Experimental and Chemoinformatics Study of Tautomerism in a Database of Commercially Available Screening Samples

Experimental and Chemoinformatics Study of Tautomerism in a Database of Commercially Available Screening Samples

Laura Guasch, Waruna Yapamudiyansel, Megan L. Peach, James A. Kelley, Joseph J. Barchi, Jr., and Marc C. Nicklaus
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5129033/
 2016 Nov 28; 56(11): 2149–2161.

Abstract: We investigated how many cases of the same chemical sold as different products (at possibly different prices) occurred in a prototypical large aggregated database and simultaneously tested the tautomerism definitions in the chemoinformatics toolkit CACTVS. We applied the standard CACTVS tautomeric transforms plus a set of recently developed ring–chain transforms to the Aldrich Market Select (AMS) database of 6 million screening samples and building blocks. In 30 000 cases, two or more AMS products were found to be just different tautomeric forms of the same compound. We purchased and analyzed 166 such tautomer pairs and triplets by 1H and 13C NMR to determine whether the CACTVS transforms accurately predicted what is the same “stuff in the bottle”. Essentially all prototropic transforms with examples in the AMS were confirmed. Some of the ring–chain transforms were found to be too “aggressive”, i.e. to equate structures with one another that were different compounds

Information
Content Type OER
Author(s) Laura Guasch, Waruna Yapamudiyansel, Megan L. Peach, James A. Kelley, Joseph J. Barchi, Jr., and Marc C. Nicklaus
DOI 10.1021/acs.jcim.6b00338
Content Link https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5129033/
License Copyright 2016 ACS, Open Access through PubMed Central
Content Status publish
Date Published
Content Tags Cheminformatics, Classroom Material, Content type, Extended Tautomers, Organic, Publication