A possible extension to the RInChI as a means of providing machine readable process data

A possible extension to the RInChI as a means of providing machine readable process data

Philipp-Maximilian Jacob, Tian Lan, Jonathan M. Goodman & Alexei A. Lapkin
Journal of Cheminformatics volume 9, Article number: 23 (2017)

Abstract: The algorithmic, large-scale use and analysis of reaction databases such as Reaxys is currently hindered by the absence of widely adopted standards for publishing reaction data in machine readable formats. Crucial data such as yields of all products or stoichiometry are frequently not explicitly stated in the published papers and, hence, not reported in the database entry for those reactions, limiting their usefulness for algorithmic analysis. This paper presents a possible extension to the IUPAC RInChI standard via an auxiliary layer, termed ProcAuxInfo, which is a standardised, extensible form in which to report certain key reaction parameters such as declaration of all products and reactants as well as auxiliaries known in the reaction, reaction stoichiometry, amounts of substances used, conversion, yield and operating conditions. The standard is demonstrated via creation of the RInChI including the ProcAuxInfo layer based on three published reactions and demonstrates accurate data recoverability via reverse translation of the created strings. Implementation of this or another method of reporting process data by the publishing community would ensure that databases, such as Reaxys, would be able to abstract crucial data for big data analysis of their contents.

Information
Content Type OER
Author(s) Philipp-Maximilian Jacob, Tian Lan, Jonathan M. Goodman & Alexei A. Lapkin
DOI https://doi.org/10.1186/s13321-017-0210-6
Content Link https://jcheminf.biomedcentral.com/articles/10.1186/s13321-017-0210-6
License CC 4.0
Content Status publish
Date Published April 11, 2017
Content Tags Content type, Publication, Reactions