About the InChI Standard

These few general questions and answers about the InChI standard are supplemented by an EXTENSIVE TECHNICAL FAQ available here.

We welcome proposals for InChI enhancements as well as technical questions, bug reports and queries, or even offers of help – please use the SourceForge inchi-discuss list as a forum.

Please register via the SourceForge web page (https://lists.sourceforge.net/lists/listinfo/inchi-discuss). Messages to the list (at inchi-discuss@lists.sourceforge.net) from unregistered participants are moderated. The SourceForge project (see http://sourceforge.net/projects/inchi) aims to enable and encourage development of facilities and applications in an Open Source context.

What is InChI?

Originally developed by the International Union of Pure and Applied Chemistry (IUPAC), the IUPAC International Chemical Identifier (InChI) is a character string generated by computer algorithm. It is a tool to be used in software applications designed and developed by those who choose to use it.

The InChI algorithm turns chemical structures into machine-readable strings of information. InChIs are unique to the compound they describe and can encode absolute stereochemistry making chemicals and chemistry machine-readable and discoverable. A simple analogy is that InChI is the bar-code for chemistry and chemical structures.

The InChI format and algorithm are non-proprietary and the software is open source, with ongoing development done by the community. A number of IUPAC working groups is currently creating standard for those areas of chemistry that are not yet handled by the InChI algorithm.

What is the InChIKey?

The InChIKey has been designed so that Internet search engines (such as Google, Yahoo, Bing, etc.) can search and find the links to a given InChI.

To make the InChIKey the InChI string is subjected to a compression algorithm to create a fixed-length string of upper-case characters. While the InChI to InChIKey hash compression is irreversible, there are a number of InChI resolvers available to look up an InChI given an InChIKey.

Why do we need the InChI and what is it used for?

The InChI with its associated InChIKey is the only structure representation standard in the public domain, open-source and freely available to the scientific community.

InChIs and InChIKeys are used by scientists, publishers and database providers to enable web-based linking between sources of chemical content whether web-page, journal or magazine, or database.

Several very large databases already have implemented InChI and/or InChIKeys.

Where can I download the InChI software?

The most current InChI software release can be downloaded from the InChI Trust download page.

What is the InChI Certification Suite?

The InChI certification suite is a software package developed and designed to check that your installation of the InChI program has been performed correctly. The programs test your installation against a broad set of structures (which are provided with the Suite).

Once the programs are run and the results sent back to the Trust, an “InChI certified” logo will be sent to the respective person or organization. The InChI Trust certification logo can then be put on the pages of the web site for all users to see. The use of the InChI Trust certification logo is optional.

The cost of the InChI certification suite is US $5000 per year.



History of InChI

1999: Steve Heller initiated a proposal at the National Institute of Standards and Technology (NIST) for a public domain structure representation standard for the NIST databases, along with Steve Stein who was the initial designer and architect for the project.

2000: from a meeting with a wide range of users of chemical nomenclature including database providers, patent officials, international trade representatives, et al, under the direction of Alan McNaught, it was decided that InChI would be an IUPAC initiative to meet the needs of the chemical and related communities.

2001: the IUPAC Chemical Identifier project began in collaboration with the US National Institute of Standards and Technology (NIST). The aim was to devise a computer based algorithm yielding a unique label for any chemical structure, regardless of how it was represented (on screen).

2005 (April): version 1 of the IUPAC International Chemical Identifier (InChI) was launched. The development and associated programming work was predominantly carried out by Dmitrii Tchekhovskoi.

2008: a shorter hash key version of InChI, known as InChIKey was developed by Igor Pletnev. This alternative format is much more suitable for use in search engines and offers many new possibilities for software applications.

2009 (January): standard versions of InChI and the InChIKey were released, which took the original algorithm with its many variable parameters and fixed them so that interoperability between databases and resources with InChIs could be achieved.

2009 (July): the InChI Trust was formed. The 2009 Board of Directors consists of 8 members: The Royal Society of Chemistry • Nature Publishing Group • Elsevier Properties SA • Thomson Reuters • John Wiley & Sons • FIZ CHEMIE • Taylor & Francis • IUPAC.

2011: version 1.04 of the InChI software was released as well as an InChI Certification Suite which is designed to check the correct installation of the InChI software. Certification logos for correct installations as well as InChI Trust member and supporter logos are provided by the InChI Trust.