Overview of Reaxys Data

Last updated on April 10, 2025

Reaxys is the most comprehensive resource of the known chemistry space. It contains data on reactions, substances and bioactivities excerpted from 119 million documents. The machine ready data representation allows for applying the Reaxys content directly in data driven workflows. The Reaxys data corpus contains:

  • 69 million reactions with experimental conditions and literature references
  • 160 million substances and associated properties
  • 48 million bioactivity data points and 43,000 biological targets
  • 16,000 journals and 119 million documents including 45 million patents

The Reaxys data, delivered through the Reaxys API and use case specific flat files, can be embedded into a user's specific data pipelines and workflows to enable more accurate search, discovery, and prediction.

Reayxs API

- The Reaxys API provides access to the full Reaxys data corpus
- The Reaxys data is updated twice a week

Reaction Flat File (RFF)

- The RFF comprises 23 million ML ready reactions most relevant for small compound synthesis analytics and predictions
- Updated quarterly

Substance Structure Flat File (SFF)

- The SFF comprises the entire set of Reaxys chemical structures (56 million) excerpted from patents and journals
- Updated weekly

Reaxys Medicinal Chemistry Flat File (RMX FF)

- The RMC FF comprises the entire Reaxys dataset of reported bioactivities with associated substances, biological targets, and assays
- Updated weekly

FAQs

       

Did we answer your question?

Related answers

Recently viewed answers

Functionality disabled due to your cookie preferences

For further assistance: