Computational Chemistry

ChemInfty: Robust Segmentation and Recognition of Chemical Structures in Low-Quality Patent Images

Fujiyoshi et al.'s segment-based approach for recognizing chemical structures in challenging Japanese patent images with …...

Computational Chemistry

Img2Mol: Accurate SMILES Recognition from Molecular Graphical Depictions

Clevert et al.'s two-stage CNN approach for converting molecular images to SMILES using CDDD embeddings and extensive …...

Computational Chemistry

MolNexTR: A Generalized Deep Learning Model for Molecular Image Recognition

Chen et al.'s dual-stream encoder approach for robust molecular structure recognition from diverse real-world images …...

Computational Chemistry

OSRA: Optical Structure Recognition for Chemical Information Extraction

Filippov & Nicklaus's open-source rule-based system for converting molecular structure images into machine-readable …...

Computational Chemistry

MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild

Fang et al.'s method for converting molecular structure images from scientific documents into machine-readable formats …...

Computational Chemistry
ZINC-22 Tranche Browser showing molecular count distribution

ZINC-22: Multi-Billion Molecule Database

A dataset card for ZINC-22, the largest freely available database of commercially available compounds for virtual …

Computational Chemistry
SELFIES strings guarantee 100% valid molecules - even when generated randomly

Converting SELFIES Strings to 2D Molecular Images

Learn how to visualize SELFIES molecular representations and explore their unique advantages through random sampling, …

Computational Chemistry
Aspirin molecular structure generated from SMILES string

Converting SMILES Strings to 2D Molecular Images

Learn how to create 2D molecular images from SMILES strings using RDKit and PIL, with proper formatting and legends.

Computational Chemistry
SELFIES representation of 2-Fluoroethenimine molecule

SELFIES (Self-Referencing Embedded Strings)

SELFIES is a 100% robust string-based representation for chemical molecules, designed for machine learning applications …...

Computational Chemistry
MARCEL dataset Kraken ligand example in 3D conformation

MARCEL: Molecular Representation and Conformer Ensemble Learning

MARCEL dataset provides 722K+ conformers across 76K+ molecules for drug discovery, catalysis, and molecular …

Computational Chemistry
Müller-Brown Potential Energy Surface showing the three minima and two saddle points

Müller-Brown Potential

The Müller-Brown potential: a classic two-dimensional analytical benchmark for testing optimization algorithms, reaction …...

Computational Chemistry
Methoxybenzonitrile

SMILES (Simplified Molecular Input Line Entry System)

SMILES is a specification for describing the structure of chemical molecules using short ASCII strings....