Computational Chemistry

InChI and Tautomerism: Toward a Comprehensive Treatment

Dhaked et al.'s comprehensive analysis of tautomerism in chemoinformatics, introducing 86 new tautomeric rules and their …...

Computational Chemistry

SMILES: The Original Paper (Weininger 1988)

A summary of David Weininger's foundational 1988 paper that introduced SMILES notation - the string-based molecular …

Computational Chemistry

GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition

Wang et al.'s novel VLM approach using graph traversal CoT and faithful recognition principles to improve optical …...

Computational Chemistry

αExtractor: Automatic Chemical Information Extraction from Biomedical Literature

Xiong et al.'s deep learning system for extracting chemical structures from literature images using ResNet-Transformer …...

Computational Chemistry

Img2Mol: Accurate SMILES Recognition from Molecular Graphical Depictions

Clevert et al.'s two-stage CNN approach for converting molecular images to SMILES using CDDD embeddings and extensive …...

Computational Chemistry

MolNexTR: A Generalized Deep Learning Model for Molecular Image Recognition

Chen et al.'s dual-stream encoder approach for robust molecular structure recognition from diverse real-world images …...

Computational Chemistry

OSRA: Optical Structure Recognition for Chemical Information Extraction

Filippov & Nicklaus's open-source rule-based system for converting molecular structure images into machine-readable …...

Computational Chemistry

MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild

Fang et al.'s method for converting molecular structure images from scientific documents into machine-readable formats …...

Computational Chemistry
ZINC-22 Tranche Browser showing molecular count distribution

ZINC-22: Multi-Billion Molecule Database

A dataset card for ZINC-22, the largest freely available database of commercially available compounds for virtual …

Computational Chemistry
Aspirin molecular structure generated from SMILES string

Converting SMILES Strings to 2D Molecular Images

Learn how to create 2D molecular images from SMILES strings using RDKit and PIL, with proper formatting and legends.

Computational Chemistry
Methoxybenzonitrile

SMILES (Simplified Molecular Input Line Entry System)

SMILES is a specification for describing the structure of chemical molecules using short ASCII strings....

Computational Chemistry
GDB-11 molecule structure showing FC1C2OC1c3c(F)coc23

GDB-11: Chemical Universe Database (26.4M Molecules)

A dataset card for the Generated Database 11 (GDB-11), a database of 26.4 million small organic molecules for virtual …