Systematic enumeration and analysis of chemical space, including combinatorial molecule generation, complexity measures, and synthetic accessibility scoring.

Chemical Space Analysis

PaperYearKey Idea
AllChem: Generating and Searching 10^20 Structures2007Recursive synthon generation from reactions and building blocks, with topomer similarity search
VEHICLe: Heteroaromatic Rings of the Future2009Complete enumeration of 24,847 heteroaromatic ring systems with ML-based tractability prediction
ACSESS: Diverse Optimal Molecules in the SMU2015Diversity-biased stochastic search finds diverse optimal molecules without exhaustive enumeration
Surge: Fastest Open-Source Chemical Graph Generator2022Three-stage canonical generation path method, 42-161x faster than MOLGEN 5.0
Molecular Complexity from the GDB Chemical Space2025Simple graph-based molecular complexity measures (MC1, MC2) derived from GDB enumeration
CHX8: Complete Eight-Carbon Hydrocarbon Space2026Exhaustive enumeration of 31,497 stable hydrocarbons up to 8 carbons with RSE strain metric
DatasetSizeDescription
GDB-1126.4MExhaustive enumeration of molecules up to 11 atoms of C, N, O, F
GDB-13977MExhaustive enumeration of molecules up to 13 atoms of C, N, O, S, Cl
GDB-17166.4BExhaustive enumeration of molecules up to 17 atoms of C, N, O, S, halogens