Benchmark suites, scoring frameworks, evaluation studies, and surveys of the molecular generation field.

Benchmark Suites & Scoring

PaperYearKey Idea
GuacaMol2019Distribution-learning and goal-directed generation benchmarks
MOSES2020Distribution-learning benchmark with curated ZINC subset and distributional metrics
FCD2018Adapts FID from image generation to molecules using learned chemical embeddings
PMO2022Sample-efficient molecular optimization comparing 25 methods under fixed oracle budget
MolScore2024Unified scoring framework wrapping objectives from GuacaMol, MOSES, and others
Tartarus2023Realistic inverse design benchmarks using physics-based oracles (DFT, xTB)
SPECTRA2025Out-of-domain generalizability evaluation via spectral analysis
MolGenBench2025Evaluation across distribution learning, property optimization, and constrained optimization

Docking Benchmarks

PaperYearKey Idea
DOCKSTRING2022Docking-based benchmarks for ligand design with precomputed scores
SMINA Benchmark2023SMINA docking evaluation on realistic binding tasks

Failure Analysis & Tools

PaperYearKey Idea
Failure Modes2019Trivial models fool distribution-learning metrics; ML scoring functions have exploitable biases
Sample Efficiency2022Property filters and diversity metrics substantially re-rank model performance
Avoiding Failure Modes2022Apparent failures stem from QSAR model disagreement, not algorithmic exploitation
UnCorrupt SMILES2023Transformer-based corrector recovers 60-95% of invalid generator outputs

Surveys & Reviews

PaperYearKey Idea
Deep Learning for Molecular Design2019Survey of RNNs, VAEs, GANs, and RL approaches with SMILES and graph representations
CLMs for De Novo Drug Design2023Review of chemical language models covering architectures and training strategies
Inverse Molecular Design2022Review of VAE, GAN, and RL approaches for navigating chemical space
RNNs vs Transformers2023Empirical comparison of RNN and Transformer architectures for molecular generation
MolGenSurvey2022Survey across 1D string, 2D graph, and 3D geometry representations
Generative AI Drug Design2024Comprehensive survey covering VAEs, GANs, diffusion, and flow models