This group covers papers that critically evaluate molecular generation methods or provide tools to improve their outputs.

PaperYearFocusKey Finding
Failure Modes2019BenchmarkingTrivial models fool distribution-learning metrics; ML scoring functions have exploitable biases
Sample Efficiency2022BenchmarkingProperty filters and diversity metrics substantially re-rank model performance
Avoiding Failure Modes2022DiagnosticsApparent failures stem from QSAR model disagreement, not algorithmic exploitation
UnCorrupt SMILES2023Post-hoc toolTransformer-based corrector recovers 60-95% of invalid generator outputs

All Notes