This group covers papers that critically evaluate molecular generation methods or provide tools to improve their outputs.
| Paper | Year | Focus | Key Finding |
|---|---|---|---|
| Failure Modes | 2019 | Benchmarking | Trivial models fool distribution-learning metrics; ML scoring functions have exploitable biases |
| Sample Efficiency | 2022 | Benchmarking | Property filters and diversity metrics substantially re-rank model performance |
| Avoiding Failure Modes | 2022 | Diagnostics | Apparent failures stem from QSAR model disagreement, not algorithmic exploitation |
| UnCorrupt SMILES | 2023 | Post-hoc tool | Transformer-based corrector recovers 60-95% of invalid generator outputs |



