This group collects work that evaluates, compares, or surveys OCSR methods rather than proposing new ones. It includes the two major review papers (Rajan et al. 2020 covering rule-based methods, Musazade et al. 2022 covering the deep learning transition), benchmark studies like Krasnov et al.’s 2024 comparison of eight tools on patent images, and ablation work on output representations (Rajan et al. 2022 on SMILES vs. SELFIES vs. InChI). The shared evaluation campaigns, TREC-Chem 2011 and CLEF-IP 2012, are represented both by their overview papers and by individual system descriptions (OSRA, ChemReader, Imago, chemoCR, and MolRec entries), providing a snapshot of the field’s state at those points in time.
| Year | Paper | Key Idea |
|---|---|---|
| 2011 | Chemical Structure Reconstruction with chemoCR (2011) | Hybrid pattern recognition and rule-based expert system for OCSR |
| 2011 | ChemReader Image-to-Structure OCR at TREC 2011 Chemical IR | ChemReader evaluation achieving 93% accuracy on TREC 2011 |
| 2011 | Imago: Open-Source Chemical Structure Recognition (2011) | Open-source C++ toolkit for 2D chemical structure image recognition |
| 2011 | MolRec: Rule-Based OCSR System at TREC 2011 Benchmark | Rule-based OCSR using vectorization, achieving 95% on TREC 2011 |
| 2011 | OSRA at TREC-CHEM 2011: Optical Structure Recognition | Open-source pipeline for converting chemical images to SMILES/SDF |
| 2011 | Overview of the TREC 2011 Chemical IR Track Benchmark | Benchmark for patent prior art and chemical image recognition |
| 2012 | CLEF-IP 2012: Patent and Chemical Structure Benchmark | Benchmarking lab for patent retrieval and chemical structure extraction |
| 2012 | MolRec: Chemical Structure Recognition at CLEF 2012 | MolRec at CLEF 2012 revealing rule-based OCSR limits |
| 2012 | MolRec at CLEF 2012: Rule-Based Structure Recognition | Failure analysis of MolRec on CLEF 2012 chemical structure task |
| 2012 | OSRA at CLEF-IP 2012: Native TIFF Processing for Patents | Native TIFF processing outperforms external splitting tools |
| 2020 | A Review of Optical Chemical Structure Recognition Tools | Review and benchmarking of 30 years of OCSR methods and tools |
| 2022 | Review of OCSR Techniques and Models (Musazade 2022) | OCSR evolution from rule-based systems to deep learning |
| 2022 | String Representations for Chemical Image Recognition | Ablation comparing SMILES, DeepSMILES, SELFIES, and InChI for OCSR |
| 2024 | Benchmarking Eight OCSR Tools on Patent Images (2024) | Benchmark of 8 open-access OCSR methods on 2702 patent images |
| 2025 | Uni-Parser: Industrial-Grade Multi-Modal PDF Parsing (2025) | Modular multi-expert PDF parsing engine with integrated OCSR |
| – | Image-to-Sequence OCSR: A Comparative Analysis | Comparative analysis of 24 image-to-sequence OCSR methods |
| – | OCSR Methods: A Taxonomy of Approaches | Taxonomy of OCSR methods by approach type |
