2026  29

March  24

Kabsch-Horn Cookbook: Differentiable Alignment

2026-03-20 · Hunter Heidenreich

MolGen: Molecular Generation with Chemical Feedback

2026-03-20 · Hunter Heidenreich

Molecular Transformer: Calibrated Reaction Prediction

2026-03-18 · Hunter Heidenreich

Arun et al.: SVD-Based Least-Squares Fitting of 3D Points

2026-03-16 · Hunter Heidenreich

Exposing Limitations of Molecular ML with Activity Cliffs

2026-03-16 · Hunter Heidenreich

Horn et al.: Absolute Orientation Using Orthonormal Matrices

2026-03-16 · Hunter Heidenreich

MoLFormer: Large-Scale Chemical Language Representations

2026-03-16 · Hunter Heidenreich

SELFormer: A SELFIES-Based Molecular Language Model

2026-03-16 · Hunter Heidenreich

Umeyama’s Method: Corrected SVD for Point Alignment

2026-03-16 · Hunter Heidenreich

AdaptMol: Domain Adaptation for Molecular OCSR (2026)

Consistency Models: Fast One-Step Diffusion Generation

2026-03-15 · Hunter Heidenreich

D3PM: Discrete Denoising Diffusion Probabilistic Models

2026-03-15 · Hunter Heidenreich

GraphReco: Probabilistic Structure Recognition (2026)

GraSP: Graph Recognition via Subgraph Prediction (2026)

Horn’s Method: Absolute Orientation via Unit Quaternions

2026-03-15 · Hunter Heidenreich

Kabsch Algorithm: Optimal Rotation for Point Set Alignment

2026-03-15 · Hunter Heidenreich

Latent Diffusion Models for High-Res Image Synthesis

2026-03-15 · Hunter Heidenreich

Uni-Parser: Industrial-Grade Multi-Modal PDF Parsing (2025)

Can Recurrent Neural Networks Warp Time? (ICLR 2018)

2026-03-14 · Hunter Heidenreich

Relational Inductive Biases in Deep Learning (2018)

2026-03-14 · Hunter Heidenreich

Scaling Laws vs Model Architectures: Inductive Bias

2026-03-14 · Hunter Heidenreich

SE(3)-Transformers: Equivariant Attention for 3D Data

2026-03-14 · Hunter Heidenreich

Spherical CNNs: Rotation-Equivariant Networks on the Sphere

2026-03-14 · Hunter Heidenreich

The Quarks of Attention: Building Blocks of Attention

2026-03-14 · Hunter Heidenreich

February  3

Molecular Sets (MOSES): A Generative Modeling Benchmark

2026-02-16 · Hunter Heidenreich

The Reliability Trap: The Limits of 99% Accuracy

2026-02-15 · 16 min · 3252 words · Hunter Heidenreich

The Evolution of Page Stream Segmentation: Rules to LLMs

2026-02-14 · 14 min · 2936 words · Hunter Heidenreich

January  2

GutenOCR: A Grounded Vision-Language Front-End for Documents

2026-01-20 · Hunter Heidenreich

PubMed-OCR: PMC Open Access OCR Annotations

2026-01-16 · Hunter Heidenreich

2025  175

December  119

ChemBERTa-3: Open Source Chemical Foundation Models

2025-12-26 · Hunter Heidenreich

ChemDFM-R: Chemical Reasoning LLM with Atomized Knowledge

2025-12-26 · Hunter Heidenreich

ChemBERTa-2: Scaling Molecular Transformers to 77M

2025-12-25 · Hunter Heidenreich

GP-MoLFormer: Molecular Generation via Transformers

2025-12-25 · Hunter Heidenreich

ChemBERTa: Molecular Property Prediction via Transformers

2025-12-23 · Hunter Heidenreich

Chemformer: A Pre-trained Transformer for Comp Chem

2025-12-23 · Hunter Heidenreich

A Convexity Principle for Interacting Gases (McCann 1997)

2025-12-21 · Hunter Heidenreich

Building Normalizing Flows with Stochastic Interpolants

2025-12-21 · Hunter Heidenreich

Flow Matching for Generative Modeling: Scalable CNFs

2025-12-21 · Hunter Heidenreich

Neural ODEs: Continuous-Depth Deep Learning Models

2025-12-21 · Hunter Heidenreich

Rectified Flow: Learning to Generate and Transfer Data

2025-12-21 · Hunter Heidenreich

Score Matching and Denoising Autoencoders: A Connection

2025-12-21 · Hunter Heidenreich

Score-Based Generative Modeling with SDEs (Song 2021)

2025-12-21 · Hunter Heidenreich

ChemDFM-X: Multimodal Foundation Model for Chemistry

2025-12-20 · Hunter Heidenreich

DynamicFlow: Integrating Protein Dynamics into Drug Design

2025-12-20 · Hunter Heidenreich

Image-to-Sequence OCSR: A Comparative Analysis

InstructMol: Multi-Modal Molecular LLM for Drug Discovery

2025-12-20 · Hunter Heidenreich

InvMSAFold: Generative Inverse Folding with Potts Models

2025-12-20 · Hunter Heidenreich

MERMaid: Multimodal Chemical Reaction Mining from PDFs

2025-12-20 · Hunter Heidenreich

MOFFlow: Flow Matching for MOF Structure Prediction

2025-12-20 · Hunter Heidenreich

Multimodal Search in Chemical Documents and Reactions

2025-12-20 · Hunter Heidenreich

OCSAug: Diffusion-Based Augmentation for Hand-Drawn OCSR

2025-12-20 · Hunter Heidenreich

STOUT V2.0: Transformer-Based SMILES to IUPAC Translation

2025-12-20 · Hunter Heidenreich

STOUT: SMILES to IUPAC Names via Neural Machine Translation

2025-12-20 · Hunter Heidenreich

Struct2IUPAC: Translating SMILES to IUPAC via Transformers

2025-12-20 · Hunter Heidenreich

Translating InChI to IUPAC Names with Transformers

2025-12-20 · Hunter Heidenreich

AtomLenz: Atom-Level OCSR with Limited Supervision

Benchmarking Eight OCSR Tools on Patent Images (2024)

ChemReco: Hand-Drawn Chemical Structure Recognition

ChemVLM: A Multimodal Large Language Model for Chemistry

2025-12-19 · Hunter Heidenreich

DECIMER.ai: Optical Chemical Structure Recognition

Dual-Path Global Awareness Transformer (DGAT) for OCSR

2025-12-19 · Hunter Heidenreich

Enhanced DECIMER for Hand-Drawn Structure Recognition

2025-12-19 · Hunter Heidenreich

Image2InChI: SwinTransformer for Molecular Recognition

2025-12-19 · Hunter Heidenreich

MarkushGrapher: Multi-modal Markush Structure Recognition

2025-12-19 · Hunter Heidenreich

MMSSC-Net: Multi-Stage Sequence Cognitive Networks

2025-12-19 · Hunter Heidenreich

MolGrapher: Graph-based Chemical Structure Recognition

MolMole: Unified Vision Pipeline for Molecule Mining

MolScribe: Robust Image-to-Graph Molecular Recognition

2025-12-19 · Hunter Heidenreich

MolSight: OCSR with RL and Multi-Granularity Learning

ABC-Net: Keypoint-Based Molecular Image Recognition

ChemPix: Hand-Drawn Hydrocarbon Structure Recognition

DECIMER 1.0: Transformers for Chemical Image Recognition

2025-12-18 · Hunter Heidenreich

End-to-End Transformer for Molecular Image Captioning

Handwritten Chemical Structure Recognition with RCGD

ICMDT: Automated Chemical Structure Image Recognition

Image-to-Graph Transformers for Chemical Structures

Image2SMILES: Transformer OCSR with Synthetic Data Pipeline

2025-12-18 · Hunter Heidenreich

MICER: Molecular Image Captioning with Transfer Learning

MolMiner: Deep Learning OCSR with YOLOv5 Detection

One Strike, You’re Out: Detecting Markush Structures

Review of OCSR Techniques and Models (Musazade 2022)

String Representations for Chemical Image Recognition

SwinOCSR: End-to-End Chemical OCR with Swin Transformers

2025-12-18 · Hunter Heidenreich

A Review of Optical Chemical Structure Recognition Tools

ChemGrapher: Deep Learning for Chemical Graph OCSR

DECIMER: Deep Learning for Chemical Image Recognition

Deep Learning for Molecular Structure Extraction (2019)

Handwritten Chemical Ring Recognition with Neural Networks

Handwritten Chemical Symbol Recognition Using SVMs

HMM-based Online Recognition of Chemical Symbols

Img2Mol: Accurate SMILES Recognition from Depictions

On-line Handwritten Chemical Expression Recognition

Online Handwritten Chemical Formula Structure Analysis

Recognition of On-line Handwritten Chemical Expressions

SVM-HMM Online Classifier for Chemical Symbols

Unified Framework for Handwritten Chemical Expressions

Chemical Structure Reconstruction with chemoCR (2011)

ChemReader Image-to-Structure OCR at TREC 2011 Chemical IR

CLEF-IP 2012: Patent and Chemical Structure Benchmark

2025-12-16 · Hunter Heidenreich

MolRec at CLEF 2012: Rule-Based Structure Recognition

OSRA at CLEF-IP 2012: Native TIFF Processing for Patents

Overview of the TREC 2011 Chemical IR Track Benchmark

2025-12-16 · Hunter Heidenreich

Probabilistic OCSR with Markov Logic Networks

Research on Chemical Expression Images Recognition

Chemical Structure Recognition (Rule-Based)

ChemInk: Real-Time Recognition for Chemical Drawings

CLiDE Pro: Optical Chemical Structure Recognition Tool

Imago: Open-Source Chemical Structure Recognition (2011)

Kekulé-1 System for Chemical Structure Recognition

OSRA at TREC-CHEM 2011: Optical Structure Recognition

Structural Analysis of Handwritten Chemical Formulas

A Spatial Model for Legislative Roll Call Analysis

2025-12-14 · Hunter Heidenreich

Automatic Recognition of Chemical Images

2025-12-14 · Hunter Heidenreich

Chaotic Evolution of the Solar System (Sussman 1992)

2025-12-14 · Hunter Heidenreich

Chemical Literature Data Extraction: The CLiDE Project

Chemical Machine Vision

ChemReader: Automated Structure Extraction

Distributed Representations: A Foundational Theory

2025-12-14 · Hunter Heidenreich

Drive to Life on Wet and Icy Worlds: Alkaline Vent Theory

2025-12-14 · Hunter Heidenreich

Dynamical Corrections to TST for Surface Diffusion

2025-12-14 · Hunter Heidenreich

Embedded-Atom Method User Guide: Voter’s 1994 Chapter

2025-12-14 · Hunter Heidenreich

Embedded-Atom Method: Theory and Applications Review

2025-12-14 · Hunter Heidenreich

Evans 1986: Thermal Conductivity of Lennard-Jones Fluid

2025-12-14 · Hunter Heidenreich

Funnels, Pathways, and Energy Landscapes of Protein Folding

2025-12-14 · Hunter Heidenreich

Graph Perception for Chemical Structure OCR

Hand-Drawn Chemical Diagram Recognition (AAAI 2007)

IMG2SMI: Translating Molecular Structure Images to SMILES

In Situ XRD of Oxidation-Reduction Oscillations on Pt/SiO2

2025-12-14 · Hunter Heidenreich

Kekulé: OCR-Optical Chemical Recognition

Kinetic Oscillations in CO Oxidation on Pt(100): Theory

2025-12-14 · Hunter Heidenreich

MD Simulation of Self-Diffusion on Metal Surfaces (1994)

2025-12-14 · Hunter Heidenreich

Mixture Density Networks: Modeling Multimodal Distributions

2025-12-14 · Hunter Heidenreich

OCSR Methods: A Taxonomy of Approaches

Optical Recognition of Chemical Graphics

2025-12-14 · Hunter Heidenreich

Oscillatory CO Oxidation on Pt(110): Temporal Modeling

2025-12-14 · Hunter Heidenreich

OSRA: Open Source Optical Structure Recognition

2025-12-14 · Hunter Heidenreich

Party Matters: Enhancing Legislative Vote Embeddings

2025-12-14 · Hunter Heidenreich

Reconstruction of Chemical Molecules from Images

2025-12-14 · Hunter Heidenreich

Second-Order Langevin Equation for Field Simulations

2025-12-14 · Hunter Heidenreich

Stillinger-Weber Potential for Silicon Simulation

2025-12-14 · Hunter Heidenreich

Tea Party in the House: Legislative Ideology via HIPTM

2025-12-14 · Hunter Heidenreich

Three Domains of Life: Woese’s Phylogenetic Revolution

2025-12-14 · Hunter Heidenreich

Adatom Dimer Diffusion on fcc(111) Crystal Surfaces

2025-12-13 · Hunter Heidenreich

AI & Physical Sciences Taxonomy: A Seven-Vector Framework

Correlations in the Motion of Atoms in Liquid Argon

2025-12-13 · Hunter Heidenreich

Terraforming Venus With the Cloud Continent Proposal

2025-12-07 · Hunter Heidenreich

Venus Evolution Through Time: Key Questions and Missions

Life on Venus? Astrobiology and the Habitability Limits

2025-12-05 · Hunter Heidenreich

November  4

Molecular String Renderer: Robust Visualization Tool

2025-11-30 · Hunter Heidenreich

Auto-Encoding Variational Bayes: VAE Paper Summary

2025-11-05 · Hunter Heidenreich

Importance Weighted Autoencoders (IWAE) for Tighter Bounds

2025-11-05 · Hunter Heidenreich

Importance Weighted Autoencoders: Beyond the Standard VAE

2025-11-05 · 7 min · 1355 words · Hunter Heidenreich

October  17

InChI and Tautomerism: Toward Comprehensive Treatment

2025-10-12 · Hunter Heidenreich

InChI: The Worldwide Chemical Structure Identifier Standard

2025-10-12 · Hunter Heidenreich

Making InChI FAIR and Sustainable for Inorganic Chemistry

2025-10-12 · Hunter Heidenreich

Mixfile & MInChI: Machine-Readable Mixture Formats

2025-10-12 · Hunter Heidenreich

NInChI: Toward a Chemical Identifier for Nanomaterials

2025-10-12 · Hunter Heidenreich

Recent Advances in the SELFIES Library: 2023 Update

2025-10-12 · Hunter Heidenreich

RInChI: The Reaction International Chemical Identifier

2025-10-12 · Hunter Heidenreich

SELFIES: The Original Paper on Robust Molecular Strings

2025-10-12 · Hunter Heidenreich

SMILES Notation: The Original Paper by Weininger (1988)

MolRec: Chemical Structure Recognition at CLEF 2012

MolRec: Rule-Based OCSR System at TREC 2011 Benchmark

What is Optical Chemical Structure Recognition (OCSR)?

2025-10-11 · 8 min · 1548 words · Hunter Heidenreich

αExtractor: Chemical Info from Biomedical Literature

ChemInfty: Chemical Structure Recognition in Patent Images

MolNexTR: A Dual-Stream Molecular Image Recognition

MolParser-7M & WildMol: Large-Scale OCSR Datasets

2025-10-03 · Hunter Heidenreich

MolParser: End-to-End Molecular Structure Recognition

September  11

ZINC-22: A Multi-Billion Scale Database for Ligand Discovery

2025-09-27 · Hunter Heidenreich

Converting SMILES and SELFIES to 2D Molecular Images

2025-09-12 · 7 min · 1470 words · Hunter Heidenreich

SELFIES: The 100% Robust Molecular String Representation

2025-09-12 · 6 min · 1172 words · Hunter Heidenreich

Communication in the Presence of Noise: Shannon’s 1949 Paper

2025-09-08 · Hunter Heidenreich

How to Fold Graciously: Levinthal’s Paradox (1969)

2025-09-08 · Hunter Heidenreich

MARCEL: Molecular Conformer Ensemble Learning Benchmark

2025-09-08 · Hunter Heidenreich

SMILES: A Compact Notation for Chemical Structures

2025-09-08 · Hunter Heidenreich

The Müller-Brown Potential: A 2D Benchmark Surface

2025-09-08 · Hunter Heidenreich

The Number of Isomeric Hydrocarbons of the Methane Series

2025-09-08 · Hunter Heidenreich

The Surface of Venus: Stratigraphy and Resurfacing History

GEOM: Energy-Annotated Molecular Conformations Dataset

2025-09-04 · Hunter Heidenreich

August  19

Exponential Random Numbers: Two Classic Algorithms

2025-08-31 · 7 min · 1326 words · Hunter Heidenreich

GDB-11: Chemical Universe Database (26.4M Molecules)

2025-08-29 · Hunter Heidenreich

Implementing the Müller-Brown Potential in PyTorch

2025-08-27 · 17 min · 3490 words · Hunter Heidenreich

Müller-Brown Potential: A PyTorch ML Testbed

2025-08-27 · Hunter Heidenreich

DenoiseVAE: Adaptive Noise for Molecular Pre-training

2025-08-24 · Hunter Heidenreich

Beyond Atoms: 3D Space Modeling for Molecular Pretraining

2025-08-23 · Hunter Heidenreich

Dark Side of Forces: Non-Conservative ML Force Models

2025-08-23 · Hunter Heidenreich

Efficient DFT Hamiltonian Prediction via Adaptive Sparsity

2025-08-23 · Hunter Heidenreich

Learning Smooth Interatomic Potentials with eSEN (ICML)

2025-08-23 · Hunter Heidenreich

Modernizing Rahman’’s 1964 Argon Simulation

2025-08-23 · Hunter Heidenreich

Modernizing Rahman’s 1964 Argon Simulation

2025-08-23 · 6 min · 1262 words · Hunter Heidenreich

Embedded-Atom Method: Impurities and Defects in Metals

2025-08-22 · Hunter Heidenreich

Umbrella Sampling: Monte Carlo Free-Energy Estimation

2025-08-21 · Hunter Heidenreich

Contrastive Learning for Variational Autoencoder Priors

2025-08-17 · Hunter Heidenreich

Lennard-Jones on Adsorption and Diffusion on Surfaces

2025-08-17 · Hunter Heidenreich

GDB-13: Chemical Universe Database (970M Molecules)

2025-08-16 · Hunter Heidenreich

GDB-17: Chemical Universe Database (166.4B Molecules)

2025-08-16 · Hunter Heidenreich

High-Performance Word2Vec in Pure PyTorch

2025-08-16 · Hunter Heidenreich

GEOM Dataset: 3D Molecular Conformer Generation

2025-08-15 · 7 min · 1381 words · Hunter Heidenreich

June  1

GTR-CoT: Graph Traversal Chain-of-Thought for Molecules

April  1

SubGrapher: Visual Fingerprinting of Chemical Structures

January  3

OCSU: Optical Chemical Structure Understanding (2025)

3D Steerable CNNs: Rotationally Equivariant Features

2025-01-16 · Hunter Heidenreich

LLMs for Insurance Document Automation

2025-01-01 · Hunter Heidenreich

2024  11

December  1

RFL: Simplifying Chemical Structure Recognition (AAAI 2025)

October  1

Optimizing Sequence Models for Dynamical Systems

2024-10-01 · Hunter Heidenreich

August  1

LLMs for Page Stream Segmentation

2024-08-21 · Hunter Heidenreich

July  1

The Nature of LUCA and Its Impact on the Early Earth System

2024-07-12 · Hunter Heidenreich

April  1

Invalid SMILES Benefit Chemical Language Models: A Study

2024-04-15 · Hunter Heidenreich

March  2

Synthetic Isomer Data Generation Pipeline

2024-03-09 · Hunter Heidenreich

Modern PyTorch VAEs: A Detailed Implementation Guide

2024-03-03 · 31 min · 6586 words · Hunter Heidenreich

February  4

Sarcasm Detection with Transformers: A Cautionary Tale

2024-02-25 · 5 min · 1004 words · Hunter Heidenreich

Hearing Molecular Shape via Coulomb Matrix Eigenvalues

2024-02-24 · 19 min · 3934 words · Hunter Heidenreich

Classifying Congressional Bills with Machine Learning

2024-02-21 · 13 min · 2717 words · Hunter Heidenreich

Coulomb Matrices for Molecular Machine Learning

2024-02-10 · 7 min · 1384 words · Hunter Heidenreich

2023  7

October  2

How Does Congress Actually Work? Data from 15K Bills

2023-10-05 · 6 min · 1164 words · Hunter Heidenreich

Kabsch Algorithm: NumPy, PyTorch, TensorFlow, and JAX

2023-10-03 · 16 min · 3370 words · Hunter Heidenreich

September  3

LAMMPS Tutorial: Copper and Platinum Adatom Diffusion

2023-09-27 · 11 min · 2309 words · Hunter Heidenreich

Automated Adatom Diffusion Workflow

2023-09-21 · Hunter Heidenreich

Generating Mini-Protein Trajectories with GROMACS

2023-09-21 · 6 min · 1150 words · Hunter Heidenreich

August  1

Mini-Protein Trajectory Generation

2023-08-01 · Hunter Heidenreich

March  1

Congressional Knowledge Graph & Policy Classification

2023-03-01 · Hunter Heidenreich

2022  4

October  1

SELFIES and the Future of Molecular String Representations

2022-10-14 · Hunter Heidenreich

May  3

IQCRNN: Certified Stability for Neural Networks

2022-05-11 · Hunter Heidenreich

Analytical Solution to Word2Vec Softmax & Bias Probing

2022-05-01 · Hunter Heidenreich

EigenNoise: Data-Free Word Vector Initialization

2022-05-01 · Hunter Heidenreich

2021  3

June  2

Look, Don’t Tweet: Unified Data Models for Social NLP

2021-06-30 · Hunter Heidenreich

PyConversations: Social Media Conversational Analysis

2021-06-01 · Hunter Heidenreich

May  1

GPT-2 Susceptibility to Universal Adversarial Triggers

2020  3

November  1

5 Axes of Multi-Arm Bandit Problems: A Practical Guide

2020-11-10 · 8 min · 1628 words · Hunter Heidenreich

August  1

NewsTweet Dataset: Social Media in Digital Journalism

2020-08-01 · Hunter Heidenreich

July  1

Coordinated Social Targeting on Twitter

2020-07-01 · Hunter Heidenreich

2019  2

November  1

Data-Driven WordNet Construction from Wiktionary

2019-11-01 · Hunter Heidenreich

January  1

A Guide to Neuroevolution: NEAT and HyperNEAT

2019-01-02 · 8 min · 1579 words · Hunter Heidenreich

2018  8

December  2

Breaking Down Machine Learning for the Average Person

2018-12-04 · 3 min · 559 words · Hunter Heidenreich

Foundations of AI: Knowledge-Based Agents and Logic

2018-12-01 · 8 min · 1674 words · Hunter Heidenreich

November  1

Cartesian Genetic Programming in Julia

2018-11-18 · Hunter Heidenreich

October  1

QuAC: Question Answering in Context Dataset

2018-10-31 · 5 min · 949 words · Hunter Heidenreich

August  3

CoQA Dataset: Advancing Conversational Question Answering

2018-08-23 · 5 min · 953 words · Hunter Heidenreich

Understanding GANs: From Fundamentals to Objective Functions

2018-08-18 · 13 min · 2585 words · Hunter Heidenreich

Word Embeddings in NLP: An Introduction

2018-08-05 · 9 min · 1839 words · Hunter Heidenreich

March  1

FFTW Compiler in Haskell

2018-03-15 · Hunter Heidenreich

2017  2

February  1

Term Schedule Optimizer

2017-02-15 · Hunter Heidenreich

January  1

Rubik’s Cube Sonification

2017-01-29 · Hunter Heidenreich

2014  1

October  1

Elemental Brawl

2014-10-24 · Hunter Heidenreich