
Optimizing Sequence Models for Dynamical Systems
Ablation study deconstructing sequence models. Attention-augmented Recurrent Highway Networks outperform Transformers on …

Ablation study deconstructing sequence models. Attention-augmented Recurrent Highway Networks outperform Transformers on …

Summary of Kingma & Welling's foundational VAE paper introducing the reparameterization trick and variational …

The key difference between multi-sample VAEs and IWAEs: how log-of-averages creates a tighter bound on log-likelihood.

Summary of Burda, Grosse & Salakhutdinov's ICLR 2016 paper introducing Importance Weighted Autoencoders for tighter …

GTR-CoT uses graph traversal chain-of-thought reasoning to improve optical chemical structure recognition accuracy.

Novel OCSR method creating molecular fingerprints from images through functional group segmentation for database …

αExtractor uses ResNet-Transformer to extract chemical structures from literature images, including noisy and hand-drawn …

Two-stage CNN approach for converting molecular images to SMILES using CDDD embeddings and extensive data augmentation.

Chen et al.'s dual-stream encoder approach for robust molecular structure recognition from diverse real-world images …

MolParser-7M is the largest OCSR dataset with 7.7M image-text pairs of molecules and E-SMILES, including 400k real-world …

MolParser converts molecular images from scientific documents to machine-readable formats using end-to-end learning.

Liu et al.'s ICLR 2025 paper introducing DenoiseVAE, which learns adaptive, atom-specific noise for better molecular …