
Optimizing Sequence Models for Dynamical Systems
Ablation study deconstructing sequence models. Attention-augmented Recurrent Highway Networks outperform Transformers on …

Ablation study deconstructing sequence models. Attention-augmented Recurrent Highway Networks outperform Transformers on …
Ablation study comparing SMILES, DeepSMILES, SELFIES, and InChI for OCSR. SMILES achieves highest accuracy; SELFIES …
Open-source C++ toolkit for extracting 2D chemical structures from scientific literature using heuristic image …

MolRec achieves 95%+ accuracy on simple structures but struggles with complex diagrams, revealing rule-based OCSR limits …

Rule-based system for optical chemical structure recognition using vectorization and geometric analysis, achieving 95% …

A high-fidelity replication of foundational molecular dynamics using modern software engineering practices: caching, …

Supervised learning reveals hidden eigenvalue patterns that clustering missed, testing k-NN and logistic regression on …

Clustering analysis reveals why Coulomb matrix eigenvalues struggle with larger alkanes, using Dunn Index and silhouette …

Learn how dataset bias can lead to misleading results in NLP: a sarcasm detection model that actually learned to …

Testing ML classification of congressional bills by policy area. Comparing Naive Bayes, Logistic Regression, and XGBoost …

What happens to bills in Congress? Analyzing 15K+ bills from the 117th Congress to understand legislative patterns, …

Analytical derivation of Word2Vec's softmax objective factorization and a new framework for detecting semantic bias in …