Kabsch-Horn Cookbook: Differentiable Alignment
The Reliability Trap: The Limits of 99% Accuracy
The Evolution of Page Stream Segmentation: Rules to LLMs
GutenOCR: A Grounded Vision-Language Front-End for Documents
PubMed-OCR: PMC Open Access OCR Annotations
Molecular String Renderer: Chemical Visualization Library
Importance Weighted Autoencoders: Beyond the Standard VAE
What is Optical Chemical Structure Recognition (OCSR)?
Converting SMILES and SELFIES to 2D Molecular Images
Exponential Random Numbers: Two Classic Algorithms
Implementing the Müller-Brown Potential in PyTorch
Müller-Brown Potential: A PyTorch ML Testbed
Modernizing Rahman’’s 1964 Argon Simulation
Modernizing Rahman’s 1964 Argon Simulation
Vectorized Word2Vec in Pure PyTorch
GEOM Dataset: 3D Molecular Conformer Generation
LLMs for Insurance Document Automation
Optimizing Sequence Models for Dynamical Systems
LLMs for Page Stream Segmentation
Synthetic Isomer Data Generation Pipeline
Modern PyTorch VAEs: A Detailed Implementation Guide
Sarcasm Detection with Transformers: A Cautionary Tale
Hearing Molecular Shape via Coulomb Matrix Eigenvalues
Classifying Congressional Bills with Machine Learning
Coulomb Matrices for Molecular Machine Learning
How Does Congress Actually Work? Data from 15K Bills
Kabsch Algorithm: NumPy, PyTorch, TensorFlow, and JAX
LAMMPS Tutorial: Copper and Platinum Adatom Diffusion
Automated Adatom Diffusion Workflow
Generating Mini-Protein Trajectories with GROMACS
Mini-Protein Trajectory Generation
Congressional Knowledge Graph & Policy Classification
IQCRNN: Certified Stability for Neural Networks
Analytical Solution to Word2Vec Softmax & Bias Probing
EigenNoise: Data-Free Word Vector Initialization
Look, Don’t Tweet: Unified Data Models for Social NLP
PyConversations: Social Media Conversational Analysis
GPT-2 Susceptibility to Universal Adversarial Triggers
5 Axes of Multi-Arm Bandit Problems: A Practical Guide
NewsTweet Dataset: Social Media in Digital Journalism
Coordinated Social Targeting on Twitter
Data-Driven WordNet Construction from Wiktionary
A Guide to Neuroevolution: NEAT and HyperNEAT
Breaking Down Machine Learning for the Average Person
Foundations of AI: Knowledge-Based Agents and Logic
Cartesian Genetic Programming in Julia
QuAC: Question Answering in Context Dataset
CoQA Dataset: Advancing Conversational Question Answering
Understanding GANs: From Fundamentals to Objective Functions
Word Embeddings in NLP: An Introduction
Rubik’s Cube Sonification