Time Series Forecasting
Forecasting comparison of different neural architectures on the Multiscale Lorenz-96 system

Deconstructing the "Black Box" - Optimizing Sequence Models for Dynamical Systems

A rigorous ablation study deconstructing the "black box" of sequence modeling. We synthesized novel hybrid …...

Scientific Computing
Velocity Autocorrelation Function showing the signature negative region characteristic of liquid dynamics and the cage effect discovered by Rahman

Digital Restoration: Modernizing Rahman's 1964 Argon Simulation

A high-fidelity replication of foundational molecular dynamics using modern software engineering practices: caching, …...

Computational Chemistry
Comparison chart showing k-NN significantly outperforming logistic regression for molecular classification across different alkane sizes

Can You Hear the Shape of a Molecule? (Part Three)

Supervised learning reveals hidden eigenvalue patterns that clustering missed, testing k-NN and logistic regression on …

Computational Chemistry
Charts showing Dunn Index, distance metrics, and computation time analysis revealing clustering performance degradation with molecular size

Can You Hear the Shape of a Molecule? (Part Two)

Clustering analysis reveals why Coulomb matrix eigenvalues struggle with larger alkanes, using Dunn Index and silhouette …

Natural Language Processing
Word vector illustration showing text classification and NLP concepts

Sarcasm Detection with Transformers: A Cautionary Tale

Learn how dataset bias can lead to misleading results in NLP: a sarcasm detection model that actually learned to …

Computational Social Science
Data visualization showing congressional bill analysis and legislative patterns

How Does Congress Actually Work? Data from 15K Bills

What happens to bills in Congress? Analyzing 15K+ bills from the 117th Congress to understand legislative patterns, …

Natural Language Processing
Information Quality Ratio plot showing statistical dependencies decay as window size increases

Analytical Solution to Word2Vec Softmax & Corpus Bias Probing

Analytical derivation of Word2Vec's softmax objective factorization and a new framework for detecting semantic bias in …...

Machine Learning Fundamentals
Various symmetric and repetitive patterns generated by Compositional Pattern Producing Networks

HyperNEAT: Scaling Neuroevolution with Geometric Patterns

How HyperNEAT uses indirect encoding and geometric patterns to evolve large-scale neural networks with biological …

Natural Language Processing
Types and distribution of coreferences in QuAC dataset showing dialogue complexity

QuAC: Question Answering in Context Dataset

Analysis of QuAC's conversational QA through student-teacher interactions, featuring 100K+ context-dependent questions …

Natural Language Processing
3D visualization of word embeddings showing semantic relationships in vector space

Word Embeddings in NLP: An Introduction

Learn about word embeddings in NLP: from basic one-hot encoding to contextual models like ELMo. Guide with examples.