Deep Learning

3D Steerable CNNs: Learning Rotationally Equivariant Features in Volumetric Data

Weiler et al.'s NeurIPS 2018 paper introducing SE(3)-equivariant CNNs for volumetric data using group theory and …...

Document Processing

LLMs for Page Stream Segmentation

Enhanced TABME benchmark for page stream segmentation, creating TABME++, showing fine-tuned decoder-based LLMs …...

Generative Modeling
Variational Autoencoder architecture diagram showing encoder, latent space, and decoder

Modern PyTorch Techniques for VAEs: A Tutorial

VAE tutorial using modern PyTorch: torch.distributions, optimization techniques, numerical stability, and implementation …

Computational Chemistry
Comparison chart showing k-NN significantly outperforming logistic regression for molecular classification across different alkane sizes

Can You Hear the Shape of a Molecule? (Part Three)

Supervised learning reveals hidden eigenvalue patterns that clustering missed, testing k-NN and logistic regression on …

Computational Chemistry
Charts showing Dunn Index, distance metrics, and computation time analysis revealing clustering performance degradation with molecular size

Can You Hear the Shape of a Molecule? (Part Two)

Clustering analysis reveals why Coulomb matrix eigenvalues struggle with larger alkanes, using Dunn Index and silhouette …

Computational Chemistry
3D ball-and-stick model of butane molecule showing linear carbon chain structure

Can You Hear the Shape of a Molecule?

Explore molecular shape recognition using Coulomb matrix eigenvalues. Analysis of alkane isomers from data generation to …

Computational Social Science
Machine learning visualization showing automated analysis of congressional legislation

Classifying Congressional Bills with Machine Learning

Testing ML classification of congressional bills by policy area. Comparing Naive Bayes, Logistic Regression, and XGBoost …

Computational Chemistry
Coulomb matrix heatmap visualization showing molecular structure encoding on logarithmic scale

Understanding Coulomb Matrices for Molecular Machine Learning

Learn how Coulomb matrices encode 3D molecular structure for machine learning—from basic theory to Python implementation …

Computational Social Science
Data visualization showing congressional bill analysis and legislative patterns

Congressional Data Analysis & Classification

Data science project scraping 47,000+ congressional bills, analyzing legislative patterns, and building ML models …...

Natural Language Processing

EigenNoise: Data-Free Word Vector Initialization

Investigation into EigenNoise, a data-free initialization scheme for word vectors that approaches pre-trained model …...

Computational Chemistry

IMG2SMI: Translating Molecular Structure Images to SMILES

Campos & Ji's method for converting 2D molecular images to SMILES strings using Transformers and SELFIES representation....

Computational Social Science
Network visualization showing social media conversational analysis and text relationships

PyConversations: Social Media Conversational Analysis

Undergraduate thesis exploring representation learning for social media text and developing tools for cross-platform …