Natural Language Processing
Word vector illustration showing text classification and NLP concepts

Sarcasm Detection with Transformers: A Cautionary Tale

Learn how dataset bias can lead to misleading results in NLP: a sarcasm detection model that actually learned to …

Computational Social Science
Machine learning visualization showing automated analysis of congressional legislation

Classifying Congressional Bills with Machine Learning

Testing ML classification of congressional bills by policy area. Comparing Naive Bayes, Logistic Regression, and XGBoost …

Computational Social Science
Data visualization showing congressional bill analysis and legislative patterns

Congressional Data Analysis & Classification

Data science project scraping 47,000+ congressional bills, analyzing legislative patterns, and building ML models …...

Natural Language Processing
One-hot encoding and count vectorization visualization showing sparse vector representation

Count Vectorization with scikit-learn in Python

Learn count vectorization in Python: convert text to numerical vectors using scikit-learn's CountVectorizer with …