Text-Classification on Hunter Heidenreich, ML Research Scientist

Text-Classification on Hunter Heidenreich, ML Research Scientist https://hunterheidenreich.com/tags/text-classification/ Recent content in Text-Classification on Hunter Heidenreich, ML Research Scientist Hugo -- 0.147.7 en 2025 Hunter Heidenreich Sun, 31 Aug 2025 00:00:00 +0000 Sarcasm Detection with Transformers: A Cautionary Tale https://hunterheidenreich.com/posts/sarcasm-detection-with-transformers/ Sun, 25 Feb 2024 00:00:00 +0000 https://hunterheidenreich.com/posts/sarcasm-detection-with-transformers/ Learn how dataset bias can lead to misleading results in NLP: a sarcasm detection model that actually learned to classify news sources. Classifying Congressional Bills with Machine Learning https://hunterheidenreich.com/posts/congressional-bill-policy-area-classification/ Wed, 21 Feb 2024 00:00:00 +0000 https://hunterheidenreich.com/posts/congressional-bill-policy-area-classification/ Testing ML classification of congressional bills by policy area. Comparing Naive Bayes, Logistic Regression, and XGBoost on legislative text. Congressional Data Analysis & Classification https://hunterheidenreich.com/projects/congressional-data-analysis/ Wed, 01 Mar 2023 00:00:00 +0000 https://hunterheidenreich.com/projects/congressional-data-analysis/ Data science project scraping 47,000+ congressional bills, analyzing legislative patterns, and building ML models achieving 87%+ accuracy. Count Vectorization with scikit-learn in Python https://hunterheidenreich.com/posts/nlp-count-vectorization/ Sun, 12 Aug 2018 00:00:00 +0000 https://hunterheidenreich.com/posts/nlp-count-vectorization/ Learn count vectorization in Python: convert text to numerical vectors using scikit-learn's CountVectorizer with practical examples.