GDB-11: Chemical Universe Database (26.4M Molecules)
A dataset card for the Generated Database 11 (GDB-11), a database of 26.4 million small organic molecules for virtual …
A dataset card for the Generated Database 11 (GDB-11), a database of 26.4 million small organic molecules for virtual …
A dataset card for the Generated Database 13 (GDB-13), a database of nearly 1 billion small organic molecules for …
A dataset card for the Generated Database 17 (GDB-17), containing 166 billion small organic molecules representing the …
Learn how GEOM transforms 2D molecular graphs into dynamic 3D conformer ensembles for molecular machine learning …
Learn how dataset bias can lead to misleading results in NLP: a sarcasm detection model that actually learned to …
GROMACS simulation workflows for generating amino acid dipeptide trajectories across nine different residue types, …
Comprehensive data science project that scraped 47,000+ congressional bills, analyzed legislative patterns, and built ML …...
Describes the creation of NewsTweet, a large-scale dataset and pipeline for studying embedded tweets in online news. …...
An analysis of QuAC's approach to conversational question answering through student-teacher interactions, featuring …
An analysis of CoQA, a conversational QA dataset that introduces multi-turn dialogue, coreference resolution, and …