Design

From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models

September 15, 2023

🔬 Research Summary by Shangbin Feng, Chan Young Park, and Yulia Tsvetkov. Shangbin Feng is a Ph.D. student at University of Washington.Chan Young Park is a Ph.D. student at Carnegie Mellon University, studying … [Read more...] about From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

September 15, 2023

🔬 Research Summary by Stephen Casper, an MIT PhD student working on AI interpretability, diagnostics, and safety. [Original paper by Stephen Casper,* Xander Davies,* Claudia Shi, Thomas Krendl Gilbert, Jérémy … [Read more...] about Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Self-Consuming Generative Models Go MAD

September 10, 2023

🔬 Research Summary by Josue Casco-Rodriguez and Sina Alemohammad. Josue is a 2nd-year PhD student at Rice University. He is interested in illuminating the intersection of machine learning and neuroscience from … [Read more...] about Self-Consuming Generative Models Go MAD