🔬 Research Summary by Shangbin Feng, Chan Young Park, and Yulia Tsvetkov. Shangbin Feng is a Ph.D. student at University of Washington.Chan Young Park is a Ph.D. student at Carnegie Mellon University, studying … [Read more...] about From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models
Design
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
🔬 Research Summary by Stephen Casper, an MIT PhD student working on AI interpretability, diagnostics, and safety. [Original paper by Stephen Casper,* Xander Davies,* Claudia Shi, Thomas Krendl Gilbert, Jérémy … [Read more...] about Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Self-Consuming Generative Models Go MAD
🔬 Research Summary by Josue Casco-Rodriguez and Sina Alemohammad. Josue is a 2nd-year PhD student at Rice University. He is interested in illuminating the intersection of machine learning and neuroscience from … [Read more...] about Self-Consuming Generative Models Go MAD
From OECD to India: Exploring cross-cultural differences in perceived trust, responsibility and reliance of AI and human experts
🔬 Research Summary by Vishakha Agrawal, an independent researcher interested in human-AI collaboration, participatory AI and AI safety. [Original paper by Vishakha Agrawal, Serhiy Kandul, Markus Kneer, and Markus … [Read more...] about From OECD to India: Exploring cross-cultural differences in perceived trust, responsibility and reliance of AI and human experts
Acceptable Risks in Europe’s Proposed AI Act: Reasonableness and Other Principles for Deciding How Much Risk Management Is Enough
🔬 Research Summary by Dr. Henry Fraser, a Research Fellow in Law, Accountability, and Data Science at the Centre of Excellence for Automated Decision-Making and Society. [Original paper by Henry Fraser and … [Read more...] about Acceptable Risks in Europe’s Proposed AI Act: Reasonableness and Other Principles for Deciding How Much Risk Management Is Enough





