🔬 Research Summary by Stephen Casper, an MIT PhD student working on AI interpretability, diagnostics, and safety. [Original paper by Stephen Casper,* Xander Davies,* Claudia Shi, Thomas Krendl Gilbert, Jérémy … [Read more...] about Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
A Critical Analysis of the What3Words Geocoding Algorithm
🔬 Research Summary by Rudy Arthur, a Senior Lecturer in Data Science at the University of Exeter. [Original paper by Rudy Arthur] Overview: What3Words (W3W) is a geocoding app that has been aggressively … [Read more...] about A Critical Analysis of the What3Words Geocoding Algorithm
Confidence-Building Measures for Artificial Intelligence
🔬 Research Summary by Andrew W. Reddie, Sarah Shoker, and Leah Walker. Andrew W. Reddie is an Associate Research Professor at the University of California, Berkeley’s Goldman School of Public Policy, and Founder … [Read more...] about Confidence-Building Measures for Artificial Intelligence
Self-Consuming Generative Models Go MAD
🔬 Research Summary by Josue Casco-Rodriguez and Sina Alemohammad. Josue is a 2nd-year PhD student at Rice University. He is interested in illuminating the intersection of machine learning and neuroscience from … [Read more...] about Self-Consuming Generative Models Go MAD
From OECD to India: Exploring cross-cultural differences in perceived trust, responsibility and reliance of AI and human experts
🔬 Research Summary by Vishakha Agrawal, an independent researcher interested in human-AI collaboration, participatory AI and AI safety. [Original paper by Vishakha Agrawal, Serhiy Kandul, Markus Kneer, and Markus … [Read more...] about From OECD to India: Exploring cross-cultural differences in perceived trust, responsibility and reliance of AI and human experts