🔬 Research Summary by Kate Kaye, a researcher, author, award-winning journalist, and deputy director of the World Privacy Forum, a nonprofit, non-partisan, public-interest research group. Kate is a member of the OECD.AI … [Read more...] about Risky Analysis: Assessing and Improving AI Governance Tools
Safety and Security
The Case for Anticipating Undesirable Consequences of Computing Innovations Early, Often, and Across Computer Science
🔬 Research Summary by Rock Yuren Pang, whose focus is on using HCI methods, crowdsourcing, and large language models to support researchers in anticipating the social impact of their work. [Original paper by Rock … [Read more...] about The Case for Anticipating Undesirable Consequences of Computing Innovations Early, Often, and Across Computer Science
DICES Dataset: Diversity in Conversational AI Evaluation for Safety
🔬 Research Summary by Ding Wang, a senior researcher from the Responsible AI Group in Google Research, specializing in responsible data practices with a specific focus on accounting for the human experience and … [Read more...] about DICES Dataset: Diversity in Conversational AI Evaluation for Safety
Defending Against Authorship Identification Attacks
🔬 Research Summary by Haining Wang, a Ph.D. student at Indiana University Bloomington, specializing in natural language processing and large language models. [Original paper by Haining Wang] Overview: … [Read more...] about Defending Against Authorship Identification Attacks
Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models
🔬 Research Summary by Leyang Cui, a senior researcher at Tencent AI lab. [Original paper by Yue Zhang , Yafu Li , Leyang Cui, Deng Cai , Lemao Liu, Tingchen Fu, Xinting Huang, Enbo Zhao , Yu Zhang , Yulong Chen, … [Read more...] about Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models