🔬 Research Summary by Sonali Singh, a Ph.D. Student at Texas Tech University working on Large language model(LLM). [Original paper by Sonali Singh, Faranak Abri, and Akbar Siami Namin] Overview: This paper … [Read more...] about Exploiting Large Language Models (LLMs) through Deception Techniques and Persuasion Principles
Core Principles of Responsible AI
Deployment corrections: An incident response framework for frontier AI models
🔬 Research Summary by Joe O’Brien, an Associate Researcher at the Institute for AI Policy and Strategy, focusing on corporate governance and accountability surrounding developing and deploying frontier AI … [Read more...] about Deployment corrections: An incident response framework for frontier AI models
Representation Engineering: A Top-Down Approach to AI Transparency
🔬 Research Summary by Andy Zou, a Ph.D. student at CMU, advised by Zico Kolter and Matt Fredrikson. He also cofounded the Center for AI Safety (safe.ai). [Original paper by Andy Zou, Long Phan, Sarah Chen, James … [Read more...] about Representation Engineering: A Top-Down Approach to AI Transparency
Risky Analysis: Assessing and Improving AI Governance Tools
🔬 Research Summary by Kate Kaye, a researcher, author, award-winning journalist, and deputy director of the World Privacy Forum, a nonprofit, non-partisan, public-interest research group. Kate is a member of the OECD.AI … [Read more...] about Risky Analysis: Assessing and Improving AI Governance Tools
AI Applications in Agriculture: Sustainable Farming
🔬 Original article by Aixa Lacroix from Encode Justice Canada This is a part of our Recess series in which university students from across Canada briefly explain key concepts in AI that young people should know about: … [Read more...] about AI Applications in Agriculture: Sustainable Farming