Honoring the Life and Legacy of a Leader in AI Ethics In accordance with his family's wishes, it is with profound sadness that we announce the passing of Abhishek Gupta, Founder and Principal Researcher of the … [Read more...] about In Memoriam: Abhishek Gupta (Dec 20, 1992 – Sep 30, 2024)
Spotlight
Representation Engineering: A Top-Down Approach to AI Transparency
🔬 Research Summary by Andy Zou, a Ph.D. student at CMU, advised by Zico Kolter and Matt Fredrikson. He also cofounded the Center for AI Safety (safe.ai). [Original paper by Andy Zou, Long Phan, Sarah Chen, James … [Read more...] about Representation Engineering: A Top-Down Approach to AI Transparency
Universal and Transferable Adversarial Attacks on Aligned Language Models
🔬 Research Summary by Andy Zou, a second-year PhD student at CMU, advised by Zico Kolter and Matt Fredrikson. He is also a cofounder of the Center for AI Safety (safe.ai). [Original paper by Andy Zou, Zifan … [Read more...] about Universal and Transferable Adversarial Attacks on Aligned Language Models
Oppenheimer As A Timely Warning to the AI Community
✍️ Original article by Eryn Rigley, a PhD research student at University of Southampton, specializing in the intersection of environmental and AI ethics, as well as defense & security AI ethics. Like many … [Read more...] about Oppenheimer As A Timely Warning to the AI Community
Unstable Diffusion: Ethical challenges and some ways forward
✍️ Founder's Desk column by Abhishek Gupta, Founder and Principal Researcher at the Montreal AI Ethics Institute. In a potentially prescient comment from a few months ago, I had shared thoughts with Kyle Wiggers … [Read more...] about Unstable Diffusion: Ethical challenges and some ways forward