🔬 Research Summary by Andy Zou, a second-year PhD student at CMU, advised by Zico Kolter and Matt Fredrikson. He is also a cofounder of the Center for AI Safety (safe.ai). [Original paper by Andy Zou, Zifan … [Read more...] about Universal and Transferable Adversarial Attacks on Aligned Language Models
Technical Methods
LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models
🔬 Research Summary by Ahmad Faiz, Masters in Data Science student at Indiana University Bloomington. [Original paper by Ahmad Faiz, Sotaro Kaneda, Ruhan Wang, Rita Osi, Parteek Sharma, Fan Chen, and Lei … [Read more...] about LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models
Exploring XAI for the Arts: Explaining Latent Space in Generative Music
🔬 Research Summary by Nick Bryan-Kinns, Professor of Creative Computing at the Creative Computing Institute, University of the Arts London, where he researches the human-centred approaches to the use of AI in the Arts. … [Read more...] about Exploring XAI for the Arts: Explaining Latent Space in Generative Music
Collect, Measure, Repeat: Reliability Factors for Responsible AI Data Collection
🔬 Research Summary by Oana Inel, a Postdoctoral Researcher at the University of Zurich, where she is working on responsible and reliable use of data and investigating the use of explanations to provide transparency for … [Read more...] about Collect, Measure, Repeat: Reliability Factors for Responsible AI Data Collection
Counterfactual Explanations via Locally-guided Sequential Algorithmic Recourse
🔬 Research Summary by Edward Small, a Ph.D. candidate in computer science at the Royal Melbourne Institute of Technology with his research focused on fair and explainable artificial intelligence. [Original paper … [Read more...] about Counterfactual Explanations via Locally-guided Sequential Algorithmic Recourse