🔬 Research Summary by Andy Zou, a second-year PhD student at CMU, advised by Zico Kolter and Matt Fredrikson. He is also a cofounder of the Center for AI Safety (safe.ai). [Original paper by Andy Zou, Zifan … [Read more...] about Universal and Transferable Adversarial Attacks on Aligned Language Models
Health
Collect, Measure, Repeat: Reliability Factors for Responsible AI Data Collection
🔬 Research Summary by Oana Inel, a Postdoctoral Researcher at the University of Zurich, where she is working on responsible and reliable use of data and investigating the use of explanations to provide transparency for … [Read more...] about Collect, Measure, Repeat: Reliability Factors for Responsible AI Data Collection
Counterfactual Explanations via Locally-guided Sequential Algorithmic Recourse
🔬 Research Summary by Edward Small, a Ph.D. candidate in computer science at the Royal Melbourne Institute of Technology with his research focused on fair and explainable artificial intelligence. [Original paper … [Read more...] about Counterfactual Explanations via Locally-guided Sequential Algorithmic Recourse
Bias and Fairness in Large Language Models: A Survey
🔬 Research Summary by Isabel O. Gallegos, a Ph.D. student in Computer Science at Stanford University, researching algorithmic fairness to interrogate the role of artificial intelligence in equitable … [Read more...] about Bias and Fairness in Large Language Models: A Survey
The Ethics of AI Value Chains: An Approach for Integrating and Expanding AI Ethics Research, Practice, and Governance
🔬 Research Summary by Blair Attard-Frost, a PhD Candidate and SSHRC Joseph-Armand Bombardier Canada Graduate Scholar at the University of Toronto’s Faculty of Information. [Original paper by Blair Attard-Frost … [Read more...] about The Ethics of AI Value Chains: An Approach for Integrating and Expanding AI Ethics Research, Practice, and Governance




