Health

Universal and Transferable Adversarial Attacks on Aligned Language Models

December 2, 2023

🔬 Research Summary by Andy Zou, a second-year PhD student at CMU, advised by Zico Kolter and Matt Fredrikson. He is also a cofounder of the Center for AI Safety (safe.ai). [Original paper by Andy Zou, Zifan … [Read more...] about Universal and Transferable Adversarial Attacks on Aligned Language Models

Collect, Measure, Repeat: Reliability Factors for Responsible AI Data Collection

November 4, 2023

🔬 Research Summary by Oana Inel, a Postdoctoral Researcher at the University of Zurich, where she is working on responsible and reliable use of data and investigating the use of explanations to provide transparency for … [Read more...] about Collect, Measure, Repeat: Reliability Factors for Responsible AI Data Collection

Counterfactual Explanations via Locally-guided Sequential Algorithmic Recourse

October 4, 2023

🔬 Research Summary by Edward Small, a Ph.D. candidate in computer science at the Royal Melbourne Institute of Technology with his research focused on fair and explainable artificial intelligence. [Original paper … [Read more...] about Counterfactual Explanations via Locally-guided Sequential Algorithmic Recourse

Bias and Fairness in Large Language Models: A Survey

September 27, 2023

🔬 Research Summary by Isabel O. Gallegos, a Ph.D. student in Computer Science at Stanford University, researching algorithmic fairness to interrogate the role of artificial intelligence in equitable … [Read more...] about Bias and Fairness in Large Language Models: A Survey

The Ethics of AI Value Chains: An Approach for Integrating and Expanding AI Ethics Research, Practice, and Governance

September 21, 2023

🔬 Research Summary by Blair Attard-Frost, a PhD Candidate and SSHRC Joseph-Armand Bombardier Canada Graduate Scholar at the University of Toronto’s Faculty of Information. [Original paper by Blair Attard-Frost … [Read more...] about The Ethics of AI Value Chains: An Approach for Integrating and Expanding AI Ethics Research, Practice, and Governance

« Previous Page