🔬 Research Summary by Roel Dobbe, an Assistant Professor working at the intersection of engineering, design and governance of data-driven and algorithmic control and decision-making systems. [Original paper by … [Read more...] about System Safety and Artificial Intelligence
Research Summaries
A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning
🔬 Research Summary by Siobhan Mackenzie Hall, PhD student at the Oxford Neural Interfacing groups at the University of Oxford. Siobhan is also a member of the Oxford Artificial Intelligence Society, along with the … [Read more...] about A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning
A Hazard Analysis Framework for Code Synthesis Large Language Models
🔬 Research Summary by Heidy Khlaaf, an Engineering Director at Trail of Bits specializing in the evaluation, specification, and verification of complex or autonomous software implementations in safety-critical systems, … [Read more...] about A Hazard Analysis Framework for Code Synthesis Large Language Models
The Ethical Need for Watermarks in Machine-Generated Language
🔬 Research summary by Connor Wright, our Partnerships Manager. [Original paper by A. Grinbaum and L. Adomaitis] Overview: With the ability of large language models to reproduce text becoming more prominent … [Read more...] about The Ethical Need for Watermarks in Machine-Generated Language
Bots don’t Vote, but They Surely Bother! A Study of Anomalous Accounts in a National Referendum
🔬 Research Summary by Eduardo Graells-Garrido and Ricardo Baeza-Yates. Eduardo Graells-Garrido is Assistant Professor at the Department of Computer Science in Universidad de Chile. He is interested in improving … [Read more...] about Bots don’t Vote, but They Surely Bother! A Study of Anomalous Accounts in a National Referendum