🔬 Research Summary by Andy Zou, a Ph.D. student at CMU, advised by Zico Kolter and Matt Fredrikson. He also cofounded the Center for AI Safety (safe.ai). [Original paper by Andy Zou, Long Phan, Sarah Chen, James … [Read more...] about Representation Engineering: A Top-Down Approach to AI Transparency
Transparency
Who to Trust, How and Why: Untangling AI Ethics Principles, Trustworthiness and Trust
🔬 Research Summary by Andreas Duenser and David M. Douglas. Andreas Duenser is a Principal Research Scientist at CSIRO - Data61, Hobart, Australia, and is interested in the convergence of psychology and emerging … [Read more...] about Who to Trust, How and Why: Untangling AI Ethics Principles, Trustworthiness and Trust
Science Communications for Explainable Artificial Intelligence
🔬 Research Summary by Simon Hudson , a writer and researcher investigating subjects in AI governance, human-machine collaboration, and Science Communications, and is currently co-leading the core team behind Botto, a … [Read more...] about Science Communications for Explainable Artificial Intelligence
Towards an Understanding of Developers’ Perceptions of Transparency in Software Development: A Preliminary Study
🔬 Research Summary by Humphrey O. Obie, an Adjunct Research Fellow with the HumaniSE Lab at Monash University; his research is at the intersection of human values and software and AI systems. [Original paper by … [Read more...] about Towards an Understanding of Developers’ Perceptions of Transparency in Software Development: A Preliminary Study
On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research
🔬 Research Summary by Luiza Pozzobon, a Research Scholar at Cohere For AI where she currently researches model safety. She’s also a master’s student at the University of Campinas, Brazil. [Original paper by Luiza … [Read more...] about On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research