🔬 Research Summary by Ashique KhudaBukhsh, an assistant professor at the Rochester Institute of Technology specializing in natural language processing, computational social science, and responsible AI. [Original … [Read more...] about Down the Toxicity Rabbit Hole: Investigating PaLM 2 Guardrails
Safety and Security
Unpacking Human-AI interaction (HAII) in safety-critical industries
🔬 Research Summary by Tita Alissa Bach, Ph.D., is a Principal Researcher at the Digital Transformation research team at DNV, Norway, focusing on Human Factors in AI in safety-critical industries [Original paper … [Read more...] about Unpacking Human-AI interaction (HAII) in safety-critical industries
A Machine Learning Challenge or a Computer Security Problem?
🔬 Research Summary by Ilia Shumailov, a Ph.D. in Computer Science from the University of Cambridge, specializing in Machine Learning and Computer Security. During the PhD under the supervision of Prof Ross Anderson, Ilia … [Read more...] about A Machine Learning Challenge or a Computer Security Problem?
LLM Platform Security: Applying a Systematic Evaluation Framework to OpenAI’s ChatGPT Plugins
🔬 Research Summary by Umar Iqbal, an Assistant professor at Washington University in St. Louis, researching computer security and privacy. [Original paper by Umar Iqbal (Washington University in St. Louis), … [Read more...] about LLM Platform Security: Applying a Systematic Evaluation Framework to OpenAI’s ChatGPT Plugins
Robust Distortion-free Watermarks for Language Models
🔬 Research Summary by Rohith Kuditipudi, a third year Ph.D. student at Stanford University advised by John Duchi and Percy Liang. [Original paper by Rohith Kuditipudi, John Thickstun, Tatsunori Hashimoto, and … [Read more...] about Robust Distortion-free Watermarks for Language Models