🔬 Research Summary by Sonali Singh, a Ph.D. Student at Texas Tech University working on Large language model(LLM). [Original paper by Sonali Singh, Faranak Abri, and Akbar Siami Namin] Overview: This paper … [Read more...] about Exploiting Large Language Models (LLMs) through Deception Techniques and Persuasion Principles
Blog
ABScribe: Rapid Exploration of Multiple Writing Variations in Human-AI Co-Writing Tasks using Large Language Models
🔬 Research Summary by Mohi Reza, a Computer Science Ph.D. Candidate at the University of Toronto and an Adaptive Experimentation Accelerator team member who won the Grand Prize in the $1M XPRIZE Digital Learning … [Read more...] about ABScribe: Rapid Exploration of Multiple Writing Variations in Human-AI Co-Writing Tasks using Large Language Models
Deployment corrections: An incident response framework for frontier AI models
🔬 Research Summary by Joe O’Brien, an Associate Researcher at the Institute for AI Policy and Strategy, focusing on corporate governance and accountability surrounding developing and deploying frontier AI … [Read more...] about Deployment corrections: An incident response framework for frontier AI models
Representation Engineering: A Top-Down Approach to AI Transparency
🔬 Research Summary by Andy Zou, a Ph.D. student at CMU, advised by Zico Kolter and Matt Fredrikson. He also cofounded the Center for AI Safety (safe.ai). [Original paper by Andy Zou, Long Phan, Sarah Chen, James … [Read more...] about Representation Engineering: A Top-Down Approach to AI Transparency
The irony of having a clean AI chatbot
✍️ Column by Sun Gyoo Kang, Lawyer. Disclaimer: The views expressed in this article are solely my own and do not reflect my employer's opinions, beliefs, or positions. Any opinions or information in this article … [Read more...] about The irony of having a clean AI chatbot