🔬 Research Summary by Jing Yao, a researcher at Microsoft Research Asia, working on AI value alignment, interpretability and societal AI. [Original paper by Jing Yao, Xiaoyuan Yi, Xiting Wang, Jindong Wang, and … [Read more...] about From Instructions to Intrinsic Human Values – A Survey of Alignment Goals for Big Models
Technical Methods
On the Challenges of Deploying Privacy-Preserving Synthetic Data in the Enterprise
🔬 Research Summary by Lauren Arthur, Marketing Director at Hazy, a leading synthetic data company. [Original paper by Georgi Ganev, Jason Costello, Jonathan Hardy, Will O’Brien, James Rea, Gareth Rees, and Lauren … [Read more...] about On the Challenges of Deploying Privacy-Preserving Synthetic Data in the Enterprise
Listen to What They Say: Better Understand and Detect Online Misinformation with User Feedback
🔬 Research Summary by Hubert Etienne, a researcher in AI ethics, the former Global Generative AI Ethics Lead at Meta and the inventor of Computational philosophy. [Original paper by Hubert Etienne and Onur … [Read more...] about Listen to What They Say: Better Understand and Detect Online Misinformation with User Feedback
The Design Space of Generative Models
🔬 Research Summary by Meredith Ringel Morris, Director of Human-AI Interaction Research at Google DeepMind; she is also an Affiliate Professor at the University of Washington, and is an ACM Fellow and member of the ACM … [Read more...] about The Design Space of Generative Models
From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models
🔬 Research Summary by Shangbin Feng, Chan Young Park, and Yulia Tsvetkov. Shangbin Feng is a Ph.D. student at University of Washington.Chan Young Park is a Ph.D. student at Carnegie Mellon University, studying … [Read more...] about From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models