AI Safety Unconference @ conf
The purpose of the AI Safety Unconference is to foster connection and shared understanding between researchers interested in AI safety. It is a series of events, hosted alongside main AI/ML conferences. It welcomes both newly interested and established researchers. It features talks, moderated discussions, one-on-ones, free-form interactions, and participant-driven activities.
AISU @ NeurIPS 2022
~85 participants, independents and from various affiliations: OpenAI, DeepMind, Cambridge, MIRI, Mila, Cornell, Anthropic, MIT, Columbia, Stanford, U Toronto, Waterloo, Cooperative AI, ...
Lightning talks
Lightning talks
- Haydn Belfield: What standard-setting in EU + US might mean for AI safety
- Esben Kran: Hackathons in AI safety research
- Franziska Boenisch: Privacy attacks against federated learning
- Aaron Tucker: Bandits with Costly Reward Observations
- Lewis Hammond: Cooperative AI
- Adam Dziedzic: Stealing and defending self-supervised models
- David Lindner: Active Learning for Reward Modelling
- Lauro Langosco di Langosco: An empirical demonstration of deceptive alignment
- Zhijing Jin: Causally aligning language models
Facilitated discussions (1h each)
Facilitated discussions (1h each)
- Haydn Belfield: AI governance
- Adam Dziedzic: Is this model mine? On stealing and defending machine learning models
- Lewis Hammond: Cooperative AI
- Lauro Langosco di Langosco: Deceptive alignment
Testimonials
"This was a fascinating event that was helpful for keeping up with the cutting edge of the field, and for launching collaborations."
— Haydn Belfield
"The AI safety unconference was very useful to meet and talk with the AI safety researchers at NeurIPS."
— Esben Kran
"It was very reassuring to hear that diverse perspectives on AI risk are being studied seriously, including criticism of the AI safety community."
— Arvind Raghavan
AISU @ NeurIPS 2019
~50 participants, independents and from various affiliations: OpenAI, DeepMind, Cambridge, MIRI, Mila, ...
Participant-driven discussions, multiple lighting talks, ...
Archived websiteAISU @ NeurIPS 2018
~50 participants from various affiliations: UC Berkeley, Vector Institute, Mila, OpenAI, DeepMind, Oxford, CHAI, Mcgill, NYU, Partnership on AI, etc
Talks from:
Adam Gleave, Jan Leike, David Krueger, Dan Hendrycks, Aaron Tucker, Victoria Krakovna
Testimonials
"A great way to meet the best people in the area and propel daring ideas forward."
— Stuart Armstrong
"The event was a great place to meet others with shared research interests. I particularly enjoyed the small discussion groups that exposed me to new perspectives."
— Adam Gleave