Past events

Sorted by date, newest first. Useful as a memory of what's happened in the community.

BlueDot SF Launch

mixer ★ 0.58

📅 Jul 30, 2026 – Jul 31, 2026 📍 San Francisco, USA via BlueDot Impact Events Calendar (Luma)

BlueDot Impact launch event in San Francisco bringing together the AI safety community. Part of BlueDot's expansion to build the workforce needed to safely navigate AGI. Community gathering for networking and connection among local AI safety practitioners, researchers, and fellows.

#community#alignment#governance launchnetworking

Breaking Barriers Part 2: AI Safety Career Fair and Builders Summit

mixer ★ 0.59

📅 Jul 24, 2026 📍 San Francisco, USA via BlueDot Impact Events Calendar (Luma)

Part 2 of the Breaking Barriers to AI Safety series, featuring a career fair connecting AI safety job seekers with organizations, plus a builders summit for technical collaboration. Organized by Jen Ba and Robin Goins (Mox) through BlueDot Impact.

#alignment#community career-fairnetworking

Secret Loyalties Hackathon 2026

hackathon ★ 0.51 Reg closed Jul 24, 2026

📅 Jul 24, 2026 – Jul 26, 2026 📍 Hybrid via Apart Research

Apart Research hackathon focused on AI safety research. Part of the 55+ sprints series with 6,000+ participants across 200+ global locations.

#alignment#safety#deception#evals#control hackathonsprint

AI Safety Evals Paper Reading Club - July 21

reading-group ★ 0.61

📅 Jul 21, 2026 📍 Virtual via BlueDot Impact Events Calendar (Luma)

Weekly AI safety evaluations paper reading club organized by BlueDot Impact. Virtual session on Zoom. Part of BlueDot's regular community programming building the workforce needed to safely navigate AGI.

#evals

Breaking Barriers to AI Safety Hackathon Part 1

hackathon ★ 0.60

📅 Jul 18, 2026 📍 San Francisco, USA via BlueDot Impact Events Calendar (Luma)

Part 1 of a two-part AI safety event series organized by BlueDot Impact. This hackathon focuses on building AI safety tools and breaking barriers to entry in the field. Followed by Part 2 (Career Fair and Builders Summit) on July 24. Organized by Joshua Landes and Jen Ba.

#alignment#community#evals hackathon

Secure & Sovereign AI Workshop 2026

workshop ★ 0.57 Reg closed Jul 18, 2026 ^?

📅 Jul 18, 2026 – Jul 19, 2026 📍 Berlin, Germany via Foresight Institute , Foresight Institute

Technical workshop bringing together top talent to address bottlenecks in secure AI advancement. Organized by Foresight Institute, a 40-year-old organization focused on transformative technology. Registration available.

#governance#evals#alignment#safety-research#security#control#technical-safety#ai-security#adversarial-robustness#scaling-infrastructure#safety technicalBerlinworkshop

Getting into AI Safety - SASH Event

mixer ★ 0.49

📅 Jul 17, 2026 📍 Singapore, Singapore via Singapore AI Safety Hub (SASH)

A community event organized by Singapore AI Safety Hub (SASH) focused on getting into AI safety. Provides networking and information for those interested in entering the AI safety field.

#community#alignment networkingintroductory

Second Workshop on Agents in the Wild: Safety, Security, and Beyond

workshop ★ 0.73

📅 Jul 11, 2026 📍 Seoul, South Korea via ICML: Safety-related Workshops

Second Workshop on Agents in the Wild focusing on safety and security of AI agents deployed in real-world environments. Addresses challenges in ensuring safe and secure operation of autonomous agents. Part of ICML 2026 workshop track.

#alignment#evals ICMLworkshopagentssafetysecurity

Pluralistic Alignment @ ICML 2026

workshop ★ 0.66 Reg closed Jun 10, 2026 ^?

📅 Jul 11, 2026 📍 Seoul, South Korea via Pluralistic Alignment Workshop at ICML

The workshop addresses how to integrate diverse perspectives, values, and expertise into pluralistic AI alignment frameworks. Examines multi-objective alignment approaches, preference elicitation methods, and human-AI interaction workflows that reflect pluralistic values across diverse communities.

#alignment#governance#pluralistic-alignment#pluralistic-values#measurement-science ICML-workshopinterdisciplinaryworkshopICML

Pluralistic Alignment Workshop at ICML 2026

workshop ★ 0.57 Reg closed Jun 10, 2026

📅 Jul 11, 2026 📍 Seoul, South Korea via Pluralistic Alignment Workshop at ICML

Workshop addressing Pluralistic AI: Aligning with the Diversity of Human Values. Examines how to integrate diverse perspectives into AI alignment frameworks, exploring multi-objective approaches and consensus-building practices for navigating value conflicts in pluralistic societies.

#alignment#governance

Second Workshop on Technical AI Governance Research

workshop ★ 0.73

📅 Jul 10, 2026 📍 Seoul, South Korea via ICML: Safety-related Workshops

Second Workshop on Technical AI Governance Research at ICML 2026, focusing on technical approaches to AI governance, policy, and regulation. Part of the main conference workshop track.

#governance#policy ICMLworkshopgovernancetechnical-governance

Failure Modes in Agentic AI: Reproducible Triggers, Trace Diagnostics, and Verified Fixes

workshop ★ 0.73

📅 Jul 10, 2026 📍 Seoul, South Korea via ICML: Safety-related Workshops

Workshop at ICML 2026 focused on identifying, diagnosing, and fixing failure modes in agentic AI systems. Covers reproducible triggers for failures, diagnostic tracing methods, and verified repair approaches. Highly relevant to AI safety and robustness.

#evals#alignment ICMLworkshopfailure-modesagentsdiagnostics

ICML 2026 Workshop on Mechanistic Interpretability

workshop ★ 0.84 CFP closed May 8, 2026

📅 Jul 10, 2026 – Jul 11, 2026 📍 Seoul, South Korea via ICML: Safety-related Workshops , ICML: Safety-related Workshops , Mechanistic Interpretability Workshop at ICML , ICML: Safety-related Workshops , Mechanistic Interpretability Workshop at ICML

Workshop bringing together diverse perspectives from the community to discuss recent advances in mechanistic interpretability, build common understanding and chart future directions. Addresses developing principled methods to analyze and understand model internals (weights and activations) to gain insight into behavior and underlying computation. Received 2.6x submissions from previous year.

#interpretability#alignment#circuit-tracing#sparse-autoencoders#mechanistic-interpretability#instrument-science#adversarial-robustness ICMLworkshopmechanistic-interpretabilityICML-workshopthird-iteration

Mechanistic Interpretability Workshop at ICML 2026

workshop ★ 0.70 CFP closed Jun 12, 2026

📅 Jul 10, 2026 📍 Seoul, South Korea via Mechanistic Interpretability Workshop at ICML

Annual mechanistic interpretability workshop at ICML building community dialogue around understanding neural network internals through principled analysis methods. Features 23 spotlight presentations alongside poster presentations. Continues series from previous workshops at ICML 2024 and NeurIPS 2025.

#interpretability#mechanistic-interpretability workshopICML

Mechanistic Interpretability Workshop @ ICML 2026

workshop ★ 0.84 CFP closed May 31, 2026 ^?

📅 Jul 10, 2026 📍 Seoul, South Korea via Mechanistic Interpretability Workshop at ICML

Annual mechanistic interpretability workshop at ICML. High-quality venue for mech interp research organized by leading researchers in the field.

#interpretability#alignment

Australian AI Safety Forum 2026

conference ★ 0.61 Reg closed Jul 7, 2026

📅 Jul 7, 2026 – Jul 8, 2026 📍 Sydney, Australia via Australian AI Safety Forum

An interdisciplinary forum grounded in the science of AI safety, bringing together participants from research, government, industry, and civil society. The 2026 forum features 79 speakers across 76 sessions with 329 participants, focusing on measurement, evaluation, and governance of AI systems.

#evals#governance#evaluation#measurement-science#alignment#evaluations#policy#technical-safety#safety interdisciplinaryconference

CAMBRIA Summer 2026: July Cohort

fellowship ★ 0.93 Apps closed Jul 24, 2026 ^?

📅 Jul 6, 2026 – Jul 24, 2026 📍 New York, USA via CAMBRIA - Cambridge Bootcamp for Research in Interpretability and Alignment

3-week ML upskilling bootcamp for AI safety focusing on interpretability and RL, based on ARENA curriculum. Run by Cambridge Boston Alignment Initiative. Provides housing, meals, 24/7 office access, dedicated teaching assistants, and travel support to participants.

#interpretability#alignment

ICML 2026

conference ★ 0.59 CFP closed Jun 6, 2026 ^?

📅 Jul 6, 2026 – Jul 11, 2026 📍 Seoul, South Korea via ICML: Safety-related Workshops , ICML: Safety-related Workshops

International Conference on Machine Learning. July 6 for Expo/Tutorial Day, July 7-9 for Main Conference, July 10-11 for Workshops. Annual ML conference with safety-related workshops in scope.

#alignment#interpretability#evals#ml-research#mechanistic-interpretability#governance#machine-learning#evaluation#safety#ml-safety major-conferenceconference

Seoul Alignment Workshop 2026

workshop ★ 0.72 Reg closed Jul 5, 2026 ^?

📅 Jul 6, 2026 📍 Seoul, South Korea via FAR AI - Foundational AI Research , FAR AI - Foundational AI Research , FAR AI - Foundational AI Research

Part of the ongoing Alignment Workshop series by FAR.AI, bringing together global leaders to explore strategies for mitigating risks from artificial general intelligence. FAR AI is a foundational research organization focused on AI safety verification, secure compute, and technical safety topics.

#alignment#governance#control#interpretability#technical-safety#agi-risk#evals#agi-safety workshop

AI Safety New Zealand Conference 2026

conference ★ 0.63 Reg closed Jul 10, 2026 ^?

📅 Jul 4, 2026 📍 Christchurch, New Zealand via BlueDot Impact Events Calendar (Luma)

AI safety conference in Christchurch, New Zealand. Organized by BlueDot Impact community. In-person event bringing together the New Zealand AI safety community.

#safety#governance#alignment

ACL 2026 Workshop on Evaluating Evaluations (EvalEval)

workshop ★ 0.54 CFP closed May 14, 2026

📅 Jul 3, 2026 – Jul 4, 2026 📍 San Diego, USA via EvalEval Coalition

Workshop examining tensions between model developers and evaluation researchers, covering evaluation methodology and measurement theory, evaluation infrastructure and costs, and sociotechnical impact assessments. Co-hosted social with GEM workshop on July 3 around 8pm, main workshop sessions on July 4 afternoon.

#evals

ASSC 29: 29th Annual Meeting of the ASSC

conference ★ 0.35

📅 Jun 30, 2026 – Jul 3, 2026 📍 Santiago, Chile via Association for the Scientific Study of Consciousness (ASSC) , Association for the Scientific Study of Consciousness (ASSC)

29th Annual Meeting of the Association for the Scientific Study of Consciousness in Santiago de Chile. Academic society promoting rigorous research on understanding the nature, function, and underlying mechanisms of consciousness. Relevant for ACO practitioners working on consciousness studies and measurement science applicable to interpretability work. Includes members from cognitive science, medicine, neuroscience, philosophy, and related disciplines.

#interpretability#measurement-science#cognitive-science

Governing AI in the Wild: AI Policy Hackathon

hackathon ★ 0.73

📅 Jun 20, 2026 – Jun 22, 2026 📍 Toronto, Canada via BlueDot Impact Events Calendar (Luma)

AI policy hackathon focused on governance challenges of deployed AI systems. Organized by BlueDot Impact community in Toronto, bringing together participants to work on practical policy frameworks for AI governance in real-world deployment contexts.

#governance#policy

Global South AI Safety Hackathon 2026

hackathon ★ 0.66 Reg closed Jun 19, 2026

📅 Jun 19, 2026 – Jun 21, 2026 📍 Hybrid via Apart Research

Regional AI safety hackathon for Latin America, Africa, and Asia. Participants build AI safety tools, evaluations, and policy research. Regional competition structure with pipeline to fellowship and placement opportunities. Funded by Schmidt Sciences.

#alignment#safety#evaluation#governance#evals#policy

UNIDIR Global Conference on AI, Security and Ethics 2026

conference ★ 0.75 Reg closed Jun 18, 2026

📅 Jun 18, 2026 📍 Geneva, Switzerland · Hybrid via UNIDIR - United Nations Institute for Disarmament Research , UNIDIR - United Nations Institute for Disarmament Research

United Nations Institute for Disarmament Research global conference on AI governance, security, and ethics. Part of UNIDIR's Centre of Excellence on AI, Peace and Security programming. International AI policy and disarmament-related conference in scope per the safety community's broader governance interests.

#governance#sociotechnical-threats#policy#sociotechnical-threat-surface#safety

Machine Learning Summer School 2026

other ★ 0.49

📅 Jun 15, 2026 – Jun 27, 2026 📍 New York, USA via Machine Learning Summer School (MLSS) - Columbia

Two-week intensive summer school at Columbia University covering machine learning topics including mechanistic interpretability, alignment/safety, RAG & agents, and LLM systems. Approximately 200 PhD students participate alongside faculty and industry speakers. In-scope due to dedicated alignment and mechanistic interpretability tracks.

#interpretability#alignment

UNIDIR Inter-faith Dialogue on AI, Security and Ethics

conference ★ 0.60

📅 Jun 12, 2026 📍 Geneva, Switzerland via UNIDIR - United Nations Institute for Disarmament Research

Bringing together multiple faith perspectives on AI governance and security implications, exploring ethical dimensions of advanced AI through inter-religious dialogue.

#governance#sociotechnical-threat-surface

AI Risk Content Hackathon

hackathon ★ 0.68 Reg closed Jun 6, 2026 ^?

📅 Jun 6, 2026 📍 London, UK via BlueDot Impact Events Calendar (Luma)

AI Risk Content Hackathon organized by BlueDot Impact in London. Focus on creating AI safety and risk communication content. Part of BlueDot's broader mission to build the workforce needed to safely navigate AGI.

#alignment#governance#evals content-creation

Foresight Vision Weekend London 2026

conference ★ 0.64

📅 Jun 5, 2026 – Jun 7, 2026 📍 London, United Kingdom via Foresight Institute

Foresight Institute flagship event gathering leading scientists, entrepreneurs, funders, and policymakers to explore the frontiers of science and technology. Multiple tracks including AI safety topics.

#frontier-science#ai-safety

Foresight Vision Weekend UK 2026

conference ★ 0.75 Reg closed Jun 5, 2026 ^?

📅 Jun 5, 2026 – Jun 7, 2026 📍 London, United Kingdom via Foresight Institute

Flagship conference where leading scientists, entrepreneurs, funders, and policymakers convene to explore frontier technology and plan for beneficial futures. Includes AI safety track as part of broader focus on transformative technology. 40-year-old organization focused on beneficial technology development.

#governance#frontier-science#technical-safety#alignment#safety

Vision Weekend UK 2026 - AI Safety Track

conference ★ 0.70 Reg closed Jun 5, 2026 ^?

📅 Jun 5, 2026 – Jun 7, 2026 📍 London, UK via Foresight Institute , Foresight Institute

Foresight Institute Vision Weekend with frontier science and technology tracks including AI safety. 40-year-old organization focused on transformative technology. Three-day event in London featuring AI safety programming alongside other frontier tech tracks.

#alignment#governance frontier-sciencemulti-track

BlueDot Incubator Week June 2026

hackathon ★ 0.73

📅 Jun 1, 2026 – Jun 5, 2026 📍 London, UK via BlueDot Impact

Five-day intensive programme for AI safety founders going from idea to funded. Successful pitches receive £50k in equity-free seed funding. Part of BlueDot Impact's incubator and rapid-funding initiatives supporting concrete AI safety work.

#alignment#governance startupincubator

EA Global: London 2026

conference ★ 0.79 Reg closed May 31, 2026

📅 May 29, 2026 – May 31, 2026 📍 London, UK via EA Global London 2026 , EA Forum Events , EA Global Events , EA Global Events

EA Global conference series organized by Centre for Effective Altruism. Speakers present research on effective altruism including heavy AI safety programming. Features talks, workshops, and networking opportunities. In scope for social and community event reasons with significant AI safety attendance.

#alignment#governance#evals#safety effective-altruism

ARENA 8.0

fellowship ★ 0.67 Apps closed May 25, 2026

📅 May 25, 2026 – Jun 26, 2026 📍 London, UK via ARENA: Alignment Research Engineering Accelerator

In-person bootcamp covering AI safety fundamentals, mechanistic interpretability, and reinforcement learning. Program covers travel, visas, accommodation, and meals. Duration: 4-5 weeks typical for ARENA bootcamps. Online curriculum available for independent study.

#alignment#interpretability

MAIA AI Safety Fundamentals Summer 2026

reading-group ★ 0.54 Apps closed May 22, 2026 ^?

📅 May 22, 2026 – Jul 17, 2026 📍 Virtual via MAIA - MIT AI Alignment

8-week virtual reading group run by MIT AI Alignment (MAIA), meeting 2 hours per week. Explores why AI safety matters and current mitigation approaches including AI trajectory, misalignment risks, technical safety solutions, policy, and career paths. No prior AI background required. Led by small groups facilitated by MAIA team members.

#alignment#governance#technical-safety#safety

Secure Program Synthesis Hackathon 2026

hackathon ★ 0.54 Reg closed May 22, 2026 ^?

📅 May 22, 2026 – May 24, 2026 📍 Hybrid via Apart Research , Apart Research

Hackathon focused on secure program synthesis by Apart Research. Hybrid format with online participation and in-person hubs.

#alignment#control#safety-research#evals#security#automated-research#safety-applications#technical-safety#evaluations#governance#verification#code-safety#adversarial-robustness#scaling-infrastructure#evaluation hybridsprint

BlueDot Technical AI Safety Project Sprint May 2026

workshop ★ 0.92 Apps closed May 10, 2026

📅 May 18, 2026 – Jun 21, 2026 📍 Virtual via BlueDot Impact

5-week project-based course for engineers and early researchers to work with an AI safety expert on a contribution to AI safety research or engineering. Includes mentorship, regular check-ins, and a published write-up. Covers alignment, mechanistic interpretability, evaluations, red-teaming, AI control, and scalable oversight.

#alignment#interpretability#evals

CAMBRIA 2026 May Cohort

fellowship ★ 0.65 Apps closed Jun 5, 2026

📅 May 18, 2026 – Jun 5, 2026 📍 Cambridge, USA via CAMBRIA - Cambridge Bootcamp for Research in Interpretability and Alignment

#interpretability#alignment

Workshop on Assurance and Verification of AI Development (AViD)

workshop ★ 0.60 Apps closed May 1, 2026 ^?

📅 May 17, 2026 📍 San Francisco, USA via FAR AI - Foundational AI Research , FAR AI - Foundational AI Research , FAR AI - Foundational AI Research

FAR.AI and Center for AI Safety workshop on infrastructure for secure and verifiable AI, bringing together researchers, builders, and funders across ML, hardware security, systems, cryptography, and computer security to identify the most promising technical approaches and spark concrete collaborations.

#evals#alignment#safety-research#security#scaling-infrastructure#adversarial-robustness#governance workshopverificationcryptographyhardware-security

Technical AI Safety (TAIS) Conference 2026

conference ★ 0.68 Reg closed May 14, 2026

📅 May 14, 2026 📍 Oxford, UK via Technical AI Safety (TAIS) Conference

Free, one-day technical AI safety conference organized by Oxford Martin AI Governance Initiative and Noeon Research. Third iteration. Welcomes researchers and professionals from all backgrounds interested in discussing AI safety, regardless of prior experience. Sponsored by MATS and Apart Research.

#alignment#evals#governance

Where is AI in 2026 and Where is it Going?

mixer ★ 0.46

📅 May 5, 2026 📍 Berkeley, USA via AI Safety Awareness Group Oakland (Meetup)

A free, accessible workshop hosted by AI Safety Awareness Group Oakland exploring AI's trajectory and societal impact. No technical background required. Features live demonstrations of current AI systems, interactive forecasting activities, and discussions about AI's implications for work, relationships, and society over the next 1-5 years.

#governance#alignment

UNIDIR Cyber Stability Conference 2026

conference ★ 0.52

📅 May 4, 2026 – May 5, 2026 📍 Geneva, Switzerland · Hybrid via UNIDIR - United Nations Institute for Disarmament Research

Two-day conference on global cooperation for cybersecurity resilience and stability. Organized by United Nations Institute for Disarmament Research. Addresses international frameworks for cyber governance and security cooperation.

#governance#cyber-security UNdisarmament

BASE Fellowship Spring 2026

fellowship ★ 0.55

📅 May 1, 2026 – Jul 31, 2026 📍 Virtual via BASE - Black in AI Safety & Ethics

13-week part-time remote fellowship designed to develop Black researchers, practitioners, and leaders in AI Safety, AI Security, and AI Governance. Program runs in two phases: Weeks 1-5 cover foundations and training curriculum, while Weeks 6-13 focus on research and project development. Organized by Black in AI Safety & Ethics (BASE) to empower Black researchers in the AI safety community.

#alignment#governance#evals

EAGxDC 2026

conference ★ 0.57 Apps closed Apr 28, 2026

📅 May 1, 2026 – May 3, 2026 📍 Washington, USA via EA Global Events

A community-organized regional Effective Altruism conference targeting the policy, research, and public-interest communities across the Washington DC, Maryland, and Virginia area. Designed to help professionals connect with practitioners, explore high-impact career paths, and engage with organizations focused on global challenges. Key topics include global health, AI governance, public policy, and biosecurity.

#governance#alignment#evals

LISA London AI Safety Mixer: How can I make AI go well?

mixer ★ 0.56

📅 Apr 30, 2026 📍 London, UK via London Initiative for Safe AI (LISA) , Pivotal Research Fellowship

AI safety mixer for professionals exploring a move into AI safety, hosted by the London Initiative for Safe AI (LISA). A low-pressure evening to explore the field, understand the part you could play, and meet people who are already working in AI safety. Designed for professionals interested in transitioning into the AI safety field.

#alignment#community-building#careers#networking

Re-Align Workshop at ICLR 2026

workshop ★ 0.56 Reg closed Apr 19, 2026

📅 Apr 27, 2026 📍 Hybrid via Re-Align Workshop at ICLR

Third edition workshop bringing together researchers from machine learning, neuroscience, and cognitive science to explore representational alignment among artificial and biological information processing systems. This year focuses on what we can do with alignment, emphasizing practical affordances and how alignment transforms static representations into controllable computational primitives.

#interpretability#alignment#measurement-science#cognitive-science workshopICLR

AIxBio Hackathon 2026

hackathon ★ 0.52 Apps closed May 11, 2026 ^?

📅 Apr 24, 2026 – Apr 26, 2026 📍 Hybrid via Apart Research , Apart Research

Three-day hybrid hackathon focusing on AI and biosafety. Organized by Apart Research with in-person hubs in London, Berlin, and San Francisco.

#evals#biosecurity#safety-research#governance#evaluation#safety-applications#evaluations#alignment#sociotechnical-threats hackathonbiosecurityhybridevalspast-event

ControlConf 2026 - AI Control Conference

conference ★ 0.71 Apps closed Apr 17, 2026

📅 Apr 18, 2026 – Apr 19, 2026 📍 Berkeley, USA via ControlConf - AI Control Conference

Conference on AI control, focusing on reducing risks from AI misalignment through interventions that are robust even when AI models involved are attempting to undermine those safeguards. Features talks, fireside chats, and breakout discussions with experienced researchers. Free to attend, with a pre-conference workshop on April 17. Organized by Redwood Research & FAR.AI.

#control#alignment#evals

TIAP Conference 2026 - Technical Innovations for AI Policy

conference ★ 0.66

📅 Mar 30, 2026 – Mar 31, 2026 📍 Washington DC, USA via FAR AI - Foundational AI Research

Second annual Technical Innovations in AI Policy Conference organized by FAR.AI in collaboration with leading think tanks. Focuses on technical innovations that enable AI policy implementation.

#governance#policy#technical-safety#alignment

AI Control Hackathon 2026

hackathon ★ 0.70

📅 Mar 20, 2026 – Mar 22, 2026 📍 Hybrid via Apart Research

A three-day hackathon focused on developing safety measures for potentially misaligned AI systems. Participants work on control protocols and evaluation tools to keep autonomous AI agents contained, addressing oversight challenges as AI systems become more autonomous. Co-organized by Apart Research and Redwood Research (founder of the AI control field), with 700+ participants submitting 126 projects across main competition and specialty tracks.

#control#alignment#evals#adversarial-robustness hackathoncontrolRedwoodhybrid

London Alignment Workshop 2026

workshop ★ 0.69 Reg closed Mar 1, 2026 ^?

📅 Mar 2, 2026 – Mar 3, 2026 📍 London, UK via FAR AI - Foundational AI Research , FAR AI - Foundational AI Research

FAR.AI hosted event bringing together more than 200 researchers, policymakers, and industry experts to discuss AI alignment topics. Featured presentations on honeypot-based methods for detecting scheming, and training against interpretability-based deception detectors. Victoria Krakovna (Google DeepMind) and Stefan Heimersheim (Google DeepMind) presented.

#alignment#interpretability#control#evals#deception#governance

AIMII - AI Manipulation and Information Integrity Workshop 2026

workshop ★ 0.43

📅 Feb 26, 2026 📍 Paris, France via AIMII - AI Manipulation and Information Integrity Workshop

The inaugural AIMII workshop brings together researchers across disciplines to examine how generative AI models influence information creation and access, clarifying core concepts, evaluating evidence on AI's persuasive and manipulative capabilities, and exploring implications for society and democracy.

#governance#safety#alignment#evals#manipulation#cognitive-science#policy workshopinterdisciplinary

AIMII Workshop 2026: AI, Manipulation, and Information Integrity

workshop ★ 0.50

📅 Feb 26, 2026 📍 Paris, France via AIMII - AI Manipulation and Information Integrity Workshop

Half-day workshop at IASEAI'26 bringing together researchers from computer science, cognitive science, philosophy, political science, and policy to examine AI's manipulative capabilities and information integrity concerns. Features three panel discussions with leading researchers, plus a poster session showcasing 24 accepted posters and 5 hackathon-winning projects addressing definitions and taxonomies of persuasion, manipulation, and deception.

#governance#evals#sociotechnical-risk#alignment#manipulation#safety-research#evaluation#societal-risk#evaluations#deception#information-integrity#policy past-eventUNESCOinterdisciplinary

SPAR Spring 2026

fellowship ★ 0.59 Apps closed Jan 14, 2026

📅 Feb 16, 2026 – May 16, 2026 📍 Virtual via SPAR - Supervised Program for Alignment Research

A part-time, remote research fellowship enabling aspiring AI safety and policy researchers to collaborate with professionals on impactful projects addressing risks from artificial intelligence. Participants commit 5-40 hours weekly for approximately 3 months, culminating in a Demo Day presentation. Mentors include professionals from Google DeepMind, RAND, Apollo Research, MATS, UK AISI.

#alignment#interpretability#governance#safety#evals fellowshipremotementorship

SPAR Spring 2026: Mentee Application

fellowship ★ 0.63 Apps closed Jan 14, 2026 ^?

📅 Feb 16, 2026 – May 16, 2026 📍 Virtual via SPAR - Supervised Program for Alignment Research , SPAR - Supervised Program for Alignment Research

SPAR (Supervised Program for Alignment Research) Spring 2026 mentee track. Part-time remote research program pairing aspiring researchers with experienced mentors from Google DeepMind, RAND, Apollo Research, MATS, UK AISI for three-month projects. Mentees commit 5-40 hours per week. Research period: February 16 - May 16. Mentee decisions were sent out February 2-6. Applications for Spring 2026 have closed.

#alignment#governance#evals#safety-research#interpretability#technical-safety#safety remotementorship

SPAR Spring 2026: Mentor Application

fellowship ★ 0.66

📅 Feb 16, 2026 – May 16, 2026 📍 Virtual via SPAR - Supervised Program for Alignment Research , SPAR - Supervised Program for Alignment Research

Part-time remote research fellowship pairing aspiring researchers with 130+ experienced mentors from Google DeepMind, RAND, Apollo Research, MATS, UK AISI, and other top organizations for three-month AI alignment projects. Participants commit 5-40 hours weekly. Covers project expenses including compute and API/LLM access. Culminates in virtual Demo Day with prizes totaling $7,000. Optional continuation beyond May 16. Mentor application track: experienced researchers from Google DeepMind, RAND, Apollo Research, MATS, UK AISI etc. apply to mentor a project. Mentor application deadline 2025-12-05 (passed).

#alignment#governance#evals#safety-research#interpretability#technical-safety remotementorship

EA Global: San Francisco 2026

conference ★ 0.54 Apps closed Feb 1, 2026

📅 Feb 13, 2026 – Feb 15, 2026 📍 San Francisco, USA via EA Global Events

A three-day conference bringing together the effective altruism community to share new thinking and research, coordinate on global projects, and network. Features keynote speakers, talks, workshops, and social activities focused on addressing pressing global problems including AI safety. Friday evening opening reception, full-day Saturday and Sunday programming.

#governance#alignment#evals

International Programme on AI Evaluation 2026

fellowship ★ 0.52 Apps closed Jan 15, 2026 ^?

📅 Feb 1, 2026 – May 31, 2026 📍 Valencia, Spain · Hybrid via International Programme on AI Evaluation , International Programme on AI Evaluation , International Programme on AI Evaluation

A fully-funded academic programme on AI evaluation combining technical training with policy and governance perspectives. 40 scholars globally receive 90 hours online, 20 hours hands-on courses, and a 40-hour in-person capstone week in Valencia, earning a 15 ECTS Expert Diploma via ValgrAI.

#evals#safety-research#governance#policy#alignment fellowshipevalsacademichybriddiplomafundedprestigiousevaluationscholarship

Technical AI Governance Hackathon - Berkeley

hackathon ★ 0.59

📅 Jan 30, 2026 – Feb 1, 2026 📍 Berkeley, USA via Apart Research

A three-day hackathon bringing together builders to create verification systems, compliance infrastructure, and coordination tools for international AI governance. Five focus tracks: Hardware Verification & Attestation, Compliance Infrastructure & Privacy-Preserving Proofs, Risk Thresholds & Compute Verification, International Verification & Coordination, and Research Governance & Dual-Use Detection. Organized by Apart Research in partnership with MIRI Technical Governance Team and Lucid Computing.

#governance#evals#scaling-infrastructure

AIGOV - AI Governance Workshop at AAAI 2026

workshop ★ 0.48 Reg closed Jan 26, 2026

📅 Jan 26, 2026 📍 Singapore, Singapore · Hybrid via AIGOV - AI Governance Workshop at AAAI

The 3rd International AI Governance Workshop focuses on Alignment, Morality, Law, and Design. This full-day hybrid workshop brings together researchers, industry, and policy communities to bridge technical alignment methods, policy frameworks, and ethical implementation of AI governance.

#governance#alignment#policy#evals#safety AAAIworkshop

Algoverse AI Safety Fellowship 2026

fellowship ★ 0.55 Apps closed Jan 4, 2026

📅 Jan 26, 2026 – May 1, 2026 📍 Virtual via Algoverse AI Research

Twelve-week intensive AI safety research fellowship with 25+ hours per week commitment. Two-stage selection: 60 participants accepted for trial week (Jan 19-23) on foundational coursework in RLHF, interpretability, SAEs, and adversarial robustness, then 30 selected as Research Fellows based on performance. Fully funded by Open Philanthropy, covers tuition, computing infrastructure access, mentorship, and limited conference travel support.

#alignment#interpretability

AI Safety Camp 11 (Virtual Edition 2026)

fellowship ★ 0.66

📅 Jan 10, 2026 – Apr 27, 2026 📍 Virtual via AI Safety Camp (AISC)

Three-month online AI safety research program where participants form teams to work on pre-selected projects. Opening weekend January 10-11, projects run through April 19, final presentations April 24-27. Features 27 projects across six themes: Stop/Pause AI (5), Policy/Governance (5), Evaluate Risks from AI (5), Mech-Interp (2), Agent Foundations (4), Alternative LLM Safety (5), and Safe by Design AIs (1). Participants work 10 hours per week with weekly team meetings. Seven-year track record with alumni founding 10 organizations and securing 43 jobs in AI Safety. Application deadline November 23, 2025.

#alignment#governance#interpretability#technical-safety#community-building#mechanistic-interpretability

AI Safety Camp 11 (AISC11)

fellowship ★ 0.61 Apps closed Nov 23, 2025

📅 Jan 10, 2026 – Apr 27, 2026 📍 Virtual via AI Safety Camp (AISC)

3-month long online program where participants form teams to work on pre-selected AI Safety projects. Opening weekend January 10-11 through April 19 with final presentations April 24-27. Three tracks available covering alignment research, governance, policy, and stop/pause advocacy. Team member applications deadline was November 23rd.

#alignment#interpretability#governance#evals#control

Sydney AI Safety Fellowship 2026

fellowship ★ 0.52

📅 Jan 10, 2026 – Mar 1, 2026 📍 Sydney, Australia · Hybrid via Sydney AI Safety Fellowship

10-week hybrid fellowship in Sydney, Australia. 7 weeks in-person component, 2 days per week in-person attendance. Targets strongly motivated, highly agentic, immensely talented folks across diverse backgrounds from technical researchers to governance specialists to entrepreneurs. Focuses on helping participants develop strategic awareness and identify their optimal contribution path to AI safety. Weekly customized discussions, expert speakers, mentorship, co-working space with compute access, social events, potential flight reimbursement for top candidates.

#alignment#governance#strategy fellowship

CAMBRIA 2026 - ML Bootcamp for Interpretability and Alignment

fellowship ★ 0.62

📅 Jan 5, 2026 – Jan 23, 2026 📍 Cambridge, USA via CAMBRIA - Cambridge Bootcamp for Research in Interpretability and Alignment

Cambridge Bootcamp for Research in Interpretability and Alignment: 3-week ML upskilling bootcamp for AI safety, focusing on interpretability and RL. Based on ARENA curriculum. Run by Cambridge Boston Alignment Initiative in Cambridge, Massachusetts. In-person intensive programme.

#interpretability#alignment

FIG Fellowship December 2025 Cohort

fellowship ★ 0.54 Apps closed Oct 21, 2025

📅 Dec 1, 2025 – Mar 1, 2026 📍 Virtual via FIG Fellowship - Future Impact Group

Part-time, remote-first, 12-week fellowship by Future Impact Group for students and early-career researchers. Minimum 8+ hours/week on research projects in AI policy, philosophy for safe AI (technical safety and ethical foundations), or AI sentience. Provides co-working sessions, issue troubleshooting, career guidance, networking, and guest speakers.

#governance#alignment

FIG Fellowship Fall 2025

fellowship ★ 0.51 Apps closed Oct 21, 2025

📅 Dec 1, 2025 – Mar 1, 2026 📍 Virtual via FIG Fellowship - Future Impact Group

Part-time, remote-first, 12-week research fellowship where participants work as research associates on specific projects under experienced supervision. Focus areas: AI governance, technical AI safety, and digital sentience. Time commitment: 8+ hours per week. Includes co-working sessions, issue troubleshooting, career guidance, opening and closing events, networking opportunities, research sprints, and guest speakers.

#governance#alignment#digital-sentience