Bringing together multiple faith perspectives on AI governance and security implications, exploring ethical dimensions of advanced AI through inter-religious dialogue.
Past events
Sorted by date, newest first. Useful as a memory of what's happened in the community.
AI Risk Content Hackathon organized by BlueDot Impact in London. Focus on creating AI safety and risk communication content. Part of BlueDot's broader mission to build the workforce needed to safely navigate AGI.
Foresight Institute flagship event gathering leading scientists, entrepreneurs, funders, and policymakers to explore the frontiers of science and technology. Multiple tracks including AI safety topics.
Flagship conference where leading scientists, entrepreneurs, funders, and policymakers convene to explore frontier technology and plan for beneficial futures. Includes AI safety track as part of broader focus on transformative technology. 40-year-old organization focused on beneficial technology development.
Foresight Institute Vision Weekend with frontier science and technology tracks including AI safety. 40-year-old organization focused on transformative technology. Three-day event in London featuring AI safety programming alongside other frontier tech tracks.
Five-day intensive programme for AI safety founders going from idea to funded. Successful pitches receive ยฃ50k in equity-free seed funding. Part of BlueDot Impact's incubator and rapid-funding initiatives supporting concrete AI safety work.
EA Global conference series organized by Centre for Effective Altruism. Speakers present research on effective altruism including heavy AI safety programming. Features talks, workshops, and networking opportunities. In scope for social and community event reasons with significant AI safety attendance.
Hackathon focused on secure program synthesis by Apart Research. Hybrid format with online participation and in-person hubs.
3-week ML upskilling bootcamp for AI safety focusing on interpretability and RL, based on ARENA curriculum. Run by Cambridge Boston Alignment Initiative. Provides housing, meals, 24/7 office access, dedicated teaching assistants, and travel support to participants.
Technical workshop in partnership with the Center for AI Safety focusing on infrastructure for secure and verifiable AI, bringing together researchers, builders, and funders across ML, hardware security, systems, cryptography, and computer security.
Third iteration of Technical AI Safety Conference, inaugural UK-hosted event. One-day conference organized by Oxford Martin AI Governance Initiative and Noeon Research. Welcomes attendees from all backgrounds with varying research experience levels. Free admission. Sponsored by MATS and Apart Research, supported by IASEAI, Foresight Institute, and OAISI.
A free, accessible workshop hosted by AI Safety Awareness Group Oakland exploring AI's trajectory and societal impact. No technical background required. Features live demonstrations of current AI systems, interactive forecasting activities, and discussions about AI's implications for work, relationships, and society over the next 1-5 years.
Two-day conference on global cooperation for cybersecurity resilience and stability. Organized by United Nations Institute for Disarmament Research. Addresses international frameworks for cyber governance and security cooperation.
A community-organized regional Effective Altruism conference targeting the policy, research, and public-interest communities across the Washington DC, Maryland, and Virginia area. Designed to help professionals connect with practitioners, explore high-impact career paths, and engage with organizations focused on global challenges. Key topics include global health, AI governance, public policy, and biosecurity.
AI safety mixer for professionals exploring a move into AI safety, hosted by the London Initiative for Safe AI (LISA). A low-pressure evening to explore the field, understand the part you could play, and meet people who are already working in AI safety. Designed for professionals interested in transitioning into the AI safety field.
Workshop on representational alignment between artificial and biological information processing systems at ICLR 2026. Two themes: Neural Control (when does representational alignment allow meaningful intervention on system behavior), and Downstream Behavior (reconfiguring features for new tasks). Includes challenge reports for leaderboard benchmarks. Double-blind review, reciprocal reviewer policy.
Three-day hybrid hackathon focusing on AI and biosafety. Organized by Apart Research with in-person hubs in London, Berlin, and San Francisco.
Second annual AI control conference organized by Redwood Research and FAR.AI. Focuses on reducing risks from AI misalignment through interventions robust even when AI models attempt to undermine safeguards. Features speakers from Redwood Research, METR, Anthropic, and CMU. Pre-conference workshop on April 17 for those new to AI control research. Free to attend with application required.
Conference connecting policymakers with leading AI technical experts to discuss policy innovations. Organized by FAR.AI. Focus on bridging technical AI safety research with policy implementation.
A three-day hackathon focused on developing safety measures for potentially misaligned AI systems. Participants work on control protocols and evaluation tools to keep autonomous AI agents contained, addressing oversight challenges as AI systems become more autonomous. Co-organized by Apart Research and Redwood Research (founder of the AI control field), with 700+ participants submitting 126 projects across main competition and specialty tracks.
FAR.AI hosted event bringing together more than 200 researchers, policymakers, and industry experts to discuss AI alignment topics. Featured presentations on honeypot-based methods for detecting scheming, and training against interpretability-based deception detectors. Victoria Krakovna (Google DeepMind) and Stefan Heimersheim (Google DeepMind) presented.
First AI, Manipulation, & Information Integrity (AIMII) Workshop at IASEAI'26. Convenes researchers across computer science, cognitive science, philosophy, political science, and policy to examine how generative AI models affect information creation and access, with focus on manipulation, deception, and the integrity of public discourse. Features three panel discussions and poster session.
Workshop on AI manipulation and information integrity at IASEAI'26. Three panel discussions plus poster session bringing together researchers from computer science, cognitive science, philosophy, political science, and policy to clarify core concepts, evaluate evidence on AI's persuasive and manipulative capabilities, and explore implications for society and democracy. Features 22 accepted poster abstracts and 5 hackathon-winning projects on topics including LLM persuasion capabilities, detection methods, psychological impacts, and policy frameworks.
Supervised Program for Alignment Research: 3-month part-time remote research program pairing aspiring researchers with experienced mentors from Google DeepMind, RAND, Apollo Research, MATS, UK AISI, etc. Accepts mentees at various experience levels without requiring prior research experience, though technical or policy backgrounds preferred. Optional continuation after May 16.
SPAR (Supervised Program for Alignment Research) Spring 2026 mentee track. Part-time remote research program pairing aspiring researchers with experienced mentors from Google DeepMind, RAND, Apollo Research, MATS, UK AISI for three-month projects. Mentees commit 5-40 hours per week. Research period: February 16 - May 16. Mentee decisions were sent out February 2-6. Applications for Spring 2026 have closed.
Part-time remote research fellowship pairing aspiring researchers with 130+ experienced mentors from Google DeepMind, RAND, Apollo Research, MATS, UK AISI, and other top organizations for three-month AI alignment projects. Participants commit 5-40 hours weekly. Covers project expenses including compute and API/LLM access. Culminates in virtual Demo Day with prizes totaling $7,000. Optional continuation beyond May 16. Mentor application track: experienced researchers from Google DeepMind, RAND, Apollo Research, MATS, UK AISI etc. apply to mentor a project. Mentor application deadline 2025-12-05 (passed).
A three-day conference bringing together the effective altruism community to share new thinking and research, coordinate on global projects, and network. Features keynote speakers, talks, workshops, and social activities focused on addressing pressing global problems including AI safety. Friday evening opening reception, full-day Saturday and Sunday programming.
First global academic programme focused on AI evaluation capabilities and safety. 150-hour programme including 90 hours online lectures and networking, 20 hours hands-on courses, and 40-hour in-person capstone week in Valencia. Fully funded scholarships for 40 participants selected globally. Faculty from Cambridge, Stanford, Princeton, EU AI Office, UK AI Safety Institute, FAR AI, and Apollo Research. Leads to 15 ECTS Expert Diploma via ValgrAI.
A three-day hackathon bringing together builders to create verification systems, compliance infrastructure, and coordination tools for international AI governance. Five focus tracks: Hardware Verification & Attestation, Compliance Infrastructure & Privacy-Preserving Proofs, Risk Thresholds & Compute Verification, International Verification & Coordination, and Research Governance & Dual-Use Detection. Organized by Apart Research in partnership with MIRI Technical Governance Team and Lucid Computing.
Third International AI Governance Workshop bringing together researchers, industry practitioners, and policy experts to address AI governance challenges. Topics include technical AI governance, policy frameworks, ethical implementation, agentic AI systems, human-AI collaboration, and safety challenges. Full-day hybrid event at AAAI 2026.
Three-month online AI safety research program where participants form teams to work on pre-selected projects. Opening weekend January 10-11, projects run through April 19, final presentations April 24-27. Features 27 projects across six themes: Stop/Pause AI (5), Policy/Governance (5), Evaluate Risks from AI (5), Mech-Interp (2), Agent Foundations (4), Alternative LLM Safety (5), and Safe by Design AIs (1). Participants work 10 hours per week with weekly team meetings. Seven-year track record with alumni founding 10 organizations and securing 43 jobs in AI Safety. Application deadline November 23, 2025.
Online part-time AI safety research program featuring 27 public projects across diverse approaches, from stop/pause AI to mech-interp, agent foundations, policy/governance, and evals. Collaborators work 10 hours per week in teams with weekly meetings, culminating in final presentations at an online conference. Projects cover alignment, interpretability, governance, and evaluations with diverse theories of change.
Cambridge Bootcamp for Research in Interpretability and Alignment: 3-week ML upskilling bootcamp for AI safety, focusing on interpretability and RL. Based on ARENA curriculum. Run by Cambridge Boston Alignment Initiative in Cambridge, Massachusetts. In-person intensive programme.
Part-time, remote-first, 12-week fellowship by Future Impact Group for students and early-career researchers. Minimum 8+ hours/week on research projects in AI policy, philosophy for safe AI (technical safety and ethical foundations), or AI sentience. Provides co-working sessions, issue troubleshooting, career guidance, networking, and guest speakers.