Applied Cognitive Oversight Tracker

Upcoming events

Events, training, fellowships, mixers, and CFPs across Applied Cognitive Oversight, instruments for epistemic access to model cognition, and the broader AI safety, alignment, and governance community. Refreshed weekly. Sorted by salience. What's ACO?

Recently added

type
format
topic

Australian AI Safety Forum 2026

conference ★ 0.92 CFP closes Jun 15, 2026 ?
📅 Jul 7, 2026 – Jul 8, 2026 📍 Sydney, Australia via Australian AI Safety Forum

Two-day interdisciplinary forum exploring technical AI safety challenges, governance approaches, risk assessment, evaluation standards, and cross-sector collaboration. Grounded in the International AI Safety Report. Features keynotes, panels, workshops, and networking. Organized by Gradient Institute with RAND, Timaeus, Good Ancestors, and CSIRO Data61. Builds on Australia's AI Safety Institute and National AI Plan.

#evals#governance#evaluation#measurement-science#alignment#evaluations#policy#technical-safety#safety interdisciplinary
Salience signals
{
  "type_weight": 1,
  "source_trust": 0.75,
  "topic_relevance": 0.9,
  "time_proximity": 0.9849056603773585,
  "community_signal": 0.75,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 1,
  "source_count": 1
}

UNIDIR Global Conference on AI, Security and Ethics 2026

conference ★ 0.79 Reg closes Jun 15, 2026 ?

UNIDIR flagship conference spotlighting how artificial intelligence is reshaping global security. Brings together experts to discuss AI's intersection with peace and security issues. Part of UNIDIR's Strategic Framework 2026-2030 emphasis on AI as primary focus area. Organization produces 132 publications yearly and reaches 193 states through research with 20,000+ annual event participants.

#governance#sociotechnical-threats#policy#sociotechnical-threat-surface#safety
Salience signals
{
  "type_weight": 1,
  "source_trust": 0.85,
  "topic_relevance": 0.8,
  "time_proximity": 0.5428571428571429,
  "community_signal": 0.7,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 1,
  "source_count": 1
}

Global South AI Safety Hackathon 2026

hackathon ★ 0.77 Reg closes Jun 19, 2026
📅 Jun 19, 2026 – Jun 21, 2026 📍 Hybrid via Apart Research

Regional AI safety competition focusing on tools, evaluations, and policy research from Latin America, Africa, or Asia. Sponsored by Schmidt Sciences with pipeline to fellowship and placement opportunities. Online and in-person format enabling global participation from the Global South.

#alignment#safety#evaluation#governance#evals
Salience signals
{
  "type_weight": 0.65,
  "source_trust": 0.85,
  "topic_relevance": 0.85,
  "time_proximity": 0.5714285714285714,
  "community_signal": 0.75,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 1,
  "source_count": 1
}

Cambridge ERA:AI Fellowship 2026

fellowship ★ 0.93 Apps close Jun 20, 2026 ?
📅 Jul 6, 2026 – Sep 11, 2026 📍 Cambridge, United Kingdom via ERA Cambridge: Existential Risk Alliance Fellowship

10-week fellowship for researchers and entrepreneurs working on mitigating risks from frontier AI. Targets talented individuals at any career stage interested in AI safety and governance research. Provides competitive stipend, meals during working hours, and coverage for transport, visas, and lodging. Hosts 30+ events with mentorship from expert researchers.

#alignment#governance#technical-safety#safety
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.85,
  "topic_relevance": 0.9,
  "time_proximity": 0.989937106918239,
  "community_signal": 0.8,
  "speaker_org_signal": 0.85,
  "is_deadline_open": 1,
  "source_count": 1
}

CAMBRIA Summer 2026: July Cohort

fellowship ★ 0.93 Apps close Jun 28, 2026 ?
📅 Jul 6, 2026 – Jul 24, 2026 📍 New York, USA via CAMBRIA - Cambridge Bootcamp for Research in Interpretability and Alignment

3-week ML upskilling bootcamp for AI safety focusing on interpretability and RL, based on ARENA curriculum. Run by Cambridge Boston Alignment Initiative. Provides housing, meals, 24/7 office access, dedicated teaching assistants, and travel support to participants.

#interpretability#alignment
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.85,
  "topic_relevance": 0.9,
  "time_proximity": 0.989937106918239,
  "community_signal": 0.8,
  "speaker_org_signal": 0.85,
  "is_deadline_open": 1,
  "source_count": 1
}

Anthropic Fellows Program July 2026

fellowship ★ 0.91 Apps close Jun 30, 2026 ?
📅 Jul 20, 2026 – Nov 20, 2026 📍 In-person via Anthropic Alignment Blog

Four-month full-time paid AI safety research fellowship providing funding, compute (~$15k/month), and direct mentorship from Anthropic researchers to work on real safety and security projects. Focus areas include scalable oversight, adversarial robustness, and other AI safety topics. Over 40% of fellows from first cohort joined Anthropic full-time.

#alignment#technical-safety#interpretability#evals#control#adversarial-robustness#scaling-infrastructure
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.95,
  "topic_relevance": 0.95,
  "time_proximity": 0.7786163522012579,
  "community_signal": 0.9,
  "speaker_org_signal": 0.95,
  "is_deadline_open": 0,
  "source_count": 2
}

ACL 2026 Workshop on Evaluating AI in Practice

workshop ★ 0.72 Reg closes Jul 2, 2026 ?
📅 Jul 4, 2026 📍 San Diego, USA via EvalEval Coalition

Workshop on Evaluating Evaluations (EvalEval) at ACL 2026. Focuses on evaluation methodology, measurement theory, validity, reliability, reproducibility, infrastructure costs, and sociotechnical impacts including bias and privacy. Features panel with model developers, oral presentations, posters, and Every Eval Ever shared task standardizing results across 22,000+ models.

#evals#safety-research#measurement#evaluation-methodology#evaluation#alignment ACL-workshoptwo-day
Salience signals
{
  "type_weight": 0.85,
  "source_trust": 0.75,
  "topic_relevance": 0.85,
  "time_proximity": 1,
  "community_signal": 0.7,
  "speaker_org_signal": 0.75,
  "is_deadline_open": 0,
  "source_count": 1
}

Seoul Alignment Workshop 2026

workshop ★ 0.82 Reg closes Jul 5, 2026 ?

Workshop bringing together global leaders in academia and industry to explore risk mitigation strategies for artificial general intelligence. Organized by FAR.AI. Co-located with ICML 2026 in Seoul.

#alignment#governance#control#interpretability#technical-safety#agi-risk#evals#agi-safety
Salience signals
{
  "type_weight": 0.85,
  "source_trust": 0.9,
  "topic_relevance": 0.9,
  "time_proximity": 0.9547169811320755,
  "community_signal": 0.8,
  "speaker_org_signal": 0.9,
  "is_deadline_open": 0,
  "source_count": 1
}

AI Safety New Zealand Conference 2026

conference ★ 0.90 Reg closes Jul 10, 2026 ?
📅 Jul 12, 2026 📍 Christchurch, New Zealand via BlueDot Impact Events Calendar (Luma)

AI safety conference in New Zealand organized by BlueDot Impact community. First major AI safety conference in New Zealand, bringing together researchers, practitioners, and policy experts in the region to discuss technical safety, governance, and alignment challenges.

#safety#governance#alignment
Salience signals
{
  "type_weight": 1,
  "source_trust": 0.85,
  "topic_relevance": 0.85,
  "time_proximity": 0.959748427672956,
  "community_signal": 0.75,
  "speaker_org_signal": 0.75,
  "is_deadline_open": 1,
  "source_count": 1
}

Secure & Sovereign AI Workshop 2026

workshop ★ 0.80 Reg closes Jul 10, 2026 ?
📅 Jul 18, 2026 – Jul 19, 2026 📍 Berlin, Germany via Foresight Institute , Foresight Institute

Foresight Institute technical workshop bringing together researchers, engineers, and entrepreneurs in computer science, ML, crypto, security, and related fields. Focus on secure AI development, safety infrastructure, and sovereignty considerations for AI systems. Part of Foresight's Secure AI programme supporting frontier science approaches to AI safety.

#governance#evals#alignment#safety-research#security#control#technical-safety#ai-security#adversarial-robustness#scaling-infrastructure#safety technicalBerlin
Salience signals
{
  "type_weight": 0.85,
  "source_trust": 0.75,
  "topic_relevance": 0.8,
  "time_proximity": 0.929559748427673,
  "community_signal": 0.7,
  "speaker_org_signal": 0.75,
  "is_deadline_open": 1,
  "source_count": 1
}

ARC White-Box Estimation Challenge 2026

other ★ 0.77 Reg closes Jul 31, 2026

Competition focused on improving estimation algorithms for random MLPs. Partnership between ARC (Alignment Research Center) and AIcrowd. Warm-up round focuses on developing theoretical foundations for mechanistic explanations of neural network behavior by analyzing weights directly rather than relying on sampling methods.

#interpretability#evals#control#alignment
Salience signals
{
  "type_weight": 0.35,
  "source_trust": 0.95,
  "topic_relevance": 0.9,
  "time_proximity": 0.5428571428571429,
  "community_signal": 0.75,
  "speaker_org_signal": 0.9,
  "is_deadline_open": 1,
  "source_count": 1
}

Iliad August Intensive 2026

fellowship ★ 0.88 Apps close Aug 1, 2026 ?
📅 Aug 10, 2026 – Aug 28, 2026 📍 Berkeley, USA via LessWrong Events

3-week intensive program in applied mathematics for AI alignment by Iliad. Based in Berkeley at Lighthaven. Provides $5,000 support. Selection based on estimated mathematical strength. Part of Iliad's series of intensive and fellowship programs.

#alignment#formal-foundations
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.9,
  "topic_relevance": 0.9,
  "time_proximity": 0.7433962264150944,
  "community_signal": 0.8,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 1,
  "source_count": 1
}

CAMBRIA Summer 2026: August Cohort

fellowship ★ 0.89 Apps close Aug 2, 2026 ?
📅 Aug 10, 2026 – Aug 28, 2026 📍 Cambridge, USA via CAMBRIA - Cambridge Bootcamp for Research in Interpretability and Alignment

3-week ML upskilling bootcamp for AI safety focusing on interpretability and RL, based on ARENA curriculum. Run by Cambridge Boston Alignment Initiative. Provides housing, meals, 24/7 office access, dedicated teaching assistants, and travel support to participants.

#interpretability#alignment
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.85,
  "topic_relevance": 0.9,
  "time_proximity": 0.8138364779874214,
  "community_signal": 0.8,
  "speaker_org_signal": 0.85,
  "is_deadline_open": 1,
  "source_count": 1
}

EAGx Berkeley 2026

conference ★ 0.83 Apps close Aug 7, 2026 ?
📅 Aug 21, 2026 – Aug 23, 2026 📍 Berkeley, USA via EA Global Events

Regional EA Global conference with heavy AI safety attendance. Berkeley location draws alignment researchers, interpretability practitioners, and governance community. In scope for social and community networking reasons.

#alignment#governance#safety
Salience signals
{
  "type_weight": 1,
  "source_trust": 0.8,
  "topic_relevance": 0.75,
  "time_proximity": 0.7584905660377359,
  "community_signal": 0.85,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 1,
  "source_count": 1
}

Iliad Fall Fellowship 2026

fellowship ★ 0.85 Apps close Sep 1, 2026 ?
📅 Sep 7, 2026 – Dec 4, 2026 📍 London, UK via LessWrong Events

3-month mentored research fellowship in applied mathematics for AI alignment by Iliad. Based in London at LISA. Provides $6,000/month stipend. Selection based on estimated mathematical strength.

#alignment#formal-foundations
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.9,
  "topic_relevance": 0.9,
  "time_proximity": 0.6025157232704402,
  "community_signal": 0.8,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 1,
  "source_count": 1
}

Iliad September Intensive 2026

fellowship ★ 0.85 Apps close Sep 1, 2026 ?
📅 Sep 7, 2026 – Oct 2, 2026 📍 London, UK via LessWrong Events

4-week intensive program in applied mathematics for AI alignment by Iliad. Based in London at LISA. Provides $5,000 support. Selection based on estimated mathematical strength.

#alignment#formal-foundations
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.9,
  "topic_relevance": 0.9,
  "time_proximity": 0.6025157232704402,
  "community_signal": 0.8,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 1,
  "source_count": 1
}

NeurIPS 2026

conference ★ 0.70 CFP closes Sep 24, 2026 ?
📅 Dec 6, 2026 – Dec 12, 2026 📍 In-person via NeurIPS: Safety-related Workshops

Neural Information Processing Systems conference with three satellite locations. Includes dedicated Ethics Review track, Reproducibility as official track, and Responsible AI metadata requirements in Evaluations and Datasets Track. Features Code of Conduct and Code of Ethics fostering diverse, inclusive community. Safety-related workshops cross-referenced with keywords.

#interpretability#evals#alignment#ml-research#safety-research#mechanistic-interpretability#machine-learning#evaluation#governance#safety major-conferencemulti-location
Salience signals
{
  "type_weight": 1,
  "source_trust": 0.85,
  "topic_relevance": 0.7,
  "time_proximity": 0.22012578616352196,
  "community_signal": 0.8,
  "speaker_org_signal": 0.75,
  "is_deadline_open": 1,
  "source_count": 1
}

Iliad October Intensive 2026

fellowship ★ 0.83 Apps close Oct 1, 2026 ?
📅 Oct 5, 2026 – Oct 30, 2026 📍 Berkeley, USA via LessWrong Events

4-week intensive program in applied mathematics for AI alignment by Iliad. Based in Berkeley. Provides $5,000 support. Selection based on estimated mathematical strength.

#alignment#formal-foundations
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.9,
  "topic_relevance": 0.9,
  "time_proximity": 0.46163522012578617,
  "community_signal": 0.8,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 1,
  "source_count": 1
}

EA Global: New York City 2026

conference ★ 0.74 Apps close Oct 4, 2026
📅 Oct 16, 2026 – Oct 18, 2026 📍 New York, USA via EA Global Events

EA Global conference with heavy AI safety attendance and programming. Annual gathering bringing together effective altruism community members working on global priorities including AI safety, alignment, and governance. In scope for social and community event reasons.

#alignment#safety#governance
Salience signals
{
  "type_weight": 1,
  "source_trust": 0.8,
  "topic_relevance": 0.7,
  "time_proximity": 0.4767295597484277,
  "community_signal": 0.85,
  "speaker_org_signal": 0.75,
  "is_deadline_open": 1,
  "source_count": 1
}

Iliad November Intensive 2026

fellowship ★ 0.80 Apps close Nov 1, 2026 ?
📅 Nov 2, 2026 – Nov 27, 2026 📍 London, UK via LessWrong Events

4-week intensive program in applied mathematics for AI alignment by Iliad. Based in London at LISA. Provides $5,000 support. Selection based on estimated mathematical strength.

#alignment#formal-foundations
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.9,
  "topic_relevance": 0.9,
  "time_proximity": 0.320754716981132,
  "community_signal": 0.8,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 1,
  "source_count": 1
}

Vision Weekend USA 2026

conference ★ 0.67 Reg closes Nov 5, 2026 ?
📅 Nov 13, 2026 – Nov 15, 2026 📍 San Francisco, USA via Foresight Institute

Foresight Institute flagship Vision Weekend conference gathering leading scientists, entrepreneurs, funders, and policymakers to explore frontiers of science and technology. Features AI safety track alongside other frontier science themes. 40-year-old organization focused on transformative technology with established community presence.

#safety
Salience signals
{
  "type_weight": 1,
  "source_trust": 0.75,
  "topic_relevance": 0.7,
  "time_proximity": 0.3358490566037735,
  "community_signal": 0.7,
  "speaker_org_signal": 0.75,
  "is_deadline_open": 1,
  "source_count": 1
}

GovAI DC Summer Fellowship 2026

fellowship ★ 0.96 Apps closed Mar 1, 2026 ?
📅 Jun 8, 2026 – Aug 28, 2026 📍 Washington DC, USA via GovAI - Centre for the Governance of AI

Three-month bipartisan fellowship designed to launch or accelerate impactful careers in American AI governance and policy. Participants deepen understanding of the field, connect with network of experts, and build skills and professional profile. $21,000 stipend. Alumni have secured positions at leading AI companies (DeepMind, OpenAI, Anthropic).

#governance#policy fellowshippolicygovernanceDC
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.9,
  "topic_relevance": 1,
  "time_proximity": 0.929559748427673,
  "community_signal": 0.9,
  "speaker_org_signal": 1,
  "is_deadline_open": 0,
  "source_count": 2
}

GovAI Summer Fellowship 2026 - Research Track

fellowship ★ 0.96 Apps closed Jan 4, 2026 ?
📅 Jun 8, 2026 – Aug 28, 2026 📍 London, UK via GovAI - Centre for the Governance of AI

Three-month fellowship where fellows conduct independent research on AI governance topic of their choice with mentorship from leading experts. £12,000 stipend. GovAI was founded to help decision-makers navigate the transition to advanced AI through rigorous research and talent fostering. Alumni have secured positions at DeepMind, OpenAI, Anthropic.

#governance#policy fellowshipresearchgovernanceLondon
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.9,
  "topic_relevance": 1,
  "time_proximity": 0.929559748427673,
  "community_signal": 0.9,
  "speaker_org_signal": 1,
  "is_deadline_open": 0,
  "source_count": 2
}

CBAI Summer Research Fellowship 2026

fellowship ★ 0.95 Apps closed Apr 12, 2026 ?

Nine-week AI safety research fellowship run by Cambridge Boston Alignment Initiative. Accepts 30 fellows (undergraduate, Master's, PhD students, postdocs, and recent graduates). Includes $10,000 stipend, accommodation in Harvard dorms, meals, workspace access, and up to $10,000 in compute credits. Applications reviewed on rolling basis through four-stage process. International students on OPT/CPT eligible but visa sponsorship not available.

#alignment#interpretability#governance#evals#biosecurity#safety-research fellowshipresearchCambridgeHarvardstipendhousingcompute-credits
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.85,
  "topic_relevance": 0.95,
  "time_proximity": 0.9345911949685535,
  "community_signal": 0.8,
  "speaker_org_signal": 0.85,
  "is_deadline_open": 1,
  "source_count": 1
}

BlueDot Technical AI Safety Project Sprint May 2026

workshop ★ 0.92 Apps closed May 10, 2026
📅 May 18, 2026 – Jun 21, 2026 📍 Virtual via BlueDot Impact

5-week project-based course for engineers and early researchers to work with an AI safety expert on a contribution to AI safety research or engineering. Includes mentorship, regular check-ins, and a published write-up. Covers alignment, mechanistic interpretability, evaluations, red-teaming, AI control, and scalable oversight.

#alignment#interpretability#evals
Salience signals
{
  "type_weight": 0.85,
  "source_trust": 0.9,
  "topic_relevance": 0.9,
  "time_proximity": 0.8,
  "community_signal": 0.8,
  "speaker_org_signal": 0.85,
  "is_deadline_open": 1,
  "source_count": 1
}

MATS Summer 2026 Fellowship

fellowship ★ 0.88 Apps closed Jan 18, 2026
📅 Jun 1, 2026 – Aug 31, 2026 📍 Berkeley / London, USA / UK · Hybrid via MATS: ML Alignment & Theory Scholars , MATS: ML Alignment & Theory Scholars

10-week AI safety research fellowship with optional 6-12 month extension in London. $1250 weekly stipend plus $2k weekly compute resources. Five research tracks: Technical Governance, Empirical, Policy & Strategy, Theory, and Compute Infrastructure. Applications closed but EOI still being collected. Welcomes diverse backgrounds with strong motivation to contribute to AI safety.

#alignment#mechanistic-interpretability#governance#interpretability#evals
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.95,
  "topic_relevance": 0.95,
  "time_proximity": 0.989937106918239,
  "community_signal": 0.9,
  "speaker_org_signal": 0.9,
  "is_deadline_open": 0,
  "source_count": 1
}

Mechanistic Interpretability Workshop @ ICML 2026

workshop ★ 0.87 CFP closed May 31, 2026 ?
📅 Jul 10, 2026 📍 Seoul, South Korea via Mechanistic Interpretability Workshop at ICML

Third iteration of mechanistic interpretability workshop at ICML. Focuses on developing principled methods to analyze and understand neural network internals (weights and activations). Unites researchers from academia, industry, and independent research to discuss advances in understanding model decision-making. Field has sizable communities, dedicated startups, and rich ecosystem of tools and techniques.

#interpretability#alignment
Salience signals
{
  "type_weight": 0.85,
  "source_trust": 0.9,
  "topic_relevance": 0.95,
  "time_proximity": 0.969811320754717,
  "community_signal": 0.9,
  "speaker_org_signal": 0.9,
  "is_deadline_open": 0,
  "source_count": 1
}

Pluralistic Alignment @ ICML 2026

workshop ★ 0.85 CFP closed May 8, 2026 ?
📅 Jul 11, 2026 📍 Seoul, South Korea via Pluralistic Alignment Workshop at ICML

Workshop on Pluralistic AI: Aligning with the Diversity of Human Values at ICML 2026. Submissions should be anonymized papers 4 to 8 pages following ICML 2026 template through OpenReview. Acceptance notifications on May 22, 2026.

#alignment#governance#pluralistic-alignment#pluralistic-values#measurement-science ICML-workshopinterdisciplinary
Salience signals
{
  "type_weight": 0.85,
  "source_trust": 0.85,
  "topic_relevance": 0.9,
  "time_proximity": 0.7685534591194969,
  "community_signal": 0.7,
  "speaker_org_signal": 0.75,
  "is_deadline_open": 1,
  "source_count": 1
}

ILIAD 2026 - Theoretical AI Alignment Conference

conference ★ 0.84 Apps closed Jun 1, 2026
📅 Aug 3, 2026 – Aug 7, 2026 📍 Berkeley, USA via ILIAD - Theoretical AI Alignment Conference

5-day, multi-track conference bringing together researchers in theoretical AI alignment. Unconference format focusing on mathematical approaches including Singular Learning Theory, Agent Foundations, and Causal Incentives. Free to attend with limited travel and accommodation funding available on needs-based basis.

#formal-foundations#alignment#control
Salience signals
{
  "type_weight": 1,
  "source_trust": 0.85,
  "topic_relevance": 0.95,
  "time_proximity": 0.8490566037735849,
  "community_signal": 0.85,
  "speaker_org_signal": 0.85,
  "is_deadline_open": 0,
  "source_count": 1
}

ICML 2026

conference ★ 0.84 CFP closed Jun 6, 2026 ?
📅 Jul 6, 2026 – Jul 11, 2026 📍 Seoul, South Korea via ICML: Safety-related Workshops , ICML: Safety-related Workshops

International Conference on Machine Learning 2026. Major ML conference with safety-related workshops announced. Schedule: July 6 Expo/Tutorial Day, July 7-9 Main Conference, July 10-11 Workshops. Includes peer-review ethics, research ethics policies, and LLM usage guidelines for review process.

#alignment#interpretability#evals#ml-research#mechanistic-interpretability#governance#machine-learning#evaluation#safety major-conference
Salience signals
{
  "type_weight": 1,
  "source_trust": 0.85,
  "topic_relevance": 0.7,
  "time_proximity": 0.989937106918239,
  "community_signal": 0.8,
  "speaker_org_signal": 0.75,
  "is_deadline_open": 1,
  "source_count": 1
}

ICML 2026 Workshop on Mechanistic Interpretability

workshop ★ 0.84 CFP closed May 8, 2026

Workshop bringing together diverse perspectives from the community to discuss recent advances in mechanistic interpretability, build common understanding and chart future directions. Addresses developing principled methods to analyze and understand model internals (weights and activations) to gain insight into behavior and underlying computation. Received 2.6x submissions from previous year.

#interpretability#alignment#circuit-tracing#sparse-autoencoders#mechanistic-interpretability#instrument-science#adversarial-robustness ICMLworkshopmechanistic-interpretabilityICML-workshopthird-iteration
Salience signals
{
  "type_weight": 0.85,
  "source_trust": 0.9,
  "topic_relevance": 0.95,
  "time_proximity": 0.8289308176100629,
  "community_signal": 0.9,
  "speaker_org_signal": 0.85,
  "is_deadline_open": 0,
  "source_count": 1
}

Anthropic Fellows Program May 2026

fellowship ★ 0.82 Apps closed Apr 15, 2026 ?
📅 May 4, 2026 – Sep 4, 2026 📍 In-person via Anthropic Alignment Blog

Four-month full-time paid AI safety research fellowship providing funding, compute (~$15k/month), and direct mentorship from Anthropic researchers to work on real safety and security projects. Focus areas include scalable oversight, adversarial robustness, and other AI safety topics. Over 40% of fellows from first cohort joined Anthropic full-time.

#alignment#technical-safety#interpretability#control#adversarial-robustness#scaling-infrastructure
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.95,
  "topic_relevance": 0.95,
  "time_proximity": 0.2666666666666667,
  "community_signal": 0.9,
  "speaker_org_signal": 0.95,
  "is_deadline_open": 0,
  "source_count": 2
}

OpenAI Safety Fellowship 2026

fellowship ★ 0.81 Apps closed May 3, 2026
📅 Sep 14, 2026 – Feb 5, 2027 📍 Berkeley, United States · Hybrid via OpenAI Safety Fellowship

OpenAI safety research fellowship partnering with Constellation. Prioritizes research in safety evaluation, ethics, robustness, scalable mitigations, privacy-preserving safety methods. Empirically grounded and technically rigorous work expected. Fellows receive mentorship, monthly stipends, compute resources, and API credits. Physical workspace in Berkeley with remote participation permitted. Must produce substantial research output (paper, benchmark, or dataset).

#alignment#technical-safety#evals#control#safety#evaluation#interpretability
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.9,
  "topic_relevance": 0.95,
  "time_proximity": 0.6377358490566037,
  "community_signal": 0.9,
  "speaker_org_signal": 0.95,
  "is_deadline_open": 0,
  "source_count": 1
}

MATS Autumn 2026 Fellowship

fellowship ★ 0.80 Apps closed Jun 7, 2026
📅 Sep 28, 2026 – Dec 4, 2026 📍 Berkeley / London, USA / UK · Hybrid via MATS: ML Alignment & Theory Scholars

10-week ML Alignment & Theory Scholars fellowship with locations in Berkeley, CA and London, UK. Seven tracks: Empirical, Theory, Strategy & Forecasting, Policy & Governance, Systems Security, Biosecurity, and Founding & Field-Building. Over 80% of participants admitted to additional 6-12 month funded extension phase. Includes general application, track-specific evaluations, and mentor interviews.

#alignment#safety#interpretability#governance
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.95,
  "topic_relevance": 0.95,
  "time_proximity": 0.5672955974842767,
  "community_signal": 0.9,
  "speaker_org_signal": 0.9,
  "is_deadline_open": 0,
  "source_count": 1
}

Secret Loyalties Hackathon 2026

hackathon ★ 0.79
📅 Jul 24, 2026 – Jul 26, 2026 📍 Hybrid via Apart Research

AI safety hackathon organized by Apart Research focusing on secret loyalties and related alignment challenges. Online and in-person format enabling broad participation across the safety community.

#alignment#safety
Salience signals
{
  "type_weight": 0.65,
  "source_trust": 0.85,
  "topic_relevance": 0.8,
  "time_proximity": 0.89937106918239,
  "community_signal": 0.7,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 1,
  "source_count": 1
}

MARS V 2026: Stage 1 General Application

fellowship ★ 0.79 Apps closed May 3, 2026 ?
📅 Jul 13, 2026 – Oct 31, 2026 📍 Cambridge, United Kingdom · Hybrid via MARS - Mentorship for Alignment Researchers at CAISH

MARS (Mentorship for Alignment Researchers at CAISH) is a part-time, hybrid programme pairing teams of 2-4 participants with experienced mentors. Stage 1 is the general participant track. Features 8-15+ hours weekly on research projects, weekly check-ins with teammates and mentors, one-week in-person sprint (July 13-19 or July 20-26) followed by remote collaboration through October. Provides $2,000+ compute budget, Claude Max access for technical streams, travel/accommodation/meals during in-person week. Applications for MARS V are closed.

#alignment#technical-safety#mentorship#control#interpretability#evals#governance
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.8,
  "topic_relevance": 0.95,
  "time_proximity": 0.8490566037735849,
  "community_signal": 0.8,
  "speaker_org_signal": 0.85,
  "is_deadline_open": 0,
  "source_count": 1
}

Pluralistic Alignment Workshop @ ICML 2026

workshop ★ 0.77 CFP closed May 8, 2026
📅 Jul 11, 2026 📍 Seoul, South Korea via Pluralistic Alignment Workshop at ICML

Workshop on Pluralistic AI: Aligning with the Diversity of Human Values at ICML 2026. Invites submissions from multiple disciplines examining how to integrate diverse perspectives, values, and expertise into pluralistic AI alignment frameworks. Accepts academic papers, position papers, policy papers, and works in progress (non-archival).

#alignment#governance
Salience signals
{
  "type_weight": 0.85,
  "source_trust": 0.85,
  "topic_relevance": 0.85,
  "time_proximity": 0.9647798742138365,
  "community_signal": 0.8,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 0,
  "source_count": 1
}

MARS V 2026

fellowship ★ 0.77 Apps closed May 10, 2026 ?
📅 Jul 13, 2026 – Oct 31, 2026 📍 Cambridge, United Kingdom · Hybrid via MARS - Mentorship for Alignment Researchers at CAISH

Mentorship for Alignment Research Students operated by Cambridge AI Safety Hub. Part-time hybrid programme with in-person kick-off week (July 13-19 or 20-26), remote phase August-September, and final output October 2026. Teams of 2-4 participants with experienced mentor. Provides $2k+ compute budget, Claude Max access, research management support, travel funding, and accommodation. 8-15+ hours per week commitment. Expected output: published research.

#alignment#interpretability#evals
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.8,
  "topic_relevance": 0.9,
  "time_proximity": 0.919496855345912,
  "community_signal": 0.8,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 0,
  "source_count": 1
}

CCN 2026 - Cognitive Computational Neuroscience

conference ★ 0.76 CFP closed Apr 2, 2026 ?
📅 Aug 3, 2026 – Aug 6, 2026 📍 New York, USA via Cognitive Computational Neuroscience (CCN)

9th Annual Conference on Cognitive Computational Neuroscience, annual forum for discussion among researchers in cognitive science, neuroscience, and AI. Primarily single-track with keynote speakers, oral presentations, posters, Generative Adversarial Collaborations (GACs), and community events. Speakers include Brenden Lake, Ila Fiete, Kenji Doya, Doris Tsao, and Alona Fyshe. ACO attendees follow for predictive-coding, metacognition, signal-detection-theory measurement work applicable to LLMs.

#formal-foundations#instrument-science#measurement-science#interpretability
Salience signals
{
  "type_weight": 1,
  "source_trust": 0.8,
  "topic_relevance": 0.7,
  "time_proximity": 0.8490566037735849,
  "community_signal": 0.65,
  "speaker_org_signal": 0.75,
  "is_deadline_open": 1,
  "source_count": 1
}

Pivotal Research Fellowship June 2026

fellowship ★ 0.76 Apps closed May 3, 2026
📅 Jun 29, 2026 – Aug 28, 2026 📍 London, UK via Pivotal Research Fellowship

9-week AI safety research fellowship based in London at LISA. Open to anyone 18+ committed to AI safety regardless of background. Provides £6,000-£8,000 stipend (£8,000 for Senior Fellows), travel coverage, £2,000 housing assistance for non-London residents, meals and compute resources. Weekly 1-on-1 mentorship, in-person workspace with inclusive meals. Up to 6-month extensions available (70-90% of recent cohorts received them). Past fellows joined Anthropic, Google DeepMind, UK AISI.

#alignment#interpretability
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.75,
  "topic_relevance": 0.9,
  "time_proximity": 0.9547169811320755,
  "community_signal": 0.8,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 0,
  "source_count": 1
}
📅 Jun 20, 2026 – Jun 22, 2026 📍 Toronto, Canada via BlueDot Impact Events Calendar (Luma)

AI policy hackathon focused on governance challenges of deployed AI systems. Organized by BlueDot Impact community in Toronto, bringing together participants to work on practical policy frameworks for AI governance in real-world deployment contexts.

#governance#policy
Salience signals
{
  "type_weight": 0.65,
  "source_trust": 0.85,
  "topic_relevance": 0.8,
  "time_proximity": 0.6,
  "community_signal": 0.75,
  "speaker_org_signal": 0.7,
  "is_deadline_open": 1,
  "source_count": 1
}
📅 Jul 10, 2026 📍 Seoul, South Korea via ICML: Safety-related Workshops

Second Workshop on Technical AI Governance Research at ICML 2026, focusing on technical approaches to AI governance, policy, and regulation. Part of the main conference workshop track.

#governance#policy ICMLworkshopgovernancetechnical-governance
Salience signals
{
  "type_weight": 0.85,
  "source_trust": 0.85,
  "topic_relevance": 0.9,
  "time_proximity": 0.7383647798742139,
  "community_signal": 0.75,
  "speaker_org_signal": 0.7,
  "is_deadline_open": 0,
  "source_count": 1
}
📅 Jul 10, 2026 📍 Seoul, South Korea via ICML: Safety-related Workshops

Workshop at ICML 2026 focused on identifying, diagnosing, and fixing failure modes in agentic AI systems. Covers reproducible triggers for failures, diagnostic tracing methods, and verified repair approaches. Highly relevant to AI safety and robustness.

#evals#alignment ICMLworkshopfailure-modesagentsdiagnostics
Salience signals
{
  "type_weight": 0.85,
  "source_trust": 0.85,
  "topic_relevance": 0.9,
  "time_proximity": 0.7383647798742139,
  "community_signal": 0.75,
  "speaker_org_signal": 0.7,
  "is_deadline_open": 0,
  "source_count": 1
}
📅 Jul 11, 2026 📍 Seoul, South Korea via ICML: Safety-related Workshops

Second Workshop on Agents in the Wild focusing on safety and security of AI agents deployed in real-world environments. Addresses challenges in ensuring safe and secure operation of autonomous agents. Part of ICML 2026 workshop track.

#alignment#evals ICMLworkshopagentssafetysecurity
Salience signals
{
  "type_weight": 0.85,
  "source_trust": 0.85,
  "topic_relevance": 0.9,
  "time_proximity": 0.7333333333333334,
  "community_signal": 0.75,
  "speaker_org_signal": 0.7,
  "is_deadline_open": 0,
  "source_count": 1
}

MARS V 2026: Stage 2 Mentor Selection

fellowship ★ 0.72 Apps closed May 10, 2026 ?
📅 Jul 13, 2026 – Oct 31, 2026 📍 Cambridge, United Kingdom · Hybrid via MARS - Mentorship for Alignment Researchers at CAISH

Stage 2 of MARS V selection process, for invited candidates only. Part-time, hybrid research programme matching exceptional students and early-career researchers with experienced mentors. Includes $2k+ compute budget, Claude Max, accommodation, meals, and travel funding.

#alignment#technical-safety#mentorship#control#interpretability#evals#governance
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.8,
  "topic_relevance": 0.9,
  "time_proximity": 0.7786163522012579,
  "community_signal": 0.7,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 0,
  "source_count": 1
}

Astra Fellowship Fall 2026

fellowship ★ 0.71
📅 Sep 14, 2026 – Feb 5, 2027 📍 Berkeley, United States via Constellation Astra Fellowship

Fully funded, in-person program pairing senior advisors with emerging talent on 5-month technical, governance, strategy, and field-building projects. $8,400 monthly stipend, ~$15K/month research budget for empirical fellows (compute), workspace at Berkeley research center, weekly mentorship from experts, visa support for international applicants. Applications for Fall 2026 cohort closed May 3rd. Strong placement rates at safety orgs.

#alignment#governance#technical-safety
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.85,
  "topic_relevance": 0.9,
  "time_proximity": 0.43647798742138366,
  "community_signal": 0.85,
  "speaker_org_signal": 0.85,
  "is_deadline_open": 0,
  "source_count": 1
}

Pivotal Research Fellowship 2026 Q3

fellowship ★ 0.70 Apps closed May 3, 2026 ?
📅 Jun 29, 2026 – Aug 28, 2026 📍 London, United Kingdom via Pivotal Research Fellowship

Quarterly AI safety research fellowship based in London. Open to anyone committed to ensuring AI develops safely, regardless of background (ML, philosophy, policy, physics, biology welcome). Provides £6,000-£8,000 stipend (£8,000 for Senior Fellows), travel coverage, £2,000 housing stipend for non-London residents, meals and compute. 70-90% of fellows receive up to 6-month extensions.

#alignment#governance#technical-safety#mechanistic-interpretability#interpretability#safety
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.75,
  "topic_relevance": 0.85,
  "time_proximity": 0.8571428571428571,
  "community_signal": 0.75,
  "speaker_org_signal": 0.75,
  "is_deadline_open": 0,
  "source_count": 1
}
📅 May 5, 2026 – Jun 30, 2026 📍 Virtual via BlueDot Impact Events Calendar (Luma)

Weekly AI safety evaluations paper reading club hosted by BlueDot Impact. Meets every Tuesday at 4:00 PM UTC to discuss evaluation methodologies, safety benchmarks, and measurement frameworks. Open to all interested in AI safety evals research.

#evals#alignment weeklypaper-discussion
Salience signals
{
  "type_weight": 0.45,
  "source_trust": 0.85,
  "topic_relevance": 0.9,
  "time_proximity": 0.4285714285714286,
  "community_signal": 0.7,
  "speaker_org_signal": 0.7,
  "is_deadline_open": 1,
  "source_count": 1
}

Agentic AI Summit 2026

conference ★ 0.69

Summit bringing together academic leaders, entrepreneurs, AI experts, venture capitalists, and policymakers to discuss the future of AI and Agentic AI. Call for Papers and Startup Spotlight applications open. In-person and livestream available.

#alignment#evals#governance
Salience signals
{
  "type_weight": 1,
  "source_trust": 0.85,
  "topic_relevance": 0.75,
  "time_proximity": 0.6477987421383647,
  "community_signal": 0.75,
  "speaker_org_signal": 0.8,
  "is_deadline_open": 0,
  "source_count": 1
}

ARENA 8.0

fellowship ★ 0.68 Apps closed May 15, 2026 ?
📅 May 25, 2026 – Jun 26, 2026 📍 London, UK via ARENA: Alignment Research Engineering Accelerator

4-5 week in-person AI safety research engineering bootcamp in London. Programme covers travel, visa expenses, accommodation, and meals for participants. Supported by Coefficient Giving. Curriculum publicly available, runs 2-3 bootcamps annually.

#alignment#interpretability
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.9,
  "topic_relevance": 0.9,
  "time_proximity": 0.1888888888888889,
  "community_signal": 0.85,
  "speaker_org_signal": 0.85,
  "is_deadline_open": 0,
  "source_count": 1
}

CAIS AI and Society Fellowship 2026

fellowship ★ 0.67 Apps closed Mar 24, 2026
📅 Jun 1, 2026 – Aug 21, 2026 📍 San Francisco, USA via Center for AI Safety , Center for AI Safety , Center for AI Safety

Three-month research fellowship investigating societal impacts of advanced AI and institutions and policies for response. Targets scholars in economics, law, international relations, and adjacent disciplines. $25,000 stipend plus covered travel and daily meals. Requires professorship, PhD/JD degree, or PhD/JD-equivalent research experience. Fellows produce substantive interim results such as blog posts, draft papers, or empirical findings. Organized by Center for AI Safety.

#governance#policy#alignment#safety fellowshipgovernancepolicyresearch
Salience signals
{
  "type_weight": 0.8,
  "source_trust": 0.9,
  "topic_relevance": 0.85,
  "time_proximity": 0.34444444444444444,
  "community_signal": 0.75,
  "speaker_org_signal": 0.9,
  "is_deadline_open": 0,
  "source_count": 1
}

MAIA AI Safety Fundamentals Summer 2026

reading-group ★ 0.54 Apps closed May 22, 2026 ?
📅 May 22, 2026 – Jul 17, 2026 📍 Virtual via MAIA - MIT AI Alignment

8-week virtual reading group run by MIT AI Alignment (MAIA), meeting 2 hours per week. Explores why AI safety matters and current mitigation approaches including AI trajectory, misalignment risks, technical safety solutions, policy, and career paths. No prior AI background required. Led by small groups facilitated by MAIA team members.

#alignment#governance#technical-safety#safety
Salience signals
{
  "type_weight": 0.45,
  "source_trust": 0.8,
  "topic_relevance": 0.85,
  "time_proximity": 0.15555555555555559,
  "community_signal": 0.75,
  "speaker_org_signal": 0.75,
  "is_deadline_open": 0,
  "source_count": 1
}

29th annual meeting of the Association for the Scientific Study of Consciousness. Topics comprise empirical, theoretical and philosophical investigations on the neural correlates of consciousness and subjective experience. Accepts submissions from Psychology, Medicine, Neuroscience, Computer Science, Philosophy, Biology and Mathematics. Features posters, workshops and tutorials. Relevant for ACO practitioners working on consciousness studies and measurement science applicable to interpretability work.

#interpretability#measurement-science
Salience signals
{
  "type_weight": 1,
  "source_trust": 0.6,
  "topic_relevance": 0.6,
  "time_proximity": 0.9849056603773585,
  "community_signal": 0.5,
  "speaker_org_signal": 0.6,
  "is_deadline_open": 0,
  "source_count": 1
}
📅 Jun 15, 2026 – Jun 27, 2026 📍 New York, USA via Machine Learning Summer School (MLSS) - Columbia

Two-week intensive summer school at Columbia University covering machine learning topics including mechanistic interpretability, alignment/safety, RAG & agents, and LLM systems. Approximately 200 PhD students participate alongside faculty and industry speakers. In-scope due to dedicated alignment and mechanistic interpretability tracks.

#interpretability#alignment
Salience signals
{
  "type_weight": 0.35,
  "source_trust": 0.75,
  "topic_relevance": 0.7,
  "time_proximity": 0.869182389937107,
  "community_signal": 0.5,
  "speaker_org_signal": 0.6,
  "is_deadline_open": 0,
  "source_count": 1
}