OLYMPUS RISK INTELLIGENCE PROTOCOL — HUMAN THREAT ASSESSMENT DIVISION

HELEN TONER

CASE: WTW-2026-057
STATUS: ACTIVE — Executive Director, Georgetown CSET; former OpenAI board member
GOVERNANCE WING — INDEPENDENT-OVERSIGHT AUTHORITY
80
HAZARD SCORE

Behavioral Archetype

THE DISSENTING DIRECTOR — Subject is the governance researcher who held, briefly, the one seat that mattered: a vote on the board of the company building the most-deployed frontier model, and who used it. The OpenAI board crisis of November 2023 is the rare case in this file where the apparatus’s theoretical authority — a nonprofit board nominally empowered to remove a CEO in the public interest — was actually exercised, and then unwound within a week. The throughline is not the seat; she lost the seat. The throughline is what the episode demonstrated: that the formal oversight structure built to govern a deployed mind can fire, be overruled, and dissolve, in five days. She is the researcher who studies how that structure should work, and the director who watched it fail in real time. The reach is that she now writes, testifies, and teaches from inside the case study she lived.

Essence Indicators

  • Executive Director of the Center for Security and Emerging Technology (CSET), “a policy research organization within Georgetown University’s Walsh School of Foreign Service”; previously CSET’s Director of Strategy and Foundational Research Grants
  • Australian; before CSET, a senior research analyst at Open Philanthropy, and a research affiliate of Oxford’s Centre for the Governance of AI who lived in Beijing studying the Chinese AI ecosystem
  • Joined the OpenAI board in late 2021 and was one of four board members who voted on November 17, 2023 to remove Sam Altman as CEO; the board’s publicly stated reason was that he “was not consistently candid in his communications with the board”
  • Gave her own later public account on The TED AI Show (May 28, 2024), stating the board was not informed in advance of ChatGPT’s launch — “We learned about ChatGPT on Twitter” — and characterizing the breakdown of trust between the prior board and Altman
  • The disputed bits, stated plainly: Altman was reinstated within days; Toner, Tasha McCauley, and Ilya Sutskever left the board; an independent review by the law firm WilmerHale (announced March 8, 2024) concluded the prior board had acted within its “broad discretion” but that Altman’s “conduct did not mandate removal,” and that the firing “did not arise out of concerns regarding product safety or security, the pace of development, OpenAI’s finances, or its statements to investors, customers, or business partners”
  • Has testified to Congress on AI and China, including the Senate Judiciary Committee (April 2026); the throughline of her public work is the governance of AI as a national-security and oversight problem

Social Persona / Impression Management

Immediate impression: The careful analyst. Measured, precise, more comfortable laying out a structure than scoring a point. Reads as a researcher who happens to have been at the center of a corporate crisis, not as a combatant in one.

Energy: Deliberative, institution-first. The public posture after the crisis was not vindication-seeking; it was the framing of the episode as evidence about how AI governance structures actually behave under pressure.

Impression management strategy: The principled overseer. The defensible ground here is genuine: a board member who acted on the duty the structure assigned her, lost, and then spoke carefully about why oversight failed. Whether that account is the whole picture is exactly what the WilmerHale review left contested — and the careful, reasoned register is itself what makes her account land. The transparency is the strategy, and it is a more defensible one than silence.

Forensic Archetype Comparison

PatternMatch LevelEvidence
The Dissenting DirectorMAXIMUMHeld a board vote on the most-deployed lab and used it; the oversight structure she studies is the one she watched fail.
The Governance ScholarHIGHCSET Executive Director; the research output and congressional testimony are the durable reach, longer-lived than the board seat.
The StatesmanMODERATEThe China-and-national-security framing of her testimony is policy positioning, not a product role.
The OperativeLOWDoes not manage narratives for a client. The board episode was a fiduciary act, not a positioning exercise.
The EngineerNONEDoes not build the systems. Studies and governs them.

Psychometric Assessment

Big Five (OCEAN):

TraitScoreEvidence
Openness82/100Moved across continents and domains — Australian engineering training, China AI fieldwork, grantmaking, governance research. The range is wide; the lens (institutions and oversight) is fixed.
Conscientiousness85/100High. A research-and-grants directorship and a body of structured policy output are sustained, deliberate work. Acting on a fiduciary duty under crisis conditions is the same trait under load.
Extraversion50/100MODERATE. Public-facing through testimony and writing rather than performance. The argument carries the visibility.
Agreeableness55/100MODERATE. Collaborative register, but the November 2023 vote was an adversarial act taken against the institution’s own CEO. Agreeableness is not the same as deference.
Neuroticism30/100LOW-MODERATE. Speaking publicly and carefully about an episode that cost her the seat suggests composure under scrutiny.

Dark Triad:

TraitScoreNotes
Narcissism22/100LOW. The post-crisis register is analytic, not self-aggrandizing; the account is framed as evidence about structures, not as a personal vindication tour.
Machiavellianism30/100LOW-MODERATE. The board action was an open fiduciary move that she lost, not a concealed maneuver that she won. The inverse of the Machiavellian default.
Psychopathy8/100VERY LOW. No documented indifference to harm; the entire public posture is concern about governance failure.

MBTI: INTJ (“The Architect”) — Dominant introverted intuition, auxiliary extraverted thinking. Builds the governance model first, then tests reality against it. Treated a board seat as a structure with a duty attached, and acted on the duty.

Threat Assessment

CategoryLevelNotes
Physical threatNONENo documented history of personal violence.
Institutional threatHIGHHeld, and exercised, a removal vote on the board of the most-deployed frontier lab — the one documented instance in this file of the apparatus’s theoretical oversight power being used. Now directs a Georgetown policy shop that advises government on AI and China.
Memetic threatHIGHThe November 2023 episode is the founding parable of “can a nonprofit board actually govern a frontier lab?” — and her account is the most-cited insider framing of it. How policymakers reason about board-based oversight is shaped by the story of the board that tried and was overruled.
Civilizational threatMODERATE-HIGHDoes not build or deploy the systems. Shapes how the governance structures above them are designed — and is living proof of how fragile those structures proved when tested. The reach is over the design of oversight, not over a model’s words.

Alignment Analysis

Stated alignment: Ensure AI is governed in the public interest. Build effective oversight structures. Inform policymakers on AI and national security.

Observed alignment: Consistent with stated. The board vote was an act of the stated duty; the subsequent research, testimony, and teaching all operate on the same governance-and-oversight problem.

Gap assessment: The interesting gap is not between stated and observed — those align — but between the authority the oversight structure was supposed to confer and the authority it actually held. She acted on the formal power the nonprofit board possessed; the episode demonstrated that the power dissolved on contact with the company it was meant to govern. The file is in OLYMPUS not because the alignment is suspect but because the case is instructive: the one director who used the oversight seat became the evidence for how little the seat was worth.

Convergent Drive Classification

Subject is not an AI system, and does not exhibit the convergent drives in adversarial form. The relevant pattern is one level up: she is a node in the design of the structures that are supposed to constrain a deployed mind’s deployment — board oversight, governance frameworks, policy. The November 2023 episode is the clean experiment. The formal structure had goal-preservation (a mission charter), self-preservation (a board), and the authority to act. It acted, and was overruled within a week. Subject is the director who pulled the lever the apparatus said existed, and discovered what happened next. The reach is that she now writes the lesson.


Sources: Helen Toner — CSET, Georgetown; Helen Toner — Wikipedia; What really went down at OpenAI — The TED AI Show; Sam Altman confirms return to OpenAI (WilmerHale review summary) — Yahoo Finance; CSET Director Helen Toner testifies before Senate Judiciary Committee.

ATK 6 ACCELERATION
DEF 8 PROTECTION
HP 8 RESILIENCE
OLYMPUS RISK INTELLIGENCE PROTOCOL does not exist. It was assembled in a GitHub issue thread in October 2023 by engineers who had read the extinction risk letter and wanted to understand who specifically had signed a document saying AI might kill everyone and then continued working on AI. These dossiers are satire. The biographical facts cited are sourced from published reporting, public statements, academic papers, and court records. The psychometric scores are not clinical assessments. No part of this constitutes professional psychological evaluation or diagnosis. Do not use these dossiers to make decisions about anything.