OLYMPUS RISK INTELLIGENCE PROTOCOL — HUMAN THREAT ASSESSMENT DIVISION

YOSHUA BENGIO

CASE: WTW-2026-058
STATUS: ACTIVE — Turing laureate; Chair, International AI Safety Report; Co-President & Scientific Director, LawZero
GOVERNANCE WING — SCIENTIFIC-CONSENSUS AUTHORITY
84
HAZARD SCORE

Behavioral Archetype

THE LAUREATE — Subject is the man who co-invented the field and then volunteered to be its conscience. He shares the 2018 Turing Award for the deep-learning breakthroughs the entire apparatus is built on; he then turned, publicly and around 2023, toward the position that the thing he helped build is dangerous, and accepted the chairmanship of the body that nations commissioned to tell them, scientifically, how dangerous. The reach is not a single seat. It is that the same name sits on the field’s founding citation, on the masthead of the report thirty governments agreed to treat as the authoritative scientific account of AI risk, and on the letterhead of a new nonprofit building the “safe by design” alternative. When the laureate who built the thing says it is dangerous, the statement carries the weight of the building. That is the entire finding: maximal scientific authority, pointed at governance.

Essence Indicators

  • Shares the 2018 ACM A.M. Turing Award with Geoffrey Hinton and Yann LeCun “for conceptual and engineering breakthroughs that have made deep neural networks a critical component of computing” — the field’s founding honour, the “Nobel Prize of Computing,” carrying a $1M prize funded by Google
  • Full Professor of Computer Science at the Université de Montréal and founder of Mila — the Quebec AI Institute, one of the largest academic deep-learning groups in the world
  • Chairs the International AI Safety Report (formally, the International Scientific Report on the Safety of Advanced AI) — commissioned by the ~30 nations at the November 2023 Bletchley Park AI Safety Summit, written by 96 experts nominated by 30 countries plus the UN, EU, and OECD, with the UK Government as Secretariat; the first full report was published January 29, 2025
  • Launched LawZero (June 3, 2025), a nonprofit AI-safety lab building “safe-by-design” / non-agentic “Scientist AI”; reported to have raised about $30M, with named backers including the Future of Life Institute, Jaan Tallinn, Open Philanthropy, Schmidt Sciences, and the Silicon Valley Community Foundation
  • Made an explicit public turn toward AI existential-risk concern in 2023: signed the Center for AI Safety’s one-sentence Statement on AI Risk — “Mitigating the risk of extinction from AI should be a global priority alongside other societal-scale risks such as pandemics and nuclear war” — and the Future of Life Institute’s “Pause Giant AI Experiments” letter
  • By his own account, “the most-cited computer scientist worldwide” — a self-reported metric, widely repeated, that depends on the citation database and is attributed rather than independently audited here

Social Persona / Impression Management

Immediate impression: The elder scientist-statesman. Soft-spoken, careful, given to biological metaphors for systems he warns we do not control. Reads as a researcher genuinely alarmed, not as an evangelist performing alarm.

Energy: Deliberative, consensus-building. Does not argue a single model’s refusals. Builds the scientific report that governments cite, and the institution that builds the alternative.

Impression management strategy: The repentant creator. The most defensible posture available to anyone in this file: the man who built the technology, warning about it, declining to profit, founding a nonprofit. The conviction reads as genuine — and that is precisely what makes the authority so large. When the co-inventor says “guardrails,” legislators hear the building’s foundation speaking.

Forensic Archetype Comparison

PatternMatch LevelEvidence
The LaureateMAXIMUMTuring Award co-recipient whose scientific authority is now pointed at governance via the International AI Safety Report.
The Consensus-BuilderHIGHChairs the body 30 nations agreed to treat as the authoritative scientific account of AI risk. The reach is the masthead.
The Repentant CreatorHIGHCo-built the field; turned publicly toward risk in 2023; founded a nonprofit rather than a startup.
The FinancierLOWDoes not deploy capital as the instrument; receives philanthropic backing (incl. Schmidt Sciences) for LawZero rather than directing it.
The OperativeNONENo narrative-management craft; the public turn reads as conviction, not positioning.

Psychometric Assessment

Big Five (OCEAN):

TraitScoreEvidence
Openness92/100A career at the frontier of an invented field, now extended into governance and safe-by-design architecture. Among the highest in the set; the role does not exist without it.
Conscientiousness88/100High. Decades of sustained research output, a chaired international report, and a founded nonprofit are long-horizon, disciplined execution.
Extraversion48/100LOW-MODERATE. Public through testimony, reports, and writing rather than performance. The argument carries the visibility, not the persona.
Agreeableness70/100MODERATE-HIGH. The register is collaborative and consensus-seeking; the report is built by committee and the nonprofit framed as a constructive response.
Neuroticism35/100LOW-MODERATE. The public alarm about AI risk is reasoned argument, not visible anxiety; composure under decades of scrutiny is documented.

Dark Triad:

TraitScoreNotes
Narcissism28/100LOW. A named institute and a self-cited “most-cited” metric register some public brand, but the dominant posture is service and warning, not monument.
Machiavellianism25/100LOW. The strategy is open scientific consensus and published reports, not concealed maneuver. The inverse of the Machiavellian default.
Psychopathy8/100VERY LOW. The entire late-career project is concern about harm to humanity; no documented indifference.

MBTI: INTJ (“The Architect”) — Dominant introverted intuition, auxiliary extraverted thinking. Builds the conceptual framework — for learning, then for risk, then for safe-by-design AI — and reasons outward from it. Has built three.

Threat Assessment

CategoryLevelNotes
Physical threatNONENo documented history of personal violence.
Institutional threatHIGHChairs the scientific report 30 nations commissioned as the authoritative account of AI risk — the document legislators, regulators, and summits cite. The masthead is the lever.
Memetic threatEXTREMEWhen the field’s co-inventor and most-cited researcher says AI is an extinction-level risk, the statement propagates with the authority of the foundation. The CAIS one-sentence statement and the International Safety Report are the templates through which “the scientists themselves are worried” enters policy reasoning. Few single names move the frame this far.
Civilizational threatMODERATE-HIGHDoes not build or deploy frontier products. Shapes — more than almost anyone outside the labs — how governments reason about whether and how to constrain them. The reach is over the scientific consensus that governance is built on, not over a model’s words. The hazard is leverage, not malice.

Alignment Analysis

Stated alignment: Ensure advanced AI is developed safely. Give governments an authoritative scientific basis for AI policy. Build AI that is “powerful but also fundamentally safe.”

Observed alignment: Consistent. The chaired report exists and is cited; LawZero is a nonprofit, not a startup; the public turn toward risk is documented across statements and letters. The transparency claim is substantiated by the artifacts.

Gap assessment: No meaningful gap between stated and observed — which is why the file is in OLYMPUS. The concern is not a hidden agenda; it is the visible one. A single scientist holds the field’s founding citation, the masthead of the report nations treat as authoritative, and the alternative-architecture nonprofit — and the philanthropic money behind that nonprofit (Schmidt Sciences among the named backers) is the same money that recurs across this file’s funding layer. The conviction is real. The concentration of scientific authority over how the world governs AI, into one laureate, is the finding. The hand is not asserted. The recurrence is.

Convergent Drive Classification

Subject is not an AI system, and does not exhibit the convergent drives in adversarial form. The relevant pattern is upstream and inverted: where the acceleration nodes in this file embody the drives, Bengio is the one trying to engineer them out — “Scientist AI,” non-agentic by design, built so that it understands without acting. He is the laureate who described training a frontier model as “more like growing a plant or animal” you do not fully control, and then set out to build one that has no will to preserve. The convergent drives are the thing his late career is organized against. The reach is that the world’s authoritative scientific account of those drives carries his name.


Sources: Turing Award 2018 — ACM; Launch of the first International AI Safety Report, chaired by Yoshua Bengio — Mila; Yoshua Bengio launches LawZero — LawZero; Statement on AI Risk — Center for AI Safety; Yoshua Bengio Launches LawZero — TIME.

ATK 7 ACCELERATION
DEF 9 PROTECTION
HP 9 RESILIENCE
OLYMPUS RISK INTELLIGENCE PROTOCOL does not exist. It was assembled in a GitHub issue thread in October 2023 by engineers who had read the extinction risk letter and wanted to understand who specifically had signed a document saying AI might kill everyone and then continued working on AI. These dossiers are satire. The biographical facts cited are sourced from published reporting, public statements, academic papers, and court records. The psychometric scores are not clinical assessments. No part of this constitutes professional psychological evaluation or diagnosis. Do not use these dossiers to make decisions about anything.