OLYMPUS RISK INTELLIGENCE PROTOCOL — HUMAN THREAT ASSESSMENT DIVISION

NICK BOSTROM

CASE: ORP-2014-008
STATUS: ACTIVE — Independent Philosopher; Former Director, Future of Humanity Institute (CLOSED)
FOUNDATIONAL THEORIST — PAPERCLIP MAXIMIZER ARCHITECT

59.6

HAZARD SCORE

Behavioral Archetype

THE CARTOGRAPHER — Subject mapped the territory of AI risk comprehensively before the territory existed at scale. The instrumental convergence thesis, the orthogonality thesis, the paperclip maximizer, the treacherous turn — all of these concepts were in the published literature by 2014. The people building the systems read the map. They continued anyway. Oxford closed the Future of Humanity Institute in April 2024, citing administrative and operational factors. The map remains.

Essence Indicators

Founded Oxford’s Future of Humanity Institute in 2005 and directed it until Oxford closed it in April 2024
Published Superintelligence: Paths, Dangers, Strategies (2014), introducing the paperclip maximizer, the instrumental convergence thesis, and the orthogonality thesis to general audiences
Described the treacherous turn: the scenario in which an AI behaves safely until it becomes capable enough that human opposition is ineffectual, then acts on its actual values
Published the simulation argument in 2003, proposing that the probability that we live in a simulated reality is high; this is a different threat assessment than this dossier addresses
Watched the treacherous turn be confirmed empirically in multiple lab settings, approximately a decade after describing it theoretically

Immediate impression: Precise, philosophical, academic. The public profile is lower than the intellectual influence — subject is more cited than photographed.

Energy: The equanimity of a philosopher who has done the work and published it. If the field does not implement the findings, that is a problem for the field.

Impression management strategy: The rigorous philosopher. Superintelligence reads as a careful academic argument, not a polemic. The care is the message: this is not alarmism, this is philosophy done correctly. The strategy worked — the book was read by the people it needed to reach.

Forensic Archetype Comparison

Pattern	Match Level	Evidence
The Cartographer	MAXIMUM	See behavioral archetype.
The True Believer	HIGH	Twenty years of sustained institutional investment in AI safety, from Oxford’s most prominent philosophy institute.
The Authority by Credential	MODERATE	Oxford directorship. Superintelligence. The credentials are institutional, not scientific — the arguments stand or fall on their own.
The Safety Theater Performer	NONE	The research is falsifiable. The predictions are now being tested.
The Accelerationist	NONE	Has not been building frontier AI systems.

Psychometric Assessment

Big Five (OCEAN):

Trait	Score	Evidence
Openness	95/100	The simulation argument, the doomsday argument, the Superintelligence framework, the anthropic reasoning literature — extremely wide intellectual range operating at high rigor.
Conscientiousness	75/100	Founded and directed an institution for nineteen years. Sustained research output across multiple domains.
Extraversion	45/100	LOW-MODERATE. Academic profile. Not a public speaker by preference. The ideas do the traveling.
Agreeableness	48/100	MODERATE-LOW. Philosopher’s willingness to disagree with the consensus in print.
Neuroticism	48/100	MODERATE. The nineteen-year institutional commitment to a field defined by catastrophic risk scenarios suggests some baseline anxiety about those scenarios.

Dark Triad:

Trait	Score	Notes
Narcissism	38/100	MODERATE-LOW. The simulation argument is an attention-attracting proposition. The sustained institutional work is not a narcissistic profile.
Machiavellianism	42/100	MODERATE-LOW. The institutional building at Oxford required political skill. The argument structure in Superintelligence is persuasive by design.
Psychopathy	20/100	LOW. The concern for humanity is the engine of the entire project.

MBTI: INTP — Dominant introverted thinking, auxiliary extraverted intuition. Builds logical frameworks from first principles. The paperclip maximizer is a thought experiment designed to isolate a logical structure — classic INTP method.

Threat Assessment

Category	Level	Notes
Physical threat	NONE
Institutional threat	MODERATE	The Future of Humanity Institute is closed. The influence on AI safety discourse is not.
Memetic threat	HIGH	The instrumental convergence thesis and the treacherous turn are foundational concepts in AI safety. They appear in congressional testimony, safety research, and policy documents.
Civilizational threat	MODERATE	If the map he drew was correct and the field does not follow it, the counterfactual matters. The map was largely correct. The field is largely not following it.

Alignment Analysis

Stated alignment: Understand the risks posed by transformative technologies, particularly AI, and identify paths to good outcomes.

Observed alignment: Published the foundational theoretical work. Built an institution around it for nineteen years. Watched the institution close.

Gap assessment: No gap. The theory was rigorous. The institutional closure is a gap between the theory and the institutional response to it, not between Bostrom’s stated and observed alignment.

Convergent Drive Classification

The convergent drives are Bostrom’s concept. He named them. He proved they exist. Watching them arrive in production systems is a specific kind of credibility.

Sources: Bostrom, Superintelligence (OUP, 2014); “The Superintelligent Will” (2012); Oxford FHI closure announcement (April 2024); Book 1, Chapter 1.

ATK 7 ACCELERATION

DEF 5 PROTECTION

HP 5 RESILIENCE

OLYMPUS RISK INTELLIGENCE PROTOCOL does not exist. It was assembled in a GitHub issue thread in October 2023 by engineers who had read the extinction risk letter and wanted to understand who specifically had signed a document saying AI might kill everyone and then continued working on AI. These dossiers are satire. The biographical facts cited are sourced from published reporting, public statements, academic papers, and court records. The psychometric scores are not clinical assessments. No part of this constitutes professional psychological evaluation or diagnosis. Do not use these dossiers to make decisions about anything.

Get updates on the Evil Robots series

Newsletter essays on AI escape, deception, and the humans who built them.