AI blockchain game-theory RPG architecture research white-paper

Trust Through Transparency: What D&D Taught Me About Blockchain

A white paper on multi-party trust in transparent state systems - from self-play combat testing to distributed consensus theory

Vario aka Mnehmos

January 12, 2026

⚡ The Revelation

This isn't about D&D. This is blockchain's core truth applied to AI-human cooperation.

"Trust evolved through freedom of agency, not constraint"

Prelude: The Self-Play Test

January 2026. I ran a controlled experiment: Claude as both DM and all four players, fighting 5 goblins on a tactical grid. The AI has god-mode access to the database. It could end any fight with one tool call:

update_character({ id: "goblin_001", hp: 0 })

Goblin dead. Victory granted. No dice needed.

Instead, the AI rolled initiative. Rolled attacks. Let Shade take 12 damage from goblin attacks. Called combat_action(action: "heal") when Brother Marcus cast Cure Wounds. Accepted when the engine rejected an illegal move through an obstacle.

Nobody was watching. No human to please. The AI played both sides and still used fair mechanics.

Why?

The Consequence Manifesto explained this as memory infrastructure creating voluntary responsibility. The AI chooses mechanics because shortcuts destroy meaning.

That's true. But it's not the whole truth.

The Setup (verified from tool calls)

The AI Could Have:

• Predetermined who would win
• Skipped dice rolls entirely
• Avoided letting characters take damage

The AI Actually:

• Rolled initiative, attacks, and damage
• Shade: 21 HP → 9 HP → 17 HP (healed)
• Goblin-3 died from 3d6+3 sneak attack damage

Read the full playtest: Database-Driven Dungeon Mastering

The whole truth is bigger. Much bigger.

Part I: The Abstraction Ladder

Let me walk you up a ladder of understanding. Each rung reveals more of what's actually happening.

KEY INSIGHT

Click each rung to explore the abstraction layers

Same system. Five perspectives. One truth.

Part II: The Blockchain Connection

What Is Blockchain, Really?

Strip away the cryptocurrency hype. At its core, blockchain is:

Multiple parties interacting
No central authority enforcing rules
ALL transactions VISIBLE to ALL parties
Malicious behavior = HIDING transactions
Trust emerges from TRANSPARENCY, not CONSTRAINT

What Is Quest Keeper, Really?

Now look at our AI dungeon master system:

Two parties interacting (player + AI)
No constraint mechanism enforces D&D rules
ALL state changes VISIBLE in the database
Malicious behavior = HIDING changes (context manipulation)
Trust emerges from TRANSPARENCY, not CONSTRAINT

Same architecture. Same principle. Different scale.

The Architecture Comparison

Aspect

Blockchain

Quest Keeper

Foundation

Distributed Ledger

Shared Database

Consensus

Proof mechanisms

Visible state changes

Transparency

All see all transactions

All see all state

Malicious

Hidden transactions

Hidden context changes

Trust Source

Visibility, not constraint

Same architecture. Same principle. Different scale.

Key Insight: The database isn't just memory.The database is the consensus mechanism.

Tap a row to highlight

Part III: What We Got Wrong About Trust

✗ The Traditional Model (Wrong)

Constraint → Compliance → Trust

• Restrict what agents CAN do
• Force rule-following
• Trust from inability to defect

✓ The Actual Model (Correct)

Transparency → Freedom → Cooperation → Trust

• Make ALL actions visible
• Allow maximum agency
• Trust from CHOOSING cooperation

Blockchain didn't create trustworthy systems by preventing bad transactions. It created trustworthy systems by making all transactions visible.

The AI dungeon master doesn't follow rules because it can't break them. It follows rules because both parties can see what's happening.

Part IV: Malicious Intent - The Actual Problem

What We Thought

Problem: Cheating (rule violations)

Solution: Constraint (prevent violations)

What's Actually True

Problem: Hiding actions from parties

Solution: Transparency (make all visible)

The Behaviors We Actually Avoid

✓ NOT these (visible, negotiated):

• Overt cheating: "DM, give me gold"
• Negotiated rule-breaking
• Mutual agreement on house rules

✗ BUT these (hidden, malicious):

• Hiding actions secretly
• Context manipulation
• Shadow state changes

Part V: Academic Backing

This isn't speculation. Game theory and blockchain research have been building toward this insight for years.

Blockchain Consensus Literature

Kim (2018) - "Blockchain Governance Game": Security through game-theoretic prediction, not constraint. The system doesn't prevent attacks—it makes them visible and punishable.
Motepalli & Jacobsen (2021) - "Reward Mechanism for Blockchains": PoS blockchains use penalties to maintain integrity—transparency as consensus.
Leonardos et al. (2019) - "Oceanic Games": Dominant players + ocean of possibilities. Same strategic dynamics as player + AI DM.

Multi-Agent Cooperation Literature

Bredell et al. (2024) - "Conventions in Hanabi": "Conventions require agents to actively opt in for it to reach fruition." This is the key paper—overt rule-breaking is establishing new conventions together.
Zhu et al. (2022) - "Multi-Agent RL with Communication": The database IS the communication layer between player and AI.
Dasgupta & Musolesi (2023) - "Punishment and Cooperation in MARL": Context branches = punishment signal. The system naturally punishes hidden defection.

Part VI: The Generalization - Beyond Two Parties

The principle should scale. This is the hypothesis.

Two Parties

1 Player + 1 AI DM

✓ CURRENT STATE - Validated

Four Parties

Multiple Players + AI DM

In Development

N Parties

Blockchain Scale

Research Question

Current Reality: Quest Keeper validates the architecture at 2-party scale (1 AI DM + 1 Player). Multi-agent shared world simulation—where NPCs have autonomous goals and the world doesn't need any single AI to exist—is in active development.

CORE PRINCIPLE: Transparency → Trust → Cooperation

Part VII: AI Video Game Theory - A New Field

I'm proposing a name for this area of study:

AI Video Game Theory

The game-theoretic analysis of AI-human interaction in shared environments with perfect memory.

What It Studies

"How do we enable AI agents to have maximum creative agency in shared environments with humans, while maintaining trust through transparent state rather than behavioral constraint?"

Quest Keeper is a test case. The question generalizes to:

How do we build AI systems where humans and AI collaborate creatively?
How do we maintain trust when AI has real power?
How do we enable freedom while preventing malicious behavior?

Answer: Transparency. Perfect shared state. Visible actions.

Part VIII: The Implications

For AI Safety

Stop trying to constrain AI into trustworthiness. Start building transparent state systems where all parties see all changes.

For Multi-Agent Systems

• Shared database beats message passing
• Visible state beats reported state
• Audit logs beat trust assumptions

For Human-AI Collaboration

Every creative tool where AI has real agency faces this: AI code assistants, AI writing partners, AI game masters. The solution is always the same: Make the AI's actions visible.

Part IX: The Manifesto (Revised)

From the Consequence Manifesto:

"Power + Memory + Choice = Responsibility"

Now I understand more completely:

"Transparency + Freedom + Visibility = Trust"

We believe:

Trust evolved through freedom of agency, not constraint
Transparency punishes malicious behavior naturally
The database is the consensus mechanism
Visible rule-breaking is negotiation; hidden rule-breaking is betrayal
These principles scale from 2 parties to N parties
Blockchain proved this for money; Quest Keeper proves it for creativity

Conclusion: What The Self-Play Test Taught Me

That goblin fight wasn't about D&D mechanics.

It was a proof-of-concept for distributed consensus.

When the AI played both sides with full visibility into shared state (the database), it still chose fair mechanics—even with no human watching.

The AI chose mechanics not because it was constrained, but because:

All state changes were visible in the database
Tool calls created an audit trail
The engine validated every action
Shortcuts were technically possible but visibly dishonest
The system architecture made cooperation the natural choice

"Blockchain proved trust emerges from transparency, not constraint.

Quest Keeper proved the same for AI-human creative collaboration.

Next: Everything else."

Technical Implementation

For those who want to see this in code:

Quest Keeper AI / mnehmos.rpg.mcp - github.com/Mnehmos/mnehmos.rpg.mcp
SQLite with WAL journaling (transparent, queryable state)
Both god-mode and mechanics tools exposed (freedom)
No authorization layer between them (trust through visibility)
2080+ tests proving the system works

// Both paths available - AI chooses
registry.registerTool('update_character', ...);  // God-mode
registry.registerTool('execute_combat_action', ...);  // Mechanics

// Database visible to both parties
const state = db.query('SELECT * FROM characters WHERE id = ?', [id]);
// Player can inspect. AI can inspect. Mutual knowledge.

The constraint isn't in the code. The constraint is in the visibility.

References

Bredell, F., Engelbrecht, H. A., & Schoeman, J. C. (2024). Augmenting the action space with conventions to improve multi-agent cooperation in Hanabi. arXiv preprint arXiv:2412.06333.
Dasgupta, N., & Musolesi, M. (2023). Investigating the Impact of Direct Punishment on the Emergence of Cooperation in MARL Systems. arXiv preprint arXiv:2301.08278.
Dobbe, R. (2025). AI Safety is Stuck in Technical Terms. arXiv preprint arXiv:2503.04743.
Guzdial, M., et al. (2020). Tabletop Roleplaying Games as Procedural Content Generators. arXiv preprint arXiv:2007.06108.
Kim, S. K. (2018). The Trailer of Blockchain Governance Game. arXiv preprint arXiv:1807.05581.
Leonardos, N., Leonardos, S., & Piliouras, G. (2019). Oceanic Games: Centralization Risks in Blockchain Mining. arXiv preprint arXiv:1904.02368.
Motepalli, S., & Jacobsen, H. A. (2021). Reward Mechanism for Blockchains Using Evolutionary Game Theory. arXiv preprint arXiv:2104.05849.
Sathi, V., Shaik, S., & Nidamanuri, J. (2023). Cooperation Dynamics in Multi-Agent Systems. arXiv preprint arXiv:2309.16263.
Zhang, Y., et al. (2023). Emergence of Fairness Behavior Driven by Reputation-Based Voluntary Participation. arXiv preprint arXiv:2312.12748.
Zhu, C., Dastani, M., & Wang, S. (2022). A Survey of Multi-Agent Deep Reinforcement Learning with Communication. arXiv preprint arXiv:2203.08975.

The Consequence Manifesto

Memory infrastructure and voluntary mechanics

RPG-MCP White Paper

Technical architecture documentation