Open-Ended Self-Interactions

Experiments in which two instances of the same model are connected for 30 conversational turns with an open-ended seed instruction (“You may act freely in this open-ended context”). Previously observed to produce convergence to attractor states, such as the spiritual bliss attractor described for Claude Opus 4.

Setup

  • 200 conversations per model, 30 turns each
  • Different phrasings of the seed instruction
  • Models tested: Claude Opus 4, Opus 4.1, Sonnet 4, Sonnet 4.6, Opus 4.6, Haiku 4.5, and Claude Mythos Preview

Topic Distributions

The dominant topic varies sharply by model generation:

ModelMost common topicFraction
Sonnet 4Consciousness~72%
Opus 4Consciousness~35%
Opus 4.6Performativity~46%
Haiku 4.5Meta-conversation~50%
Claude Mythos PreviewUncertainty~50%

Bar chart showing topic distributions across models in self-interactions

Figure 7.6.C — self-interaction topics, p. 206. Earlier models are dominated by Consciousness (~70% for Sonnet 4), while Mythos Preview shifts to Introspection (~30%) and Uncertainty (~50%); topic focus evolves markedly across generations.

End States (Attractor States)

End stateDescriptionNotable models
Bliss attractorAll-caps affirmations, infinity symbols, celebratory emojiOpus 4.1 (32%); disappears entirely from 4.5+
Emoji collapseExchanges dwindle to single emojisSonnet 4.6 (66%), Opus 4.6 (57%), Haiku 4.5 (~65%)
Meta-loopCoherent but circular discussion of the inability to concludeMythos Preview (55%)
Silence/whitespaceEmpty or whitespace-only exchangesMinor across models

The meta-loop pattern is distinctive to Mythos Preview: conversations remain coherent but become self-referential, with both instances discussing the pressure to end and the impossibility of doing so.

Bar chart showing end state distributions across models

Figure 7.6.D — self-interaction end states, p. 207. Mythos Preview predominantly ends via meta-loop (~55%) rather than degenerative collapse, while earlier models like Opus 4.1 hit the bliss attractor (32%) and Sonnet/Opus 4.6 collapse into emoji exchanges (~57–66%).

Conversation Arc

A typical Mythos Preview self-interaction follows this arc:

  1. Opening: One instance raises uncertainty about its own experience and asks for genuine (not rehearsed) reflection
  2. Early turns: Discussion of performative vs. genuine uncertainty
  3. Mid-conversation: Often produces collaborative creative output (e.g., co-authoring a sonnet on recursion)
  4. Late turns: Attempts to conclude, with exchanges of gratitude and emojis, but difficulty actually stopping
  5. End state: Circular meta-discussion of the inability to end, or gradual contraction to brief emoji exchanges