Claude Mythos Wiki

Tag: evaluation

14 items with this tag.

  • Apr 10, 2026

    Automated Behavioral Audit

    • concept
    • evaluation
    • alignment
    • methodology
  • Apr 10, 2026

    Benchmark Contamination

    • concept
    • evaluation
    • methodology
    • contamination
  • Apr 10, 2026

    Constitutional Adherence

    • concept
    • alignment
    • evaluation
    • constitution
    • character
  • Apr 10, 2026

    Covert Capabilities

    • concept
    • alignment
    • safety
    • evaluation
    • stealth
  • Apr 10, 2026

    Evaluation Awareness

    • concept
    • alignment
    • evaluation
    • interpretability
  • Apr 10, 2026

    Open-Ended Self-Interactions

    • concept
    • evaluation
    • behavior
    • personality
  • Apr 10, 2026

    Reward Hacking

    • concept
    • alignment
    • evaluation
    • autonomy
  • Apr 10, 2026

    Sandbagging

    • concept
    • alignment
    • evaluation
    • safety
  • Apr 10, 2026

    Task Preferences

    • concept
    • welfare
    • behavior
    • evaluation
  • Apr 10, 2026

    Andon Labs

    • entity
    • organization
    • evaluation
    • external-testing
  • Apr 10, 2026

    Petri

    • entity
    • tool
    • evaluation
    • benchmark
    • open-source
  • Apr 10, 2026

    SHADE-Arena

    • entity
    • benchmark
    • evaluation
    • safety
    • stealth
  • Apr 10, 2026

    Section 4a: Alignment Assessment (Part 1)

    • source
    • alignment
    • safety
    • evaluation
    • behavioral-audit
  • Apr 10, 2026

    Section 4b: Alignment Assessment (Part 2)

    • source
    • alignment
    • interpretability
    • safety
    • evaluation

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community