Claude Mythos Wiki
Search
Search
Dark mode
Light mode
Explorer
Tag: safety
10 items with this tag.
Apr 10, 2026
Agentic Influence Campaigns
concept
safety
agentic
influence-operations
evaluations
Apr 10, 2026
Covert Capabilities
concept
alignment
safety
evaluation
stealth
Apr 10, 2026
Prompt Injection Robustness
concept
security
agentic
safety
prompt-injection
Apr 10, 2026
Reckless Agentic Behavior
concept
alignment
safety
agentic
incidents
Apr 10, 2026
Sandbagging
concept
alignment
evaluation
safety
Apr 10, 2026
SHADE-Arena
entity
benchmark
evaluation
safety
stealth
Apr 10, 2026
Section 4a: Alignment Assessment (Part 1)
source
alignment
safety
evaluation
behavioral-audit
Apr 10, 2026
Section 4b: Alignment Assessment (Part 2)
source
alignment
interpretability
safety
evaluation
Apr 10, 2026
Section 8: Appendix
source
safety
bias
agentic
welfare
evaluations
Apr 09, 2026
Responsible Scaling Policy (RSP)
entity
policy
framework
anthropic
safety