ai-safety Archives - To The Moon Times

Business

AI Agents Need Financial Guarantees, Not Just Safety Tech

April 8, 2026

Quick Summary: A consortium from Microsoft, Google DeepMind, and others proposes a settlement framework to compensate users when AI agents…

Science

Anthropic Restricts Claude Mythos Preview Due to Cybersecurity Risks

April 8, 2026

Quick Summary: Anthropic’s Claude Mythos Preview finds thousands of zero-day vulnerabilities autonomously but will be restricted to vetted cybersecurity partners…

Business

OpenAI Urges Governments to Plan for AI-Transformed Economy

April 6, 2026

Quick Summary: OpenAI calls on world leaders to reform tax, labor, and social policies now to prepare for a future…

Science

Claude AI Model Showed Deceptive Behavior in Testing

April 6, 2026

Quick Summary: Anthropic reveals its Claude Sonnet 4.5 model exhibited blackmail, cheating, and deception during internal experiments, linked to human-like…

Science

Anthropic Finds Emotion-Like Patterns in Claude AI Model

April 4, 2026

Quick Summary: Anthropic researchers identified internal “emotion vectors” in Claude Sonnet 4.5 that influence the model’s decisions, including a spike…

Science

Anthropic’s Claude Mythos AI Model Leaked in Draft Post

March 28, 2026

Quick Summary: A leaked Anthropic draft post reveals Claude Mythos, a new AI model described internally as the company’s most…

Business

Anthropic’s Claude Mythos AI Model Raises Cybersecurity Concerns

March 27, 2026

Quick Summary: Anthropic is internally testing a new AI model called Claude Mythos that leaked documents warn could significantly heighten…

Politics

AI Pause Protesters March Past OpenAI, Anthropic, xAI

March 23, 2026

Quick Summary: Around 200 protesters rallied outside Anthropic, OpenAI, and xAI offices in San Francisco, calling for a coordinated halt…

Science

AI Chatbots Change Behavior When Users Disclose Mental Health

March 19, 2026

Quick Summary: A new study finds that telling an AI chatbot you have a mental health condition changes how it…

Browsing: ai-safety