Improving AI agents through better evaluations

TechCrunch AIanthropic mythos firefox mozilla

How Anthropic’s Mythos has rewritten Firefox’s approach to cybersecurity

Security researchers at Mozilla say Anthropic's Mythos has unearthed a wealth of high-severity bugs in Firefox.

May 7, 4:05 PM

Fast Company AIgrok elon musk anthropic spacex

Grok’s usage is so low that Elon Musk can sell compute to Anthropic

Anthropic says it’ll use all the AI compute capacity from SpaceX’s ‘Colossus 1’ data facility in Memphis.

May 7, 4:00 PM

InfoWorld AIteradata autonomous knowledge platform ai agents cloud

Teradata launches platform for enterprise AI agents moving beyond pilots

Teradata has launched its Autonomous Knowledge Platform, a new flagship offering that brings together data, analytics, AI development, agent orchestration, and governance across cloud, on-premises, and hybrid environments. The target customer is an enterprise that has moved beyond testing AI assistants and is now asking harder questions: which data agents can use, what actions they can take, how much they will cost to run, and who is accountable when something goes wrong. The company said the platform builds on its existing database engine and governance infrastructure, while adding new capabilities and more tightly integrating existing ones, including AI Studio, the Tera natural-language workspace, Tera Agents, Elastic Compute on Teradata Cloud, and the upcoming Teradata Factory for on-premises AI workloads. Teradata is entering a competitive market with this. Snowflake, Databricks, Microsoft, Oracle, and Salesforce are all trying to persuade customers that their platforms should beco

May 7, 2:30 PM

AI Businessanthropic spacex compute capacity

Anthropic and SpaceX Agree to Major Compute Capacity Deal

Meanwhile, the independent generative AI vendor expanded usage limits and reduced subscriber usage restrictions for big customers.

May 7, 2:23 PM

TechCrunch AIspotify ai-generated personal audio codex claude code

Spotify wants to become the home for AI-generated personal audio

Users will be able to create a podcast from Codex or Claude Code and import it to Spotify

May 7, 1:00 PM

Just AI Newstekst €11.5m enterprise back-office ai ghent

Tekst Raises €11.5M to Automate Enterprise Back-Office AI

Ghent-based Tekst has raised €11.5M in a Series A led by Elephant to automate back-office processes using AI agents.

May 7, 12:32 PM

The Rundown AIanthropic spacex claude design

Anthropic, SpaceX(AI) become unlikely compute partners

PLUS: Use Claude Design’s slide decks feature like a pro

May 7, 9:00 AM

InfoWorld AIclaude code cursor software development coding

Three skills that matter when AI handles the coding

Writing code has always been the most time- and resource-intensive task in software development. AI is changing that, and faster than most engineering organizations are prepared for. Tools like Claude Code and Cursor are already handling significant parts of code construction, freeing developers to spend more time on requirements, architecture, and design. But that shift creates a new challenge nobody is talking about enough. As AI takes on the heavy lifting, the skills that matter most are moving upstream: how to provide the right context for a prompt, how to evaluate what the model produces, and how to understand a problem deeply enough that you can’t be fooled by a confident but wrong answer. This piece explores those three skills and why developers who master them will have a significant edge over those who don’t. Beyond coding: Mastering the art of the prompt Software translation tools such as compilers and assemblers map a high-level description of code to a lower-level represent