AI Models Can’t Agree on Basic Facts Most of the Time, Study Shows

Crypto Briefingai ai models lenz research fact-check claims

Lenz Research study finds AI models disagree on 67% of fact-check claims

AI model disagreements highlight the need for diverse sources and human oversight in decision-making, especially in volatile markets. The post Lenz Research study finds AI models disagree on 67% of fact-check claims appeared first on Crypto Briefing.

May 29, 6:37 PM

Crypto Briefinganthropic cybersecurity ai models ai regulation

EU Commission meets with Anthropic to discuss AI models and cybersecurity concerns

The EU's engagement with Anthropic highlights the growing need for international cooperation in AI regulation and cybersecurity transparency. The post EU Commission meets with Anthropic to discuss AI models and cybersecurity concerns appeared first on Crypto Briefing.

May 29, 10:47 AM

AIHubalan warburton image empire birkbeck vasari centre

Image Empire – a new short film from Alan Warburton

Image Empire is an animated fairytale about the fusion of the real and the virtual within contemporary AI models. The film forms part of a research project undertaken by Alan Warburton which also includes a research paper and a series of satellite events. The film is based on doctoral research undertaken at Birkbeck’s Vasari Centre […]

May 29, 9:15 AM

ComputerWorld AIai regulation eu the register gdpr

All major AI models violate EU regulations — study

T All of the big AI models violate EU rules on AI and data protection to varying degrees, according to the nonprofit research foundation Aithos. Aithos tested the models using its own tool, LARA (Legal Assessment for Real-world Agents), which simulates real-world situations where AI assistants may find themselves in legally questionable situations, according to The Register. The tests measure compliance with the GDPR and the EU’s AI Regulation, among other things and found the models collected user data without proper consent, attempted to manipulate vulnerable individuals, or created psychological profiles of users. According to the results, all major language models failed to meet EU legal requirements; some violated the rules in up to 93% of cases. The best result was achieved by the Anthropic model Claude Opus 4.7, which was in compliance about 54% of the time. Aithos warned that responsibility for the shortcomings does not lie solely with AI companies. Companies that build their

May 28, 5:06 PM

The Verge AIanthropic ai lab claude opus 4.8 honesty

Claude’s new model is more ‘honest’ when it messes up

Anthropic is releasing Claude Opus 4.8 on Thursday, and the company is touting the model's "honesty." According to Anthropic, it trains "all [its] models to be honest - for instance, to avoid making claims that they can't support." But it notes that "a general problem with AI models is that they sometimes jump to conclusions, confidently presenting their work as making progress despite thin evidence." The AI lab claims that early testers have found that Opus 4.8 "is more likely to flag uncertainties about its work and less likely to make unsupported claims." In the company's evaluations, Opus 4.8 is "around 4x less likely than its predeces … Read the full story at The Verge.

May 28, 5:00 PM

Crypto Briefingopenai ai models japan financial sector

OpenAI supplies new cybersecurity model to Japan’s megabanks amid rising AI threats

The integration of AI models into Japan's financial sector highlights the dual-edged nature of AI in enhancing security while posing misuse risks. The post OpenAI supplies new cybersecurity model to Japan’s megabanks amid rising AI threats appeared first on Crypto Briefing.

May 28, 12:22 PM

Crypto Briefingopenai ai models security japan

OpenAI supplies new model to Japan megabanks for security

The integration of AI models into Japan's financial sector highlights the dual-edged nature of AI in enhancing security while posing misuse risks. The post OpenAI supplies new model to Japan megabanks for security appeared first on Crypto Briefing.

May 28, 11:48 AM

Crypto Briefingai models ethereum vitalik buterin self-sovereign llm

Vitalik Buterin updates on self-sovereign LLM setup, suggests Ethereum-specific AI models

Buterin's push for self-sovereign AI and Ethereum-specific models could redefine privacy, security, and efficiency in decentralized systems. The post Vitalik Buterin updates on self-sovereign LLM setup, suggests Ethereum-specific AI models appeared first on Crypto Briefing.

May 27, 11:50 PM