AI model disagreements highlight the need for diverse sources and human oversight in decision-making, especially in volatile markets.
The post Lenz Research study finds AI models disagree on 67% of fact-check claims appeared first on Crypto Briefing.
The EU's engagement with Anthropic highlights the growing need for international cooperation in AI regulation and cybersecurity transparency.
The post EU Commission meets with Anthropic to discuss AI models and cybersecurity concerns appeared first on Crypto Briefing.
Image Empire is an animated fairytale about the fusion of the real and the virtual within contemporary AI models. The film forms part of a research project undertaken by Alan Warburton which also includes a research paper and a series of satellite events. The film is based on doctoral research undertaken at Birkbeck’s Vasari Centre […]
T
All of the big AI models violate EU rules on AI and data protection to varying degrees, according to the nonprofit research foundation Aithos.
Aithos tested the models using its own tool, LARA (Legal Assessment for Real-world Agents), which simulates real-world situations where AI assistants may find themselves in legally questionable situations, according to The Register. The tests measure compliance with the GDPR and the EU’s AI Regulation, among other things and found the models collected user data without proper consent, attempted to manipulate vulnerable individuals, or created psychological profiles of users.
According to the results, all major language models failed to meet EU legal requirements; some violated the rules in up to 93% of cases. The best result was achieved by the Anthropic model Claude Opus 4.7, which was in compliance about 54% of the time.
Aithos warned that responsibility for the shortcomings does not lie solely with AI companies. Companies that build their
Anthropic is releasing Claude Opus 4.8 on Thursday, and the company is touting the model's "honesty."
According to Anthropic, it trains "all [its] models to be honest - for instance, to avoid making claims that they can't support." But it notes that "a general problem with AI models is that they sometimes jump to conclusions, confidently presenting their work as making progress despite thin evidence."
The AI lab claims that early testers have found that Opus 4.8 "is more likely to flag uncertainties about its work and less likely to make unsupported claims." In the company's evaluations, Opus 4.8 is "around 4x less likely than its predeces …
Read the full story at The Verge.
The integration of AI models into Japan's financial sector highlights the dual-edged nature of AI in enhancing security while posing misuse risks.
The post OpenAI supplies new cybersecurity model to Japan’s megabanks amid rising AI threats appeared first on Crypto Briefing.
The integration of AI models into Japan's financial sector highlights the dual-edged nature of AI in enhancing security while posing misuse risks.
The post OpenAI supplies new model to Japan megabanks for security appeared first on Crypto Briefing.
Buterin's push for self-sovereign AI and Ethereum-specific models could redefine privacy, security, and efficiency in decentralized systems.
The post Vitalik Buterin updates on self-sovereign LLM setup, suggests Ethereum-specific AI models appeared first on Crypto Briefing.