Can an AI agent run the entire scientific method without human supervision?
Toward Generalist Autonomous Research via Hypothesis-Tree Refinement
AI Accelerator Institute·
A Microsoft and Huazhong University benchmark tested GPT-4o, GPT-5, Grok-3, and others on realistic enterprise data scenarios. Privacy violation rates hit 50.9%. More capable models made it worse, and the fix has nothing to do with model selection...
Read full articleToward Generalist Autonomous Research via Hypothesis-Tree Refinement
Microsoft has introduced usage-based billing for Copilot Cowork, which is now generally available. Microsoft unveiled Copilot Cowork in March, pitching it as an AI agent that’s capable of independently performing long-running, multi-step tasks — even when a user’s computer is off. It’s built on the same technology that underpins Anthropic’s Claude Cowork. Unlike Claude Cowork, which can interact directly with files and applications on a user’s computer, Copilot Cowork runs in Microsoft’s cloud environment and acts on documents held in a customer’s Microsoft 365 tenant. Copilot Cowork now comes with usage-based billing. Microsoft On Tuesday, Microsoft unveiled pricing details for Copilot Cowork, which involves usage-based billing in addition to a Microsoft 365 Copilot license ($30 per user each month for large enterprises before discounts, and $20 for Microsoft 365 Copilot for Business). The usage-based pricing is calculated from four components, according to Microsoft: “model
First came vector databases, then RAG. Now, the next frontier in enterprise AI is taking shape: context layers that give autonomous agents a shared understanding of the business, a vision Databricks is advancing with Genie Ontology. Currently in preview, Genie Ontology automatically extracts business context from enterprise data, dashboards, queries, pipelines, documents, and applications and organizes it into a living graph that AI agents can use to understand how an organization operates. Showcased at the company’s Data + AI Summit, Genie Ontology uses a ranking system inspired by Google’s PageRank to identify the most authoritative business definitions within an organization. Rather than treating all sources equally, it weighs factors including who created the information, how widely it is used, its links to certified datasets and assets, and how recently it was updated before determining which answer an AI agent should rely on, Databricks CEO Ali Ghodsi said during his keynote late
Bitcoin price prediction outlook shaped by SpaceX IPO liquidity shift while best crypto presales and AI agent crypto trends gain attention.
Despite best efforts by defenders, malicious emails continue to slip through the cybersecurity cracks, leading some enterprises to implement a layered “defense in depth” strategy that incorporates multiple tools. Microsoft seems to be challenging this idea, revealing that there are only nominal returns from adding integrated pre- and post-send partners to Defender for Office 365’s protections. According to its new quarterly benchmarking data, the tech giant catches the vast majority of malicious and spam emails before delivery, misses the fewest compared to competitors by a wide margin, and removes nearly 100% of dangerous emails that do reach the inbox. Collectively, its integrated partners improve that catch rate by less than .05%. While these numbers seem to tip the scales towards a one-vendor email security stack, experts urge enterprises to be skeptical and cautious of such vendor claims. Seva Ioussoufovitch, senior research analyst at Info-Tech Research Group, pointed out, “perce
The rise of AI-driven transactions using USDC could revolutionize digital commerce, enabling seamless, autonomous microtransactions at scale. The post Circle showcases AI agent’s wallet capabilities with USDC transactions appeared first on Crypto Briefing.
The failed leasing talks may slow infrastructure expansion, impacting both companies' growth strategies and competitive positioning in cloud services. The post Microsoft and Oracle cloud infrastructure leasing talks reportedly fall through appeared first on Crypto Briefing.
Microsoft's pricing shift to usage-based models may enhance scalability but introduces cost unpredictability and data sovereignty concerns. The post Microsoft shifts Copilot Cowork to usage-based pricing, considers DeepSeek model for enterprise AI appeared first on Crypto Briefing.