Meta Reality Labs releases a new foundation model family for human-centric vision that pushes pose estimation, segmentation, and 3D geometry to new state-of-the-art levels — all from a single backbone.
The post Meta AI Releases Sapiens2: A High-Resolution Human-Centric Vision Model for Pose, Segmentation, Normals, Pointmap, and Albedo appeared first on MarkTechPost.
Insider Brief Genesis AI unveiled a new robotics foundation model called GENE-26.5 alongside a proprietary robotic hand and data collection system designed to improve how robots learn complex physical tasks from human behavior. The system is intended to address the difficulty of collecting large amounts of usable training data for physical tasks, which the company […]
Meta AI team has released NeuralBench, a unified open-source framework for benchmarking NeuroAI models, alongside NeuralBench-EEG v1.0 — the largest open EEG benchmark to date, covering 36 tasks, 94 datasets, and 14 deep learning architectures evaluated under a single standardized interface across 9,478 subjects and 13,603 hours of brain recordings.
The post Meta AI Releases NeuralBench: A Unified Open-Source Framework to Benchmark NeuroAI Models Across 36 EEG Tasks and 94 Datasets appeared first on MarkTechPost.
Exploring the inner workings of a decoder-only Transformer foundation model
The post Timer-XL: A Long-Context Foundation Model for Time-Series Forecasting appeared first on Towards Data Science.
Insider Brief Defense AI startup Scout AI has raised $100 million in an oversubscribed Series A round to develop a foundation model for autonomous military operations. The round was co-led by Align Ventures and Draper Associates, with participation from Decisive Point, Booz Allen Ventures, BVVC, Neman Ventures, Evolution VC Partners, Heraclitus Capital Management, Sigmas Group, […]
The model unifies speech, environmental sound, music, and temporal reasoning into a single architecture — and outperforms every open-source model tested on general audio benchmarks, including systems more than four times its size.
The post OpenMOSS Releases MOSS-Audio: An Open-Source Foundation Model for Speech, Sound, Music, and Time-Aware Audio Reasoning appeared first on MarkTechPost.
A new Google paper argues that image generation pretraining is to computer vision what GPT-style pretraining is to NLP — and the benchmark numbers back that up.
The post Google DeepMind Introduces Vision Banana: An Instruction-Tuned Image Generator That Beats SAM 3 on Segmentation and Depth Anything V3 on Metric Depth Estimation appeared first on MarkTechPost.
Researchers from Meta AI and the King Abdullah University of Science and Technology (KAUST) have introduced Neural Computers (NCs) — a proposed machine form in which a neural network itself acts as the running computer, rather than as a layer sitting on top of one. The research team presents both a theoretical framework and two […]
The post Meta AI and KAUST Researchers Propose Neural Computers That Fold Computation, Memory, and I/O Into One Learned Model appeared first on MarkTechPost.