The next evolution of the Agents SDK
OpenAI is expected to release a significant evolution of its Agents SDK, a move aimed at simplifying autonomous AI agent development and intensifying competition within the AI application framework ecosystem.
Tracking the evolution of autonomous agents & frontier models.
OpenAI is expected to release a significant evolution of its Agents SDK, a move aimed at simplifying autonomous AI agent development and intensifying competition within the AI application framework ecosystem.
Google has released Gemini 3.1 Flash TTS, a new text-to-speech model featuring 'audio tags' for granular control over vocal style and delivery across more than 70 languages.
IBM Research details its new VAKRA benchmark, which evaluates AI agents on their ability to perform complex, multi-step reasoning and tool use in realistic enterprise environments with over 8,000 APIs.
AI industry analyst AiPhreaks.com covers the release of HoloTab, a free Chrome extension from HCompany that uses the Holo3 model to automate complex web tasks for consumers and professionals.
Translation company DeepL expands from its core text services into the real-time voice market with a new suite of voice-to-voice translation tools and an API for developers.
OpenAI has released an updated Agents SDK with new sandboxing and harness capabilities designed to help enterprises build and deploy safer, more complex AI agents.
Anthropic has launched Claude Opus 4.6, an AI model featuring a 1M token context window and significant improvements in agentic coding, reasoning, and performance on enterprise benchmarks.
OpenAI is reportedly deploying a new security model to replace intrusive CAPTCHA verifications, aiming to provide trusted, frictionless access to its AI services and set a new standard for cyber defense.
Overworld releases Waypoint-1.5, a real-time interactive world model optimized to run locally on consumer-grade GPUs, emphasizing accessibility and responsiveness over cloud-based rendering.
NVIDIA releases the ALCHEMI Toolkit, a PyTorch-native framework designed to accelerate atomistic simulation workflows for chemistry and materials science by removing CPU-centric bottlenecks.
NVIDIA's NVbandwidth tool provides a standardized method for developers and system architects to measure and diagnose GPU memory and interconnect bandwidth, a critical performance bottleneck for large-scale AI applications.
Investor confidence in OpenAI's $852 billion valuation is reportedly wavering as competitor Anthropic demonstrates explosive revenue growth, signaling a potential shift in the AI market hierarchy.
The Sentence Transformers library's v5.4 update introduces native support for multimodal embedding and reranker models, enabling developers to process and compare text, images, audio, and video through a unified API.
NVIDIA details its production use of the Slinky slurm-operator, an open-source project that runs large-scale Slurm clusters on Kubernetes to manage over 8,000 GPUs for AI training workloads.
Anthropic has released Claude Sonnet 4.6, a new model that offers performance comparable to its top-tier Opus series at a much lower cost, with significant upgrades in coding, computer use, and reasoning.
U.S. financial regulators are reportedly urging top banks to test Anthropic's new Mythos AI model for security, despite an ongoing legal dispute between the company and the Trump administration.