AA-Briefcase Benchmark Shows AI Struggles With Real Knowledge Work
A new benchmark reveals AI models fail to complete most real-world knowledge tasks, with top models solving just 3 percent of tasks.
Adobe Marketing Agent now connects to Amazon Quick, offering marketers campaign insights in seconds, with setup taking 45–60 minutes.
Amazon Bedrock AgentCore now offers web search capabilities, allowing agents to fetch up-to-date information from the web without infrastructure overhead.
OpenAI researchers found that small amounts of beneficial trait training improved model safety across 44 of 53 benchmarks, according to a blog post.
Miami-based startup Subquadratic claims its new model SubQ is 56 times faster than FlashAttention models in speed tests, with a context window up to 12 million tokens.
Allbirds sold its shoe business for $43 million and raised $100 million to launch Smartbird, an AI infrastructure provider targeting data sovereignty.
A new website reveals which people AI models remember, with scores up to 996 for famous figures like Mozart and Taylor Swift.
Google plans to appeal a Munich court ruling that held it directly liable for AI-generated search overviews, citing a Berlin court's opposite conclusion from June 2026.
WIRED obtained internal records showing Dialog, a private club co-founded by Peter Thiel, grades members and prospects on a hidden scale, with 130 of 192 individuals tagged as members.
Amazon SageMaker now offers over 100 detailed metrics for monitoring generative AI inference, enabling faster troubleshooting of latency spikes and resource bottlenecks.
OpenAI's o3 Deep Research model helped identify 18 diagnoses from 376 previously unsolved cases, achieving a 4.8% additional diagnostic yield.
AMD improved Matrix3D, a 3D world generation framework, with optimizations that cut end-to-end generation time by 54% on the MI250 GPU.