Welcome to your Sunday, March 8, 2026, developer briefing. As we close out the week, the industry is grappling with the dual realities of rapid AI integration and a challenging economic climate for technical professionals.
Hacker News
Updated 3 minutes agoTechCrunch
Updated about 2 hours agoThe Pro-Human Declaration was finalized before last week's Pentagon-Anthropic standoff, but the collision of the two events wasn’t lost on anyone involved.
A coalition of telecom operators and device makers is pushing $40 smartphones to bring up to 20 million people online, but rising component costs threaten the plan.
Most of it is tied to performance, including new stock incentives linked to Waymo and Wing, its drone delivery venture.
A recently-added feature in Grammarly purports to improve users’ writing with help from the world's great writers and thinkers — and some tech journalists, too.
Hardware executive Caitlin Kalinowski announced today that in response to OpenAI's controversial agreement with the Department of Defense, she’s resigned from her role le...
Ars Technica
Updated about 11 hours agoA unique head spike and fish-eating jaws help make sense of these dinosaurs.
Research shows apparent Iranian state hackers trying to hijack consumer-grade cameras.
The Exploration Upper Stage did not in any way get NASA closer to landing on the Moon.
Planet wants to prevent "adversarial actors" from using images for "Battle Damage Assessment" purposes.
Fishing crews face horrifying burns from dredging the dumped chemical weapons.
VentureBeat
Updated about 11 hours agoAs models get smarter and more capable, the "harnesses" around them must also evolve. This "harness engineering" is an extension of context engineering, says LangChain co...
“When you get a demo and something works 90% of the time, that’s just the first nine.” — Andrej KarpathyThe “March of Nines” frames a common production reality: You can r...
San Francisco startup Anthropic continues to ship new AI products and services at a blistering pace, despite a messy ongoing dispute with the U.S. Department of War.Today...
Enterprise AI applications that handle large documents or long-horizon tasks face a severe memory bottleneck. As the context grows longer, so does the KV cache, the area...
Google senior AI product manager Shubham Saboo has turned one of the thorniest problems in agent design into an open-source engineering exercise: persistent memory.This w...
InfoQ
Updated 1 minute agoScaling Human Judgment: How Dropbox Uses LLMs to Improve Labeling for RAG Systems
To improve the relevance of responses produced by Dropbox Dash, Dropbox engineers began using LLMs to augment human labelling, which plays a crucial role in identifying t...
AWS Introduces Nested Virtualization on EC2 Instances
AWS recently announced support for nested virtual machines within virtualized EC2 instances running KVM or Hyper-V. A long-awaited feature by the community, the new optio...
Standardizing Post-Quantum IPsec: Cloudflare Adopts Hybrid ML-KEM to Replace Ciphersuite Bloat
Cloudflare has extended hybrid post-quantum encryption to IPsec and WAN traffic, standardizing its SASE stack ahead of the NIST 2030 deadline. By adopting a streamlined M...
New Research Reassesses the Value of AGENTS.md Files for AI Coding
Despite widespread industry recommendations, a new ETH Zurich paper concludes that AGENTS.md files may often hinder AI coding agents. The researchers recommend omitting L...
Architecting for Global Scale: Inside DoorDash’s Unified, Composable Dasher Onboarding Platform
DoorDash has rebuilt its Dasher onboarding into a unified, modular platform to support global expansion. The new architecture uses reusable step modules, a centralized st...
Wired AI
Updated about 12 hours agoDeveillance’s Spectre I, developed by a recent Harvard grad, wants to give people control over the always-on wearables surrounding their lives. The problem? Physics.
I stuck Amazon’s Echo Show 15 and its Alexa+ AI assistant in my kitchen for a month. Things have not gone well.
In an exclusive interview with WIRED, Block’s cofounder and CEO says he axed 40 percent of his workforce so that he can rebuild the company “as an intelligence.”
In this episode, our hosts unpack the ongoing conflict in the Middle East, particularly as the AI industry has been entrenching itself with the Department of Defense.
Sources allege the Defense Department experimented with Microsoft’s version of OpenAI technology before the ChatGPT-maker lifted its prohibition on military applications.
Towards Data Science
Updated 41 minutes agoWhy traditional RAG loses context and how contextual retrieval dramatically improves retrieval accuracy The post Understanding Context and Contextual Retrieval in RAG app...
Five classical data science skills are becoming the scarcest resource in tech. A 90-day roadmap to build them while everyone else chases AI hype. The post The AI Bubble H...
And where is it today? The post What Makes Quantum Machine Learning “Quantum”? appeared first on Towards Data Science.
6 pillars to declutter your stack, escape the service trap, and build the missing foundations for the new primary data consumer: the AI agent. The post The Data Team’s Su...
Same notification system, two architectures. Unstructured generation couples everything into a single module. Structured generation decomposes into independent components...
Anthropic News
Updated about 2 months agoThe best model in the world for coding, agents, and computer use, with meaningful improvements to everyday tasks like slides and spreadsheets.
Claude Sonnet 4.5 sets new benchmark records in coding, reasoning, and computer use while being Anthropic's most aligned model.
Claude Haiku 4.5 matches state-of-the-art coding capabilities from months ago while delivering unprecedented speed and cost-efficiency.
Anthropic raised $13 billion in a Series F round at a $183 billion valuation to expand enterprise offerings, safety research, and international growth.
Anthropic's response to the White House AI Action Plan supports infrastructure and safety measures while calling for stronger export controls.
Windsurf News
Updated about 2 hours agoGPT-5.4 is now available in Windsurf with multiple reasoning effort levels. For a limited time, self serve users enjoy promotional pricing starting at 1x credits.
Gemini 3.1 Pro is now available in Windsurf with Low and High thinking variants. For a limited time, enjoy promotional pricing on credit usage.
Claude Sonnet 4.6 is now available in Windsurf with limited-time promotional pricing for self serve users: 2x credits without thinking and 3x credits with thinking.
GLM-5 from Zhipu AI and Minimax M2.5 are now available in Windsurf with limited-time promotional pricing. Both models are included in Arena Mode's Frontier Arena and Hybr...
OpenAI's GPT-5.3-Codex-Spark, an ultra-fast model optimized for real-time coding, is now available in Windsurf's Arena Mode Fast and Hybrid battle groups.
Cursor
Updated about 2 months agoA comprehensive guide to working with coding agents, from starting with plans to managing context, customizing workflows, and reviewing code.
As models improve as agents, we've found success by providing fewer details up front, making it easier for the agent to pull relevant context on its own.
We're partnering with ecosystem vendors who have built hooks support with Cursor.
Graphite has entered into a definitive agreement to be acquired by Cursor.
Bringing design and engineering closer together.
OpenAI News
Updated about 2 hours agoDescript uses OpenAI models to scale multilingual video dubbing, optimizing translations for both meaning and timing so dubbed speech sounds natural across languages.
Codex Security is an AI application security agent that analyzes project context to detect, validate, and patch complex vulnerabilities with higher confidence and less no...
See how Balyasny built an AI research system with GPT-5.4, rigorous model evaluation, and agent workflows to transform investment analysis at scale.
OpenAI introduces CoT-Control and finds reasoning models struggle to control their chains of thought, reinforcing monitorability as an AI safety safeguard.
Introducing GPT-5.4, OpenAI’s most most capable and efficient frontier model for professional work, with state-of-the-art coding, computer use, tool search, and 1M-token...
Google DeepMind News
Updated about 2 hours agoGemini 3.1 Flash-Lite is our fastest and most cost-efficient Gemini 3 series model yet.
Our latest image generation model offers advanced world knowledge, production ready specs, subject consistency and more, all at Flash speed.
3.1 Pro is designed for tasks where a simple answer isn’t enough.
The Gemini app now features our most advanced music generation model Lyria 3, empowering anyone to make 30-second tracks using text or images.
Google DeepMind brings National Partnerships for AI initiative to India, scaling AI for science and education
Anthropic Engineering
Updated about 2 months agoThe capabilities that make agents useful also make them difficult to evaluate. The strategies that work across deployments combine techniques to match the complexity of t...
Introducing advanced tool use on the Claude Developer Platform
Code execution with MCP: Building more efficient agents
Beyond permission prompts: making Claude Code more secure and autonomous
Hugging Face Blog
Updated about 4 hours agoNvidia Blog
Updated about 2 hours agoMarch is in full bloom, and that means a fresh wave of games heading to the cloud. 15 new titles are joining the GeForce NOW library this month. Leading the March lineup...
Autonomous networks — intelligent, self-managing telecommunications operations — are moving from a future vision to a current priority for telecom operators. In the lates...
AI-RAN is moving from lab to field, showing that a software-defined approach is the only viable way to build future AI-native wireless networks. Ahead of Mobile World Con...
Lilly this week launched the most powerful AI factory wholly owned and operated by a pharmaceutical company to help its teams make meaningful medical advancements faster,...
GeForce NOW’s anniversary celebration reaches a chilling crescendo as Capcom’s Resident Evil Requiem creeps into the cloud — and the horrors look better than ever on a Ge...
arXiv
Updated about 2 hours agoScaling imitation learning is fundamentally constrained by the efficiency of data collection. While handheld interfaces have emerged as a scalable solution for in-the-wil...
Efficient and stable training of large language models (LLMs) remains a core challenge in modern machine learning systems. To address this challenge, Reparameterized Orth...
To scale the solution of optimization and simulation problems, prior work has explored machine-learning surrogates that inexpensively map problem parameters to correspond...
Large language models sometimes produce false or misleading responses. Two approaches to this problem are honesty elicitation -- modifying prompts or weights so that the...
We provide evidence of performative chain-of-thought (CoT) in reasoning models, where a model becomes strongly confident in its final answer, but continues generating tok...
Netflix TechBlog
Updated about 2 hours agoMicrosoft Research
Updated about 2 hours agoWe are pleased to announce Phi-4-reasoning-vision-15B, a 15 billion parameter open‑weight multimodal reasoning model, available through Microsoft Foundry (opens in new ta...
Microsoft research lead Doug Burger introduces his new podcast series, "The Shape of Things to Come", an exploration into the fundamental truths about AI and how the tech...
By mid-morning, a typical knowledge worker is already juggling a client report, a budget spreadsheet, a slide deck, and an email backlog, all interdependent and all deman...
As synthetic media grows, verifying what’s real, and the origin of content, matters more than ever. Our latest report explores media integrity and authentication methods,...
Project Silica introduces new techniques for encoding data in borosilicate glass, as described in the journal Nature. These advances lower media cost and simplify writing...
FFmpeg is truly a multi-tool for media processing. As an industry-standard tool it supports a wide variety of audio and video codecs and container formats. It can also or...
Meta recognizes the long-term benefits of jemalloc, a high-performance memory allocator, in its software infrastructure. We are renewing focus on jemalloc, aiming to redu...
We are open-sourcing the initial version of RCCLX – an enhanced version of RCCL that we developed and tested on Meta’s internal workloads. RCCLX is fully integrated with...
WHAT IT IS The rise of agentic software development means code is being written, reviewed, and shipped faster than ever before across the entire industry. It also means t...
We’re sharing details of the role backend aggregation (BAG) plays in building Meta’s gigawatt-scale AI clusters like Prometheus. BAG allows us to seamlessly connect thous...
Pinterest Engineering
Updated about 2 hours agoSpotify Engineering
Updated about 2 hours agoWhen we kicked this off, we weren’t trying to ship an “AI feature.” We were trying to fix a structural... The post Our Multi-Agent Architecture for Smarter Advertising ap...
In Part 2, we will peek under the hood at the tooling that makes the Spotify release process possible. The post How We Release the Spotify App: A Look Under the Hood (Par...
TL;DR Established in 2022 as a way to help support the great open source ecosystem projects that Spotify... The post Congratulations to the recipients of the 2025 Spotify...
The technical and practical rationale for a clear separation between these domains. The post Why We Use Separate Tech Stacks for Personalization and Experimentation appea...
The system we built to ensure our AI agents produce predictable, trustworthy code. The post Background Coding Agents: Predictable Results Through Strong Feedback Loops (H...
The Airbnb Tech Blog
Updated about 2 hours agoHugging Face Trending
Updated 1 minute agoQwen/Qwen3.5-9B
Qwen3.5-9B is a 9B-parameter causal language model featuring a vision encoder and a native context length of 262,144 tokens. It utilizes a unified vision-language foundat...
Lightricks/LTX-2.3
LTX-2.3 is a DiT-based audio-video foundation model designed to generate synchronized video and audio within a single model. As an update to the LTX-2 model, it offers im...
Qwen/Qwen3.5-35B-A3B
Qwen3.5-35B-A3B is a multimodal foundation model featuring a unified vision-language architecture and a vision encoder. It utilizes an efficient hybrid architecture combi...
Qwen/Qwen3.5-0.8B
Qwen3.5-0.8B is a causal language model with a vision encoder designed for prototyping, task-specific fine-tuning, and research. This unified vision-language foundation u...
Qwen/Qwen3.5-4B
Qwen3.5-4B is a causal language model integrated with a vision encoder, designed for advanced multimodal understanding and reasoning. It utilizes a unified vision-languag...
GitHub Trending
Updated 15 minutes agoDao-AILab/flash-attention
Fast and memory-efficient exact attention
GopeedLab/gopeed
A modern download manager that supports all platforms. Built with Golang and Flutter.
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
google/flatbuffers
FlatBuffers: Memory Efficient Serialization Library
microsoft/qlib
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diver...
AWS News Blog
Updated about 2 hours agoAWS launches OpenClaw on Amazon Lightsail to run OpenClaw instance, pairing your browser, enabling AI capabilities, and optionally connecting messaging channels. Your Lig...
This past week, I’ve been deep in the trenches helping customers transform their businesses through AI-DLC (AI-Driven Lifecycle) workshops. Throughout 2026, I’ve had the...
AWS announces the general availability of AWS Security Hub Extended, a unified, full-stack enterprise security solution. It brings together AWS detection services and cur...
AWS Elemental Inference is a fully managed AI service that automatically transforms live and on-demand video broadcasts into vertical formats optimized for mobile and soc...
Last week, my team met many developers at Developer Week in San Jose. My colleague, Vinicius Senger delivered a great keynote about renascent software—a new way of buildi...
Alibaba Cloud
Updated about 2 hours agoCloudflare
Updated about 2 hours agoCloudflare One unifies data security from endpoint to prompt: RDP clipboard controls, operation-mapped logs, on-device DLP, and Microsoft 365 Copilot scanning via API CAS...
The Cloudflare One Client now features the ability to actively probe and adjust packet sizes. This update eliminates the problems caused by tunnel layering and MTU differ...
Automatic Return Routing (ARR) solves the common enterprise challenge of overlapping private IP addresses by using stateful flow tracking instead of traditional routing t...
By transitioning the Cloudflare One Client to use QUIC streams for Proxy Mode, we eliminated the overhead of user-space TCP stacks, resulting in a 2x increase in throughp...
Cloudflare is introducing Attack Signature Detection and Full-Transaction Detection to provide continuous, high-fidelity security insights without the manual tuning of tr...
IGN News
Updated about 2 hours agoYakuza creator Toshihiro Nagoshi’s new game is now in doubt after investor NetEase warned the studio that it plans to cut off funding.
Bungie has revealed plans to change Marathon in some key ways just a few days after launch, outlining early patch notes for an update due out next week.
It sounded like bad news for those who want to get their hands on Valve’s upcoming new hardware. It seemed like the company suggested Steam Machine, Steam Controller, and...
Nintendo has sued the U.S. government over “unlawful” tariffs, demanding a refund with interest.
The Mandalorian and Grogu director Jon Favreau compares Jeremy Allen White's character Rotta the Hutt to Adonis Creed of the Rocky franchise, saying, "When you’re Jabba T...
Game Rant
Updated about 2 hours agoSims 4 fans now have a major reason to check out their local Five Below stores.
It's not easy to make an impactful ending, but these RPGs will leave you flabbergasted by how much their endings change everything.
Relive the memories of endless after-school gaming sessions with these iconic titles that dominated the late 2000s and 2010s.
A recent report suggests that Nvidia controls nearly the entirety of the graphics card on PC, capturing a large majority of the industry.
If you're having trouble with Equitable Distribution, then we've got you covered. This guide walks you through all 4 steps for Traxus's contract.