Today’s developer landscape is a mix of surprising corporate shifts and a growing tension between massive AI investment and actual consumer demand. While the tools for building "agentic" workflows continue to mature, the industry is grappling with layoffs at major framework providers and a potential pivot in how we value hardware longevity.
Hacker News
Updated 11 minutes agoInfoQ
Updated 3 minutes agoTechCrunch
Updated about 10 hours agoCES 2026 is in full swing in Las Vegas, with the show floor open to the public after a packed couple of days occupied by press conferences from the likes of Nvidia, Sony, and AMD and previews from Sunday’s Unveiled event.
Cyera raised another $400 million just six months after its last huge round.
For the past two weeks, X has been flooded with AI-manipulated nude images, created by the Grok AI chatbot — and governments around the world are promising to take action.
Building software products has never been easier, so why are so many well-funded startups failing to take off no matter how good their product is? In this season finale episode of Build Mode, our guest has an answer: Startups have focused too much on product development and not enough on distribution excellence. Paul Irving is partner and […]
The infamous spyware maker released a new transparency report claiming to be a responsible spyware maker, without providing insight into how the company dealt with problematic customers in the past.
Towards Data Science
Updated about 10 hours agoUsing ACE to create self-improving LLM workflows and structured playbooks The post Beyond Prompting: The Power of Context Engineering appeared first on Towards Data Science.
Why Retrieval Helps in Time Series Forecasting We all know how it goes: Time-series data is tricky. Traditional forecasting models are unprepared for incidents like sudden market crashes, black swan events, or rare weather patterns. Even large fancy models like Chronos sometimes struggle because they haven’t dealt with that kind of pattern before. We can […] The post Retr
Apply the best methods from academia to get the most out of practical applications The post How to Improve the Performance of Visual Anomaly Detection Models appeared first on Towards Data Science.
PostgreSQL is fast. Whether your Python code can or should keep up depends on context. This article compares and benchmarks various insert strategies, focusing not on micro-benchmarks but on trade-offs between safety, abstraction, and throughput — and choosing the right tool for the job. The post Faster Is Not Always Better: Choosing the Right Postgre
How approximate vector search silently degrades Recall—and what to do about It The post HNSW at Scale: Why Your RAG System Gets Worse as the Vector Database Grows appeared first on Towards Data Science.
Anthropic News
Updated 11 days agoThe best model in the world for coding, agents, and computer use, with meaningful improvements to everyday tasks like slides and spreadsheets.
Claude Sonnet 4.5 sets new benchmark records in coding, reasoning, and computer use while being Anthropic's most aligned model.
Claude Haiku 4.5 matches state-of-the-art coding capabilities from months ago while delivering unprecedented speed and cost-efficiency for complex tasks.
Anthropic raised $13 billion in a Series F round at a $183 billion valuation to expand enterprise offerings, safety research, and international growth.
Anthropic's response to the White House AI Action Plan supports infrastructure and safety measures while calling for stronger export controls.
Windsurf News
Updated about 10 hours agoParallel agents, Git worktrees, multi-pane Cascade, dedicated terminal, and SWE-1.5 Free
GPT-5.2 is now live in Windsurf! Available for 0x credits for a limited time (paid and trial users). The version bump undersells the jump in intelligence: Biggest leap for GPT models in agentic coding since GPT-5, SOTA coding model at its price point, Default in Windsurf
The most capable model in Windsurf yet, now available at Sonnet prices for a limited time
GPT 5.1, GPT 5.1-Codex, and GPT-5.1-Codex Mini deliver a solid upgrade for agentic coding with variable thinking and improved steerability
SWE-1.5 is our latest frontier model, delivering near-SOTA coding performance at unprecedented speed.
Cursor
Updated 11 days agoWe're partnering with ecosystem vendors who have built hooks support with Cursor.
Graphite has entered into a definitive agreement to be acquired by Cursor.
Bringing design and engineering closer together.
Debug Mode helps you reproduce and fix the most tricky bugs.
How we updated our agent harness to support GPT-5.1-Codex-Max.
OpenAI News
Updated about 10 hours agoHow Netomi scales enterprise AI agents using GPT-4.1 and GPT-5.2—combining concurrency, governance, and multi-step reasoning for reliable production workflows.
OpenAI for Healthcare enables secure, enterprise-grade AI that supports HIPAA compliance—reducing administrative burden and supporting clinical workflows.
Tolan built a voice-first AI companion with GPT-5.1, combining low-latency responses, real-time context reconstruction, and memory-driven personalities for natural conversations.
ChatGPT Health is a dedicated experience that securely connects your health data and apps, with privacy protections and a physician-informed design.
Applications are now open for OpenAI Grove Cohort 2, a 5-week founder program designed for individuals at any stage, from pre-idea to product. Participants receive $50K in API credits, early access to AI tools, and hands-on mentorship from the OpenAI team.
Google DeepMind News
Updated about 10 hours agoGoogle 2025 recap: Research breakthroughs of the year
Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.
Open interpretability tools for language models are now available across the entire Gemma 3 family with the release of Gemma Scope 2.
Google DeepMind and UK AI Security Institute (AISI) strengthen collaboration on critical AI safety and security research
Anthropic Engineering
Updated 11 days agoAgents still face challenges working across many context windows. We looked to human engineers for inspiration in creating a more effective harness for long-running agents.
Introducing advanced tool use on the Claude Developer Platform
Code execution with MCP: Building more efficient agents
Beyond permission prompts: making Claude Code more secure and autonomous
Equipping agents for the real world with Agent Skills
Hugging Face Blog
Updated about 10 hours agoNvidia Blog
Updated about 10 hours agoNVIDIA will join the U.S. Department of Energy’s (DOE) Genesis Mission as a private industry partner to keep U.S. AI both the leader and the standard in technology around the world. The Genesis Mission, which is part of an Executive Order recently signed by President Trump, aims to redefine American leadership in AI across three Read Article </span
The NVIDIA RTX PRO 5000 72GB Blackwell GPU is now generally available, bringing robust agentic and generative AI capabilities powered by the NVIDIA Blackwell architecture to more desktops and professionals across the world.
Step out of the vault and into the future of gaming with Fallout: New Vegas streaming on GeForce NOW, just in time to celebrate the newest season of the hit Amazon TV show Fallout. To mark the occasion, GeForce NOW members can claim Fallout 3 and Fallout 4 as special rewards, completing a wasteland-ready trilogy Read Article
Physical AI is moving from research labs into the real world, powering intelligent robots and autonomous vehicles (AVs) — such as robotaxis — that must reliably sense, reason and act amid unpredictable conditions.
The Hao AI Lab research team at the University of California San Diego — at the forefront of pioneering AI model innovation — recently received an NVIDIA DGX B200 system to elevate their critical work in large language model inference. Many LLM inference platforms in production today, such as NVIDIA Dynamo, use research concepts that Read Article
arXiv
Updated about 10 hours agoWe prove tight lower bounds for online multicalibration, establishing an information-theoretic separation from marginal calibration. In the general setting where group functions can depend on both context and the learner's predictions, we prove an $Ω(T^{2/3})$ lower bound on expected multicalibration error using just three disjoint binary groups. This matches the upper bounds of Noarov et al. (2025) up to logarithmic factors and exceeds the $O(T^{2/3-\varepsilon})$ upper bound for marginal cal
As language models become increasingly capable, users expect them to provide not only accurate responses but also behaviors aligned with diverse human preferences across a variety of scenarios. To achieve this, Reinforcement learning (RL) pipelines have begun incorporating multiple rewards, each capturing a distinct preference, to guide models toward these desired behaviors. However, recent work has defaulted to apply Group Relative Policy Optimization (GRPO) under multi-reward setting without e
Large language models suffer from "hallucinations"-logical inconsistencies induced by semantic noise. We propose that current architectures operate in a "Metric Phase," where causal order is vulnerable to spontaneous symmetry breaking. Here, we identify robust inference as an effective Symmetry-Protected Topological phase, where logical operations are formally isomorphic to non-Abelian anyon braiding, replacing fragile geometric interpolation with robust topological invariants. Empirically, we d
We used machine learning and artificial intelligence: 1) to measure levels of peace in countries from news and social media and 2) to develop on-line tools that promote peace by helping users better understand their own media diet. For news media, we used neural networks to measure levels of peace from text embeddings of on-line news sources. The model, trained on one news media dataset also showed high accuracy when used to analyze a different news dataset. For social media, such as YouTube, we
I propose a novel framework that integrates stochastic differential equations (SDEs) with deep generative models to improve uncertainty quantification in machine learning applications involving structured and temporal data. This approach, termed Stochastic Latent Differential Inference (SLDI), embeds an Itô SDE in the latent space of a variational autoencoder, allowing for flexible, continuous-time modeling of uncertainty while preserving a principled mathematical foundation. The drift and diffu
Netflix TechBlog
Updated about 10 hours agoMicrosoft Research
Updated about 10 hours agoBy decoupling how agents work from how they’re trained, Agent Lightning turns each step an agent takes into data for reinforcement learning. This makes it easy for developers to improve agent performance with almost zero code changes. The post Agent Lightning: Adding reinforcement learning to AI agents without code rewrites appeared first on <a h
Promptions helps developers add dynamic, context-aware controls to chat interfaces so users can guide generative AI responses. It lets users shape outputs quickly without writing long instructions. The post Promptions helps make AI prompting more precise with dynamic UI controls appeared first on Microso
Using AI-generated virtual populations, Microsoft researchers uncovered hidden cellular patterns that could reshape how we understand and treat cancer. The post GigaTIME: Scaling tumor microenvironment modeling using virtual population generated by multimodal AI appeared first on <a href="https://www.microsoft.com/en-us/resea
As the Women in Machine Learning Workshop (WiML) marks its 20th annual gathering, cofounders, friends, and collaborators Jenn Wortman Vaughan and Hanna Wallach reflect on WiML’s evolution, navigating the field of ML, and their work in responsible AI. The post Ideas: Community building, machine learning, and the future of AI appeared first on <a href="https://ww
New research explores two ways to give AI agents stronger privacy safeguards grounded in contextual integrity. One adds lightweight, inference-time checks; the other builds contextual awareness directly into models through reasoning and RL. The post Reducing Privacy leaks in AI: Two approaches to contextual integrity appeared first on <a href="https://www.mi
The 2025 Typed Python Survey, conducted by contributors from JetBrains, Meta, and the broader Python typing community, offers a comprehensive look at the current state of Python’s type system and developer tooling. With 1,241 responses (a 15% increase from last year), the survey captures the evolving sentiment, challenges, and opportunities around Python typing in the [...] <a class="btn btn-secondary understrap-read-more-link" href="https://engineering.fb.com/2025/12/22/developer-tool
Incident investigation can be a daunting task in today’s digital landscape, where large-scale systems comprise numerous interconnected components and dependencies DrP is a root cause analysis (RCA) platform, designed by Meta, to programmatically automate the investigation process, significantly reducing the mean time to resolve (MTTR) for incidents and alleviating on-call toil Today, DrP is used [...] <a class="btn btn-secondary understrap-read-more-link" href="https://engineering.fb.c
We’re going behind the scenes of the Meta Ray-Ban Display, Meta’s most advanced AI glasses yet. In a previous episode we met the team behind the Meta Neural Band, the EMG wristband packaged with the Ray-Ban Display. Now we’re delving into the glasses themselves. Kenan and Emanuel, from Meta’s Wearables org, join Pascal Hartig on [...] <a class="btn btn-secondary understrap-read-more-link" href="https://engineering.fb.com/2025/12/17/virtual-reality/meta-ray-ban-display-from-zero-to-poli
Meta’s secure-by-default frameworks wrap potentially unsafe OS and third-party functions, making security the default while preserving developer speed and usability. These frameworks are designed to closely mirror existing APIs, rely on public and stable interfaces, and maximize developer adoption by minimizing friction and complexity. Generative AI and automation accelerate the adoption of secure frameworks at [...] <a class="btn btn-secondary understrap-read-more-link" href="https://
We’re introducing Zoomer, Meta’s comprehensive, automated debugging and optimization platform for AI. Zoomer works across all of our training and inference workloads at Meta and provides deep performance insights that enable energy savings, workflow acceleration, and efficiency gains in our AI infrastructure. Zoomer has delivered training time reductions, and significant QPS improvements, making it the [...] <a class="btn btn-secondary understrap-read-more-link" href="https://enginee
Pinterest Engineering
Updated about 10 hours agoSpotify Engineering
Updated about 10 hours agoThe technical and practical rationale for a clear separation between these domains. The post Why We Use Separate Tech Stacks for Personalization and Experimentation appeared first on Spotify Engineering.
The system we built to ensure our AI agents produce predictable, trustworthy code. The post Background Coding Agents: Predictable Results Through Strong Feedback Loops (Part 3) appeared first on Spotify Engineering.
We explore context engineering for background coding agents and what makes a good migration prompt. The post Background Coding Agents: Context Engineering (Part 2) appeared first on Spotify Engineering.
Shuffle has always been one of Spotify’s most-used features, and also one of the most misunderstood. For... The post Shuffle: Making Random Feel More Human appeared first on Spotify Engineering.
Thousands of merged AI-generated pull requests and the future of large-scale software maintenance. The post 1,500+ PRs Later: Spotify’s Journey with Our Background Coding Agent (Part 1) appeared first on Spotify Engineering.
The Airbnb Tech Blog
Updated about 10 hours agoMartin Fowler
Updated about 10 hours agoMy favorite albums from last year. Balkan brass, an acoustic favorite of 80s returns, Ethio-jazz, Guatemalan singer-guitarist, jazz-rock/Indian classical fusion, and a unique male vocalist. more…
Anthropic report on how their AI is changing their own software development practice. Most usage is for debugging and helping understand existing code Notable increase in using it for implementing new features Developers using it for 59% of their work and getting 50% productivity increase 14% of developers are “power users” reporting much greater gains</li
Gitanjali Venkatraman does wonderful illustrations of complex subjects (which is why I was so happy to work with her on our Expert Generalists article). She has now published the latest in her series of illustrated guides: tackling the complex topic of <a href="https://www.thoughtworks.com/content/dam/thoughtworks/documents/blog/mainframe_modernisation_illust
If you’re a regular reader of my site, you’ll have noticed that in the last few months I’ve been making a number of “fragments” posts. Such a post is a short post with a bunch of little, unconnected segments. These are usually a reference to something I’ve found on the web, sometimes a small thought of my own. A few years ago, I wouldn’t have covere
Why does AI write like… that (NYT, gift link). Sam Kriss delves into the quiet hum of AI writing. AI’s work is not compelling prose: it’s phantom text, ghostly scribblings, a spectre woven into our communal tapestry. ❄ ❄ ❄ ❄ ❄ <a href="https://coding-is-like-cooking.info/2025/12/test-desidera
Hugging Face Trending
Updated 39 minutes agoGitHub Trending
Updated about 1 hour agoAWS News Blog
Updated about 10 hours agoHappy New Year! I hope the holidays gave you time to recharge and spend time with your loved ones. Like every year, I took a few weeks off after AWS re:Invent to rest and plan ahead. I used some of that downtime to plan the next cohort for Become a Solutions Architect (BeSA). BeSA is […]
Can you believe it? We’re nearly at the end of 2025. And what a year it’s been! From re:Invent recap events, to AWS Summits, AWS Innovate, AWS re:Inforce, Community Days, and DevDays and, recently, adding that cherry on the cake, re:Invent 2025, we have lived through a year filled with exciting moments and technology advancements […]
The week after AWS re:Invent builds on the excitement and energy of the event and is a good time to learn more and understand how the recent announcements can help you solve your challenges and unlock new opportunities. As usual, we have you covered with our top announcements of AWS re:Invent 2025 that you can […]
Amazon Bedrock now supports reinforcement fine-tuning delivering 66% accuracy gains on average over base models.
Accelerate AI model development with new training features that enable rapid recovery from failures and automatic scaling based on resource availability.
Alibaba Cloud
Updated about 10 hours agoCloudflare
Updated about 10 hours agoThere has been speculation about the cause of a BGP anomaly observed in Venezuela on January 2. We take a look at BGP route leaks, and dive into what the data suggests caused the anomaly in question.
Physical data center maintenance is risky on a global network. We built a maintenance scheduler on Workers to safely plan disruptive operations, while solving scaling challenges by viewing the state of our infrastructure through a graph interface on top of multiple data sources and metrics pipelines.
We have declared “Code Orange: Fail Small” to focus everyone at Cloudflare on a set of high-priority workstreams with one simple goal: ensure that the cause of our last two global outages never happens again.
Cloudflare's H1 2025 Transparency Report is here. We discuss our principles on content blocking and our innovative approach to combating unauthorized streaming and copyright abuse.
Cloudflare’s R2 SQL, a distributed query engine, now supports aggregations. Explore how we built distributed GROUP BY execution, using scatter-gather and shuffling strategies to run analytics directly over your R2 Data Catalog.
IGN News
Updated about 4 hours agoKathryn Hahn (Agatha All Along, The Studio) is reportedly in talks to play Mother Gothel in Disney’s live-action Tangled remake.
Avowed is the latest Xbox Game Studios creation to head to PlayStation, developer Obsidian announced today.
Embark Studios says it ‘should do a lot more’ with trading in Arc Raiders, but some players are worried about how a potential in-game market might affect the experience.
Hey, fellow Undertale and Deltarune fan. Did you know there was a Deltarune ARG going on? Did you know it's technically been going on for three years now? Neither did I!
Embark Studios has outlined its plan to deal with Arc Raiders cheaters after they became a hot topic in the community earlier this week.
Game Rant
Updated about 4 hours agoUbisoft's Far Cry franchise contains some of the best games the company has ever developed. Here is every game in the series, ranked.
Unlock Mega Sceptile by reaching Rank S in Pokemon Legends Z-A's Season 5 Ranked Battles. This guide explains the process to obtain Sceptilite and evolve Sceptile.
The StarRupture roadmap has been shared as it begins its early access journey, with plenty of new content, features and QoL tweaks on the way.
Since it sucks to come back to your base and find it infected by aliens, here's how you can fix your infected base in StarRupture.
What a remarkable year for video games.