Trending
Discover trending content
Hacker News
Top stories from the Hacker News community• Updated less than a minute ago
InfoQ
Latest articles from InfoQ• Updated 15 minutes ago
Cactus v1: Cross-Platform LLM Inference on Mobile with Zero Latency and Full Privacy
Cactus, a Y Combinator-backed startup, enables local AI inference to mobile phones, wearables, and other low-power devices through cross-platform, energy-efficient kernels and a native runtime. It delivers sub-50ms time-to-first-token for on-device inference, eliminates network latency, and defaults to complete privacy. By Sergio De Simone
Presentation: Ecologies and Economics of Language AI in Practice
Jade Abbott discusses the shift from massive, resource-heavy models to "Little LMs" that prioritize efficiency and cultural sustainability. She explains how techniques like LoRA, quantization, and GRPO allow for high performance with less compute. By sharing the "Ubuntu Punk" philosophy, she shares how to move beyond extractive data practices toward human-centric, sustainable AI systems. By Jade Abbott
Python Workers Redux: Wasm Snapshots and Native uv Tooling
Cloudflare's latest advancements in Python Workers revolutionize serverless performance with near-instant cold starts, expanded package compatibility, and streamlined workflows via the uv package manager. By leveraging memory snapshots and WebAssembly, Cloudflare drastically reduces startup times, making Python a prime choice for AI and data science applications. By Steef-Jan Wiggers
Nuxt Introduces Native Request Cancellation and Async Handler Extraction for Performance Gains
Nuxt 4.2 elevates the developer experience with native abort control for data fetching, improved error handling, and experimental TypeScript support. With a 39% reduction in bundle sizes and a streamlined app directory, this release enhances performance and project organization, positioning Nuxt as a leading choice for full-stack web applications built on Vue.js. By Daniel Curtis
OpenAI and Anthropic Donate AGENTS.md and Model Context Protocol to New Agentic AI Foundation
OpenAI and Anthropic have donated their AGENTS.md and Model Context Protocol projects to the Agentic AI Foundation (AAIF), a new directed fund under the Linux Foundation. Block contributed their agent framework, goose, as another founding project, and several other tech companies have joined as Platinum members. By Anthony Alford
Pinecone Introduces Dedicated Read Nodes in Public Preview for Predictable Vector Workloads
Pinecone recently announced the public preview of Dedicated Read Nodes (DRN), a new capacity mode for its vector database designed to deliver predictable performance and cost at scale for high-throughput applications such as billion-vector semantic search, recommendation systems, and mission-critical AI services. By Craig Risi
Article: Building Streaming Infrastructure That Scales: Because Viewers Won't Wait Until Tomorrow
In streaming, the challenge is immediate: customers are watching TV right now, not planning to watch it tomorrow. When systems fail during prime time, there is no recovery window; viewers leave and may not return. One and a half years ago, at ProSiebenSat.1 Media SE, we faced the challenge of scaling streaming applications for international users. By Daniele Frasca
Target Improves Add to Cart Interactions by 11 Percent with Generative AI Recommendations
Target has deployed GRAM, a GenAI-powered accessory recommendation system for the Home category, using large language models to prioritize product attributes and capture aesthetic cohesion. The system helps shoppers find compatible accessories, integrates human-in-the-loop curation, and achieved measurable improvements in engagement and conversion. By Leela Kumili
Presentation: DevOps Is for Product Engineers, Too
Lesley Cordero discusses platform engineering as a sociotechnical solution for scaling organizations. She explains the CALMS framework, the "pendulum of tension" between reliability and velocity, and how to transition from reactive to proactive leadership. By focusing on communal learning and distributed power, she shares how to build resilient systems without sacrificing human well-being. By Lesley Cordero
Top posts from tech subreddits• Updated 15 minutes ago
Samsung executives and employees indicted over leaking 10nm DRAM technology to China
New 1B parameter open-source coding model getting 76% on HumanEval [shameless but proud self-plug]
Hugging Face Trending
Popular models from Hugging Face• Updated about 1 hour ago
GitHub Trending
Popular repositories from GitHub• Updated 12 minutes ago
No repositories found
Try adjusting your search criteria.