Trending

Discover trending content

Hacker News

Top stories from the Hacker News community• Updated 13 minutes ago

InfoQ

Latest articles from InfoQ• Updated 4 minutes ago

InfoQ

Cactus v1: Cross-Platform LLM Inference on Mobile with Zero Latency and Full Privacy

Cactus, a Y Combinator-backed startup, enables local AI inference to mobile phones, wearables, and other low-power devices through cross-platform, energy-efficient kernels and a native runtime. It delivers sub-50ms time-to-first-token for on-device inference, eliminates network latency, and defaults to complete privacy. By Sergio De Simone

infoq.com
Sergio De Simone
about 2 hours ago
InfoQ

Presentation: Ecologies and Economics of Language AI in Practice

Jade Abbott discusses the shift from massive, resource-heavy models to "Little LMs" that prioritize efficiency and cultural sustainability. She explains how techniques like LoRA, quantization, and GRPO allow for high performance with less compute. By sharing the "Ubuntu Punk" philosophy, she shares how to move beyond extractive data practices toward human-centric, sustainable AI systems. By Jade Abbott

infoq.com
InfoQ

Python Workers Redux: Wasm Snapshots and Native uv Tooling

Cloudflare's latest advancements in Python Workers revolutionize serverless performance with near-instant cold starts, expanded package compatibility, and streamlined workflows via the uv package manager. By leveraging memory snapshots and WebAssembly, Cloudflare drastically reduces startup times, making Python a prime choice for AI and data science applications. By Steef-Jan Wiggers

infoq.com
Steef-Jan Wiggers
about 3 hours ago
InfoQ

Nuxt Introduces Native Request Cancellation and Async Handler Extraction for Performance Gains

Nuxt 4.2 elevates the developer experience with native abort control for data fetching, improved error handling, and experimental TypeScript support. With a 39% reduction in bundle sizes and a streamlined app directory, this release enhances performance and project organization, positioning Nuxt as a leading choice for full-stack web applications built on Vue.js. By Daniel Curtis

infoq.com
Daniel Curtis
about 21 hours ago
InfoQ

OpenAI and Anthropic Donate AGENTS.md and Model Context Protocol to New Agentic AI Foundation

OpenAI and Anthropic have donated their AGENTS.md and Model Context Protocol projects to the Agentic AI Foundation (AAIF), a new directed fund under the Linux Foundation. Block contributed their agent framework, goose, as another founding project, and several other tech companies have joined as Platinum members. By Anthony Alford

infoq.com
InfoQ

Pinecone Introduces Dedicated Read Nodes in Public Preview for Predictable Vector Workloads

Pinecone recently announced the public preview of Dedicated Read Nodes (DRN), a new capacity mode for its vector database designed to deliver predictable performance and cost at scale for high-throughput applications such as billion-vector semantic search, recommendation systems, and mission-critical AI services. By Craig Risi

infoq.com
InfoQ

Article: Building Streaming Infrastructure That Scales: Because Viewers Won't Wait Until Tomorrow

In streaming, the challenge is immediate: customers are watching TV right now, not planning to watch it tomorrow. When systems fail during prime time, there is no recovery window; viewers leave and may not return. One and a half years ago, at ProSiebenSat.1 Media SE, we faced the challenge of scaling streaming applications for international users. By Daniele Frasca

infoq.com
InfoQ

Target Improves Add to Cart Interactions by 11 Percent with Generative AI Recommendations

Target has deployed GRAM, a GenAI-powered accessory recommendation system for the Home category, using large language models to prioritize product attributes and capture aesthetic cohesion. The system helps shoppers find compatible accessories, integrates human-in-the-loop curation, and achieved measurable improvements in engagement and conversion. By Leela Kumili

infoq.com
InfoQ

Presentation: DevOps Is for Product Engineers, Too

Lesley Cordero discusses platform engineering as a sociotechnical solution for scaling organizations. She explains the CALMS framework, the "pendulum of tension" between reliability and velocity, and how to transition from reactive to proactive leadership. By focusing on communal learning and distributed power, she shares how to build resilient systems without sacrificing human well-being. By Lesley Cordero

infoq.com

Reddit

Top posts from tech subreddits• Updated 4 minutes ago

377
40
thehashimwarren
about 17 hours ago

Hugging Face Trending

Popular models from Hugging Face• Updated 40 minutes ago

GLM-4.7

Task: text-generation

Z-Image-Turbo

Task: text-to-image

functiongemma-270m-it

Task: text-generation

GitHub Trending

Popular repositories from GitHub• Updated about 1 hour ago

No repositories found

Try adjusting your search criteria.