Trending
Content tagged with "llm"
Hacker News
Top stories from the Hacker News community• Updated 11 minutes ago
InfoQ
Latest articles from InfoQ• Updated 11 minutes ago
Cactus v1: Cross-Platform LLM Inference on Mobile with Zero Latency and Full Privacy
Cactus, a Y Combinator-backed startup, enables local AI inference to mobile phones, wearables, and other low-power devices through cross-platform, energy-efficient kernels and a native runtime. It delivers sub-50ms time-to-first-token for on-device inference, eliminates network latency, and defaults to complete privacy. By Sergio De Simone
Presentation: Ecologies and Economics of Language AI in Practice
Jade Abbott discusses the shift from massive, resource-heavy models to "Little LMs" that prioritize efficiency and cultural sustainability. She explains how techniques like LoRA, quantization, and GRPO allow for high performance with less compute. By sharing the "Ubuntu Punk" philosophy, she shares how to move beyond extractive data practices toward human-centric, sustainable AI systems. By Jade Abbott
Target Improves Add to Cart Interactions by 11 Percent with Generative AI Recommendations
Target has deployed GRAM, a GenAI-powered accessory recommendation system for the Home category, using large language models to prioritize product attributes and capture aesthetic cohesion. The system helps shoppers find compatible accessories, integrates human-in-the-loop curation, and achieved measurable improvements in engagement and conversion. By Leela Kumili
Toad: A Unified CLI Tool for All Your LLMs That Promises Improved UX From Existing Ones
During his sabbatical, Will McGugan, maker of Rich and Textual( frameworks for making Textual User Interfaces (TUI)), put his UI skills to work to build Toad. The newly publicly released tool aims to provide a unified, “beautiful” GUI for multiple coding agents in your terminal, accessible via the same tool via the Agent Communication Protocol (ACP). By Olimpiu Pop
Meta Details GEM Ads Model Using LLM-Scale Training, Hybrid Parallelism, and Knowledge Transfer
Meta released details about its Generative Ads Model (GEM), a foundation model designed to improve ads recommendation across its platforms. The model addresses core challenges in recommendation systems (RecSys) by processing billions of daily user-ad interactions where meaningful signals such as clicks and conversions are very sparse. By Vinod Goje
IBM Research Introduces CUGA, an Open-Source Configurable Agent Framework on Hugging Face
IBM Research has released CUGA (Configurable Generalist Agent) on Hugging Face Spaces, making its enterprise-oriented agent framework easier to evaluate with open models and real workflows. The move positions CUGA as a practical alternative to brittle, tightly coupled agent frameworks that often struggle with tool misuse, long-horizon reasoning, and recovery from failure. By Robert Krzaczyński
Presentation: Lessons Learned From Shipping AI-Powered Healthcare Products
Clara Matos discusses the journey of shipping AI-powered healthcare products at Sword Health. She explains how to implement input/output guardrails for regulated industries and shares a framework for robust evaluations using human and LLM-based ratings. From prompt engineering to RAG and user feedback loops, she shares a data-driven roadmap for building reliable AI care agents at scale. By Clara Matos
Article: NextGen Search - Where AI Meets OpenSearch Through MCP
In this article, authors Srikanth Daggumalli and Arun Lakshmanan discuss next-generation context-aware conversational search using OpenSearch and AI agents powered by Large Language Models (LLMs) and Model Context Protocol (MCP). By Srikanth Daggumalli, Arun Lakshmanan
TornadoVM 2.0 Brings Automatic GPU Acceleration and LLM support to Java
The TornadoVM project recently reached version 2.0, a major milestone for the open-source project that aims to provide a heterogeneous hardware runtime for Java. The project automatically accelerates Java programs on multi-core CPUs, GPUs, and FPGAs. This release is likely to be of particular interest to teams developing LLM solutions on the JVM. By Ben Evans
Top posts from tech subreddits• Updated 11 minutes ago
New 1B parameter open-source coding model getting 76% on HumanEval [shameless but proud self-plug]
Eliminating LLM Hallucinations: A Methodology for AI Implementation in 100% Accuracy Business Scenarios
[D] Deep Learning/LLMs for Operations Research Problems in Production: Real-world Adoption?
Hugging Face Trending
Popular models from Hugging Face• Updated about 1 hour ago
GitHub Trending
Popular repositories from GitHub• Updated 8 minutes ago
pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
khoj
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
awesome-chatgpt-prompts
Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
llama_index
LlamaIndex is the leading framework for building LLM-powered agents over your data.
activepieces
AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents