Trending

Content tagged with "rag", "model-serving"

ragmodel-serving

Hacker News

Top stories from the Hacker News community• Updated 11 minutes ago

Reddit

Top posts from tech subreddits• Updated 5 minutes ago

Hugging Face Trending

Popular models from Hugging Face• Updated 24 minutes ago

LFM2-1.2B-RAG

Task: text-generation

jina-embeddings-v4

Task: visual-document-retrieval

GitHub Trending

Popular repositories from GitHub• Updated 38 minutes ago

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

mindsdb

Federated query engine for AI - The only MCP Server you'll ever need

langchain

🦜🔗 The platform for reliable agents.

Jupyter Notebook
119,014
19,603

qlib

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.

pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.