Trending
Content tagged with "model-serving"
Hacker News
Top stories from the Hacker News community• Updated 10 minutes ago
InfoQ
Latest articles from InfoQ• Updated 4 minutes ago
Podcast: Platform Engineering for AI: Scaling Agents and MCP at LinkedIn
QCon AI New York Chair Wes Reisz talks with LinkedIn’s Karthik Ramgopal and Prince Valluri about enabling AI agents at enterprise scale. They discuss how platform teams orchestrate secure, multi-agentic systems, the role of MCP, the use of foreground and background agents, improving developer experience, and reducing toil. By Karthik Ramgopal, Prince Valluri
Replit Introduces New AI Integrations for Multi-Model Development
Replit has introduced Replit AI Integrations, a feature that lets users select third-party models directly inside the IDE and automatically generate the code needed to run inference. By Daniel Dominguez
Top posts from tech subreddits• Updated about 1 hour ago
Built a GGUF memory & tok/sec calculator for inference requirements – Drop in any HF GGUF URL
Only the real ones remember (he is still the contributor with the most likes for his models)
Hugging Face Trending
Popular models from Hugging Face• Updated 22 minutes ago
No models found
Try removing the tag filter or searching for different content.
GitHub Trending
Popular repositories from GitHub• Updated 36 minutes ago
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
qlib
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
amazon-sagemaker-examples
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
super-gradients
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.