Trending
Content tagged with "model-serving"
Hacker News
Top stories from the Hacker News community• Updated 6 minutes ago
Top posts from tech subreddits• Updated less than a minute ago
New QAT-optimized int4 Gemma 3 models by Google, slash VRAM needs (54GB -> 14.1GB) while maintaining quality.
Hugging Face Trending
Popular models from Hugging Face• Updated 19 minutes ago
No models found
Try removing the tag filter or searching for different content.
GitHub Trending
Popular repositories from GitHub• Updated 32 minutes ago
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
amazon-sagemaker-examples
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
super-gradients
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.