Trending
Content tagged with "model-serving"
Hacker News
Top stories from the Hacker News community• Updated 9 minutes ago
about 17 hours ago
3 days ago
5 days ago
3 days ago
3 days ago
4 days ago
7 days ago
HN
Overclocking LLM Reasoning: Monitoring and Controlling LLM Thinking Path Lengths
royeisen.github.io
55
limoce
4 days ago
Top posts from tech subreddits• Updated 24 minutes ago
3 months ago
3 months ago
Reddit
New QAT-optimized int4 Gemma 3 models by Google, slash VRAM needs (54GB -> 14.1GB) while maintaining quality.
developers.googleblog.com
3 months ago
3 months ago
3 months ago
3 months ago
3 months ago
3 months ago
Hugging Face Trending
Popular models from Hugging Face• Updated 6 minutes ago
No models found
Try removing the tag filter or searching for different content.
GitHub Trending
Popular repositories from GitHub• Updated 20 minutes ago
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Python
34,696
4,951
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Python
7,972
822