Trending
Content tagged with "model-serving"
Hacker News
Top stories from the Hacker News community• Updated 13 minutes ago
about 16 hours ago
3 days ago
5 days ago
3 days ago
3 days ago
4 days ago
7 days ago
HN
Overclocking LLM Reasoning: Monitoring and Controlling LLM Thinking Path Lengths
royeisen.github.io
55
limoce
4 days ago
Top posts from tech subreddits• Updated 7 minutes ago
3 months ago
3 months ago
Reddit
New QAT-optimized int4 Gemma 3 models by Google, slash VRAM needs (54GB -> 14.1GB) while maintaining quality.
developers.googleblog.com
3 months ago
3 months ago
3 months ago
3 months ago
3 months ago
3 months ago
Hugging Face Trending
Popular models from Hugging Face• Updated 25 minutes ago
No models found
Try removing the tag filter or searching for different content.
GitHub Trending
Popular repositories from GitHub• Updated 39 minutes ago
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Python
34,696
4,951
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Python
7,972
822