Trending

Content tagged with "model-serving"

Show all

Hacker News

Top stories from the Hacker News community• Updated 6 minutes ago

HN

Your data model is your destiny

notes.mtb.xyz

368

hunglee2

4 days ago

data-engineering system-design data-science data-modeling big-data model-serving mlops

HN

Gemini 3.0 spotted in the wild through A/B testing

ricklamers.io

399

ricklamers

2 days ago

ai mlops model-serving self-hosted generative-ai

HN

New Coding Models and Integrations

ollama.com

201

meetpateltech

2 days ago

model-serving developer-tools ai mlops self-hosted automation open-source

HN

TaxCalcBench: Evaluating Frontier Models on the Tax Calculation Task

arxiv.org

58

handfuloflight

2 days ago

data-science mlops model-serving transformers llm ai nlp generative-ai

HN

Intel Announces Inference-Optimized Xe3P Graphics Card with 160GB VRAM

phoronix.com

142

wrigby

4 days ago

ai mlops deep-learning model-serving hardware gpu

HN

MAML – a new configuration language (similar to JSON, YAML, and TOML)

maml.dev

98

birdculture

6 days ago

system-design architecture configuration devops yaml configuration-management json ai mlops model-serving automation

HN

4x faster LLM inference (Flash Attention guy's company)

together.ai

192

alecco

6 days ago

llm transformers ai model-serving

HN

Ohno Type School

ohnotype.co

191

tobr

12 days ago

ai mlops ai-ethics ai-research deep-learning model-serving

HN

Representation Engineering

vgel.me

32

kqr

12 days ago

model-serving ai representation-learning deep-learning data-engineering mlops embeddings

Reddit

Top posts from tech subreddits• Updated less than a minute ago

Reddit

🚀 Run LightRAG on a Bare Metal Server in Minutes (Fully Automated)

reddit.com

28

aospan

6 months ago

r/LocalLLaMA self-hosting model-serving mlops

Reddit

Speciality of each model

reddit.com

3

Red_Pudding_pie

6 months ago

r/AI_Agents ai model-serving mlops

Reddit

[D][Discussion] - Model Context Protocol - Exhaustively Explained

reddit.com

0

shreesrinivasan

6 months ago

r/MachineLearning ai llm model-serving mlops

Reddit

New QAT-optimized int4 Gemma 3 models by Google, slash VRAM needs (54GB -> 14.1GB) while maintaining quality.

developers.googleblog.com

203

Sea_Sympathy_495

6 months ago

r/LocalLLaMA ai model-serving google deep-learning

Reddit

[P] I'm not understanding how to select the best model

reddit.com

1

No-Discipline-2354

6 months ago

r/MachineLearning ai model-serving best-practices mlops

Reddit

MCP Explained in 3 Minutes: Model Context Protocol for AI & Tools

youtu.be

2

db191997

6 months ago

r/coding ai model-serving mlops

Reddit

We GRPO-ed a Model to Keep Retrying 'Search' Until It Found What It Needed

v.redd.it

188

Kooky-Somewhere-2883

6 months ago

r/LocalLLaMA ai search vector-search model-serving mlops

Reddit

how to make multiple models all work together for me

reddit.com

2

CommissionTricky9408

6 months ago

r/AI_Agents ai model-serving mlops

Reddit

DeepSeek is about to open-source their inference engine

i.redd.it

1410

Dr_Karminski

6 months ago

r/LocalLLaMA model-serving deep-learning open-source

Hugging Face Trending

Popular models from Hugging Face• Updated 19 minutes ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated 32 minutes ago

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python

40,419

4,583

deep-learning mlops python model-serving transformers

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++

18,103

3,499

mlops onnx c++onnxruntime deep-learning tensorflow pytorch python ai model-serving

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Python

35,437

5,031

pytorch deep-learning computer-vision transformers model-serving

amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Jupyter Notebook

10,635

6,915

aws mlops deep-learning jupyter-notebook amazon-sagemaker notebooks model-serving

super-gradients

Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

Jupyter Notebook

4,833

550

computer-vision deep-learning open-source mlops ai model-serving

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python

7,972

822

mlops model-serving python