Trending

Content tagged with "transformers"

transformers

Hacker News

Top stories from the Hacker News community• Updated 13 minutes ago

Reddit

Top posts from tech subreddits• Updated 7 minutes ago

Hugging Face Trending

Popular models from Hugging Face• Updated 25 minutes ago

UserLM-8b

Task: text-generation

Qwen3-VL-8B-Instruct

Task: image-text-to-text

GLM-4.6

Task: text-generation

KORMo-10B-sft

Task: text-generation

Qwen3-VL-8B-Thinking

Task: image-text-to-text

Kumru-2B

Task: text-generation

GitHub Trending

Popular repositories from GitHub• Updated 39 minutes ago

flash-attention

Fast and memory-efficient exact attention

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Megatron-LM

Ongoing research training transformer models at scale

unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

whisper.cpp

Port of OpenAI's Whisper model in C/C++

yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors