Trending
Content tagged with "transformers"
Hacker News
Top stories from the Hacker News community• Updated 13 minutes ago
Top posts from tech subreddits• Updated 4 minutes ago
[D] How to best fine-tune a T5 model for a Seq2Seq extraction task with a very small dataset?
Will we see: Phi-5, Granite 4, Gemma 4, Deepseek R2, Llama 5, Mistral Small 4, Flux 2, Whisper 4?
Hugging Face Trending
Popular models from Hugging Face• Updated 40 minutes ago
GitHub Trending
Popular repositories from GitHub• Updated about 1 hour ago
transformers
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
unstructured
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
llama_index
LlamaIndex is the leading framework for building LLM-powered agents over your data.
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Promptify
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
optimum
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools