Trending

Content tagged with "nlp"

nlp

Hacker News

Top stories from the Hacker News community• Updated 3 minutes ago

Reddit

Top posts from tech subreddits• Updated about 1 hour ago

13
7
Academic_Sleep1118
30 days ago

Hugging Face Trending

Popular models from Hugging Face• Updated 31 minutes ago

PaddleOCR-VL

Task: image-text-to-text

Qwen3-VL-8B-Instruct

Task: image-text-to-text

Arch-Router-1.5B

Task: text-generation

Qwen3-VL-2B-Instruct

Task: image-text-to-text

neutts-air

Task: text-to-speech

olmOCR-2-7B-1025-FP8

Task: image-to-text

Bee-8B-RL

Task: image-text-to-text

DeepSeek-V3.2-Exp

Task: text-generation

Qwen3-VL-2B-Thinking

Task: image-text-to-text

GitHub Trending

Popular repositories from GitHub• Updated about 1 hour ago

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

openai-cookbook

Examples and guides for using the OpenAI API

langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook
117,938
19,407

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Megatron-LM

Ongoing research training transformer models at scale

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python
10,365
2,784

unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.