Trending

Content tagged with "nlp"

nlp

Hacker News

Top stories from the Hacker News community• Updated 8 minutes ago

16
24
smartmic
about 14 hours ago
HN

ImapGoose

whynothugo.nl

Reddit

Top posts from tech subreddits• Updated 38 minutes ago

Reddit

Good text-based widget AI chat bot?

reddit.com
2
4
BothCharge4278
22 days ago
16
9
Ok-Blueberry-1134
20 days ago

Hugging Face Trending

Popular models from Hugging Face• Updated 21 minutes ago

Ling-1T

Task: text-generation

neutts-air

Task: text-to-speech

PaddleOCR-VL

Task: image-text-to-text

Ring-1T

Task: text-generation

Qwen3-VL-8B-Instruct

Task: image-text-to-text

DeepSeek-V3.2-Exp

Task: text-generation

MinerU2.5-2509-1.2B

Task: image-text-to-text

kani-tts-370m

Task: text-to-speech

moondream3-preview

Task: text-generation

GitHub Trending

Popular repositories from GitHub• Updated 34 minutes ago

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook
117,369
19,330

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Megatron-LM

Ongoing research training transformer models at scale

openai-cookbook

Examples and guides for using the OpenAI API

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python
10,365
2,784

unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.