Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • 免费去水印

  • Log In
  • Sign Up
blanchefort 's Collections
Medical
VLA models
Audio
Translate
OCR
OmniModels
Edge models
Video encoders
Judge
Datasets for Embodied
Ru text encoders
Text2Image
VLMs

Audio

updated 18 days ago
Upvote
-

  • nvidia/audio-flamingo-3-hf

    Audio-Text-to-Text • Updated Jan 27 • 176k • 173

  • facebook/sam-audio-large

    Updated Dec 30, 2025 • 29.2k • 372

  • google/medasr

    Automatic Speech Recognition • Updated Jan 26 • 37.2k • 288

  • FunAudioLLM/Fun-CosyVoice3-0.5B-2512

    Text-to-Speech • Updated 27 days ago • 6.42k • 466

  • facebook/sam-audio-large-tv

    Updated Dec 30, 2025 • 748 • 24

  • Qwen/Qwen3-TTS-12Hz-0.6B-Base

    Text-to-Speech • Updated Jan 29 • 252k • 180

  • MTUCI/spectra_0

    Audio Classification • Updated 18 days ago • 6
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets 免费Z-image图片生成 免费去水印 Vibevoice

🎉 Free Image Generator Now Available!

Totally Free + Zero Barriers + No Login Required