Models
Datasets
Spaces
Docs
Enterprise
免费去水印
Log In
Sign Up

Yifan Peng's picture

Yifan Peng

pyf98

rizwanishaq's profile picture

gotomypc's profile picture

aben118's profile picture

·

https://pyf98.github.io

pyf98

AI & ML interests

Multimodal LLMs, Speech-to-Speech, Speech Recognition

Organizations

pyf98 's collections 1

Open Whisper-style Speech Models (OWSM)

Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/

Running on Zero

9

OWSM V4 Demo

🌍

9

This is a demo for OWSM-V4 CTC and medium model.
Runtime error

Featured

55

OWSM Demo

🔊

55
espnet/yodas_owsmv4

Viewer • Updated Sep 1, 2025 • 4 • 6.63k • 15
espnet/owsm_ctc_v4_1B

Automatic Speech Recognition • Updated Aug 30, 2025 • 2.05k • 5

Open Whisper-style Speech Models (OWSM)

Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/

Running on Zero

9

OWSM V4 Demo

🌍

9

This is a demo for OWSM-V4 CTC and medium model.
Runtime error

Featured

55

OWSM Demo

🔊

55
espnet/yodas_owsmv4

Viewer • Updated Sep 1, 2025 • 4 • 6.63k • 15
espnet/owsm_ctc_v4_1B

Automatic Speech Recognition • Updated Aug 30, 2025 • 2.05k • 5

Company

TOS Privacy About Careers

Website

Models Datasets 免费Z-image图片生成免费去水印 Vibevoice

🎉 Free Image Generator Now Available!

Totally Free + Zero Barriers + No Login Required