签到

推敲论坛»论坛 › AI 深度实验室 › 大模型前沿 › 微软开源语音识别模型VibeVoice-ASR

92 积分	0 好友	5 主题

发消息

微软开源语音识别模型VibeVoice-ASR

发表于 2026-1-23 23:07:06 | 查看: 15| 回复: 0

9B大小，支持中文，能同时识别时间戳、说话人、说话内容，最长可以单次识别60分钟的音频。

模型：https://huggingface.co/microsoft/VibeVoice-ASR
在线体验：https://dd66e23bd8ab778987.gradio.live

收藏0 回复显示全部楼层举报

Archiver|手机版|小黑屋|推敲论坛 ( 鲁ICP备19013538号-1 )

GMT+8, 2026-3-21 07:42 , Processed in 0.113521 second(s), 30 queries .

Powered by Discuz! X3.5

© 2001-2026 Discuz! Team.

快速回复 返回顶部 返回列表