ios - 分析语音模式IOS

我正在寻找一种分析语音模式的方法/库。说，房间里有6个人。我想通过语音识别每个人。

任何提示都非常感谢。

德米特里

最佳答案

The task of taking a long contiguous audio recording and splitting it up in chunks in which only one speaker is speaking - without any prior knowledge about the voice characteristics of each speaker - is called "Speaker diarization". You can find links to research code on the wikipedia page.

If you have prior recordings of each voice, and would rather do classification, this is a slightly different problem (Speaker recognition or Speaker identification). Software tools for that are available here (note that general purposes speech recognition packages like Sphinx or HTK are flexible enough to be coaxed into doing that).

在这里回答https://dsp.stackexchange.com/questions/3119/library-to-differentiate-people-by-their-voice-timbre

关于ios - 分析语音模式IOS，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/11935734/

上一篇：python - jinja2.exceptions.UndefinedError: 'response'未定义

下一篇：iphone - 在哪里可以找到用于循环的高质量音频样本

相关文章：

node.js - 转码和流式音频 - 如何发送内容范围 header

java - 我们可以将文本转换为语音，反之亦然吗？

android - 如何在 Android 中隐藏 toast 消息 “Your audio will be sent to google to provide speech recognition service.”？

ios - 无法找到 UIApplicationShortcutIcon.init(联系人 : CNContact) for loading contact image in dynamic action

animation - 在 ffmpeg 中，我可以以帧而不是秒为单位指定时间吗？

matlab - 颤音函数matlab中的wavread错误

c# - 微软语音识别平台

iphone - 使用sendSynchronousRequest时在ios中获取http响应码

ios - 从使用 AVAssetImageGenerator 获得的视频或拇指获取方向

ios - UITableView viewForFooterInSection 仅在滚动后显示