我正在寻找一种分析语音模式的方法/库。说,房间里有6个人。我想通过语音识别每个人。
任何提示都非常感谢。
德米特里
最佳答案
The task of taking a long contiguous audio recording and splitting it up in chunks in which only one speaker is speaking - without any prior knowledge about the voice characteristics of each speaker - is called "Speaker diarization". You can find links to research code on the wikipedia page.
If you have prior recordings of each voice, and would rather do classification, this is a slightly different problem (Speaker recognition or Speaker identification). Software tools for that are available here (note that general purposes speech recognition packages like Sphinx or HTK are flexible enough to be coaxed into doing that).
在这里回答https://dsp.stackexchange.com/questions/3119/library-to-differentiate-people-by-their-voice-timbre
关于ios - 分析语音模式IOS,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/11935734/