machine-learning - 语音识别方面有现有的研究可以区分不同人的声音吗？

标签 machine-learning data-mining speech-recognition voice-recognition

已关闭。这个问题是 off-topic 。目前不接受答案。

想要改进这个问题吗？ Update the question所以它是on-topic用于堆栈溢出。

已关闭10 年前。

Improve this question

我刚刚想到一个想法，我想开发一个应用程序来区分/自动检测不同人的声音。

示例用例:使用奥巴马和罗姆尼的数据进行训练后，应用程序将能够检测到任何一人再次讲话(不一定是训练数据中的相同内容)

我想知道是否有这方面的现有研究。 (我不知道如何搜索这个。我尝试了几个关键字，但没有得到明显的结果。)

如果没有，什么是开始的好方法？如何选择特征、数据表示、模型等

谢谢!

最佳答案

我找到了Speaker recognition维基百科上又链接到 An overview of text-independent speaker recognition: From features to supervectors (Kinnunen，李，2010)。

摘自论文摘要:

This paper gives an overview of automatic speaker recognition technology, with an emphasis on text-independent recognition. Speaker recognition has been studied actively for several decades. We give an overview of both the classical and the state-of-the-art methods.

关于machine-learning - 语音识别方面有现有的研究可以区分不同人的声音吗？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/13244820/

上一篇：machine-learning - 有类似 MNIST 的数据集吗？

下一篇：machine-learning - HMM 如何用于手写识别？

python - 如何为 k 均值聚类选择初始质心

r - 计算在多项选择题中选择一个选项同时选择其他每个选项的调查回复的比例

r - 如何在 R 中正确绘制 ICE？

tensorflow - “tflite_convert”不被识别为内部或外部命令(在 Windows 中)

machine-learning - 具有缺失值和偏差的排名算法

python-2.7 - Raspberry Pi 2 的 Python 语音识别

Android - 系统音量在卸载之前不会取消静音

visual-studio-2015 - SpeechRecognizer 无法工作，COMException : Class not registered/UWP App Windows IoT (10. 0.10586) 和 Visual Studio 2015 Update 1

statistics - 文本上的逐点互信息