CN105989838A - 语音识别方法及装置 - Google Patents
语音识别方法及装置 Download PDFInfo
- Publication number
- CN105989838A CN105989838A CN201510051345.4A CN201510051345A CN105989838A CN 105989838 A CN105989838 A CN 105989838A CN 201510051345 A CN201510051345 A CN 201510051345A CN 105989838 A CN105989838 A CN 105989838A
- Authority
- CN
- China
- Prior art keywords
- data
- mfcc
- voice
- input audio
- eigenmatrix
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 38
- 230000009467 reduction Effects 0.000 claims abstract description 17
- 239000011159 matrix material Substances 0.000 claims description 50
- 238000000605 extraction Methods 0.000 claims description 19
- 230000008569 process Effects 0.000 claims description 7
- 230000035945 sensitivity Effects 0.000 claims description 6
- 230000005236 sound signal Effects 0.000 claims description 6
- 230000003595 spectral effect Effects 0.000 claims description 6
- 230000002085 persistent effect Effects 0.000 claims description 3
- 238000012512 characterization method Methods 0.000 claims 1
- 238000007634 remodeling Methods 0.000 abstract 2
- 230000006870 function Effects 0.000 description 7
- 238000001514 detection method Methods 0.000 description 4
- 235000013399 edible fruits Nutrition 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 230000007613 environmental effect Effects 0.000 description 3
- HUTDUHSNJYTCAR-UHFFFAOYSA-N ancymidol Chemical compound C1=CC(OC)=CC=C1C(O)(C=1C=NC=NC=1)C1CC1 HUTDUHSNJYTCAR-UHFFFAOYSA-N 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000004913 activation Effects 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
Landscapes
- Electrically Operated Instructional Devices (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (11)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910827387.0A CN110895929B (zh) | 2015-01-30 | 2015-01-30 | 语音识别方法及装置 |
CN201510051345.4A CN105989838B (zh) | 2015-01-30 | 2015-01-30 | 语音识别方法及装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510051345.4A CN105989838B (zh) | 2015-01-30 | 2015-01-30 | 语音识别方法及装置 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910827387.0A Division CN110895929B (zh) | 2015-01-30 | 2015-01-30 | 语音识别方法及装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105989838A true CN105989838A (zh) | 2016-10-05 |
CN105989838B CN105989838B (zh) | 2019-09-06 |
Family
ID=57037166
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910827387.0A Active CN110895929B (zh) | 2015-01-30 | 2015-01-30 | 语音识别方法及装置 |
CN201510051345.4A Active CN105989838B (zh) | 2015-01-30 | 2015-01-30 | 语音识别方法及装置 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910827387.0A Active CN110895929B (zh) | 2015-01-30 | 2015-01-30 | 语音识别方法及装置 |
Country Status (1)
Country | Link |
---|---|
CN (2) | CN110895929B (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116913258A (zh) * | 2023-09-08 | 2023-10-20 | 鹿客科技(北京)股份有限公司 | 语音信号识别方法、装置、电子设备和计算机可读介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040122667A1 (en) * | 2002-12-24 | 2004-06-24 | Mi-Suk Lee | Voice activity detector and voice activity detection method using complex laplacian model |
CN101308653A (zh) * | 2008-07-17 | 2008-11-19 | 安徽科大讯飞信息科技股份有限公司 | 一种应用于语音识别系统的端点检测方法 |
CN101593522A (zh) * | 2009-07-08 | 2009-12-02 | 清华大学 | 一种全频域数字助听方法和设备 |
CN103035244A (zh) * | 2012-11-24 | 2013-04-10 | 安徽科大讯飞信息科技股份有限公司 | 一种可实时反馈用户朗读进度的语音跟踪方法 |
CN103065631A (zh) * | 2013-01-24 | 2013-04-24 | 华为终端有限公司 | 一种语音识别的方法、装置 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1162838C (zh) * | 2002-07-12 | 2004-08-18 | 清华大学 | 抗噪声语音识别用语音增强-特征加权-对数谱相加方法 |
EP1473964A3 (en) * | 2003-05-02 | 2006-08-09 | Samsung Electronics Co., Ltd. | Microphone array, method to process signals from this microphone array and speech recognition method and system using the same |
GB0426347D0 (en) * | 2004-12-01 | 2005-01-05 | Ibm | Methods, apparatus and computer programs for automatic speech recognition |
JP2007114413A (ja) * | 2005-10-19 | 2007-05-10 | Toshiba Corp | 音声非音声判別装置、音声区間検出装置、音声非音声判別方法、音声区間検出方法、音声非音声判別プログラムおよび音声区間検出プログラム |
JP5505896B2 (ja) * | 2008-02-29 | 2014-05-28 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 発話区間検出システム、方法及びプログラム |
CN103065627B (zh) * | 2012-12-17 | 2015-07-29 | 中南大学 | 基于dtw与hmm证据融合的特种车鸣笛声识别方法 |
CN103544963B (zh) * | 2013-11-07 | 2016-09-07 | 东南大学 | 一种基于核半监督判别分析的语音情感识别方法 |
-
2015
- 2015-01-30 CN CN201910827387.0A patent/CN110895929B/zh active Active
- 2015-01-30 CN CN201510051345.4A patent/CN105989838B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040122667A1 (en) * | 2002-12-24 | 2004-06-24 | Mi-Suk Lee | Voice activity detector and voice activity detection method using complex laplacian model |
CN101308653A (zh) * | 2008-07-17 | 2008-11-19 | 安徽科大讯飞信息科技股份有限公司 | 一种应用于语音识别系统的端点检测方法 |
CN101593522A (zh) * | 2009-07-08 | 2009-12-02 | 清华大学 | 一种全频域数字助听方法和设备 |
CN103035244A (zh) * | 2012-11-24 | 2013-04-10 | 安徽科大讯飞信息科技股份有限公司 | 一种可实时反馈用户朗读进度的语音跟踪方法 |
CN103065631A (zh) * | 2013-01-24 | 2013-04-24 | 华为终端有限公司 | 一种语音识别的方法、装置 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116913258A (zh) * | 2023-09-08 | 2023-10-20 | 鹿客科技(北京)股份有限公司 | 语音信号识别方法、装置、电子设备和计算机可读介质 |
CN116913258B (zh) * | 2023-09-08 | 2023-11-24 | 鹿客科技(北京)股份有限公司 | 语音信号识别方法、装置、电子设备和计算机可读介质 |
Also Published As
Publication number | Publication date |
---|---|
CN105989838B (zh) | 2019-09-06 |
CN110895929B (zh) | 2022-08-12 |
CN110895929A (zh) | 2020-03-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110853617B (zh) | 一种模型训练的方法、语种识别的方法、装置及设备 | |
CN111210021B (zh) | 一种音频信号处理方法、模型训练方法以及相关装置 | |
JP6099556B2 (ja) | 音声識別方法および装置 | |
CN110853618A (zh) | 一种语种识别的方法、模型训练的方法、装置及设备 | |
CN110503969A (zh) | 一种音频数据处理方法、装置及存储介质 | |
CN110428842A (zh) | 语音模型训练方法、装置、设备及计算机可读存储介质 | |
CN110634474B (zh) | 一种基于人工智能的语音识别方法和装置 | |
CN108172230A (zh) | 基于声纹识别模型的声纹注册方法、终端装置及存储介质 | |
CN110544468B (zh) | 应用唤醒方法、装置、存储介质及电子设备 | |
CN106033669B (zh) | 语音识别方法及装置 | |
CN106341539A (zh) | 恶意来电者声纹的自动取证方法、装置和移动终端 | |
CN112562723B (zh) | 发音准确度确定方法、装置、存储介质和电子设备 | |
CN107622773A (zh) | 一种音频特征提取方法与装置、电子设备 | |
CN110287311A (zh) | 文本分类方法及装置、存储介质、计算机设备 | |
CN110580897B (zh) | 音频校验方法、装置、存储介质及电子设备 | |
Sadjadi et al. | Nearest neighbor discriminant analysis for language recognition | |
US20210264939A1 (en) | Attribute identifying device, attribute identifying method, and program storage medium | |
CN106024017A (zh) | 语音检测方法及装置 | |
Shivakumar et al. | Simplified and supervised i-vector modeling for speaker age regression | |
CN106710588B (zh) | 语音数据句类识别方法和装置及系统 | |
CN110728993A (zh) | 一种变声识别方法及电子设备 | |
CN106297795B (zh) | 语音识别方法及装置 | |
CN102237089A (zh) | 一种减少文本无关说话人识别系统误识率的方法 | |
CN106816157A (zh) | 语音识别方法及装置 | |
CN113064118A (zh) | 声源定位方法和装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200529 Address after: 8-07, building 6, ronghuiyuan, airport economic core area, Shunyi District, Beijing Patentee after: Xin Xin finance leasing (Beijing) Co.,Ltd. Address before: Zuchongzhi road in Pudong Zhangjiang hi tech park Shanghai 201203 Lane 2288 Pudong New Area Spreadtrum Center Building 1 Patentee before: SPREADTRUM COMMUNICATIONS (SHANGHAI) Co.,Ltd. |
|
TR01 | Transfer of patent right |
Effective date of registration: 20201126 Address after: Room 2502, COFCO Plaza, 990 Nanma Road, Nankai District, Tianjin Patentee after: Xin Xin finance leasing (Tianjin) Co.,Ltd. Address before: 8-07, building 6, ronghuiyuan, airport economic core area, Shunyi District, Beijing Patentee before: Xin Xin finance leasing (Beijing) Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20161005 Assignee: SPREADTRUM COMMUNICATIONS (SHANGHAI) Co.,Ltd. Assignor: Xin Xin finance leasing (Tianjin) Co.,Ltd. Contract record no.: X2021110000055 Denomination of invention: Speech recognition method and device Granted publication date: 20190906 License type: Exclusive License Record date: 20211227 |
|
EE01 | Entry into force of recordation of patent licensing contract | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230711 Address after: 201203 Shanghai city Zuchongzhi road Pudong New Area Zhangjiang hi tech park, Spreadtrum Center Building 1, Lane 2288 Patentee after: SPREADTRUM COMMUNICATIONS (SHANGHAI) Co.,Ltd. Address before: Room 2502, COFCO Plaza, 990 Nanma Road, Nankai District, Tianjin 300100 Patentee before: Xin Xin finance leasing (Tianjin) Co.,Ltd. |
|
TR01 | Transfer of patent right |