CN105580071B - 用于训练声音识别模型数据库的方法和装置 - Google Patents
用于训练声音识别模型数据库的方法和装置 Download PDFInfo
- Publication number
- CN105580071B CN105580071B CN201480025758.9A CN201480025758A CN105580071B CN 105580071 B CN105580071 B CN 105580071B CN 201480025758 A CN201480025758 A CN 201480025758A CN 105580071 B CN105580071 B CN 105580071B
- Authority
- CN
- China
- Prior art keywords
- noise
- data
- user
- specific
- voice data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims description 50
- 238000012549 training Methods 0.000 title claims description 34
- 230000004044 response Effects 0.000 claims description 4
- 239000000654 additive Substances 0.000 claims 4
- 230000000996 additive effect Effects 0.000 claims 4
- 230000008569 process Effects 0.000 description 21
- 238000004891 communication Methods 0.000 description 19
- 230000006870 function Effects 0.000 description 11
- 230000001413 cellular effect Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 230000000007 visual effect Effects 0.000 description 6
- 230000009471 action Effects 0.000 description 5
- 238000012545 processing Methods 0.000 description 3
- 230000003993 interaction Effects 0.000 description 2
- IRLPACMLTUPBCL-KQYNXXCUSA-N 5'-adenylyl sulfate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OS(O)(=O)=O)[C@@H](O)[C@H]1O IRLPACMLTUPBCL-KQYNXXCUSA-N 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000010267 cellular communication Effects 0.000 description 1
- 230000001149 cognitive effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000006386 memory function Effects 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000002085 persistent effect Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- User Interface Of Digital Computer (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201361819985P | 2013-05-06 | 2013-05-06 | |
US61/819,985 | 2013-05-06 | ||
US14/094,875 US9275638B2 (en) | 2013-03-12 | 2013-12-03 | Method and apparatus for training a voice recognition model database |
US14/094,875 | 2013-12-03 | ||
PCT/US2014/035117 WO2014182453A2 (fr) | 2013-05-06 | 2014-04-23 | Procédé et appareil d'apprentissage d'une base de données de modèles de reconnaissance vocale |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105580071A CN105580071A (zh) | 2016-05-11 |
CN105580071B true CN105580071B (zh) | 2020-08-21 |
Family
ID=51867838
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201480025758.9A Active CN105580071B (zh) | 2013-05-06 | 2014-04-23 | 用于训练声音识别模型数据库的方法和装置 |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP2994907A2 (fr) |
CN (1) | CN105580071B (fr) |
WO (1) | WO2014182453A2 (fr) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110232909A (zh) * | 2018-03-02 | 2019-09-13 | 北京搜狗科技发展有限公司 | 一种音频处理方法、装置、设备及可读存储介质 |
CN109192216A (zh) * | 2018-08-08 | 2019-01-11 | 联智科技(天津)有限责任公司 | 一种声纹识别用训练数据集仿真获取方法及其获取装置 |
KR20200033707A (ko) * | 2018-09-20 | 2020-03-30 | 삼성전자주식회사 | 전자 장치, 및 이의 학습 데이터 제공 또는 획득 방법 |
CN109545196B (zh) * | 2018-12-29 | 2022-11-29 | 深圳市科迈爱康科技有限公司 | 语音识别方法、装置及计算机可读存储介质 |
CN109545195B (zh) * | 2018-12-29 | 2023-02-21 | 深圳市科迈爱康科技有限公司 | 陪伴机器人及其控制方法 |
CN110544469B (zh) * | 2019-09-04 | 2022-04-19 | 秒针信息技术有限公司 | 语音识别模型的训练方法及装置、存储介质、电子装置 |
CN110808030B (zh) * | 2019-11-22 | 2021-01-22 | 珠海格力电器股份有限公司 | 语音唤醒方法、系统、存储介质及电子设备 |
CN111128141B (zh) * | 2019-12-31 | 2022-04-19 | 思必驰科技股份有限公司 | 音频识别解码方法和装置 |
CN111369979B (zh) * | 2020-02-26 | 2023-12-19 | 广州市百果园信息技术有限公司 | 训练样本获取方法、装置、设备及计算机存储介质 |
CN113099353A (zh) * | 2021-04-21 | 2021-07-09 | 浙江吉利控股集团有限公司 | 一种用于车辆的集成麦克风、安全带、方向盘及车辆 |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1331467A (zh) * | 2000-06-28 | 2002-01-16 | 松下电器产业株式会社 | 产生声学模型的方法和装置 |
CN1451152A (zh) * | 2000-09-01 | 2003-10-22 | 捷装技术公司 | 计算机实现的语音识别系统训练 |
US20050071159A1 (en) * | 2003-09-26 | 2005-03-31 | Robert Boman | Speech recognizer performance in car and home applications utilizing novel multiple microphone configurations |
CN101023467A (zh) * | 2005-01-04 | 2007-08-22 | 三菱电机株式会社 | 用于提炼音频分类器的训练数据集的方法和用于分类数据的方法 |
US20080300871A1 (en) * | 2007-05-29 | 2008-12-04 | At&T Corp. | Method and apparatus for identifying acoustic background environments to enhance automatic speech recognition |
CN102426837A (zh) * | 2011-12-30 | 2012-04-25 | 中国农业科学院农业信息研究所 | 农业现场数据采集的移动设备语音识别的鲁棒性方法 |
CN102903360A (zh) * | 2011-07-26 | 2013-01-30 | 财团法人工业技术研究院 | 以麦克风阵列为基础的语音辨识系统与方法 |
CN103069480A (zh) * | 2010-06-14 | 2013-04-24 | 谷歌公司 | 用于语音识别的语音模型和噪声模型 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6876966B1 (en) * | 2000-10-16 | 2005-04-05 | Microsoft Corporation | Pattern recognition training method and apparatus using inserted noise followed by noise reduction |
-
2014
- 2014-04-23 EP EP14725344.7A patent/EP2994907A2/fr not_active Withdrawn
- 2014-04-23 WO PCT/US2014/035117 patent/WO2014182453A2/fr active Application Filing
- 2014-04-23 CN CN201480025758.9A patent/CN105580071B/zh active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1331467A (zh) * | 2000-06-28 | 2002-01-16 | 松下电器产业株式会社 | 产生声学模型的方法和装置 |
CN1451152A (zh) * | 2000-09-01 | 2003-10-22 | 捷装技术公司 | 计算机实现的语音识别系统训练 |
US20050071159A1 (en) * | 2003-09-26 | 2005-03-31 | Robert Boman | Speech recognizer performance in car and home applications utilizing novel multiple microphone configurations |
CN101023467A (zh) * | 2005-01-04 | 2007-08-22 | 三菱电机株式会社 | 用于提炼音频分类器的训练数据集的方法和用于分类数据的方法 |
US20080300871A1 (en) * | 2007-05-29 | 2008-12-04 | At&T Corp. | Method and apparatus for identifying acoustic background environments to enhance automatic speech recognition |
CN103069480A (zh) * | 2010-06-14 | 2013-04-24 | 谷歌公司 | 用于语音识别的语音模型和噪声模型 |
CN102903360A (zh) * | 2011-07-26 | 2013-01-30 | 财团法人工业技术研究院 | 以麦克风阵列为基础的语音辨识系统与方法 |
CN102426837A (zh) * | 2011-12-30 | 2012-04-25 | 中国农业科学院农业信息研究所 | 农业现场数据采集的移动设备语音识别的鲁棒性方法 |
Also Published As
Publication number | Publication date |
---|---|
CN105580071A (zh) | 2016-05-11 |
EP2994907A2 (fr) | 2016-03-16 |
WO2014182453A3 (fr) | 2014-12-31 |
WO2014182453A2 (fr) | 2014-11-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9275638B2 (en) | Method and apparatus for training a voice recognition model database | |
CN105580071B (zh) | 用于训练声音识别模型数据库的方法和装置 | |
US11676581B2 (en) | Method and apparatus for evaluating trigger phrase enrollment | |
US11557310B2 (en) | Voice trigger for a digital assistant | |
CN110288987B (zh) | 用于处理声音数据的系统和控制该系统的方法 | |
CN106201424B (zh) | 一种信息交互方法、装置及电子设备 | |
CN106971723B (zh) | 语音处理方法和装置、用于语音处理的装置 | |
CN112074900B (zh) | 用于自然语言处理的音频分析 | |
US9542947B2 (en) | Method and apparatus including parallell processes for voice recognition | |
CN106796785B (zh) | 用于产生声音检测模型的声音样本验证 | |
JP2019117623A (ja) | 音声対話方法、装置、デバイス及び記憶媒体 | |
JP6844608B2 (ja) | 音声処理装置および音声処理方法 | |
US20140278392A1 (en) | Method and Apparatus for Pre-Processing Audio Signals | |
KR20160106075A (ko) | 오디오 스트림에서 음악 작품을 식별하기 위한 방법 및 디바이스 | |
US11373656B2 (en) | Speech processing method and apparatus therefor | |
WO2020202862A1 (fr) | Dispositif de production de réponses et procédé de production de réponses | |
CN112906369A (zh) | 一种歌词文件生成方法及装置 | |
US20210110838A1 (en) | Acoustic aware voice user interface | |
CN110992928A (zh) | 音频处理方法及终端设备 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |