CN105580071B - 用于训练声音识别模型数据库的方法和装置 - Google Patents

用于训练声音识别模型数据库的方法和装置 Download PDF

Info

Publication number
CN105580071B
CN105580071B CN201480025758.9A CN201480025758A CN105580071B CN 105580071 B CN105580071 B CN 105580071B CN 201480025758 A CN201480025758 A CN 201480025758A CN 105580071 B CN105580071 B CN 105580071B
Authority
CN
China
Prior art keywords
noise
data
user
specific
voice data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201480025758.9A
Other languages
English (en)
Chinese (zh)
Other versions
CN105580071A (zh
Inventor
约翰·R·梅洛尼
约耳·A·克拉克
约瑟夫·C·德怀尔
阿德里安·舒斯特
斯内海特哈·辛加拉朱
罗伯特·A·茹雷克
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Google Technology Holdings LLC
Original Assignee
Google Technology Holdings LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US14/094,875 external-priority patent/US9275638B2/en
Application filed by Google Technology Holdings LLC filed Critical Google Technology Holdings LLC
Publication of CN105580071A publication Critical patent/CN105580071A/zh
Application granted granted Critical
Publication of CN105580071B publication Critical patent/CN105580071B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CN201480025758.9A 2013-05-06 2014-04-23 用于训练声音识别模型数据库的方法和装置 Active CN105580071B (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201361819985P 2013-05-06 2013-05-06
US61/819,985 2013-05-06
US14/094,875 US9275638B2 (en) 2013-03-12 2013-12-03 Method and apparatus for training a voice recognition model database
US14/094,875 2013-12-03
PCT/US2014/035117 WO2014182453A2 (fr) 2013-05-06 2014-04-23 Procédé et appareil d'apprentissage d'une base de données de modèles de reconnaissance vocale

Publications (2)

Publication Number Publication Date
CN105580071A CN105580071A (zh) 2016-05-11
CN105580071B true CN105580071B (zh) 2020-08-21

Family

ID=51867838

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201480025758.9A Active CN105580071B (zh) 2013-05-06 2014-04-23 用于训练声音识别模型数据库的方法和装置

Country Status (3)

Country Link
EP (1) EP2994907A2 (fr)
CN (1) CN105580071B (fr)
WO (1) WO2014182453A2 (fr)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110232909A (zh) * 2018-03-02 2019-09-13 北京搜狗科技发展有限公司 一种音频处理方法、装置、设备及可读存储介质
CN109192216A (zh) * 2018-08-08 2019-01-11 联智科技(天津)有限责任公司 一种声纹识别用训练数据集仿真获取方法及其获取装置
KR20200033707A (ko) * 2018-09-20 2020-03-30 삼성전자주식회사 전자 장치, 및 이의 학습 데이터 제공 또는 획득 방법
CN109545196B (zh) * 2018-12-29 2022-11-29 深圳市科迈爱康科技有限公司 语音识别方法、装置及计算机可读存储介质
CN109545195B (zh) * 2018-12-29 2023-02-21 深圳市科迈爱康科技有限公司 陪伴机器人及其控制方法
CN110544469B (zh) * 2019-09-04 2022-04-19 秒针信息技术有限公司 语音识别模型的训练方法及装置、存储介质、电子装置
CN110808030B (zh) * 2019-11-22 2021-01-22 珠海格力电器股份有限公司 语音唤醒方法、系统、存储介质及电子设备
CN111128141B (zh) * 2019-12-31 2022-04-19 思必驰科技股份有限公司 音频识别解码方法和装置
CN111369979B (zh) * 2020-02-26 2023-12-19 广州市百果园信息技术有限公司 训练样本获取方法、装置、设备及计算机存储介质
CN113099353A (zh) * 2021-04-21 2021-07-09 浙江吉利控股集团有限公司 一种用于车辆的集成麦克风、安全带、方向盘及车辆

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1331467A (zh) * 2000-06-28 2002-01-16 松下电器产业株式会社 产生声学模型的方法和装置
CN1451152A (zh) * 2000-09-01 2003-10-22 捷装技术公司 计算机实现的语音识别系统训练
US20050071159A1 (en) * 2003-09-26 2005-03-31 Robert Boman Speech recognizer performance in car and home applications utilizing novel multiple microphone configurations
CN101023467A (zh) * 2005-01-04 2007-08-22 三菱电机株式会社 用于提炼音频分类器的训练数据集的方法和用于分类数据的方法
US20080300871A1 (en) * 2007-05-29 2008-12-04 At&T Corp. Method and apparatus for identifying acoustic background environments to enhance automatic speech recognition
CN102426837A (zh) * 2011-12-30 2012-04-25 中国农业科学院农业信息研究所 农业现场数据采集的移动设备语音识别的鲁棒性方法
CN102903360A (zh) * 2011-07-26 2013-01-30 财团法人工业技术研究院 以麦克风阵列为基础的语音辨识系统与方法
CN103069480A (zh) * 2010-06-14 2013-04-24 谷歌公司 用于语音识别的语音模型和噪声模型

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6876966B1 (en) * 2000-10-16 2005-04-05 Microsoft Corporation Pattern recognition training method and apparatus using inserted noise followed by noise reduction

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1331467A (zh) * 2000-06-28 2002-01-16 松下电器产业株式会社 产生声学模型的方法和装置
CN1451152A (zh) * 2000-09-01 2003-10-22 捷装技术公司 计算机实现的语音识别系统训练
US20050071159A1 (en) * 2003-09-26 2005-03-31 Robert Boman Speech recognizer performance in car and home applications utilizing novel multiple microphone configurations
CN101023467A (zh) * 2005-01-04 2007-08-22 三菱电机株式会社 用于提炼音频分类器的训练数据集的方法和用于分类数据的方法
US20080300871A1 (en) * 2007-05-29 2008-12-04 At&T Corp. Method and apparatus for identifying acoustic background environments to enhance automatic speech recognition
CN103069480A (zh) * 2010-06-14 2013-04-24 谷歌公司 用于语音识别的语音模型和噪声模型
CN102903360A (zh) * 2011-07-26 2013-01-30 财团法人工业技术研究院 以麦克风阵列为基础的语音辨识系统与方法
CN102426837A (zh) * 2011-12-30 2012-04-25 中国农业科学院农业信息研究所 农业现场数据采集的移动设备语音识别的鲁棒性方法

Also Published As

Publication number Publication date
CN105580071A (zh) 2016-05-11
EP2994907A2 (fr) 2016-03-16
WO2014182453A3 (fr) 2014-12-31
WO2014182453A2 (fr) 2014-11-13

Similar Documents

Publication Publication Date Title
US9275638B2 (en) Method and apparatus for training a voice recognition model database
CN105580071B (zh) 用于训练声音识别模型数据库的方法和装置
US11676581B2 (en) Method and apparatus for evaluating trigger phrase enrollment
US11557310B2 (en) Voice trigger for a digital assistant
CN110288987B (zh) 用于处理声音数据的系统和控制该系统的方法
CN106201424B (zh) 一种信息交互方法、装置及电子设备
CN106971723B (zh) 语音处理方法和装置、用于语音处理的装置
CN112074900B (zh) 用于自然语言处理的音频分析
US9542947B2 (en) Method and apparatus including parallell processes for voice recognition
CN106796785B (zh) 用于产生声音检测模型的声音样本验证
JP2019117623A (ja) 音声対話方法、装置、デバイス及び記憶媒体
JP6844608B2 (ja) 音声処理装置および音声処理方法
US20140278392A1 (en) Method and Apparatus for Pre-Processing Audio Signals
KR20160106075A (ko) 오디오 스트림에서 음악 작품을 식별하기 위한 방법 및 디바이스
US11373656B2 (en) Speech processing method and apparatus therefor
WO2020202862A1 (fr) Dispositif de production de réponses et procédé de production de réponses
CN112906369A (zh) 一种歌词文件生成方法及装置
US20210110838A1 (en) Acoustic aware voice user interface
CN110992928A (zh) 音频处理方法及终端设备

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant