DE602006018795D1 - Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache - Google Patents
Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus spracheInfo
- Publication number
- DE602006018795D1 DE602006018795D1 DE602006018795T DE602006018795T DE602006018795D1 DE 602006018795 D1 DE602006018795 D1 DE 602006018795D1 DE 602006018795 T DE602006018795 T DE 602006018795T DE 602006018795 T DE602006018795 T DE 602006018795T DE 602006018795 D1 DE602006018795 D1 DE 602006018795D1
- Authority
- DE
- Germany
- Prior art keywords
- variability
- information
- compensation
- automatic extraction
- voice signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000605 extraction Methods 0.000 title abstract 2
- 239000013598 vector Substances 0.000 abstract 5
- 238000000034 method Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/04—Training, enrolment or model building
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Stereophonic System (AREA)
- Circuit For Audible Band Transducer (AREA)
- Machine Translation (AREA)
- Electrically Operated Instructional Devices (AREA)
- Complex Calculations (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/EP2006/004598 WO2007131530A1 (en) | 2006-05-16 | 2006-05-16 | Intersession variability compensation for automatic extraction of information from voice |
Publications (1)
Publication Number | Publication Date |
---|---|
DE602006018795D1 true DE602006018795D1 (de) | 2011-01-20 |
Family
ID=37057050
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE602006018795T Active DE602006018795D1 (de) | 2006-05-16 | 2006-05-16 | Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache |
Country Status (8)
Country | Link |
---|---|
US (1) | US8566093B2 (de) |
EP (1) | EP2022042B1 (de) |
AT (1) | ATE491202T1 (de) |
AU (1) | AU2006343470B2 (de) |
CA (1) | CA2652302C (de) |
DE (1) | DE602006018795D1 (de) |
ES (1) | ES2357674T3 (de) |
WO (1) | WO2007131530A1 (de) |
Families Citing this family (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8504366B2 (en) * | 2005-12-19 | 2013-08-06 | Nuance Communications, Inc. | Joint factor analysis scoring for speech processing systems |
DE602007014382D1 (de) * | 2007-11-12 | 2011-06-16 | Harman Becker Automotive Sys | Unterscheidung zwischen Vordergrundsprache und Hintergrundgeräuschen |
US9020816B2 (en) * | 2008-08-14 | 2015-04-28 | 21Ct, Inc. | Hidden markov model for speech processing with training method |
US8412525B2 (en) | 2009-04-30 | 2013-04-02 | Microsoft Corporation | Noise robust speech classifier ensemble |
US9177557B2 (en) * | 2009-07-07 | 2015-11-03 | General Motors Llc. | Singular value decomposition for improved voice recognition in presence of multi-talker background noise |
FR2965377A1 (fr) | 2010-09-24 | 2012-03-30 | Univ D Avignon Et Des Pays De Vaucluse | Procede de classification de donnees biometriques |
US9042867B2 (en) * | 2012-02-24 | 2015-05-26 | Agnitio S.L. | System and method for speaker recognition on mobile devices |
US9984678B2 (en) * | 2012-03-23 | 2018-05-29 | Microsoft Technology Licensing, Llc | Factored transforms for separable adaptation of acoustic models |
DK2713367T3 (en) | 2012-09-28 | 2017-02-20 | Agnitio S L | Speech Recognition |
US9240184B1 (en) * | 2012-11-15 | 2016-01-19 | Google Inc. | Frame-level combination of deep neural network and gaussian mixture models |
US20140222423A1 (en) * | 2013-02-07 | 2014-08-07 | Nuance Communications, Inc. | Method and Apparatus for Efficient I-Vector Extraction |
US9406298B2 (en) * | 2013-02-07 | 2016-08-02 | Nuance Communications, Inc. | Method and apparatus for efficient i-vector extraction |
US9865266B2 (en) * | 2013-02-25 | 2018-01-09 | Nuance Communications, Inc. | Method and apparatus for automated speaker parameters adaptation in a deployed speaker verification system |
US9489965B2 (en) * | 2013-03-15 | 2016-11-08 | Sri International | Method and apparatus for acoustic signal characterization |
US9258425B2 (en) | 2013-05-22 | 2016-02-09 | Nuance Communications, Inc. | Method and system for speaker verification |
US10438581B2 (en) | 2013-07-31 | 2019-10-08 | Google Llc | Speech recognition using neural networks |
US9514753B2 (en) | 2013-11-04 | 2016-12-06 | Google Inc. | Speaker identification using hash-based indexing |
EP3123468A1 (de) * | 2014-03-28 | 2017-02-01 | Intel IP Corporation | Trainingsklassifizierer mit ausgewählten kohortenprobenuntergruppen |
US10014007B2 (en) | 2014-05-28 | 2018-07-03 | Interactive Intelligence, Inc. | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system |
US10255903B2 (en) | 2014-05-28 | 2019-04-09 | Interactive Intelligence Group, Inc. | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system |
US9792899B2 (en) * | 2014-07-15 | 2017-10-17 | International Business Machines Corporation | Dataset shift compensation in machine learning |
CN107104994B (zh) * | 2016-02-22 | 2021-07-20 | 华硕电脑股份有限公司 | 语音识别方法、电子装置及语音识别系统 |
CA3030133C (en) | 2016-06-02 | 2022-08-09 | Genesys Telecommunications Laboratories, Inc. | Technologies for authenticating a speaker using voice biometrics |
DE102017207876A1 (de) * | 2017-05-10 | 2018-11-15 | Robert Bosch Gmbh | Parallelisierte Verarbeitung |
CN109146450A (zh) | 2017-06-16 | 2019-01-04 | 阿里巴巴集团控股有限公司 | 支付方法、客户端、电子设备、存储介质和服务器 |
US10304475B1 (en) * | 2017-08-14 | 2019-05-28 | Amazon Technologies, Inc. | Trigger word based beam selection |
US11289098B2 (en) * | 2019-03-08 | 2022-03-29 | Samsung Electronics Co., Ltd. | Method and apparatus with speaker recognition registration |
CN111833887A (zh) * | 2020-07-14 | 2020-10-27 | 山东理工大学 | 一种基于局部保持判别投影的说话人确认方法 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6519561B1 (en) * | 1997-11-03 | 2003-02-11 | T-Netix, Inc. | Model adaptation of neural tree networks and other fused models for speaker verification |
US6327565B1 (en) * | 1998-04-30 | 2001-12-04 | Matsushita Electric Industrial Co., Ltd. | Speaker and environment adaptation based on eigenvoices |
US6141644A (en) * | 1998-09-04 | 2000-10-31 | Matsushita Electric Industrial Co., Ltd. | Speaker verification and speaker identification based on eigenvoices |
US6571208B1 (en) * | 1999-11-29 | 2003-05-27 | Matsushita Electric Industrial Co., Ltd. | Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training |
US6529872B1 (en) * | 2000-04-18 | 2003-03-04 | Matsushita Electric Industrial Co., Ltd. | Method for noise adaptation in automatic speech recognition using transformed matrices |
DE10047723A1 (de) * | 2000-09-27 | 2002-04-11 | Philips Corp Intellectual Pty | Verfahren zur Ermittlung eines Eigenraums zur Darstellung einer Mehrzahl von Trainingssprechern |
US6895376B2 (en) * | 2001-05-04 | 2005-05-17 | Matsushita Electric Industrial Co., Ltd. | Eigenvoice re-estimation technique of acoustic models for speech recognition, speaker identification and speaker verification |
US6915259B2 (en) * | 2001-05-24 | 2005-07-05 | Matsushita Electric Industrial Co., Ltd. | Speaker and environment adaptation based on linear separation of variability sources |
JP4652232B2 (ja) * | 2003-07-01 | 2011-03-16 | フランス・テレコム | 話者の圧縮表現用の音声信号の分析のための方法およびシステム |
US20080208581A1 (en) * | 2003-12-05 | 2008-08-28 | Queensland University Of Technology | Model Adaptation System and Method for Speaker Recognition |
CA2609247C (en) * | 2005-05-24 | 2015-10-13 | Loquendo S.P.A. | Automatic text-independent, language-independent speaker voice-print creation and speaker recognition |
DE602007004733D1 (de) * | 2007-10-10 | 2010-03-25 | Harman Becker Automotive Sys | Sprechererkennung |
US8050920B2 (en) * | 2008-01-18 | 2011-11-01 | Universidad De Chile | Biometric control method on the telephone network with speaker verification technology by using an intra speaker variability and additive noise unsupervised compensation |
-
2006
- 2006-05-16 EP EP06742938A patent/EP2022042B1/de not_active Not-in-force
- 2006-05-16 CA CA2652302A patent/CA2652302C/en not_active Expired - Fee Related
- 2006-05-16 ES ES06742938T patent/ES2357674T3/es active Active
- 2006-05-16 US US12/227,282 patent/US8566093B2/en active Active
- 2006-05-16 AU AU2006343470A patent/AU2006343470B2/en not_active Ceased
- 2006-05-16 DE DE602006018795T patent/DE602006018795D1/de active Active
- 2006-05-16 AT AT06742938T patent/ATE491202T1/de not_active IP Right Cessation
- 2006-05-16 WO PCT/EP2006/004598 patent/WO2007131530A1/en active Application Filing
Also Published As
Publication number | Publication date |
---|---|
US20110040561A1 (en) | 2011-02-17 |
CA2652302A1 (en) | 2007-11-22 |
CA2652302C (en) | 2015-04-07 |
EP2022042A1 (de) | 2009-02-11 |
WO2007131530A1 (en) | 2007-11-22 |
AU2006343470B2 (en) | 2012-07-19 |
AU2006343470A1 (en) | 2007-11-22 |
EP2022042B1 (de) | 2010-12-08 |
ES2357674T3 (es) | 2011-04-28 |
US8566093B2 (en) | 2013-10-22 |
ATE491202T1 (de) | 2010-12-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE602006018795D1 (de) | Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache | |
MX2021014721A (es) | Sistemas y metodos para aprendizaje de maquina de atributos de voz. | |
EP3923277A3 (de) | Verzögerte antworten durch berechnungsassistenten | |
DE602007014382D1 (de) | Unterscheidung zwischen Vordergrundsprache und Hintergrundgeräuschen | |
WO2018038385A3 (ko) | 음성 인식 방법 및 이를 수행하는 전자 장치 | |
MX2010008372A (es) | Aparato y metodo para calcular coeficientes de filtro para supresion de eco. | |
TW200741650A (en) | Method and apparatus for processing a audio signal | |
WO2008114448A1 (ja) | 音声認識システム、音声認識プログラムおよび音声認識方法 | |
ATE463820T1 (de) | Sprachaktivitätdetektionssystem und verfahren | |
DK1949755T3 (da) | Høreapparat og fremgangsmåde til behandling af indgangssignaler i et høreapparat | |
WO2008087934A1 (ja) | 拡張認識辞書学習装置と音声認識システム | |
ATE403213T1 (de) | System und verfahren zur automatischen spracherkennung | |
DK2027581T3 (da) | Signalseparator, fremgangsmåde til bestemmelse af outputsignaler på basis af mikrofonsignaler og computerprogram | |
WO2007018802A3 (en) | Method and system for operation of a voice activity detector | |
WO2007035183A3 (en) | Method, system, and program product for measuring audio video synchronization independent of speaker characteristics | |
ATE425532T1 (de) | Modellbasierte verbesserung von sprachsignalen | |
DK1974587T3 (da) | Fremgangsmåde og system til udligning af en højttaler i et lokale | |
DE602007009731D1 (de) | Verfahren zur rückkopplungslöschung in einem hörgerät und hörgerät | |
FR2976710B1 (fr) | Procede de debruitage pour equipement audio multi-microphones, notamment pour un systeme de telephonie "mains libres" | |
ATE514162T1 (de) | Dynamische erzeugung von kontexten zur spracherkennung | |
ATE492875T1 (de) | Sprachanalysesystem | |
DE602005007939D1 (de) | Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb ekennungssystems liegen | |
WO2008105263A1 (ja) | 重み係数学習システム及び音声認識システム | |
ATE442641T1 (de) | Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist | |
ATE480063T1 (de) | Verfahren und vorrichtung zur schätzung der sprachqualität |