WO2009057739A1 - Speaker selection apparatus, speaker adoptive model making-out apparatus, speaker selection method and speaker selection program - Google Patents
Speaker selection apparatus, speaker adoptive model making-out apparatus, speaker selection method and speaker selection program Download PDFInfo
- Publication number
- WO2009057739A1 WO2009057739A1 PCT/JP2008/069853 JP2008069853W WO2009057739A1 WO 2009057739 A1 WO2009057739 A1 WO 2009057739A1 JP 2008069853 W JP2008069853 W JP 2008069853W WO 2009057739 A1 WO2009057739 A1 WO 2009057739A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- speaker
- speaker selection
- speakers
- selection
- model making
- Prior art date
Links
- 238000010187 selection method Methods 0.000 title 1
- 230000003044 adaptive effect Effects 0.000 abstract 1
- 230000006866 deterioration Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
It is an object to provide a speaker selection apparatus that can suppress deterioration in accuracy of an adaptive model. The speaker selection apparatus is provided with a speaker distribution density calculation means that calculates the density of distribution of a plurality of speakers where an uttering speaker takes a leading part of a speaker space by using a characteristic quantity extracted from an input voice signal of the uttering speaker and speaker models of a plurality of speakers stored in advance, and a selected speaker number calculation means that calculates the number of speakers to be selected by using the speaker distribution densities.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2009539120A JP5626558B2 (en) | 2007-10-31 | 2008-10-31 | Speaker selection device, speaker adaptive model creation device, speaker selection method, and speaker selection program |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2007283767 | 2007-10-31 | ||
JP2007-283767 | 2007-10-31 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2009057739A1 true WO2009057739A1 (en) | 2009-05-07 |
Family
ID=40591119
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2008/069853 WO2009057739A1 (en) | 2007-10-31 | 2008-10-31 | Speaker selection apparatus, speaker adoptive model making-out apparatus, speaker selection method and speaker selection program |
Country Status (2)
Country | Link |
---|---|
JP (1) | JP5626558B2 (en) |
WO (1) | WO2009057739A1 (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH11143486A (en) * | 1997-11-10 | 1999-05-28 | Fuji Xerox Co Ltd | Device and method adaptable for speaker |
JP2002149185A (en) * | 2000-09-27 | 2002-05-24 | Koninkl Philips Electronics Nv | Method for deciding intrinsic space to express more than one learning speakers |
WO2005034086A1 (en) * | 2003-10-03 | 2005-04-14 | Asahi Kasei Kabushiki Kaisha | Data processing device and data processing device control program |
JP3756879B2 (en) * | 2001-12-20 | 2006-03-15 | 松下電器産業株式会社 | Method for creating acoustic model, apparatus for creating acoustic model, computer program for creating acoustic model |
WO2008117626A1 (en) * | 2007-03-27 | 2008-10-02 | Nec Corporation | Speaker selecting device, speaker adaptive model making device, speaker selecting method, speaker selecting program, and speaker adaptive model making program |
-
2008
- 2008-10-31 WO PCT/JP2008/069853 patent/WO2009057739A1/en active Application Filing
- 2008-10-31 JP JP2009539120A patent/JP5626558B2/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH11143486A (en) * | 1997-11-10 | 1999-05-28 | Fuji Xerox Co Ltd | Device and method adaptable for speaker |
JP2002149185A (en) * | 2000-09-27 | 2002-05-24 | Koninkl Philips Electronics Nv | Method for deciding intrinsic space to express more than one learning speakers |
JP3756879B2 (en) * | 2001-12-20 | 2006-03-15 | 松下電器産業株式会社 | Method for creating acoustic model, apparatus for creating acoustic model, computer program for creating acoustic model |
WO2005034086A1 (en) * | 2003-10-03 | 2005-04-14 | Asahi Kasei Kabushiki Kaisha | Data processing device and data processing device control program |
WO2008117626A1 (en) * | 2007-03-27 | 2008-10-02 | Nec Corporation | Speaker selecting device, speaker adaptive model making device, speaker selecting method, speaker selecting program, and speaker adaptive model making program |
Also Published As
Publication number | Publication date |
---|---|
JPWO2009057739A1 (en) | 2011-03-10 |
JP5626558B2 (en) | 2014-11-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2008117626A1 (en) | Speaker selecting device, speaker adaptive model making device, speaker selecting method, speaker selecting program, and speaker adaptive model making program | |
WO2008047339A3 (en) | Method and apparatus for large population speaker identification in telephone interactions | |
MX338524B (en) | Apparatus and method for microphone positioning based on a spatial power density. | |
EP3742436A4 (en) | Voice synthesis method, model training method, device and computer device | |
ATE491202T1 (en) | COMPENSATING BETWEEN-SESSION VARIABILITY TO AUTOMATICALLY EXTRACT INFORMATION FROM SPEECH | |
ATE484927T1 (en) | METHOD FOR AUTOMATICALLY EQUALIZING A SOUND SYSTEM | |
WO2007044370A3 (en) | System and method for tailoring music to an activity based on an activity goal | |
IN2014CN03504A (en) | ||
WO2008126347A1 (en) | Voice analysis device, voice analysis method, voice analysis program, and system integration circuit | |
TW200721109A (en) | Pronunciation diagnosis device, pronunciation diagnosis method, recording medium, and pronunciation diagnosis program | |
WO2016139670A8 (en) | System and method for generating accurate speech transcription from natural speech audio signals | |
WO2012036424A3 (en) | Method and apparatus for performing microphone beamforming | |
JP2013534651A5 (en) | ||
GB2449377A (en) | Method of simulating deformable object using geometrically motivated model | |
EP2529817A4 (en) | Toy set, game control program, and game device and toy communication system | |
JP2011171875A5 (en) | ||
JP2011085641A5 (en) | ||
SG10201809403YA (en) | Loudspeaker array with circuit board-integrated asic | |
DE602004023134D1 (en) | LANGUAGE RECOGNITION AND SYSTEM ADAPTED TO THE CHARACTERISTICS OF NON-NUT SPEAKERS | |
WO2007129156A3 (en) | Soft alignment in gaussian mixture model based transformation | |
WO2013079051A8 (en) | Device and method for simulating stereophonic sound | |
RU2015141805A (en) | SIMULATION OF ACOUSTIC PULSE RESPONSE | |
EP2277170A4 (en) | Methods and systems for simplifying copying and pasting transcriptions generated from a dictation based speech-to-text system | |
WO2009038013A1 (en) | Noise removal system, noise removal method, and noise removal program | |
EP2416314A4 (en) | Method for reproducing an audio recording with the simulation of the acoustic characteristics of the recording conditions |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 08843694 Country of ref document: EP Kind code of ref document: A1 |
|
DPE1 | Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101) | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2009539120 Country of ref document: JP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 08843694 Country of ref document: EP Kind code of ref document: A1 |