WO2009057739A1 - Speaker selection apparatus, speaker adoptive model making-out apparatus, speaker selection method and speaker selection program - Google Patents

Speaker selection apparatus, speaker adoptive model making-out apparatus, speaker selection method and speaker selection program Download PDF

Info

Publication number
WO2009057739A1
WO2009057739A1 PCT/JP2008/069853 JP2008069853W WO2009057739A1 WO 2009057739 A1 WO2009057739 A1 WO 2009057739A1 JP 2008069853 W JP2008069853 W JP 2008069853W WO 2009057739 A1 WO2009057739 A1 WO 2009057739A1
Authority
WO
WIPO (PCT)
Prior art keywords
speaker
speaker selection
speakers
selection
model making
Prior art date
Application number
PCT/JP2008/069853
Other languages
French (fr)
Japanese (ja)
Inventor
Masahiro Tani
Yoshifumi Onishi
Tadashi Emori
Takafumi Koshinaka
Original Assignee
Nec Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nec Corporation filed Critical Nec Corporation
Priority to JP2009539120A priority Critical patent/JP5626558B2/en
Publication of WO2009057739A1 publication Critical patent/WO2009057739A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

It is an object to provide a speaker selection apparatus that can suppress deterioration in accuracy of an adaptive model. The speaker selection apparatus is provided with a speaker distribution density calculation means that calculates the density of distribution of a plurality of speakers where an uttering speaker takes a leading part of a speaker space by using a characteristic quantity extracted from an input voice signal of the uttering speaker and speaker models of a plurality of speakers stored in advance, and a selected speaker number calculation means that calculates the number of speakers to be selected by using the speaker distribution densities.
PCT/JP2008/069853 2007-10-31 2008-10-31 Speaker selection apparatus, speaker adoptive model making-out apparatus, speaker selection method and speaker selection program WO2009057739A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2009539120A JP5626558B2 (en) 2007-10-31 2008-10-31 Speaker selection device, speaker adaptive model creation device, speaker selection method, and speaker selection program

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2007283767 2007-10-31
JP2007-283767 2007-10-31

Publications (1)

Publication Number Publication Date
WO2009057739A1 true WO2009057739A1 (en) 2009-05-07

Family

ID=40591119

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2008/069853 WO2009057739A1 (en) 2007-10-31 2008-10-31 Speaker selection apparatus, speaker adoptive model making-out apparatus, speaker selection method and speaker selection program

Country Status (2)

Country Link
JP (1) JP5626558B2 (en)
WO (1) WO2009057739A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11143486A (en) * 1997-11-10 1999-05-28 Fuji Xerox Co Ltd Device and method adaptable for speaker
JP2002149185A (en) * 2000-09-27 2002-05-24 Koninkl Philips Electronics Nv Method for deciding intrinsic space to express more than one learning speakers
WO2005034086A1 (en) * 2003-10-03 2005-04-14 Asahi Kasei Kabushiki Kaisha Data processing device and data processing device control program
JP3756879B2 (en) * 2001-12-20 2006-03-15 松下電器産業株式会社 Method for creating acoustic model, apparatus for creating acoustic model, computer program for creating acoustic model
WO2008117626A1 (en) * 2007-03-27 2008-10-02 Nec Corporation Speaker selecting device, speaker adaptive model making device, speaker selecting method, speaker selecting program, and speaker adaptive model making program

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11143486A (en) * 1997-11-10 1999-05-28 Fuji Xerox Co Ltd Device and method adaptable for speaker
JP2002149185A (en) * 2000-09-27 2002-05-24 Koninkl Philips Electronics Nv Method for deciding intrinsic space to express more than one learning speakers
JP3756879B2 (en) * 2001-12-20 2006-03-15 松下電器産業株式会社 Method for creating acoustic model, apparatus for creating acoustic model, computer program for creating acoustic model
WO2005034086A1 (en) * 2003-10-03 2005-04-14 Asahi Kasei Kabushiki Kaisha Data processing device and data processing device control program
WO2008117626A1 (en) * 2007-03-27 2008-10-02 Nec Corporation Speaker selecting device, speaker adaptive model making device, speaker selecting method, speaker selecting program, and speaker adaptive model making program

Also Published As

Publication number Publication date
JPWO2009057739A1 (en) 2011-03-10
JP5626558B2 (en) 2014-11-19

Similar Documents

Publication Publication Date Title
WO2008117626A1 (en) Speaker selecting device, speaker adaptive model making device, speaker selecting method, speaker selecting program, and speaker adaptive model making program
WO2008047339A3 (en) Method and apparatus for large population speaker identification in telephone interactions
MX338524B (en) Apparatus and method for microphone positioning based on a spatial power density.
EP3742436A4 (en) Voice synthesis method, model training method, device and computer device
ATE491202T1 (en) COMPENSATING BETWEEN-SESSION VARIABILITY TO AUTOMATICALLY EXTRACT INFORMATION FROM SPEECH
ATE484927T1 (en) METHOD FOR AUTOMATICALLY EQUALIZING A SOUND SYSTEM
WO2007044370A3 (en) System and method for tailoring music to an activity based on an activity goal
IN2014CN03504A (en)
WO2008126347A1 (en) Voice analysis device, voice analysis method, voice analysis program, and system integration circuit
TW200721109A (en) Pronunciation diagnosis device, pronunciation diagnosis method, recording medium, and pronunciation diagnosis program
WO2016139670A8 (en) System and method for generating accurate speech transcription from natural speech audio signals
WO2012036424A3 (en) Method and apparatus for performing microphone beamforming
JP2013534651A5 (en)
GB2449377A (en) Method of simulating deformable object using geometrically motivated model
EP2529817A4 (en) Toy set, game control program, and game device and toy communication system
JP2011171875A5 (en)
JP2011085641A5 (en)
SG10201809403YA (en) Loudspeaker array with circuit board-integrated asic
DE602004023134D1 (en) LANGUAGE RECOGNITION AND SYSTEM ADAPTED TO THE CHARACTERISTICS OF NON-NUT SPEAKERS
WO2007129156A3 (en) Soft alignment in gaussian mixture model based transformation
WO2013079051A8 (en) Device and method for simulating stereophonic sound
RU2015141805A (en) SIMULATION OF ACOUSTIC PULSE RESPONSE
EP2277170A4 (en) Methods and systems for simplifying copying and pasting transcriptions generated from a dictation based speech-to-text system
WO2009038013A1 (en) Noise removal system, noise removal method, and noise removal program
EP2416314A4 (en) Method for reproducing an audio recording with the simulation of the acoustic characteristics of the recording conditions

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08843694

Country of ref document: EP

Kind code of ref document: A1

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2009539120

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 08843694

Country of ref document: EP

Kind code of ref document: A1