FR2274101B1 - - Google Patents

Info

Publication number
FR2274101B1
FR2274101B1 FR7517404A FR7517404A FR2274101B1 FR 2274101 B1 FR2274101 B1 FR 2274101B1 FR 7517404 A FR7517404 A FR 7517404A FR 7517404 A FR7517404 A FR 7517404A FR 2274101 B1 FR2274101 B1 FR 2274101B1
Authority
FR
France
Prior art keywords
phoneme
characteristic
parameter
extracted
component
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
FR7517404A
Other languages
French (fr)
Other versions
FR2274101A1 (en
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujifilm Business Innovation Corp
Original Assignee
Fuji Xerox Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fuji Xerox Co Ltd filed Critical Fuji Xerox Co Ltd
Publication of FR2274101A1 publication Critical patent/FR2274101A1/en
Application granted granted Critical
Publication of FR2274101B1 publication Critical patent/FR2274101B1/fr
Granted legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition

Abstract

Speech recognition in prior art uses one extracted characteristic component (xi) to represent one phoneme (Xi) as spoken by one speaker. This invention provides for recognizing the same phoneme as spoken by different speakers, by deriving a group of such components (xik), each a slight variant of the others, to allow finding one component most similar to both the specific phoneme and specific speaker, the method comprising the steps of: normalizing the sound pressure level of an input speech from an unknown speaker; analyzing the normalized voice in a plurality of channels having different frequencies; setting, with respect to the output Fj of each frequency band thus analyzed, a weight alpha j of the output Fj so that weight alpha j corresponds to a characteristic of a predetermined phoneme Xi; extracting the characteristic component xi of the phoneme Xi, setting a weight beta j of output Fj so that, when the extracted characteristic component xi causes a malfunction or error due to another phoneme Xe, a characteristic of phoneme Xe is corresponded to; simultaneously extracting the characteristic component xe of phoneme Xe and, when the difference between the characteristic components thus extracted is greater than a predetermined threshold value gamma i, applying the difference as a characteristic parameter for the phoneme xi; expanding the characteristic parameter to obtain a characteristic parameter group based on the characteristic parameter, each being slightly different from each other so as to be adapted for individual characteristics of different speakers; subsequently extracting from the characteristic parameter group a characteristic parameter, having maximum similarity to a reference parameter previously memorized, as an adaptive parameter adaptive to the unknown speaker; and, matching a standard pattern derived from the extracted adaptive parameters with an unknown pattern corresponding to the unknown speakers, thereby effecting recognition or analysis of the voice.
FR7517404A 1974-06-04 1975-06-04 VOICE RECOGNITION PROCESS AND DEVICE IMPLEMENTING THIS PROCESS Granted FR2274101A1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP49062526A JPS50155105A (en) 1974-06-04 1974-06-04

Publications (2)

Publication Number Publication Date
FR2274101A1 FR2274101A1 (en) 1976-01-02
FR2274101B1 true FR2274101B1 (en) 1980-02-08

Family

ID=13202704

Family Applications (1)

Application Number Title Priority Date Filing Date
FR7517404A Granted FR2274101A1 (en) 1974-06-04 1975-06-04 VOICE RECOGNITION PROCESS AND DEVICE IMPLEMENTING THIS PROCESS

Country Status (5)

Country Link
US (1) US4060694A (en)
JP (1) JPS50155105A (en)
DE (1) DE2524804A1 (en)
FR (1) FR2274101A1 (en)
GB (1) GB1519492A (en)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE2844156A1 (en) * 1978-10-10 1980-04-24 Philips Patentverwaltung METHOD FOR VERIFYING A SPEAKER
USRE31188E (en) * 1978-10-31 1983-03-22 Bell Telephone Laboratories, Incorporated Multiple template speech recognition system
US4181821A (en) * 1978-10-31 1980-01-01 Bell Telephone Laboratories, Incorporated Multiple template speech recognition system
NL177950C (en) * 1978-12-14 1986-07-16 Philips Nv VOICE ANALYSIS SYSTEM FOR DETERMINING TONE IN HUMAN SPEECH.
US4297528A (en) * 1979-09-10 1981-10-27 Interstate Electronics Corp. Training circuit for audio signal recognition computer
JPS6024994B2 (en) * 1980-04-21 1985-06-15 シャープ株式会社 Pattern similarity calculation method
JPS5710199A (en) * 1980-06-21 1982-01-19 Tokyo Shibaura Electric Co Voice information extractor
JPS5782896A (en) * 1980-11-12 1982-05-24 Hitachi Ltd Continuous voice recognition system
US4363102A (en) * 1981-03-27 1982-12-07 Bell Telephone Laboratories, Incorporated Speaker identification system using word recognition templates
US4388495A (en) * 1981-05-01 1983-06-14 Interstate Electronics Corporation Speech recognition microcomputer
US4454586A (en) * 1981-11-19 1984-06-12 At&T Bell Laboratories Method and apparatus for generating speech pattern templates
JPS6024597A (en) * 1983-07-21 1985-02-07 日本電気株式会社 Voice registration system
JPS6057475A (en) * 1983-09-07 1985-04-03 Toshiba Corp Pattern recognizing system
US4817158A (en) * 1984-10-19 1989-03-28 International Business Machines Corporation Normalization of speech signals
US4797927A (en) * 1985-10-30 1989-01-10 Grumman Aerospace Corporation Voice recognition process utilizing content addressable memory
US5129000A (en) * 1986-04-05 1992-07-07 Sharp Kabushiki Kaisha Voice recognition method by analyzing syllables
US4856067A (en) * 1986-08-21 1989-08-08 Oki Electric Industry Co., Ltd. Speech recognition system wherein the consonantal characteristics of input utterances are extracted
US4805225A (en) * 1986-11-06 1989-02-14 The Research Foundation Of The State University Of New York Pattern recognition method and apparatus
EP0290190B1 (en) * 1987-04-30 1991-10-09 Oki Electric Industry Company, Limited Pattern matching system
DE3723078A1 (en) * 1987-07-11 1989-01-19 Philips Patentverwaltung METHOD FOR DETECTING CONTINUOUSLY SPOKEN WORDS
US4949382A (en) * 1988-10-05 1990-08-14 Griggs Talkwriter Corporation Speech-controlled phonetic typewriter or display device having circuitry for analyzing fast and slow speech
US5012517A (en) * 1989-04-18 1991-04-30 Pacific Communication Science, Inc. Adaptive transform coder having long term predictor
AU6785696A (en) * 1995-09-05 1997-03-27 Frank Uldall Leonhard Method and system for processing auditory signals
US6122613A (en) * 1997-01-30 2000-09-19 Dragon Systems, Inc. Speech recognition using multiple recognizers (selectively) applied to the same input sample
EP1205906B1 (en) * 2000-11-07 2003-05-07 Telefonaktiebolaget L M Ericsson (Publ) Reference templates adaptation for speech recognition
US8239197B2 (en) * 2002-03-28 2012-08-07 Intellisist, Inc. Efficient conversion of voice messages into text
CA2480509C (en) 2002-03-28 2011-06-07 Martin Dunsmuir Closed-loop command and response system for automatic communications between interacting computer systems over an audio communications channel
DE60210174T2 (en) * 2002-08-08 2006-08-24 Alcatel Method for signal coding by means of vector quantization

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB1261385A (en) * 1968-07-24 1972-01-26 Matsushita Electric Ind Co Ltd Speech analyzing apparatus
US3509280A (en) * 1968-11-01 1970-04-28 Itt Adaptive speech pattern recognition system
US3619509A (en) * 1969-07-30 1971-11-09 Rca Corp Broad slope determining network
US3673331A (en) * 1970-01-19 1972-06-27 Texas Instruments Inc Identity verification by voice signals in the frequency domain
US3816722A (en) * 1970-09-29 1974-06-11 Nippon Electric Co Computer for calculating the similarity between patterns and pattern recognition system comprising the similarity computer
US3700815A (en) * 1971-04-20 1972-10-24 Bell Telephone Labor Inc Automatic speaker verification by non-linear time alignment of acoustic parameters
US3864518A (en) * 1972-03-20 1975-02-04 Meguer V Kalfaian Signal conversion apparatus
US3883850A (en) * 1972-06-19 1975-05-13 Threshold Tech Programmable word recognition apparatus
GB1435779A (en) * 1972-09-21 1976-05-12 Threshold Tech Word recognition

Also Published As

Publication number Publication date
JPS50155105A (en) 1975-12-15
GB1519492A (en) 1978-07-26
US4060694A (en) 1977-11-29
FR2274101A1 (en) 1976-01-02
DE2524804A1 (en) 1975-12-18

Similar Documents

Publication Publication Date Title
FR2274101B1 (en)
US5842162A (en) Method and recognizer for recognizing a sampled sound signal in noise
EP0077558B1 (en) Method and apparatus for speech recognition and reproduction
US4852181A (en) Speech recognition for recognizing the catagory of an input speech pattern
US4720863A (en) Method and apparatus for text-independent speaker recognition
KR960701428A (en) A METHOD AND APPARATUS FOR SPEAKER RECOGNITION
US5228087A (en) Speech recognition apparatus and methods
US4516215A (en) Recognition of speech or speech-like sounds
MX9505296A (en) Speech recognition bias equalization method and apparatus.
JPH0312319B2 (en)
US7216075B2 (en) Speech recognition method and apparatus with noise adaptive standard pattern
Chavan et al. Speech recognition in noisy environment, issues and challenges: A review
Toruk et al. Short utterance speaker recognition using time-delay neural network
US5001761A (en) Device for normalizing a speech spectrum
Alimuradov et al. Application of improved complete ensemble empirical mode decomposition with adaptive noise in speech signal processing
Zhu et al. Analysis of hybrid feature research based on extraction LPCC and MFCC
Koc Acoustic feature analysis for robust speech recognition
Chiu et al. Analysis of physiologically-motivated signal processing for robust speech recognition.
Mustofa Implementation speech recognition for robot control using MFCC and ANFIS
Hurmalainen et al. Compact long context spectral factorisation models for noise robust recognition of medium vocabulary speech
Das Some experiments in discrete utterance recognition
Besbes et al. Wavelet packet energy and entropy features for classification of stressed speech
Bharathi et al. Speaker verification in a noisy environment by enhancing the speech signal using various approaches of spectral subtraction
KR20130070345A (en) Apparatus and method for recognizing of speaker using vocal signal
Hsieh et al. Magnitude replacement of real and imaginary modulation spectrum of acoustic spectrograms for noise-robust speech recognition

Legal Events

Date Code Title Description
ST Notification of lapse