MX9603686A - Sistema de identificacion y verificacion de locutor. - Google Patents

Sistema de identificacion y verificacion de locutor.

Info

Publication number
MX9603686A
MX9603686A MX9603686A MX9603686A MX9603686A MX 9603686 A MX9603686 A MX 9603686A MX 9603686 A MX9603686 A MX 9603686A MX 9603686 A MX9603686 A MX 9603686A MX 9603686 A MX9603686 A MX 9603686A
Authority
MX
Mexico
Prior art keywords
speech
components
improved
transfer function
verification system
Prior art date
Application number
MX9603686A
Other languages
English (en)
Other versions
MXPA96003686A (es
Inventor
Richard J Mammone
Khaled T Assaleh
Original Assignee
Univ Rutgers
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Univ Rutgers filed Critical Univ Rutgers
Publication of MX9603686A publication Critical patent/MX9603686A/es
Publication of MXPA96003686A publication Critical patent/MXPA96003686A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/12Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
  • Radar Systems Or Details Thereof (AREA)
  • Burglar Alarm Systems (AREA)
  • Selective Calling Equipment (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Complex Calculations (AREA)

Abstract

La presente invencion se refiere a un método y sistema para el reconocimiento de locutor, el cual aplica una evaluacion de componente adaptativo a cada marco de lenguaje para atenuar los componentes del aparato no vocal y normalizar los componentes de lenguaje. Se utiliza un modelo de todos los polos de prediccion lineal para formar una nueva funcion de transferencia que tiene un componente promedio en movimiento. Un espectro normalizado es determinado a partir de la nueva funcion de transferencia. El espectro normalizado está definido como que tiene características mejoradas para los componentes del lenguaje. A partir de los componentes de lenguaje mejorados, se obtiene un reconocimiento de locutor mejorado sobre un canal.
MXPA/A/1996/003686A 1994-02-28 1995-02-28 Sistema de identificacion y verificacion delocutor MXPA96003686A (es)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US08/203,988 US5522012A (en) 1994-02-28 1994-02-28 Speaker identification and verification system
US08203988 1994-02-28
PCT/US1995/002801 WO1995023408A1 (en) 1994-02-28 1995-02-28 Speaker identification and verification system

Publications (2)

Publication Number Publication Date
MX9603686A true MX9603686A (es) 1997-12-31
MXPA96003686A MXPA96003686A (es) 1998-09-18

Family

ID=

Also Published As

Publication number Publication date
JPH10500781A (ja) 1998-01-20
EP0748500A1 (en) 1996-12-18
DE69534942T2 (de) 2006-12-07
CA2184256A1 (en) 1995-08-31
EP0748500A4 (en) 1998-09-23
ATE323933T1 (de) 2006-05-15
CN1142274A (zh) 1997-02-05
US5522012A (en) 1996-05-28
WO1995023408A1 (en) 1995-08-31
EP0748500B1 (en) 2006-04-19
DE69534942D1 (de) 2006-05-24
AU683370B2 (en) 1997-11-06
AU2116495A (en) 1995-09-11

Similar Documents

Publication Publication Date Title
EP0748500A4 (en) IDENTIFICATION AND VERIFICATION SYSTEM OF THE SPOKEN PERSON
US5933801A (en) Method for transforming a speech signal using a pitch manipulator
Hermansky et al. TRAPS-classifiers of temporal patterns.
GB2303237B (en) Method of training neural networks used for speech recognition
FI962572A (fi) Hajautettu äänentunnistusjärjestelmä
NO975475L (no) Stemmegjenkjenningssystem
BR9600762A (pt) Processo e aparelho para reduzir o ruído em si em um sinal de voz de entrada
EP0932141A3 (en) Method for signal controlled switching between different audio coding schemes
GB2307582A (en) System for recognizing spoken sounds from continuous speech and method of using same
EP0871157A3 (en) A method and a device for recognising speech
GB2308483A (en) Method and system for recognizing a boundary beween sounds in continuous speech
IL145285A0 (en) Speaker recognition
WO1999018565A3 (en) Speech coding
US5732388A (en) Feature extraction method for a speech signal
KR20010093327A (ko) 음성 인식 제거 체계
ATE252265T1 (de) Sprachaktivitätserkennung
Barker et al. Recent advances in speech fragment decoding techniques.
KR20030010432A (ko) 잡음환경에서의 음성인식장치
CN1009320B (zh) 语音识别
US20240071411A1 (en) Determining dialog quality metrics of a mixed audio signal
Weintraub Sound separation and auditory perceptual organization
Tibrewala et al. Multi-stream approach in acoustic modeling
Mauuary Blind equalization for robust telephone based speech recognition
Okuno et al. A new speech enhancement: Speech stream segregation
Cheng et al. A robust front-end algorithm for distributed speech recognition

Legal Events

Date Code Title Description
FG Grant or registration
MM Annulment or lapse due to non-payment of fees