MX9603686A - Sistema de identificacion y verificacion de locutor. - Google Patents
Sistema de identificacion y verificacion de locutor.Info
- Publication number
- MX9603686A MX9603686A MX9603686A MX9603686A MX9603686A MX 9603686 A MX9603686 A MX 9603686A MX 9603686 A MX9603686 A MX 9603686A MX 9603686 A MX9603686 A MX 9603686A MX 9603686 A MX9603686 A MX 9603686A
- Authority
- MX
- Mexico
- Prior art keywords
- speech
- components
- improved
- transfer function
- verification system
- Prior art date
Links
- 238000012795 verification Methods 0.000 title 1
- 238000001228 spectrum Methods 0.000 abstract 2
- 230000003044 adaptive effect Effects 0.000 abstract 1
- 238000000034 method Methods 0.000 abstract 1
- 230000001755 vocal effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/12—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
Landscapes
- Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
- Radar Systems Or Details Thereof (AREA)
- Burglar Alarm Systems (AREA)
- Selective Calling Equipment (AREA)
- Circuit For Audible Band Transducer (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Complex Calculations (AREA)
Abstract
La presente invencion se refiere a un método y sistema para el reconocimiento de locutor, el cual aplica una evaluacion de componente adaptativo a cada marco de lenguaje para atenuar los componentes del aparato no vocal y normalizar los componentes de lenguaje. Se utiliza un modelo de todos los polos de prediccion lineal para formar una nueva funcion de transferencia que tiene un componente promedio en movimiento. Un espectro normalizado es determinado a partir de la nueva funcion de transferencia. El espectro normalizado está definido como que tiene características mejoradas para los componentes del lenguaje. A partir de los componentes de lenguaje mejorados, se obtiene un reconocimiento de locutor mejorado sobre un canal.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/203,988 US5522012A (en) | 1994-02-28 | 1994-02-28 | Speaker identification and verification system |
US08203988 | 1994-02-28 | ||
PCT/US1995/002801 WO1995023408A1 (en) | 1994-02-28 | 1995-02-28 | Speaker identification and verification system |
Publications (2)
Publication Number | Publication Date |
---|---|
MX9603686A true MX9603686A (es) | 1997-12-31 |
MXPA96003686A MXPA96003686A (es) | 1998-09-18 |
Family
ID=
Also Published As
Publication number | Publication date |
---|---|
JPH10500781A (ja) | 1998-01-20 |
EP0748500A1 (en) | 1996-12-18 |
DE69534942T2 (de) | 2006-12-07 |
CA2184256A1 (en) | 1995-08-31 |
EP0748500A4 (en) | 1998-09-23 |
ATE323933T1 (de) | 2006-05-15 |
CN1142274A (zh) | 1997-02-05 |
US5522012A (en) | 1996-05-28 |
WO1995023408A1 (en) | 1995-08-31 |
EP0748500B1 (en) | 2006-04-19 |
DE69534942D1 (de) | 2006-05-24 |
AU683370B2 (en) | 1997-11-06 |
AU2116495A (en) | 1995-09-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0748500A4 (en) | IDENTIFICATION AND VERIFICATION SYSTEM OF THE SPOKEN PERSON | |
US5933801A (en) | Method for transforming a speech signal using a pitch manipulator | |
Hermansky et al. | TRAPS-classifiers of temporal patterns. | |
GB2303237B (en) | Method of training neural networks used for speech recognition | |
FI962572A (fi) | Hajautettu äänentunnistusjärjestelmä | |
NO975475L (no) | Stemmegjenkjenningssystem | |
BR9600762A (pt) | Processo e aparelho para reduzir o ruído em si em um sinal de voz de entrada | |
EP0932141A3 (en) | Method for signal controlled switching between different audio coding schemes | |
GB2307582A (en) | System for recognizing spoken sounds from continuous speech and method of using same | |
EP0871157A3 (en) | A method and a device for recognising speech | |
GB2308483A (en) | Method and system for recognizing a boundary beween sounds in continuous speech | |
IL145285A0 (en) | Speaker recognition | |
WO1999018565A3 (en) | Speech coding | |
US5732388A (en) | Feature extraction method for a speech signal | |
KR20010093327A (ko) | 음성 인식 제거 체계 | |
ATE252265T1 (de) | Sprachaktivitätserkennung | |
Barker et al. | Recent advances in speech fragment decoding techniques. | |
KR20030010432A (ko) | 잡음환경에서의 음성인식장치 | |
CN1009320B (zh) | 语音识别 | |
US20240071411A1 (en) | Determining dialog quality metrics of a mixed audio signal | |
Weintraub | Sound separation and auditory perceptual organization | |
Tibrewala et al. | Multi-stream approach in acoustic modeling | |
Mauuary | Blind equalization for robust telephone based speech recognition | |
Okuno et al. | A new speech enhancement: Speech stream segregation | |
Cheng et al. | A robust front-end algorithm for distributed speech recognition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG | Grant or registration | ||
MM | Annulment or lapse due to non-payment of fees |