MX166745B - Metodo y aparato para extraer porciones que porten informacion de una señal para reconocer puntos variantes de modelos semilares - Google Patents
Metodo y aparato para extraer porciones que porten informacion de una señal para reconocer puntos variantes de modelos semilaresInfo
- Publication number
- MX166745B MX166745B MX021835A MX2183590A MX166745B MX 166745 B MX166745 B MX 166745B MX 021835 A MX021835 A MX 021835A MX 2183590 A MX2183590 A MX 2183590A MX 166745 B MX166745 B MX 166745B
- Authority
- MX
- Mexico
- Prior art keywords
- signal
- sequence
- histogram
- vocal
- input
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 2
- 230000006835 compression Effects 0.000 abstract 4
- 238000007906 compression Methods 0.000 abstract 4
- 230000001755 vocal effect Effects 0.000 abstract 4
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
Landscapes
- Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
Abstract
La presente invención se refiere a un método para procesar una señal acústica vocal de entrada, para la extracción de pronunciaciones o modos de hablar individuales, que comprende las etapas de; a) convertir la señal vocal en una primera y una segunda secuencia de muestras vocales relacionadas, b) correlacionar la primera secuencia de muestras, vocales relacionadas para derivar un primer histograma que representa la señal vocal de entrada; c) correlacionar la segunda secuencia de muestras vocales relacionadas para segunda secuencia de muestras vocales relacionadas para derivar un segundo histograma que representa la señal vocal de entrada, d) comprimir el primer y segundo histogramas para derivar una pluralidad de canales espaciados, e) generar un histograma de compresión que representa al menos una parte de la señal vocal de entrada a partir de los canales espaciados; f) repetir las etapas a) a (e) para generar una secuencia de histogramas de compresión representando una transferencia de la señal dvocal de entrada g) identificar puntos terminales para cada pronunciación en la secuencia de los histogramas de compresión; y h) extraer pronunciaciones individuales de la secuencia de los histogramas de compresión entre las partes terminales de pronunciación identificadas.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US07/389,682 US5025471A (en) | 1989-08-04 | 1989-08-04 | Method and apparatus for extracting information-bearing portions of a signal for recognizing varying instances of similar patterns |
Publications (1)
Publication Number | Publication Date |
---|---|
MX166745B true MX166745B (es) | 1993-02-01 |
Family
ID=23539284
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX021835A MX166745B (es) | 1989-08-04 | 1990-08-03 | Metodo y aparato para extraer porciones que porten informacion de una señal para reconocer puntos variantes de modelos semilares |
Country Status (5)
Country | Link |
---|---|
US (1) | US5025471A (es) |
EP (1) | EP0411290A2 (es) |
AU (1) | AU633588B2 (es) |
CA (1) | CA2020242C (es) |
MX (1) | MX166745B (es) |
Families Citing this family (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5329062A (en) * | 1990-07-31 | 1994-07-12 | Casio Computer Co., Ltd. | Method of recording/reproducing waveform and apparatus for reproducing waveform |
JPH04182700A (ja) * | 1990-11-19 | 1992-06-30 | Nec Corp | 音声認識装置 |
US5305244B2 (en) * | 1992-04-06 | 1997-09-23 | Computer Products & Services I | Hands-free user-supported portable computer |
US5526466A (en) * | 1993-04-14 | 1996-06-11 | Matsushita Electric Industrial Co., Ltd. | Speech recognition apparatus |
US5870705A (en) * | 1994-10-21 | 1999-02-09 | Microsoft Corporation | Method of setting input levels in a voice recognition system |
US5651056A (en) * | 1995-07-13 | 1997-07-22 | Eting; Leon | Apparatus and methods for conveying telephone numbers and other information via communication devices |
IL125649A (en) * | 1996-03-08 | 2002-12-01 | Motorola Inc | Method and device for detecting signal of a sound sampled from noise |
DE69813597T2 (de) * | 1997-10-15 | 2004-02-12 | British Telecommunications P.L.C. | Mustererkennung, die mehrere referenzmodelle verwendet |
EP1580747A3 (en) * | 1997-10-22 | 2005-11-02 | Victor Company of Japan Limited | Audio information processing method, audio information processing apparatus, and method of recording audio information on recording medium |
US6178400B1 (en) | 1998-07-22 | 2001-01-23 | At&T Corp. | Method and apparatus for normalizing speech to facilitate a telephone call |
KR100415217B1 (ko) * | 1998-09-09 | 2004-01-16 | 아사히 가세이 가부시키가이샤 | 음성인식 장치 |
US6138089A (en) * | 1999-03-10 | 2000-10-24 | Infolio, Inc. | Apparatus system and method for speech compression and decompression |
DE19929462A1 (de) * | 1999-06-26 | 2001-02-22 | Philips Corp Intellectual Pty | Verfahren zum Training eines automatischen Spracherkenners |
US6920188B1 (en) * | 2000-11-16 | 2005-07-19 | Piradian, Inc. | Method and apparatus for processing a multiple-component wide dynamic range signal |
KR20040014431A (ko) | 2001-08-06 | 2004-02-14 | 가부시키가이샤 인덱스 | 명성의 음성적 특징분석에 기초하는 개의 감정판별장치 및방법 |
US20040102964A1 (en) * | 2002-11-21 | 2004-05-27 | Rapoport Ezra J. | Speech compression using principal component analysis |
DE10254612A1 (de) * | 2002-11-22 | 2004-06-17 | Humboldt-Universität Zu Berlin | Verfahren zur Ermittlung spezifisch relevanter akustischer Merkmale von Schallsignalen für die Analyse unbekannter Schallsignale einer Schallerzeugung |
US20050075865A1 (en) * | 2003-10-06 | 2005-04-07 | Rapoport Ezra J. | Speech recognition |
US20050102144A1 (en) * | 2003-11-06 | 2005-05-12 | Rapoport Ezra J. | Speech synthesis |
JP3827317B2 (ja) * | 2004-06-03 | 2006-09-27 | 任天堂株式会社 | コマンド処理装置 |
JP4318119B2 (ja) * | 2004-06-18 | 2009-08-19 | 国立大学法人京都大学 | 音響信号処理方法、音響信号処理装置、音響信号処理システム及びコンピュータプログラム |
US8255216B2 (en) * | 2006-10-30 | 2012-08-28 | Nuance Communications, Inc. | Speech recognition of character sequences |
EP2031581A1 (de) * | 2007-08-31 | 2009-03-04 | Deutsche Thomson OHG | Verfahren zum Erkennen eines akustischen Ereignisses in einem Audio-Signal |
US20130080165A1 (en) * | 2011-09-24 | 2013-03-28 | Microsoft Corporation | Model Based Online Normalization of Feature Distribution for Noise Robust Speech Recognition |
US9484022B2 (en) | 2014-05-23 | 2016-11-01 | Google Inc. | Training multiple neural networks with different accuracy |
US9418679B2 (en) * | 2014-08-12 | 2016-08-16 | Honeywell International Inc. | Methods and apparatus for interpreting received speech data using speech recognition |
JP6238246B2 (ja) * | 2015-04-16 | 2017-11-29 | 本田技研工業株式会社 | 会話処理装置、および会話処理方法 |
JP6672114B2 (ja) * | 2016-09-13 | 2020-03-25 | 本田技研工業株式会社 | 会話メンバー最適化装置、会話メンバー最適化方法およびプログラム |
US10983888B1 (en) * | 2018-12-12 | 2021-04-20 | Amazon Technologies, Inc. | System and method for generating dynamic sparse exponential histograms |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4161033A (en) * | 1977-12-22 | 1979-07-10 | Rca Corporation | Correlator/convolver using a second shift register to rotate sample values |
US4230906A (en) * | 1978-05-25 | 1980-10-28 | Time And Space Processing, Inc. | Speech digitizer |
CH637510A5 (de) * | 1978-10-27 | 1983-07-29 | Ibm | Verfahren und anordnung zur uebertragung von sprachsignalen sowie anwendung des verfahrens. |
JPS5857758B2 (ja) * | 1979-09-28 | 1983-12-21 | 株式会社日立製作所 | 音声ピッチ周期抽出装置 |
JPS5650398A (en) * | 1979-10-01 | 1981-05-07 | Hitachi Ltd | Sound synthesizer |
JPS5672499A (en) * | 1979-11-19 | 1981-06-16 | Hitachi Ltd | Pretreatment for voice identifier |
US4383135A (en) * | 1980-01-23 | 1983-05-10 | Scott Instruments Corporation | Method and apparatus for speech recognition |
US4373191A (en) * | 1980-11-10 | 1983-02-08 | Motorola Inc. | Absolute magnitude difference function generator for an LPC system |
US4441200A (en) * | 1981-10-08 | 1984-04-03 | Motorola Inc. | Digital voice processing system |
JPS58178396A (ja) * | 1982-04-12 | 1983-10-19 | 株式会社日立製作所 | 音声認識用標準パタ−ン登録方式 |
JPS5979300A (ja) * | 1982-10-28 | 1984-05-08 | 電子計算機基本技術研究組合 | 認識装置 |
US4672667A (en) * | 1983-06-02 | 1987-06-09 | Scott Instruments Company | Method for signal processing |
US4700360A (en) * | 1984-12-19 | 1987-10-13 | Extrema Systems International Corporation | Extrema coding digitizing signal processing method and apparatus |
AU583871B2 (en) * | 1984-12-31 | 1989-05-11 | Itt Industries, Inc. | Apparatus and method for automatic speech recognition |
EP0212323A3 (en) * | 1985-08-29 | 1988-03-16 | Scott Instruments Corporation | Method and apparatus for generating a signal transformation and the use thereof in signal processings |
JPH01169499A (ja) * | 1987-12-24 | 1989-07-04 | Fujitsu Ltd | 単語音声区間切出し方式 |
-
1989
- 1989-08-04 US US07/389,682 patent/US5025471A/en not_active Expired - Fee Related
-
1990
- 1990-06-18 AU AU57561/90A patent/AU633588B2/en not_active Ceased
- 1990-06-19 EP EP90111563A patent/EP0411290A2/en not_active Withdrawn
- 1990-06-29 CA CA002020242A patent/CA2020242C/en not_active Expired - Lifetime
- 1990-08-03 MX MX021835A patent/MX166745B/es unknown
Also Published As
Publication number | Publication date |
---|---|
AU5756190A (en) | 1991-02-07 |
CA2020242C (en) | 2002-08-20 |
EP0411290A2 (en) | 1991-02-06 |
US5025471A (en) | 1991-06-18 |
AU633588B2 (en) | 1993-02-04 |
CA2020242A1 (en) | 1991-02-05 |
EP0411290A3 (es) | 1994-02-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MX166745B (es) | Metodo y aparato para extraer porciones que porten informacion de una señal para reconocer puntos variantes de modelos semilares | |
ATE480100T1 (de) | Erzeugung von untertiteln für bewegte bilder | |
EP0736857A3 (en) | Speech recognizing method and apparatus, and speech translating system | |
US4720863A (en) | Method and apparatus for text-independent speaker recognition | |
US5136652A (en) | Amplitude enhanced sampled clipped speech encoder and decoder | |
JPS5210003A (en) | Method and system for analyzing and synthesizing voice signals | |
FR2522179B1 (fr) | Procede et appareil de reconnaissance de paroles permettant de reconnaitre des phonemes particuliers du signal vocal quelle que soit la personne qui parle | |
JPS6466698A (en) | Voice recognition equipment | |
SG128406A1 (en) | Character recognizing and translating system and voice recognizing and translating system | |
CN1979491A (zh) | 对音乐文件分类的方法及其系统 | |
CN106303695A (zh) | 音频翻译多语言文字处理方法和系统 | |
CN1300049A (zh) | 汉语普通话话音识别的方法和设备 | |
US4060695A (en) | Speaker identification system using peak value envelop lines of vocal waveforms | |
DE3271705D1 (en) | A system and method for recognizing speech | |
EP1010170A4 (en) | METHOD AND SYSTEM FOR AUTOMATIC EVALUATION OF INDEPENDENT TEXT PRONUNCIATION FOR LANGUAGE LEARNING | |
MX159615A (es) | Mejoras a sistema electronico para identificar palabras habladas | |
JPS5648688A (en) | Sound analyser | |
ten Bosch | On the automatic classification of pitch movements | |
JPS6326699A (ja) | 連続語認識記録方法 | |
EP0173986A3 (en) | Method of and device for the recognition, without previous training of connected words belonging to small vocabularies | |
JP2580768B2 (ja) | 音声認識装置 | |
Wang et al. | USTC95-a Putonghua corpus | |
JP2757356B2 (ja) | 単語音声認識方法および装置 | |
KR20220122141A (ko) | 학습데이터 수집장치, 학습데이터 수집방법, 및 음성인식장치 | |
Patil et al. | Tracing Gujarati Dialects Philogically and Sociolinguistically |