MX166745B - Metodo y aparato para extraer porciones que porten informacion de una señal para reconocer puntos variantes de modelos semilares - Google Patents
Metodo y aparato para extraer porciones que porten informacion de una señal para reconocer puntos variantes de modelos semilaresInfo
- Publication number
- MX166745B MX166745B MX021835A MX2183590A MX166745B MX 166745 B MX166745 B MX 166745B MX 021835 A MX021835 A MX 021835A MX 2183590 A MX2183590 A MX 2183590A MX 166745 B MX166745 B MX 166745B
- Authority
- MX
- Mexico
- Prior art keywords
- signal
- sequence
- histogram
- vocal
- input
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 2
- 230000006835 compression Effects 0.000 abstract 4
- 238000007906 compression Methods 0.000 abstract 4
- 230000001755 vocal effect Effects 0.000 abstract 4
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
Landscapes
- Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
La presente invención se refiere a un método para procesar una señal acústica vocal de entrada, para la extracción de pronunciaciones o modos de hablar individuales, que comprende las etapas de; a) convertir la señal vocal en una primera y una segunda secuencia de muestras vocales relacionadas, b) correlacionar la primera secuencia de muestras, vocales relacionadas para derivar un primer histograma que representa la señal vocal de entrada; c) correlacionar la segunda secuencia de muestras vocales relacionadas para segunda secuencia de muestras vocales relacionadas para derivar un segundo histograma que representa la señal vocal de entrada, d) comprimir el primer y segundo histogramas para derivar una pluralidad de canales espaciados, e) generar un histograma de compresión que representa al menos una parte de la señal vocal de entrada a partir de los canales espaciados; f) repetir las etapas a) a (e) para generar una secuencia de histogramas de compresión representando una transferencia de la señal dvocal de entrada g) identificar puntos terminales para cada pronunciación en la secuencia de los histogramas de compresión; y h) extraer pronunciaciones individuales de la secuencia de los histogramas de compresión entre las partes terminales de pronunciación identificadas.
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US07/389,682 US5025471A (en) | 1989-08-04 | 1989-08-04 | Method and apparatus for extracting information-bearing portions of a signal for recognizing varying instances of similar patterns |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| MX166745B true MX166745B (es) | 1993-02-01 |
Family
ID=23539284
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| MX021835A MX166745B (es) | 1989-08-04 | 1990-08-03 | Metodo y aparato para extraer porciones que porten informacion de una señal para reconocer puntos variantes de modelos semilares |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US5025471A (es) |
| EP (1) | EP0411290A2 (es) |
| AU (1) | AU633588B2 (es) |
| CA (1) | CA2020242C (es) |
| MX (1) | MX166745B (es) |
Families Citing this family (29)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5329062A (en) * | 1990-07-31 | 1994-07-12 | Casio Computer Co., Ltd. | Method of recording/reproducing waveform and apparatus for reproducing waveform |
| JPH04182700A (ja) * | 1990-11-19 | 1992-06-30 | Nec Corp | 音声認識装置 |
| US5305244B2 (en) * | 1992-04-06 | 1997-09-23 | Computer Products & Services I | Hands-free user-supported portable computer |
| US5526466A (en) * | 1993-04-14 | 1996-06-11 | Matsushita Electric Industrial Co., Ltd. | Speech recognition apparatus |
| US5870705A (en) * | 1994-10-21 | 1999-02-09 | Microsoft Corporation | Method of setting input levels in a voice recognition system |
| US5651056A (en) * | 1995-07-13 | 1997-07-22 | Eting; Leon | Apparatus and methods for conveying telephone numbers and other information via communication devices |
| WO1997033273A1 (en) * | 1996-03-08 | 1997-09-12 | Motorola Inc. | Method and recognizer for recognizing a sampled sound signal in noise |
| US6389392B1 (en) * | 1997-10-15 | 2002-05-14 | British Telecommunications Public Limited Company | Method and apparatus for speaker recognition via comparing an unknown input to reference data |
| EP1580747A3 (en) * | 1997-10-22 | 2005-11-02 | Victor Company of Japan Limited | Audio information processing method, audio information processing apparatus, and method of recording audio information on recording medium |
| US6178400B1 (en) | 1998-07-22 | 2001-01-23 | At&T Corp. | Method and apparatus for normalizing speech to facilitate a telephone call |
| CN1280783C (zh) * | 1998-09-09 | 2006-10-18 | 旭化成株式会社 | 声音识别装置和声音识别方法 |
| US6138089A (en) * | 1999-03-10 | 2000-10-24 | Infolio, Inc. | Apparatus system and method for speech compression and decompression |
| DE19929462A1 (de) * | 1999-06-26 | 2001-02-22 | Philips Corp Intellectual Pty | Verfahren zum Training eines automatischen Spracherkenners |
| US6920188B1 (en) * | 2000-11-16 | 2005-07-19 | Piradian, Inc. | Method and apparatus for processing a multiple-component wide dynamic range signal |
| AU2002230151B2 (en) | 2001-08-06 | 2006-08-03 | Index Corporation | Apparatus for determining dog's emotions by vocal analysis of barking sounds and method for the same |
| US20040102964A1 (en) * | 2002-11-21 | 2004-05-27 | Rapoport Ezra J. | Speech compression using principal component analysis |
| DE10254612A1 (de) * | 2002-11-22 | 2004-06-17 | Humboldt-Universität Zu Berlin | Verfahren zur Ermittlung spezifisch relevanter akustischer Merkmale von Schallsignalen für die Analyse unbekannter Schallsignale einer Schallerzeugung |
| US20050075865A1 (en) * | 2003-10-06 | 2005-04-07 | Rapoport Ezra J. | Speech recognition |
| US20050102144A1 (en) * | 2003-11-06 | 2005-05-12 | Rapoport Ezra J. | Speech synthesis |
| JP3827317B2 (ja) * | 2004-06-03 | 2006-09-27 | 任天堂株式会社 | コマンド処理装置 |
| JP4318119B2 (ja) * | 2004-06-18 | 2009-08-19 | 国立大学法人京都大学 | 音響信号処理方法、音響信号処理装置、音響信号処理システム及びコンピュータプログラム |
| US8255216B2 (en) * | 2006-10-30 | 2012-08-28 | Nuance Communications, Inc. | Speech recognition of character sequences |
| EP2031581A1 (de) * | 2007-08-31 | 2009-03-04 | Deutsche Thomson OHG | Verfahren zum Erkennen eines akustischen Ereignisses in einem Audio-Signal |
| US20130080165A1 (en) * | 2011-09-24 | 2013-03-28 | Microsoft Corporation | Model Based Online Normalization of Feature Distribution for Noise Robust Speech Recognition |
| US9484022B2 (en) | 2014-05-23 | 2016-11-01 | Google Inc. | Training multiple neural networks with different accuracy |
| US9418679B2 (en) * | 2014-08-12 | 2016-08-16 | Honeywell International Inc. | Methods and apparatus for interpreting received speech data using speech recognition |
| JP6238246B2 (ja) * | 2015-04-16 | 2017-11-29 | 本田技研工業株式会社 | 会話処理装置、および会話処理方法 |
| JP6672114B2 (ja) * | 2016-09-13 | 2020-03-25 | 本田技研工業株式会社 | 会話メンバー最適化装置、会話メンバー最適化方法およびプログラム |
| US10983888B1 (en) * | 2018-12-12 | 2021-04-20 | Amazon Technologies, Inc. | System and method for generating dynamic sparse exponential histograms |
Family Cites Families (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4161033A (en) * | 1977-12-22 | 1979-07-10 | Rca Corporation | Correlator/convolver using a second shift register to rotate sample values |
| US4230906A (en) * | 1978-05-25 | 1980-10-28 | Time And Space Processing, Inc. | Speech digitizer |
| CH637510A5 (de) * | 1978-10-27 | 1983-07-29 | Ibm | Verfahren und anordnung zur uebertragung von sprachsignalen sowie anwendung des verfahrens. |
| JPS5857758B2 (ja) * | 1979-09-28 | 1983-12-21 | 株式会社日立製作所 | 音声ピッチ周期抽出装置 |
| JPS5650398A (en) * | 1979-10-01 | 1981-05-07 | Hitachi Ltd | Sound synthesizer |
| JPS5672499A (en) * | 1979-11-19 | 1981-06-16 | Hitachi Ltd | Pretreatment for voice identifier |
| US4383135A (en) * | 1980-01-23 | 1983-05-10 | Scott Instruments Corporation | Method and apparatus for speech recognition |
| US4373191A (en) * | 1980-11-10 | 1983-02-08 | Motorola Inc. | Absolute magnitude difference function generator for an LPC system |
| US4441200A (en) * | 1981-10-08 | 1984-04-03 | Motorola Inc. | Digital voice processing system |
| JPS58178396A (ja) * | 1982-04-12 | 1983-10-19 | 株式会社日立製作所 | 音声認識用標準パタ−ン登録方式 |
| JPS5979300A (ja) * | 1982-10-28 | 1984-05-08 | 電子計算機基本技術研究組合 | 認識装置 |
| US4672667A (en) * | 1983-06-02 | 1987-06-09 | Scott Instruments Company | Method for signal processing |
| US4700360A (en) * | 1984-12-19 | 1987-10-13 | Extrema Systems International Corporation | Extrema coding digitizing signal processing method and apparatus |
| AU583871B2 (en) * | 1984-12-31 | 1989-05-11 | Itt Industries, Inc. | Apparatus and method for automatic speech recognition |
| EP0212323A3 (en) * | 1985-08-29 | 1988-03-16 | Scott Instruments Corporation | Method and apparatus for generating a signal transformation and the use thereof in signal processings |
| JPH01169499A (ja) * | 1987-12-24 | 1989-07-04 | Fujitsu Ltd | 単語音声区間切出し方式 |
-
1989
- 1989-08-04 US US07/389,682 patent/US5025471A/en not_active Expired - Fee Related
-
1990
- 1990-06-18 AU AU57561/90A patent/AU633588B2/en not_active Ceased
- 1990-06-19 EP EP90111563A patent/EP0411290A2/en not_active Withdrawn
- 1990-06-29 CA CA002020242A patent/CA2020242C/en not_active Expired - Lifetime
- 1990-08-03 MX MX021835A patent/MX166745B/es unknown
Also Published As
| Publication number | Publication date |
|---|---|
| CA2020242A1 (en) | 1991-02-05 |
| AU5756190A (en) | 1991-02-07 |
| CA2020242C (en) | 2002-08-20 |
| EP0411290A3 (es) | 1994-02-09 |
| EP0411290A2 (en) | 1991-02-06 |
| US5025471A (en) | 1991-06-18 |
| AU633588B2 (en) | 1993-02-04 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| MX166745B (es) | Metodo y aparato para extraer porciones que porten informacion de una señal para reconocer puntos variantes de modelos semilares | |
| ATE480100T1 (de) | Erzeugung von untertiteln für bewegte bilder | |
| US4720863A (en) | Method and apparatus for text-independent speaker recognition | |
| JPS6466698A (en) | Voice recognition equipment | |
| SG128406A1 (en) | Character recognizing and translating system and voice recognizing and translating system | |
| JPS5972496A (ja) | 単音識別装置 | |
| CN1300049A (zh) | 汉语普通话话音识别的方法和设备 | |
| US4060695A (en) | Speaker identification system using peak value envelop lines of vocal waveforms | |
| DE3271705D1 (en) | A system and method for recognizing speech | |
| KR20220122141A (ko) | 학습데이터 수집장치, 학습데이터 수집방법, 및 음성인식장치 | |
| MX159615A (es) | Mejoras a sistema electronico para identificar palabras habladas | |
| JPS5648688A (en) | Sound analyser | |
| US4783808A (en) | Connected word recognition enrollment method | |
| Boogaart et al. | Evaluating the overall comprehensibility of speech synthesizers | |
| Bosch | On the automatic classification of pitch movements | |
| JP2813209B2 (ja) | 大語彙音声認識装置 | |
| Purton | Speech recognition using autocorrelation analysis | |
| EP0173986A3 (en) | Method of and device for the recognition, without previous training of connected words belonging to small vocabularies | |
| KR100513038B1 (ko) | 다채널 확장된 음성인식 시스템에서의 음성데이터 저장 방법 | |
| JP2580768B2 (ja) | 音声認識装置 | |
| RU98103129A (ru) | Способ построения словаря для перевода с иностранного языка | |
| Iwata et al. | Pause rule for Japanese text-to-speech conversion using pause insertion probability | |
| JPS6421498A (en) | Automatically scoring system and apparatus | |
| KR930010781A (ko) | 문서 낭독 시스템 | |
| Aziz | Nasal aspirates in Urdu |