MX166745B - Metodo y aparato para extraer porciones que porten informacion de una señal para reconocer puntos variantes de modelos semilares - Google Patents

Metodo y aparato para extraer porciones que porten informacion de una señal para reconocer puntos variantes de modelos semilares

Info

Publication number
MX166745B
MX166745B MX021835A MX2183590A MX166745B MX 166745 B MX166745 B MX 166745B MX 021835 A MX021835 A MX 021835A MX 2183590 A MX2183590 A MX 2183590A MX 166745 B MX166745 B MX 166745B
Authority
MX
Mexico
Prior art keywords
signal
sequence
histogram
vocal
input
Prior art date
Application number
MX021835A
Other languages
English (en)
Inventor
Bruce E Balentine
Lisan Soapi Lin
Brian Lee Scott
Lloyd Alan Smith
John Mark Newell
Original Assignee
Scott Instr Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Scott Instr Corp filed Critical Scott Instr Corp
Publication of MX166745B publication Critical patent/MX166745B/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)

Abstract

La presente invención se refiere a un método para procesar una señal acústica vocal de entrada, para la extracción de pronunciaciones o modos de hablar individuales, que comprende las etapas de; a) convertir la señal vocal en una primera y una segunda secuencia de muestras vocales relacionadas, b) correlacionar la primera secuencia de muestras, vocales relacionadas para derivar un primer histograma que representa la señal vocal de entrada; c) correlacionar la segunda secuencia de muestras vocales relacionadas para segunda secuencia de muestras vocales relacionadas para derivar un segundo histograma que representa la señal vocal de entrada, d) comprimir el primer y segundo histogramas para derivar una pluralidad de canales espaciados, e) generar un histograma de compresión que representa al menos una parte de la señal vocal de entrada a partir de los canales espaciados; f) repetir las etapas a) a (e) para generar una secuencia de histogramas de compresión representando una transferencia de la señal dvocal de entrada g) identificar puntos terminales para cada pronunciación en la secuencia de los histogramas de compresión; y h) extraer pronunciaciones individuales de la secuencia de los histogramas de compresión entre las partes terminales de pronunciación identificadas.
MX021835A 1989-08-04 1990-08-03 Metodo y aparato para extraer porciones que porten informacion de una señal para reconocer puntos variantes de modelos semilares MX166745B (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US07/389,682 US5025471A (en) 1989-08-04 1989-08-04 Method and apparatus for extracting information-bearing portions of a signal for recognizing varying instances of similar patterns

Publications (1)

Publication Number Publication Date
MX166745B true MX166745B (es) 1993-02-01

Family

ID=23539284

Family Applications (1)

Application Number Title Priority Date Filing Date
MX021835A MX166745B (es) 1989-08-04 1990-08-03 Metodo y aparato para extraer porciones que porten informacion de una señal para reconocer puntos variantes de modelos semilares

Country Status (5)

Country Link
US (1) US5025471A (es)
EP (1) EP0411290A2 (es)
AU (1) AU633588B2 (es)
CA (1) CA2020242C (es)
MX (1) MX166745B (es)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5329062A (en) * 1990-07-31 1994-07-12 Casio Computer Co., Ltd. Method of recording/reproducing waveform and apparatus for reproducing waveform
JPH04182700A (ja) * 1990-11-19 1992-06-30 Nec Corp 音声認識装置
US5305244B2 (en) * 1992-04-06 1997-09-23 Computer Products & Services I Hands-free user-supported portable computer
US5526466A (en) * 1993-04-14 1996-06-11 Matsushita Electric Industrial Co., Ltd. Speech recognition apparatus
US5870705A (en) * 1994-10-21 1999-02-09 Microsoft Corporation Method of setting input levels in a voice recognition system
US5651056A (en) * 1995-07-13 1997-07-22 Eting; Leon Apparatus and methods for conveying telephone numbers and other information via communication devices
IL125649A (en) * 1996-03-08 2002-12-01 Motorola Inc Method and device for detecting signal of a sound sampled from noise
DE69813597T2 (de) * 1997-10-15 2004-02-12 British Telecommunications P.L.C. Mustererkennung, die mehrere referenzmodelle verwendet
EP1580747A3 (en) * 1997-10-22 2005-11-02 Victor Company of Japan Limited Audio information processing method, audio information processing apparatus, and method of recording audio information on recording medium
US6178400B1 (en) 1998-07-22 2001-01-23 At&T Corp. Method and apparatus for normalizing speech to facilitate a telephone call
KR100415217B1 (ko) * 1998-09-09 2004-01-16 아사히 가세이 가부시키가이샤 음성인식 장치
US6138089A (en) * 1999-03-10 2000-10-24 Infolio, Inc. Apparatus system and method for speech compression and decompression
DE19929462A1 (de) * 1999-06-26 2001-02-22 Philips Corp Intellectual Pty Verfahren zum Training eines automatischen Spracherkenners
US6920188B1 (en) * 2000-11-16 2005-07-19 Piradian, Inc. Method and apparatus for processing a multiple-component wide dynamic range signal
KR20040014431A (ko) 2001-08-06 2004-02-14 가부시키가이샤 인덱스 명성의 음성적 특징분석에 기초하는 개의 감정판별장치 및방법
US20040102964A1 (en) * 2002-11-21 2004-05-27 Rapoport Ezra J. Speech compression using principal component analysis
DE10254612A1 (de) * 2002-11-22 2004-06-17 Humboldt-Universität Zu Berlin Verfahren zur Ermittlung spezifisch relevanter akustischer Merkmale von Schallsignalen für die Analyse unbekannter Schallsignale einer Schallerzeugung
US20050075865A1 (en) * 2003-10-06 2005-04-07 Rapoport Ezra J. Speech recognition
US20050102144A1 (en) * 2003-11-06 2005-05-12 Rapoport Ezra J. Speech synthesis
JP3827317B2 (ja) * 2004-06-03 2006-09-27 任天堂株式会社 コマンド処理装置
JP4318119B2 (ja) * 2004-06-18 2009-08-19 国立大学法人京都大学 音響信号処理方法、音響信号処理装置、音響信号処理システム及びコンピュータプログラム
US8255216B2 (en) * 2006-10-30 2012-08-28 Nuance Communications, Inc. Speech recognition of character sequences
EP2031581A1 (de) * 2007-08-31 2009-03-04 Deutsche Thomson OHG Verfahren zum Erkennen eines akustischen Ereignisses in einem Audio-Signal
US20130080165A1 (en) * 2011-09-24 2013-03-28 Microsoft Corporation Model Based Online Normalization of Feature Distribution for Noise Robust Speech Recognition
US9484022B2 (en) 2014-05-23 2016-11-01 Google Inc. Training multiple neural networks with different accuracy
US9418679B2 (en) * 2014-08-12 2016-08-16 Honeywell International Inc. Methods and apparatus for interpreting received speech data using speech recognition
JP6238246B2 (ja) * 2015-04-16 2017-11-29 本田技研工業株式会社 会話処理装置、および会話処理方法
JP6672114B2 (ja) * 2016-09-13 2020-03-25 本田技研工業株式会社 会話メンバー最適化装置、会話メンバー最適化方法およびプログラム
US10983888B1 (en) * 2018-12-12 2021-04-20 Amazon Technologies, Inc. System and method for generating dynamic sparse exponential histograms

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4161033A (en) * 1977-12-22 1979-07-10 Rca Corporation Correlator/convolver using a second shift register to rotate sample values
US4230906A (en) * 1978-05-25 1980-10-28 Time And Space Processing, Inc. Speech digitizer
CH637510A5 (de) * 1978-10-27 1983-07-29 Ibm Verfahren und anordnung zur uebertragung von sprachsignalen sowie anwendung des verfahrens.
JPS5857758B2 (ja) * 1979-09-28 1983-12-21 株式会社日立製作所 音声ピッチ周期抽出装置
JPS5650398A (en) * 1979-10-01 1981-05-07 Hitachi Ltd Sound synthesizer
JPS5672499A (en) * 1979-11-19 1981-06-16 Hitachi Ltd Pretreatment for voice identifier
US4383135A (en) * 1980-01-23 1983-05-10 Scott Instruments Corporation Method and apparatus for speech recognition
US4373191A (en) * 1980-11-10 1983-02-08 Motorola Inc. Absolute magnitude difference function generator for an LPC system
US4441200A (en) * 1981-10-08 1984-04-03 Motorola Inc. Digital voice processing system
JPS58178396A (ja) * 1982-04-12 1983-10-19 株式会社日立製作所 音声認識用標準パタ−ン登録方式
JPS5979300A (ja) * 1982-10-28 1984-05-08 電子計算機基本技術研究組合 認識装置
US4672667A (en) * 1983-06-02 1987-06-09 Scott Instruments Company Method for signal processing
US4700360A (en) * 1984-12-19 1987-10-13 Extrema Systems International Corporation Extrema coding digitizing signal processing method and apparatus
AU583871B2 (en) * 1984-12-31 1989-05-11 Itt Industries, Inc. Apparatus and method for automatic speech recognition
EP0212323A3 (en) * 1985-08-29 1988-03-16 Scott Instruments Corporation Method and apparatus for generating a signal transformation and the use thereof in signal processings
JPH01169499A (ja) * 1987-12-24 1989-07-04 Fujitsu Ltd 単語音声区間切出し方式

Also Published As

Publication number Publication date
AU5756190A (en) 1991-02-07
CA2020242C (en) 2002-08-20
EP0411290A2 (en) 1991-02-06
US5025471A (en) 1991-06-18
AU633588B2 (en) 1993-02-04
CA2020242A1 (en) 1991-02-05
EP0411290A3 (es) 1994-02-09

Similar Documents

Publication Publication Date Title
MX166745B (es) Metodo y aparato para extraer porciones que porten informacion de una señal para reconocer puntos variantes de modelos semilares
ATE480100T1 (de) Erzeugung von untertiteln für bewegte bilder
EP0736857A3 (en) Speech recognizing method and apparatus, and speech translating system
US4720863A (en) Method and apparatus for text-independent speaker recognition
US5136652A (en) Amplitude enhanced sampled clipped speech encoder and decoder
JPS5210003A (en) Method and system for analyzing and synthesizing voice signals
FR2522179B1 (fr) Procede et appareil de reconnaissance de paroles permettant de reconnaitre des phonemes particuliers du signal vocal quelle que soit la personne qui parle
JPS6466698A (en) Voice recognition equipment
SG128406A1 (en) Character recognizing and translating system and voice recognizing and translating system
CN1979491A (zh) 对音乐文件分类的方法及其系统
CN106303695A (zh) 音频翻译多语言文字处理方法和系统
CN1300049A (zh) 汉语普通话话音识别的方法和设备
US4060695A (en) Speaker identification system using peak value envelop lines of vocal waveforms
DE3271705D1 (en) A system and method for recognizing speech
EP1010170A4 (en) METHOD AND SYSTEM FOR AUTOMATIC EVALUATION OF INDEPENDENT TEXT PRONUNCIATION FOR LANGUAGE LEARNING
MX159615A (es) Mejoras a sistema electronico para identificar palabras habladas
JPS5648688A (en) Sound analyser
ten Bosch On the automatic classification of pitch movements
JPS6326699A (ja) 連続語認識記録方法
EP0173986A3 (en) Method of and device for the recognition, without previous training of connected words belonging to small vocabularies
JP2580768B2 (ja) 音声認識装置
Wang et al. USTC95-a Putonghua corpus
JP2757356B2 (ja) 単語音声認識方法および装置
KR20220122141A (ko) 학습데이터 수집장치, 학습데이터 수집방법, 및 음성인식장치
Patil et al. Tracing Gujarati Dialects Philogically and Sociolinguistically