KR840003871A - 음성 인식 방법과 그 장치 - Google Patents

음성 인식 방법과 그 장치 Download PDF

Info

Publication number
KR840003871A
KR840003871A KR1019830000745A KR830000745A KR840003871A KR 840003871 A KR840003871 A KR 840003871A KR 1019830000745 A KR1019830000745 A KR 1019830000745A KR 830000745 A KR830000745 A KR 830000745A KR 840003871 A KR840003871 A KR 840003871A
Authority
KR
South Korea
Prior art keywords
parameter
transient
speech
detecting
voice
Prior art date
Application number
KR1019830000745A
Other languages
English (en)
Other versions
KR910002198B1 (ko
Inventor
마사오(외3) 와따리
Original Assignee
오오가 노리오
소니 가부시끼 가이샤
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 오오가 노리오, 소니 가부시끼 가이샤 filed Critical 오오가 노리오
Publication of KR840003871A publication Critical patent/KR840003871A/ko
Application granted granted Critical
Publication of KR910002198B1 publication Critical patent/KR910002198B1/ko

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephone Function (AREA)
  • Character Discrimination (AREA)
  • Image Processing (AREA)
  • Telephonic Communication Services (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)

Abstract

내용 없음

Description

음성 인식 방법과 그 장치
본 내용은 요부공개 건이므로 전문내용을 수록하지 않았음
제1도 내지 제4도는 음성인식장치의 설명을 위한도면.

Claims (4)

  1. 무음을 포함하는 음운간의 과도부를 검출하는 수단을 가지며, 이 검출된 과도부의 음성을 소정길이 추출하여 파라미터로 변환하고 이 파라미터를 인식 기본 단위로 하도록한 음성인식방법.
  2. 제1항에서, 상기 과도부를 검출하는 수단은 입력 음성신호를 인간의 청각 특성에 응하여 같게 중첩하여 음향 파라미터를 추출하는 수단과, 이 음향 파라미터의 레벨에 대하여 정규화를 행하는 수단을 가지며, 이 정규화된 음향파라미터를 복수 프레임에 걸쳐 감시하고, 상기 음향파라미터의 피크를 검출하도록한 음성과도점 검출방법을 특징으로 하는 음성인식방법.
  3. 무음을 포함하는 음운간의 과도부를 검출하는 음성과도점 검출기와, 이 음성과도점 검출기로부터의 신호에 의하여 상기 과도부의 음성을 소정길이 추출하여 파라미터로 변환하는 변환기와, 이 변환기로부터의 파라미터를 인식 기본 단위로 하는 인식기로 이루어진 음성인식장치.
  4. 제3항에서, 상기 음성과도점 검출기는 입력음성 신호를 인간의 청각 특성에 응하여 같게 중첩하여 음향파라미터를 추출하는 음성 파라미터 추출기와, 이 음성 파라미터 추출기로부터의 음향 파라미터의 레벨에 대하여 정규화를 행하는 레벨정규화기와, 이 정규화된 음향파라미터를 복수 프레임에 걸쳐 감시하고 상기 음향파라미터의 피크를 검출하는 피크검출기로 이루어진 것을 특징으로 하는 음성인식장치.
    ※ 참고사항 : 최초출원 내용에 의하여 공개하는 것임.
KR1019830000745A 1982-02-25 1983-02-24 음성인식방법과 그 장치 KR910002198B1 (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP29471 1982-02-25
JP?57-29471 1982-02-25
JP57029471A JPS58145998A (ja) 1982-02-25 1982-02-25 音声過渡点検出方法

Publications (2)

Publication Number Publication Date
KR840003871A true KR840003871A (ko) 1984-10-04
KR910002198B1 KR910002198B1 (ko) 1991-04-06

Family

ID=12277008

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1019830000745A KR910002198B1 (ko) 1982-02-25 1983-02-24 음성인식방법과 그 장치

Country Status (8)

Country Link
US (1) US4592085A (ko)
JP (1) JPS58145998A (ko)
KR (1) KR910002198B1 (ko)
CA (1) CA1193732A (ko)
DE (1) DE3306730A1 (ko)
FR (1) FR2522179B1 (ko)
GB (2) GB2118343B (ko)
NL (1) NL192701C (ko)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6351723B1 (en) 1996-08-29 2002-02-26 Fujitsu Limited Failure diagnostic method and apparatus for equipment and recording medium in which program causing computer system to execute process in accordance with such method is stored

Families Citing this family (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4972490A (en) * 1981-04-03 1990-11-20 At&T Bell Laboratories Distance measurement control of a multiple detector system
JPS5997200A (ja) * 1982-11-26 1984-06-04 株式会社日立製作所 音声認識方式
JPS59166999A (ja) * 1983-03-11 1984-09-20 ソニー株式会社 音声過渡点検出方法
JPS59170897A (ja) * 1983-03-17 1984-09-27 ソニー株式会社 音声過渡点検出方法
US5131043A (en) * 1983-09-05 1992-07-14 Matsushita Electric Industrial Co., Ltd. Method of and apparatus for speech recognition wherein decisions are made based on phonemes
US4991216A (en) * 1983-09-22 1991-02-05 Matsushita Electric Industrial Co., Ltd. Method for speech recognition
FR2554623B1 (fr) * 1983-11-08 1986-08-14 Texas Instruments France Procede d'analyse de la parole independant du locuteur
US4718093A (en) * 1984-03-27 1988-01-05 Exxon Research And Engineering Company Speech recognition method including biased principal components
US4718088A (en) * 1984-03-27 1988-01-05 Exxon Research And Engineering Company Speech recognition training method
US4713778A (en) * 1984-03-27 1987-12-15 Exxon Research And Engineering Company Speech recognition method
US4718092A (en) * 1984-03-27 1988-01-05 Exxon Research And Engineering Company Speech recognition activation and deactivation method
US4713777A (en) * 1984-05-27 1987-12-15 Exxon Research And Engineering Company Speech recognition method having noise immunity
US5241649A (en) * 1985-02-18 1993-08-31 Matsushita Electric Industrial Co., Ltd. Voice recognition method
DE3514286A1 (de) * 1985-04-19 1986-10-23 Siemens AG, 1000 Berlin und 8000 München System zur erkennung einzeln gesprochener woerter
CA1250368A (en) * 1985-05-28 1989-02-21 Tetsu Taguchi Formant extractor
JPS62220998A (ja) * 1986-03-22 1987-09-29 工業技術院長 音声認識装置
JPS63158596A (ja) * 1986-12-23 1988-07-01 株式会社東芝 音韻類似度計算装置
US5007093A (en) * 1987-04-03 1991-04-09 At&T Bell Laboratories Adaptive threshold voiced detector
US4860360A (en) * 1987-04-06 1989-08-22 Gte Laboratories Incorporated Method of evaluating speech
US5027408A (en) * 1987-04-09 1991-06-25 Kroeker John P Speech-recognition circuitry employing phoneme estimation
US5136653A (en) * 1988-01-11 1992-08-04 Ezel, Inc. Acoustic recognition system using accumulate power series
US5168524A (en) * 1989-08-17 1992-12-01 Eliza Corporation Speech-recognition circuitry employing nonlinear processing, speech element modeling and phoneme estimation
JPH03120598A (ja) * 1989-10-03 1991-05-22 Canon Inc 音声認識方法及び装置
EP0438662A2 (en) * 1990-01-23 1991-07-31 International Business Machines Corporation Apparatus and method of grouping utterances of a phoneme into context-de-pendent categories based on sound-similarity for automatic speech recognition
DE4111781A1 (de) * 1991-04-11 1992-10-22 Ibm Computersystem zur spracherkennung
JP3716870B2 (ja) * 1995-05-31 2005-11-16 ソニー株式会社 音声認識装置および音声認識方法
US5724410A (en) * 1995-12-18 1998-03-03 Sony Corporation Two-way voice messaging terminal having a speech to text converter
KR0173923B1 (ko) * 1995-12-22 1999-04-01 양승택 다층구조 신경망을 이용한 음소 분할 방법
US6006186A (en) * 1997-10-16 1999-12-21 Sony Corporation Method and apparatus for a parameter sharing speech recognition system
US6230122B1 (en) 1998-09-09 2001-05-08 Sony Corporation Speech detection with noise suppression based on principal components analysis
US6173258B1 (en) * 1998-09-09 2001-01-09 Sony Corporation Method for reducing noise distortions in a speech recognition system
US6768979B1 (en) 1998-10-22 2004-07-27 Sony Corporation Apparatus and method for noise attenuation in a speech recognition system
US6266642B1 (en) 1999-01-29 2001-07-24 Sony Corporation Method and portable apparatus for performing spoken language translation
US6243669B1 (en) 1999-01-29 2001-06-05 Sony Corporation Method and apparatus for providing syntactic analysis and data structure for translation knowledge in example-based language translation
US6278968B1 (en) 1999-01-29 2001-08-21 Sony Corporation Method and apparatus for adaptive speech recognition hypothesis construction and selection in a spoken language translation system
US6356865B1 (en) * 1999-01-29 2002-03-12 Sony Corporation Method and apparatus for performing spoken language translation
US6442524B1 (en) 1999-01-29 2002-08-27 Sony Corporation Analyzing inflectional morphology in a spoken language translation system
US6282507B1 (en) 1999-01-29 2001-08-28 Sony Corporation Method and apparatus for interactive source language expression recognition and alternative hypothesis presentation and selection
US6223150B1 (en) 1999-01-29 2001-04-24 Sony Corporation Method and apparatus for parsing in a spoken language translation system
US6374224B1 (en) 1999-03-10 2002-04-16 Sony Corporation Method and apparatus for style control in natural language generation
US7139708B1 (en) 1999-03-24 2006-11-21 Sony Corporation System and method for speech recognition using an enhanced phone set
US20010029363A1 (en) * 1999-05-03 2001-10-11 Lin J. T. Methods and apparatus for presbyopia correction using ultraviolet and infrared lasers
KR100608062B1 (ko) * 2004-08-04 2006-08-02 삼성전자주식회사 오디오 데이터의 고주파수 복원 방법 및 그 장치
US8332212B2 (en) * 2008-06-18 2012-12-11 Cogi, Inc. Method and system for efficient pacing of speech for transcription
US8903847B2 (en) * 2010-03-05 2014-12-02 International Business Machines Corporation Digital media voice tags in social networks
US20120246238A1 (en) 2011-03-21 2012-09-27 International Business Machines Corporation Asynchronous messaging tags
US8688090B2 (en) 2011-03-21 2014-04-01 International Business Machines Corporation Data session preferences
US20120244842A1 (en) 2011-03-21 2012-09-27 International Business Machines Corporation Data Session Synchronization With Phone Numbers
JP2013164572A (ja) * 2012-01-10 2013-08-22 Toshiba Corp 音声特徴量抽出装置、音声特徴量抽出方法及び音声特徴量抽出プログラム
JP6461660B2 (ja) * 2015-03-19 2019-01-30 株式会社東芝 検出装置、検出方法およびプログラム

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3344233A (en) * 1967-09-26 Method and apparatus for segmenting speech into phonemes
GB981154A (en) * 1961-03-20 1965-01-20 Nippon Telegraph & Telephone Improved phonetic typewriter system
US3582559A (en) * 1969-04-21 1971-06-01 Scope Inc Method and apparatus for interpretation of time-varying signals
JPS5850360B2 (ja) * 1978-05-12 1983-11-10 株式会社日立製作所 音声認識装置における前処理方法
US4412098A (en) * 1979-09-10 1983-10-25 Interstate Electronics Corporation Audio signal recognition computer
US4454586A (en) * 1981-11-19 1984-06-12 At&T Bell Laboratories Method and apparatus for generating speech pattern templates

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6351723B1 (en) 1996-08-29 2002-02-26 Fujitsu Limited Failure diagnostic method and apparatus for equipment and recording medium in which program causing computer system to execute process in accordance with such method is stored

Also Published As

Publication number Publication date
NL192701B (nl) 1997-08-01
US4592085A (en) 1986-05-27
GB8305292D0 (en) 1983-03-30
GB8429480D0 (en) 1985-01-03
GB2153127A (en) 1985-08-14
GB2118343A (en) 1983-10-26
DE3306730C2 (ko) 1991-10-17
FR2522179B1 (fr) 1986-05-02
GB2118343B (en) 1986-01-02
FR2522179A1 (fr) 1983-08-26
GB2153127B (en) 1986-01-15
JPS58145998A (ja) 1983-08-31
NL192701C (nl) 1997-12-02
DE3306730A1 (de) 1983-09-01
NL8300718A (nl) 1983-09-16
CA1193732A (en) 1985-09-17
KR910002198B1 (ko) 1991-04-06
JPH0441356B2 (ko) 1992-07-08

Similar Documents

Publication Publication Date Title
KR840003871A (ko) 음성 인식 방법과 그 장치
Martin On judging pauses in spontaneous speech
CN103617799B (zh) 一种适应于移动设备的英语语句发音质量检测方法
US4284846A (en) System and method for sound recognition
DE69427083D1 (de) Spracherkennungssystem für mehrere sprachen
CA2196554A1 (en) Test Method
Ericsdotter et al. Gender differences in vowel duration in read Swedish: Preliminary results
Mishra et al. An Overview of Hindi Speech Recognition
JPH0797279B2 (ja) 音声認識装置
JPS6138479B2 (ko)
EP0173986A3 (en) Method of and device for the recognition, without previous training of connected words belonging to small vocabularies
Fan et al. Power-normalized PLP (PNPLP) feature for robust speech recognition
Patil et al. Identifying Perceptually Similar Languages Using Teager Energy Based Cepstrum.
Dersch A decision logic for speech recognition
JPS6454494A (en) Voice segmentation apparatus
JPS6028698A (ja) 有音無音検出装置
JPS6370298A (ja) 促音認識装置
Undhad et al. Exploiting speech source information for vowel landmark detection for low resource language
Karlsson Voluntary and involuntary speech variations–a few examples from the VeriVox database
JPS59170894A (ja) 音声区間の切り出し方式
JPS63217399A (ja) 音声区間検出装置
Schauer Very low frequency characteristics of speech
KR830009546A (ko) 반도체 기억소자를 이용한 언어의 강세 및 음도의 비교검출방법과 그 장치
Heldner Focal accent–f0 movements and beyond: Introduction
KR950015055A (ko) 가라오케 기능의 채점장치

Legal Events

Date Code Title Description
A201 Request for examination
G160 Decision to publish patent application
E701 Decision to grant or registration of patent right
GRNT Written decision to grant
FPAY Annual fee payment

Payment date: 20020319

Year of fee payment: 12

EXPY Expiration of term