KR910015962A - Voice signal processing device - Google Patents

Voice signal processing device Download PDF

Info

Publication number
KR910015962A
KR910015962A KR1019910002431A KR910002431A KR910015962A KR 910015962 A KR910015962 A KR 910015962A KR 1019910002431 A KR1019910002431 A KR 1019910002431A KR 910002431 A KR910002431 A KR 910002431A KR 910015962 A KR910015962 A KR 910015962A
Authority
KR
South Korea
Prior art keywords
average value
vowel
analysis
output
consonant
Prior art date
Application number
KR1019910002431A
Other languages
Korean (ko)
Other versions
KR960005740B1 (en
Inventor
아끼라 노하라
죠지 카네
Original Assignee
다니이 아끼오
마쯔시다덴기산교 가부시기가이샤
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from JP2033211A external-priority patent/JP2959792B2/en
Priority claimed from JP3321090A external-priority patent/JP2959791B2/en
Application filed by 다니이 아끼오, 마쯔시다덴기산교 가부시기가이샤 filed Critical 다니이 아끼오
Publication of KR910015962A publication Critical patent/KR910015962A/en
Application granted granted Critical
Publication of KR960005740B1 publication Critical patent/KR960005740B1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Quality & Reliability (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Noise Elimination (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)
  • Signal Processing Not Specific To The Method Of Recording And Reproducing (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Electrophonic Musical Instruments (AREA)
  • Selective Calling Equipment (AREA)
  • Communication Control (AREA)

Abstract

내용 없음No content

Description

음성신호처리장치Voice signal processing device

본 내용은 요부공개 건이므로 전문내용을 수록하지 않았음Since this is an open matter, no full text was included.

제2도는 본 발명의 제1의 발명에 관한 음성신호장치의 실시예를 도시한 블록도, 제3도는 동 실시예에 있어서의 켑스트럼 피이크를 도시한 그래프, 제4도는 본 발명의 제2의 발명에 관한 음성신호처리장치의 실시예를 도시한 블록도.FIG. 2 is a block diagram showing an embodiment of the audio signal apparatus according to the first invention of the present invention, FIG. 3 is a graph showing the cepstrum peak in the embodiment, and FIG. 4 is a second diagram of the present invention. A block diagram showing an embodiment of an audio signal processing apparatus according to the invention.

Claims (6)

음성입력신호를 주파수분석하는 주파분석수단과, 그 주파수분석수단의 출력을 피치추출분석하는 피치추출분석수단과, 그 피치추출분석출력에 있어서의 피치를 검출하는 피치럼출수단과, 상기 피치추출분석수단의 분석출력에 있어서의 평균치레벨의 산출하는 평균치산출수단과, 상기 피치검출수단의 피치검출정보와 상기 평균치산출수단의 평균치정보에 의거해서, 상기 피치에 의거해서 모음을 판정하고, 상기 평균치정보의 레벨에 의거해서 자음을 판정해서 모음, 자음을 판정하는 모음/자음 판정수단을 구비한 것을 특징으로하는 음성신호처리장치.A frequency analysis means for frequency analysis of the audio input signal, a pitch extraction analysis means for pitch extraction analysis of the output of the frequency analysis means, a pitch rum extraction means for detecting a pitch in the pitch extraction analysis output, and the pitch extraction analysis Based on the average value calculating means for calculating the average value level in the analysis output of the means, the pitch detection information of the pitch detecting means and the average value information of the average value calculating means, the vowel is determined based on the pitch, and the average value information And a vowel / consonant determination means for determining vowels and consonants based on the consonants. 음성입력신호를 대역분할하는 대역분할수단과, 그 대역분할출력을 켑스트럼분석하는 켑스트럼분석수단과, 그 켑스트럼분석수단의 켑스트럼분석출력에 있어서의 켑스트럼 피이크를 검출하는 피이크검출수단과, 상기 켑스트럼분석수단의 켑스트럼분석출력에 있어서의 평균레벨을 산출하는 평균치산출수단과, 상기 피이크검출수단의 피이크검출정보와 상기 평균치산출수단의 평균치 정보에 의거해서, 상기 피이크에 의거해서 모음을 판정하고, 상기 평균치정보의 레벨에 의거해서 자음을 판정해서 모음, 자음을 판정하는 모음/자음판정수단을 구비한 것을 특징으로 하는 음성신호처리장치.Band dividing means for band-segmenting an audio input signal, thrip analysis means for performing a spectral analysis of the band division output, and a spectral peak at the spectral analysis output of the spectral analysis means; On the basis of the peak detection means, the average value calculation means for calculating an average level in the chop strm analysis output of the shock strut analysis means, the peak detection information of the peak detection means, and the average value information of the average value calculation means. And vowel / consonant determination means for determining vowel based on the peak, and for determining vowel and consonant based on the level of the average value information. 제2항에 있어서 모음/자음 판정수단은, 상기 피이크검출수단에서의 검출피이크 및 임계 설정부가 설정한 임계치를 비교하는 제1비교기와, 상기 평균치산출수단에 의한 산출평균치 및 임계설정부에서 설정된 소정의 임계치를 비교하는 제2비교기와, 그들 제1, 제2비교기의 비교결과에 의거해서, 모음, 자음을 판정하여 결과를 출력하는 모음/자음판정회로를 구비한 것을 특징으로 하는 음성신호처리장치.The vowel / consonant determination means according to claim 2, wherein the vowel / consonant determination means comprises: a first comparator for comparing the detected peak in the peak detection means and a threshold set by the threshold setting portion, and a predetermined average set by the average value calculation means and the threshold setting portion; And a vowel / consonant determination circuit for determining the vowel and the consonant and outputting the result based on the comparison results of the first and second comparators. . 음성입력신호를 주파수분석하는 주파수분석수단과, 그 주파수분석수단의 주파수분석출력을 케스트럼분석하는 켑스트럼분석 수단과, 그 켑스트럼분석수단의 켑스트럼분석출력에 있어서의 켑스트럼 피이크를 검출하는 피이크검출수단과, 상기 켑스트럼분석수단의 켑스트럼출력에 있어서의 평균치레벨을 산출하는 평균치산출수단과, 상기 피이크검출수단의 피이크검출정보와 상기 평균치산출수단의 평균치정보에 의거해서, 상기 피이크에 의거해서 모음을 판정하고, 상기 평균치정보와 레벨에 의거해서 자음을 판정해서, 모음,자음을 판정하는 모음/자음판정수단과, 그 모음/자음판정수단에서의 판정결과를 이용해서 소거계수를 설정하는 소거계수설정수단과, 상기 푸우리에 변환된 음성신호가 입려되어 그 잡음성분을 예측하는 잡음예측수단과, 그 잡음예측수단의 잡음예측출력 상기 음성신호 및 상기 소거계수설정수단에 의해 설정된 소거계수신호가 입력되고, 그 음성신호로부터 그 소거율을 고려한 잡음성분을 소거하는 소거수단과, 그 소거수단의 소거출력을 합성하는 신호합성수단을 구비하는 것을 특징으로하는 신호처리장치.Frequency analysis means for frequency analysis of the audio input signal, cepstrum analysis means for performing the spectral analysis of the frequency analysis output of the frequency analysis means, and the cepstrum in the spectral analysis output of the cepstrum analysis means. A peak detection means for detecting peaks, an average value calculating means for calculating an average value level in the chop strum output of the chop strum analyzing means, peak detection information of the peak detecting means and average value information of the average value calculating means. Based on the peak, the vowel is determined, the vowel / consonant determination means for determining the vowel and the consonant based on the average value information and the level, and the determination result in the vowel / consonant determination means. Erasing coefficient setting means for setting an erasing coefficient by using a noise predicting means for predicting a noise component by applying a voice signal converted to the puuri; Noise prediction output of the noise predicting means; an erasing means for inputting an audio signal and an erasing coefficient signal set by the erasing coefficient setting means, and canceling means for canceling a noise component in consideration of the erasing rate from the audio signal; And a signal synthesizing means for synthesizing the signal. 음성입력신호 대여분할하는 대역분할수단과, 그 대역분할수단의 대역분할출력을 켑스트럼분석하는 켑스트럼분석수단과, 그 켑스트럼분석수단의 켑스트럼분석출력에 있어서의 켑스트럼피이크를 검출하는 피이크검출수단과, 상기 켑스트럼분석수단의 켑스트럼분석출력에 있어서의 평균치 레벨을 산출하는 평균치산출수단과, 상기 피이크검출수단과의 피이크검출정보와 상기 평균치산출수단의 평균치정보에 의거해서, 상기 피이크에 의거해서 모음을 판정하고, 상기 평균치정보의 레벨에 의거해서 자음을 판정해서, 모음, 자음을 판정하는 모음/자음판정수단과, 그 모음/자음 판정수단에서의 판정결과를 이용해서 소거계수를 설정하는 소거계수 설정수단과, 상기 푸우리에 변환된 음성신호가 입력되고, 그 잡음성분을 예측하는 잡음예측수단과, 그 잡음예측수단의 잡음에 측출력, 상기 음성신호 및 상기 소거계수설정수단에 의해 설정된 소거계수신호가 입력되고, 그 음성신호로부터 그 소거율을 고려한 잡음성분을 소거하는 소거수단과, 그 소거수단의 소거출력을 대역합성하는 대역합성수단을 구비하는 것을 특징으로하는 신호처리장치.Band dividing means for dividing the voice input signal rental division; Chop strut analysis means for performing a spectral analysis on the band split output of the band dividing means; and a spectrum in the spectral analysis output of the spectral analysis means. Peak detecting means for detecting peaks, an average value calculating means for calculating an average value level in the chop stratum analysis output of the shock analyzing means, peak detection information with the peak detecting means, and an average value of the average value calculating means. Based on the information, the vowel is determined based on the peak, the vowel / consonant determination means for determining the vowel and the consonant based on the level of the average value information, and the judgment in the vowel / consonant determination means. An erase coefficient setting means for setting an erase coefficient using the result, a noise predicting means for inputting a voice signal converted into the puuri and predicting the noise component thereof, A side output, the audio signal, and an erase coefficient signal set by the erase coefficient setting means are input to the noise of the tone predicting means, and erase means for canceling a noise component in consideration of the erase rate from the voice signal; And a band synthesizing means for band synthesizing the erase output. 제5항에 있어서, 모음/자음판정수단은 적어도 상기 피이크검출수단에서의 검출피이크 및 임계설정부가 설정한 임계치를 제1비교기와, 상기 평균치 산출수단에 의한 산출평균치 및 임계설정부에서 설정된 소정의 임계치를 비교하는 제2비교기와, 그들 제1, 제2비교기의 비교결과에 의거해서, 모음, 자음을 판정하여 결과를 출력하는 모음/자음 판정회로를 구비한 것을 특징으로하는 음성신호처리장치.6. The apparatus according to claim 5, wherein the vowel / consonant determination means includes at least a detection peak in the peak detection means and a threshold value set by the threshold setting unit, a first average comparator, a predetermined average value set by the average value calculation means, and a threshold setting unit. And a vowel / consonant determination circuit for determining a vowel and a consonant and outputting a result based on a comparison result between the first and second comparators. ※ 참고사항 : 최초출원 내용에 의하여 공개하는 것임.※ Note: The disclosure is based on the initial application.
KR1019910002431A 1990-02-13 1991-02-13 Voice signal processing device KR960005740B1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP2033211A JP2959792B2 (en) 1990-02-13 1990-02-13 Audio signal processing device
JP2-33211 1990-02-13
JP3321090A JP2959791B2 (en) 1990-02-13 1990-02-13 Audio signal processing device
JP2-33210 1990-02-13

Publications (2)

Publication Number Publication Date
KR910015962A true KR910015962A (en) 1991-09-30
KR960005740B1 KR960005740B1 (en) 1996-05-01

Family

ID=26371868

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1019910002431A KR960005740B1 (en) 1990-02-13 1991-02-13 Voice signal processing device

Country Status (9)

Country Link
US (1) US5204906A (en)
EP (1) EP0442342B1 (en)
KR (1) KR960005740B1 (en)
AU (1) AU635600B2 (en)
CA (1) CA2036199C (en)
DE (1) DE69105154T2 (en)
FI (1) FI103930B1 (en)
HK (1) HK185195A (en)
NO (1) NO306360B1 (en)

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07104788A (en) * 1993-10-06 1995-04-21 Technol Res Assoc Of Medical & Welfare Apparatus Voice emphasis processor
JP3397568B2 (en) * 1996-03-25 2003-04-14 キヤノン株式会社 Voice recognition method and apparatus
WO1997037345A1 (en) * 1996-03-29 1997-10-09 British Telecommunications Public Limited Company Speech processing
EP1085504B1 (en) 1996-11-07 2002-05-29 Matsushita Electric Industrial Co., Ltd. CELP-Codec
JPH10247869A (en) * 1997-03-04 1998-09-14 Nec Corp Diversity circuit
DE19854341A1 (en) 1998-11-25 2000-06-08 Alcatel Sa Method and circuit arrangement for speech level measurement in a speech signal processing system
US20020150264A1 (en) * 2001-04-11 2002-10-17 Silvia Allegro Method for eliminating spurious signal components in an input signal of an auditory system, application of the method, and a hearing aid
US20040102965A1 (en) * 2002-11-21 2004-05-27 Rapoport Ezra J. Determining a pitch period
US9838784B2 (en) 2009-12-02 2017-12-05 Knowles Electronics, Llc Directional audio capture
US8880396B1 (en) * 2010-04-28 2014-11-04 Audience, Inc. Spectrum reconstruction for automatic speech recognition
DE102011006515A1 (en) 2011-03-31 2012-10-04 Siemens Medical Instruments Pte. Ltd. Method for improving speech intelligibility with a hearing aid device and hearing aid device
DE102011006511B4 (en) 2011-03-31 2016-07-14 Sivantos Pte. Ltd. Hearing aid and method for operating a hearing aid
DE102011006472B4 (en) 2011-03-31 2013-08-14 Siemens Medical Instruments Pte. Ltd. Method for improving speech intelligibility with a hearing aid device and hearing aid device
KR101247652B1 (en) * 2011-08-30 2013-04-01 광주과학기술원 Apparatus and method for eliminating noise
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
JP2015169827A (en) * 2014-03-07 2015-09-28 富士通株式会社 Speech processing device, speech processing method, and speech processing program
CN107112025A (en) 2014-09-12 2017-08-29 美商楼氏电子有限公司 System and method for recovering speech components
US9820042B1 (en) 2016-05-02 2017-11-14 Knowles Electronics, Llc Stereo separation and directional suppression with omni-directional microphones

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3566035A (en) * 1969-07-17 1971-02-23 Bell Telephone Labor Inc Real time cepstrum analyzer
US4058676A (en) * 1975-07-07 1977-11-15 International Communication Sciences Speech analysis and synthesis system
US4630305A (en) * 1985-07-01 1986-12-16 Motorola, Inc. Automatic gain selector for a noise suppression system
AU598933B2 (en) * 1987-04-03 1990-07-05 American Telephone And Telegraph Company An adaptive threshold voiced detector

Also Published As

Publication number Publication date
US5204906A (en) 1993-04-20
NO910535L (en) 1991-08-14
EP0442342B1 (en) 1994-11-17
FI103930B (en) 1999-10-15
KR960005740B1 (en) 1996-05-01
CA2036199C (en) 1997-09-30
FI103930B1 (en) 1999-10-15
NO306360B1 (en) 1999-10-25
DE69105154T2 (en) 1995-03-23
CA2036199A1 (en) 1991-08-14
AU6927891A (en) 1991-08-15
NO910535D0 (en) 1991-02-11
EP0442342A1 (en) 1991-08-21
FI910679A0 (en) 1991-02-12
FI910679A (en) 1991-08-14
DE69105154D1 (en) 1994-12-22
HK185195A (en) 1995-12-15
AU635600B2 (en) 1993-03-25

Similar Documents

Publication Publication Date Title
KR910020641A (en) Noise Prediction Device and Signal Processing Device Using It
KR910015962A (en) Voice signal processing device
KR910015109A (en) Signal processing equipment
US8140330B2 (en) System and method for detecting repeated patterns in dialog systems
KR940024660A (en) Voice recognition device
JP2007041593A (en) Method and apparatus for extracting voiced/unvoiced classification information using harmonic component of voice signal
JPH0713584A (en) Speech detecting device
KR20030064733A (en) Fast frequency-domain pitch estimation
JP3006677B2 (en) Voice recognition device
KR960007842B1 (en) Voice and noise separating device
KR910020643A (en) Voice signal processing device
WO2007026436A1 (en) Vocal fry detecting device
JP5282523B2 (en) Basic frequency extraction method, basic frequency extraction device, and program
Samad et al. Pitch detection of speech signals using the cross-correlation technique
US20060150805A1 (en) Method of automatically detecting vibrato in music
JP3106543B2 (en) Audio signal processing device
KR0136608B1 (en) Phoneme recognizing device for voice signal status detection
JPH04100099A (en) Voice detector
JP2001083978A (en) Speech recognition device
KR100345402B1 (en) An apparatus and method for real - time speech detection using pitch information
KR950013555B1 (en) Voice signal processing device
JP2666296B2 (en) Voice recognition device
JP2664136B2 (en) Voice recognition device
KR19990070595A (en) How to classify voice-voice segments in flattened spectra
KR100539176B1 (en) Device and method of extracting musical feature

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
AMND Amendment
E902 Notification of reason for refusal
AMND Amendment
E601 Decision to refuse application
J2X1 Appeal (before the patent court)

Free format text: APPEAL AGAINST DECISION TO DECLINE REFUSAL

G160 Decision to publish patent application
B701 Decision to grant
GRNT Written decision to grant
FPAY Annual fee payment

Payment date: 20070424

Year of fee payment: 12

LAPS Lapse due to unpaid annual fee