ATE412235T1 - Verfahren und vorrichtung zum erkennen von sprachsegmenten bei der sprachsignalverarbeitung - Google Patents

Verfahren und vorrichtung zum erkennen von sprachsegmenten bei der sprachsignalverarbeitung

Info

Publication number
ATE412235T1
ATE412235T1 AT05025231T AT05025231T ATE412235T1 AT E412235 T1 ATE412235 T1 AT E412235T1 AT 05025231 T AT05025231 T AT 05025231T AT 05025231 T AT05025231 T AT 05025231T AT E412235 T1 ATE412235 T1 AT E412235T1
Authority
AT
Austria
Prior art keywords
noise
signal processing
region
frame
speech
Prior art date
Application number
AT05025231T
Other languages
English (en)
Inventor
Kyung-Ho Woo
Original Assignee
Lg Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lg Electronics Inc filed Critical Lg Electronics Inc
Application granted granted Critical
Publication of ATE412235T1 publication Critical patent/ATE412235T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • G10L2025/786Adaptive threshold

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Telephonic Communication Services (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Time-Division Multiplex Systems (AREA)
AT05025231T 2004-11-20 2005-11-18 Verfahren und vorrichtung zum erkennen von sprachsegmenten bei der sprachsignalverarbeitung ATE412235T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020040095520A KR100677396B1 (ko) 2004-11-20 2004-11-20 음성인식장치의 음성구간 검출방법

Publications (1)

Publication Number Publication Date
ATE412235T1 true ATE412235T1 (de) 2008-11-15

Family

ID=35723587

Family Applications (1)

Application Number Title Priority Date Filing Date
AT05025231T ATE412235T1 (de) 2004-11-20 2005-11-18 Verfahren und vorrichtung zum erkennen von sprachsegmenten bei der sprachsignalverarbeitung

Country Status (7)

Country Link
US (1) US7620544B2 (de)
EP (1) EP1659570B1 (de)
JP (1) JP4282659B2 (de)
KR (1) KR100677396B1 (de)
CN (1) CN1805007B (de)
AT (1) ATE412235T1 (de)
DE (1) DE602005010525D1 (de)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008099163A (ja) * 2006-10-16 2008-04-24 Audio Technica Corp ノイズキャンセルヘッドフォンおよびヘッドフォンにおけるノイズキャンセル方法
KR100835996B1 (ko) * 2006-12-05 2008-06-09 한국전자통신연구원 적응형 발성 화면 분석 방법 및 장치
US20110035215A1 (en) * 2007-08-28 2011-02-10 Haim Sompolinsky Method, device and system for speech recognition
CN101515454B (zh) * 2008-02-22 2011-05-25 杨夙 用于语音、音乐、噪音自动分类的信号特征提取方法
EP2107553B1 (de) * 2008-03-31 2011-05-18 Harman Becker Automotive Systems GmbH Verfahren zur Erkennung einer Unterbrechung einer Sprachausgabe
US8380497B2 (en) 2008-10-15 2013-02-19 Qualcomm Incorporated Methods and apparatus for noise estimation
CN102356427B (zh) * 2009-04-02 2013-10-30 三菱电机株式会社 噪声抑制装置
KR101251045B1 (ko) * 2009-07-28 2013-04-04 한국전자통신연구원 오디오 판별 장치 및 그 방법
ES2371619B1 (es) * 2009-10-08 2012-08-08 Telefónica, S.A. Procedimiento de detección de segmentos de voz.
JP5712220B2 (ja) * 2009-10-19 2015-05-07 テレフオンアクチーボラゲット エル エム エリクソン(パブル) 音声活動検出のための方法および背景推定器
KR20140026229A (ko) 2010-04-22 2014-03-05 퀄컴 인코포레이티드 음성 액티비티 검출
CN102376303B (zh) * 2010-08-13 2014-03-12 国基电子(上海)有限公司 录音设备及利用该录音设备进行声音处理与录入的方法
US8898058B2 (en) 2010-10-25 2014-11-25 Qualcomm Incorporated Systems, methods, and apparatus for voice activity detection
US20130151248A1 (en) * 2011-12-08 2013-06-13 Forrest Baker, IV Apparatus, System, and Method For Distinguishing Voice in a Communication Stream
CN103915097B (zh) * 2013-01-04 2017-03-22 中国移动通信集团公司 一种语音信号处理方法、装置和系统
JP6221257B2 (ja) * 2013-02-26 2017-11-01 沖電気工業株式会社 信号処理装置、方法及びプログラム
KR20150105847A (ko) * 2014-03-10 2015-09-18 삼성전기주식회사 음성구간 검출 방법 및 장치
CN107613236B (zh) * 2017-09-28 2021-01-05 盐城市聚龙湖商务集聚区发展有限公司 一种音像录制方法及终端、存储介质
KR20200141860A (ko) 2019-06-11 2020-12-21 삼성전자주식회사 전자 장치 및 그 제어 방법
CN110689901B (zh) * 2019-09-09 2022-06-28 苏州臻迪智能科技有限公司 语音降噪的方法、装置、电子设备及可读存储介质
US20210169559A1 (en) * 2019-12-06 2021-06-10 Board Of Regents, The University Of Texas System Acoustic monitoring for electrosurgery
CN113098626B (zh) * 2020-01-09 2023-03-24 北京君正集成电路股份有限公司 一种近距离声波通信同步的方法
CN113098627B (zh) * 2020-01-09 2023-03-24 北京君正集成电路股份有限公司 一种实现近距离声波通信同步的系统
CN111554314A (zh) * 2020-05-15 2020-08-18 腾讯科技(深圳)有限公司 噪声检测方法、装置、终端及存储介质
CN115240696B (zh) * 2022-07-26 2023-10-03 北京集智数字科技有限公司 一种语音识别方法及可读存储介质
KR102516391B1 (ko) * 2022-09-02 2023-04-03 주식회사 액션파워 음성 구간 길이를 고려하여 오디오에서 음성 구간을 검출하는 방법

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3626492B2 (ja) * 1993-07-07 2005-03-09 ポリコム・インコーポレイテッド 会話の品質向上のための背景雑音の低減
FI100840B (fi) * 1995-12-12 1998-02-27 Nokia Mobile Phones Ltd Kohinanvaimennin ja menetelmä taustakohinan vaimentamiseksi kohinaises ta puheesta sekä matkaviestin
US6427134B1 (en) * 1996-07-03 2002-07-30 British Telecommunications Public Limited Company Voice activity detector for calculating spectral irregularity measure on the basis of spectral difference measurements
US5884255A (en) * 1996-07-16 1999-03-16 Coherent Communications Systems Corp. Speech detection system employing multiple determinants
US5866702A (en) * 1996-08-02 1999-02-02 Cv Therapeutics, Incorporation Purine inhibitors of cyclin dependent kinase 2
US6202046B1 (en) * 1997-01-23 2001-03-13 Kabushiki Kaisha Toshiba Background noise/speech classification method
FR2767334B1 (fr) * 1997-08-12 1999-10-22 Commissariat Energie Atomique Kinase activatrice des proteine-kinases cycline dependantes, et ses utilisations
US6479487B1 (en) * 1998-02-26 2002-11-12 Aventis Pharmaceuticals Inc. 6, 9-disubstituted 2-[trans-(4-aminocyclohexyl)amino] purines
US6480823B1 (en) * 1998-03-24 2002-11-12 Matsushita Electric Industrial Co., Ltd. Speech detection for noisy conditions
US6453289B1 (en) * 1998-07-24 2002-09-17 Hughes Electronics Corporation Method of noise reduction for speech codecs
US6266633B1 (en) * 1998-12-22 2001-07-24 Itt Manufacturing Enterprises Noise suppression and channel equalization preprocessor for speech and speaker recognizers: method and apparatus
US6327564B1 (en) * 1999-03-05 2001-12-04 Matsushita Electric Corporation Of America Speech detection using stochastic confidence measures on the frequency spectrum
AR029347A1 (es) * 1999-04-02 2003-06-25 Euro Celtique Sa Compuesto de adenina, compuesto de isognanina y 2,6-ditioxantina como precursor del mismo, uso de dichos compuestos para preparar una composicion farmaceutica y dicha composicion farmaceutica
US6618701B2 (en) * 1999-04-19 2003-09-09 Motorola, Inc. Method and system for noise suppression using external voice activity detection
JP2000310993A (ja) * 1999-04-28 2000-11-07 Pioneer Electronic Corp 音声検出装置
US6615170B1 (en) * 2000-03-07 2003-09-02 International Business Machines Corporation Model-based voice activity detection system and method using a log-likelihood ratio and pitch
US20020116186A1 (en) * 2000-09-09 2002-08-22 Adam Strauss Voice activity detector for integrated telecommunications processing
US7236929B2 (en) * 2001-05-09 2007-06-26 Plantronics, Inc. Echo suppression and speech detection techniques for telephony applications
US6812232B2 (en) * 2001-09-11 2004-11-02 Amr Technology, Inc. Heterocycle substituted purine derivatives as potent antiproliferative agents
US6667311B2 (en) * 2001-09-11 2003-12-23 Albany Molecular Research, Inc. Nitrogen substituted biaryl purine derivatives as potent antiproliferative agents
US7346175B2 (en) * 2001-09-12 2008-03-18 Bitwave Private Limited System and apparatus for speech communication and speech recognition
US7146314B2 (en) * 2001-12-20 2006-12-05 Renesas Technology Corporation Dynamic adjustment of noise separation in data handling, particularly voice activation

Also Published As

Publication number Publication date
US7620544B2 (en) 2009-11-17
JP4282659B2 (ja) 2009-06-24
KR20060056186A (ko) 2006-05-24
DE602005010525D1 (de) 2008-12-04
JP2006146226A (ja) 2006-06-08
US20060111901A1 (en) 2006-05-25
KR100677396B1 (ko) 2007-02-02
CN1805007A (zh) 2006-07-19
CN1805007B (zh) 2010-11-03
EP1659570B1 (de) 2008-10-22
EP1659570A1 (de) 2006-05-24

Similar Documents

Publication Publication Date Title
ATE412235T1 (de) Verfahren und vorrichtung zum erkennen von sprachsegmenten bei der sprachsignalverarbeitung
ATE540398T1 (de) Sprachaktivitätsdetektionseinrichtung und verfahren
ATE491262T1 (de) Verfahren und system zum verringern der auswirkungen von geräuschproduzierenden artefakten
ATE250801T1 (de) Verfahren und gerät zum erkennen von geräuschsignalproben aus einem geräusch
ATE386320T1 (de) Vorrichtung und verfahren zum ermitteln einer quantisierer-schrittweite
ATE339001T1 (de) Vorrichtung und verfahren zum analysieren eines audio-informationssignals
DE50209455D1 (de) Verfahren zum Training oder zur Adaption eines Spracherkenners
DE502005006550D1 (de) Verfahren und Vorrichtung zur Arcerkennung in einem Plasmaprozess
ATE352836T1 (de) Detektion von emotionen in sprachsignalen mittels analyse einer vielzahl von sprachsignalparametern
ATE526659T1 (de) Verfahren und vorrichtung zum kodieren von einem audiosignal
ATE390681T1 (de) Vorrichtung und verfahren zum ändern einer segmentierung eines audiostücks
ATE373858T1 (de) Verfahren und vorrichtung zur verringerung von geräuschbeeinträchtigung eines alternativen sensorsignals während multisensorischer sprachverstärkung
WO2004075167A3 (en) Log-likelihood ratio method for detecting voice activity and apparatus
DE60325881D1 (de) Verfahren zum betreiben eines spracherkennungssystemes
ATE319160T1 (de) Verfahren zur rauschrobusten klassifikation in der sprachkodierung
ATE522045T1 (de) Verfahren zur detektion von störung in einem kommunikationssignal
BRPI0911440A2 (pt) método e dispositivo para reconhecer um estado de uma máquina geradora de ruído a ser investigada
DE50212199D1 (de) Verfahren zum Betrieb eines Hörhilfegerätes oder Hörgerätesystem sowie Hörhilfegerät oder Hörgerätesystem
ATE523969T1 (de) Verfahren und vorrichtung zum beseitigen von schmalbandigen störungen mittels fensterverarbeitung in einem spreizspektrumsystem
DE602004026919D1 (de) Verfahren und Vorrichtungen zum implementieren eines geschwindigkeitsempfindlchen Mobilrouters
ATE467207T1 (de) Vervahren zum erzeugen eines abdrucks eines audiosignals
ATE470928T1 (de) Verfahren, vorrichtung und system zum ausführen der funktion der spracherkennung
ATE423411T1 (de) Vorrichtung und verfahren zum bestimmen eines korrelationswertes
ATE463887T1 (de) Verfahren und vorrichtung zum erkennen von impulsen
ATE422087T1 (de) Verfahren und vorrichtung zur durchführung von spracherkennung über einen sprachkanal

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties