IN2012DN03323A - - Google Patents

Download PDF

Info

Publication number
IN2012DN03323A
IN2012DN03323A IN3323DEN2012A IN2012DN03323A IN 2012DN03323 A IN2012DN03323 A IN 2012DN03323A IN 3323DEN2012 A IN3323DEN2012 A IN 3323DEN2012A IN 2012DN03323 A IN2012DN03323 A IN 2012DN03323A
Authority
IN
India
Application number
Inventor
Martin Sehlstedt
Original Assignee
Ericsson Telefon Ab L M
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ericsson Telefon Ab L M filed Critical Ericsson Telefon Ab L M
Publication of IN2012DN03323A publication Critical patent/IN2012DN03323A/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/87Detection of discrete points within a voice signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • G10L2025/786Adaptive threshold
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Telephone Function (AREA)
  • Noise Elimination (AREA)
  • Telephonic Communication Services (AREA)
IN3323DEN2012 2009-10-19 2012-04-17 IN2012DN03323A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US25296609P 2009-10-19 2009-10-19
PCT/SE2010/051117 WO2011049515A1 (en) 2009-10-19 2010-10-18 Method and voice activity detector for a speech encoder

Publications (1)

Publication Number Publication Date
IN2012DN03323A true IN2012DN03323A (en) 2015-10-23

Family

ID=43900544

Family Applications (1)

Application Number Title Priority Date Filing Date
IN3323DEN2012 IN2012DN03323A (en) 2009-10-19 2012-04-17

Country Status (8)

Country Link
US (2) US9401160B2 (en)
EP (1) EP2491548A4 (en)
JP (1) JP2013508773A (en)
CN (1) CN102804261B (en)
AU (1) AU2010308598A1 (en)
CA (1) CA2778343A1 (en)
IN (1) IN2012DN03323A (en)
WO (1) WO2011049515A1 (en)

Families Citing this family (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3726530B1 (en) * 2010-12-24 2024-05-22 Huawei Technologies Co., Ltd. Method and apparatus for adaptively detecting a voice activity in an input audio signal
JP6127143B2 (en) * 2012-08-31 2017-05-10 テレフオンアクチーボラゲット エルエム エリクソン(パブル) Method and apparatus for voice activity detection
BR112015014212B1 (en) 2012-12-21 2021-10-19 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. GENERATION OF A COMFORT NOISE WITH HIGH SPECTRO-TEMPORAL RESOLUTION IN DISCONTINUOUS TRANSMISSION OF AUDIO SIGNALS
CN111145767B (en) * 2012-12-21 2023-07-25 弗劳恩霍夫应用研究促进协会 Decoder and system for generating and processing coded frequency bit stream
CN112992188B (en) * 2012-12-25 2024-06-18 中兴通讯股份有限公司 Method and device for adjusting signal-to-noise ratio threshold in activated voice detection VAD judgment
CN103971680B (en) * 2013-01-24 2018-06-05 华为终端(东莞)有限公司 A kind of method, apparatus of speech recognition
CN103065631B (en) * 2013-01-24 2015-07-29 华为终端有限公司 A kind of method of speech recognition, device
BR112016014104B1 (en) 2013-12-19 2020-12-29 Telefonaktiebolaget Lm Ericsson (Publ) background noise estimation method, background noise estimator, sound activity detector, codec, wireless device, network node, computer-readable storage medium
CN103854662B (en) * 2014-03-04 2017-03-15 中央军委装备发展部第六十三研究所 Adaptive voice detection method based on multiple domain Combined estimator
CN107293287B (en) 2014-03-12 2021-10-26 华为技术有限公司 Method and apparatus for detecting audio signal
CN105321528B (en) * 2014-06-27 2019-11-05 中兴通讯股份有限公司 A kind of Microphone Array Speech detection method and device
WO2016007528A1 (en) * 2014-07-10 2016-01-14 Analog Devices Global Low-complexity voice activity detection
CN105261375B (en) * 2014-07-18 2018-08-31 中兴通讯股份有限公司 Activate the method and device of sound detection
PL3309784T3 (en) 2014-07-29 2020-02-28 Telefonaktiebolaget Lm Ericsson (Publ) Esimation of background noise in audio signals
CN104134440B (en) * 2014-07-31 2018-05-08 百度在线网络技术(北京)有限公司 Speech detection method and speech detection device for portable terminal
US9953661B2 (en) * 2014-09-26 2018-04-24 Cirrus Logic Inc. Neural network voice activity detection employing running range normalization
KR102475869B1 (en) * 2014-10-01 2022-12-08 삼성전자주식회사 Method and apparatus for processing audio signal including noise
US20160150315A1 (en) * 2014-11-20 2016-05-26 GM Global Technology Operations LLC System and method for echo cancellation
WO2016114788A1 (en) * 2015-01-16 2016-07-21 Hewlett Packard Enterprise Development Lp Video encoder
CN110895930B (en) * 2015-05-25 2022-01-28 展讯通信(上海)有限公司 Voice recognition method and device
US9413423B1 (en) * 2015-08-18 2016-08-09 Texas Instruments Incorporated SNR calculation in impulsive noise and erasure channels
KR102446392B1 (en) * 2015-09-23 2022-09-23 삼성전자주식회사 Electronic device and method for recognizing voice of speech
US11631421B2 (en) * 2015-10-18 2023-04-18 Solos Technology Limited Apparatuses and methods for enhanced speech recognition in variable environments
JP6759898B2 (en) * 2016-09-08 2020-09-23 富士通株式会社 Utterance section detection device, utterance section detection method, and computer program for utterance section detection
EP3324407A1 (en) 2016-11-17 2018-05-23 Fraunhofer Gesellschaft zur Förderung der Angewand Apparatus and method for decomposing an audio signal using a ratio as a separation characteristic
EP3324406A1 (en) 2016-11-17 2018-05-23 Fraunhofer Gesellschaft zur Förderung der Angewand Apparatus and method for decomposing an audio signal using a variable threshold
CN107393559B (en) * 2017-07-14 2021-05-18 深圳永顺智信息科技有限公司 Method and device for checking voice detection result
KR102512614B1 (en) * 2018-12-12 2023-03-23 삼성전자주식회사 Electronic device audio enhancement and method thereof
CN111048119B (en) * 2020-03-12 2020-07-10 腾讯科技(深圳)有限公司 Call audio mixing processing method and device, storage medium and computer equipment
US20230162754A1 (en) * 2020-03-27 2023-05-25 Dolby Laboratories Licensing Corporation Automatic Leveling of Speech Content
TWI756817B (en) * 2020-09-08 2022-03-01 瑞昱半導體股份有限公司 Voice activity detection device and method
CN114283840B (en) * 2021-12-22 2023-04-18 天翼爱音乐文化科技有限公司 Instruction audio generation method, system, device and storage medium
CN114566152B (en) * 2022-04-27 2022-07-08 成都启英泰伦科技有限公司 Voice endpoint detection method based on deep learning
KR102516391B1 (en) * 2022-09-02 2023-04-03 주식회사 액션파워 Method for detecting speech segment from audio considering length of speech segment

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6122384A (en) * 1997-09-02 2000-09-19 Qualcomm Inc. Noise suppression system and method
US6023674A (en) * 1998-01-23 2000-02-08 Telefonaktiebolaget L M Ericsson Non-parametric voice activity detection
US6088668A (en) * 1998-06-22 2000-07-11 D.S.P.C. Technologies Ltd. Noise suppressor having weighted gain smoothing
JP2000172283A (en) * 1998-12-01 2000-06-23 Nec Corp System and method for detecting sound
US6556967B1 (en) * 1999-03-12 2003-04-29 The United States Of America As Represented By The National Security Agency Voice activity detector
JP3759685B2 (en) 1999-05-18 2006-03-29 三菱電機株式会社 Noise section determination device, noise suppression device, and estimated noise information update method
US7058572B1 (en) * 2000-01-28 2006-06-06 Nortel Networks Limited Reducing acoustic noise in wireless and landline based telephony
US6889187B2 (en) * 2000-12-28 2005-05-03 Nortel Networks Limited Method and apparatus for improved voice activity detection in a packet voice network
US7031916B2 (en) * 2001-06-01 2006-04-18 Texas Instruments Incorporated Method for converging a G.729 Annex B compliant voice activity detection circuit
EP1271470A1 (en) * 2001-06-25 2003-01-02 Alcatel Method and device for determining the voice quality degradation of a signal
US7283956B2 (en) * 2002-09-18 2007-10-16 Motorola, Inc. Noise suppression
CA2454296A1 (en) * 2003-12-29 2005-06-29 Nokia Corporation Method and device for speech enhancement in the presence of background noise
JP2008546341A (en) * 2005-06-18 2008-12-18 ノキア コーポレイション System and method for adaptive transmission of pseudo background noise parameters in non-continuous speech transmission
US7366658B2 (en) * 2005-12-09 2008-04-29 Texas Instruments Incorporated Noise pre-processor for enhanced variable rate speech codec
ES2525427T3 (en) * 2006-02-10 2014-12-22 Telefonaktiebolaget L M Ericsson (Publ) A voice detector and a method to suppress subbands in a voice detector
US20080010065A1 (en) * 2006-06-05 2008-01-10 Harry Bratt Method and apparatus for speaker recognition
JP4568371B2 (en) * 2006-11-16 2010-10-27 インターナショナル・ビジネス・マシーンズ・コーポレーション Computerized method and computer program for distinguishing between at least two event classes
US8121835B2 (en) * 2007-03-21 2012-02-21 Texas Instruments Incorporated Automatic level control of speech signals
US7873114B2 (en) * 2007-03-29 2011-01-18 Motorola Mobility, Inc. Method and apparatus for quickly detecting a presence of abrupt noise and updating a noise estimate
CN101681619B (en) * 2007-05-22 2012-07-04 Lm爱立信电话有限公司 Improved voice activity detector
CN101320559B (en) * 2007-06-07 2011-05-18 华为技术有限公司 Sound activation detection apparatus and method
ES2533358T3 (en) * 2007-06-22 2015-04-09 Voiceage Corporation Procedure and device to estimate the tone of a sound signal

Also Published As

Publication number Publication date
CN102804261A (en) 2012-11-28
EP2491548A4 (en) 2013-10-30
US9401160B2 (en) 2016-07-26
AU2010308598A1 (en) 2012-05-17
WO2011049515A1 (en) 2011-04-28
EP2491548A1 (en) 2012-08-29
JP2013508773A (en) 2013-03-07
US20120215536A1 (en) 2012-08-23
US20160322067A1 (en) 2016-11-03
CN102804261B (en) 2015-02-18
CA2778343A1 (en) 2011-04-28

Similar Documents

Publication Publication Date Title
BR112012012396A2 (en)
BR112012008267A2 (en)
BR112012008195A2 (en)
BRPI0925311A2 (en)
BRPI0924307A2 (en)
BR122017024704A2 (en)
BR112012012487A2 (en)
BR112012000665A2 (en)
BR112012012080A2 (en)
BR112012009797A2 (en)
BR112012009446A2 (en)
BR112012009703A2 (en)
BR112012010357A2 (en)
BR112012007656A2 (en)
BR122019005883A2 (en)
BR112012001263A2 (en)
BR112012000159A2 (en)
BR112012014856A2 (en)
BRPI0924534A2 (en)
BR112012007654A2 (en)
BR112012007672A2 (en)
BR112012000255A2 (en)
BR112012000156A2 (en)
BRPI0924617A2 (en)
BR122017013721A2 (en)