DK3882914T3 - Stemmegenkendelsesfremgangsmåde, stemmegenkendelsesapparat, elektronisk indretning og computerlæsbart lagringsmedium - Google Patents

Stemmegenkendelsesfremgangsmåde, stemmegenkendelsesapparat, elektronisk indretning og computerlæsbart lagringsmedium Download PDF

Info

Publication number
DK3882914T3
DK3882914T3 DK20201839.6T DK20201839T DK3882914T3 DK 3882914 T3 DK3882914 T3 DK 3882914T3 DK 20201839 T DK20201839 T DK 20201839T DK 3882914 T3 DK3882914 T3 DK 3882914T3
Authority
DK
Denmark
Prior art keywords
voice recognition
electronic device
computer readable
readable storage
storage media
Prior art date
Application number
DK20201839.6T
Other languages
English (en)
Inventor
Nengjun Ouyang
Junhua Xu
Zhengbin Song
Danqing Yang
Gang Xu
Original Assignee
Apollo Intelligent Connectivity Beijing Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Apollo Intelligent Connectivity Beijing Technology Co Ltd filed Critical Apollo Intelligent Connectivity Beijing Technology Co Ltd
Application granted granted Critical
Publication of DK3882914T3 publication Critical patent/DK3882914T3/da

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60RVEHICLES, VEHICLE FITTINGS, OR VEHICLE PARTS, NOT OTHERWISE PROVIDED FOR
    • B60R16/00Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for
    • B60R16/02Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for electric constitutive elements
    • B60R16/037Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for electric constitutive elements for occupant comfort, e.g. for automatic adjustment of appliances according to personal settings, e.g. seats, mirrors, steering wheel
    • B60R16/0373Voice control
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02165Two microphones, one receiving mainly the noise signal and the other one mainly the speech signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Mechanical Engineering (AREA)
  • Circuit For Audible Band Transducer (AREA)
DK20201839.6T 2020-03-17 2020-10-14 Stemmegenkendelsesfremgangsmåde, stemmegenkendelsesapparat, elektronisk indretning og computerlæsbart lagringsmedium DK3882914T3 (da)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010185078.0A CN111402868B (zh) 2020-03-17 2020-03-17 语音识别方法、装置、电子设备及计算机可读存储介质

Publications (1)

Publication Number Publication Date
DK3882914T3 true DK3882914T3 (da) 2022-09-05

Family

ID=71430911

Family Applications (1)

Application Number Title Priority Date Filing Date
DK20201839.6T DK3882914T3 (da) 2020-03-17 2020-10-14 Stemmegenkendelsesfremgangsmåde, stemmegenkendelsesapparat, elektronisk indretning og computerlæsbart lagringsmedium

Country Status (5)

Country Link
US (1) US20210295857A1 (da)
EP (1) EP3882914B1 (da)
JP (1) JP7209674B2 (da)
CN (1) CN111402868B (da)
DK (1) DK3882914T3 (da)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114303188A (zh) * 2019-08-30 2022-04-08 杜比实验室特许公司 针对机器感知预调节音频
CN112583970A (zh) * 2020-12-04 2021-03-30 斑马网络技术有限公司 一种车载蓝牙回声消除方法及装置、车载终端、存储介质
CN113364840B (zh) * 2021-05-26 2022-12-23 阿波罗智联(北京)科技有限公司 用于智能后视镜的时延估计方法、装置和电子设备
CN113382081B (zh) * 2021-06-28 2023-04-07 阿波罗智联(北京)科技有限公司 时延估计调整方法、装置、设备以及存储介质
CN113674739B (zh) * 2021-07-20 2023-12-19 北京字节跳动网络技术有限公司 一种时间确定方法、装置、设备及存储介质
CN114039890B (zh) * 2021-11-04 2023-01-31 国家工业信息安全发展研究中心 一种语音识别时延测试方法、系统及存储介质
CN117880696A (zh) * 2022-10-12 2024-04-12 广州开得联软件技术有限公司 混音方法、装置、计算机设备以及存储介质

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5761638A (en) * 1995-03-17 1998-06-02 Us West Inc Telephone network apparatus and method using echo delay and attenuation
JP2006157499A (ja) * 2004-11-30 2006-06-15 Matsushita Electric Ind Co Ltd 音響エコーキャンセラとそれを用いたハンズフリー電話及び音響エコーキャンセル方法
US8462936B2 (en) * 2011-02-28 2013-06-11 Qnx Software Systems Limited Adaptive delay compensation for acoustic echo cancellation
CN104412323B (zh) * 2012-06-25 2017-12-12 三菱电机株式会社 车载信息装置
CN103516921A (zh) * 2012-06-28 2014-01-15 杜比实验室特许公司 通过隐藏音频信号的回声控制
US9497544B2 (en) * 2012-07-02 2016-11-15 Qualcomm Incorporated Systems and methods for surround sound echo reduction
US9628141B2 (en) * 2012-10-23 2017-04-18 Interactive Intelligence Group, Inc. System and method for acoustic echo cancellation
EP3171613A1 (en) * 2015-11-20 2017-05-24 Harman Becker Automotive Systems GmbH Audio enhancement
CN105847611B (zh) * 2016-03-21 2020-02-11 腾讯科技(深圳)有限公司 一种回声时延检测方法、回声消除芯片及终端设备
CN105872156B (zh) * 2016-05-25 2019-02-12 腾讯科技(深圳)有限公司 一种回声时延跟踪方法及装置
WO2018006856A1 (zh) * 2016-07-07 2018-01-11 腾讯科技(深圳)有限公司 一种回声消除的方法及终端、计算机存储介质
CN107689228B (zh) * 2016-08-04 2020-05-12 腾讯科技(深圳)有限公司 一种信息处理方法及终端
US20180190306A1 (en) * 2017-01-04 2018-07-05 2236008 Ontario Inc. Voice interface and vocal entertainment system
US10546581B1 (en) * 2017-09-08 2020-01-28 Amazon Technologies, Inc. Synchronization of inbound and outbound audio in a heterogeneous echo cancellation system
CN107610713B (zh) * 2017-10-23 2022-02-01 科大讯飞股份有限公司 基于时延估计的回声消除方法及装置
US11238879B2 (en) * 2017-11-02 2022-02-01 Microsemi Semiconductor (U.S.) Inc. Acoustic delay measurement using adaptive filter with programmable delay buffer
US10325613B1 (en) * 2018-07-12 2019-06-18 Microsemi Semiconductor Ulc Acoustic delay estimation
CN110166882B (zh) * 2018-09-29 2021-05-25 腾讯科技(深圳)有限公司 远场拾音设备、及远场拾音设备中采集人声信号的方法

Also Published As

Publication number Publication date
JP7209674B2 (ja) 2023-01-20
EP3882914A1 (en) 2021-09-22
EP3882914B1 (en) 2022-08-10
CN111402868B (zh) 2023-10-24
US20210295857A1 (en) 2021-09-23
CN111402868A (zh) 2020-07-10
JP2021149086A (ja) 2021-09-27

Similar Documents

Publication Publication Date Title
DK3882914T3 (da) Stemmegenkendelsesfremgangsmåde, stemmegenkendelsesapparat, elektronisk indretning og computerlæsbart lagringsmedium
SG11202106622XA (en) Image recognition method and apparatus, electronic device and storage medium
SG11202110565RA (en) Face recognition method and apparatus, electronic device, and storage medium
SG11202010916SA (en) Text recognition method and apparatus, electronic device and storage medium
SG11202006192YA (en) Face recognition method and apparatus, electronic device, and storage medium
SG11202109192QA (en) Interaction method and apparatus, electronic device and storage medium
SG11202105174XA (en) Text sequence recognition method and apparatus, electronic device, and storage medium
EP3828885C0 (en) METHOD AND DEVICE FOR SPEAKING, COMPUTER DEVICE AND COMPUTER-READABLE STORAGE MEDIUM
EP3584786A4 (en) VOICE RECOGNITION METHOD, ELECTRONIC DEVICE, AND COMPUTER STORAGE MEDIUM
EP3979122A4 (en) BEHAVIOR PREDICTION METHOD AND APPARATUS, GEAR RECOGNITION METHOD AND APPARATUS, ELECTRONIC DEVICE AND COMPUTER READABLE STORAGE MEDIUM
EP4123444A4 (en) METHOD AND DEVICE FOR PROCESSING VOICE INFORMATION AS WELL AS STORAGE MEDIUM AND ELECTRONIC DEVICE
EP4191996A4 (en) PHOTOGRAPHY METHOD AND APPARATUS, ELECTRONIC DEVICE AND COMPUTER-READABLE STORAGE MEDIUM
EP3920183A4 (en) SPEECH DATA PROCESSING METHOD AND DEVICE, ELECTRONIC DEVICE AND READABLE STORAGE MEDIUM
SG10202107630YA (en) Road information processing method and apparatus, electronic device and storage medium
EP4113961A4 (en) VOICE CALL METHOD AND APPARATUS, ELECTRONIC DEVICE AND COMPUTER-READABLE STORAGE MEDIUM
SG11201913925YA (en) Question and answer data processing method and apparatus, computer device, and storage medium
EP4273745A4 (en) GESTURE RECOGNITION METHOD AND APPARATUS, ELECTRONIC DEVICE, READABLE STORAGE MEDIUM AND CHIP
EP4277287A4 (en) MULTIMEDIA INFORMATION PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM
EP4273742A4 (en) HANDWRITING RECOGNITION METHOD AND APPARATUS, ELECTRONIC DEVICE AND MEDIUM
SG11202106254TA (en) Data processing method and apparatus, electronic device, and storage medium
EP4273698A4 (en) INFORMATION PROCESSING METHOD AND APPARATUS, DEVICE AND RECORDING MEDIUM
SG11202012467QA (en) Information processing method and apparatus, electronic device, and storage medium
SG11202109528SA (en) Data processing method and apparatus, electronic device and storage medium
EP3979129A4 (en) OBJECT RECOGNITION METHOD AND DEVICE, AND ELECTRONIC DEVICE AND STORAGE MEDIA
SG10202003292XA (en) Matching method and apparatus, electronic device, computer-readable storage medium, and computer program