CN114072875A - 一种语音信号处理方法及其相关设备 - Google Patents

一种语音信号处理方法及其相关设备 Download PDF

Info

Publication number
CN114072875A
CN114072875A CN202080026583.9A CN202080026583A CN114072875A CN 114072875 A CN114072875 A CN 114072875A CN 202080026583 A CN202080026583 A CN 202080026583A CN 114072875 A CN114072875 A CN 114072875A
Authority
CN
China
Prior art keywords
signal
user
voice
sensor
vibration
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202080026583.9A
Other languages
English (en)
Inventor
张立斌
杨晖
方舒
董思维
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of CN114072875A publication Critical patent/CN114072875A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • G06V40/171Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/24Speech recognition using non-acoustical features
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/06Decision making techniques; Pattern matching strategies
    • G10L17/10Multimodal systems, i.e. based on the integration of multiple recognition engines or fusion of expert systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/18Artificial neural networks; Connectionist approaches

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Game Theory and Decision Science (AREA)
  • Business, Economics & Management (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

一种语音信号处理方法及其相关设备,该方法可应用于音频领域,包括:获取传感器采集的用户语音信号;获取所述用户发出所述语音时对应的振动信号;其中所述振动信号用于表示所述用户的身体部位的振动特征;所述身体部位为当所述用户处于发声状态下,基于发声行为进行相应振动的部位;根据所述振动信号和所述传感器采集的用户语音信号,获得目标语音信息。本申请将振动信号作为语音识别的依据,由于振动信号没有包含复杂的声学传输时混入的外界非用户的语音,受其他环境噪声的影响很小(例如混响影响),因此可以相对较好的抑制住这部分噪声干扰,可以实现更好的语音识别效果。

Description

PCT国内申请,说明书已公开。

Claims (37)

  1. PCT国内申请,权利要求书已公开。
CN202080026583.9A 2020-05-29 2020-05-29 一种语音信号处理方法及其相关设备 Pending CN114072875A (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2020/093523 WO2021237740A1 (zh) 2020-05-29 2020-05-29 一种语音信号处理方法及其相关设备

Publications (1)

Publication Number Publication Date
CN114072875A true CN114072875A (zh) 2022-02-18

Family

ID=78745413

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080026583.9A Pending CN114072875A (zh) 2020-05-29 2020-05-29 一种语音信号处理方法及其相关设备

Country Status (4)

Country Link
US (1) US20230098678A1 (zh)
EP (1) EP4141867A4 (zh)
CN (1) CN114072875A (zh)
WO (1) WO2021237740A1 (zh)

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010217453A (ja) * 2009-03-16 2010-09-30 Fujitsu Ltd 音声認識用マイクロホンシステム
CN101947152B (zh) * 2010-09-11 2012-09-05 山东科技大学 仿人形义肢的脑电-语音控制系统及工作方法
CN103871419B (zh) * 2012-12-11 2017-05-24 联想(北京)有限公司 一种信息处理方法及电子设备
US10635800B2 (en) * 2016-06-07 2020-04-28 Vocalzoom Systems Ltd. System, device, and method of voice-based user authentication utilizing a challenge
US10573323B2 (en) * 2017-12-26 2020-02-25 Intel Corporation Speaker recognition based on vibration signals
CN110248281A (zh) * 2018-03-07 2019-09-17 四川语文通科技有限责任公司 在有干扰的环境中独立出自己发声的方法之声带振动匹配
EP3582514B1 (en) * 2018-06-14 2023-01-11 Oticon A/s Sound processing apparatus
EP3618457A1 (en) * 2018-09-02 2020-03-04 Oticon A/s A hearing device configured to utilize non-audio information to process audio signals
CN110931031A (zh) * 2019-10-09 2020-03-27 大象声科(深圳)科技有限公司 一种融合骨振动传感器和麦克风信号的深度学习语音提取和降噪方法

Also Published As

Publication number Publication date
EP4141867A1 (en) 2023-03-01
EP4141867A4 (en) 2023-06-14
US20230098678A1 (en) 2023-03-30
WO2021237740A1 (zh) 2021-12-02

Similar Documents

Publication Publication Date Title
US20210304735A1 (en) Keyword detection method and related apparatus
US20220165288A1 (en) Audio signal processing method and apparatus, electronic device, and storage medium
US20220172737A1 (en) Speech signal processing method and speech separation method
WO2021249053A1 (zh) 图像处理的方法及相关装置
CN111063342B (zh) 语音识别方法、装置、计算机设备及存储介质
US20190147875A1 (en) Continuous topic detection and adaption in audio environments
WO2022156654A1 (zh) 一种文本数据处理方法及装置
CN111696570B (zh) 语音信号处理方法、装置、设备及存储介质
CN113763532B (zh) 基于三维虚拟对象的人机交互方法、装置、设备及介质
CN113539290B (zh) 语音降噪方法和装置
WO2022033556A1 (zh) 电子设备及其语音识别方法和介质
CN111863020B (zh) 语音信号处理方法、装置、设备及存储介质
CN113705665B (zh) 图像变换网络模型的训练方法和电子设备
CN114242037A (zh) 一种虚拟人物生成方法及其装置
WO2021203880A1 (zh) 一种语音增强方法、训练神经网络的方法以及相关设备
CN113750523A (zh) 三维虚拟对象的动作生成方法、装置、设备及存储介质
CN116861850A (zh) 一种数据处理方法及其装置
CN113646838B (zh) 在视频聊天过程中提供情绪修改的方法和系统
CN113611318A (zh) 一种音频数据增强方法及相关设备
US20230334907A1 (en) Emotion Detection
CN115620728B (zh) 音频处理方法、装置、存储介质及智能眼镜
EP4141867A1 (en) Voice signal processing method and related device therefor
WO2022143314A1 (zh) 一种对象注册方法及装置
CN112750449A (zh) 回声消除方法、装置、终端、服务器及存储介质
WO2022253053A1 (zh) 一种播放视频的方法及装置

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination