CN115035887A - 语音信号的处理方法、装置、设备及介质 - Google Patents

语音信号的处理方法、装置、设备及介质 Download PDF

Info

Publication number
CN115035887A
CN115035887A CN202210560595.0A CN202210560595A CN115035887A CN 115035887 A CN115035887 A CN 115035887A CN 202210560595 A CN202210560595 A CN 202210560595A CN 115035887 A CN115035887 A CN 115035887A
Authority
CN
China
Prior art keywords
voice
mixing
features
layer
convolution
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210560595.0A
Other languages
English (en)
Chinese (zh)
Inventor
王炳乾
宿绍勋
夏友祥
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BOE Technology Group Co Ltd
Original Assignee
BOE Technology Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BOE Technology Group Co Ltd filed Critical BOE Technology Group Co Ltd
Priority to CN202210560595.0A priority Critical patent/CN115035887A/zh
Publication of CN115035887A publication Critical patent/CN115035887A/zh
Priority to PCT/CN2023/094965 priority patent/WO2023222071A1/fr
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Telephonic Communication Services (AREA)
CN202210560595.0A 2022-05-20 2022-05-20 语音信号的处理方法、装置、设备及介质 Pending CN115035887A (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202210560595.0A CN115035887A (zh) 2022-05-20 2022-05-20 语音信号的处理方法、装置、设备及介质
PCT/CN2023/094965 WO2023222071A1 (fr) 2022-05-20 2023-05-18 Procédé et appareil de traitement de signal vocal, et dispositif et support

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210560595.0A CN115035887A (zh) 2022-05-20 2022-05-20 语音信号的处理方法、装置、设备及介质

Publications (1)

Publication Number Publication Date
CN115035887A true CN115035887A (zh) 2022-09-09

Family

ID=83120469

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210560595.0A Pending CN115035887A (zh) 2022-05-20 2022-05-20 语音信号的处理方法、装置、设备及介质

Country Status (2)

Country Link
CN (1) CN115035887A (fr)
WO (1) WO2023222071A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023222071A1 (fr) * 2022-05-20 2023-11-23 京东方科技集团股份有限公司 Procédé et appareil de traitement de signal vocal, et dispositif et support

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6879952B2 (en) * 2000-04-26 2005-04-12 Microsoft Corporation Sound source separation using convolutional mixing and a priori sound source knowledge
CN113889091A (zh) * 2021-10-26 2022-01-04 深圳地平线机器人科技有限公司 语音识别方法、装置、计算机可读存储介质及电子设备
CN114333782A (zh) * 2022-01-13 2022-04-12 平安科技(深圳)有限公司 语音识别方法、装置、设备及存储介质
CN114446318A (zh) * 2022-02-07 2022-05-06 北京达佳互联信息技术有限公司 音频数据分离方法、装置、电子设备及存储介质
CN114399996A (zh) * 2022-03-16 2022-04-26 阿里巴巴达摩院(杭州)科技有限公司 处理语音信号的方法、装置、存储介质及系统
CN115035887A (zh) * 2022-05-20 2022-09-09 京东方科技集团股份有限公司 语音信号的处理方法、装置、设备及介质

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023222071A1 (fr) * 2022-05-20 2023-11-23 京东方科技集团股份有限公司 Procédé et appareil de traitement de signal vocal, et dispositif et support

Also Published As

Publication number Publication date
WO2023222071A1 (fr) 2023-11-23

Similar Documents

Publication Publication Date Title
CN110600017B (zh) 语音处理模型的训练方法、语音识别方法、系统及装置
CN110503971A (zh) 用于语音处理的基于神经网络的时频掩模估计和波束形成
WO2021135577A1 (fr) Procédé et appareil de traitement de signal audio, dispositif électronique, et support de stockage
CN112185352B (zh) 语音识别方法、装置及电子设备
CN108461081B (zh) 语音控制的方法、装置、设备和存储介质
CN108564965B (zh) 一种抗噪语音识别系统
CN112037822B (zh) 基于ICNN与Bi-LSTM的语音情感识别方法
CN112071322A (zh) 一种端到端的声纹识别方法、装置、存储介质及设备
CN113555032B (zh) 多说话人场景识别及网络训练方法、装置
CN109063624A (zh) 信息处理方法、系统、电子设备和计算机可读存储介质
WO2023197749A1 (fr) Procédé et appareil de détermination de point temporel d'insertion de musique de fond, dispositif et support de stockage
WO2023222071A1 (fr) Procédé et appareil de traitement de signal vocal, et dispositif et support
CN110136726A (zh) 一种语音性别的估计方法、装置、系统及存储介质
CN115602165A (zh) 基于金融系统的数字员工智能系统
CN117059068A (zh) 语音处理方法、装置、存储介质及计算机设备
CN113053361B (zh) 语音识别方法、模型训练方法、装置、设备及介质
CN117496990A (zh) 语音去噪方法、装置、计算机设备及存储介质
CN116631380A (zh) 一种音视频多模态的关键词唤醒方法及装置
CN115938364A (zh) 一种智能识别控制方法、终端设备及可读存储介质
CN116741159A (zh) 音频分类及模型的训练方法、装置、电子设备和存储介质
CN116312559A (zh) 跨信道声纹识别模型的训练方法、声纹识别方法及装置
CN113782005B (zh) 语音识别方法及装置、存储介质及电子设备
CN117063229A (zh) 交互语音信号处理方法、相关设备及系统
Li RETRACTED ARTICLE: Speech-assisted intelligent software architecture based on deep game neural network
CN116959421B (zh) 处理音频数据的方法及装置、音频数据处理设备和介质

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination