SG11202003722SA - Speaker separation model training method, two-speaker separation method and computing device - Google Patents

Speaker separation model training method, two-speaker separation method and computing device

Info

Publication number
SG11202003722SA
SG11202003722SA SG11202003722SA SG11202003722SA SG11202003722SA SG 11202003722S A SG11202003722S A SG 11202003722SA SG 11202003722S A SG11202003722S A SG 11202003722SA SG 11202003722S A SG11202003722S A SG 11202003722SA SG 11202003722S A SG11202003722S A SG 11202003722SA
Authority
SG
Singapore
Prior art keywords
speaker separation
computing device
model training
speaker
training method
Prior art date
Application number
SG11202003722SA
Inventor
Feng Zhao
Jianzong Wang
Jing Xiao
Original Assignee
Ping An Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Technology Shenzhen Co Ltd filed Critical Ping An Technology Shenzhen Co Ltd
Publication of SG11202003722SA publication Critical patent/SG11202003722SA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/06Decision making techniques; Pattern matching strategies
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/18Artificial neural networks; Connectionist approaches
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
SG11202003722SA 2018-05-28 2018-08-13 Speaker separation model training method, two-speaker separation method and computing device SG11202003722SA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810519521.6A CN108766440B (en) 2018-05-28 2018-05-28 Speaker separation model training method, two-speaker separation method and related equipment
PCT/CN2018/100174 WO2019227672A1 (en) 2018-05-28 2018-08-13 Voice separation model training method, two-speaker separation method and associated apparatus

Publications (1)

Publication Number Publication Date
SG11202003722SA true SG11202003722SA (en) 2020-12-30

Family

ID=64006219

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202003722SA SG11202003722SA (en) 2018-05-28 2018-08-13 Speaker separation model training method, two-speaker separation method and computing device

Country Status (5)

Country Link
US (1) US11158324B2 (en)
JP (1) JP2020527248A (en)
CN (1) CN108766440B (en)
SG (1) SG11202003722SA (en)
WO (1) WO2019227672A1 (en)

Families Citing this family (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109545186B (en) * 2018-12-16 2022-05-27 魔门塔(苏州)科技有限公司 Speech recognition training system and method
CN109686382A (en) * 2018-12-29 2019-04-26 平安科技(深圳)有限公司 A kind of speaker clustering method and device
CN110197665B (en) * 2019-06-25 2021-07-09 广东工业大学 Voice separation and tracking method for public security criminal investigation monitoring
CN110444223B (en) * 2019-06-26 2023-05-23 平安科技(深圳)有限公司 Speaker separation method and device based on cyclic neural network and acoustic characteristics
CN110289002B (en) * 2019-06-28 2021-04-27 四川长虹电器股份有限公司 End-to-end speaker clustering method and system
CN110390946A (en) * 2019-07-26 2019-10-29 龙马智芯(珠海横琴)科技有限公司 A kind of audio signal processing method, device, electronic equipment and storage medium
CN110718228B (en) * 2019-10-22 2022-04-12 中信银行股份有限公司 Voice separation method and device, electronic equipment and computer readable storage medium
CN111312256A (en) * 2019-10-31 2020-06-19 平安科技(深圳)有限公司 Voice identity recognition method and device and computer equipment
CN110853618B (en) * 2019-11-19 2022-08-19 腾讯科技(深圳)有限公司 Language identification method, model training method, device and equipment
CN110992940B (en) 2019-11-25 2021-06-15 百度在线网络技术(北京)有限公司 Voice interaction method, device, equipment and computer-readable storage medium
CN110992967A (en) * 2019-12-27 2020-04-10 苏州思必驰信息科技有限公司 Voice signal processing method and device, hearing aid and storage medium
CN111145761B (en) * 2019-12-27 2022-05-24 携程计算机技术(上海)有限公司 Model training method, voiceprint confirmation method, system, device and medium
CN111191787B (en) * 2019-12-30 2022-07-15 思必驰科技股份有限公司 Training method and device of neural network for extracting speaker embedded features
CN111370032B (en) * 2020-02-20 2023-02-14 厦门快商通科技股份有限公司 Voice separation method, system, mobile terminal and storage medium
JP7359028B2 (en) * 2020-02-21 2023-10-11 日本電信電話株式会社 Learning devices, learning methods, and learning programs
CN111370019B (en) * 2020-03-02 2023-08-29 字节跳动有限公司 Sound source separation method and device, and neural network model training method and device
CN111009258A (en) * 2020-03-11 2020-04-14 浙江百应科技有限公司 Single sound channel speaker separation model, training method and separation method
US11392639B2 (en) * 2020-03-31 2022-07-19 Uniphore Software Systems, Inc. Method and apparatus for automatic speaker diarization
CN111477240B (en) * 2020-04-07 2023-04-07 浙江同花顺智能科技有限公司 Audio processing method, device, equipment and storage medium
CN111524521B (en) * 2020-04-22 2023-08-08 北京小米松果电子有限公司 Voiceprint extraction model training method, voiceprint recognition method, voiceprint extraction model training device and voiceprint recognition device
CN111524527B (en) * 2020-04-30 2023-08-22 合肥讯飞数码科技有限公司 Speaker separation method, speaker separation device, electronic device and storage medium
CN111613249A (en) * 2020-05-22 2020-09-01 云知声智能科技股份有限公司 Voice analysis method and equipment
CN111640438B (en) * 2020-05-26 2023-09-05 同盾控股有限公司 Audio data processing method and device, storage medium and electronic equipment
CN111680631B (en) * 2020-06-09 2023-12-22 广州视源电子科技股份有限公司 Model training method and device
CN111785291A (en) * 2020-07-02 2020-10-16 北京捷通华声科技股份有限公司 Voice separation method and voice separation device
CN111933153B (en) * 2020-07-07 2024-03-08 北京捷通华声科技股份有限公司 Voice segmentation point determining method and device
CN111985934A (en) * 2020-07-30 2020-11-24 浙江百世技术有限公司 Intelligent customer service dialogue model construction method and application
CN111899755A (en) * 2020-08-11 2020-11-06 华院数据技术(上海)有限公司 Speaker voice separation method and related equipment
CN112071330B (en) * 2020-09-16 2022-09-20 腾讯科技(深圳)有限公司 Audio data processing method and device and computer readable storage medium
CN112071329B (en) * 2020-09-16 2022-09-16 腾讯科技(深圳)有限公司 Multi-person voice separation method and device, electronic equipment and storage medium
CN112489682B (en) * 2020-11-25 2023-05-23 平安科技(深圳)有限公司 Audio processing method, device, electronic equipment and storage medium
CN112700766B (en) * 2020-12-23 2024-03-19 北京猿力未来科技有限公司 Training method and device of voice recognition model, and voice recognition method and device
CN112289323B (en) * 2020-12-29 2021-05-28 深圳追一科技有限公司 Voice data processing method and device, computer equipment and storage medium
CN112820292B (en) * 2020-12-29 2023-07-18 平安银行股份有限公司 Method, device, electronic device and storage medium for generating meeting summary
AU2021203544A1 (en) * 2020-12-31 2022-07-14 Sensetime International Pte. Ltd. Methods and apparatuses for training neural network, and methods and apparatuses for detecting correlated objects
KR20220115453A (en) * 2021-02-10 2022-08-17 삼성전자주식회사 Electronic device supporting improved voice activity detection
KR20220136750A (en) * 2021-04-01 2022-10-11 삼성전자주식회사 Electronic apparatus for processing user utterance and controlling method thereof
KR20220169242A (en) * 2021-06-18 2022-12-27 삼성전자주식회사 Electronic devcie and method for personalized audio processing of the electronic device
WO2023281717A1 (en) * 2021-07-08 2023-01-12 日本電信電話株式会社 Speaker diarization method, speaker diarization device, and speaker diarization program
CN113362831A (en) * 2021-07-12 2021-09-07 科大讯飞股份有限公司 Speaker separation method and related equipment thereof
CN113571085B (en) * 2021-07-24 2023-09-22 平安科技(深圳)有限公司 Voice separation method, system, device and storage medium
CN113657289B (en) * 2021-08-19 2023-08-08 北京百度网讯科技有限公司 Training method and device of threshold estimation model and electronic equipment
WO2023047475A1 (en) * 2021-09-21 2023-03-30 日本電信電話株式会社 Estimation device, estimation method, and estimation program
CN114363531B (en) * 2022-01-14 2023-08-01 中国平安人寿保险股份有限公司 H5-based text description video generation method, device, equipment and medium
CN115171716B (en) * 2022-06-14 2024-04-19 武汉大学 Continuous voice separation method and system based on spatial feature clustering and electronic equipment
CN115659162B (en) * 2022-09-15 2023-10-03 云南财经大学 Method, system and equipment for extracting intra-pulse characteristics of radar radiation source signals
CN117037255A (en) * 2023-08-22 2023-11-10 北京中科深智科技有限公司 3D expression synthesis method based on directed graph

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0272398A (en) * 1988-09-07 1990-03-12 Hitachi Ltd Preprocessor for speech signal
KR100612840B1 (en) * 2004-02-18 2006-08-18 삼성전자주식회사 Speaker clustering method and speaker adaptation method based on model transformation, and apparatus using the same
JP2008051907A (en) * 2006-08-22 2008-03-06 Toshiba Corp Utterance section identification apparatus and method
WO2016095218A1 (en) * 2014-12-19 2016-06-23 Dolby Laboratories Licensing Corporation Speaker identification using spatial information
JP6430318B2 (en) * 2015-04-06 2018-11-28 日本電信電話株式会社 Unauthorized voice input determination device, method and program
CN106683661B (en) * 2015-11-05 2021-02-05 阿里巴巴集团控股有限公司 Role separation method and device based on voice
JP2017120595A (en) * 2015-12-29 2017-07-06 花王株式会社 Method for evaluating state of application of cosmetics
WO2018013200A1 (en) 2016-07-14 2018-01-18 Magic Leap, Inc. Deep neural network for iris identification
US9824692B1 (en) * 2016-09-12 2017-11-21 Pindrop Security, Inc. End-to-end speaker recognition using deep neural network
WO2018069974A1 (en) * 2016-10-11 2018-04-19 エスゼット ディージェイアイ テクノロジー カンパニー リミテッド Image capturing device, image capturing system, mobile body, method, and program
US10497382B2 (en) * 2016-12-16 2019-12-03 Google Llc Associating faces with voices for speaker diarization within videos
CN107180628A (en) * 2017-05-19 2017-09-19 百度在线网络技术(北京)有限公司 Set up the method, the method for extracting acoustic feature, device of acoustic feature extraction model
CN107221320A (en) * 2017-05-19 2017-09-29 百度在线网络技术(北京)有限公司 Train method, device, equipment and the computer-readable storage medium of acoustic feature extraction model
CN107342077A (en) * 2017-05-27 2017-11-10 国家计算机网络与信息安全管理中心 A kind of speaker segmentation clustering method and system based on factorial analysis
CN107680611B (en) 2017-09-13 2020-06-16 电子科技大学 Single-channel sound separation method based on convolutional neural network
US10529349B2 (en) * 2018-04-16 2020-01-07 Mitsubishi Electric Research Laboratories, Inc. Methods and systems for end-to-end speech separation with unfolded iterative phase reconstruction
US10782986B2 (en) * 2018-04-20 2020-09-22 Facebook, Inc. Assisting users with personalized and contextual communication content

Also Published As

Publication number Publication date
US11158324B2 (en) 2021-10-26
CN108766440A (en) 2018-11-06
JP2020527248A (en) 2020-09-03
WO2019227672A1 (en) 2019-12-05
US20200234717A1 (en) 2020-07-23
CN108766440B (en) 2020-01-14

Similar Documents

Publication Publication Date Title
SG11202003722SA (en) Speaker separation model training method, two-speaker separation method and computing device
EP3742436A4 (en) Voice synthesis method, model training method, device and computer device
EP3690763A4 (en) Machine learning model training method and device, and electronic device
EP3683725A4 (en) Abstract description generation method, abstract description model training method and computer device
EP3611725A4 (en) Voice signal processing model training method, electronic device, and storage medium
EP3633610A4 (en) Learning device, learning method, learning model, estimation device, and grip system
EP3537349A4 (en) Machine learning model training method and device
EP3582118A4 (en) Method and apparatus for training classification model
EP3716156A4 (en) Neural network model training method and apparatus
SG11202000749RA (en) Model training method and apparatus
EP3579153A4 (en) Learned model provision method and learned model provision device
EP3872705A4 (en) Detection model training method and apparatus and terminal device
EP3633549A4 (en) Facial detection training method, apparatus and electronic device
EP3503980A4 (en) Exercise system and method
EP3611657A4 (en) Model training method and method, apparatus, and device for determining data similarity
EP3399426A4 (en) Method and device for training model in distributed system
EP3179473A4 (en) Training method and apparatus for language model, and device
EP3951646A4 (en) Image recognition network model training method, image recognition method and device
EP3648044A4 (en) Method, apparatus, and device for training risk control model and risk control
EP3540652A4 (en) Method, device, chip and system for training neural network model
EP3440659A4 (en) Cpr training system and method
EP3579169A4 (en) Learned model provision method, and learned model provision device
EP3346394A4 (en) Question answering system training device and computer program therefor
EP3678072A4 (en) Model integration method and device
EP3633553A4 (en) Method, device and apparatus for training object detection model