KR102313387B9 - Method and Apparatus for Separating Speaker Based on Machine Learning - Google Patents

Method and Apparatus for Separating Speaker Based on Machine Learning

Info

Publication number
KR102313387B9
KR102313387B9 KR1020190141938A KR20190141938A KR102313387B9 KR 102313387 B9 KR102313387 B9 KR 102313387B9 KR 1020190141938 A KR1020190141938 A KR 1020190141938A KR 20190141938 A KR20190141938 A KR 20190141938A KR 102313387 B9 KR102313387 B9 KR 102313387B9
Authority
KR
South Korea
Prior art keywords
machine learning
speaker based
separating
separating speaker
learning
Prior art date
Application number
KR1020190141938A
Other languages
Korean (ko)
Other versions
KR20210055464A (en
KR102313387B1 (en
Inventor
조성배
김진영
Original Assignee
연세대학교 산학협력단
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 연세대학교 산학협력단 filed Critical 연세대학교 산학협력단
Priority to KR1020190141938A priority Critical patent/KR102313387B1/en
Publication of KR20210055464A publication Critical patent/KR20210055464A/en
Application granted granted Critical
Publication of KR102313387B1 publication Critical patent/KR102313387B1/en
Publication of KR102313387B9 publication Critical patent/KR102313387B9/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Artificial Intelligence (AREA)
  • Quality & Reliability (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Machine Translation (AREA)
  • Telephonic Communication Services (AREA)
KR1020190141938A 2019-11-07 2019-11-07 Method and Apparatus for Separating Speaker Based on Machine Learning KR102313387B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020190141938A KR102313387B1 (en) 2019-11-07 2019-11-07 Method and Apparatus for Separating Speaker Based on Machine Learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020190141938A KR102313387B1 (en) 2019-11-07 2019-11-07 Method and Apparatus for Separating Speaker Based on Machine Learning

Publications (3)

Publication Number Publication Date
KR20210055464A KR20210055464A (en) 2021-05-17
KR102313387B1 KR102313387B1 (en) 2021-10-14
KR102313387B9 true KR102313387B9 (en) 2021-11-12

Family

ID=76158155

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020190141938A KR102313387B1 (en) 2019-11-07 2019-11-07 Method and Apparatus for Separating Speaker Based on Machine Learning

Country Status (1)

Country Link
KR (1) KR102313387B1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20220169242A (en) * 2021-06-18 2022-12-27 삼성전자주식회사 Electronic devcie and method for personalized audio processing of the electronic device
CN113707173B (en) * 2021-08-30 2023-12-29 平安科技(深圳)有限公司 Voice separation method, device, equipment and storage medium based on audio segmentation

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4717872B2 (en) * 2006-12-06 2011-07-06 韓國電子通信研究院 Speaker information acquisition system and method using voice feature information of speaker
KR101178801B1 (en) * 2008-12-09 2012-08-31 한국전자통신연구원 Apparatus and method for speech recognition by using source separation and source identification
KR101304127B1 (en) * 2011-12-19 2013-09-05 세종대학교산학협력단 Apparatus and method for recognizing of speaker using vocal signal
KR101304112B1 (en) * 2011-12-27 2013-09-05 현대캐피탈 주식회사 Real time speaker recognition system and method using voice separation
KR101616112B1 (en) * 2014-07-28 2016-04-27 (주)복스유니버스 Speaker separation system and method using voice feature vectors

Also Published As

Publication number Publication date
KR20210055464A (en) 2021-05-17
KR102313387B1 (en) 2021-10-14

Similar Documents

Publication Publication Date Title
SG11202107662TA (en) Apparatus and method for cybersecurity
EP3836037A4 (en) Method and system for executing machine learning process
SG10201908562WA (en) Polishing apparatus, polishing method, and machine learning apparatus
EP3751569A4 (en) Multi-person voice separation method and apparatus
EP3740936A4 (en) Method and apparatus for pose processing
EP3529752A4 (en) Electronic apparatus for operating machine learning and method for operating machine learning
GB2589658B (en) Method and apparatus for running an applet
EP3430526A4 (en) Method and apparatus for training a learning machine
EP4068169A4 (en) Search method for machine learning model and related apparatus and device
GB2584677B (en) Method and apparatus for trajectory-planning
EP3780650C0 (en) Vibration removal apparatus and method for dual-microphone earphones
EP3785179A4 (en) Method and system for performing machine learning
GB2575043B (en) Apparatus and method for mearuring a signal
EP3794505A4 (en) Method and apparatus for image recognition
EP3857468A4 (en) Recommendation method and system and method and system for improving a machine learning system
SG11202101427SA (en) Audio processing method and apparatus
KR102313387B9 (en) Method and Apparatus for Separating Speaker Based on Machine Learning
EP3746942A4 (en) Apparatus and method for object recognition
GB2588496B (en) Recognition apparatus and method
ZA202005594B (en) Method and apparatus for processing legumes
GB201905795D0 (en) Apparatus and method
PL3113180T3 (en) Method for performing audio inpainting on a speech signal and apparatus for performing audio inpainting on a speech signal
GB201817805D0 (en) Slack separation apparatus and method
EP3645178A4 (en) A method and apparatus for sorting
EP3950156A4 (en) Sorting apparatus and sorting method

Legal Events

Date Code Title Description
E902 Notification of reason for refusal
E701 Decision to grant or registration of patent right
GRNT Written decision to grant
G170 Re-publication after modification of scope of protection [patent]