SG11202103091XA - A Speaker Recognition Method Based on Trajectories in Feature Spaces of Voice Samples - Google Patents

A Speaker Recognition Method Based on Trajectories in Feature Spaces of Voice Samples

Info

Publication number
SG11202103091XA
SG11202103091XA SG11202103091XA SG11202103091XA SG11202103091XA SG 11202103091X A SG11202103091X A SG 11202103091XA SG 11202103091X A SG11202103091X A SG 11202103091XA SG 11202103091X A SG11202103091X A SG 11202103091XA SG 11202103091X A SG11202103091X A SG 11202103091XA
Authority
SG
Singapore
Prior art keywords
trajectories
method based
recognition method
speaker recognition
voice samples
Prior art date
Application number
SG11202103091XA
Inventor
Qianhua He
Keqian Wu
Wei Xie
Wenfeng Pang
Original Assignee
Univ South China Tech
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Univ South China Tech filed Critical Univ South China Tech
Publication of SG11202103091XA publication Critical patent/SG11202103091XA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/06Decision making techniques; Pattern matching strategies
    • G10L17/08Use of distortion metrics or a particular distance between probe pattern and reference templates

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Business, Economics & Management (AREA)
  • Game Theory and Decision Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)
SG11202103091XA 2019-01-11 2019-10-16 A Speaker Recognition Method Based on Trajectories in Feature Spaces of Voice Samples SG11202103091XA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910027145.3A CN109545229B (en) 2019-01-11 2019-01-11 Speaker recognition method based on voice sample characteristic space track
PCT/CN2019/111530 WO2020143263A1 (en) 2019-01-11 2019-10-16 Speaker identification method based on speech sample feature space trajectory

Publications (1)

Publication Number Publication Date
SG11202103091XA true SG11202103091XA (en) 2021-04-29

Family

ID=65835222

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202103091XA SG11202103091XA (en) 2019-01-11 2019-10-16 A Speaker Recognition Method Based on Trajectories in Feature Spaces of Voice Samples

Country Status (3)

Country Link
CN (1) CN109545229B (en)
SG (1) SG11202103091XA (en)
WO (1) WO2020143263A1 (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109545229B (en) * 2019-01-11 2023-04-21 华南理工大学 Speaker recognition method based on voice sample characteristic space track
CN111081261B (en) * 2019-12-25 2023-04-21 华南理工大学 Text-independent voiceprint recognition method based on LDA
CN111128128B (en) * 2019-12-26 2023-05-23 华南理工大学 Voice keyword detection method based on complementary model scoring fusion
CN111933156B (en) * 2020-09-25 2021-01-19 广州佰锐网络科技有限公司 High-fidelity audio processing method and device based on multiple feature recognition
CN112487978B (en) * 2020-11-30 2024-04-16 清华珠三角研究院 Method and device for positioning speaker in video and computer storage medium
CN113611285B (en) * 2021-09-03 2023-11-24 哈尔滨理工大学 Language identification method based on stacked bidirectional time sequence pooling
CN117235435B (en) * 2023-11-15 2024-02-20 世优(北京)科技有限公司 Method and device for determining audio signal loss function

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5598507A (en) * 1994-04-12 1997-01-28 Xerox Corporation Method of speaker clustering for unknown speakers in conversational audio data
US6067517A (en) * 1996-02-02 2000-05-23 International Business Machines Corporation Transcription of speech data with segments from acoustically dissimilar environments
CN1302456C (en) * 2005-04-01 2007-02-28 郑方 Sound veins identifying method
JP4901657B2 (en) * 2007-09-05 2012-03-21 日本電信電話株式会社 Voice recognition apparatus, method thereof, program thereof, and recording medium
CN102024455B (en) * 2009-09-10 2014-09-17 索尼株式会社 Speaker recognition system and method
CN102479511A (en) * 2010-11-23 2012-05-30 盛乐信息技术(上海)有限公司 Large-scale voiceprint authentication method and system
CN105845141A (en) * 2016-03-23 2016-08-10 广州势必可赢网络科技有限公司 Speaker confirmation model, speaker confirmation method and speaker confirmation device based on channel robustness
US10637898B2 (en) * 2017-05-24 2020-04-28 AffectLayer, Inc. Automatic speaker identification in calls
CN109065028B (en) * 2018-06-11 2022-12-30 平安科技(深圳)有限公司 Speaker clustering method, speaker clustering device, computer equipment and storage medium
CN109065059A (en) * 2018-09-26 2018-12-21 新巴特(安徽)智能科技有限公司 The method for identifying speaker with the voice cluster that audio frequency characteristics principal component is established
CN109545229B (en) * 2019-01-11 2023-04-21 华南理工大学 Speaker recognition method based on voice sample characteristic space track

Also Published As

Publication number Publication date
CN109545229A (en) 2019-03-29
CN109545229B (en) 2023-04-21
WO2020143263A1 (en) 2020-07-16

Similar Documents

Publication Publication Date Title
SG11202103091XA (en) A Speaker Recognition Method Based on Trajectories in Feature Spaces of Voice Samples
EP3479376A4 (en) Speech recognition method and apparatus based on speaker recognition
EP3844750A4 (en) Home appliance and method for voice recognition thereof
EP3997694A4 (en) Systems and methods for recognizing and performing voice commands during advertisement
EP3504703A4 (en) A speech recognition method and apparatus
EP3752957A4 (en) System and method for speech understanding via integrated audio and visual based speech recognition
EP3767619A4 (en) Speech recognition and speech recognition model training method and apparatus
GB2573809B (en) Speaker Recognition
EP3501023A4 (en) Speech recognition method and apparatus
EP3888084A4 (en) Method and device for providing voice recognition service
EP4083999A4 (en) Voice recognition method and related product
EP3480816A4 (en) Method for voice recognition and electronic device for performing same
EP3533052A4 (en) Speech recognition method and apparatus
EP3701521A4 (en) Voice recognition apparatus and operation method thereof cross-reference to related application
EP3850622A4 (en) Method and device for speech recognition
EP3891729A4 (en) Method and apparatus for performing speech recognition with wake on voice
EP3869509A4 (en) Voice recognition device and method
GB202306034D0 (en) Improving speech recognition transcriptions
EP3982359A4 (en) Electronic device and method for recognizing voice by same
EP4026121A4 (en) Speech recognition systems and methods
GB2604675B (en) Improving speech recognition transcriptions
EP3975172A4 (en) Voiceprint recognition method, and device
GB201909353D0 (en) Pre-processing for automatic speech recognition
GB202117611D0 (en) Systems and methods for speech recognition
EP3686882A4 (en) Method for training filter model and speech recognition method