SG11202103091XA - A Speaker Recognition Method Based on Trajectories in Feature Spaces of Voice Samples - Google Patents
A Speaker Recognition Method Based on Trajectories in Feature Spaces of Voice SamplesInfo
- Publication number
- SG11202103091XA SG11202103091XA SG11202103091XA SG11202103091XA SG11202103091XA SG 11202103091X A SG11202103091X A SG 11202103091XA SG 11202103091X A SG11202103091X A SG 11202103091XA SG 11202103091X A SG11202103091X A SG 11202103091XA SG 11202103091X A SG11202103091X A SG 11202103091XA
- Authority
- SG
- Singapore
- Prior art keywords
- trajectories
- method based
- recognition method
- speaker recognition
- voice samples
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
- G10L17/08—Use of distortion metrics or a particular distance between probe pattern and reference templates
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Business, Economics & Management (AREA)
- Game Theory and Decision Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Image Analysis (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910027145.3A CN109545229B (en) | 2019-01-11 | 2019-01-11 | Speaker recognition method based on voice sample characteristic space track |
PCT/CN2019/111530 WO2020143263A1 (en) | 2019-01-11 | 2019-10-16 | Speaker identification method based on speech sample feature space trajectory |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11202103091XA true SG11202103091XA (en) | 2021-04-29 |
Family
ID=65835222
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11202103091XA SG11202103091XA (en) | 2019-01-11 | 2019-10-16 | A Speaker Recognition Method Based on Trajectories in Feature Spaces of Voice Samples |
Country Status (3)
Country | Link |
---|---|
CN (1) | CN109545229B (en) |
SG (1) | SG11202103091XA (en) |
WO (1) | WO2020143263A1 (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109545229B (en) * | 2019-01-11 | 2023-04-21 | 华南理工大学 | Speaker recognition method based on voice sample characteristic space track |
CN111081261B (en) * | 2019-12-25 | 2023-04-21 | 华南理工大学 | Text-independent voiceprint recognition method based on LDA |
CN111128128B (en) * | 2019-12-26 | 2023-05-23 | 华南理工大学 | Voice keyword detection method based on complementary model scoring fusion |
CN111933156B (en) * | 2020-09-25 | 2021-01-19 | 广州佰锐网络科技有限公司 | High-fidelity audio processing method and device based on multiple feature recognition |
CN112487978B (en) * | 2020-11-30 | 2024-04-16 | 清华珠三角研究院 | Method and device for positioning speaker in video and computer storage medium |
CN113611285B (en) * | 2021-09-03 | 2023-11-24 | 哈尔滨理工大学 | Language identification method based on stacked bidirectional time sequence pooling |
CN117235435B (en) * | 2023-11-15 | 2024-02-20 | 世优(北京)科技有限公司 | Method and device for determining audio signal loss function |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5598507A (en) * | 1994-04-12 | 1997-01-28 | Xerox Corporation | Method of speaker clustering for unknown speakers in conversational audio data |
US6067517A (en) * | 1996-02-02 | 2000-05-23 | International Business Machines Corporation | Transcription of speech data with segments from acoustically dissimilar environments |
CN1302456C (en) * | 2005-04-01 | 2007-02-28 | 郑方 | Sound veins identifying method |
JP4901657B2 (en) * | 2007-09-05 | 2012-03-21 | 日本電信電話株式会社 | Voice recognition apparatus, method thereof, program thereof, and recording medium |
CN102024455B (en) * | 2009-09-10 | 2014-09-17 | 索尼株式会社 | Speaker recognition system and method |
CN102479511A (en) * | 2010-11-23 | 2012-05-30 | 盛乐信息技术(上海)有限公司 | Large-scale voiceprint authentication method and system |
CN105845141A (en) * | 2016-03-23 | 2016-08-10 | 广州势必可赢网络科技有限公司 | Speaker confirmation model, speaker confirmation method and speaker confirmation device based on channel robustness |
US10637898B2 (en) * | 2017-05-24 | 2020-04-28 | AffectLayer, Inc. | Automatic speaker identification in calls |
CN109065028B (en) * | 2018-06-11 | 2022-12-30 | 平安科技(深圳)有限公司 | Speaker clustering method, speaker clustering device, computer equipment and storage medium |
CN109065059A (en) * | 2018-09-26 | 2018-12-21 | 新巴特(安徽)智能科技有限公司 | The method for identifying speaker with the voice cluster that audio frequency characteristics principal component is established |
CN109545229B (en) * | 2019-01-11 | 2023-04-21 | 华南理工大学 | Speaker recognition method based on voice sample characteristic space track |
-
2019
- 2019-01-11 CN CN201910027145.3A patent/CN109545229B/en active Active
- 2019-10-16 WO PCT/CN2019/111530 patent/WO2020143263A1/en active Application Filing
- 2019-10-16 SG SG11202103091XA patent/SG11202103091XA/en unknown
Also Published As
Publication number | Publication date |
---|---|
CN109545229A (en) | 2019-03-29 |
CN109545229B (en) | 2023-04-21 |
WO2020143263A1 (en) | 2020-07-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11202103091XA (en) | A Speaker Recognition Method Based on Trajectories in Feature Spaces of Voice Samples | |
EP3479376A4 (en) | Speech recognition method and apparatus based on speaker recognition | |
EP3844750A4 (en) | Home appliance and method for voice recognition thereof | |
EP3997694A4 (en) | Systems and methods for recognizing and performing voice commands during advertisement | |
EP3504703A4 (en) | A speech recognition method and apparatus | |
EP3752957A4 (en) | System and method for speech understanding via integrated audio and visual based speech recognition | |
EP3767619A4 (en) | Speech recognition and speech recognition model training method and apparatus | |
GB2573809B (en) | Speaker Recognition | |
EP3501023A4 (en) | Speech recognition method and apparatus | |
EP3888084A4 (en) | Method and device for providing voice recognition service | |
EP4083999A4 (en) | Voice recognition method and related product | |
EP3480816A4 (en) | Method for voice recognition and electronic device for performing same | |
EP3533052A4 (en) | Speech recognition method and apparatus | |
EP3701521A4 (en) | Voice recognition apparatus and operation method thereof cross-reference to related application | |
EP3850622A4 (en) | Method and device for speech recognition | |
EP3891729A4 (en) | Method and apparatus for performing speech recognition with wake on voice | |
EP3869509A4 (en) | Voice recognition device and method | |
GB202306034D0 (en) | Improving speech recognition transcriptions | |
EP3982359A4 (en) | Electronic device and method for recognizing voice by same | |
EP4026121A4 (en) | Speech recognition systems and methods | |
GB2604675B (en) | Improving speech recognition transcriptions | |
EP3975172A4 (en) | Voiceprint recognition method, and device | |
GB201909353D0 (en) | Pre-processing for automatic speech recognition | |
GB202117611D0 (en) | Systems and methods for speech recognition | |
EP3686882A4 (en) | Method for training filter model and speech recognition method |