SG11202006090RA - Voiceprint Recognition Method And Device Based On Memory Bottleneck Feature - Google Patents

Voiceprint Recognition Method And Device Based On Memory Bottleneck Feature

Info

Publication number
SG11202006090RA
SG11202006090RA SG11202006090RA SG11202006090RA SG11202006090RA SG 11202006090R A SG11202006090R A SG 11202006090RA SG 11202006090R A SG11202006090R A SG 11202006090RA SG 11202006090R A SG11202006090R A SG 11202006090RA SG 11202006090R A SG11202006090R A SG 11202006090RA
Authority
SG
Singapore
Prior art keywords
device based
recognition method
voiceprint recognition
bottleneck feature
memory bottleneck
Prior art date
Application number
SG11202006090RA
Inventor
Zhiming Wang
Jun Zhou
Xiaolong Li
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Publication of SG11202006090RA publication Critical patent/SG11202006090RA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/30Authentication, i.e. establishing the identity or authorisation of security principals
    • G06F21/31User authentication
    • G06F21/32User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • G06N3/063Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/046Forward inferencing; Production systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/18Artificial neural networks; Connectionist approaches
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/10Machine learning using kernel methods, e.g. support vector machines [SVM]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Computing Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Biophysics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Signal Processing (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Hardware Design (AREA)
  • Neurology (AREA)
  • Image Analysis (AREA)
  • Telephonic Communication Services (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
SG11202006090RA 2018-02-12 2019-01-25 Voiceprint Recognition Method And Device Based On Memory Bottleneck Feature SG11202006090RA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810146310.2A CN108447490B (en) 2018-02-12 2018-02-12 Voiceprint recognition method and device based on memorability bottleneck characteristics
PCT/CN2019/073101 WO2019154107A1 (en) 2018-02-12 2019-01-25 Voiceprint recognition method and device based on memorability bottleneck feature

Publications (1)

Publication Number Publication Date
SG11202006090RA true SG11202006090RA (en) 2020-07-29

Family

ID=63192672

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202006090RA SG11202006090RA (en) 2018-02-12 2019-01-25 Voiceprint Recognition Method And Device Based On Memory Bottleneck Feature

Country Status (6)

Country Link
US (1) US20200321008A1 (en)
EP (2) EP3955246B1 (en)
CN (1) CN108447490B (en)
SG (1) SG11202006090RA (en)
TW (1) TW201935464A (en)
WO (1) WO2019154107A1 (en)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108447490B (en) * 2018-02-12 2020-08-18 阿里巴巴集团控股有限公司 Voiceprint recognition method and device based on memorability bottleneck characteristics
KR102637339B1 (en) * 2018-08-31 2024-02-16 삼성전자주식회사 Method and apparatus of personalizing voice recognition model
CN109036467B (en) * 2018-10-26 2021-04-16 南京邮电大学 TF-LSTM-based CFFD extraction method, voice emotion recognition method and system
JP7024691B2 (en) * 2018-11-13 2022-02-24 日本電信電話株式会社 Non-verbal utterance detector, non-verbal utterance detection method, and program
US11315550B2 (en) * 2018-11-19 2022-04-26 Panasonic Intellectual Property Corporation Of America Speaker recognition device, speaker recognition method, and recording medium
CN109360553B (en) * 2018-11-20 2023-06-20 华南理工大学 Delay recurrent neural network for speech recognition
CN109754812A (en) * 2019-01-30 2019-05-14 华南理工大学 A kind of voiceprint authentication method of the anti-recording attack detecting based on convolutional neural networks
EP3948848B1 (en) * 2019-03-29 2023-07-19 Microsoft Technology Licensing, LLC Speaker diarization with early-stop clustering
KR102294638B1 (en) * 2019-04-01 2021-08-27 한양대학교 산학협력단 Combined learning method and apparatus using deepening neural network based feature enhancement and modified loss function for speaker recognition robust to noisy environments
CN112333545B (en) * 2019-07-31 2022-03-22 Tcl科技集团股份有限公司 Television content recommendation method, system, storage medium and smart television
CN110379412B (en) * 2019-09-05 2022-06-17 腾讯科技(深圳)有限公司 Voice processing method and device, electronic equipment and computer readable storage medium
CN111028847B (en) * 2019-12-17 2022-09-09 广东电网有限责任公司 Voiceprint recognition optimization method based on back-end model and related device
US11899765B2 (en) 2019-12-23 2024-02-13 Dts Inc. Dual-factor identification system and method with adaptive enrollment
CN111354364B (en) * 2020-04-23 2023-05-02 上海依图网络科技有限公司 Voiceprint recognition method and system based on RNN aggregation mode
CN111653270B (en) * 2020-08-05 2020-11-20 腾讯科技(深圳)有限公司 Voice processing method and device, computer readable storage medium and electronic equipment
CN112241467A (en) * 2020-12-18 2021-01-19 北京爱数智慧科技有限公司 Audio duplicate checking method and device
TWI790647B (en) * 2021-01-13 2023-01-21 神盾股份有限公司 Voice assistant system
CN112951256B (en) * 2021-01-25 2023-10-31 北京达佳互联信息技术有限公司 Voice processing method and device
CN112992126B (en) * 2021-04-22 2022-02-25 北京远鉴信息技术有限公司 Voice authenticity verification method and device, electronic equipment and readable storage medium
CN113284508B (en) * 2021-07-21 2021-11-09 中国科学院自动化研究所 Hierarchical differentiation based generated audio detection system
CN114333900B (en) * 2021-11-30 2023-09-05 南京硅基智能科技有限公司 Method for extracting BNF (BNF) characteristics end to end, network model, training method and training system
CN114882906A (en) * 2022-06-30 2022-08-09 广州伏羲智能科技有限公司 Novel environmental noise identification method and system
CN116072123B (en) * 2023-03-06 2023-06-23 南昌航天广信科技有限责任公司 Broadcast information playing method and device, readable storage medium and electronic equipment
CN117238320B (en) * 2023-11-16 2024-01-09 天津大学 Noise classification method based on multi-feature fusion convolutional neural network

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103971690A (en) * 2013-01-28 2014-08-06 腾讯科技(深圳)有限公司 Voiceprint recognition method and device
US9324320B1 (en) * 2014-10-02 2016-04-26 Microsoft Technology Licensing, Llc Neural network-based speech processing
CN105575394A (en) * 2016-01-04 2016-05-11 北京时代瑞朗科技有限公司 Voiceprint identification method based on global change space and deep learning hybrid modeling
CN107492382B (en) * 2016-06-13 2020-12-18 阿里巴巴集团控股有限公司 Voiceprint information extraction method and device based on neural network
US9824692B1 (en) * 2016-09-12 2017-11-21 Pindrop Security, Inc. End-to-end speaker recognition using deep neural network
CN106448684A (en) * 2016-11-16 2017-02-22 北京大学深圳研究生院 Deep-belief-network-characteristic-vector-based channel-robust voiceprint recognition system
CN107610707B (en) * 2016-12-15 2018-08-31 平安科技(深圳)有限公司 A kind of method for recognizing sound-groove and device
CN106875942B (en) * 2016-12-28 2021-01-22 中国科学院自动化研究所 Acoustic model self-adaption method based on accent bottleneck characteristics
CN106952644A (en) * 2017-02-24 2017-07-14 华南理工大学 A kind of complex audio segmentation clustering method based on bottleneck characteristic
CN108447490B (en) * 2018-02-12 2020-08-18 阿里巴巴集团控股有限公司 Voiceprint recognition method and device based on memorability bottleneck characteristics

Also Published As

Publication number Publication date
EP3719798B1 (en) 2022-09-21
CN108447490B (en) 2020-08-18
EP3719798A1 (en) 2020-10-07
TW201935464A (en) 2019-09-01
WO2019154107A1 (en) 2019-08-15
CN108447490A (en) 2018-08-24
EP3955246A1 (en) 2022-02-16
EP3955246B1 (en) 2023-03-29
US20200321008A1 (en) 2020-10-08
EP3719798A4 (en) 2021-03-24

Similar Documents

Publication Publication Date Title
SG11202006090RA (en) Voiceprint Recognition Method And Device Based On Memory Bottleneck Feature
SG11202011791SA (en) Pedestrian recognition method and device
EP3693887A4 (en) Fingerprint recognition apparatus and method, and electronic device
EP3657381A4 (en) Fingerprint recognition apparatus and method, and terminal device
SG11201912620YA (en) Voiceprint recognition method, device, terminal apparatus and storage medium
SG11202012520QA (en) Method and system for facilitating payment based on facial recognition
EP3416050A4 (en) Method and device for guiding fingerprint recognition
EP3623921A4 (en) Under-screen biometric recognition apparatus and electronic device
EP3786834A4 (en) Fingerprint recognition device and electronic apparatus
EP3757873A4 (en) Facial recognition method and device
EP3933693A4 (en) Object recognition method and device
PL3528157T3 (en) Fingerprint recognition-based application starting method and device
EP3391367A4 (en) Electronic device and speech recognition method thereof
EP3796208A4 (en) Fingerprint recognition apparatus and electronic device
EP3663905A4 (en) Information processing device, speech recognition system, and information processing method
EP3779667A4 (en) Speech recognition device, speech recognition device cooperation system, and speech recognition device cooperation method
EP3805982A4 (en) Gesture recognition method, apparatus and device
EP3594835A4 (en) Contactless multiple body part recognition method and multiple body part recognition device, using multiple biometric data
EP3594834A4 (en) Contactless multiple body part recognition method and multiple body part recognition device, using multiple biometric data
EP3547186A4 (en) Fingerprint recognition method and terminal device
EP3686769A4 (en) Fingerprint information acquisition method and fingerprint recognition device
EP3839806A4 (en) Facial recognition method, facial recognition system, and electronic device
EP3975172A4 (en) Voiceprint recognition method, and device
EP3869509A4 (en) Voice recognition device and method
EP3862895A4 (en) Biometric identification device, biometric identification method, and biometric identification program