SG11202002083WA - Method and apparatus for establishing voiceprint model, computer device, and storage medium - Google Patents
Method and apparatus for establishing voiceprint model, computer device, and storage mediumInfo
- Publication number
- SG11202002083WA SG11202002083WA SG11202002083WA SG11202002083WA SG11202002083WA SG 11202002083W A SG11202002083W A SG 11202002083WA SG 11202002083W A SG11202002083W A SG 11202002083WA SG 11202002083W A SG11202002083W A SG 11202002083WA SG 11202002083W A SG11202002083W A SG 11202002083WA
- Authority
- SG
- Singapore
- Prior art keywords
- establishing
- storage medium
- computer device
- voiceprint model
- voiceprint
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/18—Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/18—Artificial neural networks; Connectionist approaches
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/20—Pattern transformations or operations aimed at increasing system robustness, e.g. against channel noise or different working conditions
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Software Systems (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Mathematical Analysis (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- Computational Mathematics (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Operations Research (AREA)
- Probability & Statistics with Applications (AREA)
- Algebra (AREA)
- Databases & Information Systems (AREA)
- Electrically Operated Instructional Devices (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Telephonic Communication Services (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810433792.XA CN108806696B (zh) | 2018-05-08 | 2018-05-08 | 建立声纹模型的方法、装置、计算机设备和存储介质 |
PCT/CN2018/094888 WO2019214047A1 (zh) | 2018-05-08 | 2018-07-06 | 建立声纹模型的方法、装置、计算机设备和存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11202002083WA true SG11202002083WA (en) | 2020-04-29 |
Family
ID=64092054
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11202002083WA SG11202002083WA (en) | 2018-05-08 | 2018-07-06 | Method and apparatus for establishing voiceprint model, computer device, and storage medium |
Country Status (5)
Country | Link |
---|---|
US (1) | US11322155B2 (ja) |
JP (1) | JP6906067B2 (ja) |
CN (1) | CN108806696B (ja) |
SG (1) | SG11202002083WA (ja) |
WO (1) | WO2019214047A1 (ja) |
Families Citing this family (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110246503A (zh) * | 2019-05-20 | 2019-09-17 | 平安科技(深圳)有限公司 | 黑名单声纹库构建方法、装置、计算机设备和存储介质 |
CN110265040B (zh) * | 2019-06-20 | 2022-05-17 | Oppo广东移动通信有限公司 | 声纹模型的训练方法、装置、存储介质及电子设备 |
CN110211569A (zh) * | 2019-07-09 | 2019-09-06 | 浙江百应科技有限公司 | 基于语音图谱和深度学习的实时性别识别方法 |
CN110428853A (zh) * | 2019-08-30 | 2019-11-08 | 北京太极华保科技股份有限公司 | 语音活性检测方法、语音活性检测装置以及电子设备 |
CN110491393B (zh) * | 2019-08-30 | 2022-04-22 | 科大讯飞股份有限公司 | 声纹表征模型的训练方法及相关装置 |
CN110600040B (zh) * | 2019-09-19 | 2021-05-25 | 北京三快在线科技有限公司 | 声纹特征注册方法、装置、计算机设备及存储介质 |
CN110780741B (zh) * | 2019-10-28 | 2022-03-01 | Oppo广东移动通信有限公司 | 模型训练方法、应用运行方法、装置、介质及电子设备 |
CN111292510A (zh) * | 2020-01-16 | 2020-06-16 | 广州华铭电力科技有限公司 | 一种城市电缆被外力破坏的识别预警方法 |
CN113409793B (zh) * | 2020-02-28 | 2024-05-17 | 阿里巴巴集团控股有限公司 | 语音识别方法及智能家居系统、会议设备、计算设备 |
CN111414511B (zh) * | 2020-03-25 | 2023-08-22 | 合肥讯飞数码科技有限公司 | 自动声纹建模入库方法、装置以及设备 |
IL274741B1 (en) * | 2020-05-18 | 2024-07-01 | Cognyte Tech Israel Ltd | A system and method for obtaining voiceprints for large populations |
CN113948089B (zh) * | 2020-06-30 | 2024-06-14 | 北京猎户星空科技有限公司 | 声纹模型训练和声纹识别方法、装置、设备及介质 |
TWI807203B (zh) * | 2020-07-28 | 2023-07-01 | 華碩電腦股份有限公司 | 聲音辨識方法及使用其之電子裝置 |
CN112466311B (zh) * | 2020-12-22 | 2022-08-19 | 深圳壹账通智能科技有限公司 | 声纹识别方法、装置、存储介质及计算机设备 |
CN112637428A (zh) * | 2020-12-29 | 2021-04-09 | 平安科技(深圳)有限公司 | 无效通话判断方法、装置、计算机设备及存储介质 |
CN113011302B (zh) * | 2021-03-11 | 2022-04-01 | 国网电力科学研究院武汉南瑞有限责任公司 | 一种基于卷积神经网络的雷声信号识别系统及方法 |
CN113179442B (zh) * | 2021-04-20 | 2022-04-29 | 浙江工业大学 | 一种基于语音识别的视频中音频流替换方法 |
CN113077536B (zh) * | 2021-04-20 | 2024-05-28 | 深圳追一科技有限公司 | 一种基于bert模型的嘴部动作驱动模型训练方法及组件 |
CN113421575B (zh) * | 2021-06-30 | 2024-02-06 | 平安科技(深圳)有限公司 | 声纹识别方法、装置、设备及存储介质 |
CN114113837B (zh) * | 2021-11-15 | 2024-04-30 | 国网辽宁省电力有限公司朝阳供电公司 | 一种基于声学特征的变压器带电检测方法及系统 |
CN114495948B (zh) * | 2022-04-18 | 2022-09-09 | 北京快联科技有限公司 | 一种声纹识别方法及装置 |
CN115831152B (zh) * | 2022-11-28 | 2023-07-04 | 国网山东省电力公司应急管理中心 | 一种用于实时监测应急装备发电机运行状态的声音监测装置及方法 |
CN118155463B (zh) * | 2024-05-10 | 2024-07-19 | 兰州大学 | 嘈杂环境下听障人士汉语发音计算机辅助学习方法及装置 |
Family Cites Families (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8645137B2 (en) * | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
KR100679051B1 (ko) * | 2005-12-14 | 2007-02-05 | 삼성전자주식회사 | 복수의 신뢰도 측정 알고리즘을 이용한 음성 인식 장치 및방법 |
US11074495B2 (en) * | 2013-02-28 | 2021-07-27 | Z Advanced Computing, Inc. (Zac) | System and method for extremely efficient image and pattern recognition and artificial intelligence platform |
CN104485102A (zh) * | 2014-12-23 | 2015-04-01 | 智慧眼(湖南)科技发展有限公司 | 声纹识别方法和装置 |
CN106157959B (zh) * | 2015-03-31 | 2019-10-18 | 讯飞智元信息科技有限公司 | 声纹模型更新方法及系统 |
US10884503B2 (en) * | 2015-12-07 | 2021-01-05 | Sri International | VPA with integrated object recognition and facial expression recognition |
CN105513597B (zh) * | 2015-12-30 | 2018-07-10 | 百度在线网络技术(北京)有限公司 | 声纹认证处理方法及装置 |
CN105845140A (zh) * | 2016-03-23 | 2016-08-10 | 广州势必可赢网络科技有限公司 | 应用于短语音条件下的说话人确认方法和装置 |
CN107492382B (zh) * | 2016-06-13 | 2020-12-18 | 阿里巴巴集团控股有限公司 | 基于神经网络的声纹信息提取方法及装置 |
CN106448684A (zh) * | 2016-11-16 | 2017-02-22 | 北京大学深圳研究生院 | 基于深度置信网络特征矢量的信道鲁棒声纹识别系统 |
CN106847292B (zh) * | 2017-02-16 | 2018-06-19 | 平安科技(深圳)有限公司 | 声纹识别方法及装置 |
EP3607495A4 (en) * | 2017-04-07 | 2020-11-25 | Intel Corporation | METHODS AND SYSTEMS USING IMPROVED TRAINING AND LEARNING FOR DEEP NEURAL NETWORKS |
EP3607741A4 (en) * | 2017-04-07 | 2020-12-09 | INTEL Corporation | METHODS AND SYSTEMS USING CAMERA DEVICES FOR DEEP NEURAL CHANNEL AND CONVOLVING NETWORK IMAGES AND FORMATS |
US10896669B2 (en) * | 2017-05-19 | 2021-01-19 | Baidu Usa Llc | Systems and methods for multi-speaker neural text-to-speech |
US20180358003A1 (en) * | 2017-06-09 | 2018-12-13 | Qualcomm Incorporated | Methods and apparatus for improving speech communication and speech interface quality using neural networks |
CN107357875B (zh) * | 2017-07-04 | 2021-09-10 | 北京奇艺世纪科技有限公司 | 一种语音搜索方法、装置及电子设备 |
CN107680582B (zh) * | 2017-07-28 | 2021-03-26 | 平安科技(深圳)有限公司 | 声学模型训练方法、语音识别方法、装置、设备及介质 |
US11055604B2 (en) * | 2017-09-12 | 2021-07-06 | Intel Corporation | Per kernel Kmeans compression for neural networks |
CN107993071A (zh) * | 2017-11-21 | 2018-05-04 | 平安科技(深圳)有限公司 | 电子装置、基于声纹的身份验证方法及存储介质 |
US11264037B2 (en) * | 2018-01-23 | 2022-03-01 | Cirrus Logic, Inc. | Speaker identification |
US10437936B2 (en) * | 2018-02-01 | 2019-10-08 | Jungle Disk, L.L.C. | Generative text using a personality model |
WO2020035085A2 (en) * | 2019-10-31 | 2020-02-20 | Alipay (Hangzhou) Information Technology Co., Ltd. | System and method for determining voice characteristics |
-
2018
- 2018-05-08 CN CN201810433792.XA patent/CN108806696B/zh active Active
- 2018-07-06 US US16/759,384 patent/US11322155B2/en active Active
- 2018-07-06 WO PCT/CN2018/094888 patent/WO2019214047A1/zh active Application Filing
- 2018-07-06 JP JP2019570559A patent/JP6906067B2/ja active Active
- 2018-07-06 SG SG11202002083WA patent/SG11202002083WA/en unknown
Also Published As
Publication number | Publication date |
---|---|
JP6906067B2 (ja) | 2021-07-21 |
CN108806696A (zh) | 2018-11-13 |
US11322155B2 (en) | 2022-05-03 |
JP2020524308A (ja) | 2020-08-13 |
CN108806696B (zh) | 2020-06-05 |
WO2019214047A1 (zh) | 2019-11-14 |
US20200294509A1 (en) | 2020-09-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11202002083WA (en) | Method and apparatus for establishing voiceprint model, computer device, and storage medium | |
SG11202110565RA (en) | Face recognition method and apparatus, electronic device, and storage medium | |
SG11202002078UA (en) | Method and apparatus for training semantic segmentation model, computer device, and storage medium | |
EP3805988A4 (en) | TRAINING PROCESS FOR MODEL, STORAGE MEDIA AND COMPUTER DEVICE | |
SG11202006192YA (en) | Face recognition method and apparatus, electronic device, and storage medium | |
SG11202008322UA (en) | Neural network model training method and apparatus, computer device, and storage medium | |
SG11202002740SA (en) | Face pose analysis method and apparatus, device, storage medium, and program | |
EP3968222A4 (en) | Classification task model training method, apparatus and device and storage medium | |
SG11202100004XA (en) | Machine learning process implementation method and apparatus, device, and storage medium | |
SG11202011156YA (en) | Digital certificate verification method and apparatus, computer device, and storage medium | |
SG11202101217YA (en) | Communication connection method and apparatus, computer device, and storage medium | |
EP3611657A4 (en) | MODEL TRAINING METHOD AND METHOD, DEVICE AND DEVICE FOR DETERMINING DATA SIMILARITY | |
SG11202103527XA (en) | Interactive plot implementation method, device, computer apparatus, and storage medium | |
EP3648099A4 (en) | VOICE RECOGNITION METHOD, DEVICE, DEVICE AND STORAGE MEDIUM | |
SG11201913916QA (en) | Question data generation method and apparatus, computer device, and storage medium | |
EP3848730A4 (en) | POSITIONING PROCESS, APPARATUS AND DEVICE, AND COMPUTER READABLE STORAGE MEDIA | |
SG11202010921PA (en) | Election method and apparatus for representative node device, computer device, and storage medium | |
SG11202004541WA (en) | Chatbot configuration method and apparatus, computer device, and storage medium | |
EP3605537A4 (en) | LANGUAGE MOTION DETECTION METHOD AND DEVICE, COMPUTER DEVICE AND STORAGE MEDIUM | |
SG11202107392TA (en) | Application starting method and apparatus, computer device and storage medium | |
EP3992846A4 (en) | ACTION RECOGNITION METHOD AND APPARATUS, COMPUTER STORAGE MEDIUM AND COMPUTER DEVICE | |
EP3961441A4 (en) | IDENTITY VERIFICATION METHOD AND APPARATUS, COMPUTER DEVICE AND STORAGE MEDIA | |
EP3859685A4 (en) | METHOD AND DEVICE FOR CREATING A THREE-DIMENSIONAL MODEL, DEVICE AND STORAGE MEDIUM | |
EP3739447A4 (en) | METHOD OF EXECUTING A PROGRAM, DEVICE, COMPUTER DEVICE, AND STORAGE MEDIUM | |
SG11202103326QA (en) | Video cutting method and apparatus, computer device and storage medium |