MX2021014721A - Sistemas y metodos para aprendizaje de maquina de atributos de voz. - Google Patents
Sistemas y metodos para aprendizaje de maquina de atributos de voz.Info
- Publication number
- MX2021014721A MX2021014721A MX2021014721A MX2021014721A MX2021014721A MX 2021014721 A MX2021014721 A MX 2021014721A MX 2021014721 A MX2021014721 A MX 2021014721A MX 2021014721 A MX2021014721 A MX 2021014721A MX 2021014721 A MX2021014721 A MX 2021014721A
- Authority
- MX
- Mexico
- Prior art keywords
- systems
- methods
- machine learning
- speaker
- attributes
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 3
- 238000010801 machine learning Methods 0.000 title abstract 2
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/08—Insurance
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/80—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for detecting, monitoring or modelling epidemics or pandemics, e.g. flu
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/40—Detecting, measuring or recording for evaluating the nervous system
- A61B5/4076—Diagnosing or monitoring particular conditions of the nervous system
- A61B5/4082—Diagnosing or monitoring movement diseases, e.g. Parkinson, Huntington or Tourette
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/48—Other medical applications
- A61B5/4803—Speech analysis specially adapted for diagnostic purposes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/30—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
- G06N20/20—Ensemble learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Public Health (AREA)
- Medical Informatics (AREA)
- Theoretical Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- General Physics & Mathematics (AREA)
- Epidemiology (AREA)
- Business, Economics & Management (AREA)
- Evolutionary Computation (AREA)
- Pathology (AREA)
- Artificial Intelligence (AREA)
- Software Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Signal Processing (AREA)
- Primary Health Care (AREA)
- Databases & Information Systems (AREA)
- Finance (AREA)
- Accounting & Taxation (AREA)
- Mathematical Physics (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Technology Law (AREA)
- Economics (AREA)
- Development Economics (AREA)
- Marketing (AREA)
- Strategic Management (AREA)
- General Business, Economics & Management (AREA)
- Neurology (AREA)
- Computer Vision & Pattern Recognition (AREA)
Abstract
Se proveen sistemas y métodos para aprendizaje automático de voz y otros atributos. El sistema recibe datos de entrada, aísla sonidos predeterminados del habla aislada de un hablante de interés, resume las características para generar variables que describen al hablante y genera un modelo predictivo para detectar una característica deseada de una persona. También se proveen sistemas y métodos para detectar uno o más atributos de un hablante en base al análisis de muestras de audio u otros tipos de información almacenada digitalmente (por ejemplo, videos, fotos, etc.).
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962854652P | 2019-05-30 | 2019-05-30 | |
US202062989485P | 2020-03-13 | 2020-03-13 | |
US202063018892P | 2020-05-01 | 2020-05-01 | |
PCT/US2020/035542 WO2020243701A1 (en) | 2019-05-30 | 2020-06-01 | Systems and methods for machine learning of voice attributes |
Publications (1)
Publication Number | Publication Date |
---|---|
MX2021014721A true MX2021014721A (es) | 2022-04-06 |
Family
ID=73549497
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2021014721A MX2021014721A (es) | 2019-05-30 | 2020-06-01 | Sistemas y metodos para aprendizaje de maquina de atributos de voz. |
Country Status (12)
Country | Link |
---|---|
US (2) | US20200380957A1 (es) |
EP (1) | EP3976074A4 (es) |
JP (1) | JP2022534541A (es) |
KR (1) | KR20220024217A (es) |
CN (1) | CN114206361A (es) |
AU (1) | AU2020283065A1 (es) |
BR (1) | BR112021024196A2 (es) |
CA (1) | CA3142423A1 (es) |
IL (1) | IL288545A (es) |
MX (1) | MX2021014721A (es) |
SG (1) | SG11202113302UA (es) |
WO (1) | WO2020243701A1 (es) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11315040B2 (en) * | 2020-02-12 | 2022-04-26 | Wipro Limited | System and method for detecting instances of lie using Machine Learning model |
US11329998B1 (en) | 2020-08-31 | 2022-05-10 | Secureauth Corporation | Identification (ID) proofing and risk engine integration system and method |
US20220093121A1 (en) * | 2020-09-23 | 2022-03-24 | Sruthi Kotlo | Detecting Depression Using Machine Learning Models on Human Speech Samples |
US11700250B2 (en) * | 2020-10-14 | 2023-07-11 | Paypal, Inc. | Voice vector framework for authenticating user interactions |
US11869641B2 (en) * | 2020-12-11 | 2024-01-09 | Aetna Inc. | Systems and methods for determining whether an individual is sick based on machine learning algorithms and individualized data |
US20220198140A1 (en) * | 2020-12-21 | 2022-06-23 | International Business Machines Corporation | Live audio adjustment based on speaker attributes |
EP4039187A1 (de) * | 2021-02-05 | 2022-08-10 | Siemens Aktiengesellschaft | Computerimplementiertes verfahren und werkzeug sowie datenverarbeitungsgerät zum erkennen von oberen atemwegserkrankungen beim menschen |
US11929078B2 (en) * | 2021-02-23 | 2024-03-12 | Intuit, Inc. | Method and system for user voice identification using ensembled deep learning algorithms |
US11094135B1 (en) | 2021-03-05 | 2021-08-17 | Flyreel, Inc. | Automated measurement of interior spaces through guided modeling of dimensions |
US20220293123A1 (en) * | 2021-03-10 | 2022-09-15 | Covid Cough, Inc. | Systems and methods for authentication using sound-based vocalization analysis |
EP4089682A1 (en) * | 2021-05-12 | 2022-11-16 | BIOTRONIK SE & Co. KG | Medical support system and medical support method for patient treatment |
US20240105208A1 (en) * | 2022-09-19 | 2024-03-28 | SubStrata Ltd. | Automated classification of relative dominance based on reciprocal prosodic behaviour in an audio conversation |
Family Cites Families (34)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4712242A (en) * | 1983-04-13 | 1987-12-08 | Texas Instruments Incorporated | Speaker-independent word recognizer |
US5768474A (en) * | 1995-12-29 | 1998-06-16 | International Business Machines Corporation | Method and system for noise-robust speech processing with cochlea filters in an auditory model |
US20080275349A1 (en) * | 2007-05-02 | 2008-11-06 | Earlysense Ltd. | Monitoring, predicting and treating clinical episodes |
US20120071777A1 (en) * | 2009-09-18 | 2012-03-22 | Macauslan Joel | Cough Analysis |
US8306814B2 (en) * | 2010-05-11 | 2012-11-06 | Nice-Systems Ltd. | Method for speaker source classification |
US20130158434A1 (en) * | 2011-12-20 | 2013-06-20 | Delta Electronics, Inc. | Apparatus for voice assisted medical diagnosis |
KR102081241B1 (ko) * | 2012-03-29 | 2020-02-25 | 더 유니버서티 어브 퀸슬랜드 | 환자 소리들을 처리하기 위한 방법 및 장치 |
CN103546503B (zh) * | 2012-07-10 | 2017-03-15 | 百度在线网络技术(北京)有限公司 | 基于语音的云社交系统、方法及云分析服务器 |
EP2713367B1 (en) * | 2012-09-28 | 2016-11-09 | Agnitio, S.L. | Speaker recognition |
US9579056B2 (en) * | 2012-10-16 | 2017-02-28 | University Of Florida Research Foundation, Incorporated | Screening for neurological disease using speech articulation characteristics |
US9460722B2 (en) * | 2013-07-17 | 2016-10-04 | Verint Systems Ltd. | Blind diarization of recorded calls with arbitrary number of speakers |
US9514753B2 (en) * | 2013-11-04 | 2016-12-06 | Google Inc. | Speaker identification using hash-based indexing |
US9318112B2 (en) * | 2014-02-14 | 2016-04-19 | Google Inc. | Recognizing speech in the presence of additional audio |
US9792899B2 (en) * | 2014-07-15 | 2017-10-17 | International Business Machines Corporation | Dataset shift compensation in machine learning |
EP3257043B1 (en) * | 2015-02-11 | 2018-12-12 | Bang & Olufsen A/S | Speaker recognition in multimedia system |
US10664572B2 (en) * | 2015-08-06 | 2020-05-26 | Microsoft Technology Licensing, Llc | Recommendations for health benefit resources |
US10127929B2 (en) * | 2015-08-19 | 2018-11-13 | Massachusetts Institute Of Technology | Assessing disorders through speech and a computational model |
EP3359023A4 (en) * | 2015-10-08 | 2019-05-22 | Cordio Medical Ltd. | ASSESSMENT OF A PULMONARY SUFFERING BY LANGUAGE ANALYSIS |
US10347270B2 (en) * | 2016-03-18 | 2019-07-09 | International Business Machines Corporation | Denoising a signal |
US10141009B2 (en) * | 2016-06-28 | 2018-11-27 | Pindrop Security, Inc. | System and method for cluster-based audio event detection |
CN106504773B (zh) * | 2016-11-08 | 2023-08-01 | 上海贝生医疗设备有限公司 | 一种可穿戴装置及语音与活动监测系统 |
CN106782616A (zh) * | 2016-12-28 | 2017-05-31 | 上海百芝龙网络科技有限公司 | 一种通过人声分析检测呼吸道的方法 |
WO2018148298A1 (en) * | 2017-02-07 | 2018-08-16 | Pindrop Security, Inc. | Age compensation in biometric systems using time-interval, gender, and age |
EP3580754A4 (en) * | 2017-02-12 | 2020-12-16 | Cardiokol Ltd. | VERBAL PERIODIC SCREENING FOR HEART DISEASE |
WO2018204934A1 (en) * | 2017-05-05 | 2018-11-08 | Canary Speech, LLC | Selecting speech features for building models for detecting medical conditions |
US10637898B2 (en) * | 2017-05-24 | 2020-04-28 | AffectLayer, Inc. | Automatic speaker identification in calls |
CN107705807B (zh) * | 2017-08-24 | 2019-08-27 | 平安科技(深圳)有限公司 | 基于情绪识别的语音质检方法、装置、设备及存储介质 |
CN108053841A (zh) * | 2017-10-23 | 2018-05-18 | 平安科技(深圳)有限公司 | 利用语音进行疾病预测的方法及应用服务器 |
GB2567826B (en) * | 2017-10-24 | 2023-04-26 | Cambridge Cognition Ltd | System and method for assessing physiological state |
US10825564B1 (en) * | 2017-12-11 | 2020-11-03 | State Farm Mutual Automobile Insurance Company | Biometric characteristic application using audio/video analysis |
CN109801634B (zh) * | 2019-01-31 | 2021-05-18 | 北京声智科技有限公司 | 一种声纹特征的融合方法及装置 |
US11011188B2 (en) * | 2019-03-12 | 2021-05-18 | Cordio Medical Ltd. | Diagnostic techniques based on speech-sample alignment |
US11211053B2 (en) * | 2019-05-23 | 2021-12-28 | International Business Machines Corporation | Systems and methods for automated generation of subtitles |
WO2021123462A1 (es) * | 2019-12-16 | 2021-06-24 | Sigma Technologies, S.L. | Método y sistema para estimar características de hablante sobre la marcha para hablante desconocido con alta precisión y baja latencia |
-
2020
- 2020-06-01 SG SG11202113302UA patent/SG11202113302UA/en unknown
- 2020-06-01 JP JP2021571537A patent/JP2022534541A/ja active Pending
- 2020-06-01 EP EP20814546.6A patent/EP3976074A4/en active Pending
- 2020-06-01 CN CN202080055544.1A patent/CN114206361A/zh active Pending
- 2020-06-01 MX MX2021014721A patent/MX2021014721A/es unknown
- 2020-06-01 KR KR1020217043354A patent/KR20220024217A/ko unknown
- 2020-06-01 WO PCT/US2020/035542 patent/WO2020243701A1/en unknown
- 2020-06-01 AU AU2020283065A patent/AU2020283065A1/en not_active Abandoned
- 2020-06-01 US US16/889,307 patent/US20200380957A1/en active Pending
- 2020-06-01 CA CA3142423A patent/CA3142423A1/en not_active Abandoned
- 2020-06-01 US US16/889,326 patent/US20200381130A1/en active Pending
- 2020-06-01 BR BR112021024196A patent/BR112021024196A2/pt not_active Application Discontinuation
-
2021
- 2021-11-30 IL IL288545A patent/IL288545A/en unknown
Also Published As
Publication number | Publication date |
---|---|
SG11202113302UA (en) | 2021-12-30 |
AU2020283065A1 (en) | 2022-01-06 |
US20200381130A1 (en) | 2020-12-03 |
CN114206361A (zh) | 2022-03-18 |
CA3142423A1 (en) | 2020-12-03 |
EP3976074A4 (en) | 2023-01-25 |
WO2020243701A1 (en) | 2020-12-03 |
IL288545A (en) | 2022-02-01 |
KR20220024217A (ko) | 2022-03-03 |
EP3976074A1 (en) | 2022-04-06 |
JP2022534541A (ja) | 2022-08-01 |
BR112021024196A2 (pt) | 2022-02-08 |
US20200380957A1 (en) | 2020-12-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MX2021014721A (es) | Sistemas y metodos para aprendizaje de maquina de atributos de voz. | |
US10743107B1 (en) | Synchronization of audio signals from distributed devices | |
SG10201707702YA (en) | Collaborative Voice Controlled Devices | |
EP3963576B1 (en) | Speaker attributed transcript generation | |
US11875796B2 (en) | Audio-visual diarization to identify meeting attendees | |
EP4235645A3 (en) | System and method for customizing smart home speech interfaces using personalized speech profiles | |
US11138980B2 (en) | Processing overlapping speech from distributed devices | |
WO2020098828A3 (en) | System and method for personalized speaker verification | |
MX364461B (es) | Método y dispositivo para lograr el registro de audio objetivo y aparato electrónico. | |
GB2567339A (en) | Speaker recognition | |
US10812921B1 (en) | Audio stream processing for distributed device meeting | |
EP3751561A3 (en) | Hotword recognition | |
EP4235648A3 (en) | Language model biasing | |
DE602006018795D1 (de) | Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache | |
Kürby et al. | Bag-of-Features Acoustic Event Detection for Sensor Networks. | |
US10065013B2 (en) | Selective amplification of an acoustic signal | |
US20160189103A1 (en) | Apparatus and method for automatically creating and recording minutes of meeting | |
WO2017027397A3 (en) | Event detection for playback management in an audio device | |
MX2022001162A (es) | Coordinacion de dispositivos de audio. | |
US11468895B2 (en) | Distributed device meeting initiation | |
JP2019113636A (ja) | 音声認識システム | |
US9466299B1 (en) | Speech source classification | |
Basu et al. | An overview of speaker diarization: Approaches, resources and challenges | |
KR20200089594A (ko) | 무대 음향 시스템 및 무대 음향 제어 방법 | |
JP2020177060A (ja) | 音声認識システム、及び、音声認識方法 |