MX2021014721A - Sistemas y metodos para aprendizaje de maquina de atributos de voz. - Google Patents

Sistemas y metodos para aprendizaje de maquina de atributos de voz.

Info

Publication number
MX2021014721A
MX2021014721A MX2021014721A MX2021014721A MX2021014721A MX 2021014721 A MX2021014721 A MX 2021014721A MX 2021014721 A MX2021014721 A MX 2021014721A MX 2021014721 A MX2021014721 A MX 2021014721A MX 2021014721 A MX2021014721 A MX 2021014721A
Authority
MX
Mexico
Prior art keywords
systems
methods
machine learning
speaker
attributes
Prior art date
Application number
MX2021014721A
Other languages
English (en)
Inventor
Erik Edwards
Zilwa Shane De
Nicholas Irwin
Amir Poorjam
Flavio Avila
Keith L Lew
Christopher Sirota
Original Assignee
Insurance Services Office Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Insurance Services Office Inc filed Critical Insurance Services Office Inc
Publication of MX2021014721A publication Critical patent/MX2021014721A/es

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/08Insurance
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/66Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/80ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for detecting, monitoring or modelling epidemics or pandemics, e.g. flu
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/40Detecting, measuring or recording for evaluating the nervous system
    • A61B5/4076Diagnosing or monitoring particular conditions of the nervous system
    • A61B5/4082Diagnosing or monitoring movement diseases, e.g. Parkinson, Huntington or Tourette
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/48Other medical applications
    • A61B5/4803Speech analysis specially adapted for diagnostic purposes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/30ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/20Ensemble learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Public Health (AREA)
  • Medical Informatics (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • Epidemiology (AREA)
  • Business, Economics & Management (AREA)
  • Evolutionary Computation (AREA)
  • Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Signal Processing (AREA)
  • Primary Health Care (AREA)
  • Databases & Information Systems (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • Mathematical Physics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Technology Law (AREA)
  • Economics (AREA)
  • Development Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Neurology (AREA)
  • Computer Vision & Pattern Recognition (AREA)

Abstract

Se proveen sistemas y métodos para aprendizaje automático de voz y otros atributos. El sistema recibe datos de entrada, aísla sonidos predeterminados del habla aislada de un hablante de interés, resume las características para generar variables que describen al hablante y genera un modelo predictivo para detectar una característica deseada de una persona. También se proveen sistemas y métodos para detectar uno o más atributos de un hablante en base al análisis de muestras de audio u otros tipos de información almacenada digitalmente (por ejemplo, videos, fotos, etc.).
MX2021014721A 2019-05-30 2020-06-01 Sistemas y metodos para aprendizaje de maquina de atributos de voz. MX2021014721A (es)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201962854652P 2019-05-30 2019-05-30
US202062989485P 2020-03-13 2020-03-13
US202063018892P 2020-05-01 2020-05-01
PCT/US2020/035542 WO2020243701A1 (en) 2019-05-30 2020-06-01 Systems and methods for machine learning of voice attributes

Publications (1)

Publication Number Publication Date
MX2021014721A true MX2021014721A (es) 2022-04-06

Family

ID=73549497

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2021014721A MX2021014721A (es) 2019-05-30 2020-06-01 Sistemas y metodos para aprendizaje de maquina de atributos de voz.

Country Status (12)

Country Link
US (2) US20200380957A1 (es)
EP (1) EP3976074A4 (es)
JP (1) JP2022534541A (es)
KR (1) KR20220024217A (es)
CN (1) CN114206361A (es)
AU (1) AU2020283065A1 (es)
BR (1) BR112021024196A2 (es)
CA (1) CA3142423A1 (es)
IL (1) IL288545A (es)
MX (1) MX2021014721A (es)
SG (1) SG11202113302UA (es)
WO (1) WO2020243701A1 (es)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11315040B2 (en) * 2020-02-12 2022-04-26 Wipro Limited System and method for detecting instances of lie using Machine Learning model
US11329998B1 (en) 2020-08-31 2022-05-10 Secureauth Corporation Identification (ID) proofing and risk engine integration system and method
US20220093121A1 (en) * 2020-09-23 2022-03-24 Sruthi Kotlo Detecting Depression Using Machine Learning Models on Human Speech Samples
US11700250B2 (en) * 2020-10-14 2023-07-11 Paypal, Inc. Voice vector framework for authenticating user interactions
US11869641B2 (en) * 2020-12-11 2024-01-09 Aetna Inc. Systems and methods for determining whether an individual is sick based on machine learning algorithms and individualized data
US20220198140A1 (en) * 2020-12-21 2022-06-23 International Business Machines Corporation Live audio adjustment based on speaker attributes
EP4039187A1 (de) * 2021-02-05 2022-08-10 Siemens Aktiengesellschaft Computerimplementiertes verfahren und werkzeug sowie datenverarbeitungsgerät zum erkennen von oberen atemwegserkrankungen beim menschen
US11929078B2 (en) * 2021-02-23 2024-03-12 Intuit, Inc. Method and system for user voice identification using ensembled deep learning algorithms
US11094135B1 (en) 2021-03-05 2021-08-17 Flyreel, Inc. Automated measurement of interior spaces through guided modeling of dimensions
US20220293123A1 (en) * 2021-03-10 2022-09-15 Covid Cough, Inc. Systems and methods for authentication using sound-based vocalization analysis
EP4089682A1 (en) * 2021-05-12 2022-11-16 BIOTRONIK SE & Co. KG Medical support system and medical support method for patient treatment
US20240105208A1 (en) * 2022-09-19 2024-03-28 SubStrata Ltd. Automated classification of relative dominance based on reciprocal prosodic behaviour in an audio conversation

Family Cites Families (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4712242A (en) * 1983-04-13 1987-12-08 Texas Instruments Incorporated Speaker-independent word recognizer
US5768474A (en) * 1995-12-29 1998-06-16 International Business Machines Corporation Method and system for noise-robust speech processing with cochlea filters in an auditory model
US20080275349A1 (en) * 2007-05-02 2008-11-06 Earlysense Ltd. Monitoring, predicting and treating clinical episodes
US20120071777A1 (en) * 2009-09-18 2012-03-22 Macauslan Joel Cough Analysis
US8306814B2 (en) * 2010-05-11 2012-11-06 Nice-Systems Ltd. Method for speaker source classification
US20130158434A1 (en) * 2011-12-20 2013-06-20 Delta Electronics, Inc. Apparatus for voice assisted medical diagnosis
KR102081241B1 (ko) * 2012-03-29 2020-02-25 더 유니버서티 어브 퀸슬랜드 환자 소리들을 처리하기 위한 방법 및 장치
CN103546503B (zh) * 2012-07-10 2017-03-15 百度在线网络技术(北京)有限公司 基于语音的云社交系统、方法及云分析服务器
EP2713367B1 (en) * 2012-09-28 2016-11-09 Agnitio, S.L. Speaker recognition
US9579056B2 (en) * 2012-10-16 2017-02-28 University Of Florida Research Foundation, Incorporated Screening for neurological disease using speech articulation characteristics
US9460722B2 (en) * 2013-07-17 2016-10-04 Verint Systems Ltd. Blind diarization of recorded calls with arbitrary number of speakers
US9514753B2 (en) * 2013-11-04 2016-12-06 Google Inc. Speaker identification using hash-based indexing
US9318112B2 (en) * 2014-02-14 2016-04-19 Google Inc. Recognizing speech in the presence of additional audio
US9792899B2 (en) * 2014-07-15 2017-10-17 International Business Machines Corporation Dataset shift compensation in machine learning
EP3257043B1 (en) * 2015-02-11 2018-12-12 Bang & Olufsen A/S Speaker recognition in multimedia system
US10664572B2 (en) * 2015-08-06 2020-05-26 Microsoft Technology Licensing, Llc Recommendations for health benefit resources
US10127929B2 (en) * 2015-08-19 2018-11-13 Massachusetts Institute Of Technology Assessing disorders through speech and a computational model
EP3359023A4 (en) * 2015-10-08 2019-05-22 Cordio Medical Ltd. ASSESSMENT OF A PULMONARY SUFFERING BY LANGUAGE ANALYSIS
US10347270B2 (en) * 2016-03-18 2019-07-09 International Business Machines Corporation Denoising a signal
US10141009B2 (en) * 2016-06-28 2018-11-27 Pindrop Security, Inc. System and method for cluster-based audio event detection
CN106504773B (zh) * 2016-11-08 2023-08-01 上海贝生医疗设备有限公司 一种可穿戴装置及语音与活动监测系统
CN106782616A (zh) * 2016-12-28 2017-05-31 上海百芝龙网络科技有限公司 一种通过人声分析检测呼吸道的方法
WO2018148298A1 (en) * 2017-02-07 2018-08-16 Pindrop Security, Inc. Age compensation in biometric systems using time-interval, gender, and age
EP3580754A4 (en) * 2017-02-12 2020-12-16 Cardiokol Ltd. VERBAL PERIODIC SCREENING FOR HEART DISEASE
WO2018204934A1 (en) * 2017-05-05 2018-11-08 Canary Speech, LLC Selecting speech features for building models for detecting medical conditions
US10637898B2 (en) * 2017-05-24 2020-04-28 AffectLayer, Inc. Automatic speaker identification in calls
CN107705807B (zh) * 2017-08-24 2019-08-27 平安科技(深圳)有限公司 基于情绪识别的语音质检方法、装置、设备及存储介质
CN108053841A (zh) * 2017-10-23 2018-05-18 平安科技(深圳)有限公司 利用语音进行疾病预测的方法及应用服务器
GB2567826B (en) * 2017-10-24 2023-04-26 Cambridge Cognition Ltd System and method for assessing physiological state
US10825564B1 (en) * 2017-12-11 2020-11-03 State Farm Mutual Automobile Insurance Company Biometric characteristic application using audio/video analysis
CN109801634B (zh) * 2019-01-31 2021-05-18 北京声智科技有限公司 一种声纹特征的融合方法及装置
US11011188B2 (en) * 2019-03-12 2021-05-18 Cordio Medical Ltd. Diagnostic techniques based on speech-sample alignment
US11211053B2 (en) * 2019-05-23 2021-12-28 International Business Machines Corporation Systems and methods for automated generation of subtitles
WO2021123462A1 (es) * 2019-12-16 2021-06-24 Sigma Technologies, S.L. Método y sistema para estimar características de hablante sobre la marcha para hablante desconocido con alta precisión y baja latencia

Also Published As

Publication number Publication date
SG11202113302UA (en) 2021-12-30
AU2020283065A1 (en) 2022-01-06
US20200381130A1 (en) 2020-12-03
CN114206361A (zh) 2022-03-18
CA3142423A1 (en) 2020-12-03
EP3976074A4 (en) 2023-01-25
WO2020243701A1 (en) 2020-12-03
IL288545A (en) 2022-02-01
KR20220024217A (ko) 2022-03-03
EP3976074A1 (en) 2022-04-06
JP2022534541A (ja) 2022-08-01
BR112021024196A2 (pt) 2022-02-08
US20200380957A1 (en) 2020-12-03

Similar Documents

Publication Publication Date Title
MX2021014721A (es) Sistemas y metodos para aprendizaje de maquina de atributos de voz.
US10743107B1 (en) Synchronization of audio signals from distributed devices
SG10201707702YA (en) Collaborative Voice Controlled Devices
EP3963576B1 (en) Speaker attributed transcript generation
US11875796B2 (en) Audio-visual diarization to identify meeting attendees
EP4235645A3 (en) System and method for customizing smart home speech interfaces using personalized speech profiles
US11138980B2 (en) Processing overlapping speech from distributed devices
WO2020098828A3 (en) System and method for personalized speaker verification
MX364461B (es) Método y dispositivo para lograr el registro de audio objetivo y aparato electrónico.
GB2567339A (en) Speaker recognition
US10812921B1 (en) Audio stream processing for distributed device meeting
EP3751561A3 (en) Hotword recognition
EP4235648A3 (en) Language model biasing
DE602006018795D1 (de) Kompensation der variabilität zwischen sitzungen zur automatischen extraktion von informationen aus sprache
Kürby et al. Bag-of-Features Acoustic Event Detection for Sensor Networks.
US10065013B2 (en) Selective amplification of an acoustic signal
US20160189103A1 (en) Apparatus and method for automatically creating and recording minutes of meeting
WO2017027397A3 (en) Event detection for playback management in an audio device
MX2022001162A (es) Coordinacion de dispositivos de audio.
US11468895B2 (en) Distributed device meeting initiation
JP2019113636A (ja) 音声認識システム
US9466299B1 (en) Speech source classification
Basu et al. An overview of speaker diarization: Approaches, resources and challenges
KR20200089594A (ko) 무대 음향 시스템 및 무대 음향 제어 방법
JP2020177060A (ja) 音声認識システム、及び、音声認識方法