MX2017001121A - Reconocimiento del habla en base a acustica y a dominio para vehiculos. - Google Patents

Reconocimiento del habla en base a acustica y a dominio para vehiculos.

Info

Publication number
MX2017001121A
MX2017001121A MX2017001121A MX2017001121A MX2017001121A MX 2017001121 A MX2017001121 A MX 2017001121A MX 2017001121 A MX2017001121 A MX 2017001121A MX 2017001121 A MX2017001121 A MX 2017001121A MX 2017001121 A MX2017001121 A MX 2017001121A
Authority
MX
Mexico
Prior art keywords
acoustic
speech recognition
vehicles
domain based
based speech
Prior art date
Application number
MX2017001121A
Other languages
English (en)
Inventor
John Edward Huber
Scott Andrew Amman
Francois Charette
Gintaras Vincent Puskorius
Hassani Ali
Ji An
Rangarajan Ranjani
Frances Mora Richardson Brigitte
Original Assignee
Ford Global Tech Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ford Global Tech Llc filed Critical Ford Global Tech Llc
Publication of MX2017001121A publication Critical patent/MX2017001121A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W50/00Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
    • B60W50/08Interaction between the driver and the control system
    • B60W50/10Interpretation of driver requests or demands
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/01Assessment or evaluation of speech recognition systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2540/00Input parameters relating to occupants
    • B60W2540/21Voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Automation & Control Theory (AREA)
  • Probability & Statistics with Applications (AREA)
  • Transportation (AREA)
  • Mechanical Engineering (AREA)
  • Evolutionary Computation (AREA)
  • Signal Processing (AREA)
  • Navigation (AREA)
  • Traffic Control Systems (AREA)

Abstract

Un procesador de un sistema de reconocimiento del habla del vehículo reconoce el habla a través de modelos de lenguaje de dominioespecífico y acústicos. El procesador además, en respuesta a que el modelo acústico tenga una puntuación de confianza para el habla reconocido que cae dentro de un rango predeterminado definido con respecto a una puntuación de confianza para el modelo de lenguaje de dominio específico, reconoce el habla solo a través del modelo acústico.
MX2017001121A 2016-01-25 2017-01-24 Reconocimiento del habla en base a acustica y a dominio para vehiculos. MX2017001121A (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US15/005,654 US10475447B2 (en) 2016-01-25 2016-01-25 Acoustic and domain based speech recognition for vehicles

Publications (1)

Publication Number Publication Date
MX2017001121A true MX2017001121A (es) 2018-07-23

Family

ID=58463100

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2017001121A MX2017001121A (es) 2016-01-25 2017-01-24 Reconocimiento del habla en base a acustica y a dominio para vehiculos.

Country Status (6)

Country Link
US (1) US10475447B2 (es)
CN (1) CN107016995A (es)
DE (1) DE102017100232A1 (es)
GB (1) GB2548954A (es)
MX (1) MX2017001121A (es)
RU (1) RU2017100526A (es)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105957516B (zh) * 2016-06-16 2019-03-08 百度在线网络技术(北京)有限公司 多语音识别模型切换方法及装置
JP6597527B2 (ja) * 2016-09-06 2019-10-30 トヨタ自動車株式会社 音声認識装置および音声認識方法
US10535342B2 (en) * 2017-04-10 2020-01-14 Microsoft Technology Licensing, Llc Automatic learning of language models
CN107437416B (zh) * 2017-05-23 2020-11-17 创新先进技术有限公司 一种基于语音识别的咨询业务处理方法及装置
CN107193973B (zh) * 2017-05-25 2021-07-20 百度在线网络技术(北京)有限公司 语义解析信息的领域识别方法及装置、设备及可读介质
US11056104B2 (en) * 2017-05-26 2021-07-06 International Business Machines Corporation Closed captioning through language detection
US11043214B1 (en) * 2018-11-29 2021-06-22 Amazon Technologies, Inc. Speech recognition using dialog history
WO2020117586A1 (en) * 2018-12-03 2020-06-11 Google Llc Speech input processing
KR20200072020A (ko) * 2018-12-12 2020-06-22 현대자동차주식회사 음성인식시스템의 대화 안내 방법
KR20200072021A (ko) * 2018-12-12 2020-06-22 현대자동차주식회사 음성인식시스템의 도메인 관리 방법
CN110148416B (zh) * 2019-04-23 2024-03-15 腾讯科技(深圳)有限公司 语音识别方法、装置、设备和存储介质
DE102020200522A1 (de) 2020-01-17 2021-07-22 Volkswagen Aktiengesellschaft Verfahren, Computerprogramm und Vorrichtung zum Verarbeiten einer Spracheingabe
CN111916089B (zh) * 2020-07-27 2022-11-04 南京信息工程大学 基于声信号特征分析的冰雹检测方法和装置
US20230035752A1 (en) * 2021-07-30 2023-02-02 Nissan North America, Inc. Systems and methods for responding to audible commands and/or adjusting vehicle components based thereon
CN115472165A (zh) * 2022-07-07 2022-12-13 脸萌有限公司 用于语音识别的方法、装置、设备和存储介质
DE102022213191A1 (de) 2022-12-07 2024-06-13 Robert Bosch Gesellschaft mit beschränkter Haftung Verfahren zur Park- oder Manöverunterstützung eines Nutzers eines Fahrzeugs, Computerprogramm, Rechenvorrichtung und Fahrzeug

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6587824B1 (en) * 2000-05-04 2003-07-01 Visteon Global Technologies, Inc. Selective speaker adaptation for an in-vehicle speech recognition system
US7219058B1 (en) * 2000-10-13 2007-05-15 At&T Corp. System and method for processing speech recognition results
US7502737B2 (en) * 2002-06-24 2009-03-10 Intel Corporation Multi-pass recognition of spoken dialogue
JP4352790B2 (ja) 2002-10-31 2009-10-28 セイコーエプソン株式会社 音響モデル作成方法および音声認識装置ならびに音声認識装置を有する乗り物
US7392188B2 (en) * 2003-07-31 2008-06-24 Telefonaktiebolaget Lm Ericsson (Publ) System and method enabling acoustic barge-in
KR100612839B1 (ko) * 2004-02-18 2006-08-18 삼성전자주식회사 도메인 기반 대화 음성인식방법 및 장치
US7676363B2 (en) 2006-06-29 2010-03-09 General Motors Llc Automated speech recognition using normalized in-vehicle speech
JP2008064885A (ja) * 2006-09-05 2008-03-21 Honda Motor Co Ltd 音声認識装置、音声認識方法、及び音声認識プログラム
JP4188989B2 (ja) * 2006-09-15 2008-12-03 本田技研工業株式会社 音声認識装置、音声認識方法、及び音声認識プログラム
US20090030688A1 (en) 2007-03-07 2009-01-29 Cerra Joseph P Tagging speech recognition results based on an unstructured language model for use in a mobile communication facility application
JP4412504B2 (ja) * 2007-04-17 2010-02-10 本田技研工業株式会社 音声認識装置、音声認識方法、及び音声認識用プログラム
US8396713B2 (en) * 2007-04-30 2013-03-12 Nuance Communications, Inc. Method and system for using a statistical language model and an action classifier in parallel with grammar for better handling of out-of-grammar utterances
US8407051B2 (en) * 2007-07-02 2013-03-26 Mitsubishi Electric Corporation Speech recognizing apparatus
JP4990115B2 (ja) * 2007-12-06 2012-08-01 株式会社デンソー 位置範囲設定装置、移動物体搭載装置の制御方法および制御装置、ならびに車両用空調装置の制御方法および制御装置
US8423362B2 (en) 2007-12-21 2013-04-16 General Motors Llc In-vehicle circumstantial speech recognition
US8438028B2 (en) * 2010-05-18 2013-05-07 General Motors Llc Nametag confusability determination
US9734826B2 (en) 2015-03-11 2017-08-15 Microsoft Technology Licensing, Llc Token-level interpolation for class-based language models

Also Published As

Publication number Publication date
CN107016995A (zh) 2017-08-04
US10475447B2 (en) 2019-11-12
US20170213551A1 (en) 2017-07-27
DE102017100232A1 (de) 2017-07-27
RU2017100526A (ru) 2018-07-12
GB2548954A (en) 2017-10-04
GB201701141D0 (en) 2017-03-08

Similar Documents

Publication Publication Date Title
MX2017001121A (es) Reconocimiento del habla en base a acustica y a dominio para vehiculos.
GB2551917A (en) Privacy-preserving training corpus selection
MX2015009812A (es) Metodo y sistema para el reconicimiento de comandos de voz.
MX2017000356A (es) Sistema y metodo para activacion de funciones mediante reconocimiento de gestos y comando de voz.
GB2566215A (en) Voice user interface
SG10201900178WA (en) Speech transaction processing
MX2017000938A (es) Conmutacion dinamica de modelos acusticos para mejorar el reconocimiento del habla ruidosa.
MX2017003754A (es) Mirada para entendimiento de lenguaje por voz en interacciones de conversacion multimodal.
WO2015057907A3 (en) System and method for learning alternate pronunciations for speech recognition
TW201612773A (en) Multi-command single utterance input method
GB2536836A (en) Voice command triggered speech enhancement
MX2016012195A (es) Esquema flexible de adaptacion de modelo de lenguaje.
MY179900A (en) Speech recognition method and speech recognition apparatus
EP4312147A3 (en) Scalable dynamic class language modeling
WO2008105263A1 (ja) 重み係数学習システム及び音声認識システム
EP3349125A4 (en) Language model generation device, language model generation method and program therefor, voice recognition device, and voice recognition method and program therefor
TWD184312S (zh) 語音辨識裝置
GB2540062A (en) Systems, apparatuses and methods for communication flow modification
WO2018118492A3 (en) Linguistic modeling using sets of base phonetics
WO2014005142A3 (en) Modeling l1-specific phonological errors
NZ700273A (en) Negative example (anti-word) based performance improvement for speech recognition
WO2015124259A8 (de) Verfahren zur erfassung wenigstens zweier zu erfassender informationen mit zu verknüpfendem informationsgehalt durch eine sprachdialogeinrichtung, sprachdialogeinrichtung und kraftfahrzeug
MX2015014413A (es) Simulacion de respuesta al impulso acustico.
MX2018001996A (es) Modelo acustico dinamico para un vehículo.
WO2020117639A3 (en) Text independent speaker recognition