MX2017001121A - Reconocimiento del habla en base a acustica y a dominio para vehiculos. - Google Patents
Reconocimiento del habla en base a acustica y a dominio para vehiculos.Info
- Publication number
- MX2017001121A MX2017001121A MX2017001121A MX2017001121A MX2017001121A MX 2017001121 A MX2017001121 A MX 2017001121A MX 2017001121 A MX2017001121 A MX 2017001121A MX 2017001121 A MX2017001121 A MX 2017001121A MX 2017001121 A MX2017001121 A MX 2017001121A
- Authority
- MX
- Mexico
- Prior art keywords
- acoustic
- speech recognition
- vehicles
- domain based
- based speech
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W50/00—Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
- B60W50/08—Interaction between the driver and the control system
- B60W50/10—Interpretation of driver requests or demands
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/01—Assessment or evaluation of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2540/00—Input parameters relating to occupants
- B60W2540/21—Voice
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
Landscapes
- Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Automation & Control Theory (AREA)
- Probability & Statistics with Applications (AREA)
- Transportation (AREA)
- Mechanical Engineering (AREA)
- Evolutionary Computation (AREA)
- Signal Processing (AREA)
- Navigation (AREA)
- Traffic Control Systems (AREA)
Abstract
Un procesador de un sistema de reconocimiento del habla del vehículo reconoce el habla a través de modelos de lenguaje de dominioespecífico y acústicos. El procesador además, en respuesta a que el modelo acústico tenga una puntuación de confianza para el habla reconocido que cae dentro de un rango predeterminado definido con respecto a una puntuación de confianza para el modelo de lenguaje de dominio específico, reconoce el habla solo a través del modelo acústico.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/005,654 US10475447B2 (en) | 2016-01-25 | 2016-01-25 | Acoustic and domain based speech recognition for vehicles |
Publications (1)
Publication Number | Publication Date |
---|---|
MX2017001121A true MX2017001121A (es) | 2018-07-23 |
Family
ID=58463100
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2017001121A MX2017001121A (es) | 2016-01-25 | 2017-01-24 | Reconocimiento del habla en base a acustica y a dominio para vehiculos. |
Country Status (6)
Country | Link |
---|---|
US (1) | US10475447B2 (es) |
CN (1) | CN107016995A (es) |
DE (1) | DE102017100232A1 (es) |
GB (1) | GB2548954A (es) |
MX (1) | MX2017001121A (es) |
RU (1) | RU2017100526A (es) |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105957516B (zh) * | 2016-06-16 | 2019-03-08 | 百度在线网络技术(北京)有限公司 | 多语音识别模型切换方法及装置 |
JP6597527B2 (ja) * | 2016-09-06 | 2019-10-30 | トヨタ自動車株式会社 | 音声認識装置および音声認識方法 |
US10535342B2 (en) * | 2017-04-10 | 2020-01-14 | Microsoft Technology Licensing, Llc | Automatic learning of language models |
CN107437416B (zh) * | 2017-05-23 | 2020-11-17 | 创新先进技术有限公司 | 一种基于语音识别的咨询业务处理方法及装置 |
CN107193973B (zh) * | 2017-05-25 | 2021-07-20 | 百度在线网络技术(北京)有限公司 | 语义解析信息的领域识别方法及装置、设备及可读介质 |
US11056104B2 (en) * | 2017-05-26 | 2021-07-06 | International Business Machines Corporation | Closed captioning through language detection |
US11043214B1 (en) * | 2018-11-29 | 2021-06-22 | Amazon Technologies, Inc. | Speech recognition using dialog history |
WO2020117586A1 (en) * | 2018-12-03 | 2020-06-11 | Google Llc | Speech input processing |
KR20200072020A (ko) * | 2018-12-12 | 2020-06-22 | 현대자동차주식회사 | 음성인식시스템의 대화 안내 방법 |
KR20200072021A (ko) * | 2018-12-12 | 2020-06-22 | 현대자동차주식회사 | 음성인식시스템의 도메인 관리 방법 |
CN110148416B (zh) * | 2019-04-23 | 2024-03-15 | 腾讯科技(深圳)有限公司 | 语音识别方法、装置、设备和存储介质 |
DE102020200522A1 (de) | 2020-01-17 | 2021-07-22 | Volkswagen Aktiengesellschaft | Verfahren, Computerprogramm und Vorrichtung zum Verarbeiten einer Spracheingabe |
CN111916089B (zh) * | 2020-07-27 | 2022-11-04 | 南京信息工程大学 | 基于声信号特征分析的冰雹检测方法和装置 |
US20230035752A1 (en) * | 2021-07-30 | 2023-02-02 | Nissan North America, Inc. | Systems and methods for responding to audible commands and/or adjusting vehicle components based thereon |
CN115472165A (zh) * | 2022-07-07 | 2022-12-13 | 脸萌有限公司 | 用于语音识别的方法、装置、设备和存储介质 |
DE102022213191A1 (de) | 2022-12-07 | 2024-06-13 | Robert Bosch Gesellschaft mit beschränkter Haftung | Verfahren zur Park- oder Manöverunterstützung eines Nutzers eines Fahrzeugs, Computerprogramm, Rechenvorrichtung und Fahrzeug |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6587824B1 (en) * | 2000-05-04 | 2003-07-01 | Visteon Global Technologies, Inc. | Selective speaker adaptation for an in-vehicle speech recognition system |
US7219058B1 (en) * | 2000-10-13 | 2007-05-15 | At&T Corp. | System and method for processing speech recognition results |
US7502737B2 (en) * | 2002-06-24 | 2009-03-10 | Intel Corporation | Multi-pass recognition of spoken dialogue |
JP4352790B2 (ja) | 2002-10-31 | 2009-10-28 | セイコーエプソン株式会社 | 音響モデル作成方法および音声認識装置ならびに音声認識装置を有する乗り物 |
US7392188B2 (en) * | 2003-07-31 | 2008-06-24 | Telefonaktiebolaget Lm Ericsson (Publ) | System and method enabling acoustic barge-in |
KR100612839B1 (ko) * | 2004-02-18 | 2006-08-18 | 삼성전자주식회사 | 도메인 기반 대화 음성인식방법 및 장치 |
US7676363B2 (en) | 2006-06-29 | 2010-03-09 | General Motors Llc | Automated speech recognition using normalized in-vehicle speech |
JP2008064885A (ja) * | 2006-09-05 | 2008-03-21 | Honda Motor Co Ltd | 音声認識装置、音声認識方法、及び音声認識プログラム |
JP4188989B2 (ja) * | 2006-09-15 | 2008-12-03 | 本田技研工業株式会社 | 音声認識装置、音声認識方法、及び音声認識プログラム |
US20090030688A1 (en) | 2007-03-07 | 2009-01-29 | Cerra Joseph P | Tagging speech recognition results based on an unstructured language model for use in a mobile communication facility application |
JP4412504B2 (ja) * | 2007-04-17 | 2010-02-10 | 本田技研工業株式会社 | 音声認識装置、音声認識方法、及び音声認識用プログラム |
US8396713B2 (en) * | 2007-04-30 | 2013-03-12 | Nuance Communications, Inc. | Method and system for using a statistical language model and an action classifier in parallel with grammar for better handling of out-of-grammar utterances |
US8407051B2 (en) * | 2007-07-02 | 2013-03-26 | Mitsubishi Electric Corporation | Speech recognizing apparatus |
JP4990115B2 (ja) * | 2007-12-06 | 2012-08-01 | 株式会社デンソー | 位置範囲設定装置、移動物体搭載装置の制御方法および制御装置、ならびに車両用空調装置の制御方法および制御装置 |
US8423362B2 (en) | 2007-12-21 | 2013-04-16 | General Motors Llc | In-vehicle circumstantial speech recognition |
US8438028B2 (en) * | 2010-05-18 | 2013-05-07 | General Motors Llc | Nametag confusability determination |
US9734826B2 (en) | 2015-03-11 | 2017-08-15 | Microsoft Technology Licensing, Llc | Token-level interpolation for class-based language models |
-
2016
- 2016-01-25 US US15/005,654 patent/US10475447B2/en not_active Expired - Fee Related
-
2017
- 2017-01-09 DE DE102017100232.4A patent/DE102017100232A1/de not_active Withdrawn
- 2017-01-11 RU RU2017100526A patent/RU2017100526A/ru not_active Application Discontinuation
- 2017-01-23 GB GB1701141.2A patent/GB2548954A/en not_active Withdrawn
- 2017-01-24 MX MX2017001121A patent/MX2017001121A/es unknown
- 2017-01-25 CN CN201710055930.0A patent/CN107016995A/zh active Pending
Also Published As
Publication number | Publication date |
---|---|
CN107016995A (zh) | 2017-08-04 |
US10475447B2 (en) | 2019-11-12 |
US20170213551A1 (en) | 2017-07-27 |
DE102017100232A1 (de) | 2017-07-27 |
RU2017100526A (ru) | 2018-07-12 |
GB2548954A (en) | 2017-10-04 |
GB201701141D0 (en) | 2017-03-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MX2017001121A (es) | Reconocimiento del habla en base a acustica y a dominio para vehiculos. | |
GB2551917A (en) | Privacy-preserving training corpus selection | |
MX2015009812A (es) | Metodo y sistema para el reconicimiento de comandos de voz. | |
MX2017000356A (es) | Sistema y metodo para activacion de funciones mediante reconocimiento de gestos y comando de voz. | |
GB2566215A (en) | Voice user interface | |
SG10201900178WA (en) | Speech transaction processing | |
MX2017000938A (es) | Conmutacion dinamica de modelos acusticos para mejorar el reconocimiento del habla ruidosa. | |
MX2017003754A (es) | Mirada para entendimiento de lenguaje por voz en interacciones de conversacion multimodal. | |
WO2015057907A3 (en) | System and method for learning alternate pronunciations for speech recognition | |
TW201612773A (en) | Multi-command single utterance input method | |
GB2536836A (en) | Voice command triggered speech enhancement | |
MX2016012195A (es) | Esquema flexible de adaptacion de modelo de lenguaje. | |
MY179900A (en) | Speech recognition method and speech recognition apparatus | |
EP4312147A3 (en) | Scalable dynamic class language modeling | |
WO2008105263A1 (ja) | 重み係数学習システム及び音声認識システム | |
EP3349125A4 (en) | Language model generation device, language model generation method and program therefor, voice recognition device, and voice recognition method and program therefor | |
TWD184312S (zh) | 語音辨識裝置 | |
GB2540062A (en) | Systems, apparatuses and methods for communication flow modification | |
WO2018118492A3 (en) | Linguistic modeling using sets of base phonetics | |
WO2014005142A3 (en) | Modeling l1-specific phonological errors | |
NZ700273A (en) | Negative example (anti-word) based performance improvement for speech recognition | |
WO2015124259A8 (de) | Verfahren zur erfassung wenigstens zweier zu erfassender informationen mit zu verknüpfendem informationsgehalt durch eine sprachdialogeinrichtung, sprachdialogeinrichtung und kraftfahrzeug | |
MX2015014413A (es) | Simulacion de respuesta al impulso acustico. | |
MX2018001996A (es) | Modelo acustico dinamico para un vehículo. | |
WO2020117639A3 (en) | Text independent speaker recognition |