MX359330B - Sistemas y metodos de realizacion de reconocimiento automatico del habla (asr) en la presencia de heterografos. - Google Patents

Sistemas y metodos de realizacion de reconocimiento automatico del habla (asr) en la presencia de heterografos.

Info

Publication number
MX359330B
MX359330B MX2016017394A MX2016017394A MX359330B MX 359330 B MX359330 B MX 359330B MX 2016017394 A MX2016017394 A MX 2016017394A MX 2016017394 A MX2016017394 A MX 2016017394A MX 359330 B MX359330 B MX 359330B
Authority
MX
Mexico
Prior art keywords
heterographs
systems
methods
word
words
Prior art date
Application number
MX2016017394A
Other languages
English (en)
Other versions
MX2016017394A (es
Inventor
Barve Rakesh
Agarwal Akshat
Original Assignee
Rovi Guides Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Rovi Guides Inc filed Critical Rovi Guides Inc
Publication of MX2016017394A publication Critical patent/MX2016017394A/es
Publication of MX359330B publication Critical patent/MX359330B/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/193Formal grammars, e.g. finite state automata, context free grammars or word networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • User Interface Of Digital Computer (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Information Transfer Between Computers (AREA)
  • Machine Translation (AREA)
  • Steroid Compounds (AREA)

Abstract

Son proporcionados sistemas y métodos de realización del reconocimiento automático del habla (ASR, por sus siglas en Inglés) en la presencia de heterógrafos. Una entrada verbal es recibida del usuario, la cual incluye una pluralidad de pronunciaciones. Una primera de la pluralidad de pronunciaciones es comparada con una primera palabra. Después, es determinado que una segunda pronunciación en la pluralidad de pronunciaciones coincide con una pluralidad de palabras que está en un mismo conjunto heterógrafo. A continuación es identificada cuál de la pluralidad de palabras está asociada con un contexto de la primera palabra. Finalmente es realizada una función basada en la primera palabra y en la palabra identificada de la pluralidad de palabras.
MX2016017394A 2014-07-31 2015-07-29 Sistemas y metodos de realizacion de reconocimiento automatico del habla (asr) en la presencia de heterografos. MX359330B (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US14/448,308 US9721564B2 (en) 2014-07-31 2014-07-31 Systems and methods for performing ASR in the presence of heterographs
PCT/US2015/042584 WO2016018981A1 (en) 2014-07-31 2015-07-29 Systems and methods for performing asr in the presence of heterographs

Publications (2)

Publication Number Publication Date
MX2016017394A MX2016017394A (es) 2017-04-27
MX359330B true MX359330B (es) 2018-09-25

Family

ID=53784025

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2016017394A MX359330B (es) 2014-07-31 2015-07-29 Sistemas y metodos de realizacion de reconocimiento automatico del habla (asr) en la presencia de heterografos.

Country Status (13)

Country Link
US (1) US9721564B2 (es)
EP (2) EP3364408B1 (es)
JP (1) JP6684231B2 (es)
KR (3) KR20230130761A (es)
CN (1) CN106471571A (es)
AU (1) AU2015296597A1 (es)
CA (2) CA2954197C (es)
DK (1) DK3175442T3 (es)
ES (1) ES2675302T3 (es)
GB (1) GB2530871B (es)
MX (1) MX359330B (es)
PT (2) PT3175442T (es)
WO (1) WO2016018981A1 (es)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11023541B2 (en) 2014-12-30 2021-06-01 Rovi Guides, Inc. Methods and systems for providing media recommendations based on user location
US9854049B2 (en) 2015-01-30 2017-12-26 Rovi Guides, Inc. Systems and methods for resolving ambiguous terms in social chatter based on a user profile
US10628009B2 (en) 2015-06-26 2020-04-21 Rovi Guides, Inc. Systems and methods for automatic formatting of images for media assets based on user profile
US9576578B1 (en) * 2015-08-12 2017-02-21 Google Inc. Contextual improvement of voice query recognition
US10133735B2 (en) 2016-02-29 2018-11-20 Rovi Guides, Inc. Systems and methods for training a model to determine whether a query with multiple segments comprises multiple distinct commands or a combined command
US10031967B2 (en) 2016-02-29 2018-07-24 Rovi Guides, Inc. Systems and methods for using a trained model for determining whether a query comprising multiple segments relates to an individual query or several queries
US20170272825A1 (en) 2016-03-16 2017-09-21 Rovi Guides, Inc. System and method for locating content related to a media asset
US10169470B2 (en) 2016-04-11 2019-01-01 Rovi Guides, Inc. Systems and methods for identifying a meaning of an ambiguous term in a natural language query
US10503832B2 (en) 2016-07-29 2019-12-10 Rovi Guides, Inc. Systems and methods for disambiguating a term based on static and temporal knowledge graphs
US9959864B1 (en) 2016-10-27 2018-05-01 Google Llc Location-based voice query recognition
US10097898B2 (en) 2016-11-21 2018-10-09 Rovi Guides, Inc. Systems and methods for generating for display recommendations that are temporally relevant to activities of a user and are contextually relevant to a portion of a media asset that the user is consuming
US11094317B2 (en) * 2018-07-31 2021-08-17 Samsung Electronics Co., Ltd. System and method for personalized natural language understanding
CN110176237A (zh) * 2019-07-09 2019-08-27 北京金山数字娱乐科技有限公司 一种语音识别方法及装置
US11721322B2 (en) 2020-02-28 2023-08-08 Rovi Guides, Inc. Automated word correction in speech recognition systems

Family Cites Families (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS60130798A (ja) * 1983-12-19 1985-07-12 松下電器産業株式会社 音声識別装置
US4980918A (en) 1985-05-09 1990-12-25 International Business Machines Corporation Speech recognition system with efficient storage and rapid assembly of phonological graphs
US6239794B1 (en) 1994-08-31 2001-05-29 E Guide, Inc. Method and system for simultaneously displaying a television program and information about the program
US6388714B1 (en) 1995-10-02 2002-05-14 Starsight Telecast Inc Interactive computer system for providing television schedule information
US6177931B1 (en) 1996-12-19 2001-01-23 Index Systems, Inc. Systems and methods for displaying and recording control interface with television programs, video, advertising information and program scheduling information
US5963957A (en) * 1997-04-28 1999-10-05 Philips Electronics North America Corporation Bibliographic music data base with normalized musical themes
US6182038B1 (en) 1997-12-01 2001-01-30 Motorola, Inc. Context dependent phoneme networks for encoding speech information
US6564378B1 (en) 1997-12-08 2003-05-13 United Video Properties, Inc. Program guide system with browsing display
CA2322217C (en) 1998-03-04 2007-04-10 United Video Properties, Inc. Program guide system with targeted advertising
US6236968B1 (en) 1998-05-14 2001-05-22 International Business Machines Corporation Sleep prevention dialog based car system
CN1867068A (zh) 1998-07-14 2006-11-22 联合视频制品公司 交互式电视节目导视系统及其方法
CA2730344C (en) 1998-07-17 2014-10-21 United Video Properties, Inc. Interactive television program guide system having multiple devices within a household
AR020608A1 (es) 1998-07-17 2002-05-22 United Video Properties Inc Un metodo y una disposicion para suministrar a un usuario acceso remoto a una guia de programacion interactiva por un enlace de acceso remoto
US6269335B1 (en) 1998-08-14 2001-07-31 International Business Machines Corporation Apparatus and methods for identifying homophones among words in a speech recognition system
US7165098B1 (en) 1998-11-10 2007-01-16 United Video Properties, Inc. On-line schedule system with personalization features
US6370503B1 (en) 1999-06-30 2002-04-09 International Business Machines Corp. Method and apparatus for improving speech recognition accuracy
MX347698B (es) 2001-02-21 2017-05-09 United Video Properties Inc Sistemas y metodos para guias de programa interactivas con caracteristicas de grabacion personal.
US7693720B2 (en) 2002-07-15 2010-04-06 Voicebox Technologies, Inc. Mobile systems and methods for responding to natural language speech utterance
JP2006085565A (ja) * 2004-09-17 2006-03-30 Fuji Xerox Co Ltd 情報処理装置、および情報処理方法、並びにコンピュータ・プログラム
US7818179B2 (en) 2004-11-12 2010-10-19 International Business Machines Corporation Devices and methods providing automated assistance for verbal communication
KR100755677B1 (ko) * 2005-11-02 2007-09-05 삼성전자주식회사 주제 영역 검출을 이용한 대화체 음성 인식 장치 및 방법
US20100153885A1 (en) 2005-12-29 2010-06-17 Rovi Technologies Corporation Systems and methods for interacting with advanced displays provided by an interactive media guidance application
JP4734155B2 (ja) * 2006-03-24 2011-07-27 株式会社東芝 音声認識装置、音声認識方法および音声認識プログラム
CN101118541B (zh) * 2006-08-03 2011-08-17 苗玉水 汉语语音码汉语语音识别方法
JP5121252B2 (ja) 2007-02-26 2013-01-16 株式会社東芝 原言語による音声を目的言語に翻訳する装置、方法およびプログラム
US20080270110A1 (en) 2007-04-30 2008-10-30 Yurick Steven J Automatic speech recognition with textual content input
WO2009105639A1 (en) 2008-02-22 2009-08-27 Vocera Communications, Inc. System and method for treating homonyms in a speech recognition system
CN101655837B (zh) * 2009-09-08 2010-10-13 北京邮电大学 一种对语音识别后文本进行检错并纠错的方法
US8744860B2 (en) * 2010-08-02 2014-06-03 At&T Intellectual Property I, L.P. Apparatus and method for providing messages in a social network
JP6131249B2 (ja) 2011-06-19 2017-05-17 エムモーダル アイピー エルエルシー コンテキストアウェア認識モデルを使用した音声認識
WO2013006215A1 (en) * 2011-07-01 2013-01-10 Nec Corporation Method and apparatus of confidence measure calculation
US8606577B1 (en) 2012-06-25 2013-12-10 Google Inc. Visual confirmation of voice recognized text input
US8909526B2 (en) 2012-07-09 2014-12-09 Nuance Communications, Inc. Detecting potential significant errors in speech recognition results
US9588964B2 (en) * 2012-09-18 2017-03-07 Adobe Systems Incorporated Natural language vocabulary generation and usage
US20140122069A1 (en) 2012-10-30 2014-05-01 International Business Machines Corporation Automatic Speech Recognition Accuracy Improvement Through Utilization of Context Analysis
US9189742B2 (en) 2013-11-20 2015-11-17 Justin London Adaptive virtual intelligent agent
US10296160B2 (en) * 2013-12-06 2019-05-21 Apple Inc. Method for extracting salient dialog usage from live data

Also Published As

Publication number Publication date
MX2016017394A (es) 2017-04-27
KR20230130761A (ko) 2023-09-12
ES2675302T3 (es) 2018-07-10
EP3175442A1 (en) 2017-06-07
EP3364408B1 (en) 2021-05-19
CA2954197A1 (en) 2016-02-04
KR20170040134A (ko) 2017-04-12
CA2954197C (en) 2023-03-21
KR102574333B1 (ko) 2023-09-01
AU2015296597A1 (en) 2017-01-12
GB2530871B (en) 2018-11-21
JP6684231B2 (ja) 2020-04-22
CA3187269A1 (en) 2016-02-04
US20160035347A1 (en) 2016-02-04
GB201513493D0 (en) 2015-09-16
KR102438752B1 (ko) 2022-08-30
DK3175442T3 (en) 2018-06-18
WO2016018981A1 (en) 2016-02-04
PT3175442T (pt) 2018-06-19
EP3364408A1 (en) 2018-08-22
CN106471571A (zh) 2017-03-01
KR20220123347A (ko) 2022-09-06
GB2530871A (en) 2016-04-06
PT3364408T (pt) 2021-06-14
US9721564B2 (en) 2017-08-01
EP3175442B1 (en) 2018-06-06
JP2017525993A (ja) 2017-09-07

Similar Documents

Publication Publication Date Title
MX2016017394A (es) Sistemas y metodos de realizacion de reconocimiento automatico del habla (asr) en la presencia de heterografos.
EP3472831B8 (en) Techniques for wake-up word recognition and related systems and methods
MX2017003316A (es) Eliminacion de ambigüedades de la entrada de teclado.
EP3114679A4 (en) Predicting pronunciation in speech recognition
EP3501023A4 (en) METHOD AND APPARATUS FOR VOICE RECOGNITION
EP3172729A4 (en) Text rule based multi-accent speech recognition with single acoustic model and automatic accent detection
EP3479376A4 (en) METHOD AND APPARATUS FOR VOICE RECOGNITION BASED ON RECOGNITION OF SPEAKER
EP3479377A4 (en) SPEECH RECOGNITION
EP3373293A4 (en) METHOD AND APPARATUS FOR VOICE RECOGNITION
EP3563373A4 (en) VOICE RECOGNITION SYSTEM
WO2016044027A8 (en) Method and apparatus for performing speaker recognition
GB2530131B (en) Speech recognition methods, devices, and systems
EP3535751A4 (en) METHOD FOR LANGUAGE-INDEPENDENT WAY RECOGNITION
EP4235395A3 (en) Device voice control
EP3544002A4 (en) SPEECH RECOGNITION DEVICE AND SPEECH RECOGNITION SYSTEM
EP3384488A4 (en) SYSTEM AND METHOD FOR IMPLEMENTING A VOICE USER INTERFACE BY COMBINING A SPEECH-TEXT SYSTEM AND A SPEECH-INTENTION SYSTEM
EP3183727A4 (en) System and method for speech validation
EP3092638A4 (en) GENERAL EXPRESSIONS IN AUTOMATIC SPEECH RECOGNITION SYSTEMS
SG10201807147TA (en) Verification methods and verification devices
SG11201707861UA (en) Systems and methods for executing cryptographically secure transactions using voice and natural language processing
GB201501383D0 (en) Adjusting speech recognition using contextual information
EP3767620A3 (en) Speech endpointing based on word comparisons
EP3232160A4 (en) Voice input assistance device, voice input assistance system, and voice input method
GB2540702A (en) Nucleic acid processing of a nucleic acid fragment with a triazole linkage
EP3204944A4 (en) Method, device, and system of noise reduction and speech enhancement

Legal Events

Date Code Title Description
FG Grant or registration