MX361307B - Contenido visual modificado para facilitar el reconocimiento mejorado de voz. - Google Patents
Contenido visual modificado para facilitar el reconocimiento mejorado de voz.Info
- Publication number
- MX361307B MX361307B MX2016016131A MX2016016131A MX361307B MX 361307 B MX361307 B MX 361307B MX 2016016131 A MX2016016131 A MX 2016016131A MX 2016016131 A MX2016016131 A MX 2016016131A MX 361307 B MX361307 B MX 361307B
- Authority
- MX
- Mexico
- Prior art keywords
- visual content
- speech recognition
- modified
- modification
- facilitate improved
- Prior art date
Links
- 230000000007 visual effect Effects 0.000 title abstract 5
- 230000004048 modification Effects 0.000 title 1
- 238000012986 modification Methods 0.000 title 1
- 238000005516 engineering process Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/24—Speech recognition using non-acoustical features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/103—Formatting, i.e. changing of presentation of documents
- G06F40/106—Display of layout of documents; Previewing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/174—Form filling; Merging
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/18—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
- H04N7/183—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast for receiving images from a single remote source
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- General Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Signal Processing (AREA)
- User Interface Of Digital Computer (AREA)
- Rehabilitation Tools (AREA)
- Eye Examination Apparatus (AREA)
- Controls And Circuits For Display Device (AREA)
- Road Signs Or Road Markings (AREA)
- Digital Computer Display Output (AREA)
Abstract
Se describen tecnologías que se refieren a la modificación de contenido visual para presentarse en una presentación para facilitar el mejoramiento del desempeño de un sistema de reconocimiento automático de voz (ASR). El contenido visual se modifica para mover elementos lejos uno del otro, en donde los elementos movidos dan lugar a la ambigüedad desde la perspectiva del sistema ASR. El contenido visual se modifica para tomar en cuenta la exactitud del rastreo de mirada. Cuando un usuario ve un elemento en el contenido visual modificado, el sistema ASR se personaliza como una función del elemento que se está siendo visto por el usuario.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/297,742 US9583105B2 (en) | 2014-06-06 | 2014-06-06 | Modification of visual content to facilitate improved speech recognition |
PCT/US2015/033865 WO2015187756A2 (en) | 2014-06-06 | 2015-06-03 | Modification of visual content to facilitate improved speech recognition |
Publications (2)
Publication Number | Publication Date |
---|---|
MX2016016131A MX2016016131A (es) | 2017-03-08 |
MX361307B true MX361307B (es) | 2018-12-03 |
Family
ID=54540159
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2016016131A MX361307B (es) | 2014-06-06 | 2015-06-03 | Contenido visual modificado para facilitar el reconocimiento mejorado de voz. |
Country Status (11)
Country | Link |
---|---|
US (1) | US9583105B2 (es) |
EP (1) | EP3152754B1 (es) |
JP (1) | JP6545716B2 (es) |
KR (1) | KR102393147B1 (es) |
CN (1) | CN106463119B (es) |
AU (1) | AU2015271726B2 (es) |
BR (1) | BR112016026904B1 (es) |
CA (1) | CA2948523C (es) |
MX (1) | MX361307B (es) |
RU (1) | RU2684475C2 (es) |
WO (1) | WO2015187756A2 (es) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9613267B2 (en) * | 2012-05-31 | 2017-04-04 | Xerox Corporation | Method and system of extracting label:value data from a document |
KR102342117B1 (ko) * | 2015-03-13 | 2021-12-21 | 엘지전자 주식회사 | 단말기, 및 이를 구비하는 홈 어플라이언스 시스템 |
KR101904889B1 (ko) | 2016-04-21 | 2018-10-05 | 주식회사 비주얼캠프 | 표시 장치와 이를 이용한 입력 처리 방법 및 시스템 |
WO2017183943A1 (ko) * | 2016-04-21 | 2017-10-26 | 주식회사 비주얼캠프 | 표시 장치와 이를 이용한 입력 처리 방법 및 시스템 |
SG11201908535XA (en) * | 2017-03-17 | 2019-10-30 | Uilicious Private Ltd | Systems, methods and computer readable media for ambiguity resolution in instruction statement interpretation |
US10142686B2 (en) * | 2017-03-30 | 2018-11-27 | Rovi Guides, Inc. | System and methods for disambiguating an ambiguous entity in a search query based on the gaze of a user |
CN109445757B (zh) * | 2018-09-21 | 2022-07-29 | 深圳变设龙信息科技有限公司 | 新设计图生成方法、装置及终端设备 |
JP7414231B2 (ja) | 2019-07-11 | 2024-01-16 | 中部電力株式会社 | マルチモーダル音声認識装置およびマルチモーダル音声認識方法 |
KR20210133600A (ko) * | 2020-04-29 | 2021-11-08 | 현대자동차주식회사 | 차량 음성 인식 방법 및 장치 |
Family Cites Families (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3530591B2 (ja) * | 1994-09-14 | 2004-05-24 | キヤノン株式会社 | 音声認識装置及びこれを用いた情報処理装置とそれらの方法 |
US6629074B1 (en) * | 1997-08-14 | 2003-09-30 | International Business Machines Corporation | Resource utilization indication and commit mechanism in a data processing system and method therefor |
US7720682B2 (en) | 1998-12-04 | 2010-05-18 | Tegic Communications, Inc. | Method and apparatus utilizing voice input to resolve ambiguous manually entered text input |
DE50104533D1 (de) | 2000-01-27 | 2004-12-23 | Siemens Ag | System und verfahren zur blickfokussierten sprachverarbeitung |
US6741791B1 (en) * | 2000-01-31 | 2004-05-25 | Intel Corporation | Using speech to select a position in a program |
US7036080B1 (en) | 2001-11-30 | 2006-04-25 | Sap Labs, Inc. | Method and apparatus for implementing a speech interface for a GUI |
US20050182558A1 (en) * | 2002-04-12 | 2005-08-18 | Mitsubishi Denki Kabushiki Kaisha | Car navigation system and speech recognizing device therefor |
US7158779B2 (en) * | 2003-11-11 | 2007-01-02 | Microsoft Corporation | Sequential multimodal input |
CN102272827B (zh) * | 2005-06-01 | 2013-07-10 | 泰吉克通讯股份有限公司 | 利用语音输入解决模糊的手工输入文本输入的方法和装置 |
US7627819B2 (en) * | 2005-11-01 | 2009-12-01 | At&T Intellectual Property I, L.P. | Visual screen indicator |
JP4399607B2 (ja) * | 2006-02-13 | 2010-01-20 | 国立大学法人埼玉大学 | 視線制御表示装置と表示方法 |
BRPI0708456A2 (pt) * | 2006-03-03 | 2011-05-31 | Koninkl Philips Electronics Nv | método para prover um sumário de diversas imagens, dispositivo adaptado para gerar um sumário de diversas imagens, sistema, código de programa executável por computador, e, portador de dados |
US9250703B2 (en) | 2006-03-06 | 2016-02-02 | Sony Computer Entertainment Inc. | Interface with gaze detection and voice input |
US8793620B2 (en) | 2011-04-21 | 2014-07-29 | Sony Computer Entertainment Inc. | Gaze-assisted computer interface |
US20080141166A1 (en) * | 2006-12-11 | 2008-06-12 | Cisco Technology, Inc. | Using images in alternative navigation |
US7983915B2 (en) * | 2007-04-30 | 2011-07-19 | Sonic Foundry, Inc. | Audio content search engine |
JP5230120B2 (ja) * | 2007-05-07 | 2013-07-10 | 任天堂株式会社 | 情報処理システム、情報処理プログラム |
US20130125051A1 (en) * | 2007-09-28 | 2013-05-16 | Adobe Systems Incorporated | Historical review using manipulable visual indicators |
US8386260B2 (en) * | 2007-12-31 | 2013-02-26 | Motorola Mobility Llc | Methods and apparatus for implementing distributed multi-modal applications |
US8438485B2 (en) * | 2009-03-17 | 2013-05-07 | Unews, Llc | System, method, and apparatus for generating, customizing, distributing, and presenting an interactive audio publication |
US9197736B2 (en) | 2009-12-31 | 2015-11-24 | Digimarc Corporation | Intuitive computing methods and systems |
US9507418B2 (en) * | 2010-01-21 | 2016-11-29 | Tobii Ab | Eye tracker based contextual action |
JP2012022589A (ja) * | 2010-07-16 | 2012-02-02 | Hitachi Ltd | 商品選択支援方法 |
US10120438B2 (en) * | 2011-05-25 | 2018-11-06 | Sony Interactive Entertainment Inc. | Eye gaze to alter device behavior |
US9423870B2 (en) | 2012-05-08 | 2016-08-23 | Google Inc. | Input determination method |
US9823742B2 (en) * | 2012-05-18 | 2017-11-21 | Microsoft Technology Licensing, Llc | Interaction and management of devices using gaze detection |
KR102156175B1 (ko) * | 2012-10-09 | 2020-09-15 | 삼성전자주식회사 | 멀티 모달리티를 활용한 유저 인터페이스를 제공하는 인터페이싱 장치 및 그 장치를 이용한 방법 |
-
2014
- 2014-06-06 US US14/297,742 patent/US9583105B2/en active Active
-
2015
- 2015-06-03 EP EP15793931.5A patent/EP3152754B1/en active Active
- 2015-06-03 BR BR112016026904-7A patent/BR112016026904B1/pt active IP Right Grant
- 2015-06-03 JP JP2016567801A patent/JP6545716B2/ja active Active
- 2015-06-03 KR KR1020167037034A patent/KR102393147B1/ko active IP Right Grant
- 2015-06-03 CA CA2948523A patent/CA2948523C/en active Active
- 2015-06-03 MX MX2016016131A patent/MX361307B/es active IP Right Grant
- 2015-06-03 WO PCT/US2015/033865 patent/WO2015187756A2/en active Application Filing
- 2015-06-03 CN CN201580029986.8A patent/CN106463119B/zh active Active
- 2015-06-03 AU AU2015271726A patent/AU2015271726B2/en active Active
- 2015-06-03 RU RU2016147071A patent/RU2684475C2/ru active
Also Published As
Publication number | Publication date |
---|---|
WO2015187756A3 (en) | 2016-01-28 |
JP2017525002A (ja) | 2017-08-31 |
EP3152754B1 (en) | 2018-01-10 |
RU2684475C2 (ru) | 2019-04-09 |
AU2015271726B2 (en) | 2020-04-09 |
RU2016147071A3 (es) | 2018-12-29 |
AU2015271726A1 (en) | 2016-11-17 |
KR20170016399A (ko) | 2017-02-13 |
CA2948523C (en) | 2021-12-07 |
KR102393147B1 (ko) | 2022-04-29 |
WO2015187756A2 (en) | 2015-12-10 |
CA2948523A1 (en) | 2015-12-10 |
RU2016147071A (ru) | 2018-06-01 |
US20150356971A1 (en) | 2015-12-10 |
BR112016026904A2 (pt) | 2017-08-15 |
US9583105B2 (en) | 2017-02-28 |
JP6545716B2 (ja) | 2019-07-17 |
BR112016026904A8 (pt) | 2021-07-13 |
EP3152754A2 (en) | 2017-04-12 |
MX2016016131A (es) | 2017-03-08 |
CN106463119B (zh) | 2020-07-10 |
CN106463119A (zh) | 2017-02-22 |
BR112016026904B1 (pt) | 2023-03-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MX361307B (es) | Contenido visual modificado para facilitar el reconocimiento mejorado de voz. | |
EP3785187A4 (en) | PERSONALIZED GESTURES RECOGNITION FOR USER INTERACTION WITH ASSISTANCE SYSTEMS | |
MX2016015045A (es) | Aparato domestico de operacion electrica teniendo dispositivo de reconocimiento de voz. | |
EP3241372A4 (en) | Contextual based gesture recognition and control | |
EP2775377A3 (en) | Automatic fitting of haptic effects | |
WO2015107225A3 (en) | Interactive system | |
EP3384488A4 (en) | SYSTEM AND METHOD FOR IMPLEMENTING A VOICE USER INTERFACE BY COMBINING A SPEECH-TEXT SYSTEM AND A SPEECH-INTENTION SYSTEM | |
MX2015017625A (es) | Reconocimiento de evento adaptable. | |
EP3286669A4 (en) | Automatic content recognition fingerprint sequence matching | |
WO2014125380A3 (en) | Systems and methods of eye tracking calibration | |
MX2015010771A (es) | Expansor del iris. | |
EP4086897A3 (en) | Recognizing accented speech | |
EP4239628A3 (en) | Determining hotword suitability | |
MY164536A (en) | Syringe | |
WO2014165392A3 (en) | Content presentation based on social recommendations | |
TWD178036S (zh) | 自動注射器之外殼之部分 | |
WO2014143885A3 (en) | Automatic invocation of a dialog user interface for translation applications | |
MX2016002668A (es) | Metodo y dispositivo para empujar informacion. | |
GB2538392A (en) | Ranging using current profiling | |
GB2553443A (en) | Assist layer with automated extraction | |
WO2016050086A8 (en) | Interaction method for user interfaces | |
MX2015004950A (es) | Metodo y aparato para determinar la temperatura. | |
EP3171354A4 (en) | Language learning system utilizing component unit, more segmented than phoneme, or various games | |
EP3128240A3 (en) | Household appliance system and household appliance | |
EP3516540A4 (en) | TECHNIQUES FOR RETRIEVING DATA IN MEMORY |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG | Grant or registration |