CN105074815B - 针对语音识别系统的视觉反馈 - Google Patents
针对语音识别系统的视觉反馈 Download PDFInfo
- Publication number
- CN105074815B CN105074815B CN201480005988.9A CN201480005988A CN105074815B CN 105074815 B CN105074815 B CN 105074815B CN 201480005988 A CN201480005988 A CN 201480005988A CN 105074815 B CN105074815 B CN 105074815B
- Authority
- CN
- China
- Prior art keywords
- appearance
- indicator
- user
- voice input
- speech recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000000007 visual effect Effects 0.000 title abstract description 9
- 238000000034 method Methods 0.000 claims abstract description 41
- 230000001419 dependent effect Effects 0.000 claims abstract description 3
- 230000004044 response Effects 0.000 claims description 5
- 230000004048 modification Effects 0.000 claims description 4
- 238000012986 modification Methods 0.000 claims description 4
- 238000004891 communication Methods 0.000 description 11
- 230000008569 process Effects 0.000 description 10
- 230000008713 feedback mechanism Effects 0.000 description 9
- 230000008859 change Effects 0.000 description 7
- 238000012545 processing Methods 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 4
- 238000005352 clarification Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000014509 gene expression Effects 0.000 description 3
- 230000003068 static effect Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 101000822695 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C1 Proteins 0.000 description 1
- 101000655262 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C2 Proteins 0.000 description 1
- 101000655256 Paraclostridium bifermentans Small, acid-soluble spore protein alpha Proteins 0.000 description 1
- 101000655264 Paraclostridium bifermentans Small, acid-soluble spore protein beta Proteins 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000007177 brain activity Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000005684 electric field Effects 0.000 description 1
- 238000005538 encapsulation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000002618 waking effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transforming into visible information
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/03—Arrangements for converting the position or the displacement of a member into a coded form
- G06F3/0304—Detection arrangements using opto-electronic means
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/225—Feedback of the input speech
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- User Interface Of Digital Computer (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/749,392 US9721587B2 (en) | 2013-01-24 | 2013-01-24 | Visual feedback for speech recognition system |
US13/749392 | 2013-01-24 | ||
PCT/US2014/012229 WO2014116548A1 (fr) | 2013-01-24 | 2014-01-21 | Rétroaction visuelle pour système de reconnaissance vocale |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105074815A CN105074815A (zh) | 2015-11-18 |
CN105074815B true CN105074815B (zh) | 2019-01-22 |
Family
ID=50033842
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201480005988.9A Active CN105074815B (zh) | 2013-01-24 | 2014-01-21 | 针对语音识别系统的视觉反馈 |
Country Status (4)
Country | Link |
---|---|
US (1) | US9721587B2 (fr) |
EP (1) | EP2948944B1 (fr) |
CN (1) | CN105074815B (fr) |
WO (1) | WO2014116548A1 (fr) |
Families Citing this family (34)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110067059A1 (en) * | 2009-09-15 | 2011-03-17 | At&T Intellectual Property I, L.P. | Media control |
US20130339030A1 (en) * | 2012-06-13 | 2013-12-19 | Fluential, Llc | Interactive spoken dialogue interface for collection of structured data |
JP6229287B2 (ja) * | 2013-04-03 | 2017-11-15 | ソニー株式会社 | 情報処理装置、情報処理方法及びコンピュータプログラム |
KR101456974B1 (ko) | 2013-05-21 | 2014-10-31 | 삼성전자 주식회사 | 사용자 단말기, 음성인식 서버 및 음성인식 가이드 방법 |
US9575720B2 (en) * | 2013-07-31 | 2017-02-21 | Google Inc. | Visual confirmation for a recognized voice-initiated action |
GB2518002B (en) * | 2013-09-10 | 2017-03-29 | Jaguar Land Rover Ltd | Vehicle interface system |
US11132173B1 (en) * | 2014-02-20 | 2021-09-28 | Amazon Technologies, Inc. | Network scheduling of stimulus-based actions |
US9430186B2 (en) * | 2014-03-17 | 2016-08-30 | Google Inc | Visual indication of a recognized voice-initiated action |
EP3202125B1 (fr) * | 2014-09-30 | 2019-07-31 | Hewlett-Packard Development Company, L.P. | Conditionnement de signaux sonores |
US9564130B2 (en) * | 2014-12-03 | 2017-02-07 | Samsung Electronics Co., Ltd. | Wireless controller including indicator |
US9946862B2 (en) * | 2015-12-01 | 2018-04-17 | Qualcomm Incorporated | Electronic device generating notification based on context data in response to speech phrase from user |
WO2017188801A1 (fr) * | 2016-04-29 | 2017-11-02 | 주식회사 브이터치 | Procédé de commande optimale basé sur une commande multimode de voix opérationnelle, et dispositif électronique auquel celui-ci est appliqué |
US10261752B2 (en) * | 2016-08-02 | 2019-04-16 | Google Llc | Component libraries for voice interaction services |
US10026403B2 (en) * | 2016-08-12 | 2018-07-17 | Paypal, Inc. | Location based voice association system |
US10409552B1 (en) * | 2016-09-19 | 2019-09-10 | Amazon Technologies, Inc. | Speech-based audio indicators |
EP3561653A4 (fr) * | 2016-12-22 | 2019-11-20 | Sony Corporation | Dispositif de traitement d'informations et procédé de traitement d'informations |
KR20180085931A (ko) | 2017-01-20 | 2018-07-30 | 삼성전자주식회사 | 음성 입력 처리 방법 및 이를 지원하는 전자 장치 |
US10359993B2 (en) | 2017-01-20 | 2019-07-23 | Essential Products, Inc. | Contextual user interface based on environment |
US10166465B2 (en) * | 2017-01-20 | 2019-01-01 | Essential Products, Inc. | Contextual user interface based on video game playback |
CN106873937A (zh) * | 2017-02-16 | 2017-06-20 | 北京百度网讯科技有限公司 | 语音输入方法和装置 |
DE102017206876B4 (de) | 2017-04-24 | 2021-12-09 | Volkswagen Aktiengesellschaft | Verfahren zum Betreiben eines Sprachsteuerungssystems in einem Kraftfahrzeug undSprachsteuerungssystem |
CN107155121B (zh) * | 2017-04-26 | 2020-01-10 | 海信集团有限公司 | 语音控制文本的显示方法及装置 |
US11158317B2 (en) * | 2017-05-08 | 2021-10-26 | Signify Holding B.V. | Methods, systems and apparatus for voice control of a utility |
CN107277630B (zh) * | 2017-07-20 | 2019-07-09 | 海信集团有限公司 | 语音提示信息的显示方法及装置 |
CN108108391A (zh) * | 2017-11-21 | 2018-06-01 | 众安信息技术服务有限公司 | 用于数据可视化的信息的处理方法以及装置 |
US11182567B2 (en) * | 2018-03-29 | 2021-11-23 | Panasonic Corporation | Speech translation apparatus, speech translation method, and recording medium storing the speech translation method |
US11544591B2 (en) | 2018-08-21 | 2023-01-03 | Google Llc | Framework for a computing system that alters user behavior |
CN109274828B (zh) * | 2018-09-30 | 2021-01-15 | 华为技术有限公司 | 一种生成截图的方法、控制方法及电子设备 |
US11482215B2 (en) * | 2019-03-27 | 2022-10-25 | Samsung Electronics Co., Ltd. | Multi-modal interaction with intelligent assistants in voice command devices |
DE102019134874A1 (de) * | 2019-06-25 | 2020-12-31 | Miele & Cie. Kg | Verfahren zur Bedienung eines Geräts durch einen Benutzer mittels einer Sprachsteuerung |
CN112533041A (zh) * | 2019-09-19 | 2021-03-19 | 百度在线网络技术(北京)有限公司 | 视频播放方法、装置、电子设备和可读存储介质 |
EP3933560A1 (fr) * | 2020-06-30 | 2022-01-05 | Spotify AB | Procédés et systèmes permettant de fournir une rétroaction visuelle animée pour les commandes vocales |
TWI755037B (zh) * | 2020-08-21 | 2022-02-11 | 陳筱涵 | 影音錄製裝置與影音編輯播放系統 |
WO2022254667A1 (fr) * | 2021-06-03 | 2022-12-08 | 日産自動車株式会社 | Dispositif de commande d'affichage et procédé de commande d'affichage |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6075534A (en) * | 1998-03-26 | 2000-06-13 | International Business Machines Corporation | Multiple function graphical user interface minibar for speech recognition |
CN1615508A (zh) * | 2001-12-17 | 2005-05-11 | 旭化成株式会社 | 语音识别方法、遥控器、信息终端、电话通信终端以及语音识别器 |
Family Cites Families (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5819225A (en) | 1996-05-30 | 1998-10-06 | International Business Machines Corporation | Display indications of speech processing states in speech recognition system |
US5933804A (en) * | 1997-04-10 | 1999-08-03 | Microsoft Corporation | Extensible speech recognition system that provides a user with audio feedback |
US6965863B1 (en) | 1998-11-12 | 2005-11-15 | Microsoft Corporation | Speech recognition user interface |
ES2243451T3 (es) | 2000-01-27 | 2005-12-01 | Siemens Aktiengesellschaft | Sistema y procedimiento para el procesamiento de voz enfocado a la vision con generacion de una señal de reaccion visual. |
US7324947B2 (en) | 2001-10-03 | 2008-01-29 | Promptu Systems Corporation | Global speech user interface |
US7099829B2 (en) | 2001-11-06 | 2006-08-29 | International Business Machines Corporation | Method of dynamically displaying speech recognition system information |
US7047200B2 (en) * | 2002-05-24 | 2006-05-16 | Microsoft, Corporation | Voice recognition status display |
KR100754385B1 (ko) * | 2004-09-30 | 2007-08-31 | 삼성전자주식회사 | 오디오/비디오 센서를 이용한 위치 파악, 추적 및 분리장치와 그 방법 |
US8510109B2 (en) * | 2007-08-22 | 2013-08-13 | Canyon Ip Holdings Llc | Continuous speech transcription performance indication |
US20090037171A1 (en) * | 2007-08-03 | 2009-02-05 | Mcfarland Tim J | Real-time voice transcription system |
US8140335B2 (en) | 2007-12-11 | 2012-03-20 | Voicebox Technologies, Inc. | System and method for providing a natural language voice user interface in an integrated voice navigation services environment |
US10496753B2 (en) * | 2010-01-18 | 2019-12-03 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
JP5326934B2 (ja) | 2009-01-23 | 2013-10-30 | 株式会社Jvcケンウッド | 電子機器 |
US11012732B2 (en) * | 2009-06-25 | 2021-05-18 | DISH Technologies L.L.C. | Voice enabled media presentation systems and methods |
US9159151B2 (en) * | 2009-07-13 | 2015-10-13 | Microsoft Technology Licensing, Llc | Bringing a visual representation to life via learned input from the user |
US8265341B2 (en) * | 2010-01-25 | 2012-09-11 | Microsoft Corporation | Voice-body identity correlation |
US8898324B2 (en) | 2010-06-24 | 2014-11-25 | International Business Machines Corporation | Data access management in a hybrid memory server |
US10496714B2 (en) * | 2010-08-06 | 2019-12-03 | Google Llc | State-dependent query response |
US20120089392A1 (en) | 2010-10-07 | 2012-04-12 | Microsoft Corporation | Speech recognition user interface |
WO2012169679A1 (fr) * | 2011-06-10 | 2012-12-13 | 엘지전자 주식회사 | Appareil d'affichage, procédé de commande d'un appareil d'affichage et système de reconnaissance vocale pour un appareil d'affichage |
US9129591B2 (en) * | 2012-03-08 | 2015-09-08 | Google Inc. | Recognizing speech in multiple languages |
WO2014028068A1 (fr) * | 2012-08-17 | 2014-02-20 | Flextronics Ap, Llc | Centre multimédia |
-
2013
- 2013-01-24 US US13/749,392 patent/US9721587B2/en active Active
-
2014
- 2014-01-21 EP EP14702725.4A patent/EP2948944B1/fr active Active
- 2014-01-21 CN CN201480005988.9A patent/CN105074815B/zh active Active
- 2014-01-21 WO PCT/US2014/012229 patent/WO2014116548A1/fr active Application Filing
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6075534A (en) * | 1998-03-26 | 2000-06-13 | International Business Machines Corporation | Multiple function graphical user interface minibar for speech recognition |
CN1615508A (zh) * | 2001-12-17 | 2005-05-11 | 旭化成株式会社 | 语音识别方法、遥控器、信息终端、电话通信终端以及语音识别器 |
Also Published As
Publication number | Publication date |
---|---|
US20140207452A1 (en) | 2014-07-24 |
CN105074815A (zh) | 2015-11-18 |
EP2948944B1 (fr) | 2021-03-10 |
WO2014116548A1 (fr) | 2014-07-31 |
US9721587B2 (en) | 2017-08-01 |
EP2948944A1 (fr) | 2015-12-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105074815B (zh) | 针对语音识别系统的视觉反馈 | |
US10453443B2 (en) | Providing an indication of the suitability of speech recognition | |
CN105009031B (zh) | 增强现实设备以及在其上操作用户界面的方法 | |
US11871109B2 (en) | Interactive application adapted for use by multiple users via a distributed computer-based system | |
Csapó et al. | Overview of auditory representations in human-machine interfaces | |
US10824310B2 (en) | Augmented reality virtual personal assistant for external representation | |
Jain et al. | Head-mounted display visualizations to support sound awareness for the deaf and hard of hearing | |
KR102357633B1 (ko) | 대화 감지 | |
WO2018045553A1 (fr) | Système et procédé d'interaction homme-machine | |
US10409552B1 (en) | Speech-based audio indicators | |
US11183187B2 (en) | Dialog method, dialog system, dialog apparatus and program that gives impression that dialog system understands content of dialog | |
US10521723B2 (en) | Electronic apparatus, method of providing guide and non-transitory computer readable recording medium | |
JP2023525173A (ja) | レンダリングされたグラフィカル出力を利用する会話型aiプラットフォーム | |
KR102515023B1 (ko) | 전자 장치 및 그 제어 방법 | |
JP6545716B2 (ja) | 改善された音声認識を容易にする視覚的コンテンツの修正 | |
WO2014122416A1 (fr) | Analyse de l'émotion dans un discours | |
KR20150144031A (ko) | 음성 인식을 이용하는 사용자 인터페이스 제공 방법 및 사용자 인터페이스 제공 장치 | |
US20230259540A1 (en) | Conversational ai platform with extractive question answering | |
KR102469712B1 (ko) | 전자 장치 및 이의 자연어 생성 방법 | |
Karpov et al. | Information enquiry kiosk with multimodal user interface | |
KR20200080389A (ko) | 전자 장치 및 그 제어 방법 | |
CN108804897A (zh) | 屏幕控制方法、装置、计算机设备及存储介质 | |
KR102585795B1 (ko) | 멀티미디어 제공 애플리케이션을 통한 다언어 번역 제공 방법 | |
Prischepa et al. | Hierarchical dialogue system for guide robot in shopping mall environments | |
KR20190091189A (ko) | 대화 이해 ai 시스템에 의한, 사용자를 위한 대화 세션에 연관된 관리자 디스플레이를 제어하는 방법, 컴퓨터 판독가능 기록 매체 및 컴퓨터 장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |