CN105074815B - 针对语音识别系统的视觉反馈 - Google Patents

针对语音识别系统的视觉反馈 Download PDF

Info

Publication number
CN105074815B
CN105074815B CN201480005988.9A CN201480005988A CN105074815B CN 105074815 B CN105074815 B CN 105074815B CN 201480005988 A CN201480005988 A CN 201480005988A CN 105074815 B CN105074815 B CN 105074815B
Authority
CN
China
Prior art keywords
appearance
indicator
user
voice input
speech recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201480005988.9A
Other languages
English (en)
Chinese (zh)
Other versions
CN105074815A (zh
Inventor
C.克莱因
M.尼曼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Technology Licensing LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing LLC filed Critical Microsoft Technology Licensing LLC
Publication of CN105074815A publication Critical patent/CN105074815A/zh
Application granted granted Critical
Publication of CN105074815B publication Critical patent/CN105074815B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/03Arrangements for converting the position or the displacement of a member into a coded form
    • G06F3/0304Detection arrangements using opto-electronic means
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • User Interface Of Digital Computer (AREA)
CN201480005988.9A 2013-01-24 2014-01-21 针对语音识别系统的视觉反馈 Active CN105074815B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US13/749,392 US9721587B2 (en) 2013-01-24 2013-01-24 Visual feedback for speech recognition system
US13/749392 2013-01-24
PCT/US2014/012229 WO2014116548A1 (fr) 2013-01-24 2014-01-21 Rétroaction visuelle pour système de reconnaissance vocale

Publications (2)

Publication Number Publication Date
CN105074815A CN105074815A (zh) 2015-11-18
CN105074815B true CN105074815B (zh) 2019-01-22

Family

ID=50033842

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201480005988.9A Active CN105074815B (zh) 2013-01-24 2014-01-21 针对语音识别系统的视觉反馈

Country Status (4)

Country Link
US (1) US9721587B2 (fr)
EP (1) EP2948944B1 (fr)
CN (1) CN105074815B (fr)
WO (1) WO2014116548A1 (fr)

Families Citing this family (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110067059A1 (en) * 2009-09-15 2011-03-17 At&T Intellectual Property I, L.P. Media control
US20130339030A1 (en) * 2012-06-13 2013-12-19 Fluential, Llc Interactive spoken dialogue interface for collection of structured data
JP6229287B2 (ja) * 2013-04-03 2017-11-15 ソニー株式会社 情報処理装置、情報処理方法及びコンピュータプログラム
KR101456974B1 (ko) 2013-05-21 2014-10-31 삼성전자 주식회사 사용자 단말기, 음성인식 서버 및 음성인식 가이드 방법
US9575720B2 (en) * 2013-07-31 2017-02-21 Google Inc. Visual confirmation for a recognized voice-initiated action
GB2518002B (en) * 2013-09-10 2017-03-29 Jaguar Land Rover Ltd Vehicle interface system
US11132173B1 (en) * 2014-02-20 2021-09-28 Amazon Technologies, Inc. Network scheduling of stimulus-based actions
US9430186B2 (en) * 2014-03-17 2016-08-30 Google Inc Visual indication of a recognized voice-initiated action
EP3202125B1 (fr) * 2014-09-30 2019-07-31 Hewlett-Packard Development Company, L.P. Conditionnement de signaux sonores
US9564130B2 (en) * 2014-12-03 2017-02-07 Samsung Electronics Co., Ltd. Wireless controller including indicator
US9946862B2 (en) * 2015-12-01 2018-04-17 Qualcomm Incorporated Electronic device generating notification based on context data in response to speech phrase from user
WO2017188801A1 (fr) * 2016-04-29 2017-11-02 주식회사 브이터치 Procédé de commande optimale basé sur une commande multimode de voix opérationnelle, et dispositif électronique auquel celui-ci est appliqué
US10261752B2 (en) * 2016-08-02 2019-04-16 Google Llc Component libraries for voice interaction services
US10026403B2 (en) * 2016-08-12 2018-07-17 Paypal, Inc. Location based voice association system
US10409552B1 (en) * 2016-09-19 2019-09-10 Amazon Technologies, Inc. Speech-based audio indicators
EP3561653A4 (fr) * 2016-12-22 2019-11-20 Sony Corporation Dispositif de traitement d'informations et procédé de traitement d'informations
KR20180085931A (ko) 2017-01-20 2018-07-30 삼성전자주식회사 음성 입력 처리 방법 및 이를 지원하는 전자 장치
US10359993B2 (en) 2017-01-20 2019-07-23 Essential Products, Inc. Contextual user interface based on environment
US10166465B2 (en) * 2017-01-20 2019-01-01 Essential Products, Inc. Contextual user interface based on video game playback
CN106873937A (zh) * 2017-02-16 2017-06-20 北京百度网讯科技有限公司 语音输入方法和装置
DE102017206876B4 (de) 2017-04-24 2021-12-09 Volkswagen Aktiengesellschaft Verfahren zum Betreiben eines Sprachsteuerungssystems in einem Kraftfahrzeug undSprachsteuerungssystem
CN107155121B (zh) * 2017-04-26 2020-01-10 海信集团有限公司 语音控制文本的显示方法及装置
US11158317B2 (en) * 2017-05-08 2021-10-26 Signify Holding B.V. Methods, systems and apparatus for voice control of a utility
CN107277630B (zh) * 2017-07-20 2019-07-09 海信集团有限公司 语音提示信息的显示方法及装置
CN108108391A (zh) * 2017-11-21 2018-06-01 众安信息技术服务有限公司 用于数据可视化的信息的处理方法以及装置
US11182567B2 (en) * 2018-03-29 2021-11-23 Panasonic Corporation Speech translation apparatus, speech translation method, and recording medium storing the speech translation method
US11544591B2 (en) 2018-08-21 2023-01-03 Google Llc Framework for a computing system that alters user behavior
CN109274828B (zh) * 2018-09-30 2021-01-15 华为技术有限公司 一种生成截图的方法、控制方法及电子设备
US11482215B2 (en) * 2019-03-27 2022-10-25 Samsung Electronics Co., Ltd. Multi-modal interaction with intelligent assistants in voice command devices
DE102019134874A1 (de) * 2019-06-25 2020-12-31 Miele & Cie. Kg Verfahren zur Bedienung eines Geräts durch einen Benutzer mittels einer Sprachsteuerung
CN112533041A (zh) * 2019-09-19 2021-03-19 百度在线网络技术(北京)有限公司 视频播放方法、装置、电子设备和可读存储介质
EP3933560A1 (fr) * 2020-06-30 2022-01-05 Spotify AB Procédés et systèmes permettant de fournir une rétroaction visuelle animée pour les commandes vocales
TWI755037B (zh) * 2020-08-21 2022-02-11 陳筱涵 影音錄製裝置與影音編輯播放系統
WO2022254667A1 (fr) * 2021-06-03 2022-12-08 日産自動車株式会社 Dispositif de commande d'affichage et procédé de commande d'affichage

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6075534A (en) * 1998-03-26 2000-06-13 International Business Machines Corporation Multiple function graphical user interface minibar for speech recognition
CN1615508A (zh) * 2001-12-17 2005-05-11 旭化成株式会社 语音识别方法、遥控器、信息终端、电话通信终端以及语音识别器

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5819225A (en) 1996-05-30 1998-10-06 International Business Machines Corporation Display indications of speech processing states in speech recognition system
US5933804A (en) * 1997-04-10 1999-08-03 Microsoft Corporation Extensible speech recognition system that provides a user with audio feedback
US6965863B1 (en) 1998-11-12 2005-11-15 Microsoft Corporation Speech recognition user interface
ES2243451T3 (es) 2000-01-27 2005-12-01 Siemens Aktiengesellschaft Sistema y procedimiento para el procesamiento de voz enfocado a la vision con generacion de una señal de reaccion visual.
US7324947B2 (en) 2001-10-03 2008-01-29 Promptu Systems Corporation Global speech user interface
US7099829B2 (en) 2001-11-06 2006-08-29 International Business Machines Corporation Method of dynamically displaying speech recognition system information
US7047200B2 (en) * 2002-05-24 2006-05-16 Microsoft, Corporation Voice recognition status display
KR100754385B1 (ko) * 2004-09-30 2007-08-31 삼성전자주식회사 오디오/비디오 센서를 이용한 위치 파악, 추적 및 분리장치와 그 방법
US8510109B2 (en) * 2007-08-22 2013-08-13 Canyon Ip Holdings Llc Continuous speech transcription performance indication
US20090037171A1 (en) * 2007-08-03 2009-02-05 Mcfarland Tim J Real-time voice transcription system
US8140335B2 (en) 2007-12-11 2012-03-20 Voicebox Technologies, Inc. System and method for providing a natural language voice user interface in an integrated voice navigation services environment
US10496753B2 (en) * 2010-01-18 2019-12-03 Apple Inc. Automatically adapting user interfaces for hands-free interaction
JP5326934B2 (ja) 2009-01-23 2013-10-30 株式会社Jvcケンウッド 電子機器
US11012732B2 (en) * 2009-06-25 2021-05-18 DISH Technologies L.L.C. Voice enabled media presentation systems and methods
US9159151B2 (en) * 2009-07-13 2015-10-13 Microsoft Technology Licensing, Llc Bringing a visual representation to life via learned input from the user
US8265341B2 (en) * 2010-01-25 2012-09-11 Microsoft Corporation Voice-body identity correlation
US8898324B2 (en) 2010-06-24 2014-11-25 International Business Machines Corporation Data access management in a hybrid memory server
US10496714B2 (en) * 2010-08-06 2019-12-03 Google Llc State-dependent query response
US20120089392A1 (en) 2010-10-07 2012-04-12 Microsoft Corporation Speech recognition user interface
WO2012169679A1 (fr) * 2011-06-10 2012-12-13 엘지전자 주식회사 Appareil d'affichage, procédé de commande d'un appareil d'affichage et système de reconnaissance vocale pour un appareil d'affichage
US9129591B2 (en) * 2012-03-08 2015-09-08 Google Inc. Recognizing speech in multiple languages
WO2014028068A1 (fr) * 2012-08-17 2014-02-20 Flextronics Ap, Llc Centre multimédia

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6075534A (en) * 1998-03-26 2000-06-13 International Business Machines Corporation Multiple function graphical user interface minibar for speech recognition
CN1615508A (zh) * 2001-12-17 2005-05-11 旭化成株式会社 语音识别方法、遥控器、信息终端、电话通信终端以及语音识别器

Also Published As

Publication number Publication date
US20140207452A1 (en) 2014-07-24
CN105074815A (zh) 2015-11-18
EP2948944B1 (fr) 2021-03-10
WO2014116548A1 (fr) 2014-07-31
US9721587B2 (en) 2017-08-01
EP2948944A1 (fr) 2015-12-02

Similar Documents

Publication Publication Date Title
CN105074815B (zh) 针对语音识别系统的视觉反馈
US10453443B2 (en) Providing an indication of the suitability of speech recognition
CN105009031B (zh) 增强现实设备以及在其上操作用户界面的方法
US11871109B2 (en) Interactive application adapted for use by multiple users via a distributed computer-based system
Csapó et al. Overview of auditory representations in human-machine interfaces
US10824310B2 (en) Augmented reality virtual personal assistant for external representation
Jain et al. Head-mounted display visualizations to support sound awareness for the deaf and hard of hearing
KR102357633B1 (ko) 대화 감지
WO2018045553A1 (fr) Système et procédé d'interaction homme-machine
US10409552B1 (en) Speech-based audio indicators
US11183187B2 (en) Dialog method, dialog system, dialog apparatus and program that gives impression that dialog system understands content of dialog
US10521723B2 (en) Electronic apparatus, method of providing guide and non-transitory computer readable recording medium
JP2023525173A (ja) レンダリングされたグラフィカル出力を利用する会話型aiプラットフォーム
KR102515023B1 (ko) 전자 장치 및 그 제어 방법
JP6545716B2 (ja) 改善された音声認識を容易にする視覚的コンテンツの修正
WO2014122416A1 (fr) Analyse de l'émotion dans un discours
KR20150144031A (ko) 음성 인식을 이용하는 사용자 인터페이스 제공 방법 및 사용자 인터페이스 제공 장치
US20230259540A1 (en) Conversational ai platform with extractive question answering
KR102469712B1 (ko) 전자 장치 및 이의 자연어 생성 방법
Karpov et al. Information enquiry kiosk with multimodal user interface
KR20200080389A (ko) 전자 장치 및 그 제어 방법
CN108804897A (zh) 屏幕控制方法、装置、计算机设备及存储介质
KR102585795B1 (ko) 멀티미디어 제공 애플리케이션을 통한 다언어 번역 제공 방법
Prischepa et al. Hierarchical dialogue system for guide robot in shopping mall environments
KR20190091189A (ko) 대화 이해 ai 시스템에 의한, 사용자를 위한 대화 세션에 연관된 관리자 디스플레이를 제어하는 방법, 컴퓨터 판독가능 기록 매체 및 컴퓨터 장치

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant