WO2016027909A8 - データ構造、音声対話装置及び電子機器 - Google Patents

データ構造、音声対話装置及び電子機器 Download PDF

Info

Publication number
WO2016027909A8
WO2016027909A8 PCT/JP2015/078633 JP2015078633W WO2016027909A8 WO 2016027909 A8 WO2016027909 A8 WO 2016027909A8 JP 2015078633 W JP2015078633 W JP 2015078633W WO 2016027909 A8 WO2016027909 A8 WO 2016027909A8
Authority
WO
WIPO (PCT)
Prior art keywords
data structure
voice response
interactive voice
electronic device
speech content
Prior art date
Application number
PCT/JP2015/078633
Other languages
English (en)
French (fr)
Other versions
WO2016027909A1 (ja
Inventor
晃二 福永
Original Assignee
シャープ株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by シャープ株式会社 filed Critical シャープ株式会社
Priority to US15/328,169 priority Critical patent/US20170221481A1/en
Publication of WO2016027909A1 publication Critical patent/WO2016027909A1/ja
Publication of WO2016027909A8 publication Critical patent/WO2016027909A8/ja

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Telephonic Communication Services (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Navigation (AREA)

Abstract

 高い処理能力を必要とせず、会話が発散した場合であっても、対話を適切なタイミングで継続して行うことを可能にする。本発明のデータ構造は、少なくとも、使用者に対して発話する発話内容(Speak)と、当該発話内容に対して会話が成り立つ応答内容(Return)と、当該発話内容の属性を示す属性情報(Entity)と、を一つのセットとしたデータ構造である。
PCT/JP2015/078633 2014-08-20 2015-10-08 データ構造、音声対話装置及び電子機器 WO2016027909A1 (ja)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US15/328,169 US20170221481A1 (en) 2014-08-20 2015-10-08 Data structure, interactive voice response device, and electronic device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2014-167856 2014-08-20
JP2014167856A JP6448950B2 (ja) 2014-08-20 2014-08-20 音声対話装置及び電子機器

Publications (2)

Publication Number Publication Date
WO2016027909A1 WO2016027909A1 (ja) 2016-02-25
WO2016027909A8 true WO2016027909A8 (ja) 2016-04-14

Family

ID=55350847

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2015/078633 WO2016027909A1 (ja) 2014-08-20 2015-10-08 データ構造、音声対話装置及び電子機器

Country Status (3)

Country Link
US (1) US20170221481A1 (ja)
JP (1) JP6448950B2 (ja)
WO (1) WO2016027909A1 (ja)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7224116B2 (ja) 2018-06-15 2023-02-17 シャープ株式会社 空気調和機

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108369804A (zh) * 2015-12-07 2018-08-03 雅马哈株式会社 语音交互设备和语音交互方法
JP2018054790A (ja) * 2016-09-28 2018-04-05 トヨタ自動車株式会社 音声対話システムおよび音声対話方法
JP6690767B1 (ja) * 2019-09-30 2020-04-28 大日本印刷株式会社 対話シナリオのデータ構造、対話システム、サーバ装置、クライアント装置、及びコンピュータプログラム

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0792993A (ja) * 1993-09-20 1995-04-07 Fujitsu Ltd 音声認識装置
JP2003091299A (ja) * 2001-07-13 2003-03-28 Honda Motor Co Ltd 車載用音声認識装置
US7519534B2 (en) * 2002-10-31 2009-04-14 Agiletv Corporation Speech controlled access to content on a presentation medium
JP4729902B2 (ja) * 2003-12-12 2011-07-20 株式会社豊田中央研究所 音声対話システム
US7487085B2 (en) * 2004-08-24 2009-02-03 International Business Machines Corporation Method and system of building a grammar rule with baseforms generated dynamically from user utterances
JP4353212B2 (ja) * 2006-07-20 2009-10-28 株式会社デンソー 単語列認識装置
US8374874B2 (en) * 2006-09-11 2013-02-12 Nuance Communications, Inc. Establishing a multimodal personality for a multimodal application in dependence upon attributes of user interaction
US8073681B2 (en) * 2006-10-16 2011-12-06 Voicebox Technologies, Inc. System and method for a cooperative conversational voice user interface
US7949526B2 (en) * 2007-06-04 2011-05-24 Microsoft Corporation Voice aware demographic personalization
US8374859B2 (en) * 2008-08-20 2013-02-12 Universal Entertainment Corporation Automatic answering device, automatic answering system, conversation scenario editing device, conversation server, and automatic answering method
JP5195405B2 (ja) * 2008-12-25 2013-05-08 トヨタ自動車株式会社 応答生成装置及びプログラム
US20130211841A1 (en) * 2012-02-15 2013-08-15 Fluential, Llc Multi-Dimensional Interactions and Recall
US8977555B2 (en) * 2012-12-20 2015-03-10 Amazon Technologies, Inc. Identification of utterance subjects
JP6126870B2 (ja) * 2013-03-01 2017-05-10 本田技研工業株式会社 音声対話システム及び音声対話方法
US10726831B2 (en) * 2014-05-20 2020-07-28 Amazon Technologies, Inc. Context interpretation in natural language processing using previous dialog acts

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7224116B2 (ja) 2018-06-15 2023-02-17 シャープ株式会社 空気調和機

Also Published As

Publication number Publication date
US20170221481A1 (en) 2017-08-03
JP2016045253A (ja) 2016-04-04
JP6448950B2 (ja) 2019-01-09
WO2016027909A1 (ja) 2016-02-25

Similar Documents

Publication Publication Date Title
AU2019268131A1 (en) Speech recognition method, speech wakeup apparatus, speech recognition apparatus, and terminal
WO2018038385A3 (ko) 음성 인식 방법 및 이를 수행하는 전자 장치
WO2015009586A3 (en) Performing an operation relative to tabular data based upon voice input
CN106687908A8 (zh) 用于调用话音输入的手势快捷方式
WO2014107635A3 (en) Speech modification for distributed story reading
EP3751561A3 (en) Hotword recognition
WO2015184196A3 (en) Speech summary and action item generation
EP4283613A3 (en) Noise mitigation for a voice interface device
MY179900A (en) Speech recognition method and speech recognition apparatus
EP3057093A3 (en) Operating method for voice function and electronic device supporting the same
WO2014124332A3 (en) Voice trigger for a digital assistant
WO2012173941A3 (en) Speech recognition using loosely coupled components
EP2963643A3 (en) Entity name recognition
WO2014004536A3 (en) Voice-based image tagging and searching
EP4239628A3 (en) Determining hotword suitability
MX2015009812A (es) Metodo y sistema para el reconicimiento de comandos de voz.
WO2016027909A8 (ja) データ構造、音声対話装置及び電子機器
MX2014010795A (es) Dispositivo para extraer informacion a partir de un dialogo.
WO2016033480A3 (en) Intermediate compression for higher order ambisonic audio data
WO2014022306A3 (en) Dynamic context-based language determination
WO2016009444A3 (en) Music performance system and method thereof
EP2385520A3 (en) Method and device for generating text from spoken word
WO2018118492A3 (en) Linguistic modeling using sets of base phonetics
EP2816489A3 (en) Text entry at electronic communication device
EP3444819A4 (en) LANGUAGE SIGNAL CASCADE PROCESSING AND SENDING DEVICE AND COMPUTER-READABLE STORAGE MEDIUM

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15833600

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 15328169

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15833600

Country of ref document: EP

Kind code of ref document: A1