JP2005234572A5 - - Google Patents

Download PDF

Info

Publication number
JP2005234572A5
JP2005234572A5 JP2005039648A JP2005039648A JP2005234572A5 JP 2005234572 A5 JP2005234572 A5 JP 2005234572A5 JP 2005039648 A JP2005039648 A JP 2005039648A JP 2005039648 A JP2005039648 A JP 2005039648A JP 2005234572 A5 JP2005234572 A5 JP 2005234572A5
Authority
JP
Japan
Prior art keywords
discourse
function
determining
theory
prosodic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2005039648A
Other languages
English (en)
Other versions
JP2005234572A (ja
Filing date
Publication date
Priority claimed from US10/781,443 external-priority patent/US7542903B2/en
Application filed filed Critical
Publication of JP2005234572A publication Critical patent/JP2005234572A/ja
Publication of JP2005234572A5 publication Critical patent/JP2005234572A5/ja
Pending legal-status Critical Current

Links

Claims (15)

  1. 音声発話のコーパスを判定するステップと、
    少なくとも1つの音声発話に関連した少なくとも1つの談話機能を判定するステップと、
    前記少なくとも1つの談話機能に関連した少なくとも1つの韻律特徴を判定するステップと、
    前記韻律特徴および前記談話機能に基づいて談話機能の少なくとも1つの予測モデルを判定するステップを含む、談話機能に対する予測モデルを判定する方法。
  2. 前記談話機能、統一言語的談話モデル(Unified Linguistic Discourse Model)、修辞構造理論(Rhetorical Structure Theory)、談話構造理論(Discourse Structure Theory)、構造談話表示理論(Structured Discourse Representation Theory)のうちの少なくとも1つの談話分析の理論に基づいて判定される請求項に記載の方法。
  3. 前記予測モデルが機械学習、統計学的学習、規則帰納、ナイーブベイズ、決定木、サポートベクトルマシンのうちの少なくとも1つに基づいて判定される、請求項1に記載の方法。
  4. 前記韻律特徴が韻律特徴ベクトル内でコード化される、請求項1に記載の方法。
  5. 前記韻律特徴ベクトルが1つの談話機能に関連づけられた多数の韻律特徴が組み合わせられて、1つの韻律特徴ベクトルとされる、請求項に記載の方法。
  6. 前記談話機能は、タスク、テキストおよび対話レベル談話活動を行うために用いられるセンテンス内現象である、請求項1に記載の方法。
  7. 前記談話機能は、タスク、テキストおよび対話レベル談話活動を行うために用いられるセンテンス間現象である、請求項1に記載の方法。
  8. 少なくとも1つの音声発話のコーパスを検索するための入力/出力回路と、
    前記少なくとも1つの音声発話に関連した韻律特徴を判定するプロセッサであって、前記少なくとも1つの音声発話のコーパスに関連した少なくとも1つの談話機能を判定し、前記少なくとも1つの談話機能に関連した少なくとも1つの韻律特徴を判定し、前記韻律特徴および前記談話機能に基づいて談話機能に対する予測モデルを判定するプロセッサと、
    を備える、談話機能の予測モデルを判定するシステム。
  9. 前記談話機能、統一言語的談話モデル(Unified Linguistic Discourse Model)、修辞構造理論(Rhetorical Structure Theory)、談話構造理論(Discourse Structure Theory)、構造談話表示理論(Structured Discourse Representation Theory)のうちの少なくとも1つの談話分析の理論に基づいて判定される請求項に記載のシステム。
  10. 前記予測モデルが機械学習、統計学的学習、規則帰納、ナイーブベイズ、決定木、サポートベクトルマシンのうちの少なくとも1つに基づいて判定される、請求項に記載のシステム。
  11. 前記韻律特徴が韻律特徴ベクトル内でコード化される、請求項に記載のシステム。
  12. 前記韻律特徴ベクトルが1つの談話機能に関連づけられた多数の韻律特徴が組み合わせられて、1つの韻律特徴ベクトルとされる、請求項11に記載のシステム。
  13. 前記談話機能は、タスク、テキストおよび対話レベル談話活動を行うために用いられるセンテンス内現象である、請求項に記載のシステム。
  14. 前記談話機能は、タスク、テキストおよび対話レベル談話活動を行うために用いられるセンテンス間現象である、請求項に記載のシステム。
  15. 音声発語のコーパスを判定する手順と、
    少なくとも1つの音声発話に関連した少なくとも1つの談話機能を判定する手順と、
    少なくとも1つの談話機能に関連した少なくとも1つの韻律特徴を判定する手順と、
    前記韻律特徴および前記談話機能に基づいて談話機能の少なくとも1つの予測モデルを判定する手順と、
    を含む、談話機能に対する予測モデルを判定するようにコンピュータをプログラムするコンピュータ・プログラム。
JP2005039648A 2004-02-18 2005-02-16 談話機能に対する予測モデルを判定する方法およびシステム Pending JP2005234572A (ja)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/781,443 US7542903B2 (en) 2004-02-18 2004-02-18 Systems and methods for determining predictive models of discourse functions

Publications (2)

Publication Number Publication Date
JP2005234572A JP2005234572A (ja) 2005-09-02
JP2005234572A5 true JP2005234572A5 (ja) 2008-04-03

Family

ID=34838743

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2005039648A Pending JP2005234572A (ja) 2004-02-18 2005-02-16 談話機能に対する予測モデルを判定する方法およびシステム

Country Status (2)

Country Link
US (3) US7542903B2 (ja)
JP (1) JP2005234572A (ja)

Families Citing this family (96)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7398209B2 (en) * 2002-06-03 2008-07-08 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7693720B2 (en) 2002-07-15 2010-04-06 Voicebox Technologies, Inc. Mobile systems and methods for responding to natural language speech utterance
DE602004025616D1 (de) * 2003-12-26 2010-04-01 Kenwood Corp Einrichtungssteuereinrichtung, -verfahren und -programm
US7542903B2 (en) * 2004-02-18 2009-06-02 Fuji Xerox Co., Ltd. Systems and methods for determining predictive models of discourse functions
US20050187772A1 (en) * 2004-02-25 2005-08-25 Fuji Xerox Co., Ltd. Systems and methods for synthesizing speech using discourse function level prosodic features
KR100590553B1 (ko) * 2004-05-21 2006-06-19 삼성전자주식회사 대화체 운율구조 생성방법 및 장치와 이를 적용한음성합성시스템
US8340971B1 (en) * 2005-01-05 2012-12-25 At&T Intellectual Property Ii, L.P. System and method of dialog trajectory analysis
US7640160B2 (en) 2005-08-05 2009-12-29 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7620549B2 (en) * 2005-08-10 2009-11-17 Voicebox Technologies, Inc. System and method of supporting adaptive misrecognition in conversational speech
US8977636B2 (en) 2005-08-19 2015-03-10 International Business Machines Corporation Synthesizing aggregate data of disparate data types into data of a uniform data type
US8924212B1 (en) * 2005-08-26 2014-12-30 At&T Intellectual Property Ii, L.P. System and method for robust access and entry to large structured data using voice form-filling
US7949529B2 (en) 2005-08-29 2011-05-24 Voicebox Technologies, Inc. Mobile systems and methods of supporting natural language human-machine interactions
US7634409B2 (en) 2005-08-31 2009-12-15 Voicebox Technologies, Inc. Dynamic speech sharpening
US8447592B2 (en) * 2005-09-13 2013-05-21 Nuance Communications, Inc. Methods and apparatus for formant-based voice systems
US8694319B2 (en) * 2005-11-03 2014-04-08 International Business Machines Corporation Dynamic prosody adjustment for voice-rendering synthesized data
US20070129943A1 (en) * 2005-12-06 2007-06-07 Microsoft Corporation Speech recognition using adaptation and prior knowledge
US9135339B2 (en) * 2006-02-13 2015-09-15 International Business Machines Corporation Invoking an audio hyperlink
US8032375B2 (en) * 2006-03-17 2011-10-04 Microsoft Corporation Using generic predictive models for slot values in language modeling
JP4353202B2 (ja) * 2006-05-25 2009-10-28 ソニー株式会社 韻律識別装置及び方法、並びに音声認識装置及び方法
US8121890B2 (en) * 2006-06-09 2012-02-21 International Business Machines Corporation Method and system for automated service climate measurement based on social signals
US8073681B2 (en) 2006-10-16 2011-12-06 Voicebox Technologies, Inc. System and method for a cooperative conversational voice user interface
US9318100B2 (en) 2007-01-03 2016-04-19 International Business Machines Corporation Supplementing audio recorded in a media file
US7818176B2 (en) 2007-02-06 2010-10-19 Voicebox Technologies, Inc. System and method for selecting and presenting advertisements based on natural language processing of voice-based input
US8126860B2 (en) * 2007-07-17 2012-02-28 Ricoh Company, Limited Method and apparatus for processing data
US8346756B2 (en) * 2007-08-31 2013-01-01 Microsoft Corporation Calculating valence of expressions within documents for searching a document index
US8712758B2 (en) 2007-08-31 2014-04-29 Microsoft Corporation Coreference resolution in an ambiguity-sensitive natural language processing system
US8229730B2 (en) * 2007-08-31 2012-07-24 Microsoft Corporation Indexing role hierarchies for words in a search index
US20090070322A1 (en) * 2007-08-31 2009-03-12 Powerset, Inc. Browsing knowledge on the basis of semantic relations
US8868562B2 (en) * 2007-08-31 2014-10-21 Microsoft Corporation Identification of semantic relationships within reported speech
US8280721B2 (en) * 2007-08-31 2012-10-02 Microsoft Corporation Efficiently representing word sense probabilities
US8229970B2 (en) * 2007-08-31 2012-07-24 Microsoft Corporation Efficient storage and retrieval of posting lists
US8463593B2 (en) * 2007-08-31 2013-06-11 Microsoft Corporation Natural language hypernym weighting for word sense disambiguation
US8316036B2 (en) * 2007-08-31 2012-11-20 Microsoft Corporation Checkpointing iterators during search
US8639708B2 (en) * 2007-08-31 2014-01-28 Microsoft Corporation Fact-based indexing for natural language search
US7996214B2 (en) * 2007-11-01 2011-08-09 At&T Intellectual Property I, L.P. System and method of exploiting prosodic features for dialog act tagging in a discriminative modeling framework
US8589366B1 (en) * 2007-11-01 2013-11-19 Google Inc. Data extraction using templates
US8140335B2 (en) 2007-12-11 2012-03-20 Voicebox Technologies, Inc. System and method for providing a natural language voice user interface in an integrated voice navigation services environment
US8061142B2 (en) * 2008-04-11 2011-11-22 General Electric Company Mixer for a combustor
US9305548B2 (en) 2008-05-27 2016-04-05 Voicebox Technologies Corporation System and method for an integrated, multi-modal, multi-device natural language voice services environment
US8589161B2 (en) 2008-05-27 2013-11-19 Voicebox Technologies, Inc. System and method for an integrated, multi-modal, multi-device natural language voice services environment
US10127231B2 (en) 2008-07-22 2018-11-13 At&T Intellectual Property I, L.P. System and method for rich media annotation
US8374873B2 (en) * 2008-08-12 2013-02-12 Morphism, Llc Training and applying prosody models
CN102160359B (zh) 2008-09-18 2015-07-08 皇家飞利浦电子股份有限公司 控制系统的方法和信号处理系统
WO2010045375A1 (en) * 2008-10-14 2010-04-22 Honda Motor Co., Ltd. Improving dialog coherence using semantic features
US9129601B2 (en) 2008-11-26 2015-09-08 At&T Intellectual Property I, L.P. System and method for dialog modeling
US8326637B2 (en) 2009-02-20 2012-12-04 Voicebox Technologies, Inc. System and method for processing multi-modal device interactions in a natural language voice services environment
US8484225B1 (en) 2009-07-22 2013-07-09 Google Inc. Predicting object identity using an ensemble of predictors
WO2011059997A1 (en) 2009-11-10 2011-05-19 Voicebox Technologies, Inc. System and method for providing a natural language content dedication service
US9171541B2 (en) 2009-11-10 2015-10-27 Voicebox Technologies Corporation System and method for hybrid processing in a natural language voice services environment
CN102237081B (zh) * 2010-04-30 2013-04-24 国际商业机器公司 语音韵律评估方法与系统
WO2012110690A1 (en) * 2011-02-15 2012-08-23 Nokia Corporation Method apparatus and computer program product for prosodic tagging
TWI441163B (zh) * 2011-05-10 2014-06-11 Univ Nat Chiao Tung 中文語音辨識裝置及其辨識方法
JP5983604B2 (ja) * 2011-05-25 2016-08-31 日本電気株式会社 素片情報生成装置、音声合成装置、音声合成方法および音声合成プログラム
US8959082B2 (en) 2011-10-31 2015-02-17 Elwha Llc Context-sensitive query enrichment
US10008206B2 (en) * 2011-12-23 2018-06-26 National Ict Australia Limited Verifying a user
US10528913B2 (en) 2011-12-30 2020-01-07 Elwha Llc Evidence-based healthcare information management protocols
US10679309B2 (en) 2011-12-30 2020-06-09 Elwha Llc Evidence-based healthcare information management protocols
US20130173295A1 (en) 2011-12-30 2013-07-04 Elwha LLC, a limited liability company of the State of Delaware Evidence-based healthcare information management protocols
US10475142B2 (en) 2011-12-30 2019-11-12 Elwha Llc Evidence-based healthcare information management protocols
US10340034B2 (en) 2011-12-30 2019-07-02 Elwha Llc Evidence-based healthcare information management protocols
US10552581B2 (en) 2011-12-30 2020-02-04 Elwha Llc Evidence-based healthcare information management protocols
US10559380B2 (en) 2011-12-30 2020-02-11 Elwha Llc Evidence-based healthcare information management protocols
US20130325482A1 (en) * 2012-05-29 2013-12-05 GM Global Technology Operations LLC Estimating congnitive-load in human-machine interaction
US8577671B1 (en) 2012-07-20 2013-11-05 Veveo, Inc. Method of and system for using conversation state information in a conversational interaction system
US9465833B2 (en) 2012-07-31 2016-10-11 Veveo, Inc. Disambiguating user intent in conversational interaction system for large corpus information retrieval
US9798799B2 (en) * 2012-11-15 2017-10-24 Sri International Vehicle personal assistant that interprets spoken natural language input based upon vehicle context
RU2530268C2 (ru) 2012-11-28 2014-10-10 Общество с ограниченной ответственностью "Спиктуит" Способ обучения информационной диалоговой системы пользователем
US9761247B2 (en) * 2013-01-31 2017-09-12 Microsoft Technology Licensing, Llc Prosodic and lexical addressee detection
DK2994908T3 (da) 2013-05-07 2019-09-23 Veveo Inc Grænseflade til inkrementel taleinput med realtidsfeedback
US10186262B2 (en) * 2013-07-31 2019-01-22 Microsoft Technology Licensing, Llc System with multiple simultaneous speech recognizers
US9898459B2 (en) 2014-09-16 2018-02-20 Voicebox Technologies Corporation Integration of domain information into state transitions of a finite state transducer for natural language processing
EP3195145A4 (en) 2014-09-16 2018-01-24 VoiceBox Technologies Corporation Voice commerce
WO2016061309A1 (en) 2014-10-15 2016-04-21 Voicebox Technologies Corporation System and method for providing follow-up responses to prior natural language inputs of a user
US10614799B2 (en) 2014-11-26 2020-04-07 Voicebox Technologies Corporation System and method of providing intent predictions for an utterance prior to a system detection of an end of the utterance
US10431214B2 (en) 2014-11-26 2019-10-01 Voicebox Technologies Corporation System and method of determining a domain and/or an action related to a natural language input
US9852136B2 (en) 2014-12-23 2017-12-26 Rovi Guides, Inc. Systems and methods for determining whether a negation statement applies to a current or past query
US9854049B2 (en) 2015-01-30 2017-12-26 Rovi Guides, Inc. Systems and methods for resolving ambiguous terms in social chatter based on a user profile
TWI562000B (en) * 2015-12-09 2016-12-11 Ind Tech Res Inst Internet question answering system and method, and computer readable recording media
US11210324B2 (en) * 2016-06-03 2021-12-28 Microsoft Technology Licensing, Llc Relation extraction across sentence boundaries
US10331784B2 (en) 2016-07-29 2019-06-25 Voicebox Technologies Corporation System and method of disambiguating natural language processing requests
JP6461058B2 (ja) * 2016-09-06 2019-01-30 国立大学法人京都大学 音声対話装置および音声対話装置を用いた自動対話方法
US10373515B2 (en) 2017-01-04 2019-08-06 International Business Machines Corporation System and method for cognitive intervention on human interactions
US10235990B2 (en) 2017-01-04 2019-03-19 International Business Machines Corporation System and method for cognitive intervention on human interactions
US10318639B2 (en) 2017-02-03 2019-06-11 International Business Machines Corporation Intelligent action recommendation
CN108717413B (zh) * 2018-03-26 2021-10-08 浙江大学 一种基于假设性半监督学习的开放领域问答方法
JP6969491B2 (ja) * 2018-05-11 2021-11-24 トヨタ自動車株式会社 音声対話システム、音声対話方法及びプログラム
JP7063779B2 (ja) * 2018-08-31 2022-05-09 国立大学法人京都大学 音声対話システム、音声対話方法、プログラム、学習モデル生成装置及び学習モデル生成方法
US11140110B2 (en) 2018-10-26 2021-10-05 International Business Machines Corporation Adaptive dialog strategy for multi turn conversation systems using interaction sequences
DE102018133694B4 (de) * 2018-12-28 2023-09-07 Volkswagen Aktiengesellschaft Verfahren zur Verbesserung der Spracherkennung einer Benutzerschnittstelle
US11256868B2 (en) 2019-06-03 2022-02-22 Microsoft Technology Licensing, Llc Architecture for resolving ambiguous user utterance
CN110400576B (zh) * 2019-07-29 2021-10-15 北京声智科技有限公司 语音请求的处理方法及装置
TWI721516B (zh) * 2019-07-31 2021-03-11 國立交通大學 用以產生局部倒語速之估計値之方法與據以產生局部倒語速之預測値之裝置與方法
US11928430B2 (en) * 2019-09-12 2024-03-12 Oracle International Corporation Detecting unrelated utterances in a chatbot system
CN110782871B (zh) * 2019-10-30 2020-10-30 百度在线网络技术(北京)有限公司 一种韵律停顿预测方法、装置以及电子设备
US11361754B2 (en) * 2020-01-22 2022-06-14 Conduent Business Services, Llc Method and system for speech effectiveness evaluation and enhancement
CN113688685B (zh) * 2021-07-26 2023-09-22 天津大学 基于交互场景下的手语识别方法

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2119397C (en) * 1993-03-19 2007-10-02 Kim E.A. Silverman Improved automated voice synthesis employing enhanced prosodic treatment of text, spelling of text and rate of annunciation
JP3350293B2 (ja) * 1994-08-09 2002-11-25 株式会社東芝 対話処理装置及び対話処理方法
US5751907A (en) 1995-08-16 1998-05-12 Lucent Technologies Inc. Speech synthesizer having an acoustic element database
US5790978A (en) 1995-09-15 1998-08-04 Lucent Technologies, Inc. System and method for determining pitch contours
JP2000200273A (ja) * 1998-11-04 2000-07-18 Atr Interpreting Telecommunications Res Lab 発話意図認識装置
US20040049391A1 (en) * 2002-09-09 2004-03-11 Fuji Xerox Co., Ltd. Systems and methods for dynamic reading fluency proficiency assessment
US7610190B2 (en) 2003-10-15 2009-10-27 Fuji Xerox Co., Ltd. Systems and methods for hybrid text summarization
US7542971B2 (en) 2004-02-02 2009-06-02 Fuji Xerox Co., Ltd. Systems and methods for collaborative note-taking
US7542903B2 (en) * 2004-02-18 2009-06-02 Fuji Xerox Co., Ltd. Systems and methods for determining predictive models of discourse functions
US20050187772A1 (en) 2004-02-25 2005-08-25 Fuji Xerox Co., Ltd. Systems and methods for synthesizing speech using discourse function level prosodic features

Similar Documents

Publication Publication Date Title
JP2005234572A5 (ja)
Walker et al. Sphinx-4: A flexible open source framework for speech recognition
McGraw et al. Personalized speech recognition on mobile devices
JP3991914B2 (ja) 移動体用音声認識装置
JP2001215993A (ja) 対話処理装置および対話処理方法、並びに記録媒体
KR20220108169A (ko) 주의 기반 클록워크 계층적 변이형 인코더
JP2004198831A (ja) 音声認識装置および方法、プログラム、並びに記録媒体
Salishev et al. Voice activity detector (VAD) based on long-term mel frequency band features
US11783824B1 (en) Cross-assistant command processing
US20040006469A1 (en) Apparatus and method for updating lexicon
JP6712754B2 (ja) 談話機能推定装置及びそのためのコンピュータプログラム
Wester et al. A comparison of data-derived and knowledge-based modeling of pronunciation variation
Zhou et al. Two-way speech-to-speech translation on handheld devices.
JP5243325B2 (ja) 音声認識に仮名漢字変換システムを用いた端末、方法及びプログラム
JP3566977B2 (ja) 自然言語処理装置及びその方法
KR100480790B1 (ko) 양방향 n-그램 언어모델을 이용한 연속 음성인식방법 및장치
CN100380442C (zh) 利用优化音素集进行普通话语音识别的系统和方法
Li et al. A dialectal Chinese speech recognition framework
JP2000222406A (ja) 音声認識翻訳装置及び方法
Vesnicer et al. A voice-driven web browser for blind people.
US11961514B1 (en) Streaming self-attention in a neural network
KR102458830B1 (ko) 사용자 중심의 음성 대화 시스템
Rajput et al. Speech in Mobile and Pervasive Environments
Takahashi et al. Interactive voice technology development for telecommunications applications
JP6121313B2 (ja) ポーズ推定装置、方法、プログラム