CN102160043A - 针对集成多语气多装置自然语言语音服务环境的系统和方法 - Google Patents

针对集成多语气多装置自然语言语音服务环境的系统和方法 Download PDF

Info

Publication number
CN102160043A
CN102160043A CN2008801303038A CN200880130303A CN102160043A CN 102160043 A CN102160043 A CN 102160043A CN 2008801303038 A CN2008801303038 A CN 2008801303038A CN 200880130303 A CN200880130303 A CN 200880130303A CN 102160043 A CN102160043 A CN 102160043A
Authority
CN
China
Prior art keywords
intention
natural language
input media
input
lingual
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2008801303038A
Other languages
English (en)
Other versions
CN102160043B (zh
Inventor
罗伯特·肯尼维克
克里斯·魏德
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Weber Assets Co., Ltd.
Original Assignee
VoiceBox Technologies Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by VoiceBox Technologies Corp filed Critical VoiceBox Technologies Corp
Publication of CN102160043A publication Critical patent/CN102160043A/zh
Application granted granted Critical
Publication of CN102160043B publication Critical patent/CN102160043B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/32Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output

Abstract

提供了一种针对集成多语气多装置自然语言语音服务环境的系统和方法。具体来说,该环境包括多个语音装置,每个语音装置具有用于处理多语气自然语言输入的意图确定能力,除此之外还具有环境中其他装置的意图确定能力的知识。另外,该环境可被布置为集中方式、分布对等方式、或者这些方式的各种组合。这样,各个装置可协作来确定多语气自然语言输入的意图,并且可将命令、询问或其他请求传递到最适于作出响应来进行动作的一个或多个装置中。

Description

针对集成多语气多装置自然语言语音服务环境的系统和方法
技术领域
本发明涉及一种集成语音服务环境,其中多个装置可通过协同处理自由形式多语气的自然语言输入来提供各种语音服务,从而便于用户与集成环境中的一个或多个装置之间的会话式交互。
背景技术
作为近年来发展的技术,消耗品电子装置已呈现出在许多人的日常生活中几乎无处不在。为了满足由移动电话、导航装置、嵌入装置及其它类似装置的功能性和移动性而产生的日益增加的需求,除了核心应用程序之外还常常在这些装置中提供大量特征和功能。然而更强的功能性还引出了这样的折衷,它们包括常常约束用户无法完全发觉其电子装置的全部性能的学习曲线。例如,许多现有的电子装置包括复杂的并不特别用户友好的人机界面,这约束了许多技术被投入大量的市场应用。而且,繁琐的界面常导致期望的特征不能被发现(例如,在一些冗长而不易导航的菜单中),这趋向于使许多用户不去使用甚至并不知晓其装置的潜在性能。
这样,由许多电子装置提供的增加的功能性常趋于被浪费,正如市场调查所表明的那样,许多用户只使用给定装置上的一小部分可用特征或应用程序。而且,在一个无线网络和宽带接入越来越普及的社会中,消费者自然倾向于希望他们的电子装置具有无缝移动能力。因此,随着消费者要求强化更简单的机制来与电子装置进行交互,阻止快速和准确交互的繁琐界面会变成一个重要的关注点。因此,对以直观方式使用技术的机制的持续增长的需求还在很大程度上未得到满足。
简化电子装置中的人机交互的一种方法包括使用语音识别软件,这种软件可使用户发觉不熟悉的、未知的或难使用的特征。例如,最近由Navteq公司进行的一项调查提供了在自动导航和基于页面应用程序之类的各种应用程序中使用的数据,该调查表明语音识别在电子装置用户最想要的特征当中名列前茅。即便如此,现有的语音用户界面在其实际工作时仍倾向于要求用户方面进行大量学习。
例如,许多现有的语音用户界面只支持根据特定命令和控制序列或语法而被公式化的请求。而且,很多现有的语音用户界面由于不准确的话语识别而使用户失望或不满意。类似地,通过强制用户提供预先建立的命令或关键字来以系统可理解的方式传达请求,现有的语音用户界面不能有效地使用户进行丰富和协作的对话来解决请求并使会话朝向互相满意的目标发展(尤其是例如当用户不确定特别需要、可用信息或装置性能时)。因此,现有的语音用户界面存在各种缺点,包括大大限制了用户以协作和会话的方式进行对话。
另外,许多现有的语音用户界面在利用分布在各种不同领域或装置中的信息来解决自然语言基于语音的输入方面都存在不足。因此,现有的语音用户界面被限于它们所设计针对的有限的应用程序组或被限于它们所属的装置。尽管技术性的进步使用户常具有多种装置来满足他们的不同需要,但现有的语音用户界面还不能使用户充分摆脱装置的约束。例如,用户可能会对与不同应用程序和装置有关的多种服务感兴趣,但现有的语音用户界面却趋于限制用户以合适的方法访问应用程序和装置。而且,用户通常仅能一次实际携带有限数量的装置,但在各种状况下也可能会需要与用户的其它正在使用的装置有关的内容或服务。因此,尽管用户有各种需要,在各种背景或环境下会需要与不同装置有关的内容或服务,但现有的语音技术在提供集成的环境来使用户能够请求与实质上的任意装置或网络相关的内容或服务方面存在不足。对信息有效性的限制和现有语音服务环境中的装置交互机制趋于妨碍用户以直观、自然和有效的方式来体验技术。现有系统受到这些和其它问题的影响。
发明内容
根据本发明的各个方面,针对集成多语气多装置的自然语言语音服务环境的系统和方法包括多个语音装置,每个语音装置除了具有对环境中其它装置的意图确定能力的知识以外,还具有处理多语气自然语言输入的意图确定能力。另外,该环境可以以集中方式、分布对等(peer-to-peer)方式或这两种方式的各种组合方式来布置。这样,各个装置能够协作来确定多语气自然语言输入的意图,并且可将命令、询问或其它请求发送到最适合响应这些命令、询问或请求来进行活动的一个或多个装置。
根据本发明的各个方面,以集中方式布置的集成的自然语言语音服务环境包括:接收多语气自然语言输入的输入装置、通信地耦接到输入装置的中央装置、以及通信地耦接到中央装置的一个或多个次级装置。输入装置、中央装置以及一个或多个次级装置的每一个都可具有用于处理多语气自然语言输入的意图确定能力。这样,可采用集中方式通过将多语气自然语言输入从输入装置传送到中央装置来确定一个给出的多语气自然语言输入的意图。此后,中央装置可集合输入装置的意图确定能力和一个或多个次级装置的意图确定能力并使用所集合的意图确定能力来确定多语气自然语言输入的意图。输入装置随后可从中央装置接收所确定的意图并基于所确定的意图来在输入装置、中央装置或次级装置的一个或多个处至少调用一个动作。
根据本发明的各个方面,以分布方式布置的集成的自然语言语音服务环境包括:接收多语气自然语言输入的输入装置、通信地耦接到输入装置的中央装置、以及通信地耦接到输入装置的一个或多个次级装置,其中如集中方式的实现中一样,输入装置以及一个或多个次级装置的每一个都可具有用于处理多语气自然语言输入的意图确定能力。然而,分布方式的实现与集中方式的实现的区别在于可使用本地意图确定能力来在输入装置处确定多语气自然语言输入的初步意图。随后多语气自然语言输入可被传送到次级装置的一个或多个(例如,当输入装置处的意图确定可信度下降到低于一个给定阈值时)。这种情况下,每一个次级装置使用本地意图确定能力来确定多语气自然语言输入的意图。输入装置比较初步意图确定和次级装置的意图确定,并可在所比较的意图确定当中作出裁断来确定多语气自然输入的活动意图。
根据本发明的各个方面,以在集中模型和分布模型之间动态选择的方式布置集成的自然语言语音服务环境。例如,该环境包括接收多语气自然语言输入的输入装置、以及通信地耦接到输入装置的一个或多个次级装置,其中输入装置和一个或多个次级装置的每一个都可具有用于处理多语气自然语言输入的意图确定能力。合成模型(constellation model)可访问输入装置和一个或多个次级装置的每一个,其中合成模型描述了输入装置和一个或多个次级装置的意图确定能力。可发送多语气自然语言输入来用于在输入装置或次级装置的一个或多个处进行处理,从而根据合成模型中所描述的意图确定能力来确定其意图。例如,当合成模型将输入装置和次级装置布置按集中方式布置时,次级装置之一可被指定为中央装置并按上述方式处理自然语言输入。然而,当无法将多语气自然语言传送到中央装置时,合成模型可按照分布方式动态地进行重新布置,从而输入装置与次级装置共享与各个本地意图确定能力有关的知识并如协作节点那样进行操作,以使用共享的与本地意图确定能力有关的知识来确定多语气自然语言输入的意图。
本发明的其它目的和优点将根据以下附图和详细描述来进行说明。
附图说明
图1示出了根据本发明各个方面的可被提供在集成多装置自然语言语音服务环境中的示例性多语气电子装置的框图。
图2示出了根据本发明各个方面的集成多语气多装置的自然语言语音服务环境的集中实现方式的示例的框图。
图3示出了根据本发明的各个方面,在集成多语气多装置的自然语言语音服务环境的集中实现方式中,在输入装置处对多语气自然语言输入进行处理的示例的流程图。
图4示出了根据本发明的各个方面,在集成多语气多装置的自然语言语音服务环境的集中实现方式中,在中央装置处对多语气自然语言输入进行处理的示例的流程图。
图5示出了根据本发明的各个方面,在集成多语气多装置的自然语言语音服务环境的集中实现方式中,在次级装置处对多语气自然语言输入进行处理的示例的流程图。
图6示出了根据本发明各个方面的集成多语气多装置的自然语言语音服务环境的分布实现方式的示例的框图。
图7示出了根据本发明的各个方面,在集成多语气多装置的自然语言语音服务环境的分布实现方式中,在输入装置处对多语气自然语言输入进行处理的示例的流程图。
具体实施方式
根据本发明的各个方面,图1示出了可被提供在包括一个或多个附加多语气装置(例如,如图2和图6所示)的自然语言语音服务环境中的示例性多语气电子装置100的框图。将会明了,图1所示的电子装置100可以是任何适当的语音电子装置(例如,电信息通信装置、个人导航装置、移动电话、VoIP节点、个人计算机、媒体装置、嵌入装置、服务器、或其它电子装置)。装置100可包括各种组件来共同提供用于处理会话式多语气自然语言输入的能力。这样,装置100的用户可与语音电子装置100进行多语气会话式的对话,从而以形式自由且协作的方式来解决请求。
例如,自然语言处理组件可支持形式自由的自然语言发言来使用户摆脱关于如何对命令、询问或者其它请求进行公式化的限制。而且,用户可使用任何感觉自然的说话方式来请求装置100所能提供的内容或服务(例如与电信息通信、通信、媒体、消息传递、导航、交易、信息检索等有关的内容或服务)。例如,在各种实现方式中,装置100可利用2003年6月3日提交的题为“Systems and Methods for Responding to Natural Language Speech Utterance”的共同未决美国专利申请10/452,147和2003年6月15日提交的题为“Mobile Systems and Methods for Responding to Natural Language Speech Utterance”的美国专利申请10/618,633中所描述的技术来处理自然语言发言,通过参考上述两个申请的全部内容来将其公开并入本文。
而且,因为装置100可被运用到集成的多装置环境中,所以用户可进一步请求环境中的其它装置所能够提供的内容或服务。具体来说,集成的语音服务环境可包括多个多语气装置,每个多语气装置都包括与图1所示的自然语言组件大体类似的组件。然而环境中的各个装置可用于不同的目的,使得在环境中的装置之间会有可用内容、服务、应用程序或其它能力的变化(例如,媒体装置的核心功能可由个人导航装置的核心功能变化而来)。因此,环境中的每个装置包括装置100在内都具有对内容、服务、应用程序、意图确定能力、以及其它能通过合成模型130b来从其它装置得到的特征的认识。因此,如下将要详细说明的那样,电子装置100可与集成环境中的其它装置协作,通过在其它事以外还分享前后内容、先前信息、领域知识、短期知识、长期知识和认知模型来解决请求。
根据本发明的各个方面,电子装置100可包括输入机构105,其能够接收多语气自然语言输入,该输入至少包括用户所讲的发言。如下所述,输入机构105可包括任何能够接收讲话输入的合适装置(例如,方向性麦克风、麦克风阵列、或其它能够产生编码语音的装置)或装置的组合。而且,在各种实现方式中,输入机构105可被构成为通过例如最大化用户方向的增益、消除回声、清除点噪声源、执行各种采样率的采样或者对环境噪声(例如背景对话)滤波来使编码语音的逼真度最大化。这样,输入机构105可通过能够容忍可能在其它方式中干扰发言的精确译释的噪声或其它因素的方式来生成编码语音。
而且,在各种实现方式中,输入机构105可包括各种其它输入形式(即,输入机构105可被布置在多语气环境中),其中非语音输入可与一个或多个在先的、同时的或在后的多语气自然语言输入相关联和/或被相联系地处理。例如,输入机构105可被耦接到触摸屏接口、触针和输入板接口、键区或键盘、或者任何其它合适的输入机构。结果,处理多语气输入时潜在可用的信息量可被最大化,因为用户能够澄清发言或者另外通过使用各种输入形式来在给定的多语气自然语言输入中提供附加信息。例如,在示例描述中,用户可用触针或其它点击装置来触摸装置100的触摸屏接口的一部分,同时还提供与所触摸的接口部分相关的发言(例如,“为我显示此处周围的餐馆”)。在该示例中,自然语言发言可与通过触摸屏接口接收的输入相关联,从而将“此处周围”译释成与接口的触摸部分有关(例如,不被译释成用户的当前位置或一些其它含义)。
根据本发明的各个方面,装置100可包括自动语音识别器110,其生成一个或多个对编码语音的初步译释,可从输入机构105接收这些译释。例如,自动语音识别器110可使用一个或多个动态适配识别语法来识别发言中所含的音节、词或短语。可使用动态识别语法来基于一个或多个声音模型识别通过语音口述的音素流。而且,如2005年8月5日提交的题为“Systems and Methods for Responding to Natural Language Speech Utterance”的共同未决美国专利申请11/197,504所述,自动语音识别器110能够进行多通道分析,其中主语音识别引擎可生成对发言的主译释(例如使用大的口述语法列表)并可从一个或多个次语音识别引擎请求次译释(例如使用具有超出词汇表外的诱饵字的虚拟口述语法),将该申请的全部内容通过引用并入本文。
因此,自动语音识别器110能够以各种方式生成对发言的主译释,这些方式包括对口述语法或虚拟口述语法的排他使用、或对这些语法的各种组合的使用(例如,当装置100支持多通道分析时)。在任何情况下,自动语音识别器110可提供超出词汇表外的能力并能容忍被落下的语音信号、用户的误讲或者其它可能发生在自然语言语音中的变数(例如停止、开始、结巴等)。而且,自动语音识别器110所使用的识别语法可包括词汇、字典、音节、词、短语或其它根据各种前后关系或应用特定的领域(例如,导航、音乐、电影、天气、购物、新闻、语言、时间或地域临近、或其它适合的领域)而优化的信息。另外,可使用环境知识(例如对等亲和、环境中装置的能力等)、历史知识(例如频繁请求、上文等)或其它类型的知识来持续动态优化包含在识别语法中的信息。
例如,可动态优化包含在识别语法中的信息来改善给定发言被精确识别的可能性(例如,对一个词的不正确译释之后,可从语法中去除该不正确译释以减小该不正确译释被重复的可能性)。因此,自动语音识别器110可使用多个技术来生成对自然语言发言的主译释,包括那些例如在2006年8月31日提交的题为“Dynamic Speech Sharpening”的共同未决美国专利申请11/513,269中公开的内容,该申请的全部内容以引用方式并入本文。而且,与装置100有关的自动语音识别器110所使用的技术可被认为是定义了装置100的意图确定能力,并且该能力可与环境中的其它装置共享来使整个环境中的语音识别集中(例如,由于各种装置可使用特有的语音识别技术或具有特有的语法或词汇表,所以装置可共享词汇翻译机制来增强系统范围内的识别)。
根据本发明的各个方面,自动语音识别器110可向会话语言处理器120提供对其中包含发言的多语气输入的一个或多个主译释。该会话语言处理器120可包括各种组件,它们集中操作来将每日的人与人的会话建模,从而与用户进行协作式的会话来解决基于用户意图的请求。例如,会话语言处理器120可包括但不限于意图确定引擎130a、合成模型130b、一个或多个领域接口进程(domain agent)130c、语境跟踪引擎130d、误识别引擎130e、以及语音搜索引擎130f。另外,会话语言处理器120可耦接到一个或多个数据存储器160以及与一个或多个领域有关的应用程序。因此,装置100的意图确定能力可根据自动语音识别器110和会话语言处理器120的数据和处理能力来定义。
更特别的,意图确定引擎130a可基于对装置100的意图确定能力以及对集成语音服务环境中其它装置的意图确定能力的考虑来为一个给定的多语气自然语言输入建立含义。例如,装置100的意图确定能力可被定义为处理资源的功能,处理对语法、语境、接口进程或其它数据的存储的功能,以及处理与装置100有关的内容或服务的功能(例如,具有较少存储量的媒体装置与具有较大存储量的装置相比具有更少的可识别歌曲列表)。因此,意图确定引擎130a可确定是否在本地处理给定的输入(例如,当装置100的意图确定能力表明了识别的有利条件时),或者是否将输入相关的信息传递到其它能帮助确定输入意图的装置。
这样,为了确定应当由哪个装置或装置的组合来处理输入,意图确定引擎130a可评估合成模型130b,该模型提供了针对集成语音服务环境中每个装置的意图确定能力的模型。例如,合成模型130b可包含但不限于为环境中每个装置可用的处理知识和存储资源、以及为环境中每个装置可用的领域接口进程、语境、内容、服务和其它信息的特征和范围。这样,使用该合成模型130b,意图确定引擎130a能够确定是否存在任何一个具有意图确定能力的其它装置能够被调用来增大或提高装置100的意图确定能力(例如,通过将多语气自然语言输入相关的信息传递到表现为最适合分析该信息从而确定输入意图的一个或多个装置)。因此,意图确定引擎130a可利用广泛的合成模型130b来建立给定发言的含义,该广泛的合成模型130b描述了装置100内以及整个集成环境的能力。因此,意图确定引擎130a可根据整个环境的能力来对给定自然语言输入的处理进行优化(例如,可在装置100本地处理发言,可根据合成模型130b中的信息来将发言传递到特定装置,或者可将发言发送到环境中的所有装置并作出裁断来选出一个对意图确定的最佳猜测)。
尽管以下将针对能被用来确定集成多装置环境中多语气自然语言输入的意图的各种技术进行讨论,但显然任何一个装置的自然语言处理能力都不仅限于这里所提供的特定讨论的范围。这样,除了以上所参考的共同未决美国专利申请以外,还可利用以下申请中所描述的其它自然语言处理能力,这些申请包括:2005年8月5日提交的题为“Systems and Methods for Responding to Natural Language Speech Utterance”的共同未决美国专利申请11/197,504;2005年8月10日提交的题为“System and Method of Supporting Adaptive Misrecognition in Conversational Speech”的美国专利申请11/200,164;2005年8月29日提交的题为“Mobile Systems and Methods of Supporting Natural Language Human-Machine Interactions”的美国专利申请11/212,693;2006年10月16日提交的题为“System and Method for a Cooperative Conversational Voice User Interface”的美国专利申请11/580,926;2007年2月6日提交的题为“System and Method for Selecting and Presenting Advertisements Based on Natural Language Processing of Voice-Based Input”的美国专利申请11/671,526;以及2007年12月11日提交的题为“System and Method for Providing a Natural Language Voice User Interface in an Integrated Voice Navigation Services Environment”的美国专利申请11/954,064,这些申请的全部公开内容都通过引用并入本文。
根据本发明的各个方面,图2示出了集成多语气多装置的自然语言语音服务环境的集中实现方式的示例的框图。如稍后将要描述的,该集成多语气多装置的自然语言语音服务环境的集中实现方式可使用户与任何一个语音装置210a-n或中央语音装置220进行会话式的多语气自然语言交互。这样,多装置语音服务环境可集中确定任意给定多语气自然语言输入的意图,从而用户可不受限制地请求与环境中的任意装置或应用程序有关的内容或语音服务。
如图2所示,多装置语音服务环境的集中实现可包括多个语音装置210a-n,每个语音装置包括如图1所描述的能够确定自然语言发言意图的各种组件。另外,集中实现包括中央装置220,其包含与每个其它语音装置210a-n的意图确定能力有关的信息。例如,在各种示例实现方式中,中央装置220可被设计成其优点是作为最能够确定发言意图的装置(例如,具有重要的处理力、存储资源和通信能力来使装置适于管理整个环境的意图确定的服务器主数据中心或其它装置)。在其它示例实现方式中,可根据给定多语气自然语言输入、对话或交互的一个或多个特征来动态选择中央装置220(例如在当前发言与特定领域有关时可以将一个装置指定为中央装置220)。
在图2所示的集中实现方式中,可在语音装置210a-n之一处接收多语气自然语言输入。因此,装置210a-n中进行接收的一个装置可被指定为针对那一输入的输入装置,而装置210a-n中剩下的装置可被指定为针对那一输入的次级装置。换句话说,对于任意给定的多语气自然语言输入,多装置环境可包括一个用来收集输入的输入装置,对环境中所有装置210a-n的意图确定、推断和处理能力进行集合的中央装置220,以及也可被用于意图确定处理中的一个或多个次级装置。这样,环境中的每个装置210可提供有合成模型来对具有输入和输出通信能力的所有装置210进行鉴别,因此指示了其它装置可能能够确定针对给定多语气自然语言输入的范围。合成模型还可定义中央装置220的位置,该中央装置集合了来自环境中各个装置210a-n的语境、词汇表、内容、识别语法、错误识别、共享知识、意图确定能力、推断能力、以及其它信息。
因此,只要通信和处理能力允许,中央装置220可被用作第一个或最后一个识别器手段。例如,由于中央装置220集合了整个环境的意图确定能力(例如通过集合来自环境中装置210a-n的语境、词汇表、装置能力以及其它信息),所以当输入装置210处的本地处理无法以满意的信任等级确定输入意图时,在中央装置220被用作第一个手段的识别器或被用作最后一个手段的识别器的情况下将输入自动地传送到该中央装置220。然而,还应明了在某些情况下输入装置210可能由于各种原因而无法与中央装置220连接(例如,无法使用网络连接、或者中央装置220的处理瓶颈会引起通信延迟)。在这种情况下,起初与中央装置220连接的输入装置210可被移到分布式处理中(例如参考图6描述的那样)并且以合成模型来与一个或多个其它装置210a-n进行能力的通信。因此,当中央装置220由于各种原因而无法被调用时,剩下的装置210a-n可作为协作节点来进行操作,以通过分散方式确定意图。
此外,在多装置语音服务环境中,中央装置220和各个其它装置210a-n可协作来创建一个整个环境能力的集合模型。例如,如上面所指出的,除了具有基于处理资源、存储资源、以及装置能力的意图确定能力之外,每个装置210a-n和中央装置220可包括各种其它自然语言处理组件。通过不仅仅维持与各个装置210a-n有关的数据、内容和服务的完整模型,还维持与各个装置210a-n有关的其它自然语言处理能力和动态状态,可因此使语音服务环境以集成方式工作。这样,各个装置210a-n能够以集中整个装置的能力、数据、状态和其它信息为目标进行工作,这种工作方式可以是针对一个装置(例如,中央装置220),也可以是遍布各个装置210a-n(例如,如图6所描述的分布实现方案中那样)。
例如,如上所述,每个装置210都包括一个自动语音识别器、一个或多个动态适配识别语法、以及列出了用来产生对自然语言发言的因素译释的词汇表。而且,每个装置210都包括在本地建立的语境,该语境的范围包括语境堆中所含的信息、语境和命名空间变量、词汇翻译机制、与当前对话或会话交互有关的短期共享知识、与用户经过长时间得知的喜好有关的长期共享知识、或者其它语境信息。而且,每个装置210可具有彼此相关的各种服务或应用,并且可在本地执行自然语言处理的各个方面。因此,将在整个环境中集中的附加信息可包括部分或初步发言识别、误识别或含糊识别、推断能力、以及全部装置状态信息(例如,环境中播放的歌曲、环境中设置的警报等)。
因此,各种数据同步化和参照完整性算法可被各个装置210a-n和中央装置220一齐使用来提供对环境一致的观点。例如,使用为计算机辅助装置设计的通用即插即用协议来在整个环境中描述和传递信息以用于同步化和集中的目的,尽管环境也可工作在对等断开模式下(例如,当无法达到中央装置220时)。然而,在各种实现方式中,例如当装置210a-n具有针对自然语言处理足够的相称资源和能力时,环境也可工作在如图6所示的对等模式下而无需考虑断开状态。
通常,环境中用于集中的算法能够以各种间隔来执行,尽管希望其限制数据传输以避免处理瓶颈。例如,由于集中和同步化技术与其中典型地在几秒钟时间内表达给定发言的自然语言处理有关,所以与语境和词汇有关的信息无需在少于很少几秒的时间范围内更新。然而,只要通信能力允许,就能够更加频繁地更新语境和词汇来提供实时识别或表现实时识别。在另一实现方式中,允许执行集中和同步化直至完成(例如,当此时没有未决请求时),或者当达到预定时间或资源消耗限制时(例如,当集中与使用截止时间的未决请求、具有最高信任等级的意图确定有关时)可暂停或终止集中和同步化。
通过在整个环境中建立对能力、数据、状态和其它信息一致的观点,在处理任意给定的多语气自然语言输入过程中输入装置210可与中央装置220和一个或多个次级装置(即,输入装置以外的一个或多个装置210a-n)协作。而且,通过为每个装置210和中央装置220提供描述了环境的同步状态的合成模型,环境可容忍由一个或者多个装置210a-n或中央装置220产生的故障。例如,如果输入装置210无法与中央装置220通信(例如由于服务冲突),则输入装置210可进入断开的对等模式,从而能够与通信保持可用的一个或多个装置210a-n交换能力。这样,当装置210建立了与词汇、语境、误识别、接口进程适配、意图确定能力、推断能力或其它有关的新信息时,除了询问合成模型来确定信息是否应被发送到一个或多个其它装置210a-n中以外,如上所述,装置210还会发送这些信息到中央装置220以用于集中目的。
例如,假设环境包括具有与播放音乐或其它媒体有关的标称功能的语音移动电话,并且该语音移动电话还具有有限量的本地存储空间,而环境还包括语音家庭媒体系统,该系统具有能够提供专用的媒体功能的较大存储介质。如果移动电话将要建立新的词汇、语境、或其它与歌曲相关的信息(例如用户在路上下载歌曲或铃声到移动电话),则移动电话除了可将新建立的信息发送中央装置220以外还可将这些信息发送到家庭媒体系统。这样,通过具有环境中所有装置210a-n的模型并将新信息发送到信息最可能被使用的装置,在中央装置220由于任何原因而无法使用时各个装置都可掌握操作的断开模式,同时可在整个环境中有效分配资源。
因此,根据前面的讨论,将明了集成多装置服务环境的集中实现方式通常包括中央装置220,其操作来集合或集中关于内容、服务、能力、以及其它与环境中所使用的各个语音装置210a-n有关的信息的知识。在这样的集中实现方式中,如参考图3至图5将详细描述的那样,中央装置220可被调用来作为第一个手段或最后一个手段的识别器,而且,环境中的其它装置210a-n可被构成来在中央装置220由于任意原因而无法被调用时自动进入断开或对等工作模式(即,装置可进入分散或分布模式,如参考图6至图7将要详细描述的)。因此可使每个装置210a-n的知识和能力以集中方式、分布方式或它们的各种组合方式而在整个语音服务环境中可用,从而优化了在确定任意给定多语气自然语言输入时所使用的自然语言处理资源的量。
根据本发明的各个方面,图3示出了在集成多语气多装置的自然语言语音服务环境的集中实现方式中,在输入装置处对多语气自然语言输入进行处理的示例的流程图。类似地,图4和图5示出了分别与集中语音服务环境中的中央装置和一个或多个次级装置有关的对应方法。而且,显然图3至图5所描述的处理技术通常可基于图2所示的上述集中实现方式,从而可假设输入装置与中央装置不同,并且可假设一个或多个次级装置与中央装置和输入装置不同。然而,显然的是各个示例都会涉及在中央装置或其它装置处接收的自然语言输入,在此情况下,图3至图5所示的技术可根据环境的具体情况而变化(例如,关于将发言传递到特定的一个或几个装置的决定可在本地协作地做出,或是根据诸如整体系统状态、通信能力、意图确定能力或其它因素之类的各种因素来以其它方式做出)。
如图3所示,在操作310中,多语气自然语言输入可在输入装置处接收。多语气输入可至少包括用户提供的自然语言发言,并且还可包括诸如音频、文本、按键按下、手势、或其它非语音输入之类的其它输入形式。还应当明确的是在操作310中接收自然语言输入之前,输入装置可被构成来建立自然语言处理能力。例如,建立自然语言处理能力可包括但不限于:加载自动语音识别器和任意相关的识别语法,启动会话语言处理器来掌管与用户的对话,以及安装一个或多个领域接口进程来提供针对各个应用领域或语境领域的功能(例如,导航、音乐、电影、天气、信息检索、装置控制等)。
输入装置还可被配置成在操作310接收输入之前使意图确定能力、共享知识和其它信息的同步化与环境中的中央装置和次级装置协调。例如,当输入装置建立了一个领域接口进程时,所安装的领域接口进程可从系统中其它装置导入(bootstrap)语境变量、语意、命名空间变量、判定值以及其它与领域接口进程有关的语境。类似地,可从中央装置和次级装置中接收误识别从而对使用了与所接收的误识别有关的信息的接口进程进行校正,并且在装置之间对词汇表和相关翻译机制进行同步以对各个装置所使用的自动语音识别器之间的潜在变化负责(例如,环境中的每个装置都无法确保使用相同的自动语音识别器或者将在共享意图确定能力的几个装置之间共享的识别语法、必须词汇以及翻译机制)。
一旦建立和同步了自然语言处理能力并随后在操作310中接收了多语气自然语言输入,输入装置就可在确定操作320中确定是否已经建立了环境来自动将输入发送到中央装置。在这种情况下,处理进行到操作360来将输入发送到中央装置,该中央装置随后根据参考图4描述的技术来处理输入。然而,如果还未建立将输入自动发送到中央装置的环境,则处理进行到操作330,此处输入装置执行对包含在多语气输入中的自然语言发言的转述。例如,输入装置可使用自动语音识别器和与该识别器相关的识别语法来根据上述技术以及以上参考的美国专利申请中的技术来对该发言进行转述。
接着,在操作340中,可使用本地自然语言处理能力和资源来在输入装置处确定多语气自然语言输入的意图。例如,输入中所包含的任何非语音输入形式都可与发言转述合并,并且与输入装置有关的会话语言处理器可利用与语境、领域知识、共享知识、语境变量、判定变量、或其它在自然语言处理中有用的信息有关的本地信息。这样,输入装置可尝试确定对于提供输入的用户的诸如鉴别会话类型(例如,询问、教训或试探)之类的意图或者对于可能包含在输入中的请求(例如与一个或多个领域接口进程或应用领域有关的命令或询问)的最佳猜测。
可为输入装置的意图确定指定一个信任等级(例如,具有实现多通道分析的自动语音识别器的装置可对其所创建的发言转述指定相对更高的信任等级,这可能会产生针对意图确定的更高信任等级)。可根据各种因素来指定信任等级,正如上述参考的美国专利申请所描述的一样。这样,确定操作350可包括确定输入装置的意图确定是否满足信任的可接受等级。当意图确定满足了可接受的信任等级时,处理可直接进行到操作380,在该操作中可响应于该意图确定来进行动作。例如,当意图确定指示了用户已请求了某一信息时,可阐释一个或多个询问来从可能包括一个或多个其它装置的适当信息源取回信息。在另一示例中,当意图确定指示了用户已请求了一个给定命令时(例如控制特定装置的命令),可将命令传递到适当装置来执行。
因此,在输入装置能够无需中央装置或次级装置的协助而确定自然语言输入的意图的情况下,可通过进行可能是合适的即刻动作来保存通信和处理资源。另一方面,当输入装置的意图确定不满足可接受的信任等级时,确定操作350会导致在操作360中进行输入装置向中央装置寻求协助的操作。在这种情况下,多语气自然语言输入可被传送到整个中央装置,从而中央装置按照图4所示的技术来处理该输入。然而,假如由于一些原因而使得向中央装置的传输失败,则输入装置可被切换到断开的对等模式,在该模式中可利用一个或多个次级装置,如下文将参考图7所描述的那样。然而当发生了向中央装置的传输而未发生任何意外时,在操作370中输入装置会从中央装置接收意图确定,并且会进一步接收中央装置能够解决的一个或多个请求、或者已被中央装置制定来进行对输入装置的进一步处理的请求。结果,输入装置可在操作380中根据在操作370中从中央装置接收的信息来进行动作。例如,输入装置可根据意图确定来将询问或命令传递到本地或远程信息源或装置,或者可向用户呈现由中央装置处理的请求结果。
参考图4,在操作410中,中央装置可从输入装置接收多语气自然语言输入。集合了来自整个环境的语境和其它知识的中央装置可因此在操作420中对发言进行转述并在操作430中根据所转述的发言来确定输入的意图。这样,中央装置可在确定发言意图的过程中考虑与整个环境中的语境、领域接口进程、应用和装置能力有关的信息,包括对与输入相关的一个或多个领域进行鉴别。然而,应当明了的是利用从整个环境集合的信息会引起在不同情况下的含混或不明确(例如,包含词语“交通”的发言在与电影、音乐和导航有关的不同领域中会具有不同的意图)。
这样,一旦中央装置尝试确定自然语言输入的意图,就会在操作440中进行关于一个或多个次级装置(即,除了输入装置以外存在于合成模型中的其它装置)是否也能在所鉴别的一个或几个领域中进行意图确定的确定步骤。当没有鉴别出这样的次级装置时,确定操作440直接分支到操作480来将确定的意图和任何从确定意图中鉴别出的命令、询问、或其它请求发送到输入装置。
另一方面,当环境中的一个或多个次级装置具有在鉴别的一个或几个领域中的意图确定能力时,可在操作450中将自然语言输入发送到这些次级装置。随后如图5所示次级装置可确定意图,该确定步骤中包括了大体上与上述输入装置和中央装置的技术类似的技术(即,可在操作510中接收自然语言输入,包含在其中的发言可在操作520中转述,并且在操作530中进行的意图确定可在操作540中返回到中央装置)。
回到图4,中央装置可在操作460中对从次级装置接收到的意图确定响应进行比较。例如,如上所述,中央装置可鉴别一个或多个次级装置能否在中央装置所鉴别出的与自然语言发言有关的领域中进行意图确定。将要明了的是,在操作450中被调用的次级装置通常可包括多个装置,并且可根据处理资源、通信通过量、或其它因素(例如,次级装置可包括具有大量处理能力和宽带网络连接的信息通讯装置、以及具有较小处理能力和单一的蜂窝式连接的嵌入式移动电话,在这种情况下,很可能是信息通讯装置在嵌入式移动电话之前就将结果提供给中央装置)来以交错方式从次级装置接收意图确定响应。因此,根据次级装置响应时间的潜在变化,中央装置可被构成为对比较操作460中的比较进行约束。例如,一旦从满足可接受信任等级的次级装置之一接收到意图确定就终止比较操作460,或者当经过预定量的时间或消耗了预定量的资源时切断操作460。然而,在其它实现方式中,显然可将比较操作460构成为运行至完成而不考虑发生了延迟或接收到了适当的意图确定。另外,应当明了的是可使用各种标准来确定是否或何时结束比较操作460,这些标准包括但不限于给定自然语言输入或对话的特性、或其它交互或系统或用户偏好的特性。
在任何情况下,当比较操作460完成时,顺序的操作470可包括中央装置在从一个或多个先前在操作450中被调用的次级装置接收的意图确定响应中作出裁断。例如,产生意图确定的每个被调用的次级装置都可为该意图确定指定信任等级,并且中央装置会在对响应进行裁断的过程中考虑这些信任等级。而且,中央装置会将其它标准与次级装置或从次级装置接收的意图确定相关联,从而进一步增强使用最佳意图确定的可能性。例如,各个次级装置都只会针对专用领域中的部分识别而被调用,并且中央装置可集合并裁断部分识别来创建完整的转述。在另一示例中,多个次级装置可被调用来执行覆盖意图确定,并且中央装置会考虑次级装置的能力来对各个信任等级进行加权(例如,当两个同样的次级装置之一使用了多通道语音识别分析时,使用多通道语音识别分析的次级装置可被加权为具有更高的成功可能性)。应当清楚中央装置可被构成为从所有的意图假定中(包括在操作430中由中央装置产生的意图确定假定)裁断并选择一个意图确定。一旦选择了最佳意图确定假定,则中央装置随后在操作480中将该意图确定连同任何可能与其相关的命令、询问、或其它请求一起提供到输入装置。输入装置随后可进行如上图3所述的适当动作。
根据本发明的各个方面,图6示出了集成多语气多装置的自然语言语音服务环境的分布实现方式的示例的框图。如上所述,分布实现方式还可被分类为断开或对等模式,在集中实现方式中的中央装置无法达到或无法满足环境需要时使用该模式。图6所示的分布实现方式大体上以类似于上述集中实现方式的目的来工作(即,确保环境包括集合了环境中多个装置610a-n的知识和能力的广泛的模型)。但是,分布实现方式可工作在多少稍有不同的方式下,其中为一个或多个装置610a-n提供整体合成模型,或者将各个不同方面的模型分布到多个装置610a-n或者它们的各种组合。
总的来说,多个语音装置610a-n可通过语音服务接口630彼此耦接,该语音服务接口630包括任何适当的实体或虚拟接口(例如,通用信息总线或网络接口、服务定向提取层等)。各个装置610a-n因此可作为协作节点工作来对任何一个装置610所接收的多语气自然语言发言进行意图确定。而且,在同步某些形式的数据以保证装置610a-n之间一致处理的同时,装置610a-n可共享词汇表、语境、能力和其它信息的知识。例如,由于在装置610a-n中使用的自然语言处理组件会发生变化(例如,存在不同识别语法或语音识别技术),所以在意图确定处理中使用的词汇翻译机制、误识别、语境变量、标准值、标准处理器和其它信息应被同步为通信能力所允许的程度。
通过共享意图确定能力、装置能力、推断能力、领域知识和其它信息,可本地地(例如在一个输入装置)、协作地(例如具有与发言相关的特定能力的装置可发送请求来处理发言)、或者结合这两种方式(例如输入装置可考虑仅在无法确定发言意图时传递到次级装置)来确定将发言传递到装置610a-n中特定的一个。类似地,在一个或多个装置610a-n处执行的部分识别可被用于确定传递针对发言的另一意图确定的策略。例如,可以在只能确定一个领域意图的输入装置处接收包含了关于多个不同领域的多个请求的发言。在该示例中,输入装置可执行与该输入装置有关的领域中的部分识别,并且该部分识别还鉴别出该输入装置不具有足够识别信息的其它发言领域。因此,输入装置执行的部分识别可产生对其它潜在相关领域的鉴别,并可形成策略来将该发言传递到环境中包括了针对那些领域的识别信息的其它装置。
结果,包括了自然语言发言的多语气自然语言输入可被传递到各个装置610a-n当中,从而以分布方式执行意图确定。然而,由于装置610a-n的任意一个所具有的能力和知识会变化,所以每个装置610a-n会与各个装置610a-n所产生的针对意图确定的可靠因数相关联。这样,为了保证最终的意图确定能够以一个足够的信任等级而被信任,可在装置610a-n之间分布知识来保证由每个装置610a-n所提供的针对意图确定的可靠性度量在整个环境中都是相称的。例如,即使附加知识会导致环境中的冗余,也会将该知识提供到具有较低意图确定可靠性的装置,从而保证环境范围内意图确定的相称可靠性。
因此,在集成语音服务环境的分布实现方式中,可通过各种方式来处理发言,这些方式取决于给定时间的情况(例如,系统状态、系统或用户喜好等)。例如,可在一个输入装置处对一个发言进行本地处理,并在意图确定信任等级降到给定阈值以下时仅将该发言传递到次级装置。在另一示例中,根据上述对知识和能力的建模来将发言传递到特定装置。在又一示例中,发言可遍布到环境中的所有装置当中,并且会发生裁断,由此比较意图确定并对意图确定的最佳猜测进行裁断。
因此,可以通过各种方式来处理发言,包括通过本地技术、集中技术、分布技术以及这些技术的各种组合。尽管明了存在许多变型,图7只示出了根据本发明的各个方面、在语音服务环境的分布实现方式中对多语气自然语言输入的本地和分布处理进行结合的示例的流程图。具体来说,在操作710开始分布处理,其中在输入装置处接收多语气自然语言输入。输入装置随后在操作720中利用各种相关的自然语言处理能力来对多语气输入中所包含的发言进行转述(例如,使用自动语音识别器和相关的识别语法),并接着在操作730中确定多语气自然语言输入的初步意图。应当明了的是操作710到操作730通常可使用与输入装置相关的本地意图确定能力来执行。
此后,在操作740中,输入装置可调用一个或多个次级装置的意图确定能力。具体来说,输入装置可向一个或多个次级装置提供与多语气自然语言输入有关的信息,该一个或多个次级装置可利用本地意图确定能力来使用上述图5所描述的技术进行对输入的意图确定。还应当明了,在各种实现方式中,在操作740中被调用的次级装置可仅仅包含具有与关于输入所鉴别出的特定领域有关的意图确定能力的装置。在任意情况下,输入装置可在操作750中从所调用的次级装置接收意图确定,并且随后输入装置可比较从次级装置接收的意图确定。输入装置随后在各个意图确定中作出裁断,或者结合各个意图确定(例如,当专用次级装置确定了专用领域中的意图时),又或者在意图确定中进行裁断来确定对多语气自然语言输入的意图的最佳猜测(例如,根据与各个意图确定有关的信任等级)。根据确定的意图,输入装置随后可在操作770中进行适当的动作,例如发出将在输入装置或次级装置的一个或多个处执行的一个或多个命令、询问或其它请求。
而且,除了上述示例的实现方式之外,各种实现方式还包括操作的持续收听模式,其中多个装置可对基于多语气语音的输入进行持续收听。在持续收听模式中,当发生一个或多个预定事件时,环境中的每个装置可被触发来接收一个多语气输入。例如,每个装置都会与一个或多个关注词语相关,诸如“电话,<多语气请求>”针对移动电话,或者“计算机,<多语气请求>”针对个人计算机。当环境中的一个或多个装置识别出相关的关注词语时,会导致关键字激活,其中相关装置触发来接受接下来的多语气请求。另外,在合成模型中的多个装置可进行收听的情况下,该合成模型可使用所有可用输入来增加识别率。
而且,应当明了的是可将持续收听模式应用到集中语音服务环境、分布集中语音服务环境或各种组合环境中。例如,当合成模型中的每个装置具有不同的关注词语时,识别一个关注词语的任意给定装置都会参照一种合成模型来确定与该关注词语相关的目标装置或功能。在另一示例中,当合成模型中的多个装置共享一个或多个关注词语时,多个装置会彼此协作来对用于处理多语气输入的信息进行同步,该信息诸如是包含在该多语气输入中的发言的开始时间。
本发明的实现方式可以通过硬件、固件、软件或这些方式的各种组合来实现。本发明还可被实现为存储在机器可读介质中的指令,可由一个或多个处理器来读取和执行这些指令。机器可读介质可包括各种用于存储或发送具有机器(例如计算装置)可读形式的信息的机构。例如,机器可读存储介质可包括只读存储器、随机访问存储器、磁盘存储介质、光存储介质、闪速存储装置及其它,并且机器可读发送介质可包括传播信号的形式,诸如载波、红外信号、数字信号及其它。另外,固件、软件、程序或指令可以在上述关于特定示例方面和本发明的实现方式方面的公开来描述,并执行某些动作。然而,显然这些描述仅仅是为了方便,这样的动作事实上是由计算装置、处理器、控制器、或其它执行固件、软件、程序或指令的装置所进行的。
描述了各个方面和实现方式包括特定的特征、结构或特性,但每个方面或实现方式并非必须包括这些特定的特征、结构或特性。另外,当特定的特征、结构或特性结合一个方面或实现方式而被描述时,不管是否进行了直接的描述都应当理解该特征、结构或特性可与其它方面或实现方式相关地存在。因此可在不超出本发明范围和精神的前提下对前面的描述进行各种变化和修改,并且说明书和附图应被看作仅仅用来示例,本发明的范围仅由所附权利要求来确定。

Claims (27)

1.用于提供集成的自然语言语音服务环境的方法,其中,所述集成的自然语言语音服务环境包括接收多语气自然语言输入的输入装置、通信地耦接到输入装置的中央装置、以及通信地耦接到中央装置的一个或多个次级装置,输入装置、中央装置以及一个或多个次级装置的每一个都具有用于处理多语气自然语言输入的意图确定能力,所述方法包括:
将多语气自然语言输入从输入装置传送到中央装置,中央装置操作来执行:
集合输入装置的意图确定能力和一个或多个次级装置的意图确定能力;并
使用所集合的意图确定能力来确定在输入装置处接收的多语气自然语言输入的意图;
所述方法还包括:
在输入装置处接收多语气自然语言输入的所确定的意图;以及
基于所确定的意图来在输入装置、中央装置或次级装置的一个或多个处至少调用一个动作。
2.根据权利要求1所述的方法,其中针对输入装置、中央装置或次级装置中给定的一个的意图确定能力是基于处理能力、存储资源、自然语言处理能力或本地知识中的至少一个。
3.根据权利要求1所述的方法,中央装置还操作来执行:
将多语气自然语言输入传送到次级装置的每一个,其中次级装置的每一个都使用本地意图确定能力来确定在输入装置处接收的多语气自然语言输入的意图;
接收由次级装置的每一个确定的意图;以及
在次级装置的意图确定之间进行裁断以确定多语气自然语言输入的意图。
4.根据权利要求3所述的方法,其中,中央装置根据与次级装置的意图确定有关的各个信任等级来在次级装置的意图确定之间进行裁断。
5.根据权利要求4所述的方法,中央装置还操作来对次级装置的意图确定的信任等级进行加权,其中使用多通道语音识别技术的次级装置的意图确定接收到更高的权重。
6.根据权利要求1所述的方法,还包括在输入装置、中央装置和次级装置之间共享知识,从而对这些装置的意图确定能力进行更新。
7.根据权利要求6所述的方法,其中所共享的知识包括与环境中装置的词汇表、翻译机制、误识别、语境或状态的至少一个有关的信息。
8.根据权利要求1所述的方法,输入装置操作来执行:
使用本地意图确定能力来对输入装置处接收到的多语气自然语言输入的意图进行确定;以及
在输入装置的意图确定满足预定的信任等级时,调用至少一个动作而不将多语气自然语言输入传送到中央装置。
9.根据权利要求1所述的方法,其中语音服务环境工作在持续收听模式下。
10.用于提供集成的自然语言语音服务环境的方法,其中,所述集成的自然语言语音服务环境包括接收多语气自然语言输入的输入装置、以及通信地耦接到输入装置的一个或多个次级装置,输入装置和一个或多个次级装置的每一个都具有用于处理多语气自然语言输入的意图确定能力,所述方法包括:
确定多语气自然语言输入的初步意图,其中输入装置使用本地意图确定能力来确定初步意图;
将多语气自然语言输入传送到一个或多个次级装置,每个次级装置都使用本地意图确定能力来确定多语气自然语言输入的意图;
将初步意图确定与次级装置的意图确定进行比较;
在所比较的意图确定之间进行裁断来确定多语气自然语言输入的可作用意图;和
基于所确定的可作用意图来在输入装置或次级装置的一个或多个处至少调用一个动作。
11.根据权利要求10所述的方法,还包括在与各个本地意图确定能力有关的输入装置和次级装置之间共享知识。
12.根据权利要求11所述的方法,还包括将可靠性因数与输入装置和一个或多个次级装置的每一个的本地意图确定能力相关联,其中对针对本地意图确定能力的可靠性因数的集中使整个环境中的意图确定能力相称。
13.根据权利要求11所述的方法,其中所共享的知识包括与处理能力、存储资源、自然语言处理能力的至少一个有关的信息或与输入装置和次级装置联系在一起的知识。
14.根据权利要求10所述的方法,还包括在输入装置和次级装置之间共享知识,从而对意图确定能力进行更新。
15.根据权利要求14所述的方法,其中共享知识包括将建立在输入装置上的知识发送到一个或多个次级装置。
16.根据权利要求14所述的方法,其中共享知识包括将建立在次级装置之一上的知识发送到输入装置和除建立该知识的次级装置以外的其它的次级装置。
17.根据权利要求14所述的方法,其中所共享的知识包括与环境中装置的词汇表、翻译机制、误识别、语境或状态的至少一个有关的信息。
18.根据权利要求10所述的方法,其中语音服务环境工作在持续收听模式下。
19.用于提供集成的自然语言语音服务环境的系统,该系统包括:
输入装置,其接收多语气自然语言输入;
一个或多个次级装置,其通信地耦接到输入装置,输入装置和一个或多个次级装置的每一个都具有用于处理多语气自然语言输入的意图确定能力;以及
合成模型,其可访问输入装置和一个或多个次级装置的每一个,该合成模型描述了输入装置和一个或多个次级装置的意图确定能力,对多语气自然语言输入进行传递以在输入装置或次级装置的一个或多个处进行处理,从而基于合成模型中所描述的意图确定能力来确定该多语气自然语言输入的意图。
20.根据权利要求19所述的系统,其中合成模型针对输入装置和一个或多个次级装置的每一个装置描述了与输入装置和次级装置联系在一起的处理能力、存储资源、自然语言处理能力或知识的一个或多个。
21.根据权利要求19所述的系统,其中合成模型动态地选择集中模型或分布模型之一来用于操作环境。
22.根据权利要求21所述的系统,所述集中模型将次级装置之一指定为中央装置,该中央装置操作来执行:
集合输入装置的意图确定能力和一个或多个次级装置的意图确定能力;并
使用所集合的意图确定能力来确定多语气自然语言输入的意图。
23.根据权利要求22所述的系统,其中在多语气自然语言无法被传送到中央装置时,合成模型选择分布模型。
24.根据权利要求23所述的系统,其中选择分布模型使得输入装置和一个或多个次级装置执行以下操作:
共享与各个本地意图确定能力有关的知识;以及
作为协作节点进行工作以使用所共享的与本地意图确定能力有关的知识来确定多语气自然语言输入的意图。
25.根据权利要求24所述的系统,其中作为协作节点进行工作包括确定是应该通过输入装置、还是应该通过一个或多个次级装置、还是应该通过输入装置和一个或多个次级装置的每一个来确定多语气自然语言输入的意图。
26.根据权利要求19所述的系统,还包括与输入装置关联的至少一个领域接口进程,其中至少一个领域接口进程从一个或多个次级装置导入与领域接口进程关联的语境信息。
27.根据权利要求19所述的系统,其中语音服务环境工作在持续收听模式下。
CN200880130303.8A 2008-05-27 2008-07-09 针对集成多语气多装置自然语言语音服务环境的系统和方法 Active CN102160043B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US12/127,343 2008-05-27
US12/127,343 US8589161B2 (en) 2008-05-27 2008-05-27 System and method for an integrated, multi-modal, multi-device natural language voice services environment
PCT/US2008/069524 WO2009145796A1 (en) 2008-05-27 2008-07-09 System and method for an integrated, multi-modal, multi-device natural language voice services environment

Publications (2)

Publication Number Publication Date
CN102160043A true CN102160043A (zh) 2011-08-17
CN102160043B CN102160043B (zh) 2015-04-22

Family

ID=41377402

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200880130303.8A Active CN102160043B (zh) 2008-05-27 2008-07-09 针对集成多语气多装置自然语言语音服务环境的系统和方法

Country Status (4)

Country Link
US (1) US8589161B2 (zh)
EP (1) EP2283431B1 (zh)
CN (1) CN102160043B (zh)
WO (1) WO2009145796A1 (zh)

Cited By (146)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105229727A (zh) * 2013-01-08 2016-01-06 赛普拉斯半导体公司 分布式语音识别系统
US9548050B2 (en) 2010-01-18 2017-01-17 Apple Inc. Intelligent automated assistant
US9582608B2 (en) 2013-06-07 2017-02-28 Apple Inc. Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
US9620104B2 (en) 2013-06-07 2017-04-11 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
US9626955B2 (en) 2008-04-05 2017-04-18 Apple Inc. Intelligent text-to-speech conversion
US9633660B2 (en) 2010-02-25 2017-04-25 Apple Inc. User profiling for voice input processing
US9646614B2 (en) 2000-03-16 2017-05-09 Apple Inc. Fast, language-independent method for user authentication by voice
US9668024B2 (en) 2014-06-30 2017-05-30 Apple Inc. Intelligent automated assistant for TV user interactions
US9934775B2 (en) 2016-05-26 2018-04-03 Apple Inc. Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9953088B2 (en) 2012-05-14 2018-04-24 Apple Inc. Crowd sourcing information to fulfill user requests
US9966068B2 (en) 2013-06-08 2018-05-08 Apple Inc. Interpreting and acting upon commands that involve sharing information with remote devices
US9971774B2 (en) 2012-09-19 2018-05-15 Apple Inc. Voice-based media searching
US9972304B2 (en) 2016-06-03 2018-05-15 Apple Inc. Privacy preserving distributed evaluation framework for embedded personalized systems
US9986419B2 (en) 2014-09-30 2018-05-29 Apple Inc. Social reminders
CN108228699A (zh) * 2016-12-22 2018-06-29 谷歌有限责任公司 协作性语音控制装置
US10043516B2 (en) 2016-09-23 2018-08-07 Apple Inc. Intelligent automated assistant
US10049663B2 (en) 2016-06-08 2018-08-14 Apple, Inc. Intelligent automated assistant for media exploration
US10049668B2 (en) 2015-12-02 2018-08-14 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10067938B2 (en) 2016-06-10 2018-09-04 Apple Inc. Multilingual word prediction
CN108541315A (zh) * 2016-12-30 2018-09-14 谷歌有限责任公司 语音激活数据分组的数据结构池化
US10079014B2 (en) 2012-06-08 2018-09-18 Apple Inc. Name recognition system
US10083690B2 (en) 2014-05-30 2018-09-25 Apple Inc. Better resolution when referencing to concepts
US10089072B2 (en) 2016-06-11 2018-10-02 Apple Inc. Intelligent device arbitration and control
US10102359B2 (en) 2011-03-21 2018-10-16 Apple Inc. Device access using voice authentication
US10108612B2 (en) 2008-07-31 2018-10-23 Apple Inc. Mobile device having human language translation capability with positional feedback
US10169329B2 (en) 2014-05-30 2019-01-01 Apple Inc. Exemplar-based natural language processing
US10176167B2 (en) 2013-06-09 2019-01-08 Apple Inc. System and method for inferring user intent from speech inputs
US10185542B2 (en) 2013-06-09 2019-01-22 Apple Inc. Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
US10192552B2 (en) 2016-06-10 2019-01-29 Apple Inc. Digital assistant providing whispered speech
US10223066B2 (en) 2015-12-23 2019-03-05 Apple Inc. Proactive assistance based on dialog communication between devices
US10249300B2 (en) 2016-06-06 2019-04-02 Apple Inc. Intelligent list reading
US10269345B2 (en) 2016-06-11 2019-04-23 Apple Inc. Intelligent task discovery
CN109716325A (zh) * 2016-09-13 2019-05-03 微软技术许可有限责任公司 计算机化的自然语言查询意图分派
US10283110B2 (en) 2009-07-02 2019-05-07 Apple Inc. Methods and apparatuses for automatic speech recognition
US10297253B2 (en) 2016-06-11 2019-05-21 Apple Inc. Application integration with a digital assistant
US10303715B2 (en) 2017-05-16 2019-05-28 Apple Inc. Intelligent automated assistant for media exploration
US10311144B2 (en) 2017-05-16 2019-06-04 Apple Inc. Emoji word sense disambiguation
US10311871B2 (en) 2015-03-08 2019-06-04 Apple Inc. Competing devices responding to voice triggers
US10318871B2 (en) 2005-09-08 2019-06-11 Apple Inc. Method and apparatus for building an intelligent automated assistant
US10332518B2 (en) 2017-05-09 2019-06-25 Apple Inc. User interface for correcting recognition errors
US10356243B2 (en) 2015-06-05 2019-07-16 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US10354011B2 (en) 2016-06-09 2019-07-16 Apple Inc. Intelligent automated assistant in a home environment
US10366158B2 (en) 2015-09-29 2019-07-30 Apple Inc. Efficient word encoding for recurrent neural network language models
US10381016B2 (en) 2008-01-03 2019-08-13 Apple Inc. Methods and apparatus for altering audio output signals
US10395654B2 (en) 2017-05-11 2019-08-27 Apple Inc. Text normalization based on a data-driven learning network
US10403283B1 (en) 2018-06-01 2019-09-03 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US10403278B2 (en) 2017-05-16 2019-09-03 Apple Inc. Methods and systems for phonetic matching in digital assistant services
US10410637B2 (en) 2017-05-12 2019-09-10 Apple Inc. User-specific acoustic models
US10417266B2 (en) 2017-05-09 2019-09-17 Apple Inc. Context-aware ranking of intelligent response suggestions
US10431204B2 (en) 2014-09-11 2019-10-01 Apple Inc. Method and apparatus for discovering trending terms in speech requests
US10438595B2 (en) 2014-09-30 2019-10-08 Apple Inc. Speaker identification and unsupervised speaker adaptation techniques
US10446143B2 (en) 2016-03-14 2019-10-15 Apple Inc. Identification of voice inputs providing credentials
US10445429B2 (en) 2017-09-21 2019-10-15 Apple Inc. Natural language understanding using vocabularies with compressed serialized tries
US10453443B2 (en) 2014-09-30 2019-10-22 Apple Inc. Providing an indication of the suitability of speech recognition
US10474753B2 (en) 2016-09-07 2019-11-12 Apple Inc. Language identification using recurrent neural networks
US10482874B2 (en) 2017-05-15 2019-11-19 Apple Inc. Hierarchical belief states for digital assistants
US10490187B2 (en) 2016-06-10 2019-11-26 Apple Inc. Digital assistant providing automated status report
US10496753B2 (en) 2010-01-18 2019-12-03 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US10497365B2 (en) 2014-05-30 2019-12-03 Apple Inc. Multi-command single utterance input method
US10496705B1 (en) 2018-06-03 2019-12-03 Apple Inc. Accelerated task performance
US10509862B2 (en) 2016-06-10 2019-12-17 Apple Inc. Dynamic phrase expansion of language input
US10521466B2 (en) 2016-06-11 2019-12-31 Apple Inc. Data driven natural language event detection and classification
US10529332B2 (en) 2015-03-08 2020-01-07 Apple Inc. Virtual assistant activation
US10553216B2 (en) 2008-05-27 2020-02-04 Oracle International Corporation System and method for an integrated, multi-modal, multi-device natural language voice services environment
US10553209B2 (en) 2010-01-18 2020-02-04 Apple Inc. Systems and methods for hands-free notification summaries
US10567477B2 (en) 2015-03-08 2020-02-18 Apple Inc. Virtual assistant continuity
US10592604B2 (en) 2018-03-12 2020-03-17 Apple Inc. Inverse text normalization for automatic speech recognition
US10593346B2 (en) 2016-12-22 2020-03-17 Apple Inc. Rank-reduced token representation for automatic speech recognition
US10636424B2 (en) 2017-11-30 2020-04-28 Apple Inc. Multi-turn canned dialog
US10643611B2 (en) 2008-10-02 2020-05-05 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US10657328B2 (en) 2017-06-02 2020-05-19 Apple Inc. Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling
US10671428B2 (en) 2015-09-08 2020-06-02 Apple Inc. Distributed personal assistant
US10679605B2 (en) 2010-01-18 2020-06-09 Apple Inc. Hands-free list-reading by intelligent automated assistant
US10684703B2 (en) 2018-06-01 2020-06-16 Apple Inc. Attention aware virtual assistant dismissal
US10691473B2 (en) 2015-11-06 2020-06-23 Apple Inc. Intelligent automated assistant in a messaging environment
US10699717B2 (en) 2014-05-30 2020-06-30 Apple Inc. Intelligent assistant for home automation
US10705794B2 (en) 2010-01-18 2020-07-07 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US10714117B2 (en) 2013-02-07 2020-07-14 Apple Inc. Voice trigger for a digital assistant
US10726832B2 (en) 2017-05-11 2020-07-28 Apple Inc. Maintaining privacy of personal information
US10733982B2 (en) 2018-01-08 2020-08-04 Apple Inc. Multi-directional dialog
US10733993B2 (en) 2016-06-10 2020-08-04 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10733375B2 (en) 2018-01-31 2020-08-04 Apple Inc. Knowledge-based framework for improving natural language understanding
US10741185B2 (en) 2010-01-18 2020-08-11 Apple Inc. Intelligent automated assistant
US10748546B2 (en) 2017-05-16 2020-08-18 Apple Inc. Digital assistant services based on device capabilities
US10747498B2 (en) 2015-09-08 2020-08-18 Apple Inc. Zero latency digital assistant
US10755051B2 (en) 2017-09-29 2020-08-25 Apple Inc. Rule-based natural language processing
US10755703B2 (en) 2017-05-11 2020-08-25 Apple Inc. Offline personal assistant
US10789959B2 (en) 2018-03-02 2020-09-29 Apple Inc. Training speaker recognition models for digital assistants
US10791176B2 (en) 2017-05-12 2020-09-29 Apple Inc. Synchronization and task delegation of a digital assistant
US10789945B2 (en) 2017-05-12 2020-09-29 Apple Inc. Low-latency intelligent automated assistant
US10795541B2 (en) 2009-06-05 2020-10-06 Apple Inc. Intelligent organization of tasks items
US10810274B2 (en) 2017-05-15 2020-10-20 Apple Inc. Optimizing dialogue policy decisions for digital assistants using implicit feedback
US10818288B2 (en) 2018-03-26 2020-10-27 Apple Inc. Natural assistant interaction
US10839159B2 (en) 2018-09-28 2020-11-17 Apple Inc. Named entity normalization in a spoken dialog system
US10892996B2 (en) 2018-06-01 2021-01-12 Apple Inc. Variable latency device coordination
US10909331B2 (en) 2018-03-30 2021-02-02 Apple Inc. Implicit identification of translation payload with neural machine translation
US10928918B2 (en) 2018-05-07 2021-02-23 Apple Inc. Raise to speak
US10984780B2 (en) 2018-05-21 2021-04-20 Apple Inc. Global semantic word embeddings using bi-directional recurrent neural networks
US11010561B2 (en) 2018-09-27 2021-05-18 Apple Inc. Sentiment prediction from textual data
US11010550B2 (en) 2015-09-29 2021-05-18 Apple Inc. Unified language modeling framework for word prediction, auto-completion and auto-correction
US11010127B2 (en) 2015-06-29 2021-05-18 Apple Inc. Virtual assistant for media playback
US11023513B2 (en) 2007-12-20 2021-06-01 Apple Inc. Method and apparatus for searching using an active ontology
US11025565B2 (en) 2015-06-07 2021-06-01 Apple Inc. Personalized prediction of responses for instant messaging
US11069336B2 (en) 2012-03-02 2021-07-20 Apple Inc. Systems and methods for name pronunciation
US11070949B2 (en) 2015-05-27 2021-07-20 Apple Inc. Systems and methods for proactively identifying and surfacing relevant content on an electronic device with a touch-sensitive display
US11080012B2 (en) 2009-06-05 2021-08-03 Apple Inc. Interface for a virtual digital assistant
US11120372B2 (en) 2011-06-03 2021-09-14 Apple Inc. Performing actions associated with task items that represent tasks to perform
US11127397B2 (en) 2015-05-27 2021-09-21 Apple Inc. Device voice control
US11133008B2 (en) 2014-05-30 2021-09-28 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US11140099B2 (en) 2019-05-21 2021-10-05 Apple Inc. Providing message response suggestions
US11145294B2 (en) 2018-05-07 2021-10-12 Apple Inc. Intelligent automated assistant for delivering content from user experiences
US11170166B2 (en) 2018-09-28 2021-11-09 Apple Inc. Neural typographical error modeling via generative adversarial networks
US11204787B2 (en) 2017-01-09 2021-12-21 Apple Inc. Application integration with a digital assistant
US11217251B2 (en) 2019-05-06 2022-01-04 Apple Inc. Spoken notifications
US11227589B2 (en) 2016-06-06 2022-01-18 Apple Inc. Intelligent list reading
US11231904B2 (en) 2015-03-06 2022-01-25 Apple Inc. Reducing response latency of intelligent automated assistants
US11237797B2 (en) 2019-05-31 2022-02-01 Apple Inc. User activity shortcut suggestions
US11269678B2 (en) 2012-05-15 2022-03-08 Apple Inc. Systems and methods for integrating third party services with a digital assistant
US11281993B2 (en) 2016-12-05 2022-03-22 Apple Inc. Model and ensemble compression for metric learning
US11289073B2 (en) 2019-05-31 2022-03-29 Apple Inc. Device text to speech
US11301477B2 (en) 2017-05-12 2022-04-12 Apple Inc. Feedback analysis of a digital assistant
US11307752B2 (en) 2019-05-06 2022-04-19 Apple Inc. User configurable task triggers
US11314370B2 (en) 2013-12-06 2022-04-26 Apple Inc. Method for extracting salient dialog usage from live data
US11348573B2 (en) 2019-03-18 2022-05-31 Apple Inc. Multimodality in digital assistant systems
US11350253B2 (en) 2011-06-03 2022-05-31 Apple Inc. Active transport based notifications
US11360641B2 (en) 2019-06-01 2022-06-14 Apple Inc. Increasing the relevance of new available information
US11388291B2 (en) 2013-03-14 2022-07-12 Apple Inc. System and method for processing voicemail
US11386266B2 (en) 2018-06-01 2022-07-12 Apple Inc. Text correction
US11423908B2 (en) 2019-05-06 2022-08-23 Apple Inc. Interpreting spoken requests
US11462215B2 (en) 2018-09-28 2022-10-04 Apple Inc. Multi-modal inputs for voice commands
US11468282B2 (en) 2015-05-15 2022-10-11 Apple Inc. Virtual assistant in a communication session
US11475898B2 (en) 2018-10-26 2022-10-18 Apple Inc. Low-latency multi-speaker speech recognition
US11475884B2 (en) 2019-05-06 2022-10-18 Apple Inc. Reducing digital assistant latency when a language is incorrectly determined
US11488406B2 (en) 2019-09-25 2022-11-01 Apple Inc. Text detection using global geometry estimators
US11496600B2 (en) 2019-05-31 2022-11-08 Apple Inc. Remote execution of machine-learned models
US11495218B2 (en) 2018-06-01 2022-11-08 Apple Inc. Virtual assistant operation in multi-device environments
US11532306B2 (en) 2017-05-16 2022-12-20 Apple Inc. Detecting a trigger of a digital assistant
US11587559B2 (en) 2015-09-30 2023-02-21 Apple Inc. Intelligent device identification
US11638059B2 (en) 2019-01-04 2023-04-25 Apple Inc. Content playback on multiple devices
US11657813B2 (en) 2019-05-31 2023-05-23 Apple Inc. Voice identification in digital assistant systems
US11671920B2 (en) 2007-04-03 2023-06-06 Apple Inc. Method and system for operating a multifunction portable electronic device using voice-activation
US11765209B2 (en) 2020-05-11 2023-09-19 Apple Inc. Digital assistant hardware abstraction
US11798547B2 (en) 2013-03-15 2023-10-24 Apple Inc. Voice activated device for use with a voice-based digital assistant
US11809483B2 (en) 2015-09-08 2023-11-07 Apple Inc. Intelligent automated assistant for media search and playback
US11853536B2 (en) 2015-09-08 2023-12-26 Apple Inc. Intelligent automated assistant in a media environment
US11886805B2 (en) 2015-11-09 2024-01-30 Apple Inc. Unconventional virtual assistant interactions

Families Citing this family (195)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU6630800A (en) 1999-08-13 2001-03-13 Pixo, Inc. Methods and apparatuses for display and traversing of links in page character array
ITFI20010199A1 (it) 2001-10-22 2003-04-22 Riccardo Vieri Sistema e metodo per trasformare in voce comunicazioni testuali ed inviarle con una connessione internet a qualsiasi apparato telefonico
US7398209B2 (en) 2002-06-03 2008-07-08 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7693720B2 (en) 2002-07-15 2010-04-06 Voicebox Technologies, Inc. Mobile systems and methods for responding to natural language speech utterance
US7669134B1 (en) 2003-05-02 2010-02-23 Apple Inc. Method and apparatus for displaying information during an instant messaging session
US9224394B2 (en) * 2009-03-24 2015-12-29 Sirius Xm Connected Vehicle Services Inc Service oriented speech recognition for in-vehicle automated interaction and in-vehicle user interfaces requiring minimal cognitive driver processing for same
US20060271520A1 (en) * 2005-05-27 2006-11-30 Ragan Gene Z Content-based implicit search query
US7640160B2 (en) 2005-08-05 2009-12-29 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7620549B2 (en) 2005-08-10 2009-11-17 Voicebox Technologies, Inc. System and method of supporting adaptive misrecognition in conversational speech
US7949529B2 (en) 2005-08-29 2011-05-24 Voicebox Technologies, Inc. Mobile systems and methods of supporting natural language human-machine interactions
WO2007027989A2 (en) 2005-08-31 2007-03-08 Voicebox Technologies, Inc. Dynamic speech sharpening
US7633076B2 (en) 2005-09-30 2009-12-15 Apple Inc. Automated response to and sensing of user activity in portable devices
US8073681B2 (en) 2006-10-16 2011-12-06 Voicebox Technologies, Inc. System and method for a cooperative conversational voice user interface
US7818176B2 (en) 2007-02-06 2010-10-19 Voicebox Technologies, Inc. System and method for selecting and presenting advertisements based on natural language processing of voice-based input
ITFI20070177A1 (it) 2007-07-26 2009-01-27 Riccardo Vieri Sistema per la creazione e impostazione di una campagna pubblicitaria derivante dall'inserimento di messaggi pubblicitari all'interno di uno scambio di messaggi e metodo per il suo funzionamento.
US9053089B2 (en) 2007-10-02 2015-06-09 Apple Inc. Part-of-speech tagging using latent analogy
US8595642B1 (en) 2007-10-04 2013-11-26 Great Northern Research, LLC Multiple shell multi faceted graphical user interface
US8165886B1 (en) 2007-10-04 2012-04-24 Great Northern Research LLC Speech interface system and method for control and interaction with applications on a computing system
US8364694B2 (en) 2007-10-26 2013-01-29 Apple Inc. Search assistant for digital media assets
US8620662B2 (en) 2007-11-20 2013-12-31 Apple Inc. Context-aware unit selection
US8140335B2 (en) 2007-12-11 2012-03-20 Voicebox Technologies, Inc. System and method for providing a natural language voice user interface in an integrated voice navigation services environment
US8327272B2 (en) 2008-01-06 2012-12-04 Apple Inc. Portable multifunction device, method, and graphical user interface for viewing and managing electronic calendars
US8626152B2 (en) 2008-01-31 2014-01-07 Agero Connected Sevices, Inc. Flexible telematics system and method for providing telematics to a vehicle
US8065143B2 (en) 2008-02-22 2011-11-22 Apple Inc. Providing text input using speech data and non-speech data
US8289283B2 (en) 2008-03-04 2012-10-16 Apple Inc. Language input interface on a device
US8464150B2 (en) 2008-06-07 2013-06-11 Apple Inc. Automatic language identification for dynamic text processing
US8768702B2 (en) 2008-09-05 2014-07-01 Apple Inc. Multi-tiered voice feedback in an electronic device
US8898568B2 (en) 2008-09-09 2014-11-25 Apple Inc. Audio user interface
US8396714B2 (en) 2008-09-29 2013-03-12 Apple Inc. Systems and methods for concatenation of words in text to speech synthesis
US8352272B2 (en) 2008-09-29 2013-01-08 Apple Inc. Systems and methods for text to speech synthesis
US8583418B2 (en) 2008-09-29 2013-11-12 Apple Inc. Systems and methods of detecting language and natural language strings for text to speech synthesis
US8712776B2 (en) 2008-09-29 2014-04-29 Apple Inc. Systems and methods for selective text to speech synthesis
US8352268B2 (en) 2008-09-29 2013-01-08 Apple Inc. Systems and methods for selective rate of speech and speech preferences for text to speech synthesis
US8355919B2 (en) 2008-09-29 2013-01-15 Apple Inc. Systems and methods for text normalization for text to speech synthesis
WO2010067118A1 (en) 2008-12-11 2010-06-17 Novauris Technologies Limited Speech recognition involving a mobile device
US8862252B2 (en) 2009-01-30 2014-10-14 Apple Inc. Audio user interface for displayless electronic device
US8326637B2 (en) 2009-02-20 2012-12-04 Voicebox Technologies, Inc. System and method for processing multi-modal device interactions in a natural language voice services environment
US8380507B2 (en) 2009-03-09 2013-02-19 Apple Inc. Systems and methods for determining the language to use for speech generated by a text to speech engine
US10540976B2 (en) 2009-06-05 2020-01-21 Apple Inc. Contextual voice commands
US9858925B2 (en) 2009-06-05 2018-01-02 Apple Inc. Using context information to facilitate processing of commands in a virtual assistant
US9171541B2 (en) 2009-11-10 2015-10-27 Voicebox Technologies Corporation System and method for hybrid processing in a natural language voice services environment
US9502025B2 (en) 2009-11-10 2016-11-22 Voicebox Technologies Corporation System and method for providing a natural language content dedication service
US8682649B2 (en) 2009-11-12 2014-03-25 Apple Inc. Sentiment prediction from textual data
EP4318463A3 (en) * 2009-12-23 2024-02-28 Google LLC Multi-modal input on an electronic device
US11416214B2 (en) 2009-12-23 2022-08-16 Google Llc Multi-modal input on an electronic device
US8600743B2 (en) 2010-01-06 2013-12-03 Apple Inc. Noise profile determination for voice-related feature
US8381107B2 (en) 2010-01-13 2013-02-19 Apple Inc. Adaptive audio feedback system and method
US8311838B2 (en) 2010-01-13 2012-11-13 Apple Inc. Devices and methods for identifying a prompt corresponding to a voice input in a sequence of prompts
US8639516B2 (en) 2010-06-04 2014-01-28 Apple Inc. User-specific noise suppression for voice quality improvements
US8713021B2 (en) 2010-07-07 2014-04-29 Apple Inc. Unsupervised document clustering using latent semantic density analysis
US9104670B2 (en) 2010-07-21 2015-08-11 Apple Inc. Customized search or acquisition of digital media assets
US8731939B1 (en) * 2010-08-06 2014-05-20 Google Inc. Routing queries based on carrier phrase registration
US8719006B2 (en) 2010-08-27 2014-05-06 Apple Inc. Combined statistical and rule-based part-of-speech tagging for text-to-speech synthesis
US8719014B2 (en) 2010-09-27 2014-05-06 Apple Inc. Electronic device with text error correction based on voice recognition data
US10515147B2 (en) 2010-12-22 2019-12-24 Apple Inc. Using statistical language models for contextual lookup
US10762293B2 (en) 2010-12-22 2020-09-01 Apple Inc. Using parts-of-speech tagging and named entity recognition for spelling correction
US8781836B2 (en) 2011-02-22 2014-07-15 Apple Inc. Hearing assistance system for providing consistent human speech
US9263045B2 (en) * 2011-05-17 2016-02-16 Microsoft Technology Licensing, Llc Multi-mode text input
US10672399B2 (en) 2011-06-03 2020-06-02 Apple Inc. Switching between text data and audio data based on a mapping
US8812294B2 (en) 2011-06-21 2014-08-19 Apple Inc. Translating phrases from one language into another using an order-based set of declarative rules
US8706472B2 (en) 2011-08-11 2014-04-22 Apple Inc. Method for disambiguating multiple readings in language conversion
US8994660B2 (en) 2011-08-29 2015-03-31 Apple Inc. Text correction processing
JP2013068532A (ja) 2011-09-22 2013-04-18 Clarion Co Ltd 情報端末、サーバー装置、検索システムおよびその検索方法
US8762156B2 (en) 2011-09-28 2014-06-24 Apple Inc. Speech recognition repair using contextual information
US9483461B2 (en) 2012-03-06 2016-11-01 Apple Inc. Handling speech synthesis of content for multiple languages
US9431012B2 (en) 2012-04-30 2016-08-30 2236008 Ontario Inc. Post processing of natural language automatic speech recognition
US9093076B2 (en) 2012-04-30 2015-07-28 2236008 Ontario Inc. Multipass ASR controlling multiple applications
US8775442B2 (en) 2012-05-15 2014-07-08 Apple Inc. Semantic search using a single-source semantic model
WO2013185109A2 (en) 2012-06-08 2013-12-12 Apple Inc. Systems and methods for recognizing textual identifiers within a plurality of words
US9734839B1 (en) 2012-06-20 2017-08-15 Amazon Technologies, Inc. Routing natural language commands to the appropriate applications
US9495129B2 (en) 2012-06-29 2016-11-15 Apple Inc. Device, method, and user interface for voice-activated navigation and browsing of a document
US9400633B2 (en) 2012-08-02 2016-07-26 Nuance Communications, Inc. Methods and apparatus for voiced-enabling a web application
US9292252B2 (en) * 2012-08-02 2016-03-22 Nuance Communications, Inc. Methods and apparatus for voiced-enabling a web application
US10157612B2 (en) 2012-08-02 2018-12-18 Nuance Communications, Inc. Methods and apparatus for voice-enabling a web application
US9781262B2 (en) 2012-08-02 2017-10-03 Nuance Communications, Inc. Methods and apparatus for voice-enabling a web application
US9292253B2 (en) 2012-08-02 2016-03-22 Nuance Communications, Inc. Methods and apparatus for voiced-enabling a web application
US9424840B1 (en) 2012-08-31 2016-08-23 Amazon Technologies, Inc. Speech recognition platforms
US9576574B2 (en) 2012-09-10 2017-02-21 Apple Inc. Context-sensitive handling of interruptions by intelligent digital assistant
US8935167B2 (en) 2012-09-25 2015-01-13 Apple Inc. Exemplar-based latent perceptual modeling for automatic speech recognition
US9286898B2 (en) 2012-11-14 2016-03-15 Qualcomm Incorporated Methods and apparatuses for providing tangible control of sound
US9196250B2 (en) 2012-11-16 2015-11-24 2236008 Ontario Inc. Application services interface to ASR
EP3232436A3 (en) * 2012-11-16 2017-10-25 2236008 Ontario Inc. Application services interface to asr
US10572476B2 (en) 2013-03-14 2020-02-25 Apple Inc. Refining a search based on schedule items
US9977779B2 (en) 2013-03-14 2018-05-22 Apple Inc. Automatic supplementation of word correction dictionaries
US9733821B2 (en) 2013-03-14 2017-08-15 Apple Inc. Voice control to diagnose inadvertent activation of accessibility features
US9368114B2 (en) 2013-03-14 2016-06-14 Apple Inc. Context-sensitive handling of interruptions
US10642574B2 (en) 2013-03-14 2020-05-05 Apple Inc. Device, method, and graphical user interface for outputting captions
CN105027197B (zh) 2013-03-15 2018-12-14 苹果公司 训练至少部分语音命令系统
KR102057795B1 (ko) 2013-03-15 2019-12-19 애플 인크. 콘텍스트-민감성 방해 처리
WO2014144579A1 (en) 2013-03-15 2014-09-18 Apple Inc. System and method for updating an adaptive speech recognition model
EP2973002B1 (en) 2013-03-15 2019-06-26 Apple Inc. User training by intelligent digital assistant
US9384751B2 (en) * 2013-05-06 2016-07-05 Honeywell International Inc. User authentication of voice controlled devices
WO2014197336A1 (en) 2013-06-07 2014-12-11 Apple Inc. System and method for detecting errors in interactions with a voice-based digital assistant
CN105265005B (zh) 2013-06-13 2019-09-17 苹果公司 用于由语音命令发起的紧急呼叫的系统和方法
WO2014203495A1 (ja) * 2013-06-19 2014-12-24 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ 音声対話方法、及び機器
EP2816552B1 (en) * 2013-06-20 2018-10-17 2236008 Ontario Inc. Conditional multipass automatic speech recognition
US9733894B2 (en) * 2013-07-02 2017-08-15 24/7 Customer, Inc. Method and apparatus for facilitating voice user interface design
AU2014306221B2 (en) 2013-08-06 2017-04-06 Apple Inc. Auto-activating smart responses based on activities from remote devices
US9892745B2 (en) 2013-08-23 2018-02-13 At&T Intellectual Property I, L.P. Augmented multi-tier classifier for multi-modal voice activity detection
WO2015029362A1 (ja) * 2013-08-29 2015-03-05 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ 機器制御方法及び機器制御システム
US10885918B2 (en) 2013-09-19 2021-01-05 Microsoft Technology Licensing, Llc Speech recognition using phoneme matching
US9373321B2 (en) * 2013-12-02 2016-06-21 Cypress Semiconductor Corporation Generation of wake-up words
US9601108B2 (en) 2014-01-17 2017-03-21 Microsoft Technology Licensing, Llc Incorporating an exogenous large-vocabulary model into rule-based speech recognition
US10749989B2 (en) 2014-04-01 2020-08-18 Microsoft Technology Licensing Llc Hybrid client/server architecture for parallel processing
US20150278370A1 (en) * 2014-04-01 2015-10-01 Microsoft Corporation Task completion for natural language input
US20150310853A1 (en) 2014-04-25 2015-10-29 GM Global Technology Operations LLC Systems and methods for speech artifact compensation in speech recognition systems
JP6440513B2 (ja) * 2014-05-13 2018-12-19 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America 音声認識機能を用いた情報提供方法および機器の制御方法
US9620105B2 (en) 2014-05-15 2017-04-11 Apple Inc. Analyzing audio input for efficient speech and music recognition
US10592095B2 (en) 2014-05-23 2020-03-17 Apple Inc. Instantaneous speaking of content on touch devices
US9502031B2 (en) 2014-05-27 2016-11-22 Apple Inc. Method for supporting dynamic grammars in WFST-based ASR
US10289433B2 (en) 2014-05-30 2019-05-14 Apple Inc. Domain specific language for encoding assistant dialog
US10078631B2 (en) 2014-05-30 2018-09-18 Apple Inc. Entropy-guided text prediction using combined word and character n-gram language models
US9842101B2 (en) 2014-05-30 2017-12-12 Apple Inc. Predictive conversion of language input
US9760559B2 (en) 2014-05-30 2017-09-12 Apple Inc. Predictive text input
US9734193B2 (en) 2014-05-30 2017-08-15 Apple Inc. Determining domain salience ranking from ambiguous words in natural speech
US9785630B2 (en) 2014-05-30 2017-10-10 Apple Inc. Text prediction using combined word N-gram and unigram language models
US10659851B2 (en) 2014-06-30 2020-05-19 Apple Inc. Real-time digital assistant knowledge updates
US10446141B2 (en) 2014-08-28 2019-10-15 Apple Inc. Automatic speech recognition based on user feedback
US10789041B2 (en) 2014-09-12 2020-09-29 Apple Inc. Dynamic thresholds for always listening speech trigger
US9898459B2 (en) 2014-09-16 2018-02-20 Voicebox Technologies Corporation Integration of domain information into state transitions of a finite state transducer for natural language processing
EP3195145A4 (en) 2014-09-16 2018-01-24 VoiceBox Technologies Corporation Voice commerce
US9886432B2 (en) 2014-09-30 2018-02-06 Apple Inc. Parsimonious handling of word inflection via categorical stem + suffix N-gram language models
US9646609B2 (en) 2014-09-30 2017-05-09 Apple Inc. Caching apparatus for serving phonetic pronunciations
US9812128B2 (en) 2014-10-09 2017-11-07 Google Inc. Device leadership negotiation among voice interface devices
CN107003999B (zh) 2014-10-15 2020-08-21 声钰科技 对用户的在先自然语言输入的后续响应的系统和方法
US9842593B2 (en) 2014-11-14 2017-12-12 At&T Intellectual Property I, L.P. Multi-level content analysis and response
US10614799B2 (en) 2014-11-26 2020-04-07 Voicebox Technologies Corporation System and method of providing intent predictions for an utterance prior to a system detection of an end of the utterance
US10431214B2 (en) 2014-11-26 2019-10-01 Voicebox Technologies Corporation System and method of determining a domain and/or an action related to a natural language input
US10552013B2 (en) 2014-12-02 2020-02-04 Apple Inc. Data detection
US9711141B2 (en) 2014-12-09 2017-07-18 Apple Inc. Disambiguating heteronyms in speech synthesis
KR102387567B1 (ko) * 2015-01-19 2022-04-18 삼성전자주식회사 음성 인식 방법 및 음성 인식 장치
US9865280B2 (en) 2015-03-06 2018-01-09 Apple Inc. Structured dictation using intelligent automated assistants
US9899019B2 (en) 2015-03-18 2018-02-20 Apple Inc. Systems and methods for structured stem and suffix language models
US9842105B2 (en) 2015-04-16 2017-12-12 Apple Inc. Parsimonious continuous-space phrase representations for natural language processing
US9966073B2 (en) 2015-05-27 2018-05-08 Google Llc Context-sensitive dynamic update of voice to text model in a voice-enabled electronic device
US9870196B2 (en) 2015-05-27 2018-01-16 Google Llc Selective aborting of online processing of voice inputs in a voice-enabled electronic device
US10083697B2 (en) 2015-05-27 2018-09-25 Google Llc Local persisting of data for selectively offline capable voice action in a voice-enabled electronic device
US10127220B2 (en) 2015-06-04 2018-11-13 Apple Inc. Language identification from short strings
US10101822B2 (en) 2015-06-05 2018-10-16 Apple Inc. Language input correction
US10255907B2 (en) 2015-06-07 2019-04-09 Apple Inc. Automatic accent detection using acoustic models
US10186254B2 (en) 2015-06-07 2019-01-22 Apple Inc. Context-based endpoint detection
US10026399B2 (en) 2015-09-11 2018-07-17 Amazon Technologies, Inc. Arbitration between voice-enabled devices
US9697820B2 (en) 2015-09-24 2017-07-04 Apple Inc. Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks
WO2017091883A1 (en) * 2015-12-01 2017-06-08 Tandemlaunch Inc. System and method for implementing a vocal user interface by combining a speech to text system and a speech to intent system
US10095470B2 (en) 2016-02-22 2018-10-09 Sonos, Inc. Audio response playback
US9826306B2 (en) 2016-02-22 2017-11-21 Sonos, Inc. Default playback device designation
US10264030B2 (en) 2016-02-22 2019-04-16 Sonos, Inc. Networked microphone device control
US10587708B2 (en) 2016-03-28 2020-03-10 Microsoft Technology Licensing, Llc Multi-modal conversational intercom
US10171410B2 (en) 2016-03-28 2019-01-01 Microsoft Technology Licensing, Llc Cross-mode communiation
US11487512B2 (en) 2016-03-29 2022-11-01 Microsoft Technology Licensing, Llc Generating a services application
WO2018023106A1 (en) 2016-07-29 2018-02-01 Erik SWART System and method of disambiguating natural language processing requests
US10115400B2 (en) 2016-08-05 2018-10-30 Sonos, Inc. Multiple voice services
US10540513B2 (en) 2016-09-13 2020-01-21 Microsoft Technology Licensing, Llc Natural language processor extension transmission data protection
US10783883B2 (en) * 2016-11-03 2020-09-22 Google Llc Focus session at a voice interface device
US10600418B2 (en) 2016-12-07 2020-03-24 Google Llc Voice to text conversion based on third-party agent content
US20180213396A1 (en) * 2017-01-20 2018-07-26 Essential Products, Inc. Privacy control in a connected environment based on speech characteristics
US10861450B2 (en) 2017-02-10 2020-12-08 Samsung Electronics Co., Ltd. Method and apparatus for managing voice-based interaction in internet of things network system
US10409659B2 (en) 2017-03-16 2019-09-10 Honeywell International Inc. Systems and methods for command management
US10748531B2 (en) * 2017-04-13 2020-08-18 Harman International Industries, Incorporated Management layer for multiple intelligent personal assistant services
KR101924852B1 (ko) * 2017-04-14 2018-12-04 네이버 주식회사 네트워크에 연결된 음향기기와의 멀티모달 인터렉션 방법 및 시스템
JP6508251B2 (ja) * 2017-04-27 2019-05-08 トヨタ自動車株式会社 音声対話システムおよび情報処理装置
DK180048B1 (en) 2017-05-11 2020-02-04 Apple Inc. MAINTAINING THE DATA PROTECTION OF PERSONAL INFORMATION
US10636428B2 (en) * 2017-06-29 2020-04-28 Microsoft Technology Licensing, Llc Determining a target device for voice command interaction
US10475449B2 (en) 2017-08-07 2019-11-12 Sonos, Inc. Wake-word detection suppression
US10048930B1 (en) 2017-09-08 2018-08-14 Sonos, Inc. Dynamic computation of system response volume
US10482868B2 (en) 2017-09-28 2019-11-19 Sonos, Inc. Multi-channel acoustic echo cancellation
US10466962B2 (en) 2017-09-29 2019-11-05 Sonos, Inc. Media playback system with voice assistance
US11175880B2 (en) 2018-05-10 2021-11-16 Sonos, Inc. Systems and methods for voice-assisted media content selection
US10959029B2 (en) 2018-05-25 2021-03-23 Sonos, Inc. Determining and adapting to changes in microphone performance of playback devices
US10803865B2 (en) 2018-06-05 2020-10-13 Voicify, LLC Voice application platform
US10235999B1 (en) 2018-06-05 2019-03-19 Voicify, LLC Voice application platform
US11437029B2 (en) 2018-06-05 2022-09-06 Voicify, LLC Voice application platform
US10636425B2 (en) 2018-06-05 2020-04-28 Voicify, LLC Voice application platform
US11016968B1 (en) * 2018-09-18 2021-05-25 Amazon Technologies, Inc. Mutation architecture for contextual data aggregator
US11024331B2 (en) 2018-09-21 2021-06-01 Sonos, Inc. Voice detection optimization using sound metadata
US11100923B2 (en) 2018-09-28 2021-08-24 Sonos, Inc. Systems and methods for selective wake word detection using neural network models
US11899519B2 (en) 2018-10-23 2024-02-13 Sonos, Inc. Multiple stage network microphone device with reduced power consumption and processing load
US11183183B2 (en) 2018-12-07 2021-11-23 Sonos, Inc. Systems and methods of operating media playback systems having multiple voice assistant services
US11132989B2 (en) 2018-12-13 2021-09-28 Sonos, Inc. Networked microphone devices, systems, and methods of localized arbitration
US11120794B2 (en) 2019-05-03 2021-09-14 Sonos, Inc. Voice assistant persistence across multiple network microphone devices
US11468890B2 (en) 2019-06-01 2022-10-11 Apple Inc. Methods and user interfaces for voice-based control of electronic devices
US11875231B2 (en) 2019-06-26 2024-01-16 Samsung Electronics Co., Ltd. System and method for complex task machine learning
US11189286B2 (en) 2019-10-22 2021-11-30 Sonos, Inc. VAS toggle based on device orientation
US11200900B2 (en) 2019-12-20 2021-12-14 Sonos, Inc. Offline voice control
US20210210099A1 (en) * 2020-01-06 2021-07-08 Soundhound, Inc. Multi Device Proxy
US11562740B2 (en) 2020-01-07 2023-01-24 Sonos, Inc. Voice verification for media playback
US11308958B2 (en) 2020-02-07 2022-04-19 Sonos, Inc. Localized wakeword verification
US11061543B1 (en) 2020-05-11 2021-07-13 Apple Inc. Providing relevant data items based on context
US11810578B2 (en) 2020-05-11 2023-11-07 Apple Inc. Device arbitration for digital assistant-based intercom systems
US11482224B2 (en) 2020-05-20 2022-10-25 Sonos, Inc. Command keywords with input detection windowing
US11490204B2 (en) 2020-07-20 2022-11-01 Apple Inc. Multi-device audio adjustment coordination
US11438683B2 (en) 2020-07-21 2022-09-06 Apple Inc. User identification using headphones
US11829720B2 (en) 2020-09-01 2023-11-28 Apple Inc. Analysis and validation of language models
US11461681B2 (en) * 2020-10-14 2022-10-04 Openstream Inc. System and method for multi-modality soft-agent for query population and information mining
CN116830190A (zh) * 2020-12-21 2023-09-29 塞伦妮经营公司 跨不同种生态系统路由用户命令

Family Cites Families (473)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US559864A (en) * 1896-05-12 Potato-digging machine
US4430669A (en) 1981-05-29 1984-02-07 Payview Limited Transmitting and receiving apparatus for permitting the transmission and reception of multi-tier subscription programs
US5208748A (en) 1985-11-18 1993-05-04 Action Technologies, Inc. Method and apparatus for structuring and managing human communications by explicitly defining the types of communications permitted between participants
US4910784A (en) 1987-07-30 1990-03-20 Texas Instruments Incorporated Low cost speech recognition system and method
CA1268228A (en) 1987-09-14 1990-04-24 Gary Lennartz Voice interactive security system
US5027406A (en) 1988-12-06 1991-06-25 Dragon Systems, Inc. Method for interactive speech recognition and training
SE466029B (sv) 1989-03-06 1991-12-02 Ibm Svenska Ab Anordning och foerfarande foer analys av naturligt spraak i ett datorbaserat informationsbehandlingssystem
JPH03129469A (ja) 1989-10-14 1991-06-03 Canon Inc 自然言語処理装置
JP3266246B2 (ja) 1990-06-15 2002-03-18 インターナシヨナル・ビジネス・マシーンズ・コーポレーシヨン 自然言語解析装置及び方法並びに自然言語解析用知識ベース構築方法
US5164904A (en) 1990-07-26 1992-11-17 Farradyne Systems, Inc. In-vehicle traffic congestion information system
US5722084A (en) 1990-09-28 1998-02-24 At&T Corp. Cellular/PCS handset NAM download capability using a wide-area paging system
DE69116167D1 (de) 1990-11-27 1996-02-15 Gordon M Jacobs Digitaler datenumsetzer
US5274560A (en) 1990-12-03 1993-12-28 Audio Navigation Systems, Inc. Sensor free vehicle navigation system utilizing a voice input/output interface for routing a driver from his source point to his destination point
EP0543329B1 (en) 1991-11-18 2002-02-06 Kabushiki Kaisha Toshiba Speech dialogue system for facilitating human-computer interaction
US5608635A (en) 1992-04-14 1997-03-04 Zexel Corporation Navigation system for a vehicle with route recalculation between multiple locations
CA2102077C (en) 1992-12-21 1997-09-16 Steven Lloyd Greenspan Call billing and measurement methods for redirected calls
US5465289A (en) 1993-03-05 1995-11-07 E-Systems, Inc. Cellular based traffic sensor system
US5471318A (en) 1993-04-22 1995-11-28 At&T Corp. Multimedia communications network
US5377350A (en) 1993-04-30 1994-12-27 International Business Machines Corporation System for cooperative communication between local object managers to provide verification for the performance of remote calls by object messages
US5537436A (en) 1993-06-14 1996-07-16 At&T Corp. Simultaneous analog and digital communication applications
US5983161A (en) 1993-08-11 1999-11-09 Lemelson; Jerome H. GPS vehicle collision avoidance warning and control system and method
EP0645757B1 (en) 1993-09-23 2000-04-05 Xerox Corporation Semantic co-occurrence filtering for speech recognition and signal transcription applications
US5475733A (en) 1993-11-04 1995-12-12 At&T Corp. Language accommodated message relaying for hearing impaired callers
CA2118278C (en) 1993-12-21 1999-09-07 J. David Garland Multimedia system
US5748841A (en) 1994-02-25 1998-05-05 Morin; Philippe Supervised contextual language acquisition system
US5533108A (en) 1994-03-18 1996-07-02 At&T Corp. Method and system for routing phone calls based on voice and data transport capability
US5488652A (en) 1994-04-14 1996-01-30 Northern Telecom Limited Method and apparatus for training speech recognition algorithms for directory assistance applications
US5652570A (en) 1994-05-19 1997-07-29 Lepkofker; Robert Individual location system
US5752052A (en) 1994-06-24 1998-05-12 Microsoft Corporation Method and system for bootstrapping statistical processing into a rule-based natural language parser
JP2674521B2 (ja) 1994-09-21 1997-11-12 日本電気株式会社 移動体誘導装置
US5539744A (en) 1994-10-17 1996-07-23 At&T Corp. Hand-off management for cellular telephony
US5696965A (en) 1994-11-03 1997-12-09 Intel Corporation Electronic information appraisal agent
JP2855409B2 (ja) 1994-11-17 1999-02-10 日本アイ・ビー・エム株式会社 自然言語処理方法及びシステム
US6571279B1 (en) 1997-12-05 2003-05-27 Pinpoint Incorporated Location enhanced information delivery system
US5499289A (en) 1994-12-06 1996-03-12 At&T Corp. Systems, methods and articles of manufacture for performing distributed telecommunications
US5748974A (en) 1994-12-13 1998-05-05 International Business Machines Corporation Multimodal natural language interface for cross-application tasks
US5774859A (en) 1995-01-03 1998-06-30 Scientific-Atlanta, Inc. Information system having a speech interface
US5794050A (en) 1995-01-04 1998-08-11 Intelligent Text Processing, Inc. Natural language understanding system
US5892900A (en) 1996-08-30 1999-04-06 Intertrust Technologies Corp. Systems and methods for secure transaction management and electronic rights protection
US5918222A (en) 1995-03-17 1999-06-29 Kabushiki Kaisha Toshiba Information disclosing apparatus and multi-modal information input/output system
US6965864B1 (en) 1995-04-10 2005-11-15 Texas Instruments Incorporated Voice activated hypermedia systems using grammatical metadata
DE69622565T2 (de) 1995-05-26 2003-04-03 Speechworks Int Inc Verfahren und vorrichtung zur dynamischen anpassung eines spracherkennungssystems mit grossem wortschatz und zur verwendung von einschränkungen aus einer datenbank in einem spracherkennungssystem mit grossem wortschatz
US5708422A (en) 1995-05-31 1998-01-13 At&T Transaction authorization and alert system
JP3716870B2 (ja) 1995-05-31 2005-11-16 ソニー株式会社 音声認識装置および音声認識方法
US5721938A (en) 1995-06-07 1998-02-24 Stuckey; Barbara K. Method and device for parsing and analyzing natural language sentences and text
US5617407A (en) 1995-06-21 1997-04-01 Bareis; Monica M. Optical disk having speech recognition templates for information access
US5794196A (en) 1995-06-30 1998-08-11 Kurzweil Applied Intelligence, Inc. Speech recognition system distinguishing dictation from commands by arbitration between continuous speech and isolated word modules
US6292767B1 (en) 1995-07-18 2001-09-18 Nuance Communications Method and system for building and running natural language understanding systems
US5963940A (en) 1995-08-16 1999-10-05 Syracuse University Natural language information retrieval system and method
US5911120A (en) 1995-09-08 1999-06-08 At&T Wireless Services Wireless communication system having mobile stations establish a communication link through the base station without using a landline or regional cellular network and without a call in progress
US5675629A (en) 1995-09-08 1997-10-07 At&T Cordless cellular system base station
US5855000A (en) 1995-09-08 1998-12-29 Carnegie Mellon University Method and apparatus for correcting and repairing machine-transcribed input using independent or cross-modal secondary input
US6192110B1 (en) 1995-09-15 2001-02-20 At&T Corp. Method and apparatus for generating sematically consistent inputs to a dialog manager
US5774841A (en) 1995-09-20 1998-06-30 The United States Of America As Represented By The Adminstrator Of The National Aeronautics And Space Administration Real-time reconfigurable adaptive speech recognition command and control apparatus and method
US5799276A (en) 1995-11-07 1998-08-25 Accent Incorporated Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals
US5960447A (en) 1995-11-13 1999-09-28 Holt; Douglas Word tagging and editing system for speech recognition
WO1997023068A2 (en) 1995-12-15 1997-06-26 Philips Electronic N.V. An adaptive noise cancelling arrangement, a noise reduction system and a transceiver
US6567778B1 (en) 1995-12-21 2003-05-20 Nuance Communications Natural language speech recognition using slot semantic confidence scores related to their word recognition confidence scores
US5802510A (en) 1995-12-29 1998-09-01 At&T Corp Universal directory service
US5742763A (en) 1995-12-29 1998-04-21 At&T Corp. Universal message delivery system for handles identifying network presences
US5633922A (en) 1995-12-29 1997-05-27 At&T Process and apparatus for restarting call routing in a telephone network
US5832221A (en) 1995-12-29 1998-11-03 At&T Corp Universal message storage system
US5987404A (en) 1996-01-29 1999-11-16 International Business Machines Corporation Statistical natural language understanding using hidden clumpings
US6314420B1 (en) 1996-04-04 2001-11-06 Lycos, Inc. Collaborative/adaptive search engine
US5848396A (en) 1996-04-26 1998-12-08 Freedom Of Information, Inc. Method and apparatus for determining behavioral profile of a computer user
US5878386A (en) 1996-06-28 1999-03-02 Microsoft Corporation Natural language parser with dictionary-based part-of-speech probabilities
US5953393A (en) 1996-07-15 1999-09-14 At&T Corp. Personal telephone agent
US5867817A (en) 1996-08-19 1999-02-02 Virtual Vision, Inc. Speech recognition manager
US6009382A (en) 1996-08-19 1999-12-28 International Business Machines Corporation Word storage table for natural language determination
US6385646B1 (en) 1996-08-23 2002-05-07 At&T Corp. Method and system for establishing voice communications in an internet environment
US6470315B1 (en) 1996-09-11 2002-10-22 Texas Instruments Incorporated Enrollment and modeling method and apparatus for robust speaker dependent speech models
US5878385A (en) 1996-09-16 1999-03-02 Ergo Linguistic Technologies Method and apparatus for universal parsing of language
US6085186A (en) 1996-09-20 2000-07-04 Netbot, Inc. Method and system using information written in a wrapper description language to execute query on a network
US6961700B2 (en) 1996-09-24 2005-11-01 Allvoice Computing Plc Method and apparatus for processing the output of a speech recognition engine
US6035267A (en) 1996-09-26 2000-03-07 Mitsubishi Denki Kabushiki Kaisha Interactive processing apparatus having natural language interfacing capability, utilizing goal frames, and judging action feasibility
US5892813A (en) 1996-09-30 1999-04-06 Matsushita Electric Industrial Co., Ltd. Multimodal voice dialing digital key telephone with dialog manager
US5995928A (en) 1996-10-02 1999-11-30 Speechworks International, Inc. Method and apparatus for continuous spelling speech recognition with early identification
US5902347A (en) 1996-11-19 1999-05-11 American Navigation Systems, Inc. Hand-held GPS-mapping device
US5839107A (en) 1996-11-29 1998-11-17 Northern Telecom Limited Method and apparatus for automatically generating a speech recognition vocabulary from a white pages listing
US6154526A (en) 1996-12-04 2000-11-28 Intellivoice Communications, Inc. Data acquisition and error correcting speech recognition system
US5960399A (en) 1996-12-24 1999-09-28 Gte Internetworking Incorporated Client/server speech processor/recognizer
US6456974B1 (en) 1997-01-06 2002-09-24 Texas Instruments Incorporated System and method for adding speech recognition capabilities to java
US6122613A (en) 1997-01-30 2000-09-19 Dragon Systems, Inc. Speech recognition using multiple recognizers (selectively) applied to the same input sample
JPH10254486A (ja) 1997-03-13 1998-09-25 Canon Inc 音声認識装置および方法
GB2323693B (en) 1997-03-27 2001-09-26 Forum Technology Ltd Speech to text conversion
US6167377A (en) 1997-03-28 2000-12-26 Dragon Systems, Inc. Speech recognition language models
FR2761837B1 (fr) 1997-04-08 1999-06-11 Sophie Sommelet Dispositif d'aide a la navigation ayant une architecture distribuee basee sur internet
US6014559A (en) 1997-04-10 2000-01-11 At&T Wireless Services, Inc. Method and system for delivering a voice mail notification to a private base station using cellular phone network
US6078886A (en) 1997-04-14 2000-06-20 At&T Corporation System and method for providing remote automatic speech recognition services via a packet network
US6058187A (en) 1997-04-17 2000-05-02 At&T Corp. Secure telecommunications data transmission
US5895464A (en) 1997-04-30 1999-04-20 Eastman Kodak Company Computer program product and a method for using natural language for the description, search and retrieval of multi-media objects
CA2292959A1 (en) 1997-05-06 1998-11-12 Speechworks International, Inc. System and method for developing interactive speech applications
US6128369A (en) 1997-05-14 2000-10-03 A.T.&T. Corp. Employing customer premises equipment in communications network maintenance
US5960397A (en) 1997-05-27 1999-09-28 At&T Corp System and method of recognizing an acoustic environment to adapt a set of based recognition models to the current acoustic environment for subsequent speech recognition
US5995119A (en) 1997-06-06 1999-11-30 At&T Corp. Method for generating photo-realistic animated characters
FI972723A0 (fi) 1997-06-24 1997-06-24 Nokia Mobile Phones Ltd Mobila kommunikationsanordningar
US6199043B1 (en) 1997-06-24 2001-03-06 International Business Machines Corporation Conversation management in speech recognition interfaces
US6101241A (en) 1997-07-16 2000-08-08 At&T Corp. Telephone-based speech recognition for data collection
US5926784A (en) 1997-07-17 1999-07-20 Microsoft Corporation Method and system for natural language parsing using podding
US5933822A (en) 1997-07-22 1999-08-03 Microsoft Corporation Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision
US6275231B1 (en) 1997-08-01 2001-08-14 American Calcar Inc. Centralized control and management system for automobiles
US6044347A (en) 1997-08-05 2000-03-28 Lucent Technologies Inc. Methods and apparatus object-oriented rule-based dialogue management
US6144667A (en) 1997-08-07 2000-11-07 At&T Corp. Network-based method and apparatus for initiating and completing a telephone call via the internet
US6192338B1 (en) 1997-08-12 2001-02-20 At&T Corp. Natural language knowledge servers as network resources
US6360234B2 (en) 1997-08-14 2002-03-19 Virage, Inc. Video cataloger system with synchronized encoders
US5895466A (en) 1997-08-19 1999-04-20 At&T Corp Automated natural language understanding customer service system
US6081774A (en) 1997-08-22 2000-06-27 Novell, Inc. Natural language information retrieval system and method
US6018708A (en) 1997-08-26 2000-01-25 Nortel Networks Corporation Method and apparatus for performing speech recognition utilizing a supplementary lexicon of frequently used orthographies
US6076059A (en) 1997-08-29 2000-06-13 Digital Equipment Corporation Method for aligning text with audio signals
US6049602A (en) 1997-09-18 2000-04-11 At&T Corp Virtual call center
US6650747B1 (en) 1997-09-18 2003-11-18 At&T Corp. Control of merchant application by system monitor in virtual contact center
DE19742054A1 (de) 1997-09-24 1999-04-01 Philips Patentverwaltung Eingabesystem wenigstens für Orts- und/oder Straßennamen
US6134235A (en) 1997-10-08 2000-10-17 At&T Corp. Pots/packet bridge
US5897613A (en) 1997-10-08 1999-04-27 Lucent Technologies Inc. Efficient transmission of voice silence intervals
US6272455B1 (en) 1997-10-22 2001-08-07 Lucent Technologies, Inc. Method and apparatus for understanding natural language
JPH11126090A (ja) 1997-10-23 1999-05-11 Pioneer Electron Corp 音声認識方法及び音声認識装置並びに音声認識装置を動作させるためのプログラムが記録された記録媒体
US6021384A (en) 1997-10-29 2000-02-01 At&T Corp. Automatic generation of superwords
US6498797B1 (en) 1997-11-14 2002-12-24 At&T Corp. Method and apparatus for communication services on a network
US6188982B1 (en) 1997-12-01 2001-02-13 Industrial Technology Research Institute On-line background noise adaptation of parallel model combination HMM with discriminative learning using weighted HMM for noisy speech recognition
US5970412A (en) 1997-12-02 1999-10-19 Maxemchuk; Nicholas Frank Overload control in a packet-switching cellular environment
US6614773B1 (en) 1997-12-02 2003-09-02 At&T Corp. Packet transmissions over cellular radio
US6219346B1 (en) 1997-12-02 2001-04-17 At&T Corp. Packet switching architecture in cellular radio
US6195634B1 (en) 1997-12-24 2001-02-27 Nortel Networks Corporation Selection of decoys for non-vocabulary utterances rejection
US6301560B1 (en) 1998-01-05 2001-10-09 Microsoft Corporation Discrete speech recognition system with ballooning active grammar
US6278377B1 (en) 1999-08-25 2001-08-21 Donnelly Corporation Indicator for vehicle accessory
US6226612B1 (en) 1998-01-30 2001-05-01 Motorola, Inc. Method of evaluating an utterance in a speech recognition system
US6385596B1 (en) 1998-02-06 2002-05-07 Liquid Audio, Inc. Secure online music distribution system
US6160883A (en) 1998-03-04 2000-12-12 At&T Corporation Telecommunications network system and method
US6119087A (en) 1998-03-13 2000-09-12 Nuance Communications System architecture for and method of voice processing
US6233559B1 (en) 1998-04-01 2001-05-15 Motorola, Inc. Speech control of multiple applications using applets
US6420975B1 (en) 1999-08-25 2002-07-16 Donnelly Corporation Interior rearview mirror sound processing system
US6173279B1 (en) 1998-04-09 2001-01-09 At&T Corp. Method of using a natural language interface to retrieve information from one or more data resources
US6144938A (en) 1998-05-01 2000-11-07 Sun Microsystems, Inc. Voice user interface with personality
US6574597B1 (en) 1998-05-08 2003-06-03 At&T Corp. Fully expanded context-dependent networks for speech recognition
US6236968B1 (en) 1998-05-14 2001-05-22 International Business Machines Corporation Sleep prevention dialog based car system
US20070094223A1 (en) 1998-05-28 2007-04-26 Lawrence Au Method and system for using contextual meaning in voice to text conversion
CN1652107A (zh) 1998-06-04 2005-08-10 松下电器产业株式会社 语言变换规则产生装置、语言变换装置及程序记录媒体
US6219643B1 (en) 1998-06-26 2001-04-17 Nuance Communications, Inc. Method of analyzing dialogs in a natural language speech recognition system
US6175858B1 (en) 1998-07-13 2001-01-16 At&T Corp. Intelligent network messaging agent and method
US6553372B1 (en) 1998-07-13 2003-04-22 Microsoft Corporation Natural language information retrieval system
US6393428B1 (en) 1998-07-13 2002-05-21 Microsoft Corporation Natural language information retrieval system
US6269336B1 (en) 1998-07-24 2001-07-31 Motorola, Inc. Voice browser for interactive services and methods thereof
US6539348B1 (en) 1998-08-24 2003-03-25 Virtual Research Associates, Inc. Systems and methods for parsing a natural language sentence
US6208964B1 (en) 1998-08-31 2001-03-27 Nortel Networks Limited Method and apparatus for providing unsupervised adaptation of transcriptions
US6499013B1 (en) 1998-09-09 2002-12-24 One Voice Technologies, Inc. Interactive user interface using speech recognition and natural language processing
US6434524B1 (en) 1998-09-09 2002-08-13 One Voice Technologies, Inc. Object interactive user interface using speech recognition and natural language processing
US6049607A (en) 1998-09-18 2000-04-11 Lamar Signal Processing Interference canceling method and apparatus
US6405170B1 (en) 1998-09-22 2002-06-11 Speechworks International, Inc. Method and system of reviewing the behavior of an interactive speech recognition application
US6606598B1 (en) 1998-09-22 2003-08-12 Speechworks International, Inc. Statistical computing and reporting for interactive speech applications
US7003463B1 (en) * 1998-10-02 2006-02-21 International Business Machines Corporation System and method for providing network coordinated conversational services
CN1151488C (zh) 1998-10-02 2004-05-26 国际商业机器公司 通过一般分层对象进行有效语音导航的结构框架
US6185535B1 (en) 1998-10-16 2001-02-06 Telefonaktiebolaget Lm Ericsson (Publ) Voice control of a user interface to service applications
WO2000024131A1 (en) 1998-10-21 2000-04-27 American Calcar, Inc. Positional camera and gps data interchange device
US6453292B2 (en) 1998-10-28 2002-09-17 International Business Machines Corporation Command boundary identifier for conversational natural language
US6028514A (en) 1998-10-30 2000-02-22 Lemelson Jerome H. Personal emergency, safety warning system and method
US6477200B1 (en) 1998-11-09 2002-11-05 Broadcom Corporation Multi-pair gigabit ethernet transceiver
US8121891B2 (en) 1998-11-12 2012-02-21 Accenture Global Services Gmbh Personalized product report
US6208972B1 (en) 1998-12-23 2001-03-27 Richard Grant Method for integrating computer processes with an interface controlled by voice actuated grammars
US6195651B1 (en) 1998-11-19 2001-02-27 Andersen Consulting Properties Bv System, method and article of manufacture for a tuned user application experience
US6246981B1 (en) 1998-11-25 2001-06-12 International Business Machines Corporation Natural language task-oriented dialog manager and method
US7881936B2 (en) 1998-12-04 2011-02-01 Tegic Communications, Inc. Multimodal disambiguation of speech recognition
US6430285B1 (en) 1998-12-15 2002-08-06 At&T Corp. Method and apparatus for an automated caller interaction system
US6721001B1 (en) 1998-12-16 2004-04-13 International Business Machines Corporation Digital camera with voice recognition annotation
US6233556B1 (en) 1998-12-16 2001-05-15 Nuance Communications Voice processing and verification system
US6754485B1 (en) 1998-12-23 2004-06-22 American Calcar Inc. Technique for effectively providing maintenance and information to vehicles
US6570555B1 (en) 1998-12-30 2003-05-27 Fuji Xerox Co., Ltd. Method and apparatus for embodied conversational characters with multimodal input/output in an interface device
US6851115B1 (en) 1999-01-05 2005-02-01 Sri International Software-based architecture for communication and cooperation among distributed electronic agents
US6742021B1 (en) 1999-01-05 2004-05-25 Sri International, Inc. Navigating network-based electronic information using spoken input with multimodal error feedback
US6523061B1 (en) 1999-01-05 2003-02-18 Sri International, Inc. System, method, and article of manufacture for agent-based navigation in a speech-based data navigation system
US7036128B1 (en) 1999-01-05 2006-04-25 Sri International Offices Using a community of distributed electronic agents to support a highly mobile, ambient computing environment
US6757718B1 (en) 1999-01-05 2004-06-29 Sri International Mobile navigation of network-based electronic information using spoken input
US6429813B2 (en) 1999-01-14 2002-08-06 Navigation Technologies Corp. Method and system for providing end-user preferences with a navigation system
US6567797B1 (en) 1999-01-26 2003-05-20 Xerox Corporation System and method for providing recommendations based on multi-modal user clusters
WO2000045375A1 (en) 1999-01-27 2000-08-03 Kent Ridge Digital Labs Method and apparatus for voice annotation and retrieval of multimedia data
US6556970B1 (en) 1999-01-28 2003-04-29 Denso Corporation Apparatus for determining appropriate series of words carrying information to be recognized
US6278968B1 (en) 1999-01-29 2001-08-21 Sony Corporation Method and apparatus for adaptive speech recognition hypothesis construction and selection in a spoken language translation system
TWM253017U (en) 1999-02-03 2004-12-11 Matsushita Electric Ind Co Ltd Emergency reporting apparatus emergency reporting network system
US6430531B1 (en) 1999-02-04 2002-08-06 Soliloquy, Inc. Bilateral speech system
US6643620B1 (en) 1999-03-15 2003-11-04 Matsushita Electric Industrial Co., Ltd. Voice activated controller for recording and retrieving audio/video programs
JP4176228B2 (ja) 1999-03-15 2008-11-05 株式会社東芝 自然言語対話装置及び自然言語対話方法
US6631346B1 (en) 1999-04-07 2003-10-07 Matsushita Electric Industrial Co., Ltd. Method and apparatus for natural language parsing using multiple passes and tags
US6233561B1 (en) 1999-04-12 2001-05-15 Matsushita Electric Industrial Co., Ltd. Method for goal-oriented speech translation in hand-held devices using meaning extraction and dialogue
US6408272B1 (en) 1999-04-12 2002-06-18 General Magic, Inc. Distributed voice user interface
US6570964B1 (en) 1999-04-16 2003-05-27 Nuance Communications Technique for recognizing telephone numbers and other spoken information embedded in voice messages stored in a voice messaging system
US6434523B1 (en) 1999-04-23 2002-08-13 Nuance Communications Creating and editing grammars for speech recognition graphically
US6314402B1 (en) 1999-04-23 2001-11-06 Nuance Communications Method and apparatus for creating modifiable and combinable speech objects for acquiring information from a speaker in an interactive voice response system
US6804638B2 (en) 1999-04-30 2004-10-12 Recent Memory Incorporated Device and method for selective recall and preservation of events prior to decision to record the events
US6356869B1 (en) 1999-04-30 2002-03-12 Nortel Networks Limited Method and apparatus for discourse management
US6505155B1 (en) 1999-05-06 2003-01-07 International Business Machines Corporation Method and system for automatically adjusting prompt feedback based on predicted recognition accuracy
US6308151B1 (en) 1999-05-14 2001-10-23 International Business Machines Corp. Method and system using a speech recognition system to dictate a body of text in response to an available body of text
US6604075B1 (en) 1999-05-20 2003-08-05 Lucent Technologies Inc. Web-based voice dialog interface
GB9911971D0 (en) * 1999-05-21 1999-07-21 Canon Kk A system, a server for a system and a machine for use in a system
US6584439B1 (en) 1999-05-21 2003-06-24 Winbond Electronics Corporation Method and apparatus for controlling voice controlled devices
US20020032564A1 (en) 2000-04-19 2002-03-14 Farzad Ehsani Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface
US20020107694A1 (en) 1999-06-07 2002-08-08 Traptec Corporation Voice-recognition safety system for aircraft and method of using the same
US6374214B1 (en) 1999-06-24 2002-04-16 International Business Machines Corp. Method and apparatus for excluding text phrases during re-dictation in a speech recognition system
DE60026637T2 (de) 1999-06-30 2006-10-05 International Business Machines Corp. Verfahren zur Erweiterung des Wortschatzes eines Spracherkennungssystems
US6321196B1 (en) 1999-07-02 2001-11-20 International Business Machines Corporation Phonetic spelling for speech recognition
US6377913B1 (en) * 1999-08-13 2002-04-23 International Business Machines Corporation Method and system for multi-client access to a dialog system
US7069220B2 (en) 1999-08-13 2006-06-27 International Business Machines Corporation Method for determining and maintaining dialog focus in a conversational speech system
US6513006B2 (en) 1999-08-26 2003-01-28 Matsushita Electronic Industrial Co., Ltd. Automatic control of household activity using speech recognition and natural language
US6901366B1 (en) 1999-08-26 2005-05-31 Matsushita Electric Industrial Co., Ltd. System and method for assessing TV-related information over the internet
US6415257B1 (en) 1999-08-26 2002-07-02 Matsushita Electric Industrial Co., Ltd. System for identifying and adapting a TV-user profile by means of speech technology
EP1083545A3 (en) 1999-09-09 2001-09-26 Xanavi Informatics Corporation Voice recognition of proper names in a navigation apparatus
US6658388B1 (en) 1999-09-10 2003-12-02 International Business Machines Corporation Personality generator for conversational systems
US6850603B1 (en) 1999-09-13 2005-02-01 Microstrategy, Incorporated System and method for the creation and automatic deployment of personalized dynamic and interactive voice services
US7340040B1 (en) 1999-09-13 2008-03-04 Microstrategy, Incorporated System and method for real-time, personalized, dynamic, interactive voice services for corporate-analysis related information
US6631351B1 (en) 1999-09-14 2003-10-07 Aidentity Matrix Smart toys
US6601026B2 (en) 1999-09-17 2003-07-29 Discern Communications, Inc. Information retrieval by natural language querying
US6587858B1 (en) 1999-09-30 2003-07-01 Steven Paul Strazza Systems and methods for the control of dynamic data and request criteria in a data repository
US6868385B1 (en) 1999-10-05 2005-03-15 Yomobile, Inc. Method and apparatus for the provision of information signals based upon speech recognition
US6937977B2 (en) 1999-10-05 2005-08-30 Fastmobile, Inc. Method and apparatus for processing an input speech signal during presentation of an output audio signal
US6442522B1 (en) 1999-10-12 2002-08-27 International Business Machines Corporation Bi-directional natural language system for interfacing with multiple back-end applications
US6721697B1 (en) 1999-10-18 2004-04-13 Sony Corporation Method and system for reducing lexical ambiguity
US7447635B1 (en) 1999-10-19 2008-11-04 Sony Corporation Natural language interface control system
US6581103B1 (en) 1999-10-22 2003-06-17 Dedicated Radio, Llc Method for internet radio broadcasting including listener requests of audio and/or video files with input dedications
US6594367B1 (en) 1999-10-25 2003-07-15 Andrea Electronics Corporation Super directional beamforming design and implementation
CA2389186A1 (en) 1999-10-29 2001-05-03 British Telecommunications Public Limited Company Method and apparatus for processing queries
US6622119B1 (en) 1999-10-30 2003-09-16 International Business Machines Corporation Adaptive command predictor and method for a natural language dialog system
US6522746B1 (en) 1999-11-03 2003-02-18 Tellabs Operations, Inc. Synchronization of voice boundaries and their use by echo cancellers in a voice processing system
US6681206B1 (en) 1999-11-05 2004-01-20 At&T Corporation Method for generating morphemes
US8482535B2 (en) 1999-11-08 2013-07-09 Apple Inc. Programmable tactile touch screen displays and man-machine interfaces for improved vehicle instrumentation and telematics
US6615172B1 (en) 1999-11-12 2003-09-02 Phoenix Solutions, Inc. Intelligent query engine for processing voice based queries
US7392185B2 (en) 1999-11-12 2008-06-24 Phoenix Solutions, Inc. Speech based learning/training system using semantic decoding
US9076448B2 (en) * 1999-11-12 2015-07-07 Nuance Communications, Inc. Distributed real time speech recognition system
US6633846B1 (en) 1999-11-12 2003-10-14 Phoenix Solutions, Inc. Distributed realtime speech recognition system
US6418210B1 (en) 1999-11-29 2002-07-09 At&T Corp Method and apparatus for providing information between a calling network and a called network
US6751612B1 (en) 1999-11-29 2004-06-15 Xerox Corporation User query generate search results that rank set of servers where ranking is based on comparing content on each server with user query, frequency at which content on each server is altered using web crawler in a search engine
GB9928420D0 (en) * 1999-12-02 2000-01-26 Ibm Interactive voice response system
US6288319B1 (en) 1999-12-02 2001-09-11 Gary Catona Electronic greeting card with a custom audio mix
US6591239B1 (en) 1999-12-09 2003-07-08 Steris Inc. Voice controlled surgical suite
US6598018B1 (en) 1999-12-15 2003-07-22 Matsushita Electric Industrial Co., Ltd. Method for natural dialog interface to car devices
US6976229B1 (en) 1999-12-16 2005-12-13 Ricoh Co., Ltd. Method and apparatus for storytelling with digital photographs
US6832230B1 (en) 1999-12-22 2004-12-14 Nokia Corporation Apparatus and associated method for downloading an application with a variable lifetime to a mobile terminal
US6920421B2 (en) 1999-12-28 2005-07-19 Sony Corporation Model adaptive apparatus for performing adaptation of a model used in pattern recognition considering recentness of a received pattern data
US6678680B1 (en) 2000-01-06 2004-01-13 Mark Woo Music search engine
US6701294B1 (en) * 2000-01-19 2004-03-02 Lucent Technologies, Inc. User interface for translating natural language inquiries into database queries and data presentations
US6829603B1 (en) 2000-02-02 2004-12-07 International Business Machines Corp. System, method and program product for interactive natural dialog
US6560590B1 (en) 2000-02-14 2003-05-06 Kana Software, Inc. Method and apparatus for multiple tiered matching of natural language queries to positions in a text corpus
US6434529B1 (en) 2000-02-16 2002-08-13 Sun Microsystems, Inc. System and method for referencing object instances and invoking methods on those object instances from within a speech recognition grammar
EP3367268A1 (en) 2000-02-22 2018-08-29 Nokia Technologies Oy Spatially coding and displaying information
US7110951B1 (en) 2000-03-03 2006-09-19 Dorothy Lemelson, legal representative System and method for enhancing speech intelligibility for the hearing impaired
US6466654B1 (en) 2000-03-06 2002-10-15 Avaya Technology Corp. Personal virtual assistant with semantic tagging
US6510417B1 (en) 2000-03-21 2003-01-21 America Online, Inc. System and method for voice access to internet-based information
US7974875B1 (en) 2000-03-21 2011-07-05 Aol Inc. System and method for using voice over a telephone to access, process, and carry out transactions over the internet
US6868380B2 (en) 2000-03-24 2005-03-15 Eliza Corporation Speech recognition system and method for generating phonotic estimates
ATE494610T1 (de) 2000-03-24 2011-01-15 Eliza Corp Spracherkennung
AU2001249768A1 (en) 2000-04-02 2001-10-15 Tangis Corporation Soliciting information based on a computer user's context
US6980092B2 (en) 2000-04-06 2005-12-27 Gentex Corporation Vehicle rearview mirror assembly incorporating a communication system
US6578022B1 (en) 2000-04-18 2003-06-10 Icplanet Corporation Interactive intelligent searching with executable suggestions
US6556973B1 (en) 2000-04-19 2003-04-29 Voxi Ab Conversion between data representation formats
US6560576B1 (en) 2000-04-25 2003-05-06 Nuance Communications Method and apparatus for providing active help to a user of a voice-enabled application
US20010054087A1 (en) 2000-04-26 2001-12-20 Michael Flom Portable internet services
AU2001259446A1 (en) 2000-05-02 2001-11-12 Dragon Systems, Inc. Error correction in speech recognition
CN1252975C (zh) 2000-05-16 2006-04-19 约翰·塔歇罗 提供地理目标信息和广告的方法和系统
CN100477704C (zh) 2000-05-26 2009-04-08 皇家菲利浦电子有限公司 用于与自适应波束形成组合的回声抵消的方法和设备
US6487495B1 (en) 2000-06-02 2002-11-26 Navigation Technologies Corporation Navigation applications using related location-referenced keywords
US7082469B2 (en) 2000-06-09 2006-07-25 Gold Mustache Publishing, Inc. Method and system for electronic song dedication
WO2001097558A2 (en) 2000-06-13 2001-12-20 Gn Resound Corporation Fixed polar-pattern-based adaptive directionality systems
US6990513B2 (en) 2000-06-22 2006-01-24 Microsoft Corporation Distributed computing services platform
JP3567864B2 (ja) 2000-07-21 2004-09-22 株式会社デンソー 音声認識装置及び記録媒体
US7143039B1 (en) 2000-08-11 2006-11-28 Tellme Networks, Inc. Providing menu and other services for an information processing system using a telephone or other audio interface
JP2004505322A (ja) 2000-07-28 2004-02-19 シーメンス ヴィディーオー オートモーティヴ コーポレイション 遠隔操作系のユーザーインターフェイス
US7092928B1 (en) 2000-07-31 2006-08-15 Quantum Leap Research, Inc. Intelligent portal engine
US7027975B1 (en) 2000-08-08 2006-04-11 Object Services And Consulting, Inc. Guided natural language interface system and method
US7653748B2 (en) 2000-08-10 2010-01-26 Simplexity, Llc Systems, methods and computer program products for integrating advertising within web content
US6574624B1 (en) 2000-08-18 2003-06-03 International Business Machines Corporation Automatic topic identification and switch for natural language search of textual document collections
AU2001283579A1 (en) 2000-08-21 2002-03-04 Yahoo, Inc. Method and system of interpreting and presenting web content using a voice browser
CN1226717C (zh) 2000-08-30 2005-11-09 国际商业机器公司 自动新词提取方法和系统
US7062488B1 (en) 2000-08-30 2006-06-13 Richard Reisman Task/domain segmentation in applying feedback to command control
EP1184841A1 (de) 2000-08-31 2002-03-06 Siemens Aktiengesellschaft Sprachgesteuerte Anordnung und Verfahren zur Spracheingabe und -erkennung
US6813341B1 (en) 2000-08-31 2004-11-02 Ivoice, Inc. Voice activated/voice responsive item locator
JP2004508636A (ja) 2000-09-07 2004-03-18 テレフオンアクチーボラゲット エル エム エリクソン(パブル) 情報提供システム及びその制御方法
US20040205671A1 (en) 2000-09-13 2004-10-14 Tatsuya Sukehiro Natural-language processing system
CA2423200A1 (en) 2000-09-21 2002-03-28 American Calcar Inc. Technique for operating a vehicle effectively and safely
US7085708B2 (en) 2000-09-23 2006-08-01 Ravenflow, Inc. Computer system with natural language to machine language translator
US6362748B1 (en) 2000-09-27 2002-03-26 Lite Vision Corporation System for communicating among vehicles and a communication system control center
US6704576B1 (en) 2000-09-27 2004-03-09 At&T Corp. Method and system for communicating multimedia content in a unicast, multicast, simulcast or broadcast environment
JP2003044708A (ja) 2000-10-02 2003-02-14 Omron Corp 情報仲介システムとそれに用いられる情報仲介方法
US6922670B2 (en) 2000-10-24 2005-07-26 Sanyo Electric Co., Ltd. User support apparatus and system using agents
US6721706B1 (en) 2000-10-30 2004-04-13 Koninklijke Philips Electronics N.V. Environment-responsive user interface/entertainment device that simulates personal interaction
US6795808B1 (en) 2000-10-30 2004-09-21 Koninklijke Philips Electronics N.V. User interface/entertainment device that simulates personal interaction and charges external database with relevant data
US6934756B2 (en) 2000-11-01 2005-08-23 International Business Machines Corporation Conversational networking via transport, coding and control conversational protocols
GB0027178D0 (en) 2000-11-07 2000-12-27 Canon Kk Speech processing system
US6941266B1 (en) * 2000-11-15 2005-09-06 At&T Corp. Method and system for predicting problematic dialog situations in a task classification system
US6735592B1 (en) 2000-11-16 2004-05-11 Discern Communications System, method, and computer program product for a network-based content exchange system
US7013308B1 (en) 2000-11-28 2006-03-14 Semscript Ltd. Knowledge storage and retrieval system and method
US20020065568A1 (en) 2000-11-30 2002-05-30 Silfvast Robert Denton Plug-in modules for digital signal processor functionalities
US6973429B2 (en) 2000-12-04 2005-12-06 A9.Com, Inc. Grammar generation for voice-based searches
US7016847B1 (en) 2000-12-08 2006-03-21 Ben Franklin Patent Holdings L.L.C. Open architecture for a voice user interface
US6456711B1 (en) 2000-12-12 2002-09-24 At&T Corp. Method for placing a call intended for an enhanced network user on hold while the enhanced network user is unavailable to take the call using a distributed feature architecture
US20020082911A1 (en) 2000-12-22 2002-06-27 Dunn Charles L. Online revenue sharing
US6973427B2 (en) 2000-12-26 2005-12-06 Microsoft Corporation Method for adding phonetic descriptions to a speech recognition lexicon
US20020087326A1 (en) 2000-12-29 2002-07-04 Lee Victor Wai Leung Computer-implemented web page summarization method and system
DE10101282A1 (de) 2001-01-12 2002-07-18 Siemens Ag Notrufmeldung mittels mobiler Telekommunikationsgeräte
US6751591B1 (en) 2001-01-22 2004-06-15 At&T Corp. Method and system for predicting understanding errors in a task classification system
US7069207B2 (en) 2001-01-26 2006-06-27 Microsoft Corporation Linguistically intelligent text compression
US7206418B2 (en) 2001-02-12 2007-04-17 Fortemedia, Inc. Noise suppression for a wireless communication device
EP1231788A1 (en) 2001-02-12 2002-08-14 Koninklijke Philips Electronics N.V. Arrangement for distributing content, profiling center, receiving device and method
US6549629B2 (en) 2001-02-21 2003-04-15 Digisonix Llc DVE system with normalized selection
US6754627B2 (en) 2001-03-01 2004-06-22 International Business Machines Corporation Detecting speech recognition errors in an embedded speech recognition system
US20020173961A1 (en) 2001-03-09 2002-11-21 Guerra Lisa M. System, method and computer program product for dynamic, robust and fault tolerant audio output in a speech recognition framework
US7024364B2 (en) 2001-03-09 2006-04-04 Bevocal, Inc. System, method and computer program product for looking up business addresses and directions based on a voice dial-up session
US20020133402A1 (en) 2001-03-13 2002-09-19 Scott Faber Apparatus and method for recruiting, communicating with, and paying participants of interactive advertising
US7574362B2 (en) 2001-03-14 2009-08-11 At&T Intellectual Property Ii, L.P. Method for automated sentence planning in a task classification system
US7729918B2 (en) 2001-03-14 2010-06-01 At&T Intellectual Property Ii, Lp Trainable sentence planning system
WO2002073453A1 (en) 2001-03-14 2002-09-19 At & T Corp. A trainable sentence planning system
US6801897B2 (en) 2001-03-28 2004-10-05 International Business Machines Corporation Method of providing concise forms of natural commands
US7406421B2 (en) 2001-10-26 2008-07-29 Intellisist Inc. Systems and methods for reviewing informational content in a vehicle
US8175886B2 (en) 2001-03-29 2012-05-08 Intellisist, Inc. Determination of signal-processing approach based on signal destination characteristics
JP2002358095A (ja) 2001-03-30 2002-12-13 Sony Corp 音声処理装置および音声処理方法、並びにプログラムおよび記録媒体
WO2002079896A2 (en) 2001-03-30 2002-10-10 British Telecommunications Public Limited Company Multi-modal interface
US6996531B2 (en) 2001-03-30 2006-02-07 Comverse Ltd. Automated database assistance using a telephone for a speech based or text based multimedia communication mode
FR2822994B1 (fr) 2001-03-30 2004-05-21 Bouygues Telecom Sa Assistance au conducteur d'un vehicule automobile
US6885989B2 (en) 2001-04-02 2005-04-26 International Business Machines Corporation Method and system for collaborative speech recognition for small-area network
US6856990B2 (en) 2001-04-09 2005-02-15 Intel Corporation Network dedication system
US7437295B2 (en) 2001-04-27 2008-10-14 Accenture Llp Natural language processing for a location-based services system
US7970648B2 (en) 2001-04-27 2011-06-28 Accenture Global Services Limited Advertising campaign and business listing management for a location-based services system
US6950821B2 (en) 2001-05-04 2005-09-27 Sun Microsystems, Inc. System and method for resolving distributed network search queries to information providers
US6804684B2 (en) 2001-05-07 2004-10-12 Eastman Kodak Company Method for associating semantic information with multiple images in an image database environment
US6944594B2 (en) 2001-05-30 2005-09-13 Bellsouth Intellectual Property Corporation Multi-context conversational environment system and method
JP2003005897A (ja) 2001-06-20 2003-01-08 Alpine Electronics Inc 情報入力方法および装置
US6801604B2 (en) 2001-06-25 2004-10-05 International Business Machines Corporation Universal IP-based and scalable architectures across conversational applications using web services for speech and audio processing resources
US20020198714A1 (en) 2001-06-26 2002-12-26 Guojun Zhou Statistical spoken dialog system
US20100029261A1 (en) 2001-06-27 2010-02-04 John Mikkelsen Virtual wireless data cable method, apparatus and system
US20050234727A1 (en) 2001-07-03 2005-10-20 Leo Chiu Method and apparatus for adapting a voice extensible markup language-enabled voice system for natural speech recognition and system response
US6983307B2 (en) 2001-07-11 2006-01-03 Kirusa, Inc. Synchronization among plural browsers
US7123727B2 (en) 2001-07-18 2006-10-17 Agere Systems Inc. Adaptive close-talking differential microphone array
US7283951B2 (en) 2001-08-14 2007-10-16 Insightful Corporation Method and system for enhanced data searching
US6757544B2 (en) 2001-08-15 2004-06-29 Motorola, Inc. System and method for determining a location relevant to a communication device and/or its associated user
US7920682B2 (en) 2001-08-21 2011-04-05 Byrne William J Dynamic interactive voice interface
US7305381B1 (en) 2001-09-14 2007-12-04 Ricoh Co., Ltd Asynchronous unconscious retrieval in a network of information appliances
US6959276B2 (en) 2001-09-27 2005-10-25 Microsoft Corporation Including the category of environmental noise when processing speech signals
US6721633B2 (en) 2001-09-28 2004-04-13 Robert Bosch Gmbh Method and device for interfacing a driver information system using a voice portal server
US7289606B2 (en) 2001-10-01 2007-10-30 Sandeep Sibal Mode-swapping in multi-modal telephonic applications
JP3997459B2 (ja) 2001-10-02 2007-10-24 株式会社日立製作所 音声入力システムおよび音声ポータルサーバおよび音声入力端末
US7254384B2 (en) 2001-10-03 2007-08-07 Accenture Global Services Gmbh Multi-modal messaging
US7640006B2 (en) 2001-10-03 2009-12-29 Accenture Global Services Gmbh Directory assistance with multi-modal messaging
JP4065936B2 (ja) 2001-10-09 2008-03-26 独立行政法人情報通信研究機構 機械学習法を用いた言語解析処理システムおよび機械学習法を用いた言語省略解析処理システム
US6501834B1 (en) 2001-11-21 2002-12-31 At&T Corp. Message sender status monitor
US20030101054A1 (en) 2001-11-27 2003-05-29 Ncc, Llc Integrated system and method for electronic speech recognition and transcription
US7165028B2 (en) 2001-12-12 2007-01-16 Texas Instruments Incorporated Method of speech recognition resistant to convolutive distortion and additive distortion
GB2383247A (en) 2001-12-13 2003-06-18 Hewlett Packard Co Multi-modal picture allowing verbal interaction between a user and the picture
US7231343B1 (en) 2001-12-20 2007-06-12 Ianywhere Solutions, Inc. Synonyms mechanism for natural language systems
US20030120493A1 (en) 2001-12-21 2003-06-26 Gupta Sunil K. Method and system for updating and customizing recognition vocabulary
EP1324274A3 (en) 2001-12-28 2005-11-02 Matsushita Electric Industrial Co., Ltd. Vehicle information recording system
US7203644B2 (en) 2001-12-31 2007-04-10 Intel Corporation Automating tuning of speech recognition systems
US7493259B2 (en) 2002-01-04 2009-02-17 Siebel Systems, Inc. Method for accessing data via voice
US7493559B1 (en) 2002-01-09 2009-02-17 Ricoh Co., Ltd. System and method for direct multi-modal annotation of objects
US7117200B2 (en) 2002-01-11 2006-10-03 International Business Machines Corporation Synthesizing information-bearing content from multiple channels
US7111248B2 (en) 2002-01-15 2006-09-19 Openwave Systems Inc. Alphanumeric information input method
US7536297B2 (en) 2002-01-22 2009-05-19 International Business Machines Corporation System and method for hybrid text mining for finding abbreviations and their definitions
US7054817B2 (en) 2002-01-25 2006-05-30 Canon Europa N.V. User interface for speech model generation and testing
US20030144846A1 (en) 2002-01-31 2003-07-31 Denenberg Lawrence A. Method and system for modifying the behavior of an application based upon the application's grammar
US7130390B2 (en) 2002-02-01 2006-10-31 Microsoft Corporation Audio messaging system and method
US7177814B2 (en) 2002-02-07 2007-02-13 Sap Aktiengesellschaft Dynamic grammar for voice-enabled applications
US7058890B2 (en) 2002-02-13 2006-06-06 Siebel Systems, Inc. Method and system for enabling connectivity to a data system
US8249880B2 (en) 2002-02-14 2012-08-21 Intellisist, Inc. Real-time display of system instructions
US7587317B2 (en) 2002-02-15 2009-09-08 Microsoft Corporation Word training interface
JP3974419B2 (ja) 2002-02-18 2007-09-12 株式会社日立製作所 音声入力を用いた情報取得方法及び情報取得システム
US6704396B2 (en) 2002-02-27 2004-03-09 Sbc Technology Resources, Inc. Multi-modal communications method
EP1478982B1 (en) 2002-02-27 2014-11-05 Y Indeed Consulting L.L.C. System and method that facilitates customizing media
US7016849B2 (en) 2002-03-25 2006-03-21 Sri International Method and apparatus for providing speech-driven routing between spoken language applications
US7136875B2 (en) 2002-09-24 2006-11-14 Google, Inc. Serving advertisements based on content
US7072834B2 (en) 2002-04-05 2006-07-04 Intel Corporation Adapting to adverse acoustic environment in speech processing using playback training data
US7197460B1 (en) 2002-04-23 2007-03-27 At&T Corp. System for handling frequently asked questions in a natural language dialog service
US6877001B2 (en) 2002-04-25 2005-04-05 Mitsubishi Electric Research Laboratories, Inc. Method and system for retrieving documents with spoken queries
US7167568B2 (en) 2002-05-02 2007-01-23 Microsoft Corporation Microphone array signal enhancement
US20030212558A1 (en) 2002-05-07 2003-11-13 Matula Valentine C. Method and apparatus for distributed interactive voice processing
US20030212550A1 (en) 2002-05-10 2003-11-13 Ubale Anil W. Method, apparatus, and system for improving speech quality of voice-over-packets (VOP) systems
US20030212562A1 (en) 2002-05-13 2003-11-13 General Motors Corporation Manual barge-in for server-based in-vehicle voice recognition systems
JP2003329477A (ja) 2002-05-15 2003-11-19 Pioneer Electronic Corp ナビゲーション装置及び対話型情報提供プログラム
US7107210B2 (en) 2002-05-20 2006-09-12 Microsoft Corporation Method of noise reduction based on dynamic aspects of speech
US7127400B2 (en) 2002-05-22 2006-10-24 Bellsouth Intellectual Property Corporation Methods and systems for personal interactive voice response
US7546382B2 (en) 2002-05-28 2009-06-09 International Business Machines Corporation Methods and systems for authoring of mixed-initiative multi-modal interactions and related browsing mechanisms
US20040140989A1 (en) 2002-05-28 2004-07-22 John Papageorge Content subscription and delivery service
US7398209B2 (en) 2002-06-03 2008-07-08 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7143037B1 (en) 2002-06-12 2006-11-28 Cisco Technology, Inc. Spelling words using an arbitrary phonetic alphabet
US7502737B2 (en) 2002-06-24 2009-03-10 Intel Corporation Multi-pass recognition of spoken dialogue
US20050021470A1 (en) 2002-06-25 2005-01-27 Bose Corporation Intelligent music track selection
US7177816B2 (en) 2002-07-05 2007-02-13 At&T Corp. System and method of handling problematic input during context-sensitive help for multi-modal dialog systems
US7177815B2 (en) 2002-07-05 2007-02-13 At&T Corp. System and method of context-sensitive help for multi-modal dialog systems
US7693720B2 (en) 2002-07-15 2010-04-06 Voicebox Technologies, Inc. Mobile systems and methods for responding to natural language speech utterance
EP1391830A1 (fr) 2002-07-19 2004-02-25 Albert Inc. S.A. Système d'extraction d'informations dans un texte en langage naturel
EP1394692A1 (en) 2002-08-05 2004-03-03 Alcatel Method, terminal, browser application, and mark-up language for multimodal interaction between a user and a terminal
US7236923B1 (en) 2002-08-07 2007-06-26 Itt Manufacturing Enterprises, Inc. Acronym extraction system and method of identifying acronyms and extracting corresponding expansions from text
US6741931B1 (en) 2002-09-05 2004-05-25 Daimlerchrysler Corporation Vehicle navigation system with off-board server
US7184957B2 (en) 2002-09-25 2007-02-27 Toyota Infotechnology Center Co., Ltd. Multiple pass speech recognition method and system
US7328155B2 (en) 2002-09-25 2008-02-05 Toyota Infotechnology Center Co., Ltd. Method and system for speech recognition using grammar weighted based upon location information
US20030115062A1 (en) 2002-10-29 2003-06-19 Walker Marilyn A. Method for automated sentence planning
US8321427B2 (en) 2002-10-31 2012-11-27 Promptu Systems Corporation Method and apparatus for generation and augmentation of search terms from external and internal sources
US6739556B1 (en) 2002-11-20 2004-05-25 Raytheon Company Method and apparatus for providing an aircraft emergency safety control system
US6834265B2 (en) 2002-12-13 2004-12-21 Motorola, Inc. Method and apparatus for selective speech recognition
US7890324B2 (en) 2002-12-19 2011-02-15 At&T Intellectual Property Ii, L.P. Context-sensitive interface widgets for multi-modal dialog systems
US20040158555A1 (en) 2003-02-11 2004-08-12 Terradigtal Systems Llc. Method for managing a collection of media objects
GB2398913B (en) 2003-02-27 2005-08-17 Motorola Inc Noise estimation in speech recognition
JP4103639B2 (ja) 2003-03-14 2008-06-18 セイコーエプソン株式会社 音響モデル作成方法および音響モデル作成装置ならびに音声認識装置
US7146319B2 (en) 2003-03-31 2006-12-05 Novauris Technologies Ltd. Phonetically based speech recognition system and method
US20050021826A1 (en) 2003-04-21 2005-01-27 Sunil Kumar Gateway controller for a multimodal system that provides inter-communication among different data and voice servers through various mobile devices, and interface for that controller
US7421393B1 (en) 2004-03-01 2008-09-02 At&T Corp. System for developing a dialog manager using modular spoken-dialog components
US20050015256A1 (en) 2003-05-29 2005-01-20 Kargman James B. Method and apparatus for ordering food items, and in particular, pizza
JP2005003926A (ja) 2003-06-11 2005-01-06 Sony Corp 情報処理装置および方法、並びにプログラム
KR100577387B1 (ko) 2003-08-06 2006-05-10 삼성전자주식회사 음성 대화 시스템에서의 음성 인식 오류 처리 방법 및 장치
US20050043940A1 (en) 2003-08-20 2005-02-24 Marvin Elder Preparing a data source for a natural language query
US7428497B2 (en) 2003-10-06 2008-09-23 Utbk, Inc. Methods and apparatuses for pay-per-call advertising in mobile/wireless applications
US20070162296A1 (en) 2003-10-06 2007-07-12 Utbk, Inc. Methods and apparatuses for audio advertisements
GB0325497D0 (en) 2003-10-31 2003-12-03 Vox Generation Ltd Automated speech application creation deployment and management
US7454608B2 (en) 2003-10-31 2008-11-18 International Business Machines Corporation Resource configuration in multi-modal distributed computing systems
JP2005157494A (ja) 2003-11-20 2005-06-16 Aruze Corp 会話制御装置及び会話制御方法
JP4558308B2 (ja) 2003-12-03 2010-10-06 ニュアンス コミュニケーションズ,インコーポレイテッド 音声認識システム、データ処理装置、そのデータ処理方法及びプログラム
US20050137877A1 (en) 2003-12-17 2005-06-23 General Motors Corporation Method and system for enabling a device function of a vehicle
US7027586B2 (en) 2003-12-18 2006-04-11 Sbc Knowledge Ventures, L.P. Intelligently routing customer communications
US20050137850A1 (en) 2003-12-23 2005-06-23 Intel Corporation Method for automation of programmable interfaces
US7386443B1 (en) 2004-01-09 2008-06-10 At&T Corp. System and method for mobile automatic speech recognition
JP3924583B2 (ja) 2004-02-03 2007-06-06 松下電器産業株式会社 ユーザ適応型装置およびその制御方法
US7542903B2 (en) 2004-02-18 2009-06-02 Fuji Xerox Co., Ltd. Systems and methods for determining predictive models of discourse functions
US20050216254A1 (en) 2004-03-24 2005-09-29 Gupta Anurag K System-resource-based multi-modal input fusion
US20050246174A1 (en) 2004-04-28 2005-11-03 Degolia Richard C Method and system for presenting dynamic commercial content to clients interacting with a voice extensible markup language system
US20050283752A1 (en) 2004-05-17 2005-12-22 Renate Fruchter DiVAS-a cross-media system for ubiquitous gesture-discourse-sketch knowledge capture and reuse
US20060206310A1 (en) 2004-06-29 2006-09-14 Damaka, Inc. System and method for natural language processing in a peer-to-peer hybrid communications network
DE102004037858A1 (de) 2004-08-04 2006-03-16 Harman Becker Automotive Systems Gmbh Navigationssystem mit sprachgesteuerter Angabe von Sonderzielen
US7480618B2 (en) 2004-09-02 2009-01-20 Microsoft Corporation Eliminating interference of noisy modality in a multimodal application
US20060074660A1 (en) 2004-09-29 2006-04-06 France Telecom Method and apparatus for enhancing speech recognition accuracy by using geographic data to filter a set of words
US7376645B2 (en) 2004-11-29 2008-05-20 The Intellection Group, Inc. Multimodal natural language query system and architecture for processing voice and proximity-based queries
US20070214182A1 (en) 2005-01-15 2007-09-13 Outland Research, Llc Establishment-based media and messaging service
US7873654B2 (en) 2005-01-24 2011-01-18 The Intellection Group, Inc. Multimodal natural language query system for processing and analyzing voice and proximity-based queries
US7437297B2 (en) 2005-01-27 2008-10-14 International Business Machines Corporation Systems and methods for predicting consequences of misinterpretation of user commands in automated systems
US7606708B2 (en) 2005-02-01 2009-10-20 Samsung Electronics Co., Ltd. Apparatus, method, and medium for generating grammar network for use in speech recognition and dialogue speech recognition
US7831433B1 (en) 2005-02-03 2010-11-09 Hrl Laboratories, Llc System and method for using context in navigation dialog
US7461059B2 (en) 2005-02-23 2008-12-02 Microsoft Corporation Dynamically updated search results based upon continuously-evolving search query that is based at least in part upon phrase suggestion, search engine uses previous result sets performing additional search tasks
US7283829B2 (en) 2005-03-25 2007-10-16 Cisco Technology, Inc. Management of call requests in multi-modal communication environments
US7813485B2 (en) 2005-05-26 2010-10-12 International Business Machines Corporation System and method for seamlessly integrating an interactive visual menu with an voice menu provided in an interactive voice response system
US7917365B2 (en) 2005-06-16 2011-03-29 Nuance Communications, Inc. Synchronizing visual and speech events in a multimodal application
US7873523B2 (en) 2005-06-30 2011-01-18 Microsoft Corporation Computer implemented method of analyzing recognition results between a user and an interactive application utilizing inferred values instead of transcribed speech
US20070043868A1 (en) 2005-07-07 2007-02-22 V-Enable, Inc. System and method for searching for network-based content in a multi-modal system using spoken keywords
US7424431B2 (en) 2005-07-11 2008-09-09 Stragent, Llc System, method and computer program product for adding voice activation and voice control to a media player
US7640160B2 (en) 2005-08-05 2009-12-29 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7620549B2 (en) 2005-08-10 2009-11-17 Voicebox Technologies, Inc. System and method of supporting adaptive misrecognition in conversational speech
US7949529B2 (en) 2005-08-29 2011-05-24 Voicebox Technologies, Inc. Mobile systems and methods of supporting natural language human-machine interactions
WO2007027989A2 (en) 2005-08-31 2007-03-08 Voicebox Technologies, Inc. Dynamic speech sharpening
US7672852B2 (en) 2005-09-29 2010-03-02 Microsoft Corporation Localization of prompts
US8626588B2 (en) 2005-09-30 2014-01-07 Google Inc. Advertising with audio content
US20070078708A1 (en) 2005-09-30 2007-04-05 Hua Yu Using speech recognition to determine advertisements relevant to audio content and/or audio content relevant to advertisements
US7477909B2 (en) 2005-10-31 2009-01-13 Nuance Communications, Inc. System and method for conducting a search using a wireless mobile device
US7587308B2 (en) 2005-11-21 2009-09-08 Hewlett-Packard Development Company, L.P. Word recognition using ontologies
US20070135101A1 (en) 2005-12-08 2007-06-14 Comverse, Ltd. Enhanced visual IVR capabilities
US8325398B2 (en) 2005-12-22 2012-12-04 Canon Kabushiki Kaisha Image editing system, image management apparatus, and image editing program
US20070186165A1 (en) 2006-02-07 2007-08-09 Pudding Ltd. Method And Apparatus For Electronically Providing Advertisements
EP2011017A4 (en) 2006-03-30 2010-07-07 Stanford Res Inst Int METHOD AND APPARATUS FOR ANNOTATING MULTIMEDIA STREAMS
US7533089B2 (en) 2006-06-27 2009-05-12 International Business Machines Corporation Hybrid approach for query recommendation in conversation systems
CA2657134C (en) 2006-07-10 2016-04-05 Accenture Global Services Gmbh Mobile personal services platform for providing feedback
US8145493B2 (en) 2006-09-11 2012-03-27 Nuance Communications, Inc. Establishing a preferred mode of interaction between a user and a multimodal application
US8086463B2 (en) 2006-09-12 2011-12-27 Nuance Communications, Inc. Dynamically generating a vocal help prompt in a multimodal application
WO2008032329A2 (en) 2006-09-13 2008-03-20 Alon Atsmon Providing content responsive to multimedia signals
US7788084B2 (en) 2006-09-19 2010-08-31 Xerox Corporation Labeling of work of art titles in text for natural language processing
US8073681B2 (en) 2006-10-16 2011-12-06 Voicebox Technologies, Inc. System and method for a cooperative conversational voice user interface
US20080109285A1 (en) 2006-10-26 2008-05-08 Mobile Content Networks, Inc. Techniques for determining relevant advertisements in response to queries
US7805740B2 (en) 2006-11-10 2010-09-28 Audiogate Technologies Ltd. System and method for providing advertisement based on speech recognition
US7640272B2 (en) 2006-12-07 2009-12-29 Microsoft Corporation Using automated content analysis for audio/video content consumption
US7818176B2 (en) 2007-02-06 2010-10-19 Voicebox Technologies, Inc. System and method for selecting and presenting advertisements based on natural language processing of voice-based input
US8909532B2 (en) 2007-03-23 2014-12-09 Nuance Communications, Inc. Supporting multi-lingual user interaction with a multimodal application
US8060367B2 (en) 2007-06-26 2011-11-15 Targus Information Corporation Spatially indexed grammar and methods of use
US8219399B2 (en) 2007-07-11 2012-07-10 Garmin Switzerland Gmbh Automated speech recognition (ASR) tiling
DE102007044792B4 (de) 2007-09-19 2012-12-13 Siemens Ag Verfahren, Steuergerät und System zur Steuerung oder Bedienung
US8140335B2 (en) 2007-12-11 2012-03-20 Voicebox Technologies, Inc. System and method for providing a natural language voice user interface in an integrated voice navigation services environment
US8077975B2 (en) 2008-02-26 2011-12-13 Microsoft Corporation Handwriting symbol recognition accuracy using speech input
US8255224B2 (en) 2008-03-07 2012-08-28 Google Inc. Voice recognition grammar selection based on context
US20090276700A1 (en) 2008-04-30 2009-11-05 Nokia Corporation Method, apparatus, and computer program product for determining user status indicators
US8909810B2 (en) 2008-08-05 2014-12-09 Isabella Products, Inc. Systems and methods for multimedia content sharing
US8224652B2 (en) 2008-09-26 2012-07-17 Microsoft Corporation Speech and text driven HMM-based body animation synthesis
US8326637B2 (en) 2009-02-20 2012-12-04 Voicebox Technologies, Inc. System and method for processing multi-modal device interactions in a natural language voice services environment
US9502025B2 (en) 2009-11-10 2016-11-22 Voicebox Technologies Corporation System and method for providing a natural language content dedication service
US9171541B2 (en) 2009-11-10 2015-10-27 Voicebox Technologies Corporation System and method for hybrid processing in a natural language voice services environment

Cited By (223)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9646614B2 (en) 2000-03-16 2017-05-09 Apple Inc. Fast, language-independent method for user authentication by voice
US10318871B2 (en) 2005-09-08 2019-06-11 Apple Inc. Method and apparatus for building an intelligent automated assistant
US11928604B2 (en) 2005-09-08 2024-03-12 Apple Inc. Method and apparatus for building an intelligent automated assistant
US11671920B2 (en) 2007-04-03 2023-06-06 Apple Inc. Method and system for operating a multifunction portable electronic device using voice-activation
US11023513B2 (en) 2007-12-20 2021-06-01 Apple Inc. Method and apparatus for searching using an active ontology
US10381016B2 (en) 2008-01-03 2019-08-13 Apple Inc. Methods and apparatus for altering audio output signals
US9865248B2 (en) 2008-04-05 2018-01-09 Apple Inc. Intelligent text-to-speech conversion
US9626955B2 (en) 2008-04-05 2017-04-18 Apple Inc. Intelligent text-to-speech conversion
US10553216B2 (en) 2008-05-27 2020-02-04 Oracle International Corporation System and method for an integrated, multi-modal, multi-device natural language voice services environment
US10108612B2 (en) 2008-07-31 2018-10-23 Apple Inc. Mobile device having human language translation capability with positional feedback
US11348582B2 (en) 2008-10-02 2022-05-31 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US10643611B2 (en) 2008-10-02 2020-05-05 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US10795541B2 (en) 2009-06-05 2020-10-06 Apple Inc. Intelligent organization of tasks items
US11080012B2 (en) 2009-06-05 2021-08-03 Apple Inc. Interface for a virtual digital assistant
US10283110B2 (en) 2009-07-02 2019-05-07 Apple Inc. Methods and apparatuses for automatic speech recognition
US10706841B2 (en) 2010-01-18 2020-07-07 Apple Inc. Task flow identification based on user intent
US9548050B2 (en) 2010-01-18 2017-01-17 Apple Inc. Intelligent automated assistant
US10679605B2 (en) 2010-01-18 2020-06-09 Apple Inc. Hands-free list-reading by intelligent automated assistant
US11423886B2 (en) 2010-01-18 2022-08-23 Apple Inc. Task flow identification based on user intent
US10705794B2 (en) 2010-01-18 2020-07-07 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US10496753B2 (en) 2010-01-18 2019-12-03 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US10553209B2 (en) 2010-01-18 2020-02-04 Apple Inc. Systems and methods for hands-free notification summaries
US10741185B2 (en) 2010-01-18 2020-08-11 Apple Inc. Intelligent automated assistant
US10692504B2 (en) 2010-02-25 2020-06-23 Apple Inc. User profiling for voice input processing
US10049675B2 (en) 2010-02-25 2018-08-14 Apple Inc. User profiling for voice input processing
US9633660B2 (en) 2010-02-25 2017-04-25 Apple Inc. User profiling for voice input processing
US10102359B2 (en) 2011-03-21 2018-10-16 Apple Inc. Device access using voice authentication
US10417405B2 (en) 2011-03-21 2019-09-17 Apple Inc. Device access using voice authentication
US11120372B2 (en) 2011-06-03 2021-09-14 Apple Inc. Performing actions associated with task items that represent tasks to perform
US11350253B2 (en) 2011-06-03 2022-05-31 Apple Inc. Active transport based notifications
US11069336B2 (en) 2012-03-02 2021-07-20 Apple Inc. Systems and methods for name pronunciation
US9953088B2 (en) 2012-05-14 2018-04-24 Apple Inc. Crowd sourcing information to fulfill user requests
US11321116B2 (en) 2012-05-15 2022-05-03 Apple Inc. Systems and methods for integrating third party services with a digital assistant
US11269678B2 (en) 2012-05-15 2022-03-08 Apple Inc. Systems and methods for integrating third party services with a digital assistant
US10079014B2 (en) 2012-06-08 2018-09-18 Apple Inc. Name recognition system
US9971774B2 (en) 2012-09-19 2018-05-15 Apple Inc. Voice-based media searching
CN105229727A (zh) * 2013-01-08 2016-01-06 赛普拉斯半导体公司 分布式语音识别系统
US10978090B2 (en) 2013-02-07 2021-04-13 Apple Inc. Voice trigger for a digital assistant
US11636869B2 (en) 2013-02-07 2023-04-25 Apple Inc. Voice trigger for a digital assistant
US10714117B2 (en) 2013-02-07 2020-07-14 Apple Inc. Voice trigger for a digital assistant
US11388291B2 (en) 2013-03-14 2022-07-12 Apple Inc. System and method for processing voicemail
US11798547B2 (en) 2013-03-15 2023-10-24 Apple Inc. Voice activated device for use with a voice-based digital assistant
US9966060B2 (en) 2013-06-07 2018-05-08 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
US9620104B2 (en) 2013-06-07 2017-04-11 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
US9582608B2 (en) 2013-06-07 2017-02-28 Apple Inc. Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
US9966068B2 (en) 2013-06-08 2018-05-08 Apple Inc. Interpreting and acting upon commands that involve sharing information with remote devices
US10657961B2 (en) 2013-06-08 2020-05-19 Apple Inc. Interpreting and acting upon commands that involve sharing information with remote devices
US10769385B2 (en) 2013-06-09 2020-09-08 Apple Inc. System and method for inferring user intent from speech inputs
US11048473B2 (en) 2013-06-09 2021-06-29 Apple Inc. Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
US10185542B2 (en) 2013-06-09 2019-01-22 Apple Inc. Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
US10176167B2 (en) 2013-06-09 2019-01-08 Apple Inc. System and method for inferring user intent from speech inputs
US11727219B2 (en) 2013-06-09 2023-08-15 Apple Inc. System and method for inferring user intent from speech inputs
US11314370B2 (en) 2013-12-06 2022-04-26 Apple Inc. Method for extracting salient dialog usage from live data
US11810562B2 (en) 2014-05-30 2023-11-07 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US10699717B2 (en) 2014-05-30 2020-06-30 Apple Inc. Intelligent assistant for home automation
US10417344B2 (en) 2014-05-30 2019-09-17 Apple Inc. Exemplar-based natural language processing
US10878809B2 (en) 2014-05-30 2020-12-29 Apple Inc. Multi-command single utterance input method
US10083690B2 (en) 2014-05-30 2018-09-25 Apple Inc. Better resolution when referencing to concepts
US11699448B2 (en) 2014-05-30 2023-07-11 Apple Inc. Intelligent assistant for home automation
US10169329B2 (en) 2014-05-30 2019-01-01 Apple Inc. Exemplar-based natural language processing
US10714095B2 (en) 2014-05-30 2020-07-14 Apple Inc. Intelligent assistant for home automation
US11133008B2 (en) 2014-05-30 2021-09-28 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US11257504B2 (en) 2014-05-30 2022-02-22 Apple Inc. Intelligent assistant for home automation
US10657966B2 (en) 2014-05-30 2020-05-19 Apple Inc. Better resolution when referencing to concepts
US11670289B2 (en) 2014-05-30 2023-06-06 Apple Inc. Multi-command single utterance input method
US10497365B2 (en) 2014-05-30 2019-12-03 Apple Inc. Multi-command single utterance input method
US9668024B2 (en) 2014-06-30 2017-05-30 Apple Inc. Intelligent automated assistant for TV user interactions
US10904611B2 (en) 2014-06-30 2021-01-26 Apple Inc. Intelligent automated assistant for TV user interactions
US11516537B2 (en) 2014-06-30 2022-11-29 Apple Inc. Intelligent automated assistant for TV user interactions
US10431204B2 (en) 2014-09-11 2019-10-01 Apple Inc. Method and apparatus for discovering trending terms in speech requests
US9986419B2 (en) 2014-09-30 2018-05-29 Apple Inc. Social reminders
US10438595B2 (en) 2014-09-30 2019-10-08 Apple Inc. Speaker identification and unsupervised speaker adaptation techniques
US10390213B2 (en) 2014-09-30 2019-08-20 Apple Inc. Social reminders
US10453443B2 (en) 2014-09-30 2019-10-22 Apple Inc. Providing an indication of the suitability of speech recognition
US11231904B2 (en) 2015-03-06 2022-01-25 Apple Inc. Reducing response latency of intelligent automated assistants
US10567477B2 (en) 2015-03-08 2020-02-18 Apple Inc. Virtual assistant continuity
US10311871B2 (en) 2015-03-08 2019-06-04 Apple Inc. Competing devices responding to voice triggers
US11842734B2 (en) 2015-03-08 2023-12-12 Apple Inc. Virtual assistant activation
US11087759B2 (en) 2015-03-08 2021-08-10 Apple Inc. Virtual assistant activation
US10529332B2 (en) 2015-03-08 2020-01-07 Apple Inc. Virtual assistant activation
US10930282B2 (en) 2015-03-08 2021-02-23 Apple Inc. Competing devices responding to voice triggers
US11468282B2 (en) 2015-05-15 2022-10-11 Apple Inc. Virtual assistant in a communication session
US11070949B2 (en) 2015-05-27 2021-07-20 Apple Inc. Systems and methods for proactively identifying and surfacing relevant content on an electronic device with a touch-sensitive display
US11127397B2 (en) 2015-05-27 2021-09-21 Apple Inc. Device voice control
US10681212B2 (en) 2015-06-05 2020-06-09 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US10356243B2 (en) 2015-06-05 2019-07-16 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US11025565B2 (en) 2015-06-07 2021-06-01 Apple Inc. Personalized prediction of responses for instant messaging
US11010127B2 (en) 2015-06-29 2021-05-18 Apple Inc. Virtual assistant for media playback
US11947873B2 (en) 2015-06-29 2024-04-02 Apple Inc. Virtual assistant for media playback
US11500672B2 (en) 2015-09-08 2022-11-15 Apple Inc. Distributed personal assistant
US11550542B2 (en) 2015-09-08 2023-01-10 Apple Inc. Zero latency digital assistant
US10671428B2 (en) 2015-09-08 2020-06-02 Apple Inc. Distributed personal assistant
US11853536B2 (en) 2015-09-08 2023-12-26 Apple Inc. Intelligent automated assistant in a media environment
US11126400B2 (en) 2015-09-08 2021-09-21 Apple Inc. Zero latency digital assistant
US11809483B2 (en) 2015-09-08 2023-11-07 Apple Inc. Intelligent automated assistant for media search and playback
US10747498B2 (en) 2015-09-08 2020-08-18 Apple Inc. Zero latency digital assistant
US10366158B2 (en) 2015-09-29 2019-07-30 Apple Inc. Efficient word encoding for recurrent neural network language models
US11010550B2 (en) 2015-09-29 2021-05-18 Apple Inc. Unified language modeling framework for word prediction, auto-completion and auto-correction
US11587559B2 (en) 2015-09-30 2023-02-21 Apple Inc. Intelligent device identification
US11526368B2 (en) 2015-11-06 2022-12-13 Apple Inc. Intelligent automated assistant in a messaging environment
US10691473B2 (en) 2015-11-06 2020-06-23 Apple Inc. Intelligent automated assistant in a messaging environment
US11886805B2 (en) 2015-11-09 2024-01-30 Apple Inc. Unconventional virtual assistant interactions
US10049668B2 (en) 2015-12-02 2018-08-14 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10354652B2 (en) 2015-12-02 2019-07-16 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10942703B2 (en) 2015-12-23 2021-03-09 Apple Inc. Proactive assistance based on dialog communication between devices
US10223066B2 (en) 2015-12-23 2019-03-05 Apple Inc. Proactive assistance based on dialog communication between devices
US10446143B2 (en) 2016-03-14 2019-10-15 Apple Inc. Identification of voice inputs providing credentials
US9934775B2 (en) 2016-05-26 2018-04-03 Apple Inc. Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9972304B2 (en) 2016-06-03 2018-05-15 Apple Inc. Privacy preserving distributed evaluation framework for embedded personalized systems
US11227589B2 (en) 2016-06-06 2022-01-18 Apple Inc. Intelligent list reading
US10249300B2 (en) 2016-06-06 2019-04-02 Apple Inc. Intelligent list reading
US11069347B2 (en) 2016-06-08 2021-07-20 Apple Inc. Intelligent automated assistant for media exploration
US10049663B2 (en) 2016-06-08 2018-08-14 Apple, Inc. Intelligent automated assistant for media exploration
US10354011B2 (en) 2016-06-09 2019-07-16 Apple Inc. Intelligent automated assistant in a home environment
US11037565B2 (en) 2016-06-10 2021-06-15 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10067938B2 (en) 2016-06-10 2018-09-04 Apple Inc. Multilingual word prediction
US10192552B2 (en) 2016-06-10 2019-01-29 Apple Inc. Digital assistant providing whispered speech
US10490187B2 (en) 2016-06-10 2019-11-26 Apple Inc. Digital assistant providing automated status report
US11657820B2 (en) 2016-06-10 2023-05-23 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10733993B2 (en) 2016-06-10 2020-08-04 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10509862B2 (en) 2016-06-10 2019-12-17 Apple Inc. Dynamic phrase expansion of language input
US10297253B2 (en) 2016-06-11 2019-05-21 Apple Inc. Application integration with a digital assistant
US10269345B2 (en) 2016-06-11 2019-04-23 Apple Inc. Intelligent task discovery
US11809783B2 (en) 2016-06-11 2023-11-07 Apple Inc. Intelligent device arbitration and control
US11152002B2 (en) 2016-06-11 2021-10-19 Apple Inc. Application integration with a digital assistant
US10521466B2 (en) 2016-06-11 2019-12-31 Apple Inc. Data driven natural language event detection and classification
US10942702B2 (en) 2016-06-11 2021-03-09 Apple Inc. Intelligent device arbitration and control
US11749275B2 (en) 2016-06-11 2023-09-05 Apple Inc. Application integration with a digital assistant
US10580409B2 (en) 2016-06-11 2020-03-03 Apple Inc. Application integration with a digital assistant
US10089072B2 (en) 2016-06-11 2018-10-02 Apple Inc. Intelligent device arbitration and control
US10474753B2 (en) 2016-09-07 2019-11-12 Apple Inc. Language identification using recurrent neural networks
CN109716325B (zh) * 2016-09-13 2023-09-12 微软技术许可有限责任公司 计算机化的自然语言查询意图分派
CN109716325A (zh) * 2016-09-13 2019-05-03 微软技术许可有限责任公司 计算机化的自然语言查询意图分派
US10553215B2 (en) 2016-09-23 2020-02-04 Apple Inc. Intelligent automated assistant
US10043516B2 (en) 2016-09-23 2018-08-07 Apple Inc. Intelligent automated assistant
US11281993B2 (en) 2016-12-05 2022-03-22 Apple Inc. Model and ensemble compression for metric learning
CN108228699A (zh) * 2016-12-22 2018-06-29 谷歌有限责任公司 协作性语音控制装置
US11893995B2 (en) 2016-12-22 2024-02-06 Google Llc Generating additional synthesized voice output based on prior utterance and synthesized voice output provided in response to the prior utterance
US11521618B2 (en) 2016-12-22 2022-12-06 Google Llc Collaborative voice controlled devices
US10593346B2 (en) 2016-12-22 2020-03-17 Apple Inc. Rank-reduced token representation for automatic speech recognition
CN108541315B (zh) * 2016-12-30 2022-01-11 谷歌有限责任公司 语音激活数据分组的数据结构池化
CN108541315A (zh) * 2016-12-30 2018-09-14 谷歌有限责任公司 语音激活数据分组的数据结构池化
US11204787B2 (en) 2017-01-09 2021-12-21 Apple Inc. Application integration with a digital assistant
US11656884B2 (en) 2017-01-09 2023-05-23 Apple Inc. Application integration with a digital assistant
US10332518B2 (en) 2017-05-09 2019-06-25 Apple Inc. User interface for correcting recognition errors
US10741181B2 (en) 2017-05-09 2020-08-11 Apple Inc. User interface for correcting recognition errors
US10417266B2 (en) 2017-05-09 2019-09-17 Apple Inc. Context-aware ranking of intelligent response suggestions
US10395654B2 (en) 2017-05-11 2019-08-27 Apple Inc. Text normalization based on a data-driven learning network
US11599331B2 (en) 2017-05-11 2023-03-07 Apple Inc. Maintaining privacy of personal information
US10847142B2 (en) 2017-05-11 2020-11-24 Apple Inc. Maintaining privacy of personal information
US10755703B2 (en) 2017-05-11 2020-08-25 Apple Inc. Offline personal assistant
US10726832B2 (en) 2017-05-11 2020-07-28 Apple Inc. Maintaining privacy of personal information
US10791176B2 (en) 2017-05-12 2020-09-29 Apple Inc. Synchronization and task delegation of a digital assistant
US10789945B2 (en) 2017-05-12 2020-09-29 Apple Inc. Low-latency intelligent automated assistant
US11405466B2 (en) 2017-05-12 2022-08-02 Apple Inc. Synchronization and task delegation of a digital assistant
US11580990B2 (en) 2017-05-12 2023-02-14 Apple Inc. User-specific acoustic models
US10410637B2 (en) 2017-05-12 2019-09-10 Apple Inc. User-specific acoustic models
US11380310B2 (en) 2017-05-12 2022-07-05 Apple Inc. Low-latency intelligent automated assistant
US11301477B2 (en) 2017-05-12 2022-04-12 Apple Inc. Feedback analysis of a digital assistant
US10810274B2 (en) 2017-05-15 2020-10-20 Apple Inc. Optimizing dialogue policy decisions for digital assistants using implicit feedback
US10482874B2 (en) 2017-05-15 2019-11-19 Apple Inc. Hierarchical belief states for digital assistants
US11532306B2 (en) 2017-05-16 2022-12-20 Apple Inc. Detecting a trigger of a digital assistant
US10748546B2 (en) 2017-05-16 2020-08-18 Apple Inc. Digital assistant services based on device capabilities
US10403278B2 (en) 2017-05-16 2019-09-03 Apple Inc. Methods and systems for phonetic matching in digital assistant services
US10303715B2 (en) 2017-05-16 2019-05-28 Apple Inc. Intelligent automated assistant for media exploration
US11217255B2 (en) 2017-05-16 2022-01-04 Apple Inc. Far-field extension for digital assistant services
US11675829B2 (en) 2017-05-16 2023-06-13 Apple Inc. Intelligent automated assistant for media exploration
US10311144B2 (en) 2017-05-16 2019-06-04 Apple Inc. Emoji word sense disambiguation
US10909171B2 (en) 2017-05-16 2021-02-02 Apple Inc. Intelligent automated assistant for media exploration
US10657328B2 (en) 2017-06-02 2020-05-19 Apple Inc. Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling
US10445429B2 (en) 2017-09-21 2019-10-15 Apple Inc. Natural language understanding using vocabularies with compressed serialized tries
US10755051B2 (en) 2017-09-29 2020-08-25 Apple Inc. Rule-based natural language processing
US10636424B2 (en) 2017-11-30 2020-04-28 Apple Inc. Multi-turn canned dialog
US10733982B2 (en) 2018-01-08 2020-08-04 Apple Inc. Multi-directional dialog
US10733375B2 (en) 2018-01-31 2020-08-04 Apple Inc. Knowledge-based framework for improving natural language understanding
US10789959B2 (en) 2018-03-02 2020-09-29 Apple Inc. Training speaker recognition models for digital assistants
US10592604B2 (en) 2018-03-12 2020-03-17 Apple Inc. Inverse text normalization for automatic speech recognition
US10818288B2 (en) 2018-03-26 2020-10-27 Apple Inc. Natural assistant interaction
US11710482B2 (en) 2018-03-26 2023-07-25 Apple Inc. Natural assistant interaction
US10909331B2 (en) 2018-03-30 2021-02-02 Apple Inc. Implicit identification of translation payload with neural machine translation
US11145294B2 (en) 2018-05-07 2021-10-12 Apple Inc. Intelligent automated assistant for delivering content from user experiences
US10928918B2 (en) 2018-05-07 2021-02-23 Apple Inc. Raise to speak
US11487364B2 (en) 2018-05-07 2022-11-01 Apple Inc. Raise to speak
US11900923B2 (en) 2018-05-07 2024-02-13 Apple Inc. Intelligent automated assistant for delivering content from user experiences
US11854539B2 (en) 2018-05-07 2023-12-26 Apple Inc. Intelligent automated assistant for delivering content from user experiences
US11169616B2 (en) 2018-05-07 2021-11-09 Apple Inc. Raise to speak
US10984780B2 (en) 2018-05-21 2021-04-20 Apple Inc. Global semantic word embeddings using bi-directional recurrent neural networks
US10720160B2 (en) 2018-06-01 2020-07-21 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US11386266B2 (en) 2018-06-01 2022-07-12 Apple Inc. Text correction
US11495218B2 (en) 2018-06-01 2022-11-08 Apple Inc. Virtual assistant operation in multi-device environments
US10403283B1 (en) 2018-06-01 2019-09-03 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US11360577B2 (en) 2018-06-01 2022-06-14 Apple Inc. Attention aware virtual assistant dismissal
US10984798B2 (en) 2018-06-01 2021-04-20 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US10684703B2 (en) 2018-06-01 2020-06-16 Apple Inc. Attention aware virtual assistant dismissal
US11009970B2 (en) 2018-06-01 2021-05-18 Apple Inc. Attention aware virtual assistant dismissal
US10892996B2 (en) 2018-06-01 2021-01-12 Apple Inc. Variable latency device coordination
US11431642B2 (en) 2018-06-01 2022-08-30 Apple Inc. Variable latency device coordination
US10504518B1 (en) 2018-06-03 2019-12-10 Apple Inc. Accelerated task performance
US10944859B2 (en) 2018-06-03 2021-03-09 Apple Inc. Accelerated task performance
US10496705B1 (en) 2018-06-03 2019-12-03 Apple Inc. Accelerated task performance
US11010561B2 (en) 2018-09-27 2021-05-18 Apple Inc. Sentiment prediction from textual data
US10839159B2 (en) 2018-09-28 2020-11-17 Apple Inc. Named entity normalization in a spoken dialog system
US11462215B2 (en) 2018-09-28 2022-10-04 Apple Inc. Multi-modal inputs for voice commands
US11170166B2 (en) 2018-09-28 2021-11-09 Apple Inc. Neural typographical error modeling via generative adversarial networks
US11475898B2 (en) 2018-10-26 2022-10-18 Apple Inc. Low-latency multi-speaker speech recognition
US11638059B2 (en) 2019-01-04 2023-04-25 Apple Inc. Content playback on multiple devices
US11348573B2 (en) 2019-03-18 2022-05-31 Apple Inc. Multimodality in digital assistant systems
US11705130B2 (en) 2019-05-06 2023-07-18 Apple Inc. Spoken notifications
US11423908B2 (en) 2019-05-06 2022-08-23 Apple Inc. Interpreting spoken requests
US11475884B2 (en) 2019-05-06 2022-10-18 Apple Inc. Reducing digital assistant latency when a language is incorrectly determined
US11217251B2 (en) 2019-05-06 2022-01-04 Apple Inc. Spoken notifications
US11307752B2 (en) 2019-05-06 2022-04-19 Apple Inc. User configurable task triggers
US11140099B2 (en) 2019-05-21 2021-10-05 Apple Inc. Providing message response suggestions
US11888791B2 (en) 2019-05-21 2024-01-30 Apple Inc. Providing message response suggestions
US11657813B2 (en) 2019-05-31 2023-05-23 Apple Inc. Voice identification in digital assistant systems
US11289073B2 (en) 2019-05-31 2022-03-29 Apple Inc. Device text to speech
US11237797B2 (en) 2019-05-31 2022-02-01 Apple Inc. User activity shortcut suggestions
US11496600B2 (en) 2019-05-31 2022-11-08 Apple Inc. Remote execution of machine-learned models
US11360739B2 (en) 2019-05-31 2022-06-14 Apple Inc. User activity shortcut suggestions
US11360641B2 (en) 2019-06-01 2022-06-14 Apple Inc. Increasing the relevance of new available information
US11488406B2 (en) 2019-09-25 2022-11-01 Apple Inc. Text detection using global geometry estimators
US11924254B2 (en) 2020-05-11 2024-03-05 Apple Inc. Digital assistant hardware abstraction
US11765209B2 (en) 2020-05-11 2023-09-19 Apple Inc. Digital assistant hardware abstraction

Also Published As

Publication number Publication date
US20090299745A1 (en) 2009-12-03
US8589161B2 (en) 2013-11-19
WO2009145796A1 (en) 2009-12-03
EP2283431A1 (en) 2011-02-16
EP2283431A4 (en) 2012-09-05
EP2283431B1 (en) 2021-12-15
CN102160043B (zh) 2015-04-22

Similar Documents

Publication Publication Date Title
CN102160043B (zh) 针对集成多语气多装置自然语言语音服务环境的系统和方法
US10553216B2 (en) System and method for an integrated, multi-modal, multi-device natural language voice services environment
JP7044415B2 (ja) ホームアシスタント装置を制御するための方法及びシステム
KR102505597B1 (ko) 어시스턴트 애플리케이션을 위한 음성 사용자 인터페이스 단축
US10755706B2 (en) Voice-based user interface with dynamically switchable endpoints
KR102543693B1 (ko) 전자 장치 및 그의 동작 방법
KR101912058B1 (ko) 자연어 음성 서비스 환경에서 하이브리드 처리를 위한 시스템 및 방법
CN107564510A (zh) 一种语音虚拟角色管理方法、装置、服务器和存储介质
US10249296B1 (en) Application discovery and selection in language-based systems
JP2015122084A (ja) 自然言語音声サービス環境においてマルチモーダル機器対話を処理するシステム及び方法
CN103714813A (zh) 短语辨认系统和方法
JP6783339B2 (ja) 音声を処理する方法及び装置
CN110459222A (zh) 语音控制方法、语音控制装置及终端设备
CN112292724A (zh) 用于调用自动助理的动态和/或场境特定热词
CN109144458A (zh) 用于执行与语音输入相对应的操作的电子设备
JP2019040602A (ja) 人工知能機器における連続会話機能
CN116417003A (zh) 语音交互系统、方法、电子设备和存储介质
CN111292749B (zh) 智能语音平台的会话控制方法及装置
CN112700770A (zh) 语音控制方法、音箱设备、计算设备和存储介质
JP2023553453A (ja) グループホットワード
US11600260B1 (en) Utterance generation and evaluation
US20210049215A1 (en) Shared Context Manager for Cohabitating Agents

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20160921

Address after: Washington, USA

Patentee after: Voicebox Technologies Inc.

Address before: Washington, USA

Patentee before: Voicebox Technologies Inc.

TR01 Transfer of patent right

Effective date of registration: 20181212

Address after: Washington, USA

Patentee after: Weber Assets Co., Ltd.

Address before: Washington, USA

Patentee before: Voicebox Technologies Inc.

TR01 Transfer of patent right