CN108874766A - 用于数字助理服务中的语音匹配的方法和系统 - Google Patents

用于数字助理服务中的语音匹配的方法和系统 Download PDF

Info

Publication number
CN108874766A
CN108874766A CN201810072173.2A CN201810072173A CN108874766A CN 108874766 A CN108874766 A CN 108874766A CN 201810072173 A CN201810072173 A CN 201810072173A CN 108874766 A CN108874766 A CN 108874766A
Authority
CN
China
Prior art keywords
user
media item
speech input
voice
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810072173.2A
Other languages
English (en)
Other versions
CN108874766B (zh
Inventor
A·斯吉林
M·J·亨特
G·埃弗曼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Apple Inc
Original Assignee
Apple Computer Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Apple Computer Inc filed Critical Apple Computer Inc
Publication of CN108874766A publication Critical patent/CN108874766A/zh
Application granted granted Critical
Publication of CN108874766B publication Critical patent/CN108874766B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/43Querying
    • G06F16/432Query formulation
    • G06F16/433Query formulation using audio data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/10Speech classification or search using distance or distortion measures between unknown speech and reference templates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/025Phonemes, fenemes or fenones being the recognition units
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting

Abstract

本发明题为“用于数字助理服务中的语音匹配的方法和系统”。本发明提供了用于操作智能自动助理以基于语音匹配技术来提供媒体项目的系统和过程。示例性方法包括接收来自用户的言语输入和确定该言语输入是否包括对媒体项目的用户请求。该方法还包括根据确定所述言语输入包括对获得媒体项目的用户请求,从多个媒体项目来确定候选媒体项目。该方法进一步包括基于候选媒体项目的语音表示和言语输入的语音表示之间的差值来确定是否要将候选媒体项目提供给用户。该方法进一步包括根据确定要将候选媒体项目提供给用户,将所述候选媒体项目提供给所述用户。

Description

用于数字助理服务中的语音匹配的方法和系统
相关专利申请的交叉引用
本专利申请要求提交于2017年9月13日的名称为“METHODS AND SYSTEMS FORPHONETIC MATCHING IN DIGITAL ASSISTANT SERVICES”的美国非临时专利申请15/703,013的优先权,此非临时专利申 请要求提交于2017年5月16日的名称为“METHODS ANDSYSTEMS FOR PHONETIC MATCHING IN DIGITAL ASSISTANT SERVICES”的美 国临时专利申请62/506,871和提交于2017年9月7日的名称为 “METHODS AND SYSTEMS FOR PHONETICMATCHING IN DIGITAL ASSISTANT SERVICES”的美国临时专利申请62/555,311的优先权。
技术领域
本发明整体涉及智能自动化助理,并且更具体地涉及用于基于语音匹 配技术来提供媒体项目的智能自动化助理。
背景技术
智能自动化助理(或数字助理)可在人类用户与电子设备之间提供有 利界面。此类助理可允许用户使用自然语言以口语形式和/或文本形式与设 备或系统进行交互。例如,用户可向正在电子设备上运行的数字助理提供 包含用户请求的言语输入。数字助理可从该言语输入解译用户意图并且将 用户意图操作化成任务。随后可通过执行电子设备的一项或多项服务来执 行这些任务,并且可将响应于用户请求的相关输出返回给用户。
数字助理可用于基于用户的言语输入获得媒体项目。例如,在尝试获 得一首歌曲时,用户可说出“播放21Savage演唱的Skrrt Skrrt”。由于多 种原因(例如,发音的相似性、可用媒体项目的名称的相似性、缺少意图推 断的上下文信息),对用于获得媒体项目的言语输入的用户意图解读可能是 很困难而且不准确的。不准确的用户意图可能使得数字助理获得不正确的 媒体项目或无法获得任何媒体项目。在上述示例中,如果数字助理将用户意图解读为“Kodak Black演唱的Skirt”或“Kodak Black演唱的Skrrt”或 “21Savage演唱的Skrt Skrt”,则数字助理可能无法获得正确的媒体项 目。又如,用户可能想要名称为“Zedd的Candyman”的专辑,而数字助 理可能错误地解读为“Zedd的Candy Man”,这就可能无法识别并返回正 确的专辑。
发明内容
本公开提供了用于提供数字助理服务以获得媒体项目的系统和过程。
本文公开了示例性方法。一种示例性方法包括在具有一个或多个处理 器的电子设备处,接收来自用户的言语输入和确定该言语输入是否包括对 媒体项目的用户请求。该方法还包括根据确定所述言语输入包括对获得媒 体项目的用户请求,从多个媒体项目来确定候选媒体项目。该方法还包括 基于候选媒体项目的语音表示和言语输入的语音表示之间的差值,确定是 否要将候选媒体项目提供给用户。该方法还包括根据确定要将候选媒体项 目提供给用户,将所述候选媒体项目提供给所述用户。
本文公开了示例性非暂态计算机可读介质。公开了一种存储一个或多 个程序的示例性非暂态计算机可读存储介质。一个或多个程序包括指令, 该指令当由电子设备的一个或多个处理器执行时使得电子设备接收来自用 户的言语输入并确定该言语输入是否包括对媒体项目的用户请求。一个或 多个程序还包括指令,该指令使得电子设备根据确定言语输入包括对获得 媒体项目的用户请求,从多个媒体项目来确定候选媒体项目。一个或多个 程序还包括指令,该指令使得电子设备基于候选媒体项目的语音表示和言 语输入的语音表示之间的差值,确定是否要将候选媒体项目提供给用户。 一个或多个程序还包括指令,该指令使得电子设备根据确定要将候选媒体 项目提供给用户,将候选媒体项目提供给用户。
本文公开了示例性电子设备。一种示例性电子设备包括一个或多个处 理器;存储器;一个或多个程序,其中所述一个或多个程序被存储在所述 存储器中并被配置为由所述一个或多个处理器执行,所述一个或多个程序 包括用于接收来自用户的言语输入并确定所述言语输入是否包括对媒体项 目的用户请求的指令。一个或多个程序还包括指令,该指令用于根据确定 所述言语输入包括对获得媒体项目的用户请求,从多个媒体项目来确定候 选媒体项目。一个或多个程序还包括指令,该指令用于基于候选媒体项目 的语音表示和言语输入的语音表示之间的差值,确定是否要将候选媒体项 目提供给用户。一个或多个程序还包括指令,该指令用于根据确定要将候 选媒体项目提供给用户,将候选媒体项目提供给用户。
一种示例性电子设备包括用于接收来自用户的言语输入和确定所述言 语输入是否包括对媒体项目的用户请求的装置。该电子设备还包括用于根 据确定所述言语输入包括对获得媒体项目的用户请求,从多个媒体项目来 确定候选媒体项目的装置。该电子设备还包括用于基于候选媒体项目的语 音表示和言语输入的语音表示之间的差值,确定是否要将候选媒体项目提 供给用户的装置。该电子设备还包括用于根据确定要将候选媒体项目提供 给用户,将所述候选媒体项目提供给所述用户的装置。
语音匹配技术可用于提高基于言语输入来获得媒体项目的准确度。如 上所述,基于用户的言语输入获得媒体项目可与用户意图解读中的难度相 关联,这就导致无法获得正确的媒体项目。可施用语音匹配技术以确定从 媒体项目的储存库中所获得候选媒体项目的语音表示和言语输入的语音表 示之间的差值,从而确定是否应将候选媒体项目提供给用户。在某些情况 下,候选媒体项目可表示储存库中的最佳可用媒体项目,但可能仍不是用 户预期的媒体项目。通过执行语音匹配,可使获得与用户意图不相匹配的 媒体项目的错误率降低。此外,语音匹配增强了设备的可操作性并使得用 户设备界面更为有效(例如,通过降低获得不正确的媒体项目的错误率),这 就通过使用户更快速高效地使用设备进一步降低了用电量并提高了设备的 电池寿命。
附图说明
图1是示出根据各种示例的用于实现数字助理的系统和环境的框图。
图2A是示出根据各种示例的实现数字助理的客户端侧部分的便携式 多功能设备的框图。
图2B是示出根据各种示例的用于事件处理的示例性部件的框图。
图3示出了根据各种示例的实现数字助理的客户端侧部分的便携式多 功能设备。
图4是根据各种示例的具有显示器和触敏表面的示例性多功能设备的 框图。
图5A示出了根据各种示例的便携式多功能设备上的应用程序菜单的 示例性用户界面。
图5B示出了根据各种示例的具有与显示器分开的触敏表面的多功能设 备的示例性用户界面。
图6A示出了根据各种示例的个人电子设备。
图6B是示出根据各种示例的个人电子设备的框图。
图7A是示出根据各种示例的数字助理系统或其服务器部分的框图。
图7B示出了根据各种示例的图7A所示的数字助理的功能。
图7C示出了根据各种示例的知识本体的一部分。
图8示出了根据各种示例的用于基于语音匹配技术提供媒体项目的数 字助理的框图。
图9示出了根据各种示例的用于接收来自用户的言语输入的数字助理 的框图。
图10示出了根据各种示例的请求检测器的框图。
图11A示出了根据各种示例的媒体项目搜索引擎的框图。
图11B示出了根据各种示例的用于生成语音图的框图。
图11C示出了根据各种示例的用于示例性言语输入的示例性语音图。
图11D示出了根据各种示例的另一示例性语音图。
图12A-图12E示出了根据各种示例的用于操作数字助理以基于言语输 入来提供媒体项目的过程。
具体实施方式
在以下对示例的描述中将引用附图,在附图中以例示的方式示出了可 被实施的特定示例。应当理解,在不脱离各个示例的范围的情况下,可使 用其他示例并且可作出结构性变更。
期望用于基于用户的言语输入向用户提供媒体项目的技术。语音匹配 技术可用于提高获得媒体项目的准确度。在一些示例中,从用户接收到言 语输入并据此确定该言语输入是否包括对媒体项目的用户请求。如果该言 语输入包括对媒体项目的用户请求,则根据媒体项目的储存库确定候选媒 体项目。候选媒体项目可表示从该言语输入得出的所请求的媒体项目和储 存库中可获得的媒体项目之间的最接近匹配。可施用语音匹配技术以确定 候选媒体项目的语音表示和言语输入的语音表示之间的差值,从而确定是 否应将候选媒体项目提供给用户。如果差值不满足阈值条件,则候选媒体 项目很可能与用户请求不匹配,从而将不被提供给用户。语音匹配技术可 减少获得媒体项目中的错误率并提高用户交互效率。
尽管以下描述使用术语第一、第二等来描述各种元素,但这些元素不 应受术语的限制。这些术语只是用于将一个元件与另一元件区分开。例 如,在不脱离各种所述示例的范围的情况下,第一输入可被称为第二输 入,并且类似地,第二输入可被称为第一输入。第一输入和第二输入均为 输入,并且在一些情况下为独立且不同的输入。
在本文中对各种所述示例的描述中所使用的术语只是为了描述特定示 例,而并非旨在进行限制。如在对各种所述示例的描述和所附权利要求书 中所使用的那样,单数形式“一个”和“该”旨在也包括复数形式,除非 上下文另外明确地指示。还应当理解,本文中所使用的术语“和/或”是指 并且涵盖相关联地列出的项目中的一个或多个项目的任何和全部可能的组 合。还将理解的是,术语“包括”和/或“包含”当在本说明书中使用时指 定存在所陈述的特征、整数、步骤、操作、元素、和/或部件,但是并不排 除存在或添加一个或多个其他特征、整数、步骤、操作、元素、部件、和/ 或其分组。
根据上下文,术语“如果”可被解释为意指“当...时”、“在...时”或 “响应于确定”或“响应于检测到”。类似地,根据上下文,短语“如果 确定...”或“如果检测到[所陈述的条件或事件]”可以、被解释为意指“在 确定...时”或“响应于确定...”或“在检测到[所陈述的条件或事件]时”或 “响应于检测到[所陈述的条件或事件]”。
1.系统和环境
图1示出了根据各种示例的系统100的框图。在一些示例中,系统100 实现数字助理。术语“数字助理”、“虚拟助理”、“智能自动化助理” 或“自动数字助理”是指解译口头形式和/或文本形式的自然语言输入来推 断用户意图并基于推断出的用户意图来执行动作的任何信息处理系统。例 如,为了作用于推断出的用户意图,系统执行下述步骤中的一个或多个: 识别具有设计用于实现推断出的用户意图的步骤和参数的任务流,根据推 断出的用户意图将特定要求输入到任务流中;通过调用程序、方法、服 务、API等来执行任务流;以及以可听(例如,言语)和/或可视形式来生 成对用户的输出响应。
具体地讲,数字助理能够接受至少部分地为自然语言命令、请求、声 明、讲述和/或询问的形式的用户请求。通常,用户请求要么寻求数字助理 作出信息性回答,要么寻求数字助理执行任务。对用户请求的令人满意的 响应包括提供所请求的信息性回答、执行所请求的任务或这两者的组合。 例如,用户向数字助理提出问题,诸如“我现在在哪里?”。基于用户的当 前位置,数字助理回答“你在中央公园西门附近。”用户还请求执行任 务,例如“请邀请我的朋友们下周来参加我女朋友的生日聚会。”作为响 应,数字助理可通过讲出“好的,马上”来确认请求,并然后代表用户将 合适的日历邀请发送至用户的电子通讯录中列出的用户朋友中的每个朋 友。在执行所请求的任务期间,数字助理有时在很长时间段内在涉及多次 信息交换的持续对话中与用户进行交互。存在与数字助理进行交互以请求 信息或执行各种任务的许多其他方法。除提供口头响应并采取经编程的行 动之外,数字助理还提供其他视频或音频形式的响应,例如作为文本、警 报、音乐、视频、动画等。
如图1所示,在一些示例中,数字助理根据客户端-服务器模型来实 现。数字助理包括在用户设备104上执行的客户端侧部分102(后文称作 “DA客户端102”)以及在服务器系统108上执行的服务器侧部分106 (后文称作“DA服务器106”)。DA客户端102通过一个或多个网络110 与DA服务器106通信。DA客户端102提供客户端侧功能,诸如面向用户 的输入和输出处理,以及与DA服务器106通信。DA服务器106为各自位 于相应用户设备104上的任意数量的DA客户端102提供服务器侧功能。
在一些示例中,DA服务器106包括面向客户端的I/O接口112、一个 或多个处理模块114、数据与模型116,以及到外部服务的I/O接口118。 面向客户端的I/O接口112有利于DA服务器106的面向客户端的输入和输 出处理。一个或多个处理模块114利用数据与模型116来处理言语输入, 并基于自然语言输入来确定用户意图。此外,一个或多个处理模块114基 于推断出的用户意图来进行任务执行。在一些示例中,DA服务器106通过 一个或多个网络110与外部服务120通信以完成任务或采集信息。到外部 服务的I/O接口118促成此类通信。
用户设备104可以是任何合适的电子设备。在一些示例中,用户设备 是便携式多功能设备(例如,下面参考图2A描述的设备200)、多功能设 备(例如,下面参考图4描述的设备400)或个人电子设备(例如,下面参 考图6A至图6B描述的设备600)。便携式多功能设备是例如还包含诸如 PDA和/或音乐播放器功能的其他功能的移动电话。便携式多功能设备的特 定示例包括来自Apple Inc.(Cupertino,California)的iPod 设备。便携式多功能设备的其他示例包括但不限于膝上型电脑或平板 电脑。此外,在一些示例中,用户设备104是非便携式多功能设备。具体 地讲,用户设备104是台式计算机、游戏机、电视或电视机顶盒。在一些 示例中,用户设备104包括触敏表面(例如,触摸屏显示器和/或触控 板)。此外,用户设备104任选地包括一个或多个其他物理用户接口设 备,诸如物理键盘、鼠标和/或操纵杆。下文更详细地描述了电子设备诸如 多功能设备的各种示例。
一个或多个通信网络110的示例包括局域网(LAN)和广域网(WAN), 例如互联网。一个或多个通信网络110使用任何已知的网络协议来实现, 包括各种有线或无线协议,诸如以太网、通用串行总线(USB)、火线 (FIREWIRE)、全球移动通信系统(GSM)、增强型数据GSM环境(EDGE)、 码分多址(CDMA)、时分多址(TDMA)、蓝牙、Wi-Fi、互联网协议语音(VoIP)、Wi-MAX或任何其他合适的通信协议。
服务器系统108在一个或多个独立式数据处理设备或分布式计算机网 络上实现。在一些示例中,服务器系统108还采用第三方服务提供方(例 如,第三方云服务提供方)的各种虚拟设备和/或服务来提供服务器系统 108的潜在计算资源和/或基础结构资源。
在一些示例中,用户设备104经由第二用户设备122与DA服务器106 通信。第二用户设备122与用户设备104相似或相同。例如,第二用户设 备122类似于下文参考图2A、图4和图6A至图6B描述的设备200、设备 400或设备600。用户设备104被配置为经由直接通信连接诸如蓝牙、 NFC、BTLE等或者经由有线或无线网络诸如局域Wi-Fi网络而通信耦接到 第二用户设备122。在一些示例中,第二用户设备122被配置为充当用户设 备104与DA服务器106之间的代理。例如,用户设备104的DA客户端 102被配置为经由第二用户设备122向DA服务器106传输信息(例如,在 用户设备104处接收的用户请求)。DA服务器106处理该信息,并经由第 二用户设备122将相关数据(例如,响应于用户请求的数据内容)返回到 用户设备104。
在一些示例中,用户设备104可被配置为将针对数据的缩略请求发送 到第二用户设备122,以减少从用户设备104传输的信息量。第二用户设备 122被配置为确定添加到缩略请求的补充信息,以生成完整的请求来传输到 DA服务器106。该系统架构可有利地通过使用具有较强通信能力和/或电池 电力的第二用户设备122(例如,移动电话、膝上型计算机、平板电脑等) 作为至DA服务器106的代理而允许具有有限通信能力和/或有限电池电力的用户设备104(例如,手表或类似的紧凑型电子设备)访问由DA服务器 106所提供的服务。虽然图1中仅示出两个用户设备104和122,但应当理 解,在一些示例中,系统100可包括在此代理配置中被配置为与DA服务 器系统106通信的任意数量和类型的用户设备。
虽然图1中所示的数字助理包括客户端侧部分(例如,DA客户端 102)和服务器侧部分(例如,DA服务器106)两者,但在一些示例中, 数字助理的功能被实现为被安装在用户设备上的独立式应用程序。此外, 数字助理的客户端部分与服务器部分之间的功能划分在不同的具体实施中 可变化。例如,在一些示例中,DA客户端为仅提供面向用户的输入和输出 处理功能,并将数字助理的所有其他功能委派给后端服务器的瘦客户端。
2.电子设备
现在将注意力转至用于实现数字助理的客户端侧部分的电子设备的实 施方案。图2A是示出了根据一些实施方案的具有触敏显示系统212的便携 式多功能设备200的框图。触敏显示器212有时为了方便被叫做“触摸 屏”,并且有时可被称为或被叫做“触敏显示器系统”。设备200包括存 储器202(其任选地包括一个或多个计算机可读存储介质)、存储器控制器 222、一个或多个处理单元(CPU)220、外围设备接口218、RF电路208、音 频电路210、扬声器211、麦克风213、输入/输出(I/O)子系统206、其他输 入控制设备216、和外部端口224。设备200任选地包括一个或多个光学传 感器264。设备200任选地包括用于检测设备200(例如,触敏表面,诸如 设备200的触敏显示器系统212)上的接触的强度的一个或多个接触强度传 感器265。设备200任选地包括用于在设备200上生成触觉输出的一个或多 个触觉输出发生器267(例如在触敏表面诸如设备200的触敏显示器系统 212或设备400的触摸板455上生成触觉输出)。这些部件任选地通过一个 或多个通信总线或信号线203进行通信。
如在本说明书和权利要求书中所使用的那样,触敏表面上的接触的术 语“强度”是指触敏表面上的接触(例如,手指接触)的力或压力(每单 位面积的力),或是指触敏表面上的接触的力或压力的替代物(代用 物)。接触的强度具有值范围,该值范围包括至少四个不同值并且更典型 地包括上百个不同值(例如,至少256个)。接触的强度任选地使用各种方法和各种传感器或传感器的组合来确定(或测量)。例如,在触敏表面 下方或相邻于触敏表面的一个或多个力传感器任选地用于测量触敏表面上 的不同点处的力。在一些具体实施中,来自多个力传感器的力测量被合并 (例如,加权平均),以确定估计的接触力。类似地,触笔的压敏顶端任 选地用于确定触笔在触敏表面上的压力。另选地,在触敏表面上检测到的 接触区域的大小和/或其变化、接触附近的触敏表面的电容和/或其变化、和 /或接触附近的触敏表面的电阻和/或其变化任选地被用作触敏表面上的接触 的力或压力的替代。在一些具体实施中,接触力或压力的替代测量值直接 用于确定是否已超出强度阈值(例如,强度阈值以与替代测量值对应的单 位描述)。在一些具体实施中,接触力或压力的替代测量被转换成估计的 力或压力,并且估计的力或压力用于确定是否已超过强度阈值(例如,强 度阈值是以压力的单位进行测量的压力阈值)。使用接触强度作为用户输 入的属性,从而允许用户访问用户在尺寸更小的设备上可能本来不能访问 的附加设备功能,该尺寸更小的设备具有有限的实地面积以用于(例如, 在触敏显示器上)显示示能表示和/或接收用户输入(例如,经由触敏显示 器、触敏表面或物理控件/机械控件,诸如旋钮或按钮)。
如本说明书和权利要求书中所使用的,术语“触觉输出”是指将由用 户利用用户的触感检测到的设备相对于设备的先前位置的物理位移、设备 的部件(例如,触敏表面)相对于设备的另一个部件(例如,外壳)的物 理位移、或部件相对于设备的质心的位移。例如,在设备或设备的部件与 用户的对触摸敏感的表面(例如,手指、手掌或用户手部的其他部分)接 触的情况下,通过物理位移生成的触觉输出将由用户解释为触感,该触感 对应于设备或设备部件的物理特征所感知的变化。例如,触敏表面(例 如,触敏显示器或触控板)的移动任选地由用户解释为对物理致动按钮的 “按下点击”或“松开点击”。在一些情况下,用户将感觉到触感,诸如 “按下点击”或“松开点击”,即使在通过用户的移动而物理地被按压(例如,被移位)的与触敏表面相关联的物理致动按钮没有移动时。作为 另一个示例,即使在触敏表面的光滑度无变化时,触敏表面的移动也将任 选地由用户解释为或感测为触敏表面的“粗糙度”。虽然由用户对触摸的 此类解释将受到用户的个体化感官知觉的限制,但是存在触摸的许多感官 知觉是大多数用户共有的。因此,当触觉输出被描述为对应于用户的特定 感官知觉(例如,“按下点击”、“松开点击”、“粗糙度”)时,除非 另外陈述,否则所生成的触觉输出对应于设备或其部件的物理位移,该物 理位移将会生成典型(或普通)用户的所述感官知觉。
应当理解,设备200仅是便携式多功能设备的一个示例,并且设备 200任选地具有比所示出的更多或更少的部件,任选地组合两个或更多个部 件,或者任选地具有这些部件的不同配置或布置。图2A中所示的各种部件 以硬件、软件、或硬件与软件两者的组合来实现,包括一个或多个信号处 理电路和/或专用集成电路。
存储器202包括一个或多个计算机可读存储介质。这些计算机可读存 储介质例如为有形的和非暂态的。存储器202包括高速随机存取存储器, 并且还包括非易失性存储器,诸如一个或多个磁盘存储设备、闪存存储器 设备或其他非易失性固态存储器设备。存储器控制器222控制设备200的 其他部件访问存储器202。
在一些示例中,存储器202的非暂态计算机可读存储介质用于存储指 令(例如,用于执行下文描述的过程的各方面)以供指令执行系统、装置 或设备诸如基于计算机的系统、包含处理器的系统或可从指令执行系统、 装置或设备取出指令并执行指令的其他系统使用或与其结合使用。在其他 示例中,指令(例如,用于执行下文描述的过程的各方面)存储在服务器 系统108的非暂态计算机可读存储介质(未示出)上,或在存储器202的 非暂态计算机可读存储介质与服务器系统108的非暂态计算机可读存储介 质之间划分。
外围设备接口218用于将该设备的输入和输出外围设备耦接到CPU 220和存储器202。一个或多个处理器220运行或执行存储在存储器202中 的各种软件程序和/或指令集以执行设备200的各种功能并处理数据。在一 些实施方案中,外围设备接口218、CPU 220和存储器控制器222在单个芯 片诸如芯片204上实现。在一些其他实施方案中,它们在独立的芯片上实 现。
RF(射频)电路208接收和发送又称电磁信号的RF信号。RF电路 208将电信号转换为电磁信号/将电磁信号转换为电信号,并且经由电磁信 号来与通信网络以及其他通信设备进行通信。RF电路208任选地包括用于 执行这些功能的熟知的电路,包括但不限于天线系统、RF收发器、一个或 多个放大器、调谐器、一个或多个振荡器、数字信号处理器、编解码芯片 组、用户身份模块(SIM)卡、存储器等等。RF电路208任选地通过无线通 信来与网络以及其他设备进行通信,该网络为诸如互联网(也被称为万维 网(WWW))、内联网和/或无线网络(诸如蜂窝电话网络、无线局域网 (LAN)和/或城域网(MAN))。RF电路208任选地包括用于检测近场通信 (NFC)场的熟知的电路,诸如通过近程通信无线电部件来进行检测。无线通信任选地使用多种通信标准、协议和技术中的任一种,包括但不限于全球 移动通信系统(GSM)、增强型数据GSM环境(EDGE)、高速下行链路分组接 入(HSDPA)、高速上行链路分组接入(HSUPA)、演进、纯数据(EV-DO)、 HSPA、HSPA+、双小区HSPA(DC-HSPDA)、长期演进(LTE)、近场通信 (NFC)、宽带码分多址(W-CDMA)、码分多址(CDMA)、时分多址 (TDMA)、蓝牙、蓝牙低功耗(BTLE)、无线保真(Wi-Fi)(例如,IEEE 802.11a、IEEE 802.11b、IEEE 802.11g、IEEE802.11n和/或IEEE 802.11ac)、互联网协议语音(VoIP)、Wi-MAX、电子邮件协议(例如,互联网消息访问协议(IMAP)和/或邮局协议(POP))、即时消息(例如,可扩 展消息处理和存在协议(XMPP)、用于即时消息和存在利用扩展的会话发起 协议(SIMPLE)、即时消息和存在服务(IMPS))和/或短消息服务(SMS),或 者任何其他适当的通信协议,包括在本文档提交日期时尚未开发出的通信 协议。
音频电路210、扬声器211和麦克风213提供用户和设备200之间的音 频接口。音频电路210从外围设备接口218接收音频数据,将音频数据转 换为电信号,并将电信号传输到扬声器211。扬声器211将电信号转换为人 类可听的声波。音频电路210还接收由麦克风213根据声波转换的电信 号。音频电路210将电信号转换为音频数据,并将音频数据传输到外围设 备接口218,以用于处理。音频数据通过外围设备接口218检索自和/或传 输至存储器202和/或RF电路208。在一些实施方案中,音频电路210还包 括耳麦插孔(例如,图3中的312)。耳麦插孔提供音频电路210与可移除 的音频输入/输出外围设备之间的接口,该外围设备诸如仅输出的耳机或者 具有输出(例如,单耳或双耳耳机)和输入(例如,麦克风)二者的耳 麦。
I/O子系统206将设备200上的输入/输出外围设备诸如触摸屏212和 其他输入控制设备216耦接到外围设备接口218。I/O子系统206任选地包 括显示控制器256、光学传感器控制器258、强度传感器控制器259、触觉 反馈控制器261,以及用于其他输入或控制设备的一个或多个输入控制器 260。该一个或多个输入控制器260从其他输入控制设备216接收电信号/将 电信号发送到其他输入控制设备216。其他输入控制设备216任选地包括物 理按钮(例如,下压按钮、摇臂按钮等)、拨号盘、滑动开关、操纵杆、 点击轮等等。在一些另选的实施方案中,一个或多个输入控制器260任选 地耦接到以下各项中的任一者(或不耦接到以下各项中的任一者):键 盘、红外线端口、USB端口以及指针设备诸如鼠标。一个或多个按钮(例 如,图3中的308)任选地包括用于扬声器211和/或麦克风213的音量控 制的增大/减小按钮。一个或多个按钮任选地包括下压按钮(例如,图3中 的306)。
快速按下下压按钮可解除触摸屏212的锁定或者开始使用触摸屏上的 手势来对设备进行解锁的过程,如在2005年12月23日提交的标题为 “Unlocking a Device byPerforming Gestures on an Unlock Image”的美国专 利申请11/322,549,美国专利7,657,849中所述,上述美国专利申请全文以 引用方式并入本文。较长地按下下压按钮(例如,306)使设备200开机或 关机。用户能够自定义一个或多个按钮的功能。触摸屏212用于实现虚拟 按钮或软按钮以及一个或多个软键盘。
触敏显示器212提供设备和用户之间的输入接口和输出接口。显示控 制器256从触摸屏212接收电信号和/或将电信号发送至触摸屏212。触摸 屏212向用户显示视觉输出。视觉输出包括图形、文本、图标、视频及其 任何组合(统称为“图形”)。在一些实施方案中,一些视觉输出或全部 视觉输出对应于用户界面对象。
触摸屏212具有基于触觉和/或触感接触来接受来自用户的输入的触敏 表面、传感器或传感器组。触摸屏212和显示控制器256(与存储器202中 的任何相关联的模块和/或指令集一起)检测触摸屏212上的接触(和该接 触的任何移动或中断),并且将所检测到的接触转换为与显示在触摸屏212 上的用户界面对象(例如,一个或多个软键、图标、网页或图像)的交 互。在一个示例性实施方案中,触摸屏212和用户之间的接触点与用户的 手指对应。
触摸屏212使用LCD(液晶显示器)技术、LPD(发光聚合物显示 器)技术或LED(发光二极管)技术,但在其他实施方案中可使用其他显 示技术。触摸屏212和显示控制器256使用目前已知或以后将开发的多种 触摸感测技术中的任何技术,以及其他接近传感器阵列或用于确定与触摸 屏212接触的一个或多个点的其他元件来检测接触及其任何移动或中断, 所述多种触摸感测技术包括但不限于电容式、电阻式、红外和表面声波技 术。在一个示例性实施方案中,使用投射式互电容感测技术,诸如在Apple Inc.(Cupertino,California)的和iPod 中发现的技术。
在一些实施方案中,触摸屏212的触敏显示器类似于下文美国专利: 6,323,846(Westerman等人)、6,570,557(Westerman等人)和/或6,677,932 (Westerman)和/或美国专利公告2002/0015024A1中所述的多触敏触摸板, 这些专利申请均据此全文以引用方式并入本文。然而,触摸屏212显示来 自设备200的视觉输出,而触敏触摸板不提供视觉输出。
在一些实施方案中,触摸屏212的触敏显示器如以下专利申请所述: (1)提交于2006年5月2日的名称为“Multipoint Touch Surface Controller” 的美国专利申请11/381,313;(2)提交于2004年5月6日的名称为 “Multipoint Touchscreen”的美国专利申请10/840,862;(3)提交于2004年7 月30日的名称为“Gestures For Touch Sensitive InputDevices”的美国专利 申请10/903,964;(4)提交于2005年1月31日的名称为“Gestures ForTouch Sensitive Input Devices”的美国专利申请11/048,264;(5)提交于2005年1 月18日的名称为“Mode-Based Graphical User Interfaces For Touch Sensitive InputDevices”的美国专利申请11/038,590;(6)提交于2005年9月16日的 名称为“VirtualInput Device Placement On A Touch Screen User Interface” 的美国专利申请11/228,758;(7)提交于2005年9月16日的名称为 “Operation Of A Computer With A TouchScreen Interface”的美国专利申请 11/228,700;(8)提交于2005年9月16日的名称为“Activating Virtual Keys Of A Touch-Screen Virtual Keyboard”的美国专利申请11/228,737;和(9)提 交于2006年3月3日2006的名称为“Multi-Functional Hand-HeldDevice” 的美国专利申请11/367,749。所有这些专利申请全文以引用方式并入本文。
触摸屏212例如具有超过100dpi的视频分辨率。在一些实施方案中, 触摸屏具有约160dpi的视频分辨率。用户使用任何合适的对象或附加物诸 如触笔、手指等与触摸屏212进行接触。在一些实施方案中,将用户界面 设计用来主要与基于手指的接触和手势一起工作,由于手指在触摸屏上的 接触面积较大,因此这可能不如基于触笔的输入那样精确。在一些实施方 案中,设备将基于手指的粗略输入转化为精确的指针/光标位置或命令,以 用于执行用户所期望的动作。
在一些实施方案中,除了触摸屏之外,设备200还包括用于激活或去 激活特定功能的触控板(未示出)。在一些实施方案中,触控板是设备的 触敏区域,该触敏区域与触摸屏不同,其不显示视觉输出。触控板是与触 摸屏212分开的触敏表面,或者是由触摸屏形成的触敏表面的延伸。
设备200还包括用于为各种部件供电的电力系统262。电力系统262包 括电力管理系统、一个或多个电源(例如,电池、交流电(AC))、再充电 系统、电力故障检测电路、功率转换器或逆变器、电力状态指示器(例 如,发光二极管(LED))和与便携式设备中电力的生成、管理和分配相关联 的任何其他部件。
设备200还包括一个或多个光学传感器264。图2A示出了耦接至I/O 子系统206中的光学传感器控制器258的光学传感器。光学传感器264包 括电荷耦合器件(CCD)或互补金属氧化物半导体(CMOS)光电晶体管。光学 传感器264从环境接收通过一个或多个透镜而投射的光,并且将光转换为 表示图像的数据。结合成像模块243(也叫做相机模块),光学传感器264 捕获静态图像或视频。在一些实施方案中,光学传感器位于设备200的后 部,与设备前部的触摸屏显示器212相背对,使得触摸屏显示器被用作用 于静态图像和/或视频图像采集的取景器。在一些实施方案中,光学传感器 位于设备的前部,使得在用户在触摸屏显示器上查看其他视频会议参与者 的同时获得该用户的图像以用于视频会议。在一些实施方案中,光学传感 器264的位置可由用户改变(例如,通过旋转设备外壳中的透镜和传感器),使得单个光学传感器264与触摸屏显示器一起使用,以用于视频会 议和静态图像和/或视频图像采集两者。
设备200任选地还包括一个或多个接触强度传感器265。图2A示出了 耦接至I/O子系统206中的强度传感器控制器259的接触强度传感器。接触 强度传感器265任选地包括一个或多个压阻应变仪、电容式力传感器、电 气力传感器、压电力传感器、光学力传感器、电容式触敏表面或其他强度 传感器(例如,用于测量触敏表面上的接触的力(或压力)的传感器)。 接触强度传感器265从环境接收接触强度信息(例如,压力信息或压力信 息的代用物)。在一些实施方案中,至少一个接触强度传感器与触敏表面 (例如,触敏显示系统212)并置排列或邻近。在一些实施方案中,至少一 个接触强度传感器位于设备200的后部上,与位于设备200的前部上的触 摸屏显示器212相背对。
设备200还包括一个或多个接近传感器266。图2A示出了耦接至外围 设备接口218的接近传感器266。另选地,接近传感器266耦接到I/O子系 统206中的输入控制器260。接近传感器266如名称为“Proximity Detector In Handheld Device”的美国专利申请11/241,839;名称为“Proximity Detector In Handheld Device”的美国专利申请11/240,788;名称为“Using Ambient Light Sensor To Augment Proximity Sensor Output”的美国专利申请 11/620,702;名称为“Automated Response To And Sensing Of User Activity InPortable Devices”的美国专利申请11/586,862;和名称为“Methods And Systems ForAutomatic Configuration Of Peripherals”的美国专利申请 11/638,251中所述的那样执行,这些专利申请据此全文以引用方式并入。在 一些实施方案中,当多功能设备被置于用户的耳朵附近时(例如,当用户 正在进行电话呼叫时),接近传感器关闭并且禁用触摸屏212。
设备200任选地还包括一个或多个触觉输出发生器267。图2A示出了 耦接至I/O子系统206中的触觉反馈控制器261的触觉输出发生器。触觉输 出发生器267任选地包括一个或多个电声设备诸如扬声器或其他音频部 件;和/或用于将能量转换成线性运动的机电设备诸如电机、螺线管、电活 性聚合物、压电致动器、静电致动器或其他触觉输出生成部件(例如,用 于将电信号转换成设备上的触觉输出的部件)。接触强度传感器265从触 觉反馈模块233接收触觉反馈生成指令,并且在设备200上生成能够由设 备200的用户感觉到的触觉输出。在一些实施方案中,至少一个触觉输出 发生器与触敏表面(例如,触敏显示系统212)并置排列或邻近,并且任选 地通过竖直地(例如,向设备200的表面内/外)或侧向地(例如,在与设 备200的表面相同的平面中向后和向前)移动触敏表面来生成触觉输出。 在一些实施方案中,至少一个触觉输出发生器传感器位于设备200的后部 上,与位于设备200的前部上的触摸屏显示器212相背对。
设备200还包括一个或多个加速度计268。图2A示出了耦接至外围设 备接口218的加速度计268。另选地,加速度计268耦接至I/O子系统206 中的输入控制器260。加速度计268如以下美国专利公开中所述那样执行: 美国专利公开20050190059,“Acceleration-based Theft Detection System for Portable Electronic Devices”和美国专利公开20060017692,“Methods And Apparatuses For Operating A Portable Device Based OnAn Accelerometer”, 这两个美国专利公开全文以引用方式并入本文。在一些实施方案中,信息 基于对从一个或多个加速度计所接收的数据的分析而在触摸屏显示器上以 纵向视图或横向视图被显示。设备200任选地除了加速度计268之外还包 括磁力仪(未示出)和GPS(或GLONASS或其他全球导航系统)接收器 (未示出),以用于获得关于设备200的位置和取向(例如,纵向或横 向)的信息。
在一些实施方案中,存储于存储器202中的软件部件包括操作系统 226、通信模块(或指令集)228、接触/运动模块(或指令集)230、图形模 块(或指令集)232、文本输入模块(或指令集)234、全球定位系统(GPS) 模块(或指令集)235、数字助理客户端模块229以及应用程序(或指令 集)236。此外,存储器202存储数据与模型,诸如用户数据与模型231。 此外,在一些实施方案中,存储器202(图2A)或470(图4)存储设备/ 全局内部状态257,如图2A和图4中所示。设备/全局内部状态257包括以 下各项中的一者或多者:活动应用程序状态,用于指示哪些应用程序(如 果有的话)当前处于活动态;显示状态,用于指示什么应用程序、视图或 其它信息占据触摸屏显示器212的各个区域;传感器状态,包括从设备的 各个传感器和输入控制设备216获得的信息;以及关于设备的位置和/或姿 态的位置信息。
操作系统226(例如,Darwin、RTXC、LINUX、UNIX、OS X、 iOS、WINDOWS、或嵌入式操作系统诸如VxWorks)包括用于控制和管理 一般系统任务(例如,存储器管理、存储设备控制、功率管理等)的各种 软件部件和/或驱动程序,并且促进各种硬件部件和软件部件之间的通信。
通信模块228有利于通过一个或多个外部端口224与其他设备进行通 信,并且还包括用于处理由RF电路208和/或外部端口224所接收的数据 的各种软件部件。外部端口224(例如,通用串行总线(USB)、火线等)适 于直接耦接到其他设备或通过网络(例如,互联网、无线LAN等)间接耦 接。在一些实施方案中,外部端口是与(Apple Inc.的商标)设备上所 使用的30针连接器相同的或类似的和/或与其兼容的多针(例如,30针) 连接器。
接触/运动模块230任选地检测与触摸屏212(结合显示控制器256) 和其他触敏设备(例如,触摸板或物理点击轮)的接触。接触/运动模块 230包括各种软件部件以用于执行与接触检测相关的各种操作,诸如确定是 否已发生接触(例如,检测手指按下事件)、确定接触的强度(例如,接 触的力或压力,或者接触的力或压力的替代物)、确定是否存在接触的移 动并跟踪在触敏表面上的移动(例如,检测一个或多个手指拖动事件), 以及确定接触是否已停止(例如,检测手指抬起事件或者接触断开)。接 触/运动模块230从触敏表面接收接触数据。确定接触点的移动任选地包括 确定接触点的速率(量值)、速度(量值和方向)和/或加速度(量值和/或 方向的改变),接触点的移动由一系列接触数据表示。这些操作任选地被 应用于单点接触(例如,单指接触)或者多点同时接触(例如,“多点触 摸”/多个手指接触)。在一些实施方案中,接触/运动模块230和显示控制 器256检测触控板上的接触。
在一些实施方案中,接触/运动模块230使用一组一个或多个强度阈值 来确定操作是否已由用户执行(例如,确定用户是否已“点击”图标)。 在一些实施方案中,根据软件参数来确定强度阈值的至少一个子集(例 如,强度阈值不是由特定物理致动器的激活阈值来确定的,并且可在不改 变设备200的物理硬件的情况下被调节)。例如,在不改变触控板或触摸 屏显示器硬件的情况下,触控板或触摸屏显示器的鼠标“点击”阈值可被 设定成预定义的阈值的大范围中的任一个阈值。另外,在一些具体实施 中,向设备的用户提供用于调节一组强度阈值中的一个或多个强度阈值 (例如,通过调节各个强度阈值和/或通过利用对“强度”参数的系统级点 击来一次调节多个强度阈值)的软件设置。
接触/运动模块230任选地检测用户的手势输入。触敏表面上的不同手 势具有不同的接触模式(例如,所检测到的接触的不同运动、定时和/或强 度)。因此,任选地通过检测特定接触图案来检测手势。例如,检测手指 轻击手势包括检测手指按下事件,然后在与手指按下事件相同的位置(或 基本上相同的位置)处(例如,在图标的位置处)检测手指抬起(抬离) 事件。作为另一个示例,在触敏表面上检测手指轻扫手势包括检测手指按 下事件,然后检测一个或多个手指拖动事件,并且随后检测手指抬起(抬 离)事件。
图形模块232包括用于在触摸屏212或其他显示器上呈现和显示图形 的各种已知的软件部件,包括用于改变所显示的图形的视觉冲击(例如, 亮度、透明度、饱和度、对比度或其他视觉特征)的部件。如本文所用, 术语“图形”包括可被显示给用户的任何对象,非限制性地包括文本、网 页、图标(诸如,包括软键的用户界面对象)、数字图像、视频、动画 等。
在一些实施方案中,图形模块232存储用于表示待使用图形的数据。 每个图形任选地被分配有对应的代码。图形模块232从应用程序等接收用 于指定待显示的图形的一个或多个代码,在必要的情况下还接收坐标数据 和其他图形属性数据,并且然后生成屏幕图像数据,以输出至显示控制器 256。
触觉反馈模块233包括用于生成指令的各种软件部件,该指令由一个 或多个触觉输出发生器267使用,以便响应于用户与设备200的交互而在 设备200上的一个或多个位置处产生触觉输出。
在一些示例中作为图形模块232的部件的文本输入模块234提供用于 在各种应用程序(例如,联系人237、电子邮件240、IM 241、浏览器247 和需要文本输入的任何其他应用程序)中输入文本的软键盘。
GPS模块235确定设备的位置并将该信息提供用于各种应用程序中(例 如,提供给电话238以用于基于位置的拨号;提供给相机243用作图片/视 频元数据;以及提供给提供基于位置的服务的应用程序,诸如天气桌面小 程序、本地黄页桌面小程序和地图/导航桌面小程序)。
数字助理客户端模块229包括各种客户端侧数字助理指令,以提供数 字助理的客户端侧功能。例如,数字助理客户端模块229能够通过便携式 多功能设备200的各种用户接口(例如,麦克风213、加速度计268、触敏 显示器系统212、光学传感器229、其他输入控制设备216等)接受声音输 入(例如,言语输入)、文本输入、触摸输入和/或手势输入。数字助理客 户端模块229还能够通过便携式多功能设备200的各种输出接口(例如, 扬声器211、触敏显示器系统212、触觉输出生成器267等)提供音频形式 的输出(例如,言语输出)、视觉形式的输出和/或触觉形式的输出。例 如,将输出提供为语音、声音、警报、文本消息、菜单、图形、视频、动 画、振动和/或以上两者或更多者的组合。在操作期间,数字助理客户端模 块229使用RF电路208与DA服务器106通信。
用户数据与模型231包括与用户相关联的各种数据(例如,用户特定 的词汇数据、用户偏好数据、用户指定的名称发音、来自用户电子通讯录 的数据、待办事项、购物清单等)以提供数字助理的客户端侧功能。此 外,用户数据与模型231包括用于处理用户输入并且确定用户意图的各种 模型(例如,言语识别模型、统计语言模型、自然语言处理模型、知识本 体、任务流模型、服务模型等)。
在一些示例中,数字助理客户端模块229利用便携式多功能设备200 的各种传感器、子系统和外围设备来从便携式多功能设备200的周围环境 采集附加信息,以建立与用户、当前用户交互和/或当前用户输入相关联的 上下文。在一些示例中,数字助理客户端模块229将上下文信息或其子集 与用户输入一起提供至DA服务器106以帮助推断用户意图。在一些示例 中,数字助理还使用上下文信息来确定如何准备输出并将其传送给用户。 上下文信息被称为上下文数据。
在一些示例中,伴随用户输入的上下文信息包括传感器信息,例如照 明、环境噪声、环境温度、周围环境的图像或视频等。在一些示例中,上 下文信息还可包括设备的物理状态,例如设备取向、设备位置、设备温 度、功率电平、速度、加速度、运动模式、蜂窝信号强度等。在一些示例 中,将与DA服务器106的软件状态相关的信息,例如便携式多功能设备200的运行过程、已安装程序、过去和当前的网络活动、后台服务、错误日 志、资源使用等,作为与用户输入相关联的上下文信息提供至DA服务器 106。
在一些示例中,数字助理客户端模块229响应于来自DA服务器106 的请求而选择性地提供存储在便携式多功能设备200上的信息(例如,用 户数据231)。在一些示例中,数字助理客户端模块229还在DA服务器 106请求时引出来自用户经由自然语言对话或其他用户接口的附加输入。数 字助理客户端模块229将该附加输入传送至DA服务器106,以帮助DA服 务器106进行意图推断和/或满足在用户请求中表达的用户意图。
下面参考图7A-图7C对数字助理进行更详细的描述。应当认识到,数 字助理客户端模块229可包括下文所述的数字助理模块726的任何数量的 子模块。
应用程序236包括以下模块(或指令集)或者其子集或超集:
·联系人模块237(有时称为通讯录或联系人列表);
·电话模块238;
·视频会议模块239;
·电子邮件客户端模块240;
·即时消息(IM)模块241;
·健身支持模块242;
·用于静态图像和/或视频图像的相机模块243;
·图像管理模块244;
·视频播放器模块;
·音乐播放器模块;
·浏览器模块247;
·日历模块248;
·桌面小程序模块249,其在一些示例中包括以下各项中的一者或多 者:天气桌面小程序249-1、股票桌面小程序249-2、计算器桌面小 程序249-3、闹钟桌面小程序249-4、词典桌面小程序249-5和用户 获得的其他桌面小程序,以及用户创建的桌面小程序249-6;
·用于制作用户创建的桌面小程序249-6的桌面小程序创建器模块 250;
·搜索模块251;
·视频和音乐播放器模块252,其合并视频播放器模块和音乐播放器 模块;
·记事本模块253;
·地图模块254;和/或
·在线视频模块255。
存储在存储器202中的其他应用程序236的示例包括其他文字处理应 用程序、其他图像编辑应用程序、绘图应用程序、呈现应用程序、支持 JAVA的应用程序、加密、数字版权管理、声音识别和声音复制。
结合触摸屏212、显示控制器256、接触/运动模块230、图形模块 232、和文本输入模块234,联系人模块237用于管理通讯录或联系人列表 (例如,存储在存储器202或存储器470中的联系人模块237的应用程序 内部状态292中),包括:将一个或多个姓名添加到通讯录;从通讯录删 除一个或多个姓名;将一个或多个电话号码、一个或多个电子邮件地址、一个或多个物理地址或其他信息与姓名相关联;将图像与姓名进行关联; 对姓名进行分类和排序;提供电话号码或电子邮件地址来发起和/或促进通 过电话238、视频会议模块239、电子邮件240或即时消息241的通信;等 等。
结合RF电路208、音频电路210、扬声器211、麦克风213、触摸屏 212、显示控制器256、接触/运动模块230、图形模块232和文本输入模块 234,电话模块238用于输入对应于电话号码的字符序列、访问联系人模块 237中的一个或多个电话号码、修改已经输入的电话号码、拨打相应的电话 号码、进行会话以及当会话完成时断开或挂断。如上所述,无线通信使用 多种通信标准、协议和技术中的任一种。
结合RF电路208、音频电路210、扬声器211、麦克风213、触摸屏 212、显示控制器256、光学传感器264、光学传感器控制器258、接触/运 动模块230、图形模块232、文本输入模块234、联系人模块237和电话模 块238,视频会议模块239包括根据用户指令来发起、进行和终止用户与一 个或多个其他参与者之间的视频会议的可执行指令。
结合RF电路208、触摸屏212、显示控制器256、接触/运动模块 230、图形模块232和文本输入模块234,电子邮件客户端模块240包括响 应于用户指令来创建、发送、接收和管理电子邮件的可执行指令。结合图 像管理模块244,电子邮件客户端模块240使得非常容易创建和发送具有由 相机模块243拍摄的静态图像或视频图像的电子邮件。
结合RF电路208、触摸屏212、显示控制器256、接触/运动模块 230、图形模块232和文本输入模块234,即时消息模块241包括用于以下 操作的可执行指令:输入与即时消息对应的字符序列、修改先前输入的字 符、传输相应即时消息(例如,使用短消息服务(SMS)或多媒体消息服务 (MMS)协议以用于基于电话的即时消息或者使用XMPP、SIMPLE或IMPS 以用于基于互联网的即时消息)、接收即时消息以及查看所接收的即时消 息。在一些实施方案中,所传输和/或接收的即时消息包括图形、照片、音 频文件、视频文件和/或如MMS和/或增强型消息服务(EMS)中支持的其他 附件。如本文所用,“即时消息”是指基于电话的消息(例如,使用SMS 或MMS发送的消息)和基于互联网的消息(例如,使用XMPP、 SIMPLE、或IMPS发送的消息)两者。
结合射频电路208、触摸屏212、显示器控制器256、接触模块230、 图形模块232、文本输入模块234、GPS模块235、地图模块254和音乐播 放器模块146,健身支持模块242包括用于以下各项的可执行指令:创建健 身(例如,具有时间、距离和/或卡路里燃烧目标);与健身传感器(运动 设备)进行通信;接收健身传感器数据;校准用于监视健身的传感器;选择和播放用于健身的音乐;以及显示、存储和传输健身数据。
结合触摸屏212、显示控制器256、一个或多个光学传感器264、光学 传感器控制器258、接触/运动模块230、图形模块232和图像管理模块 244,相机模块243包括用于以下操作的可执行指令:捕获静态图像或视频 (包括视频流)并且将它们存储到存储器202中、修改静态图像或视频的 特征,或从存储器202删除静态图像或视频。
结合触摸屏212、显示控制器256、接触/运动模块230、图形模块 232、文本输入模块234和相机模块243,图像管理模块244包括用于以下 操作的可执行指令:排列、修改(例如,编辑),或以其他方式操控、加 标签、删除、呈现(例如,在数字幻灯片或相册中),以及存储静态图像 和/或视频图像。
结合RF电路208、触摸屏212、显示控制器256、接触/运动模块 230、图形模块232和文本输入模块234,浏览器模块247包括根据用户指 令来浏览互联网(包括搜索、链接至、接收和显示网页或其部分,以及链 接至网页的附件和其他文件)的可执行指令。
结合RF电路208、触摸屏212、显示控制器256、接触/运动模块 230、图形模块232、文本输入模块234、电子邮件客户端模块240和浏览 器模块247,日历模块248包括根据用户指令来创建、显示、修改和存储日 历以及与日历相关联的数据(例如,日历条目、待办事项等)的可执行指 令。
结合RF电路208、触摸屏212、显示控制器256、接触/运动模块 230、图形模块232、文本输入模块234和浏览器模块247,桌面小程序模 块249是可由用户下载并使用的微型应用程序(例如,天气桌面小程序 249-1、股市桌面小程序249-2、计算器桌面小程序249-3、闹钟桌面小程序 249-4和词典桌面小程序249-5)或由用户创建的微型应用程序(例如,用 户创建的桌面小程序249-6)。在一些实施方案中,桌面小程序包括HTML (超文本标记语言)文件、CSS(层叠样式表)文件和JavaScript文件。在 一些实施方案中,桌面小程序包括XML(可扩展标记语言)文件和 JavaScript文件(例如,Yahoo!桌面小程序)。
结合RF电路208、触摸屏212、显示控制器256、接触/运动模块 230、图形模块232、文本输入模块234和浏览器模块247,桌面小程序创 建器模块250被用户用于创建桌面小程序(例如,使网页的用户指定部分 变成桌面小程序)。
结合触摸屏212、显示控制器256、接触/运动模块230、图形模块232 和文本输入模块234,搜索模块251包括根据用户指令来搜索存储器202中 的匹配一个或多个搜索条件(例如,一个或多个用户指定的搜索词)的文 本、音乐、声音、图像、视频和/或其他文件的可执行指令。
结合触摸屏212、显示控制器256、接触/运动模块230、图形模块 232、音频电路系统210、扬声器211、RF电路系统208和浏览器模块 247,视频和音乐播放器模块252包括允许用户下载和回放以一种或多种文 件格式(诸如MP3或AAC文件)存储的所记录的音乐和其他声音文件的 可执行指令,以及用于显示、呈现或以其他方式回放视频(例如,在触摸 屏212上或在经由外部端口224连接的外部显示器上)的可执行指令。在 一些实施方案中,设备200任选地包括MP3播放器诸如iPod(Apple Inc.的 商标)的功能。
结合触摸屏212、显示控制器256、接触/运动模块230、图形模块232 和文本输入模块234,记事本模块253包括根据用户指令来创建和管理记事 本、待办事项等的可执行指令。
结合RF电路208、触摸屏212、显示控制器256、接触/运动模块 230、图形模块232、文本输入模块234、GPS模块235和浏览器模块247, 地图模块254用于根据用户指令接收、显示、修改和存储地图以及与地图 相关联的数据(例如,驾驶方向、与特定位置处或附近的商店及其他兴趣 点有关的数据,以及其他基于位置的数据)。
结合触摸屏212、显示控制器256、接触/运动模块230、图形模块 232、音频电路210、扬声器211、RF电路208、文本输入模块234、电子 邮件客户端模块240和浏览器模块247,在线视频模块255包括允许用户访 问、浏览、接收(例如,通过流式传输和/或下载)、回放(例如,在触摸 屏上或经由外部端口224在所连接的外部显示器上)、发送具有至特定在线视频的链接的电子邮件,以及以其他方式管理一种或多种文件格式(诸 如,H.264)的在线视频的指令。在一些实施方案中,即时消息模块241而 不是电子邮件客户端模块240用于发送至特定在线视频的链接。在线视频 应用程序的附加描述可在于2007年6月20日提交的标题为“Portable Multifunction Device,Method,and Graphical User Interface forPlaying Online Videos”的美国临时专利申请60/936,562、和于2007年12月31日提交的标题为“Portable Multifunction Device,Method,and Graphical User Interface forPlaying Online Videos”的美国专利申请11/968,067中找到,这两个专利 申请的内容据此全文以引用方式并入本文。
上述模块和应用程序中的每个模块和应用程序对应于用于执行上述一 种或多种功能以及在该专利申请中所述的方法(例如,本文所述的计算机 实现的方法和其他信息处理方法)的可执行指令集。这些模块(例如,指 令集)不必被实现为独立的软件程序、过程或模块,并因此在各种实施方 案中可组合或以其他方式重新布置这些模块的各种子集。例如,视频播放 器模块可与音乐播放器模块组合成单个模块(例如,图2A中的视频和音乐 播放器模块252)。在一些实施方案中,存储器202存储上述模块和数据结 构的子集。此外,存储器202存储上文未描述的另外的模块和数据结构。
在一些实施方案中,设备200是该设备上的预定义的一组功能的操作 唯一地通过触摸屏和/或触控板来执行的设备。通过使用触摸屏和/或触控板 作为用于设备200的操作的主要输入控制设备,减少设备200上的物理输 入控制设备(诸如下压按钮、拨盘等)的数量。
唯一地通过触摸屏和/或触控板执行的该预定义的一组功能任选地包括 在用户界面之间进行导航。在一些实施方案中,触控板在被用户触摸时将 设备200从被显示在设备200上的任何用户界面导航到主菜单、home菜单 或根菜单。在此类实施方案中,使用触摸板来实现“菜单按钮”。在一些 其他实施方案中,菜单按钮是物理下压按钮或者其他物理输入控制设备, 而不是触控板。
图2B是示出了根据一些实施方案用于事件处理的示例性部件的框图。 在一些实施方案中,存储器202(图2A)或存储器470(图4)包括事件分 类器270(例如,在操作系统226中)以及相应的应用程序236-1(例如, 前述应用程序237至251、255、480至490中的任一个应用程序)。
事件分类器270接收事件信息并确定要将事件信息递送到的应用程序 236-1和应用程序236-1的应用程序视图291。事件分类器270包括事件监 视器271和事件分配器模块274。在一些实施方案中,应用程序236-1包括 应用程序内部状态292,该应用程序内部状态指示当应用程序是活动的或正 在执行时被显示在触敏显示器212上的一个或多个当前应用程序视图。在 一些实施方案中,设备/全局内部状态257被事件分类器270用于确定哪个(哪些)应用程序当前是活动的,并且应用程序内部状态292被事件分类 器270用于确定要将事件信息递送到的应用程序视图291。
在一些实施方案中,应用程序内部状态292包括附加信息,诸如以下 各项中的一者或多者:当应用程序236-1恢复执行时将被使用的恢复信息、 指示正被应用程序236-1显示的信息或准备好用于被应用程序236-1显示的 信息的用户界面状态信息、用于使得用户能够返回到应用程序236-1的前一 状态或视图的状态队列、以及用户采取的先前动作的重复/撤销队列。
事件监视器271从外围设备接口218接收事件信息。事件信息包括关 于子事件(例如,作为多点触摸手势一部分的触敏显示器212上的用户触 摸)的信息。外围设备接口218传输其从I/O子系统206或传感器诸如接近 传感器266、加速度计268和/或麦克风213(通过音频电路210)接收的信 息。外围设备接口218从I/O子系统206接收的信息包括来自触敏显示器 212或触敏表面的信息。
在一些实施方案中,事件监视器271以预先确定的间隔将请求发送至 外围设备接口218。作为响应,外围设备接口218传输事件信息。在其他实 施方案中,外围设备接口218仅当存在显著事件(例如,接收到高于预先 确定的噪声阈值和/或接收到超过预先确定的持续时间的输入)时才传输事 件信息。
在一些实施方案中,事件分类器270还包括命中视图确定模块272和/ 或活动事件识别器确定模块273。
当触敏显示器212显示多于一个视图时,命中视图确定模块272提供 用于确定子事件已在一个或多个视图内的何处发生的软件过程。视图由用 户能在显示器上看到的控件和其他元素构成。
与应用程序相关联的用户界面的另一方面是一组视图,在本文中有时 也被称为应用程序视图或用户界面窗口,在其中显示信息并且发生基于触 摸的手势。在其中检测到触摸的(相应应用程序的)应用程序视图对应于 应用程序的程序化分级结构或视图分级结构内的程序化水平。例如,在其 中检测到触摸的最低水平视图被称为命中视图,并且被认为是正确输入的 事件集至少部分地基于初始触摸的命中视图来确定,该初始触摸开始基于 触摸的手势。
点击视图确定模块272接收与基于接触的手势的子事件相关的信息。 当应用程序具有在分级结构中组织的多个视图时,命中视图确定模块272 将命中视图识别为应对子事件进行处理的分级结构中的最低视图。在大多 数情况下,命中视图是发起子事件(例如,形成事件或潜在事件的子事件 序列中的第一子事件)在其中发生的最低水平视图。一旦命中视图被命中 视图确定模块272识别,命中视图便通常接收与其被识别为命中视图所针 对的同一触摸或输入源相关的所有子事件。
活动事件识别器确定模块273确定视图分级结构内的哪个或哪些视图 应接收特定子事件序列。在一些实施方案中,活动事件识别器确定模块273 确定仅命中视图应接收特定子事件序列。在其他实施方案中,活动事件识 别器确定模块273确定包括子事件的物理位置的所有视图都是活跃参与的 视图,因此确定所有活跃参与的视图都应接收特定子事件序列。在其他实 施方案中,即使触摸子事件完全被局限到与一个特定视图相关联的区域, 但在分级结构中较高的视图将仍然保持为活跃参与的视图。
事件分配器模块274将事件信息分配到事件识别器(例如,事件识别 器280)。在包括活动事件识别器确定模块273的实施方案中,事件分配器 模块274将事件信息递送到由活动事件识别器确定模块273确定的事件识 别器。在一些实施方案中,事件分配器模块274在事件队列中存储事件信 息,该事件信息由相应事件接收器282进行检索。
在一些实施方案中,操作系统226包括事件分类器270。另选地,应 用程序236-1包括事件分类器270。在其他实施方案中,事件分类器270是 独立的模块,或者是被存储在存储器202中的另一个模块(诸如接触/运动 模块230)的一部分。
在一些实施方案中,应用236-1包括多个事件处理程序290和一个或 多个应用视图291,其中每个应用视图包括用于处理发生在应用的用户界面 的相应视图内的触摸事件的指令。应用程序236-1的每个应用程序视图291 包括一个或多个事件识别器280。通常,相应应用程序视图291包括多个事 件识别器280。在其他实施方案中,事件识别器280中的一个或多个事件识 别器是独立模块的一部分,该独立模块诸如用户界面工具包(未示出)或应用程序236-1从中继承方法和其他属性的更高水平的对象。在一些实施方 案中,相应事件处理程序290包括以下各项中的一者或多者:数据更新器 276、对象更新器277、GUI更新器278、和/或从事件分类器270接收的事 件数据279。事件处理程序290利用或调用数据更新器276、对象更新器 277或GUI更新器278来更新应用程序内部状态292。另选地,应用视图291中的一个或多个应用视图包括一个或多个相应事件处理程序290。另 外,在一些实施方案中,数据更新器276、对象更新器277和GUI更新器 278中的一者或多者被包括在相应应用程序视图291中。
相应的事件识别器280从事件分类器270接收事件信息(例如,事件 数据279),并且从事件信息识别事件。事件识别器280包括事件接收器 282和事件比较器284。在一些实施方案中,事件识别器280还包括元数据 283和事件传递指令288(其包括子事件传递指令)的至少一个子集。
事件接收器282接收来自事件分类器270的事件信息。事件信息包括 关于子事件例如触摸或触摸移动的信息。根据子事件,事件信息还包括附 加信息,诸如子事件的位置。当子事件涉及触摸的运动时,事件信息还包 括子事件的速率和方向。在一些实施方案中,事件包括设备从一个取向旋 转到另一取向(例如,从纵向取向到横向取向,或反之亦然)的旋转,并 且事件信息包括关于设备的当前取向(也被称为设备姿态)的对应信息。
事件比较器284将事件信息与预定义的事件或子事件定义进行比较, 并且基于该比较,确定事件或子事件,或者确定或更新事件或子事件的状 态。在一些实施方案中,事件比较器284包括事件定义286。事件定义286 包含事件的定义(例如,预定义的子事件序列),例如事件1(287-1)、 事件2(287-2)以及其他事件。在一些实施方案中,事件(287)中的子事件例如包括触摸开始、触摸结束、触摸移动、触摸取消和多点触摸。在一 个示例中,事件1(287-1)的定义是在被显示对象上的双击。例如,双击包 括被显示对象上的预先确定时长的第一次触摸(触摸开始)、预先确定时 长的第一次抬离(触摸结束)、被显示对象上的预先确定时长的第二次触 摸(触摸开始)、以及预先确定时长的第二次抬离(触摸结束)。在另一个示例中,事件2(287-2)的定义是被显示对象上的拖动。例如,拖动包括 所显示对象上的预先确定时长的触摸(或接触)、触摸在触敏显示器212 上的移动,以及触摸的抬离(触摸结束)。在一些实施方案中,事件还包 括用于一个或多个相关联的事件处理程序290的信息。
在一些实施方案中,事件定义287包括对用于相应用户界面对象的事 件的定义。在一些实施方案中,事件比较器284执行命中测试,以确定哪 个用户界面对象与子事件相关联。例如,在触敏显示器212上显示三个用 户界面对象的应用程序视图中,当在触敏显示器212上检测到触摸时,事 件比较器284执行命中测试以确定这三个用户界面对象中的哪一个用户界 面对象与该触摸(子事件)相关联。如果每个所显示的对象与相应的事件 处理程序290相关联,则事件比较器使用该命中测试的结果,以确定哪个 事件处理程序290应当被激活。例如,事件比较器284选择与子事件和触 发该命中测试的对象相关联的事件处理程序。
在一些实施方案中,相应事件(287)的定义还包括延迟动作,该延迟 动作延迟事件信息的递送,直到已确定子事件序列是否确实对应于或不对 应于事件识别器的事件类型。
当相应事件识别器280确定子事件序列不与事件定义286中的任何事 件匹配时,该相应事件识别器280进入事件不可能、事件失败或事件结束 状态,在此之后忽略基于触摸的手势的后续子事件。在这种情况下,对于 命中视图保持活动的其他事件识别器(如果有的话)继续跟踪并处理持续 进行的基于触摸的手势的子事件。
在一些实施方案中,相应事件识别器280包括具有指示事件递送系统 应该如何执行对活跃参与的事件识别器的子事件递送的可配置属性、标志 和/或列表的元数据283。在一些实施方案中,元数据283包括指示事件识 别器彼此如何交互或如何能够交互的可配置属性、标志和/或列表。在一些 实施方案中,元数据283包括指示子事件是否递送到视图或程序化分级结 构中的不同层级的可配置属性、标志和/或列表。
在一些实施方案中,当识别事件的一个或多个特定子事件时,相应事 件识别器280激活与事件相关联的事件处理程序290。在一些实施方案中, 相应事件识别器280将与事件相关联的事件信息递送到事件处理程序290。 激活事件处理程序290不同于将子事件发送(和延期发送)到相应命中视 图。在一些实施方案中,事件识别器280抛出与所识别的事件相关联的标 志,并且与该标志相关联的事件处理程序290获得该标志并执行预定义的过程。
在一些实施方案中,事件递送指令288包括递送关于子事件的事件信 息而无需激活事件处理程序的子事件递送指令。相反,子事件递送指令将 事件信息递送到与子事件系列相关联的事件处理程序或递送到活跃参与的 视图。与子事件系列或与活跃参与的视图相关联的事件处理程序接收事件 信息并执行预先确定的过程。
在一些实施方案中,数据更新器276创建并更新在应用程序236-1中 使用的数据。例如,数据更新器276对联系人模块237中所使用的电话号 码进行更新,或者对视频播放器模块中所使用的视频文件进行存储。在一 些实施方案中,对象更新器277创建和更新在应用程序236-1中使用的对 象。例如,对象更新器277创建新用户界面对象或更新用户界面对象的位 置。GUI更新器278更新GUI。例如,GUI更新器278准备显示信息并将 其发送至图形模块232以用于在触敏显示器上显示。
在一些实施方案中,一个或多个事件处理程序290包括数据更新器 276、对象更新器277和GUI更新器278或者具有对该数据更新器、该对象 更新器和该GUI更新器的访问权限。在一些实施方案中,数据更新器 276、对象更新器277和GUI更新器278被包括在相应应用程序236-1或应 用程序视图291的单个模块中。在其他实施方案中,它们被包括在两个或更多个软件模块中。
应当理解,关于触敏显示器上的用户触摸的事件处理的上述论述还适 用于利用输入设备来操作多功能设备200的其他形式的用户输入,并不是 所有用户输入都是在触摸屏上发起的。例如,任选地与单次或多次键盘按 下或按住协作的鼠标移动和鼠标按钮按下;触控板上的接触移动,诸如轻 击、拖拽、滚动等;触笔输入;设备的移动;口头指令;检测到的眼睛移 动;生物特征输入;和/或它们的任何组合任选地被用作对应于限定要识别 的事件的子事件的输入。
图3示出了根据一些实施方案的具有触摸屏212的便携式多功能设备 200。触摸屏任选地在用户界面(UI)300内显示一个或多个图形。在本实施 方案中以及在下文中描述的其他实施方案中,用户能够通过例如利用一个 或多个手指302(在附图中没有按比例绘制)或者利用一个或多个触笔303 (在附图中没有按比例绘制),在图形上作出手势来选择这些图形中的一 个或多个图形。在一些实施方案中,当用户中断与一个或多个图形的接触 时,将发生对一个或多个图形的选择。在一些实施方案中,手势任选地包 括一次或多次轻击、一次或多次轻扫(从左向右、从右向左、向上和/或向 下)和/或已与设备200发生接触的手指的滚动(从右向左、从左向右、向 上和/或向下)。在一些具体实施中或在一些情况下,不经意地与图形接触 不会选择图形。例如,当与选择对应的手势是轻击时,在应用程序图标上 方扫动的轻扫手势任选地不选择对应应用程序。
设备200还包括一个或多个物理按钮,诸如“主桌面”或菜单按钮 304。如前所述,菜单按钮304用于导航到在设备200上执行的一组应用程 序中的任何应用程序236。另选地,在一些实施方案中,菜单按钮被实现为 被显示在触摸屏212上的GUI中的软键。
在一个实施方案中,设备200包括触摸屏212、菜单按钮304、用于使 设备通电/断电和用于锁定设备的下压按钮306、一个或多个音量调节按钮 308、用户身份模块(SIM)卡槽310、耳麦插孔312、和对接/充电外部端口 224。下压按钮306任选地用于通过压下该按钮并且将该按钮保持在压下状 态预定义的时间间隔来对设备进行开/关机;通过压下该按钮并在该预定义 的时间间隔过去之前释放该按钮来锁定设备;和/或对设备进行解锁或发起解锁过程。在另选的实施方案中,设备200还通过麦克风213来接受用于 激活或去激活某些功能的口头输入。设备200还任选地包括用于检测触摸 屏212上的接触的强度的一个或多个接触强度传感器265,和/或用于为设 备200的用户生成触觉输出的一个或多个触觉输出发生器267。
图4是根据一些实施方案的具有显示器和触敏表面的示例性多功能设 备的框图。设备400不必是便携式的。在一些实施方案中,设备400是膝 上型计算机、台式计算机、平板电脑、多媒体播放器设备、导航设备、教 育设备(诸如儿童学习玩具)、游戏系统、或控制设备(例如,家用控制 器或工业用控制器)。设备400通常包括一个或多个处理单元(CPU)410、一个或多个网络或其他通信接口460、存储器470和用于使这些部件互连的 一个或多个通信总线420。通信总线420任选地包括使系统部件互连并且控 制系统部件之间的通信的电路(有时称为芯片组)。设备400包括具有显 示器440的输入/输出(I/O)接口430,该显示器通常是触摸屏显示器。I/O接 口430还任选地包括键盘和/或鼠标(或其他指向设备)450和触摸板455、 用于在设备400上生成触觉输出的触觉输出发生器457(例如,类似于以上 参考图2A所述的触觉输出发生器267)、传感器459(例如,光学传感 器、加速度传感器、接近传感器、触敏传感器和/或一个或多个接触强度传 感器(类似于以上参考图2A所述的接触强度传感器265))。存储器470 包括高速随机存取存储器,诸如DRAM、SRAM、DDR RAM或其它随机 存取固态存储器设备;并且任选地包括非易失性存储器,诸如一个或多个 磁盘存储设备、光盘存储设备、闪存设备、或其它非易失性固态存储设 备。存储器470任选地包括远离一个或多个CPU 410定位的一个或多个存 储设备。在一些实施方案中,存储器470存储与在便携式多功能设备200 (图2A)的存储器202中所存储的程序、模块和数据结构类似的程序、模 块和数据结构,或它们的子集。此外,存储器470任选地存储在便携式多 功能设备200的存储器202中不存在的附加程序、模块和数据结构。例 如,设备400的存储器470任选地存储绘图模块480、呈现模块482、文字 处理模块484、网站创建模块486、盘编辑模块488,和/或电子表格模块 490,而便携式多功能设备200(图2A)的存储器202任选地不存储这些模 块。
图4中的上述元件中的每一者在一些示例中存储在一个或多个先前提 到的存储器设备中。上述模块中的每个模块与用于执行上述功能的指令集 对应。上述模块或程序(例如,指令集)不必被实现为独立的软件程序、 过程或模块,因此这些模块的各种子集在各种实施方案中组合或以其他方 式重新布置。在一些实施方案中,存储器470存储上述模块和数据结构的 子集。此外,存储器470存储上文未描述的附加的模块和数据结构。
现在将注意力转到可在例如便携式多功能设备200上实现的用户界面 的实施方案。
图5A示出了根据一些实施方案的便携式多功能设备200上的应用程序 菜单的示例性用户界面。类似的用户界面在设备400上实现。在一些实施 方案中,用户界面500包括以下元件或者其子集或超集:
一种或多种无线通信诸如蜂窝信号和Wi-Fi信号的一个或多个信号强 度指示器502;
·时间504;
·蓝牙指示器505;
·电池状态指示器506;
·具有常用应用程序图标的托盘508,图标诸如:
○电话模块238的被标记为“电话”的图标516,该图标任选地包 括未接来电或语音留言的数量的指示符514;
○电子邮件客户端模块240的被标记为“邮件”的图标518,该图 标任选地包括未读电子邮件的数量的指示符510;
○浏览器模块247的被标记为“浏览器”的图标520;和
○视频和音乐播放器模块252(也叫做iPod(Apple Inc.的商标)模 块252)的标记“iPod”的图标522;和
·其它应用程序的图标,诸如:
○IM模块241的被标记为“消息”的图标524;
○日历模块248的被标记为“日历”的图标526;
○图像管理模块244的被标记为“照片”的图标528;
○相机模块243的被标记为“相机”的图标530;
○在线视频模块255的被标记为“在线视频”的图标532;
○股市桌面小程序249-2的被标记为“股市”的图标534;
○地图模块254的被标记为“地图”的图标536;
○天气桌面小程序249-1的被标记为“天气”的图标538;
○闹钟桌面小程序249-4的被标记为“时钟”的图标540;
○健身支持模块242的被标记为“健身支持”的图标542;
○记事本模块253的被标记为“记事本”的图标544;和
○用于设置应用程序或模块的被标记为“设置”的图标546,该图 标提供对设备200及其各种应用程序236的设置的访问。
需注意,图5A中示出的图标标签仅是示例性的。例如,视频和音乐 播放器模块252的图标522任选地被标记为“音乐”或“音乐播放器”。 对于各种应用程序图标任选地使用其他标签。在一些实施方案中,相应应 用程序图标的标签包括与该相应应用程序图标对应的应用程序的名称。在 一些实施方案中,特定应用图标的标签不同于与该特定应用图标对应的应 用的名称。
图5B示出了具有与显示器550(例如,触摸屏显示器212)分开的触 敏表面551(例如,图4的平板电脑或触摸板455)的设备(例如,图4的 设备400)上的示例性用户界面。设备400还任选地包括用于检测触敏表面 551上接触强度的一个或多个接触强度传感器(例如,传感器457中的一个 或多个传感器),和/或用于为设备400的用户生成触觉输出的一个或多个 触觉输出发生器459。
尽管将参考触摸屏显示器212(其中组合了触敏表面和显示器)上的 输入给出随后的示例中的一些示例,但是在一些实施方案中,设备检测与 显示器分开的触敏表面上的输入,如图5B所示。在一些实施方案中,触敏 表面(例如,图5B中的551)具有与显示器(例如,550)上的主轴(例 如,图5B中的553)对应的主轴(例如,图5B中的552)。根据这些实施方案,设备检测在与显示器上的相应位置对应的位置(例如,在图5B中, 560对应于568并且562对应于570)处的与触敏表面551的接触(例如, 图5B中的560和562)。这样,在触敏表面(例如,图5B中的551)与多 功能设备的显示器(图5B中的550)分开时,由设备在触敏表面上检测到 的用户输入(例如,接触560和562以及它们的移动)被该设备用于操纵 显示器上的用户界面。应当理解,类似的方法任选地用于本文所述的其他 用户界面。
另外,虽然主要是参考手指输入(例如,手指接触、单指轻击手势、 手指轻扫手势)来给出下面的示例,但是应当理解,在一些实施方案中, 这些手指输入中的一个或多个手指输入由来自另一输入设备的输入(例 如,基于鼠标的输入或触笔输入)替代。例如,轻扫手势任选地由鼠标点 击(例如,而不是接触),之后是光标沿着轻扫的路径的移动(例如,而不是接触的移动)来替代。又如,轻击手势任选地由在光标位于轻击手势 的位置上方时的鼠标点击(例如,而不是对接触的检测,以及之后的停止 检测接触)来代替。类似地,当同时检测到多个用户输入时,应当理解的 是,多个计算机鼠标任选地被同时使用,或鼠标和手指接触任选地被同时 使用。
图6A示出了示例性个人电子设备600。设备600包括主体602。在一 些实施方案中,设备600包括相对于设备200和400(例如,图2A至图 4)所述的特征中的一些或全部特征。在一些实施方案中,设备600具有在 下文中称为触摸屏604的触敏显示屏604。另选地,或作为触摸屏604的补 充,设备600具有显示器和触敏表面。与设备200和400的情况一样,在 一些实施方案中,触摸屏604(或触敏表面)具有用于检测正在施加的接触 (例如,触摸)的强度的一个或多个强度传感器。触摸屏604(或触敏表 面)的一个或多个强度传感器提供表示触摸的强度的输出数据。设备600 的用户界面基于触摸强度来对触摸作出响应,这意味着不同强度的触摸可 调用设备600上的不同的用户界面操作。
用于检测和处理触摸强度的技术存在于相关申请中:例如于2013年5 月8日提交的标题为“Device,Method,and Graphical User Interface for Displaying UserInterface Objects Corresponding to an Application”的国际专 利申请序列PCT/US2013/040061,以及于2013年11月11日提交的标题为 “Device,Method,and GraphicalUser Interface for Transitioning Between Touch Input to Display OutputRelationships”的国际专利申请序列 PCT/US2013/069483,这两个专利申请中的每个专利申请据此全文以引用方 式并入本文。
在一些实施方案中,设备600具有一个或多个输入机构606和608。输 入机构606和608(如果包括的话)是物理形式的。物理输入机构的示例包 括下压按钮和可旋转机构。在一些实施方案中,设备600具有一个或多个 附接机构。此类附接机构(如果包括的话)可允许将设备600与例如帽 子、眼镜、耳环、项链、衬衣、夹克、手镯、表带、手链、裤子、腰带、 鞋子、钱包、背包等附接。这些附接机构允许设备600被用户穿戴。
图6B示出了示例性个人电子设备600。在一些实施方案中,设备600 包括相对于图2A、图2B和图4所述的部件中的一些或全部部件。设备600 具有总线612,该总线将I/O部分614与一个或多个计算机处理器616和存 储器618操作性地耦接。I/O部分614被连接到显示器604,该显示器可具 有触敏部件622并且任选地还具有触摸强度敏感部件624。此外,I/O部分 614与通信单元630连接,以用于使用Wi-Fi、蓝牙、近场通信(NFC)、蜂 窝和/或其他无线通信技术来接收应用程序和操作系统数据。设备600包括 输入机构606和/或608。例如,输入机构606是可旋转输入设备或者可按 压输入设备以及可旋转输入设备。在一些示例中,输入机构608是按钮。
在一些示例中,输入机构608是麦克风。个人电子设备600包括例如 各种传感器,诸如GPS传感器632、加速度计634、定向传感器640(例 如,罗盘)、陀螺仪636、运动传感器638和/或其组合,所有这些设备均 可操作性连接到I/O部分614。
个人电子设备600的存储器618包括用于存储计算机可执行指令的一 个或多个非暂态计算机可读存储介质,该指令当由一个或多个计算机处理 器616执行时例如使得计算机处理器执行上述技术和过程。该计算机可执 行指令也例如在任何非暂态计算机可读存储介质内进行存储和/或传送,以 供指令执行系统、装置或设备诸如基于计算机的系统、包含处理器的系统 或可从指令执行系统、装置或设备获取指令并执行指令的其他系统使用或 与其结合。个人电子设备600不限于图6B的部件和配置,而是可包括多种 配置的其他部件或附加部件。
如本文所用,术语“示能表示”是指在设备200、400和/或600(图 2、图4和图6)的显示屏上显示的用户交互式图形用户界面对象。例如, 图像(例如,图标)、按钮和文本(例如,超链接)各自构成示能表示。
如本文所用,术语“焦点选择器”是指用于指示用户正与之进行交互 的用户界面的当前部分的输入元素。在包括光标或其他位置标记的一些具 体实施中,光标充当“焦点选择器”,使得在光标在特定用户界面元素 (例如,按钮、窗口、滑块或其他用户界面元素)上方时在触敏表面(例 如,图4中的触控板455或图5B中的触敏表面551)上检测到输入(例如,按压输入)的情况下,该特定用户界面元素根据所检测到的输入而被 调节。在包括能够实现与触摸屏显示器上的用户界面元素的直接交互的触 摸屏显示器(例如,图2A中的触敏显示系统212或图5A中的触摸屏 212)的一些具体实施中,触摸屏上所检测到的接触充当“焦点选择器”, 使得当在触摸屏显示器上在特定用户界面元素(例如,按钮、窗口、滑块 或其他用户界面元素)的位置处检测到输入(例如,由接触进行的按压输 入)时,该特定用户界面元素根据所检测到的输入而被调节。在一些具体 实施中,焦点从用户界面的一个区域移动到用户界面的另一个区域,而无 需光标的对应移动或触摸屏显示器上的接触的移动(例如,通过使用制表 键或箭头键将焦点从一个按钮移动到另一个按钮);在这些具体实施中, 焦点选择器根据焦点在用户界面的不同区域之间的移动而移动。不考虑焦 点选择器所采取的具体形式,焦点选择器通常是由用户控制的以便递送与 用户界面的用户预期的交互(例如,通过向设备指示用户界面的用户期望 与其进行交互的元素)的用户界面元素(或触摸屏显示器上的接触)。例 如,在触敏表面(例如,触摸板或触摸屏)上检测到按压输入时,焦点选 择器(例如,光标、接触或选择框)在相应按钮上方的位置将指示用户意 图激活相应按钮(而不是设备的显示器上示出的其他用户界面元素)。
如说明书和权利要求书中所使用的,接触的“特征强度”这一术语是 指基于接触的一个或多个强度的接触的特征。在一些实施方案中,特征强 度基于多个强度样本。特征强度任选地基于相对于预定义事件(例如,在 检测到接触之后,在检测到接触抬离之前,在检测到接触开始移动之前或 之后,在检测到接触结束之前,在检测到接触强度增大之前或之后和/或在 检测到接触强度减小之前或之后)而言在预先确定的时间段(例如,0.05 秒、0.1秒、0.2秒、0.5秒、1秒、2秒、5秒、10秒)期间采集的预定义数 量的强度样本或一组强度样本。接触的特征强度任选地基于以下各项中的 一者或多者:接触强度的最大值、接触强度的均值、接触强度的平均值、 接触强度的前10%处的值、接触强度的半最大值、接触强度的90%最大值 等。在一些实施方案中,在确定特征强度时使用接触的持续时间(例如, 在特征强度是接触强度在时间上的平均值时)。在一些实施方案中,将特 征强度与一组一个或多个强度阈值进行比较,以确定用户是否已执行操 作。例如,该组一个或多个强度阈值包括第一强度阈值和第二强度阈值。 在该示例中,特征强度未超过第一阈值的接触导致第一操作,特征强度超 过第一强度阈值但未超过第二强度阈值的接触导致第二操作,并且特征强 度超过第二阈值的接触导致第三操作。在一些实施方案中,使用特征强度 与一个或多个阈值之间的比较来确定是否要执行一个或多个操作(例如, 是执行相应操作还是放弃执行相应操作),而不是用于确定执行第一操作 还是第二操作。
在一些实施方案中,识别手势的一部分,目的是确定特征强度。例 如,触敏表面接收连续的轻扫接触,该连续的轻扫接触从起始位置过渡并 到达结束位置,在该结束位置处,接触的强度增加。在该示例中,接触在 结束位置处的特征强度仅基于连续轻扫接触的一部分,而不是整个轻扫接 触(例如,仅轻扫接触在结束位置处的部分)。在一些实施方案中,在确 定接触的特征强度之前向轻扫手势的强度应用平滑化算法。例如,该平滑 化算法任选地包括以下各项中的一者或多者:不加权滑动平均平滑化算 法、三角平滑化算法、中值滤波器平滑化算法和/或指数平滑化算法。在一 些情况下,这些平滑化算法消除了轻扫接触强度中的窄的尖峰或倾斜,目 的是确定特征强度。
相对于一个或多个强度阈值诸如接触检测强度阈值、轻按压强度阈 值、深按压强度阈值和/或一个或多个其他强度阈值来表征触敏表面上的接 触的强度。在一些实施方案中,轻按压强度阈值对应于这样的强度:在该 强度下设备将执行通常与点击物理鼠标的按钮或触控板相关联的操作。在 一些实施方案中,深按压强度阈值对应于这样的强度:在该强度下设备将 执行与通常与点击物理鼠标或触控板的按钮相关联的操作不同的操作。在一些实施方案中,当检测到特征强度低于轻按压强度阈值(例如,并且高 于标称接触检测强度阈值,比该标称接触检测强度阈值低的接触不再被检 测到)的接触时,设备将根据接触在触敏表面上的移动来移动焦点选择 器,而不执行与轻按压强度阈值或深按压强度阈值相关联的操作。一般来 讲,除非另有陈述,否则这些强度阈值在不同组的用户界面附图之间是一 致的。
接触的特征强度从低于轻按压强度阈值的强度增大到介于轻按压强度 阈值与深按压强度阈值之间的强度有时被称为“轻按压”输入。接触特征 强度从低于深按压强度阈值的强度增大到高于深按压强度阈值的强度有时 被称为“深按压”输入。接触的特征强度从低于接触检测强度阈值的强度 增大到介于接触检测强度阈值与轻按压强度阈值之间的强度有时被称为检 测到触摸表面上的接触。接触的特征强度从高于接触检测强度阈值的强度 减小到低于接触检测强度阈值的强度有时被称为检测到接触从触摸表面抬 离。在一些实施方案中,接触检测强度阈值为零。在一些实施方案中,接 触检测强度阈值大于零。
在本文中所述的一些实施方案中,响应于检测到包括相应按压输入的 手势或响应于检测到利用相应接触(或多个接触)执行的相应按压输入来 执行一个或多个操作,其中至少部分地基于检测到该接触(或多个接触) 的强度增大到高于按压输入强度阈值而检测到相应按压输入。在一些实施 方案中,响应于检测到相应接触强度增大到高于按压输入强度阈值(例 如,相应按压输入的“向下冲程”)而执行相应操作。在一些实施方案 中,按压输入包括相应接触强度增大到高于按压输入强度阈值以及该接触 强度随后减小到低于按压输入强度阈值,并且响应于检测到相应接触强度 随后减小到低于按压输入阈值(例如,相应按压输入的“向上冲程”)而 执行相应操作。
在一些实施方案中,设备采用强度滞后以避免有时被称为“抖动”的 意外输入,其中设备限定或选择与按压输入强度阈值具有预定义关系的滞 后强度阈值(例如,滞后强度阈值比按压输入强度阈值低X个强度单位, 或滞后强度阈值是按压输入强度阈值的75%、90%或某个合理比例)。因 此,在一些实施方案中,按压输入包括相应接触强度增大到高于按压输入 强度阈值以及该接触强度随后减小到低于与按压输入强度阈值对应的滞后 强度阈值,并且响应于检测到相应接触强度随后减小到低于滞后强度阈值(例如,相应按压输入的“向上冲程”)而执行相应操作。类似地,在一 些实施方案中,仅在设备检测到接触强度从等于或低于滞后强度阈值的强 度增大到等于或高于按压输入强度阈值的强度并且任选地接触强度随后减 小到等于或低于滞后强度的强度时才检测到按压输入,并且响应于检测到 按压输入(例如,根据环境,接触强度增大或接触强度减小)而执行相应 操作。
为了容易解释,任选地,响应于检测到以下各种情况中的任一种情况 而触发对响应于与按压输入强度阈值相关联的按压输入或响应于包括按压 输入的手势而执行的操作的描述:接触强度增大到高于按压输入强度阈 值、接触强度从低于滞后强度阈值的强度增大到高于按压输入强度阈值的 强度、接触强度减小到低于按压输入强度阈值、和/或接触强度减小到低于 与按压输入强度阈值对应的滞后强度阈值。另外,在将操作描述为响应于检测到接触强度减小到低于按压输入强度阈值而执行的示例中,任选地响 应于检测到接触强度减小到低于对应于并且小于按压输入强度阈值的滞后 强度阈值来执行操作。
3.数字助理系统
图7A示出根据各种示例的数字助理系统700的框图。在一些示例中, 数字助理系统700在独立式计算机系统上实现。在一些示例中,数字助理 系统700跨多个计算机分布。在一些示例中,数字助理的模块和功能中的 一些被划分成服务器部分和客户端部分,其中客户端部分位于一个或多个 用户设备(例如,设备104、设备122、设备200、设备400或设备600) 上并通过一个或多个网络与服务器部分(例如,服务器系统108)通信,例 如,如图1中所示。在一些示例中,数字助理系统700是图1中所示的服 务器系统108(和/或DA服务器106)的具体实施。应当指出,数字助理系 统700仅为数字助理系统的一个示例,且该数字助理系统700具有比所示 更多或更少的部件、组合两个或更多个部件,或者可具有部件的不同配置 或布局。图7A中所示的各种部件在硬件、用于由一个或多个处理器执行的 软件指令、固件(包括一个或多个信号处理集成电路和/或专用集成电 路),或其组合中实现。
数字助理系统700包括存储器702、输入/输出(I/O)接口706、网络通 信接口708,以及一个或多个处理器704。这些部件可通过一个或多个通信 总线或信号线710彼此进行通信。
在一些示例中,存储器702包括非暂态计算机可读介质,诸如高速随 机存取存储器和/或非易失性计算机可读存储介质(例如,一个或多个磁盘 存储设备、闪存存储器设备或其他非易失性固态存储器设备)。
在一些示例中,I/O接口706将数字助理系统700的输入/输出设备716 诸如显示器、键盘、触摸屏和麦克风耦接至用户界面模块722。I/O接口 706,与用户界面模块722结合,接收用户输入(例如,言语输入、键盘输 入、触摸输入等)并相应地对这些输入进行处理。在一些示例中,例如, 当数字助理在独立式用户设备上实现时,数字助理系统700包括相对于图 2A、图4、图6A至图6B中各自的设备200、设备400或设备600所描述 的部件和I/O通信接口中的任一者。在一些示例中,数字助理系统700代表 数字助理具体实施的服务器部分,并且可通过位于用户设备(例如,设备 104、设备200、设备400或设备600)上的客户端侧部分与用户进行交 互。
在一些示例中,网络通信接口708包括一个或多个有线通信端口 712,以及/或者无线传输和接收电路714。一个或多个有线通信端口经由一 个或多个有线接口例如以太网、通用串行总线(USB)、火线等接收和发送通 信信号。无线电路714从通信网络及其他通信设备接收RF信号和/或光学 信号以及将RF信号和/或光学信号发送至通信网络及其他通信设备。无线 通信使用多种通信标准、协议和技术中的任一种,诸如GSM、EDGE、 CDMA、TDMA、蓝牙、Wi-Fi、VoIP、Wi-MAX、或任何其他合适的通信 协议。网络通信接口708使数字助理系统700通过网络,诸如互联网、内 联网和/或无线网络诸如蜂窝电话网络、无线局域网(LAN)和/或城域网 (MAN),与其他设备之间的通信成为可能。
在一些示例中,存储器702或存储器702的计算机可读存储介质存储 程序、模块、指令和数据结构,包括以下内容中的全部或其子集:操作系 统718、通信模块720、用户界面模块722、一个或多个应用程序724和数 字助理模块726。具体地讲,存储器702或存储器702的计算机可读存储介 质存储用于执行上述过程的指令。一个或多个处理器704执行这些程序、 模块和指令,并从数据结构读取数据或将数据写到数据结构。
操作系统718(例如,Darwin、RTXC、LINUX、UNIX、iOS、OS X、WINDOWS、或嵌入式操作系统诸如VxWorks)包括用于控制和管理 一般系统任务(例如,存储器管理、存储设备控制、电源管理等)的各种 软件组件和/或驱动器,并且有利于各种硬件、固件和软件组件之间的通 信。
通信模块720促成数字助理系统700与其他设备之间通过网络通信接 口708进行的通信。例如,通信模块720与电子设备诸如分别在图2A、图 4、图6A至图6B中所示的设备200、400或600的RF电路208通信。通 信模块720还包括各种部件,用于处理由无线电路714和/或有线通信端口 712所接收的数据。
用户界面模块722经由I/O接口706接收来自用户(例如,来自键 盘、触摸屏、指向设备、控制器和/或麦克风)的命令和/或输入,并在显示 器上生成用户界面对象。用户界面模块722还准备输出(例如,言语、声 音、动画、文本、图标、振动、触觉反馈、光照等)并将其经由I/O接口 706(例如,通过显示器、音频通道、扬声器、触控板等)传送给用户。
应用程序724包括被配置为由所述一个或多个处理器704执行的程序 和/或模块。例如,如果数字助理系统在独立式用户设备上实施,则应用程 序724包括用户应用程序,诸如游戏、日历应用程序、导航应用程序或邮 件应用程序。如果数字助理系统700在服务器上实现,则应用程序724包 括例如资源管理应用程序、诊断应用程序、或调度应用程序。
存储器702还存储数字助理模块726(或数字助理的服务器部分)。 在一些示例中,数字助理模块726包括以下子模块或者其子集或超集:输 入/输出处理模块728、言语到文本(STT)处理模块730、自然语言处理模块 732、对话流处理模块734、任务流处理模块736、服务处理模块738和言 语合成模块740。这些模块中的每一者均具有对以下数字助理模块726的系 统或数据与模型中的一者或多者或者其子集或超集的访问权限:知识本体 760、词汇索引744、用户数据748、任务流模型754、服务模型756和 ASR系统758。
在一些示例中,使用数字助理模块726中实现的处理模块、数据和模 型,数字助理可执行下述各项的至少一部分:将言语输入转换为文本;识 别在从用户接收的自然语言输入中表达的用户意图;主动引出并获得充分 推断用户意图所需的信息(例如,通过消除词、名称、意图的歧义等); 确定用于满足推断出的意图的任务流;以及执行该任务流以满足推断出的 意图。
在一些示例中,如图7B中所示,I/O处理模块728可通过图7A中的 I/O设备716与用户交互或通过图7A中的网络通信接口708与用户设备 (例如,设备104、设备200、设备400或设备600)交互,以获得用户输 入(例如,言语输入)并提供对用户输入的响应(例如,作为言语输 出)。I/O处理模块728随同接收到用户输入一起或在接收到用户输入之后 不久任选地获得与来自用户设备的用户输入相关联的上下文信息。上下文 信息包括特定于用户的数据、词汇,和/或与用户输入相关的偏好。在一些 示例中,该上下文信息还包括在接收到用户请求时的用户设备的软件状态 和硬件状态,和/或与在接收到用户请求时的用户的周围环境相关的信息。 在一些示例中,I/O处理模块728还向用户发送与用户请求有关的跟进问 题,并从用户接收回答。在用户请求被I/O处理模块728接收且用户请求包 括言语输入时,I/O处理模块728将言语输入转发至STT处理模块730(或 言语识别器)以进行言语到文本转换。
STT处理模块730包括一个或多个ASR系统758。该一个或多个ASR 系统758可处理通过I/O处理模块728接收到的言语输入,以产生识别结 果。每个ASR系统758可包括前端言语预处理器。前端言语预处理器从言 语输入中提取代表性特征。例如,前端言语预处理器对言语输入执行傅里 叶变换,以提取表征言语输入的光谱特征作为代表性多维向量的序列。另 外,每个ASR系统758包括一个或多个言语识别模型(例如,声学模型和/ 或语言模型)并且实现一个或多个言语识别引擎。言语识别模型的示例包 括隐马尔可夫模型、高斯混合模型、深层神经网络模型、n元语言模型以及 其他统计模型。言语识别引擎的示例包括基于动态时间规整的引擎和基于 加权有限状态变换器(WFST)的引擎。使用一个或多个言语识别模型和一个 或多个言语识别引擎来处理前端言语预处理器的所提取的代表性特征以产 生中间识别结果(例如,音素、音素串和子字词),并且最终产生文本识 别结果(例如,字词、字词串、或符号序列)。在一些示例中,言语输入 至少部分地由第三方服务处理或在用户的设备(例如,设备104、设备 200、设备400或设备600)上处理,以产生识别结果。一旦STT处理模块 730产生包含文本串(例如,字词,或字词的序列,或符号序列)的识别结 果,识别结果即被传送至自然语言处理模块732以供意图推断。在一些示 例中,STT处理模块730产生言语输入的多个候选文本表示。每个候选文 本表示是与言语输入对应的字词或符号的序列。在一些示例中,每个候选 文本表示与言语识别置信度得分相关联。基于言语识别置信度得分,STT 处理模块730对候选文本表示进行排序并将n个最佳(例如,n个排名最高) 候选文本表示提供给自然语言处理模块732以供意图推断,其中n为大于 零的预先确定的整数。例如,在一个示例中,仅将排名最高的(n=1)候选文 本表示递送至自然语言处理模块732以供意图推断。又如,将5个排名最 高的(n=5)候选文本表示递送至自然语言处理模块732以供意图推断。
有关言语转文本处理的更多细节在提交于2011年9月20日的标题为“Consolidating Speech Recognition Results”的美国实用新型专利申请序列 号13/236,942中有所描述,其全部公开内容以引用方式并入本文。
在一些示例中,STT处理模块730包括可识别字词的词汇和/或经由语 音字母转换模块731访问该词汇。每个词汇字词与言语识别语音字母表中 表示的字词的一个或多个候选发音相关联。具体地讲,可识别字词的词汇 包括与多个候选发音相关联的字词。例如,该词汇包括与 的候选发音相关联的字词“tomato”。另外,词汇字词与基于来自 用户的先前言语输入的自定义候选发音相关联。此类自定义候选发音存储 在STT处理模块730中,并且经由设备上的用户配置文件与特定用户相关 联。在一些示例中,字词的候选发音基于字词的拼写以及一个或多个语言 学和/或语音规则确定。在一些示例中,候选发音手动生成,例如,基于已 知的标准发音而手动生成。
在一些示例中,基于候选发音的普遍性来对候选发音进行排名。例 如,候选发音的排序高于因为前者是更常用的发音 (例如,在所有用户中,对于特定地理区域的用户而言,或者对于任何其 他合适的用户子集而言)。在一些示例中,基于候选发音是否为与用户相 关联的自定义候选发音来对候选发音进行排序。例如,自定义候选发音的 排名高于标准候选发音。这可用于识别具有偏离规范发音的独特发音的专 有名词。在一些示例中,候选发音与一个或多个言语特征(诸如地理起 源、国家或种族)相关联。例如,候选发音与美国相关联,而候 选发音与英国相关联。此外,候选发音的排序基于存储在设备上 的用户配置文件中的用户的一个或多个特征(例如,地理起源、国家、种 族等)。例如,可从用户配置文件确定该用户与美国相关联。基于用户与 美国相关联,候选发音(与美国相关联)可比候选发音 (与英国相关联)排名更高。在一些示例中,经排序的候选发音中的一个 可被选作预测发音(例如,最可能的发音)。
接收到言语输入时,STT处理模块730被用来(例如,使用声音模 型)确定对应于该言语输入的音素,然后尝试(例如,使用语言模型)确 定匹配该音素的字词。例如,如果STT处理模块730首先识别对应于该言 语输入的一部分的音素序列那么它随后可基于词汇索引744确 定该序列对应于字词“tomato”。
在一些示例中,STT处理模块730使用模糊匹配技术来确定话语中的 字词。因此,例如,STT处理模块730确定音素序列对应于字词 “tomato”,即使该特定音素序列不是该字词的候选音素序列。
数字助理的自然语言处理模块732(“自然语言处理器”)获取由 STT处理模块730生成的n个最佳候选文字表示(“字词序列”或“符号 序列”),并尝试将每个候选文本表示与由数字助理所识别的一个或多个 “可执行意图”相关联。“可执行意图”(或“用户意图”)表示可由数 字助理执行并且可具有在任务流模型754中实现的相关联的任务流的任 务。相关联任务流是数字助理为了执行任务而采取的一系列经编程的动作 和步骤。数字助理的能力范围取决于已在任务流模型754中实现并存储的 任务流的数量和种类,或换言之,取决于数字助理所识别的“可执行意 图”的数量和种类。然而,数字助理的有效性还取决于助理从以自然语言 表达的用户请求中推断出正确的“一个或多个可执行意图”的能力。
在一些示例中,除从STT处理模块730获得的字词或符号的序列之 外,自然语言处理模块732还(例如,从I/O处理模块728)接收与用户请 求相关联的上下文信息。自然语言处理模块732任选地使用上下文信息来 明确、补充和/或进一步限定在从STT处理模块730接收的候选文本表示中 包含的信息。上下文信息包括例如用户偏好,用户设备的硬件和/或软件状 态,在用户请求之前、期间或之后不久收集的传感器信息,数字助理与用 户之间的先前交互(例如,对话),等等。如本文所述,在一些示例中, 上下文信息是动态的,并且随对话的时间、位置、内容、以及其他因素而 变化。
在一些示例中,自然语言处理基于例如知识本体760。知识本体760 为包含许多节点的分级结构,每个节点表示“可执行意图”或与“可执行 意图”或其他“属性”中的一者或多者相关的“属性”。如上所述,“可 执行意图”表示数字助理能够执行的任务,即,该任务为“可执行的”或 可被进行的。“属性”代表与可执行意图或另一属性的子方面相关联的参数。知识本体760中可执行意图节点与属性节点之间的连接定义由属性节 点表示的参数如何从属于由可执行意图节点表示的任务。
在一些示例中,知识本体760由可执行意图节点和属性节点组成。在 知识本体760内,每个可执行意图节点直接连接至或通过一个或多个中间 属性节点连接至一个或多个属性节点。类似地,每个属性节点直接连接至 或通过一个或多个中间属性节点连接至一个或多个可执行意图节点。例 如,如图7C所示,知识本体760包括“餐厅预订”节点(即,可执行意图 节点)。属性节点“餐厅”、“日期/时间”(针对预订)和“同行人数”均 直接连接至可执行意图节点(即,“餐厅预订”节点)。
此外,属性节点“菜系”、“价格区间”、“电话号码”和“位置” 是属性节点“餐厅”的子节点,并且均通过中间属性节点“餐厅”连接至 “餐厅预订”节点(即,可执行意图节点)。又如,如图7C所示,知识本体 760还包括“设定提醒”节点(即,另一个可执行意图节点)。属性节点“日 期/时间”(针对设定提醒)和“主题”(针对提醒)均连接至“设定提 醒”节点。由于属性“日期/时间”与进行餐厅预订的任务和设定提醒的任 务二者相关,因此属性节点“日期/时间”连接至知识本体760中的“餐厅 预订”节点和“设定提醒”节点二者。
可执行意图节点连同其连接的概念节点一起,被描述为“域”。在本 讨论中,每个域与相应的可执行意图相关联,并涉及与特定可执行意图相 关联的一组节点(以及这些节点之间的关系)。例如,图7C中示出的知识 本体760包括在知识本体760内的餐厅预订域762的示例以及提醒域764的 示例。餐厅预订域包括可执行意图节点“餐厅预订”、属性节点“餐厅”、“日期/时间”和“同行人数”以及子属性节点“菜系”、“价格范 围”、“电话号码”和“位置”。提醒域764包括可执行意图节点“设定 提醒”和属性节点“主题”和“日期/时间”。在一些示例中,知识本体 760由多个域组成。每个域与一个或多个其他域共享一个或多个属性节点。 例如,除了餐厅预订域762和提醒域764之外,“日期/时间”属性节点还 与许多不同域(例如,行程安排域、旅行预订域、电影票域等)相关联。
尽管图7C示出知识本体760内的两个示例性域,但其他域包括例如 “查找电影”、“发起电话呼叫”、“查找方向”、“安排会议”、“发 送消息”以及“提供问题的回答”、“阅读列表”、“提供导航指令”、 “提供针对任务的指令”等。“发送消息”域与“发送消息”可执行意图节点相关联,并且进一步包括属性节点诸如“一个或多个接收人”、“消 息类型”和“消息正文”。属性节点“接收人”进一步例如由子属性节点 诸如“接收人姓名”和“消息地址”来限定。
在一些示例中,知识本体760包括数字助理能够理解并对其起作用的 所有域(以及因而可执行意图)。在一些示例中,知识本体760诸如通过 添加或移除整个域或节点,或者通过修改知识本体760内的节点之间的关 系进行修改。
在一些示例中,将与多个相关可执行意图相关联的节点群集在知识本 体760中的“超级域”下。例如,“旅行”超级域包括与旅行相关的属性 节点和可执行意图节点的群集。与旅行相关的可执行意图节点包括“机票 预订”、“酒店预订”、“汽车租赁”、“路线规划”、“寻找感兴趣的 点”,等等。同一超级域(例如,“旅行”超级域)下的可执行意图节点 具有多个共用的属性节点。例如,针对“机票预订”、“酒店预订”、 “汽车租赁”、“路线规划”和“寻找感兴趣的点”的可执行意图节点共 享属性节点“起始位置”、“目的地”、“出发日期/时间”、“到达日期/ 时间”和“同行人数”中的一者或多者。
在一些示例中,知识本体760中的每个节点与跟由节点代表的属性或 可执行意图有关的一组字词和/或短语相关联。与每个节点相关联的相应组 的字词和/或短语是所谓的与节点相关联的“词汇”。将与每个节点相关联 的相应组的字词和/或短语存储在与由节点所代表的属性或可执行意图相关 联的词汇索引744中。例如,返回图7B,与“餐厅”属性的节点相关联的 词汇包括字词诸如“美食”、“酒水”、“菜系”、“饥饿”、“吃”、 “披萨”、“快餐”、“膳食”等。又如,与“发起电话呼叫”可执行意 图的节点相关联的词汇包括字词和短语诸如“呼叫”、“打电话”、“拨 打”、“与……通电话”、“呼叫该号码”、“打电话给”等。词汇索引 744任选地包括不同语言的字词和短语。
自然语言处理模块732接收来自STT处理模块730的候选文本表示(例 如,一个或多个文本串或一个或多个符号序列),并针对每个候选表示,确 定候选文本表示中的字词涉及到哪些节点。在一些示例中,如果发现候选 文本表示中的字词或短语(经由词汇索引744)与知识本体760中的一个或 多个节点相关联,则所述字词或短语“触发”或“激活”这些节点。基于 已激活节点的数量和/或相对重要性,自然语言处理模块732选择可执行意 图中的一个可执行意图作为用户意图使数字助理执行的任务。在一些示例 中,选择具有最多“已触发”节点的域。在一些示例中,选择具有最高置 信度(例如,基于其各个已触发节点的相对重要性)的域。在一些示例 中,基于已触发节点的数量和重要性的组合来选择域。在一些示例中,在 选择节点的过程中还考虑附加因素,诸如数字助理先前是否已正确解译来自用户的类似请求。
用户数据748包括特定于用户的信息,诸如特定于用户的词汇、用户 偏好、用户地址、用户的默认语言和第二语言、用户的联系人列表,以及 每位用户的其他短期或长期信息。在一些示例中,自然语言处理模块732 使用特定于用户的信息来补充用户输入中所包含的信息以进一步限定用户 意图。例如,针对用户请求“邀请我的朋友参加我的生日派对”,自然语 言处理模块732能够访问用户数据748以确定“朋友”是哪些人以及“生 日派对”将于何时何地举行,而不需要用户在其请求中明确地提供此类信 息。
应认识到,在一些示例中,利用一个或多个机器学习机构(例如,神经 网络)来实现自然语言处理模块732。具体地,一个或多个机器学习机构被 配置为接收候选文本表示和与候选文本表示相关联的上下文信息。基于候 选文本表示和相关联的上下文信息,一个或多个机器学习机构被配置为基 于一组候选可执行意图确定意图置信度得分。自然语言处理模块732可基 于所确定的意图置信度得分从一组候选可执行意图中选择一个或多个候选 可执行意图。在一些示例中,还利用知识本体(例如,知识本体760)从一组 候选可执行意图中选择一个或多个候选可执行意图。
基于符号串搜索知识本体的其他细节在于2008年12月22日提交的标 题为“Method and Apparatus for Searching Using An Active Ontology”的美 国实用新型专利申请序列号12/341,743中有所描述,其全部公开内容以引 用方式并入本文。
在一些示例中,一旦自然语言处理模块732基于用户请求识别出可执 行意图(或域),自然语言处理模块732便生成结构化查询以表示所识别 的可执行意图。在一些示例中,结构化查询包括针对可执行意图的域内的 一个或多个节点的参数,并且所述参数中的至少一些参数填充有用户请求 中指定的特定信息和要求。例如,用户说“帮我在寿司店预订晚上7点的 座位。”在这种情况下,自然语言处理模块732能够基于用户输入将可执 行意图正确地识别为“餐厅预订”。根据知识本体,“餐厅预订”域的结 构化查询包括参数诸如{菜系}、{时间}、{日期}、{同行人数}等。在一些 示例中,基于言语输入和使用STT处理模块730从言语输入得出的文本, 自然语言处理模块732针对餐厅预订域生成部分结构化查询,其中部分结 构化查询包括参数{菜系=“寿司类”}以及{时间=“晚上7点”}。然而, 在该示例中,用户话语包含不足以完成与域相关联的结构化查询的信息。 因此,基于当前可用信息,在结构化查询中未指定其他必要参数,诸如{同 行人数}和{日期}。在一些示例中,自然语言处理模块732用所接收的上下 文信息来填充结构化查询的一些参数。例如,在一些示例中,如果请求 “附近的”寿司店,自然语言处理模块732用来自用户设备的GPS坐标来 填充结构化查询中的{位置}参数。
在一些示例中,自然语言处理模块732识别针对从STT处理模块730 所接收的每个候选文本表示的多个候选可执行意图。另外,在一些示例 中,针对每个所识别的候选可执行意图生成相应的结构化查询(部分地或全 部地)。自然语言处理模块732确定针对每个候选可执行意图的意图置信 度得分,并基于意图置信度得分对候选可执行意图进行排序。在一些示例 中,自然语言处理模块732将所生成的一个或多个结构化查询(包括任何 已完成的参数)传送至任务流处理模块736(“任务流处理器”)。在一些 示例中,针对m个最佳(例如,m个排名最高的)候选可执行意图的一个 或多个结构化查询被提供给任务流处理模块736,其中m为预先确定的大 于零的整数。在一些示例中,将针对m个最佳候选可执行意图的一个或多 个结构化查询连同对应的候选文本表示提供给任务流处理模块736。
基于根据言语输入的多个候选文本表示所确定的多个候选可执行意图 推断用户意图的其他细节在提交于2014年6月6日的“System and Method for Inferring UserIntent From Speech Inputs”的美国实用新型申请号 14/298,725中有所描述,其全部公开内容以引用方式并入本文。
任务流处理模块736被配置为接收来自自然语言处理模块732的一个 或多个结构化查询,(必要时)完成结构化查询,以及执行“完成”用户 最终请求所需的动作。在一些示例中,完成这些任务所必需的各种过程在 任务流模型754中提供。在一些示例中,任务流模型754包括用于获得来 自用户的附加信息的过程,以及用于执行与可执行意图相关联的动作的任 务流。
如上所述,为了完成结构化查询,任务流处理模块736需要发起与用 户的附加对话,以便获得附加信息和/或弄清可能有歧义的话语。当有必要 进行此类交互时,任务流处理模块736调用对话流处理模块734来参与同 用户的对话。在一些示例中,对话流处理器模块734确定如何(和/或何 时)向用户请求附加信息,并且接收和处理用户响应。通过I/O处理模块 728将问题提供给用户并从用户接收回答。在一些示例中,对话处理模块 734经由音频和/或视频输出向用户呈现对话输出,并接收经由口头或物理 (例如,点击)响应的来自用户的输入。继续上述示例,在任务流处理模 块736调用对话流处理模块734来确定针对与域“餐厅预订”相关联的结 构化查询的“同行人数”和“日期”信息时,对话流处理模块734生成诸 如“一行几位?”和“预订哪天?”之类的问题传递给用户。一旦收到来自 用户的回答,对话流处理模块734就用缺失信息填充结构化查询,或将信 息传递给任务流处理模块736以根据结构化查询完成缺失信息。
一旦任务流处理模块736已针对可执行意图完成结构化查询,任务流 处理模块736便开始执行与可执行意图相关联的最终任务。因此,任务流 处理模块736根据结构化查询中包含的特定参数来执行任务流模型中的步 骤和指令。例如,针对可执行意图“餐厅预订”的任务流模型包括用于联 系餐厅并实际上请求在特定时间针对特定同行人数的预订的步骤和指令。 例如,使用结构化查询诸如:餐厅预订、{餐厅=ABC咖啡馆、日期= 3/12/2012、时间=7pm、同行人数=5,}任务流处理模块736可执行以下步 骤:(1)登录ABC咖啡馆的服务器或诸如之类的餐厅预订系 统,(2)以网站上的形式输入日期、时间和同行人数信息,(3)提交表格,以 及(4)在用户的日历上形成针对预订的日历条目。
在一些示例中,任务流处理模块736在服务处理模块738(“服务处 理模块”)的辅助下完成用户输入中所请求的任务或者提供用户输入中所 请求的信息性回答。例如,服务处理模块738代表任务流处理模块736发 起电话呼叫、设定日历条目、调用地图搜索、调用用户设备上安装的其他 用户应用程序或与所述其他应用程序进行交互,以及调用第三方服务(例 如,餐厅预订门户网站、社交网站、银行门户网站等)或与第三方服务进 行交互。在一些示例中,通过服务模型756中的相应服务模型指定每项服 务所需的协议和应用程序编程接口(API)。服务处理模块738针对服务访问 适当的服务模型,并依据服务模型根据该服务所需的协议和API生成针对 该服务的请求。
例如,如果餐厅已启用在线预订服务,则餐厅提交服务模型,该服务 模型指定进行预订的必要参数以及将必要参数的值传送至在线预订服务的 API。在被任务流处理模块736请求时,服务处理模块738可使用存储在服 务模型中的web地址来建立与在线预订服务的网络连接,并将预订的必要 参数(例如,时间、日期、同行人数)以根据在线预订服务的API的格式 发送至在线预订接口。
在一些示例中,自然语言处理模块732、对话处理模块734以及任务 流处理模块736被共同且反复地使用,以推断并限定用户的意图、获得信 息以进一步明确并细化用户意图、并最终生成响应(即,输出至用户,或 完成任务)以满足用户的意图。所生成的响应是至少部分地满足用户意图 的对言语输入的对话响应。另外,在一些示例中,所生成的响应被输出为 言语输出。在这些示例中,所生成的响应被发送到言语合成模块740(例 如,言语合成器),在言语合成模块中,可处理所生成的响应以将对话响 应以言语形式合成。在其他示例中,所生成的响应是与满足言语输入中的 用户请求相关的数据内容。
在任务流处理模块736接收到来自自然语言处理模块732的多个结构 化查询的示例中,任务流处理模块736首先处理所接收结构化查询的第一 结构化查询以试图完成第一结构化查询和/或执行由第一结构化查询所表示 的一个或多个任务或动作。在一些示例中,第一结构化查询对应于排名最 高的可执行意图。在其他示例中,第一结构化查询选自基于对应的语音识 别置信度得分和对应的意图置信度得分的结合所接收的结构化查询。在一 些示例中,如果任务流处理模块736在第一结构化查询的处理期间(例如, 由于无法确定必要的参数)遇到错误,任务流处理模块736可继续选择和处 理所接收结构化查询的与排名较低的可执行意图对应的第二结构化查询。 例如基于对应候选文本表示的语音识别置信度得分、对应候选可执行意图 的意图置信度得分、第一结构化查询中的缺失必要参数或它们的任何组合 来选择第二结构化查询。
言语合成模块740被配置为合成用于呈现给用户的言语输出。言语合 成模块740基于数字助理提供的文本来合成言语输出。例如,所生成的对 话响应是文本串的形式。言语合成模块740将文本串转换成可听言语输 出。言语合成模块740使用任何适当言语合成技术,以便从文本生成言语 输出,包括但不限于:拼接合成、单元选择合成、双音素合成、域特定合 成、共振峰合成、发音合成、基于隐马尔可夫模型(HMM)的合成,以及正 弦波合成。在一些示例中,言语合成模块740被配置为基于对应于这些字 词的音素串来合成各个字词。例如,音素串与所生成的对话响应中的字词 相关联。音素串存储在与字词相关联的元数据中。言语合成模型740被配 置为直接处理元数据中的音素串,以合成言语形式的字词。
在一些示例中,替代使用言语合成模块740(或除此之外),在远程 设备(例如,服务器系统108)上执行言语合成,并且将合成的言语发送至 用户设备以输出给用户。例如,这可发生在一些具体实施中,其中在服务 器系统处生成数字助理的输出。而且由于服务器系统通常比用户设备具有 更强的处理能力或更多的资源,其有可能获得比客户端侧合成将实现的质 量更高的言语输出。
有关数字助理的附加细节可在于2011年1月10日提交的标题为 “IntelligentAutomated Assistant”的美国实用新型专利申请12/987,982和于 2011年9月30日提交的标题为“Generating and Processing Task Items That Represent Tasks to Perform”的美国实用新型专利申请13/251,088中找到, 其全部公开内容以引用方式并入本文。
4.数字助理的示例性功能和架构基于语音匹配技术提供媒体项目
图8示出了根据各种示例的用于基于语音匹配技术提供媒体项目的数 字助理800的框图。在一些示例中,数字助理800(例如,数字助理系统 700)由用户设备根据各种示例来实现。在一些示例中,用户设备、服务器 (例如,服务器108)或它们的组合可实现数字助理800。用户设备可利用例 如如图1、2A-2B、4和6所示的设备104、200、400或600来实现。在一 些示例中,数字助理800可利用数字助理系统700的数字助理模块726来 实现。数字助理800包括类似于数字助理模块726的一个或多个模块、模 型、应用程序、词汇表和用户数据。例如,数字助理800包括下述子模块 或它们的子集或超集:输入/输出处理模块、STT处理模块、自然语言处理 模块、任务流处理模块和言语合成模块。这些模块也可类似于如图7B所示 的对应模块来实现,因此未示出并且不进行重复描述。
如图8所示,在一些实施方案中,数字助理800可包括请求检测器 810、媒体项目搜索引擎820、媒体项目的储存库830和语音匹配模块 840。数字助理800可接收言语输入802。图9示出了根据各种示例的用于 接收来自用户902的言语输入的数字助理800的框图。参照图8和图9,言 语输入802可包括非结构化自然语言信息。例如,如图9所示,言语输入 802A可包括“播放21Savage的‘Skrrt Skrrt;”并且言语输入802B可包 括“为我找到Zedd的‘Candyman”。尽管言语输入802A和802B包括诸 如歌曲名称、专辑名称、艺术家名称等信息,但应理解,言语输入也可包 括诸如流派、主演、著名场景等其他信息。
在一些示例中,需要数字助理800准确确定用户意图并获得正确的媒 体项目,尽管存在发音的相似性、储存库中能够获得的媒体项目的相似 性、以及缺少用于意图推断的上下文信息。在上述示例中,将需要数字助 理800确定例如言语输入802A是否包括字词“Skirt”、“Skrrt”或 “Skrt;”以及言语输入802B是否包括字词“Candyman”或字词序列“Candy man”。在一些示例中,用户可以不提供附加信息,并且言语输入 802A或802B可为针对用户意图的推断对数字助理800可用的所有信息。 语音匹配技术可提高获得正确媒体项目的准确度,如下文所详述的。
如上所述,数字助理800可包括请求检测器810。图10示出了根据各 种示例的请求检测器810的框图。请求检测器810可确定言语输入802是 否包括对媒体项目的用户请求。言语输入802可包括任何信息或任何请求 类型。例如,言语输入802可包括对天气信息、股票价格等的请求,并且 与请求媒体项目无关。因此,期望首先确定言语输入802是否包括对媒体 项目的用户请求。媒体项目可为可听项目(例如,歌曲、有声读物/章节、新 闻的语音输出等)、视频项目(例如,电影、纪录片、电视节目等),或它们 的组合。
参考图10,请求检测器810接收言语输入802。在一些实施方案中, 为了确定言语输入802是否包括对媒体项目的用户请求,请求检测器810 可基于言语输入802获得符号或字词序列。例如,请求检测器810可包括 言语转文本(STT)处理模块1010。STT处理模块1010可利用如上所述的 STT处理模块730来实现。因此,STT处理模块1010可处理言语输入802(例如,利用各种声音和/或语言模型执行特征提取和言语识别)并产生包含 文本串(例如,字词、字词序列或符号序列)的识别结果。言语识别模型的示 例包括隐马尔可夫模型、高斯混合模型、深层神经网络模型、n元语言模型 以及其他统计模型。
在一些示例中,STT处理模块1010可产生言语输入802的多个候选文 本表示。每个候选文本表示是与言语输入对应的字词或符号的序列。在一 些示例中,每个候选文本表示与言语识别置信度得分相关联。基于言语识 别置信度得分,STT处理模块1010可对候选文本表示进行排序并将n个最 佳(例如,n个排名最高)候选文本表示提供给请求分析器1020,其中n为大 于零的预先确定的整数。例如,在一个示例中,仅将排名最高(n=1)的候选 文本表示递送至请求分析器1020以确定言语输入802是否包括对媒体项目 的用户请求。又如,将5个排名最高(n=5)的候选文本表示递送至请求分析 器1020以确定言语输入802是否包括对媒体项目的用户请求。
在一些实施方案中,如图10所示,请求分析器1020接收由STT处理 模块1010所提供的一个或多个字词序列。请求分析器1020可确定一个或 多个字词序列是否包括对媒体项目的用户请求的表示。在一个示例中,请 求分析器1020可确定一个或多个字词序列是否包括一个或多个预先确定的 字词。例如,预先确定的字词(例如,“播放”、“随机播放”、“听”、 “查找”)或预先确定的字词序列(例如,“可否播放…?”)可能表明言语输 入802包括对媒体项目的用户请求。
在一个示例中,请求分析器1020可确定一个或多个字词序列是否对应 于一个或多个预先确定的语法。例如,预先确定的字词序列的语法可具有 如下模式:诸如歌曲名称紧接艺术家名称(例如,“21Savage的Skrrt Skrrt (Skrrt Skrrt by 21Savage)”)、艺术家名称紧接歌曲名称(例如,“21 Savage Skrrt Skrrt”)、定冠词紧接歌曲名称(例如,“theSkrrt Skrrt”)、任 何字词紧接艺术家名称(例如,“…21Savage”)、定冠词紧接专辑名称和/ 或艺术家名称(例如,“the Candyman by Zedd)或任何字词紧接或紧随流派 (例如,“请播放轻音乐”)。在一些示例中,如果从言语输入802获得的一 个或多个字词序列对应于一个或多个预先确定的语法,则很可能言语输入 802包括对媒体项目的用户请求。
在一些示例中,请求分析器1020可基于预先确定的字词和预先确定的 语法的结合来确定STT处理模块1010所提供的一个或多个字词序列是否包 括对媒体项目的用户请求。例如,请求分析器1020可确定序列或字词是否 均包括预先确定的字词并对应于上述模式中的任一种(例如,“播放21 Savage的Skrrt Skrrt”)。如果序列或字词均包括预先确定的字词并对应于 上述模式中的任一种,则很可能言语输入802包括对媒体项目的用户请求。
在一些示例中,为了减小言语输入802是否包括对媒体项目的用户请 求的错误确定的可能性,请求检测器810中可包括停用字词检测器1030。 停用字词检测器1030可确定一个或多个字词序列是否包括一个或多个停用 字词或短语。停用字词包括经常被确定指示言语输入包括数字助理不太可 能能够识别的请求的常用词。很多情况下,这些言语输入可包括一个或多 个停用字词,诸如“播放列表”或“播客”。这些字词可与可能不与候选 媒体项目824的集合相对应或可能不被候选媒体项目824的集合所支持的 大量项目相关联,使得它们不太可能被语音匹配模块840识别出来。在一 些示例中,停用字词检测器1030可包括用于检测停用字词的有限状态机。 如果检测到一个或多个停用字词,则言语输入802不太可能包括对媒体项 目的用户请求。
在一些示例中,请求检测器810可确定字词序列的概率和/或得分,并 将该概率或得分与阈值概率或得分进行比较。如果概率或得分满足阈值概 率或得分,则请求检测器810可确定言语输入802包括对媒体项目的用户 请求。例如,如果字词序列包括一个或多个预先确定的字词(例如,“播 放”)和/或预先确定的语法(例如,歌曲名称紧接艺术家名称),则请求检测 器810可增加字词序列的概率和/或得分,并且如果字词序列包括一个或多 个停用字词,则请求检测器810减小其概率和/或得分。
如图8和图10所示,在一些实施方案中,根据确定言语输入802包括 对媒体项目的用户请求,则请求检测器810可将表示言语输入802的字词 序列812提供给媒体项目搜索引擎820。媒体项目搜索引擎820可根据媒体 项目的储存库830来确定候选媒体项目。图11A示出了根据各种示例的媒 体项目搜索引擎820的框图。在一些实施方案中,媒体项目搜索引擎820 可包括语音符号序列发生器1110、符号搜索引擎1120和搜索结果精炼模块 1130。下文更详细描述媒体项目搜索引擎820的这些部件中的每个部件。
参考图11A,语音符号序列发生器1110可生成表示言语输入802的语 音符号序列1112。在一些实施方案中,语音符号序列发生器1110可包括如 上所述的言语合成模块740。在一些示例中,利用言语合成模块740,可使 用任何适当言语合成技术从字词序列812来生成语音符号序列1112,这些 言语合成技术包括但不限于:拼接合成、单元选择合成、双音素合成、域 特定合成、共振峰合成、发音合成、基于隐马尔可夫模型(HMM)的合成, 以及正弦波合成。如上所述,字词序列812基于言语输入802来获得。在 一些示例中,由于用户在言语输入802中的不完美发音(例如,用户的口 音)、噪声环境和/或与言语到文本转换相关联的不准确本质,字词序列812 可能准确地或不准确地表示用户意图。例如,言语输入802可包括艺术家 名称“James Smith”,而图10所示的言语到文本处理模块1010可生成包 括字词“fChame Snix”的字词序列812。这样,基于字词序列812,语音符 号序列发生器1110可生成语音符号序列1112,诸如“f-ch-ay-m-[]-s-n-ix- []”。
在一些实施方案中,如图11A所示,符号搜索引擎1120可基于表示言 语输入802的语音符号序列1112来确定一个或多个参考语音符号序列 1122。参考语音符号序列可表示从一个或多个字典获得的字词序列。这 样,参考语音符号序列中所表示的字词可为其标准形式(例如,公认形式或 无误形式)的字词;因此,参考语音符号中所包括的发音可为标准发音。利 用上述示例,字词序列“James Smith”的参考语音符号序列可为“[]-jh-ey- m-z-s-m-ih-th”。
在一些实施方案中,为了确定参考语音符号序列1122,符号搜索引擎 1120可使用语音图(phonomap)。语音图使一个或多个参考语音符号序列 (例如,基于字典中的字词所获得的语音符号序列)与表示言语输入的语音符 号序列相关联。图11B示出了根据各种示例的用于生成语音图的框图。图 11C示出了字词序列“James Smith”的示例性语音图。
参考图11A-图11C,在一些实施方案中,符号搜索引擎1120可基于 可在表示言语输入802的语音符号序列1112期间执行的多个预先确定的操 作来生成语音图1146。如上所述,由于用户在言语输入802中的不完美发 音(例如,用户的口音)、噪声环境和/或与言语到文本转换相关联的不准确 本质,字词序列812可能准确地或不准确地表示预期字词。在一些示例 中,一个或多个操作(例如,插入、删除或置换)可在表示言语输入802的语 音符号序列1112的生成期间执行,从而导致不准确的语音符号、额外的语 音符号和/或缺失的语音符号。因此,为了确定参考语音符号序列1122中的 参考语音符号,符号搜索引擎1120可确定是否在表示言语输入802的语音 符号序列1112的生成期间执行一个或多个预先确定的操作(例如,插入操 作、替换操作和/或删除操作)。插入操作、替换操作和/或删除操作可用于 映射不同的但能够经常被关联的语音符号。例如,针对表示言语输入802 的语音符号序列1112的每个语音符号,符号搜索引擎1120可确定参考语 音符号序列1122的语音符号是否对应于不同的参考语音符号。可基于能够 在表示言语输入802的语音符号序列1112的生成期间执行的预先确定的操 作(例如,插入、置换或删除)来确定对应关系。
如图11A和11C所示,由于不完美语音符号序列的生成,表示言语输 入802的语音符号序列1112可包括语音符号1154A(例如,“f”)。语音符 号1154A可为在语音符号序列生成过程期间经常被插入的或额外的语音符 号。因此,符号搜索引擎1120可确定语音符号序列发生器1110可能已经 在生成语音符号序列1112中执行插入操作。因此,符号搜索引擎1120可 确定语音符号序列1112中的语音符号1154A(例如,符号“f”)对应于参考 语音符号序列1122中的语音符号1152A(例如,不发音符号或无符号)。
又如,由于不完美语音符号序列的生成,表示言语输入的语音符号序 列1112可包括语音符号1154B(例如,“ch”)。语音符号1154B可为语音 符号1152B(例如,“jh”)在语音符号序列生成过程期间的常用替代形式或 另选形式。因此,符号搜索引擎1120可确定语音符号序列发生器1110可 能在生成语音符号序列1112中执行替换操作。因此,符号搜索引擎1120 可确定语音符号序列1112中的语音符号1154B(例如,符号“ch”)对应于 参考语音符号序列1122中的语音符号1152B(例如,符号“jh”)。
又如,由于不完美语音符号序列的生成,表示言语输入的语音符号序 列1112可包括语音符号1154D(例如,不发音符号或无符号)。语音符号 1154D可为在语音符号序列生成过程期间经常被省略或删除的语音符号。 因此,符号搜索引擎1120可确定语音符号序列发生器1110可能在生成语 音符号序列1112中执行删除操作。因此,符号搜索引擎1120可确定语音 符号序列1112中的语音符号1154D(例如,不发音符号或无符号)对应于参 考语音符号序列1122中的语音符号1152D(例如,符号“z”)。
在一些示例中,如果基于当前言语输入生成的语音符号和参考语音符 号之间存在准确匹配,则符号搜索引擎1120可确定语音符号序列发生器 1110可能未执行诸如插入、替换或删除等操作。例如,针对表示言语输入 802的语音符号序列1112的每个语音符号,符号搜索引擎1120可确定参考 语音符号序列1122的语音符号是否与参考语音符号序列1122的相同参考 语音符号匹配。如图11C所示,符号搜索引擎1120可确定语音符号序列 1112中的语音符号1154C(例如,“m”)与参考语音符号序列1122中的参 考语音符号1152C(例如,“m”)完全匹配。因此,语音图1146表明存在 匹配并且语音符号序列发生器1110可能不执行操作。
在一些示例中,针对表示言语输入802的语音符号序列1112中的每个 语音符号,符号搜索引擎1120还可确定语音符号序列1112的语音符号与 参考语音符号序列1122中的参考语音符号相对应的概率。符号搜索引擎 1120可包括语音图中的概率。图11D示出了根据各种示例的此类示例性语 音图。
参考图11A和11D,符号搜索引擎1120可生成语音图1160。语音图 1160可包括表示言语输入802的语音符号序列1112中的多个语音符号 1164。语音符号1164可包括例如“Ae-Eh-B-D-Dx-Del”,其中“Del”表 示符号的删除操作或省略操作。语音图1160还可包括参考语音符号序列 1122中的多个参考语音符号1162。参考语音符号1162可包括例如“ae0-ae1-ae2-eh0-eh1-eh2-b<-b>-d<-d>-dx-ins”,其中“ins”表示插入操作。在 语音图1160中,数字与语音符号1164关联,并且参考语音符号1162表示 特定语音符号1164与特定参考语音符号1162对应的概率。例如,语音图 1160中的元素1166A表示语音符号1112中的语音符号1164A(例如, “Ae”)与参考语音符号序列1122中的参考语音符号1162A(例如, “ae0”)对应的概率(例如,92%)。即,元素1166A表示语音符号序列发生 器1110可在生成语音符号1164A时在表示言语输入802的语音符号序列 1112中执行替换操作(例如,用“Ae”替换“ae0”)的概率。
再例如,语音图1160中的元素1166B表示代表言语输入802的语音符 号序列1112中的语音符号1164B(例如,“D”)已被语音符号序列发生器 1110插入的概率(例如,6%)。又如,语音图1160中的元素1166C表示语音 符号序列发生器1110可在表示言语输入802的语音符号序列1112中执行 删除操作或省略参考语音符号1162C(例如,“b<”)的概率(例如,5%)。 在一些示例中,可基于训练数据来确定语音图1160中的概率。语音图的训 练在提交于2003年3月31日并作为美国专利号7,146,319公布的名称为 “PHONETICALLY BASEDSPEECH RECOGNITION SYSTEM AND METHOD”的美国专利申请10/401,572中详述,该文献的内容据此全文以 引用方式并入本文并包括在附录中。
在一些示例中,语音图1160还可包括其他信息,诸如特定参考语音符 号在所有参考语音符号之间出现的相对频率。例如,元素1166D指示参考 语音序号1162B(例如,“ae1”)在所有参考语音符号之间出现的相对频率 为1。语音图1160还可包括信息,诸如在语音符号序列发生器1110生成语 音符号1164时的期望分数。该分数可指示语音符号1164表示言语输入802 的准确度或可能性。
再次参考图11A,在一些示例中,搜索结果精炼模块1130可基于参考 语音符号序列1122来确定候选媒体项目。例如,针对一个或多个参考语音 符号序列1122中的每个参考语音符号序列,搜索结果精炼模块1130可确 定指示表示言语输入802的语音符号序列1112和特定参考语音符号序列 1122之间的匹配度的分数。例如,如果特定参考语音符号序列1122具有较 高的匹配度(例如,不进行操作或仅进行诸如替换、插入或删除之类的几个 操作),则特定参考语音符号序列1122可具有较高分数。在一些示例中,搜 索结果精炼模块1130可基于与一个或多个参考语音符号序列1122相关联 的分数来确定一个或多个媒体项目。例如,搜索结果精炼模块1130可基于 一个或多个参考语音符号序列1122的相应分数对它们进行排名;选择具有 最高分数的参考语音符号序列;并基于具有最高分数的参考语音符号序列 来识别候选媒体项目。
参考图8和图11A,在一些示例中,可利用具有最高分数的参考语音 符号序列和媒体项目的储存库830中能够获得的媒体项目的语音表示832 来执行识别候选媒体项目。例如,基于媒体项目的储存库830,可生成与储 存库830中能够获得的每个可用媒体项目(例如,标题、专辑、流派、艺术 家等)相关的信息的语音符号序列。储存库830中能够获得的所有媒体项目 的信息的语音符号序列可形成语音表示832。搜索结果精炼模块1130可通 过利用例如加权有限状态转换器(WFST)使具有最高分数的参考语音符号序 列映射到语音表示832来确定候选媒体项目。WFST可使得两组符号(例 如,参考语音符号序列1122中的语音符号和语音表示832的每个语音符号 序列中所包括的语音符号)之间的映射成为可能。因此,基于语音符号序列 的映射,可识别候选媒体项目。候选媒体项目可对应于与具有最高分数的 参考语音符号序列最匹配的语音表示832的语音符号序列。使用WFST允 许对候选媒体项目更快速有效的确定。如图11A所示,搜索结果精炼模块 1130可提供候选媒体项目的语音符号序列1132以供进一步处理。候选媒体 项目的语音符号序列1132表示与具有最高分数的参考语音符号序列最匹配 的语音表示832的语音符号序列。
再次参考图8,媒体项目搜索引擎820可提供言语输入802的语音表示 822和候选媒体项目的语音表示824。言语输入802的语音表示822可包括 例如表示言语输入802的语音符号序列1112,如图11A所示。候选媒体项 目的语音表示824可包括候选媒体项目的语音符号序列1132,如图11A所 示。在一些示例中,基于候选媒体项目的语音表示824和言语输入802的 语音表示822之间的差值,语音匹配模块840可确定是否需向用户提供候 选媒体项目。
如上所述,媒体项目搜索引擎820可确定候选媒体项目的语音表示 824。可从媒体项目的储存库830中识别候选媒体项目。因此,候选媒体项 目表示能够从储存库830获得的最佳匹配的媒体项目。但候选媒体项目可 能是或可能不是用户预期的项目。例如,如果用户预期的特定媒体项目未 存储在储存库830中,则候选媒体项目可表示密切相关的媒体项目,但可 能不是用户预期的确切项目。因此,进一步确定候选媒体项目是否与从言 语输入802得出的用户意图匹配可提高准确性并提供更有效的用户交互界 面。例如,如果在储存库830中未发现用户预期的媒体项目,则可以不向 用户提供候选媒体项目(例如,表示储存库830中最佳匹配的项目)。在一些 示例中,如果与候选媒体项目的语音表示824相关联的分数满足阈值条 件,则可能不需要进一步确定候选媒体项目是否与用户意图相匹配。例如,与语音表示824相关联的分数可与参考语音符号序列1122的最高分数 对应。如果最高分数超过特定阈值条件,则可能表明候选媒体项目极有可 能与用户意图相匹配,并且可能不需要语音匹配的过程(如下文所详述的)。 因此,可放弃(例如,不执行或放弃)对候选媒体项目是否与用户意图匹配的 确定并可将候选媒体项目提供给用户。放弃对候选媒体项目是否与用户意 图匹配的确定可提高对用户作出响应的速度。
参考图8和图11A,在一些实施方案中,为了确定是否将候选媒体项 目提供给用户(例如,确定候选媒体项目是否与用户预期的项目充分匹配), 语音匹配模块840可确定候选媒体项目的语音表示824和言语输入802的 语音表示822之间的差值。如上所述,在一些示例中,言语输入802的语 音表示822可包括表示言语输入802的语音符号序列1112;并且语音表示 824可包括候选媒体项目的语音符号序列1132。在一些示例中,差值的确 定可基于与语音符号序列1112相关联的预期分数和指示语音符号序列1112 和候选媒体项目的语音符号序列1132之间匹配度的分数。
在一些示例中,与表示言语输入802的语音符号序列1112相关联的预 期分数可为与生成表示言语输入802的语音符号序列1112中所包括的音符 相关联的最佳可能分数。在一些示例中,最佳可能分数可为与生成表示言 语输入802的语音符号序列1112相关联的置信度分数。如上所述,在一些 示例中,生成语音符号序列1112可包括生成言语输入802的字词序列812 和基于字词序列812生成语音符号序列1112的过程。这些过程可能需要言 语到文本转换以将言语输入802转换为字词序列812,同时还需要言语合成 过程以将字词序列812转换为语音符号序列1112。因此,可确定一个或多 个置信度分数以指示与用于基于言语输入802生成语音符号序列1112的这 些过程相关联的准确度的可能性。
参考图8和图11A,语音匹配模块840可确定候选媒体项目的语音表 示824和言语输入802的语音表示822之间的差值是否满足阈值条件;并 且根据确定该差值满足阈值条件来确定要将候选媒体项目提供给用户。如 上所述,表示言语输入802的语音符号序列1112可包括在语音表示822中 并且候选媒体项目的语音符号序列1132可包括在语音表示824中。在一些 示例中,语音匹配模块840可根据如下公式(1)来确定候选媒体项目的语音表示824和言语输入802的语音表示822之间的差值是否满足阈值条件:
(与表示言语输入802的语音符号序列1112相关联的预期分数–指示语 音符号序列1112和候选媒体项目的语音符号序列1132之间的最高匹配度的最高分数)/N<T (1)
在上述公式(1)中,“N”代表表示言语输入802的语音符号序列1112 中所包括的语音符号的数量;并且T代表用于保持候选媒体项目的语音符 号序列1132的阈值条件(例如,分数差)。这样,如果满足上述公式,则基 于存储于储存库830中的可用媒体项目所识别的候选媒体项目很可能与言 语输入802所表示的用户意图相匹配;并且语音匹配模块840可确定应将 语音符号序列1132所表示的候选媒体项目提供给用户。在一些示例中,语 音匹配模块840可包括前瞻机制用以放弃或终止对是否要将候选媒体项目 提供给用户的确定。确定过程的放弃或终止可发生在基于预期分数进行确 定的情况下。例如,与表示言语输入802的语音符号序列1112相关联的预 期分数可指示满足公式(1)的可能性极小,从而可放弃或终止确定过程。执 行候选媒体项目的语音表示824和言语输入802的语音表示822之间的语 音匹配可提高准确识别与用户意图相匹配的媒体项目的可能性。这就降低 了错误率并提高了基于用户意图的端到端的准确性。
再次参考图8,在一些实施方案中,根据确定要将候选媒体项目提供 给用户,数字助理800可将候选媒体项目842提供给用户。例如,数字助 理800可输出与候选媒体项目842相关联的音频和/或视频(例如,播放歌 曲、电影等)。在一些示例中,数字助理800还可确定与候选媒体项目842 相关联的附加信息(例如,标题、艺术家、来源、专辑名称、流派等),并将 该附加信息提供给用户。例如,数字助理800可输出音频和/或将附加信息 可视地显示给用户。
在一些实施方案中,数字助理800可确定是否可能需要语音匹配并且 可基于确定进行语音匹配或不进行语音匹配来提供一个或多个媒体项目。 此类确定可基于诸如语言模型分数等一个或多个标准。语言模型分数指示 基于对非常大量文本的先验分析出现字词序列的可能性。其可指示在缺少 语音匹配的情况下基于自然语言处理结果(上述)识别和/或解释用户请求的 准确度。例如,如果数字助理800确定语言模型分数满足(例如,高于或等 于)模型-分数阈值条件,则可确定执行用户请求不需要语音匹配。满足模型 -分数阈值条件的语言模型分数可指示相对于用户输入执行的自然语言处理 具有高准确度,从而进一步的语音匹配不太可能有帮助。例如,用户输入 可包括“播放些音乐”,这是常用短语并且由数字助理800的自然语言处 理元件很好地识别/解释。因此,数字助理800可确定与用户输入“播放些 音乐”相关联的语言模型分数高于模型-分数阈值条件,从而不提供用户输 入用以语音匹配。相反,一个或多个媒体项目可在没有语音匹配的情况下 被识别并提供给用户。
在一些示例中,如果数字助理800确定语言模型分数不满足(例如,低 于)模型-分数阈值条件,则可指示在不存在语音匹配不确定的情况下基于自 然语言处理对用户输入进行识别/解释。因此,数字助理800可确定执行用 户请求需要语音匹配,从而提供用户输入用以语音匹配。在一些实施方案 中,在将用户输入提供至语音匹配之前对是否需要语音匹配的确定可提高 识别用户请求的媒体项目的准确度并提高提供媒体项目的效率。例如,如 果在没有确定是否需要语音匹配的情况下所有用户输入提供用于语音匹 配,数字助理800可能无法针对某些用户请求(例如,常用短语诸如“播放 些音乐”)准确地识别特定媒体项目,这是因为指定艺术家或标题的特定项 目有时可能会由于自动语音识别中出现的变化而更好地匹配。
5.用于基于语音匹配技术提供媒体项目的过程
图12A-图12E示出了根据各种示例的用于操作数字助理以基于语音匹 配技术提供媒体项目的过程1200。可使用例如实施了数字助理的一个或多 个电子设备来执行过程1200。在一些示例中,利用客户端-服务器系统(例 如,系统100)执行过程1200,并且在服务器(例如,DA服务器106)和客户 端设备之间以任何方式划分过程1200的框。在其他示例中,在服务器和多 个客户端设备(例如,移动电话和智能手表)之间划分过程1200的框。因 此,尽管过程1200的部分在本文中被描述为由客户端-服务器系统的特定设 备执行,但应当理解,过程1200不限于此。在其他示例中,利用仅客户端 设备(例如,用户设备104)或仅多个客户端设备来执行过程1200。在过程 1200中,一些框被任选地组合,一些框的顺序被任选地改变,并且一些框 被任选地省略。在一些示例中,附加步骤可与过程1200结合而执行。
参考图12A,在框1202,从用户接收言语输入。在框1204,言语输入 包括非结构化自然语言信息(例如,“播放21Savage的Skrrt Skrrt”)。
在框1206,确定言语输入是否包括对媒体项目的用户请求。在框 1208,为了确定言语输入是否包括对媒体项目的用户请求,基于言语输入 来获得字词序列。在框1210,为了获得字词序列,基于统计语言模型执行言 语输入的语音文字转换。
在框1212,基于获得的字词序列,确定字词序列是否包括对媒体项目 的用户请求的表示。在框1214,在一些示例中,为了确定字词序列是否包 括用户请求的表示,确定字词序列是否包括一个或多个预先确定的字词(例 如,“播放”)。在框1216,在一些示例中,为了确定字词序列是否包括用 户请求的表示,确定字词序列是否对应于一个或多个预先确定的语法(例 如,媒体项目名称后接艺术家)。在框1218,在一些示例中,为了确定字词 序列是否包括用户请求的表示,确定字词序列是否包括一个或多个停用字 词(例如,“播放列表”)。
参考图12B,在框1220,根据确定言语输入包括对媒体项目的用户请 求,则根据媒体项目的储存库确定候选媒体项目。在框1222,为了确定候 选媒体项目,生成用于表示言语输入的语音符号序列。在框1224,基于从 言语输入的言语到文本转换获得的字词序列生成生成语音符号序列。
在框1226,基于表示言语输入的语音符号序列,确定一个或多个参考 语音符号序列。在框1228,参考语音符号序列表示从一个或多个词典中获 得的字词序列。在框1230,在一些示例中,为了确定参考语音符号序列, 针对表示言语输入的语音符号序列中的每个语音符号,确定表示言语输入 的语音符号序列中的语音符号是否与相同参考语音符号匹配。
在框1232,在一些示例中,为了确定参考语音符号序列,针对表示言 语输入的语音符号序列中的每个语音符号,确定表示言语输入的语音符号 序列中的语音符号是否对应于不同参考语音符号。在框1234,为了确定对 应关系,确定是否在表示言语输入的语音符号序列的生成期间执行一个或 多个预先确定的操作。在一些示例中,预先确定的操作为替换操作、删除 操作和/或插入操作。在框1236,针对表示言语输入的语音符号序列中的每个语音符号,确定表示言语输入的语音符号序列中的语音符号与参考语音 符号对应的概率。
参考图12C,在框1238,基于使与表示言语输入的语音符号序列相关 联的一个或多个模式与一个或多个参考模式相匹配来确定一个或多个参考 语音符号序列。
在框1240,基于一个或多个确定的参考语音符号序列来确定候选媒体 项目。在框1242,为了确定候选媒体项目,针对一个或多个参考语音符号 序列中的每个参考语音符号序列,确定指示表示言语输入的语音符号序列 和一个或多个参考语音符号序列中每个参考语音符号序列之间的匹配度的 分数。
在框1244,基于与一个或多个参考语音符号序列相关联的分数来确定 候选媒体项目。在框1246,为了确定候选媒体项目,基于一个或多个参考 语音符号序列的相应分数来对它们进行排序。在框1248,选择具有最高分 数的参考语音符号序列。在框1250,基于具有最高分数的参考语音符号序 列来识别候选媒体项目。
在框1252,为了识别参考媒体项目,将具有最高分数的参考语音符号 序列映射到媒体项目的储存库中能够获得的媒体项目的语音表示。如上所 述,储存库中能够获得的媒体项目的语音表示可包括与媒体项目相关联的 信息(例如,歌曲、艺术家、专辑、流派等的名称)的多个语音符号序列。在 框1254,基于映射识别候选媒体项目。例如,可基于在储存库中能够获得 的媒体项目之中的媒体项目的最匹配语音符号序列来识别候选媒体项目。在框1256,在一些示例中,确定最高分数是否满足分数-阈值条件。在框 1258,根据确定最高分数满足分数-阈值条件,放弃对是否要将候选媒体项 目提供给用户的确定。例如,分数可能高到足以满足分数-阈值条件,因此 无需确定是否需基于语音匹配技术将候选媒体项目提供给用户。
参考图12D,在框1260,基于候选媒体项目的语音表示和言语输入的 语音表示之间的差值,来确定是否要将候选媒体项目提供给用户。
在框1262,为了确定是否要将候选媒体项目提供给用户,确定候选媒 体项目的语音表示和言语输入的语音表示之间的差值。如上所述,可施用 两个语音表示之间的语音匹配。在一些示例中,在框1264,基于与表示言 语输入的语音符号序列相关联的预期分数以及指示表示言语输入的语音符 号序列和候选媒体项目的语音符号序列之间的匹配度的分数来确定差值。 例如,可使用如上所述的公式来确定差值。在一些示例中,与言语输入的语音符号序列相关联的预期分数为与生成言语输入的语音符号序列相关联 的置信度分数。在框1266,在一些示例中,在确定是否要将候选媒体项目 提供给用户时,基于与言语输入的语音符号序列相关联的期望分数来终止 或放弃该确定。例如,可确定预期分数太低而无法满足上述公式(1),从而 无需进一步确定。
在框1268,确定候选媒体项目的语音表示和言语输入的语音表示之间 的差值是否满足阈值条件。在框1270,根据确定该差值满足阈值条件,来 确定要将候选媒体项目提供给用户。在一些示例中,如果差值满足阈值条 件,则可能候选媒体项目与用户意图严格匹配,从而将被提供给用户。
在框1272,根据确定要将候选媒体项目提供给用户,来将候选媒体项 目提供给用户。在框1274,输出与候选媒体项目相关联的音频(例如,播放 歌曲)。在框1276,确定与候选媒体项目相关联的附加信息。附加信息可包 括例如艺术家、专辑、流派等。在框1278,将与候选媒体项目相关联的附 加信息提供给用户。
参考图12E,在框1280,在确定言语输入是否包括对媒体项目的用户 请求之前,执行言语输入的自然语言处理。在框1282,基于自然语言处理 的结果来确定是否需要语音匹配。在框1284,根据确定需要语音匹配,发 起对言语输入是否包括对媒体项目的用户请求的确定。在框1286,根据确 定无需语音匹配,来提供基于自然语言处理结果所获得的媒体项目。
以上参考图12A至12D描述的操作任选地由图1至图4、图6A至图 6B和图7A至图7C描绘的部件来实现。例如,过程800的操作可由数字助 理系统700来实现。本领域人员会清楚地知道如何基于在图1至图4、图 6A至图6B和图7A至图7C中所描绘的部件来实现其他过程。
根据一些具体实施,提供一种计算机可读存储介质(例如,非暂态计算 机可读存储介质),该计算机可读存储介质存储供电子设备的一个或多个处 理器执行的一个或多个程序,该一个或多个程序包括用于执行本文所述方 法或过程中的任一个的指令。
根据一些具体实施,提供了一种电子设备(例如,便携式电子设 备),该电子设备包括用于执行本文所述的方法和过程中的任一个的装 置。
根据一些具体实施,提供了一种电子设备(例如,便携式电子设 备),该电子设备包括处理单元,该处理单元被配置为执行本文所述的方 法和过程中的任一个。
根据一些具体实施,提供了一种电子设备(例如,便携式电子设 备),该电子设备包括一个或多个处理器和存储用于由该一个或多个处理 器执行的一个或多个程序的存储器,该一个或多个程序包括用于执行本文 所描述的方法和过程中的任一个的指令。
出于解释的目的,前面的描述是通过参考具体实施方案来描述的。然 而,上面的示例性讨论并非旨在是穷尽的,也并非旨在将本发明限制为所 公开的精确形式。根据以上教导内容,很多修改形式和变型形式都是可能 的。选择并描述这些实施方案是为了最好地解释这些技术的原理及其实际 应用。本领域的其他技术人员由此能够最好地利用这些技术以及具有适合 于所设想的特定用途的各种修改的各种实施方案。
虽然参照附图对本公开以及示例进行了全面的描述,但应当注意,各 种变化和修改对于本领域内的技术人员而言将变得显而易见。应当理解, 此类变化和修改被认为被包括在由权利要求书所限定的本公开和示例的范 围内。另外,尽管本公开使用媒体项目作为示例,但本领域的技术人员可 理解这些技术能够适用于任何其他项目,诸如包括兴趣点和邮政地址的地 理项目列表。

Claims (35)

1.一种用于提供数字助理服务的方法,包括:
在具有存储器和一个或多个处理器的一个或多个电子设备处:
接收来自用户的言语输入;
确定所述言语输入是否包括对媒体项目的用户请求;
根据确定所述言语输入包括对媒体项目的用户请求,从媒体项目的储存库来确定候选媒体项目;
基于所述候选媒体项目的语音表示和所述言语输入的语音表示之间的差值来确定是否要将所述候选媒体项目提供给所述用户;以及
根据确定要将所述候选媒体项目提供给所述用户,将所述候选媒体项目提供给所述用户。
2.根据权利要求1所述的方法,其中所述言语输入包含非结构化自然语言信息。
3.根据权利要求1和2中的任一项所述的方法,其中确定所述言语输入是否包括对媒体项目的用户请求包括:
基于所述言语输入来获得字词序列;以及
确定所述字词序列是否包括对媒体项目的所述用户请求的表示。
4.根据权利要求3所述的方法,其中基于所述言语输入来获得所述字词序列包括基于统计语言模型来执行所述言语输入的言语到文本转换。
5.根据权利要求3和4中的任一项所述的方法,其中确定所述字词序列是否包括对媒体项目的所述用户请求的表示包括确定所述字词序列是否包括一个或多个预先确定的字词。
6.根据权利要求3-5中的任一项所述的方法,其中确定所述字词序列是否包括对媒体项目的所述用户请求的表示包括确定所述字词序列是否对应于一个或多个预先确定的语法。
7.根据权利要求3-6中的任一项所述的方法,其中确定所述字词序列是否包括对媒体项目的所述用户请求的表示包括确定所述字词序列是否包括一个或多个停用字词。
8.根据权利要求1-7中的任一项所述的方法,其中从媒体项目的储存库来确定所述候选媒体项目包括:
生成表示所述言语输入的语音符号序列;
基于表示所述言语输入的所述语音符号序列来确定一个或多个参考语音符号序列;以及
基于所确定的一个或多个参考语音符号序列来确定候选媒体项目。
9.根据权利要求8所述的方法,其中生成表示所述言语输入的语音符号序列包括:
基于从所述言语输入的言语到文本转换获得的字词序列来生成表示所述言语输入的所述语音符号序列。
10.根据权利要求8和9中的任一项所述的方法,其中所述参考语音符号序列表示从一个或多个词典获得的字词序列。
11.根据权利要求8-10中的任一项所述的方法,其中基于表示所述言语输入的所述语音符号序列来确定所述一个或多个参考语音符号序列包括:
针对表示所述言语输入的所述语音符号序列中的每个语音符号来确定表示所述言语输入的所述语音符号序列中的所述语音符号是否与相同的参考语音符号匹配。
12.根据权利要求8-11中的任一项所述的方法,其中基于表示所述言语输入的所述语音符号序列来确定所述一个或多个参考语音符号序列包括:
针对表示所述言语输入的所述语音符号序列中的每个语音符号来确定表示所述言语输入的所述语音符号序列中的所述语音符号是否与不同的参考语音符号对应。
13.根据权利要求12所述的方法,其中确定表示所述言语输入的所述语音符号序列中的所述语音符号是否与不同的参考语音符号对应包括:
确定是否在表示所述言语输入的语音符号序列的生成期间执行一个或多个预先确定的操作。
14.根据权利要求13所述的方法,其中所述一个或多个预先确定的操作包括替换操作。
15.根据权利要求13和14中的任一项所述的方法,其中所述一个或多个预先确定的操作包括删除操作。
16.根据权利要求13-15中的任一项所述的方法,其中所述一个或多个预先确定的操作包括插入操作。
17.根据权利要求8-16中的任一项所述的方法,还包括:
针对表示所述言语输入的所述语音符号序列中的每个语音符号来确定表示所述言语输入的所述语音符号序列中的所述语音符号与参考语音符号对应的概率。
18.根据权利要求8-17中的任一项所述的方法,其中基于所确定的一个或多个参考语音符号序列来确定所述候选媒体项目包括:
针对所述一个或多个参考语音符号序列中的每个参考语音符号序列来确定指示表示所述言语输入的所述语音符号序列和所述一个或多个参考语音符号序列中每个参考语音符号序列之间的匹配度的分数;以及
基于与所述一个或多个参考语音符号序列相关联的所述分数来确定所述候选媒体项目。
19.根据权利要求18所述的方法,其中基于与所述一个或多个参考语音符号序列相关联的所述分数来确定所述候选媒体项目包括:
基于所述一个或多个参考语音符号序列的相应分数来对所述一个或多个参考语音符号序列进行排序;
选择具有最高分数的所述参考语音符号序列;以及
基于具有所述最高分数的所述参考语音符号序列来识别所述候选媒体项目。
20.根据权利要求19所述的方法,还包括:
确定所述最高分数是否满足分数-阈值条件;
根据确定所述最高分数满足所述分数-阈值条件,放弃对是否要将所述候选媒体项目提供给所述用户的确定。
21.根据权利要求19所述的方法,其中基于具有所述最高分数的所述参考语音符号序列来识别所述候选媒体项目包括:
将具有所述最高分数的所述参考语音符号序列映射到能够在媒体项目的所述储存库中获得的媒体项目的语音表示;
基于所述映射来识别所述候选媒体项目。
22.根据权利要求1-21中的任一项所述的方法,其中确定是否要将所述候选媒体项目提供给所述用户包括:
确定所述候选媒体项目的所述语音表示和所述言语输入的所述语音表示之间的差值;
确定所述候选媒体项目的所述语音表示和所述言语输入的所述语音表示之间的所述差值是否满足阈值条件;以及
根据确定所述差值满足所述阈值条件,确定要将所述候选媒体项目提供给所述用户。
23.根据权利要求22所述的方法,其中确定所述候选媒体项目的所述语音表示和所述言语输入的所述语音表示之间的所述差值包括:
基于与表示所述言语输入的语音符号序列相关联的预期分数以及指示表示所述言语输入的语音符号序列和所述候选媒体项目的所述语音符号序列之间的匹配度的分数来确定所述差值。
24.根据权利要求23所述的方法,其中与所述言语输入的所述语音符号序列相关联的所述预期分数是与生成所述言语输入的所述语音符号序列相关联的置信度分数。
25.根据权利要求23-24中的任一项所述的方法,还包括:
在确定是否要将所述候选媒体项目提供给所述用户时,基于与所述言语输入的所述语音符号序列相关联的所述期望分数来终止所述确定。
26.根据权利要求1-25中的任一项所述的方法,其中将所述候选媒体项目提供给所述用户包括输出与所述候选媒体项目相关联的音频。
27.根据权利要求1-26中的任一项所述的方法,还包括:
确定与所述候选媒体项目相关联的附加信息;以及
将与所述候选媒体项目相关联的所述附加信息提供给所述用户。
28.根据权利要求1-27中的任一项所述的方法,在确定所述言语输入是否包括对媒体项目的用户请求之前,还包括:
执行所述言语输入的自然语言处理;
基于所述自然语言处理的结果来确定是否需要语音匹配;
根据确定需要语音匹配,发起对所述言语输入是否包括对媒体项目的用户请求的确定;以及
根据确定无需语音匹配,提供基于所述自然语言处理结果获得的媒体项目。
29.一种存储一个或多个程序的非暂态计算机可读存储介质,所述一个或多个程序包括指令,所述指令当由电子设备的一个或多个处理器执行时,使所述电子设备:
接收来自用户的言语输入;
确定所述言语输入是否包括对媒体项目的用户请求;
根据确定所述言语输入包括对获得媒体项目的用户请求,从媒体项目的储存库来确定候选媒体项目;
基于所述候选媒体项目的语音表示和所述言语输入的语音表示之间的差值来确定是否要将所述候选媒体项目提供给所述用户;以及
根据确定要将所述候选媒体项目提供给所述用户,将所述候选媒体项目提供给所述用户。
30.一种电子设备,包括:
一个或多个处理器;
存储器;和
存储在存储器中的一个或多个程序,所述一个或多个程序包括用于以下操作的指令:
接收来自用户的言语输入;
确定所述言语输入是否包括对媒体项目的用户请求;
根据确定所述言语输入包括对获得媒体项目的用户请求,
从媒体项目的储存库来确定候选媒体项目;
基于所述候选媒体项目的语音表示和所述言语输入的语音表示之间的差值来确定是否要将所述候选媒体项目提供给所述用户;以及
根据确定要将所述候选媒体项目提供给所述用户,将所述候选媒体项目提供给所述用户。
31.一种电子设备,包括:
用于接收来自用户的言语输入的装置;
用于确定所述言语输入是否包括对媒体项目的用户请求的装置;
根据确定所述言语输入包括对获得媒体项目的用户请求,用于从媒体项目的储存库来确定候选媒体项目的装置;
用于基于所述候选媒体项目的语音表示和所述言语输入的语音表示之间的差值来确定是否要将所述候选媒体项目提供给所述用户的装置;和
根据确定要将所述候选媒体项目提供给所述用户,用于将所述候选媒体项目提供给所述用户的装置。
32.一种电子设备,包括:
一个或多个处理器;
存储器;和
存储在存储器中的一个或多个程序,所述一个或多个程序包括用于执行根据权利要求1-28中的任一项所述的方法的指令。
33.一种电子设备,包括:
用于执行根据权利要求1-28所述的方法中的任一方法的装置。
34.一种包括用于由电子设备的一个或多个处理器执行的一个或多个程序的非暂态计算机可读存储介质,所述一个或多个程序包括指令,所述指令在由所述一个或多个处理器执行时使所述电子设备执行根据权利要求1-28中的任一项所述的方法。
35.一种用于操作数字助理的系统,所述系统包括用于执行根据权利要求1-28所述的方法中的任一方法的装置。
CN201810072173.2A 2017-05-16 2018-01-25 用于数字助理服务中的语音匹配的方法和系统 Active CN108874766B (zh)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US201762506871P 2017-05-16 2017-05-16
US62/506,871 2017-05-16
US201762555311P 2017-09-07 2017-09-07
US62/555,311 2017-09-07
US15/703,013 2017-09-13
US15/703,013 US10403278B2 (en) 2017-05-16 2017-09-13 Methods and systems for phonetic matching in digital assistant services

Publications (2)

Publication Number Publication Date
CN108874766A true CN108874766A (zh) 2018-11-23
CN108874766B CN108874766B (zh) 2022-07-01

Family

ID=61131975

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810072173.2A Active CN108874766B (zh) 2017-05-16 2018-01-25 用于数字助理服务中的语音匹配的方法和系统

Country Status (3)

Country Link
US (1) US10403278B2 (zh)
EP (1) EP3404653B1 (zh)
CN (1) CN108874766B (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111161717A (zh) * 2019-12-26 2020-05-15 苏州思必驰信息科技有限公司 用于语音对话平台的技能调度方法及系统

Families Citing this family (61)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9318108B2 (en) 2010-01-18 2016-04-19 Apple Inc. Intelligent automated assistant
US8977255B2 (en) 2007-04-03 2015-03-10 Apple Inc. Method and system for operating a multi-function portable electronic device using voice-activation
US8676904B2 (en) 2008-10-02 2014-03-18 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US10706373B2 (en) 2011-06-03 2020-07-07 Apple Inc. Performing actions associated with task items that represent tasks to perform
US10417037B2 (en) 2012-05-15 2019-09-17 Apple Inc. Systems and methods for integrating third party services with a digital assistant
JP2016508007A (ja) 2013-02-07 2016-03-10 アップル インコーポレイテッド デジタルアシスタントのためのボイストリガ
US10652394B2 (en) 2013-03-14 2020-05-12 Apple Inc. System and method for processing voicemail
US10748529B1 (en) 2013-03-15 2020-08-18 Apple Inc. Voice activated device for use with a voice-based digital assistant
US10176167B2 (en) 2013-06-09 2019-01-08 Apple Inc. System and method for inferring user intent from speech inputs
US10170123B2 (en) 2014-05-30 2019-01-01 Apple Inc. Intelligent assistant for home automation
US9715875B2 (en) 2014-05-30 2017-07-25 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US9966065B2 (en) 2014-05-30 2018-05-08 Apple Inc. Multi-command single utterance input method
US9338493B2 (en) 2014-06-30 2016-05-10 Apple Inc. Intelligent automated assistant for TV user interactions
US9886953B2 (en) 2015-03-08 2018-02-06 Apple Inc. Virtual assistant activation
US10460227B2 (en) 2015-05-15 2019-10-29 Apple Inc. Virtual assistant in a communication session
US10200824B2 (en) 2015-05-27 2019-02-05 Apple Inc. Systems and methods for proactively identifying and surfacing relevant content on a touch-sensitive device
US20160378747A1 (en) 2015-06-29 2016-12-29 Apple Inc. Virtual assistant for media playback
US10740384B2 (en) 2015-09-08 2020-08-11 Apple Inc. Intelligent automated assistant for media search and playback
US10671428B2 (en) 2015-09-08 2020-06-02 Apple Inc. Distributed personal assistant
US10747498B2 (en) 2015-09-08 2020-08-18 Apple Inc. Zero latency digital assistant
US10331312B2 (en) 2015-09-08 2019-06-25 Apple Inc. Intelligent automated assistant in a media environment
US10691473B2 (en) 2015-11-06 2020-06-23 Apple Inc. Intelligent automated assistant in a messaging environment
US10956666B2 (en) 2015-11-09 2021-03-23 Apple Inc. Unconventional virtual assistant interactions
US10223066B2 (en) 2015-12-23 2019-03-05 Apple Inc. Proactive assistance based on dialog communication between devices
US10586535B2 (en) 2016-06-10 2020-03-10 Apple Inc. Intelligent digital assistant in a multi-tasking environment
DK201670540A1 (en) 2016-06-11 2018-01-08 Apple Inc Application integration with a digital assistant
DK179415B1 (en) 2016-06-11 2018-06-14 Apple Inc Intelligent device arbitration and control
US11204787B2 (en) 2017-01-09 2021-12-21 Apple Inc. Application integration with a digital assistant
DK180048B1 (en) 2017-05-11 2020-02-04 Apple Inc. MAINTAINING THE DATA PROTECTION OF PERSONAL INFORMATION
US10726832B2 (en) 2017-05-11 2020-07-28 Apple Inc. Maintaining privacy of personal information
DK201770429A1 (en) 2017-05-12 2018-12-14 Apple Inc. LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT
DK179496B1 (en) 2017-05-12 2019-01-15 Apple Inc. USER-SPECIFIC Acoustic Models
DK179745B1 (en) 2017-05-12 2019-05-01 Apple Inc. SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT
US10303715B2 (en) 2017-05-16 2019-05-28 Apple Inc. Intelligent automated assistant for media exploration
US20180336892A1 (en) 2017-05-16 2018-11-22 Apple Inc. Detecting a trigger of a digital assistant
US10818288B2 (en) 2018-03-26 2020-10-27 Apple Inc. Natural assistant interaction
US10928918B2 (en) 2018-05-07 2021-02-23 Apple Inc. Raise to speak
US11145294B2 (en) 2018-05-07 2021-10-12 Apple Inc. Intelligent automated assistant for delivering content from user experiences
DK179822B1 (da) 2018-06-01 2019-07-12 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
DK180639B1 (en) 2018-06-01 2021-11-04 Apple Inc DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT
US10892996B2 (en) 2018-06-01 2021-01-12 Apple Inc. Variable latency device coordination
US10811014B1 (en) * 2018-06-28 2020-10-20 Amazon Technologies, Inc. Contact list reconciliation and permissioning
US11462215B2 (en) 2018-09-28 2022-10-04 Apple Inc. Multi-modal inputs for voice commands
US11069352B1 (en) * 2019-02-18 2021-07-20 Amazon Technologies, Inc. Media presence detection
US11348573B2 (en) 2019-03-18 2022-05-31 Apple Inc. Multimodality in digital assistant systems
US11307752B2 (en) 2019-05-06 2022-04-19 Apple Inc. User configurable task triggers
DK201970509A1 (en) 2019-05-06 2021-01-15 Apple Inc Spoken notifications
US11140099B2 (en) 2019-05-21 2021-10-05 Apple Inc. Providing message response suggestions
DK201970511A1 (en) 2019-05-31 2021-02-15 Apple Inc Voice identification in digital assistant systems
DK180129B1 (en) 2019-05-31 2020-06-02 Apple Inc. USER ACTIVITY SHORTCUT SUGGESTIONS
US11468890B2 (en) 2019-06-01 2022-10-11 Apple Inc. Methods and user interfaces for voice-based control of electronic devices
US11488406B2 (en) 2019-09-25 2022-11-01 Apple Inc. Text detection using global geometry estimators
US11817090B1 (en) * 2019-12-12 2023-11-14 Amazon Technologies, Inc. Entity resolution using acoustic data
US11068073B2 (en) * 2019-12-13 2021-07-20 Dell Products, L.P. User-customized keyboard input error correction
CN111192434B (zh) * 2020-01-19 2024-02-09 中国建筑第四工程局有限公司 一种基于多模态感知的安全防护服识别系统及方法
US11061543B1 (en) 2020-05-11 2021-07-13 Apple Inc. Providing relevant data items based on context
US11183193B1 (en) 2020-05-11 2021-11-23 Apple Inc. Digital assistant hardware abstraction
US11755276B2 (en) 2020-05-12 2023-09-12 Apple Inc. Reducing description length based on confidence
US11490204B2 (en) 2020-07-20 2022-11-01 Apple Inc. Multi-device audio adjustment coordination
US11438683B2 (en) 2020-07-21 2022-09-06 Apple Inc. User identification using headphones
US11620993B2 (en) * 2021-06-09 2023-04-04 Merlyn Mind, Inc. Multimodal intent entity resolver

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7720674B2 (en) * 2004-06-29 2010-05-18 Sap Ag Systems and methods for processing natural language queries
US20140081633A1 (en) * 2012-09-19 2014-03-20 Apple Inc. Voice-Based Media Searching
US20140365216A1 (en) * 2013-06-07 2014-12-11 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
US20140365209A1 (en) * 2013-06-09 2014-12-11 Apple Inc. System and method for inferring user intent from speech inputs
US20140365226A1 (en) * 2013-06-07 2014-12-11 Apple Inc. System and method for detecting errors in interactions with a voice-based digital assistant
CN106471570A (zh) * 2014-05-30 2017-03-01 苹果公司 多命令单一话语输入方法

Family Cites Families (2443)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3859005A (en) 1973-08-13 1975-01-07 Albert L Huebner Erosion reduction in wet turbines
US4826405A (en) 1985-10-15 1989-05-02 Aeroquip Corporation Fan blade fabrication system
US6222525B1 (en) 1992-03-05 2001-04-24 Brad A. Armstrong Image controllers with sheet connected sensors
US8073695B1 (en) 1992-12-09 2011-12-06 Adrea, LLC Electronic book with voice emulation features
US7835989B1 (en) 1992-12-09 2010-11-16 Discovery Communications, Inc. Electronic book alternative delivery systems
US6311157B1 (en) 1992-12-31 2001-10-30 Apple Computer, Inc. Assigning meanings to utterances in a speech recognition system
US6594688B2 (en) 1993-10-01 2003-07-15 Collaboration Properties, Inc. Dedicated echo canceler for a workstation
JP2813728B2 (ja) 1993-11-01 1998-10-22 インターナショナル・ビジネス・マシーンズ・コーポレイション ズーム/パン機能付パーソナル通信機
US5901287A (en) 1996-04-01 1999-05-04 The Sabre Group Inc. Information aggregation and synthesization system
US7113958B1 (en) 1996-08-12 2006-09-26 Battelle Memorial Institute Three-dimensional display of document set
US6199076B1 (en) 1996-10-02 2001-03-06 James Logan Audio program player including a dynamic program selection controller
US7787647B2 (en) 1997-01-13 2010-08-31 Micro Ear Technology, Inc. Portable system for programming hearing aids
US7321783B2 (en) 1997-04-25 2008-01-22 Minerva Industries, Inc. Mobile entertainment and communication device
US6026233A (en) 1997-05-27 2000-02-15 Microsoft Corporation Method and apparatus for presenting and selecting options to modify a programming language statement
US7046813B1 (en) 1997-09-25 2006-05-16 Fumio Denda Auditory sense training method and sound processing method for auditory sense training
US9292111B2 (en) 1998-01-26 2016-03-22 Apple Inc. Gesturing with a multipoint sensing device
US7840912B2 (en) 2006-01-30 2010-11-23 Apple Inc. Multi-touch gesture dictionary
US8479122B2 (en) 2004-07-30 2013-07-02 Apple Inc. Gestures for touch sensitive input devices
US7663607B2 (en) 2004-05-06 2010-02-16 Apple Inc. Multipoint touchscreen
US7614008B2 (en) 2004-07-30 2009-11-03 Apple Inc. Operation of a computer with touch screen interface
KR100595912B1 (ko) 1998-01-26 2006-07-07 웨인 웨스터만 수동 입력 통합 방법 및 장치
US6963871B1 (en) 1998-03-25 2005-11-08 Language Analysis Systems, Inc. System and method for adaptive multi-cultural searching and matching of personal names
US6424983B1 (en) 1998-05-26 2002-07-23 Global Information Research And Technologies, Llc Spelling and grammar checking system
US20070094224A1 (en) 1998-05-28 2007-04-26 Lawrence Au Method and system for determining contextual meaning for network search applications
US7711672B2 (en) 1998-05-28 2010-05-04 Lawrence Au Semantic network methods to disambiguate natural language meaning
US6742003B2 (en) 2001-04-30 2004-05-25 Microsoft Corporation Apparatus and accompanying methods for visualizing clusters of data and hierarchical cluster classifications
ATE383640T1 (de) 1998-10-02 2008-01-15 Ibm Vorrichtung und verfahren zur bereitstellung von netzwerk-koordinierten konversationsdiensten
US6163794A (en) 1998-10-23 2000-12-19 General Magic Network system extensible by users
US6321092B1 (en) 1998-11-03 2001-11-20 Signal Soft Corporation Multiple input data management for wireless location-based applications
US7447637B1 (en) 1998-12-23 2008-11-04 Eastern Investments, Llc System and method of processing speech within a graphic user interface
US7319957B2 (en) 2004-02-11 2008-01-15 Tegic Communications, Inc. Handwriting and voice input with automatic correction
US7679534B2 (en) 1998-12-04 2010-03-16 Tegic Communications, Inc. Contextual prediction of user words and user actions
US7881936B2 (en) 1998-12-04 2011-02-01 Tegic Communications, Inc. Multimodal disambiguation of speech recognition
US7712053B2 (en) 1998-12-04 2010-05-04 Tegic Communications, Inc. Explicit character filtering of ambiguous text entry
US8938688B2 (en) 1998-12-04 2015-01-20 Nuance Communications, Inc. Contextual prediction of user words and user actions
US6842877B2 (en) 1998-12-18 2005-01-11 Tangis Corporation Contextual responses based on automated learning techniques
FR2787902B1 (fr) 1998-12-23 2004-07-30 France Telecom Modele et procede d'implementation d'un agent rationnel dialoguant, serveur et systeme multi-agent pour la mise en oeuvre
GB2388938B (en) 1999-02-22 2004-03-17 Nokia Corp A communication terminal having a predictive editor application
GB9904662D0 (en) 1999-03-01 1999-04-21 Canon Kk Natural language search method and apparatus
US7596606B2 (en) 1999-03-11 2009-09-29 Codignotto John D Message publishing system for publishing messages from identified, authorized senders
US7761296B1 (en) 1999-04-02 2010-07-20 International Business Machines Corporation System and method for rescoring N-best hypotheses of an automatic speech recognition system
US7558381B1 (en) 1999-04-22 2009-07-07 Agere Systems Inc. Retrieval of deleted voice messages in voice messaging system
US7030863B2 (en) 2000-05-26 2006-04-18 America Online, Incorporated Virtual keyboard system with automatic correction
AU5299700A (en) 1999-05-27 2000-12-18 America Online, Inc. Keyboard system with automatic correction
US7821503B2 (en) 2003-04-09 2010-10-26 Tegic Communications, Inc. Touch screen and graphical user interface
AU5451800A (en) 1999-05-28 2000-12-18 Sehda, Inc. Phrase-based dialogue modeling with particular application to creating recognition grammars for voice-controlled user interfaces
US20140098247A1 (en) 1999-06-04 2014-04-10 Ip Holdings, Inc. Home Automation And Smart Home Control Using Mobile Devices And Wireless Enabled Electrical Switches
US8065155B1 (en) 1999-06-10 2011-11-22 Gazdzinski Robert F Adaptive advertising apparatus and methods
US7711565B1 (en) 1999-06-10 2010-05-04 Gazdzinski Robert F “Smart” elevator system and method
AUPQ138199A0 (en) 1999-07-02 1999-07-29 Telstra R & D Management Pty Ltd A search system
US7451177B1 (en) 1999-08-12 2008-11-11 Avintaquin Capital, Llc System for and method of implementing a closed loop response architecture for electronic commerce
US7743188B2 (en) 1999-08-12 2010-06-22 Palm, Inc. Method and apparatus for accessing a contacts database and telephone services
US7925610B2 (en) 1999-09-22 2011-04-12 Google Inc. Determining a meaning of a knowledge item using document-based information
US6789231B1 (en) 1999-10-05 2004-09-07 Microsoft Corporation Method and system for providing alternatives for text derived from stochastic input sources
US7176372B2 (en) 1999-10-19 2007-02-13 Medialab Solutions Llc Interactive digital music recorder and player
KR100812109B1 (ko) 1999-10-19 2008-03-12 소니 일렉트로닉스 인코포레이티드 자연어 인터페이스 제어 시스템
US8392188B1 (en) 1999-11-05 2013-03-05 At&T Intellectual Property Ii, L.P. Method and system for building a phonotactic model for domain independent speech recognition
US7050977B1 (en) 1999-11-12 2006-05-23 Phoenix Solutions, Inc. Speech-enabled server for internet website and method
US9076448B2 (en) 1999-11-12 2015-07-07 Nuance Communications, Inc. Distributed real time speech recognition system
US7392185B2 (en) 1999-11-12 2008-06-24 Phoenix Solutions, Inc. Speech based learning/training system using semantic decoding
US7725307B2 (en) 1999-11-12 2010-05-25 Phoenix Solutions, Inc. Query engine for processing voice based queries including semantic decoding
US7412643B1 (en) 1999-11-23 2008-08-12 International Business Machines Corporation Method and apparatus for linking representation and realization data
US7337389B1 (en) 1999-12-07 2008-02-26 Microsoft Corporation System and method for annotating an electronic document independently of its content
US7434177B1 (en) 1999-12-20 2008-10-07 Apple Inc. User interface for providing consolidation and access
US8271287B1 (en) 2000-01-14 2012-09-18 Alcatel Lucent Voice command remote control system
GB2360106B (en) 2000-02-21 2004-09-22 Ac Properties Bv Ordering playable works
AU2001245447A1 (en) 2000-03-06 2001-09-17 Kanisa Inc. A system and method for providing an intelligent multi-step dialog with a user
US6757362B1 (en) 2000-03-06 2004-06-29 Avaya Technology Corp. Personal virtual assistant
US8024415B2 (en) 2001-03-16 2011-09-20 Microsoft Corporation Priorities generation and management
US8645137B2 (en) 2000-03-16 2014-02-04 Apple Inc. Fast, language-independent method for user authentication by voice
US7187947B1 (en) 2000-03-28 2007-03-06 Affinity Labs, Llc System and method for communicating selected information to an electronic device
US7478129B1 (en) 2000-04-18 2009-01-13 Helen Jeanne Chemtob Method and apparatus for providing group interaction via communications networks
WO2001082111A2 (en) 2000-04-24 2001-11-01 Microsoft Corporation Computer-aided reading system and method with cross-language reading wizard
US6912498B2 (en) 2000-05-02 2005-06-28 Scansoft, Inc. Error correction in speech recognition by correcting text around selected area
US7020718B2 (en) 2000-05-15 2006-03-28 Hewlett-Packard Development Company, L.P. System and method of aggregating discontiguous address ranges into addresses and masks using a plurality of repeating address blocks
FR2809509B1 (fr) 2000-05-26 2003-09-12 Bull Sa Systeme et procede d'internationalisation du contenu de documents a balises dans un systeme informatique
US7080315B1 (en) 2000-06-28 2006-07-18 International Business Machines Corporation Method and apparatus for coupling a visual browser to a voice browser
US7389225B1 (en) 2000-10-18 2008-06-17 Novell, Inc. Method and mechanism for superpositioning state vectors in a semantic abstract
US7672952B2 (en) 2000-07-13 2010-03-02 Novell, Inc. System and method of semantic correlation of rich content
US7139709B2 (en) 2000-07-20 2006-11-21 Microsoft Corporation Middleware layer between speech related applications and engines
JP2002041276A (ja) 2000-07-24 2002-02-08 Sony Corp 対話型操作支援システム及び対話型操作支援方法、並びに記憶媒体
US7853664B1 (en) 2000-07-31 2010-12-14 Landmark Digital Services Llc Method and system for purchasing pre-recorded music
US6915294B1 (en) 2000-08-18 2005-07-05 Firstrain, Inc. Method and apparatus for searching network resources
AU2001283579A1 (en) 2000-08-21 2002-03-04 Yahoo, Inc. Method and system of interpreting and presenting web content using a voice browser
US6836759B1 (en) 2000-08-22 2004-12-28 Microsoft Corporation Method and system of handling the selection of alternates for recognized words
US7287009B1 (en) 2000-09-14 2007-10-23 Raanan Liebermann System and a method for carrying out personal and business transactions
US7688306B2 (en) 2000-10-02 2010-03-30 Apple Inc. Methods and apparatuses for operating a portable device based on an accelerometer
US7218226B2 (en) 2004-03-01 2007-05-15 Apple Inc. Acceleration-based theft detection system for portable electronic devices
US7457750B2 (en) 2000-10-13 2008-11-25 At&T Corp. Systems and methods for dynamic re-configurable speech recognition
US7369993B1 (en) 2000-11-02 2008-05-06 At&T Corp. System and method of pattern recognition in very high-dimensional space
US6915262B2 (en) 2000-11-30 2005-07-05 Telesector Resources Group, Inc. Methods and apparatus for performing speech recognition and using speech recognition results
US7016847B1 (en) 2000-12-08 2006-03-21 Ben Franklin Patent Holdings L.L.C. Open architecture for a voice user interface
US7340446B2 (en) 2000-12-11 2008-03-04 Microsoft Corporation Method and system for query-based management of multiple network resources
US6973427B2 (en) 2000-12-26 2005-12-06 Microsoft Corporation Method for adding phonetic descriptions to a speech recognition lexicon
US7257537B2 (en) 2001-01-12 2007-08-14 International Business Machines Corporation Method and apparatus for performing dialog management in a computer conversational interface
US6677932B1 (en) 2001-01-28 2004-01-13 Finger Works, Inc. System and method for recognizing touch typing under limited tactile feedback conditions
US8213910B2 (en) 2001-02-09 2012-07-03 Harris Technology, Llc Telephone using a connection network for processing data remotely from the telephone
US6570557B1 (en) 2001-02-10 2003-05-27 Finger Works, Inc. Multi-touch system and method for emulating modifier keys via fingertip chords
US7171365B2 (en) 2001-02-16 2007-01-30 International Business Machines Corporation Tracking time using portable recorders and speech recognition
US7290039B1 (en) 2001-02-27 2007-10-30 Microsoft Corporation Intent based processing
US7366979B2 (en) 2001-03-09 2008-04-29 Copernicus Investments, Llc Method and apparatus for annotating a document
EP1490790A2 (en) 2001-03-13 2004-12-29 Intelligate Ltd. Dynamic natural language understanding
WO2002073449A1 (en) 2001-03-14 2002-09-19 At & T Corp. Automated sentence planning in a task classification system
US7209880B1 (en) 2001-03-20 2007-04-24 At&T Corp. Systems and methods for dynamic re-configurable speech recognition
JP2002358092A (ja) 2001-06-01 2002-12-13 Sony Corp 音声合成システム
US20020194003A1 (en) 2001-06-05 2002-12-19 Mozer Todd F. Client-server security system and method
US7328250B2 (en) 2001-06-29 2008-02-05 Nokia, Inc. Apparatus and method for handling electronic mail
US20050134578A1 (en) 2001-07-13 2005-06-23 Universal Electronics Inc. System and methods for interacting with a control environment
US7987151B2 (en) 2001-08-10 2011-07-26 General Dynamics Advanced Info Systems, Inc. Apparatus and method for problem solving using intelligent agents
US7920682B2 (en) 2001-08-21 2011-04-05 Byrne William J Dynamic interactive voice interface
US7774388B1 (en) 2001-08-31 2010-08-10 Margaret Runchey Model of everything with UR-URL combination identity-identifier-addressing-indexing method, means, and apparatus
US8121649B2 (en) 2001-09-05 2012-02-21 Vocera Communications, Inc. Voice-controlled communications system and method having an access device
US7953447B2 (en) 2001-09-05 2011-05-31 Vocera Communications, Inc. Voice-controlled communications system and method using a badge application
US7103848B2 (en) 2001-09-13 2006-09-05 International Business Machines Corporation Handheld electronic book reader with annotation and usage tracking capabilities
US7477240B2 (en) 2001-09-21 2009-01-13 Lenovo Singapore Pte. Ltd. Input apparatus, computer apparatus, method for identifying input object, method for identifying input object in keyboard, and computer program
US7403938B2 (en) 2001-09-24 2008-07-22 Iac Search & Media, Inc. Natural language query processing
US7324947B2 (en) 2001-10-03 2008-01-29 Promptu Systems Corporation Global speech user interface
US7167832B2 (en) 2001-10-15 2007-01-23 At&T Corp. Method for dialog management
US7345671B2 (en) 2001-10-22 2008-03-18 Apple Inc. Method and apparatus for use of rotational user inputs
ITFI20010199A1 (it) 2001-10-22 2003-04-22 Riccardo Vieri Sistema e metodo per trasformare in voce comunicazioni testuali ed inviarle con una connessione internet a qualsiasi apparato telefonico
US7312785B2 (en) 2001-10-22 2007-12-25 Apple Inc. Method and apparatus for accelerated scrolling
US7913185B1 (en) 2001-10-25 2011-03-22 Adobe Systems Incorporated Graphical insertion of JavaScript pop-up menus
US7359671B2 (en) 2001-10-30 2008-04-15 Unwired Technology Llc Multiple channel wireless communication system
US20030101054A1 (en) 2001-11-27 2003-05-29 Ncc, Llc Integrated system and method for electronic speech recognition and transcription
US7447624B2 (en) 2001-11-27 2008-11-04 Sun Microsystems, Inc. Generation of localized software applications
US7483832B2 (en) 2001-12-10 2009-01-27 At&T Intellectual Property I, L.P. Method and system for customizing voice translation of text to speech
US7490039B1 (en) 2001-12-13 2009-02-10 Cisco Technology, Inc. Text to speech system and method having interactive spelling capabilities
US7103542B2 (en) 2001-12-14 2006-09-05 Ben Franklin Patent Holding Llc Automatically improving a voice recognition system
WO2003065179A2 (en) 2002-02-01 2003-08-07 John Fairweather A system and method for mining data
US8374879B2 (en) 2002-02-04 2013-02-12 Microsoft Corporation Systems and methods for managing interactions from multiple speech-enabled applications
US7272377B2 (en) 2002-02-07 2007-09-18 At&T Corp. System and method of ubiquitous language translation for wireless devices
US8249880B2 (en) 2002-02-14 2012-08-21 Intellisist, Inc. Real-time display of system instructions
WO2003071393A2 (en) 2002-02-15 2003-08-28 Mathsoft Engineering And Education, Inc. Linguistic support for a regognizer of mathematical expressions
US7009663B2 (en) 2003-12-17 2006-03-07 Planar Systems, Inc. Integrated optical light sensitive active matrix liquid crystal display
US7221287B2 (en) 2002-03-05 2007-05-22 Triangle Software Llc Three-dimensional traffic report
JP4039086B2 (ja) 2002-03-05 2008-01-30 ソニー株式会社 情報処理装置および情報処理方法、情報処理システム、記録媒体、並びにプログラム
US7360158B1 (en) 2002-03-28 2008-04-15 At&T Mobility Ii Llc Interactive education tool
JP2003295882A (ja) 2002-04-02 2003-10-15 Canon Inc 音声合成用テキスト構造、音声合成方法、音声合成装置及びそのコンピュータ・プログラム
US7707221B1 (en) 2002-04-03 2010-04-27 Yahoo! Inc. Associating and linking compact disc metadata
US7359493B1 (en) 2002-04-11 2008-04-15 Aol Llc, A Delaware Limited Liability Company Bulk voicemail
US7043474B2 (en) 2002-04-15 2006-05-09 International Business Machines Corporation System and method for measuring image similarity based on semantic meaning
US7073193B2 (en) 2002-04-16 2006-07-04 Microsoft Corporation Media content descriptions
US7869998B1 (en) 2002-04-23 2011-01-11 At&T Intellectual Property Ii, L.P. Voice-enabled dialog system
US8135115B1 (en) 2006-11-22 2012-03-13 Securus Technologies, Inc. System and method for multi-channel recording
US7490034B2 (en) 2002-04-30 2009-02-10 Microsoft Corporation Lexicon with sectionalized data and method of using the same
US7221937B2 (en) 2002-05-06 2007-05-22 Research In Motion Limited Event reminder method
US7380203B2 (en) 2002-05-14 2008-05-27 Microsoft Corporation Natural input recognition tool
US7436947B2 (en) 2002-05-14 2008-10-14 Avaya Inc. Method and apparatus for automatic notification and response based on communication flow expressions
US7493560B1 (en) 2002-05-20 2009-02-17 Oracle International Corporation Definition links in online documentation
US8611919B2 (en) 2002-05-23 2013-12-17 Wounder Gmbh., Llc System, method, and computer program product for providing location based services and mobile e-commerce
US7546382B2 (en) 2002-05-28 2009-06-09 International Business Machines Corporation Methods and systems for authoring of mixed-initiative multi-modal interactions and related browsing mechanisms
US7398209B2 (en) 2002-06-03 2008-07-08 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7680649B2 (en) 2002-06-17 2010-03-16 International Business Machines Corporation System, method, program product, and networking use for recognizing words and their parts of speech in one or more natural languages
US8219608B2 (en) 2002-06-20 2012-07-10 Koninklijke Philips Electronics N.V. Scalable architecture for web services
US7568151B2 (en) 2002-06-27 2009-07-28 Microsoft Corporation Notification of activity around documents
US7286987B2 (en) 2002-06-28 2007-10-23 Conceptual Speech Llc Multi-phoneme streamer and knowledge representation speech recognition system and method
US7079713B2 (en) 2002-06-28 2006-07-18 Microsoft Corporation Method and system for displaying and linking ink objects with recognized text and objects
US7656393B2 (en) 2005-03-04 2010-02-02 Apple Inc. Electronic device having display and surrounding touch sensitive bezel for user interface and control
US7693720B2 (en) 2002-07-15 2010-04-06 Voicebox Technologies, Inc. Mobile systems and methods for responding to natural language speech utterance
US6876727B2 (en) 2002-07-24 2005-04-05 Sbc Properties, Lp Voice over IP method for developing interactive voice response system
US7535997B1 (en) 2002-07-29 2009-05-19 At&T Intellectual Property I, L.P. Systems and methods for silent message delivery
US7027842B2 (en) 2002-09-24 2006-04-11 Bellsouth Intellectual Property Corporation Apparatus and method for providing hands-free operation of a device
US7328155B2 (en) 2002-09-25 2008-02-05 Toyota Infotechnology Center Co., Ltd. Method and system for speech recognition using grammar weighted based upon location information
US7467087B1 (en) 2002-10-10 2008-12-16 Gillick Laurence S Training and using pronunciation guessers in speech recognition
US7373612B2 (en) 2002-10-21 2008-05-13 Battelle Memorial Institute Multidimensional structured data visualization method and apparatus, text visualization method and apparatus, method and apparatus for visualizing and graphically navigating the world wide web, method and apparatus for visualizing hierarchies
US7386799B1 (en) 2002-11-21 2008-06-10 Forterra Systems, Inc. Cinematic techniques in avatar-centric communication during a multi-user online simulation
AU2003293071A1 (en) 2002-11-22 2004-06-18 Roy Rosser Autonomous response engine
US7298930B1 (en) 2002-11-29 2007-11-20 Ricoh Company, Ltd. Multimodal access of meeting recordings
US7684985B2 (en) 2002-12-10 2010-03-23 Richard Dominach Techniques for disambiguating speech input using multimodal interfaces
US7386449B2 (en) 2002-12-11 2008-06-10 Voice Enabling Systems Technology Inc. Knowledge-based flexible natural speech dialogue system
US7353139B1 (en) 2002-12-13 2008-04-01 Garmin Ltd. Portable apparatus with performance monitoring and audio entertainment features
FR2848688A1 (fr) 2002-12-17 2004-06-18 France Telecom Identification de langue d'un texte
JP3974511B2 (ja) 2002-12-19 2007-09-12 インターナショナル・ビジネス・マシーンズ・コーポレーション 情報検索のためのデータ構造を生成するコンピュータ・システム、そのための方法、情報検索のためのデータ構造を生成するコンピュータ実行可能なプログラム、情報検索のためのデータ構造を生成するコンピュータ実行可能なプログラムを記憶したコンピュータ可読な記憶媒体、情報検索システム、およびグラフィカル・ユーザ・インタフェイス・システム
US7797331B2 (en) 2002-12-20 2010-09-14 Nokia Corporation Method and device for organizing user provided information with meta-information
US8661112B2 (en) 2002-12-20 2014-02-25 Nuance Communications, Inc. Customized interactive voice response menus
JP2004205605A (ja) 2002-12-24 2004-07-22 Yamaha Corp 音声および楽曲再生装置およびシーケンスデータフォーマット
US7703091B1 (en) 2002-12-31 2010-04-20 Emc Corporation Methods and apparatus for installing agents in a managed network
US7003464B2 (en) 2003-01-09 2006-02-21 Motorola, Inc. Dialog recognition and control in a voice browser
US7593868B2 (en) 2003-01-29 2009-09-22 Innovation Interactive Llc Systems and methods for providing contextual advertising information via a communication network
US7617094B2 (en) 2003-02-28 2009-11-10 Palo Alto Research Center Incorporated Methods, apparatus, and products for identifying a conversation
WO2004079720A1 (en) 2003-03-01 2004-09-16 Robert E Coifman Method and apparatus for improving the transcription accuracy of speech recognition software
US7809565B2 (en) 2003-03-01 2010-10-05 Coifman Robert E Method and apparatus for improving the transcription accuracy of speech recognition software
US7805299B2 (en) 2004-03-01 2010-09-28 Coifman Robert E Method and apparatus for improving the transcription accuracy of speech recognition software
US7529671B2 (en) 2003-03-04 2009-05-05 Microsoft Corporation Block synchronous decoding
JP4828091B2 (ja) 2003-03-05 2011-11-30 ヒューレット・パッカード・カンパニー クラスタリング方法プログラム及び装置
US8064753B2 (en) 2003-03-05 2011-11-22 Freeman Alan D Multi-feature media article and method for manufacture of same
US7835504B1 (en) 2003-03-16 2010-11-16 Palm, Inc. Telephone number parsing and linking
US8244712B2 (en) 2003-03-18 2012-08-14 Apple Inc. Localized viewing of file system names
US7613797B2 (en) 2003-03-19 2009-11-03 Unisys Corporation Remote discovery and system architecture
US7496498B2 (en) 2003-03-24 2009-02-24 Microsoft Corporation Front-end architecture for a multi-lingual text-to-speech system
US8745541B2 (en) 2003-03-25 2014-06-03 Microsoft Corporation Architecture for controlling a computer using hand gestures
US7146319B2 (en) 2003-03-31 2006-12-05 Novauris Technologies Ltd. Phonetically based speech recognition system and method
US7394947B2 (en) 2003-04-08 2008-07-01 The Penn State Research Foundation System and method for automatic linguistic indexing of images by a statistical modeling approach
US7941009B2 (en) 2003-04-08 2011-05-10 The Penn State Research Foundation Real-time computerized annotation of pictures
US7711550B1 (en) 2003-04-29 2010-05-04 Microsoft Corporation Methods and system for recognizing names in a computer-generated document and for providing helpful actions associated with recognized names
US7669134B1 (en) 2003-05-02 2010-02-23 Apple Inc. Method and apparatus for displaying information during an instant messaging session
US7421393B1 (en) 2004-03-01 2008-09-02 At&T Corp. System for developing a dialog manager using modular spoken-dialog components
US7407384B2 (en) 2003-05-29 2008-08-05 Robert Bosch Gmbh System, method and device for language education through a voice portal server
US7493251B2 (en) 2003-05-30 2009-02-17 Microsoft Corporation Using source-channel models for word segmentation
US7496230B2 (en) 2003-06-05 2009-02-24 International Business Machines Corporation System and method for automatic natural language translation of embedded text regions in images during information transfer
WO2004110099A2 (en) 2003-06-06 2004-12-16 Gn Resound A/S A hearing aid wireless network
US7720683B1 (en) 2003-06-13 2010-05-18 Sensory, Inc. Method and apparatus of specifying and performing speech recognition operations
KR100634496B1 (ko) 2003-06-16 2006-10-13 삼성전자주식회사 입력언어모드 인식방법 및 장치와 이를 이용한 입력언어모드 자동전환방법 및 장치
US7559026B2 (en) 2003-06-20 2009-07-07 Apple Inc. Video conferencing system having focus control
US7827047B2 (en) 2003-06-24 2010-11-02 At&T Intellectual Property I, L.P. Methods and systems for assisting scheduling with automation
US7757182B2 (en) 2003-06-25 2010-07-13 Microsoft Corporation Taskbar media player
US7363586B1 (en) 2003-06-26 2008-04-22 Microsoft Corporation Component localization
US7634732B1 (en) 2003-06-26 2009-12-15 Microsoft Corporation Persona menu
US7739588B2 (en) 2003-06-27 2010-06-15 Microsoft Corporation Leveraging markup language data for semantically labeling text strings and data and for providing actions based on semantically labeled text strings and data
US7580551B1 (en) 2003-06-30 2009-08-25 The Research Foundation Of State University Of Ny Method and apparatus for analyzing and/or comparing handwritten and/or biometric samples
EP1639441A1 (en) 2003-07-01 2006-03-29 Nokia Corporation Method and device for operating a user-input area on an electronic display device
US20080097937A1 (en) 2003-07-10 2008-04-24 Ali Hadjarian Distributed method for integrating data mining and text categorization techniques
US8373660B2 (en) 2003-07-14 2013-02-12 Matt Pallakoff System and method for a portable multimedia client
US7757173B2 (en) 2003-07-18 2010-07-13 Apple Inc. Voice menu system
KR100811232B1 (ko) 2003-07-18 2008-03-07 엘지전자 주식회사 턴 바이 턴 네비게이션 시스템 및 차기 안내방법
US20080101584A1 (en) 2003-08-01 2008-05-01 Mitel Networks Corporation Method of providing context aware announcements
US7386438B1 (en) 2003-08-04 2008-06-10 Google Inc. Identifying language attributes through probabilistic analysis
US7386110B2 (en) 2003-08-14 2008-06-10 Hewlett-Packard Development Company, L.P. Directory assistance utilizing a personalized cache
US8311835B2 (en) 2003-08-29 2012-11-13 Microsoft Corporation Assisted multi-modal dialogue
US7475010B2 (en) 2003-09-03 2009-01-06 Lingospot, Inc. Adaptive and scalable method for resolving natural language ambiguities
US20050054381A1 (en) 2003-09-05 2005-03-10 Samsung Electronics Co., Ltd. Proactive user interface
US7539619B1 (en) 2003-09-05 2009-05-26 Spoken Translation Ind. Speech-enabled language translation system and method enabling interactive user supervision of translation and speech recognition accuracy
US7475015B2 (en) 2003-09-05 2009-01-06 International Business Machines Corporation Semantic language modeling and confidence measurement
US7337108B2 (en) 2003-09-10 2008-02-26 Microsoft Corporation System and method for providing high-quality stretching and compression of a digital audio signal
JP4663223B2 (ja) 2003-09-11 2011-04-06 パナソニック株式会社 演算処理装置
AU2003260819A1 (en) 2003-09-12 2005-04-06 Nokia Corporation Method and device for handling missed calls in a mobile communications environment
US7418392B1 (en) 2003-09-25 2008-08-26 Sensory, Inc. System and method for controlling the operation of a device by voice commands
US7460652B2 (en) 2003-09-26 2008-12-02 At&T Intellectual Property I, L.P. VoiceXML and rule engine based switchboard for interactive voice response (IVR) services
US7386440B2 (en) 2003-10-01 2008-06-10 International Business Machines Corporation Method, system, and apparatus for natural language mixed-initiative dialogue processing
US20060008256A1 (en) 2003-10-01 2006-01-12 Khedouri Robert K Audio visual player apparatus and system and method of content distribution using the same
WO2005034086A1 (ja) 2003-10-03 2005-04-14 Asahi Kasei Kabushiki Kaisha データ処理装置及びデータ処理装置制御プログラム
US7318020B1 (en) 2003-10-08 2008-01-08 Microsoft Corporation Methods and systems for external localization
US7620894B1 (en) 2003-10-08 2009-11-17 Apple Inc. Automatic, dynamic user interface configuration
US7383170B2 (en) 2003-10-10 2008-06-03 At&T Knowledge Ventures, L.P. System and method for analyzing automatic speech recognition performance data
US7487092B2 (en) 2003-10-17 2009-02-03 International Business Machines Corporation Interactive debugging and tuning method for CTTS voice building
US7643990B1 (en) 2003-10-23 2010-01-05 Apple Inc. Global boundary-centric feature extraction and associated discontinuity metrics
US7409347B1 (en) 2003-10-23 2008-08-05 Apple Inc. Data-driven global boundary optimization
US7669177B2 (en) 2003-10-24 2010-02-23 Microsoft Corporation System and method for preference application installation and execution
US7292726B2 (en) 2003-11-10 2007-11-06 Microsoft Corporation Recognition of electronic ink with late strokes
WO2005048239A1 (ja) 2003-11-12 2005-05-26 Honda Motor Co., Ltd. 音声認識装置
US7584092B2 (en) 2004-11-15 2009-09-01 Microsoft Corporation Unsupervised learning of paraphrase/translation alternations and selective application thereof
US7561069B2 (en) 2003-11-12 2009-07-14 Legalview Assets, Limited Notification systems and methods enabling a response to change particulars of delivery or pickup
US7841533B2 (en) 2003-11-13 2010-11-30 Metrologic Instruments, Inc. Method of capturing and processing digital images of an object within the field of view (FOV) of a hand-supportable digitial image capture and processing system
US7779356B2 (en) 2003-11-26 2010-08-17 Griesmer James P Enhanced data tip system and method
US20090018918A1 (en) 2004-11-04 2009-01-15 Manyworlds Inc. Influence-based Social Network Advertising
WO2005062293A1 (ja) 2003-12-05 2005-07-07 Kabushikikaisha Kenwood オーディオ機器制御装置、オーディオ機器制御方法及びプログラム
US7689412B2 (en) 2003-12-05 2010-03-30 Microsoft Corporation Synonymous collocation extraction using translation information
US7412388B2 (en) 2003-12-12 2008-08-12 International Business Machines Corporation Language-enhanced programming tools
US7427024B1 (en) 2003-12-17 2008-09-23 Gazdzinski Mark J Chattel management apparatus and methods
US7356748B2 (en) 2003-12-19 2008-04-08 Telefonaktiebolaget Lm Ericsson (Publ) Partial spectral loss concealment in transform codecs
DE602004025616D1 (de) 2003-12-26 2010-04-01 Kenwood Corp Einrichtungssteuereinrichtung, -verfahren und -programm
US7404143B2 (en) 2003-12-26 2008-07-22 Microsoft Corporation Server-based single roundtrip spell checking
US7401300B2 (en) 2004-01-09 2008-07-15 Nokia Corporation Adaptive user interface input device
US7552055B2 (en) 2004-01-10 2009-06-23 Microsoft Corporation Dialog component re-use in recognition systems
US8160883B2 (en) 2004-01-10 2012-04-17 Microsoft Corporation Focus tracking in dialogs
US7660715B1 (en) 2004-01-12 2010-02-09 Avaya Inc. Transparent monitoring and intervention to improve automatic adaptation of speech models
US8281339B1 (en) 2004-01-12 2012-10-02 United Video Properties, Inc. Customizable flip and browse overlays in an interactive television system
US7359851B2 (en) 2004-01-14 2008-04-15 Clairvoyance Corporation Method of identifying the language of a textual passage using short word and/or n-gram comparisons
US7707039B2 (en) 2004-02-15 2010-04-27 Exbiblio B.V. Automatic modification of web pages
DE602004017955D1 (de) 2004-01-29 2009-01-08 Daimler Ag Verfahren und System zur Sprachdialogschnittstelle
CA2640927C (en) 2004-01-30 2012-01-17 Research In Motion Limited Contact query data system and method
US7610258B2 (en) 2004-01-30 2009-10-27 Microsoft Corporation System and method for exposing a child list
US7542971B2 (en) 2004-02-02 2009-06-02 Fuji Xerox Co., Ltd. Systems and methods for collaborative note-taking
US7596499B2 (en) 2004-02-02 2009-09-29 Panasonic Corporation Multilingual text-to-speech system with limited resources
JP4262113B2 (ja) 2004-02-13 2009-05-13 シチズン電子株式会社 バックライト
US7721226B2 (en) 2004-02-18 2010-05-18 Microsoft Corporation Glom widget
US7433876B2 (en) 2004-02-23 2008-10-07 Radar Networks, Inc. Semantic web portal and platform
US8654936B1 (en) 2004-02-24 2014-02-18 At&T Intellectual Property I, L.P. Home control, monitoring and communication system using remote voice commands
KR100462292B1 (ko) 2004-02-26 2004-12-17 엔에이치엔(주) 중요도 정보를 반영한 검색 결과 리스트 제공 방법 및 그시스템
US20050195094A1 (en) 2004-03-05 2005-09-08 White Russell W. System and method for utilizing a bicycle computer to monitor athletic performance
US7693715B2 (en) 2004-03-10 2010-04-06 Microsoft Corporation Generating large units of graphonemes with mutual information criterion for letter to sound conversion
US7711129B2 (en) 2004-03-11 2010-05-04 Apple Inc. Method and system for approximating graphic equalizers using dynamic filter order reduction
US7983835B2 (en) 2004-11-03 2011-07-19 Lagassey Paul J Modular intelligent transportation system
US7478033B2 (en) 2004-03-16 2009-01-13 Google Inc. Systems and methods for translating Chinese pinyin to Chinese characters
US7409337B1 (en) 2004-03-30 2008-08-05 Microsoft Corporation Natural language processing interface
US7716216B1 (en) 2004-03-31 2010-05-11 Google Inc. Document ranking based on semantic distance between terms in a document
US8713418B2 (en) 2004-04-12 2014-04-29 Google Inc. Adding value to a rendered document
US7496512B2 (en) 2004-04-13 2009-02-24 Microsoft Corporation Refining of segmental boundaries in speech waveforms using contextual-dependent models
US7623119B2 (en) 2004-04-21 2009-11-24 Nokia Corporation Graphical functions by gestures
EP1738291A1 (en) 2004-04-23 2007-01-03 Novauris Technologies Limited Tree index based method for accessing automatic directory
US7657844B2 (en) 2004-04-30 2010-02-02 International Business Machines Corporation Providing accessibility compliance within advanced componentry
JP4296598B2 (ja) 2004-04-30 2009-07-15 カシオ計算機株式会社 通信端末装置および通信端末処理プログラム
US7447665B2 (en) 2004-05-10 2008-11-04 Kinetx, Inc. System and method of self-learning conceptual mapping to organize and interpret data
US20080126491A1 (en) 2004-05-14 2008-05-29 Koninklijke Philips Electronics, N.V. Method for Transmitting Messages from a Sender to a Recipient, a Messaging System and Message Converting Means
US7366461B1 (en) 2004-05-17 2008-04-29 Wendell Brown Method and apparatus for improving the quality of a recorded broadcast audio program
US7778830B2 (en) 2004-05-19 2010-08-17 International Business Machines Corporation Training speaker-dependent, phrase-based speech grammars using an unsupervised automated technique
US8130929B2 (en) 2004-05-25 2012-03-06 Galileo Processing, Inc. Methods for obtaining complex data in an interactive voice response system
CN100524457C (zh) 2004-05-31 2009-08-05 国际商业机器公司 文本至语音转换以及调整语料库的装置和方法
US7873149B2 (en) 2004-06-01 2011-01-18 Verizon Business Global Llc Systems and methods for gathering information
US8095364B2 (en) 2004-06-02 2012-01-10 Tegic Communications, Inc. Multimodal disambiguation of speech recognition
US8224649B2 (en) 2004-06-02 2012-07-17 International Business Machines Corporation Method and apparatus for remote command, control and diagnostics of systems using conversational or audio interface
US7673340B1 (en) 2004-06-02 2010-03-02 Clickfox Llc System and method for analyzing system user behavior
US8060138B2 (en) 2004-06-02 2011-11-15 Research In Motion Limited Handheld electronic device and associated method employing a multiple-axis input device and providing a learning function in a text disambiguation environment
US20090187402A1 (en) 2004-06-04 2009-07-23 Koninklijke Philips Electronics, N.V. Performance Prediction For An Interactive Speech Recognition System
US7472065B2 (en) 2004-06-04 2008-12-30 International Business Machines Corporation Generating paralinguistic phenomena via markup in text-to-speech synthesis
CA2573002A1 (en) 2004-06-04 2005-12-22 Benjamin Firooz Ghassabian Systems to enhance data entry in mobile and fixed environment
US7613881B2 (en) 2004-06-08 2009-11-03 Dartdevices Interop Corporation Method and system for configuring and using virtual pointers to access one or more independent address spaces
US20090018829A1 (en) 2004-06-08 2009-01-15 Metaphor Solutions, Inc. Speech Recognition Dialog Management
US7565104B1 (en) 2004-06-16 2009-07-21 Wendell Brown Broadcast audio program guide
US8321786B2 (en) 2004-06-17 2012-11-27 Apple Inc. Routine and interface for correcting electronic text
GB0413743D0 (en) 2004-06-19 2004-07-21 Ibm Method and system for approximate string matching
US20070214133A1 (en) 2004-06-23 2007-09-13 Edo Liberty Methods for filtering data and filling in missing data using nonlinear inference
US8099395B2 (en) 2004-06-24 2012-01-17 Oracle America, Inc. System level identity object
JP4416643B2 (ja) 2004-06-29 2010-02-17 キヤノン株式会社 マルチモーダル入力方法
US7505795B1 (en) 2004-07-07 2009-03-17 Advanced Micro Devices, Inc. Power save management with customized range for user configuration and tuning value based upon recent usage
US7823123B2 (en) 2004-07-13 2010-10-26 The Mitre Corporation Semantic system for integrating software components
JP4652737B2 (ja) 2004-07-14 2011-03-16 インターナショナル・ビジネス・マシーンズ・コーポレーション 単語境界確率推定装置及び方法、確率的言語モデル構築装置及び方法、仮名漢字変換装置及び方法、並びに、未知語モデルの構築方法、
US8036893B2 (en) 2004-07-22 2011-10-11 Nuance Communications, Inc. Method and system for identifying and correcting accent-induced speech recognition difficulties
US7936861B2 (en) 2004-07-23 2011-05-03 At&T Intellectual Property I, L.P. Announcement system and method of use
US7603349B1 (en) 2004-07-29 2009-10-13 Yahoo! Inc. User interfaces for search systems using in-line contextual queries
US8381135B2 (en) 2004-07-30 2013-02-19 Apple Inc. Proximity detector in handheld device
US7725318B2 (en) 2004-07-30 2010-05-25 Nice Systems Inc. System and method for improving the accuracy of audio searching
US7653883B2 (en) 2004-07-30 2010-01-26 Apple Inc. Proximity detector in handheld device
US7831601B2 (en) 2004-08-04 2010-11-09 International Business Machines Corporation Method for automatically searching for documents related to calendar and email entries
US7508324B2 (en) 2004-08-06 2009-03-24 Daniel Suraqui Finger activated reduced keyboard and a method for performing text input
US7728821B2 (en) 2004-08-06 2010-06-01 Touchtable, Inc. Touch detecting interactive display
US7724242B2 (en) 2004-08-06 2010-05-25 Touchtable, Inc. Touch driven method and apparatus to integrate and display multiple image layers forming alternate depictions of same subject matter
JP4563106B2 (ja) 2004-08-09 2010-10-13 アルパイン株式会社 車載機及びその音声出力方法
US7869999B2 (en) 2004-08-11 2011-01-11 Nuance Communications, Inc. Systems and methods for selecting from multiple phonectic transcriptions for text-to-speech synthesis
US8117542B2 (en) 2004-08-16 2012-02-14 Microsoft Corporation User interface for displaying selectable software functionality controls that are contextually relevant to a selected object
US7895531B2 (en) 2004-08-16 2011-02-22 Microsoft Corporation Floating command object
US7912699B1 (en) 2004-08-23 2011-03-22 At&T Intellectual Property Ii, L.P. System and method of lattice-based search for spoken utterance retrieval
US20060048055A1 (en) 2004-08-25 2006-03-02 Jun Wu Fault-tolerant romanized input method for non-roman characters
US7853574B2 (en) 2004-08-26 2010-12-14 International Business Machines Corporation Method of generating a context-inferenced search query and of sorting a result of the query
US7477238B2 (en) 2004-08-31 2009-01-13 Research In Motion Limited Handheld electronic device with text disambiguation
RU2007114059A (ru) 2004-09-14 2008-10-27 Интиллектчуал Проперти Бэнк Корп. (Jp) Чертежное устройство для схемы взаимосвязи документов, компонующее документы в хронологическом порядке
US20060059424A1 (en) 2004-09-15 2006-03-16 Petri Jonah W Real-time data localization
US7319385B2 (en) 2004-09-17 2008-01-15 Nokia Corporation Sensor data sharing
US7447360B2 (en) 2004-09-22 2008-11-04 Microsoft Corporation Analyzing tabular structures in expression recognition
US7716056B2 (en) 2004-09-27 2010-05-11 Robert Bosch Corporation Method and system for interactive conversational dialogue for cognitively overloaded device users
KR100754385B1 (ko) 2004-09-30 2007-08-31 삼성전자주식회사 오디오/비디오 센서를 이용한 위치 파악, 추적 및 분리장치와 그 방법
US7936863B2 (en) 2004-09-30 2011-05-03 Avaya Inc. Method and apparatus for providing communication tasks in a workflow
US8107401B2 (en) 2004-09-30 2012-01-31 Avaya Inc. Method and apparatus for providing a virtual assistant to a communication participant
US7603381B2 (en) 2004-09-30 2009-10-13 Microsoft Corporation Contextual action publishing
US8744852B1 (en) 2004-10-01 2014-06-03 Apple Inc. Spoken interfaces
US7756871B2 (en) 2004-10-13 2010-07-13 Hewlett-Packard Development Company, L.P. Article extraction
US7543232B2 (en) 2004-10-19 2009-06-02 International Business Machines Corporation Intelligent web based help system
US7693719B2 (en) 2004-10-29 2010-04-06 Microsoft Corporation Providing personalized voice font for text-to-speech applications
KR101087483B1 (ko) 2004-11-04 2011-11-28 엘지전자 주식회사 네비게이션 시스템의 안내 음성신호 출력 제어방법 및 장치
US7735012B2 (en) 2004-11-04 2010-06-08 Apple Inc. Audio user interface for computing devices
US7885844B1 (en) 2004-11-16 2011-02-08 Amazon Technologies, Inc. Automatically generating task recommendations for human task performers
JP4604178B2 (ja) 2004-11-22 2010-12-22 独立行政法人産業技術総合研究所 音声認識装置及び方法ならびにプログラム
BRPI0419230B1 (pt) 2004-11-23 2018-09-25 Nokia Corp dispositivo de comunicação de rádio portátil e método para processar uma mensagem recebida da rede celular móvel
US7702500B2 (en) 2004-11-24 2010-04-20 Blaedow Karen R Method and apparatus for determining the meaning of natural language
US7376645B2 (en) 2004-11-29 2008-05-20 The Intellection Group, Inc. Multimodal natural language query system and architecture for processing voice and proximity-based queries
US8498865B1 (en) 2004-11-30 2013-07-30 Vocera Communications, Inc. Speech recognition system and method using group call statistics
JP4297442B2 (ja) 2004-11-30 2009-07-15 富士通株式会社 手書き情報入力装置
US20080255837A1 (en) 2004-11-30 2008-10-16 Jonathan Kahn Method for locating an audio segment within an audio file
US7630900B1 (en) 2004-12-01 2009-12-08 Tellme Networks, Inc. Method and system for selecting grammars based on geographic information associated with a caller
US8214214B2 (en) 2004-12-03 2012-07-03 Phoenix Solutions, Inc. Emotion detection device and method for use in distributed systems
US7636657B2 (en) 2004-12-09 2009-12-22 Microsoft Corporation Method and apparatus for automatic grammar generation from data entries
US7853445B2 (en) 2004-12-10 2010-12-14 Deception Discovery Technologies LLC Method and system for the automatic recognition of deceptive language
US20080004881A1 (en) 2004-12-22 2008-01-03 David Attwater Turn-taking model
US8275618B2 (en) 2004-12-22 2012-09-25 Nuance Communications, Inc. Mobile dictation correction user interface
US7483692B2 (en) 2004-12-28 2009-01-27 Sony Ericsson Mobile Communications Ab System and method of predicting user input to a mobile terminal
US7444589B2 (en) 2004-12-30 2008-10-28 At&T Intellectual Property I, L.P. Automated patent office documentation
US7987244B1 (en) 2004-12-30 2011-07-26 At&T Intellectual Property Ii, L.P. Network repository for voice fonts
US7818672B2 (en) 2004-12-30 2010-10-19 Microsoft Corporation Floating action buttons
US8478589B2 (en) 2005-01-05 2013-07-02 At&T Intellectual Property Ii, L.P. Library of existing spoken dialog data for use in generating new natural language spoken dialog systems
US7536565B2 (en) 2005-01-07 2009-05-19 Apple Inc. Techniques for improved playlist processing on media devices
US7363227B2 (en) 2005-01-10 2008-04-22 Herman Miller, Inc. Disruption of speech understanding by adding a privacy sound thereto
US8069422B2 (en) 2005-01-10 2011-11-29 Samsung Electronics, Co., Ltd. Contextual task recommendation system and method for determining user's context and suggesting tasks
US7418389B2 (en) 2005-01-11 2008-08-26 Microsoft Corporation Defining atom units between phone and syllable for TTS systems
WO2006076516A2 (en) 2005-01-12 2006-07-20 Howard Friedman Customizable delivery of audio information
US7337170B2 (en) 2005-01-18 2008-02-26 International Business Machines Corporation System and method for planning and generating queries for multi-dimensional analysis using domain models and data federation
US8150872B2 (en) 2005-01-24 2012-04-03 The Intellection Group, Inc. Multimodal natural language query system for processing and analyzing voice and proximity-based queries
US7873654B2 (en) 2005-01-24 2011-01-18 The Intellection Group, Inc. Multimodal natural language query system for processing and analyzing voice and proximity-based queries
US8228299B1 (en) 2005-01-27 2012-07-24 Singleton Technology, Llc Transaction automation and archival system using electronic contract and disclosure units
US7508373B2 (en) 2005-01-28 2009-03-24 Microsoft Corporation Form factor and input method for language input
US7734569B2 (en) 2005-02-03 2010-06-08 Strands, Inc. Recommender system for identifying a new set of media items responsive to an input set of media items and knowledge base metrics
GB0502259D0 (en) 2005-02-03 2005-03-09 British Telecomm Document searching tool and method
US8200495B2 (en) 2005-02-04 2012-06-12 Vocollect, Inc. Methods and systems for considering information about an expected response when performing speech recognition
US7895039B2 (en) 2005-02-04 2011-02-22 Vocollect, Inc. Methods and systems for optimizing model adaptation for a speech recognition system
US7813481B1 (en) 2005-02-18 2010-10-12 At&T Mobility Ii Llc Conversation recording with real-time notification for users of communication terminals
JP4911028B2 (ja) 2005-02-24 2012-04-04 富士ゼロックス株式会社 単語翻訳装置、翻訳方法および翻訳プログラム
US7634413B1 (en) 2005-02-25 2009-12-15 Apple Inc. Bitrate constrained variable bitrate audio encoding
US7412389B2 (en) 2005-03-02 2008-08-12 Yang George L Document animation system
WO2005057425A2 (en) 2005-03-07 2005-06-23 Linguatec Sprachtechnologien Gmbh Hybrid machine translation system
US7676026B1 (en) 2005-03-08 2010-03-09 Baxtech Asia Pte Ltd Desktop telephony system
US7814514B2 (en) 2005-03-10 2010-10-12 Panasonic Corporation Digital broadcast receiving apparatus configured for use with copy control information
JP4404211B2 (ja) 2005-03-14 2010-01-27 富士ゼロックス株式会社 マルチリンガル翻訳メモリ、翻訳方法および翻訳プログラム
US7706510B2 (en) 2005-03-16 2010-04-27 Research In Motion System and method for personalized text-to-voice synthesis
US20060218506A1 (en) 2005-03-23 2006-09-28 Edward Srenger Adaptive menu for a user interface
US7565380B1 (en) 2005-03-24 2009-07-21 Netlogic Microsystems, Inc. Memory optimized pattern searching
US7925525B2 (en) 2005-03-25 2011-04-12 Microsoft Corporation Smart reminders
US7810050B2 (en) 2005-03-28 2010-10-05 Panasonic Corporation User interface system
US8219398B2 (en) 2005-03-28 2012-07-10 Lessac Technologies, Inc. Computerized speech synthesizer for synthesizing speech from text
US7721301B2 (en) 2005-03-31 2010-05-18 Microsoft Corporation Processing files from a mobile device using voice commands
US7664558B2 (en) 2005-04-01 2010-02-16 Apple Inc. Efficient techniques for modifying audio playback rates
GB2424969A (en) 2005-04-04 2006-10-11 Messagelabs Ltd Training an anti-spam filter
US20090058860A1 (en) 2005-04-04 2009-03-05 Mor (F) Dynamics Pty Ltd. Method for Transforming Language Into a Visual Form
US20080120312A1 (en) 2005-04-07 2008-05-22 Iofy Corporation System and Method for Creating a New Title that Incorporates a Preexisting Title
US20080120342A1 (en) 2005-04-07 2008-05-22 Iofy Corporation System and Method for Providing Data to be Used in a Presentation on a Device
US20080120330A1 (en) 2005-04-07 2008-05-22 Iofy Corporation System and Method for Linking User Generated Data Pertaining to Sequential Content
US20080120196A1 (en) 2005-04-07 2008-05-22 Iofy Corporation System and Method for Offering a Title for Sale Over the Internet
US20080119953A1 (en) 2005-04-07 2008-05-22 Iofy Corporation Device and System for Utilizing an Information Unit to Present Content and Metadata on a Device
US20080140702A1 (en) 2005-04-07 2008-06-12 Iofy Corporation System and Method for Correlating a First Title with a Second Title
US20080141180A1 (en) 2005-04-07 2008-06-12 Iofy Corporation Apparatus and Method for Utilizing an Information Unit to Provide Navigation Features on a Device
GB0507036D0 (en) 2005-04-07 2005-05-11 Ibm Method and system for language identification
US20080120311A1 (en) 2005-04-07 2008-05-22 Iofy Corporation Device and Method for Protecting Unauthorized Data from being used in a Presentation on a Device
GB0507148D0 (en) 2005-04-08 2005-05-18 Ibm Method and apparatus for multimodal voice and web services
US7516123B2 (en) 2005-04-14 2009-04-07 International Business Machines Corporation Page rank for the semantic web query
WO2006113597A2 (en) 2005-04-14 2006-10-26 The Regents Of The University Of California Method for information retrieval
US8260617B2 (en) 2005-04-18 2012-09-04 Nuance Communications, Inc. Automating input when testing voice-enabled applications
US7627481B1 (en) 2005-04-19 2009-12-01 Apple Inc. Adapting masking thresholds for encoding a low frequency transient signal in audio data
US7996589B2 (en) 2005-04-22 2011-08-09 Microsoft Corporation Auto-suggest lists and handwritten input
US7584093B2 (en) 2005-04-25 2009-09-01 Microsoft Corporation Method and system for generating spelling suggestions
US7684990B2 (en) 2005-04-29 2010-03-23 Nuance Communications, Inc. Method and apparatus for multiple value confirmation and correction in spoken dialog systems
ATE476065T1 (de) 2005-05-03 2010-08-15 Oticon As System und verfahren zum teilen von netzwerkresourcen zwischen hörgeräten
US8046374B1 (en) 2005-05-06 2011-10-25 Symantec Corporation Automatic training of a database intrusion detection system
US7606580B2 (en) 2005-05-11 2009-10-20 Aol Llc Personalized location information for mobile devices
US8117540B2 (en) 2005-05-18 2012-02-14 Neuer Wall Treuhand Gmbh Method and device incorporating improved text input mechanism
US7886233B2 (en) 2005-05-23 2011-02-08 Nokia Corporation Electronic text input involving word completion functionality for predicting word candidates for partial word inputs
FR2886445A1 (fr) 2005-05-30 2006-12-01 France Telecom Procede, dispositif et programme d'ordinateur pour la reconnaissance de la parole
US7539882B2 (en) 2005-05-30 2009-05-26 Rambus Inc. Self-powered devices and methods
US8041570B2 (en) 2005-05-31 2011-10-18 Robert Bosch Corporation Dialogue management using scripts
CA2610269C (en) 2005-06-01 2016-02-02 Loquendo S.P.A. Method of adapting a neural network of an automatic speech recognition device
US7580576B2 (en) 2005-06-02 2009-08-25 Microsoft Corporation Stroke localization and binding to electronic document
US8024195B2 (en) 2005-06-27 2011-09-20 Sensory, Inc. Systems and methods of performing speech recognition using historical information
US7538685B1 (en) 2005-06-28 2009-05-26 Avaya Inc. Use of auditory feedback and audio queues in the realization of a personal virtual assistant
US8396456B2 (en) 2005-06-28 2013-03-12 Avaya Integrated Cabinet Solutions Inc. Visual voicemail management
GB0513225D0 (en) 2005-06-29 2005-08-03 Ibm Method and system for building and contracting a linguistic dictionary
US7542967B2 (en) 2005-06-30 2009-06-02 Microsoft Corporation Searching an index of media content
US7433869B2 (en) 2005-07-01 2008-10-07 Ebrary, Inc. Method and apparatus for document clustering and document sketching
US7826945B2 (en) 2005-07-01 2010-11-02 You Zhang Automobile speech-recognition interface
US7885390B2 (en) 2005-07-01 2011-02-08 Soleo Communications, Inc. System and method for multi-modal personal communication services
US7881283B2 (en) 2005-07-13 2011-02-01 Research In Motion Limited Customizability of event notification on telephony-enabled devices
US9094636B1 (en) 2005-07-14 2015-07-28 Zaxcom, Inc. Systems and methods for remotely controlling local audio devices in a virtual wireless multitrack recording system
US7809572B2 (en) 2005-07-20 2010-10-05 Panasonic Corporation Voice quality change portion locating apparatus
US7912720B1 (en) 2005-07-20 2011-03-22 At&T Intellectual Property Ii, L.P. System and method for building emotional machines
US7613264B2 (en) 2005-07-26 2009-11-03 Lsi Corporation Flexible sampling-rate encoder
US20090048821A1 (en) 2005-07-27 2009-02-19 Yahoo! Inc. Mobile language interpreter with text to speech
US7571092B1 (en) 2005-07-29 2009-08-04 Sun Microsystems, Inc. Method and apparatus for on-demand localization of files
US8694322B2 (en) 2005-08-05 2014-04-08 Microsoft Corporation Selective confirmation for execution of a voice activated user interface
US7640160B2 (en) 2005-08-05 2009-12-29 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7844037B2 (en) 2005-08-08 2010-11-30 Palm, Inc. Method and device for enabling message responses to incoming phone calls
JP5320064B2 (ja) 2005-08-09 2013-10-23 モバイル・ヴォイス・コントロール・エルエルシー 音声制御型ワイヤレス通信デバイス・システム
US7362738B2 (en) 2005-08-09 2008-04-22 Deere & Company Method and system for delivering information to a user
US7620549B2 (en) 2005-08-10 2009-11-17 Voicebox Technologies, Inc. System and method of supporting adaptive misrecognition in conversational speech
KR20080043358A (ko) 2005-08-19 2008-05-16 그레이스노트 아이엔씨 재생 디바이스의 동작을 제어하는 방법 및 시스템
US7949529B2 (en) 2005-08-29 2011-05-24 Voicebox Technologies, Inc. Mobile systems and methods of supporting natural language human-machine interactions
WO2007026365A2 (en) 2005-08-31 2007-03-08 Intuview Ltd. Decision-support expert system and methods for real-time exploitation of documents in non-english languages
EP1934971A4 (en) 2005-08-31 2010-10-27 Voicebox Technologies Inc DYNAMIC LANGUAGE SCRIPTURE
US8265939B2 (en) 2005-08-31 2012-09-11 Nuance Communications, Inc. Hierarchical methods and apparatus for extracting user intent from spoken utterances
US7443316B2 (en) 2005-09-01 2008-10-28 Motorola, Inc. Entering a character into an electronic device
US8677377B2 (en) 2005-09-08 2014-03-18 Apple Inc. Method and apparatus for building an intelligent automated assistant
GB2430101A (en) 2005-09-09 2007-03-14 Mitsubishi Electric Inf Tech Applying metadata for video navigation
US7378963B1 (en) 2005-09-20 2008-05-27 Begault Durand R Reconfigurable auditory-visual display
US8275399B2 (en) 2005-09-21 2012-09-25 Buckyball Mobile Inc. Dynamic context-data tag cloud
US7505784B2 (en) 2005-09-26 2009-03-17 Barbera Melvin A Safety features for portable electronic device
US8270933B2 (en) 2005-09-26 2012-09-18 Zoomsafer, Inc. Safety features for portable electronic device
US7788590B2 (en) 2005-09-26 2010-08-31 Microsoft Corporation Lightweight reference user interface
US7992085B2 (en) 2005-09-26 2011-08-02 Microsoft Corporation Lightweight reference user interface
US7711562B1 (en) 2005-09-27 2010-05-04 At&T Intellectual Property Ii, L.P. System and method for testing a TTS voice
US7693716B1 (en) 2005-09-27 2010-04-06 At&T Intellectual Property Ii, L.P. System and method of developing a TTS voice
US9009046B1 (en) 2005-09-27 2015-04-14 At&T Intellectual Property Ii, L.P. System and method for disambiguating multiple intents in a natural language dialog system
JP5120826B2 (ja) 2005-09-29 2013-01-16 独立行政法人産業技術総合研究所 発音診断装置、発音診断方法、記録媒体、及び、発音診断プログラム
US7633076B2 (en) 2005-09-30 2009-12-15 Apple Inc. Automated response to and sensing of user activity in portable devices
JP4908094B2 (ja) 2005-09-30 2012-04-04 株式会社リコー 情報処理システム、情報処理方法及び情報処理プログラム
US7577522B2 (en) 2005-12-05 2009-08-18 Outland Research, Llc Spatially associated personal reminder system and method
US7930168B2 (en) 2005-10-04 2011-04-19 Robert Bosch Gmbh Natural language processing of disfluent sentences
CN100483399C (zh) 2005-10-09 2009-04-29 株式会社东芝 训练音译模型、切分统计模型的方法和装置
WO2007044806A2 (en) 2005-10-11 2007-04-19 Aol Llc Ordering of conversations based on monitored recipient user interaction with corresponding electronic messages
US8401163B1 (en) 2005-10-18 2013-03-19 Callwave Communications, Llc Methods and systems for call processing and for providing call progress status over a network
US7707032B2 (en) 2005-10-20 2010-04-27 National Cheng Kung University Method and system for matching speech data
US8204266B2 (en) 2005-10-21 2012-06-19 Sfx Technologies Limited Audio devices
US20070094024A1 (en) 2005-10-22 2007-04-26 International Business Machines Corporation System and method for improving text input in a shorthand-on-keyboard interface
US8688148B2 (en) 2005-10-25 2014-04-01 Qualcomm Incorporated Dynamic resource matching system
US7395959B2 (en) 2005-10-27 2008-07-08 International Business Machines Corporation Hands free contact database information entry at a communication device
US7792253B2 (en) 2005-10-27 2010-09-07 International Business Machines Corporation Communications involving devices having different communication modes
KR100755678B1 (ko) 2005-10-28 2007-09-05 삼성전자주식회사 개체명 검출 장치 및 방법
US7778632B2 (en) 2005-10-28 2010-08-17 Microsoft Corporation Multi-modal device capable of automated actions
US9026915B1 (en) 2005-10-31 2015-05-05 At&T Intellectual Property Ii, L.P. System and method for creating a presentation using natural language
US7936339B2 (en) 2005-11-01 2011-05-03 Leapfrog Enterprises, Inc. Method and system for invoking computer functionality by interaction with dynamically generated interface regions of a writing surface
US7640158B2 (en) 2005-11-08 2009-12-29 Multimodal Technologies, Inc. Automatic detection and application of editing patterns in draft documents
US7676463B2 (en) 2005-11-15 2010-03-09 Kroll Ontrack, Inc. Information exploration systems and method
US20070112572A1 (en) 2005-11-15 2007-05-17 Fail Keith W Method and apparatus for assisting vision impaired individuals with selecting items from a list
US8042048B2 (en) 2005-11-17 2011-10-18 Att Knowledge Ventures, L.P. System and method for home automation
WO2007057879A1 (en) 2005-11-17 2007-05-24 Shaul Simhi Personalized voice activity detection
US7909326B2 (en) 2005-11-22 2011-03-22 Walker Digital, Llc Systems, products and processes for conducting instant lottery games
US8055707B2 (en) 2005-11-30 2011-11-08 Alcatel Lucent Calendar interface for digital communications
DE102005057406A1 (de) 2005-11-30 2007-06-06 Valenzuela, Carlos Alberto, Dr.-Ing. Verfahren zur Aufnahme einer Tonquelle mit zeitlich variabler Richtcharakteristik und zur Wiedergabe sowie System zur Durchführung des Verfahrens
US20100304342A1 (en) 2005-11-30 2010-12-02 Linguacomm Enterprises Inc. Interactive Language Education System and Method
US8209182B2 (en) 2005-11-30 2012-06-26 University Of Southern California Emotion recognition system
TW200611546A (en) 2005-12-02 2006-04-01 Univ Chang Gung Mobile phone providing remotely activated and touch power-on and voice response system
ATE432563T1 (de) 2005-12-05 2009-06-15 Ericsson Telefon Ab L M Verfahren und system bezüglich netzverwaltung
KR100810500B1 (ko) 2005-12-08 2008-03-07 한국전자통신연구원 대화형 음성 인터페이스 시스템에서의 사용자 편의성증대 방법
US7461043B2 (en) 2005-12-14 2008-12-02 Siemens Aktiengesellschaft Methods and apparatus to abstract events in software applications or services
GB2433403B (en) 2005-12-16 2009-06-24 Emil Ltd A text editing apparatus and method
US8234494B1 (en) 2005-12-21 2012-07-31 At&T Intellectual Property Ii, L.P. Speaker-verification digital signatures
DE102005061365A1 (de) 2005-12-21 2007-06-28 Siemens Ag Verfahren zur Ansteuerung zumindest einer ersten und zweiten Hintergrundapplikation über ein universelles Sprachdialogsystem
US7996228B2 (en) 2005-12-22 2011-08-09 Microsoft Corporation Voice initiated network operations
US7657849B2 (en) 2005-12-23 2010-02-02 Apple Inc. Unlocking a device by performing gestures on an unlock image
US7599918B2 (en) 2005-12-29 2009-10-06 Microsoft Corporation Dynamic search with implicit user intention mining
US7685144B1 (en) 2005-12-29 2010-03-23 Google Inc. Dynamically autocompleting a data entry
US7890330B2 (en) 2005-12-30 2011-02-15 Alpine Electronics Inc. Voice recording tool for creating database used in text to speech synthesis system
TWI302265B (en) 2005-12-30 2008-10-21 High Tech Comp Corp Moving determination apparatus
US7673238B2 (en) 2006-01-05 2010-03-02 Apple Inc. Portable media device with video acceleration capabilities
US7684991B2 (en) 2006-01-05 2010-03-23 Alpine Electronics, Inc. Digital audio file search method and apparatus using text-to-speech processing
US8006180B2 (en) 2006-01-10 2011-08-23 Mircrosoft Corporation Spell checking in network browser based applications
JP2007183864A (ja) 2006-01-10 2007-07-19 Fujitsu Ltd ファイル検索方法及びそのシステム
EP1977312A2 (en) 2006-01-16 2008-10-08 Zlango Ltd. Iconic communication
JP4241736B2 (ja) 2006-01-19 2009-03-18 株式会社東芝 音声処理装置及びその方法
FR2896603B1 (fr) 2006-01-20 2008-05-02 Thales Sa Procede et dispositif pour extraire des informations et les transformer en donnees qualitatives d'un document textuel
US9600568B2 (en) 2006-01-23 2017-03-21 Veritas Technologies Llc Methods and systems for automatic evaluation of electronic discovery review and productions
US9275129B2 (en) 2006-01-23 2016-03-01 Symantec Corporation Methods and systems to efficiently find similar and near-duplicate emails and files
US7929805B2 (en) 2006-01-31 2011-04-19 The Penn State Research Foundation Image-based CAPTCHA generation system
IL174107A0 (en) 2006-02-01 2006-08-01 Grois Dan Method and system for advertising by means of a search engine over a data network
US7818291B2 (en) 2006-02-03 2010-10-19 The General Electric Company Data object access system and method using dedicated task object
US8352183B2 (en) 2006-02-04 2013-01-08 Microsoft Corporation Maps for social networking and geo blogs
US7836437B2 (en) 2006-02-10 2010-11-16 Microsoft Corporation Semantic annotations for virtual objects
EP1818837B1 (en) 2006-02-10 2009-08-19 Harman Becker Automotive Systems GmbH System for a speech-driven selection of an audio file and method therefor
US20090222270A2 (en) 2006-02-14 2009-09-03 Ivc Inc. Voice command interface device
US9101279B2 (en) 2006-02-15 2015-08-11 Virtual Video Reality By Ritchey, Llc Mobile user borne brain activity data and surrounding environment data correlation system
US7541940B2 (en) 2006-02-16 2009-06-02 International Business Machines Corporation Proximity-based task alerts
EP2511833B1 (en) 2006-02-17 2020-02-05 Google LLC Encoding and adaptive, scalable accessing of distributed translation models
KR100764174B1 (ko) 2006-03-03 2007-10-08 삼성전자주식회사 음성 대화 서비스 장치 및 방법
US7983910B2 (en) 2006-03-03 2011-07-19 International Business Machines Corporation Communicating across voice and text channels with emotion preservation
US9250703B2 (en) 2006-03-06 2016-02-02 Sony Computer Entertainment Inc. Interface with gaze detection and voice input
CN1984207B (zh) 2006-03-07 2010-05-12 华为技术有限公司 一种PoC业务的计费方法及设备
US8532678B2 (en) 2006-03-08 2013-09-10 Tomtom International B.V. Portable GPS navigation device
US7752152B2 (en) 2006-03-17 2010-07-06 Microsoft Corporation Using predictive user models for language modeling on a personal device with user behavior models based on statistical modeling
ATE414975T1 (de) 2006-03-17 2008-12-15 Svox Ag Text-zu-sprache-synthese
DE102006037156A1 (de) 2006-03-22 2007-09-27 Volkswagen Ag Interaktive Bedienvorrichtung und Verfahren zum Betreiben der interaktiven Bedienvorrichtung
JP4734155B2 (ja) 2006-03-24 2011-07-27 株式会社東芝 音声認識装置、音声認識方法および音声認識プログラム
US7930183B2 (en) 2006-03-29 2011-04-19 Microsoft Corporation Automatic identification of dialog timing problems for an interactive speech dialog application using speech log data indicative of cases of barge-in and timing problems
US7724696B1 (en) 2006-03-29 2010-05-25 Amazon Technologies, Inc. Predictive reader power management
US8018431B1 (en) 2006-03-29 2011-09-13 Amazon Technologies, Inc. Page turner for handheld electronic book reader device
US20070233806A1 (en) 2006-03-29 2007-10-04 Mehrzad Asadi Method and system for conducting an internet search using a mobile radio terminal
US7283072B1 (en) 2006-03-30 2007-10-16 International Business Machines Corporation Methods of creating a dictionary for data compression
JP4551961B2 (ja) 2006-03-31 2010-09-29 パイオニア株式会社 音声入力支援装置、その方法、そのプログラム、そのプログラムを記録した記録媒体、および、ナビゲーション装置
US7756708B2 (en) 2006-04-03 2010-07-13 Google Inc. Automatic language model update
WO2007123797A1 (en) 2006-04-04 2007-11-01 Johnson Controls Technology Company System and method for extraction of meta data from a digital media storage device for media selection in a vehicle
US8510109B2 (en) 2007-08-22 2013-08-13 Canyon Ip Holdings Llc Continuous speech transcription performance indication
US7777717B2 (en) 2006-04-05 2010-08-17 Research In Motion Limited Handheld electronic device and method for performing spell checking during text entry and for integrating the output from such spell checking into the output from disambiguation
US7996769B2 (en) 2006-04-05 2011-08-09 Research In Motion Limited Handheld electronic device and method for performing spell checking during text entry and for providing a spell-check learning feature
US7797629B2 (en) 2006-04-05 2010-09-14 Research In Motion Limited Handheld electronic device and method for performing optimized spell checking during text entry by providing a sequentially ordered series of spell-check algorithms
US7693717B2 (en) 2006-04-12 2010-04-06 Custom Speech Usa, Inc. Session file modification with annotation using speech recognition or text to speech
US8046363B2 (en) 2006-04-13 2011-10-25 Lg Electronics Inc. System and method for clustering documents
DE602006010323D1 (de) 2006-04-13 2009-12-24 Fraunhofer Ges Forschung Audiosignaldekorrelator
US7707027B2 (en) 2006-04-13 2010-04-27 Nuance Communications, Inc. Identification and rejection of meaningless input during natural language classification
US7475063B2 (en) 2006-04-19 2009-01-06 Google Inc. Augmenting queries with synonyms selected using language statistics
US8077153B2 (en) 2006-04-19 2011-12-13 Microsoft Corporation Precise selection techniques for multi-touch screens
KR100771626B1 (ko) 2006-04-25 2007-10-31 엘지전자 주식회사 단말기 및 이를 위한 명령 입력 방법
US8392183B2 (en) 2006-04-25 2013-03-05 Frank Elmo Weber Character-based automated media summarization
US20070255554A1 (en) 2006-04-26 2007-11-01 Lucent Technologies Inc. Language translation service for text message communications
US8214213B1 (en) 2006-04-27 2012-07-03 At&T Intellectual Property Ii, L.P. Speech recognition based on pronunciation modeling
US9020804B2 (en) 2006-05-10 2015-04-28 Xerox Corporation Method for aligning sentences at the word level enforcing selective contiguity constraints
JP4969645B2 (ja) 2006-05-10 2012-07-04 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 明確性が強化された音声プロンプトを備える自動体外式除細動器
WO2007133716A2 (en) 2006-05-11 2007-11-22 Cerebode, Inc. Multimodal communication and command control systems and related methods
US20090307594A1 (en) 2006-05-12 2009-12-10 Timo Kosonen Adaptive User Interface
CN101075228B (zh) 2006-05-15 2012-05-23 松下电器产业株式会社 识别自然语言中的命名实体的方法和装置
US7779353B2 (en) 2006-05-19 2010-08-17 Microsoft Corporation Error checking web documents
US7596765B2 (en) 2006-05-23 2009-09-29 Sony Ericsson Mobile Communications Ab Sound feedback on menu navigation
US7831423B2 (en) 2006-05-25 2010-11-09 Multimodal Technologies, Inc. Replacing text representing a concept with an alternate written form of the concept
US20100257160A1 (en) 2006-06-07 2010-10-07 Yu Cao Methods & apparatus for searching with awareness of different types of information
US7483894B2 (en) 2006-06-07 2009-01-27 Platformation Technologies, Inc Methods and apparatus for entity search
US7523108B2 (en) 2006-06-07 2009-04-21 Platformation, Inc. Methods and apparatus for searching with awareness of geography and languages
TW200801988A (en) 2006-06-08 2008-01-01 George Ko Concurrent multilingual translation system
US7853577B2 (en) 2006-06-09 2010-12-14 Ebay Inc. Shopping context engine
US20080010273A1 (en) 2006-06-12 2008-01-10 Metacarta, Inc. Systems and methods for hierarchical organization and presentation of geographic search results
US7774202B2 (en) 2006-06-12 2010-08-10 Lockheed Martin Corporation Speech activated control system and related methods
US8332218B2 (en) 2006-06-13 2012-12-11 Nuance Communications, Inc. Context-based grammars for automated speech recognition
US20080141125A1 (en) 2006-06-23 2008-06-12 Firooz Ghassabian Combined data entry systems
WO2008001485A1 (fr) 2006-06-26 2008-01-03 Nec Corporation système de génération de modèles de langue, procédé de génération de modèles de langue et programme de génération de modèles de langue
JP4675840B2 (ja) 2006-06-29 2011-04-27 三菱電機株式会社 リモートコントローラ並びに家電機器
US7548895B2 (en) 2006-06-30 2009-06-16 Microsoft Corporation Communication-prompted user assistance
US7586423B2 (en) 2006-06-30 2009-09-08 Research In Motion Limited Handheld electronic device and method for dual-mode disambiguation of text input
US8279171B2 (en) 2006-07-06 2012-10-02 Panasonic Corporation Voice input device
US8050500B1 (en) 2006-07-06 2011-11-01 Senapps, LLC Recognition method and system
WO2008008730A2 (en) 2006-07-08 2008-01-17 Personics Holdings Inc. Personal audio assistant device and method
EP1879000A1 (en) 2006-07-10 2008-01-16 Harman Becker Automotive Systems GmbH Transmission of text messages by navigation systems
JP2008021002A (ja) 2006-07-11 2008-01-31 Fuji Xerox Co Ltd Webサーバ装置、表示情報音声合成装置及びプログラム
US7747445B2 (en) 2006-07-12 2010-06-29 Nuance Communications, Inc. Distinguishing among different types of abstractions consisting of plurality of commands specified by particular sequencing and or timing or no timing and sequencing using voice commands
US7756710B2 (en) 2006-07-13 2010-07-13 Sri International Method and apparatus for error correction in speech recognition applications
US20080016575A1 (en) 2006-07-14 2008-01-17 Motorola, Inc. Method and system of auto message deletion using expiration
TWI312103B (en) 2006-07-17 2009-07-11 Asia Optical Co Inc Image pickup systems and methods
US20080013751A1 (en) 2006-07-17 2008-01-17 Per Hiselius Volume dependent audio frequency gain profile
US20080022208A1 (en) 2006-07-18 2008-01-24 Creative Technology Ltd System and method for personalizing the user interface of audio rendering devices
JP2008026381A (ja) 2006-07-18 2008-02-07 Konica Minolta Business Technologies Inc 画像形成装置
US20080042970A1 (en) 2006-07-24 2008-02-21 Yih-Shiuan Liang Associating a region on a surface with a sound or with another region
US8234120B2 (en) 2006-07-26 2012-07-31 Nuance Communications, Inc. Performing a safety analysis for user-defined voice commands to ensure that the voice commands do not cause speech recognition ambiguities
US20080027726A1 (en) 2006-07-28 2008-01-31 Eric Louis Hansen Text to audio mapping, and animation of the text
US8135047B2 (en) 2006-07-31 2012-03-13 Qualcomm Incorporated Systems and methods for including an identifier with a packet associated with a speech signal
JP4728905B2 (ja) 2006-08-02 2011-07-20 クラリオン株式会社 音声対話装置および音声対話プログラム
KR100883652B1 (ko) 2006-08-03 2009-02-18 삼성전자주식회사 음성 구간 검출 방법 및 장치, 및 이를 이용한 음성 인식시스템
US8106742B2 (en) 2006-08-04 2012-01-31 Tegic Communications, Inc. Remotely controlling one or more client devices detected over a wireless network using a mobile device
US20080034044A1 (en) 2006-08-04 2008-02-07 International Business Machines Corporation Electronic mail reader capable of adapting gender and emotions of sender
MX2009001087A (es) 2006-08-04 2009-04-28 Jps Communications Inc Reconocimiento de modulacion de voz en un adaptador de radio a sip.
US20080046948A1 (en) 2006-08-07 2008-02-21 Apple Computer, Inc. Creation, management and delivery of personalized media items
US20080040339A1 (en) 2006-08-07 2008-02-14 Microsoft Corporation Learning question paraphrases from log data
KR100753838B1 (ko) 2006-08-11 2007-08-31 한국전자통신연구원 적응형 차량 운전지원 장치 및 방법
US7646296B2 (en) 2006-08-11 2010-01-12 Honda Motor Co., Ltd. Method and system for receiving and sending navigational data via a wireless messaging service on a navigation system
US8134481B2 (en) 2006-08-11 2012-03-13 Honda Motor Co., Ltd. Method and system for receiving and sending navigational data via a wireless messaging service on a navigation system
US7796980B1 (en) 2006-08-11 2010-09-14 Sprint Communications Company L.P. Remote mobile voice control of digital/personal video recorder
KR20080015567A (ko) 2006-08-16 2008-02-20 삼성전자주식회사 휴대 장치를 위한 음성기반 파일 정보 안내 시스템 및 방법
KR100764649B1 (ko) 2006-08-18 2007-10-08 삼성전자주식회사 휴대용 단말기에서 미디어 플레이어를 제어하기 위한 장치및 방법
DE102006039126A1 (de) 2006-08-21 2008-03-06 Robert Bosch Gmbh Verfahren zur Spracherkennung und Sprachwiedergabe
WO2008024797A2 (en) 2006-08-21 2008-02-28 Pinger, Inc. Graphical user interface for managing voice messages
US20080059190A1 (en) 2006-08-22 2008-03-06 Microsoft Corporation Speech unit selection using HMM acoustic models
US20080052262A1 (en) 2006-08-22 2008-02-28 Serhiy Kosinov Method for personalized named entity recognition
US20080059200A1 (en) 2006-08-22 2008-03-06 Accenture Global Services Gmbh Multi-Lingual Telephonic Service
US7970216B2 (en) 2006-08-24 2011-06-28 Dell Products L.P. Methods and apparatus for reducing storage size
US20100174544A1 (en) 2006-08-28 2010-07-08 Mark Heifets System, method and end-user device for vocal delivery of textual data
JP4582238B2 (ja) 2006-08-30 2010-11-17 日本電気株式会社 音声ミキシング方法およびその方法を用いる多地点会議サーバならびにプログラム
US8239480B2 (en) 2006-08-31 2012-08-07 Sony Ericsson Mobile Communications Ab Methods of searching using captured portions of digital audio content and additional information separate therefrom and related systems and computer program products
US20080055194A1 (en) 2006-08-31 2008-03-06 Motorola, Inc. Method and system for context based user interface information presentation and positioning
US9071701B2 (en) 2006-08-31 2015-06-30 Qualcomm Incorporated Using wireless characteristic to trigger generation of position fix
US8402499B2 (en) 2006-08-31 2013-03-19 Accenture Global Services Gmbh Voicemail interface system and method
US9552349B2 (en) 2006-08-31 2017-01-24 International Business Machines Corporation Methods and apparatus for performing spelling corrections using one or more variant hash tables
US7689408B2 (en) 2006-09-01 2010-03-30 Microsoft Corporation Identifying language of origin for words using estimates of normalized appearance frequency
US7881928B2 (en) 2006-09-01 2011-02-01 International Business Machines Corporation Enhanced linguistic transformation
US20080077393A1 (en) 2006-09-01 2008-03-27 Yuqing Gao Virtual keyboard adaptation for multilingual input
JP4666648B2 (ja) 2006-09-01 2011-04-06 本田技研工業株式会社 音声応答システム、音声応答プログラム
US8170790B2 (en) 2006-09-05 2012-05-01 Garmin Switzerland Gmbh Apparatus for switching navigation device mode
US7683886B2 (en) 2006-09-05 2010-03-23 Research In Motion Limited Disambiguated text message review function
US8253695B2 (en) 2006-09-06 2012-08-28 Apple Inc. Email client for a portable multifunction device
US8564544B2 (en) 2006-09-06 2013-10-22 Apple Inc. Touch screen device, method, and graphical user interface for customizing display of content category icons
US7996792B2 (en) 2006-09-06 2011-08-09 Apple Inc. Voicemail manager for portable multifunction device
US8589869B2 (en) 2006-09-07 2013-11-19 Wolfram Alpha Llc Methods and systems for determining a formula
US7771320B2 (en) 2006-09-07 2010-08-10 Nike, Inc. Athletic performance sensing and/or tracking systems and methods
TWI322610B (en) 2006-09-08 2010-03-21 Htc Corp Handheld electronic device
US9318108B2 (en) 2010-01-18 2016-04-19 Apple Inc. Intelligent automated assistant
JP2008064687A (ja) 2006-09-08 2008-03-21 Toyota Motor Corp 走行情報案内装置
US8374874B2 (en) 2006-09-11 2013-02-12 Nuance Communications, Inc. Establishing a multimodal personality for a multimodal application in dependence upon attributes of user interaction
US8564543B2 (en) 2006-09-11 2013-10-22 Apple Inc. Media player with imaged based browsing
US8036766B2 (en) 2006-09-11 2011-10-11 Apple Inc. Intelligent audio mixing among media playback and at least one other non-playback application
US9892650B2 (en) 2006-09-11 2018-02-13 Houghton Mifflin Harcourt Publishing Company Recovery of polled data after an online test platform failure
US20080071544A1 (en) 2006-09-14 2008-03-20 Google Inc. Integrating Voice-Enabled Local Search and Contact Lists
WO2008033095A1 (en) 2006-09-15 2008-03-20 Agency For Science, Technology And Research Apparatus and method for speech utterance verification
US8027837B2 (en) 2006-09-15 2011-09-27 Apple Inc. Using non-speech sounds during text-to-speech synthesis
US20100278453A1 (en) 2006-09-15 2010-11-04 King Martin T Capture and display of annotations in paper and electronic documents
US8407229B2 (en) 2006-09-19 2013-03-26 Iac Search & Media, Inc. Systems and methods for aggregating search results
US20080076972A1 (en) 2006-09-21 2008-03-27 Apple Inc. Integrated sensors for tracking performance metrics
JP4393494B2 (ja) 2006-09-22 2010-01-06 株式会社東芝 機械翻訳装置、機械翻訳方法および機械翻訳プログラム
US20080077384A1 (en) 2006-09-22 2008-03-27 International Business Machines Corporation Dynamically translating a software application to a user selected target language that is not natively provided by the software application
US7865282B2 (en) 2006-09-22 2011-01-04 General Motors Llc Methods of managing communications for an in-vehicle telematics system
US20080084974A1 (en) 2006-09-25 2008-04-10 International Business Machines Corporation Method and system for interactively synthesizing call center responses using multi-language text-to-speech synthesizers
KR100813170B1 (ko) 2006-09-27 2008-03-17 삼성전자주식회사 사진 의미 인덱싱 방법 및 그 시스템
US7649454B2 (en) 2006-09-28 2010-01-19 Ektimisi Semiotics Holdings, Llc System and method for providing a task reminder based on historical travel information
US8014308B2 (en) 2006-09-28 2011-09-06 Microsoft Corporation Hardware architecture for cloud services
US8214208B2 (en) 2006-09-28 2012-07-03 Reqall, Inc. Method and system for sharing portable voice profiles
US7930197B2 (en) 2006-09-28 2011-04-19 Microsoft Corporation Personal data mining
US7528713B2 (en) 2006-09-28 2009-05-05 Ektimisi Semiotics Holdings, Llc Apparatus and method for providing a task reminder based on travel history
US7945470B1 (en) 2006-09-29 2011-05-17 Amazon Technologies, Inc. Facilitating performance of submitted tasks by mobile task performers
US7831432B2 (en) 2006-09-29 2010-11-09 International Business Machines Corporation Audio menus describing media contents of media players
US7885222B2 (en) 2006-09-29 2011-02-08 Advanced Micro Devices, Inc. Task scheduler responsive to connectivity prerequisites
US20080082338A1 (en) 2006-09-29 2008-04-03 O'neil Michael P Systems and methods for secure voice identification and medical device interface
JP2008090545A (ja) 2006-09-29 2008-04-17 Toshiba Corp 音声対話装置および音声対話方法
EP1909263B1 (en) 2006-10-02 2009-01-28 Harman Becker Automotive Systems GmbH Exploitation of language identification of media file data in speech dialog systems
US7673251B1 (en) 2006-10-02 2010-03-02 Adobe Systems, Incorporated Panel presentation
US20080082390A1 (en) 2006-10-02 2008-04-03 International Business Machines Corporation Methods for Generating Auxiliary Data Operations for a Role Based Personalized Business User Workplace
US7801721B2 (en) 2006-10-02 2010-09-21 Google Inc. Displaying original text in a user interface with translated text
JP2008092269A (ja) 2006-10-02 2008-04-17 Matsushita Electric Ind Co Ltd ハンズフリー通話装置
US7937075B2 (en) 2006-10-06 2011-05-03 At&T Intellectual Property I, L.P. Mode changing of a mobile communications device and vehicle settings when the mobile communications device is in proximity to a vehicle
US8145473B2 (en) 2006-10-10 2012-03-27 Abbyy Software Ltd. Deep model statistics method for machine translation
US8024193B2 (en) 2006-10-10 2011-09-20 Apple Inc. Methods and apparatus related to pruning for concatenative text-to-speech synthesis
CN101162153A (zh) 2006-10-11 2008-04-16 丁玉国 一种语音控制的车载gps导航系统及其实现方法
US20080091426A1 (en) 2006-10-12 2008-04-17 Rod Rempel Adaptive context for automatic speech recognition systems
US7793228B2 (en) 2006-10-13 2010-09-07 Apple Inc. Method, system, and graphical user interface for text entry with partial word display
US8041568B2 (en) 2006-10-13 2011-10-18 Google Inc. Business listing search
US8073681B2 (en) 2006-10-16 2011-12-06 Voicebox Technologies, Inc. System and method for a cooperative conversational voice user interface
US7697922B2 (en) 2006-10-18 2010-04-13 At&T Intellectual Property I., L.P. Event notification systems and related methods
US20080098480A1 (en) 2006-10-20 2008-04-24 Hewlett-Packard Development Company Lp Information association
WO2008050225A2 (en) 2006-10-24 2008-05-02 Edgetech America, Inc. Method for spell-checking location-bound words within a document
US20080096533A1 (en) 2006-10-24 2008-04-24 Kallideas Spa Virtual Assistant With Real-Time Emotions
US20080124695A1 (en) 2006-10-26 2008-05-29 Cary Michael Myers Non-intrusive audio book
US8204739B2 (en) 2008-04-15 2012-06-19 Mobile Technologies, Llc System and methods for maintaining speech-to-speech translation in the field
US8972268B2 (en) 2008-04-15 2015-03-03 Facebook, Inc. Enhanced speech-to-speech translation system and methods for adding a new word
US8255216B2 (en) 2006-10-30 2012-08-28 Nuance Communications, Inc. Speech recognition of character sequences
US8037179B2 (en) 2006-11-02 2011-10-11 Storz Endoskop Produktions Gmbh Device control system employing extensible markup language for defining information resources
JP2008116298A (ja) 2006-11-02 2008-05-22 Denso Corp 車載緊急通報装置及び車載緊急通報システム
US9471333B2 (en) 2006-11-03 2016-10-18 Conceptual Speech, Llc Contextual speech-recognition user-interface driven system and method
US20080109222A1 (en) 2006-11-04 2008-05-08 Edward Liu Advertising using extracted context sensitive information and data of interest from voice/audio transmissions and recordings
US7873517B2 (en) 2006-11-09 2011-01-18 Volkswagen Of America, Inc. Motor vehicle with a speech interface
US9329753B2 (en) 2006-11-10 2016-05-03 Blackberry Limited Handheld electronic device having selectable language indicator and menus for language selection and method therefor
US9355568B2 (en) 2006-11-13 2016-05-31 Joyce S. Stone Systems and methods for providing an electronic reader having interactive and educational features
US8718538B2 (en) 2006-11-13 2014-05-06 Joseph Harb Real-time remote purchase-list capture system
US20080114841A1 (en) 2006-11-14 2008-05-15 Lambert Daniel T System and method for interfacing with event management software
US20080114604A1 (en) 2006-11-15 2008-05-15 Motorola, Inc. Method and system for a user interface using higher order commands
US7904298B2 (en) 2006-11-17 2011-03-08 Rao Ashwin P Predictive speech-to-text input
CN101193460B (zh) 2006-11-20 2011-09-28 松下电器产业株式会社 检测声音的装置及方法
US8090194B2 (en) 2006-11-21 2012-01-03 Mantis Vision Ltd. 3D geometric modeling and motion capture using both single and dual imaging
US8010338B2 (en) 2006-11-27 2011-08-30 Sony Ericsson Mobile Communications Ab Dynamic modification of a messaging language
US20080126075A1 (en) 2006-11-27 2008-05-29 Sony Ericsson Mobile Communications Ab Input prediction
US8600760B2 (en) 2006-11-28 2013-12-03 General Motors Llc Correcting substitution errors during automatic speech recognition by accepting a second best when first best is confusable
US8055502B2 (en) 2006-11-28 2011-11-08 General Motors Llc Voice dialing using a rejection reference
US20080126093A1 (en) 2006-11-28 2008-05-29 Nokia Corporation Method, Apparatus and Computer Program Product for Providing a Language Based Interactive Multimedia System
JP2008134949A (ja) 2006-11-29 2008-06-12 Fujitsu Ltd 携帯端末装置及びスケジュール作成画面表示方法
US8571862B2 (en) 2006-11-30 2013-10-29 Ashwin P. Rao Multimodal interface for input of text
DE602006005830D1 (de) 2006-11-30 2009-04-30 Harman Becker Automotive Sys Interaktives Spracherkennungssystem
US8676802B2 (en) 2006-11-30 2014-03-18 Oracle Otc Subsidiary Llc Method and system for information retrieval with clustering
US8401847B2 (en) 2006-11-30 2013-03-19 National Institute Of Advanced Industrial Science And Technology Speech recognition system and program therefor
US8355915B2 (en) 2006-11-30 2013-01-15 Rao Ashwin P Multimodal speech recognition system
GB0623915D0 (en) 2006-11-30 2007-01-10 Ibm Phonetic decoding and concatentive speech synthesis
US9830912B2 (en) 2006-11-30 2017-11-28 Ashwin P Rao Speak and touch auto correction interface
US8001400B2 (en) 2006-12-01 2011-08-16 Apple Inc. Power consumption management for functional preservation in a battery-powered electronic device
US20080129520A1 (en) 2006-12-01 2008-06-05 Apple Computer, Inc. Electronic device with enhanced audio feedback
US20080133245A1 (en) 2006-12-04 2008-06-05 Sehda, Inc. Methods for speech-to-speech translation
US8045808B2 (en) 2006-12-04 2011-10-25 Trend Micro Incorporated Pure adversarial approach for identifying text content in images
US8208624B2 (en) 2006-12-05 2012-06-26 Hewlett-Packard Development Company, L.P. Hearing aid compatible mobile phone
EP2095250B1 (en) 2006-12-05 2014-11-12 Nuance Communications, Inc. Wireless server based text to speech email
US8311590B2 (en) 2006-12-05 2012-11-13 Hewlett-Packard Development Company, L.P. System and method for improved loudspeaker functionality
US7676249B2 (en) 2006-12-05 2010-03-09 Research In Motion Limited Alert methods and apparatus for call appointments in a calendar application based on communication conditions of a mobile station
US20080140652A1 (en) 2006-12-07 2008-06-12 Jonathan Travis Millman Authoring tool
US20080140413A1 (en) 2006-12-07 2008-06-12 Jonathan Travis Millman Synchronization of audio to reading
US7831246B1 (en) 2006-12-08 2010-11-09 At&T Mobility Ii, Llc Mobile merchant
US10185779B2 (en) 2008-03-03 2019-01-22 Oath Inc. Mechanisms for content aggregation, syndication, sharing, and updating
US7630972B2 (en) 2007-01-05 2009-12-08 Yahoo! Inc. Clustered search processing
US9522332B2 (en) 2006-12-13 2016-12-20 Voodoo Gaming Llc Video games including real-life attributes and/or fantasy team settings
US20100080398A1 (en) 2006-12-13 2010-04-01 Phonak Ag Method and system for hearing device fitting
US7783644B1 (en) 2006-12-13 2010-08-24 Google Inc. Query-independent entity importance in books
US8731610B2 (en) 2006-12-13 2014-05-20 Samsung Electronics Co., Ltd. Method for adaptive user interface in mobile devices
US7646297B2 (en) 2006-12-15 2010-01-12 At&T Intellectual Property I, L.P. Context-detected auto-mode switching
US20080146290A1 (en) 2006-12-18 2008-06-19 Motorola, Inc. Changing a mute state of a voice call from a bluetooth headset
US7552045B2 (en) 2006-12-18 2009-06-23 Nokia Corporation Method, apparatus and computer program product for providing flexible text based language identification
US8204182B2 (en) 2006-12-19 2012-06-19 Nuance Communications, Inc. Dialect translator for a speech application environment extended for interactive text exchanges
US20080147411A1 (en) 2006-12-19 2008-06-19 International Business Machines Corporation Adaptation of a speech processing system from external input that is not directly related to sounds in an operational acoustic environment
KR101405284B1 (ko) 2006-12-20 2014-06-10 삼성전자 주식회사 화상형성장치 및 그 다국어 자판 표시방법
GB0625642D0 (en) 2006-12-21 2007-01-31 Symbian Software Ltd Mobile sensor feedback
EP1936606B1 (en) 2006-12-21 2011-10-05 Harman Becker Automotive Systems GmbH Multi-stage speech recognition
US20080154600A1 (en) 2006-12-21 2008-06-26 Nokia Corporation System, Method, Apparatus and Computer Program Product for Providing Dynamic Vocabulary Prediction for Speech Recognition
US7991724B2 (en) 2006-12-21 2011-08-02 Support Machines Ltd. Method and a computer program product for providing a response to a statement of a user
US8630855B2 (en) 2006-12-22 2014-01-14 Anthony Oddo Call system and method
US8010367B2 (en) 2006-12-22 2011-08-30 Nuance Communications, Inc. Spoken free-form passwords for light-weight speaker verification using standard speech recognition engines
CN101563682A (zh) 2006-12-22 2009-10-21 日本电气株式会社 语句改述方法、程序以及系统
US20080154612A1 (en) 2006-12-26 2008-06-26 Voice Signal Technologies, Inc. Local storage and use of search results for voice-enabled mobile communications devices
US20080154577A1 (en) 2006-12-26 2008-06-26 Sehda,Inc. Chunk-based statistical machine translation system
JP4867654B2 (ja) 2006-12-28 2012-02-01 日産自動車株式会社 音声認識装置、および音声認識方法
US20080163119A1 (en) 2006-12-28 2008-07-03 Samsung Electronics Co., Ltd. Method for providing menu and multimedia device using the same
US7865817B2 (en) 2006-12-29 2011-01-04 Amazon Technologies, Inc. Invariant referencing in digital works
EP1939759A1 (en) 2006-12-29 2008-07-02 Vodafone Holding GmbH Method for providing content to a mobile device, gateway for providing content and mobile device
US8019271B1 (en) 2006-12-29 2011-09-13 Nextel Communications, Inc. Methods and systems for presenting information on mobile devices
US8019050B2 (en) 2007-01-03 2011-09-13 Motorola Solutions, Inc. Method and apparatus for providing feedback of vocal quality to a user
US8493330B2 (en) 2007-01-03 2013-07-23 Apple Inc. Individual channel phase delay scheme
WO2008086112A1 (en) 2007-01-04 2008-07-17 Sound Id Personalized sound system hearing profile selection process
US20080167876A1 (en) 2007-01-04 2008-07-10 International Business Machines Corporation Methods and computer program products for providing paraphrasing in a text-to-speech system
US8074172B2 (en) 2007-01-05 2011-12-06 Apple Inc. Method, system, and graphical user interface for providing word recommendations
US7889184B2 (en) 2007-01-05 2011-02-15 Apple Inc. Method, system and graphical user interface for displaying hyperlink information
US7957955B2 (en) 2007-01-05 2011-06-07 Apple Inc. Method and system for providing word recommendations for text input
EP2099652B1 (en) 2007-01-05 2016-11-16 Visteon Global Technologies, Inc. System and method for customized audio prompting
US7889185B2 (en) 2007-01-05 2011-02-15 Apple Inc. Method, system, and graphical user interface for activating hyperlinks
US8060824B2 (en) 2007-01-05 2011-11-15 Starz Entertainment Llc User interface for a multimedia service
US7978176B2 (en) 2007-01-07 2011-07-12 Apple Inc. Portrait-landscape rotation heuristics for a portable multifunction device
US8553856B2 (en) 2007-01-07 2013-10-08 Apple Inc. Voicemail systems and methods
US8391844B2 (en) 2007-01-07 2013-03-05 Apple Inc. Voicemail systems and methods
WO2008085742A2 (en) 2007-01-07 2008-07-17 Apple Inc. Portable multifunction device, method and graphical user interface for interacting with user input elements in displayed content
FR2911201A1 (fr) 2007-01-08 2008-07-11 Sagem Comm Procede d'edition d'un texte exprime dans une langue
GB2445666A (en) 2007-01-09 2008-07-16 Spinvox Ltd Method of replying to an electronically received message
EP2116035A1 (en) 2007-01-09 2009-11-11 Spinvox Limited Voice messages converted to text for display on a web site
US20080165994A1 (en) 2007-01-10 2008-07-10 Magnadyne Corporation Bluetooth enabled hearing aid
US8056070B2 (en) 2007-01-10 2011-11-08 Goller Michael D System and method for modifying and updating a speech recognition program
KR100799195B1 (ko) 2007-01-12 2008-01-29 삼성전자주식회사 휴대용 단말기에서 긴급 호 연결 방법 및 장치
US20080172698A1 (en) 2007-01-12 2008-07-17 Berger Adam L Performing support functions on a portable device
US7912724B1 (en) 2007-01-18 2011-03-22 Adobe Systems Incorporated Audio comparison using phoneme matching
KR100837166B1 (ko) 2007-01-20 2008-06-11 엘지전자 주식회사 전자기기의 정보 표시 방법 및 그 전자기기
KR100883657B1 (ko) 2007-01-26 2009-02-18 삼성전자주식회사 음성 인식 기반의 음악 검색 방법 및 장치
JP5270841B2 (ja) 2007-01-29 2013-08-21 株式会社タイトー レッスンプログラム、記憶媒体
US7707226B1 (en) 2007-01-29 2010-04-27 Aol Inc. Presentation of content items based on dynamic monitoring of real-time context
JP2008185805A (ja) 2007-01-30 2008-08-14 Internatl Business Mach Corp <Ibm> 高品質の合成音声を生成する技術
US20080186196A1 (en) 2007-02-01 2008-08-07 Sony Ericsson Mobile Communications Ab Non-time based snooze
US20080189606A1 (en) 2007-02-02 2008-08-07 Michal Rybak Handheld electronic device including predictive accent mechanism, and associated method
US20110047605A1 (en) 2007-02-06 2011-02-24 Vidoop, Llc System And Method For Authenticating A User To A Computer System
US20080186960A1 (en) 2007-02-06 2008-08-07 Access Systems Americas, Inc. System and method of controlling media streams in an electronic device
US7873710B2 (en) 2007-02-06 2011-01-18 5O9, Inc. Contextual data communication platform
US7818176B2 (en) 2007-02-06 2010-10-19 Voicebox Technologies, Inc. System and method for selecting and presenting advertisements based on natural language processing of voice-based input
EP2126900B1 (en) 2007-02-06 2013-04-24 Nuance Communications Austria GmbH Method and system for creating entries in a speech recognition lexicon
US7912700B2 (en) 2007-02-08 2011-03-22 Microsoft Corporation Context based word prediction
US9465791B2 (en) 2007-02-09 2016-10-11 International Business Machines Corporation Method and apparatus for automatic detection of spelling errors in one or more documents
US8078978B2 (en) 2007-10-19 2011-12-13 Google Inc. Method and system for predicting text
US20080195630A1 (en) 2007-02-13 2008-08-14 Amadeus S.A.S. Web service interrogation method and apparatus
US7941133B2 (en) 2007-02-14 2011-05-10 At&T Intellectual Property I, L.P. Methods, systems, and computer program products for schedule management based on locations of wireless devices
JP4890289B2 (ja) 2007-02-14 2012-03-07 ヤフー株式会社 リモコン文字入力制御方法、サーバおよびリモコン文字入力制御プログラム
US7853240B2 (en) 2007-02-15 2010-12-14 Research In Motion Limited Emergency number selection for mobile communications device
US20080201434A1 (en) 2007-02-16 2008-08-21 Microsoft Corporation Context-Sensitive Searches and Functionality for Instant Messaging Applications
US20080201000A1 (en) 2007-02-20 2008-08-21 Nokia Corporation Contextual grouping of media items
WO2008103925A1 (en) 2007-02-22 2008-08-28 Personics Holdings Inc. Method and device for sound detection and audio control
US20080204379A1 (en) 2007-02-22 2008-08-28 Microsoft Corporation Display with integrated audio transducer device
US7912828B2 (en) 2007-02-23 2011-03-22 Apple Inc. Pattern searching methods and apparatuses
US7801728B2 (en) 2007-02-26 2010-09-21 Nuance Communications, Inc. Document session replay for multimodal applications
US8112402B2 (en) 2007-02-26 2012-02-07 Microsoft Corporation Automatic disambiguation based on a reference resource
US7797265B2 (en) 2007-02-26 2010-09-14 Siemens Corporation Document clustering that applies a locality sensitive hashing function to a feature vector to obtain a limited set of candidate clusters
US7822608B2 (en) 2007-02-27 2010-10-26 Nuance Communications, Inc. Disambiguating a speech recognition grammar in a multimodal application
US7840409B2 (en) 2007-02-27 2010-11-23 Nuance Communications, Inc. Ordering recognition results produced by an automatic speech recognition engine for a multimodal application
US7826872B2 (en) 2007-02-28 2010-11-02 Sony Ericsson Mobile Communications Ab Audio nickname tag associated with PTT user
US8362642B2 (en) 2007-03-01 2013-01-29 Rambus Inc. Optimized power supply for an electronic system
US8457959B2 (en) 2007-03-01 2013-06-04 Edward C. Kaiser Systems and methods for implicitly interpreting semantically redundant communication modes
US8521519B2 (en) 2007-03-02 2013-08-27 Panasonic Corporation Adaptive audio signal source vector quantization device and adaptive audio signal source vector quantization method that search for pitch period based on variable resolution
JP2008217468A (ja) 2007-03-05 2008-09-18 Mitsubishi Electric Corp 情報処理装置及びメニュー項目生成プログラム
US20080221866A1 (en) 2007-03-06 2008-09-11 Lalitesh Katragadda Machine Learning For Transliteration
US8886540B2 (en) 2007-03-07 2014-11-11 Vlingo Corporation Using speech recognition results based on an unstructured language model in a mobile communication facility application
US20090030685A1 (en) 2007-03-07 2009-01-29 Cerra Joseph P Using speech recognition results based on an unstructured language model with a navigation system
US20080221901A1 (en) 2007-03-07 2008-09-11 Joseph Cerra Mobile general search environment speech processing facility
US8838457B2 (en) 2007-03-07 2014-09-16 Vlingo Corporation Using results of unstructured language model based speech recognition to control a system-level function of a mobile communications facility
US8635243B2 (en) 2007-03-07 2014-01-21 Research In Motion Limited Sending a communications header with voice recording to send metadata for use in speech recognition, formatting, and search mobile search application
US20110060587A1 (en) 2007-03-07 2011-03-10 Phillips Michael S Command and control utilizing ancillary information in a mobile voice-to-speech application
US8886545B2 (en) 2007-03-07 2014-11-11 Vlingo Corporation Dealing with switch latency in speech recognition
SE530911C2 (sv) 2007-03-07 2008-10-14 Hexaformer Ab Transformatoranordning
US8880405B2 (en) 2007-03-07 2014-11-04 Vlingo Corporation Application text entry in a mobile environment using a speech processing facility
US8949266B2 (en) 2007-03-07 2015-02-03 Vlingo Corporation Multiple web-based content category searching in mobile search application
US20110054894A1 (en) 2007-03-07 2011-03-03 Phillips Michael S Speech recognition through the collection of contact information in mobile dictation application
US20080219641A1 (en) 2007-03-09 2008-09-11 Barry Sandrew Apparatus and method for synchronizing a secondary audio track to the audio track of a video source
GB0704772D0 (en) 2007-03-12 2007-04-18 Mongoose Ventures Ltd Aural similarity measuring system for text
US20080256613A1 (en) 2007-03-13 2008-10-16 Grover Noel J Voice print identification portal
US7801729B2 (en) 2007-03-13 2010-09-21 Sensory, Inc. Using multiple attributes to create a voice search playlist
US8924844B2 (en) 2007-03-13 2014-12-30 Visual Cues Llc Object annotation
US7945851B2 (en) 2007-03-14 2011-05-17 Nuance Communications, Inc. Enabling dynamic voiceXML in an X+V page of a multimodal application
JP4466666B2 (ja) 2007-03-14 2010-05-26 日本電気株式会社 議事録作成方法、その装置及びそのプログラム
US20080229218A1 (en) 2007-03-14 2008-09-18 Joon Maeng Systems and methods for providing additional information for objects in electronic documents
JP4793291B2 (ja) 2007-03-15 2011-10-12 パナソニック株式会社 リモコン装置
US8626930B2 (en) 2007-03-15 2014-01-07 Apple Inc. Multimedia content filtering
US8144920B2 (en) 2007-03-15 2012-03-27 Microsoft Corporation Automated location estimation using image analysis
US8219406B2 (en) 2007-03-15 2012-07-10 Microsoft Corporation Speech-centric multimodal user interface design in mobile technology
WO2008114448A1 (ja) 2007-03-20 2008-09-25 Fujitsu Limited 音声認識システム、音声認識プログラムおよび音声認識方法
US8886537B2 (en) 2007-03-20 2014-11-11 Nuance Communications, Inc. Method and system for text-to-speech synthesis with personalized voice
US8515757B2 (en) 2007-03-20 2013-08-20 Nuance Communications, Inc. Indexing digitized speech with words represented in the digitized speech
JP2008236448A (ja) 2007-03-22 2008-10-02 Clarion Co Ltd 音声信号処理装置、ハンズフリー通話装置、音声信号処理方法および制御プログラム
JP2008233678A (ja) 2007-03-22 2008-10-02 Honda Motor Co Ltd 音声対話装置、音声対話方法、及び音声対話用プログラム
US8909532B2 (en) 2007-03-23 2014-12-09 Nuance Communications, Inc. Supporting multi-lingual user interaction with a multimodal application
US8126484B2 (en) 2007-03-26 2012-02-28 Qualcomm, Incorporated Apparatus and methods of sharing contact information between mobile communication devices using short message service
JP2008271481A (ja) 2007-03-27 2008-11-06 Brother Ind Ltd 電話装置
US8498628B2 (en) 2007-03-27 2013-07-30 Iocast Llc Content delivery system and method
CA2682000A1 (en) 2007-03-28 2008-10-02 Breakthrough Performancetech, Llc Systems and methods for computerized interactive training
WO2008120036A1 (en) 2007-03-29 2008-10-09 Nokia Corporation Method at a central server for managing a translation dictionary and a translation server system
JP2008250375A (ja) 2007-03-29 2008-10-16 Toshiba Corp 文字入力装置、方法およびプログラム
US7797269B2 (en) 2007-03-29 2010-09-14 Nokia Corporation Method and apparatus using a context sensitive dictionary
JP4713532B2 (ja) 2007-03-29 2011-06-29 株式会社エヌ・ティ・ティ・ドコモ 通信端末及びそのプログラム
US8370145B2 (en) 2007-03-29 2013-02-05 Panasonic Corporation Device for extracting keywords in a conversation
US20080244446A1 (en) 2007-03-29 2008-10-02 Lefevre John Disambiguation of icons and other media in text-based applications
TWI502380B (zh) 2007-03-29 2015-10-01 Nokia Corp 配合預測式本文輸入使用之方法、裝置、伺服器、系統及電腦程式產品
US8775931B2 (en) 2007-03-30 2014-07-08 Blackberry Limited Spell check function that applies a preference to a spell check algorithm based upon extensive user selection of spell check results generated by the algorithm, and associated handheld electronic device
US8650030B2 (en) 2007-04-02 2014-02-11 Google Inc. Location based responses to telephone requests
US8131556B2 (en) 2007-04-03 2012-03-06 Microsoft Corporation Communications using different modalities
US20080247529A1 (en) 2007-04-03 2008-10-09 Microsoft Corporation Incoming Call Classification And Disposition
US8977255B2 (en) 2007-04-03 2015-03-10 Apple Inc. Method and system for operating a multi-function portable electronic device using voice-activation
US8032472B2 (en) 2007-04-04 2011-10-04 Tuen Solutions Limited Liability Company Intelligent agent for distributed services for mobile devices
US7920902B2 (en) 2007-04-04 2011-04-05 Carroll David W Mobile personal audio device
US7809610B2 (en) 2007-04-09 2010-10-05 Platformation, Inc. Methods and apparatus for freshness and completeness of information
EP1981253B1 (en) 2007-04-10 2011-06-22 Oticon A/S A user interface for a communications device
CN101286094A (zh) 2007-04-10 2008-10-15 谷歌股份有限公司 多模式输入法编辑器
US20080253577A1 (en) 2007-04-13 2008-10-16 Apple Inc. Multi-channel sound panner
US20100142740A1 (en) 2007-04-16 2010-06-10 Gn Resound A/S Hearing aid wireless communication adaptor
JP4412504B2 (ja) 2007-04-17 2010-02-10 本田技研工業株式会社 音声認識装置、音声認識方法、及び音声認識用プログラム
US7848924B2 (en) 2007-04-17 2010-12-07 Nokia Corporation Method, apparatus and computer program product for providing voice conversion using temporal dynamic features
KR100769156B1 (ko) 2007-04-20 2007-10-22 주식회사 서비전자 홈 네트워크 시스템 및 그것의 제어방법
JP2008268684A (ja) 2007-04-24 2008-11-06 Seiko Instruments Inc 音声再生装置、電子辞書、音声再生方法、音声再生プログラム
US7953600B2 (en) 2007-04-24 2011-05-31 Novaspeech Llc System and method for hybrid speech synthesis
JP5243730B2 (ja) 2007-04-24 2013-07-24 株式会社エヌ・ティ・ティ・ドコモ 検索支援システム、検索支援方法
US8457946B2 (en) 2007-04-26 2013-06-04 Microsoft Corporation Recognition architecture for generating Asian characters
US20080270151A1 (en) 2007-04-26 2008-10-30 Bd Metrics Method and system for developing an audience of buyers and obtaining their behavioral preferences to promote commerce on a communication network
JP4769223B2 (ja) 2007-04-26 2011-09-07 旭化成株式会社 テキスト発音記号変換辞書作成装置、認識語彙辞書作成装置、及び音声認識装置
US8695074B2 (en) 2007-04-26 2014-04-08 Microsoft Corporation Pre-authenticated calling for voice applications
KR100819928B1 (ko) 2007-04-26 2008-04-08 (주)부성큐 휴대 단말기의 음성 인식장치 및 그 방법
US20080270344A1 (en) 2007-04-30 2008-10-30 Yurick Steven J Rich media content search engine
US8005664B2 (en) 2007-04-30 2011-08-23 Tachyon Technologies Pvt. Ltd. System, method to generate transliteration and method for generating decision tree to obtain transliteration
US7983915B2 (en) 2007-04-30 2011-07-19 Sonic Foundry, Inc. Audio content search engine
US7912289B2 (en) 2007-05-01 2011-03-22 Microsoft Corporation Image text replacement
US20080273672A1 (en) 2007-05-03 2008-11-06 Microsoft Corporation Automated attendant grammar tuning
US8032383B1 (en) 2007-05-04 2011-10-04 Foneweb, Inc. Speech controlled services and devices using internet
US7899666B2 (en) 2007-05-04 2011-03-01 Expert System S.P.A. Method and system for automatically extracting relations between concepts included in text
US9292807B2 (en) 2007-05-10 2016-03-22 Microsoft Technology Licensing, Llc Recommending actions based on context
TWI336048B (en) 2007-05-11 2011-01-11 Delta Electronics Inc Input system for mobile search and method therefor
KR20090001716A (ko) 2007-05-14 2009-01-09 이병수 성장 지능형 가상 비서 운영시스템 및 그 방법
CN102968441B (zh) 2007-05-15 2016-03-30 Tivo有限公司 多媒体内容搜索和记录安排系统
US8538757B2 (en) 2007-05-17 2013-09-17 Redstart Systems, Inc. System and method of a list commands utility for a speech recognition command system
US8886521B2 (en) 2007-05-17 2014-11-11 Redstart Systems, Inc. System and method of dictation for a speech recognition command system
US8620652B2 (en) 2007-05-17 2013-12-31 Microsoft Corporation Speech recognition macro runtime
EP2168378A1 (en) 2007-05-18 2010-03-31 Giacomo Poretti System and method to consume web content using television set
US8990215B1 (en) 2007-05-21 2015-03-24 Amazon Technologies, Inc. Obtaining and verifying search indices
EG25474A (en) 2007-05-21 2012-01-11 Sherikat Link Letatweer Elbarmaguey At Sae Method for translitering and suggesting arabic replacement for a given user input
US20080294981A1 (en) 2007-05-21 2008-11-27 Advancis.Com, Inc. Page clipping tool for digital publications
US20080294517A1 (en) 2007-05-25 2008-11-27 Gregory Howard Hill Customized image based calendar, method and system
US8099418B2 (en) 2007-05-28 2012-01-17 Panasonic Corporation Information search support method and information search support device
US8831941B2 (en) 2007-05-29 2014-09-09 At&T Intellectual Property Ii, L.P. System and method for tracking fraudulent electronic transactions using voiceprints of uncommon words
US8762143B2 (en) 2007-05-29 2014-06-24 At&T Intellectual Property Ii, L.P. Method and apparatus for identifying acoustic background environments based on time and speed to enhance automatic speech recognition
US8189880B2 (en) 2007-05-29 2012-05-29 Microsoft Corporation Interactive photo annotation based on face clustering
US8494137B2 (en) 2007-05-31 2013-07-23 Centurylink Intellectual Property Llc System and method for pre-call messaging
TWI338269B (en) 2007-05-31 2011-03-01 Univ Nat Taiwan Teaching materials generation methods and systems, and machine readable medium thereof
US8285206B2 (en) 2007-06-01 2012-10-09 Research In Motion Limited Proximity-dependent events
US8055708B2 (en) 2007-06-01 2011-11-08 Microsoft Corporation Multimedia spaces
JP2008299221A (ja) 2007-06-01 2008-12-11 Fujitsu Ten Ltd 発話検知装置
US8204238B2 (en) 2007-06-08 2012-06-19 Sensory, Inc Systems and methods of sonic communication
US8004493B2 (en) 2007-06-08 2011-08-23 Apple Inc. Methods and systems for providing sensory information to devices and peripherals
US8135577B2 (en) 2007-06-09 2012-03-13 Apple Inc. Braille support
CN101325756B (zh) 2007-06-11 2013-02-13 英华达(上海)电子有限公司 一种手机语音辨识装置以及激活手机语音辨识的方法
US20080312928A1 (en) 2007-06-12 2008-12-18 Robert Patrick Goebel Natural language speech recognition calculator
KR20080109322A (ko) 2007-06-12 2008-12-17 엘지전자 주식회사 사용자의 직관적 의도 파악에 따른 서비스 제공 방법 및장치
WO2008151623A1 (en) 2007-06-13 2008-12-18 Widex A/S A system and a method for establishing a conversation group among a number of hearing aids
WO2008151624A1 (en) 2007-06-13 2008-12-18 Widex A/S Hearing aid system establishing a conversation group among hearing aids used by different users
US20080313335A1 (en) 2007-06-15 2008-12-18 Searete Llc, A Limited Liability Corporation Of The State Of Delaware Communicator establishing aspects with context identifying
JP4970160B2 (ja) 2007-06-22 2012-07-04 アルパイン株式会社 車載システム及び現在位置目印地点案内方法
US8059101B2 (en) 2007-06-22 2011-11-15 Apple Inc. Swipe gestures for touch screen keyboards
US8527262B2 (en) 2007-06-22 2013-09-03 International Business Machines Corporation Systems and methods for automatic semantic role labeling of high morphological text for natural language processing applications
KR101465770B1 (ko) 2007-06-25 2014-11-27 구글 인코포레이티드 단어 확률 결정
US8027834B2 (en) 2007-06-25 2011-09-27 Nuance Communications, Inc. Technique for training a phonetic decision tree with limited phonetic exceptional terms
US7689421B2 (en) 2007-06-27 2010-03-30 Microsoft Corporation Voice persona service for embedding text-to-speech features into software programs
US8090621B1 (en) 2007-06-27 2012-01-03 Amazon Technologies, Inc. Method and system for associating feedback with recommendation rules
US8065624B2 (en) 2007-06-28 2011-11-22 Panasonic Corporation Virtual keypad systems and methods
US8260809B2 (en) 2007-06-28 2012-09-04 Microsoft Corporation Voice-based search processing
US7861008B2 (en) 2007-06-28 2010-12-28 Apple Inc. Media management and routing within an electronic device
US8190627B2 (en) 2007-06-28 2012-05-29 Microsoft Corporation Machine assisted query formulation
US8041438B2 (en) 2007-06-28 2011-10-18 Apple Inc. Data-driven media management within an electronic device
US9794605B2 (en) 2007-06-28 2017-10-17 Apple Inc. Using time-stamped event entries to facilitate synchronizing data streams
US9632561B2 (en) 2007-06-28 2017-04-25 Apple Inc. Power-gating media decoders to reduce power consumption
US8019606B2 (en) 2007-06-29 2011-09-13 Microsoft Corporation Identification and selection of a software application via speech
KR100930802B1 (ko) 2007-06-29 2009-12-09 엔에이치엔(주) 이미지를 이용한 브라우저 제어 방법 및 시스템
US7962344B2 (en) 2007-06-29 2011-06-14 Microsoft Corporation Depicting a speech user interface via graphical elements
US8290775B2 (en) 2007-06-29 2012-10-16 Microsoft Corporation Pronunciation correction of text-to-speech systems between different spoken languages
JP4424382B2 (ja) 2007-07-04 2010-03-03 ソニー株式会社 コンテンツ再生装置およびコンテンツ自動受信方法
US7617074B2 (en) 2007-07-06 2009-11-10 Microsoft Corporation Suppressing repeated events and storing diagnostic information
US8219399B2 (en) 2007-07-11 2012-07-10 Garmin Switzerland Gmbh Automated speech recognition (ASR) tiling
US8306235B2 (en) 2007-07-17 2012-11-06 Apple Inc. Method and apparatus for using a sound sensor to adjust the audio output for a device
DE102007033472A1 (de) 2007-07-18 2009-01-29 Siemens Ag Verfahren zur Spracherkennung
US7890493B2 (en) 2007-07-20 2011-02-15 Google Inc. Translating a search query into multiple languages
CN101354746B (zh) 2007-07-23 2011-08-31 夏普株式会社 文字图像抽出装置及文字图像抽出方法
ITFI20070177A1 (it) 2007-07-26 2009-01-27 Riccardo Vieri Sistema per la creazione e impostazione di una campagna pubblicitaria derivante dall'inserimento di messaggi pubblicitari all'interno di uno scambio di messaggi e metodo per il suo funzionamento.
EP2183913A4 (en) 2007-07-30 2011-06-22 Lg Electronics Inc DISPLAY ARRANGEMENT AND SPEAKER SYSTEM FOR THE DISPLAY ARRANGEMENT
CN105045777A (zh) 2007-08-01 2015-11-11 金格软件有限公司 使用互联网语料库的自动的上下文相关的语言校正和增强
JP2009036999A (ja) 2007-08-01 2009-02-19 Infocom Corp コンピュータによる対話方法、対話システム、コンピュータプログラムおよびコンピュータに読み取り可能な記憶媒体
TW200907695A (en) 2007-08-06 2009-02-16 jian-qiang Peng System and method of fast opening network link service
US9342496B2 (en) 2007-08-06 2016-05-17 Apple Inc. Auto-completion of names
US20090043583A1 (en) 2007-08-08 2009-02-12 International Business Machines Corporation Dynamic modification of voice selection based on user specific factors
US7983919B2 (en) 2007-08-09 2011-07-19 At&T Intellectual Property Ii, L.P. System and method for performing speech synthesis with a cache of phoneme sequences
US7983478B2 (en) 2007-08-10 2011-07-19 Microsoft Corporation Hidden markov model based handwriting/calligraphy generation
US8321222B2 (en) 2007-08-14 2012-11-27 Nuance Communications, Inc. Synthesis by generation and concatenation of multi-form segments
JP2009048245A (ja) 2007-08-14 2009-03-05 Konami Digital Entertainment:Kk 入力受付装置、領域制御方法、および、プログラム
US8478598B2 (en) 2007-08-17 2013-07-02 International Business Machines Corporation Apparatus, system, and method for voice chat transcription
KR101490687B1 (ko) 2007-08-20 2015-02-06 삼성전자주식회사 홈 네트워크에서 디바이스들이 비밀 정보를 공유하는 방법및 이를 위한 장치
JP4987623B2 (ja) 2007-08-20 2012-07-25 株式会社東芝 ユーザと音声により対話する装置および方法
US7788276B2 (en) 2007-08-22 2010-08-31 Yahoo! Inc. Predictive stemming for web search with statistical machine translation models
US8296377B1 (en) 2007-08-22 2012-10-23 Canyon IP Holdings, LLC. Facilitating presentation by mobile device of additional content for a word or phrase upon utterance thereof
US20090055186A1 (en) 2007-08-23 2009-02-26 International Business Machines Corporation Method to voice id tag content to ease reading for visually impaired
US7917355B2 (en) 2007-08-23 2011-03-29 Google Inc. Word detection
US7983902B2 (en) 2007-08-23 2011-07-19 Google Inc. Domain dictionary creation by detection of new topic words using divergence value comparison
KR101359715B1 (ko) 2007-08-24 2014-02-10 삼성전자주식회사 모바일 음성 웹 제공 방법 및 장치
US8126274B2 (en) 2007-08-30 2012-02-28 Microsoft Corporation Visual language modeling for image classification
US8190359B2 (en) 2007-08-31 2012-05-29 Proxpro, Inc. Situation-aware personal information management for a mobile device
US20090058823A1 (en) 2007-09-04 2009-03-05 Apple Inc. Virtual Keyboards in Multi-Language Environment
US8683378B2 (en) 2007-09-04 2014-03-25 Apple Inc. Scrolling techniques for user interfaces
US8826132B2 (en) 2007-09-04 2014-09-02 Apple Inc. Methods and systems for navigating content on a portable device
US8683197B2 (en) 2007-09-04 2014-03-25 Apple Inc. Method and apparatus for providing seamless resumption of video playback
US20090106397A1 (en) 2007-09-05 2009-04-23 O'keefe Sean Patrick Method and apparatus for interactive content distribution
US9812023B2 (en) 2007-09-10 2017-11-07 Excalibur Ip, Llc Audible metadata
US20090070109A1 (en) 2007-09-12 2009-03-12 Microsoft Corporation Speech-to-Text Transcription for Personal Communication Devices
US8661340B2 (en) 2007-09-13 2014-02-25 Apple Inc. Input methods for device having multi-language environment
US20090074214A1 (en) 2007-09-13 2009-03-19 Bionica Corporation Assistive listening system with plug in enhancement platform and communication port to download user preferred processing algorithms
US20090076825A1 (en) 2007-09-13 2009-03-19 Bionica Corporation Method of enhancing sound for hearing impaired individuals
JP4990077B2 (ja) 2007-09-14 2012-08-01 株式会社日立製作所 ナビゲーション装置
US8713144B2 (en) 2007-09-14 2014-04-29 Ricoh Co., Ltd. Workflow-enabled client
KR100920267B1 (ko) 2007-09-17 2009-10-05 한국전자통신연구원 음성 대화 분석 시스템 및 그 방법
US8706476B2 (en) 2007-09-18 2014-04-22 Ariadne Genomics, Inc. Natural language processing method by analyzing primitive sentences, logical clauses, clause types and verbal blocks
KR100919225B1 (ko) 2007-09-19 2009-09-28 한국전자통신연구원 음성 대화 시스템에 있어서 다단계 검증을 이용한 대화오류 후처리 장치 및 방법
US8583438B2 (en) 2007-09-20 2013-11-12 Microsoft Corporation Unnatural prosody detection in speech synthesis
ATE509345T1 (de) 2007-09-21 2011-05-15 Boeing Co Gesprochene fahrzeugsteuerung
US8042053B2 (en) 2007-09-24 2011-10-18 Microsoft Corporation Method for making digital documents browseable
US8069051B2 (en) 2007-09-25 2011-11-29 Apple Inc. Zero-gap playback using predictive mixing
US20090083035A1 (en) 2007-09-25 2009-03-26 Ritchie Winson Huang Text pre-processing for text-to-speech generation
US20090079622A1 (en) 2007-09-26 2009-03-26 Broadcom Corporation Sharing of gps information between mobile devices
WO2009041101A1 (ja) 2007-09-28 2009-04-02 Nec Corporation データ分類方法およびデータ分類装置
US9053089B2 (en) 2007-10-02 2015-06-09 Apple Inc. Part-of-speech tagging using latent analogy
TWI360761B (en) 2007-10-03 2012-03-21 Inventec Corp An electronic apparatus and a method for automatic
US8923491B2 (en) 2007-10-03 2014-12-30 At&T Intellectual Property I, L.P. System and method for connecting to addresses received in spoken communications
US8515095B2 (en) 2007-10-04 2013-08-20 Apple Inc. Reducing annoyance by managing the acoustic noise produced by a device
US7995732B2 (en) 2007-10-04 2011-08-09 At&T Intellectual Property I, Lp Managing audio in a multi-source audio environment
US8462959B2 (en) 2007-10-04 2013-06-11 Apple Inc. Managing acoustic noise produced by a device
US8165886B1 (en) 2007-10-04 2012-04-24 Great Northern Research LLC Speech interface system and method for control and interaction with applications on a computing system
US8036901B2 (en) 2007-10-05 2011-10-11 Sensory, Incorporated Systems and methods of performing speech recognition using sensory inputs of human position
IL186505A0 (en) 2007-10-08 2008-01-20 Excelang Ltd Grammar checker
WO2009049049A1 (en) 2007-10-09 2009-04-16 Language Analytics Llc Method and system for adaptive transliteration
US8139763B2 (en) 2007-10-10 2012-03-20 Spansion Llc Randomized RSA-based cryptographic exponentiation resistant to side channel and fault attacks
US20090097634A1 (en) 2007-10-16 2009-04-16 Ullas Balan Nambiar Method and System for Call Processing
US8594996B2 (en) 2007-10-17 2013-11-26 Evri Inc. NLP-based entity recognition and disambiguation
JP2009098490A (ja) 2007-10-18 2009-05-07 Kddi Corp 音声認識結果編集装置、音声認識装置およびコンピュータプログラム
US8209384B2 (en) 2007-10-23 2012-06-26 Yahoo! Inc. Persistent group-based instant messaging
US20090112677A1 (en) 2007-10-24 2009-04-30 Rhett Randolph L Method for automatically developing suggested optimal work schedules from unsorted group and individual task lists
US8606562B2 (en) 2007-10-25 2013-12-10 Blackberry Limited Disambiguated text message retype function
US8000972B2 (en) 2007-10-26 2011-08-16 Sony Corporation Remote controller with speech recognition
US8280885B2 (en) 2007-10-29 2012-10-02 Cornell University System and method for automatically summarizing fine-grained opinions in digital text
JP2009110300A (ja) 2007-10-30 2009-05-21 Nippon Telegr & Teleph Corp <Ntt> 情報家電ネットワーク制御装置、情報家電ネットワーク制御システム、情報家電ネットワーク制御方法、およびプログラム
US7840447B2 (en) 2007-10-30 2010-11-23 Leonard Kleinrock Pricing and auctioning of bundled items among multiple sellers and buyers
US20090112572A1 (en) 2007-10-30 2009-04-30 Karl Ola Thorn System and method for input of text to an application operating on a device
US9063979B2 (en) 2007-11-01 2015-06-23 Ebay, Inc. Analyzing event streams of user sessions
US7983997B2 (en) 2007-11-02 2011-07-19 Florida Institute For Human And Machine Cognition, Inc. Interactive complex task teaching system that allows for natural language input, recognizes a user's intent, and automatically performs tasks in document object model (DOM) nodes
CN101179754A (zh) 2007-11-08 2008-05-14 深圳市戴文科技有限公司 一种交互式业务的实现方法及移动终端
US8065152B2 (en) 2007-11-08 2011-11-22 Demand Media, Inc. Platform for enabling voice commands to resolve phoneme based domain name registrations
DE102008051756A1 (de) 2007-11-12 2009-05-14 Volkswagen Ag Multimodale Benutzerschnittstelle eines Fahrerassistenzsystems zur Eingabe und Präsentation von Informationen
JP4926004B2 (ja) 2007-11-12 2012-05-09 株式会社リコー 文書処理装置、文書処理方法及び文書処理プログラム
US7890525B2 (en) 2007-11-14 2011-02-15 International Business Machines Corporation Foreign language abbreviation translation in an instant messaging system
US20090125602A1 (en) 2007-11-14 2009-05-14 International Business Machines Corporation Automatic priority adjustment for incoming emails
US8112280B2 (en) 2007-11-19 2012-02-07 Sensory, Inc. Systems and methods of performing speech recognition with barge-in for use in a bluetooth system
US8294669B2 (en) 2007-11-19 2012-10-23 Palo Alto Research Center Incorporated Link target accuracy in touch-screen mobile devices by layout adjustment
US8620662B2 (en) 2007-11-20 2013-12-31 Apple Inc. Context-aware unit selection
US20150046537A1 (en) 2007-11-21 2015-02-12 Vdoqwest, Inc., A Delaware Corporation Retrieving video annotation metadata using a p2p network and copyright free indexes
US20110246471A1 (en) 2010-04-06 2011-10-06 Selim Shlomo Rakib Retrieving video annotation metadata using a p2p network
CN101448340B (zh) 2007-11-26 2011-12-07 联想(北京)有限公司 一种检测移动终端状态的方法、系统及该移动终端
TWI373708B (en) 2007-11-27 2012-10-01 Htc Corp Power management method for handheld electronic device
US8213999B2 (en) 2007-11-27 2012-07-03 Htc Corporation Controlling method and system for handheld communication device and recording medium using the same
JP5212379B2 (ja) 2007-11-28 2013-06-19 富士通株式会社 無線タグで管理される金属パイプ及びその無線タグ
US8190596B2 (en) 2007-11-28 2012-05-29 International Business Machines Corporation Method for assembly of personalized enterprise information integrators over conjunctive queries
JP2009134409A (ja) 2007-11-29 2009-06-18 Sony Ericsson Mobilecommunications Japan Inc リマインダ装置、リマインダ方法、リマインダプログラム、及び携帯端末装置
US7805286B2 (en) 2007-11-30 2010-09-28 Bose Corporation System and method for sound system simulation
US8543622B2 (en) 2007-12-07 2013-09-24 Patrick Giblin Method and system for meta-tagging media content and distribution
ATE516658T1 (de) 2007-12-07 2011-07-15 Research In Motion Ltd System und verfahren zur ereignisabhängigen zustandsaktivierung für eine mobile kommunikationsvorrichtung
JP5493267B2 (ja) 2007-12-11 2014-05-14 大日本印刷株式会社 商品検索装置および商品検索方法
US8140335B2 (en) 2007-12-11 2012-03-20 Voicebox Technologies, Inc. System and method for providing a natural language voice user interface in an integrated voice navigation services environment
US8385588B2 (en) 2007-12-11 2013-02-26 Eastman Kodak Company Recording audio metadata for stored images
US9767681B2 (en) 2007-12-12 2017-09-19 Apple Inc. Handheld electronic devices with remote control functionality and gesture recognition
US8275607B2 (en) 2007-12-12 2012-09-25 Microsoft Corporation Semi-supervised part-of-speech tagging
US20090158423A1 (en) 2007-12-14 2009-06-18 Symbol Technologies, Inc. Locking mobile device cradle
US20090152349A1 (en) 2007-12-17 2009-06-18 Bonev Robert Family organizer communications network system
KR101300839B1 (ko) 2007-12-18 2013-09-10 삼성전자주식회사 음성 검색어 확장 방법 및 시스템
JP5327054B2 (ja) 2007-12-18 2013-10-30 日本電気株式会社 発音変動規則抽出装置、発音変動規則抽出方法、および発音変動規則抽出用プログラム
US8145196B2 (en) 2007-12-18 2012-03-27 Apple Inc. Creation and management of voicemail greetings for mobile communication devices
US20090164937A1 (en) 2007-12-20 2009-06-25 Alden Alviar Scroll Apparatus and Method for Manipulating Data on an Electronic Device Display
US10002189B2 (en) 2007-12-20 2018-06-19 Apple Inc. Method and apparatus for searching using an active ontology
US8095680B2 (en) 2007-12-20 2012-01-10 Telefonaktiebolaget Lm Ericsson (Publ) Real-time network transport protocol interface method and apparatus
US20090164301A1 (en) 2007-12-21 2009-06-25 Yahoo! Inc. Targeted Ad System Using Metadata
JP5239328B2 (ja) 2007-12-21 2013-07-17 ソニー株式会社 情報処理装置及びタッチ動作認識方法
US8019604B2 (en) 2007-12-21 2011-09-13 Motorola Mobility, Inc. Method and apparatus for uniterm discovery and voice-to-voice search on mobile device
WO2009079736A1 (en) 2007-12-21 2009-07-02 Bce Inc. Method and apparatus for interrupting an active telephony session to deliver information to a subscriber
CN101188644A (zh) 2007-12-26 2008-05-28 中国工商银行股份有限公司 银行语音服务方法与系统
KR20090071077A (ko) 2007-12-27 2009-07-01 엘지전자 주식회사 네비게이션 장치 및 이의 턴 지점 정보 제공방법
US8583416B2 (en) 2007-12-27 2013-11-12 Fluential, Llc Robust information extraction from utterances
US8219407B1 (en) 2007-12-27 2012-07-10 Great Northern Research, LLC Method for processing the output of a speech recognizer
US20090172108A1 (en) 2007-12-28 2009-07-02 Surgo Systems and methods for a telephone-accessible message communication system
US8138896B2 (en) 2007-12-31 2012-03-20 Apple Inc. Tactile feedback in an electronic device
US9330720B2 (en) 2008-01-03 2016-05-03 Apple Inc. Methods and apparatus for altering audio output signals
US20090177966A1 (en) 2008-01-06 2009-07-09 Apple Inc. Content Sheet for Media Player
US8405621B2 (en) 2008-01-06 2013-03-26 Apple Inc. Variable rate media playback methods for electronic devices with touch interfaces
US7609179B2 (en) 2008-01-08 2009-10-27 International Business Machines Corporation Method for compressed data with reduced dictionary sizes by coding value prefixes
US8478578B2 (en) 2008-01-09 2013-07-02 Fluential, Llc Mobile speech-to-speech interpretation system
US8232973B2 (en) 2008-01-09 2012-07-31 Apple Inc. Method, device, and graphical user interface providing word recommendations for text input
US20090204243A1 (en) 2008-01-09 2009-08-13 8 Figure, Llc Method and apparatus for creating customized text-to-speech podcasts and videos incorporating associated media
WO2009087860A1 (ja) 2008-01-10 2009-07-16 Brother Kogyo Kabushiki Kaisha 音声対話装置及び音声対話プログラムを記憶したコンピュータ読み取り可能な媒体
US7870133B2 (en) 2008-01-14 2011-01-11 Infosys Technologies Ltd. Method for semantic based storage and retrieval of information
US10176827B2 (en) 2008-01-15 2019-01-08 Verint Americas Inc. Active lab
EP2081185B1 (en) 2008-01-16 2014-11-26 Nuance Communications, Inc. Speech recognition on large lists using fragments
US20090187950A1 (en) 2008-01-18 2009-07-23 At&T Knowledge Ventures, L.P. Audible menu system
US20090187577A1 (en) 2008-01-20 2009-07-23 Aviv Reznik System and Method Providing Audio-on-Demand to a User's Personal Online Device as Part of an Online Audio Community
ITPO20080002A1 (it) 2008-01-22 2009-07-23 Riccardo Vieri Sistema e metodo per la generazione di pubblicita' contestuale durante l'invio di sms, relativo dispositivo e interfaccia.
US8175882B2 (en) 2008-01-25 2012-05-08 International Business Machines Corporation Method and system for accent correction
US20120284015A1 (en) 2008-01-28 2012-11-08 William Drewes Method for Increasing the Accuracy of Subject-Specific Statistical Machine Translation (SMT)
US20090192782A1 (en) 2008-01-28 2009-07-30 William Drewes Method for increasing the accuracy of statistical machine translation (SMT)
US8223988B2 (en) 2008-01-29 2012-07-17 Qualcomm Incorporated Enhanced blind source separation algorithm for highly correlated mixtures
KR101629873B1 (ko) 2008-01-30 2016-06-21 구글 인코포레이티드 모바일 디바이스 이벤트의 통지
CN101500041A (zh) 2008-01-30 2009-08-05 中兴通讯股份有限公司 呼叫控制方法和装置
CN101499156A (zh) 2008-01-31 2009-08-05 上海亿动信息技术有限公司 一种基于多广告信息发布装置发布广告的控制方法及装置
US7840581B2 (en) 2008-02-01 2010-11-23 Realnetworks, Inc. Method and system for improving the quality of deep metadata associated with media content
KR20090085376A (ko) 2008-02-04 2009-08-07 삼성전자주식회사 문자 메시지의 음성 합성을 이용한 서비스 방법 및 장치
US10269024B2 (en) 2008-02-08 2019-04-23 Outbrain Inc. Systems and methods for identifying and measuring trends in consumer content demand within vertically associated websites and related content
US8000956B2 (en) 2008-02-08 2011-08-16 Xerox Corporation Semantic compatibility checking for automatic correction and discovery of named entities
KR101334066B1 (ko) 2008-02-11 2013-11-29 이점식 진화하는 사이버 로봇 시스템 및 그 제공 방법
US8195656B2 (en) 2008-02-13 2012-06-05 Yahoo, Inc. Social network search
US8099289B2 (en) 2008-02-13 2012-01-17 Sensory, Inc. Voice interface and search for electronic devices including bluetooth headsets and remote systems
US20090210391A1 (en) 2008-02-14 2009-08-20 Hall Stephen G Method and system for automated search for, and retrieval and distribution of, information
JP2009193457A (ja) 2008-02-15 2009-08-27 Oki Electric Ind Co Ltd 情報検索装置、方法及びプログラム
US8165884B2 (en) 2008-02-15 2012-04-24 Microsoft Corporation Layered prompting: self-calibrating instructional prompting for verbal interfaces
JP2009193448A (ja) 2008-02-15 2009-08-27 Oki Electric Ind Co Ltd 対話システム、方法及びプログラム
JP2009193532A (ja) 2008-02-18 2009-08-27 Oki Electric Ind Co Ltd 対話管理装置、方法及びプログラム、並びに意識抽出システム
EP2094032A1 (en) 2008-02-19 2009-08-26 Deutsche Thomson OHG Audio signal, method and apparatus for encoding or transmitting the same and method and apparatus for processing the same
WO2009104126A1 (en) 2008-02-20 2009-08-27 Koninklijke Philips Electronics N.V. Audio device and method of operation therefor
US8065143B2 (en) 2008-02-22 2011-11-22 Apple Inc. Providing text input using speech data and non-speech data
US20090215466A1 (en) 2008-02-22 2009-08-27 Darcy Ahl Mobile phone based system for disabling a cell phone while traveling
US8706474B2 (en) 2008-02-23 2014-04-22 Fair Isaac Corporation Translation of entity names based on source document publication date, and frequency and co-occurrence of the entity names
US8015144B2 (en) 2008-02-26 2011-09-06 Microsoft Corporation Learning transportation modes from raw GPS data
JP4433061B2 (ja) 2008-02-27 2010-03-17 株式会社デンソー 運転支援システム
US8068604B2 (en) 2008-12-19 2011-11-29 Computer Product Introductions Corporation Method and system for event notifications
EP2096840B1 (en) 2008-02-29 2012-07-04 Research In Motion Limited Visual event notification on a handheld communications device
US9049255B2 (en) 2008-02-29 2015-06-02 Blackberry Limited Visual event notification on a handheld communications device
US20090221274A1 (en) 2008-02-29 2009-09-03 Venkatakrishnan Poornima System, method and device for enabling alternative call handling routines for incoming calls
JP2009205579A (ja) 2008-02-29 2009-09-10 Toshiba Corp 音声翻訳装置およびプログラム
US8201109B2 (en) 2008-03-04 2012-06-12 Apple Inc. Methods and graphical user interfaces for editing on a portable multifunction device
US8205157B2 (en) 2008-03-04 2012-06-19 Apple Inc. Methods and graphical user interfaces for conducting searches on a portable multifunction device
US8650507B2 (en) 2008-03-04 2014-02-11 Apple Inc. Selecting of text using gestures
US20090228273A1 (en) 2008-03-05 2009-09-10 Microsoft Corporation Handwriting-based user interface for correction of speech recognition errors
US20090228439A1 (en) 2008-03-07 2009-09-10 Microsoft Corporation Intent-aware search
US8255224B2 (en) 2008-03-07 2012-08-28 Google Inc. Voice recognition grammar selection based on context
US8380512B2 (en) 2008-03-10 2013-02-19 Yahoo! Inc. Navigation using a search engine and phonetic voice recognition
US20090235280A1 (en) 2008-03-12 2009-09-17 Xerox Corporation Event extraction system for electronic messages
US8364486B2 (en) 2008-03-12 2013-01-29 Intelligent Mechatronic Systems Inc. Speech understanding method and system
US20090234655A1 (en) 2008-03-13 2009-09-17 Jason Kwon Mobile electronic device with active speech recognition
CN101246020B (zh) 2008-03-14 2011-05-25 深圳市凯立德科技股份有限公司 语音播报装置、使用了此装置的导航系统及其采用的方法
US20090234638A1 (en) 2008-03-14 2009-09-17 Microsoft Corporation Use of a Speech Grammar to Recognize Instant Message Input
US20090235176A1 (en) 2008-03-14 2009-09-17 Madhavi Jayanthi Social interaction system for facilitating display of current location of friends and location of businesses of interest
US7958136B1 (en) 2008-03-18 2011-06-07 Google Inc. Systems and methods for identifying similar documents
JP2009223840A (ja) 2008-03-19 2009-10-01 Fujitsu Ltd スケジュール管理プログラム,スケジュール管理装置およびスケジュール管理方法
US20090239552A1 (en) 2008-03-24 2009-09-24 Yahoo! Inc. Location-based opportunistic recommendations
CN101547396B (zh) 2008-03-24 2012-07-04 展讯通信(上海)有限公司 一种紧急呼叫过程中快速位置上报的方法
US8856009B2 (en) 2008-03-25 2014-10-07 Intelligent Mechatronic Systems Inc. Multi-participant, mixed-initiative voice interaction system
WO2009118716A1 (en) 2008-03-27 2009-10-01 Markport Limited Processing of messaging service attributes in communication systems
US8615388B2 (en) 2008-03-28 2013-12-24 Microsoft Corporation Intra-language statistical machine translation
US20090248456A1 (en) 2008-03-28 2009-10-01 Passkey International, Inc. Notifications and reports in a reservation system
EP2107553B1 (en) 2008-03-31 2011-05-18 Harman Becker Automotive Systems GmbH Method for determining barge-in
US7472061B1 (en) 2008-03-31 2008-12-30 International Business Machines Corporation Systems and methods for building a native language phoneme lexicon having native pronunciations of non-native words derived from non-native pronunciations
US8417298B2 (en) 2008-04-01 2013-04-09 Apple Inc. Mounting structures for portable electronic devices
US20090249198A1 (en) 2008-04-01 2009-10-01 Yahoo! Inc. Techniques for input recogniton and completion
US8312376B2 (en) 2008-04-03 2012-11-13 Microsoft Corporation Bookmark interpretation service
TWI446780B (zh) 2008-04-03 2014-07-21 Hon Hai Prec Ind Co Ltd 通訊裝置及其通訊方法
US20090253457A1 (en) 2008-04-04 2009-10-08 Apple Inc. Audio signal processing for certification enhancement in a handheld wireless communications device
US8996376B2 (en) 2008-04-05 2015-03-31 Apple Inc. Intelligent text-to-speech conversion
KR101491581B1 (ko) 2008-04-07 2015-02-24 삼성전자주식회사 철자 오류 보정 시스템 및 방법
KR20090107364A (ko) 2008-04-08 2009-10-13 엘지전자 주식회사 이동 단말기 및 그 메뉴 제어방법
KR20090107365A (ko) 2008-04-08 2009-10-13 엘지전자 주식회사 이동 단말기 및 그 메뉴 제어방법
US8958848B2 (en) 2008-04-08 2015-02-17 Lg Electronics Inc. Mobile terminal and menu control method thereof
US7889101B2 (en) 2008-04-14 2011-02-15 Alpine Electronics, Inc Method and apparatus for generating location based reminder message for navigation system
JP4656177B2 (ja) 2008-04-14 2011-03-23 トヨタ自動車株式会社 ナビゲーション装置、操作部表示方法
US8046222B2 (en) 2008-04-16 2011-10-25 Google Inc. Segmenting words using scaled probabilities
US8490050B2 (en) 2008-04-17 2013-07-16 Microsoft Corporation Automatic generation of user interfaces
US8433778B1 (en) 2008-04-22 2013-04-30 Marvell International Ltd Device configuration
US8666824B2 (en) 2008-04-23 2014-03-04 Dell Products L.P. Digital media content location and purchasing system
US8972432B2 (en) 2008-04-23 2015-03-03 Google Inc. Machine translation using information retrieval
US8407049B2 (en) 2008-04-23 2013-03-26 Cogi, Inc. Systems and methods for conversation enhancement
US8594995B2 (en) 2008-04-24 2013-11-26 Nuance Communications, Inc. Multilingual asynchronous communications of speech messages recorded in digital media files
US8121837B2 (en) 2008-04-24 2012-02-21 Nuance Communications, Inc. Adjusting a speech engine for a mobile computing device based on background noise
US8249857B2 (en) 2008-04-24 2012-08-21 International Business Machines Corporation Multilingual administration of enterprise data with user selected target language translation
US8249858B2 (en) 2008-04-24 2012-08-21 International Business Machines Corporation Multilingual administration of enterprise data with default target languages
US8082148B2 (en) 2008-04-24 2011-12-20 Nuance Communications, Inc. Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise
US8693698B2 (en) 2008-04-30 2014-04-08 Qualcomm Incorporated Method and apparatus to reduce non-linear distortion in mobile computing devices
US8521512B2 (en) 2008-04-30 2013-08-27 Deep Sky Concepts, Inc Systems and methods for natural language communication with a computer
US8400405B2 (en) 2008-05-09 2013-03-19 Research In Motion Limited Handheld electronic device and associated method enabling text input in a language employing non-roman characters
US8254829B1 (en) 2008-05-09 2012-08-28 Sprint Communications Company L.P. Network media service with track delivery adapted to a user cadence
US8219115B1 (en) 2008-05-12 2012-07-10 Google Inc. Location based reminders
US10496753B2 (en) 2010-01-18 2019-12-03 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US20130275899A1 (en) 2010-01-18 2013-10-17 Apple Inc. Application Gateway for Providing Different User Interfaces for Limited Distraction and Non-Limited Distraction Contexts
US9965035B2 (en) 2008-05-13 2018-05-08 Apple Inc. Device, method, and graphical user interface for synchronizing two or more displays
US8174503B2 (en) 2008-05-17 2012-05-08 David H. Cain Touch-based authentication of a mobile device through user generated pattern creation
US8131267B2 (en) 2008-05-19 2012-03-06 Tbm, Llc Interactive voice access and retrieval of information
DE102008024258A1 (de) 2008-05-20 2009-11-26 Siemens Aktiengesellschaft Verfahren zur Klassifizierung und Entfernung unerwünschter Anteile aus einer Äußerung bei einer Spracherkennung
US8285344B2 (en) 2008-05-21 2012-10-09 DP Technlogies, Inc. Method and apparatus for adjusting audio for a user environment
US20090292987A1 (en) 2008-05-22 2009-11-26 International Business Machines Corporation Formatting selected content of an electronic document based on analyzed formatting
CN101281745B (zh) 2008-05-23 2011-08-10 深圳市北科瑞声科技有限公司 一种车载语音交互系统
US8589161B2 (en) 2008-05-27 2013-11-19 Voicebox Technologies, Inc. System and method for an integrated, multi-modal, multi-device natural language voice services environment
US8082498B2 (en) 2008-05-27 2011-12-20 Appfolio, Inc. Systems and methods for automatic spell checking of dynamically generated web pages
US9305548B2 (en) 2008-05-27 2016-04-05 Voicebox Technologies Corporation System and method for an integrated, multi-modal, multi-device natural language voice services environment
US20130100268A1 (en) 2008-05-27 2013-04-25 University Health Network Emergency detection and response system and method
US20090326938A1 (en) 2008-05-28 2009-12-31 Nokia Corporation Multiword text correction
US8126435B2 (en) 2008-05-30 2012-02-28 Hewlett-Packard Development Company, L.P. Techniques to manage vehicle communications
US8473279B2 (en) 2008-05-30 2013-06-25 Eiman Al-Shammari Lemmatizing, stemming, and query expansion method and system
US8694355B2 (en) 2008-05-30 2014-04-08 Sri International Method and apparatus for automated assistance with task management
US8233366B2 (en) 2008-06-02 2012-07-31 Apple Inc. Context-based error indication methods and apparatus
US20090298529A1 (en) 2008-06-03 2009-12-03 Symbol Technologies, Inc. Audio HTML (aHTML): Audio Access to Web/Data
JP5377889B2 (ja) 2008-06-05 2013-12-25 日本放送協会 言語処理装置およびプログラム
JP5136228B2 (ja) 2008-06-05 2013-02-06 日本電気株式会社 作業環境自動保存復元システム、作業環境自動保存復元方法および作業環境自動保存復元プログラム
US8180630B2 (en) 2008-06-06 2012-05-15 Zi Corporation Of Canada, Inc. Systems and methods for an automated personalized dictionary generator for portable devices
US8140326B2 (en) 2008-06-06 2012-03-20 Fuji Xerox Co., Ltd. Systems and methods for reducing speech intelligibility while preserving environmental sounds
US8831948B2 (en) 2008-06-06 2014-09-09 At&T Intellectual Property I, L.P. System and method for synthetically generated speech describing media content
TWM348993U (en) 2008-06-06 2009-01-11 Ming-Ying Chen Smart voice-controlled device to control home appliance with infrared controller
US8464150B2 (en) 2008-06-07 2013-06-11 Apple Inc. Automatic language identification for dynamic text processing
US9626363B2 (en) 2008-06-08 2017-04-18 Apple Inc. System and method for placeshifting media playback
KR100988397B1 (ko) 2008-06-09 2010-10-19 엘지전자 주식회사 이동 단말기 및 그의 텍스트 수정방법
US20090306967A1 (en) 2008-06-09 2009-12-10 J.D. Power And Associates Automatic Sentiment Analysis of Surveys
US8219397B2 (en) 2008-06-10 2012-07-10 Nuance Communications, Inc. Data processing system for autonomously building speech identification and tagging data
ATE501478T1 (de) 2008-06-11 2011-03-15 Exb Asset Man Gmbh Vorrichtung und verfahren mit verbessertem texteingabemechanismus
US20090313020A1 (en) 2008-06-12 2009-12-17 Nokia Corporation Text-to-speech user interface control
US20090313564A1 (en) 2008-06-12 2009-12-17 Apple Inc. Systems and methods for adjusting playback of media files based on previous usage
KR101513615B1 (ko) 2008-06-12 2015-04-20 엘지전자 주식회사 이동 단말기 및 그 음성 인식 방법
US8527876B2 (en) 2008-06-12 2013-09-03 Apple Inc. System and methods for adjusting graphical representations of media files based on previous usage
US20090313023A1 (en) 2008-06-17 2009-12-17 Ralph Jones Multilingual text-to-speech system
DE102008028885A1 (de) 2008-06-18 2009-12-31 Epcos Ag Verfahren zur Abstimmung einer Resonanzfrequenz eines piezoelektrischen Bauelementes
US9510044B1 (en) 2008-06-18 2016-11-29 Gracenote, Inc. TV content segmentation, categorization and identification and time-aligned applications
CA2727951A1 (en) 2008-06-19 2009-12-23 E-Lane Systems Inc. Communication system with voice mail access and call by spelling functionality
EP2304660A4 (en) 2008-06-19 2013-11-27 Wize Technologies Inc SYSTEM AND METHOD FOR ENHANCING AND SUMMING A FEELING FOR A PRODUCT / SUBJECT
GB2462800A (en) 2008-06-20 2010-02-24 New Voice Media Ltd Monitoring a conversation between an agent and a customer and performing real time analytics on the audio signal for determining future handling of the call
WO2009156438A1 (en) 2008-06-24 2009-12-30 Llinxx Method and system for entering an expression
US9081590B2 (en) 2008-06-24 2015-07-14 Microsoft Technology Licensing, Llc Multimodal input using scratchpad graphical user interface to edit speech text input with keyboard input
US8300801B2 (en) 2008-06-26 2012-10-30 Centurylink Intellectual Property Llc System and method for telephone based noise cancellation
US20110106736A1 (en) 2008-06-26 2011-05-05 Intuitive User Interfaces Ltd. System and method for intuitive user interaction
US8423288B2 (en) 2009-11-30 2013-04-16 Apple Inc. Dynamic alerts for calendar events
US8364481B2 (en) 2008-07-02 2013-01-29 Google Inc. Speech recognition with parallel recognition tasks
WO2010000322A1 (en) 2008-07-03 2010-01-07 Mobiter Dicta Oy Method and device for converting speech
US20100005085A1 (en) 2008-07-03 2010-01-07 Oracle International Corporation Creating relationship maps from enterprise application system data
KR101059631B1 (ko) 2008-07-04 2011-08-25 야후! 인크. 자동 입출력 인터페이스를 갖춘 번역기 및 그 인터페이싱방법
US8478592B2 (en) 2008-07-08 2013-07-02 Nuance Communications, Inc. Enhancing media playback with speech recognition
US8781833B2 (en) 2008-07-17 2014-07-15 Nuance Communications, Inc. Speech recognition semantic classification training
US8521761B2 (en) 2008-07-18 2013-08-27 Google Inc. Transliteration for query expansion
US8166019B1 (en) 2008-07-21 2012-04-24 Sprint Communications Company L.P. Providing suggested actions in response to textual communications
JP5791861B2 (ja) 2008-07-25 2015-10-07 シャープ株式会社 情報処理装置および情報処理方法
US20100030549A1 (en) 2008-07-31 2010-02-04 Lee Michael M Mobile device having human language translation capability with positional feedback
US8386485B2 (en) 2008-07-31 2013-02-26 George Mason Intellectual Properties, Inc. Case-based framework for collaborative semantic search
US8041848B2 (en) 2008-08-04 2011-10-18 Apple Inc. Media processing method and device
US8589149B2 (en) 2008-08-05 2013-11-19 Nuance Communications, Inc. Probability-based approach to recognition of user-entered data
KR100998566B1 (ko) 2008-08-11 2010-12-07 엘지전자 주식회사 음성인식을 이용한 언어 번역 방법 및 장치
JPWO2010018796A1 (ja) 2008-08-11 2012-01-26 旭化成株式会社 例外語辞書作成装置、例外語辞書作成方法及びそのプログラム、並びに、音声認識装置及び音声認識方法
JP4577428B2 (ja) 2008-08-11 2010-11-10 ソニー株式会社 表示装置、表示方法及びプログラム
US8170969B2 (en) 2008-08-13 2012-05-01 Siemens Aktiengesellschaft Automated computation of semantic similarity of pairs of named entity phrases using electronic document corpora as background knowledge
US8520979B2 (en) 2008-08-19 2013-08-27 Digimarc Corporation Methods and systems for content processing
US8805110B2 (en) 2008-08-19 2014-08-12 Digimarc Corporation Methods and systems for content processing
JP5459214B2 (ja) 2008-08-20 2014-04-02 日本電気株式会社 言語モデル作成装置、言語モデル作成方法、音声認識装置、音声認識方法、プログラム、および記録媒体
US20100050064A1 (en) 2008-08-22 2010-02-25 At & T Labs, Inc. System and method for selecting a multimedia presentation to accompany text
US8112269B2 (en) 2008-08-25 2012-02-07 Microsoft Corporation Determining utility of a question
WO2010025460A1 (en) 2008-08-29 2010-03-04 O3 Technologies, Llc System and method for speech-to-speech translation
US8117136B2 (en) 2008-08-29 2012-02-14 Hewlett-Packard Development Company, L.P. Relationship management on a mobile computing device
WO2010022561A1 (en) 2008-08-29 2010-03-04 Mediatek (Hefei) Inc. Method for playing voice guidance and navigation device using the same
US8442248B2 (en) 2008-09-03 2013-05-14 Starkey Laboratories, Inc. Systems and methods for managing wireless communication links for hearing assistance devices
US8380959B2 (en) 2008-09-05 2013-02-19 Apple Inc. Memory management system and method
WO2010028169A2 (en) 2008-09-05 2010-03-11 Fotonauts, Inc. Reverse tagging of images in system for managing and sharing digital images
US20100063825A1 (en) 2008-09-05 2010-03-11 Apple Inc. Systems and Methods for Memory Management and Crossfading in an Electronic Device
US8098262B2 (en) 2008-09-05 2012-01-17 Apple Inc. Arbitrary fractional pixel movement
US8768702B2 (en) 2008-09-05 2014-07-01 Apple Inc. Multi-tiered voice feedback in an electronic device
US7936736B2 (en) 2008-09-08 2011-05-03 Proctor Jr James Arthur Enforcing policies in wireless communication using exchanged identities
US8290971B2 (en) 2008-09-09 2012-10-16 Applied Systems, Inc. Method and apparatus for remotely displaying a list by determining a quantity of data to send based on the list size and the display control size
US8898568B2 (en) 2008-09-09 2014-11-25 Apple Inc. Audio user interface
JP2010066519A (ja) 2008-09-11 2010-03-25 Brother Ind Ltd 音声対話装置、音声対話方法、および音声対話プログラム
CN101673274A (zh) 2008-09-12 2010-03-17 深圳富泰宏精密工业有限公司 影片字幕检索系统及方法
US8259082B2 (en) 2008-09-12 2012-09-04 At&T Intellectual Property I, L.P. Multimodal portable communication interface for accessing video content
US8929877B2 (en) 2008-09-12 2015-01-06 Digimarc Corporation Methods and systems for content processing
US8756519B2 (en) 2008-09-12 2014-06-17 Google Inc. Techniques for sharing content on a web page
US8239201B2 (en) 2008-09-13 2012-08-07 At&T Intellectual Property I, L.P. System and method for audibly presenting selected text
US20100071003A1 (en) 2008-09-14 2010-03-18 Modu Ltd. Content personalization
US8775154B2 (en) 2008-09-18 2014-07-08 Xerox Corporation Query translation through dictionary adaptation
US8326622B2 (en) 2008-09-23 2012-12-04 International Business Machines Corporation Dialog filtering for filling out a form
US20100077350A1 (en) 2008-09-25 2010-03-25 Microsoft Corporation Combining elements in presentation of content
JP2010078979A (ja) 2008-09-26 2010-04-08 Nec Infrontia Corp 音声録音装置、録音音声検索方法及びプログラム
US8712776B2 (en) 2008-09-29 2014-04-29 Apple Inc. Systems and methods for selective text to speech synthesis
US20100082327A1 (en) 2008-09-29 2010-04-01 Apple Inc. Systems and methods for mapping phonemes for text to speech synthesis
US8352272B2 (en) 2008-09-29 2013-01-08 Apple Inc. Systems and methods for text to speech synthesis
US8583418B2 (en) 2008-09-29 2013-11-12 Apple Inc. Systems and methods of detecting language and natural language strings for text to speech synthesis
US8396714B2 (en) 2008-09-29 2013-03-12 Apple Inc. Systems and methods for concatenation of words in text to speech synthesis
US8352268B2 (en) 2008-09-29 2013-01-08 Apple Inc. Systems and methods for selective rate of speech and speech preferences for text to speech synthesis
US8355919B2 (en) 2008-09-29 2013-01-15 Apple Inc. Systems and methods for text normalization for text to speech synthesis
US20100082328A1 (en) 2008-09-29 2010-04-01 Apple Inc. Systems and methods for speech preprocessing in text to speech synthesis
GB2476011B (en) 2008-09-29 2013-05-15 Fisher Rosemount Systems Inc Efficient design and configuration of elements in a process control system
US8411953B2 (en) 2008-09-30 2013-04-02 International Business Machines Corporation Tagging images by determining a set of similar pre-tagged images and extracting prominent tags from that set
JP2010086230A (ja) 2008-09-30 2010-04-15 Sony Corp 情報処理装置、情報処理方法およびプログラム
US8401178B2 (en) 2008-09-30 2013-03-19 Apple Inc. Multiple microphone switching and configuration
US8798956B2 (en) 2008-09-30 2014-08-05 Apple Inc. Method and apparatus for surface sensing input device
US9077526B2 (en) 2008-09-30 2015-07-07 Apple Inc. Method and system for ensuring sequential playback of digital media
US20100255858A1 (en) 2008-10-02 2010-10-07 Juhasz Paul R Dead Zone for Wireless Communication Device
US8676904B2 (en) 2008-10-02 2014-03-18 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US8285545B2 (en) 2008-10-03 2012-10-09 Volkswagen Ag Voice command acquisition system and method
US9200913B2 (en) 2008-10-07 2015-12-01 Telecommunication Systems, Inc. User interface for predictive traffic
US9442648B2 (en) 2008-10-07 2016-09-13 Blackberry Limited Portable electronic device and method of controlling same
US8380497B2 (en) 2008-10-15 2013-02-19 Qualcomm Incorporated Methods and apparatus for noise estimation
US8543913B2 (en) 2008-10-16 2013-09-24 International Business Machines Corporation Identifying and using textual widgets
US20100114887A1 (en) 2008-10-17 2010-05-06 Google Inc. Textual Disambiguation Using Social Connections
US20100131899A1 (en) 2008-10-17 2010-05-27 Darwin Ecosystem Llc Scannable Cloud
US8364487B2 (en) 2008-10-21 2013-01-29 Microsoft Corporation Speech recognition system with display information
US8670546B2 (en) 2008-10-22 2014-03-11 At&T Intellectual Property I, L.P. Systems and methods for providing a personalized communication processing service
US8218397B2 (en) 2008-10-24 2012-07-10 Qualcomm Incorporated Audio source proximity estimation using sensor array for noise reduction
US8190437B2 (en) 2008-10-24 2012-05-29 Nuance Communications, Inc. Speaker verification methods and apparatus
US8577685B2 (en) 2008-10-24 2013-11-05 At&T Intellectual Property I, L.P. System and method for targeted advertising
US8724829B2 (en) 2008-10-24 2014-05-13 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for coherence detection
US8645123B2 (en) 2008-10-27 2014-02-04 Microsoft Corporation Image-based semantic distance
US8412529B2 (en) 2008-10-29 2013-04-02 Verizon Patent And Licensing Inc. Method and system for enhancing verbal communication sessions
TWI487385B (zh) 2008-10-31 2015-06-01 Chi Mei Comm Systems Inc 音量調節裝置及其調節方法
JP5230358B2 (ja) 2008-10-31 2013-07-10 キヤノン株式会社 情報検索装置、情報検索方法、プログラム及び記憶媒体
KR101543221B1 (ko) 2008-10-31 2015-08-12 에스케이플래닛 주식회사 다중 사용자-다중 서비스 제공 방법, 장치 및 시스템
US8788261B2 (en) 2008-11-04 2014-07-22 Saplo Ab Method and system for analyzing text
US8122094B1 (en) 2008-11-05 2012-02-21 Kotab Dominic M Methods for performing an action relating to the scheduling of an event by performing one or more actions based on a response to a message
US20100205628A1 (en) 2009-02-12 2010-08-12 Davis Bruce L Media processing methods and arrangements
US8122353B2 (en) 2008-11-07 2012-02-21 Yahoo! Inc. Composing a message in an online textbox using a non-latin script
EP3258468B1 (en) 2008-11-10 2019-08-21 Google LLC Multisensory speech detection
US8249870B2 (en) 2008-11-12 2012-08-21 Massachusetts Institute Of Technology Semi-automatic speech transcription
KR20100053149A (ko) 2008-11-12 2010-05-20 삼성전자주식회사 이동통신 단말기에서 참석 인원의 상황을 고려한 일정 생성장치 및 방법
US8386261B2 (en) 2008-11-14 2013-02-26 Vocollect Healthcare Systems, Inc. Training/coaching system for a voice-enabled work environment
US8832319B2 (en) 2008-11-18 2014-09-09 Amazon Technologies, Inc. Synchronization of digital content
US8584031B2 (en) 2008-11-19 2013-11-12 Apple Inc. Portable touch screen device, method, and graphical user interface for using emoji characters
US8108214B2 (en) 2008-11-19 2012-01-31 Robert Bosch Gmbh System and method for recognizing proper names in dialog systems
US8296124B1 (en) 2008-11-21 2012-10-23 Google Inc. Method and apparatus for detecting incorrectly translated text in a document
US9202455B2 (en) 2008-11-24 2015-12-01 Qualcomm Incorporated Systems, methods, apparatus, and computer program products for enhanced active noise cancellation
US20100131498A1 (en) 2008-11-26 2010-05-27 General Electric Company Automated healthcare information composition and query enhancement
US8442824B2 (en) 2008-11-26 2013-05-14 Nuance Communications, Inc. Device, system, and method of liveness detection utilizing voice biometrics
US8140328B2 (en) 2008-12-01 2012-03-20 At&T Intellectual Property I, L.P. User intention based on N-best list of recognition hypotheses for utterances in a dialog
US8489599B2 (en) 2008-12-02 2013-07-16 Palo Alto Research Center Incorporated Context and activity-driven content delivery and interaction
US20100138680A1 (en) 2008-12-02 2010-06-03 At&T Mobility Ii Llc Automatic display and voice command activation with hand edge sensing
US8117036B2 (en) 2008-12-03 2012-02-14 At&T Intellectual Property I, L.P. Non-disruptive side conversation information retrieval
US8073693B2 (en) 2008-12-04 2011-12-06 At&T Intellectual Property I, L.P. System and method for pronunciation modeling
US8589157B2 (en) 2008-12-05 2013-11-19 Microsoft Corporation Replying to text messages via automated voice search techniques
JP5257311B2 (ja) 2008-12-05 2013-08-07 ソニー株式会社 情報処理装置、および情報処理方法
US8054180B1 (en) 2008-12-08 2011-11-08 Amazon Technologies, Inc. Location aware reminders
US20100185949A1 (en) 2008-12-09 2010-07-22 Denny Jaeger Method for using gesture objects for computer control
EP2196989B1 (en) 2008-12-10 2012-06-27 Nuance Communications, Inc. Grammar and template-based speech recognition of spoken utterances
US20100153448A1 (en) 2008-12-12 2010-06-17 International Business Machines Corporation Persistent search notification
US8121842B2 (en) 2008-12-12 2012-02-21 Microsoft Corporation Audio output of a document from mobile device
US8208609B2 (en) 2008-12-15 2012-06-26 Centurylink Intellectual Property Llc System and method for voice activated dialing from a home phone
US8160881B2 (en) 2008-12-15 2012-04-17 Microsoft Corporation Human-assisted pronunciation generation
DE112009003645B4 (de) 2008-12-16 2014-05-15 Mitsubishi Electric Corporation Navigationsvorrichtung
US8447588B2 (en) 2008-12-18 2013-05-21 Palo Alto Research Center Incorporated Region-matching transducers for natural language processing
US9323854B2 (en) 2008-12-19 2016-04-26 Intel Corporation Method, apparatus and system for location assisted translation
EP2368199B1 (en) 2008-12-22 2018-10-31 Google LLC Asynchronous distributed de-duplication for replicated content addressable storage clusters
US8635068B2 (en) 2008-12-23 2014-01-21 At&T Intellectual Property I, L.P. System and method for recognizing speech with dialect grammars
JP5326892B2 (ja) 2008-12-26 2013-10-30 富士通株式会社 情報処理装置、プログラム、および音響モデルを生成する方法
US8447609B2 (en) 2008-12-31 2013-05-21 Intel Corporation Adjustment of temporal acoustical characteristics
CA2748695C (en) 2008-12-31 2017-11-07 Bce Inc. System and method for unlocking a device
US8456420B2 (en) 2008-12-31 2013-06-04 Intel Corporation Audible list traversal
KR101543326B1 (ko) 2009-01-05 2015-08-10 삼성전자주식회사 시스템 온 칩 및 그 구동 방법
EP2205010A1 (en) 2009-01-06 2010-07-07 BRITISH TELECOMMUNICATIONS public limited company Messaging
TW201027515A (en) 2009-01-06 2010-07-16 High Tech Comp Corp Electronic event-recording device and method thereof
US8332205B2 (en) 2009-01-09 2012-12-11 Microsoft Corporation Mining transliterations for out-of-vocabulary query terms
US10088976B2 (en) 2009-01-15 2018-10-02 Em Acquisition Corp., Inc. Systems and methods for multiple voice document narration
US8364488B2 (en) 2009-01-15 2013-01-29 K-Nfb Reading Technology, Inc. Voice models for document narration
US20100180218A1 (en) 2009-01-15 2010-07-15 International Business Machines Corporation Editing metadata in a social network
EP2211336B1 (en) 2009-01-23 2014-10-08 Harman Becker Automotive Systems GmbH Improved speech input using navigation information
US8213911B2 (en) 2009-01-28 2012-07-03 Virtual Hold Technology Llc Mobile communication device for establishing automated call back
US8200489B1 (en) 2009-01-29 2012-06-12 The United States Of America As Represented By The Secretary Of The Navy Multi-resolution hidden markov model using class specific features
US8862252B2 (en) 2009-01-30 2014-10-14 Apple Inc. Audio user interface for displayless electronic device
US9070282B2 (en) 2009-01-30 2015-06-30 Altorr Corp. Smartphone control of electrical devices
US20100197359A1 (en) 2009-01-30 2010-08-05 Harris Technology, Llc Automatic Detection of Wireless Phone
US20110307491A1 (en) 2009-02-04 2011-12-15 Fisk Charles M Digital photo organizing and tagging method
US9489131B2 (en) 2009-02-05 2016-11-08 Apple Inc. Method of presenting a web page for accessibility browsing
US8254972B2 (en) 2009-02-13 2012-08-28 Sony Mobile Communications Ab Device and method for handling messages
US8428758B2 (en) 2009-02-16 2013-04-23 Apple Inc. Dynamic audio ducking
CN105930311B (zh) 2009-02-18 2018-10-09 谷歌有限责任公司 执行与再现文档关联的动作的方法、移动设备和可读介质
US8032602B2 (en) 2009-02-18 2011-10-04 International Business Machines Corporation Prioritization of recipient email messages
US8326637B2 (en) 2009-02-20 2012-12-04 Voicebox Technologies, Inc. System and method for processing multi-modal device interactions in a natural language voice services environment
EP2401711A4 (en) 2009-02-25 2016-12-28 Miri Systems Llc PAYMENT SYSTEM AND METHOD
US8155630B2 (en) 2009-02-27 2012-04-10 Research In Motion Limited Communications system providing mobile device notification based upon personal interest information and calendar events
US20100223131A1 (en) 2009-02-27 2010-09-02 Research In Motion Limited Communications system providing mobile device notification based upon contact web pages and related methods
KR101041039B1 (ko) 2009-02-27 2011-06-14 고려대학교 산학협력단 오디오 및 비디오 정보를 이용한 시공간 음성 구간 검출 방법 및 장치
US9646603B2 (en) 2009-02-27 2017-05-09 Longsand Limited Various apparatus and methods for a speech recognition system
EP2224705B1 (en) 2009-02-27 2012-02-01 Research In Motion Limited Mobile wireless communications device with speech to text conversion and related method
US8280434B2 (en) 2009-02-27 2012-10-02 Research In Motion Limited Mobile wireless communications device for hearing and/or speech impaired user
US9280971B2 (en) 2009-02-27 2016-03-08 Blackberry Limited Mobile wireless communications device with speech to text conversion and related methods
US9171284B2 (en) 2009-03-02 2015-10-27 Microsoft Technology Licensing, Llc Techniques to restore communications sessions for applications having conversation and meeting environments
US20100229100A1 (en) 2009-03-03 2010-09-09 Sprint Spectrum L.P. Methods and Systems for Storing and Accessing Application History
US8239333B2 (en) 2009-03-03 2012-08-07 Microsoft Corporation Media tag recommendation technologies
US8805439B2 (en) 2009-03-05 2014-08-12 Lg Electronics Inc. Mobile terminal and method for controlling the same
JP5138810B2 (ja) 2009-03-06 2013-02-06 シャープ株式会社 ブックマーク利用装置、ブックマーク作成装置、ブックマーク共有システム、制御方法、制御プログラム、および、記録媒体
US8605039B2 (en) 2009-03-06 2013-12-10 Zimpl Ab Text input
US20100225809A1 (en) 2009-03-09 2010-09-09 Sony Corporation And Sony Electronics Inc. Electronic book with enhanced features
US8380507B2 (en) 2009-03-09 2013-02-19 Apple Inc. Systems and methods for determining the language to use for speech generated by a text to speech engine
US8165321B2 (en) 2009-03-10 2012-04-24 Apple Inc. Intelligent clip mixing
WO2010105245A2 (en) 2009-03-12 2010-09-16 Exbiblio B.V. Automatically providing content associated with captured information, such as information captured in real-time
US8417526B2 (en) 2009-03-13 2013-04-09 Adacel, Inc. Speech recognition learning system and method
US8286106B2 (en) 2009-03-13 2012-10-09 Oracle America, Inc. System and method for interacting with status information on a touch screen device
US8370736B2 (en) 2009-03-16 2013-02-05 Apple Inc. Methods and graphical user interfaces for editing on a multifunction device with a touch screen display
US20100235780A1 (en) 2009-03-16 2010-09-16 Westerman Wayne C System and Method for Identifying Words Based on a Sequence of Keyboard Events
CN102439540B (zh) 2009-03-19 2015-04-08 谷歌股份有限公司 输入法编辑器
JP2010224194A (ja) 2009-03-23 2010-10-07 Sony Corp 音声認識装置及び音声認識方法、言語モデル生成装置及び言語モデル生成方法、並びにコンピューター・プログラム
JP5419136B2 (ja) 2009-03-24 2014-02-19 アルパイン株式会社 音声出力装置
KR101078864B1 (ko) 2009-03-26 2011-11-02 한국과학기술원 질의/문서 주제 범주 변화 분석 시스템 및 그 방법과 이를 이용한 질의 확장 기반 정보 검색 시스템 및 그 방법
US9424246B2 (en) 2009-03-30 2016-08-23 Touchtype Ltd. System and method for inputting text into electronic devices
GB0905457D0 (en) 2009-03-30 2009-05-13 Touchtype Ltd System and method for inputting text into electronic devices
US9189472B2 (en) 2009-03-30 2015-11-17 Touchtype Limited System and method for inputting text into small screen devices
US10191654B2 (en) 2009-03-30 2019-01-29 Touchtype Limited System and method for inputting text into electronic devices
US20100250599A1 (en) 2009-03-30 2010-09-30 Nokia Corporation Method and apparatus for integration of community-provided place data
GB201016385D0 (en) 2010-09-29 2010-11-10 Touchtype Ltd System and method for inputting text into electronic devices
US8798255B2 (en) 2009-03-31 2014-08-05 Nice Systems Ltd Methods and apparatus for deep interaction analysis
US8166032B2 (en) 2009-04-09 2012-04-24 MarketChorus, Inc. System and method for sentiment-based text classification and relevancy ranking
US8805823B2 (en) 2009-04-14 2014-08-12 Sri International Content processing systems and methods
KR101537706B1 (ko) 2009-04-16 2015-07-20 엘지전자 주식회사 이동 단말기 및 그 제어 방법
US8209174B2 (en) 2009-04-17 2012-06-26 Saudi Arabian Oil Company Speaker verification system
US20110065456A1 (en) 2009-04-20 2011-03-17 Brennan Joseph P Cellular device deactivation system
US9761219B2 (en) 2009-04-21 2017-09-12 Creative Technology Ltd System and method for distributed text-to-speech synthesis and intelligibility
US8660970B1 (en) 2009-04-23 2014-02-25 The Boeing Company Passive learning and autonomously interactive system for leveraging user knowledge in networked environments
JP5911796B2 (ja) 2009-04-30 2016-04-27 サムスン エレクトロニクス カンパニー リミテッド マルチモーダル情報を用いるユーザ意図推論装置及び方法
KR101032792B1 (ko) 2009-04-30 2011-05-06 주식회사 코오롱 에어백용 폴리에스테르 원단 및 그의 제조 방법
KR101581883B1 (ko) 2009-04-30 2016-01-11 삼성전자주식회사 모션 정보를 이용하는 음성 검출 장치 및 방법
US9298823B2 (en) 2009-05-08 2016-03-29 International Business Machines Corporation Identifying core content based on citations
EP2428028A4 (en) 2009-05-08 2014-07-02 Obdedge Llc SYSTEMS, METHODS AND DEVICES FOR POLICY-BASED CONTROL AND MONITORING THE USE OF MOBILE DEVICES BY VEHICLE OPERATORS
WO2010131256A1 (en) 2009-05-13 2010-11-18 Rajesh Mehra A keyboard for linguistic scripts
US20100293460A1 (en) 2009-05-14 2010-11-18 Budelli Joe G Text selection method and system based on gestures
US8583511B2 (en) 2009-05-19 2013-11-12 Bradley Marshall Hendrickson Systems and methods for storing customer purchasing and preference data and enabling a customer to pre-register orders and events
US8498857B2 (en) 2009-05-19 2013-07-30 Tata Consultancy Services Limited System and method for rapid prototyping of existing speech recognition solutions in different languages
KR101577607B1 (ko) 2009-05-22 2015-12-15 삼성전자주식회사 상황 및 의도인지 기반의 언어 표현 장치 및 그 방법
US20100302056A1 (en) 2009-05-27 2010-12-02 Geodelic, Inc. Location discovery system and method
US8577543B2 (en) 2009-05-28 2013-11-05 Intelligent Mechatronic Systems Inc. Communication system with personal information management and remote vehicle monitoring and control features
US8369822B2 (en) 2009-05-28 2013-02-05 At&T Intellectual Property I, Lp Systems and methods for providing emergency callback procedures
US20120310652A1 (en) 2009-06-01 2012-12-06 O'sullivan Daniel Adaptive Human Computer Interface (AAHCI)
US8095119B2 (en) 2009-06-02 2012-01-10 Microsoft Corporation In-call contact information display
EP2259252B1 (en) 2009-06-02 2012-08-01 Nuance Communications, Inc. Speech recognition method for selecting a combination of list elements via a speech input
US8560313B2 (en) 2010-05-13 2013-10-15 General Motors Llc Transient noise rejection for speech recognition
US9858925B2 (en) 2009-06-05 2018-01-02 Apple Inc. Using context information to facilitate processing of commands in a virtual assistant
US10241752B2 (en) 2011-09-30 2019-03-26 Apple Inc. Interface for a virtual digital assistant
US10540976B2 (en) 2009-06-05 2020-01-21 Apple Inc. Contextual voice commands
US10706373B2 (en) 2011-06-03 2020-07-07 Apple Inc. Performing actions associated with task items that represent tasks to perform
US10241644B2 (en) 2011-06-03 2019-03-26 Apple Inc. Actionable reminder entries
US20120327009A1 (en) 2009-06-07 2012-12-27 Apple Inc. Devices, methods, and graphical user interfaces for accessibility using a touch-sensitive surface
KR101562792B1 (ko) 2009-06-10 2015-10-23 삼성전자주식회사 목표 예측 인터페이스 제공 장치 및 그 방법
US8412531B2 (en) 2009-06-10 2013-04-02 Microsoft Corporation Touch anywhere to speak
JP2010287063A (ja) 2009-06-11 2010-12-24 Zenrin Datacom Co Ltd 情報提供装置、情報提供システム及びプログラム
US8484027B1 (en) 2009-06-12 2013-07-09 Skyreader Media Inc. Method for live remote narration of a digital book
US8290777B1 (en) 2009-06-12 2012-10-16 Amazon Technologies, Inc. Synchronizing the playing and displaying of digital content
US8533622B2 (en) 2009-06-17 2013-09-10 Microsoft Corporation Integrating digital book and zoom interface displays
US8306238B2 (en) 2009-06-17 2012-11-06 Sony Ericsson Mobile Communications Ab Method and circuit for controlling an output of an audio signal of a battery-powered device
US10353967B2 (en) 2009-06-22 2019-07-16 Microsoft Technology Licensing, Llc Assigning relevance weights based on temporal dynamics
US20100324709A1 (en) 2009-06-22 2010-12-23 Tree Of Life Publishing E-book reader with voice annotation
US9215212B2 (en) 2009-06-22 2015-12-15 Citrix Systems, Inc. Systems and methods for providing a visualizer for rules of an application firewall
US11012732B2 (en) 2009-06-25 2021-05-18 DISH Technologies L.L.C. Voice enabled media presentation systems and methods
US20100330909A1 (en) 2009-06-25 2010-12-30 Blueant Wireless Pty Limited Voice-enabled walk-through pairing of telecommunications devices
US20100332236A1 (en) 2009-06-25 2010-12-30 Blueant Wireless Pty Limited Voice-triggered operation of electronic devices
US9754224B2 (en) 2009-06-26 2017-09-05 International Business Machines Corporation Action based to-do list
US8219930B2 (en) 2009-06-26 2012-07-10 Verizon Patent And Licensing Inc. Radial menu display systems and methods
US8527278B2 (en) 2009-06-29 2013-09-03 Abraham Ben David Intelligent home automation
US20100332224A1 (en) 2009-06-30 2010-12-30 Nokia Corporation Method and apparatus for converting text to audio and tactile output
US9431006B2 (en) 2009-07-02 2016-08-30 Apple Inc. Methods and apparatuses for automatic speech recognition
US20110002487A1 (en) 2009-07-06 2011-01-06 Apple Inc. Audio Channel Assignment for Audio Output in a Movable Device
US8943423B2 (en) 2009-07-07 2015-01-27 International Business Machines Corporation User interface indicators for changed user interface elements
KR101083540B1 (ko) 2009-07-08 2011-11-14 엔에이치엔(주) 통계적인 방법을 이용한 한자에 대한 자국어 발음열 변환 시스템 및 방법
US8344847B2 (en) 2009-07-09 2013-01-01 Medtronic Minimed, Inc. Coordination of control commands in a medical device system having at least one therapy delivery device and at least one wireless controller device
US8892439B2 (en) 2009-07-15 2014-11-18 Microsoft Corporation Combination and federation of local and remote speech recognition
US20110016150A1 (en) 2009-07-20 2011-01-20 Engstroem Jimmy System and method for tagging multiple digital images
US8213962B2 (en) 2009-07-21 2012-07-03 Verizon Patent And Licensing Inc. Vehicle computer link to mobile phone
US7953679B2 (en) 2009-07-22 2011-05-31 Xerox Corporation Scalable indexing for layout based document retrieval and ranking
CA2761700C (en) 2009-07-24 2014-12-02 Research In Motion Limited Method and apparatus for a touch-sensitive display
US9117448B2 (en) 2009-07-27 2015-08-25 Cisco Technology, Inc. Method and system for speech recognition using social networks
US9489577B2 (en) 2009-07-27 2016-11-08 Cxense Asa Visual similarity for video content
US8239129B2 (en) 2009-07-27 2012-08-07 Robert Bosch Gmbh Method and system for improving speech recognition accuracy by use of geographic information
US20110029616A1 (en) 2009-07-29 2011-02-03 Guanming Wang Unified auto-reply to an email coming from unified messaging service
US8875219B2 (en) 2009-07-30 2014-10-28 Blackberry Limited Apparatus and method for controlled sharing of personal information
JP2011033874A (ja) 2009-08-03 2011-02-17 Alpine Electronics Inc 多言語音声認識装置及び多言語音声認識辞書作成方法
US8340312B2 (en) 2009-08-04 2012-12-25 Apple Inc. Differential mode noise cancellation with active real-time control for microphone-speaker combinations used in two way audio communications
US8160877B1 (en) 2009-08-06 2012-04-17 Narus, Inc. Hierarchical real-time speaker recognition for biometric VoIP verification and targeting
US20110047072A1 (en) 2009-08-07 2011-02-24 Visa U.S.A. Inc. Systems and Methods for Propensity Analysis and Validation
US8233919B2 (en) 2009-08-09 2012-07-31 Hntb Holdings Ltd. Intelligently providing user-specific transportation-related information
JP5201599B2 (ja) 2009-08-11 2013-06-05 Necカシオモバイルコミュニケーションズ株式会社 端末装置、および、プログラム
US20110040707A1 (en) 2009-08-12 2011-02-17 Ford Global Technologies, Llc Intelligent music selection in vehicles
US8768313B2 (en) 2009-08-17 2014-07-01 Digimarc Corporation Methods and systems for image or audio recognition processing
US8626133B2 (en) 2009-08-19 2014-01-07 Cisco Technology, Inc. Matching a location of a contact with a task location
US8654952B2 (en) 2009-08-20 2014-02-18 T-Mobile Usa, Inc. Shareable applications on telecommunications devices
KR101496649B1 (ko) 2009-08-21 2015-03-02 삼성전자주식회사 복합 네트워크 망을 통한 외부 디바이스의 기능 공유 방법 및 그 장치
US9277021B2 (en) 2009-08-21 2016-03-01 Avaya Inc. Sending a user associated telecommunication address
EP2629211A1 (en) 2009-08-21 2013-08-21 Mikko Kalervo Väänänen Method and means for data searching and language translation
JP2011045005A (ja) 2009-08-24 2011-03-03 Fujitsu Toshiba Mobile Communications Ltd 携帯電話機
EP2471064A4 (en) 2009-08-25 2014-01-08 Univ Nanyang Tech METHOD AND SYSTEM FOR RECONSTRUCTING LANGUAGE FROM AN ENTRY SIGNAL WITH FLUIDED PARTS
US20110054647A1 (en) 2009-08-26 2011-03-03 Nokia Corporation Network service for an audio interface unit
JP2011048671A (ja) 2009-08-27 2011-03-10 Kyocera Corp 入力装置および入力装置の制御方法
CN101996631B (zh) 2009-08-28 2014-12-03 国际商业机器公司 用于对齐文本的方法和装置
US20110238407A1 (en) 2009-08-31 2011-09-29 O3 Technologies, Llc Systems and methods for speech-to-speech translation
WO2011028844A2 (en) 2009-09-02 2011-03-10 Sri International Method and apparatus for tailoring the output of an intelligent automated assistant to a user
US8451238B2 (en) 2009-09-02 2013-05-28 Amazon Technologies, Inc. Touch-screen user interface
US8624851B2 (en) 2009-09-02 2014-01-07 Amazon Technologies, Inc. Touch-screen user interface
US8675084B2 (en) 2009-09-04 2014-03-18 Apple Inc. Systems and methods for remote camera control
US9031834B2 (en) 2009-09-04 2015-05-12 Nuance Communications, Inc. Speech enhancement techniques on the power spectrum
TW201110108A (en) 2009-09-04 2011-03-16 Chunghwa Telecom Co Ltd Voice noise elimination method for microphone array
US20120265535A1 (en) 2009-09-07 2012-10-18 Donald Ray Bryant-Rich Personal voice operated reminder system
US8560300B2 (en) 2009-09-09 2013-10-15 International Business Machines Corporation Error correction using fact repositories
US8788267B2 (en) 2009-09-10 2014-07-22 Mitsubishi Electric Research Laboratories, Inc. Multi-purpose contextual control
US8321527B2 (en) 2009-09-10 2012-11-27 Tribal Brands System and method for tracking user location and associated activity and responsively providing mobile device updates
WO2011032060A1 (en) 2009-09-11 2011-03-17 Telenav, Inc. Location based system with contextual contact manager mechanism and method of operation thereof
US20110066468A1 (en) 2009-09-11 2011-03-17 Internationl Business Machines Corporation Dynamic event planning through location awareness
US10587833B2 (en) 2009-09-16 2020-03-10 Disney Enterprises, Inc. System and method for automated network search and companion display of result relating to audio-video metadata
US8972878B2 (en) 2009-09-21 2015-03-03 Avaya Inc. Screen icon manipulation by context and frequency of Use
US8768308B2 (en) 2009-09-29 2014-07-01 Deutsche Telekom Ag Apparatus and method for creating and managing personal schedules via context-sensing and actuation
US9111538B2 (en) 2009-09-30 2015-08-18 T-Mobile Usa, Inc. Genius button secondary commands
KR20110036385A (ko) 2009-10-01 2011-04-07 삼성전자주식회사 사용자 의도 분석 장치 및 방법
TW201113741A (en) 2009-10-01 2011-04-16 Htc Corp Lock-state switching method, electronic apparatus and computer program product
US20110083079A1 (en) 2009-10-02 2011-04-07 International Business Machines Corporation Apparatus, system, and method for improved type-ahead functionality in a type-ahead field based on activity of a user within a user interface
US9338274B2 (en) 2009-10-02 2016-05-10 Blackberry Limited Method of interacting with electronic devices in a locked state and handheld electronic device configured to permit interaction when in a locked state
JP5473520B2 (ja) 2009-10-06 2014-04-16 キヤノン株式会社 入力装置及びその制御方法
US7809550B1 (en) 2009-10-08 2010-10-05 Joan Barry Barrows System for reading chinese characters in seconds
US20110087685A1 (en) 2009-10-09 2011-04-14 Microsoft Corporation Location-based service middleware
CN101673544B (zh) 2009-10-10 2012-07-04 上海电虹软件有限公司 一种基于声纹识别和定位跟踪的交叉监控方法和系统
US8335689B2 (en) 2009-10-14 2012-12-18 Cogi, Inc. Method and system for efficient management of speech transcribers
US8611876B2 (en) 2009-10-15 2013-12-17 Larry Miller Configurable phone with interactive voice response engine
US8510103B2 (en) 2009-10-15 2013-08-13 Paul Angott System and method for voice recognition
US8255217B2 (en) 2009-10-16 2012-08-28 At&T Intellectual Property I, Lp Systems and methods for creating and using geo-centric language models
US8451112B2 (en) 2009-10-19 2013-05-28 Qualcomm Incorporated Methods and apparatus for estimating departure time based on known calendar events
US8332748B1 (en) 2009-10-22 2012-12-11 Google Inc. Multi-directional auto-complete menu
US8554537B2 (en) 2009-10-23 2013-10-08 Samsung Electronics Co., Ltd Method and device for transliteration
US8326624B2 (en) 2009-10-26 2012-12-04 International Business Machines Corporation Detecting and communicating biometrics of recorded voice during transcription process
US20110099507A1 (en) 2009-10-28 2011-04-28 Google Inc. Displaying a collection of interactive elements that trigger actions directed to an item
US9197736B2 (en) 2009-12-31 2015-11-24 Digimarc Corporation Intuitive computing methods and systems
US8386574B2 (en) 2009-10-29 2013-02-26 Xerox Corporation Multi-modality classification for one-class classification in social networks
US8315617B2 (en) 2009-10-31 2012-11-20 Btpatent Llc Controlling mobile device functions
US8832205B2 (en) 2009-11-02 2014-09-09 Lextine Software, Llc System and method for extracting calendar events from free-form email
US20120137367A1 (en) 2009-11-06 2012-05-31 Cataphora, Inc. Continuous anomaly detection based on behavior modeling and heterogeneous information analysis
WO2011055410A1 (ja) 2009-11-06 2011-05-12 株式会社 東芝 音声認識装置
US9502025B2 (en) 2009-11-10 2016-11-22 Voicebox Technologies Corporation System and method for providing a natural language content dedication service
US20110111724A1 (en) 2009-11-10 2011-05-12 David Baptiste Method and apparatus for combating distracted driving
US9171541B2 (en) 2009-11-10 2015-10-27 Voicebox Technologies Corporation System and method for hybrid processing in a natural language voice services environment
US8527859B2 (en) 2009-11-10 2013-09-03 Dulcetta, Inc. Dynamic audio playback of soundtracks for electronic visual works
US8358747B2 (en) 2009-11-10 2013-01-22 International Business Machines Corporation Real time automatic caller speech profiling
US8321209B2 (en) 2009-11-10 2012-11-27 Research In Motion Limited System and method for low overhead frequency domain voice authentication
CN102860039B (zh) 2009-11-12 2016-10-19 罗伯特·亨利·弗莱特 免提电话和/或麦克风阵列以及使用它们的方法和系统
US8732180B2 (en) 2009-11-12 2014-05-20 Apple Inc. Recommending media items
US20130166303A1 (en) 2009-11-13 2013-06-27 Adobe Systems Incorporated Accessing media data using metadata repository
US8712759B2 (en) 2009-11-13 2014-04-29 Clausal Computing Oy Specializing disambiguation of a natural language expression
KR20110052863A (ko) 2009-11-13 2011-05-19 삼성전자주식회사 모바일 기기 및 그 제어 신호 생성 방법
TWI391915B (zh) 2009-11-17 2013-04-01 Inst Information Industry 語音變異模型建立裝置、方法及應用該裝置之語音辨識系統和方法
KR101595029B1 (ko) 2009-11-18 2016-02-17 엘지전자 주식회사 이동단말기 및 그 제어방법
US8358752B2 (en) 2009-11-19 2013-01-22 At&T Mobility Ii Llc User profile based speech to text conversion for visual voice mail
US8630971B2 (en) 2009-11-20 2014-01-14 Indian Institute Of Science System and method of using Multi Pattern Viterbi Algorithm for joint decoding of multiple patterns
US8358749B2 (en) 2009-11-21 2013-01-22 At&T Intellectual Property I, L.P. System and method to search a media content database based on voice input data
KR101960835B1 (ko) 2009-11-24 2019-03-21 삼성전자주식회사 대화 로봇을 이용한 일정 관리 시스템 및 그 방법
US20110153330A1 (en) 2009-11-27 2011-06-23 i-SCROLL System and method for rendering text synchronized audio
US8731901B2 (en) 2009-12-02 2014-05-20 Content Savvy, Inc. Context aware back-transliteration and translation of names and common phrases using web resources
JP5844274B2 (ja) 2009-12-04 2016-01-13 ティヴォ インク 多機能マルチメディア装置
US8396888B2 (en) 2009-12-04 2013-03-12 Google Inc. Location-based searching using a search area that corresponds to a geographical location of a computing device
US8224300B2 (en) 2009-12-11 2012-07-17 Alpine Electronics, Inc. Method and apparatus to enhance navigation user experience for a smart phone device
US8543917B2 (en) 2009-12-11 2013-09-24 Nokia Corporation Method and apparatus for presenting a first-person world view of content
KR101622111B1 (ko) 2009-12-11 2016-05-18 삼성전자 주식회사 대화 시스템 및 그의 대화 방법
US8812990B2 (en) 2009-12-11 2014-08-19 Nokia Corporation Method and apparatus for presenting a first person world view of content
US9766089B2 (en) 2009-12-14 2017-09-19 Nokia Technologies Oy Method and apparatus for correlating and navigating between a live image and a prerecorded panoramic image
US20110144857A1 (en) 2009-12-14 2011-06-16 Theodore Charles Wingrove Anticipatory and adaptive automobile hmi
US8892443B2 (en) 2009-12-15 2014-11-18 At&T Intellectual Property I, L.P. System and method for combining geographic metadata in automatic speech recognition language and acoustic models
KR101211796B1 (ko) 2009-12-16 2012-12-13 포항공과대학교 산학협력단 외국어 학습 장치 및 그 제공 방법
US8341037B2 (en) 2009-12-18 2012-12-25 Apple Inc. Mixed source media playback
US20110154193A1 (en) 2009-12-21 2011-06-23 Nokia Corporation Method and Apparatus for Text Input
US9100809B2 (en) 2009-12-21 2015-08-04 Julia Olincy Olincy Automatic response option mobile system for responding to incoming texts or calls or both
US8385982B2 (en) 2009-12-21 2013-02-26 At&T Intellectual Property I, L.P. Controlling use of a communications device in accordance with motion of the device
US8805711B2 (en) 2009-12-22 2014-08-12 International Business Machines Corporation Two-layer data architecture for reservation management systems
KR20110072847A (ko) 2009-12-23 2011-06-29 삼성전자주식회사 열려진 사용자 의도 처리를 위한 대화관리 시스템 및 방법
EP2339576B1 (en) 2009-12-23 2019-08-07 Google LLC Multi-modal input on an electronic device
US20110161309A1 (en) 2009-12-29 2011-06-30 Lx1 Technology Limited Method Of Sorting The Result Set Of A Search Engine
US8988356B2 (en) 2009-12-31 2015-03-24 Google Inc. Touch sensor and touchscreen user input combination
US8479107B2 (en) 2009-12-31 2013-07-02 Nokia Corporation Method and apparatus for fluid graphical user interface
US8494852B2 (en) 2010-01-05 2013-07-23 Google Inc. Word-level correction of speech input
US20110167350A1 (en) 2010-01-06 2011-07-07 Apple Inc. Assist Features For Content Display Device
US8600743B2 (en) 2010-01-06 2013-12-03 Apple Inc. Noise profile determination for voice-related feature
US8346562B2 (en) 2010-01-06 2013-01-01 Csr Technology Inc. Method and apparatus for voice controlled operation of a media player
US8381107B2 (en) 2010-01-13 2013-02-19 Apple Inc. Adaptive audio feedback system and method
US8311838B2 (en) 2010-01-13 2012-11-13 Apple Inc. Devices and methods for identifying a prompt corresponding to a voice input in a sequence of prompts
US8334842B2 (en) 2010-01-15 2012-12-18 Microsoft Corporation Recognizing user intent in motion capture system
US20110179372A1 (en) 2010-01-15 2011-07-21 Bradford Allen Moore Automatic Keyboard Layout Determination
US10553209B2 (en) 2010-01-18 2020-02-04 Apple Inc. Systems and methods for hands-free notification summaries
US10679605B2 (en) 2010-01-18 2020-06-09 Apple Inc. Hands-free list-reading by intelligent automated assistant
US10276170B2 (en) 2010-01-18 2019-04-30 Apple Inc. Intelligent automated assistant
US10705794B2 (en) 2010-01-18 2020-07-07 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US8417575B2 (en) 2010-01-19 2013-04-09 Apple Inc. On-device offline purchases using credits
US20110179002A1 (en) 2010-01-19 2011-07-21 Dell Products L.P. System and Method for a Vector-Space Search Engine
US8626511B2 (en) 2010-01-22 2014-01-07 Google Inc. Multi-dimensional disambiguation of voice commands
US8301121B2 (en) 2010-01-22 2012-10-30 Sony Ericsson Mobile Communications Ab Regulating alerts generated by communication terminals responsive to sensed movement
US20110184736A1 (en) 2010-01-26 2011-07-28 Benjamin Slotznick Automated method of recognizing inputted information items and selecting information items
US8346590B2 (en) 2010-01-27 2013-01-01 Google Inc. Automatically schedule and re-schedule meetings through search interface
US8406745B1 (en) 2010-01-28 2013-03-26 Sprint Communications Company L.P. Synchronization of voice mail greeting and email auto-reply by a wireless communication device
JP5633042B2 (ja) 2010-01-28 2014-12-03 本田技研工業株式会社 音声認識装置、音声認識方法、及び音声認識ロボット
US20120330662A1 (en) 2010-01-29 2012-12-27 Nec Corporation Input supporting system, method and program
US8600967B2 (en) 2010-02-03 2013-12-03 Apple Inc. Automatic organization of browsing histories
US8687777B1 (en) 2010-02-03 2014-04-01 Tal Lavian Systems and methods for visual presentation and selection of IVR menu
US8645287B2 (en) 2010-02-04 2014-02-04 Microsoft Corporation Image tagging based upon cross domain context
US8886541B2 (en) 2010-02-04 2014-11-11 Sony Corporation Remote controller with position actuatated voice transmission
US8751218B2 (en) 2010-02-09 2014-06-10 Siemens Aktiengesellschaft Indexing content at semantic level
US8179370B1 (en) 2010-02-09 2012-05-15 Google Inc. Proximity based keystroke resolution
US9413869B2 (en) 2010-02-10 2016-08-09 Qualcomm Incorporated Mobile device having plurality of input modes
US8782556B2 (en) 2010-02-12 2014-07-15 Microsoft Corporation User-centric soft keyboard predictive technologies
US8402018B2 (en) 2010-02-12 2013-03-19 Korea Advanced Institute Of Science And Technology Semantic search system using semantic ranking scheme
US8812056B2 (en) 2010-02-12 2014-08-19 Christopher D. Higginbotham Voice-based command driven computer implemented method
US9965165B2 (en) 2010-02-19 2018-05-08 Microsoft Technology Licensing, Llc Multi-finger gestures
US8850360B2 (en) 2010-02-23 2014-09-30 Hewlett-Packard Development Company, L.P. Skipping through electronic content on an electronic device
US9665344B2 (en) 2010-02-24 2017-05-30 GM Global Technology Operations LLC Multi-modal input system for a voice-based menu and content navigation service
US8682667B2 (en) 2010-02-25 2014-03-25 Apple Inc. User profiling for selecting user specific voice input processing information
US9710556B2 (en) 2010-03-01 2017-07-18 Vcvc Iii Llc Content recommendation based on collections of entities
US20110218855A1 (en) 2010-03-03 2011-09-08 Platformation, Inc. Offering Promotions Based on Query Analysis
US20120066303A1 (en) 2010-03-03 2012-03-15 Waldeck Technology, Llc Synchronized group location updates
US8502837B2 (en) 2010-03-04 2013-08-06 Research In Motion Limited System and method for activating components on an electronic device using orientation data
US8903847B2 (en) 2010-03-05 2014-12-02 International Business Machines Corporation Digital media voice tags in social networks
US8948515B2 (en) 2010-03-08 2015-02-03 Sightera Technologies Ltd. Method and system for classifying one or more images
US8521513B2 (en) 2010-03-12 2013-08-27 Microsoft Corporation Localization for interactive voice response systems
US9104312B2 (en) 2010-03-12 2015-08-11 Nuance Communications, Inc. Multimodal text input system, such as for use with touch screens on mobile phones
US20110228913A1 (en) 2010-03-16 2011-09-22 Telcordia Technologies, Inc. Automatic extraction of information from ongoing voice communication system and methods
US8374864B2 (en) 2010-03-17 2013-02-12 Cisco Technology, Inc. Correlation of transcribed text with corresponding audio
CN102893327B (zh) 2010-03-19 2015-05-27 数字标记公司 直觉计算方法和系统
US9323756B2 (en) 2010-03-22 2016-04-26 Lenovo (Singapore) Pte. Ltd. Audio book and e-book synchronization
US8554280B2 (en) 2010-03-23 2013-10-08 Ebay Inc. Free-form entries during payment processes
US20110239111A1 (en) 2010-03-24 2011-09-29 Avaya Inc. Spell checker interface
US20110238676A1 (en) 2010-03-25 2011-09-29 Palm, Inc. System and method for data capture, storage, and retrieval
US9378202B2 (en) 2010-03-26 2016-06-28 Virtuoz Sa Semantic clustering
US8428759B2 (en) 2010-03-26 2013-04-23 Google Inc. Predictive pre-recording of audio for voice input
EP2550651B1 (en) 2010-03-26 2016-06-15 Nuance Communications, Inc. Context based voice activity detection sensitivity
US20110242007A1 (en) 2010-04-01 2011-10-06 Gray Theodore W E-Book with User-Manipulatable Graphical Objects
US8296380B1 (en) 2010-04-01 2012-10-23 Kel & Partners LLC Social media based messaging systems and methods
US8930176B2 (en) 2010-04-01 2015-01-06 Microsoft Corporation Interactive multilingual word-alignment techniques
CA2795812A1 (en) 2010-04-07 2011-10-13 Max Value Solutions INTL, LLC Method and system for name pronunciation guide services
US8448084B2 (en) 2010-04-08 2013-05-21 Twitter, Inc. User interface mechanics
US8810684B2 (en) 2010-04-09 2014-08-19 Apple Inc. Tagging images in a mobile communications device using a contacts list
KR101369810B1 (ko) 2010-04-09 2014-03-05 이초강 로봇을 위한 경험적 상황인식 방법을 실행하는 프로그램을 기록한 컴퓨터 판독가능한 기록 매체.
JP5315289B2 (ja) 2010-04-12 2013-10-16 トヨタ自動車株式会社 オペレーティングシステム及びオペレーティング方法
WO2011127640A1 (en) 2010-04-12 2011-10-20 Google Inc. Extension framework for input method editor
US8140567B2 (en) 2010-04-13 2012-03-20 Microsoft Corporation Measuring entity extraction complexity
US8265928B2 (en) 2010-04-14 2012-09-11 Google Inc. Geotagged environmental audio for enhanced speech recognition accuracy
US8756233B2 (en) 2010-04-16 2014-06-17 Video Semantics Semantic segmentation and tagging engine
US8595014B2 (en) 2010-04-19 2013-11-26 Qualcomm Incorporated Providing audible navigation system direction updates during predetermined time windows so as to minimize impact on conversations
US20110260829A1 (en) 2010-04-21 2011-10-27 Research In Motion Limited Method of providing security on a portable electronic device having a touch-sensitive display
US20130238647A1 (en) 2010-04-21 2013-09-12 Proteus Digital Health, Inc. Diagnostic System and Method
US20110264495A1 (en) 2010-04-22 2011-10-27 Apple Inc. Aggregation of tagged media item information
US20110264999A1 (en) 2010-04-23 2011-10-27 Research In Motion Limited Electronic device including touch-sensitive input device and method of controlling same
US8452037B2 (en) 2010-05-05 2013-05-28 Apple Inc. Speaker clip
US8380504B1 (en) 2010-05-06 2013-02-19 Sprint Communications Company L.P. Generation of voice profiles
US8756571B2 (en) 2010-05-07 2014-06-17 Hewlett-Packard Development Company, L.P. Natural language text instructions
US8938436B2 (en) 2010-05-10 2015-01-20 Verizon Patent And Licensing Inc. System for and method of providing reusable software service information based on natural language queries
JP2011238022A (ja) 2010-05-11 2011-11-24 Panasonic Corp 端末、コンテンツの利用の把握方法およびコンテンツの利用システム
US20110279368A1 (en) 2010-05-12 2011-11-17 Microsoft Corporation Inferring user intent to engage a motion capture system
US20110283189A1 (en) 2010-05-12 2011-11-17 Rovi Technologies Corporation Systems and methods for adjusting media guide interaction modes
US9015139B2 (en) 2010-05-14 2015-04-21 Rovi Guides, Inc. Systems and methods for performing a search based on a media content snapshot image
US8392186B2 (en) 2010-05-18 2013-03-05 K-Nfb Reading Technology, Inc. Audio synchronization for document narration with user-selected playback
US8745091B2 (en) 2010-05-18 2014-06-03 Integro, Inc. Electronic document classification
US8694313B2 (en) 2010-05-19 2014-04-08 Google Inc. Disambiguation of contact information using historical data
US9552355B2 (en) 2010-05-20 2017-01-24 Xerox Corporation Dynamic bi-phrases for statistical machine translation
US8522283B2 (en) 2010-05-20 2013-08-27 Google Inc. Television remote control data transfer
WO2011143827A1 (en) 2010-05-21 2011-11-24 Google Inc. Input method editor
US9236047B2 (en) 2010-05-21 2016-01-12 Microsoft Technology Licensing, Llc Voice stream augmented note taking
US8606579B2 (en) 2010-05-24 2013-12-10 Microsoft Corporation Voice print identification for identifying speakers
JP2011250027A (ja) 2010-05-25 2011-12-08 Panasonic Electric Works Co Ltd リモートコントロール機器及び情報通信システム
US9569549B1 (en) 2010-05-25 2017-02-14 Amazon Technologies, Inc. Location based recommendation and tagging of media content items
US8468012B2 (en) 2010-05-26 2013-06-18 Google Inc. Acoustic model adaptation using geographic information
JP2013533996A (ja) 2010-05-31 2013-08-29 バイドゥ オンライン ネットワーク テクノロジー(ペキン) カンパニー リミテッド 英文と別の文字の混在入力に用いられる方法と装置
US8639516B2 (en) 2010-06-04 2014-01-28 Apple Inc. User-specific noise suppression for voice quality improvements
ES2534047T3 (es) 2010-06-08 2015-04-16 Vodafone Holding Gmbh Tarjeta inteligente con micrófono
US8458115B2 (en) 2010-06-08 2013-06-04 Microsoft Corporation Mining topic-related aspects from user generated content
US8954425B2 (en) 2010-06-08 2015-02-10 Microsoft Corporation Snippet extraction and ranking
US20110306426A1 (en) 2010-06-10 2011-12-15 Microsoft Corporation Activity Participation Based On User Intent
US20110307810A1 (en) 2010-06-11 2011-12-15 Isreal Hilerio List integration
US8234111B2 (en) 2010-06-14 2012-07-31 Google Inc. Speech and noise models for speech recognition
US20120136572A1 (en) 2010-06-17 2012-05-31 Norton Kenneth S Distance and Location-Aware Reminders in a Calendar System
US20110314003A1 (en) 2010-06-17 2011-12-22 Microsoft Corporation Template concatenation for capturing multiple concepts in a voice query
WO2011160140A1 (en) 2010-06-18 2011-12-22 Susan Bennett System and method of semantic based searching
US9009592B2 (en) 2010-06-22 2015-04-14 Microsoft Technology Licensing, Llc Population of lists and tasks from captured voice and audio content
US8375320B2 (en) 2010-06-22 2013-02-12 Microsoft Corporation Context-based task generation
EP2400373A1 (en) 2010-06-22 2011-12-28 Vodafone Holding GmbH Inputting symbols into an electronic device having a touch-screen
US8581844B2 (en) 2010-06-23 2013-11-12 Google Inc. Switching between a first operational mode and a second operational mode using a natural motion gesture
US8655901B1 (en) 2010-06-23 2014-02-18 Google Inc. Translation-based query pattern mining
US11068657B2 (en) 2010-06-28 2021-07-20 Skyscanner Limited Natural language question answering system and method based on deep semantics
US8411874B2 (en) 2010-06-30 2013-04-02 Google Inc. Removing noise from audio
JP5323770B2 (ja) 2010-06-30 2013-10-23 日本放送協会 ユーザ指示取得装置、ユーザ指示取得プログラムおよびテレビ受像機
CN101894547A (zh) 2010-06-30 2010-11-24 北京捷通华声语音技术有限公司 一种语音合成方法和系统
US8250071B1 (en) 2010-06-30 2012-08-21 Amazon Technologies, Inc. Disambiguation of term meaning
US20120005602A1 (en) 2010-07-02 2012-01-05 Nokia Corporation Methods and apparatuses for facilitating task switching
EP2402867B1 (en) 2010-07-02 2018-08-22 Accenture Global Services Limited A computer-implemented method, a computer program product and a computer system for image processing
US8699821B2 (en) 2010-07-05 2014-04-15 Apple Inc. Aligning images
US20120010886A1 (en) 2010-07-06 2012-01-12 Javad Razavilar Language Identification
US8848882B2 (en) 2010-07-07 2014-09-30 Verizon Patent And Licensing Inc. System for and method of measuring caller interactions during a call session
US8249556B2 (en) 2010-07-13 2012-08-21 Google Inc. Securing a mobile computing device
US9104670B2 (en) 2010-07-21 2015-08-11 Apple Inc. Customized search or acquisition of digital media assets
US8260247B2 (en) 2010-07-21 2012-09-04 Research In Motion Limited Portable electronic device and method of operation
DK2596647T3 (en) 2010-07-23 2016-02-15 Sonova Ag Hearing system and method for operating a hearing system
US9786159B2 (en) 2010-07-23 2017-10-10 Tivo Solutions Inc. Multi-function remote control device
US8861925B1 (en) 2010-07-28 2014-10-14 Intuit Inc. Methods and systems for audio-visual synchronization
KR101699720B1 (ko) 2010-08-03 2017-01-26 삼성전자주식회사 음성명령 인식 장치 및 음성명령 인식 방법
BRPI1004128A2 (pt) 2010-08-04 2012-04-10 Magneti Marelli Sist S Automotivos Ind E Com Ltda definição dos parámetros chave de nìvel superior para sensor lógico de biodiesel
US9349368B1 (en) 2010-08-05 2016-05-24 Google Inc. Generating an audio notification based on detection of a triggering event
US8775156B2 (en) 2010-08-05 2014-07-08 Google Inc. Translating languages in response to device motion
US8402533B2 (en) 2010-08-06 2013-03-19 Google Inc. Input to locked computing device
US8731939B1 (en) 2010-08-06 2014-05-20 Google Inc. Routing queries based on carrier phrase registration
US8359020B2 (en) 2010-08-06 2013-01-22 Google Inc. Automatically monitoring for voice input based on context
US8473289B2 (en) 2010-08-06 2013-06-25 Google Inc. Disambiguating input based on context
WO2012019637A1 (en) 2010-08-09 2012-02-16 Jadhav, Shubhangi Mahadeo Visual music playlist creation and visual music track exploration
CN101951553B (zh) 2010-08-17 2012-10-10 深圳市车音网科技有限公司 基于语音命令的导航方法及系统
US8719006B2 (en) 2010-08-27 2014-05-06 Apple Inc. Combined statistical and rule-based part-of-speech tagging for text-to-speech synthesis
EP2609752A4 (en) 2010-08-27 2015-04-08 Intel Corp REMOTE CONTROL DEVICE
US8478519B2 (en) 2010-08-30 2013-07-02 Google Inc. Providing results to parameterless search queries
WO2012030838A1 (en) 2010-08-30 2012-03-08 Honda Motor Co., Ltd. Belief tracking and action selection in spoken dialog systems
US9800721B2 (en) 2010-09-07 2017-10-24 Securus Technologies, Inc. Multi-party conversation analyzer and logger
US20120059655A1 (en) 2010-09-08 2012-03-08 Nuance Communications, Inc. Methods and apparatus for providing input to a speech-enabled application program
US8341142B2 (en) 2010-09-08 2012-12-25 Nuance Communications, Inc. Methods and apparatus for searching the Internet
WO2012033492A1 (en) 2010-09-09 2012-03-15 Sony Ericsson Mobile Communications Ab Annotating e-books/e-magazines with application results
US9538229B2 (en) 2010-09-15 2017-01-03 Verizon Patent And Licensing Inc. Media experience for touch screen devices
US8560229B1 (en) 2010-09-15 2013-10-15 Google Inc. Sensor based activity detection
US20120068937A1 (en) 2010-09-16 2012-03-22 Sony Ericsson Mobile Communications Ab Quick input language/virtual keyboard/ language dictionary change on a touch screen device
US20120078635A1 (en) 2010-09-24 2012-03-29 Apple Inc. Voice control system
CN101937194B (zh) 2010-09-27 2012-12-19 鸿富锦精密工业(深圳)有限公司 具有学习功能的智能控制系统和方法
KR20120031722A (ko) 2010-09-27 2012-04-04 삼성전자주식회사 동적 응답 생성 장치 및 방법
US8719014B2 (en) 2010-09-27 2014-05-06 Apple Inc. Electronic device with text error correction based on voice recognition data
US8594997B2 (en) 2010-09-27 2013-11-26 Sap Ag Context-aware conversational user interface
US10037319B2 (en) 2010-09-29 2018-07-31 Touchtype Limited User input prediction
CN102436456B (zh) 2010-09-29 2016-03-30 国际商业机器公司 用于对命名实体进行分类的方法和装置
US20120084248A1 (en) 2010-09-30 2012-04-05 Microsoft Corporation Providing suggestions based on user intent
US8644519B2 (en) 2010-09-30 2014-02-04 Apple Inc. Electronic devices with improved audio
US8812321B2 (en) 2010-09-30 2014-08-19 At&T Intellectual Property I, L.P. System and method for combining speech recognition outputs from a plurality of domain-specific speech recognizers via machine learning
US20120084634A1 (en) 2010-10-05 2012-04-05 Sony Corporation Method and apparatus for annotating text
US8606293B2 (en) 2010-10-05 2013-12-10 Qualcomm Incorporated Mobile device location estimation using environmental information
US9679256B2 (en) 2010-10-06 2017-06-13 The Chancellor, Masters And Scholars Of The University Of Cambridge Automated assessment of examination scripts
US10900799B2 (en) 2010-10-12 2021-01-26 Toyota Motor Engineering & Manufacturing North America, Inc. Systems and methods for determining a destination location from a communication
AU2011316437A1 (en) 2010-10-15 2013-05-09 Intelligent Mechatronic Systems Inc. Implicit association and polymorphism driven human machine interaction
JP5572059B2 (ja) 2010-10-21 2014-08-13 京セラ株式会社 表示装置
US20120108221A1 (en) 2010-10-28 2012-05-03 Microsoft Corporation Augmenting communication sessions with applications
JP5883014B2 (ja) 2010-10-29 2016-03-09 科大訊飛股▲分▼有限公司iFLYTEK Co., Ltd. 録音の終了点自動検出のための方法及びシステム
US8660531B2 (en) 2010-11-03 2014-02-25 Blackberry Limited Access to locked functions
US20120116770A1 (en) 2010-11-08 2012-05-10 Ming-Fu Chen Speech data retrieving and presenting device
US8881057B2 (en) 2010-11-09 2014-11-04 Blackberry Limited Methods and apparatus to display mobile device contexts
MY177511A (en) 2010-11-16 2020-09-17 Shardul Suresh Shroff System and method for providing virtual arbitration
US20120124126A1 (en) 2010-11-17 2012-05-17 Microsoft Corporation Contextual and task focused computing
US10144440B2 (en) 2010-11-17 2018-12-04 General Electric Company Methods and systems for data communications
US9484018B2 (en) 2010-11-23 2016-11-01 At&T Intellectual Property I, L.P. System and method for building and evaluating automatic speech recognition via an application programmer interface
US8938216B2 (en) 2010-11-24 2015-01-20 Cisco Technology, Inc. Geographical location information/signal quality-context based recording and playback of multimedia data from a conference session
US9105008B2 (en) 2010-11-29 2015-08-11 Yahoo! Inc. Detecting controversial events
US8489625B2 (en) 2010-11-29 2013-07-16 Microsoft Corporation Mobile query suggestions with time-location awareness
JP5652913B2 (ja) 2010-12-03 2015-01-14 アイシン・エィ・ダブリュ株式会社 車載端末装置
US9135241B2 (en) 2010-12-08 2015-09-15 At&T Intellectual Property I, L.P. System and method for learning latent representations for natural language tasks
US8312096B2 (en) 2010-12-08 2012-11-13 Google Inc. Priority inbox notifications and synchronization for mobile messaging application
US9244606B2 (en) 2010-12-20 2016-01-26 Apple Inc. Device, method, and graphical user interface for navigation of concurrently open software applications
US8666726B2 (en) 2010-12-21 2014-03-04 Nuance Communications, Inc. Sample clustering to reduce manual transcriptions in speech recognition system
US20120158422A1 (en) 2010-12-21 2012-06-21 General Electric Company Methods and systems for scheduling appointments in healthcare systems
US20120158293A1 (en) 2010-12-21 2012-06-21 General Electric Company Methods and systems for dynamically providing users with appointment reminders
US20130035086A1 (en) 2010-12-22 2013-02-07 Logitech Europe S.A. Remote control system for providing content suggestions
US8532377B2 (en) 2010-12-22 2013-09-10 Xerox Corporation Image ranking based on abstract concepts
US8838449B2 (en) 2010-12-23 2014-09-16 Microsoft Corporation Word-dependent language model
TWI413105B (zh) 2010-12-30 2013-10-21 Ind Tech Res Inst 多語言之文字轉語音合成系統與方法
US8626681B1 (en) 2011-01-04 2014-01-07 Google Inc. Training a probabilistic spelling checker from structured data
KR101828273B1 (ko) 2011-01-04 2018-02-14 삼성전자주식회사 결합기반의 음성명령 인식 장치 및 그 방법
EP2661705A4 (en) 2011-01-05 2016-06-01 Google Inc METHOD AND SYSTEM FOR FACILITATING TEXT INPUT
US8589950B2 (en) 2011-01-05 2013-11-19 Blackberry Limited Processing user input events in a web browser
JP5712618B2 (ja) 2011-01-07 2015-05-07 サクサ株式会社 電話システム
US10049669B2 (en) 2011-01-07 2018-08-14 Nuance Communications, Inc. Configurable speech recognition system using multiple recognizers
US9183843B2 (en) 2011-01-07 2015-11-10 Nuance Communications, Inc. Configurable speech recognition system using multiple recognizers
CA2821565C (en) 2011-01-07 2017-04-18 Research In Motion Limited System and method for controlling mobile communication devices
US8689116B2 (en) 2011-01-14 2014-04-01 Apple Inc. Email user interface
US20120192096A1 (en) 2011-01-25 2012-07-26 Research In Motion Limited Active command line driven user interface
US8666895B2 (en) 2011-01-31 2014-03-04 Bank Of America Corporation Single action mobile transaction device
US8943054B2 (en) 2011-01-31 2015-01-27 Social Resolve, Llc Social media content management system and method
AU2012212517A1 (en) 2011-02-04 2013-08-22 Google Inc. Posting to social networks by voice
US8862612B2 (en) 2011-02-11 2014-10-14 Sony Corporation Direct search launch on a second display
US10631246B2 (en) 2011-02-14 2020-04-21 Microsoft Technology Licensing, Llc Task switching on mobile devices
US9916420B2 (en) 2011-02-18 2018-03-13 Nuance Communications, Inc. Physician and clinical documentation specialist workflow integration
US8694335B2 (en) 2011-02-18 2014-04-08 Nuance Communications, Inc. Methods and apparatus for applying user corrections to medical fact extraction
US10145960B2 (en) 2011-02-24 2018-12-04 Ford Global Technologies, Llc System and method for cell phone restriction
KR101178310B1 (ko) 2011-02-24 2012-08-29 포항공과대학교 산학협력단 대화 관리 방법 및 이를 실행하는 시스템
CN102651217A (zh) 2011-02-25 2012-08-29 株式会社东芝 用于合成语音的方法、设备以及用于语音合成的声学模型训练方法
US8688453B1 (en) 2011-02-28 2014-04-01 Nuance Communications, Inc. Intent mining via analysis of utterances
US20120221552A1 (en) 2011-02-28 2012-08-30 Nokia Corporation Method and apparatus for providing an active search user interface element
US9632677B2 (en) 2011-03-02 2017-04-25 The Boeing Company System and method for navigating a 3-D environment using a multi-input interface
US8972275B2 (en) 2011-03-03 2015-03-03 Brightedge Technologies, Inc. Optimization of social media engagement
EP2498250B1 (en) 2011-03-07 2021-05-05 Accenture Global Services Limited Client and server system for natural language-based control of a digital network of devices
US9081760B2 (en) 2011-03-08 2015-07-14 At&T Intellectual Property I, L.P. System and method for building diverse language models
US20120233266A1 (en) 2011-03-11 2012-09-13 Microsoft Corporation Peer-to-peer group with renegotiation of group owner
CN202092650U (zh) 2011-03-14 2011-12-28 深圳市车乐数码科技有限公司 一种带按键的语音导航的车载多媒体
US8849931B2 (en) 2011-03-15 2014-09-30 Idt Messaging, Llc Linking context-based information to text messages
US9262612B2 (en) 2011-03-21 2016-02-16 Apple Inc. Device access using voice authentication
US8862255B2 (en) 2011-03-23 2014-10-14 Audible, Inc. Managing playback of synchronized content
US20120246064A1 (en) 2011-03-23 2012-09-27 Ebay, Inc. Customer refunds using payment service providers
US9202465B2 (en) 2011-03-25 2015-12-01 General Motors Llc Speech recognition dependent on text message content
US8766793B2 (en) 2011-03-25 2014-07-01 Microsoft Corporation Contextually-appropriate task reminders
US9171546B1 (en) 2011-03-29 2015-10-27 Google Inc. Performing functions based on commands in context of telephonic communication
CN202035047U (zh) 2011-03-29 2011-11-09 张磊 一种提取地址信息进行导航的移动终端
US9154555B2 (en) 2011-03-30 2015-10-06 Paypal, Inc. Device specific remote disabling of applications
US9280535B2 (en) 2011-03-31 2016-03-08 Infosys Limited Natural language querying with cascaded conditional random fields
EP2691870A4 (en) 2011-03-31 2015-05-20 Microsoft Technology Licensing Llc USER INTENTIONS ORIENTED ON TASKS
US9337999B2 (en) 2011-04-01 2016-05-10 Intel Corporation Application usage continuum across platforms
US9098488B2 (en) 2011-04-03 2015-08-04 Microsoft Technology Licensing, Llc Translation of multilingual embedded phrases
US20120252367A1 (en) 2011-04-04 2012-10-04 Meditalk Devices, Llc Auditory Speech Module For Medical Devices
US8914275B2 (en) 2011-04-06 2014-12-16 Microsoft Corporation Text prediction
US9292877B2 (en) 2011-04-13 2016-03-22 Longsand Limited Methods and systems for generating concept-based hash tags
CN102137193A (zh) 2011-04-13 2011-07-27 深圳凯虹移动通信有限公司 一种移动通讯终端及其通讯控制方法
EP2702473A1 (en) 2011-04-25 2014-03-05 Veveo, Inc. System and method for an intelligent personal timeline assistant
US9444692B2 (en) 2011-04-26 2016-09-13 Openet Telecom Ltd. Systems, devices and methods of crowd-sourcing across multiple domains
JP5592433B2 (ja) 2011-05-03 2014-09-17 宏達國際電子股▲ふん▼有限公司 手持ち式電子装置及びそのマルチメディアクリップ記録方法
DE112011100058T5 (de) 2011-05-04 2013-02-07 Research In Motion Ltd. Verfahren zum anpassen einer darstellung von grafischen daten, die auf einer grafischen benutzerschnittstelle angezeigt werden
US8171137B1 (en) 2011-05-09 2012-05-01 Google Inc. Transferring application state across devices
US8150385B1 (en) 2011-05-09 2012-04-03 Loment, Inc. Automated reply messages among end user communication devices
WO2012155079A2 (en) 2011-05-12 2012-11-15 Johnson Controls Technology Company Adaptive voice recognition systems and methods
US9064006B2 (en) 2012-08-23 2015-06-23 Microsoft Technology Licensing, Llc Translating natural language utterances to keyword search queries
KR101233561B1 (ko) 2011-05-12 2013-02-14 엔에이치엔(주) 단어 수준의 후보 생성에 기초한 음성 인식 시스템 및 방법
JP2014524059A (ja) 2011-05-13 2014-09-18 プリンプトン,デーヴィッド カレンダベースの検索エンジン
US20120290291A1 (en) 2011-05-13 2012-11-15 Gabriel Lee Gilbert Shelley Input processing for character matching and predicted word matching
US8793624B2 (en) 2011-05-18 2014-07-29 Google Inc. Control of a device using gestures
US8972240B2 (en) 2011-05-19 2015-03-03 Microsoft Corporation User-modifiable word lattice display for editing documents and search queries
US8914290B2 (en) 2011-05-20 2014-12-16 Vocollect, Inc. Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US9236045B2 (en) 2011-05-23 2016-01-12 Nuance Communications, Inc. Methods and apparatus for proofing of a text input
US20120304124A1 (en) 2011-05-23 2012-11-29 Microsoft Corporation Context aware input engine
US8731936B2 (en) 2011-05-26 2014-05-20 Microsoft Corporation Energy-efficient unobtrusive identification of a speaker
JP5463385B2 (ja) 2011-06-03 2014-04-09 アップル インコーポレイテッド テキストデータとオーディオデータとの間のマッピングの自動作成
US20120310642A1 (en) 2011-06-03 2012-12-06 Apple Inc. Automatically creating a mapping between text data and audio data
US10057736B2 (en) 2011-06-03 2018-08-21 Apple Inc. Active transport based notifications
US9268857B2 (en) 2011-06-03 2016-02-23 Facebook, Inc. Suggesting search results to users before receiving any search query from the users
US20120317498A1 (en) 2011-06-07 2012-12-13 Research In Motion Limited Electronic communication device and method for displaying icons
US8781841B1 (en) 2011-06-07 2014-07-15 Cisco Technology, Inc. Name recognition of virtual meeting participants
US20120316875A1 (en) 2011-06-10 2012-12-13 Red Shift Company, Llc Hosted speech handling
WO2012170817A1 (en) 2011-06-10 2012-12-13 Google Inc. Augmenting statistical machine translation with linguistic knowledge
US8732319B2 (en) 2011-06-10 2014-05-20 Qualcomm Incorporated Context awareness proximity-based establishment of wireless communication connection
US20130158977A1 (en) 2011-06-14 2013-06-20 Andrew Senior System and Method for Evaluating Speech Exposure
US20120321112A1 (en) 2011-06-16 2012-12-20 Apple Inc. Selecting a digital stream based on an audio sample
US20120324391A1 (en) 2011-06-16 2012-12-20 Microsoft Corporation Predictive word completion
US20120329529A1 (en) 2011-06-21 2012-12-27 GreatCall, Inc. Gesture activate help process and system
CN104011712B (zh) 2011-06-24 2018-04-24 谷歌有限责任公司 对跨语言查询建议的查询翻译进行评价
US10984387B2 (en) 2011-06-28 2021-04-20 Microsoft Technology Licensing, Llc Automatic task extraction and calendar entry
US20130006633A1 (en) 2011-07-01 2013-01-03 Qualcomm Incorporated Learning speech models for mobile device users
DE102011078642A1 (de) 2011-07-05 2013-01-10 Robert Bosch Gmbh Verfahren zum Prüfen eines m aus n Codes
US8209183B1 (en) 2011-07-07 2012-06-26 Google Inc. Systems and methods for correction of text from different input types, sources, and contexts
US20130010575A1 (en) 2011-07-07 2013-01-10 International Business Machines Corporation Systems and methods of managing electronic calendar applications
US8682670B2 (en) 2011-07-07 2014-03-25 International Business Machines Corporation Statistical enhancement of speech output from a statistical text-to-speech synthesis system
US20130018659A1 (en) 2011-07-12 2013-01-17 Google Inc. Systems and Methods for Speech Command Processing
CA2747153A1 (en) 2011-07-19 2013-01-19 Suleman Kaheer Natural language processing dialog system for obtaining goods, services or information
US20130024576A1 (en) 2011-07-22 2013-01-24 Microsoft Corporation Proximity-Based Detection
US20130031476A1 (en) 2011-07-25 2013-01-31 Coin Emmett Voice activated virtual assistant
US8781810B2 (en) 2011-07-25 2014-07-15 Xerox Corporation System and method for productive generation of compound words in statistical machine translation
US9009041B2 (en) 2011-07-26 2015-04-14 Nuance Communications, Inc. Systems and methods for improving the accuracy of a transcription using auxiliary data such as personal data
US8732028B2 (en) 2011-07-26 2014-05-20 Expose Retail Strategies Inc. Scheduling of order processing for remotely ordered goods
US9292112B2 (en) 2011-07-28 2016-03-22 Hewlett-Packard Development Company, L.P. Multimodal interface
EP2551784A1 (en) 2011-07-28 2013-01-30 Roche Diagnostics GmbH Method of controlling the display of a dataset
US9031842B2 (en) 2011-07-28 2015-05-12 Blackberry Limited Methods and devices for facilitating communications
CN102905499B (zh) 2011-07-29 2015-12-09 纬创资通股份有限公司 竖卡模块及电子装置
US20130031216A1 (en) 2011-07-29 2013-01-31 Myxer, Inc. Systems and methods for generation of customized media playlists
US20130030789A1 (en) 2011-07-29 2013-01-31 Reginald Dalce Universal Language Translator
US20130035117A1 (en) 2011-08-04 2013-02-07 GM Global Technology Operations LLC System and method for restricting driver mobile device feature usage while vehicle is in motion
EP3754997B1 (en) 2011-08-05 2023-08-30 Samsung Electronics Co., Ltd. Method for controlling electronic apparatus based on voice recognition and motion recognition, and electronic apparatus applying the same
WO2013022218A2 (en) 2011-08-05 2013-02-14 Samsung Electronics Co., Ltd. Electronic apparatus and method for providing user interface thereof
US9417754B2 (en) 2011-08-05 2016-08-16 P4tents1, LLC User interface system, method, and computer program product
US8595015B2 (en) 2011-08-08 2013-11-26 Verizon New Jersey Inc. Audio communication assessment
CN102929710B (zh) 2011-08-09 2017-10-27 中兴通讯股份有限公司 一种调用应用模块的方法及移动终端
WO2013022135A1 (en) 2011-08-11 2013-02-14 Lg Electronics Inc. Electronic device and method of controlling the same
US8706472B2 (en) 2011-08-11 2014-04-22 Apple Inc. Method for disambiguating multiple readings in language conversion
US8589160B2 (en) 2011-08-19 2013-11-19 Dolbey & Company, Inc. Systems and methods for providing an electronic dictation interface
US20130055099A1 (en) 2011-08-22 2013-02-28 Rose Yao Unified Messaging System with Integration of Call Log Data
US8943071B2 (en) 2011-08-23 2015-01-27 At&T Intellectual Property I, L.P. Automatic sort and propagation associated with electronic documents
US9195768B2 (en) 2011-08-26 2015-11-24 Amazon Technologies, Inc. Remote browsing session management
US20130055147A1 (en) 2011-08-29 2013-02-28 Salesforce.Com, Inc. Configuration, generation, and presentation of custom graphical user interface components for a virtual cloud-based application
US20130054706A1 (en) 2011-08-29 2013-02-28 Mary Graham Modulation of Visual Notification Parameters Based on Message Activity and Notification Value
US8994660B2 (en) 2011-08-29 2015-03-31 Apple Inc. Text correction processing
US8819012B2 (en) 2011-08-30 2014-08-26 International Business Machines Corporation Accessing anchors in voice site content
US8554729B2 (en) 2011-08-31 2013-10-08 Google Inc. System and method for synchronization of actions in the background of an application
US8914288B2 (en) 2011-09-01 2014-12-16 At&T Intellectual Property I, L.P. System and method for advanced turn-taking for interactive spoken dialog systems
JP6050362B2 (ja) 2011-09-09 2016-12-21 グーグル インコーポレイテッド 翻訳ウェブページのためのユーザーインターフェース
US9596084B2 (en) 2011-09-09 2017-03-14 Facebook, Inc. Initializing camera subsystem for face detection based on sensor inputs
US20130066832A1 (en) 2011-09-12 2013-03-14 Microsoft Corporation Application state synchronization
US20130073346A1 (en) 2011-09-16 2013-03-21 David Chun Identifying companies most closely related to a given company
US20130073286A1 (en) 2011-09-20 2013-03-21 Apple Inc. Consolidating Speech Recognition Results
US9129606B2 (en) 2011-09-23 2015-09-08 Microsoft Technology Licensing, Llc User query history expansion for improving language model adaptation
US8798995B1 (en) 2011-09-23 2014-08-05 Amazon Technologies, Inc. Key word determinations from voice data
US20130080251A1 (en) 2011-09-26 2013-03-28 Accenture Global Services Limited Product registration and tracking system
US8812301B2 (en) 2011-09-26 2014-08-19 Xerox Corporation Linguistically-adapted structural query annotation
US8996381B2 (en) 2011-09-27 2015-03-31 Sensory, Incorporated Background speech recognition assistant
US8768707B2 (en) 2011-09-27 2014-07-01 Sensory Incorporated Background speech recognition assistant using speaker verification
US8762156B2 (en) 2011-09-28 2014-06-24 Apple Inc. Speech recognition repair using contextual information
EP2575128A3 (en) 2011-09-30 2013-08-14 Apple Inc. Using context information to facilitate processing of commands in a virtual assistant
US8452597B2 (en) 2011-09-30 2013-05-28 Google Inc. Systems and methods for continual speech recognition and detection in mobile computing devices
WO2013048880A1 (en) 2011-09-30 2013-04-04 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US8468022B2 (en) 2011-09-30 2013-06-18 Google Inc. Voice control for asynchronous notifications
AU2015203483A1 (en) 2011-09-30 2015-07-16 Apple Inc. Using context information to facilitate processing of commands in a virtual assistant
US8340975B1 (en) 2011-10-04 2012-12-25 Theodore Alfred Rosenberger Interactive speech recognition device and system for hands-free building control
US8386926B1 (en) 2011-10-06 2013-02-26 Google Inc. Network-based custom dictionary, auto-correction and text entry preferences
US9640175B2 (en) 2011-10-07 2017-05-02 Microsoft Technology Licensing, Llc Pronunciation learning from user correction
US9521175B2 (en) 2011-10-07 2016-12-13 Henk B. Rogers Media tagging
US9021565B2 (en) 2011-10-13 2015-04-28 At&T Intellectual Property I, L.P. Authentication techniques utilizing a computing device
US8738363B2 (en) 2011-10-13 2014-05-27 Xerox Corporation System and method for suggestion mining
US20130097566A1 (en) 2011-10-17 2013-04-18 Carl Fredrik Alexander BERGLUND System and method for displaying items on electronic devices
KR101873741B1 (ko) 2011-10-26 2018-07-03 엘지전자 주식회사 휴대 단말기 및 그 제어 방법
US20130111330A1 (en) 2011-11-01 2013-05-02 Research In Motion Limited Accelerated compositing of fixed position elements on an electronic device
US9223948B2 (en) 2011-11-01 2015-12-29 Blackberry Limited Combined passcode and activity launch modifier
US8996350B1 (en) 2011-11-02 2015-03-31 Dub Software Group, Inc. System and method for automatic document management
US20130110943A1 (en) 2011-11-02 2013-05-02 Apple Inc. Notification and reminder generation, distribution, and storage system
US9471666B2 (en) 2011-11-02 2016-10-18 Salesforce.Com, Inc. System and method for supporting natural language queries and requests against a user's personal data cloud
JP5681611B2 (ja) 2011-11-09 2015-03-11 株式会社日立製作所 ナビゲーションシステム、ナビゲーション装置、方法及びサーバ
US9711137B2 (en) 2011-11-10 2017-07-18 At&T Intellectual Property I, Lp Network-based background expert
US8972263B2 (en) 2011-11-18 2015-03-03 Soundhound, Inc. System and method for performing dual mode speech recognition
KR101830656B1 (ko) 2011-12-02 2018-02-21 엘지전자 주식회사 이동 단말기 및 이의 제어방법
KR101193668B1 (ko) 2011-12-06 2012-12-14 위준성 스마트 기기를 이용한 상황 인식 기반 외국어 습득 및 학습 서비스 제공 방법
US9214157B2 (en) 2011-12-06 2015-12-15 At&T Intellectual Property I, L.P. System and method for machine-mediated human-human conversation
US9323746B2 (en) 2011-12-06 2016-04-26 At&T Intellectual Property I, L.P. System and method for collaborative language translation
US9082402B2 (en) 2011-12-08 2015-07-14 Sri International Generic virtual personal assistant platform
US9646313B2 (en) 2011-12-13 2017-05-09 Microsoft Technology Licensing, Llc Gesture-based tagging to view related content
WO2013090839A1 (en) 2011-12-14 2013-06-20 Realnetworks, Inc. Customizable media auto-reply systems and methods
US8622836B2 (en) 2011-12-22 2014-01-07 Igt Use of wireless signal strength to determine connection
JP2013134430A (ja) 2011-12-27 2013-07-08 Toyota Motor Corp コマンド処理装置、方法、及びプログラム
US8996729B2 (en) 2012-04-12 2015-03-31 Nokia Corporation Method and apparatus for synchronizing tasks performed by multiple devices
US9094534B2 (en) 2011-12-29 2015-07-28 Apple Inc. Device, method, and graphical user interface for configuring and implementing restricted interactions with a user interface
JP5547216B2 (ja) 2012-01-06 2014-07-09 株式会社東芝 電子機器及び表示制御方法
JP5887937B2 (ja) 2012-01-06 2016-03-16 株式会社リコー 出力制御システム、出力制御方法、出力制御装置、および出力制御プログラム
US9547832B2 (en) 2012-01-10 2017-01-17 Oracle International Corporation Identifying individual intentions and determining responses to individual intentions
US8825020B2 (en) 2012-01-12 2014-09-02 Sensory, Incorporated Information access and device control using mobile phones and audio in the home environment
US8812302B2 (en) 2012-01-17 2014-08-19 Google Inc. Techniques for inserting diacritical marks to text input via a user device
US9099098B2 (en) 2012-01-20 2015-08-04 Qualcomm Incorporated Voice activity detection in presence of background noise
US20130204813A1 (en) 2012-01-20 2013-08-08 Fluential, Llc Self-learning, context aware virtual assistants, systems and methods
WO2013113010A1 (en) 2012-01-26 2013-08-01 Telecommunication Systems, Inc. Navigational lane guidance
JP5682578B2 (ja) 2012-01-27 2015-03-11 日本電気株式会社 音声認識結果修正支援システム、音声認識結果修正支援方法および音声認識結果修正支援プログラム
US8626748B2 (en) 2012-02-03 2014-01-07 International Business Machines Corporation Combined word tree text visualization system
US8995960B2 (en) 2012-02-10 2015-03-31 Dedo Interactive, Inc. Mobile device authentication
CN102629246B (zh) 2012-02-10 2017-06-27 百纳(武汉)信息技术有限公司 识别浏览器语音命令的服务器及浏览器语音命令识别方法
US10209954B2 (en) 2012-02-14 2019-02-19 Microsoft Technology Licensing, Llc Equal access to speech and touch input
JP2013167806A (ja) 2012-02-16 2013-08-29 Toshiba Corp 情報通知支援装置、情報通知支援方法、および、プログラム
US9064497B2 (en) 2012-02-22 2015-06-23 Htc Corporation Method and apparatus for audio intelligibility enhancement and computing apparatus
DE112012000189B4 (de) 2012-02-24 2023-06-15 Blackberry Limited Berührungsbildschirm-Tastatur zum Vorsehen von Wortvorhersagen in Partitionen der Berührungsbildschirm-Tastatur in naher Assoziation mit Kandidaten-Buchstaben
US9042867B2 (en) 2012-02-24 2015-05-26 Agnitio S.L. System and method for speaker recognition on mobile devices
US8543398B1 (en) 2012-02-29 2013-09-24 Google Inc. Training an automatic speech recognition system using compressed word frequencies
US10134385B2 (en) 2012-03-02 2018-11-20 Apple Inc. Systems and methods for name pronunciation
US20130235987A1 (en) 2012-03-06 2013-09-12 Jose Arroniz-Escobar Automatic machine to machine distribution of subscriber contact information
US9639174B2 (en) 2012-03-09 2017-05-02 Paypal, Inc. Mobile device display content based on shaking the device
US20150006157A1 (en) 2012-03-14 2015-01-01 Nec Corporation Term synonym acquisition method and term synonym acquisition apparatus
WO2013138633A1 (en) 2012-03-15 2013-09-19 Regents Of The University Of Minnesota Automated verbal fluency assessment
EP2639792A1 (en) 2012-03-16 2013-09-18 France Télécom Voice control of applications by associating user input with action-context idendifier pairs
US9223497B2 (en) 2012-03-16 2015-12-29 Blackberry Limited In-context word prediction and word correction
JP5870790B2 (ja) 2012-03-19 2016-03-01 富士通株式会社 文章校正装置、及び文章校正方法
JP2013200423A (ja) 2012-03-23 2013-10-03 Toshiba Corp 音声対話支援装置、方法、およびプログラム
JP5965175B2 (ja) 2012-03-27 2016-08-03 ヤフー株式会社 応答生成装置、応答生成方法および応答生成プログラム
US8681950B2 (en) 2012-03-28 2014-03-25 Interactive Intelligence, Inc. System and method for fingerprinting datasets
WO2013144759A1 (en) 2012-03-29 2013-10-03 Telmap Ltd. Location-based assistance for personal planning
US8892419B2 (en) 2012-04-10 2014-11-18 Artificial Solutions Iberia SL System and methods for semiautomatic generation and tuning of natural language interaction applications
US8346563B1 (en) 2012-04-10 2013-01-01 Artificial Solutions Ltd. System and methods for delivering advanced natural language interaction applications
US20130275117A1 (en) 2012-04-11 2013-10-17 Morgan H. Winer Generalized Phonetic Transliteration Engine
US9685160B2 (en) 2012-04-16 2017-06-20 Htc Corporation Method for offering suggestion during conversation, electronic device using the same, and non-transitory storage medium
US9223537B2 (en) 2012-04-18 2015-12-29 Next It Corporation Conversation user interface
US9117449B2 (en) 2012-04-26 2015-08-25 Nuance Communications, Inc. Embedded system for construction of small footprint speech recognition with user-definable constraints
CN102682771B (zh) 2012-04-27 2013-11-20 厦门思德电子科技有限公司 一种适用于云平台的多语音控制方法
TWI511537B (zh) 2012-04-27 2015-12-01 Wistron Corp 智慧型電視系統、智慧型電視、行動裝置及其輸入操作方法
US20130289991A1 (en) 2012-04-30 2013-10-31 International Business Machines Corporation Application of Voice Tags in a Social Media Context
KR101946364B1 (ko) 2012-05-01 2019-02-11 엘지전자 주식회사 적어도 하나의 마이크 센서를 갖는 모바일 디바이스 및 그 제어방법
US9423870B2 (en) 2012-05-08 2016-08-23 Google Inc. Input determination method
US8732560B2 (en) 2012-05-08 2014-05-20 Infineon Technologies Ag Method and device for correction of ternary stored binary data
WO2013169842A2 (en) 2012-05-09 2013-11-14 Yknots Industries Llc Device, method, and graphical user interface for selecting object within a group of objects
US8725808B2 (en) 2012-05-10 2014-05-13 Intel Mobile Communications GmbH Method for transferring data between a first device and a second device
JP5996262B2 (ja) 2012-05-11 2016-09-21 シャープ株式会社 文字入力装置、電子機器、制御方法、制御プログラムおよび記録媒体
US9002768B2 (en) 2012-05-12 2015-04-07 Mikhail Fedorov Human-computer interface system
US8897822B2 (en) 2012-05-13 2014-11-25 Wavemarket, Inc. Auto responder
US9280610B2 (en) 2012-05-14 2016-03-08 Apple Inc. Crowd sourcing information to fulfill user requests
US20130308922A1 (en) 2012-05-15 2013-11-21 Microsoft Corporation Enhanced video discovery and productivity through accessibility
US10417037B2 (en) 2012-05-15 2019-09-17 Apple Inc. Systems and methods for integrating third party services with a digital assistant
US20130307855A1 (en) 2012-05-16 2013-11-21 Mathew J. Lamb Holographic story telling
US9247306B2 (en) 2012-05-21 2016-01-26 Intellectual Ventures Fund 83 Llc Forming a multimedia product using video chat
US8484573B1 (en) 2012-05-23 2013-07-09 Google Inc. Predictive virtual keyboard
US9173074B2 (en) 2012-05-27 2015-10-27 Qualcomm Incorporated Personal hub presence and response
KR20130133629A (ko) 2012-05-29 2013-12-09 삼성전자주식회사 전자장치에서 음성명령을 실행시키기 위한 장치 및 방법
US20130325436A1 (en) 2012-05-29 2013-12-05 Wright State University Large Scale Distributed Syntactic, Semantic and Lexical Language Models
US20130325447A1 (en) 2012-05-31 2013-12-05 Elwha LLC, a limited liability corporation of the State of Delaware Speech recognition adaptation systems based on adaptation data
US9620128B2 (en) 2012-05-31 2017-04-11 Elwha Llc Speech recognition adaptation systems based on adaptation data
US8768693B2 (en) 2012-05-31 2014-07-01 Yahoo! Inc. Automatic tag extraction from audio annotated photos
US9123338B1 (en) 2012-06-01 2015-09-01 Google Inc. Background audio identification for speech disambiguation
US8725823B2 (en) 2012-06-05 2014-05-13 Forget You Not, LLC Location-based communications
US9230556B2 (en) 2012-06-05 2016-01-05 Apple Inc. Voice instructions during navigation
US8515750B1 (en) 2012-06-05 2013-08-20 Google Inc. Realtime acoustic adaptation using stability measures
US9002380B2 (en) 2012-06-08 2015-04-07 Apple Inc. Proximity-based notifications in a mobile device
WO2013185109A2 (en) 2012-06-08 2013-12-12 Apple Inc. Systems and methods for recognizing textual identifiers within a plurality of words
US20130332168A1 (en) 2012-06-08 2013-12-12 Samsung Electronics Co., Ltd. Voice activated search and control for applications
US9721563B2 (en) 2012-06-08 2017-08-01 Apple Inc. Name recognition system
US20130332159A1 (en) 2012-06-08 2013-12-12 Apple Inc. Using fan throttling to enhance dictation accuracy
US9230218B2 (en) 2012-06-08 2016-01-05 Spotify Ab Systems and methods for recognizing ambiguity in metadata
US9916514B2 (en) 2012-06-11 2018-03-13 Amazon Technologies, Inc. Text recognition driven functionality
EP2862102A4 (en) 2012-06-14 2016-01-27 Nokia Technologies Oy METHOD AND APPARATUS FOR ASSOCIATING LABELS OF INTEREST WITH MULTIMEDIA ELEMENTS BASED ON SOCIAL DIFFUSIONS BETWEEN USERS
US9734839B1 (en) 2012-06-20 2017-08-15 Amazon Technologies, Inc. Routing natural language commands to the appropriate applications
US20140012574A1 (en) 2012-06-21 2014-01-09 Maluuba Inc. Interactive timeline for presenting and organizing tasks
US20130346347A1 (en) 2012-06-22 2013-12-26 Google Inc. Method to Predict a Communicative Action that is Most Likely to be Executed Given a Context
US20130346068A1 (en) 2012-06-25 2013-12-26 Apple Inc. Voice-Based Image Tagging and Searching
US20150201064A1 (en) 2012-06-26 2015-07-16 Blackberry Limited Methods and apparatus to detect and add impact events to a calendar program
US20140006153A1 (en) 2012-06-27 2014-01-02 Infosys Limited System for making personalized offers for business facilitation of an entity and methods thereof
KR101961139B1 (ko) 2012-06-28 2019-03-25 엘지전자 주식회사 이동 단말기 및 그것의 음성 인식 방법
JP5852930B2 (ja) 2012-06-29 2016-02-03 Kddi株式会社 入力文字推定装置およびプログラム
US9996628B2 (en) 2012-06-29 2018-06-12 Verisign, Inc. Providing audio-activated resource access for user devices based on speaker voiceprint
US9495129B2 (en) 2012-06-29 2016-11-15 Apple Inc. Device, method, and user interface for voice-activated navigation and browsing of a document
US20140006012A1 (en) 2012-07-02 2014-01-02 Microsoft Corporation Learning-Based Processing of Natural Language Questions
US9536528B2 (en) 2012-07-03 2017-01-03 Google Inc. Determining hotword suitability
US9064493B2 (en) 2012-07-09 2015-06-23 Nuance Communications, Inc. Detecting potential significant errors in speech recognition results
CN103544140A (zh) 2012-07-12 2014-01-29 国际商业机器公司 一种数据处理方法、展示方法和相应的装置
US9053708B2 (en) 2012-07-18 2015-06-09 International Business Machines Corporation System, method and program product for providing automatic speech recognition (ASR) in a shared resource environment
US20140026101A1 (en) 2012-07-20 2014-01-23 Barnesandnoble.Com Llc Accessible Menu Navigation Techniques For Electronic Devices
US9953584B2 (en) 2012-07-24 2018-04-24 Nook Digital, Llc Lighting techniques for display devices
US8838436B2 (en) 2012-07-25 2014-09-16 Aro, Inc. Labeling context slices to produce a storyline from mobile device data
JP2014026629A (ja) 2012-07-26 2014-02-06 Panasonic Corp 入力装置及び入力支援方法
US8589911B1 (en) 2012-07-26 2013-11-19 Google Inc. Intent fulfillment
US8442821B1 (en) 2012-07-27 2013-05-14 Google Inc. Multi-frame prediction for hybrid neural network/hidden Markov models
US9465833B2 (en) 2012-07-31 2016-10-11 Veveo, Inc. Disambiguating user intent in conversational interaction system for large corpus information retrieval
US8831957B2 (en) 2012-08-01 2014-09-09 Google Inc. Speech recognition models based on location indicia
US20140035823A1 (en) 2012-08-01 2014-02-06 Apple Inc. Dynamic Context-Based Language Determination
US9390174B2 (en) 2012-08-08 2016-07-12 Google Inc. Search result ranking and presentation
KR20150046100A (ko) 2012-08-10 2015-04-29 뉘앙스 커뮤니케이션즈, 인코포레이티드 전자 디바이스에 대한 가상 에이전트 통신
US20140052791A1 (en) 2012-08-14 2014-02-20 International Business Machines Corporation Task Based Filtering of Unwanted Electronic Communications
US10163058B2 (en) 2012-08-14 2018-12-25 Sri International Method, system and device for inferring a mobile user's current context and proactively providing assistance
US9292487B1 (en) 2012-08-16 2016-03-22 Amazon Technologies, Inc. Discriminative language model pruning
KR101922464B1 (ko) 2012-08-16 2018-11-27 삼성전자주식회사 메시지 송수신 방법 및 그 전자장치
KR20150045404A (ko) 2012-08-16 2015-04-28 뉘앙스 커뮤니케이션즈, 인코포레이티드 엔터테인먼트 시스템을 위한 사용자 인터페이스
US9497515B2 (en) 2012-08-16 2016-11-15 Nuance Communications, Inc. User interface for entertainment systems
US9229924B2 (en) 2012-08-24 2016-01-05 Microsoft Technology Licensing, Llc Word detection and domain dictionary recommendation
WO2014029099A1 (en) 2012-08-24 2014-02-27 Microsoft Corporation I-vector based clustering training data in speech recognition
US9049295B1 (en) 2012-08-28 2015-06-02 West Corporation Intelligent interactive voice response system for processing customer communications
JP6393021B2 (ja) 2012-08-28 2018-09-19 京セラ株式会社 電子機器、制御方法、及び制御プログラム
US9026425B2 (en) 2012-08-28 2015-05-05 Xerox Corporation Lexical and phrasal feature domain adaptation in statistical machine translation
EP2891347B1 (en) 2012-08-28 2019-10-09 Nokia Technologies Oy Discovery method and system for discovery
KR102081925B1 (ko) 2012-08-29 2020-02-26 엘지전자 주식회사 디스플레이 디바이스 및 스피치 검색 방법
US9218333B2 (en) 2012-08-31 2015-12-22 Microsoft Technology Licensing, Llc Context sensitive auto-correction
US8826415B2 (en) 2012-09-04 2014-09-02 Apple Inc. Automated device access
US9325809B1 (en) 2012-09-07 2016-04-26 Mindmeld, Inc. Audio recall during voice conversations
US9536049B2 (en) 2012-09-07 2017-01-03 Next It Corporation Conversational virtual healthcare assistant
US20140074466A1 (en) 2012-09-10 2014-03-13 Google Inc. Answering questions using environmental context
US20150088523A1 (en) 2012-09-10 2015-03-26 Google Inc. Systems and Methods for Designing Voice Applications
US20140074470A1 (en) 2012-09-11 2014-03-13 Google Inc. Phonetic pronunciation
US20140074472A1 (en) 2012-09-12 2014-03-13 Chih-Hung Lin Voice control system with portable voice control device
US20140078065A1 (en) 2012-09-15 2014-03-20 Ahmet Akkok Predictive Keyboard With Suppressed Keys
US9081482B1 (en) 2012-09-18 2015-07-14 Google Inc. Text input suggestion ranking
JP6057637B2 (ja) 2012-09-18 2017-01-11 株式会社アイ・オー・データ機器 携帯型情報端末装置、機能切替方法、および機能切替プログラム
US10042603B2 (en) 2012-09-20 2018-08-07 Samsung Electronics Co., Ltd. Context aware service provision method and apparatus of user device
US9076450B1 (en) 2012-09-21 2015-07-07 Amazon Technologies, Inc. Directed audio for speech recognition
US9092415B2 (en) 2012-09-25 2015-07-28 Rovi Guides, Inc. Systems and methods for automatic program recommendations based on user interactions
US8983383B1 (en) 2012-09-25 2015-03-17 Rawles Llc Providing hands-free service to multiple devices
US8983836B2 (en) 2012-09-26 2015-03-17 International Business Machines Corporation Captioning using socially derived acoustic profiles
CA2886566A1 (en) 2012-09-27 2014-04-03 John Joseph Geyer Mobile device context incorporating near field communications
US8498864B1 (en) 2012-09-27 2013-07-30 Google Inc. Methods and systems for predicting a text
JP2014072586A (ja) 2012-09-27 2014-04-21 Sharp Corp 表示装置、表示方法、テレビジョン受像機、プログラム、および、記録媒体
US10096316B2 (en) 2013-11-27 2018-10-09 Sri International Sharing intents to provide virtual assistance in a multi-person dialog
US20140095172A1 (en) 2012-10-01 2014-04-03 Nuance Communications, Inc. Systems and methods for providing a voice agent user interface
US10276157B2 (en) 2012-10-01 2019-04-30 Nuance Communications, Inc. Systems and methods for providing a voice agent user interface
US20140095171A1 (en) 2012-10-01 2014-04-03 Nuance Communications, Inc. Systems and methods for providing a voice agent user interface
US9230560B2 (en) 2012-10-08 2016-01-05 Nant Holdings Ip, Llc Smart home automation systems and methods
US8606568B1 (en) 2012-10-10 2013-12-10 Google Inc. Evaluating pronouns in context
JP6066471B2 (ja) 2012-10-12 2017-01-25 本田技研工業株式会社 対話システム及び対話システム向け発話の判別方法
US8843845B2 (en) 2012-10-16 2014-09-23 Google Inc. Multi-gesture text input prediction
US20150241962A1 (en) 2012-10-22 2015-08-27 Vid Scale, Inc. User presence detection in mobile devices
US8527276B1 (en) 2012-10-25 2013-09-03 Google Inc. Speech synthesis using deep neural networks
US9305439B2 (en) 2012-10-25 2016-04-05 Google Inc. Configurable indicator on computing device
US20140122086A1 (en) 2012-10-26 2014-05-01 Microsoft Corporation Augmenting speech recognition with depth imaging
US10304465B2 (en) 2012-10-30 2019-05-28 Google Technology Holdings LLC Voice control user interface for low power mode
US9734151B2 (en) 2012-10-31 2017-08-15 Tivo Solutions Inc. Method and system for voice based media search
US20140122153A1 (en) 2012-10-31 2014-05-01 DoWhatILikeBest, LLC Favorite and serendipitous event correlation and notification
US9093069B2 (en) 2012-11-05 2015-07-28 Nuance Communications, Inc. Privacy-sensitive speech model creation via aggregation of multiple user models
JP6018881B2 (ja) 2012-11-07 2016-11-02 株式会社日立製作所 ナビゲーション装置、及びナビゲーション方法
US9247387B2 (en) 2012-11-13 2016-01-26 International Business Machines Corporation Proximity based reminders
KR20140060995A (ko) 2012-11-13 2014-05-21 삼성전자주식회사 상황별 거절 메시지 제공 방법 및 이를 지원하는 단말기
US9275642B2 (en) 2012-11-13 2016-03-01 Unified Computer Intelligence Corporation Voice-operated internet-ready ubiquitous computing device and method thereof
US9235321B2 (en) 2012-11-14 2016-01-12 Facebook, Inc. Animation sequence associated with content item
KR101709187B1 (ko) 2012-11-14 2017-02-23 한국전자통신연구원 계층적 대화 태스크 라이브러리를 이용한 이중 대화관리 기반 음성대화시스템
US9798799B2 (en) 2012-11-15 2017-10-24 Sri International Vehicle personal assistant that interprets spoken natural language input based upon vehicle context
US9032219B2 (en) 2012-11-16 2015-05-12 Nuance Communications, Inc. Securing speech recognition data
US8965754B2 (en) 2012-11-20 2015-02-24 International Business Machines Corporation Text prediction using environment hints
US10551928B2 (en) 2012-11-20 2020-02-04 Samsung Electronics Company, Ltd. GUI transitions on wearable electronic device
JP2014102669A (ja) 2012-11-20 2014-06-05 Toshiba Corp 情報処理装置、情報処理方法およびプログラム
US9756049B2 (en) 2012-11-22 2017-09-05 8303142 Canada Inc. System and method for managing several mobile devices simultaneously
US9875741B2 (en) 2013-03-15 2018-01-23 Google Llc Selective speech recognition for chat and digital personal assistant systems
WO2014209157A1 (en) 2013-06-27 2014-12-31 Obschestvo S Ogranichennoy Otvetstvennostiyu "Speaktoit" Generating dialog recommendations for chat information systems
US20140146200A1 (en) 2012-11-28 2014-05-29 Research In Motion Limited Entries to an electronic calendar
AU2013352236B2 (en) 2012-11-29 2018-08-02 Edsense, L.L.C. System and method for displaying multiple applications
US9589149B2 (en) 2012-11-30 2017-03-07 Microsoft Technology Licensing, Llc Combining personalization and privacy locally on devices
US9549323B2 (en) 2012-12-03 2017-01-17 Samsung Electronics Co., Ltd. Method and mobile terminal for controlling screen lock
US9819786B2 (en) 2012-12-05 2017-11-14 Facebook, Inc. Systems and methods for a symbol-adaptable keyboard
US20140164476A1 (en) 2012-12-06 2014-06-12 At&T Intellectual Property I, Lp Apparatus and method for providing a virtual assistant
US8930181B2 (en) 2012-12-06 2015-01-06 Prashant Parikh Automatic dynamic contextual data entry completion
US9244905B2 (en) 2012-12-06 2016-01-26 Microsoft Technology Licensing, Llc Communication context based predictive-text suggestion
US20140163951A1 (en) 2012-12-07 2014-06-12 Xerox Corporation Hybrid adaptation of named entity recognition
US9697827B1 (en) 2012-12-11 2017-07-04 Amazon Technologies, Inc. Error reduction in speech processing
US9148394B2 (en) 2012-12-11 2015-09-29 Nuance Communications, Inc. Systems and methods for user interface presentation of virtual agent
US20140164532A1 (en) 2012-12-11 2014-06-12 Nuance Communications, Inc. Systems and methods for virtual agent participation in multiparty conversation
US9190057B2 (en) 2012-12-12 2015-11-17 Amazon Technologies, Inc. Speech model retrieval in distributed speech recognition systems
US9117450B2 (en) 2012-12-12 2015-08-25 Nuance Communications, Inc. Combining re-speaking, partial agent transcription and ASR for improved accuracy / human guided ASR
KR102014778B1 (ko) 2012-12-14 2019-08-27 엘지전자 주식회사 텍스트 메시징 서비스를 제공하는 디지털 디바이스 및 그 제어 방법
CN110223495A (zh) 2012-12-18 2019-09-10 三星电子株式会社 用于在家庭网络系统中远程控制家庭设备的方法和设备
US9098467B1 (en) 2012-12-19 2015-08-04 Rawles Llc Accepting voice commands based on user identity
US9070366B1 (en) 2012-12-19 2015-06-30 Amazon Technologies, Inc. Architecture for multi-domain utterance processing
US8977555B2 (en) 2012-12-20 2015-03-10 Amazon Technologies, Inc. Identification of utterance subjects
US8645138B1 (en) 2012-12-20 2014-02-04 Google Inc. Two-pass decoding for speech recognition of search and action requests
WO2014096506A1 (en) 2012-12-21 2014-06-26 Nokia Corporation Method, apparatus, and computer program product for personalizing speech recognition
KR20140082157A (ko) 2012-12-24 2014-07-02 한국전자통신연구원 다중 음향 모델을 이용하여 음성을 인식하기 위한 장치 및 그 방법
JP2014126600A (ja) 2012-12-25 2014-07-07 Panasonic Corp 音声認識装置、音声認識方法、およびテレビ
JP2014124332A (ja) 2012-12-26 2014-07-07 Daiichi Shokai Co Ltd 遊技機
US8571851B1 (en) 2012-12-31 2013-10-29 Google Inc. Semantic interpretation using user gaze order
CN103020047A (zh) 2012-12-31 2013-04-03 威盛电子股份有限公司 修正语音应答的方法及自然语言对话系统
KR20140093303A (ko) 2013-01-07 2014-07-28 삼성전자주식회사 디스플레이 장치 및 그의 제어 방법
KR20140089862A (ko) 2013-01-07 2014-07-16 삼성전자주식회사 디스플레이 장치 및 그의 제어 방법
US20140195233A1 (en) 2013-01-08 2014-07-10 Spansion Llc Distributed Speech Recognition System
CN104919278B (zh) 2013-01-09 2017-09-19 三菱电机株式会社 语音识别装置及显示方法
US20140198047A1 (en) 2013-01-14 2014-07-17 Nuance Communications, Inc. Reducing error rates for touch based keyboards
US8731912B1 (en) 2013-01-16 2014-05-20 Google Inc. Delaying audio notifications
US9292489B1 (en) 2013-01-16 2016-03-22 Google Inc. Sub-lexical language models with word level pronunciation lexicons
US20140203939A1 (en) 2013-01-21 2014-07-24 Rtc Inc. Control and monitoring of light-emitting-diode (led) bulbs
US9047274B2 (en) 2013-01-21 2015-06-02 Xerox Corporation Machine translation-driven authoring system and method
US9530409B2 (en) 2013-01-23 2016-12-27 Blackberry Limited Event-triggered hands-free multitasking for media playback
US9165566B2 (en) 2013-01-24 2015-10-20 Microsoft Technology Licensing, Llc Indefinite speech inputs
DE102013001219B4 (de) 2013-01-25 2019-08-29 Inodyn Newmedia Gmbh Verfahren und System zur Sprachaktivierung eines Software-Agenten aus einem Standby-Modus
JP6251958B2 (ja) 2013-01-28 2017-12-27 富士通株式会社 発話解析装置、音声対話制御装置、方法、及びプログラム
KR20140098947A (ko) 2013-01-31 2014-08-11 삼성전자주식회사 광고 제공 시스템, 사용자 단말 및 광고 제공 방법
JP2014150323A (ja) 2013-01-31 2014-08-21 Sharp Corp 文字入力装置
US10055091B2 (en) 2013-02-01 2018-08-21 Microsoft Technology Licensing, Llc Autosave and manual save modes for software applications
US20140218372A1 (en) 2013-02-05 2014-08-07 Apple Inc. Intelligent digital assistant in a desktop environment
US8694315B1 (en) 2013-02-05 2014-04-08 Visa International Service Association System and method for authentication using speaker verification techniques and fraud model
US20140223481A1 (en) 2013-02-07 2014-08-07 United Video Properties, Inc. Systems and methods for updating a search request
JP2016508007A (ja) 2013-02-07 2016-03-10 アップル インコーポレイテッド デジタルアシスタントのためのボイストリガ
US10078437B2 (en) 2013-02-20 2018-09-18 Blackberry Limited Method and apparatus for responding to a notification via a capacitive physical keyboard
US20140236986A1 (en) 2013-02-21 2014-08-21 Apple Inc. Natural language document search
US9734819B2 (en) 2013-02-21 2017-08-15 Google Technology Holdings LLC Recognizing accented speech
US9621619B2 (en) 2013-02-21 2017-04-11 International Business Machines Corporation Enhanced notification for relevant communications
US20140245140A1 (en) 2013-02-22 2014-08-28 Next It Corporation Virtual Assistant Transfer between Smart Devices
US9484023B2 (en) 2013-02-22 2016-11-01 International Business Machines Corporation Conversion of non-back-off language models for efficient speech decoding
US9894312B2 (en) 2013-02-22 2018-02-13 The Directv Group, Inc. Method and system for controlling a user receiving device using voice commands
US9172747B2 (en) 2013-02-25 2015-10-27 Artificial Solutions Iberia SL System and methods for virtual assistant networks
US9865266B2 (en) 2013-02-25 2018-01-09 Nuance Communications, Inc. Method and apparatus for automated speaker parameters adaptation in a deployed speaker verification system
KR101383552B1 (ko) 2013-02-25 2014-04-10 미디어젠(주) 다중 명령어가 포함된 단일 문장의 음성인식방법
US9330659B2 (en) 2013-02-25 2016-05-03 Microsoft Technology Licensing, Llc Facilitating development of a spoken natural language interface
US9280981B2 (en) 2013-02-27 2016-03-08 Blackberry Limited Method and apparatus for voice control of a mobile device
US9218819B1 (en) 2013-03-01 2015-12-22 Google Inc. Customizing actions based on contextual data and voice-based inputs
US9251467B2 (en) 2013-03-03 2016-02-02 Microsoft Technology Licensing, Llc Probabilistic parsing
US9554050B2 (en) 2013-03-04 2017-01-24 Apple Inc. Mobile device using images and location for reminders
US9460715B2 (en) 2013-03-04 2016-10-04 Amazon Technologies, Inc. Identification using audio signatures and additional characteristics
US9454957B1 (en) 2013-03-05 2016-09-27 Amazon Technologies, Inc. Named entity resolution in spoken language processing
US9293129B2 (en) 2013-03-05 2016-03-22 Microsoft Technology Licensing, Llc Speech recognition assisted evaluation on text-to-speech pronunciation issue detection
KR101952179B1 (ko) 2013-03-05 2019-05-22 엘지전자 주식회사 이동 단말기 및 그것의 제어방법
US10795528B2 (en) 2013-03-06 2020-10-06 Nuance Communications, Inc. Task assistant having multiple visual displays
US9990611B2 (en) 2013-03-08 2018-06-05 Baydin, Inc. Systems and methods for incorporating calendar functionality into electronic messages
US9496968B2 (en) 2013-03-08 2016-11-15 Google Inc. Proximity detection by mobile devices
US9112984B2 (en) 2013-03-12 2015-08-18 Nuance Communications, Inc. Methods and apparatus for detecting a voice command
US11393461B2 (en) 2013-03-12 2022-07-19 Cerence Operating Company Methods and apparatus for detecting a voice command
US9361885B2 (en) 2013-03-12 2016-06-07 Nuance Communications, Inc. Methods and apparatus for detecting a voice command
WO2014159581A1 (en) 2013-03-12 2014-10-02 Nuance Communications, Inc. Methods and apparatus for detecting a voice command
US9477753B2 (en) 2013-03-12 2016-10-25 International Business Machines Corporation Classifier-based system combination for spoken term detection
US9076459B2 (en) 2013-03-12 2015-07-07 Intermec Ip, Corp. Apparatus and method to classify sound to detect speech
US10229697B2 (en) 2013-03-12 2019-03-12 Google Technology Holdings LLC Apparatus and method for beamforming to obtain voice and noise signals
US9129013B2 (en) 2013-03-12 2015-09-08 Nuance Communications, Inc. Methods and apparatus for entity detection
US9135248B2 (en) 2013-03-13 2015-09-15 Arris Technology, Inc. Context demographic determination system
US10219100B2 (en) 2013-03-13 2019-02-26 Aliphcom Determining proximity for devices interacting with media devices
US9282423B2 (en) 2013-03-13 2016-03-08 Aliphcom Proximity and interface controls of media devices for media presentations
US9378739B2 (en) 2013-03-13 2016-06-28 Nuance Communications, Inc. Identifying corresponding positions in different representations of a textual work
US20140278349A1 (en) 2013-03-14 2014-09-18 Microsoft Corporation Language Model Dictionaries for Text Predictions
KR20140112910A (ko) 2013-03-14 2014-09-24 삼성전자주식회사 입력 제어 방법 및 이를 지원하는 전자 장치
US20140267599A1 (en) 2013-03-14 2014-09-18 360Brandvision, Inc. User interaction with a holographic poster via a secondary mobile device
US9189196B2 (en) 2013-03-14 2015-11-17 Google Inc. Compartmentalized self registration of external devices
US10572476B2 (en) 2013-03-14 2020-02-25 Apple Inc. Refining a search based on schedule items
US9733821B2 (en) 2013-03-14 2017-08-15 Apple Inc. Voice control to diagnose inadvertent activation of accessibility features
WO2014144579A1 (en) 2013-03-15 2014-09-18 Apple Inc. System and method for updating an adaptive speech recognition model
EP2973315A4 (en) 2013-03-15 2016-11-16 Adityo Prakash SYSTEMS AND METHODS FOR FACILITATING INTEGRATED BEHAVIORAL SUPPORT
US9189157B2 (en) 2013-03-15 2015-11-17 Blackberry Limited Method and apparatus for word prediction selection
CN112230878A (zh) 2013-03-15 2021-01-15 苹果公司 对中断进行上下文相关处理
KR101759009B1 (ko) 2013-03-15 2017-07-17 애플 인크. 적어도 부분적인 보이스 커맨드 시스템을 트레이닝시키는 것
US9378065B2 (en) 2013-03-15 2016-06-28 Advanced Elemental Technologies, Inc. Purposeful computing
CN105190607B (zh) 2013-03-15 2018-11-30 苹果公司 通过智能数字助理的用户培训
US9558743B2 (en) 2013-03-15 2017-01-31 Google Inc. Integration of semantic context information
CN105431809B (zh) 2013-03-15 2018-12-18 谷歌有限责任公司 用于国际语言的虚拟键盘输入
US9176649B2 (en) 2013-03-15 2015-11-03 American Megatrends, Inc. Method and apparatus of remote management of computer system using voice and gesture based input
WO2014143959A2 (en) 2013-03-15 2014-09-18 Bodhi Technology Ventures Llc Volume control for mobile device using a wireless device
US9201865B2 (en) 2013-03-15 2015-12-01 Bao Tran Automated assistance for user request that determines semantics by domain, task, and parameter
US10638198B2 (en) 2013-03-15 2020-04-28 Ebay Inc. Shoppable video
US9299041B2 (en) 2013-03-15 2016-03-29 Business Objects Software Ltd. Obtaining data from unstructured data for a structured data collection
US9886160B2 (en) 2013-03-15 2018-02-06 Google Llc Managing audio at the tab level for user notification and control
US9479499B2 (en) 2013-03-21 2016-10-25 Tencent Technology (Shenzhen) Company Limited Method and apparatus for identity authentication via mobile capturing code
JP6221301B2 (ja) 2013-03-28 2017-11-01 富士通株式会社 音声処理装置、音声処理システムおよび音声処理方法
US20140297288A1 (en) 2013-03-29 2014-10-02 Orange Telephone voice personal assistant
JP2014203207A (ja) 2013-04-03 2014-10-27 ソニー株式会社 情報処理装置、情報処理方法及びコンピュータプログラム
US9300718B2 (en) 2013-04-09 2016-03-29 Avaya Inc. System and method for keyword-based notification and delivery of content
WO2014169269A1 (en) 2013-04-12 2014-10-16 Nant Holdings Ip, Llc Virtual teller systems and methods
US8825474B1 (en) 2013-04-16 2014-09-02 Google Inc. Text suggestion output using past interaction data
US9875494B2 (en) 2013-04-16 2018-01-23 Sri International Using intents to analyze and personalize a user's dialog experience with a virtual personal assistant
US9760644B2 (en) 2013-04-17 2017-09-12 Google Inc. Embedding event creation link in a document
US20150193392A1 (en) 2013-04-17 2015-07-09 Google Inc. User Interface for Quickly Checking Agenda and Creating New Events
NL2010662C2 (en) 2013-04-18 2014-10-21 Bosch Gmbh Robert Remote maintenance.
US10445115B2 (en) 2013-04-18 2019-10-15 Verint Americas Inc. Virtual assistant focused user interfaces
US9177318B2 (en) 2013-04-22 2015-11-03 Palo Alto Research Center Incorporated Method and apparatus for customizing conversation agents based on user characteristics using a relevance score for automatic statements, and a response prediction function
US9384751B2 (en) 2013-05-06 2016-07-05 Honeywell International Inc. User authentication of voice controlled devices
KR20140132246A (ko) 2013-05-07 2014-11-17 삼성전자주식회사 오브젝트 선택 방법 및 오브젝트 선택 장치
US9223898B2 (en) 2013-05-08 2015-12-29 Facebook, Inc. Filtering suggested structured queries on online social networks
US9923849B2 (en) 2013-05-09 2018-03-20 Ebay Inc. System and method for suggesting a phrase based on a context
US9489625B2 (en) 2013-05-10 2016-11-08 Sri International Rapid development of virtual personal assistant applications
US9081411B2 (en) 2013-05-10 2015-07-14 Sri International Rapid development of virtual personal assistant applications
US20140337751A1 (en) 2013-05-13 2014-11-13 Microsoft Corporation Automatic creation of calendar items
US9495266B2 (en) 2013-05-16 2016-11-15 Advantest Corporation Voice recognition virtual test engineering assistant
US20140344687A1 (en) 2013-05-16 2014-11-20 Lenitra Durham Techniques for Natural User Interface Input based on Context
KR101334342B1 (ko) 2013-05-16 2013-11-29 주식회사 네오패드 문자 입력 장치 및 문자 입력 방법
US9432499B2 (en) 2013-05-18 2016-08-30 Loralee Hajdu Peripheral specific selection of automated response messages
WO2014189486A1 (en) 2013-05-20 2014-11-27 Intel Corporation Natural human-computer interaction for virtual personal assistant systems
US20150199077A1 (en) 2013-05-23 2015-07-16 Google Inc. Scheduling and viewing a calender event using time zones based on a user's location at event time
US9747900B2 (en) 2013-05-24 2017-08-29 Google Technology Holdings LLC Method and apparatus for using image data to aid voice recognition
US20140350933A1 (en) 2013-05-24 2014-11-27 Samsung Electronics Co., Ltd. Voice recognition apparatus and control method thereof
US20140351760A1 (en) 2013-05-24 2014-11-27 Google Inc. Order-independent text input
US20140358523A1 (en) 2013-05-30 2014-12-04 Wright State University Topic-specific sentiment extraction
US20140358519A1 (en) 2013-06-03 2014-12-04 Xerox Corporation Confidence-driven rewriting of source texts for improved translation
US9286029B2 (en) 2013-06-06 2016-03-15 Honda Motor Co., Ltd. System and method for multimodal human-vehicle interaction and belief tracking
KR102429833B1 (ko) 2013-06-07 2022-08-05 애플 인크. 지능형 자동 어시스턴트
US9582608B2 (en) 2013-06-07 2017-02-28 Apple Inc. Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
WO2014197335A1 (en) 2013-06-08 2014-12-11 Apple Inc. Interpreting and acting upon commands that involve sharing information with remote devices
CN110442699A (zh) 2013-06-09 2019-11-12 苹果公司 操作数字助理的方法、计算机可读介质、电子设备和系统
US9449600B2 (en) 2013-06-11 2016-09-20 Plantronics, Inc. Character data entry
US9892115B2 (en) 2013-06-11 2018-02-13 Facebook, Inc. Translation training with cross-lingual multi-media support
US9508040B2 (en) 2013-06-12 2016-11-29 Microsoft Technology Licensing, Llc Predictive pre-launch for applications
CN105265005B (zh) 2013-06-13 2019-09-17 苹果公司 用于由语音命令发起的紧急呼叫的系统和方法
US9728184B2 (en) 2013-06-18 2017-08-08 Microsoft Technology Licensing, Llc Restructuring deep neural network acoustic models
US9437186B1 (en) 2013-06-19 2016-09-06 Amazon Technologies, Inc. Enhanced endpoint detection for speech recognition
US20140379334A1 (en) 2013-06-20 2014-12-25 Qnx Software Systems Limited Natural language understanding automatic speech recognition post processing
KR102160767B1 (ko) 2013-06-20 2020-09-29 삼성전자주식회사 제스처를 감지하여 기능을 제어하는 휴대 단말 및 방법
US9311298B2 (en) 2013-06-21 2016-04-12 Microsoft Technology Licensing, Llc Building conversational understanding systems using a toolset
US10496743B2 (en) 2013-06-26 2019-12-03 Nuance Communications, Inc. Methods and apparatus for extracting facts from a medical text
US9747899B2 (en) 2013-06-27 2017-08-29 Amazon Technologies, Inc. Detecting self-generated wake expressions
US8947596B2 (en) 2013-06-27 2015-02-03 Intel Corporation Alignment of closed captions
US20150006148A1 (en) 2013-06-27 2015-01-01 Microsoft Corporation Automatically Creating Training Data For Language Identifiers
US9741339B2 (en) 2013-06-28 2017-08-22 Google Inc. Data driven word pronunciation learning and scoring with crowd sourcing based on the word's phonemes pronunciation scores
EP3014610B1 (en) 2013-06-28 2023-10-04 Harman International Industries, Incorporated Wireless control of linked devices
US9646606B2 (en) 2013-07-03 2017-05-09 Google Inc. Speech recognition using domain knowledge
DE102014109121B4 (de) 2013-07-10 2023-05-04 Gm Global Technology Operations, Llc Systeme und Verfahren zur Arbitrierung eines Sprachdialogdienstes
US9396727B2 (en) 2013-07-10 2016-07-19 GM Global Technology Operations LLC Systems and methods for spoken dialog service arbitration
CN110096253B (zh) 2013-07-11 2022-08-30 英特尔公司 利用相同的音频输入的设备唤醒和说话者验证
TWI508057B (zh) 2013-07-15 2015-11-11 Chunghwa Picture Tubes Ltd 語音辨識系統以及方法
US9311912B1 (en) 2013-07-22 2016-04-12 Amazon Technologies, Inc. Cost efficient distributed text-to-speech processing
US20150032238A1 (en) 2013-07-23 2015-01-29 Motorola Mobility Llc Method and Device for Audio Input Routing
US9335983B2 (en) 2013-07-28 2016-05-10 Oded Haim Breiner Method and system for displaying a non-installed android application and for requesting an action from a non-installed android application
US9575720B2 (en) 2013-07-31 2017-02-21 Google Inc. Visual confirmation for a recognized voice-initiated action
US9311915B2 (en) 2013-07-31 2016-04-12 Google Inc. Context-based speech recognition
TWI601032B (zh) 2013-08-02 2017-10-01 晨星半導體股份有限公司 應用於聲控裝置的控制器與相關方法
JP6163266B2 (ja) 2013-08-06 2017-07-12 アップル インコーポレイテッド リモート機器からの作動に基づくスマート応答の自動作動
ES2849254T3 (es) 2013-08-06 2021-08-17 Saronikos Trading & Services Unipessoal Lda Sistema para controlar dispositivos electrónicos por medio de órdenes de voz, más específicamente un mando a distancia para controlar una pluralidad de dispositivos electrónicos por medio de órdenes de voz
JP2015041845A (ja) 2013-08-21 2015-03-02 カシオ計算機株式会社 文字入力装置及びプログラム
EP2862164B1 (en) 2013-08-23 2017-05-31 Nuance Communications, Inc. Multiple pass automatic speech recognition
KR102147935B1 (ko) 2013-08-29 2020-08-25 삼성전자주식회사 데이터 처리 방법 및 그 전자 장치
WO2015029379A1 (ja) 2013-08-29 2015-03-05 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ 機器制御方法、表示制御方法及び購入決済方法
US20150066506A1 (en) 2013-08-30 2015-03-05 Verint Systems Ltd. System and Method of Text Zoning
CN112989840A (zh) 2013-08-30 2021-06-18 英特尔公司 用于虚拟个人助理的可扩展上下文感知的自然语言交互
US10867597B2 (en) 2013-09-02 2020-12-15 Microsoft Technology Licensing, Llc Assignment of semantic labels to a sequence of words using neural network architectures
US9633669B2 (en) 2013-09-03 2017-04-25 Amazon Technologies, Inc. Smart circular audio buffer
US9316400B2 (en) 2013-09-03 2016-04-19 Panasonic Intellctual Property Corporation of America Appliance control method, speech-based appliance control system, and cooking appliance
KR102065409B1 (ko) 2013-09-04 2020-01-13 엘지전자 주식회사 이동단말기 및 그 제어방법
GB2517952B (en) 2013-09-05 2017-05-31 Barclays Bank Plc Biometric verification using predicted signatures
US9208779B2 (en) 2013-09-06 2015-12-08 Google Inc. Mixture of n-gram language models
US9460704B2 (en) 2013-09-06 2016-10-04 Google Inc. Deep networks for unit selection speech synthesis
US20150074524A1 (en) 2013-09-10 2015-03-12 Lenovo (Singapore) Pte. Ltd. Management of virtual assistant action items
CN104700832B (zh) 2013-12-09 2018-05-25 联发科技股份有限公司 语音关键字检测系统及方法
EP3047481A4 (en) 2013-09-20 2017-03-01 Amazon Technologies Inc. Local and remote speech processing
US20150088511A1 (en) 2013-09-24 2015-03-26 Verizon Patent And Licensing Inc. Named-entity based speech recognition
US10134395B2 (en) 2013-09-25 2018-11-20 Amazon Technologies, Inc. In-call virtual assistants
CN104516522B (zh) 2013-09-29 2018-05-01 北京三星通信技术研究有限公司 九宫格键盘输入的方法和装置
US20150095031A1 (en) 2013-09-30 2015-04-02 At&T Intellectual Property I, L.P. System and method for crowdsourcing of word pronunciation verification
US20150095278A1 (en) 2013-09-30 2015-04-02 Manyworlds, Inc. Adaptive Probabilistic Semantic System and Method
US20150100537A1 (en) 2013-10-03 2015-04-09 Microsoft Corporation Emoji for Text Predictions
US20150100983A1 (en) 2013-10-06 2015-04-09 Yang Pan Personal Mobile Device as Ad hoc Set-Top Box for Television
US9436918B2 (en) 2013-10-07 2016-09-06 Microsoft Technology Licensing, Llc Smart selection of text spans
US8996639B1 (en) 2013-10-15 2015-03-31 Google Inc. Predictive responses to incoming communications
US9063640B2 (en) 2013-10-17 2015-06-23 Spotify Ab System and method for switching between media items in a plurality of sequences of media items
US20150120723A1 (en) 2013-10-24 2015-04-30 Xerox Corporation Methods and systems for processing speech queries
US10055681B2 (en) 2013-10-31 2018-08-21 Verint Americas Inc. Mapping actions and objects to tasks
US9183830B2 (en) 2013-11-01 2015-11-10 Google Inc. Method and system for non-parametric voice conversion
US10088973B2 (en) 2013-11-08 2018-10-02 Google Llc Event scheduling presentation in a graphical user interface environment
JP6493866B2 (ja) 2013-11-12 2019-04-03 インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation 情報処理装置、情報処理方法、およびプログラム
GB2520266A (en) 2013-11-13 2015-05-20 Ibm Cursor-Based Character input interface
US10430024B2 (en) 2013-11-13 2019-10-01 Microsoft Technology Licensing, Llc Media item selection using user-specific grammar
US9361084B1 (en) 2013-11-14 2016-06-07 Google Inc. Methods and systems for installing and executing applications
US10454783B2 (en) 2014-02-05 2019-10-22 Apple Inc. Accessory management system using environment model
US9443522B2 (en) 2013-11-18 2016-09-13 Beijing Lenovo Software Ltd. Voice recognition method, voice controlling method, information processing method, and electronic apparatus
US9898554B2 (en) 2013-11-18 2018-02-20 Google Inc. Implicit question query identification
US10162813B2 (en) 2013-11-21 2018-12-25 Microsoft Technology Licensing, Llc Dialogue evaluation via multiple hypothesis ranking
US20150149354A1 (en) 2013-11-27 2015-05-28 Bank Of America Corporation Real-Time Data Recognition and User Interface Field Updating During Voice Entry
US10079013B2 (en) 2013-11-27 2018-09-18 Sri International Sharing intents to provide virtual assistance in a multi-person dialog
US9451434B2 (en) 2013-11-27 2016-09-20 At&T Intellectual Property I, L.P. Direct interaction between a user and a communication network
US9698999B2 (en) 2013-12-02 2017-07-04 Amazon Technologies, Inc. Natural language control of secondary device
US9215510B2 (en) 2013-12-06 2015-12-15 Rovi Guides, Inc. Systems and methods for automatically tagging a media asset based on verbal input and playback adjustments
EP3077999B1 (en) 2013-12-06 2022-02-02 The ADT Security Corporation Voice activated application for mobile devices
US20150162001A1 (en) 2013-12-10 2015-06-11 Honeywell International Inc. System and method for textually and graphically presenting air traffic control voice information
US9208153B1 (en) 2013-12-13 2015-12-08 Symantec Corporation Filtering relevant event notifications in a file sharing and collaboration environment
EP3063646A4 (en) 2013-12-16 2017-06-21 Nuance Communications, Inc. Systems and methods for providing a virtual assistant
US20170017501A1 (en) 2013-12-16 2017-01-19 Nuance Communications, Inc. Systems and methods for providing a virtual assistant
US9571645B2 (en) 2013-12-16 2017-02-14 Nuance Communications, Inc. Systems and methods for providing a virtual assistant
US9804820B2 (en) 2013-12-16 2017-10-31 Nuance Communications, Inc. Systems and methods for providing a virtual assistant
WO2015092943A1 (en) 2013-12-17 2015-06-25 Sony Corporation Electronic devices and methods for compensating for environmental noise in text-to-speech applications
GB2523984B (en) 2013-12-18 2017-07-26 Cirrus Logic Int Semiconductor Ltd Processing received speech data
US10565268B2 (en) 2013-12-19 2020-02-18 Adobe Inc. Interactive communication augmented with contextual information
US9741343B1 (en) 2013-12-19 2017-08-22 Amazon Technologies, Inc. Voice interaction application selection
US20150221307A1 (en) 2013-12-20 2015-08-06 Saurin Shah Transition from low power always listening mode to high power speech recognition mode
KR102179506B1 (ko) 2013-12-23 2020-11-17 삼성전자 주식회사 전자장치 및 그 제어방법
KR102092164B1 (ko) 2013-12-27 2020-03-23 삼성전자주식회사 디스플레이 장치, 서버 장치 및 이들을 포함하는 디스플레이 시스템과 그 컨텐츠 제공 방법들
US9640181B2 (en) 2013-12-27 2017-05-02 Kopin Corporation Text editing with gesture control and natural speech
US9460735B2 (en) 2013-12-28 2016-10-04 Intel Corporation Intelligent ancillary electronic device
US9390726B1 (en) 2013-12-30 2016-07-12 Google Inc. Supplementing speech commands with gestures
US10078489B2 (en) 2013-12-30 2018-09-18 Microsoft Technology Licensing, Llc Voice interface to a social networking service
US9152307B2 (en) 2013-12-31 2015-10-06 Google Inc. Systems and methods for simultaneously displaying clustered, in-line electronic messages in one display
US9274673B2 (en) 2013-12-31 2016-03-01 Google Inc. Methods, systems, and media for rewinding media content based on detected audio events
US9823811B2 (en) 2013-12-31 2017-11-21 Next It Corporation Virtual assistant team identification
US9424241B2 (en) 2013-12-31 2016-08-23 Barnes & Noble College Booksellers, Llc Annotation mode including multiple note types for paginated digital content
US9742836B2 (en) 2014-01-03 2017-08-22 Yahoo Holdings, Inc. Systems and methods for content delivery
US20150193379A1 (en) 2014-01-06 2015-07-09 Apple Inc. System and method for cognizant time-based reminders
US9924215B2 (en) 2014-01-09 2018-03-20 Hsni, Llc Digital media content management system and method
US9443516B2 (en) 2014-01-09 2016-09-13 Honeywell International Inc. Far-field speech recognition systems and methods
US8938394B1 (en) 2014-01-09 2015-01-20 Google Inc. Audio triggers based on context
US9514748B2 (en) 2014-01-15 2016-12-06 Microsoft Technology Licensing, Llc Digital personal assistant interaction with impersonations and rich multimedia in responses
US20150199965A1 (en) 2014-01-16 2015-07-16 CloudCar Inc. System and method for recognition and automatic correction of voice commands
US8868409B1 (en) 2014-01-16 2014-10-21 Google Inc. Evaluating transcriptions with a semantic parser
US9336300B2 (en) 2014-01-17 2016-05-10 Facebook, Inc. Client-side search templates for online social networks
US10171643B2 (en) 2014-01-22 2019-01-01 Sony Corporation Directing audio output based on gestures
US9858039B2 (en) 2014-01-28 2018-01-02 Oracle International Corporation Voice recognition of commands extracted from user interface screen devices
US11386886B2 (en) 2014-01-28 2022-07-12 Lenovo (Singapore) Pte. Ltd. Adjusting speech recognition using contextual information
US20160173960A1 (en) 2014-01-31 2016-06-16 EyeGroove, Inc. Methods and systems for generating audiovisual media items
EP3100259A4 (en) 2014-01-31 2017-08-30 Hewlett-Packard Development Company, L.P. Voice input command
US9292488B2 (en) 2014-02-01 2016-03-22 Soundhound, Inc. Method for embedding voice mail in a spoken utterance using a natural language processing computer system
US20160336007A1 (en) 2014-02-06 2016-11-17 Mitsubishi Electric Corporation Speech search device and speech search method
US20150228281A1 (en) 2014-02-07 2015-08-13 First Principles,Inc. Device, system, and method for active listening
US10083205B2 (en) 2014-02-12 2018-09-25 Samsung Electronics Co., Ltd. Query cards
US9037967B1 (en) 2014-02-18 2015-05-19 King Fahd University Of Petroleum And Minerals Arabic spell checking technique
US9589562B2 (en) 2014-02-21 2017-03-07 Microsoft Technology Licensing, Llc Pronunciation learning through correction logs
US10691292B2 (en) 2014-02-24 2020-06-23 Microsoft Technology Licensing, Llc Unified presentation of contextually connected information to improve user efficiency and interaction performance
US9495959B2 (en) 2014-02-27 2016-11-15 Ford Global Technologies, Llc Disambiguation of dynamic commands
US20150248651A1 (en) 2014-02-28 2015-09-03 Christine E. Akutagawa Social networking event planning
US9412363B2 (en) 2014-03-03 2016-08-09 Microsoft Technology Licensing, Llc Model based approach for on-screen item selection and disambiguation
US20150256873A1 (en) 2014-03-04 2015-09-10 Microsoft Technology Licensing, Llc Relayed voice control of devices
US9489171B2 (en) 2014-03-04 2016-11-08 Microsoft Technology Licensing, Llc Voice-command suggestions based on user identity
US9582246B2 (en) 2014-03-04 2017-02-28 Microsoft Technology Licensing, Llc Voice-command suggestions based on computer context
US9286910B1 (en) 2014-03-13 2016-03-15 Amazon Technologies, Inc. System for resolving ambiguous queries based on user context
US9430186B2 (en) 2014-03-17 2016-08-30 Google Inc Visual indication of a recognized voice-initiated action
US9336306B2 (en) 2014-03-21 2016-05-10 International Business Machines Corporation Automatic evaluation and improvement of ontologies for natural language processing tasks
IN2014DE00899A (zh) 2014-03-28 2015-10-02 Samsung Electronics Co Ltd
US9196243B2 (en) 2014-03-31 2015-11-24 International Business Machines Corporation Method and system for efficient spoken term detection using confusion networks
US20150278370A1 (en) 2014-04-01 2015-10-01 Microsoft Corporation Task completion for natural language input
US9286892B2 (en) 2014-04-01 2016-03-15 Google Inc. Language modeling in speech recognition
KR101873671B1 (ko) 2014-04-02 2018-07-02 소니 주식회사 전력 효율적인 근접 검출
US20150286627A1 (en) 2014-04-03 2015-10-08 Adobe Systems Incorporated Contextual sentiment text analysis
KR20150115555A (ko) 2014-04-04 2015-10-14 삼성전자주식회사 전자 장치 및 그의 정보 제공 방법
KR102249086B1 (ko) 2014-04-04 2021-05-10 삼성전자주식회사 레코딩 지원 전자장치 및 방법
JP6282516B2 (ja) 2014-04-08 2018-02-21 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America 複数機器の音声操作システム、音声操作方法及び、プログラム
US20150294516A1 (en) 2014-04-10 2015-10-15 Kuo-Ching Chiang Electronic device with security module
US9888452B2 (en) 2014-04-10 2018-02-06 Twin Harbor Labs Llc Methods and apparatus notifying a user of the operating condition of a household appliance
WO2015157013A1 (en) 2014-04-11 2015-10-15 Analog Devices, Inc. Apparatus, systems and methods for providing blind source separation services
US9582499B2 (en) 2014-04-14 2017-02-28 Xerox Corporation Retrieval of domain relevant phrase tables
CN108551675B (zh) 2014-04-14 2022-04-15 创新先进技术有限公司 一种应用客户端、服务端及对应的Portal认证方法
US20150294086A1 (en) 2014-04-14 2015-10-15 Elwha Llc Devices, systems, and methods for automated enhanced care rooms
US20150302856A1 (en) 2014-04-17 2015-10-22 Qualcomm Incorporated Method and apparatus for performing function by speech input
US10770075B2 (en) 2014-04-21 2020-09-08 Qualcomm Incorporated Method and apparatus for activating application by speech input
US9607613B2 (en) 2014-04-23 2017-03-28 Google Inc. Speech endpointing based on word comparisons
US20150310862A1 (en) 2014-04-24 2015-10-29 Microsoft Corporation Deep learning for semantic parsing including semantic utterance classification
US10845982B2 (en) 2014-04-28 2020-11-24 Facebook, Inc. Providing intelligent transcriptions of sound messages in a messaging application
US9520127B2 (en) 2014-04-29 2016-12-13 Microsoft Technology Licensing, Llc Shared hidden layer combination for speech recognition systems
US9600600B2 (en) 2014-04-30 2017-03-21 Excalibur Ip, Llc Method and system for evaluating query suggestions quality
KR102248474B1 (ko) 2014-04-30 2021-05-07 삼성전자 주식회사 음성 명령 제공 방법 및 장치
US9501163B2 (en) 2014-05-06 2016-11-22 Symbol Technologies, Llc Apparatus and method for activating a trigger mechanism
KR102282487B1 (ko) 2014-05-08 2021-07-26 삼성전자주식회사 애플리케이션 실행 장치 및 방법
US9620105B2 (en) 2014-05-15 2017-04-11 Apple Inc. Analyzing audio input for efficient speech and music recognition
US9459889B2 (en) 2014-05-19 2016-10-04 Qualcomm Incorporated Systems and methods for context-aware application control
KR102216048B1 (ko) 2014-05-20 2021-02-15 삼성전자주식회사 음성 명령 인식 장치 및 방법
KR102223278B1 (ko) 2014-05-22 2021-03-05 엘지전자 주식회사 글래스 타입 단말기 및 이의 제어방법
US9990433B2 (en) 2014-05-23 2018-06-05 Samsung Electronics Co., Ltd. Method for searching and device thereof
US10592095B2 (en) 2014-05-23 2020-03-17 Apple Inc. Instantaneous speaking of content on touch devices
US9502031B2 (en) 2014-05-27 2016-11-22 Apple Inc. Method for supporting dynamic grammars in WFST-based ASR
US9437189B2 (en) 2014-05-29 2016-09-06 Google Inc. Generating language models
US9760559B2 (en) 2014-05-30 2017-09-12 Apple Inc. Predictive text input
US9519634B2 (en) 2014-05-30 2016-12-13 Educational Testing Service Systems and methods for determining lexical associations among words in a corpus
US10579212B2 (en) 2014-05-30 2020-03-03 Apple Inc. Structured suggestions
US9842101B2 (en) 2014-05-30 2017-12-12 Apple Inc. Predictive conversion of language input
US9430463B2 (en) 2014-05-30 2016-08-30 Apple Inc. Exemplar-based natural language processing
US10078631B2 (en) 2014-05-30 2018-09-18 Apple Inc. Entropy-guided text prediction using combined word and character n-gram language models
US9633004B2 (en) 2014-05-30 2017-04-25 Apple Inc. Better resolution when referencing to concepts
US10170123B2 (en) 2014-05-30 2019-01-01 Apple Inc. Intelligent assistant for home automation
TWI520007B (zh) 2014-05-30 2016-02-01 由田新技股份有限公司 眼控密碼輸入設備、方法、電腦可讀取紀錄媒體及電腦程式產品
US9715875B2 (en) 2014-05-30 2017-07-25 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US9537852B2 (en) 2014-06-04 2017-01-03 Sonos, Inc. Cloud queue access control
JP6307356B2 (ja) 2014-06-06 2018-04-04 株式会社デンソー 運転コンテキスト情報生成装置
CN107113222B (zh) 2014-06-06 2020-09-01 谷歌有限责任公司 基于环境的主动聊天信息系统
US20150364140A1 (en) 2014-06-13 2015-12-17 Sony Corporation Portable Electronic Equipment and Method of Operating a User Interface
US9462112B2 (en) 2014-06-19 2016-10-04 Microsoft Technology Licensing, Llc Use of a digital assistant in communications
US10186282B2 (en) 2014-06-19 2019-01-22 Apple Inc. Robust end-pointing of speech signals using speaker recognition
US9632748B2 (en) 2014-06-24 2017-04-25 Google Inc. Device designation for audio input monitoring
US9384738B2 (en) 2014-06-24 2016-07-05 Google Inc. Dynamic threshold for speaker verification
US10659851B2 (en) 2014-06-30 2020-05-19 Apple Inc. Real-time digital assistant knowledge updates
US9338493B2 (en) 2014-06-30 2016-05-10 Apple Inc. Intelligent automated assistant for TV user interactions
US10321204B2 (en) 2014-07-11 2019-06-11 Lenovo (Singapore) Pte. Ltd. Intelligent closed captioning
KR20160009344A (ko) 2014-07-16 2016-01-26 삼성전자주식회사 귓속말 인식 방법 및 장치
US9257120B1 (en) 2014-07-18 2016-02-09 Google Inc. Speaker verification using co-location information
US9301256B2 (en) 2014-07-24 2016-03-29 Verizon Patent And Licensing Inc. Low battery indication for callers to mobile device
US20160028666A1 (en) 2014-07-24 2016-01-28 Framy Inc. System and method for instant messaging
US20160086116A1 (en) 2014-07-27 2016-03-24 Supriya Rao Method and system of an automatically managed calendar and contextual task list
US20160034811A1 (en) 2014-07-31 2016-02-04 Apple Inc. Efficient generation of complementary acoustic models for performing automatic speech recognition system combination
US9377871B2 (en) 2014-08-01 2016-06-28 Nuance Communications, Inc. System and methods for determining keyboard input in the presence of multiple contact points
US9767794B2 (en) 2014-08-11 2017-09-19 Nuance Communications, Inc. Dialog flow management in hierarchical task dialogs
US9548066B2 (en) 2014-08-11 2017-01-17 Amazon Technologies, Inc. Voice application architecture
US9361442B2 (en) 2014-08-12 2016-06-07 International Business Machines Corporation Triggering actions on a user device based on biometrics of nearby individuals
US10345767B2 (en) 2014-08-19 2019-07-09 Samsung Electronics Co., Ltd. Apparatus and method for gamification of sensor data interpretation in smart home
US20160055240A1 (en) 2014-08-22 2016-02-25 Microsoft Corporation Orphaned utterance detection system and method
US10446141B2 (en) 2014-08-28 2019-10-15 Apple Inc. Automatic speech recognition based on user feedback
US9990610B2 (en) 2014-08-29 2018-06-05 Google Llc Systems and methods for providing suggested reminders
US9959863B2 (en) 2014-09-08 2018-05-01 Qualcomm Incorporated Keyword detection using speaker-independent keyword models for user-designated keywords
WO2016037311A1 (en) 2014-09-09 2016-03-17 Microsoft Technology Licensing, Llc Variable-component deep neural network for robust speech recognition
US9818400B2 (en) 2014-09-11 2017-11-14 Apple Inc. Method and apparatus for discovering trending terms in speech requests
US10789041B2 (en) 2014-09-12 2020-09-29 Apple Inc. Dynamic thresholds for always listening speech trigger
US9508028B2 (en) 2014-09-24 2016-11-29 Nuance Communications, Inc. Converting text strings into number strings, such as via a touchscreen input
US10317992B2 (en) 2014-09-25 2019-06-11 Microsoft Technology Licensing, Llc Eye gaze for spoken language understanding in multi-modal conversational interactions
US9378740B1 (en) 2014-09-30 2016-06-28 Amazon Technologies, Inc. Command suggestions during automatic speech recognition
US9646609B2 (en) 2014-09-30 2017-05-09 Apple Inc. Caching apparatus for serving phonetic pronunciations
US9830321B2 (en) 2014-09-30 2017-11-28 Rovi Guides, Inc. Systems and methods for searching for a media asset
US10127911B2 (en) 2014-09-30 2018-11-13 Apple Inc. Speaker identification and unsupervised speaker adaptation techniques
US9668121B2 (en) 2014-09-30 2017-05-30 Apple Inc. Social reminders
US9886432B2 (en) 2014-09-30 2018-02-06 Apple Inc. Parsimonious handling of word inflection via categorical stem + suffix N-gram language models
US10074360B2 (en) 2014-09-30 2018-09-11 Apple Inc. Providing an indication of the suitability of speech recognition
US9318107B1 (en) 2014-10-09 2016-04-19 Google Inc. Hotword detection on multiple devices
US9741344B2 (en) 2014-10-20 2017-08-22 Vocalzoom Systems Ltd. System and method for operating devices using voice commands
US20160117386A1 (en) 2014-10-22 2016-04-28 International Business Machines Corporation Discovering terms using statistical corpus analysis
CN104460593B (zh) 2014-10-29 2017-10-10 小米科技有限责任公司 模式切换方法及装置
US9880714B2 (en) 2014-10-30 2018-01-30 Ebay Inc. Dynamic loading of contextual ontologies for predictive touch screen typing
CN105574067B (zh) 2014-10-31 2020-01-21 株式会社东芝 项目推荐装置以及项目推荐方法
GB2532075A (en) 2014-11-10 2016-05-11 Lego As System and method for toy recognition and detection based on convolutional neural networks
US9582493B2 (en) 2014-11-10 2017-02-28 Oracle International Corporation Lemma mapping to universal ontologies in computer natural language processing
US20160139662A1 (en) 2014-11-14 2016-05-19 Sachin Dabhade Controlling a visual device based on a proximity between a user and the visual device
US9258604B1 (en) 2014-11-24 2016-02-09 Facebook, Inc. Commercial detection based on audio fingerprinting
US9886430B2 (en) 2014-11-25 2018-02-06 Microsoft Technology Licensing, Llc Entity based content selection
US10614799B2 (en) 2014-11-26 2020-04-07 Voicebox Technologies Corporation System and method of providing intent predictions for an utterance prior to a system detection of an end of the utterance
US10192549B2 (en) 2014-11-28 2019-01-29 Microsoft Technology Licensing, Llc Extending digital personal assistant action providers
US9812126B2 (en) 2014-11-28 2017-11-07 Microsoft Technology Licensing, Llc Device arbitration for listening devices
US9495345B2 (en) 2014-12-09 2016-11-15 Idibon, Inc. Methods and systems for modeling complex taxonomies with natural language understanding
US9711141B2 (en) 2014-12-09 2017-07-18 Apple Inc. Disambiguating heteronyms in speech synthesis
US9466297B2 (en) 2014-12-09 2016-10-11 Microsoft Technology Licensing, Llc Communication system
US20160170966A1 (en) 2014-12-10 2016-06-16 Brian Kolo Methods and systems for automated language identification
WO2016094807A1 (en) 2014-12-11 2016-06-16 Vishal Sharma Virtual assistant system to enable actionable messaging
US9904673B2 (en) 2014-12-17 2018-02-27 International Business Machines Corporation Conversation advisor
US9911415B2 (en) 2014-12-19 2018-03-06 Lenovo (Singapore) Pte. Ltd. Executing a voice command during voice input
JP6504808B2 (ja) 2014-12-22 2019-04-24 キヤノン株式会社 撮像装置、音声コマンド機能の設定方法、コンピュータプログラム、及び記憶媒体
US9811312B2 (en) 2014-12-22 2017-11-07 Intel Corporation Connected device voice command support
US9837081B2 (en) 2014-12-30 2017-12-05 Microsoft Technology Licensing, Llc Discovering capabilities of third-party voice-enabled resources
US9367541B1 (en) 2015-01-20 2016-06-14 Xerox Corporation Terminological adaptation of statistical machine translation system through automatic generation of phrasal contexts for bilingual terms
US9424412B1 (en) 2015-02-02 2016-08-23 Bank Of America Corporation Authenticating customers using biometrics
US20160225372A1 (en) 2015-02-03 2016-08-04 Samsung Electronics Company, Ltd. Smart home connected device contextual learning using audio commands
US9613022B2 (en) 2015-02-04 2017-04-04 Lenovo (Singapore) Pte. Ltd. Context based customization of word assistance functions
EP3259688A4 (en) 2015-02-19 2018-12-12 Digital Reasoning Systems, Inc. Systems and methods for neural language modeling
US9928232B2 (en) 2015-02-27 2018-03-27 Microsoft Technology Licensing, Llc Topically aware word suggestions
US10152299B2 (en) 2015-03-06 2018-12-11 Apple Inc. Reducing response latency of intelligent automated assistants
US9865280B2 (en) 2015-03-06 2018-01-09 Apple Inc. Structured dictation using intelligent automated assistants
US9886953B2 (en) 2015-03-08 2018-02-06 Apple Inc. Virtual assistant activation
US10567477B2 (en) 2015-03-08 2020-02-18 Apple Inc. Virtual assistant continuity
US9721566B2 (en) 2015-03-08 2017-08-01 Apple Inc. Competing devices responding to voice triggers
US20160266871A1 (en) 2015-03-11 2016-09-15 Adapx, Inc. Speech recognizer for multimodal systems and signing in/out with and /or for a digital pen
US9805713B2 (en) 2015-03-13 2017-10-31 Google Inc. Addressing missing features in models
US9899019B2 (en) 2015-03-18 2018-02-20 Apple Inc. Systems and methods for structured stem and suffix language models
US20160286045A1 (en) 2015-03-23 2016-09-29 Vonage Network Llc System and method for providing an informative message when rejecting an incoming call
US9703394B2 (en) 2015-03-24 2017-07-11 Google Inc. Unlearning techniques for adaptive language models in text entry
US9672725B2 (en) 2015-03-25 2017-06-06 Microsoft Technology Licensing, Llc Proximity-based reminders
US9484021B1 (en) 2015-03-30 2016-11-01 Amazon Technologies, Inc. Disambiguation in speech recognition
US10049099B2 (en) 2015-04-10 2018-08-14 Facebook, Inc. Spell correction with hidden markov models on online social networks
US9678664B2 (en) 2015-04-10 2017-06-13 Google Inc. Neural network for keyboard input decoding
US10095683B2 (en) 2015-04-10 2018-10-09 Facebook, Inc. Contextual speller models on online social networks
US9842105B2 (en) 2015-04-16 2017-12-12 Apple Inc. Parsimonious continuous-space phrase representations for natural language processing
GB2537903B (en) 2015-04-30 2019-09-04 Toshiba Res Europe Limited Device and method for a spoken dialogue system
US9953063B2 (en) 2015-05-02 2018-04-24 Lithium Technologies, Llc System and method of providing a content discovery platform for optimizing social network engagements
US9892363B2 (en) 2015-05-07 2018-02-13 Truemotion, Inc. Methods and systems for sensor-based driving data collection
US9953648B2 (en) 2015-05-11 2018-04-24 Samsung Electronics Co., Ltd. Electronic device and method for controlling the same
US20160337299A1 (en) 2015-05-13 2016-11-17 Google Inc. Prioritized notification display
US9906482B2 (en) 2015-05-13 2018-02-27 The Travelers Indemnity Company Predictive electronic message management systems and controllers
US10061848B2 (en) 2015-05-22 2018-08-28 Microsoft Technology Licensing, Llc Ontology-crowd-relevance deep response generation
US10083688B2 (en) 2015-05-27 2018-09-25 Apple Inc. Device voice control for selecting a displayed affordance
US10127220B2 (en) 2015-06-04 2018-11-13 Apple Inc. Language identification from short strings
US10101822B2 (en) 2015-06-05 2018-10-16 Apple Inc. Language input correction
US10505884B2 (en) 2015-06-05 2019-12-10 Microsoft Technology Licensing, Llc Entity classification and/or relationship identification
US9578173B2 (en) 2015-06-05 2017-02-21 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US9865265B2 (en) 2015-06-06 2018-01-09 Apple Inc. Multi-microphone speech recognition systems and related techniques
US11025565B2 (en) 2015-06-07 2021-06-01 Apple Inc. Personalized prediction of responses for instant messaging
US20160357861A1 (en) 2015-06-07 2016-12-08 Apple Inc. Natural language event detection
US10186254B2 (en) 2015-06-07 2019-01-22 Apple Inc. Context-based endpoint detection
US10255907B2 (en) 2015-06-07 2019-04-09 Apple Inc. Automatic accent detection using acoustic models
US20160371250A1 (en) 2015-06-16 2016-12-22 Microsoft Technology Licensing, Llc Text suggestion using a predictive grammar model
US10325590B2 (en) 2015-06-26 2019-06-18 Intel Corporation Language model modification for local speech recognition systems using remote sources
US20160379641A1 (en) 2015-06-29 2016-12-29 Microsoft Technology Licensing, Llc Auto-Generation of Notes and Tasks From Passive Recording
US20160378747A1 (en) 2015-06-29 2016-12-29 Apple Inc. Virtual assistant for media playback
US9536527B1 (en) 2015-06-30 2017-01-03 Amazon Technologies, Inc. Reporting operational metrics in speech-based systems
KR102371188B1 (ko) 2015-06-30 2022-03-04 삼성전자주식회사 음성 인식 장치 및 방법과 전자 장치
US10426037B2 (en) 2015-07-15 2019-09-24 International Business Machines Corporation Circuitized structure with 3-dimensional configuration
US10311384B2 (en) 2015-07-29 2019-06-04 Microsoft Technology Licensing, Llc Automatic creation and maintenance of a taskline
US9691361B2 (en) 2015-08-03 2017-06-27 International Business Machines Corporation Adjusting presentation of content on a display
US10362978B2 (en) 2015-08-28 2019-07-30 Comcast Cable Communications, Llc Computational model for mood
US10747498B2 (en) 2015-09-08 2020-08-18 Apple Inc. Zero latency digital assistant
US10671428B2 (en) 2015-09-08 2020-06-02 Apple Inc. Distributed personal assistant
US10331312B2 (en) 2015-09-08 2019-06-25 Apple Inc. Intelligent automated assistant in a media environment
US10740384B2 (en) 2015-09-08 2020-08-11 Apple Inc. Intelligent automated assistant for media search and playback
US10026399B2 (en) 2015-09-11 2018-07-17 Amazon Technologies, Inc. Arbitration between voice-enabled devices
US9875081B2 (en) 2015-09-21 2018-01-23 Amazon Technologies, Inc. Device selection for providing a response
US11010550B2 (en) 2015-09-29 2021-05-18 Apple Inc. Unified language modeling framework for word prediction, auto-completion and auto-correction
US20170092278A1 (en) 2015-09-30 2017-03-30 Apple Inc. Speaker recognition
US11587559B2 (en) 2015-09-30 2023-02-21 Apple Inc. Intelligent device identification
US10891106B2 (en) 2015-10-13 2021-01-12 Google Llc Automatic batch voice commands
US9691378B1 (en) 2015-11-05 2017-06-27 Amazon Technologies, Inc. Methods and devices for selectively ignoring captured audio data
US10956666B2 (en) 2015-11-09 2021-03-23 Apple Inc. Unconventional virtual assistant interactions
KR102432620B1 (ko) 2015-11-12 2022-08-16 삼성전자주식회사 외부 객체의 근접에 따른 동작을 수행하는 전자 장치 및 그 방법
US10796693B2 (en) 2015-12-09 2020-10-06 Lenovo (Singapore) Pte. Ltd. Modifying input based on determined characteristics
US20170193083A1 (en) 2016-01-06 2017-07-06 International Business Machines Corporation Identifying message content related to an event utilizing natural language processing and performing an action pertaining to the event
US9747289B2 (en) 2016-01-13 2017-08-29 Disney Enterprises, Inc. System and method for proximity-based personalized content recommendations
US10743101B2 (en) 2016-02-22 2020-08-11 Sonos, Inc. Content mixing
US9922648B2 (en) 2016-03-01 2018-03-20 Google Llc Developer voice actions system
DK201670539A1 (en) 2016-03-14 2017-10-02 Apple Inc Dictation that allows editing
US10431205B2 (en) 2016-04-27 2019-10-01 Conduent Business Services, Llc Dialog device with dialog support generated using a mixture of language models combined using a recurrent neural network
KR20170128820A (ko) 2016-05-16 2017-11-24 엘지전자 주식회사 이동단말기 및 그 제어방법
US9934775B2 (en) 2016-05-26 2018-04-03 Apple Inc. Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9972304B2 (en) 2016-06-03 2018-05-15 Apple Inc. Privacy preserving distributed evaluation framework for embedded personalized systems
US10249300B2 (en) 2016-06-06 2019-04-02 Apple Inc. Intelligent list reading
DK179309B1 (en) 2016-06-09 2018-04-23 Apple Inc Intelligent automated assistant in a home environment
US10192552B2 (en) 2016-06-10 2019-01-29 Apple Inc. Digital assistant providing whispered speech
US10586535B2 (en) 2016-06-10 2020-03-10 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10509862B2 (en) 2016-06-10 2019-12-17 Apple Inc. Dynamic phrase expansion of language input
US10067938B2 (en) 2016-06-10 2018-09-04 Apple Inc. Multilingual word prediction
US10490187B2 (en) 2016-06-10 2019-11-26 Apple Inc. Digital assistant providing automated status report
US10592601B2 (en) 2016-06-10 2020-03-17 Apple Inc. Multilingual word prediction
DK179049B1 (en) 2016-06-11 2017-09-18 Apple Inc Data driven natural language event detection and classification
DK179343B1 (en) 2016-06-11 2018-05-14 Apple Inc Intelligent task discovery
DK179415B1 (en) 2016-06-11 2018-06-14 Apple Inc Intelligent device arbitration and control
US10043516B2 (en) 2016-09-23 2018-08-07 Apple Inc. Intelligent automated assistant

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7720674B2 (en) * 2004-06-29 2010-05-18 Sap Ag Systems and methods for processing natural language queries
US20140081633A1 (en) * 2012-09-19 2014-03-20 Apple Inc. Voice-Based Media Searching
US20140365216A1 (en) * 2013-06-07 2014-12-11 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
US20140365226A1 (en) * 2013-06-07 2014-12-11 Apple Inc. System and method for detecting errors in interactions with a voice-based digital assistant
US20140365209A1 (en) * 2013-06-09 2014-12-11 Apple Inc. System and method for inferring user intent from speech inputs
CN106471570A (zh) * 2014-05-30 2017-03-01 苹果公司 多命令单一话语输入方法

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111161717A (zh) * 2019-12-26 2020-05-15 苏州思必驰信息科技有限公司 用于语音对话平台的技能调度方法及系统
CN111161717B (zh) * 2019-12-26 2022-03-22 思必驰科技股份有限公司 用于语音对话平台的技能调度方法及系统

Also Published As

Publication number Publication date
US20180336197A1 (en) 2018-11-22
US10403278B2 (en) 2019-09-03
EP3404653A1 (en) 2018-11-21
CN108874766B (zh) 2022-07-01
EP3404653B1 (en) 2023-06-07

Similar Documents

Publication Publication Date Title
CN109328381B (zh) 检测数字助理的触发
CN108874766A (zh) 用于数字助理服务中的语音匹配的方法和系统
CN107978313B (zh) 智能自动化助理
CN107491285B (zh) 智能设备仲裁和控制
CN110019752B (zh) 多方向对话
CN107195306B (zh) 识别提供凭据的语音输入
CN107430501B (zh) 对语音触发进行响应的竞争设备
CN107408387B (zh) 虚拟助理激活
CN106471570B (zh) 多命令单一话语输入方法
CN110364148A (zh) 自然助理交互
CN110457000A (zh) 用于根据用户体验来递送内容的智能自动化助理
CN109635130A (zh) 用于媒体探索的智能自动化助理
CN109313898A (zh) 提供低声语音的数字助理
CN107491284A (zh) 提供自动化状态报告的数字助理
CN107949823A (zh) 零延迟数字助理
CN107735833A (zh) 自动口音检测
CN110223698A (zh) 训练数字助理的说话人识别模型
CN107257950A (zh) 虚拟助理连续性
CN107491469A (zh) 智能任务发现
CN108292203A (zh) 基于设备间对话通信的主动协助
CN108733438A (zh) 应用程序与数字助理集成
CN107493374A (zh) 具有数字助理的应用集成
CN107608998A (zh) 具有数字助理的应用集成
CN107615378A (zh) 设备语音控制
CN109814832A (zh) 多任务环境中的智能数字助理

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant