CN102763159A - 话音输入的处理 - Google Patents
话音输入的处理 Download PDFInfo
- Publication number
- CN102763159A CN102763159A CN201180009581XA CN201180009581A CN102763159A CN 102763159 A CN102763159 A CN 102763159A CN 201180009581X A CN201180009581X A CN 201180009581XA CN 201180009581 A CN201180009581 A CN 201180009581A CN 102763159 A CN102763159 A CN 102763159A
- Authority
- CN
- China
- Prior art keywords
- prompting
- time stamp
- time
- input
- electronic installation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012545 processing Methods 0.000 title claims abstract description 33
- 238000000034 method Methods 0.000 claims abstract description 64
- 238000009434 installation Methods 0.000 claims description 146
- 230000008859 change Effects 0.000 claims description 16
- 230000004044 response Effects 0.000 claims description 10
- 238000004590 computer program Methods 0.000 claims 2
- 230000008569 process Effects 0.000 abstract description 35
- 238000012544 monitoring process Methods 0.000 abstract description 2
- 238000010586 diagram Methods 0.000 description 18
- 230000008676 import Effects 0.000 description 10
- 230000005055 memory storage Effects 0.000 description 9
- 230000007246 mechanism Effects 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 3
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 238000013500 data storage Methods 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 208000004350 Strabismus Diseases 0.000 description 1
- 206010047571 Visual impairment Diseases 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 238000013475 authorization Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 230000008786 sensory perception of smell Effects 0.000 description 1
- 230000014860 sensory perception of taste Effects 0.000 description 1
- 238000005496 tempering Methods 0.000 description 1
- 208000029257 vision disease Diseases 0.000 description 1
- 230000004393 visual impairment Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Theoretical Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
本发明是针对在提供提示时处理由电子装置接收的话音输入。明确地说,本发明是针对在监视话音输入的同时将提示序列提供给用户(例如,旁白提示)。当接收到所述话音输入时,可为所述话音输入识别特性时戳,且可将所述特性时戳和与所述已提供提示中的每一者相关联的周期或窗口进行比较。所述电子装置接着可确定对应于包含所述特性时戳的窗口的所述提示曾是所述用户曾希望将所述话音输入应用到的所述提示。所述装置可处理所述话音输入以提取用户指令,且将所述指令应用到所述已识别提示(例如,且执行与所述提示相关联的操作)。
Description
技术领域
本发明是针对响应于连续提供的提示而处理由电子装置接收的话音输入。明确地说,本发明是针对识别特定接收的话音输入与之相关联的提示。
背景技术
许多电子装置提供用户可接入的大量特征或操作。可用特征或操作的数目常常可超过使用电子装置的输入接口可得到的输入的数目。为了允许用户接入未特定联系到特定输入(例如,不与按键序列或按钮按下相关联的输入,例如可从苹果(Apple)公司购得的iPod上的MENU按钮)的电子装置操作,电子装置可提供具有可选择选项的菜单,其中所述选项与电子装置操作相关联。举例来说,电子装置可(例如)响应于从输入接口(例如,MENU按钮)中接收到与具有可选择选项的菜单相关联的输入而在显示器上显示所述菜单。
因为菜单通常显示在电子装置显示器上,所以可能要求用户看着所述显示器以选择特定选项。这有时可能是不合意的。举例来说,如果用户希望节约电力(例如,在便携式电子装置中),那么要求所述电子装置显示菜单并移动由用户导览的突出显示区以提供选择可能需要可通过不驱动显示器而节省的电力。作为另一实例,如果用户处于暗环境中,且显示器不包含背光,那么用户可能不能够辨别菜单的已显示选项。作为又一实例,如果用户是盲人或视力受损,那么用户可能不能够查看已显示菜单。
为了克服此问题,一些系统可允许用户通过话音来提供指令。明确地说,电子装置可包含用于检测用户所说的词语的音频输入电路。所述装置的处理电路接着可处理所述词语以向所述电子装置识别对应指令,且执行所述对应指令。为了处理已接收话音输入,电子装置可确定话音输入的内容以及对应于所述内容的指令。
然而,在一些情况下,可响应于由装置所提供的提示而接收话音输入。举例来说,可在电子装置提供描述特定可选择选项的话音提示或旁白时提供话音输入。作为另一实例,可在电子装置依序显示一系列可选择选项时提供话音输入。由于接收整个话音输入、处理话音输入以及确定话音输入的内容所需要的时间,可在第一提示已结束之后且在提供第二提示时处理和理解用户响应于第一提示而提供的特定话音输入。所述装置因而可难以确定哪一提示与已接收话音输入相关联。
发明内容
本发明是针对用于在提供对应于可选择选项的提示序列时处理已接收话音输入的系统和方法。明确地说,本发明是针对识别特定提示以与已接收话音输入相关联。
电子装置可将提示序列提供给用户,其中每一提示与用户可选择的电子装置操作相关联。可使用任何合适方法来提供提示,包含(例如)作为已显示提示、音频提示或触觉提示。当提供用户感兴趣的提示时,用户可将输入提供给装置以指导所述装置执行与所述提示相关联的操作。
用户可使用任何合适方法将输入提供给装置。在一些实施例中,用户可提供话音输入。因为话音输入可要求整体上接收特定持续时间且接着处理特定持续时间以确定其内容,所以可在接收和处理话音输入所花费的时间提供若干提示。接着可要求电子装置确定已提供提示中的哪一者与话音输入有关。在一些实施例中,电子装置可界定与提示中的每一者相关联的输入窗口或持续时间,使得输入窗口或持续时间可指定期间已接收话音输入与对应提示有关的特定时间帧。输入窗口或持续时间可具有任何合适长度(例如,提供提示的时间量),且可从提供提示的时间偏移(例如,输入窗口在提示后偏移了2秒)。在一些情况下,不同提示可(例如)基于每一提示的选择的相对重要性或可能性或基于特定提示的长度(例如,装置提供提示的持续时间)而具有可变的输入窗口大小或长度。
为了使话音输入与提示有关,电子装置可使特性时间与已接收话音输入相关联。接着可将特性时间和提示的输入窗口进行比较,以确定哪一输入窗口包含特性时间。电子装置接着可确定或推断对应于包含特性时间的输入窗口的提示曾是用户感兴趣的提示。特性时间可包含期间曾接收到话音输入的任何合适时间或时间范围。举例来说,特性时间可包含曾接收到话音输入的初始时间、从初始时间偏移的时间,或任何其它合适时间。
在一些实施例中,从连续提供的提示起的输入窗口或持续时间可重叠(例如,如果所述提示中的一者较重要且具有扩大的输入窗口)。如果与话音输入相关联的特性时间包含在重叠的输入窗口或持续时间中,那么电子装置可识别一个或一个以上额外特性时间以与话音输入相关联。电子装置接着可选择包含原始特性时间以及一个或一个以上额外特性时间两者的特定输入窗口和对应提示。
电子装置可处理已接收话音输入以提取与所述话音输入相关联的指令。电子装置接着可将已提取指令应用到与对应于已接收话音输入的提示相关联的一个或一个以上装置操作。在一些实施例中,可由从已提取指令确定的变量或值(例如,用以充当界定新媒体播放列表的种子的媒体项目)来表征或修改装置操作。在一些实施例中,已处理话音输入可改为或另外用以识别所述话音输入与之相关联的特定提示(例如,指导装置执行与已提供提示相关联的特定操作的话音输入)。
附图说明
在考虑以下结合附图进行的详细描述后,本发明的上述和其它特征、本发明的性质和各种优点随即将更明显,在所述附图中:
图1是根据本发明的一个实施例的电子装置的示意图;
图2是根据本发明的一个实施例的用于处理随着按顺序提供提示而接收的话音输入的说明性系统的示意图;
图3是根据本发明的一个实施例的用于提供提示并接收话音输入的说明性时间线的示意图;
图4是根据本发明的一个实施例的具有关联周期的提示的示意图;
图5是根据本发明的一个实施例的待提供的说明性提示序列的示意图;
图6是根据本发明的一个实施例的说明性提示序列和待处理的话音输入的示意图;
图7是根据本发明的一个实施例的用于处理对应于提示的话音输入的说明性过程的流程图;
图8是根据本发明的一个实施例的用于处理对应于提示的话音输入的说明性过程的流程图;以及
图9是根据本发明的一个实施例的用于为提示界定输入窗口的说明性过程的流程图。
具体实施方式
电子装置可操作以接收由用户提供的话音输入来控制电子装置操作。在一些情况下,所提供的话音输入可对应于来自电子装置的提示,包含(例如)依序提供的一系列提示中的一者。
电子装置可使用任何合适方法提示用户与所述装置交互。在一些实施例中,电子装置可提供一个或一个以上提示,所述提示各自与一装置操作或指令相关联,用户可选择所述提示以指导所述装置执行操作。举例来说,电子装置可提供用于控制媒体重放的旁白提示。作为另一实例,电子装置可提供列出电子装置可启动的应用程序的已显示提示。每一提示可被提供达特定持续时间,且随后由队列中的下一提示替换。
响应于检测到针对用户所需要的操作的提示,用户可提供指导装置执行与当前提示相关联的操作或指令的话音输入。归因于话音输入的长度以及处理话音输入所需要的时间,电子装置可在提供序列中的后续提示的同时终结处理输入。为了防止装置不正确地确定在话音输入处理结束时提供的提示与话音输入相关联,电子装置可界定与每一提示相关联的一个或一个以上时戳或时间范围。当话音输入被起初提供或由特定时戳完成或在特定时间范围(例如,如由话音输入的特性时间设置)内时,电子装置可使话音输入与对应提示相关联。明确地说,可将时戳或时间范围界定成使得在提示结束之后处理的话音输入仍可与前一提示相关联。
每一提示可与时戳或时间范围的任何合适组合相关联。举例来说,提示可与延长超出期间提供所述提示的时间的时间范围相关联。在一些情况下,与特定提示相关联的时戳和时间范围可基于用户选择提示的历史记录、提示的类型或用户的话音输入或所述提示的任何其它特性而动态地改变。
图1是根据本发明的一个实施例的电子装置的示意图。电子装置100可包含处理器102、存储装置104、存储器106、输入接口108以及输出接口110。在一些实施例中,可组合或省略电子装置组件100中的一者或一者以上(例如,组合存储装置104与存储器106)。在一些实施例中,电子装置100可包含未组合或包含于图1所示的组件中的其它组件(例如,通信电路、定位电路、检测装置环境的感测电路、电源或总线),或图1所示的组件的若干例子。为了简单起见,图1中仅展示所述组件中的每一组件中的一者。
处理器102可包含操作以控制电子装置100的操作和性能的任何处理电路或控制电路。举例来说,处理器102可用以运行操作系统应用程序、固件应用程序、媒体重放应用程序、媒体编辑应用程序,或任何其它应用程序。在一些实施例中,处理器可驱动显示器,且处理从用户接口中接收的输入。
存储装置104可包含(例如)一个或一个以上存储媒体,所述存储媒体包含硬盘驱动器、固态驱动器、快闪存储器、永久存储器(例如ROM)、任何其它合适类型的存储组件,或其任何组合。存储装置104可存储(例如)媒体数据(例如,音乐和视频文件)、应用程序数据(例如,用于实施装置100上的功能)、固件、用户偏好信息(例如,媒体重放偏好)、验证信息(例如,与经授权用户相关联的数据库)、生活方式信息(例如,饮食偏好)、锻炼信息(例如,由锻炼监视设备获得的信息)、交易信息(例如,诸如信用卡信息等信息)、无线连接信息(例如,可使电子装置100能够建立无线连接的信息)、预订信息(例如,跟踪播客或电视放映或用户预订的其它媒体的信息)、联系人信息(例如,电话号码和电子邮件地址)、日历信息,以及任何其它合适数据或其任何组合。
存储器106可包含高速缓存存储器、半永久存储器(例如RAM),和/或用于临时存储数据的一种或一种以上不同类型的存储器。在一些实施例中,存储器106也可用于存储用以操作电子装置应用程序的数据,或可存储在存储装置104中的任何其它类型的数据。在一些实施例中,存储器106和存储装置104可被组合为单个存储媒体。
输入接口108可将输入提供到电子装置的输入/输出电路。输入接口108可包含任何合适输入接口,例如按钮、小键盘、拨号盘、点按式选盘或触摸屏。在一些实施例中,电子装置100可包含电容性感测机构,或多触摸电容性感测机构。在一些实施例中,输入接口可包含用于接收用户的话音输入的麦克风或其它音频输入接口。输入接口可包含用于将对应于话音输入的已接收模拟信号转换为可经处理和分析以识别特定词语或指令的数字信号的模/数转换器。
输出接口110可包含用于提供音频输出、视觉输出或其它类型的输出(例如,嗅觉、味觉或触觉输出)的一个或一个以上接口。举例来说,输出接口110可包含构建到电子装置100中的一个或一个以上扬声器(例如,单声道或立体声扬声器),或操作以耦合到音频输出机构的音频连接器(例如,音频插孔或适当的蓝牙连接)。输出接口110可操作以使用有线或无线连接将音频数据提供给耳机、头戴式耳机或耳塞。作为另一实例,输出接口110可包含用于提供用户可见的显示的显示电路(例如,屏幕或投影系统)。显示器可包含并入在电子装置100中的屏幕(例如,LCD屏幕)、用于在远离电子装置100的表面上提供内容显示的可移动显示器或投影系统(例如视频投影仪),或任何其它合适显示器。输出接口110可与输入/输出电路(未图示)介接以将输出提供给装置的用户。
在一些实施例中,电子装置100可包含操作以提供数据传送路径的总线,所述数据传送路径用于向控制处理器102、存储装置104、存储器106、输入接口108、输出接口110以及包含于电子装置中的任何其它组件、从控制处理器102、存储装置104、存储器106、输入接口108、输出接口110以及包含于电子装置中的任何其它组件中或在控制处理器102、存储装置104、存储器106、输入接口108、输出接口110以及包含于电子装置中的任何其它组件之间传送数据。
用户可使用任何合适方法来与电子装置交互。在一些实施例中,用户可使用触摸输入接口(例如键盘、按钮、鼠标或触敏表面)的一个或一个以上手指来提供输入。在一些实施例中,用户可改为或另外通过以特定方式摇晃或移动电子装置(例如,使得输入接口的运动感测组件检测用户移动)来提供输入。在一些实施例中,用户可改为或另外将话音输入提供给电子装置。举例来说,用户可向嵌入在电子装置中或连接到电子装置的麦克风讲话。
用户可在任何合适时间将话音输入提供给电子装置。在一些实施例中,电子装置可连续地监视话音输入(例如,当所述装置不处于休眠模式时,或在所有时间)。在一些实施例中,电子装置可响应于进入话音输入的用户输入或指令而监视话音输入。举例来说,用户可选择按钮或选项,或以使得传感器检测到用户希望被提供话音输入(例如,近程传感器检测到用户已将电子装置放到用户的嘴边)的方式放置所述装置。在一些实施例中,电子装置可在一个或一个以上特定应用程序或进程正在所述装置上运行时监视用户输入。举例来说,电子装置可在媒体重放应用程序、话音控制应用程序、搜索应用程序或任何其它合适应用程序中监视话音输入。
在一个实施方案中,电子装置可将可选择提示提供给用户,且可响应于所述提示而监视话音输入或其它类型的输入。电子装置可提供任何合适类型的提示,包含(例如)视觉提示(例如,提供于显示器上)、音频提示(例如,由音频输出接口输出)、触觉提示(例如,使用所述装置内的振动机构)或任何其它合适类型的提示中的一者或一者以上。举例来说,不包含视觉或显示输出接口的电子装置(例如,可从苹果公司购得的iPod Shuffle)可提供音频菜单,音频菜单包含各自与一装置操作相关联的一连串提示。在一个实施方案中,音频菜单可包含用于创建新播放列表、选择现有播放列表、根据艺术家、专辑或标题来选择媒体项目的音频提示,或与控制不具有显示器的装置上的媒体重放有关的任何其它指令或操作。由用户提供的提示可以特定速率自动循环,使得每一提示被提供达特定持续时间(例如,对应于提示的内容的话音输出所需要的持续时间)。
用户可使用任何合适方法来提供选择提示中的一者的输入。在一些实施例中,用户可使用装置的输入接口(例如按钮或触敏表面)来提供输入。用户可通过与输入接口交互(例如,执行示意动作或按下按钮)来提供输入。当输入较短以使得电子装置可在提示的持续时间内接收和处理输入时,用户可较容易地选择提示,并接收指示恰当提示曾被选择的反馈。
由用户提供的一些输入可要求接收和处理较长的时间量。举例来说,接收和处理话音输入所需要的持续时间可长于接收和处理按钮按下或加速计输出所需要的持续时间。明确地说,持续时间可长得致使电子装置可在起初接收到话音输入时提供第一提示,且在最终处理话音输入时提供第二提示。电子装置接着可需要确定第一提示和第二提示中的哪一者与已接收话音输入相关联。
图2是根据本发明的一个实施例的用于处理随着按顺序提供提示而接收的话音输入的说明性系统的示意图。系统200可包含处理模块202,处理模块202经由路径230和232而连接到提示210和话音输入220。处理模块202可包含在电子装置(例如,电子装置100,图1)中作为硬件、固件和软件的任何合适组合。举例来说,处理模块202可被提供为指导控制电路或处理器的操作的代码。处理模块202可依序将一系列提示210提供给装置的用户(例如,使用输出接口)。举例来说,响应于进入菜单的用户请求,处理模块202可识别与涉及所述菜单的指令或操作有关或对应的一组提示,且可指导输出接口提供所述提示。可以任何合适形式提供提示,包含(例如)作为视觉提示(例如,所显示的可选择选项)、音频提示(例如,旁白选项)、触觉提示(例如,对应于消息的振动),或任何其它形式。
处理模块202可识别待提供的任何合适数目个提示,包含(例如)根据电子装置可用的内容而确定的数目。举例来说,处理模块202可针对存储在装置上的每一播放列表或针对存储在装置上的媒体项目的每一艺术家提供提示。可使用任何合适方法来提供提示。举例来说,可依序提供个别提示,使得在特定时间仅提供单个提示。或者,处理模块202可同时提供若干提示。在一些实施例中,处理模块202可提供提示210,使得在不同时刻提供一个或一个以上不同提示。明确地说,处理模块202可重覆循环不同组提示210(例如,重覆循环个别提供的提示,或重覆循环所提供的多组提示),使得用户可在不同时间选择不同提示。
当用户检测到感兴趣提示被提供时,用户可将话音输入220提供给处理模块202。话音输入220可具有任何合适内容,包含(例如)指示感兴趣提示的选择的内容。处理模块202可接收话音输入220,且处理话音输入以识别所述输入的特定词语或短语。处理模块202可使用任何合适方法来处理话音输入,包含(例如)通过将已接收话音输入220与已知词语库进行比较,以及确定已识别库词语或短语的组合的含义。通过处理话音输入220,处理模块202可识别用户感兴趣的对应提示210,且执行对应于所述提示的操作或提供对应于所述提示的指令。
如上文所论述,因为可花费时间来检测、接收(例如,记录以供处理)和处理话音输入,所以处理模块可在用户感兴趣的提示已被另一提示替换之后终结处理话音输入。图3是根据本发明的一个实施例的用于提供提示并接收话音输入的说明性时间线的示意图。时间线300可包含描绘时间推移的时间轴302。在适当时间,电子装置(例如,处理模块)可依序提供提示310、312、314和316。提示310、312、314和316可包含任何合适类型的提示,包含(例如)个别音频提示、已显示提示的集合,或任何其它提示。描绘提示310、312、314和316的方框中的每一者的长度可提供期间提供提示的持续时间的指示(例如,用于重放对应于音频提示的音频剪辑的时间)。当用户听到感兴趣提示时,用户可将话音输入320提供给装置。表示话音输入320的方框可指示用于检测和接收话音输入的持续时间(例如,部分322),以及用于处理话音输入并确定所述输入的内容的持续时间(例如,部分324)。从时间线300的实例可看出,话音输入320可同提示312、314和316重叠。此外,话音输入320的部分322仅同提示312和314重叠,且话音输入320的部分324仅同提示314和316重叠。另外,话音输入322在提示310结束之后不久开始。因此,话音输入320可合理地应用到提示310、312、314和316中的任一者。因此,处理模块可需要用于确保话音输入与对应提示恰当地相关联的系统或程序。
为了确保话音输入与适当提供的提示相关联,可使每一提示与界定周期或输入窗口的时序信息相关联。如果在所述周期期间接收到话音输入,那么话音输入将对应于提示。可使用任何合适方法使一周期与每一提示相关联。图4是根据本发明的一个实施例的具有关联周期的提示的示意图。提示400可具有任何合适持续时间,包含(例如)由时间线410上的时戳412和414界定的持续时间。可基于提示的类型或基于由提示提供的信息而选择持续时间。举例来说,时戳412与414之间的持续时间对用户来说可至少长得足以阅读和理解书面或图形提示。作为另一实例,可将时戳412与414之间的持续时间选择成使得所述持续时间对于话音输入至少足够长以供完全听到特定指令(例如,至少长得足以重放对应于话音输出提示的整个音频剪辑)。在一些实施例中,可将时戳412与414之间的持续时间选择为长于使用户理解提示所需要的最小值,以向用户提供较长的输入窗口或周期来提供选择输入(例如,选择话音输入)。
提示400可与期间将假定已检测话音输入与提示400有关的输入窗口或周期420相关联。周期420可同时戳412与414之间的持续时间的某一部分或全部重叠。举例来说,周期420可与提示400的持续时间匹配。在一些实施例中,周期420可延长超出提示400的开始和结束中的一者或两者。因为可连续提供若干提示,所以可将周期420界定成使得其不同与另一提示相关联的周期重叠,或同所述周期最低程度地重叠。在提示400的实例中,周期420可由时戳422且由时戳424界定,时戳422是在时戳412与414之间(例如,在提供提示400时的周期期间),时戳424是在时戳414之后(例如,当不再提供提示400)。时戳412与422之间的持续时间可和时戳414与424之间的持续时间实质上相同,使得当在提示400后接有后续提示时,与所述后续提示相关联的周期或输入窗口将仅在时戳424时开始,而不在时戳414时开始(例如,限制与提示400和后继提示相关联的输入窗口之间的重叠)。
可使用任何合适方法来界定每一提示400的输入窗口或周期420的长度和位置。在一些实施例中,可基于提示的开始和结束而界定持续时间。举例来说,每一周期可在从提示的开始起的特定持续时间(例如,在开始之后5秒,或在提示的2%已被提供之后)开始,且在从提示的结束起的特定持续时间(例如,在提示的结束时、在当前或下一提示的持续时间的2%之后,或在5秒之后)结束。可使用初始时戳和最终时戳而为处理模块界定周期,初始时戳和最终时戳两者均可与提示相关联。
在一些实施例中,输入窗口或周期420的长度和位置可基于输入窗口或周期420与之相关联的特定提示而变化。明确地说,可将一些提示确定为较重要或较可能由用户选择。与那些提示相关联的周期因而可长于与较不重要的提示或较不可能被选择的提示相关联的周期。举例来说,与较可能被选择的提示相关联的周期可在或较靠近提示的开始时开始、可进一步延长超出提示的结束,或此两者。
电子装置可使用任何合适方法来确定提示选择的重要性或可能性。在一些实施例中,电子装置可提示用户提供最感兴趣的操作类型的指示,或用户很可能选择的特定提示。或者或另外,电子装置可从与所述装置的过去用户交互确定用户通常选择的特定提示,或用户提供给所述装置的提示或指令类型(例如,创建用户在不同情形下选择的提示的历史简档)。在一些实施例中,电子装置可识别使用所述装置的若干用户中的每一者,且确定所述若干用户中的每一者感兴趣的提示。
在一些实施例中,可基于提示的相对重要性或基于与每一提示相关联的周期的长度而确定提示的次序。因为当提示周期延长超过提示的结束时,提示周期固有地限制开始点,且因此限制与后续提示相关联的周期的持续时间。因此,可能需要将较不重要的提示放在由装置提供的较重要的提示之间。图5是根据本发明的一个实施例的待提供的说明性提示序列的示意图。序列500可包含沿着时间线501连续提供的提示502、504、506和508。在序列500中,提示504和508可比提示502和506重要。提示中的每一者可分别与对应周期512、514、516和518相关联。如图5所示,对应于较重要或相关的提示514和518的周期514和518可实质上长于对应于较不重要或相关的提示502和506的周期512和516。明确地说,周期512可实质上在提示502的结束时结束,而周期514可在提示504的结束以及进入提示506的显著部分之后结束。周期516可在提示506之后不久结束(例如,延长进入期间提供提示508的时间的较短量),而周期518可延长超出提示508的结束。在序列500的实例中,周期516和518可部分地重叠。通过将较不重要的提示506放在提示504与508之间,周期514和518两者可分别延长超出提示504和508的持续时间,且减小周期516的持续时间。如果提示506在提示504与508之间尚不可用,那么周期514和518中的一者或两者可能已被要求较小以适应彼此,或可能已显著地重叠。
在一些实施例中,电子装置可改为或另外通过调整提供提示的时间长度来间接控制与所述提示相关联的周期的持续时间。举例来说,电子装置可将每一周期界定为与提示的持续时间匹配或对应(例如,所述周期与提示开始和结束时间匹配,或从开始和结束时间稍微偏移),且改变每一提示的持续时间以增加或减小期间已接收输入将对应于已提供提示的周期。然而,此方法可提供用户体验,其中一些提示可被急冲或加速,而其它提示被抽出。
一旦已确定与每一提示相关联的周期或输入窗口,电子装置(例如,处理模块)就可确定话音输入的哪一或哪些部分将用作识别话音输入所对应的对应提示的时戳。图6是根据本发明的一个实施例的说明性提示序列和待处理的话音输入的示意图。序列600可包含沿着时间线602依序提供的提示610、612、614和616。每一提示可分别与一对应周期或输入窗口620、622、624和626相关联,在所述周期或输入窗口期间,已检测话音输入与对应提示相关联。话音输入630可在序列600被提供时予以提供,且可包含对应于由电子装置检测和记录话音输入的已检测部分632,以及对应于对已检测话音输入进行处理以确定用户的输入的内容的处理部分634。
在一些情况下,可在提供若干相异提示时发生话音输入630。在图6的特定实例中,话音输入630在期间提供提示612的时戳640时开始,且在期间提供提示616的时戳646时结束。因此,话音输入630在期间曾提供提示614的整个周期期间持续。此外,因为对应于提示610的周期620延长到期间提供提示612的时间中,所以话音输入630曾在周期620、622、624和626期间被提供。电子装置可使用任何合适方法来确定使话音输入630与所述周期中的哪一者相关联。在一些实施例中,电子装置起初可确定话音输入是否同若干周期重叠。如果所述输入同若干周期重叠,那么电子装置可审查话音输入的内容,且尝试基于话音输入内容而确定将使话音输入与之相关联的特定提示。举例来说,电子装置可确定话音输入内容是否调出所述提示中的一者的指令或操作(例如,“播放播放列表3”,当术语“播放列表3”包含在所述提示的一者中或包含在与所述提示中的一者相关联的元数据中时)。作为另一实例,电子装置可处理话音输入以确定指令是否与任何提示有关(例如,指令代替地为不与提示有关的任意命令,例如“关机”)。
在一些情况下,电子装置可改为或另外从话音输入630中选择特定的特性时戳以与整个话音输入相关联。在一些情况下,电子装置可改为或另外界定时间范围或持续时间,以表征曾接收到话音输入630的时间。举例来说,电子装置可选择时戳640或时戳646(例如,话音输入的开始或结束)。或者,电子装置可选择时戳644,时戳644指示用户提供的话音输入的结束(例如,已检测部分632的结束)。作为又一实例,电子装置可从所述装置检测到用户提供的输入时的周期内选择时戳642。时戳642可对应于在用户提供的输入期间的任何合适时间,包含(例如)输入的中间(例如,时戳640与644之间的中途,或时戳640与646之间的中途)、从话音输入的开始或结束起的预定时间(例如,在用户开始讲话之后2秒,或进入已接收话音输入的10%)、当接收到关键词或短语时(例如,当曾接收到指令关键字(例如“播放”、“暂停”或“跳过”)时),或在话音输入630内的任何其它合适时间。
一旦电子装置已选择特定时戳以与话音输入相关联,电子装置就可确定包含所述时戳的提示周期或输入窗口,且接着确定对应于所述周期或输入窗口的提示。如果若干重叠的周期或输入窗口包含所述时戳,那么电子装置可选择第二或替代时戳以应用到话音输入。电子装置接着可选择对应于其中含有第二时戳的周期的提示。在一些情况下,电子装置可改为或另外比较同若干周期或输入窗口中的每一者或同对应提示重叠的话音输入630(或部分632和634)的量(例如,分别同提示610和612的周期620和622重叠的话音输入630的量)。
一旦已识别特定提示,就可从已处理话音输入的内容中提取指令,且可将所述指令应用到所述特定提示。举例来说,如果指令包含“选择”指令,那么可执行与特定提示相关联的操作或进程。作为另一实例,如果指令包含“选择下一个”或“回到上一个”指令,那么电子装置可执行涉及提供提示(例如,且提供上一提示)或涉及实施与不同于所识别的特定提示的提示相关联的操作或进程的操作或进程(例如,改为执行来自下一提示的操作)。作为又一实例,指令可提供用于执行与提示相关联的特定操作的一个或一个以上变量或值(例如,提供媒体项目以充当用于产生新播放列表的种子)。一旦已接收到指令且执行对应操作,电子装置就可退出其中提供提示的模式(例如,假如所述指令不与提供提示序列有关)。然而,在一些实施例中,电子装置可改为或另外在确定话音输入是否对应于已提供提示之前处理话音输入以识别指令。明确地说,电子装置起初可确定话音输入指令是否与提示中的一者有关(例如,话音输入为“选择这个”),且如果话音输入对应于一提示,那么电子装置起初可仅确定哪一提示与所述输入相关联。
图7是根据本发明的一个实施例的用于处理对应于提示的话音输入的说明性过程的流程图。过程700可在步骤702处开始。在步骤704处,电子装置可确定是否曾提供提示。举例来说,电子装置可确定是否已启用用于提供提示的模式(例如,用户是否已接入旁白菜单模式)。如果电子装置确定尚未提供提示,那么过程700可移动到步骤706并结束。
如果在步骤704处电子装置改为确定提示被提供,那么过程700可移动到步骤708。在步骤708处,电子装置可依序将提示提供给用户。举例来说,电子装置可重覆循环一组提示,其中并非所有提示均同时被提供。明确地说,电子装置可依序提供一系列旁白提示。在步骤710处,电子装置可确定曾接收到还是正在接收话音输入。举例来说,电子装置可确定输入接口(例如,麦克风)是否已检测到对应于话音输入的信号。如果电子装置确定尚未接收到或未在接收话音输入,那么过程700可返回到步骤708,且继续依序提供提示。如果在步骤710处电子装置改为确定话音输入曾被或正在被接收,那么过程700可移动到步骤712。
在步骤712处,电子装置可识别与已接收话音输入相关联的特性时戳。举例来说,电子装置可识别曾接收到话音输入的开始时间、话音输入曾结束的结束时间、曾处理话音输入的时间,或期间曾提供或处理话音输入的任何其它合适时间。时戳可包含任何合适的时间度量,包含(例如)装置时间、相对于一个或一个以上提示的时间,或可返回与已接收提示有关的任何其它时间。在步骤714处,电子装置可识别对应于已提供提示中包含特性时戳的提示的时间周期。举例来说,电子装置可识别与已提供提示中的每一者相关联的时间周期或输入窗口,且将所述时间周期或输入窗口的范围和所述特性时戳进行比较。在步骤716处,电子装置可确定是否曾识别出若干时间周期。举例来说,电子装置可确定特性时戳是否属于与已接收提示相关联的时间周期或输入窗口中的若干者(例如,如果时间周期或输入窗口重叠)。如果电子装置确定特性时戳仅属于一个时间周期,那么过程700可移动到步骤718。在步骤718处,电子装置可处理话音输入以提取指令。举例来说,电子装置可识别话音输入的特定词语或短语(例如,通过与词典进行比较),且识别与已识别词语或短语相关联的指令。过程700接着可移动到步骤724。
如果在步骤716处电子装置改为确定特性时戳属于若干时间周期,那么过程700可移动到步骤720。在步骤720处,电子装置可识别与已接收话音输入相关联的额外特性时戳。举例来说,电子装置可选择在期间曾提供话音输入的时间范围内的另一时戳。在一些情况下,电子装置可改为或另外识别特性时间范围以与话音输入相关联。在步骤722处,电子装置可识别已提供提示的包含原始特性时戳和额外时戳的时间周期。举例来说,电子装置可识别与已提供提示中的每一者相关联的时间周期或输入窗口,且将所述时间周期或输入窗口的范围与所述特性时戳和额外时戳进行比较。过程700接着可移动到上文所描述的步骤718。然而,在一些实施例中,过程700可返回到步骤716以确定若干时间周期是否仍与原始特性时戳和额外特性时戳相关联。如果识别出若干时间周期,那么过程700可返回到步骤720,在步骤720处,电子装置可识别又一额外特性时戳。
在步骤724处,电子装置可将已提取指令应用到对应于已识别时间周期的提示。举例来说,电子装置可执行选择特定提示或提供执行与特定提示相关联的操作所需要的一个或一个以上变量(例如,提供媒体项目以充当用于产生新播放列表的种子)的指令。过程700接着可在步骤706处结束。
图8是根据本发明的一个实施例的用于处理对应于提示的话音输入的说明性过程的流程图。过程800可在步骤802处开始。在步骤804处,电子装置可提供提示序列,所述提示各自与一时间周期相关联。举例来说,电子装置可依序显示或提供用于若干装置选项的音频输出。在步骤806处,电子装置可接收话音输入。举例来说,所述装置的输入接口可接收话音输入。在步骤808处,电子装置可识别与话音输入相关联的特性时间。举例来说,电子装置可识别期间曾在接收或处理话音输入的特定时间。在步骤810处,电子装置可识别包含所述特性时间的时间周期。举例来说,电子装置可识别特性时间所属的特定时间周期或窗口。在步骤812处,电子装置可将话音输入应用到与已识别时间相关联的提示。举例来说,电子装置可从话音输入中提取指令,且将所述指令应用到所述提示。过程800接着可在步骤814处结束。
图9是根据本发明的一个实施例的用于为提示界定输入窗口的说明性过程的流程图。过程900可在步骤902处开始。在步骤904处,电子装置可识别多个提示以依序提供给用户。可使用任何合适方法来提供提示,包含(例如)使用视觉、音频或触觉提示。在步骤806处,电子装置可相对于用于提供提示的开始时间和结束时间中的至少一者界定偏移量。举例来说,电子装置可将经分配用于提供提示的时间的持续时间或百分比界定为偏移量。在步骤908处,电子装置可确定界定用于提供的输入窗口的边界的初始时间和最终时间,其中初始时间和最终时间中的至少一者从开始时间和结束时间偏移了已界定偏移量。举例来说,用于确定哪些话音输入与已提供提示相关联的输入窗口可由从期间曾提供提示的开始时间和结束时间偏移的初始时戳和最终时戳界定(例如,输入窗口比曾提供提示的时间晚五秒)。过程900接着可在步骤912处结束。
尽管本文关于个人计算装置而描述本发明的实施例中的许多实施例,但应理解,本发明不限于个人计算应用,而是通常适用于其它应用。
本发明的实施例优选地由软件实施,但也可在硬件或硬件与软件的组合中实施。还可将本发明的实施例体现为计算机可读媒体上的计算机可读代码。计算机可读媒体为可存储可在此后由计算机系统读取的数据的任何数据存储装置。计算机可读媒体的实例包含只读存储器、随机存取存储器、CD-ROM、DVD、磁带,以及光学数据存储装置。计算机可读媒体也可分布在网络耦合计算机系统上,使得计算机可读代码以分布式方式予以存储和执行。
现在已知或日后想出的对如所属领域的技术人员所看到的所主张标的物的非实质性改变被明确预期为同等地在所附权利要求书的范围内。因此,所属领域的技术人员现在或日后已知的明显替换被界定为在已界定要素的范围内。
出于说明而非限制的目的而呈现本发明的上述实施例。
Claims (23)
1.一种用于处理响应于提示而提供的话音输入的方法,其包括:
自动提供提示序列,其中每一提示与一时间周期相关联;
随着提供所述提示序列而接收话音输入;
识别与所述已接收话音输入相关联的特性时间;
识别包含所述特性时间的所述时间周期;以及
将所述已接收话音输入应用到与所述已识别时间周期相关联的所述提示。
2.根据权利要求1所述的方法,其进一步包括:
为每一提示界定初始时戳和最终时戳,其中所述初始时戳与所述最终时戳之间的周期组成与所述提示相关联的所述时间周期。
3.根据权利要求2所述的方法,其中:
所述初始时戳不同于对应于开始提供所述提示的时戳;且
所述最终时戳不同于对应于停止提供所述提示的时戳。
4.根据权利要求3所述的方法,其中:
所述最终时戳是在对应于停止提供所述提示的所述时戳之后。
5.根据权利要求2所述的方法,其进一步包括:
界定最终时戳和初始时戳中的至少一者,使得与按顺序提供的提示相关联的时间周期重叠。
6.根据权利要求1所述的方法,其进一步包括:
确定每一提示的相对重要性;以及
基于所述提示的所述已确定相对重要性而改变每一提示的所述时间周期的长度。
7.根据权利要求6所述的方法,其中改变进一步包括:
改变所述最终时戳超过对应于停止提供所述提示的所述时戳的量。
8.根据权利要求7所述的方法,其进一步包括:
对所述提示进行排序,使得较不重要的提示在较重要的提示之间,以防止与所述较重要的提示相关联的所述时间周期重叠。
9.根据权利要求1所述的方法,其中识别特性时间进一步包括:
识别期间接收所述话音输入的特性时戳。
10.根据权利要求9所述的方法,其进一步包括:
为每一提示界定初始时戳和最终时戳,其中所述初始时戳与所述最终时戳之间的周期组成与所述提示相关联的所述时间周期;以及
识别初始时戳与最终时戳的组合,对于所述组合,所述特性时戳大于所述初始时戳但小于所述最终时戳。
11.根据权利要求1所述的方法,其中:
自动提供提示序列进一步包括自动提供话音输出提示序列,其中每一提示与一电子装置操作相关联。
12.一种用于处理话音输入的电子装置,其包括:
输出接口,其用于输出多个音频提示,其中所述音频提示是连续提供的;
输入接口,其用于接收话音输入;以及
处理模块,其操作以:
确定在曾接收到所述话音输入时曾输出至少两个音频提示;
为所述话音输入界定特性时戳;
将所述特性时戳和与所述至少两个音频提示中的每一者相关联的输入窗口进行比较,其中每一输入窗口界定期间已接收输入对应于所述输入窗口的所述音频提示的持续时间;且
使所述已接收话音输入与包含所述特性时戳的所述输入窗口的所述音频提示相关联。
13.根据权利要求12所述的电子装置,其中所述处理模块进一步操作以:
确定所述特性时戳包含于所述至少两个音频提示的所述输入窗口中;
为所述话音输入界定额外特性时戳;且
确定所述输入窗口中的哪一者包含所述特性时戳和所述额外特性时戳两者。
14.根据权利要求12所述的电子装置,其中所述处理模块进一步操作以:
从所述话音输入中提取指令;且
将所述已提取指令应用到包含所述特性时戳的所述输入窗口的所述音频提示。
15.根据权利要求14所述的电子装置,其中所述处理模块进一步操作以:
识别与包含所述特性时戳的所述输入窗口的所述音频提示相关联的操作;
基于所述已接收指令而确定执行所述操作的方式;且
以所述已确定方式执行所述操作。
16.根据权利要求15所述的电子装置,其中所述处理模块进一步操作以:
从所述指令确定表征所述操作的至少一个变量;且
使用来自所述指令的所述至少一个变量执行所述操作。
17.一种用于界定输入窗口以与已提供提示相关联的方法,其包括:
识别多个提示以依序提供,其中每一提示与一电子装置操作相关联;
相对于用于提供所述多个提示中的每一者的开始时间和结束时间中的至少一者界定偏移量;以及
为所述多个提示中的每一者确定由用于确定所述多个提示中的哪一已提供提示与已接收话音输入相关联的初始时间和最终时间界定的输入窗口,其中所述初始时间和所述最终时间中的至少一者从所述开始时间和所述结束时间偏移了所述已界定偏移量。
18.根据权利要求17所述的方法,其进一步包括:
确定每一提示的重要性;以及
基于所述提示的所述重要性而改变用于每一提示的所述已界定偏移量。
19.根据权利要求17所述的方法,其进一步包括:
相对于所述开始时间界定第一偏移量以应用到所述初始时间;以及
相对于所述结束时间界定第二偏移量以应用到所述最终时间,其中所述第一偏移量和所述第二偏移量不同。
20.根据权利要求17所述的方法,其中所述偏移量被界定为以下各项中的至少一者:
持续时间;
所述持续时间的提供所述提示的百分比;以及
所述持续时间的提供多个提示的序列中的另一提示的百分比。
21.一种用于处理响应于提示而提供的话音输入的计算机可读媒体,所述计算机可读媒体包括记录在其上的计算机程序逻辑,所述计算机程序逻辑用于:
自动提供提示序列,其中每一提示与一时间周期相关联;
随着提供所述提示序列而接收话音输入;
识别与所述已接收话音输入相关联的特性时间;
识别包含所述特性时间的所述时间周期;以及
将所述已接收话音输入应用到与所述已识别时间周期相关联的所述提示。
22.根据权利要求21所述的计算机可读媒体,其进一步包括记录在其上的额外计算机程序逻辑,所述额外计算机程序逻辑用于:
为每一提示界定初始时戳和最终时戳,其中所述初始时戳与所述最终时戳之间的周期组成与所述提示相关联的所述时间周期。
23.根据权利要求22所述的计算机可读媒体,其中:
所述初始时戳不同于对应于开始提供所述提示的时戳;且
所述最终时戳不同于对应于停止提供所述提示的时戳。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410244527.9A CN104020978B (zh) | 2010-01-13 | 2011-01-11 | 话音输入的处理 |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/686,774 US8311838B2 (en) | 2010-01-13 | 2010-01-13 | Devices and methods for identifying a prompt corresponding to a voice input in a sequence of prompts |
US12/686,774 | 2010-01-13 | ||
PCT/US2011/020825 WO2011088038A1 (en) | 2010-01-13 | 2011-01-11 | Processing of voice inputs |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410244527.9A Division CN104020978B (zh) | 2010-01-13 | 2011-01-11 | 话音输入的处理 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102763159A true CN102763159A (zh) | 2012-10-31 |
CN102763159B CN102763159B (zh) | 2014-07-09 |
Family
ID=43640481
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410244527.9A Expired - Fee Related CN104020978B (zh) | 2010-01-13 | 2011-01-11 | 话音输入的处理 |
CN201180009581.XA Expired - Fee Related CN102763159B (zh) | 2010-01-13 | 2011-01-11 | 话音输入的处理 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410244527.9A Expired - Fee Related CN104020978B (zh) | 2010-01-13 | 2011-01-11 | 话音输入的处理 |
Country Status (6)
Country | Link |
---|---|
US (2) | US8311838B2 (zh) |
EP (1) | EP2524369B1 (zh) |
KR (1) | KR101393816B1 (zh) |
CN (2) | CN104020978B (zh) |
AU (1) | AU2011205411B2 (zh) |
WO (1) | WO2011088038A1 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107066227A (zh) * | 2013-01-07 | 2017-08-18 | 三星电子株式会社 | 显示装置和用于控制显示装置的方法 |
CN108447476A (zh) * | 2017-02-06 | 2018-08-24 | 北京嘀嘀无限科技发展有限公司 | 用于请求服务以及服务资源分配的方法及装置 |
Families Citing this family (197)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8645137B2 (en) | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
US10002189B2 (en) | 2007-12-20 | 2018-06-19 | Apple Inc. | Method and apparatus for searching using an active ontology |
US9330720B2 (en) | 2008-01-03 | 2016-05-03 | Apple Inc. | Methods and apparatus for altering audio output signals |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US20100030549A1 (en) | 2008-07-31 | 2010-02-04 | Lee Michael M | Mobile device having human language translation capability with positional feedback |
US8463053B1 (en) | 2008-08-08 | 2013-06-11 | The Research Foundation Of State University Of New York | Enhanced max margin learning on multimodal data mining in a multimedia database |
US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US10706373B2 (en) | 2011-06-03 | 2020-07-07 | Apple Inc. | Performing actions associated with task items that represent tasks to perform |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
US8311838B2 (en) | 2010-01-13 | 2012-11-13 | Apple Inc. | Devices and methods for identifying a prompt corresponding to a voice input in a sequence of prompts |
US10276170B2 (en) | 2010-01-18 | 2019-04-30 | Apple Inc. | Intelligent automated assistant |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US9634855B2 (en) | 2010-05-13 | 2017-04-25 | Alexander Poltorak | Electronic personal interactive device that determines topics of interest using a conversational agent |
US8442835B2 (en) | 2010-06-17 | 2013-05-14 | At&T Intellectual Property I, L.P. | Methods, systems, and products for measuring health |
US8666768B2 (en) | 2010-07-27 | 2014-03-04 | At&T Intellectual Property I, L. P. | Methods, systems, and products for measuring health |
US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
US9741226B1 (en) * | 2011-06-01 | 2017-08-22 | Cox Communications, Inc | System, method and device for monitoring the status of an entity based upon an established monitoring profile |
US10057736B2 (en) | 2011-06-03 | 2018-08-21 | Apple Inc. | Active transport based notifications |
DE102011079034A1 (de) | 2011-07-12 | 2013-01-17 | Siemens Aktiengesellschaft | Ansteuerung eines technischen Systems |
US8994660B2 (en) | 2011-08-29 | 2015-03-31 | Apple Inc. | Text correction processing |
WO2013109525A1 (en) | 2012-01-20 | 2013-07-25 | Sly Ward | Use of human input recognition to prevent contamination |
US9557903B2 (en) * | 2012-02-13 | 2017-01-31 | Lg Electronics Inc. | Method for providing user interface on terminal |
US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
US9280610B2 (en) | 2012-05-14 | 2016-03-08 | Apple Inc. | Crowd sourcing information to fulfill user requests |
US10417037B2 (en) | 2012-05-15 | 2019-09-17 | Apple Inc. | Systems and methods for integrating third party services with a digital assistant |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
US20130339455A1 (en) * | 2012-06-19 | 2013-12-19 | Research In Motion Limited | Method and Apparatus for Identifying an Active Participant in a Conferencing Event |
CN103632664B (zh) * | 2012-08-20 | 2017-07-25 | 联想(北京)有限公司 | 一种语音识别的方法及电子设备 |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
US9542936B2 (en) | 2012-12-29 | 2017-01-10 | Genesys Telecommunications Laboratories, Inc. | Fast out-of-vocabulary search in automatic speech recognition systems |
KR20240132105A (ko) | 2013-02-07 | 2024-09-02 | 애플 인크. | 디지털 어시스턴트를 위한 음성 트리거 |
US10652394B2 (en) | 2013-03-14 | 2020-05-12 | Apple Inc. | System and method for processing voicemail |
US10748529B1 (en) | 2013-03-15 | 2020-08-18 | Apple Inc. | Voice activated device for use with a voice-based digital assistant |
US20140337901A1 (en) * | 2013-05-07 | 2014-11-13 | Ericsson Television Inc. | Network personal video recorder system, method and associated subscriber device |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
WO2014197336A1 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
WO2014197335A1 (en) | 2013-06-08 | 2014-12-11 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
KR101772152B1 (ko) | 2013-06-09 | 2017-08-28 | 애플 인크. | 디지털 어시스턴트의 둘 이상의 인스턴스들에 걸친 대화 지속성을 가능하게 하기 위한 디바이스, 방법 및 그래픽 사용자 인터페이스 |
US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
USRE49014E1 (en) * | 2013-06-19 | 2022-04-05 | Panasonic Intellectual Property Corporation Of America | Voice interaction method, and device |
DE112014003653B4 (de) | 2013-08-06 | 2024-04-18 | Apple Inc. | Automatisch aktivierende intelligente Antworten auf der Grundlage von Aktivitäten von entfernt angeordneten Vorrichtungen |
US10296160B2 (en) | 2013-12-06 | 2019-05-21 | Apple Inc. | Method for extracting salient dialog usage from live data |
CN108469937B (zh) * | 2014-03-26 | 2020-11-20 | 联想(北京)有限公司 | 一种信息处理方法及电子设备 |
US9633004B2 (en) | 2014-05-30 | 2017-04-25 | Apple Inc. | Better resolution when referencing to concepts |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
CN110797019B (zh) | 2014-05-30 | 2023-08-29 | 苹果公司 | 多命令单一话语输入方法 |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US9842101B2 (en) | 2014-05-30 | 2017-12-12 | Apple Inc. | Predictive conversion of language input |
US9933994B2 (en) * | 2014-06-24 | 2018-04-03 | Lenovo (Singapore) Pte. Ltd. | Receiving at a device audible input that is spelled |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US9257120B1 (en) | 2014-07-18 | 2016-02-09 | Google Inc. | Speaker verification using co-location information |
US11676608B2 (en) | 2021-04-02 | 2023-06-13 | Google Llc | Speaker verification using co-location information |
US11942095B2 (en) | 2014-07-18 | 2024-03-26 | Google Llc | Speaker verification using co-location information |
US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
US10789041B2 (en) | 2014-09-12 | 2020-09-29 | Apple Inc. | Dynamic thresholds for always listening speech trigger |
US9886432B2 (en) | 2014-09-30 | 2018-02-06 | Apple Inc. | Parsimonious handling of word inflection via categorical stem + suffix N-gram language models |
US9646609B2 (en) | 2014-09-30 | 2017-05-09 | Apple Inc. | Caching apparatus for serving phonetic pronunciations |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
US9424841B2 (en) | 2014-10-09 | 2016-08-23 | Google Inc. | Hotword detection on multiple devices |
US9812128B2 (en) | 2014-10-09 | 2017-11-07 | Google Inc. | Device leadership negotiation among voice interface devices |
US9318107B1 (en) | 2014-10-09 | 2016-04-19 | Google Inc. | Hotword detection on multiple devices |
JP5907231B1 (ja) * | 2014-10-15 | 2016-04-26 | 富士通株式会社 | 入力情報支援装置、入力情報支援方法および入力情報支援プログラム |
US9865280B2 (en) | 2015-03-06 | 2018-01-09 | Apple Inc. | Structured dictation using intelligent automated assistants |
US10152299B2 (en) | 2015-03-06 | 2018-12-11 | Apple Inc. | Reducing response latency of intelligent automated assistants |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
US9899019B2 (en) | 2015-03-18 | 2018-02-20 | Apple Inc. | Systems and methods for structured stem and suffix language models |
US9842105B2 (en) | 2015-04-16 | 2017-12-12 | Apple Inc. | Parsimonious continuous-space phrase representations for natural language processing |
US10460227B2 (en) | 2015-05-15 | 2019-10-29 | Apple Inc. | Virtual assistant in a communication session |
US10200824B2 (en) | 2015-05-27 | 2019-02-05 | Apple Inc. | Systems and methods for proactively identifying and surfacing relevant content on a touch-sensitive device |
US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
US10127220B2 (en) | 2015-06-04 | 2018-11-13 | Apple Inc. | Language identification from short strings |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10101822B2 (en) | 2015-06-05 | 2018-10-16 | Apple Inc. | Language input correction |
US10255907B2 (en) | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
US10186254B2 (en) | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
US20160378747A1 (en) | 2015-06-29 | 2016-12-29 | Apple Inc. | Virtual assistant for media playback |
US10178218B1 (en) | 2015-09-04 | 2019-01-08 | Vishal Vadodaria | Intelligent agent / personal virtual assistant with animated 3D persona, facial expressions, human gestures, body movements and mental states |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US10740384B2 (en) | 2015-09-08 | 2020-08-11 | Apple Inc. | Intelligent automated assistant for media search and playback |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10331312B2 (en) | 2015-09-08 | 2019-06-25 | Apple Inc. | Intelligent automated assistant in a media environment |
US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10956666B2 (en) | 2015-11-09 | 2021-03-23 | Apple Inc. | Unconventional virtual assistant interactions |
CN106782627B (zh) * | 2015-11-23 | 2019-08-27 | 广州酷狗计算机科技有限公司 | 音频文件的重录方法及装置 |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US9779735B2 (en) | 2016-02-24 | 2017-10-03 | Google Inc. | Methods and systems for detecting and processing speech signals |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US11227589B2 (en) | 2016-06-06 | 2022-01-18 | Apple Inc. | Intelligent list reading |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179588B1 (en) | 2016-06-09 | 2019-02-22 | Apple Inc. | INTELLIGENT AUTOMATED ASSISTANT IN A HOME ENVIRONMENT |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
US9972320B2 (en) | 2016-08-24 | 2018-05-15 | Google Llc | Hotword detection on multiple devices |
US10474753B2 (en) | 2016-09-07 | 2019-11-12 | Apple Inc. | Language identification using recurrent neural networks |
US10282537B2 (en) | 2016-09-20 | 2019-05-07 | International Business Machines Corporation | Single prompt multiple-response user authentication method |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
JP6515897B2 (ja) | 2016-09-28 | 2019-05-22 | トヨタ自動車株式会社 | 音声対話システムおよび発話意図理解方法 |
WO2018085192A1 (en) | 2016-11-07 | 2018-05-11 | Google Llc | Recorded media hotword trigger suppression |
US11281993B2 (en) | 2016-12-05 | 2022-03-22 | Apple Inc. | Model and ensemble compression for metric learning |
US20180166073A1 (en) * | 2016-12-13 | 2018-06-14 | Ford Global Technologies, Llc | Speech Recognition Without Interrupting The Playback Audio |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
US11204787B2 (en) | 2017-01-09 | 2021-12-21 | Apple Inc. | Application integration with a digital assistant |
CN117577099A (zh) | 2017-04-20 | 2024-02-20 | 谷歌有限责任公司 | 设备上的多用户认证的方法、系统和介质 |
US10417266B2 (en) | 2017-05-09 | 2019-09-17 | Apple Inc. | Context-aware ranking of intelligent response suggestions |
DK201770383A1 (en) | 2017-05-09 | 2018-12-14 | Apple Inc. | USER INTERFACE FOR CORRECTING RECOGNITION ERRORS |
US10395654B2 (en) | 2017-05-11 | 2019-08-27 | Apple Inc. | Text normalization based on a data-driven learning network |
DK180048B1 (en) | 2017-05-11 | 2020-02-04 | Apple Inc. | MAINTAINING THE DATA PROTECTION OF PERSONAL INFORMATION |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
US10726832B2 (en) | 2017-05-11 | 2020-07-28 | Apple Inc. | Maintaining privacy of personal information |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
US11301477B2 (en) | 2017-05-12 | 2022-04-12 | Apple Inc. | Feedback analysis of a digital assistant |
DK201770428A1 (en) | 2017-05-12 | 2019-02-18 | Apple Inc. | LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770411A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | MULTI-MODAL INTERFACES |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK179549B1 (en) | 2017-05-16 | 2019-02-12 | Apple Inc. | FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES |
US10311144B2 (en) | 2017-05-16 | 2019-06-04 | Apple Inc. | Emoji word sense disambiguation |
US20180336275A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Intelligent automated assistant for media exploration |
US20180336892A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Detecting a trigger of a digital assistant |
US10403278B2 (en) | 2017-05-16 | 2019-09-03 | Apple Inc. | Methods and systems for phonetic matching in digital assistant services |
US10657328B2 (en) | 2017-06-02 | 2020-05-19 | Apple Inc. | Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling |
US10395650B2 (en) | 2017-06-05 | 2019-08-27 | Google Llc | Recorded media hotword trigger suppression |
US11120817B2 (en) * | 2017-08-25 | 2021-09-14 | David Tuk Wai LEONG | Sound recognition apparatus |
US10445429B2 (en) | 2017-09-21 | 2019-10-15 | Apple Inc. | Natural language understanding using vocabularies with compressed serialized tries |
US10755051B2 (en) | 2017-09-29 | 2020-08-25 | Apple Inc. | Rule-based natural language processing |
US10636424B2 (en) | 2017-11-30 | 2020-04-28 | Apple Inc. | Multi-turn canned dialog |
US10733982B2 (en) | 2018-01-08 | 2020-08-04 | Apple Inc. | Multi-directional dialog |
US10733375B2 (en) | 2018-01-31 | 2020-08-04 | Apple Inc. | Knowledge-based framework for improving natural language understanding |
US10789959B2 (en) | 2018-03-02 | 2020-09-29 | Apple Inc. | Training speaker recognition models for digital assistants |
US10592604B2 (en) | 2018-03-12 | 2020-03-17 | Apple Inc. | Inverse text normalization for automatic speech recognition |
US10818288B2 (en) | 2018-03-26 | 2020-10-27 | Apple Inc. | Natural assistant interaction |
US10909331B2 (en) | 2018-03-30 | 2021-02-02 | Apple Inc. | Implicit identification of translation payload with neural machine translation |
US10928918B2 (en) | 2018-05-07 | 2021-02-23 | Apple Inc. | Raise to speak |
US11145294B2 (en) | 2018-05-07 | 2021-10-12 | Apple Inc. | Intelligent automated assistant for delivering content from user experiences |
US10984780B2 (en) | 2018-05-21 | 2021-04-20 | Apple Inc. | Global semantic word embeddings using bi-directional recurrent neural networks |
US10692496B2 (en) | 2018-05-22 | 2020-06-23 | Google Llc | Hotword suppression |
DK201870355A1 (en) | 2018-06-01 | 2019-12-16 | Apple Inc. | VIRTUAL ASSISTANT OPERATION IN MULTI-DEVICE ENVIRONMENTS |
DK179822B1 (da) | 2018-06-01 | 2019-07-12 | Apple Inc. | Voice interaction at a primary device to access call functionality of a companion device |
US11386266B2 (en) | 2018-06-01 | 2022-07-12 | Apple Inc. | Text correction |
US10892996B2 (en) | 2018-06-01 | 2021-01-12 | Apple Inc. | Variable latency device coordination |
DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
US11076039B2 (en) | 2018-06-03 | 2021-07-27 | Apple Inc. | Accelerated task performance |
US11010561B2 (en) | 2018-09-27 | 2021-05-18 | Apple Inc. | Sentiment prediction from textual data |
US11170166B2 (en) | 2018-09-28 | 2021-11-09 | Apple Inc. | Neural typographical error modeling via generative adversarial networks |
US11462215B2 (en) | 2018-09-28 | 2022-10-04 | Apple Inc. | Multi-modal inputs for voice commands |
US10839159B2 (en) | 2018-09-28 | 2020-11-17 | Apple Inc. | Named entity normalization in a spoken dialog system |
US11475898B2 (en) | 2018-10-26 | 2022-10-18 | Apple Inc. | Low-latency multi-speaker speech recognition |
CN109410944B (zh) * | 2018-12-12 | 2020-06-09 | 百度在线网络技术(北京)有限公司 | 语音交互方法、装置和终端 |
US11638059B2 (en) | 2019-01-04 | 2023-04-25 | Apple Inc. | Content playback on multiple devices |
KR20200098025A (ko) * | 2019-02-11 | 2020-08-20 | 삼성전자주식회사 | 전자 장치 및 그 제어 방법 |
US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
US11475884B2 (en) | 2019-05-06 | 2022-10-18 | Apple Inc. | Reducing digital assistant latency when a language is incorrectly determined |
US11307752B2 (en) | 2019-05-06 | 2022-04-19 | Apple Inc. | User configurable task triggers |
US11423908B2 (en) | 2019-05-06 | 2022-08-23 | Apple Inc. | Interpreting spoken requests |
DK201970509A1 (en) | 2019-05-06 | 2021-01-15 | Apple Inc | Spoken notifications |
US11140099B2 (en) | 2019-05-21 | 2021-10-05 | Apple Inc. | Providing message response suggestions |
DK180129B1 (en) | 2019-05-31 | 2020-06-02 | Apple Inc. | USER ACTIVITY SHORTCUT SUGGESTIONS |
US11496600B2 (en) | 2019-05-31 | 2022-11-08 | Apple Inc. | Remote execution of machine-learned models |
US11289073B2 (en) | 2019-05-31 | 2022-03-29 | Apple Inc. | Device text to speech |
DK201970511A1 (en) | 2019-05-31 | 2021-02-15 | Apple Inc | Voice identification in digital assistant systems |
US11227599B2 (en) | 2019-06-01 | 2022-01-18 | Apple Inc. | Methods and user interfaces for voice-based control of electronic devices |
US11360641B2 (en) | 2019-06-01 | 2022-06-14 | Apple Inc. | Increasing the relevance of new available information |
KR20210015428A (ko) | 2019-08-02 | 2021-02-10 | 삼성전자주식회사 | 사용자 인터페이스를 제공하는 전자 장치 및 방법 |
WO2021056255A1 (en) | 2019-09-25 | 2021-04-01 | Apple Inc. | Text detection using global geometry estimators |
US11061543B1 (en) | 2020-05-11 | 2021-07-13 | Apple Inc. | Providing relevant data items based on context |
US11038934B1 (en) | 2020-05-11 | 2021-06-15 | Apple Inc. | Digital assistant hardware abstraction |
US11755276B2 (en) | 2020-05-12 | 2023-09-12 | Apple Inc. | Reducing description length based on confidence |
US11490204B2 (en) | 2020-07-20 | 2022-11-01 | Apple Inc. | Multi-device audio adjustment coordination |
US11438683B2 (en) | 2020-07-21 | 2022-09-06 | Apple Inc. | User identification using headphones |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5930751A (en) * | 1997-05-30 | 1999-07-27 | Lucent Technologies Inc. | Method of implicit confirmation for automatic speech recognition |
WO2001046946A1 (en) * | 1999-12-22 | 2001-06-28 | Ambush Interactive, Inc. | Hands-free, voice-operated remote control transmitter |
US20030171928A1 (en) * | 2002-02-04 | 2003-09-11 | Falcon Stephen Russel | Systems and methods for managing interactions from multiple speech-enabled applications |
US20060247931A1 (en) * | 2005-04-29 | 2006-11-02 | International Business Machines Corporation | Method and apparatus for multiple value confirmation and correction in spoken dialog systems |
CN101228503A (zh) * | 2005-03-23 | 2008-07-23 | 摩托罗拉公司 | 用于用户界面的自适应菜单 |
Family Cites Families (585)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3828132A (en) | 1970-10-30 | 1974-08-06 | Bell Telephone Labor Inc | Speech synthesis by concatenation of formant encoded words |
US3704345A (en) | 1971-03-19 | 1972-11-28 | Bell Telephone Labor Inc | Conversion of printed text into synthetic speech |
US3979557A (en) | 1974-07-03 | 1976-09-07 | International Telephone And Telegraph Corporation | Speech processor system for pitch period extraction using prediction filters |
BG24190A1 (en) | 1976-09-08 | 1978-01-10 | Antonov | Method of synthesis of speech and device for effecting same |
JPS597120B2 (ja) | 1978-11-24 | 1984-02-16 | 日本電気株式会社 | 音声分析装置 |
US4310721A (en) | 1980-01-23 | 1982-01-12 | The United States Of America As Represented By The Secretary Of The Army | Half duplex integral vocoder modem system |
US4348553A (en) | 1980-07-02 | 1982-09-07 | International Business Machines Corporation | Parallel pattern verifier with dynamic time warping |
US5047617A (en) | 1982-01-25 | 1991-09-10 | Symbol Technologies, Inc. | Narrow-bodied, single- and twin-windowed portable laser scanning head for reading bar code symbols |
DE3382796T2 (de) | 1982-06-11 | 1996-03-28 | Mitsubishi Electric Corp | Vorrichtung zur Zwischenbildkodierung. |
US4688195A (en) | 1983-01-28 | 1987-08-18 | Texas Instruments Incorporated | Natural-language interface generating system |
JPS603056A (ja) | 1983-06-21 | 1985-01-09 | Toshiba Corp | 情報整理装置 |
DE3335358A1 (de) | 1983-09-29 | 1985-04-11 | Siemens AG, 1000 Berlin und 8000 München | Verfahren zur bestimmung von sprachspektren fuer die automatische spracherkennung und sprachcodierung |
US5164900A (en) | 1983-11-14 | 1992-11-17 | Colman Bernath | Method and device for phonetically encoding Chinese textual data for data processing entry |
US4726065A (en) | 1984-01-26 | 1988-02-16 | Horst Froessl | Image manipulation by speech signals |
US4955047A (en) | 1984-03-26 | 1990-09-04 | Dytel Corporation | Automated attendant with direct inward system access |
US4811243A (en) | 1984-04-06 | 1989-03-07 | Racine Marsh V | Computer aided coordinate digitizing system |
US4692941A (en) | 1984-04-10 | 1987-09-08 | First Byte | Real-time text-to-speech conversion system |
US4783807A (en) | 1984-08-27 | 1988-11-08 | John Marley | System and method for sound recognition with feature selection synchronized to voice pitch |
US4718094A (en) | 1984-11-19 | 1988-01-05 | International Business Machines Corp. | Speech recognition system |
US5165007A (en) | 1985-02-01 | 1992-11-17 | International Business Machines Corporation | Feneme-based Markov models for words |
US4944013A (en) | 1985-04-03 | 1990-07-24 | British Telecommunications Public Limited Company | Multi-pulse speech coder |
US4819271A (en) | 1985-05-29 | 1989-04-04 | International Business Machines Corporation | Constructing Markov model word baseforms from multiple utterances by concatenating model sequences for word segments |
US4833712A (en) | 1985-05-29 | 1989-05-23 | International Business Machines Corporation | Automatic generation of simple Markov model stunted baseforms for words in a vocabulary |
EP0218859A3 (en) | 1985-10-11 | 1989-09-06 | International Business Machines Corporation | Signal processor communication interface |
US4776016A (en) | 1985-11-21 | 1988-10-04 | Position Orientation Systems, Inc. | Voice control system |
JPH0833744B2 (ja) | 1986-01-09 | 1996-03-29 | 株式会社東芝 | 音声合成装置 |
US4724542A (en) | 1986-01-22 | 1988-02-09 | International Business Machines Corporation | Automatic reference adaptation during dynamic signature verification |
US5759101A (en) | 1986-03-10 | 1998-06-02 | Response Reward Systems L.C. | Central and remote evaluation of responses of participatory broadcast audience with automatic crediting and couponing |
US5128752A (en) | 1986-03-10 | 1992-07-07 | Kohorn H Von | System and method for generating and redeeming tokens |
US5032989A (en) | 1986-03-19 | 1991-07-16 | Realpro, Ltd. | Real estate search and location system and method |
DE3779351D1 (zh) | 1986-03-28 | 1992-07-02 | American Telephone And Telegraph Co., New York, N.Y., Us | |
US4903305A (en) | 1986-05-12 | 1990-02-20 | Dragon Systems, Inc. | Method for representing word models for use in speech recognition |
ES2047494T3 (es) | 1986-10-03 | 1994-03-01 | British Telecomm | Sistema de traduccion de lenguas. |
US4878230A (en) | 1986-10-16 | 1989-10-31 | Mitsubishi Denki Kabushiki Kaisha | Amplitude-adaptive vector quantization system |
US4829576A (en) | 1986-10-21 | 1989-05-09 | Dragon Systems, Inc. | Voice recognition system |
US4852168A (en) | 1986-11-18 | 1989-07-25 | Sprague Richard P | Compression of stored waveforms for artificial speech |
US4727354A (en) | 1987-01-07 | 1988-02-23 | Unisys Corporation | System for selecting best fit vector code in vector quantization encoding |
US4827520A (en) | 1987-01-16 | 1989-05-02 | Prince Corporation | Voice actuated control system for use in a vehicle |
US4965763A (en) | 1987-03-03 | 1990-10-23 | International Business Machines Corporation | Computer method for automatic extraction of commonly specified information from business correspondence |
US5644727A (en) | 1987-04-15 | 1997-07-01 | Proprietary Financial Products, Inc. | System for the operation and management of one or more financial accounts through the use of a digital communication and computation system for exchange, investment and borrowing |
EP0293259A3 (en) | 1987-05-29 | 1990-03-07 | Kabushiki Kaisha Toshiba | Voice recognition system used in telephone apparatus |
DE3723078A1 (de) | 1987-07-11 | 1989-01-19 | Philips Patentverwaltung | Verfahren zur erkennung von zusammenhaengend gesprochenen woertern |
US4974191A (en) | 1987-07-31 | 1990-11-27 | Syntellect Software Inc. | Adaptive natural language computer interface system |
CA1288516C (en) | 1987-07-31 | 1991-09-03 | Leendert M. Bijnagte | Apparatus and method for communicating textual and image information between a host computer and a remote display terminal |
US5022081A (en) | 1987-10-01 | 1991-06-04 | Sharp Kabushiki Kaisha | Information recognition system |
US4852173A (en) | 1987-10-29 | 1989-07-25 | International Business Machines Corporation | Design and construction of a binary-tree system for language modelling |
DE3876379T2 (de) | 1987-10-30 | 1993-06-09 | Ibm | Automatische bestimmung von kennzeichen und markov-wortmodellen in einem spracherkennungssystem. |
US5072452A (en) | 1987-10-30 | 1991-12-10 | International Business Machines Corporation | Automatic determination of labels and Markov word models in a speech recognition system |
US4914586A (en) | 1987-11-06 | 1990-04-03 | Xerox Corporation | Garbage collector for hypermedia systems |
US4992972A (en) | 1987-11-18 | 1991-02-12 | International Business Machines Corporation | Flexible context searchable on-line information system with help files and modules for on-line computer system documentation |
US5220657A (en) | 1987-12-02 | 1993-06-15 | Xerox Corporation | Updating local copy of shared data in a collaborative system |
US4984177A (en) | 1988-02-05 | 1991-01-08 | Advanced Products And Technologies, Inc. | Voice language translator |
US5194950A (en) | 1988-02-29 | 1993-03-16 | Mitsubishi Denki Kabushiki Kaisha | Vector quantizer |
US4914590A (en) | 1988-05-18 | 1990-04-03 | Emhart Industries, Inc. | Natural language understanding system |
FR2636163B1 (fr) | 1988-09-02 | 1991-07-05 | Hamon Christian | Procede et dispositif de synthese de la parole par addition-recouvrement de formes d'onde |
US4839853A (en) | 1988-09-15 | 1989-06-13 | Bell Communications Research, Inc. | Computer information retrieval using latent semantic structure |
JPH0293597A (ja) | 1988-09-30 | 1990-04-04 | Nippon I B M Kk | 音声認識装置 |
US4905163A (en) | 1988-10-03 | 1990-02-27 | Minnesota Mining & Manufacturing Company | Intelligent optical navigator dynamic information presentation and navigation system |
US5282265A (en) | 1988-10-04 | 1994-01-25 | Canon Kabushiki Kaisha | Knowledge information processing system |
DE3837590A1 (de) | 1988-11-05 | 1990-05-10 | Ant Nachrichtentech | Verfahren zum reduzieren der datenrate von digitalen bilddaten |
EP0372734B1 (en) | 1988-11-23 | 1994-03-09 | Digital Equipment Corporation | Name pronunciation by synthesizer |
US5027406A (en) | 1988-12-06 | 1991-06-25 | Dragon Systems, Inc. | Method for interactive speech recognition and training |
US5127055A (en) | 1988-12-30 | 1992-06-30 | Kurzweil Applied Intelligence, Inc. | Speech recognition apparatus & method having dynamic reference pattern adaptation |
US5293448A (en) | 1989-10-02 | 1994-03-08 | Nippon Telegraph And Telephone Corporation | Speech analysis-synthesis method and apparatus therefor |
SE466029B (sv) | 1989-03-06 | 1991-12-02 | Ibm Svenska Ab | Anordning och foerfarande foer analys av naturligt spraak i ett datorbaserat informationsbehandlingssystem |
JPH0782544B2 (ja) | 1989-03-24 | 1995-09-06 | インターナショナル・ビジネス・マシーンズ・コーポレーション | マルチテンプレートを用いるdpマツチング方法及び装置 |
US4977598A (en) | 1989-04-13 | 1990-12-11 | Texas Instruments Incorporated | Efficient pruning algorithm for hidden markov model speech recognition |
US5197005A (en) | 1989-05-01 | 1993-03-23 | Intelligent Business Systems | Database retrieval system having a natural language interface |
US5010574A (en) | 1989-06-13 | 1991-04-23 | At&T Bell Laboratories | Vector quantizer search arrangement |
JP2940005B2 (ja) | 1989-07-20 | 1999-08-25 | 日本電気株式会社 | 音声符号化装置 |
US5091945A (en) | 1989-09-28 | 1992-02-25 | At&T Bell Laboratories | Source dependent channel coding with error protection |
CA2027705C (en) | 1989-10-17 | 1994-02-15 | Masami Akamine | Speech coding system utilizing a recursive computation technique for improvement in processing speed |
US5020112A (en) | 1989-10-31 | 1991-05-28 | At&T Bell Laboratories | Image recognition method using two-dimensional stochastic grammars |
US5220639A (en) | 1989-12-01 | 1993-06-15 | National Science Council | Mandarin speech input method for Chinese computers and a mandarin speech recognition machine |
US5021971A (en) | 1989-12-07 | 1991-06-04 | Unisys Corporation | Reflective binary encoder for vector quantization |
US5179652A (en) | 1989-12-13 | 1993-01-12 | Anthony I. Rozmanith | Method and apparatus for storing, transmitting and retrieving graphical and tabular data |
CH681573A5 (en) | 1990-02-13 | 1993-04-15 | Astral | Automatic teller arrangement involving bank computers - is operated by user data card carrying personal data, account information and transaction records |
EP0443548B1 (en) | 1990-02-22 | 2003-07-23 | Nec Corporation | Speech coder |
US5301109A (en) | 1990-06-11 | 1994-04-05 | Bell Communications Research, Inc. | Computerized cross-language document retrieval using latent semantic indexing |
JP3266246B2 (ja) | 1990-06-15 | 2002-03-18 | インターナシヨナル・ビジネス・マシーンズ・コーポレーシヨン | 自然言語解析装置及び方法並びに自然言語解析用知識ベース構築方法 |
US5202952A (en) | 1990-06-22 | 1993-04-13 | Dragon Systems, Inc. | Large-vocabulary continuous speech prefiltering and processing system |
GB9017600D0 (en) | 1990-08-10 | 1990-09-26 | British Aerospace | An assembly and method for binary tree-searched vector quanisation data compression processing |
US5309359A (en) | 1990-08-16 | 1994-05-03 | Boris Katz | Method and apparatus for generating and utlizing annotations to facilitate computer text retrieval |
US5404295A (en) | 1990-08-16 | 1995-04-04 | Katz; Boris | Method and apparatus for utilizing annotations to facilitate computer retrieval of database material |
US5297170A (en) | 1990-08-21 | 1994-03-22 | Codex Corporation | Lattice and trellis-coded quantization |
US5400434A (en) | 1990-09-04 | 1995-03-21 | Matsushita Electric Industrial Co., Ltd. | Voice source for synthetic speech system |
US5216747A (en) | 1990-09-20 | 1993-06-01 | Digital Voice Systems, Inc. | Voiced/unvoiced estimation of an acoustic signal |
US5128672A (en) | 1990-10-30 | 1992-07-07 | Apple Computer, Inc. | Dynamic predictive keyboard |
US5317507A (en) | 1990-11-07 | 1994-05-31 | Gallant Stephen I | Method for document retrieval and for word sense disambiguation using neural networks |
US5325298A (en) | 1990-11-07 | 1994-06-28 | Hnc, Inc. | Methods for generating or revising context vectors for a plurality of word stems |
US5247579A (en) | 1990-12-05 | 1993-09-21 | Digital Voice Systems, Inc. | Methods for speech transmission |
US5345536A (en) | 1990-12-21 | 1994-09-06 | Matsushita Electric Industrial Co., Ltd. | Method of speech recognition |
US5127053A (en) | 1990-12-24 | 1992-06-30 | General Electric Company | Low-complexity method for improving the performance of autocorrelation-based pitch detectors |
US5133011A (en) | 1990-12-26 | 1992-07-21 | International Business Machines Corporation | Method and apparatus for linear vocal control of cursor position |
US5268990A (en) | 1991-01-31 | 1993-12-07 | Sri International | Method for recognizing speech using linguistically-motivated hidden Markov models |
GB9105367D0 (en) | 1991-03-13 | 1991-04-24 | Univ Strathclyde | Computerised information-retrieval database systems |
US5303406A (en) | 1991-04-29 | 1994-04-12 | Motorola, Inc. | Noise squelch circuit with adaptive noise shaping |
US5475587A (en) | 1991-06-28 | 1995-12-12 | Digital Equipment Corporation | Method and apparatus for efficient morphological text analysis using a high-level language for compact specification of inflectional paradigms |
US5293452A (en) | 1991-07-01 | 1994-03-08 | Texas Instruments Incorporated | Voice log-in using spoken name input |
US5687077A (en) | 1991-07-31 | 1997-11-11 | Universal Dynamics Limited | Method and apparatus for adaptive control |
US5199077A (en) | 1991-09-19 | 1993-03-30 | Xerox Corporation | Wordspotting for voice editing and indexing |
JP2662120B2 (ja) | 1991-10-01 | 1997-10-08 | インターナショナル・ビジネス・マシーンズ・コーポレイション | 音声認識装置および音声認識用処理ユニット |
US5222146A (en) | 1991-10-23 | 1993-06-22 | International Business Machines Corporation | Speech recognition apparatus having a speech coder outputting acoustic prototype ranks |
KR940002854B1 (ko) | 1991-11-06 | 1994-04-04 | 한국전기통신공사 | 음성 합성시스팀의 음성단편 코딩 및 그의 피치조절 방법과 그의 유성음 합성장치 |
US5386494A (en) | 1991-12-06 | 1995-01-31 | Apple Computer, Inc. | Method and apparatus for controlling a speech recognition function using a cursor control device |
US6081750A (en) | 1991-12-23 | 2000-06-27 | Hoffberg; Steven Mark | Ergonomic man-machine interface incorporating adaptive pattern recognition based control system |
US5903454A (en) | 1991-12-23 | 1999-05-11 | Hoffberg; Linda Irene | Human-factored interface corporating adaptive pattern recognition based controller apparatus |
US5502790A (en) | 1991-12-24 | 1996-03-26 | Oki Electric Industry Co., Ltd. | Speech recognition method and system using triphones, diphones, and phonemes |
US5349645A (en) | 1991-12-31 | 1994-09-20 | Matsushita Electric Industrial Co., Ltd. | Word hypothesizer for continuous speech decoding using stressed-vowel centered bidirectional tree searches |
US5267345A (en) | 1992-02-10 | 1993-11-30 | International Business Machines Corporation | Speech recognition apparatus which predicts word classes from context and words from word classes |
DE69322894T2 (de) | 1992-03-02 | 1999-07-29 | At & T Corp., New York, N.Y. | Lernverfahren und Gerät zur Spracherkennung |
US6055514A (en) | 1992-03-20 | 2000-04-25 | Wren; Stephen Corey | System for marketing foods and services utilizing computerized centraland remote facilities |
US5317647A (en) | 1992-04-07 | 1994-05-31 | Apple Computer, Inc. | Constrained attribute grammars for syntactic pattern recognition |
US5412804A (en) | 1992-04-30 | 1995-05-02 | Oracle Corporation | Extending the semantics of the outer join operator for un-nesting queries to a data base |
JPH07506908A (ja) | 1992-05-20 | 1995-07-27 | インダストリアル リサーチ リミテッド | 広帯域残響支援システム |
US5293584A (en) | 1992-05-21 | 1994-03-08 | International Business Machines Corporation | Speech recognition system for natural language translation |
US5434777A (en) | 1992-05-27 | 1995-07-18 | Apple Computer, Inc. | Method and apparatus for processing natural language |
US5390281A (en) | 1992-05-27 | 1995-02-14 | Apple Computer, Inc. | Method and apparatus for deducing user intent and providing computer implemented services |
US5734789A (en) | 1992-06-01 | 1998-03-31 | Hughes Electronics | Voiced, unvoiced or noise modes in a CELP vocoder |
US5333275A (en) | 1992-06-23 | 1994-07-26 | Wheatley Barbara J | System and method for time aligning speech |
US5325297A (en) | 1992-06-25 | 1994-06-28 | System Of Multiple-Colored Images For Internationally Listed Estates, Inc. | Computer implemented method and system for storing and retrieving textual data and compressed image data |
US5999908A (en) | 1992-08-06 | 1999-12-07 | Abelow; Daniel H. | Customer-based product design module |
US5412806A (en) | 1992-08-20 | 1995-05-02 | Hewlett-Packard Company | Calibration of logical cost formulae for queries in a heterogeneous DBMS using synthetic database |
GB9220404D0 (en) | 1992-08-20 | 1992-11-11 | Nat Security Agency | Method of identifying,retrieving and sorting documents |
US5333236A (en) | 1992-09-10 | 1994-07-26 | International Business Machines Corporation | Speech recognizer having a speech coder for an acoustic match based on context-dependent speech-transition acoustic models |
US5384893A (en) | 1992-09-23 | 1995-01-24 | Emerson & Stern Associates, Inc. | Method and apparatus for speech synthesis based on prosodic analysis |
FR2696036B1 (fr) | 1992-09-24 | 1994-10-14 | France Telecom | Procédé de mesure de ressemblance entre échantillons sonores et dispositif de mise en Óoeuvre de ce procédé. |
JPH0772840B2 (ja) | 1992-09-29 | 1995-08-02 | 日本アイ・ビー・エム株式会社 | 音声モデルの構成方法、音声認識方法、音声認識装置及び音声モデルの訓練方法 |
US5758313A (en) | 1992-10-16 | 1998-05-26 | Mobile Information Systems, Inc. | Method and apparatus for tracking vehicle location |
US5455888A (en) | 1992-12-04 | 1995-10-03 | Northern Telecom Limited | Speech bandwidth extension method and apparatus |
US5412756A (en) | 1992-12-22 | 1995-05-02 | Mitsubishi Denki Kabushiki Kaisha | Artificial intelligence software shell for plant operation simulation |
US5734791A (en) | 1992-12-31 | 1998-03-31 | Apple Computer, Inc. | Rapid tree-based method for vector quantization |
US5390279A (en) | 1992-12-31 | 1995-02-14 | Apple Computer, Inc. | Partitioning speech rules by context for speech recognition |
US5384892A (en) | 1992-12-31 | 1995-01-24 | Apple Computer, Inc. | Dynamic language model for speech recognition |
US5613036A (en) | 1992-12-31 | 1997-03-18 | Apple Computer, Inc. | Dynamic categories for a speech recognition system |
US6122616A (en) | 1993-01-21 | 2000-09-19 | Apple Computer, Inc. | Method and apparatus for diphone aliasing |
US5864844A (en) | 1993-02-18 | 1999-01-26 | Apple Computer, Inc. | System and method for enhancing a user interface with a computer based training tool |
CA2091658A1 (en) | 1993-03-15 | 1994-09-16 | Matthew Lennig | Method and apparatus for automation of directory assistance using speech recognition |
US6055531A (en) | 1993-03-24 | 2000-04-25 | Engate Incorporated | Down-line transcription system having context sensitive searching capability |
US5536902A (en) | 1993-04-14 | 1996-07-16 | Yamaha Corporation | Method of and apparatus for analyzing and synthesizing a sound by extracting and controlling a sound parameter |
US5444823A (en) | 1993-04-16 | 1995-08-22 | Compaq Computer Corporation | Intelligent search engine for associated on-line documentation having questionless case-based knowledge base |
US5574823A (en) | 1993-06-23 | 1996-11-12 | Her Majesty The Queen In Right Of Canada As Represented By The Minister Of Communications | Frequency selective harmonic coding |
US5515475A (en) | 1993-06-24 | 1996-05-07 | Northern Telecom Limited | Speech recognition method using a two-pass search |
JPH0756933A (ja) | 1993-06-24 | 1995-03-03 | Xerox Corp | 文書検索方法 |
JP3685812B2 (ja) | 1993-06-29 | 2005-08-24 | ソニー株式会社 | 音声信号送受信装置 |
US5794207A (en) | 1996-09-04 | 1998-08-11 | Walker Asset Management Limited Partnership | Method and apparatus for a cryptographically assisted commercial network system designed to facilitate buyer-driven conditional purchase offers |
US5495604A (en) | 1993-08-25 | 1996-02-27 | Asymetrix Corporation | Method and apparatus for the modeling and query of database structures using natural language-like constructs |
US5619694A (en) | 1993-08-26 | 1997-04-08 | Nec Corporation | Case database storage/retrieval system |
US5940811A (en) | 1993-08-27 | 1999-08-17 | Affinity Technology Group, Inc. | Closed loop financial transaction method and apparatus |
US5377258A (en) | 1993-08-30 | 1994-12-27 | National Medical Research Council | Method and apparatus for an automated and interactive behavioral guidance system |
US5873056A (en) | 1993-10-12 | 1999-02-16 | The Syracuse University | Natural language processing system for semantic vector representation which accounts for lexical ambiguity |
US5578808A (en) | 1993-12-22 | 1996-11-26 | Datamark Services, Inc. | Data card that can be used for transactions involving separate card issuers |
WO1995017711A1 (en) | 1993-12-23 | 1995-06-29 | Diacom Technologies, Inc. | Method and apparatus for implementing user feedback |
US5621859A (en) | 1994-01-19 | 1997-04-15 | Bbn Corporation | Single tree method for grammar directed, very large vocabulary speech recognizer |
US5584024A (en) | 1994-03-24 | 1996-12-10 | Software Ag | Interactive database query system and method for prohibiting the selection of semantically incorrect query parameters |
US5642519A (en) | 1994-04-29 | 1997-06-24 | Sun Microsystems, Inc. | Speech interpreter with a unified grammer compiler |
KR100250509B1 (ko) | 1994-05-25 | 2000-04-01 | 슈즈이 다께오 | 가변 전송속도 데이터 전송장치 |
US5493677A (en) | 1994-06-08 | 1996-02-20 | Systems Research & Applications Corporation | Generation, archiving, and retrieval of digital images with evoked suggestion-set captions and natural language interface |
US5675819A (en) | 1994-06-16 | 1997-10-07 | Xerox Corporation | Document information retrieval using global word co-occurrence patterns |
JPH0869470A (ja) | 1994-06-21 | 1996-03-12 | Canon Inc | 自然言語処理装置及びその方法 |
US5948040A (en) | 1994-06-24 | 1999-09-07 | Delorme Publishing Co. | Travel reservation information and planning system |
US5682539A (en) | 1994-09-29 | 1997-10-28 | Conrad; Donovan | Anticipated meaning natural language interface |
GB2293667B (en) | 1994-09-30 | 1998-05-27 | Intermation Limited | Database management system |
US5715468A (en) | 1994-09-30 | 1998-02-03 | Budzinski; Robert Lucius | Memory system for storing and retrieving experience and knowledge with natural language |
US5845255A (en) | 1994-10-28 | 1998-12-01 | Advanced Health Med-E-Systems Corporation | Prescription management system |
US5577241A (en) | 1994-12-07 | 1996-11-19 | Excite, Inc. | Information retrieval system and method with implementation extensible query architecture |
US5748974A (en) | 1994-12-13 | 1998-05-05 | International Business Machines Corporation | Multimodal natural language interface for cross-application tasks |
US5794050A (en) | 1995-01-04 | 1998-08-11 | Intelligent Text Processing, Inc. | Natural language understanding system |
CN1912885B (zh) | 1995-02-13 | 2010-12-22 | 英特特拉斯特技术公司 | 用于安全交易管理和电子权利保护的系统和方法 |
US5701400A (en) | 1995-03-08 | 1997-12-23 | Amado; Carlos Armando | Method and apparatus for applying if-then-else rules to data sets in a relational data base and generating from the results of application of said rules a database of diagnostics linked to said data sets to aid executive analysis of financial data |
US5749081A (en) | 1995-04-06 | 1998-05-05 | Firefly Network, Inc. | System and method for recommending items to a user |
US5642464A (en) | 1995-05-03 | 1997-06-24 | Northern Telecom Limited | Methods and apparatus for noise conditioning in digital speech compression systems using linear predictive coding |
US5664055A (en) | 1995-06-07 | 1997-09-02 | Lucent Technologies Inc. | CS-ACELP speech compression system with adaptive pitch prediction filter gain based on a measure of periodicity |
US5710886A (en) | 1995-06-16 | 1998-01-20 | Sellectsoft, L.C. | Electric couponing method and apparatus |
JP3284832B2 (ja) | 1995-06-22 | 2002-05-20 | セイコーエプソン株式会社 | 音声認識対話処理方法および音声認識対話装置 |
US6038533A (en) | 1995-07-07 | 2000-03-14 | Lucent Technologies Inc. | System and method for selecting training text |
US6026388A (en) | 1995-08-16 | 2000-02-15 | Textwise, Llc | User interface and other enhancements for natural language information retrieval system and method |
JP3697748B2 (ja) | 1995-08-21 | 2005-09-21 | セイコーエプソン株式会社 | 端末、音声認識装置 |
US5712957A (en) | 1995-09-08 | 1998-01-27 | Carnegie Mellon University | Locating and correcting erroneously recognized portions of utterances by rescoring based on two n-best lists |
US5790978A (en) | 1995-09-15 | 1998-08-04 | Lucent Technologies, Inc. | System and method for determining pitch contours |
US6173261B1 (en) | 1998-09-30 | 2001-01-09 | At&T Corp | Grammar fragment acquisition using syntactic and semantic clustering |
US5737734A (en) | 1995-09-15 | 1998-04-07 | Infonautics Corporation | Query word relevance adjustment in a search of an information retrieval system |
US5884323A (en) | 1995-10-13 | 1999-03-16 | 3Com Corporation | Extendible method and apparatus for synchronizing files on two different computer systems |
US5799276A (en) | 1995-11-07 | 1998-08-25 | Accent Incorporated | Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals |
US5794237A (en) | 1995-11-13 | 1998-08-11 | International Business Machines Corporation | System and method for improving problem source identification in computer systems employing relevance feedback and statistical source ranking |
US5706442A (en) | 1995-12-20 | 1998-01-06 | Block Financial Corporation | System for on-line financial services using distributed objects |
US6119101A (en) | 1996-01-17 | 2000-09-12 | Personal Agents, Inc. | Intelligent agents for electronic commerce |
US6125356A (en) | 1996-01-18 | 2000-09-26 | Rosefaire Development, Ltd. | Portable sales presentation system with selective scripted seller prompts |
US5987404A (en) | 1996-01-29 | 1999-11-16 | International Business Machines Corporation | Statistical natural language understanding using hidden clumpings |
US5729694A (en) | 1996-02-06 | 1998-03-17 | The Regents Of The University Of California | Speech coding, reconstruction and recognition using acoustics and electromagnetic waves |
US6076088A (en) | 1996-02-09 | 2000-06-13 | Paik; Woojin | Information extraction system and method using concept relation concept (CRC) triples |
US5835893A (en) | 1996-02-15 | 1998-11-10 | Atr Interpreting Telecommunications Research Labs | Class-based word clustering for speech recognition using a three-level balanced hierarchical similarity |
US5901287A (en) | 1996-04-01 | 1999-05-04 | The Sabre Group Inc. | Information aggregation and synthesization system |
US5867799A (en) | 1996-04-04 | 1999-02-02 | Lang; Andrew K. | Information system and method for filtering a massive flow of information entities to meet user information classification needs |
US5987140A (en) | 1996-04-26 | 1999-11-16 | Verifone, Inc. | System, method and article of manufacture for secure network electronic payment and credit collection |
US5963924A (en) | 1996-04-26 | 1999-10-05 | Verifone, Inc. | System, method and article of manufacture for the use of payment instrument holders and payment instruments in network electronic commerce |
US5913193A (en) | 1996-04-30 | 1999-06-15 | Microsoft Corporation | Method and system of runtime acoustic unit selection for speech synthesis |
US5857184A (en) | 1996-05-03 | 1999-01-05 | Walden Media, Inc. | Language and method for creating, organizing, and retrieving data from a database |
US5828999A (en) | 1996-05-06 | 1998-10-27 | Apple Computer, Inc. | Method and system for deriving a large-span semantic language model for large-vocabulary recognition systems |
FR2748342B1 (fr) | 1996-05-06 | 1998-07-17 | France Telecom | Procede et dispositif de filtrage par egalisation d'un signal de parole, mettant en oeuvre un modele statistique de ce signal |
US5826261A (en) | 1996-05-10 | 1998-10-20 | Spencer; Graham | System and method for querying multiple, distributed databases by selective sharing of local relative significance information for terms related to the query |
US6366883B1 (en) | 1996-05-15 | 2002-04-02 | Atr Interpreting Telecommunications | Concatenation of speech segments by use of a speech synthesizer |
US5727950A (en) | 1996-05-22 | 1998-03-17 | Netsage Corporation | Agent based instruction system and method |
US5966533A (en) | 1996-06-11 | 1999-10-12 | Excite, Inc. | Method and system for dynamically synthesizing a computer program by differentially resolving atoms based on user context data |
US5915249A (en) | 1996-06-14 | 1999-06-22 | Excite, Inc. | System and method for accelerated query evaluation of very large full-text databases |
US5987132A (en) | 1996-06-17 | 1999-11-16 | Verifone, Inc. | System, method and article of manufacture for conditionally accepting a payment method utilizing an extensible, flexible architecture |
US5825881A (en) | 1996-06-28 | 1998-10-20 | Allsoft Distributing Inc. | Public network merchandising system |
US6070147A (en) | 1996-07-02 | 2000-05-30 | Tecmark Services, Inc. | Customer identification and marketing analysis systems |
EP0912954B8 (en) | 1996-07-22 | 2006-06-14 | Cyva Research Corporation | Personal information security and exchange tool |
EP0829811A1 (en) | 1996-09-11 | 1998-03-18 | Nippon Telegraph And Telephone Corporation | Method and system for information retrieval |
US6181935B1 (en) | 1996-09-27 | 2001-01-30 | Software.Com, Inc. | Mobility extended telephone application programming interface and method of use |
US5794182A (en) | 1996-09-30 | 1998-08-11 | Apple Computer, Inc. | Linear predictive speech encoding systems with efficient combination pitch coefficients computation |
US5721827A (en) | 1996-10-02 | 1998-02-24 | James Logan | System for electrically distributing personalized information |
US5913203A (en) | 1996-10-03 | 1999-06-15 | Jaesent Inc. | System and method for pseudo cash transactions |
US5930769A (en) | 1996-10-07 | 1999-07-27 | Rose; Andrea | System and method for fashion shopping |
US5836771A (en) | 1996-12-02 | 1998-11-17 | Ho; Chi Fai | Learning method and system based on questioning |
US6665639B2 (en) | 1996-12-06 | 2003-12-16 | Sensory, Inc. | Speech recognition in consumer electronic products |
US6078914A (en) | 1996-12-09 | 2000-06-20 | Open Text Corporation | Natural language meta-search system and method |
US5839106A (en) | 1996-12-17 | 1998-11-17 | Apple Computer, Inc. | Large-vocabulary speech recognition using an integrated syntactic and semantic statistical language model |
US5966126A (en) | 1996-12-23 | 1999-10-12 | Szabo; Andrew J. | Graphic user interface for database system |
US5932869A (en) | 1996-12-27 | 1999-08-03 | Graphic Technology, Inc. | Promotional system with magnetic stripe and visual thermo-reversible print surfaced medium |
JP3579204B2 (ja) | 1997-01-17 | 2004-10-20 | 富士通株式会社 | 文書要約装置およびその方法 |
US5941944A (en) | 1997-03-03 | 1999-08-24 | Microsoft Corporation | Method for providing a substitute for a requested inaccessible object by identifying substantially similar objects using weights corresponding to object features |
US6076051A (en) | 1997-03-07 | 2000-06-13 | Microsoft Corporation | Information retrieval utilizing semantic representation of text |
US5930801A (en) | 1997-03-07 | 1999-07-27 | Xerox Corporation | Shared-data environment in which each file has independent security properties |
US5822743A (en) | 1997-04-08 | 1998-10-13 | 1215627 Ontario Inc. | Knowledge-based information retrieval system |
US5970474A (en) | 1997-04-24 | 1999-10-19 | Sears, Roebuck And Co. | Registry information system for shoppers |
US5895464A (en) | 1997-04-30 | 1999-04-20 | Eastman Kodak Company | Computer program product and a method for using natural language for the description, search and retrieval of multi-media objects |
US5860063A (en) | 1997-07-11 | 1999-01-12 | At&T Corp | Automated meaningful phrase clustering |
US5933822A (en) | 1997-07-22 | 1999-08-03 | Microsoft Corporation | Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision |
US5974146A (en) | 1997-07-30 | 1999-10-26 | Huntington Bancshares Incorporated | Real time bank-centric universal payment system |
US5895466A (en) | 1997-08-19 | 1999-04-20 | At&T Corp | Automated natural language understanding customer service system |
US6081774A (en) | 1997-08-22 | 2000-06-27 | Novell, Inc. | Natural language information retrieval system and method |
US6404876B1 (en) | 1997-09-25 | 2002-06-11 | Gte Intelligent Network Services Incorporated | System and method for voice activated dialing and routing under open access network control |
US6023684A (en) | 1997-10-01 | 2000-02-08 | Security First Technologies, Inc. | Three tier financial transaction system with cache memory |
EP0911808B1 (en) | 1997-10-23 | 2002-05-08 | Sony International (Europe) GmbH | Speech interface in a home network environment |
US6108627A (en) | 1997-10-31 | 2000-08-22 | Nortel Networks Corporation | Automatic transcription tool |
US5943670A (en) | 1997-11-21 | 1999-08-24 | International Business Machines Corporation | System and method for categorizing objects in combined categories |
US5960422A (en) | 1997-11-26 | 1999-09-28 | International Business Machines Corporation | System and method for optimized source selection in an information retrieval system |
US6026375A (en) | 1997-12-05 | 2000-02-15 | Nortel Networks Corporation | Method and apparatus for processing orders from customers in a mobile environment |
US6064960A (en) | 1997-12-18 | 2000-05-16 | Apple Computer, Inc. | Method and apparatus for improved duration modeling of phonemes |
US6094649A (en) | 1997-12-22 | 2000-07-25 | Partnet, Inc. | Keyword searches of structured databases |
US6173287B1 (en) | 1998-03-11 | 2001-01-09 | Digital Equipment Corporation | Technique for ranking multimedia annotations of interest |
US6195641B1 (en) | 1998-03-27 | 2001-02-27 | International Business Machines Corp. | Network universal spoken language vocabulary |
US6026393A (en) | 1998-03-31 | 2000-02-15 | Casebank Technologies Inc. | Configuration knowledge as an aid to case retrieval |
US6233559B1 (en) | 1998-04-01 | 2001-05-15 | Motorola, Inc. | Speech control of multiple applications using applets |
US6173279B1 (en) | 1998-04-09 | 2001-01-09 | At&T Corp. | Method of using a natural language interface to retrieve information from one or more data resources |
US6088731A (en) | 1998-04-24 | 2000-07-11 | Associative Computing, Inc. | Intelligent assistant for use with a local computer and with the internet |
US6029132A (en) | 1998-04-30 | 2000-02-22 | Matsushita Electric Industrial Co. | Method for letter-to-sound in text-to-speech synthesis |
US6016471A (en) | 1998-04-29 | 2000-01-18 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word |
US6285786B1 (en) | 1998-04-30 | 2001-09-04 | Motorola, Inc. | Text recognizer and method using non-cumulative character scoring in a forward search |
US6144938A (en) | 1998-05-01 | 2000-11-07 | Sun Microsystems, Inc. | Voice user interface with personality |
US7526466B2 (en) | 1998-05-28 | 2009-04-28 | Qps Tech Limited Liability Company | Method and system for analysis of intended meaning of natural language |
US7711672B2 (en) | 1998-05-28 | 2010-05-04 | Lawrence Au | Semantic network methods to disambiguate natural language meaning |
US6778970B2 (en) | 1998-05-28 | 2004-08-17 | Lawrence Au | Topological methods to organize semantic network data flows for conversational applications |
US6144958A (en) | 1998-07-15 | 2000-11-07 | Amazon.Com, Inc. | System and method for correcting spelling errors in search queries |
US6105865A (en) | 1998-07-17 | 2000-08-22 | Hardesty; Laurence Daniel | Financial transaction system with retirement saving benefit |
US6499013B1 (en) | 1998-09-09 | 2002-12-24 | One Voice Technologies, Inc. | Interactive user interface using speech recognition and natural language processing |
US6434524B1 (en) | 1998-09-09 | 2002-08-13 | One Voice Technologies, Inc. | Object interactive user interface using speech recognition and natural language processing |
DE19841541B4 (de) | 1998-09-11 | 2007-12-06 | Püllen, Rainer | Teilnehmereinheit für einen Multimediadienst |
US6266637B1 (en) | 1998-09-11 | 2001-07-24 | International Business Machines Corporation | Phrase splicing and variable substitution using a trainable speech synthesizer |
US6792082B1 (en) | 1998-09-11 | 2004-09-14 | Comverse Ltd. | Voice mail system with personal assistant provisioning |
US6317831B1 (en) | 1998-09-21 | 2001-11-13 | Openwave Systems Inc. | Method and apparatus for establishing a secure connection over a one-way data path |
WO2000021232A2 (en) | 1998-10-02 | 2000-04-13 | International Business Machines Corporation | Conversational browser and conversational systems |
US6275824B1 (en) | 1998-10-02 | 2001-08-14 | Ncr Corporation | System and method for managing data privacy in a database management system |
GB9821969D0 (en) | 1998-10-08 | 1998-12-02 | Canon Kk | Apparatus and method for processing natural language |
US6928614B1 (en) | 1998-10-13 | 2005-08-09 | Visteon Global Technologies, Inc. | Mobile office with speech recognition |
US6453292B2 (en) | 1998-10-28 | 2002-09-17 | International Business Machines Corporation | Command boundary identifier for conversational natural language |
US6208971B1 (en) | 1998-10-30 | 2001-03-27 | Apple Computer, Inc. | Method and apparatus for command recognition using data-driven semantic inference |
US6321092B1 (en) | 1998-11-03 | 2001-11-20 | Signal Soft Corporation | Multiple input data management for wireless location-based applications |
US6446076B1 (en) | 1998-11-12 | 2002-09-03 | Accenture Llp. | Voice interactive web-based agent system responsive to a user location for prioritizing and formatting information |
AU772874B2 (en) | 1998-11-13 | 2004-05-13 | Scansoft, Inc. | Speech synthesis using concatenation of speech waveforms |
US6606599B2 (en) | 1998-12-23 | 2003-08-12 | Interactive Speech Technologies, Llc | Method for integrating computing processes with an interface controlled by voice actuated grammars |
US6246981B1 (en) | 1998-11-25 | 2001-06-12 | International Business Machines Corporation | Natural language task-oriented dialog manager and method |
US7082397B2 (en) | 1998-12-01 | 2006-07-25 | Nuance Communications, Inc. | System for and method of creating and browsing a voice web |
US6260024B1 (en) | 1998-12-02 | 2001-07-10 | Gary Shkedy | Method and apparatus for facilitating buyer-driven purchase orders on a commercial network system |
US7881936B2 (en) | 1998-12-04 | 2011-02-01 | Tegic Communications, Inc. | Multimodal disambiguation of speech recognition |
US6317707B1 (en) | 1998-12-07 | 2001-11-13 | At&T Corp. | Automatic clustering of tokens from a corpus for grammar acquisition |
US6308149B1 (en) | 1998-12-16 | 2001-10-23 | Xerox Corporation | Grouping words with equivalent substrings by automatic clustering based on suffix relationships |
US6523172B1 (en) | 1998-12-17 | 2003-02-18 | Evolutionary Technologies International, Inc. | Parser translator system and method |
US6460029B1 (en) | 1998-12-23 | 2002-10-01 | Microsoft Corporation | System for improving search text |
US6742021B1 (en) | 1999-01-05 | 2004-05-25 | Sri International, Inc. | Navigating network-based electronic information using spoken input with multimodal error feedback |
US6757718B1 (en) | 1999-01-05 | 2004-06-29 | Sri International | Mobile navigation of network-based electronic information using spoken input |
US6851115B1 (en) | 1999-01-05 | 2005-02-01 | Sri International | Software-based architecture for communication and cooperation among distributed electronic agents |
US6513063B1 (en) | 1999-01-05 | 2003-01-28 | Sri International | Accessing network-based electronic information through scripted online interfaces using spoken input |
US7036128B1 (en) | 1999-01-05 | 2006-04-25 | Sri International Offices | Using a community of distributed electronic agents to support a highly mobile, ambient computing environment |
US6523061B1 (en) | 1999-01-05 | 2003-02-18 | Sri International, Inc. | System, method, and article of manufacture for agent-based navigation in a speech-based data navigation system |
US7152070B1 (en) | 1999-01-08 | 2006-12-19 | The Regents Of The University Of California | System and method for integrating and accessing multiple data sources within a data warehouse architecture |
US6505183B1 (en) | 1999-02-04 | 2003-01-07 | Authoria, Inc. | Human resource knowledge modeling and delivery system |
US6317718B1 (en) | 1999-02-26 | 2001-11-13 | Accenture Properties (2) B.V. | System, method and article of manufacture for location-based filtering for shopping agent in the physical world |
GB9904662D0 (en) | 1999-03-01 | 1999-04-21 | Canon Kk | Natural language search method and apparatus |
US6356905B1 (en) | 1999-03-05 | 2002-03-12 | Accenture Llp | System, method and article of manufacture for mobile communication utilizing an interface support framework |
US6928404B1 (en) | 1999-03-17 | 2005-08-09 | International Business Machines Corporation | System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies |
US6584464B1 (en) | 1999-03-19 | 2003-06-24 | Ask Jeeves, Inc. | Grammar template query system |
WO2000058942A2 (en) | 1999-03-26 | 2000-10-05 | Koninklijke Philips Electronics N.V. | Client-server speech recognition |
US6356854B1 (en) | 1999-04-05 | 2002-03-12 | Delphi Technologies, Inc. | Holographic object position and type sensing system and method |
US6631346B1 (en) | 1999-04-07 | 2003-10-07 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus for natural language parsing using multiple passes and tags |
WO2000060435A2 (en) | 1999-04-07 | 2000-10-12 | Rensselaer Polytechnic Institute | System and method for accessing personal information |
US6647260B2 (en) | 1999-04-09 | 2003-11-11 | Openwave Systems Inc. | Method and system facilitating web based provisioning of two-way mobile communications devices |
US6924828B1 (en) | 1999-04-27 | 2005-08-02 | Surfnotes | Method and apparatus for improved information representation |
US6697780B1 (en) | 1999-04-30 | 2004-02-24 | At&T Corp. | Method and apparatus for rapid acoustic unit selection from a large speech corpus |
EP1224569A4 (en) | 1999-05-28 | 2005-08-10 | Sehda Inc | PHRASE BASED DIALOGUE MODELING WITH SPECIAL APPLICATION FOR GENERATING RECOGNITION GRAMMARK FOR LANGUAGE-CONTROLLED USER INTERFACE |
US20020032564A1 (en) | 2000-04-19 | 2002-03-14 | Farzad Ehsani | Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface |
US6931384B1 (en) | 1999-06-04 | 2005-08-16 | Microsoft Corporation | System and method providing utility-based decision making about clarification dialog given communicative uncertainty |
US6598039B1 (en) | 1999-06-08 | 2003-07-22 | Albert-Inc. S.A. | Natural language interface for searching database |
US7093693B1 (en) | 1999-06-10 | 2006-08-22 | Gazdzinski Robert F | Elevator access control system and method |
US8065155B1 (en) | 1999-06-10 | 2011-11-22 | Gazdzinski Robert F | Adaptive advertising apparatus and methods |
US6615175B1 (en) | 1999-06-10 | 2003-09-02 | Robert F. Gazdzinski | “Smart” elevator system and method |
US7711565B1 (en) | 1999-06-10 | 2010-05-04 | Gazdzinski Robert F | “Smart” elevator system and method |
US6711585B1 (en) | 1999-06-15 | 2004-03-23 | Kanisa Inc. | System and method for implementing a knowledge management system |
JP3361291B2 (ja) | 1999-07-23 | 2003-01-07 | コナミ株式会社 | 音声合成方法、音声合成装置及び音声合成プログラムを記録したコンピュータ読み取り可能な媒体 |
US6421672B1 (en) | 1999-07-27 | 2002-07-16 | Verizon Services Corp. | Apparatus for and method of disambiguation of directory listing searches utilizing multiple selectable secondary search keys |
US7451177B1 (en) | 1999-08-12 | 2008-11-11 | Avintaquin Capital, Llc | System for and method of implementing a closed loop response architecture for electronic commerce |
EP1079387A3 (en) | 1999-08-26 | 2003-07-09 | Matsushita Electric Industrial Co., Ltd. | Mechanism for storing information about recorded television broadcasts |
US6697824B1 (en) | 1999-08-31 | 2004-02-24 | Accenture Llp | Relationship management in an E-commerce application framework |
US6601234B1 (en) | 1999-08-31 | 2003-07-29 | Accenture Llp | Attribute dictionary in a business logic services environment |
US6912499B1 (en) | 1999-08-31 | 2005-06-28 | Nortel Networks Limited | Method and apparatus for training a multilingual speech model set |
US7127403B1 (en) | 1999-09-13 | 2006-10-24 | Microstrategy, Inc. | System and method for personalizing an interactive voice broadcast of a voice service based on particulars of a request |
US6601026B2 (en) | 1999-09-17 | 2003-07-29 | Discern Communications, Inc. | Information retrieval by natural language querying |
US6505175B1 (en) | 1999-10-06 | 2003-01-07 | Goldman, Sachs & Co. | Order centric tracking system |
US6625583B1 (en) | 1999-10-06 | 2003-09-23 | Goldman, Sachs & Co. | Handheld trading system interface |
US7020685B1 (en) | 1999-10-08 | 2006-03-28 | Openwave Systems Inc. | Method and apparatus for providing internet content to SMS-based wireless devices |
US7447635B1 (en) | 1999-10-19 | 2008-11-04 | Sony Corporation | Natural language interface control system |
US6807574B1 (en) | 1999-10-22 | 2004-10-19 | Tellme Networks, Inc. | Method and apparatus for content personalization over a telephone interface |
JP2001125896A (ja) | 1999-10-26 | 2001-05-11 | Victor Co Of Japan Ltd | 自然言語対話システム |
US7310600B1 (en) | 1999-10-28 | 2007-12-18 | Canon Kabushiki Kaisha | Language recognition using a similarity measure |
US6615172B1 (en) | 1999-11-12 | 2003-09-02 | Phoenix Solutions, Inc. | Intelligent query engine for processing voice based queries |
US7050977B1 (en) | 1999-11-12 | 2006-05-23 | Phoenix Solutions, Inc. | Speech-enabled server for internet website and method |
US6665640B1 (en) | 1999-11-12 | 2003-12-16 | Phoenix Solutions, Inc. | Interactive speech based learning/training system formulating search queries based on natural language parsing of recognized user queries |
US7725307B2 (en) | 1999-11-12 | 2010-05-25 | Phoenix Solutions, Inc. | Query engine for processing voice based queries including semantic decoding |
US7392185B2 (en) | 1999-11-12 | 2008-06-24 | Phoenix Solutions, Inc. | Speech based learning/training system using semantic decoding |
US9076448B2 (en) | 1999-11-12 | 2015-07-07 | Nuance Communications, Inc. | Distributed real time speech recognition system |
US6633846B1 (en) | 1999-11-12 | 2003-10-14 | Phoenix Solutions, Inc. | Distributed realtime speech recognition system |
US6532446B1 (en) | 1999-11-24 | 2003-03-11 | Openwave Systems Inc. | Server based speech recognition user interface for wireless devices |
US6526382B1 (en) | 1999-12-07 | 2003-02-25 | Comverse, Inc. | Language-oriented user interfaces for voice activated services |
US6526395B1 (en) | 1999-12-31 | 2003-02-25 | Intel Corporation | Application of personality models and interaction with synthetic characters in a computing system |
US6556983B1 (en) | 2000-01-12 | 2003-04-29 | Microsoft Corporation | Methods and apparatus for finding semantic information, such as usage logs, similar to a query using a pattern lattice data space |
US6546388B1 (en) | 2000-01-14 | 2003-04-08 | International Business Machines Corporation | Metadata search results ranking system |
US6701294B1 (en) | 2000-01-19 | 2004-03-02 | Lucent Technologies, Inc. | User interface for translating natural language inquiries into database queries and data presentations |
US6829603B1 (en) | 2000-02-02 | 2004-12-07 | International Business Machines Corp. | System, method and program product for interactive natural dialog |
US6895558B1 (en) | 2000-02-11 | 2005-05-17 | Microsoft Corporation | Multi-access mode electronic personal assistant |
US6640098B1 (en) | 2000-02-14 | 2003-10-28 | Action Engine Corporation | System for obtaining service-related information for local interactive wireless devices |
US6847979B2 (en) | 2000-02-25 | 2005-01-25 | Synquiry Technologies, Ltd | Conceptual factoring and unification of graphs representing semantic models |
US6449620B1 (en) | 2000-03-02 | 2002-09-10 | Nimble Technology, Inc. | Method and apparatus for generating information pages using semi-structured data stored in a structured manner |
US6895380B2 (en) | 2000-03-02 | 2005-05-17 | Electro Standards Laboratories | Voice actuation with contextual learning for intelligent machine control |
US6757362B1 (en) | 2000-03-06 | 2004-06-29 | Avaya Technology Corp. | Personal virtual assistant |
US6466654B1 (en) | 2000-03-06 | 2002-10-15 | Avaya Technology Corp. | Personal virtual assistant with semantic tagging |
EP1275042A2 (en) | 2000-03-06 | 2003-01-15 | Kanisa Inc. | A system and method for providing an intelligent multi-step dialog with a user |
US6477488B1 (en) | 2000-03-10 | 2002-11-05 | Apple Computer, Inc. | Method for dynamic context scope selection in hybrid n-gram+LSA language modeling |
US6615220B1 (en) | 2000-03-14 | 2003-09-02 | Oracle International Corporation | Method and mechanism for data consolidation |
US6510417B1 (en) | 2000-03-21 | 2003-01-21 | America Online, Inc. | System and method for voice access to internet-based information |
GB2366009B (en) | 2000-03-22 | 2004-07-21 | Canon Kk | Natural language machine interface |
JP3728172B2 (ja) | 2000-03-31 | 2005-12-21 | キヤノン株式会社 | 音声合成方法および装置 |
US7177798B2 (en) | 2000-04-07 | 2007-02-13 | Rensselaer Polytechnic Institute | Natural language interface using constrained intermediate dictionary of results |
US6810379B1 (en) | 2000-04-24 | 2004-10-26 | Sensory, Inc. | Client/server architecture for text-to-speech synthesis |
US6684187B1 (en) | 2000-06-30 | 2004-01-27 | At&T Corp. | Method and system for preselection of suitable units for concatenative speech |
US6691111B2 (en) | 2000-06-30 | 2004-02-10 | Research In Motion Limited | System and method for implementing a natural language user interface |
US6505158B1 (en) | 2000-07-05 | 2003-01-07 | At&T Corp. | Synthesis-based pre-selection of suitable units for concatenative speech |
JP3949356B2 (ja) | 2000-07-12 | 2007-07-25 | 三菱電機株式会社 | 音声対話システム |
US7143040B2 (en) * | 2000-07-20 | 2006-11-28 | British Telecommunications Public Limited Company | Interactive dialogues |
US7139709B2 (en) | 2000-07-20 | 2006-11-21 | Microsoft Corporation | Middleware layer between speech related applications and engines |
US20060143007A1 (en) | 2000-07-24 | 2006-06-29 | Koh V E | User interaction with voice information services |
JP2002041276A (ja) | 2000-07-24 | 2002-02-08 | Sony Corp | 対話型操作支援システム及び対話型操作支援方法、並びに記憶媒体 |
US7092928B1 (en) | 2000-07-31 | 2006-08-15 | Quantum Leap Research, Inc. | Intelligent portal engine |
US6778951B1 (en) | 2000-08-09 | 2004-08-17 | Concerto Software, Inc. | Information retrieval method with natural language interface |
US6766320B1 (en) | 2000-08-24 | 2004-07-20 | Microsoft Corporation | Search engine with natural language-based robust parsing for user query and relevance feedback learning |
DE10042944C2 (de) | 2000-08-31 | 2003-03-13 | Siemens Ag | Graphem-Phonem-Konvertierung |
AU2001290882A1 (en) | 2000-09-15 | 2002-03-26 | Lernout And Hauspie Speech Products N.V. | Fast waveform synchronization for concatenation and time-scale modification of speech |
US7216080B2 (en) | 2000-09-29 | 2007-05-08 | Mindfabric Holdings Llc | Natural-language voice-activated personal assistant |
US6832194B1 (en) | 2000-10-26 | 2004-12-14 | Sensory, Incorporated | Audio recognition peripheral system |
US7027974B1 (en) | 2000-10-27 | 2006-04-11 | Science Applications International Corporation | Ontology-based parser for natural language processing |
US7006969B2 (en) | 2000-11-02 | 2006-02-28 | At&T Corp. | System and method of pattern recognition in very high-dimensional space |
WO2002050816A1 (en) | 2000-12-18 | 2002-06-27 | Koninklijke Philips Electronics N.V. | Store speech, select vocabulary to recognize word |
US6937986B2 (en) | 2000-12-28 | 2005-08-30 | Comverse, Inc. | Automatic dynamic speech recognition vocabulary based on external sources of information |
AU2001255568A1 (en) | 2000-12-29 | 2002-07-16 | General Electric Company | Method and system for identifying repeatedly malfunctioning equipment |
US7257537B2 (en) | 2001-01-12 | 2007-08-14 | International Business Machines Corporation | Method and apparatus for performing dialog management in a computer conversational interface |
US6964023B2 (en) | 2001-02-05 | 2005-11-08 | International Business Machines Corporation | System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input |
US7290039B1 (en) | 2001-02-27 | 2007-10-30 | Microsoft Corporation | Intent based processing |
US6721728B2 (en) | 2001-03-02 | 2004-04-13 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration | System, method and apparatus for discovering phrases in a database |
EP1490790A2 (en) | 2001-03-13 | 2004-12-29 | Intelligate Ltd. | Dynamic natural language understanding |
US6996531B2 (en) | 2001-03-30 | 2006-02-07 | Comverse Ltd. | Automated database assistance using a telephone for a speech based or text based multimedia communication mode |
US6654740B2 (en) | 2001-05-08 | 2003-11-25 | Sunflare Co., Ltd. | Probabilistic information retrieval based on differential latent semantic space |
US7085722B2 (en) | 2001-05-14 | 2006-08-01 | Sony Computer Entertainment America Inc. | System and method for menu-driven voice control of characters in a game environment |
US6775358B1 (en) * | 2001-05-17 | 2004-08-10 | Oracle Cable, Inc. | Method and system for enhanced interactive playback of audio content to telephone callers |
US6944594B2 (en) | 2001-05-30 | 2005-09-13 | Bellsouth Intellectual Property Corporation | Multi-context conversational environment system and method |
US20020194003A1 (en) | 2001-06-05 | 2002-12-19 | Mozer Todd F. | Client-server security system and method |
US20020198714A1 (en) | 2001-06-26 | 2002-12-26 | Guojun Zhou | Statistical spoken dialog system |
US7139722B2 (en) | 2001-06-27 | 2006-11-21 | Bellsouth Intellectual Property Corporation | Location and time sensitive wireless calendaring |
US6604059B2 (en) | 2001-07-10 | 2003-08-05 | Koninklijke Philips Electronics N.V. | Predictive calendar |
US7987151B2 (en) | 2001-08-10 | 2011-07-26 | General Dynamics Advanced Info Systems, Inc. | Apparatus and method for problem solving using intelligent agents |
US6813491B1 (en) | 2001-08-31 | 2004-11-02 | Openwave Systems Inc. | Method and apparatus for adapting settings of wireless communication devices in accordance with user proximity |
US7403938B2 (en) | 2001-09-24 | 2008-07-22 | Iac Search & Media, Inc. | Natural language query processing |
US20050196732A1 (en) | 2001-09-26 | 2005-09-08 | Scientific Learning Corporation | Method and apparatus for automated training of language learning skills |
US6985865B1 (en) | 2001-09-26 | 2006-01-10 | Sprint Spectrum L.P. | Method and system for enhanced response to voice commands in a voice command platform |
US6650735B2 (en) | 2001-09-27 | 2003-11-18 | Microsoft Corporation | Integrated voice access to a variety of personal information services |
US7324947B2 (en) | 2001-10-03 | 2008-01-29 | Promptu Systems Corporation | Global speech user interface |
US7167832B2 (en) | 2001-10-15 | 2007-01-23 | At&T Corp. | Method for dialog management |
GB2381409B (en) | 2001-10-27 | 2004-04-28 | Hewlett Packard Ltd | Asynchronous access to synchronous voice services |
US7069213B2 (en) * | 2001-11-09 | 2006-06-27 | Netbytel, Inc. | Influencing a voice recognition matching operation with user barge-in time |
NO316480B1 (no) | 2001-11-15 | 2004-01-26 | Forinnova As | Fremgangsmåte og system for tekstuell granskning og oppdagelse |
US20030101054A1 (en) | 2001-11-27 | 2003-05-29 | Ncc, Llc | Integrated system and method for electronic speech recognition and transcription |
TW541517B (en) | 2001-12-25 | 2003-07-11 | Univ Nat Cheng Kung | Speech recognition system |
US7197460B1 (en) | 2002-04-23 | 2007-03-27 | At&T Corp. | System for handling frequently asked questions in a natural language dialog service |
US6847966B1 (en) | 2002-04-24 | 2005-01-25 | Engenium Corporation | Method and system for optimally searching a document database using a representative semantic space |
US7546382B2 (en) | 2002-05-28 | 2009-06-09 | International Business Machines Corporation | Methods and systems for authoring of mixed-initiative multi-modal interactions and related browsing mechanisms |
US7398209B2 (en) | 2002-06-03 | 2008-07-08 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7299033B2 (en) | 2002-06-28 | 2007-11-20 | Openwave Systems Inc. | Domain-based management of distribution of digital content from multiple suppliers to multiple wireless services subscribers |
US7233790B2 (en) | 2002-06-28 | 2007-06-19 | Openwave Systems, Inc. | Device capability based discovery, packaging and provisioning of content for wireless mobile devices |
US7693720B2 (en) | 2002-07-15 | 2010-04-06 | Voicebox Technologies, Inc. | Mobile systems and methods for responding to natural language speech utterance |
US7467087B1 (en) | 2002-10-10 | 2008-12-16 | Gillick Laurence S | Training and using pronunciation guessers in speech recognition |
US7152033B2 (en) * | 2002-11-12 | 2006-12-19 | Motorola, Inc. | Method, system and module for multi-modal data fusion |
US7783486B2 (en) | 2002-11-22 | 2010-08-24 | Roy Jonathan Rosser | Response generator for mimicking human-computer natural language conversation |
EP2017828A1 (en) | 2002-12-10 | 2009-01-21 | Kirusa, Inc. | Techniques for disambiguating speech input using multimodal interfaces |
US7386449B2 (en) | 2002-12-11 | 2008-06-10 | Voice Enabling Systems Technology Inc. | Knowledge-based flexible natural speech dialogue system |
US7956766B2 (en) | 2003-01-06 | 2011-06-07 | Panasonic Corporation | Apparatus operating system |
US7529671B2 (en) | 2003-03-04 | 2009-05-05 | Microsoft Corporation | Block synchronous decoding |
US6980949B2 (en) | 2003-03-14 | 2005-12-27 | Sonum Technologies, Inc. | Natural language processor |
US7496498B2 (en) | 2003-03-24 | 2009-02-24 | Microsoft Corporation | Front-end architecture for a multi-lingual text-to-speech system |
US7421393B1 (en) | 2004-03-01 | 2008-09-02 | At&T Corp. | System for developing a dialog manager using modular spoken-dialog components |
US7200559B2 (en) | 2003-05-29 | 2007-04-03 | Microsoft Corporation | Semantic object synchronous understanding implemented with speech application language tags |
US7720683B1 (en) | 2003-06-13 | 2010-05-18 | Sensory, Inc. | Method and apparatus of specifying and performing speech recognition operations |
US7475010B2 (en) | 2003-09-03 | 2009-01-06 | Lingospot, Inc. | Adaptive and scalable method for resolving natural language ambiguities |
US7418392B1 (en) | 2003-09-25 | 2008-08-26 | Sensory, Inc. | System and method for controlling the operation of a device by voice commands |
US7383170B2 (en) * | 2003-10-10 | 2008-06-03 | At&T Knowledge Ventures, L.P. | System and method for analyzing automatic speech recognition performance data |
US7155706B2 (en) | 2003-10-24 | 2006-12-26 | Microsoft Corporation | Administrative tool environment |
US7584092B2 (en) | 2004-11-15 | 2009-09-01 | Microsoft Corporation | Unsupervised learning of paraphrase/translation alternations and selective application thereof |
US7412385B2 (en) | 2003-11-12 | 2008-08-12 | Microsoft Corporation | System for identifying paraphrases using machine translation |
US7206391B2 (en) * | 2003-12-23 | 2007-04-17 | Apptera Inc. | Method for creating and deploying system changes in a voice application system |
US7447630B2 (en) | 2003-11-26 | 2008-11-04 | Microsoft Corporation | Method and apparatus for multi-sensory speech enhancement |
JP4533845B2 (ja) | 2003-12-05 | 2010-09-01 | 株式会社ケンウッド | オーディオ機器制御装置、オーディオ機器制御方法及びプログラム |
ATE404967T1 (de) | 2003-12-16 | 2008-08-15 | Loquendo Spa | Text-zu-sprache-system und verfahren, computerprogramm dafür |
US7427024B1 (en) | 2003-12-17 | 2008-09-23 | Gazdzinski Mark J | Chattel management apparatus and methods |
US7552055B2 (en) | 2004-01-10 | 2009-06-23 | Microsoft Corporation | Dialog component re-use in recognition systems |
US7567896B2 (en) | 2004-01-16 | 2009-07-28 | Nuance Communications, Inc. | Corpus-based speech synthesis based on segment recombination |
US20050165607A1 (en) | 2004-01-22 | 2005-07-28 | At&T Corp. | System and method to disambiguate and clarify user intention in a spoken dialog system |
EP1560200B8 (en) | 2004-01-29 | 2009-08-05 | Harman Becker Automotive Systems GmbH | Method and system for spoken dialogue interface |
KR100462292B1 (ko) | 2004-02-26 | 2004-12-17 | 엔에이치엔(주) | 중요도 정보를 반영한 검색 결과 리스트 제공 방법 및 그시스템 |
US7693715B2 (en) | 2004-03-10 | 2010-04-06 | Microsoft Corporation | Generating large units of graphonemes with mutual information criterion for letter to sound conversion |
US7409337B1 (en) | 2004-03-30 | 2008-08-05 | Microsoft Corporation | Natural language processing interface |
US7496512B2 (en) | 2004-04-13 | 2009-02-24 | Microsoft Corporation | Refining of segmental boundaries in speech waveforms using contextual-dependent models |
US7673340B1 (en) * | 2004-06-02 | 2010-03-02 | Clickfox Llc | System and method for analyzing system user behavior |
US8095364B2 (en) | 2004-06-02 | 2012-01-10 | Tegic Communications, Inc. | Multimodal disambiguation of speech recognition |
US7720674B2 (en) | 2004-06-29 | 2010-05-18 | Sap Ag | Systems and methods for processing natural language queries |
TWI252049B (en) | 2004-07-23 | 2006-03-21 | Inventec Corp | Sound control system and method |
US7725318B2 (en) | 2004-07-30 | 2010-05-25 | Nice Systems Inc. | System and method for improving the accuracy of audio searching |
US7853574B2 (en) | 2004-08-26 | 2010-12-14 | International Business Machines Corporation | Method of generating a context-inferenced search query and of sorting a result of the query |
US7716056B2 (en) | 2004-09-27 | 2010-05-11 | Robert Bosch Corporation | Method and system for interactive conversational dialogue for cognitively overloaded device users |
US8107401B2 (en) | 2004-09-30 | 2012-01-31 | Avaya Inc. | Method and apparatus for providing a virtual assistant to a communication participant |
US7546235B2 (en) | 2004-11-15 | 2009-06-09 | Microsoft Corporation | Unsupervised learning of paraphrase/translation alternations and selective application thereof |
US7552046B2 (en) | 2004-11-15 | 2009-06-23 | Microsoft Corporation | Unsupervised learning of paraphrase/translation alternations and selective application thereof |
US7702500B2 (en) | 2004-11-24 | 2010-04-20 | Blaedow Karen R | Method and apparatus for determining the meaning of natural language |
CN1609859A (zh) | 2004-11-26 | 2005-04-27 | 孙斌 | 搜索结果聚类的方法 |
US7376645B2 (en) | 2004-11-29 | 2008-05-20 | The Intellection Group, Inc. | Multimodal natural language query system and architecture for processing voice and proximity-based queries |
US20060122834A1 (en) | 2004-12-03 | 2006-06-08 | Bennett Ian M | Emotion detection device & method for use in distributed systems |
US8214214B2 (en) | 2004-12-03 | 2012-07-03 | Phoenix Solutions, Inc. | Emotion detection device and method for use in distributed systems |
US7636657B2 (en) | 2004-12-09 | 2009-12-22 | Microsoft Corporation | Method and apparatus for automatic grammar generation from data entries |
WO2006069381A2 (en) * | 2004-12-22 | 2006-06-29 | Enterprise Integration Group | Turn-taking confidence |
US7873654B2 (en) | 2005-01-24 | 2011-01-18 | The Intellection Group, Inc. | Multimodal natural language query system for processing and analyzing voice and proximity-based queries |
US7508373B2 (en) | 2005-01-28 | 2009-03-24 | Microsoft Corporation | Form factor and input method for language input |
GB0502259D0 (en) | 2005-02-03 | 2005-03-09 | British Telecomm | Document searching tool and method |
US7676026B1 (en) | 2005-03-08 | 2010-03-09 | Baxtech Asia Pte Ltd | Desktop telephony system |
US7925525B2 (en) | 2005-03-25 | 2011-04-12 | Microsoft Corporation | Smart reminders |
WO2006129967A1 (en) | 2005-05-30 | 2006-12-07 | Daumsoft, Inc. | Conversation system and method using conversational agent |
US8041570B2 (en) | 2005-05-31 | 2011-10-18 | Robert Bosch Corporation | Dialogue management using scripts |
US8024195B2 (en) | 2005-06-27 | 2011-09-20 | Sensory, Inc. | Systems and methods of performing speech recognition using historical information |
US7826945B2 (en) | 2005-07-01 | 2010-11-02 | You Zhang | Automobile speech-recognition interface |
WO2007019480A2 (en) | 2005-08-05 | 2007-02-15 | Realnetworks, Inc. | System and computer program product for chronologically presenting data |
US7640160B2 (en) | 2005-08-05 | 2009-12-29 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7620549B2 (en) | 2005-08-10 | 2009-11-17 | Voicebox Technologies, Inc. | System and method of supporting adaptive misrecognition in conversational speech |
US7949529B2 (en) | 2005-08-29 | 2011-05-24 | Voicebox Technologies, Inc. | Mobile systems and methods of supporting natural language human-machine interactions |
EP1934971A4 (en) | 2005-08-31 | 2010-10-27 | Voicebox Technologies Inc | DYNAMIC LANGUAGE SCRIPTURE |
US8265939B2 (en) | 2005-08-31 | 2012-09-11 | Nuance Communications, Inc. | Hierarchical methods and apparatus for extracting user intent from spoken utterances |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
JP4908094B2 (ja) | 2005-09-30 | 2012-04-04 | 株式会社リコー | 情報処理システム、情報処理方法及び情報処理プログラム |
US7930168B2 (en) | 2005-10-04 | 2011-04-19 | Robert Bosch Gmbh | Natural language processing of disfluent sentences |
US8620667B2 (en) | 2005-10-17 | 2013-12-31 | Microsoft Corporation | Flexible speech-activated command and control |
US7707032B2 (en) | 2005-10-20 | 2010-04-27 | National Cheng Kung University | Method and system for matching speech data |
US20070106674A1 (en) | 2005-11-10 | 2007-05-10 | Purusharth Agrawal | Field sales process facilitation systems and methods |
US20070185926A1 (en) | 2005-11-28 | 2007-08-09 | Anand Prahlad | Systems and methods for classifying and transferring information in a storage network |
KR100810500B1 (ko) | 2005-12-08 | 2008-03-07 | 한국전자통신연구원 | 대화형 음성 인터페이스 시스템에서의 사용자 편의성증대 방법 |
DE102005061365A1 (de) | 2005-12-21 | 2007-06-28 | Siemens Ag | Verfahren zur Ansteuerung zumindest einer ersten und zweiten Hintergrundapplikation über ein universelles Sprachdialogsystem |
US7996228B2 (en) | 2005-12-22 | 2011-08-09 | Microsoft Corporation | Voice initiated network operations |
US7599918B2 (en) | 2005-12-29 | 2009-10-06 | Microsoft Corporation | Dynamic search with implicit user intention mining |
JP2007183864A (ja) | 2006-01-10 | 2007-07-19 | Fujitsu Ltd | ファイル検索方法及びそのシステム |
US20070174188A1 (en) | 2006-01-25 | 2007-07-26 | Fish Robert D | Electronic marketplace that facilitates transactions between consolidated buyers and/or sellers |
IL174107A0 (en) | 2006-02-01 | 2006-08-01 | Grois Dan | Method and system for advertising by means of a search engine over a data network |
KR100764174B1 (ko) | 2006-03-03 | 2007-10-08 | 삼성전자주식회사 | 음성 대화 서비스 장치 및 방법 |
US7752152B2 (en) | 2006-03-17 | 2010-07-06 | Microsoft Corporation | Using predictive user models for language modeling on a personal device with user behavior models based on statistical modeling |
JP4734155B2 (ja) | 2006-03-24 | 2011-07-27 | 株式会社東芝 | 音声認識装置、音声認識方法および音声認識プログラム |
US7930183B2 (en) * | 2006-03-29 | 2011-04-19 | Microsoft Corporation | Automatic identification of dialog timing problems for an interactive speech dialog application using speech log data indicative of cases of barge-in and timing problems |
US7707027B2 (en) | 2006-04-13 | 2010-04-27 | Nuance Communications, Inc. | Identification and rejection of meaningless input during natural language classification |
US8423347B2 (en) | 2006-06-06 | 2013-04-16 | Microsoft Corporation | Natural language personal information management |
US20100257160A1 (en) | 2006-06-07 | 2010-10-07 | Yu Cao | Methods & apparatus for searching with awareness of different types of information |
US7523108B2 (en) | 2006-06-07 | 2009-04-21 | Platformation, Inc. | Methods and apparatus for searching with awareness of geography and languages |
US7483894B2 (en) | 2006-06-07 | 2009-01-27 | Platformation Technologies, Inc | Methods and apparatus for entity search |
KR100776800B1 (ko) | 2006-06-16 | 2007-11-19 | 한국전자통신연구원 | 지능형 가제트를 이용한 맞춤형 서비스 제공 방법 및시스템 |
US7548895B2 (en) | 2006-06-30 | 2009-06-16 | Microsoft Corporation | Communication-prompted user assistance |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US8073681B2 (en) | 2006-10-16 | 2011-12-06 | Voicebox Technologies, Inc. | System and method for a cooperative conversational voice user interface |
US8718538B2 (en) | 2006-11-13 | 2014-05-06 | Joseph Harb | Real-time remote purchase-list capture system |
US20080129520A1 (en) | 2006-12-01 | 2008-06-05 | Apple Computer, Inc. | Electronic device with enhanced audio feedback |
WO2008085742A2 (en) | 2007-01-07 | 2008-07-17 | Apple Inc. | Portable multifunction device, method and graphical user interface for interacting with user input elements in displayed content |
KR100883657B1 (ko) | 2007-01-26 | 2009-02-18 | 삼성전자주식회사 | 음성 인식 기반의 음악 검색 방법 및 장치 |
US7818176B2 (en) | 2007-02-06 | 2010-10-19 | Voicebox Technologies, Inc. | System and method for selecting and presenting advertisements based on natural language processing of voice-based input |
US7801728B2 (en) * | 2007-02-26 | 2010-09-21 | Nuance Communications, Inc. | Document session replay for multimodal applications |
US7822608B2 (en) | 2007-02-27 | 2010-10-26 | Nuance Communications, Inc. | Disambiguating a speech recognition grammar in a multimodal application |
US20080221900A1 (en) | 2007-03-07 | 2008-09-11 | Cerra Joseph P | Mobile local search environment speech processing facility |
US7801729B2 (en) | 2007-03-13 | 2010-09-21 | Sensory, Inc. | Using multiple attributes to create a voice search playlist |
US8219406B2 (en) | 2007-03-15 | 2012-07-10 | Microsoft Corporation | Speech-centric multimodal user interface design in mobile technology |
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
US7809610B2 (en) | 2007-04-09 | 2010-10-05 | Platformation, Inc. | Methods and apparatus for freshness and completeness of information |
US7983915B2 (en) | 2007-04-30 | 2011-07-19 | Sonic Foundry, Inc. | Audio content search engine |
US8055708B2 (en) | 2007-06-01 | 2011-11-08 | Microsoft Corporation | Multimedia spaces |
US8204238B2 (en) | 2007-06-08 | 2012-06-19 | Sensory, Inc | Systems and methods of sonic communication |
US8190627B2 (en) | 2007-06-28 | 2012-05-29 | Microsoft Corporation | Machine assisted query formulation |
US8019606B2 (en) | 2007-06-29 | 2011-09-13 | Microsoft Corporation | Identification and selection of a software application via speech |
JP2009036999A (ja) | 2007-08-01 | 2009-02-19 | Infocom Corp | コンピュータによる対話方法、対話システム、コンピュータプログラムおよびコンピュータに読み取り可能な記憶媒体 |
KR101359715B1 (ko) | 2007-08-24 | 2014-02-10 | 삼성전자주식회사 | 모바일 음성 웹 제공 방법 및 장치 |
US8190359B2 (en) | 2007-08-31 | 2012-05-29 | Proxpro, Inc. | Situation-aware personal information management for a mobile device |
US20090058823A1 (en) | 2007-09-04 | 2009-03-05 | Apple Inc. | Virtual Keyboards in Multi-Language Environment |
US20090106397A1 (en) | 2007-09-05 | 2009-04-23 | O'keefe Sean Patrick | Method and apparatus for interactive content distribution |
US8171117B2 (en) | 2007-09-14 | 2012-05-01 | Ricoh Co. Ltd. | Workflow manager for a distributed system |
KR100920267B1 (ko) | 2007-09-17 | 2009-10-05 | 한국전자통신연구원 | 음성 대화 분석 시스템 및 그 방법 |
US8706476B2 (en) | 2007-09-18 | 2014-04-22 | Ariadne Genomics, Inc. | Natural language processing method by analyzing primitive sentences, logical clauses, clause types and verbal blocks |
US8165886B1 (en) | 2007-10-04 | 2012-04-24 | Great Northern Research LLC | Speech interface system and method for control and interaction with applications on a computing system |
US8036901B2 (en) | 2007-10-05 | 2011-10-11 | Sensory, Incorporated | Systems and methods of performing speech recognition using sensory inputs of human position |
US20090112677A1 (en) | 2007-10-24 | 2009-04-30 | Rhett Randolph L | Method for automatically developing suggested optimal work schedules from unsorted group and individual task lists |
US7840447B2 (en) | 2007-10-30 | 2010-11-23 | Leonard Kleinrock | Pricing and auctioning of bundled items among multiple sellers and buyers |
US7983997B2 (en) | 2007-11-02 | 2011-07-19 | Florida Institute For Human And Machine Cognition, Inc. | Interactive complex task teaching system that allows for natural language input, recognizes a user's intent, and automatically performs tasks in document object model (DOM) nodes |
US8112280B2 (en) | 2007-11-19 | 2012-02-07 | Sensory, Inc. | Systems and methods of performing speech recognition with barge-in for use in a bluetooth system |
US8140335B2 (en) | 2007-12-11 | 2012-03-20 | Voicebox Technologies, Inc. | System and method for providing a natural language voice user interface in an integrated voice navigation services environment |
US10002189B2 (en) | 2007-12-20 | 2018-06-19 | Apple Inc. | Method and apparatus for searching using an active ontology |
US8219407B1 (en) | 2007-12-27 | 2012-07-10 | Great Northern Research, LLC | Method for processing the output of a speech recognizer |
US8099289B2 (en) | 2008-02-13 | 2012-01-17 | Sensory, Inc. | Voice interface and search for electronic devices including bluetooth headsets and remote systems |
US8958848B2 (en) | 2008-04-08 | 2015-02-17 | Lg Electronics Inc. | Mobile terminal and menu control method thereof |
US8666824B2 (en) | 2008-04-23 | 2014-03-04 | Dell Products L.P. | Digital media content location and purchasing system |
US8285344B2 (en) | 2008-05-21 | 2012-10-09 | DP Technlogies, Inc. | Method and apparatus for adjusting audio for a user environment |
US8589161B2 (en) | 2008-05-27 | 2013-11-19 | Voicebox Technologies, Inc. | System and method for an integrated, multi-modal, multi-device natural language voice services environment |
US8694355B2 (en) | 2008-05-30 | 2014-04-08 | Sri International | Method and apparatus for automated assistance with task management |
US8423288B2 (en) | 2009-11-30 | 2013-04-16 | Apple Inc. | Dynamic alerts for calendar events |
US8166019B1 (en) | 2008-07-21 | 2012-04-24 | Sprint Communications Company L.P. | Providing suggested actions in response to textual communications |
US9200913B2 (en) | 2008-10-07 | 2015-12-01 | Telecommunication Systems, Inc. | User interface for predictive traffic |
US8140328B2 (en) | 2008-12-01 | 2012-03-20 | At&T Intellectual Property I, L.P. | User intention based on N-best list of recognition hypotheses for utterances in a dialog |
US8326637B2 (en) | 2009-02-20 | 2012-12-04 | Voicebox Technologies, Inc. | System and method for processing multi-modal device interactions in a natural language voice services environment |
US8805823B2 (en) | 2009-04-14 | 2014-08-12 | Sri International | Content processing systems and methods |
KR101581883B1 (ko) | 2009-04-30 | 2016-01-11 | 삼성전자주식회사 | 모션 정보를 이용하는 음성 검출 장치 및 방법 |
EP2426598B1 (en) | 2009-04-30 | 2017-06-21 | Samsung Electronics Co., Ltd. | Apparatus and method for user intention inference using multimodal information |
US9858925B2 (en) | 2009-06-05 | 2018-01-02 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
US10706373B2 (en) | 2011-06-03 | 2020-07-07 | Apple Inc. | Performing actions associated with task items that represent tasks to perform |
US10540976B2 (en) | 2009-06-05 | 2020-01-21 | Apple Inc. | Contextual voice commands |
KR101562792B1 (ko) | 2009-06-10 | 2015-10-23 | 삼성전자주식회사 | 목표 예측 인터페이스 제공 장치 및 그 방법 |
US8527278B2 (en) | 2009-06-29 | 2013-09-03 | Abraham Ben David | Intelligent home automation |
US20110047072A1 (en) | 2009-08-07 | 2011-02-24 | Visa U.S.A. Inc. | Systems and Methods for Propensity Analysis and Validation |
US8768313B2 (en) | 2009-08-17 | 2014-07-01 | Digimarc Corporation | Methods and systems for image or audio recognition processing |
EP2473916A4 (en) | 2009-09-02 | 2013-07-10 | Stanford Res Inst Int | METHOD AND DEVICE FOR USING A HUMAN FEEDBACK IN AN INTELLIGENT AUTOMATED ASSISTANT |
US8321527B2 (en) | 2009-09-10 | 2012-11-27 | Tribal Brands | System and method for tracking user location and associated activity and responsively providing mobile device updates |
KR20110036385A (ko) | 2009-10-01 | 2011-04-07 | 삼성전자주식회사 | 사용자 의도 분석 장치 및 방법 |
CN101673544B (zh) * | 2009-10-10 | 2012-07-04 | 上海电虹软件有限公司 | 一种基于声纹识别和定位跟踪的交叉监控方法和系统 |
US20110099507A1 (en) | 2009-10-28 | 2011-04-28 | Google Inc. | Displaying a collection of interactive elements that trigger actions directed to an item |
US9197736B2 (en) | 2009-12-31 | 2015-11-24 | Digimarc Corporation | Intuitive computing methods and systems |
US20120137367A1 (en) | 2009-11-06 | 2012-05-31 | Cataphora, Inc. | Continuous anomaly detection based on behavior modeling and heterogeneous information analysis |
US9502025B2 (en) | 2009-11-10 | 2016-11-22 | Voicebox Technologies Corporation | System and method for providing a natural language content dedication service |
US9171541B2 (en) | 2009-11-10 | 2015-10-27 | Voicebox Technologies Corporation | System and method for hybrid processing in a natural language voice services environment |
US8712759B2 (en) | 2009-11-13 | 2014-04-29 | Clausal Computing Oy | Specializing disambiguation of a natural language expression |
KR101960835B1 (ko) | 2009-11-24 | 2019-03-21 | 삼성전자주식회사 | 대화 로봇을 이용한 일정 관리 시스템 및 그 방법 |
US8396888B2 (en) | 2009-12-04 | 2013-03-12 | Google Inc. | Location-based searching using a search area that corresponds to a geographical location of a computing device |
KR101622111B1 (ko) | 2009-12-11 | 2016-05-18 | 삼성전자 주식회사 | 대화 시스템 및 그의 대화 방법 |
US20110161309A1 (en) | 2009-12-29 | 2011-06-30 | Lx1 Technology Limited | Method Of Sorting The Result Set Of A Search Engine |
US8494852B2 (en) | 2010-01-05 | 2013-07-23 | Google Inc. | Word-level correction of speech input |
US8311838B2 (en) | 2010-01-13 | 2012-11-13 | Apple Inc. | Devices and methods for identifying a prompt corresponding to a voice input in a sequence of prompts |
US8334842B2 (en) | 2010-01-15 | 2012-12-18 | Microsoft Corporation | Recognizing user intent in motion capture system |
US8626511B2 (en) | 2010-01-22 | 2014-01-07 | Google Inc. | Multi-dimensional disambiguation of voice commands |
US20110218855A1 (en) | 2010-03-03 | 2011-09-08 | Platformation, Inc. | Offering Promotions Based on Query Analysis |
US8265928B2 (en) | 2010-04-14 | 2012-09-11 | Google Inc. | Geotagged environmental audio for enhanced speech recognition accuracy |
US20110279368A1 (en) | 2010-05-12 | 2011-11-17 | Microsoft Corporation | Inferring user intent to engage a motion capture system |
US8694313B2 (en) | 2010-05-19 | 2014-04-08 | Google Inc. | Disambiguation of contact information using historical data |
US8522283B2 (en) | 2010-05-20 | 2013-08-27 | Google Inc. | Television remote control data transfer |
US8468012B2 (en) | 2010-05-26 | 2013-06-18 | Google Inc. | Acoustic model adaptation using geographic information |
US20110306426A1 (en) | 2010-06-10 | 2011-12-15 | Microsoft Corporation | Activity Participation Based On User Intent |
US8234111B2 (en) | 2010-06-14 | 2012-07-31 | Google Inc. | Speech and noise models for speech recognition |
US8411874B2 (en) | 2010-06-30 | 2013-04-02 | Google Inc. | Removing noise from audio |
US8775156B2 (en) | 2010-08-05 | 2014-07-08 | Google Inc. | Translating languages in response to device motion |
US8359020B2 (en) | 2010-08-06 | 2013-01-22 | Google Inc. | Automatically monitoring for voice input based on context |
US8473289B2 (en) | 2010-08-06 | 2013-06-25 | Google Inc. | Disambiguating input based on context |
EP2702473A1 (en) | 2011-04-25 | 2014-03-05 | Veveo, Inc. | System and method for an intelligent personal timeline assistant |
-
2010
- 2010-01-13 US US12/686,774 patent/US8311838B2/en active Active
-
2011
- 2011-01-11 AU AU2011205411A patent/AU2011205411B2/en not_active Ceased
- 2011-01-11 KR KR1020127021219A patent/KR101393816B1/ko active IP Right Grant
- 2011-01-11 CN CN201410244527.9A patent/CN104020978B/zh not_active Expired - Fee Related
- 2011-01-11 WO PCT/US2011/020825 patent/WO2011088038A1/en active Application Filing
- 2011-01-11 CN CN201180009581.XA patent/CN102763159B/zh not_active Expired - Fee Related
- 2011-01-11 EP EP11700488.7A patent/EP2524369B1/en not_active Not-in-force
-
2012
- 2012-09-13 US US13/615,418 patent/US8670985B2/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5930751A (en) * | 1997-05-30 | 1999-07-27 | Lucent Technologies Inc. | Method of implicit confirmation for automatic speech recognition |
WO2001046946A1 (en) * | 1999-12-22 | 2001-06-28 | Ambush Interactive, Inc. | Hands-free, voice-operated remote control transmitter |
US20030171928A1 (en) * | 2002-02-04 | 2003-09-11 | Falcon Stephen Russel | Systems and methods for managing interactions from multiple speech-enabled applications |
CN101228503A (zh) * | 2005-03-23 | 2008-07-23 | 摩托罗拉公司 | 用于用户界面的自适应菜单 |
US20060247931A1 (en) * | 2005-04-29 | 2006-11-02 | International Business Machines Corporation | Method and apparatus for multiple value confirmation and correction in spoken dialog systems |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107066227A (zh) * | 2013-01-07 | 2017-08-18 | 三星电子株式会社 | 显示装置和用于控制显示装置的方法 |
CN108447476A (zh) * | 2017-02-06 | 2018-08-24 | 北京嘀嘀无限科技发展有限公司 | 用于请求服务以及服务资源分配的方法及装置 |
Also Published As
Publication number | Publication date |
---|---|
CN104020978A (zh) | 2014-09-03 |
AU2011205411B2 (en) | 2015-03-05 |
US8311838B2 (en) | 2012-11-13 |
WO2011088038A1 (en) | 2011-07-21 |
EP2524369A1 (en) | 2012-11-21 |
EP2524369B1 (en) | 2014-03-05 |
CN102763159B (zh) | 2014-07-09 |
KR20120108044A (ko) | 2012-10-04 |
US20110172994A1 (en) | 2011-07-14 |
US8670985B2 (en) | 2014-03-11 |
CN104020978B (zh) | 2017-05-31 |
KR101393816B1 (ko) | 2014-05-12 |
AU2011205411A1 (en) | 2012-08-09 |
US20130006643A1 (en) | 2013-01-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102763159B (zh) | 话音输入的处理 | |
US10692504B2 (en) | User profiling for voice input processing | |
US11520467B2 (en) | Input device and user interface interactions | |
CN102144209B (zh) | 电子设备中的多层次话音反馈 | |
CN108984081A (zh) | 一种搜索页面交互方法、装置、终端及存储介质 | |
CN101794208A (zh) | 用于无显示器的电子设备的音频用户接口 | |
CN102112946A (zh) | 用于实现用户接口的电子装置和方法 | |
CN101882366A (zh) | 主机设备和附件的遥控信号学习和处理 | |
US20180275756A1 (en) | System And Method Of Controlling Based On A Button Having Multiple Layers Of Pressure | |
CN107147957A (zh) | 视频播放方法和装置 | |
KR20130068303A (ko) | 음성 명령 수행장치, 이를 구비한 이동 단말기 및 음성 명령 수행방법 | |
WO2016112791A1 (zh) | 移动终端应用程序页面的展现方法和装置 | |
CN108073291B (zh) | 一种输入方法和装置、一种用于输入的装置 | |
CN103744658A (zh) | 一种提示启动滚轮浮层的方法及电子设备 | |
KR101096572B1 (ko) | 터치스크린 입력방법과 입력 장치, 이를 포함하는 휴대용 단말 | |
CN103218038A (zh) | 电子设备及其控制方法 | |
CN101571784B (zh) | 电子装置与自动隐藏键盘方法 | |
AU2015203169B2 (en) | Processing of voice inputs |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20140709 |