CN101366075A - 话音控制式无线通信装置系统的控制中心 - Google Patents
话音控制式无线通信装置系统的控制中心 Download PDFInfo
- Publication number
- CN101366075A CN101366075A CNA2006800349872A CN200680034987A CN101366075A CN 101366075 A CN101366075 A CN 101366075A CN A2006800349872 A CNA2006800349872 A CN A2006800349872A CN 200680034987 A CN200680034987 A CN 200680034987A CN 101366075 A CN101366075 A CN 101366075A
- Authority
- CN
- China
- Prior art keywords
- file
- speech recognition
- voice
- communication device
- voice commands
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000004891 communication Methods 0.000 title claims abstract description 25
- 238000012552 review Methods 0.000 claims abstract description 6
- 230000004044 response Effects 0.000 claims description 8
- 238000000034 method Methods 0.000 abstract description 33
- 230000008569 process Effects 0.000 abstract description 28
- 238000012545 processing Methods 0.000 abstract description 5
- 230000009471 action Effects 0.000 abstract description 4
- 230000005540 biological transmission Effects 0.000 abstract description 3
- 238000005516 engineering process Methods 0.000 description 4
- 230000001755 vocal effect Effects 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000004100 electronic packaging Methods 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 206010038743 Restlessness Diseases 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000001066 destructive effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/487—Arrangements for providing information services, e.g. recorded voice services or time announcements
- H04M3/493—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
- H04M3/4936—Speech interaction details
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2250/00—Details of telephonic subscriber devices
- H04M2250/74—Details of telephonic subscriber devices with voice recognition means
Landscapes
- Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Theoretical Computer Science (AREA)
- Signal Processing (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Telephonic Communication Services (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
本发明揭示一种可接受来自最终用户的所记录音频数据的无线通信装置。所述音频数据可呈请求用户动作的命令的形式。同样地,所述音频数据可为将被转换成文本文件的文本。将所述音频数据还原为呈装置硬件支持的格式的数字话音文件,例如,wav、mp3、vnf文件或类似文件。将所述数字话音文件通过受保护或不受保护的无线通信发送到一个或一个以上服务器计算机以供进一步处理。根据本发明的重要方面,所述系统评估语音识别过程的置信度。如果所述置信度较高,那么所述系统自动地构建应用程序命令或创建文本文件供传输到所述通信装置。另一选择为,如果所述语音识别的置信度较低,那么将所记录的音频数据文件路由到电信服务所雇用的人类抄录员,所述人类抄录员手动地审阅所述数字话音文件并构建应用程序命令或文本文件。一旦创建了所述应用程序命令,便将其传输到所述通信装置。作为本发明的结果,已显示通信装置背景中的语音识别在90%以上的时间内是准确的。
Description
相关申请案交叉参照
本申请案主张2005年8月9日申请的美国临时专利申请案第60/706,806号的优先权和利益,所述申请案以引用的方式并入本文中。
技术领域
本发明涉及使用基于无线服务器的话音识别工具以通过话音命令控制各种无线通信装置。
背景技术
话音控制式系统已存在并使用了许多年。所述系统(通常并入有驻存在受控装置上的计算机硬件和软件的组合)允许最终用户通过叙述口头命令来控制装置。所述口头命令随后被转换成可控制电子装置的可执行命令。当今,可在分布在从计算机接口、汽车、蜂窝式电话到其它手持式装置范围内的各种类型的技术中找到驱动话音控制式装置的话音识别系统。
无线通信装置本身特别适合于话音控制。所述无线装置通常将蜂窝式电话、电子邮件、联系人列表、日历、互联网web浏览、多媒体播放器和许多其它类似的电子应用程序组合成单个电子包装,所述电子包装小得足以装入口袋或钱包中。与无线装置的互相作用通常是通过附接到所述无线装置的小键盘来进行。由于所述键盘比标准键盘小得多,因此经常发生可能是破坏性的错误。此外,所述装置(例如)通常是在驾驶时使用,因此监视所述装置并手动输入命令不切合实际。最后,所述问题阻碍了将装置用于其计划的用途。因此,需要能够使用话音控制而非键盘来控制无线装置。
现有无线通信装置相依于完全驻存在所述装置上的程序设计。所述系统的能力受到通常可用于移动话音控制式装置上的减小的存储器和计算功率的极大限制。然而,语音识别准确度的质量不良,这在很大程度上是因为面对移动用户的环境挑战,例如,背景噪声、用户口音和成本效率硬件,例如,不能提供高质量音频的麦克风。
美国专利第7,027,987("the'987专利)揭示一种将话音介接到搜索引擎的方法。然而,如同所述'987专利的发明人在其论文中报告的那样,在试验中仅有60%的时间能够达到对口语单词的正确识别。参见,Alex Franz和Brian Milch.Searching the Web byVoice,Proc.19th International Conference on Computational Linguistics,1213-1217(2002),。
因此,需要一种可执行各种应用程序同时维持对口语单词的极高识别准确度的话音控制式无线通信装置。
发明内容
无线通信装置接受来自最终用户的所记录音频数据。所述音频数据可呈请求动作的命令的形式,所述动作通常在所述装置上手动地执行,例如,发送电子邮件、安排约会时间、起始电话呼叫、搜索互联网、播放多媒体文件(例如,MP3歌曲)或请求与新闻有关的信息(例如,体育比分或股票行情)。同样地,所述音频数据可为将被转换成文本文件且保存为附注、信件或其它文本数据的文本。所述音频数据还原为呈装置硬件支持的格式的数字话音文件,例如,wav、mp3、vnf文件或类似文件。所述数字话音文件通过受保护或不受保护的无线通信发送到一个或一个以上服务器计算机以供进一步处理。所述服务器计算机通常由为所述通信装置提供电话和电子邮件接入的相同电信服务来管理。一旦所述音频数据通过辨识,所述服务器便通过建造应用程序命令或文本文件来处理所述数据并将所得信息发送到所述无线装置供正确地执行。
将所述音频数据运送到服务器供语音识别允许由更强大的语音引擎来处理请求。然而,这样做并不能改善被解译数据的质量。如由许多研究和失败的系统所证明(http://www.cs.berkeley.edu/%7Emilch/papers/gvs.pdf),如果音频的质量不良,那么最好的语音识别也不能产生准确的结果。这致使用户停止使用所述系统。
因此,本发明通过以下方式来克服单词识别准确度的问题:不仅利用所述服务器计算机的功率来执行语音识别,而且本发明还评估所述语音识别过程的置信度。如果所述置信度高,那么所述系统自动地建造应用程序命令或创建文本文件供传输到所述通信装置。另一选择为,如果所述语音识别的置信低,那么将所记录的音频数据文件路由到电信服务雇用的人类抄录员,所述抄录员手动地审阅数字话音文件且建造所述应用程序命令或文本文件。一旦创建了所述应用程序命令,其便被传输到所述通信装置。作为本发明的结果,已显示通信装置背景中的语音识别在90%以上的时间内是准确的。
附图说明
具体实施方式
图1中显示一种用于无线通信装置的话音控制式系统。所述系统包括与一个或一个以上服务器计算机20无线通信的手持式通信装置10。在最低程度上,通信装置10具有运行程序(也称为应用程序)的能力。通信装置10还具有音频记录能力,例如,麦克风,以便可记录来自用户呈话音命令形式的音频数据并将所述命令保存为记录的话音命令文件30。
通信装置10的用户存取驻存在装置10上的话音命令应用程序,并向装置的麦克风口述控制装置10的命令。装置10记录话音命令并创建记录的话音命令文件30。装置10可视情况将记录的话音命令文件30存储在内部以供将来使用。然后,通信装置10将记录的话音命令文件30以无线方式发送到服务器计算机20并等待服务器的响应。
在接收到记录的话音命令文件30后,服务器计算机20执行一系列编程模块以处理记录的话音命令文件30。最初,服务器计算机20对记录的话音命令文件30执行语音识别40,从而产生解译的话音命令50。在其中多个服务器正运行并行语音识别过程的情况下,所述系统将基于各种参数(包含但不限于个别服务器的活动)确定哪一服务器计算机20引导记录的话音命令文件30来进行语音识别。服务器计算机20可视情况将记录的话音命令文件30存储在内部以供将来使用。服务器计算机20评估语音识别过程60的置信度以确定所述语音识别的准确度。如果所述置信度处在预定水平或在所述预定水平以上,那么服务器20将调用机器可读命令70的自动创建以创建应用程序命令80。
另一方面,如果语音识别过程40的置信度低于预定水平,那么服务器20将记录的话音命令文件30路由到人类抄录员以供手动审阅并创建机器可读命令90。
一旦创建了机器可读命令80,服务器计算机20便将应用程序命令80传输到通信装置10。通信装置10将接收的应用程序命令80引导到合适的应用程序供执行。
通信装置10可为当今可得到的许多类似类型装置中的一者。典型的通信装置10将能够运行各种应用程序,包含但不限于无线电话通信、无线电子邮件、日历、联系人列表、无线互联网web浏览和多媒体呈现。所述应用程序是以本机装置硬件可支持的语言写入,例如C++、Symbian、Java、Linux和类似语言。另外,装置10还可能能够运行除装置厂家提供的应用程序以外的应用程序。
图2显示在通信装置上运行的话音命令应用程序。用户以各种方式,优选地通过按下装置上的按钮(其起始应用程序100)来启动应用程序。所述应用程序提示用户进行音频记录,例如,语言命令,其接收110并以装置支持的格式保存为记录的话音命令文件130,例如,wav、mp3或vnf文件。可优选地基于硬件使用其它文件格式。如果用户正记录话音命令,那么应用程序可视情况呈现可能命令105列表。
所述装置随后与服务器计算机建立无线数据连接并将记录的话音命令文件130传输到所述服务器。所述连接可基于用户以及系统管理员的偏好而为受保护或不受保护的通信。优选地,所述装置维持与服务器计算机的连接直到服务器响应140为止。偶尔地,所述响应可花费太长时间且所述数据连接在接收到所述响应之前便终止。在所述情况下,所述装置或服务器可在稍后重新建立通信以传输(或接收)呈应用程序命令180形式的服务器响应并终止所述连接。
所述通信装置接收应用程序命令文件180并询问应用程序命令以确定通信装置必须采取的动作150。基于应用程序命令文件180,将所述命令引导到适当的应用程序供执行160。
此外,基于话音命令,可将对应用程序的执行引导到特殊内容提供者。举例来说,对互联网内容的请求可来自互联网上的若干个源。运行所述系统的电信服务可与互联网内容提供者达成协定以将所有此类请求仅引导到所述互联网内容提供者。所述协定可在财务上有益于电信服务。同样地,用户可选择将使用哪一互联网内容提供者且可将所述提供者预定为此类请求的源。
当音频记录为话音命令时,所述话音命令优选地将具有为所有命令遵循的标准格式。话音命令的标准化格式允许更容易地执行额外的命令。所述话音命令应以关键字短语开始以识别命令的类型。关键字短语的实例包含但不限于“呼叫联系人”、“电子邮件”、“搜索web”、“寻找电影”或“播放歌曲”。所述关键字短语基于话音命令的类型而后跟额外的参数。举例来说,如果关键字短语为“呼叫联系人”,那么额外的参数应为所述联系人的姓名。更详尽的实例可包含电子邮件命令,所述电子邮件命令将包含多个额外的参数,例如,联系人姓名或电子邮件地址、主题和文本。某些参数可以参数短语(例如,电子邮件话音命令的“主题”)为开始或在没有参数短语的情况下简单地附加到关键字短语。如在关键字短语“呼叫联系人”之后的联系人姓名中那样。
一旦用户向通信装置叙述了话音命令,所述装置便以适当的数字文件格式保存记录的话音命令以供传输到服务器计算机。视情况,所述系统还可附加指示从其接收记录的话音命令的通信装置的唯一装置识别符。基于所述唯一装置识别符,所述系统可识别以下描述的额外有用信息。
如果通信装置上维持联系人列表,那么所述列表可与记录的视频文件一起周期性地传输且维持在服务器计算机上。所保存的联系人列表用于增加语音转译的准确度。语音识别过程使用所述列表来帮助需要来自联系人列表的输入的自动话音命令转译。另外,如果将话音命令发送到人类抄录员供审阅,那么所述抄录员可存取特殊用户的联系人列表,或可将所述联系人列表自动地呈现给所述人类抄录员。
当将所述联系人列表发送到服务器计算机时,便可视需要操纵所述列表。举例来说,所述服务器计算机可管理具有中间名字的首字母和不具有中间名字的首字母两者的联系人姓名,以使不具有中间名字的首字母的记录向后解析为具有中间名字的首字母的记录。举例来说,如果用户在其联系人列表中请求其所具有的针对Robert Smith的联系人,但在其数据库中所具有的唯一记录为Robert T.Smith,那么所述系统可找出Robert T.Smith并将结果返回到所述用户。
图3显示对从通信装置传输的所记录话音命令文件的服务器计算机处理。所述服务器计算机以及对话音命令的所有处理通常由为所述通信装置提供无线通信的电信服务来控制。所述通信装置建立与服务器计算机的无线数据连接并将记录的话音命令文件传输到服务器计算机200。所述服务器计算机对记录的话音命令文件230执行语音识别210。可使用可在市场上购得的语音识别程序,例如,可从Nuance,Inc.,公司购得的Dragon Naturally Speaking,或者可使用定制的语音识别程序。语音识别过程导致创建解译的话音命令文件250。所述语音识别软件还应能够提供测量软件对话音命令准确识别的确信程度的置信度。所述置信测量通常被并入识别过程中。
临界置信度(即,如果识别过程的置信不充足,那么必须执行的额外处理的置信度水平)可由系统管理员或者系统自身来调节。如果由语音识别产生的置信度处在临界置信度或在所述临界置信度以上,那么使用来自语音识别过程210的所解译话音命令250自动地创建240应用程序命令280。相反,如果由语音识别产生的置信度低于所述临界置信度,那么将记录的话音命令文件230路由到人类抄录员供手动创建机器可读命令文件280。
机器可读命令文件80应呈标准格式的形式,例如,Xml。标准格式允许容易地包含新的命令。举例来说,如果话音命令为“呼叫联系人Bob Jones”,那么所述系统便将“呼叫联系人”识别为关键字短语并针对电话呼叫类型建造Xml代码(例如,<commandtype>call)。在知道命令类型后,所述系统接下来分析出姓名并创建Xml代码(例如,<contact>Bob Jones)。因此,应用程序命令文件280将为<commandtype>call<contact>Bob Jones。其它格式已为所属技术领域的技术人员熟知且可容易地替代Xml格式。
一旦创建了应用程序命令文件280,不管用来创建所述应用程序命令文件的过程如何,所述服务器计算机都会通过建立的无线数据连接将文件280返回到通信装置。如上所述,如果数据连接已终止,那么所述服务器计算机可重新建立连接以将文件280传输到通信装置。
图4显示使用不同的并行语音识别过程而非单个语音识别过程的本发明的另一实施例。所述方法的优点是不同语音识别系统的差异,从而可获得最准确的语音识别。在完成所有语音识别过程310时,所述系统评估每一过程的置信度320。如果所述语音识别过程310的置信度中的至少一者处在临界置信度或在临界置信度以上,那么所述系统选择具有最高置信度的所解译话音命令文件340并基于解译的话音命令文件395自动地创建应用程序命令390。如果没有一个过程产生处在临界置信度或者在临界置信度以上的置信度,那么将记录的话音命令路由到人类抄录员供审阅并手动创建应用程序命令360。
视情况,可能需要额外的面向内容的语音识别过程335。面向内容的语音识别过程335是使用特殊词典(例如,法律词典)或特殊语言(例如,西班牙语词典)的过程。基于初始语音识别过程310的结果,且假定所述语音识别过程在临界置信度320以上,便可确定记录的话音命令需要由面向内容的语音识别过程335进行额外的处理。同样地,可基于用户已挑选额外的面向内容的语音识别过程而调用额外的面向内容的语音识别过程335。所述系统将能够确定特殊用户已基于经编码唯一识别符请求的额外的面向内容的语音识别过程。
在本发明的一个实施例中,如果将记录的话音命令文件路由到人类抄录员,那么所述系统将试图将所述记录的话音命令文件引导到最适当的抄录员。可基于用户界定准则的数目来选择适当的抄录员。举例来说,所述系统可存取任一特殊抄录员的工作负荷并将文件指派给最不忙碌的抄录员。另一选项是确定命令类型并将记录的话音命令文件指派给最适合于特殊命令类型的抄录员。这在所述命令可需要大量键入的情况下尤其有用,例如,通常将需要键入额外信息(例如,电子邮件的文本)的电子邮件命令。因此,将具有大量键入要求的命令引导到已被识别为最好打字员的抄录员。
还可将记录的话音命令文件引导到已经历过创建所述话音命令的用户的抄录员。由于唯一识别符是视情况附加到每一记录的话音命令文件,因此所述系统可确定哪一抄录员先前已审阅过来自记录所述话音命令的用户的话音命令。由于地区性方言和口音的原因,因此可能需要让同一抄录员审阅来自同一用户的话音命令。即,所述抄录员熟悉用户的口音且将来的抄录对于所述抄录员更容易一些。
还可基于命令的时间性优先化所述命令。举例来说,与通常不需要立即响应的命令(例如,用于发送电子邮件的命令)相比,需要立即响应的命令(例如,起始呼叫的命令)将具有较高的优先权,且因此被指派给较快的抄录员。
一旦已将记录的话音命令文件路由到人类抄录员,便可向所述抄录员呈现包含针对抄录员的视觉线索的自动屏幕,所述视觉线索包含用户的过去历史以及设计用以加速由人类抄录员所做处理的其它速度技术。在抄录员已手动创建应用程序命令文件之后,所述系统可提示抄录员更新用户的语音识别语法文件,这将在以下更详细描述的辨识话音命令中帮助语音识别过程。
图5中显示另一实施例。在所述实施例中,用户记录将保存为(举例来说)附注、信件、备忘录或提示项的文本信息,且将所得文本文件存储在通信装置410上。类似于先前实施例,所述音频数据存储在记录的音频文件430中并被传输到服务器计算机420。通过可创建文本文件450的语音识别服务器模块440处理记录的音频文件430。服务器计算机420评估语音识别过程460的置信度以确定所述语音识别的准确度。如果所述置信度处在预定水平或在所述预定水平以上,那么便将自动创建的文本文件450输送到服务器模块480供传输到通信装置410。相反,如果语音识别过程440的置信度低于预定水平,那么服务器420便将记录的音频文件430路由到人类抄录员470供手动审阅并创建文本文件455。
不管用于创建文本文件450或455的方法如何,所述文本文件均沿着已经建立的无线数据连接传输480。一旦返回到通信装置410,便可将所述文本文件保存在通信装置上,并使用适合于显示文本数据的应用程序来显示,例如,笔记本计算机或词处理器。
在另一实施例中,所述系统具有一个或一个以上应用程序接口,其可确定话音命令的类型并将应用程序命令的创建引导到另一组织。以此方式,一个组织可构造对在通信装置上运行的应用程序是唯一的其自己的话音命令集。这在一个组织具有可容易存取的信息但不想或不能使所述信息用于运行所述系统的电信服务的情况下是有利的。举例来说,销售组织可能想通过通信装置使其销售团队存取公司的机密信息,但又不想所述信息被电信服务存取。当所述系统确定话音命令是所述特定类型命令中的一者时,便将记录的话音命令文件输送到所述组织以创建应用程序命令文件。将优选地使用所属技术领域的技术人员已知的任一众所周知的加密方法来加密所得的应用程序命令文件。将经加密的应用程序命令文件输送回到电信服务供传输到所述通信装置。一旦在所述通信装置上接收到经加密的应用程序命令,便将其引导到由所述组织提供的通信装置上的唯一应用程序。
在又一实施例中,使用附加到记录的话音命令的通信装置的唯一识别符来识别叙述所述话音命令的用户。因此,当服务器计算机接收来自通信装置的所记录话音命令时,所述系统可确定用户是谁以及所述用户是否适合于由电信服务提供的话音命令服务。另外,所述语音识别过程可存取创建用于特殊用户的用户语法文件。所述语法文件含有用户的语音样式的实例且可用于帮助语音识别过程。特殊用户的语法文件在此项技术以及最易在市场上购得的语音识别系统的标准组件中众所周知。语法文件的构造可由用户来执行,或者人类抄录员可如上所述创建语法文件。
Claims (7)
1.一种用于从无线通信装置接收话音控制命令的控制中心,其包括:
接收从无线通信装置传送的且响应于置信度从基于服务器的语音识别程序获得的一个或一个以上所记录的话音命令;将所述话音命令引导到一个或一个以上人类抄录员;及
所述人类抄录员审阅所述话音命令,并创建将要传送回所述通信装置的一个或一个以上应用程序命令。
2.如权利要求1所述的控制中心,其中所述话音命令基于人类抄录员准则被引导到特殊人类抄录员。
3.如权利要求1所述的控制中心,其中所述话音命令基于特定人类抄录员已接收到的呼叫的音量被引导到特殊人类抄录员。
4.如权利要求1所述的控制中心,其中所述话音命令基于创建所述话音命令的用户被引导到特殊人类抄录员。
5.如权利要求1所述的控制中心,其中所述话音命令基于命令的类型被引导到特殊人类抄录员。
6.如权利要求1所述的控制中心,其中所述人类抄录员使用所述装置用户对未被所述基于服务器的语音识别程序正确转译的单词的发音来更新所述语音识别程序。
7.一种用于从无线通信装置接收音频数据的控制中心,其包括:
接收来自无线通信装置且响应于置信度的经记录的音频数据,其从基于服务器的语音识别程序而获得,将所述音频数据引导到一个或一个以上人类抄录员;及
所述人类抄录员审阅所述音频数据,并创建所述音频数据的将要传送回所述通信装置的本文版本。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US70680605P | 2005-08-09 | 2005-08-09 | |
US60/706,806 | 2005-08-09 | ||
PCT/US2006/031265 WO2007055766A2 (en) | 2005-08-09 | 2006-08-09 | Control center for a voice controlled wireless communication device system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101366075A true CN101366075A (zh) | 2009-02-11 |
CN101366075B CN101366075B (zh) | 2016-04-20 |
Family
ID=38023732
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200680034897.3A Expired - Fee Related CN101366073B (zh) | 2005-08-09 | 2006-08-09 | 多种语音识别软件实例的使用 |
CN2006800348901A Expired - Fee Related CN101366074B (zh) | 2005-08-09 | 2006-08-09 | 话音控制式无线通信装置系统 |
CN200680034987.2A Expired - Fee Related CN101366075B (zh) | 2005-08-09 | 2006-08-09 | 话音控制式无线通信装置系统的控制中心 |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200680034897.3A Expired - Fee Related CN101366073B (zh) | 2005-08-09 | 2006-08-09 | 多种语音识别软件实例的使用 |
CN2006800348901A Expired - Fee Related CN101366074B (zh) | 2005-08-09 | 2006-08-09 | 话音控制式无线通信装置系统 |
Country Status (6)
Country | Link |
---|---|
US (7) | US8775189B2 (zh) |
EP (3) | EP1920432A4 (zh) |
JP (3) | JP5394738B2 (zh) |
CN (3) | CN101366073B (zh) |
CA (3) | CA2618547C (zh) |
WO (3) | WO2007055766A2 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105096952A (zh) * | 2015-09-01 | 2015-11-25 | 联想(北京)有限公司 | 一种语音识别的辅助处理方法和服务器 |
CN106537493A (zh) * | 2015-09-29 | 2017-03-22 | 深圳市全圣时代科技有限公司 | 语音识别系统及方法、客户端设备及云端服务器 |
CN110476150A (zh) * | 2017-03-28 | 2019-11-19 | 三星电子株式会社 | 用于操作语音辨识服务的方法和支持其的电子装置 |
Families Citing this family (254)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9037451B2 (en) * | 1998-09-25 | 2015-05-19 | Rpx Corporation | Systems and methods for multiple mode voice and data communications using intelligently bridged TDM and packet buses and methods for implementing language capabilities using the same |
US8645137B2 (en) | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
US8239197B2 (en) * | 2002-03-28 | 2012-08-07 | Intellisist, Inc. | Efficient conversion of voice messages into text |
US20150371629A9 (en) * | 2005-01-03 | 2015-12-24 | Luc Julia | System and method for enabling search and retrieval operations to be performed for data items and records using data obtained from associated voice files |
JP5394738B2 (ja) | 2005-08-09 | 2014-01-22 | モバイル・ヴォイス・コントロール・エルエルシー | 音声制御型ワイヤレス通信デバイス・システム |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
US8510109B2 (en) | 2007-08-22 | 2013-08-13 | Canyon Ip Holdings Llc | Continuous speech transcription performance indication |
US9086737B2 (en) * | 2006-06-15 | 2015-07-21 | Apple Inc. | Dynamically controlled keyboard |
US20080063156A1 (en) * | 2006-08-28 | 2008-03-13 | Sony Ericsson Mobile Communications Ab | System and method for coordinating audiovisual content with contact list information |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US8635243B2 (en) | 2007-03-07 | 2014-01-21 | Research In Motion Limited | Sending a communications header with voice recording to send metadata for use in speech recognition, formatting, and search mobile search application |
US8886540B2 (en) | 2007-03-07 | 2014-11-11 | Vlingo Corporation | Using speech recognition results based on an unstructured language model in a mobile communication facility application |
US20090030691A1 (en) * | 2007-03-07 | 2009-01-29 | Cerra Joseph P | Using an unstructured language model associated with an application of a mobile communication facility |
US8949130B2 (en) | 2007-03-07 | 2015-02-03 | Vlingo Corporation | Internal and external speech recognition use with a mobile communication facility |
US8996379B2 (en) | 2007-03-07 | 2015-03-31 | Vlingo Corporation | Speech recognition text entry for software applications |
US10056077B2 (en) | 2007-03-07 | 2018-08-21 | Nuance Communications, Inc. | Using speech recognition results based on an unstructured language model with a music system |
US8949266B2 (en) | 2007-03-07 | 2015-02-03 | Vlingo Corporation | Multiple web-based content category searching in mobile search application |
US8886545B2 (en) | 2007-03-07 | 2014-11-11 | Vlingo Corporation | Dealing with switch latency in speech recognition |
US8838457B2 (en) | 2007-03-07 | 2014-09-16 | Vlingo Corporation | Using results of unstructured language model based speech recognition to control a system-level function of a mobile communications facility |
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
US9973450B2 (en) | 2007-09-17 | 2018-05-15 | Amazon Technologies, Inc. | Methods and systems for dynamically updating web service profile information by parsing transcribed message strings |
US9794348B2 (en) | 2007-06-04 | 2017-10-17 | Todd R. Smith | Using voice commands from a mobile device to remotely access and control a computer |
US9026447B2 (en) * | 2007-11-16 | 2015-05-05 | Centurylink Intellectual Property Llc | Command and control of devices and applications by voice using a communication base system |
US10002189B2 (en) | 2007-12-20 | 2018-06-19 | Apple Inc. | Method and apparatus for searching using an active ontology |
US9330720B2 (en) | 2008-01-03 | 2016-05-03 | Apple Inc. | Methods and apparatus for altering audio output signals |
US8067701B2 (en) * | 2008-01-07 | 2011-11-29 | Apple Inc. | I/O connectors with extendable faraday cage |
US20090234655A1 (en) * | 2008-03-13 | 2009-09-17 | Jason Kwon | Mobile electronic device with active speech recognition |
US8676577B2 (en) * | 2008-03-31 | 2014-03-18 | Canyon IP Holdings, LLC | Use of metadata to post process speech recognition output |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
KR20090107365A (ko) * | 2008-04-08 | 2009-10-13 | 엘지전자 주식회사 | 이동 단말기 및 그 메뉴 제어방법 |
US10496753B2 (en) | 2010-01-18 | 2019-12-03 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
JP2010008601A (ja) * | 2008-06-25 | 2010-01-14 | Fujitsu Ltd | 案内情報表示装置、案内情報表示方法及びプログラム |
US8364481B2 (en) * | 2008-07-02 | 2013-01-29 | Google Inc. | Speech recognition with parallel recognition tasks |
US20100030549A1 (en) | 2008-07-31 | 2010-02-04 | Lee Michael M | Mobile device having human language translation capability with positional feedback |
US8110744B2 (en) * | 2008-08-19 | 2012-02-07 | Apple Inc. | Flexible shielded cable |
US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
CN101780011B (zh) * | 2009-01-20 | 2013-11-13 | 仝小林 | 一种中药煎煮装置 |
US9858925B2 (en) | 2009-06-05 | 2018-01-02 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US20120309363A1 (en) | 2011-06-03 | 2012-12-06 | Apple Inc. | Triggering notifications associated with tasks items that represent tasks to perform |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
US9865263B2 (en) * | 2009-12-01 | 2018-01-09 | Nuance Communications, Inc. | Real-time voice recognition on a handheld device |
US10679605B2 (en) | 2010-01-18 | 2020-06-09 | Apple Inc. | Hands-free list-reading by intelligent automated assistant |
US10705794B2 (en) | 2010-01-18 | 2020-07-07 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
US10553209B2 (en) | 2010-01-18 | 2020-02-04 | Apple Inc. | Systems and methods for hands-free notification summaries |
US10276170B2 (en) | 2010-01-18 | 2019-04-30 | Apple Inc. | Intelligent automated assistant |
US8626511B2 (en) * | 2010-01-22 | 2014-01-07 | Google Inc. | Multi-dimensional disambiguation of voice commands |
DE202011111062U1 (de) | 2010-01-25 | 2019-02-19 | Newvaluexchange Ltd. | Vorrichtung und System für eine Digitalkonversationsmanagementplattform |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US8645136B2 (en) * | 2010-07-20 | 2014-02-04 | Intellisist, Inc. | System and method for efficiently reducing transcription error using hybrid voice transcription |
US9472185B1 (en) | 2011-01-05 | 2016-10-18 | Interactions Llc | Automated recognition system for natural language understanding |
US9245525B2 (en) | 2011-01-05 | 2016-01-26 | Interactions Llc | Automated speech recognition proxy system for natural language understanding |
US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
US9202465B2 (en) * | 2011-03-25 | 2015-12-01 | General Motors Llc | Speech recognition dependent on text message content |
US10057736B2 (en) | 2011-06-03 | 2018-08-21 | Apple Inc. | Active transport based notifications |
US8994660B2 (en) | 2011-08-29 | 2015-03-31 | Apple Inc. | Text correction processing |
US9536517B2 (en) * | 2011-11-18 | 2017-01-03 | At&T Intellectual Property I, L.P. | System and method for crowd-sourced data labeling |
US9620122B2 (en) * | 2011-12-08 | 2017-04-11 | Lenovo (Singapore) Pte. Ltd | Hybrid speech recognition |
US9931116B2 (en) | 2012-02-10 | 2018-04-03 | Covidien Lp | Buttress composition |
US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
US9483461B2 (en) | 2012-03-06 | 2016-11-01 | Apple Inc. | Handling speech synthesis of content for multiple languages |
US20130238326A1 (en) * | 2012-03-08 | 2013-09-12 | Lg Electronics Inc. | Apparatus and method for multiple device voice control |
US9002702B2 (en) * | 2012-05-03 | 2015-04-07 | International Business Machines Corporation | Confidence level assignment to information from audio transcriptions |
US9280610B2 (en) | 2012-05-14 | 2016-03-08 | Apple Inc. | Crowd sourcing information to fulfill user requests |
US10417037B2 (en) | 2012-05-15 | 2019-09-17 | Apple Inc. | Systems and methods for integrating third party services with a digital assistant |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
US9495129B2 (en) | 2012-06-29 | 2016-11-15 | Apple Inc. | Device, method, and user interface for voice-activated navigation and browsing of a document |
US9715879B2 (en) * | 2012-07-02 | 2017-07-25 | Salesforce.Com, Inc. | Computer implemented methods and apparatus for selectively interacting with a server to build a local database for speech recognition at a device |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
KR20140072927A (ko) * | 2012-11-15 | 2014-06-16 | 엘지전자 주식회사 | 이동 단말기 및 이의 제어방법 |
TWI515719B (zh) * | 2012-12-28 | 2016-01-01 | 財團法人工業技術研究院 | 基於目標名稱辨識之共用語音操控方法、裝置、其記錄媒體與程式產品 |
CN113470640B (zh) | 2013-02-07 | 2022-04-26 | 苹果公司 | 数字助理的语音触发器 |
US9894312B2 (en) * | 2013-02-22 | 2018-02-13 | The Directv Group, Inc. | Method and system for controlling a user receiving device using voice commands |
US9384732B2 (en) * | 2013-03-14 | 2016-07-05 | Microsoft Technology Licensing, Llc | Voice command definitions used in launching application with a command |
US10652394B2 (en) | 2013-03-14 | 2020-05-12 | Apple Inc. | System and method for processing voicemail |
US10748529B1 (en) | 2013-03-15 | 2020-08-18 | Apple Inc. | Voice activated device for use with a voice-based digital assistant |
US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
WO2014197336A1 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
WO2014197335A1 (en) | 2013-06-08 | 2014-12-11 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
WO2014200728A1 (en) | 2013-06-09 | 2014-12-18 | Apple Inc. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
JP6025785B2 (ja) * | 2013-07-08 | 2016-11-16 | インタラクションズ リミテッド ライアビリティ カンパニー | 自然言語理解のための自動音声認識プロキシシステム |
US10186262B2 (en) * | 2013-07-31 | 2019-01-22 | Microsoft Technology Licensing, Llc | System with multiple simultaneous speech recognizers |
WO2015020942A1 (en) | 2013-08-06 | 2015-02-12 | Apple Inc. | Auto-activating smart responses based on activities from remote devices |
US9646613B2 (en) | 2013-11-29 | 2017-05-09 | Daon Holdings Limited | Methods and systems for splitting a digital signal |
US10296160B2 (en) | 2013-12-06 | 2019-05-21 | Apple Inc. | Method for extracting salient dialog usage from live data |
US20150278737A1 (en) * | 2013-12-30 | 2015-10-01 | Google Inc. | Automatic Calendar Event Generation with Structured Data from Free-Form Speech |
WO2015102082A1 (ja) * | 2014-01-06 | 2015-07-09 | 株式会社Nttドコモ | ユーザのデータ入力に応じて情報提供を行うための端末装置、プログラム、およびサーバ装置 |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US9760559B2 (en) | 2014-05-30 | 2017-09-12 | Apple Inc. | Predictive text input |
US10078631B2 (en) | 2014-05-30 | 2018-09-18 | Apple Inc. | Entropy-guided text prediction using combined word and character n-gram language models |
US9633004B2 (en) | 2014-05-30 | 2017-04-25 | Apple Inc. | Better resolution when referencing to concepts |
EP3480811A1 (en) | 2014-05-30 | 2019-05-08 | Apple Inc. | Multi-command single utterance input method |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US9785630B2 (en) | 2014-05-30 | 2017-10-10 | Apple Inc. | Text prediction using combined word N-gram and unigram language models |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
US9842101B2 (en) | 2014-05-30 | 2017-12-12 | Apple Inc. | Predictive conversion of language input |
US10418034B1 (en) | 2014-06-20 | 2019-09-17 | Nvoq Incorporated | Systems and methods for a wireless microphone to access remotely hosted applications |
US10659851B2 (en) | 2014-06-30 | 2020-05-19 | Apple Inc. | Real-time digital assistant knowledge updates |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US9548066B2 (en) * | 2014-08-11 | 2017-01-17 | Amazon Technologies, Inc. | Voice application architecture |
US10446141B2 (en) | 2014-08-28 | 2019-10-15 | Apple Inc. | Automatic speech recognition based on user feedback |
US9953646B2 (en) | 2014-09-02 | 2018-04-24 | Belleau Technologies | Method and system for dynamic speech recognition and tracking of prewritten script |
US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
US10789041B2 (en) | 2014-09-12 | 2020-09-29 | Apple Inc. | Dynamic thresholds for always listening speech trigger |
US9646609B2 (en) | 2014-09-30 | 2017-05-09 | Apple Inc. | Caching apparatus for serving phonetic pronunciations |
US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
US9886432B2 (en) | 2014-09-30 | 2018-02-06 | Apple Inc. | Parsimonious handling of word inflection via categorical stem + suffix N-gram language models |
US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
US10552013B2 (en) | 2014-12-02 | 2020-02-04 | Apple Inc. | Data detection |
US9865280B2 (en) | 2015-03-06 | 2018-01-09 | Apple Inc. | Structured dictation using intelligent automated assistants |
US10152299B2 (en) | 2015-03-06 | 2018-12-11 | Apple Inc. | Reducing response latency of intelligent automated assistants |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
US9899019B2 (en) | 2015-03-18 | 2018-02-20 | Apple Inc. | Systems and methods for structured stem and suffix language models |
US9842105B2 (en) | 2015-04-16 | 2017-12-12 | Apple Inc. | Parsimonious continuous-space phrase representations for natural language processing |
US10460227B2 (en) | 2015-05-15 | 2019-10-29 | Apple Inc. | Virtual assistant in a communication session |
US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
US10200824B2 (en) | 2015-05-27 | 2019-02-05 | Apple Inc. | Systems and methods for proactively identifying and surfacing relevant content on a touch-sensitive device |
US10127220B2 (en) | 2015-06-04 | 2018-11-13 | Apple Inc. | Language identification from short strings |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10101822B2 (en) | 2015-06-05 | 2018-10-16 | Apple Inc. | Language input correction |
US10186254B2 (en) | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
US10255907B2 (en) | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
US20160378747A1 (en) | 2015-06-29 | 2016-12-29 | Apple Inc. | Virtual assistant for media playback |
US10740384B2 (en) | 2015-09-08 | 2020-08-11 | Apple Inc. | Intelligent automated assistant for media search and playback |
US10331312B2 (en) | 2015-09-08 | 2019-06-25 | Apple Inc. | Intelligent automated assistant in a media environment |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US9653075B1 (en) * | 2015-11-06 | 2017-05-16 | Google Inc. | Voice commands across devices |
US10956666B2 (en) | 2015-11-09 | 2021-03-23 | Apple Inc. | Unconventional virtual assistant interactions |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
CN105446489B (zh) * | 2015-12-08 | 2017-09-22 | 广州神马移动信息科技有限公司 | 语音双模控制方法、装置及用户终端 |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
CN105786441B (zh) * | 2016-01-29 | 2019-01-25 | 腾讯科技(深圳)有限公司 | 一种音频处理的方法、服务器、用户设备及系统 |
US10484484B2 (en) | 2016-02-05 | 2019-11-19 | International Business Machines Corporation | Context-aware task processing for multiple devices |
US10044798B2 (en) | 2016-02-05 | 2018-08-07 | International Business Machines Corporation | Context-aware task offloading among multiple devices |
EP3414758B1 (en) * | 2016-02-12 | 2020-09-23 | Samsung Electronics Co., Ltd. | Method and electronic device for performing voice based actions |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US11227589B2 (en) | 2016-06-06 | 2022-01-18 | Apple Inc. | Intelligent list reading |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179309B1 (en) | 2016-06-09 | 2018-04-23 | Apple Inc | Intelligent automated assistant in a home environment |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
US9619202B1 (en) | 2016-07-07 | 2017-04-11 | Intelligently Interactive, Inc. | Voice command-driven database |
KR20180022021A (ko) * | 2016-08-23 | 2018-03-06 | 삼성전자주식회사 | 음성 인식 방법 및 이를 수행하는 전자 장치 |
US10474753B2 (en) | 2016-09-07 | 2019-11-12 | Apple Inc. | Language identification using recurrent neural networks |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
US11281993B2 (en) | 2016-12-05 | 2022-03-22 | Apple Inc. | Model and ensemble compression for metric learning |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
US11204787B2 (en) | 2017-01-09 | 2021-12-21 | Apple Inc. | Application integration with a digital assistant |
US10360914B2 (en) * | 2017-01-26 | 2019-07-23 | Essence, Inc | Speech recognition based on context and multiple recognition engines |
US11100384B2 (en) | 2017-02-14 | 2021-08-24 | Microsoft Technology Licensing, Llc | Intelligent device user interactions |
US11010601B2 (en) | 2017-02-14 | 2021-05-18 | Microsoft Technology Licensing, Llc | Intelligent assistant device communicating non-verbal cues |
US10467510B2 (en) | 2017-02-14 | 2019-11-05 | Microsoft Technology Licensing, Llc | Intelligent assistant |
KR20180101926A (ko) * | 2017-03-06 | 2018-09-14 | 삼성전자주식회사 | 전자 장치 및 전자 장치의 어플리케이션 제어 방법 |
CN106936908A (zh) * | 2017-03-10 | 2017-07-07 | 广州华多网络科技有限公司 | 一种基于web的语音告警方法及相关装置 |
KR102343084B1 (ko) * | 2017-03-27 | 2021-12-27 | 삼성전자주식회사 | 전자 장치 및 전자 장치의 기능 실행 방법 |
US10547729B2 (en) | 2017-03-27 | 2020-01-28 | Samsung Electronics Co., Ltd. | Electronic device and method of executing function of electronic device |
DK201770383A1 (en) | 2017-05-09 | 2018-12-14 | Apple Inc. | USER INTERFACE FOR CORRECTING RECOGNITION ERRORS |
US10417266B2 (en) | 2017-05-09 | 2019-09-17 | Apple Inc. | Context-aware ranking of intelligent response suggestions |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
DK180048B1 (en) | 2017-05-11 | 2020-02-04 | Apple Inc. | MAINTAINING THE DATA PROTECTION OF PERSONAL INFORMATION |
US10395654B2 (en) | 2017-05-11 | 2019-08-27 | Apple Inc. | Text normalization based on a data-driven learning network |
US10726832B2 (en) | 2017-05-11 | 2020-07-28 | Apple Inc. | Maintaining privacy of personal information |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK201770428A1 (en) | 2017-05-12 | 2019-02-18 | Apple Inc. | LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
US11301477B2 (en) | 2017-05-12 | 2022-04-12 | Apple Inc. | Feedback analysis of a digital assistant |
DK201770411A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | MULTI-MODAL INTERFACES |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
EP3459076B1 (en) * | 2017-05-16 | 2020-07-22 | Apple Inc. | Far-field extension for digital assistant services |
US20180336892A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Detecting a trigger of a digital assistant |
US10311144B2 (en) | 2017-05-16 | 2019-06-04 | Apple Inc. | Emoji word sense disambiguation |
US20180336275A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Intelligent automated assistant for media exploration |
DK179560B1 (en) | 2017-05-16 | 2019-02-18 | Apple Inc. | FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES |
US10403278B2 (en) | 2017-05-16 | 2019-09-03 | Apple Inc. | Methods and systems for phonetic matching in digital assistant services |
US10657328B2 (en) | 2017-06-02 | 2020-05-19 | Apple Inc. | Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling |
US10607606B2 (en) | 2017-06-19 | 2020-03-31 | Lenovo (Singapore) Pte. Ltd. | Systems and methods for execution of digital assistant |
US10445429B2 (en) | 2017-09-21 | 2019-10-15 | Apple Inc. | Natural language understanding using vocabularies with compressed serialized tries |
US10755051B2 (en) | 2017-09-29 | 2020-08-25 | Apple Inc. | Rule-based natural language processing |
US10636424B2 (en) | 2017-11-30 | 2020-04-28 | Apple Inc. | Multi-turn canned dialog |
EP3496090A1 (en) * | 2017-12-07 | 2019-06-12 | Thomson Licensing | Device and method for privacy-preserving vocal interaction |
US10713007B2 (en) | 2017-12-12 | 2020-07-14 | Amazon Technologies, Inc. | Architecture for a hub configured to control a second device while a connection to a remote system is unavailable |
US10733982B2 (en) | 2018-01-08 | 2020-08-04 | Apple Inc. | Multi-directional dialog |
US10733375B2 (en) | 2018-01-31 | 2020-08-04 | Apple Inc. | Knowledge-based framework for improving natural language understanding |
US10789959B2 (en) | 2018-03-02 | 2020-09-29 | Apple Inc. | Training speaker recognition models for digital assistants |
US11676062B2 (en) * | 2018-03-06 | 2023-06-13 | Samsung Electronics Co., Ltd. | Dynamically evolving hybrid personalized artificial intelligence system |
EP3596729A1 (en) * | 2018-03-07 | 2020-01-22 | Google LLC. | Systems and methods for voice-based initiation of custom device actions |
US10592604B2 (en) | 2018-03-12 | 2020-03-17 | Apple Inc. | Inverse text normalization for automatic speech recognition |
US10818288B2 (en) | 2018-03-26 | 2020-10-27 | Apple Inc. | Natural assistant interaction |
US10909331B2 (en) | 2018-03-30 | 2021-02-02 | Apple Inc. | Implicit identification of translation payload with neural machine translation |
US11145299B2 (en) | 2018-04-19 | 2021-10-12 | X Development Llc | Managing voice interface devices |
US11145294B2 (en) | 2018-05-07 | 2021-10-12 | Apple Inc. | Intelligent automated assistant for delivering content from user experiences |
US10928918B2 (en) | 2018-05-07 | 2021-02-23 | Apple Inc. | Raise to speak |
US10984780B2 (en) | 2018-05-21 | 2021-04-20 | Apple Inc. | Global semantic word embeddings using bi-directional recurrent neural networks |
DK201870355A1 (en) | 2018-06-01 | 2019-12-16 | Apple Inc. | VIRTUAL ASSISTANT OPERATION IN MULTI-DEVICE ENVIRONMENTS |
DK179822B1 (da) | 2018-06-01 | 2019-07-12 | Apple Inc. | Voice interaction at a primary device to access call functionality of a companion device |
US11386266B2 (en) | 2018-06-01 | 2022-07-12 | Apple Inc. | Text correction |
DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
US10892996B2 (en) | 2018-06-01 | 2021-01-12 | Apple Inc. | Variable latency device coordination |
US10944859B2 (en) | 2018-06-03 | 2021-03-09 | Apple Inc. | Accelerated task performance |
US11010561B2 (en) | 2018-09-27 | 2021-05-18 | Apple Inc. | Sentiment prediction from textual data |
US10839159B2 (en) | 2018-09-28 | 2020-11-17 | Apple Inc. | Named entity normalization in a spoken dialog system |
US11170166B2 (en) | 2018-09-28 | 2021-11-09 | Apple Inc. | Neural typographical error modeling via generative adversarial networks |
US11462215B2 (en) | 2018-09-28 | 2022-10-04 | Apple Inc. | Multi-modal inputs for voice commands |
US11627012B2 (en) | 2018-10-09 | 2023-04-11 | NewTekSol, LLC | Home automation management system |
US11475898B2 (en) | 2018-10-26 | 2022-10-18 | Apple Inc. | Low-latency multi-speaker speech recognition |
US11638059B2 (en) | 2019-01-04 | 2023-04-25 | Apple Inc. | Content playback on multiple devices |
US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
KR20200117317A (ko) * | 2019-04-03 | 2020-10-14 | 현대자동차주식회사 | 대화 시스템 및 대화 처리 방법 |
US11170782B2 (en) * | 2019-04-08 | 2021-11-09 | Speech Cloud, Inc | Real-time audio transcription, video conferencing, and online collaboration system and methods |
US11307752B2 (en) | 2019-05-06 | 2022-04-19 | Apple Inc. | User configurable task triggers |
US11475884B2 (en) | 2019-05-06 | 2022-10-18 | Apple Inc. | Reducing digital assistant latency when a language is incorrectly determined |
DK201970509A1 (en) | 2019-05-06 | 2021-01-15 | Apple Inc | Spoken notifications |
US11423908B2 (en) | 2019-05-06 | 2022-08-23 | Apple Inc. | Interpreting spoken requests |
US11140099B2 (en) | 2019-05-21 | 2021-10-05 | Apple Inc. | Providing message response suggestions |
US11496600B2 (en) | 2019-05-31 | 2022-11-08 | Apple Inc. | Remote execution of machine-learned models |
DK180129B1 (en) | 2019-05-31 | 2020-06-02 | Apple Inc. | USER ACTIVITY SHORTCUT SUGGESTIONS |
US11289073B2 (en) | 2019-05-31 | 2022-03-29 | Apple Inc. | Device text to speech |
DK201970511A1 (en) | 2019-05-31 | 2021-02-15 | Apple Inc | Voice identification in digital assistant systems |
US11360641B2 (en) | 2019-06-01 | 2022-06-14 | Apple Inc. | Increasing the relevance of new available information |
US11468890B2 (en) | 2019-06-01 | 2022-10-11 | Apple Inc. | Methods and user interfaces for voice-based control of electronic devices |
CN110335598A (zh) * | 2019-06-26 | 2019-10-15 | 重庆金美通信有限责任公司 | 一种基于语音识别的无线窄带信道话音通信方法 |
WO2021056255A1 (en) | 2019-09-25 | 2021-04-01 | Apple Inc. | Text detection using global geometry estimators |
US11676496B2 (en) | 2020-03-19 | 2023-06-13 | Honeywell International Inc. | Methods and systems for querying for parameter retrieval |
US11183193B1 (en) | 2020-05-11 | 2021-11-23 | Apple Inc. | Digital assistant hardware abstraction |
US11061543B1 (en) | 2020-05-11 | 2021-07-13 | Apple Inc. | Providing relevant data items based on context |
US11755276B2 (en) | 2020-05-12 | 2023-09-12 | Apple Inc. | Reducing description length based on confidence |
US11490204B2 (en) | 2020-07-20 | 2022-11-01 | Apple Inc. | Multi-device audio adjustment coordination |
US11438683B2 (en) | 2020-07-21 | 2022-09-06 | Apple Inc. | User identification using headphones |
US20220129543A1 (en) * | 2020-10-27 | 2022-04-28 | Arris Enterprises Llc | Secure voice interface in a streaming media device to avoid vulnerability attacks |
US12021806B1 (en) | 2021-09-21 | 2024-06-25 | Apple Inc. | Intelligent message delivery |
Family Cites Families (103)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5153905A (en) | 1989-11-27 | 1992-10-06 | Dictaphone Corporation | Priority voice message delivery system |
CN1020365C (zh) * | 1991-08-15 | 1993-04-21 | 北京海淀志远开发公司 | 用户电话交换机自动应答、转接的方法和装置 |
WO1994014270A1 (en) * | 1992-12-17 | 1994-06-23 | Bell Atlantic Network Services, Inc. | Mechanized directory assistance |
US6594628B1 (en) * | 1995-09-21 | 2003-07-15 | Qualcomm, Incorporated | Distributed voice recognition system |
US5488652A (en) * | 1994-04-14 | 1996-01-30 | Northern Telecom Limited | Method and apparatus for training speech recognition algorithms for directory assistance applications |
US5754978A (en) * | 1995-10-27 | 1998-05-19 | Speech Systems Of Colorado, Inc. | Speech recognition system |
US6122613A (en) * | 1997-01-30 | 2000-09-19 | Dragon Systems, Inc. | Speech recognition using multiple recognizers (selectively) applied to the same input sample |
GB2323693B (en) * | 1997-03-27 | 2001-09-26 | Forum Technology Ltd | Speech to text conversion |
US6173259B1 (en) * | 1997-03-27 | 2001-01-09 | Speech Machines Plc | Speech to text conversion |
EP0980574B1 (en) * | 1997-10-20 | 2004-03-10 | Koninklijke Philips Electronics N.V. | Pattern recognition enrolment in a distributed system |
US6151572A (en) * | 1998-04-27 | 2000-11-21 | Motorola, Inc. | Automatic and attendant speech to text conversion in a selective call radio system and method |
US6614885B2 (en) * | 1998-08-14 | 2003-09-02 | Intervoice Limited Partnership | System and method for operating a highly distributed interactive voice response system |
US6839410B2 (en) * | 1998-09-01 | 2005-01-04 | At&T Corp. | Method and apparatus for setting user communication parameters based on voice identification of users |
US6167251A (en) * | 1998-10-02 | 2000-12-26 | Telespree Communications | Keyless portable cellular phone system having remote voice recognition |
US8275617B1 (en) * | 1998-12-17 | 2012-09-25 | Nuance Communications, Inc. | Speech command input recognition system for interactive computer display with interpretation of ancillary relevant speech query terms into commands |
FI116991B (fi) * | 1999-01-18 | 2006-04-28 | Nokia Corp | Menetelmä puheen tunnistamisessa, puheentunnistuslaite ja puheella ohjattava langaton viestin |
US6643622B2 (en) * | 1999-02-19 | 2003-11-04 | Robert O. Stuart | Data retrieval assistance system and method utilizing a speech recognition system and a live operator |
US6243684B1 (en) * | 1999-02-19 | 2001-06-05 | Usada, Inc. | Directory assistance system and method utilizing a speech recognition system and a live operator |
DE19910236A1 (de) * | 1999-03-09 | 2000-09-21 | Philips Corp Intellectual Pty | Verfahren zur Spracherkennung |
DE19910234A1 (de) * | 1999-03-09 | 2000-09-21 | Philips Corp Intellectual Pty | Verfahren mit mehreren Spracherkennern |
GB9911971D0 (en) * | 1999-05-21 | 1999-07-21 | Canon Kk | A system, a server for a system and a machine for use in a system |
US6865258B1 (en) * | 1999-08-13 | 2005-03-08 | Intervoice Limited Partnership | Method and system for enhanced transcription |
US6990514B1 (en) | 1999-09-03 | 2006-01-24 | Cisco Technology, Inc. | Unified messaging system using web based application server for management of messages using standardized servers |
US6738803B1 (en) * | 1999-09-03 | 2004-05-18 | Cisco Technology, Inc. | Proxy browser providing voice enabled web application audio control for telephony devices |
US7725307B2 (en) * | 1999-11-12 | 2010-05-25 | Phoenix Solutions, Inc. | Query engine for processing voice based queries including semantic decoding |
JP3444486B2 (ja) * | 2000-01-26 | 2003-09-08 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声認識手段を使用する自動音声応答システムおよび方法 |
US6438215B1 (en) | 2000-02-29 | 2002-08-20 | Ameritech Corporation | Method and system for filter based message processing in a unified messaging system |
US6578007B1 (en) * | 2000-02-29 | 2003-06-10 | Dictaphone Corporation | Global document creation system including administrative server computer |
US6760699B1 (en) * | 2000-04-24 | 2004-07-06 | Lucent Technologies Inc. | Soft feature decoding in a distributed automatic speech recognition system for use over wireless channels |
US6778961B2 (en) * | 2000-05-17 | 2004-08-17 | Wconect, Llc | Method and system for delivering text-to-speech in a real time telephony environment |
AU2001268293A1 (en) * | 2000-06-12 | 2001-12-24 | L And H Holdings Usa, Inc. | Using utterance-level confidence estimates |
US6621892B1 (en) | 2000-07-14 | 2003-09-16 | America Online, Inc. | System and method for converting electronic mail text to audio for telephonic delivery |
AU2001279101A1 (en) * | 2000-07-31 | 2002-02-13 | Eliza Corporation | Method of and system for improving accuracy in a speech recognition system |
JP2002150039A (ja) * | 2000-08-31 | 2002-05-24 | Hitachi Ltd | サービス仲介装置 |
US7236932B1 (en) * | 2000-09-12 | 2007-06-26 | Avaya Technology Corp. | Method of and apparatus for improving productivity of human reviewers of automatically transcribed documents generated by media conversion systems |
JP2002116796A (ja) * | 2000-10-11 | 2002-04-19 | Canon Inc | 音声処理装置、音声処理方法及び記憶媒体 |
JP2002140243A (ja) * | 2000-10-31 | 2002-05-17 | Arcadia:Kk | ネットワークシステム、処理管理装置 |
US6980953B1 (en) * | 2000-10-31 | 2005-12-27 | International Business Machines Corp. | Real-time remote transcription or translation service |
JP2002182691A (ja) * | 2000-12-14 | 2002-06-26 | Matsushita Electric Ind Co Ltd | 音を出力する機器を制御する制御装置 |
US6671354B2 (en) * | 2001-01-23 | 2003-12-30 | Ivoice.Com, Inc. | Speech enabled, automatic telephone dialer using names, including seamless interface with computer-based address book programs, for telephones without private branch exchanges |
US20030004720A1 (en) * | 2001-01-30 | 2003-01-02 | Harinath Garudadri | System and method for computing and transmitting parameters in a distributed voice recognition system |
US7027987B1 (en) | 2001-02-07 | 2006-04-11 | Google Inc. | Voice interface for a search engine |
US20020178003A1 (en) * | 2001-03-09 | 2002-11-28 | Motorola, Inc. | Method and apparatus for providing voice recognition service to a wireless communication device |
US7593920B2 (en) * | 2001-04-04 | 2009-09-22 | West Services, Inc. | System, method, and software for identifying historically related legal opinions |
US20020152071A1 (en) * | 2001-04-12 | 2002-10-17 | David Chaiken | Human-augmented, automatic speech recognition engine |
US6760705B2 (en) * | 2001-05-31 | 2004-07-06 | Motorola, Inc. | Virtual speech interface system and method of using same |
US6701293B2 (en) * | 2001-06-13 | 2004-03-02 | Intel Corporation | Combining N-best lists from multiple speech recognizers |
US6996525B2 (en) * | 2001-06-15 | 2006-02-07 | Intel Corporation | Selecting one of multiple speech recognizers in a system based on performance predections resulting from experience |
US20030046350A1 (en) * | 2001-09-04 | 2003-03-06 | Systel, Inc. | System for transcribing dictation |
US8583430B2 (en) * | 2001-09-06 | 2013-11-12 | J. Albert Avila | Semi-automated intermodal voice to data transcription method and apparatus |
US20030050783A1 (en) * | 2001-09-13 | 2003-03-13 | Shinichi Yoshizawa | Terminal device, server device and speech recognition method |
US7313525B1 (en) * | 2001-09-26 | 2007-12-25 | Sprint Spectrum L.P. | Method and system for bookmarking navigation points in a voice command title platform |
US20030065724A1 (en) | 2001-09-28 | 2003-04-03 | Openwave Systems Inc. | Managing messages in unified messaging systems |
US7308404B2 (en) * | 2001-09-28 | 2007-12-11 | Sri International | Method and apparatus for speech recognition using a dynamic vocabulary |
JP3997459B2 (ja) * | 2001-10-02 | 2007-10-24 | 株式会社日立製作所 | 音声入力システムおよび音声ポータルサーバおよび音声入力端末 |
US7146321B2 (en) * | 2001-10-31 | 2006-12-05 | Dictaphone Corporation | Distributed speech recognition system |
JP2003140691A (ja) * | 2001-11-07 | 2003-05-16 | Hitachi Ltd | 音声認識装置 |
US6785654B2 (en) * | 2001-11-30 | 2004-08-31 | Dictaphone Corporation | Distributed speech recognition system with speech recognition engines offering multiple functionalities |
US7103542B2 (en) * | 2001-12-14 | 2006-09-05 | Ben Franklin Patent Holding Llc | Automatically improving a voice recognition system |
US6898567B2 (en) * | 2001-12-29 | 2005-05-24 | Motorola, Inc. | Method and apparatus for multi-level distributed speech recognition |
US8170197B2 (en) * | 2002-03-15 | 2012-05-01 | Intellisist, Inc. | System and method for providing automated call center post-call processing |
US7099825B1 (en) * | 2002-03-15 | 2006-08-29 | Sprint Communications Company L.P. | User mobility in a voice recognition environment |
US8239197B2 (en) * | 2002-03-28 | 2012-08-07 | Intellisist, Inc. | Efficient conversion of voice messages into text |
AU2003222132A1 (en) * | 2002-03-28 | 2003-10-13 | Martin Dunsmuir | Closed-loop command and response system for automatic communications between interacting computer systems over an audio communications channel |
JP2003295890A (ja) * | 2002-04-04 | 2003-10-15 | Nec Corp | 音声認識対話選択装置、音声認識対話システム、音声認識対話選択方法、プログラム |
WO2003093766A1 (fr) * | 2002-04-30 | 2003-11-13 | Hitachi, Ltd. | Systeme de navigation de type communication et procede de navigation |
US7292975B2 (en) * | 2002-05-01 | 2007-11-06 | Nuance Communications, Inc. | Systems and methods for evaluating speaker suitability for automatic speech recognition aided transcription |
US7502737B2 (en) * | 2002-06-24 | 2009-03-10 | Intel Corporation | Multi-pass recognition of spoken dialogue |
US7421390B2 (en) * | 2002-09-13 | 2008-09-02 | Sun Microsystems, Inc. | Method and system for voice control of software applications |
US7184957B2 (en) * | 2002-09-25 | 2007-02-27 | Toyota Infotechnology Center Co., Ltd. | Multiple pass speech recognition method and system |
US7016844B2 (en) * | 2002-09-26 | 2006-03-21 | Core Mobility, Inc. | System and method for online transcription services |
US7228275B1 (en) * | 2002-10-21 | 2007-06-05 | Toyota Infotechnology Center Co., Ltd. | Speech recognition system having multiple speech recognizers |
US7539086B2 (en) * | 2002-10-23 | 2009-05-26 | J2 Global Communications, Inc. | System and method for the secure, real-time, high accuracy conversion of general-quality speech into text |
JP4059059B2 (ja) * | 2002-10-29 | 2008-03-12 | 日産自動車株式会社 | 情報取得装置および情報提供システム |
US6714631B1 (en) * | 2002-10-31 | 2004-03-30 | Sbc Properties, L.P. | Method and system for an automated departure strategy |
US6889188B2 (en) * | 2002-11-22 | 2005-05-03 | Intel Corporation | Methods and apparatus for controlling an electronic device |
US6834265B2 (en) * | 2002-12-13 | 2004-12-21 | Motorola, Inc. | Method and apparatus for selective speech recognition |
CA2419526A1 (en) * | 2002-12-16 | 2004-06-16 | John Taschereau | Voice recognition system |
JP2004198597A (ja) * | 2002-12-17 | 2004-07-15 | Advanced Telecommunication Research Institute International | 音声認識装置および文分類装置としてコンピュータを動作させるコンピュータプログラム、階層化された言語モデルを作成する方法を実現する様にコンピュータを動作させるコンピュータプログラム、および記憶媒体 |
US7822612B1 (en) * | 2003-01-03 | 2010-10-26 | Verizon Laboratories Inc. | Methods of processing a voice command from a caller |
US20040138885A1 (en) * | 2003-01-09 | 2004-07-15 | Xiaofan Lin | Commercial automatic speech recognition engine combinations |
US7426468B2 (en) * | 2003-03-01 | 2008-09-16 | Coifman Robert E | Method and apparatus for improving the transcription accuracy of speech recognition software |
US20040181467A1 (en) * | 2003-03-14 | 2004-09-16 | Samir Raiyani | Multi-modal warehouse applications |
US20040204941A1 (en) * | 2003-03-28 | 2004-10-14 | Wetype4U | Digital transcription system and method |
JP2004310692A (ja) * | 2003-04-10 | 2004-11-04 | Mitsubishi Electric Corp | 障害解決支援装置 |
EP1618734A2 (en) * | 2003-04-22 | 2006-01-25 | Spinvox Limited | Operator performed voicemail transcription |
JP2005003997A (ja) * | 2003-06-12 | 2005-01-06 | Toyota Motor Corp | 音声認識装置および音声認識方法ならびに車両 |
US20040264677A1 (en) * | 2003-06-30 | 2004-12-30 | Horvitz Eric J. | Ideal transfer of call handling from automated systems to human operators based on forecasts of automation efficacy and operator load |
EP1661122B1 (en) * | 2003-08-29 | 2008-10-08 | Johnson Controls Technology Company | System and method of operating a speech recognition system in a vehicle |
US7917364B2 (en) * | 2003-09-23 | 2011-03-29 | Hewlett-Packard Development Company, L.P. | System and method using multiple automated speech recognition engines |
US7376561B2 (en) * | 2004-02-23 | 2008-05-20 | Louis Ralph Rennillo | Real-time transcription system |
US7340395B2 (en) * | 2004-04-23 | 2008-03-04 | Sap Aktiengesellschaft | Multiple speech recognition engines |
US20060171775A1 (en) * | 2005-01-31 | 2006-08-03 | Mclaughlin Ronald | Articulated torque rod with elastomer retainer |
US20060004570A1 (en) * | 2004-06-30 | 2006-01-05 | Microsoft Corporation | Transcribing speech data with dialog context and/or recognition alternative information |
US8589156B2 (en) * | 2004-07-12 | 2013-11-19 | Hewlett-Packard Development Company, L.P. | Allocation of speech recognition tasks and combination of results thereof |
KR100695127B1 (ko) * | 2004-10-08 | 2007-03-14 | 삼성전자주식회사 | 다 단계 음성 인식 장치 및 방법 |
US7437297B2 (en) * | 2005-01-27 | 2008-10-14 | International Business Machines Corporation | Systems and methods for predicting consequences of misinterpretation of user commands in automated systems |
US7548977B2 (en) * | 2005-02-11 | 2009-06-16 | International Business Machines Corporation | Client / server application task allocation based upon client resources |
US8265930B1 (en) * | 2005-04-13 | 2012-09-11 | Sprint Communications Company L.P. | System and method for recording voice data and converting voice data to a text file |
US20060235684A1 (en) * | 2005-04-14 | 2006-10-19 | Sbc Knowledge Ventures, Lp | Wireless device to access network-based voice-activated services using distributed speech recognition |
JP5394738B2 (ja) | 2005-08-09 | 2014-01-22 | モバイル・ヴォイス・コントロール・エルエルシー | 音声制御型ワイヤレス通信デバイス・システム |
US8121838B2 (en) * | 2006-04-11 | 2012-02-21 | Nuance Communications, Inc. | Method and system for automatic transcription prioritization |
US8364481B2 (en) * | 2008-07-02 | 2013-01-29 | Google Inc. | Speech recognition with parallel recognition tasks |
-
2006
- 2006-08-09 JP JP2008526224A patent/JP5394738B2/ja not_active Expired - Fee Related
- 2006-08-09 EP EP06801336A patent/EP1920432A4/en not_active Ceased
- 2006-08-09 US US11/501,950 patent/US8775189B2/en not_active Expired - Fee Related
- 2006-08-09 JP JP2008526207A patent/JP5320064B2/ja not_active Expired - Fee Related
- 2006-08-09 EP EP06801186A patent/EP1922719A4/en not_active Ceased
- 2006-08-09 WO PCT/US2006/031265 patent/WO2007055766A2/en active Application Filing
- 2006-08-09 US US11/502,030 patent/US7957975B2/en not_active Expired - Fee Related
- 2006-08-09 CN CN200680034897.3A patent/CN101366073B/zh not_active Expired - Fee Related
- 2006-08-09 EP EP06801224A patent/EP1922717A4/en not_active Ceased
- 2006-08-09 CN CN2006800348901A patent/CN101366074B/zh not_active Expired - Fee Related
- 2006-08-09 CN CN200680034987.2A patent/CN101366075B/zh not_active Expired - Fee Related
- 2006-08-09 US US11/501,998 patent/US7822610B2/en active Active
- 2006-08-09 WO PCT/US2006/031334 patent/WO2007092044A1/en active Application Filing
- 2006-08-09 CA CA2618547A patent/CA2618547C/en not_active Expired - Fee Related
- 2006-08-09 JP JP2008526257A patent/JP5394739B2/ja not_active Expired - Fee Related
- 2006-08-09 CA CA2618626A patent/CA2618626C/en not_active Expired - Fee Related
- 2006-08-09 CA CA2618623A patent/CA2618623C/en not_active Expired - Fee Related
- 2006-08-09 WO PCT/US2006/031500 patent/WO2007061466A2/en active Application Filing
-
2010
- 2010-09-17 US US12/884,540 patent/US8812325B2/en not_active Expired - Fee Related
-
2011
- 2011-03-18 US US13/051,167 patent/US8315878B1/en active Active
-
2012
- 2012-11-15 US US13/677,553 patent/US8682676B2/en active Active
-
2014
- 2014-02-11 US US14/177,769 patent/US9293139B2/en active Active
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105096952A (zh) * | 2015-09-01 | 2015-11-25 | 联想(北京)有限公司 | 一种语音识别的辅助处理方法和服务器 |
CN106537493A (zh) * | 2015-09-29 | 2017-03-22 | 深圳市全圣时代科技有限公司 | 语音识别系统及方法、客户端设备及云端服务器 |
CN110476150A (zh) * | 2017-03-28 | 2019-11-19 | 三星电子株式会社 | 用于操作语音辨识服务的方法和支持其的电子装置 |
US11733964B2 (en) | 2017-03-28 | 2023-08-22 | Samsung Electronics Co., Ltd. | Method for operating speech recognition service and electronic device supporting the same |
CN110476150B (zh) * | 2017-03-28 | 2023-12-29 | 三星电子株式会社 | 用于操作语音辨识服务的方法和支持其的电子装置 |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101366074B (zh) | 话音控制式无线通信装置系统 | |
CN103035240B (zh) | 用于使用上下文信息的语音识别修复的方法和系统 | |
US7980465B2 (en) | Hands free contact database information entry at a communication device | |
CN104541325A (zh) | 混合模型语音识别 | |
KR20200011198A (ko) | 대화형 메시지 구현 방법, 장치 및 프로그램 | |
KR100380829B1 (ko) | 에이전트를 이용한 대화 방식 인터페이스 운영 시스템 및방법과 그 프로그램 소스를 기록한 기록 매체 | |
US20080046230A1 (en) | Reception support system and program therefor | |
KR20060065789A (ko) | 휴대 단말에서 입력 문자 실시간 낭독방법 | |
TW201004282A (en) | System and method for playing text short messages |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20160420 Termination date: 20200809 |
|
CF01 | Termination of patent right due to non-payment of annual fee |