CN1849579A - 语音信息系统 - Google Patents
语音信息系统 Download PDFInfo
- Publication number
- CN1849579A CN1849579A CNA2004800262085A CN200480026208A CN1849579A CN 1849579 A CN1849579 A CN 1849579A CN A2004800262085 A CNA2004800262085 A CN A2004800262085A CN 200480026208 A CN200480026208 A CN 200480026208A CN 1849579 A CN1849579 A CN 1849579A
- Authority
- CN
- China
- Prior art keywords
- audio file
- menu
- text string
- media
- user
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 claims description 35
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 230000005540 biological transmission Effects 0.000 claims 2
- 230000006835 compression Effects 0.000 claims 1
- 238000007906 compression Methods 0.000 claims 1
- 230000000007 visual effect Effects 0.000 abstract description 6
- 238000010586 diagram Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 238000005096 rolling process Methods 0.000 description 7
- 238000004364 calculation method Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 4
- 230000006870 function Effects 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 230000005059 dormancy Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000004913 activation Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000003337 fertilizer Substances 0.000 description 1
- 238000005286 illumination Methods 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 238000004080 punching Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
- G06F3/0482—Interaction with lists of selectable items, e.g. menus
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- User Interface Of Digital Computer (AREA)
- Circuits Of Receivers In General (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
公开了一种语音信息系统。本发明一般地适用于可更新的音频信息(例如、菜单)。虽然装置可能有一些预装的菜单组件,但是也可从服务器接收其他的菜单组件。每个菜单组件,不管它是原有的或接收自服务器的,均具有相关的语音名称。当用户突出显示菜单选项时,语音名称可被播放。于是用户拥有选择该菜单选项或翻滚到新菜单选项的选择权。这样,用户无须实际看着菜单的可视显示屏就可以对菜单导航,这可能对于不能看到可视显示屏的用户或有视力障碍的用户特别有用。
Description
技术领域
本发明涉及媒体播放器,更具体地说,涉及在媒体播放器上提供语音信息。
背景技术
在信息时代,计算机能够共享信息的能力是非常重要的。网络是计算机借以能彼此进行通信的的机构。一般,提供资源的装置称为服务器,而利用这些资源的装置称为客户机。根据网络类型,装置可能专用于一种类型的任务或者可能既作为客户机又作为服务器,这取决于装置是给出资源还是请求资源。
人们想共享的资源类型通常与娱乐有关,这种情况日益增多。具体地说,音乐、电影、图片和印刷物是用户可能想通过网络访问的娱乐相关媒体的全部类型。例如,尽管音乐库可以驻留在台式计算机上,但媒体拥有者可能想在携带式媒体播放器上听音乐。
为了实现便携性,许多便携式媒体播放器使用让用户经由简单图形用户界面访问音乐的最低限(minimalist)显示屏。显示屏并不总是被良好照明,在黑暗中也许不可导航。而且,用户可能在某些场合(例如,开车时)不便于或不适合看显示屏,或者用户可能残疾,这使得不可能对菜单进行可视导航。另外,许多人可轻易发现显示屏太小且不便于以常规方式使用。
虽然所描述的技术在很多应用中效果不错,但仍需继续努力以进一步提高用户感受。
发明内容
本发明提供用于提供音频信息的方法。在一个实施例中,音频信息属于音频菜单。首先,在服务器上提供正文串,每个正文串能够表示一个菜单选项。其次,产生音频文件,每个音频文件表示正文串之一的语音名称,并且将每个音频文件与其正文串相关联。然后服务器将音频文件及其关联传送到客户机。
包括由正文串代表的菜单选项的菜单随后呈现在客户机上,该菜单选项能够被突出显示即选择。当与音频文件关联的菜单选项被突出显示时,在客户机上播放该音频文件。
在本发明的另一方面,设有包含处理器、存储器和网络接口的服务器。该服务器的处理器可用来执行指令,包括提供正文串这样的指令。该服务器的处理器也可用于执行其它指令,例如产生正文串的音频表达的音频文件并将音频文件传送到客户机装置。在一个实施例中,正文串代表菜单组件。菜单组件可为能从客户机装置的菜单中选择的若干选项之一。在一个实施例中,客户机装置是媒体播放器,例如手持媒体播放器。
在本发明的又一方面,提供包括处理器、存储器和网络接口的客户机装置。客户机的处理器可用来执行包括允许其从服务器接收菜单组件的音频表达的音频文件的指令,由此菜单组件是可从菜单中选择的若干选项之一。客户机的处理器也可用于执行包括关于允许它更新菜单以包括菜单组件并且在突出显示菜单组件时播放音频文件的指令。
在本发明的又一方面,提供媒体管理系统。该媒体管理系统包括媒体数据库、媒体集合记录、媒体记录、语音名称数据库和字符串关联记录。媒体数据库存储媒体文件。媒体集合记录包括与媒体文件分组有关的数据。媒体记录包括与媒体文件有关的元数据。语音名称数据库存储音频文件。字符串关联记录将音频文件与媒体集合记录中的数据以及媒体记录中的元数据关联起来。
附图说明
通过参照以下结合附图的描述可很好地理解本发明,附图中:
图1是说明可实现本发明的示例性环境的方框图;
图2是说明本发明一实施例的媒体管理系统的组织机构的方框图;
图3是说明可与本发明一实施例结合使用的一般步骤的流程图;
图4是说明一种按照图3所示的本发明一实施例产生语音名称的可能方法的流程图;
图5是说明本发明一实施例的在客户机装置中激活可闻菜单选项时执行的步骤的流程图。
图6是说明本发明一实施例的可在菜单导航期间执行的步骤的流程图;以及
图7是说明可实现本发明不同实施例的示例性计算装置的图。
应理解,附图中相同的数字指示相同的构成要素。同样应理解,图中的描绘未必按比例。
具体实施方式
在下面的描述中,阐述许多具体细节以提供对本发明的深入理解。然而,本领域技术人员显见,无需若干或全部这些具体细节也可实现本发明。在其他情况,为了避免不必要地使本发明的阐述变得不清晰,未对众所周知的处理步骤作详细描述。
本发明提供用于提供音频信息的方法。在一个实施例中,该音频信息属于音频菜单。
本发明通常考虑到可更新的声音菜单。虽然装置可能有一些预装的菜单组件,但其他的菜单组件接收自服务器。例如,可以与音乐播放器一起提供一些预装的菜单组件(例如,“播放列表”、“歌曲”、“艺术家”、“设置”和“关于”的顶层菜单级),但也允许其他菜单组件添加到各种菜单选项(例如,用户添加的顶级菜单“风格”或可用播放列表、歌曲和艺术家的二级菜单列表)。每个菜单组件,无论是原有的还是接收自服务器,均有相关的语音名称。在用户将菜单选项突出显示时,播放其语音名称。然后用户可选择该菜单选项或翻到新菜单选项。这样,用户无须观看显示屏就可对菜单导航。
图1是说明可实现本发明的示例性环境的方框图。网络105将服务器110连接到各客户机115、120、125和130。网络105通常为数据网络,例如LAN、WAN或因特网。服务器110可以是专用装置或者不是专用装置。在图1所示的例中,服务器110是通用计算机。各种客户机115、120、125和130可以是具有不同级别处理能力的肥或瘦客户机。客户机可包括便携式计算机115、台式计算机120、专用装置例如可从加利福尼亚库珀蒂诺的苹果计算机公司买到的iPodsTM125、甚至设计用来跨网络105工作的网络感知的音频/视频部件130。某些装置例如iPod 125可以经由FireWire、USB或一些其它的允许客户机125和服务器110更直接联网在一起的外部总线直接连接到服务器110。
图2是说明本发明一实施例的媒体管理系统200的组织机构的方框图。媒体管理系统200是允许用户组织和访问数字媒体的计算机程序。为简单起见,下面讨论将假设数字媒体限于音乐。然而,应了解,对“歌曲”或“音乐”的任何引用可以推广到任何形式的数字媒体,这包括声音文件、图片数据、电影、文本文件或任何其他类型的可采用数字方式存储在计算机上的媒体。类似的,对“播放列表”的任何引用可以推广到媒体集合,包括混合数字媒体集合。
虽然服务器110和客户机115、120、125、130均可以具有特别适合那些装置所需的特定功能性的媒体管理系统200的不同版本,但是媒体管理系统200的基本组件是相似的。具体而言,媒体管理系统200可包括媒体管理器205、音乐数据库210和语音名称数据库215。媒体管理器205管理数据库210和215。
音乐数据库210有许多歌曲记录220和用于分类、识别和/或描述音乐数据库210中的媒体(即,媒体项)的播放列表记录225。歌曲记录220包含关于在数据库210中可得的每个媒体项的元数据。元数据可包括例如歌曲名称、艺术家、专辑、歌曲大小、歌曲格式和任何其他适当的信息。当然,信息类型可能取决于媒体类型。视频文件可能还有导演和制片人字段,但可不使用专辑字段。
播放列表记录225包含关于在音乐数据库210中可得的每个播放列表的信息。而且,关于给定播放列表的信息可包括该播放列表内的每首歌曲的识别信息。播放列表可以是采用任何特定顺序或者不采用任何特定顺序的媒体的集合。用户可以选择按流派、基调、艺术家、听众或任何其他有意义的安排来组合媒体。
一些包含在各种记录220、225和230中的信息用作菜单组件。例如,顶级的菜单组件可允许用户通过“歌曲”、“艺术家”或“播放列表”导航。这些分类可能与媒体管理系统200预装在一起,或者在媒体管理系统200允许修改时由用户修改过。然后用户将能够通过若干不同的路径导航到特定媒体。
例如,如果用户想通过“歌曲”菜单组件访问歌曲“Little Angel ofMine”,则用户将翻滚顶级选项,直到“歌曲”菜单组件被突出显示。一旦突出显示,用户将选择“歌曲”并用菜单组件的二级列表来呈现。该二级列表可能只是用户可得的所有歌曲的按字母顺序的列表,每首歌曲作为二级菜单组件。一般,这些二级菜单组件中没有一个是预装的,并且它们完全取决于用户的特殊音乐偏好。该用户将翻滚歌曲直到“Little Angel of Mine”被突出显示,然后选择该菜单组件来播放该歌曲。
或者,如果用户想通过“艺术家”访问歌曲,则用户将翻滚到菜单组件的顶级,直到“艺术家”被突出显示,然后选择“艺术家”以用菜单组件的第二级来呈现。用户将翻滚艺术家的按字母顺序的列表,直到组合“No Secrets”被突出显示。若选择“No Secrets”二级菜单组件则将用户导引到列出由组合“No Secrets”演奏的全部歌曲的菜单组件的第三级。然后歌曲“Little Angel of Mine”就会在第三级菜单组件当中。
导航到声音的另一备选方法是通过用户定义的播放列表访问歌曲。选择顶级菜单组件“播放列表”将用户带到用户已经创建的所有播放列表的二级列表。歌曲“Little Angel of Mine”可能列出于若干不同的播放列表下。例如“Stuart Little 2 Soundtrack”或者“SongsWritten by Orrin Hatch”播放列表可能包含该歌曲。选择这些二级菜单组件中的任一个将都将用户带到播放列表中的歌曲的三级列表。
所描述菜单组件中的每一个均直接从记录220和225得到。与各菜单组件关联的是菜单组件的音频表达。在前例中,“歌曲”、“艺术家”、“播放列表”、“No Secrets”、“Stuart Little 2 Soundtrack”、“Songs Written by Orrin Hatch”和″Little Angle of Mine″都需要相关联的发音,以让用户无须任何视觉的提示对菜单导航。
一种保存发音的机构是语音名称数据库215。语音名称数据库215包含每个发音的音频文件以及保存音频文件和其对应菜单组件之间的关联的多个记录230。虽然也能采用另一些机构(例如,在歌曲记录220和播放列表记录225中嵌入发音,从而不需要语音名称数据库215),但是使用分离的语音名称数据库215允许与用户如何导航到特定菜单组件无关地使用单个发音。
图3是说明可与本发明一实施例结合而执行的一般步骤的流程图。在步骤305,将表示新菜单组件的正文串引入服务器110。这种引入可能发生在用户手工输入例如新播放列表的新条目时,或者引入可自动发生,例如在购买与歌曲记录215装在一起的新歌曲文件时。
在步骤310,必要时产生菜单组件的语音名称的音频文件。如果购买的歌曲包括语音名称或如果语音名称已经存在于语音名称数据库215,则不必产生语音名称。例如,如果用户已有″The Beatles″的语音名称,则每当将新的Beatles歌曲增加到音乐数据库210时,就不需要创建完全相同的语音名称。
图4是说明本发明一实施例的产生语音名称涉及的详细步骤的流程图。在步骤405,媒体管理系统200接收触发信号以创建语音名称。一般,该触发信号通过引入新歌曲记录220或新播放列表记录225创建一个新菜单组件而产生。然而,如果语音名称选项先前已关闭,则第一次开启该选项将产生一个触发信号,通知媒体管理系统200需要语音名称。
一旦产生了触发信号,媒体管理系统200就在步骤410确定是否已经存在特定字符串的语音名称。如果不存在语音名称,则服务器110在415能使用标准的文本/话音转换工具来产生音频文件。最好,还对这些文件进行压缩以节省空间。一种普遍采用的编码并压缩话音的编解码器是Qualcomm PureVoice,加利福尼亚圣迭戈的Qualcomm公司有售。
一旦创建了一个音频文件,服务器110在步骤420视情况可为用户重放语音名称,使得用户能听到该音频文件。在步骤425,用户可作出许可或拒绝发音的选择。如果用户许可发音,则媒体管理系统200在步骤430将创建适当的字符串关联记录230,使得音频文件与适当的菜单组件相关联。
如果用户在步骤425不认可发音,则在步骤435用户可选择修改文本/语音转换工具用来创建语音名称的文本。能以选择方式让用户输入的文本独立于菜单组件,从而允许用户试听菜单组件而无需改变用于记录220和225的实际正文,从而使得菜单组件在拼写和发音上都正确。在步骤420,向用户播放新发音,给用户认可新发音的选择机会。
或者,如果用户在435不选择改变文本,则媒体管理系统200可允许用户在440记录他或她自己的发音或者可提供其他音频文件。于是,用户自己的语音能用于稍后对菜单的导航。
再参考图3,在步骤3 10创建语音名称的音频文件之后,服务器110在步骤315将所有新文件传送到客户机装置115、120、125或130。一般,当用户从服务器110将音乐数据库210和它们相关的记录220和225下载到客户机装置115、120、125或130时,将传送语音名称数据库215和字符串关联记录230的内容。但是,并不存在语音名称数据库215和关联记录230不能独立于音乐数据库210及其记录220和225而传送的理由。
在步骤320,客户机装置115、120、125或130接收音频文件以及所有适当的新菜单组件。一旦接收,客户机的媒体管理系统200上的菜单就在步骤325被更新,以反映任何变化。然后,在步骤330,只要用户突出显示任一菜单组件,向用户重放适当的音频文件,让用户通过声音提示来对菜单导航。
一般,媒体管理系统200让用户选择是打开或关闭可听菜单。图5是说明本发明一实施例中在设置可听菜单选项时可执行的步骤的流程图。在步骤505,用户可视情况选择语言选项。语言选项允许以其它语言呈现预装的菜单组件。例如,“歌曲”菜单组件将以其他语言呈现。例如, “歌曲”菜单组件以西班牙语“Canciones”、以法语“Chansons”和意大利语“Canzoni”呈现给用户。另外,英语版本的语音名称将不再是适当的,并可以用适当的外语发音替换。外语发音可以预装在媒体管理系统200中,或者可能需要从服务器110处下载。一般,语言选项一旦设定,它们将不被改变。
在步骤510,用户激活可听菜单特征。虽然这可能导致客户机装置115、120、125、或130使用预定义的设置,但是也能向用户呈现各种定制选项。例如,在步骤515,用户能选择在浏览菜单时播放音乐。一旦用户选择要播放的歌曲,用户可能想在听他或她的第一选择时将另一歌曲排队等候。因此,用户可被给予在第一首选定歌曲播放时允许呈现语音名称的选项。如果用户不想在菜单导航期间播放音乐,则可在520将系统设置为暂停或静音。
如果用户想在对菜单导航时听音乐,则在步骤525可允许用户将音乐与语音名称混合。通过在当前播放的歌曲中播放音频文件简单地实现混合。如果希望混合,则在步骤530设置混合选项。如果不希望混合,但用户仍想在对菜单导航时播放音乐,则媒体管理系统200在步骤535可以允许在一个声道(左边或右边的扬声器)中播放音乐,并通过设置单声道选项在另一声道中播放语音名称。因此,当用户戴耳机时,语音名称将在一个耳朵中呈现而不需要中断在另一耳朵播放音乐。另外,即使用户在步骤530选择了混合选项或在步骤520选择了暂停音乐选项,用户仍有理由在步骤540还选择在单声道中输出语音名称。
一旦设置了所有可听菜单特征,在菜单导航期间客户机装置115、120、125或130就随时可使用语音名称。图6是说明本发明一实施例中在菜单导航期间可执行的步骤的流程图。
在步骤605将菜单激活。如果菜单总是活动的,则可能不需要激活,在但经过一段非激活时间之后一些客户机装置115、120、125或130会使菜单休眠。一般,通过按压导航控制件使菜单停止休眠。导航控制件可包括拨号盘、按钮、触摸屏或任何其他便利的输入机构。导航控制件可呈现在客户机装置115、120、125或130上,或通过远程控制来实现。应知,许多远程控制件没有任何可视显示,如果在客户机装置115、120、125或130上必须使用可视显示,则菜单导航会变得不方便。
一旦激活,媒体管理系统200在步骤610选择确定菜单组件是否已突出显示了充分的时间。用户翻滚菜单组件并听到各菜单组件开始的语音名称,只是被下一菜单组件的语音名称打断,然后又被下一菜单组件的语音名称打断,这可能很令人烦扰。最好是,媒体管理系统200具有较短的延迟,使得用户没有这种烦扰就可以快速地翻滚各种选项。在615,媒体管理系统200等待直到用户停止翻滚菜单组件,并在单个菜单组件上暂停足够的时间以允许在620播放语音名称。这段时间不需要太长,一般不超过几秒,甚至可以是几分之一秒。
在625,用户则具有导航到新菜单组件并重新开始处理的选择权。可通过滚动,或者如果当前突出显示的菜单组件导向另一级菜单,则通过选择当前菜单组件来实现导航。或者,如果用户简单地停止对菜单导航,或进行没有导向更多菜单选项(例如,播放歌曲)的菜单组件选择,该处理可结束。
一般,本发明的方法可以在软件和/或硬件中实现。例如,它们可以在操作系统、在单独的用户处理、在绑定到应用程序中的库程序包或在特别构造的设备中实现。在本发明特定实施例中,本发明的方法采用软件(例如操作系统和/或运行在操作系统上的应用程序)实现。
本发明技术的软件或软件/硬件混合实现可以实现在由存储在存储器中的计算机程序选择性激活或重新配置的通用可编程设备上。在备选实施例中,本发明的方法可实现在通用网络主机例如个人计算机、工作站或服务器上。而且,本发明可至少部分实现在通用计算装置上。
现在参考图7,适于实现本发明技术的计算装置700包括主中央处理器(CPU)705、接口710、存储器715和总线720。当在适当的软件或固件的控制下工作时,CPU 705可以负责实现与期望的计算装置的功能相关联的特定功能。优选是,CPU 705在包括操作系统(例如,Mac OSX)和任何适合的应用软件(例如,iTunes)的软件的控制下完成所有这些功能。
CPU 705可包括一个或多个处理器,例如来自摩托罗拉微处理器族或MIPS微处理器族的那些处理器。在备选实施例中,特别设计处理器作为控制计算装置700的操作的硬件。
通常提供接口710作为接口卡。一般来说,它们控制通过网络发送和接收数据包并且有时支持与计算装置700一起使用的其他外围设备。可提供的接口包括以太网接口、帧中继接口、电缆接口、DSL接口、令牌环接口等等。另外,可以提供各种超高速度接口,例如高速以太网接口、十亿比特以太网接口、ATM接口、HSSI接口、POS接口、FDDI接口、ASI接口、DHEI接口、Firewire接口、USB接口等等。一般来说,这些接口可包括适于与适当的媒体通信的端口。在某些情况下,它们还可包括独立处理器以及,在一些情况下,易失性RAM。
不管计算装置的配置,可使用一个或多个配置用于储存数据、程序指令和/或与本文描述的技术的功能性有关的其他信息的存储器或存储模块(例如,存储器715)。例如,程序指令可控制操作系统和/或一个或多个应用程序的操作。
因为可使用这种信息和程序指令来实现本文描述的系统/方法,所以本发明涉及包括程序指令、状态信息等用于执行本文描述的各种操作的可读媒体的设备(例如,计算机)。机器可读媒体的例子包括但不限于例如硬盘、软盘和磁带的磁性媒体;例如CD-ROM光盘的光学媒体;例如光磁软盘的磁光媒体;以及特别配置以存储程序指令的硬件装置,例如只读存储器装置(ROM)和随机存取存储器(RAM)。本发明还可嵌入在通过适当的媒体例如电波、光缆、电线等传播的载波中。程序指令的例子包括机器代码、例如由编译器产生的机器代码以及可由计算机(例如,使用解释器)执行的较高级代码。
虽然本文示出并描述本发明的说明性实施例和应用,但是许多变化和修改是可能的,它们保持在本发明的概念、范围和精神之内,在熟读本应用之后,这些变化对本领域技术人员而言是显见的。例如,术语“滚动”和“突出显示”用于菜单的上下文时,并不局限于它们的字面解释。可以用一个菜单组件替换上一菜单组件在单线上“滚动”菜单选项。同样地,即使菜单选项是斜体、粗体或以着重号列出,也可“突出显示”该菜单选项。因此,所呈现的实施例认为是说明性的而非限制性的,并且本发明不局限于本文所给出的细节,而是可在所附权利要求的范围和等效物内修改。
Claims (24)
1.一种用于提供可听菜单的方法,包括:
在服务器上设置正文串,每个正文串能代表一个菜单选项;
生成音频文件,每个音频文件代表所述正文串之一的语音名称;
将各所述音频文件和与其对应的正文串相关联;
将所述音频文件从服务器传送到客户机;
在包括由所述正文串代表的菜单选项的所述客户机上呈现菜单,所述菜单选项能被突出显示或选择;
当关联的菜单选项被突出显示时,在所述客户机上播放所述音频文件。
2.如权利要求1所述的方法,还包括:
提供可通过所述客户机上的所述菜单来导航的远程控制。
3.如权利要求1所述的方法,其中:
所述语音名称采用非英语的语言。
4.如权利要求1所述的方法,其中:
所述客户机能够播放音乐;以及
在播放音乐时播放所述音频文件并不停止所述音乐的播放。
5.如权利要求4所述的方法,其中:
所述客户机至少在两个声道中生成音频输出;以及
仅通过一个声道输出所述音频文件。
6.如权利要求5所述的方法,其中:
恰好有两个声道用于所述客户机的音频输出,所述两个声道是左声道和右声道。
7.如权利要求4所述的方法,其中:
在播放音乐时所述音频文件与所述音乐混合。
8.一种在服务器计算机上创建音频表达而用于客户机装置的方法,包括:
提供正文串;
生成作为所述正文串的音频表达的音频文件;
将所述音频文件传送到客户机装置。
9.如权利要求8所述的方法,其中:所述正文串属于菜单组件,因此所述菜单组件是可从所述客户机装置上显示的菜单中选择的若干选项之一。
10.如权利要求8所述的方法,其中:所述客户机装置是媒体播放器,且所述正文串属于媒体项。
11.如权利要求8至10中任一权利要求所述的方法,还包括:
播放所述音频文件;以及
在将所述音频文件传送到客户机装置之前,请求认可所播放的音频文件。
12.如权利要求11所述的方法,其中:
通过一个文本/话音转换算法来实现所述音频文件的生成。
13.如权利要求12所述的方法,其中:
如果未得到认可,则提供修改所述正文串的机会;以及
如果修改了所述正文串,则用根据所修改的正文串生成的新音频文件替换所述音频文件;
播放音频文件;以及
请求认可所播放的音频文件。
14.如权利要求13所述的方法,其中:
如果所述正文串未被修改,则提供用从录音生成的新音频文件替换所述音频文件的机会。
15.如权利要求8至10中任一权利要求所述的方法,其中:
所述音频文件的生成至少包括所述音频文件的压缩。
16.如权利要求8至10中任一权利要求所述的方法,其中:
所述音频文件的传送包括在元数据中嵌入所述音频文件。
17.如权利要求8至10中任一权利要求所述的方法,还包含:
确定所述音频文件是否呈现在所述客户机装置上;
其中,仅当所述音频文件未呈现在所述客户机装置上时才执行所述音频文件的传送。
18.一种服务器,包括:
处理器;以及
在操作上与所述处理器连接的存储器;
其中,所述处理器可用来执行指令,所述指令包括
提供代表菜单组件的正文串,从而所述菜单组件是可从客户机装置上的菜单中选择的若干选项之一;
生成作为所述菜单组件的音频表达的音频文件;
将所述音频文件传送到客户机装置。
19.一种在菜单中使用音频文件的方法,包括:
从服务器接收作为菜单组件的音频表达的音频文件,从而所述菜单组件是可选自所述菜单的若干选项之一;
更新所述菜单以包括所述菜单组件;以及
当所述菜单组件被突出显示时,播放所述音频文件。
20.如权利要求19所述的方法,其中:
所述菜单包括还未被所述服务器接收的菜单组件;以及
预装音频文件与还未被所述服务器接收的所述菜单组件相关联。
21.如权利要求19所述的方法,其中:
仅在所述菜单组件已被突出显示一段预定时间之后播放所述音频文件。
22.一种客户机装置,包括:
处理器;以及
在操作上与所述处理器连接的存储器;
其中,所述处理器可用来执行包括以下操作的指令:
从服务器接收作为正文串的音频表达的音频文件;
在所述存储器中存储与相应的正文串相关联的所述音频文件;以及
在所述相应的正文串被显示时播放所述音频文件。
23.一种媒体管理系统,包括:
存储媒体文件的媒体数据库;
包含与媒体文件分组有关的数据的媒体集合记录;
包含与所述媒体文件有关的元数据的媒体记录;
存储音频文件的语音名称数据库;以及
将所述音频文件与所述媒体集合记录中的数据和所述媒体记录中的元数据相关联的字符串关联记录。
24.如权利要求23所述的媒体管理系统,其中:
所述媒体管理系统在便携式数字音乐播放器上运行。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/623,339 US7757173B2 (en) | 2003-07-18 | 2003-07-18 | Voice menu system |
US10/623,339 | 2003-07-18 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1849579A true CN1849579A (zh) | 2006-10-18 |
Family
ID=34063359
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2004800262085A Pending CN1849579A (zh) | 2003-07-18 | 2004-05-25 | 语音信息系统 |
Country Status (4)
Country | Link |
---|---|
US (1) | US7757173B2 (zh) |
EP (1) | EP1646936A2 (zh) |
CN (1) | CN1849579A (zh) |
WO (1) | WO2005015382A2 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101458093B (zh) * | 2007-12-12 | 2011-12-07 | 株式会社查纳位资讯情报 | 导航设备 |
CN101419528B (zh) * | 2007-10-24 | 2012-08-29 | 兄弟工业株式会社 | 数据处理装置 |
CN113766414A (zh) * | 2013-04-03 | 2021-12-07 | 杜比实验室特许公司 | 用于基于对象的音频的交互式渲染的方法和系统 |
Families Citing this family (270)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8645137B2 (en) | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
US8046689B2 (en) * | 2004-11-04 | 2011-10-25 | Apple Inc. | Media presentation with supplementary media |
US6934812B1 (en) * | 2001-10-22 | 2005-08-23 | Apple Computer, Inc. | Media player with instant play capability |
US8151259B2 (en) | 2006-01-03 | 2012-04-03 | Apple Inc. | Remote content updates for portable media devices |
US7433546B2 (en) * | 2004-10-25 | 2008-10-07 | Apple Inc. | Image scaling arrangement |
US8372112B2 (en) * | 2003-04-11 | 2013-02-12 | St. Jude Medical, Cardiology Division, Inc. | Closure devices, related delivery methods, and related methods of use |
US7724716B2 (en) | 2006-06-20 | 2010-05-25 | Apple Inc. | Wireless communication system |
US7831199B2 (en) | 2006-01-03 | 2010-11-09 | Apple Inc. | Media data exchange, transfer or delivery for portable electronic devices |
US7653542B2 (en) * | 2004-05-26 | 2010-01-26 | Verizon Business Global Llc | Method and system for providing synthesized speech |
TWI254576B (en) * | 2004-10-22 | 2006-05-01 | Lite On It Corp | Auxiliary function-switching method for digital video player |
US7706637B2 (en) * | 2004-10-25 | 2010-04-27 | Apple Inc. | Host configured for interoperation with coupled portable media player device |
US7593782B2 (en) * | 2005-01-07 | 2009-09-22 | Apple Inc. | Highly portable media device |
US8300841B2 (en) | 2005-06-03 | 2012-10-30 | Apple Inc. | Techniques for presenting sound effects on a portable media player |
US7424431B2 (en) * | 2005-07-11 | 2008-09-09 | Stragent, Llc | System, method and computer program product for adding voice activation and voice control to a media player |
US8977636B2 (en) | 2005-08-19 | 2015-03-10 | International Business Machines Corporation | Synthesizing aggregate data of disparate data types into data of a uniform data type |
US7590772B2 (en) | 2005-08-22 | 2009-09-15 | Apple Inc. | Audio status information for a portable electronic device |
US7417202B2 (en) * | 2005-09-02 | 2008-08-26 | White Electronic Designs Corporation | Switches and systems employing the same to enhance switch reliability and control |
US7439465B2 (en) * | 2005-09-02 | 2008-10-21 | White Electronics Designs Corporation | Switch arrays and systems employing the same to enhance system reliability |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
US8266220B2 (en) * | 2005-09-14 | 2012-09-11 | International Business Machines Corporation | Email management and rendering |
US7930369B2 (en) | 2005-10-19 | 2011-04-19 | Apple Inc. | Remotely configured media device |
US8694319B2 (en) | 2005-11-03 | 2014-04-08 | International Business Machines Corporation | Dynamic prosody adjustment for voice-rendering synthesized data |
US8654993B2 (en) | 2005-12-07 | 2014-02-18 | Apple Inc. | Portable audio device providing automated control of audio volume parameters for hearing protection |
US8255640B2 (en) | 2006-01-03 | 2012-08-28 | Apple Inc. | Media device with intelligent cache utilization |
US7673238B2 (en) * | 2006-01-05 | 2010-03-02 | Apple Inc. | Portable media device with video acceleration capabilities |
US8271107B2 (en) | 2006-01-13 | 2012-09-18 | International Business Machines Corporation | Controlling audio operation for data management and data rendering |
US9135339B2 (en) | 2006-02-13 | 2015-09-15 | International Business Machines Corporation | Invoking an audio hyperlink |
US20070192683A1 (en) * | 2006-02-13 | 2007-08-16 | Bodin William K | Synthesizing the content of disparate data types |
US7996754B2 (en) * | 2006-02-13 | 2011-08-09 | International Business Machines Corporation | Consolidated content management |
US7505978B2 (en) * | 2006-02-13 | 2009-03-17 | International Business Machines Corporation | Aggregating content of disparate data types from disparate data sources for single point access |
US20070192674A1 (en) * | 2006-02-13 | 2007-08-16 | Bodin William K | Publishing content through RSS feeds |
US7848527B2 (en) * | 2006-02-27 | 2010-12-07 | Apple Inc. | Dynamic power management in a portable media delivery system |
US9092542B2 (en) * | 2006-03-09 | 2015-07-28 | International Business Machines Corporation | Podcasting content associated with a user account |
US9361299B2 (en) * | 2006-03-09 | 2016-06-07 | International Business Machines Corporation | RSS content administration for rendering RSS content on a digital audio player |
US8849895B2 (en) * | 2006-03-09 | 2014-09-30 | International Business Machines Corporation | Associating user selected content management directives with user selected ratings |
US8607149B2 (en) * | 2006-03-23 | 2013-12-10 | International Business Machines Corporation | Highlighting related user interface controls |
US8073984B2 (en) * | 2006-05-22 | 2011-12-06 | Apple Inc. | Communication protocol for use with portable electronic devices |
US9137309B2 (en) | 2006-05-22 | 2015-09-15 | Apple Inc. | Calibration techniques for activity sensing devices |
US7643895B2 (en) | 2006-05-22 | 2010-01-05 | Apple Inc. | Portable media device with workout support |
US20070270663A1 (en) * | 2006-05-22 | 2007-11-22 | Apple Computer, Inc. | System including portable media player and physiologic data gathering device |
US20070271116A1 (en) * | 2006-05-22 | 2007-11-22 | Apple Computer, Inc. | Integrated media jukebox and physiologic data handling application |
US8358273B2 (en) | 2006-05-23 | 2013-01-22 | Apple Inc. | Portable media device with power-managed display |
US7596765B2 (en) * | 2006-05-23 | 2009-09-29 | Sony Ericsson Mobile Communications Ab | Sound feedback on menu navigation |
US7778980B2 (en) * | 2006-05-24 | 2010-08-17 | International Business Machines Corporation | Providing disparate content as a playlist of media files |
US20070277088A1 (en) * | 2006-05-24 | 2007-11-29 | Bodin William K | Enhancing an existing web page |
US8286229B2 (en) * | 2006-05-24 | 2012-10-09 | International Business Machines Corporation | Token-based content subscription |
EP2059924A4 (en) * | 2006-08-28 | 2010-08-25 | Shaul Shalev | SYSTEMS AND METHODS FOR AUDIO MARKING INFORMATION ELEMENTS FOR IDENTIFYING AND APPLYING LINKS TO INFORMATION, OR PROCESSES RELATING TO THE MARKED ELEMENTS |
US7913297B2 (en) * | 2006-08-30 | 2011-03-22 | Apple Inc. | Pairing of wireless devices using a wired medium |
US7813715B2 (en) * | 2006-08-30 | 2010-10-12 | Apple Inc. | Automated pairing of wireless accessories with host devices |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US8090130B2 (en) * | 2006-09-11 | 2012-01-03 | Apple Inc. | Highly portable media devices |
US8341524B2 (en) * | 2006-09-11 | 2012-12-25 | Apple Inc. | Portable electronic device with local search capabilities |
US7729791B2 (en) | 2006-09-11 | 2010-06-01 | Apple Inc. | Portable media playback device including user interface event passthrough to non-media-playback processing |
US8036766B2 (en) * | 2006-09-11 | 2011-10-11 | Apple Inc. | Intelligent audio mixing among media playback and at least one other non-playback application |
US7831432B2 (en) * | 2006-09-29 | 2010-11-09 | International Business Machines Corporation | Audio menus describing media contents of media players |
US9196241B2 (en) * | 2006-09-29 | 2015-11-24 | International Business Machines Corporation | Asynchronous communications using messages recorded on handheld devices |
US8001400B2 (en) * | 2006-12-01 | 2011-08-16 | Apple Inc. | Power consumption management for functional preservation in a battery-powered electronic device |
US8219402B2 (en) | 2007-01-03 | 2012-07-10 | International Business Machines Corporation | Asynchronous receipt of information from a user |
US9318100B2 (en) * | 2007-01-03 | 2016-04-19 | International Business Machines Corporation | Supplementing audio recorded in a media file |
US8132104B2 (en) * | 2007-01-24 | 2012-03-06 | Cerner Innovation, Inc. | Multi-modal entry for electronic clinical documentation |
KR20080073868A (ko) * | 2007-02-07 | 2008-08-12 | 엘지전자 주식회사 | 단말기 및 메뉴표시방법 |
KR20080073869A (ko) * | 2007-02-07 | 2008-08-12 | 엘지전자 주식회사 | 단말기 및 메뉴표시방법 |
US20080194175A1 (en) * | 2007-02-09 | 2008-08-14 | Intellitoys Llc | Interactive toy providing, dynamic, navigable media content |
CN101247247B (zh) * | 2007-02-15 | 2012-06-27 | 华为技术有限公司 | 一种利用呈现信息传播广告的方法、系统和服务器 |
US7589629B2 (en) * | 2007-02-28 | 2009-09-15 | Apple Inc. | Event recorder for portable media device |
US7698101B2 (en) * | 2007-03-07 | 2010-04-13 | Apple Inc. | Smart garment |
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
US8010345B2 (en) * | 2007-12-18 | 2011-08-30 | International Business Machines Corporation | Providing speech recognition data to a speech enabled device when providing a new entry that is selectable via a speech recognition interface of the device |
US10002189B2 (en) | 2007-12-20 | 2018-06-19 | Apple Inc. | Method and apparatus for searching using an active ontology |
US9330720B2 (en) | 2008-01-03 | 2016-05-03 | Apple Inc. | Methods and apparatus for altering audio output signals |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US10496753B2 (en) | 2010-01-18 | 2019-12-03 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
US20100077322A1 (en) | 2008-05-20 | 2010-03-25 | Petro Michael Anthony | Systems and methods for a realtime creation and modification of a dynamic media player and a disabled user compliant video player |
US20100030549A1 (en) | 2008-07-31 | 2010-02-04 | Lee Michael M | Mobile device having human language translation capability with positional feedback |
US8073590B1 (en) | 2008-08-22 | 2011-12-06 | Boadin Technology, LLC | System, method, and computer program product for utilizing a communication channel of a mobile device by a vehicular assembly |
US8265862B1 (en) | 2008-08-22 | 2012-09-11 | Boadin Technology, LLC | System, method, and computer program product for communicating location-related information |
US8131458B1 (en) | 2008-08-22 | 2012-03-06 | Boadin Technology, LLC | System, method, and computer program product for instant messaging utilizing a vehicular assembly |
US8078397B1 (en) | 2008-08-22 | 2011-12-13 | Boadin Technology, LLC | System, method, and computer program product for social networking utilizing a vehicular assembly |
US8768702B2 (en) * | 2008-09-05 | 2014-07-01 | Apple Inc. | Multi-tiered voice feedback in an electronic device |
US8898568B2 (en) * | 2008-09-09 | 2014-11-25 | Apple Inc. | Audio user interface |
US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
US20100131845A1 (en) * | 2008-11-26 | 2010-05-27 | Toyota Motor Engineering & Manufacturing North America, Inc. | Human interface of a media playing device |
WO2010067118A1 (en) | 2008-12-11 | 2010-06-17 | Novauris Technologies Limited | Speech recognition involving a mobile device |
US8862252B2 (en) * | 2009-01-30 | 2014-10-14 | Apple Inc. | Audio user interface for displayless electronic device |
CN201408397Y (zh) * | 2009-05-12 | 2010-02-17 | 李厚敦 | 带声音提示菜单选择功能的单旋转按钮装置 |
US20120309363A1 (en) | 2011-06-03 | 2012-12-06 | Apple Inc. | Triggering notifications associated with tasks items that represent tasks to perform |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US9858925B2 (en) | 2009-06-05 | 2018-01-02 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
US10705794B2 (en) | 2010-01-18 | 2020-07-07 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
US10276170B2 (en) | 2010-01-18 | 2019-04-30 | Apple Inc. | Intelligent automated assistant |
US10679605B2 (en) | 2010-01-18 | 2020-06-09 | Apple Inc. | Hands-free list-reading by intelligent automated assistant |
US10553209B2 (en) | 2010-01-18 | 2020-02-04 | Apple Inc. | Systems and methods for hands-free notification summaries |
DE202011111062U1 (de) | 2010-01-25 | 2019-02-19 | Newvaluexchange Ltd. | Vorrichtung und System für eine Digitalkonversationsmanagementplattform |
US8879698B1 (en) | 2010-02-03 | 2014-11-04 | Tal Lavian | Device and method for providing enhanced telephony |
US8406388B2 (en) | 2011-07-18 | 2013-03-26 | Zvi Or-Bach | Systems and methods for visual presentation and selection of IVR menu |
US8594280B1 (en) | 2010-02-03 | 2013-11-26 | Zvi Or-Bach | Systems and methods for visual presentation and selection of IVR menu |
US8625756B1 (en) | 2010-02-03 | 2014-01-07 | Tal Lavian | Systems and methods for visual presentation and selection of IVR menu |
US8537989B1 (en) | 2010-02-03 | 2013-09-17 | Tal Lavian | Device and method for providing enhanced telephony |
US8548131B1 (en) | 2010-02-03 | 2013-10-01 | Tal Lavian | Systems and methods for communicating with an interactive voice response system |
US8572303B2 (en) | 2010-02-03 | 2013-10-29 | Tal Lavian | Portable universal communication device |
US8687777B1 (en) | 2010-02-03 | 2014-04-01 | Tal Lavian | Systems and methods for visual presentation and selection of IVR menu |
US9001819B1 (en) | 2010-02-18 | 2015-04-07 | Zvi Or-Bach | Systems and methods for visual presentation and selection of IVR menu |
US8553859B1 (en) | 2010-02-03 | 2013-10-08 | Tal Lavian | Device and method for providing enhanced telephony |
US8548135B1 (en) | 2010-02-03 | 2013-10-01 | Tal Lavian | Systems and methods for visual presentation and selection of IVR menu |
US8681951B1 (en) | 2010-02-03 | 2014-03-25 | Tal Lavian | Systems and methods for visual presentation and selection of IVR menu |
US8903073B2 (en) | 2011-07-20 | 2014-12-02 | Zvi Or-Bach | Systems and methods for visual presentation and selection of IVR menu |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US20130318553A1 (en) * | 2010-02-26 | 2013-11-28 | Echostar Ukraine, L.L.C. | System and methods for enhancing operation of a graphical user interface |
US10762293B2 (en) | 2010-12-22 | 2020-09-01 | Apple Inc. | Using parts-of-speech tagging and named entity recognition for spelling correction |
US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
US10057736B2 (en) | 2011-06-03 | 2018-08-21 | Apple Inc. | Active transport based notifications |
US8994660B2 (en) | 2011-08-29 | 2015-03-31 | Apple Inc. | Text correction processing |
US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
US8731148B1 (en) | 2012-03-02 | 2014-05-20 | Tal Lavian | Systems and methods for visual presentation and selection of IVR menu |
US8867708B1 (en) | 2012-03-02 | 2014-10-21 | Tal Lavian | Systems and methods for visual presentation and selection of IVR menu |
US9483461B2 (en) | 2012-03-06 | 2016-11-01 | Apple Inc. | Handling speech synthesis of content for multiple languages |
US9280610B2 (en) | 2012-05-14 | 2016-03-08 | Apple Inc. | Crowd sourcing information to fulfill user requests |
US10417037B2 (en) | 2012-05-15 | 2019-09-17 | Apple Inc. | Systems and methods for integrating third party services with a digital assistant |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
US9495129B2 (en) | 2012-06-29 | 2016-11-15 | Apple Inc. | Device, method, and user interface for voice-activated navigation and browsing of a document |
US9576574B2 (en) | 2012-09-10 | 2017-02-21 | Apple Inc. | Context-sensitive handling of interruptions by intelligent digital assistant |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
US9164954B2 (en) | 2012-10-08 | 2015-10-20 | The Coca-Cola Company | Vending accommodation and accessibility |
DE212014000045U1 (de) | 2013-02-07 | 2015-09-24 | Apple Inc. | Sprach-Trigger für einen digitalen Assistenten |
US10652394B2 (en) | 2013-03-14 | 2020-05-12 | Apple Inc. | System and method for processing voicemail |
US9368114B2 (en) | 2013-03-14 | 2016-06-14 | Apple Inc. | Context-sensitive handling of interruptions |
WO2014144579A1 (en) | 2013-03-15 | 2014-09-18 | Apple Inc. | System and method for updating an adaptive speech recognition model |
US10748529B1 (en) | 2013-03-15 | 2020-08-18 | Apple Inc. | Voice activated device for use with a voice-based digital assistant |
US9507561B2 (en) * | 2013-03-15 | 2016-11-29 | Verizon Patent And Licensing Inc. | Method and apparatus for facilitating use of touchscreen devices |
CN105027197B (zh) | 2013-03-15 | 2018-12-14 | 苹果公司 | 训练至少部分语音命令系统 |
US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
WO2014197336A1 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
WO2014197335A1 (en) | 2013-06-08 | 2014-12-11 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
AU2014278592B2 (en) | 2013-06-09 | 2017-09-07 | Apple Inc. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
EP3008964B1 (en) | 2013-06-13 | 2019-09-25 | Apple Inc. | System and method for emergency calls initiated by voice command |
WO2015020942A1 (en) | 2013-08-06 | 2015-02-12 | Apple Inc. | Auto-activating smart responses based on activities from remote devices |
US10296160B2 (en) | 2013-12-06 | 2019-05-21 | Apple Inc. | Method for extracting salient dialog usage from live data |
US9620105B2 (en) | 2014-05-15 | 2017-04-11 | Apple Inc. | Analyzing audio input for efficient speech and music recognition |
US10592095B2 (en) | 2014-05-23 | 2020-03-17 | Apple Inc. | Instantaneous speaking of content on touch devices |
US9502031B2 (en) | 2014-05-27 | 2016-11-22 | Apple Inc. | Method for supporting dynamic grammars in WFST-based ASR |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US10289433B2 (en) | 2014-05-30 | 2019-05-14 | Apple Inc. | Domain specific language for encoding assistant dialog |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US9760559B2 (en) | 2014-05-30 | 2017-09-12 | Apple Inc. | Predictive text input |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
US9842101B2 (en) | 2014-05-30 | 2017-12-12 | Apple Inc. | Predictive conversion of language input |
US9633004B2 (en) | 2014-05-30 | 2017-04-25 | Apple Inc. | Better resolution when referencing to concepts |
US10078631B2 (en) | 2014-05-30 | 2018-09-18 | Apple Inc. | Entropy-guided text prediction using combined word and character n-gram language models |
US9734193B2 (en) | 2014-05-30 | 2017-08-15 | Apple Inc. | Determining domain salience ranking from ambiguous words in natural speech |
US9785630B2 (en) | 2014-05-30 | 2017-10-10 | Apple Inc. | Text prediction using combined word N-gram and unigram language models |
AU2015266863B2 (en) | 2014-05-30 | 2018-03-15 | Apple Inc. | Multi-command single utterance input method |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US10659851B2 (en) | 2014-06-30 | 2020-05-19 | Apple Inc. | Real-time digital assistant knowledge updates |
WO2016022496A2 (en) | 2014-08-06 | 2016-02-11 | Apple Inc. | Reduced-size user interfaces for battery management |
US10446141B2 (en) | 2014-08-28 | 2019-10-15 | Apple Inc. | Automatic speech recognition based on user feedback |
EP4027227A1 (en) | 2014-09-02 | 2022-07-13 | Apple Inc. | Reduced-size interfaces for managing alerts |
US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
US10789041B2 (en) | 2014-09-12 | 2020-09-29 | Apple Inc. | Dynamic thresholds for always listening speech trigger |
US9886432B2 (en) | 2014-09-30 | 2018-02-06 | Apple Inc. | Parsimonious handling of word inflection via categorical stem + suffix N-gram language models |
US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
US9646609B2 (en) | 2014-09-30 | 2017-05-09 | Apple Inc. | Caching apparatus for serving phonetic pronunciations |
US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
US10552013B2 (en) | 2014-12-02 | 2020-02-04 | Apple Inc. | Data detection |
US9711141B2 (en) | 2014-12-09 | 2017-07-18 | Apple Inc. | Disambiguating heteronyms in speech synthesis |
US9865280B2 (en) | 2015-03-06 | 2018-01-09 | Apple Inc. | Structured dictation using intelligent automated assistants |
US10152299B2 (en) | 2015-03-06 | 2018-12-11 | Apple Inc. | Reducing response latency of intelligent automated assistants |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
CN104679415A (zh) * | 2015-03-18 | 2015-06-03 | 吴爱好 | 一种智能菜谱推荐播报设备及实现方法 |
US9899019B2 (en) | 2015-03-18 | 2018-02-20 | Apple Inc. | Systems and methods for structured stem and suffix language models |
US9842105B2 (en) | 2015-04-16 | 2017-12-12 | Apple Inc. | Parsimonious continuous-space phrase representations for natural language processing |
US10460227B2 (en) | 2015-05-15 | 2019-10-29 | Apple Inc. | Virtual assistant in a communication session |
US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
US10127220B2 (en) | 2015-06-04 | 2018-11-13 | Apple Inc. | Language identification from short strings |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10101822B2 (en) | 2015-06-05 | 2018-10-16 | Apple Inc. | Language input correction |
US10186254B2 (en) | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
US10255907B2 (en) | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
US20160378747A1 (en) | 2015-06-29 | 2016-12-29 | Apple Inc. | Virtual assistant for media playback |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US11227589B2 (en) | 2016-06-06 | 2022-01-18 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179309B1 (en) | 2016-06-09 | 2018-04-23 | Apple Inc | Intelligent automated assistant in a home environment |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
US10474753B2 (en) | 2016-09-07 | 2019-11-12 | Apple Inc. | Language identification using recurrent neural networks |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
US11281993B2 (en) | 2016-12-05 | 2022-03-22 | Apple Inc. | Model and ensemble compression for metric learning |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
US11204787B2 (en) | 2017-01-09 | 2021-12-21 | Apple Inc. | Application integration with a digital assistant |
US10417266B2 (en) | 2017-05-09 | 2019-09-17 | Apple Inc. | Context-aware ranking of intelligent response suggestions |
DK201770383A1 (en) | 2017-05-09 | 2018-12-14 | Apple Inc. | USER INTERFACE FOR CORRECTING RECOGNITION ERRORS |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
US10395654B2 (en) | 2017-05-11 | 2019-08-27 | Apple Inc. | Text normalization based on a data-driven learning network |
US10726832B2 (en) | 2017-05-11 | 2020-07-28 | Apple Inc. | Maintaining privacy of personal information |
US11301477B2 (en) | 2017-05-12 | 2022-04-12 | Apple Inc. | Feedback analysis of a digital assistant |
DK201770428A1 (en) | 2017-05-12 | 2019-02-18 | Apple Inc. | LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
DK179560B1 (en) | 2017-05-16 | 2019-02-18 | Apple Inc. | FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES |
US10303715B2 (en) | 2017-05-16 | 2019-05-28 | Apple Inc. | Intelligent automated assistant for media exploration |
US20180336892A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Detecting a trigger of a digital assistant |
US10403278B2 (en) | 2017-05-16 | 2019-09-03 | Apple Inc. | Methods and systems for phonetic matching in digital assistant services |
US10311144B2 (en) | 2017-05-16 | 2019-06-04 | Apple Inc. | Emoji word sense disambiguation |
US10657328B2 (en) | 2017-06-02 | 2020-05-19 | Apple Inc. | Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling |
US10445429B2 (en) | 2017-09-21 | 2019-10-15 | Apple Inc. | Natural language understanding using vocabularies with compressed serialized tries |
US10755051B2 (en) | 2017-09-29 | 2020-08-25 | Apple Inc. | Rule-based natural language processing |
US10636424B2 (en) | 2017-11-30 | 2020-04-28 | Apple Inc. | Multi-turn canned dialog |
US10733982B2 (en) | 2018-01-08 | 2020-08-04 | Apple Inc. | Multi-directional dialog |
US10733375B2 (en) | 2018-01-31 | 2020-08-04 | Apple Inc. | Knowledge-based framework for improving natural language understanding |
US10789959B2 (en) | 2018-03-02 | 2020-09-29 | Apple Inc. | Training speaker recognition models for digital assistants |
US10592604B2 (en) | 2018-03-12 | 2020-03-17 | Apple Inc. | Inverse text normalization for automatic speech recognition |
US10818288B2 (en) | 2018-03-26 | 2020-10-27 | Apple Inc. | Natural assistant interaction |
US10909331B2 (en) | 2018-03-30 | 2021-02-02 | Apple Inc. | Implicit identification of translation payload with neural machine translation |
US11145294B2 (en) | 2018-05-07 | 2021-10-12 | Apple Inc. | Intelligent automated assistant for delivering content from user experiences |
US10928918B2 (en) | 2018-05-07 | 2021-02-23 | Apple Inc. | Raise to speak |
US10984780B2 (en) | 2018-05-21 | 2021-04-20 | Apple Inc. | Global semantic word embeddings using bi-directional recurrent neural networks |
DK179822B1 (da) | 2018-06-01 | 2019-07-12 | Apple Inc. | Voice interaction at a primary device to access call functionality of a companion device |
DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
DK201870355A1 (en) | 2018-06-01 | 2019-12-16 | Apple Inc. | VIRTUAL ASSISTANT OPERATION IN MULTI-DEVICE ENVIRONMENTS |
US11386266B2 (en) | 2018-06-01 | 2022-07-12 | Apple Inc. | Text correction |
US10892996B2 (en) | 2018-06-01 | 2021-01-12 | Apple Inc. | Variable latency device coordination |
US10944859B2 (en) | 2018-06-03 | 2021-03-09 | Apple Inc. | Accelerated task performance |
US11010561B2 (en) | 2018-09-27 | 2021-05-18 | Apple Inc. | Sentiment prediction from textual data |
US10839159B2 (en) | 2018-09-28 | 2020-11-17 | Apple Inc. | Named entity normalization in a spoken dialog system |
US11170166B2 (en) | 2018-09-28 | 2021-11-09 | Apple Inc. | Neural typographical error modeling via generative adversarial networks |
US11462215B2 (en) | 2018-09-28 | 2022-10-04 | Apple Inc. | Multi-modal inputs for voice commands |
US11475898B2 (en) | 2018-10-26 | 2022-10-18 | Apple Inc. | Low-latency multi-speaker speech recognition |
US11638059B2 (en) | 2019-01-04 | 2023-04-25 | Apple Inc. | Content playback on multiple devices |
US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
US11475884B2 (en) | 2019-05-06 | 2022-10-18 | Apple Inc. | Reducing digital assistant latency when a language is incorrectly determined |
US11423908B2 (en) | 2019-05-06 | 2022-08-23 | Apple Inc. | Interpreting spoken requests |
US11307752B2 (en) | 2019-05-06 | 2022-04-19 | Apple Inc. | User configurable task triggers |
DK201970509A1 (en) | 2019-05-06 | 2021-01-15 | Apple Inc | Spoken notifications |
US11140099B2 (en) | 2019-05-21 | 2021-10-05 | Apple Inc. | Providing message response suggestions |
US11289073B2 (en) | 2019-05-31 | 2022-03-29 | Apple Inc. | Device text to speech |
DK201970510A1 (en) | 2019-05-31 | 2021-02-11 | Apple Inc | Voice identification in digital assistant systems |
DK180129B1 (en) | 2019-05-31 | 2020-06-02 | Apple Inc. | USER ACTIVITY SHORTCUT SUGGESTIONS |
US11496600B2 (en) | 2019-05-31 | 2022-11-08 | Apple Inc. | Remote execution of machine-learned models |
US11360641B2 (en) | 2019-06-01 | 2022-06-14 | Apple Inc. | Increasing the relevance of new available information |
US11488406B2 (en) | 2019-09-25 | 2022-11-01 | Apple Inc. | Text detection using global geometry estimators |
US11810578B2 (en) | 2020-05-11 | 2023-11-07 | Apple Inc. | Device arbitration for digital assistant-based intercom systems |
Family Cites Families (50)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH05108065A (ja) * | 1991-10-15 | 1993-04-30 | Kawai Musical Instr Mfg Co Ltd | 自動演奏装置 |
US5890122A (en) * | 1993-02-08 | 1999-03-30 | Microsoft Corporation | Voice-controlled computer simulateously displaying application menu and list of available commands |
US5661787A (en) * | 1994-10-27 | 1997-08-26 | Pocock; Michael H. | System for on-demand remote access to a self-generating audio recording, storage, indexing and transaction system |
US5999895A (en) * | 1995-07-24 | 1999-12-07 | Forest; Donald K. | Sound operated menu method and apparatus |
US6560707B2 (en) * | 1995-11-06 | 2003-05-06 | Xerox Corporation | Multimedia coordination system |
US5802526A (en) * | 1995-11-15 | 1998-09-01 | Microsoft Corporation | System and method for graphically displaying and navigating through an interactive voice response menu |
US5912952A (en) * | 1996-06-27 | 1999-06-15 | At&T Corp | Voice response unit with a visual menu interface |
US5950123A (en) * | 1996-08-26 | 1999-09-07 | Telefonaktiebolaget L M | Cellular telephone network support of audible information delivery to visually impaired subscribers |
US5721827A (en) | 1996-10-02 | 1998-02-24 | James Logan | System for electrically distributing personalized information |
US20070026852A1 (en) * | 1996-10-02 | 2007-02-01 | James Logan | Multimedia telephone system |
US20020002039A1 (en) * | 1998-06-12 | 2002-01-03 | Safi Qureshey | Network-enabled audio device |
US6563769B1 (en) * | 1998-06-11 | 2003-05-13 | Koninklijke Philips Electronics N.V. | Virtual jukebox |
US6493428B1 (en) | 1998-08-18 | 2002-12-10 | Siemens Information & Communication Networks, Inc | Text-enhanced voice menu system |
US6360237B1 (en) * | 1998-10-05 | 2002-03-19 | Lernout & Hauspie Speech Products N.V. | Method and system for performing text edits during audio recording playback |
US6983251B1 (en) * | 1999-02-15 | 2006-01-03 | Sharp Kabushiki Kaisha | Information selection apparatus selecting desired information from plurality of audio information by mainly using audio |
US20020013852A1 (en) * | 2000-03-03 | 2002-01-31 | Craig Janik | System for providing content, management, and interactivity for thin client devices |
AU2299701A (en) * | 1999-10-22 | 2001-04-30 | Tellme Networks, Inc. | Streaming content over a telephone interface |
US6978127B1 (en) * | 1999-12-16 | 2005-12-20 | Koninklijke Philips Electronics N.V. | Hand-ear user interface for hand-held device |
US6519566B1 (en) * | 2000-03-01 | 2003-02-11 | International Business Machines Corporation | Method for hands-free operation of a pointer |
NL1014847C1 (nl) | 2000-04-05 | 2001-10-08 | Minos B V I O | Gegevensoverdracht. |
EP1285330B1 (en) * | 2000-05-11 | 2006-08-30 | Nes Stewart Irvine | Zeroclick |
KR100867760B1 (ko) * | 2000-05-15 | 2008-11-10 | 소니 가부시끼 가이샤 | 재생장치, 재생방법 및 기록매체 |
US6754504B1 (en) * | 2000-06-10 | 2004-06-22 | Motorola, Inc. | Method and apparatus for controlling environmental conditions using a personal area network |
US20020013784A1 (en) * | 2000-07-31 | 2002-01-31 | Swanson Raymond H. | Audio data transmission system and method of operation thereof |
US6529586B1 (en) * | 2000-08-31 | 2003-03-04 | Oracle Cable, Inc. | System and method for gathering, personalized rendering, and secure telephonic transmission of audio data |
US6556971B1 (en) | 2000-09-01 | 2003-04-29 | Snap-On Technologies, Inc. | Computer-implemented speech recognition system training |
US20020046315A1 (en) * | 2000-10-13 | 2002-04-18 | Interactive Objects, Inc. | System and method for mapping interface functionality to codec functionality in a portable audio device |
US6947728B2 (en) * | 2000-10-13 | 2005-09-20 | Matsushita Electric Industrial Co., Ltd. | Mobile phone with music reproduction function, music data reproduction method by mobile phone with music reproduction function, and the program thereof |
US6731312B2 (en) | 2001-01-08 | 2004-05-04 | Apple Computer, Inc. | Media player interface |
US7149319B2 (en) * | 2001-01-23 | 2006-12-12 | Phonak Ag | Telecommunication system, speech recognizer, and terminal, and method for adjusting capacity for vocal commanding |
US6448485B1 (en) * | 2001-03-16 | 2002-09-10 | Intel Corporation | Method and system for embedding audio titles |
US6834264B2 (en) * | 2001-03-29 | 2004-12-21 | Provox Technologies Corporation | Method and apparatus for voice dictation and document production |
US6892083B2 (en) * | 2001-09-05 | 2005-05-10 | Vocera Communications Inc. | Voice-controlled wireless communications system and method |
US7010581B2 (en) * | 2001-09-24 | 2006-03-07 | International Business Machines Corporation | Method and system for providing browser functions on a web page for client-specific accessibility |
US7027990B2 (en) | 2001-10-12 | 2006-04-11 | Lester Sussman | System and method for integrating the visual display of text menus for interactive voice response systems |
EP1440402A1 (en) | 2001-10-22 | 2004-07-28 | Apple Computer, Inc. | Intelligent synchronization for a media player |
US20030167318A1 (en) | 2001-10-22 | 2003-09-04 | Apple Computer, Inc. | Intelligent synchronization of media player with host computer |
EP1309142B1 (en) * | 2001-10-30 | 2007-06-20 | Hewlett-Packard Company | Communication system and method |
EP1311102A1 (en) | 2001-11-08 | 2003-05-14 | Hewlett-Packard Company | Streaming audio under voice control |
US6996777B2 (en) * | 2001-11-29 | 2006-02-07 | Nokia Corporation | Method and apparatus for presenting auditory icons in a mobile terminal |
US20030158737A1 (en) * | 2002-02-15 | 2003-08-21 | Csicsatka Tibor George | Method and apparatus for incorporating additional audio information into audio data file identifying information |
US6999066B2 (en) * | 2002-06-24 | 2006-02-14 | Xerox Corporation | System for audible feedback for touch screen displays |
US7166791B2 (en) * | 2002-07-30 | 2007-01-23 | Apple Computer, Inc. | Graphical user interface and methods of use thereof in a multimedia player |
US7136874B2 (en) * | 2002-10-16 | 2006-11-14 | Microsoft Corporation | Adaptive menu system for media players |
US7054888B2 (en) * | 2002-10-16 | 2006-05-30 | Microsoft Corporation | Optimizing media player memory during rendering |
US20040218451A1 (en) * | 2002-11-05 | 2004-11-04 | Said Joe P. | Accessible user interface and navigation system and method |
KR101156827B1 (ko) * | 2003-04-24 | 2012-06-18 | 톰슨 라이센싱 | 오디오 식별을 이용한 재생목록의 생성 |
US6728729B1 (en) | 2003-04-25 | 2004-04-27 | Apple Computer, Inc. | Accessing media across networks |
US20050045373A1 (en) * | 2003-05-27 | 2005-03-03 | Joseph Born | Portable media device with audio prompt menu |
KR20050072256A (ko) * | 2004-01-06 | 2005-07-11 | 엘지전자 주식회사 | 고밀도 광디스크의 메뉴 사운드 구성방법 및 재생방법과기록재생장치 |
-
2003
- 2003-07-18 US US10/623,339 patent/US7757173B2/en not_active Expired - Fee Related
-
2004
- 2004-05-25 WO PCT/US2004/016519 patent/WO2005015382A2/en active Application Filing
- 2004-05-25 EP EP04753362A patent/EP1646936A2/en not_active Withdrawn
- 2004-05-25 CN CNA2004800262085A patent/CN1849579A/zh active Pending
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101419528B (zh) * | 2007-10-24 | 2012-08-29 | 兄弟工业株式会社 | 数据处理装置 |
CN101458093B (zh) * | 2007-12-12 | 2011-12-07 | 株式会社查纳位资讯情报 | 导航设备 |
CN113766414A (zh) * | 2013-04-03 | 2021-12-07 | 杜比实验室特许公司 | 用于基于对象的音频的交互式渲染的方法和系统 |
CN113766414B (zh) * | 2013-04-03 | 2024-03-01 | 杜比实验室特许公司 | 用于基于对象的音频的交互式渲染的方法和系统 |
Also Published As
Publication number | Publication date |
---|---|
WO2005015382A2 (en) | 2005-02-17 |
WO2005015382A3 (en) | 2006-01-05 |
US7757173B2 (en) | 2010-07-13 |
EP1646936A2 (en) | 2006-04-19 |
US20050015254A1 (en) | 2005-01-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1849579A (zh) | 语音信息系统 | |
US7779357B2 (en) | Audio user interface for computing devices | |
US11080474B2 (en) | Calculations on sound associated with cells in spreadsheets | |
EP2324416B1 (en) | Audio user interface | |
US8108462B2 (en) | Information processing apparatus, information processing method, information processing program and recording medium for storing the program | |
US8438485B2 (en) | System, method, and apparatus for generating, customizing, distributing, and presenting an interactive audio publication | |
KR101242040B1 (ko) | 포터블 기기의 재생 목록 자동 생성 방법 및 장치 | |
US20110153330A1 (en) | System and method for rendering text synchronized audio | |
EP2301014A2 (en) | Method and apparatus for generating voice annotations for playlists of digital media | |
US20080312760A1 (en) | Method and system for generating and processing digital content based on text-to-speech conversion | |
US20240126500A1 (en) | Device and method for creating a sharable clip of a podcast | |
CN1818899A (zh) | Mpeg播放器的数据检索方法 | |
EP2119070A2 (en) | Templates and style sheets for audio broadcasts | |
CN2679758Y (zh) | 具有乐曲检索功能的音乐播放器 | |
TW201340693A (zh) | 智慧電視股票看盤個人化語音播報裝置與方法 | |
KR20080052525A (ko) | 메타데이터를 동반한 직접 인코딩 시스템 | |
Mazzoni et al. | Podcasting with Audacity: Creating a Podcast With Free Audio Software (Digital Short Cut) | |
Proctor | Microware Review | |
KR20070016620A (ko) | 오디오 데이터의 메타 데이터를 음성으로 제공하는 장치 및그 방법 | |
KR20090005665A (ko) | 곡 정보데이터 저장방법 및 장치와 그를 포함하는파일재생장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20061018 |