WO2016107001A1 - 一种记录语音通信信息的方法、终端及计算机存储介质 - Google Patents

一种记录语音通信信息的方法、终端及计算机存储介质 Download PDF

Info

Publication number
WO2016107001A1
WO2016107001A1 PCT/CN2015/076208 CN2015076208W WO2016107001A1 WO 2016107001 A1 WO2016107001 A1 WO 2016107001A1 CN 2015076208 W CN2015076208 W CN 2015076208W WO 2016107001 A1 WO2016107001 A1 WO 2016107001A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
communication information
stream
voice communication
terminal
Prior art date
Application number
PCT/CN2015/076208
Other languages
English (en)
French (fr)
Inventor
陈新
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2016107001A1 publication Critical patent/WO2016107001A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/26Devices for calling a subscriber
    • H04M1/27Devices whereby a plurality of signals may be stored simultaneously
    • H04M1/274Devices whereby a plurality of signals may be stored simultaneously with provision for storing more than one subscriber number at a time, e.g. using toothed disc
    • H04M1/2745Devices whereby a plurality of signals may be stored simultaneously with provision for storing more than one subscriber number at a time, e.g. using toothed disc using static electronic memories, e.g. chips
    • H04M1/275Devices whereby a plurality of signals may be stored simultaneously with provision for storing more than one subscriber number at a time, e.g. using toothed disc using static electronic memories, e.g. chips implemented by means of portable electronic directories

Definitions

  • the present invention relates to the field of terminal applications, and in particular, to a method, a terminal, and a computer storage medium for recording voice communication information.
  • the terminal needs the user to manually record these important voice communication information in the voice communication process, and then add it to the corresponding application of the terminal, so that the operation of recording the voice communication information is very complicated.
  • Embodiments of the present invention are directed to a method, a terminal, and a computer storage medium for recording voice communication information, which are convenient for a user to record voice communication information, improve the intelligence level of the terminal, and provide a good user experience.
  • an embodiment of the present invention provides a method for recording voice communication information, where the method includes: listening to a voice stream of a called party during a voice communication, and from the called party according to a preset rule.
  • the voice communication information is extracted from the voice stream; the voice communication information is added to the corresponding application.
  • the method before the listening to the voice stream of the called party, the method further includes: listening to the voice stream of the calling party, identifying the first voice instruction, wherein the first voice instruction And configuring, according to the preset rule, the voice communication information to be extracted from the voice stream of the called party, according to the preset rule, according to the first And extracting the information type in the voice instruction, and extracting the corresponding voice communication information from the called party voice stream.
  • the identifying the first voice instruction comprises: performing voice recognition on the voice stream of the calling party, and when the recognition result is a preset voice command, the preset voice is The instruction is determined to be the first voice instruction; or the voice stream of the calling party is fuzzy matched to obtain the first voice instruction.
  • the method further includes: listening to the voice stream of the calling party, and identifying a second voice instruction, where The second voice instruction is configured to instruct the terminal to stop listening to the called party's voice stream.
  • the adding the voice communication information to the corresponding application comprises: buffering the voice communication information; after the voice communication ends, displaying a confirmation interface including the voice communication information;
  • the confirmation interface is configured to confirm the voice communication information by the user, and after receiving the confirmation operation of the confirmation interface by the user, adding the voice communication information to the corresponding application.
  • the embodiment of the present invention further provides a terminal, where the terminal includes: a listening unit, a voice recognition unit, and an information adding unit; wherein the listening unit is configured to be in voice communication During the process, the voice stream of the called party is intercepted; the voice recognition unit is configured to extract voice communication information from the voice stream of the called party according to a preset rule; the information adding unit is configured to The voice communication information is added to a corresponding application.
  • the listening unit is further configured to listen to the voice stream of the calling party before listening to the voice stream of the called party; correspondingly, the voice recognition unit is further configured to Identifying a first voice instruction in the voice stream of the calling party; wherein the first voice instruction is configured to instruct the terminal to listen to the voice stream of the called party.
  • the voice recognition unit is configured to perform voice recognition on the voice stream of the calling party, and when the recognition result is a preset voice command, determine the preset voice command as The first voice instruction; or, performing fuzzy matching on the voice stream of the calling party to obtain the first voice instruction.
  • the listening unit is further configured to listen to the voice stream of the calling party after the voice recognition unit extracts voice communication information from the voice stream of the called party;
  • the voice recognition unit is further configured to: identify a second voice instruction in the voice stream of the calling party; wherein the second voice instruction is configured to instruct the terminal to stop calling the called party The listening of the voice stream.
  • the terminal further includes: a storage unit configured to buffer the voice communication information; and correspondingly, the information adding unit is configured to display, after the end of the voice communication, a a confirmation interface of the voice communication information; configured to: after receiving the confirmation operation of the confirmation interface by the user, adding the voice communication information to a corresponding application; wherein the confirmation interface is configured to confirm the voice by the user Communication information.
  • an embodiment of the present invention further provides a computer storage medium, where the computer storage medium stores computer executable instructions, where the computer executable instructions are used to perform recording voice communication information according to an embodiment of the present invention.
  • the medium in the process of voice communication, listens to the voice stream of the called party, and extracts voice communication information from the voice stream of the called party according to a preset rule, for example, extracts the phone from the voice stream of the called party.
  • the terminal can automatically extract the voice communication information that needs to be recorded during the voice communication process of the user, and record, so that the user does not need to manually add to the corresponding application after manual recording, so that the user can record the voice communication information and improve the intelligence level of the terminal. Good user experience
  • FIG. 1 is a schematic flowchart of a method for recording voice communication information according to an embodiment of the present invention
  • FIG. 2 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
  • the embodiment of the invention provides a method for recording voice communication information, and the method is applied to terminals such as a smart phone, a tablet computer, a function mobile phone, etc., and the user can perform voices such as making a call, a video call, and an instant voice chat through the terminals. Communication.
  • FIG. 1 is a schematic flowchart of a method for recording voice communication information according to an embodiment of the present invention. Referring to FIG. 1, the method includes:
  • S101 During the voice communication process, listen to the voice stream of the called party, and extract voice communication information from the voice stream of the called party according to a preset rule.
  • the terminal can automatically enable the listening function to listen to the voice stream of the called party; of course, it can also be manually turned on by the user; based on the listening function, The terminal obtains the voice signals input by the called party in real time, and performs these signals on the signals. Speech recognition, and then extracting voice communication information according to preset rules, such as extracting information types, keywords, and the like.
  • the method further includes: listening to the voice flow of the calling party, and identifying the first voice instruction, where the first voice instruction is configured To instruct the terminal to listen to the voice stream of the called party.
  • the terminal may first only listen to the voice stream of the calling party, then perform voice recognition on the voice streams, obtain the recognition result, and then match the recognition result with the preset voice instruction. If the matching is successful, that is, when the recognition result is a preset voice command, the preset voice command is determined as the first voice command; or the voice stream of the calling party is fuzzy matched to obtain the first voice command.
  • the foregoing preset voice command may be, but is not limited to, the following two situations:
  • the preset voice command is some fixed voice command pre-stored in the terminal local or the cloud server, and the preset voice command has a fixed format, such as: “record + phone number”, “record + activity” / Schedule” and so on.
  • the preset voice command pre-stores some fuzzy instructions on the terminal local or the cloud server, that is, the statement containing the preset instruction keyword, which is a common life term, more natural, such as: Start recording the XXX (general user name) phone number, "What is XXX's phone number”, “Please tell me XXX's phone number” and other fuzzy instructions with "phone number” or “number” as keywords; or , “Start recording XXX (generally referred to as the event name) activity", "When is the event on XX day”, “Please tell us how the schedule of tomorrow's meeting is arranged", etc., with "activity” or “schedule” as keywords Fuzzy instructions.
  • the first voice instruction has an extracted information type, that is, " Statements such as "telephone number”, “schedule”, “activity”, etc. So, in determining the first voice finger
  • the step of extracting the voice communication information from the called party's voice stream according to the preset rule may be: according to the extracted information type in the first voice instruction, from the called party voice stream. Extract corresponding voice communication information.
  • the terminal when the extracted information type in the first voice instruction is “telephone number”, the terminal extracts information such as “Zhang San, 139xxxxxxxx” in the voice stream of the called party, and when the first voice instruction is When the extracted information type is "Schedule”, the terminal extracts information such as "Playing Badminton, Sunday 10:00, X Badminton Hall".
  • the method further includes: listening to the voice stream of the calling party, and identifying the second voice instruction, where the second voice instruction It is configured to instruct the terminal to stop listening to the voice stream of the called party.
  • the user may also control the terminal to end the recording by using another voice command.
  • the preset voice command may further include, for example, “end record” and “this Is the XX mobile phone number, the "end recording activity", etc., then the terminal can identify the preset voice command in the voice stream of the calling party, and determine the preset command as the second voice command. And stop listening to the voice stream of the called party.
  • the mapping relationship between the voice communication information and the application is pre-stored in the terminal; in an implementation manner, the voice communication information may include an information type, that is, the information type and the application are pre-stored in the terminal. Mapping relations.
  • the mapping relationship between the voice communication information and the application may include: a mapping relationship between the phone number and the address book application, a mapping relationship between the schedule and the memo or the reminder application, and the like. For example, if the voice communication information is a phone number, the information is added to the address book application; if the voice communication information is a schedule, the information is added to an application such as a calendar, a memo, or a reminder.
  • step of ending the recording may be performed before S102, or may be performed simultaneously with S102, and may be performed after S102, which is not specifically limited in the embodiment of the present invention.
  • the S102 may include: buffering voice communication information; after the voice communication ends, displaying a confirmation interface including the voice communication information; wherein the confirmation interface is configured to confirm the voice communication information by the user; After the confirmation operation of the confirmation interface, the voice communication information is added to the corresponding application.
  • the terminal caches the information, and after the voice communication ends, the voice communication information is displayed to the user through the confirmation interface, and is provided on the confirmation interface.
  • the user can click the “confirm” virtual operation button to confirm the voice communication information; when confirming that the voice communication message obtained by the terminal is incorrect through the confirmation interface, the user can click “edit”
  • the virtual operation button is used to edit the voice communication information, or the "cancel" virtual operation button is touched to delete the cache of the voice communication information.
  • the terminal listens to the voice stream of the called party during the voice communication process, and extracts voice communication information from the voice stream of the called party according to a preset rule, for example, from the voice stream of the called party. Extract phone number, event schedule, schedule, etc., and then add voice communication information to the corresponding application, such as adding a phone number to the address book application, adding location information and/or time information to the memo application, In this way, the terminal can automatically extract the voice communication information that needs to be recorded during the voice communication process of the user, and record, so that the user does not need to manually add to the corresponding application after manual recording, so that the user can record the voice communication information and improve the intelligence level of the terminal. , providing a good user experience.
  • the embodiment of the present invention further provides a computer storage medium, where the computer storage medium stores computer executable instructions, and the computer executable instructions are used to execute the embodiment of the present invention.
  • the method of recording voice communication information is not limited to a computer storage medium.
  • an embodiment of the present invention provides a terminal that is consistent with the terminal described in one or more of the foregoing embodiments.
  • the terminal includes: a listening unit 21, a voice recognition unit 22, and an information adding unit 23; wherein the listening unit 21 is configured.
  • the voice recognition unit 22 is configured to extract voice communication information from the voice stream of the called party according to a preset rule; the information adding unit 23 , configured to add voice communication information to the corresponding application.
  • the intercepting unit 21 is further configured to listen to the voice stream of the calling party before listening to the voice stream of the called party;
  • the voice recognition unit 22 is further configured to: identify the first voice instruction in the voice stream of the calling party; and further configured to extract the corresponding voice from the called party voice stream according to the extracted information type in the first voice instruction. Voice communication information; wherein the first voice instruction is configured to instruct the terminal to listen to the voice stream of the called party.
  • the voice recognition unit 22 is configured to perform voice recognition on the voice stream of the calling party, and when the recognition result is a preset voice command, determine the preset voice command as the first voice instruction; or And performing fuzzy matching on the voice stream of the calling party to obtain the first voice instruction.
  • the listening unit 21 is further configured to: after the voice recognition unit 22 extracts voice communication information from the voice stream of the called party, listen to the voice stream of the calling party;
  • the speech recognition unit 22 is further configured to recognize the second voice instruction in the voice stream of the calling party, wherein the second voice command is configured to instruct the terminal to stop listening to the voice stream of the called party.
  • the terminal further includes: a storage unit configured to cache voice communication Letter information
  • the information adding unit 23 is configured to display a confirmation interface including voice communication information after the end of the voice communication; and after receiving the confirmation operation of the confirmation interface by the user, adding the voice communication information to the corresponding application;
  • the confirmation interface is configured to confirm the voice communication information by the user.
  • the listening unit 21, the voice recognition unit 22, and the information adding unit 23 in the terminal may pass through a central processing unit (CPU) and a digital signal in the terminal in an actual application.
  • CPU central processing unit
  • a DSP Digital Signal Processor
  • FPGA Field-Programmable Gate Array
  • the storage unit in the terminal can be implemented by a memory in the terminal in practical applications.
  • embodiments of the present invention can be provided as a method, system, or computer program product. Accordingly, the present invention can take the form of a hardware embodiment, a software embodiment, or a combination of software and hardware. Moreover, the invention can take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage and optical storage, etc.) including computer usable program code.
  • the computer program instructions can also be stored in a computer readable memory that can direct a computer or other programmable data processing device to operate in a particular manner, such that the computer readable memory is
  • the instructions in the memory produce an article of manufacture comprising instruction means that implements the functions specified in one or more blocks of the flow or in a flow or block diagram of the flowchart.
  • These computer program instructions can also be loaded onto a computer or other programmable data processing device such that a series of operational steps are performed on a computer or other programmable device to produce computer-implemented processing for execution on a computer or other programmable device.
  • the instructions provide steps for implementing the functions specified in one or more of the flow or in a block or blocks of a flow diagram.
  • the terminal listens to the voice stream of the called party during the voice communication process, and extracts voice communication information from the voice stream of the called party according to a preset rule, for example, the voice stream from the called party. Extract the phone number, event schedule, schedule, etc., and then add the voice communication information to the corresponding application, such as adding the phone number to the address book application, adding the location information and/or time information to the memo application.
  • the terminal can automatically extract the voice communication information that needs to be recorded during the voice communication process of the user, and record, so that the user does not need to manually add to the corresponding application after the manual recording, so that the user can record the voice communication information and improve the intelligence of the terminal. Degree, providing a good user experience.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)

Abstract

本发明实施例公开了一种记录语音通信信息的方法,所述方法包括:在语音通信过程中,侦听被叫方的语音流,并根据预设规则,从所述被叫方的语音流中提取出语音通信信息;将所述语音通信信息添加到对应的应用中。本发明实施例同时还公开了一种终端及计算机存储介质。

Description

一种记录语音通信信息的方法、终端及计算机存储介质 技术领域
本发明涉及终端应用领域,尤其涉及一种记录语音通信信息的方法、终端及计算机存储介质。
背景技术
通常,在人们打电话的过程中,往往会遇到需要记录语音通信内容中一些重要信息的情况。比如:当被叫方要求主叫方记录一个电话号码时,特别是一个较长的手机号码时,一般情况下,主叫方需要单独拿出纸笔记录,再在语音通信结束后,根据纸上的记录添加到通讯录应用中,或者在语音通信结束后,凭记忆记录到通讯录应用中;或者,当语音通信中被叫方提及一个或多个日程活动时,一般情况下,主叫方都需要单独拿出纸笔记录,再在语音通信结束后,根据纸上的记录添加到日历或者备忘录应用中,或者在语音通信结束后,凭记忆把这些日程安排中的时间、地点、活动内容等信息添加到手机的日历或者备忘录应用中,以便到时得到提醒。
在上述情况下,终端都需要用户先在语音通信过程中人工记录这些重要的语音通信信息,再添加到终端相应的应用中,这样就使得记录语音通信信息的操作十分复杂。
发明内容
本发明实施例期望提供一种记录语音通信信息的方法、终端及计算机存储介质,能够方便用户记录语音通信信息,提高终端的智能程度,提供良好的用户体验。
为达到上述目的,本发明实施例的技术方案是这样实现的:
第一方面,本发明实施例提供一种记录语音通信信息的方法,所述方法包括:在语音通信过程中,侦听被叫方的语音流,并根据预设规则,从所述被叫方的语音流中提取出语音通信信息;将所述语音通信信息添加到对应的应用中。
在另一实施例中,在所述侦听被叫方的语音流之前,所述方法还包括:侦听主叫方的语音流,识别出第一语音指令,其中,所述第一语音指令配置为指示所述终端侦听所述被叫方的语音流;相应地,所述根据预设规则,从所述被叫方的语音流中提取出语音通信信息,包括:根据所述第一语音指令中的提取信息类型,从所述被叫方语音流中提取对应的所述语音通信信息。
在另一实施例中,所述识别出第一语音指令,包括:对所述主叫方的语音流进行语音识别,并当所述识别结果为预设语音指令时,将所述预设语音指令确定为所述第一语音指令;或者,对所述主叫方的语音流进行模糊匹配,获得所述第一语音指令。
在另一实施例中,从所述被叫方的语音流中提取出语音通信信息之后,所述方法还包括:侦听所述主叫方的语音流,识别出第二语音指令,其中,所述第二语音指令配置为指示所述终端停止对所述被叫方的语音流的侦听。
在另一实施例中,所述将所述语音通信信息添加到对应的应用中,包括:缓存所述语音通信信息;在所述语音通信结束后,显示包含所述语音通信信息的确认界面;其中,所述确认界面配置为用户确认所述语音通信信息;接收所述用户对所述确认界面的确认操作之后,将所述语音通信信息添加到对应的应用中。
第二方面,本发明实施例还提供一种终端,所述终端包括:侦听单元、语音识别单元及信息添加单元;其中,所述侦听单元,配置为在语音通信 过程中,侦听被叫方的语音流;所述语音识别单元,配置为根据预设规则,从所述被叫方的语音流中提取出语音通信信息;所述信息添加单元,配置为将所述语音通信信息添加到对应的应用中。
在另一实施例中,所述侦听单元,还配置为在侦听所述被叫方的语音流之前,侦听主叫方的语音流;相应地,所述语音识别单元,还配置为在所述主叫方的语音流中识别出第一语音指令;其中,所述第一语音指令配置为指示所述终端侦听所述被叫方的语音流。
在另一实施例中,所述语音识别单元,配置为对所述主叫方的语音流进行语音识别,并当所述识别结果为预设语音指令时,将所述预设语音指令确定为所述第一语音指令;或,对所述主叫方的语音流进行模糊匹配,获得所述第一语音指令。
在另一实施例中,所述侦听单元,还配置为在所述语音识别单元从所述被叫方的语音流中提取出语音通信信息之后,侦听所述主叫方的语音流;相应地,所述语音识别单元,还配置为在所述主叫方的语音流中识别出第二语音指令;其中,所述第二语音指令配置为指示所述终端停止对所述被叫方的语音流的侦听。
在另一实施例中,所述终端,还包括:存储单元,配置为缓存所述语音通信信息;相应地,所述信息添加单元,配置为在所述语音通信结束后,显示一包含所述语音通信信息的确认界面;还配置为接收所述用户对所述确认界面的确认操作之后,将所述语音通信信息添加到对应的应用中;其中,所述确认界面配置为用户确认所述语音通信信息。
第三方面,本发明实施例还提供了一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行本发明实施例所述的记录语音通信信息的方法。
本发明实施例所提供的记录语音通信信息的方法、终端及计算机存储 介质,终端在语音通信过程中,侦听被叫方的语音流,并根据预设规则,从被叫方的语音流中提取出语音通信信息,比如,从被叫方的语音流中提取电话号码、活动安排、日程安排等信息,然后,将语音通信信息添加到对应的应用中,如将电话号码添加到通讯录应用中、将地点信息和/或时间信息添加到备忘录应用中等,如此,终端就能够自动提取用户语音通信过程中需要记录的语音通信信息,并记录,使得用户无需在人工记录后,手动添加到对应的应用中,方便用户记录语音通信信息,提高终端的智能程度,提供良好的用户体验
附图说明
图1为本发明实施例中的记录语音通信信息的方法流程示意图;
图2为本发明实施例中的终端的结构示意图。
具体实施方式
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述。
本发明实施例提供一种记录语音通信信息的方法,所述方法应用于如智能手机、平板电脑、功能手机等终端中,用户可以通过这些终端进行如打电话、视频电话、即时语音聊天等语音通信。
图1为本发明实施例中的记录语音通信信息的方法流程示意图,参见图1所示,所述方法包括:
S101:在语音通信过程中,侦听被叫方的语音流,并根据预设规则,从被叫方的语音流中提取出语音通信信息。
具体来说,当用户在进行语音通信,如打电话时,终端可以自动开启侦听功能,来侦听被叫方的语音流;当然,也可以由用户手动开启;基于所述侦听功能,终端实时获得被叫方输入的语音信号,并对这些信号进行 语音识别,再按照预设规则,如提取信息类型,关键字等提取出语音通信信息。
在实际应用中,为了降低终端的数据处理量,节省功耗,在S101之前,所述方法还包括:侦听主叫方的语音流,识别出第一语音指令,其中,第一语音指令配置为指示终端侦听被叫方的语音流。
具体来说,终端在确定用户开始语音通信之后,可以先仅仅侦听主叫方的语音流,然后对这些语音流进行语音识别,获得识别结果,再将识别结果与预设语音指令进行匹配,若匹配成功,即识别结果为预设语音指令时,将预设语音指令确定为第一语音指令;或者,对所述主叫方的语音流进行模糊匹配,获得所述第一语音指令。
在具体实施过程中,上述预设语音指令可以且不限于存在以下两种情况:
第一种,所述预设语音指令为在终端本地或者云端服务器中预先存储的一些固定语音指令,且所述预设语音指令具有固定格式,比如:“记录+电话号码”、“记录+活动/日程”等。
第二种,所述预设语音指令在终端本地或者云端服务器上预先存储的一些模糊指令,即包含预设指令关键字的语句,这种指令就是平常的生活用语,更自然一些,比如:“开始记录XXX(一般用人名)的电话号码”、“XXX的电话号码是什么”、“请你告诉我XXX的电话号码”等以“电话号码”或“号码”作为关键字的模糊指令;或者,“开始记录XXX(一般指活动名)活动”、“XX日的活动是什么时候”、“请你告诉明天的会议日程是怎样安排的”等以“活动”或“日程”作为关键字的模糊指令。
由上述可以看出,在第一语音指令中除了有指示侦听被叫方的语音流的指令,也就是“开始记录”、“记录”等语句之外,还具有提取信息类型,也就是“电话号码”、“日程”、“活动”等语句。所以,在确定第一语音指 令之后,S101中的所述根据预设规则,从被叫方的语音流中提取出语音通信信息的步骤,可以为:根据第一语音指令中的提取信息类型,从被叫方语音流中提取对应的语音通信信息。例如,当所述第一语音指令中的提取信息类型为“电话号码”时,终端就在被叫方的语音流中提取如“张三,139xxxxxxxx”的信息,而当所述第一语音指令中的提取信息类型为“日程”时,终端就提取如“打羽毛球,周日上午10点,X羽毛球馆”的信息。
作为另一实施方式,为了降低终端的数据处理量,节省功耗,在S101之后,所述方法还包括:侦听主叫方的语音流,识别出第二语音指令,其中,第二语音指令配置为指示终端停止对被叫方的语音流的侦听。
具体的,当用户完成语音通信信息的记录时,用户还可以通过另一个语音指令来控制终端结束记录,那么,此时,上述预设语音指令中就还可以包括如“结束记录”、“这是XX的手机号码吧”、“结束记录活动”等,那么,终端就可以在主叫方的语音流中识别出上述预设语音指令后,将所述预设指令确定为第二语音指令,并停止对被叫方的语音流的侦听。
S102:将语音通信信息添加到对应的应用中。
具体的,所述终端中预先存储有语音通信信息与应用的映射关系;在一种实施方式中,所述语音通信信息可以包括信息类型,也即所述终端中预先存储有信息类型与应用的映射关系。所述语音通信信息与应用的映射关系可以包括:电话号码与通讯录应用的映射关系、日程与备忘录或提醒事项应用的映射关系等等。比如,如果语音通信信息为电话号码,那么,就将所述信息添加到通讯录应用中;如果语音通信信息为日程,那么,就将所述信息添加到日历、备忘录或者提醒事项等应用中。
需要说明的是,上述结束记录的步骤可以在S102之前执行,也可以与S102同时执行,还可以在S102之后执行,本发明实施例中不做具体限定。
在另一实施例中,为了保证终端所记录的语音通信信息的准确性,提 供良好的用户体验,S102可以包括:缓存语音通信信息;在语音通信结束后,显示包含所述语音通信信息的确认界面;其中,所述确认界面配置为用户确认语音通信信息;接收用户对所述确认界面的确认操作之后,将所述语音通信信息添加到对应的应用中。
具体来说,终端在通过S101提取出语音通信信息之后,先对所述信息进行缓存,当语音通信结束后,再通过确认界面,将所述语音通信信息显示给用户,并在确认界面上提供如“确认”、“编辑”、“取消”等虚拟操作按键,使得用户可以在结束语音通信后,通过确认界面来确认终端所获得的语音通信信息是否正确;当通过确认界面来确认终端所获得的语音通信消息正确时,用户可以点触“确认”虚拟操作按键,以确认所述语音通信信息;当通过确认界面来确认终端所获得的语音通信消息不正确时,用户可以点触“编辑”虚拟操作按键,以编辑所述语音通信信息,或者点触“取消”虚拟操作按键,以删除所述语音通信信息的缓存。
至此,就完成了终端在用户语音通信过程中,记录语音通信信息的过程。
由上述可知,终端在语音通信过程中,侦听被叫方的语音流,并根据预设规则,从被叫方的语音流中提取出语音通信信息,比如,从被叫方的语音流中提取电话号码、活动安排、日程安排等信息,然后,将语音通信信息添加到对应的应用中,如将电话号码添加到通讯录应用中、将地点信息和/或时间信息添加到备忘录应用中等,如此,终端就能够自动提取用户语音通信过程中需要记录的语音通信信息,并记录,使得用户无需在人工记录后,手动添加到对应的应用中,方便用户记录语音通信信息,提高终端的智能程度,提供良好的用户体验。
本发明实施例还提供了一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行本发明实施例 所述的记录语音通信信息的方法。
基于同一发明构思,本发明实施例提供一种终端,所述终端与上述一个或者多个实施例中所述的终端一致。
图2为本发明实施例中的终端的结构示意图,参见图2所示,所述终端包括:侦听单元21、语音识别单元22及信息添加单元23;其中,所述侦听单元21,配置为在语音通信过程中,侦听被叫方的语音流;所述语音识别单元22,配置为根据预设规则,从被叫方的语音流中提取出语音通信信息;所述信息添加单元23,配置为将语音通信信息添加到对应的应用中。
作为一种实施方式,所述侦听单元21,还配置为在侦听被叫方的语音流之前,侦听主叫方的语音流;
相应地,语音识别单元22,还配置为在主叫方的语音流中识别出第一语音指令;还配置为根据第一语音指令中的提取信息类型,从被叫方语音流中提取对应的语音通信信息;其中,所述第一语音指令配置为指示终端侦听被叫方的语音流。
作为一种实施方式,所述语音识别单元22,配置为对主叫方的语音流进行语音识别,并当识别结果为预设语音指令时,将预设语音指令确定为第一语音指令;或,对所述主叫方的语音流进行模糊匹配,获得所述第一语音指令。
作为一种实施方式,所述侦听单元21,还配置为在所述语音识别单元22从所述被叫方的语音流中提取出语音通信信息之后,侦听主叫方的语音流;
相应地,语音识别单元22,还配置为在主叫方的语音流中识别出第二语音指令,其中,第二语音指令配置为指示终端停止对被叫方的语音流的侦听。
作为一种实施方式,所述终端还包括:存储单元,配置为缓存语音通 信信息;
相应地,信息添加单元23,配置为在语音通信结束后,显示包含语音通信信息的确认界面;接收用户对所述确认界面的确认操作之后,将所述语音通信信息添加到对应的应用中;其中,所述确认界面配置为用户确认语音通信信息。
在本实施例中,所述终端中的侦听单元21、语音识别单元22及信息添加单元23在实际应用中,可通过所述终端中的中央处理器(CPU,Central Processing Unit)、数字信号处理器(DSP,Digital Signal Processor)或可编程门阵列(FPGA,Field-Programmable Gate Array)实现;所述终端中的存储单元,在实际应用中,可通过所述终端中的存储器实现。
本领域内的技术人员应明白,本发明的实施例可提供为方法、系统、或计算机程序产品。因此,本发明可采用硬件实施例、软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器和光学存储器等)上实施的计算机程序产品的形式。
本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在所述计算机可读 存储器中的指令产生包括指令装置的制造品,所述指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。
以上所述,仅为本发明的较佳实施例而已,并非用于限定本发明的保护范围。
工业实用性
本发明实施例通过终端在语音通信过程中,侦听被叫方的语音流,并根据预设规则,从被叫方的语音流中提取出语音通信信息,比如,从被叫方的语音流中提取电话号码、活动安排、日程安排等信息,然后,将语音通信信息添加到对应的应用中,如将电话号码添加到通讯录应用中、将地点信息和/或时间信息添加到备忘录应用中等,如此,终端就能够自动提取用户语音通信过程中需要记录的语音通信信息,并记录,使得用户无需在人工记录后,手动添加到对应的应用中,方便用户记录语音通信信息,提高终端的智能程度,提供良好的用户体验。

Claims (11)

  1. 一种记录语音通信信息的方法,所述方法包括:
    在语音通信过程中,侦听被叫方的语音流,并根据预设规则,从所述被叫方的语音流中提取出语音通信信息;
    将所述语音通信信息添加到对应的应用中。
  2. 根据权利要求1所述的方法,其中,在所述侦听被叫方的语音流之前,所述方法还包括:
    侦听主叫方的语音流,识别出第一语音指令,其中,所述第一语音指令配置为指示所述终端侦听所述被叫方的语音流;
    相应地,所述根据预设规则,从所述被叫方的语音流中提取出语音通信信息,包括:
    根据所述第一语音指令中的提取信息类型,从所述被叫方语音流中提取对应的所述语音通信信息。
  3. 根据权利要求2所述的方法,其中,所述识别出第一语音指令,包括:
    对所述主叫方的语音流进行语音识别,并当所述识别结果为预设语音指令时,将所述预设语音指令确定为所述第一语音指令;或者,对所述主叫方的语音流进行模糊匹配,获得所述第一语音指令。
  4. 根据权利要求1所述的方法,其中,从所述被叫方的语音流中提取出语音通信信息之后,所述方法还包括:
    侦听所述主叫方的语音流,识别出第二语音指令,其中,所述第二语音指令配置为指示所述终端停止对所述被叫方的语音流的侦听。
  5. 根据权利要求1所述的方法,其中,所述将所述语音通信信息添加到对应的应用中,包括:
    缓存所述语音通信信息;
    在所述语音通信结束后,显示包含所述语音通信信息的确认界面;其中,所述确认界面配置为用户确认所述语音通信信息;
    接收所述用户对所述确认界面的确认操作之后,将所述语音通信信息添加到对应的应用中。
  6. 一种终端,所述终端包括:侦听单元、语音识别单元及信息添加单元;其中,
    所述侦听单元,配置为在语音通信过程中,侦听被叫方的语音流;
    所述语音识别单元,配置为根据预设规则,从所述被叫方的语音流中提取出语音通信信息;
    所述信息添加单元,配置为将所述语音通信信息添加到对应的应用中。
  7. 根据权利要求1所述的终端,其中,所述侦听单元,还配置为在侦听所述被叫方的语音流之前,侦听主叫方的语音流;
    相应地,所述语音识别单元,还配置为在所述主叫方的语音流中识别出第一语音指令;其中,所述第一语音指令配置为指示所述终端侦听所述被叫方的语音流。
  8. 根据权利要求7所述的终端,其中,所述语音识别单元,配置为对所述主叫方的语音流进行语音识别,并当所述识别结果为预设语音指令时,将所述预设语音指令确定为所述第一语音指令;或,对所述主叫方的语音流进行模糊匹配,获得所述第一语音指令。
  9. 根据权利要求6所述的终端,其中,所述侦听单元,还配置为在所述语音识别单元从所述被叫方的语音流中提取出语音通信信息之后,侦听所述主叫方的语音流;
    相应地,所述语音识别单元,还配置为在所述主叫方的语音流中识别出第二语音指令;其中,所述第二语音指令配置为指示所述终端停止对所述被叫方的语音流的侦听。
  10. 根据权利要求6所述的终端,其中,所述终端,还包括:存储单元,配置为缓存所述语音通信信息;
    相应地,所述信息添加单元,配置为在所述语音通信结束后,显示包含所述语音通信信息的确认界面;还配置为接收所述用户对所述确认界面的确认操作之后,将所述语音通信信息添加到对应的应用中;其中,所述确认界面配置为用户确认所述语音通信信息。
  11. 一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行权利要求1至5任一项所述的记录语音通信信息的方法。
PCT/CN2015/076208 2014-12-29 2015-04-09 一种记录语音通信信息的方法、终端及计算机存储介质 WO2016107001A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410848570.6A CN105812535A (zh) 2014-12-29 2014-12-29 一种记录语音通信信息的方法及终端
CN201410848570.6 2014-12-29

Publications (1)

Publication Number Publication Date
WO2016107001A1 true WO2016107001A1 (zh) 2016-07-07

Family

ID=56284061

Family Applications (2)

Application Number Title Priority Date Filing Date
PCT/CN2015/076208 WO2016107001A1 (zh) 2014-12-29 2015-04-09 一种记录语音通信信息的方法、终端及计算机存储介质
PCT/CN2015/082130 WO2016107104A1 (zh) 2014-12-29 2015-06-23 一种记录语音通信信息的方法及终端、计算机存储介质

Family Applications After (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/082130 WO2016107104A1 (zh) 2014-12-29 2015-06-23 一种记录语音通信信息的方法及终端、计算机存储介质

Country Status (2)

Country Link
CN (1) CN105812535A (zh)
WO (2) WO2016107001A1 (zh)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106598712A (zh) * 2016-11-21 2017-04-26 捷开通讯(深圳)有限公司 一种在通话时启动应用程序的方法及通信终端
CN106357932A (zh) * 2016-11-22 2017-01-25 奇酷互联网络科技(深圳)有限公司 一种通话信息记录方法和移动终端
CN106531158A (zh) * 2016-11-30 2017-03-22 北京理工大学 一种应答语音的识别方法及装置
CN109377998B (zh) * 2018-12-11 2022-02-25 科大讯飞股份有限公司 一种语音交互方法及装置

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8380521B1 (en) * 2007-07-17 2013-02-19 West Corporation System, method and computer-readable medium for verbal control of a conference call
CN103024123A (zh) * 2012-12-25 2013-04-03 广东欧珀移动通信有限公司 基于语音识别技术的电话号码存储装置及方法
CN103200328A (zh) * 2013-04-09 2013-07-10 上海斐讯数据通信技术有限公司 一种手机通话过程中的号码记录装置
CN103873654A (zh) * 2012-12-13 2014-06-18 深圳富泰宏精密工业有限公司 通话内容分析及提取系统及方法

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101282376A (zh) * 2008-06-02 2008-10-08 深圳华为通信技术有限公司 一种移动终端及实现移动终端自动保存数字号码的方法
US20110093266A1 (en) * 2009-10-15 2011-04-21 Tham Krister Voice pattern tagged contacts
CN103167120A (zh) * 2012-07-05 2013-06-19 深圳市金立通信设备有限公司 手机通话过程中快速查找联系人的系统及方法
CN103929551B (zh) * 2013-01-11 2017-10-31 上海掌门科技有限公司 基于通话的辅助方法及系统

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8380521B1 (en) * 2007-07-17 2013-02-19 West Corporation System, method and computer-readable medium for verbal control of a conference call
CN103873654A (zh) * 2012-12-13 2014-06-18 深圳富泰宏精密工业有限公司 通话内容分析及提取系统及方法
CN103024123A (zh) * 2012-12-25 2013-04-03 广东欧珀移动通信有限公司 基于语音识别技术的电话号码存储装置及方法
CN103200328A (zh) * 2013-04-09 2013-07-10 上海斐讯数据通信技术有限公司 一种手机通话过程中的号码记录装置

Also Published As

Publication number Publication date
WO2016107104A1 (zh) 2016-07-07
CN105812535A (zh) 2016-07-27

Similar Documents

Publication Publication Date Title
KR101942308B1 (ko) 메시지 기능을 제공하기 위한 방법 및 그 전자 장치
US9661133B2 (en) Electronic device and method for extracting incoming/outgoing information and managing contacts
JP6618489B2 (ja) 位置ベースのオーディオ・メッセージング
WO2016023317A1 (zh) 一种语音信息的处理方法及终端
TWI611336B (zh) 提示產生方法、行動電子裝置及電腦可讀取媒體
WO2017128991A1 (zh) 一种基于语音识别的即时通信方法和即时通信系统
US20200153963A1 (en) Systems and methods for providing integrated computerized personal assistant services in telephony communications
WO2016110217A1 (zh) 通话过程中保存号码的方法和装置、终端、存储介质
US9172795B1 (en) Phone call context setting
US9444927B2 (en) Methods for voice management, and related devices
CN105183486A (zh) 通知消息的显示方法及装置
US20170064084A1 (en) Method and Apparatus for Implementing Voice Mailbox
WO2016107001A1 (zh) 一种记录语音通信信息的方法、终端及计算机存储介质
CN110708430A (zh) 一种通话管理方法、通信终端及存储介质
US11516169B2 (en) Electronic messaging platform that allows users to change the content and attachments of messages after sending
CN115840841A (zh) 多模态对话方法、装置、设备及存储介质
WO2020103562A1 (zh) 一种语音处理方法和装置
US20140241340A1 (en) System and method for software turret phone capabilities
CN110086941B (zh) 语音播放方法、装置及终端设备
CN104954588A (zh) 语音留言方法和语音留言装置
US11477323B2 (en) Managing queued voice calls
CN115550502A (zh) 日程记录及提示方法、装置、智能设备及存储介质
CN108809894B (zh) 一种网络电话处理的方法及终端
CN104571856A (zh) 一种终端
CN107515666A (zh) 一种数据管理方法及终端

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15874694

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15874694

Country of ref document: EP

Kind code of ref document: A1