WO2016107104A1 - Terminal, procédé d'enregistrement d'informations de communication vocale et support de stockage informatique - Google Patents

Terminal, procédé d'enregistrement d'informations de communication vocale et support de stockage informatique Download PDF

Info

Publication number
WO2016107104A1
WO2016107104A1 PCT/CN2015/082130 CN2015082130W WO2016107104A1 WO 2016107104 A1 WO2016107104 A1 WO 2016107104A1 CN 2015082130 W CN2015082130 W CN 2015082130W WO 2016107104 A1 WO2016107104 A1 WO 2016107104A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
communication information
voice communication
stream
terminal
Prior art date
Application number
PCT/CN2015/082130
Other languages
English (en)
Chinese (zh)
Inventor
陈新
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2016107104A1 publication Critical patent/WO2016107104A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/26Devices for calling a subscriber
    • H04M1/27Devices whereby a plurality of signals may be stored simultaneously
    • H04M1/274Devices whereby a plurality of signals may be stored simultaneously with provision for storing more than one subscriber number at a time, e.g. using toothed disc
    • H04M1/2745Devices whereby a plurality of signals may be stored simultaneously with provision for storing more than one subscriber number at a time, e.g. using toothed disc using static electronic memories, e.g. chips
    • H04M1/275Devices whereby a plurality of signals may be stored simultaneously with provision for storing more than one subscriber number at a time, e.g. using toothed disc using static electronic memories, e.g. chips implemented by means of portable electronic directories

Definitions

  • the present invention relates to the field of terminal applications, and in particular, to a method and terminal for recording voice communication information, and a computer storage medium.
  • the terminal needs the user to manually record these important voice communication information in the voice communication process, and then add it to the corresponding application of the terminal, so that the operation of recording the voice communication information is very complicated.
  • an embodiment of the present invention provides a method for recording voice communication information, a terminal, and a computer storage medium, so as to facilitate the user to record voice communication information, improve the intelligence level of the terminal, and provide a good user experience.
  • an embodiment of the present invention provides a method for recording voice communication information, where the method includes: listening to a voice stream of a called party during a voice communication, and from the called party according to a preset rule.
  • the voice communication information is extracted from the voice stream; the voice communication information is added to the corresponding application.
  • the method before the listening to the voice stream of the called party, the method further includes: listening to the voice stream of the calling party, and identifying the first voice instruction, where the first voice instruction And indicating, by the terminal, the voice stream of the called party; and correspondingly, the voice communication information is extracted from the voice stream of the called party according to a preset rule, including: according to the first And extracting the information type in the voice instruction, and extracting the corresponding voice communication information from the called party voice stream.
  • the identifying the first voice instruction includes: performing voice recognition on the voice stream of the calling party, and when the recognition result is a preset voice command, the preset voice is The instruction is determined to be the first voice instruction.
  • the method further includes: listening to the voice stream of the calling party, and identifying a second voice instruction, wherein the second voice instruction And is used to instruct the terminal to stop listening to the voice stream of the called party.
  • the adding the voice communication information to the corresponding application includes: buffering the voice communication information; after the voice communication ends, displaying a confirmation interface including the voice communication information
  • the confirmation interface is used by the user to confirm the voice communication information; after receiving the confirmation operation of the confirmation interface by the user, the voice communication information is added to the corresponding application.
  • an embodiment of the present invention provides a terminal, where the terminal includes: a listening unit, a voice recognition unit, and an information adding unit.
  • the listening unit is configured to listen to a called party during a voice communication process.
  • a voice streaming unit configured to extract voice communication information from a voice stream of the called party according to a preset rule; the information adding unit, configuring To add the voice communication information to a corresponding application.
  • the listening unit is further configured to listen to the voice stream of the calling party before listening to the voice stream of the called party; correspondingly, the voice recognition unit is further configured to Identifying a first voice instruction in the voice stream of the calling party, where the first voice instruction is used to instruct the terminal to listen to the voice stream of the called party.
  • the voice recognition unit is further configured to perform voice recognition on the voice stream of the calling party, and determine the preset voice command when the recognition result is a preset voice command. For the first voice instruction; or, performing fuzzy matching on the voice stream of the calling party to obtain the first voice instruction.
  • the listening unit is further configured to: after the storage unit saves the voice communication information, listen to the voice stream of the calling party; correspondingly, the voice recognition unit further And configured to identify a second voice instruction in the voice stream of the calling party, where the second voice instruction is used to instruct the terminal to stop listening to the voice stream of the called party.
  • the terminal further includes: a storage unit configured to cache the voice communication information; and correspondingly, the information adding unit is further configured to display an inclusion after the voice communication ends
  • the confirmation interface of the voice communication information wherein the confirmation interface is used by the user to confirm the voice communication information; after receiving the confirmation operation of the confirmation interface by the user, adding the voice communication information to the corresponding application .
  • an embodiment of the present invention provides a computer storage medium storing a computer program for executing the foregoing method for recording voice communication information.
  • the terminal listens to the voice stream of the called party during the voice communication process, and extracts the voice from the voice stream of the called party according to a preset rule.
  • Communication information for example, extracting information such as a phone number, an event schedule, a schedule, and the like from the voice stream of the called party, and then adding the voice communication information to the corresponding application, such as adding the phone number to the address book application, Location information and/or time information added to the note
  • the application is medium, so the terminal can automatically extract the voice communication information that needs to be recorded during the user's voice communication, and record, so that the user does not need to manually add to the corresponding application after manual recording, so that the user can record the voice communication information and improve the user.
  • the intelligence of the terminal provides a good user experience.
  • FIG. 1 is a schematic flowchart of a method for recording voice communication information according to an embodiment of the present invention
  • FIG. 2 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
  • Embodiments of the present invention provide a method for recording voice communication information, which is applied to terminals such as a smart phone, a tablet computer, and a function mobile phone, and the user can perform voice communication such as making a call, a video call, and an instant voice chat through the terminals. .
  • FIG. 1 is a schematic flowchart of a method for recording voice communication information according to an embodiment of the present invention. Referring to FIG. 1, the method includes:
  • S101 Listening, in the process of voice communication, a voice stream of the called party, and extracting voice communication information from the voice stream of the called party according to a preset rule;
  • the terminal can automatically enable the listening function to listen to the voice stream of the called party.
  • the user can also manually turn on the call, and then the terminal obtains the called party in real time.
  • the method further includes: listening to the voice stream of the calling party, and identifying the first voice instruction, wherein the first voice instruction is used to indicate The terminal listens to the voice stream of the called party;
  • the terminal may first only listen to the voice stream of the calling party, then perform voice recognition on the voice streams, obtain the recognition result, and then match the recognition result with the preset voice instruction. If the matching is successful, that is, when the recognition result is a preset voice instruction, the preset voice command is determined as the first voice command.
  • the foregoing preset voice command may be, but is not limited to, the following two situations.
  • the first type, the preset voice command is some fixed voice commands pre-stored in the terminal local or cloud server, and has a fixed format, such as: "record + phone number”, “record + activity / schedule”, and the like.
  • some fuzzy instructions pre-stored by the preset voice command on the terminal local or cloud server that is, the statement containing the preset instruction keyword, which is a common life term, more natural, such as: Record the XXX (general user name) phone number, "What is XXX's phone number”, “Please tell me XXX's phone number” and other fuzzy instructions with "phone number” or “number” as keywords; or, “Start recording XXX (generally referred to as the event name) activity", "when is the event on XX day”, “Please tell me how the schedule of tomorrow's meeting is arranged", etc., with the "activity” or “schedule” as the key to the blur instruction.
  • the step of extracting the voice communication information from the voice stream of the called party according to the preset rule in S101 may be: according to the type of the extracted information in the first voice instruction,
  • the corresponding voice communication information is extracted from the called party voice stream, that is, when the extracted information type in the first voice instruction is “telephone number”, the terminal extracts information such as “Zhang San, 139xxxxxxxx” in the voice stream of the called party.
  • the terminal extracts information such as "playing badminton, 10 am on Sunday, X badminton hall".
  • the method further includes: listening to the voice stream of the calling party, and identifying the second voice instruction, wherein the second voice instruction is used. Instructing the terminal to stop listening to the voice stream of the called party.
  • the user when the user completes the recording of the voice communication information, the user can also control the terminal to end the recording by another voice command.
  • the preset voice command may further include, for example, “end the record” and “ This is the XX mobile phone number, the "end recording activity", etc., then the terminal can identify the preset voice command in the voice stream of the calling party, and determine the preset command as the second voice command. And stop listening to the voice stream of the called party.
  • the voice communication information is a phone number
  • the information is added to the address book application
  • the voice communication information is a schedule
  • the information is added to an application such as a calendar, a memo, or a reminder.
  • step of ending the recording may be performed before S102, or may be performed simultaneously with S102, or may be performed after S102, which is not specifically limited by the present invention.
  • S102 may include: buffering voice communication information; after the voice communication ends, displaying a confirmation interface including voice communication information.
  • the confirmation interface is used for the user to confirm the voice communication information; after receiving the confirmation operation of the confirmation interface by the user, the voice communication information is added to the corresponding application.
  • the terminal first caches the information, and after the voice communication ends, the voice communication information is displayed to the user through a confirmation interface, and is provided on the confirmation interface.
  • the virtual operation buttons such as “confirm”, “edit”, and “cancel” enable the user to confirm whether the voice communication message obtained by the terminal is correct after the end of the voice communication, and if so, the user can click “confirm” virtual Operate the button to confirm the information; if not, the user can click the "Edit” virtual operation button to edit the information. Or touch the "Cancel” virtual action button to delete the cache of this information.
  • the terminal listens to the voice stream of the called party during the voice communication process, and extracts voice communication information from the voice stream of the called party according to a preset rule, for example, from the voice stream of the called party. Extract phone number, event schedule, schedule, etc., and then add voice communication information to the corresponding application, such as adding a phone number to the address book application, adding location information and/or time information to the memo application, In this way, the terminal can automatically extract the voice communication information that needs to be recorded during the voice communication process of the user, and record, so that the user does not need to manually add to the corresponding application after manual recording, so that the user can record the voice communication information and improve the intelligence level of the terminal. To provide a good user experience
  • an embodiment of the present invention provides a terminal that is consistent with the terminal described in one or more of the foregoing embodiments.
  • the terminal includes: a listening unit 21, a voice recognition unit 22, and an information adding unit 23; wherein the listening unit 21 is configured to be in voice During the communication process, the voice stream of the called party is intercepted; the voice recognition unit 22 is configured to extract the voice communication information from the voice stream of the called party according to the preset rule; the information adding unit 23 is configured to set the voice communication information Add to the corresponding app.
  • the listening unit 21 is further configured to listen to the voice stream of the calling party before listening to the voice stream of the called party;
  • the voice recognition unit 22 is further configured to: identify, in the voice stream of the calling party, the first voice command, where the first voice command is used to indicate that the terminal listens to the voice stream of the called party;
  • the extracted information type in a voice command extracts corresponding voice communication information from the called party voice stream.
  • the voice recognition unit 22 is further configured to stream the voice of the calling party.
  • the voice recognition is performed, and when the recognition result is a preset voice command, the preset voice command is determined as the first voice command.
  • the listening unit 21 is further configured to: after the storage unit saves the voice communication information, listen to the voice stream of the calling party;
  • the speech recognition unit 22 is further configured to recognize the second voice instruction in the voice stream of the calling party, wherein the second voice instruction is used to instruct the terminal to stop listening to the voice stream of the called party.
  • the terminal further includes: a storage unit (not shown) configured to cache voice communication information;
  • the information adding unit 23 is further configured to: after the end of the voice communication, display a confirmation interface including voice communication information, wherein the confirmation interface is used for the user to confirm the voice communication information; after receiving the confirmation operation of the confirmation interface by the user, Voice communication information is added to the corresponding application.
  • the above-mentioned listening unit 21, the voice recognition unit 22, and the information adding unit 23 may be disposed in a processor such as a CPU or an ARM, and may be disposed in, for example, a single chip microcomputer or a system level chip, and the present invention is not specifically limited.
  • Each of the above units may be implemented by a central processing unit (CPU, Central Processing Unit), a microprocessor (MPU, MicroProcessor Unit), or a digital signal processor (DSP, Digital Signal Processor) located in the terminal, or Field-Programmable Gate Array (FPGA) implementation.
  • CPU Central Processing Unit
  • MPU Microprocessor
  • DSP Digital Signal Processor
  • FPGA Field-Programmable Gate Array
  • the apparatus for tracking the service signaling may also be stored in a computer readable storage medium if it is implemented in the form of a software function module and sold or used as a separate product.
  • the technical solution of the embodiments of the present invention may be embodied in the form of a software product in essence or in the form of a software product stored in a storage medium, including a plurality of instructions.
  • Make a computer device can be A personal computer, server, or network device, etc.) performs all or part of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes various media that can store program codes, such as a USB flash drive, a mobile hard disk, a read only memory (ROM), a magnetic disk, or an optical disk.
  • program codes such as a USB flash drive, a mobile hard disk, a read only memory (ROM), a magnetic disk, or an optical disk.
  • an embodiment of the present invention further provides a computer storage medium, wherein a computer program for executing a method for recording voice communication information according to an embodiment of the present invention is stored.
  • embodiments of the present invention can be provided as a method, system, or computer program product. Accordingly, the present invention can take the form of a hardware embodiment, a software embodiment, or a combination of software and hardware. Moreover, the invention can take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage and optical storage, etc.) including computer usable program code.
  • the computer program instructions can also be stored in a computer readable memory that can direct a computer or other programmable data processing device to operate in a particular manner, such that the instructions stored in the computer readable memory produce an article of manufacture comprising the instruction device.
  • the apparatus implements the functions specified in one or more blocks of a flow or a flow and/or block diagram of the flowchart.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)

Abstract

L'invention concerne un procédé pour enregistrer des informations de communication vocale. Le procédé consiste : dans un processus de communication vocale, à surveiller un flux vocal d'un appelé, et à extraire des informations de communication vocale du flux vocal de l'appelé en fonction d'une règle prédéfinie ; et à ajouter les informations de communication vocale à une application correspondante. L'invention concerne également un terminal et un support de stockage informatique.
PCT/CN2015/082130 2014-12-29 2015-06-23 Terminal, procédé d'enregistrement d'informations de communication vocale et support de stockage informatique WO2016107104A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410848570.6 2014-12-29
CN201410848570.6A CN105812535A (zh) 2014-12-29 2014-12-29 一种记录语音通信信息的方法及终端

Publications (1)

Publication Number Publication Date
WO2016107104A1 true WO2016107104A1 (fr) 2016-07-07

Family

ID=56284061

Family Applications (2)

Application Number Title Priority Date Filing Date
PCT/CN2015/076208 WO2016107001A1 (fr) 2014-12-29 2015-04-09 Procédé d'enregistrement d'informations de communications vocales, terminal et support de stockage informatique
PCT/CN2015/082130 WO2016107104A1 (fr) 2014-12-29 2015-06-23 Terminal, procédé d'enregistrement d'informations de communication vocale et support de stockage informatique

Family Applications Before (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/076208 WO2016107001A1 (fr) 2014-12-29 2015-04-09 Procédé d'enregistrement d'informations de communications vocales, terminal et support de stockage informatique

Country Status (2)

Country Link
CN (1) CN105812535A (fr)
WO (2) WO2016107001A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106357932A (zh) * 2016-11-22 2017-01-25 奇酷互联网络科技(深圳)有限公司 一种通话信息记录方法和移动终端

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106598712A (zh) * 2016-11-21 2017-04-26 捷开通讯(深圳)有限公司 一种在通话时启动应用程序的方法及通信终端
CN106531158A (zh) * 2016-11-30 2017-03-22 北京理工大学 一种应答语音的识别方法及装置
CN109377998B (zh) * 2018-12-11 2022-02-25 科大讯飞股份有限公司 一种语音交互方法及装置

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101282376A (zh) * 2008-06-02 2008-10-08 深圳华为通信技术有限公司 一种移动终端及实现移动终端自动保存数字号码的方法
CN103024123A (zh) * 2012-12-25 2013-04-03 广东欧珀移动通信有限公司 基于语音识别技术的电话号码存储装置及方法
CN103200328A (zh) * 2013-04-09 2013-07-10 上海斐讯数据通信技术有限公司 一种手机通话过程中的号码记录装置
CN103873654A (zh) * 2012-12-13 2014-06-18 深圳富泰宏精密工业有限公司 通话内容分析及提取系统及方法

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8060366B1 (en) * 2007-07-17 2011-11-15 West Corporation System, method, and computer-readable medium for verbal control of a conference call
US20110093266A1 (en) * 2009-10-15 2011-04-21 Tham Krister Voice pattern tagged contacts
CN103167120A (zh) * 2012-07-05 2013-06-19 深圳市金立通信设备有限公司 手机通话过程中快速查找联系人的系统及方法
CN103929551B (zh) * 2013-01-11 2017-10-31 上海掌门科技有限公司 基于通话的辅助方法及系统

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101282376A (zh) * 2008-06-02 2008-10-08 深圳华为通信技术有限公司 一种移动终端及实现移动终端自动保存数字号码的方法
CN103873654A (zh) * 2012-12-13 2014-06-18 深圳富泰宏精密工业有限公司 通话内容分析及提取系统及方法
CN103024123A (zh) * 2012-12-25 2013-04-03 广东欧珀移动通信有限公司 基于语音识别技术的电话号码存储装置及方法
CN103200328A (zh) * 2013-04-09 2013-07-10 上海斐讯数据通信技术有限公司 一种手机通话过程中的号码记录装置

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106357932A (zh) * 2016-11-22 2017-01-25 奇酷互联网络科技(深圳)有限公司 一种通话信息记录方法和移动终端

Also Published As

Publication number Publication date
WO2016107001A1 (fr) 2016-07-07
CN105812535A (zh) 2016-07-27

Similar Documents

Publication Publication Date Title
RU2694273C2 (ru) Основанная на местоположении передача аудиосообщений
US10827065B2 (en) Systems and methods for providing integrated computerized personal assistant services in telephony communications
US9661133B2 (en) Electronic device and method for extracting incoming/outgoing information and managing contacts
WO2016110217A1 (fr) Procédé, appareil, terminal et support de stockage permettant de sauvegarder un numéro pendant un appel
US9444927B2 (en) Methods for voice management, and related devices
CN110267113B (zh) 视频文件加工方法、系统、介质和电子设备
TW201334498A (zh) 通訊裝置及其控制方法
WO2016023317A1 (fr) Procédé et terminal de traitement d'informations vocales
US11200899B2 (en) Voice processing method, apparatus and device
US11587560B2 (en) Voice interaction method, device, apparatus and server
WO2016107104A1 (fr) Terminal, procédé d'enregistrement d'informations de communication vocale et support de stockage informatique
US9172795B1 (en) Phone call context setting
CN106775969B (zh) 一种应用程序的选择性运行方法及装置
CN110708430A (zh) 一种通话管理方法、通信终端及存储介质
WO2016095386A1 (fr) Procédé de traitement de message court et terminal de traitement de message court
CN110708431A (zh) 一种通话管理方法、通信终端及存储介质
CN110943908A (zh) 语音消息发送方法、电子设备及介质
CN110086941B (zh) 语音播放方法、装置及终端设备
TW201533654A (zh) 語音管理方法及系統,及其電腦程式產品
WO2020103562A1 (fr) Procédé et appareil de traitement vocal
US11477323B2 (en) Managing queued voice calls
KR101643808B1 (ko) 어플리케이션과 서버 간의 연동을 이용한 음성 서비스 제공 방법 및 그 시스템
CN110868347A (zh) 消息提示方法、装置和系统
CN107959720A (zh) 通话录音云存储的方法和系统
CN104571856A (zh) 一种终端

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15874794

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15874794

Country of ref document: EP

Kind code of ref document: A1