WO2018170816A1 - 一种呼叫控制处理方法和移动终端 - Google Patents

一种呼叫控制处理方法和移动终端 Download PDF

Info

Publication number
WO2018170816A1
WO2018170816A1 PCT/CN2017/077824 CN2017077824W WO2018170816A1 WO 2018170816 A1 WO2018170816 A1 WO 2018170816A1 CN 2017077824 W CN2017077824 W CN 2017077824W WO 2018170816 A1 WO2018170816 A1 WO 2018170816A1
Authority
WO
WIPO (PCT)
Prior art keywords
call
unit
answering
voiceprint information
current
Prior art date
Application number
PCT/CN2017/077824
Other languages
English (en)
French (fr)
Inventor
李卓希
Original Assignee
李卓希
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 李卓希 filed Critical 李卓希
Priority to PCT/CN2017/077824 priority Critical patent/WO2018170816A1/zh
Publication of WO2018170816A1 publication Critical patent/WO2018170816A1/zh

Links

Images

Definitions

  • the present invention relates to the field of mobile communications technologies, and in particular, to a call control processing method and a mobile terminal.
  • mobile terminals such as mobile phones have entered all aspects of people's lives.
  • the popularity of mobile terminals such as mobile phones has brought great convenience to people's lives and work.
  • people enjoy fast and convenient mobile phones.
  • communication is also suffering from the constant loss of communication privacy and security.
  • the prior art mobile phone is not only a terminal for voice communication, but also integrates different functions such as electronic finance and electronic payment, and the criminals send fraud to the user through malicious communication means such as telephone fraud, telephone intermediation and telephone harassment.
  • Information or harassment information although there are already applications that can block and report the number of the malicious communication means, in view of the large number of such types, the criminals can implement the malicious communication again by a low-cost means of changing the communication number. means. It can be seen that the shielding and reporting function of the above application cannot effectively identify the call received by the mobile terminal for the first time, thereby reducing the security of the user using the mobile phone communication.
  • the technical problem to be solved by the present invention is that the prior art mobile terminal application cannot identify the first received harassment call, and the user usually needs to answer the call to manually distinguish the content of the harassing call. This not only wastes the user's time, but also reduces the privacy and security of the user's communication using the mobile terminal.
  • an embodiment of the present invention provides a call control processing method, where the method includes:
  • S103 Perform matching processing on the extracted voiceprint information and voiceprint feature data in the voiceprint information base, and if the matching processing result is positive, cut off the current call, otherwise, switch the current call to normal answering. mode;
  • the determining whether the current calling number is a strange number is specifically acquiring a number of the current call, and performing matching processing on the current calling number and the number database, if the matching processing result is affirmative, the The number of the current call is a non-unfamiliar number; otherwise, the current call number is a strange number;
  • the virtual manual answering mode is capable of recognizing a voice keyword from a call of the calling party, so as to select a preset voice call content in the response database to perform the virtual call interaction;
  • the extracted voiceprint information is matched with the voiceprint feature data in the voiceprint information database by a template matching method, a nearest neighbor method, a neural network method, a Markov model method, and a polynomial classification method. One or more of them are implemented.
  • the embodiment of the present invention further provides a mobile terminal, where the mobile terminal includes: an operation unit, a storage unit, an answering unit, and a matching unit, wherein:
  • the storage unit maintains a number database and a voiceprint information base, wherein the number database is used to identify whether the number of the current call is a strange number, and the voiceprint information base is used to identify whether the current caller is a caller whose history is forbidden to be answered. ;
  • the answering unit is capable of performing switching between a virtual manual answering mode and a normal answering mode, and performing de-extraction of voiceprint information on the calling party;
  • the matching unit can perform matching processing on the number of the current call with the number database, and perform matching processing on the voiceprint information extracted by the answering unit and the voiceprint information base;
  • the operating unit is configured to control operations of the storage unit, the answering unit, and the matching unit;
  • the answering unit can identify a voice keyword from a call of the calling party, thereby selecting a preset voice call content in the response database to perform Virtual call interaction of the same call scene;
  • the answering unit is capable of feeding back to the operating unit an instruction regarding whether the current call is normal or abnormal;
  • the operating unit controls the answering unit to switch between the virtual manual answering mode and the normal answering mode based on the instruction;
  • the answering unit is further capable of recognizing whether the extracted voiceprint information is from a digital audio synthesized sound
  • the matching unit implements the extracted voiceprint information and the sound by one or more of a template matching method, a nearest neighboring method, a neural network method, a Markov model method, and a polynomial classification method.
  • the texture database performs matching processing.
  • the present invention provides a call control processing method and a mobile terminal by using the above technical solution.
  • the mobile terminal answers a call from a strange number through a virtual manual unit, and the virtual artificial unit can set a corresponding answering scene and acquire a call through the sampling unit.
  • the sound of the square, the analyzing unit receives the sampled sound information, analyzes the voiceprint signal therein, and determines whether the voiceprint signal matches the feature data in the voiceprint signal set in the storage unit, and if matched, cuts the call call And save the strange number to the interception number set; otherwise, the call call is transferred to the normal answer mode, which eliminates the need for the user to personally answer the strange number to identify whether the call is a trouble of harassing the call, thereby improving the mobile The call security of the terminal.
  • FIG. 1 is a schematic flowchart of a call control processing method according to an embodiment of the present invention
  • FIG. 2 is a schematic structural diagram of a mobile terminal according to an embodiment of the present invention.
  • FIG. 1 is a schematic flowchart of a call control processing method according to an embodiment of the present invention.
  • the call control processing method includes:
  • the operation unit acquires the number corresponding to the call, and performs matching processing on the number with the number database in the storage unit, where the number database may be, but is not limited to, the address book number in the mobile terminal.
  • the database of historical calls and short message records of the database or mobile terminal corresponds to the number database.
  • the number matching process may be performed by using a sequential traversal query or a binary tree traversal query.
  • the numbers in the number database may be arranged in the order of input time or numerical order.
  • the operation unit When the operation unit receives an instruction that the current call number is normal, the operation unit sends an instruction to the answering unit to allow the user to answer, and the answering unit transfers the current call to the normal answer mode, and the normal answer mode allows the user to perform the operation. Manually answering or hanging up; when the operating unit receives an instruction regarding the abnormality of the current calling number, the operating unit sends an instruction to prohibit the user from answering the answering unit, and the operating unit cuts off the current call and the normal answering mode.
  • the answering unit transfers the current call to the virtual manual answer mode, which may be a pre-recorded call mode with different call scenarios, the call mode is a plurality of segments of the user's voice call content
  • the answering database the answering unit is capable of interacting with the calling party by selecting a corresponding voice call content from the account database according to the content of the current call call.
  • the answering unit performs voice recognition of the keyword on the call of the calling party, thereby performing voice call content selection in the answering database according to the result of the voice recognition.
  • the answering unit is further capable of sampling and recording the call voice of the calling party, and the sampled recording may be a complete call recording about the calling party or a segment call recording about the calling party, and transmitting the sampled recording content to the analysis.
  • the analysis unit can extract the voiceprint information in the sampled recording content, because each person's respective vocal cords, nasal cavity, oral cavity, tongue, lips and lungs and other related vocal organs have different structures, and each person's speaking habits are different. This leads to the unique characteristics of each person's voice, and this feature can be distinguished and extracted from the sound spectrum, so that it is a unique symbol of each person's voice in the form of voiceprint.
  • the voiceprint recognition may be identified from aspects such as acoustic features or wording features, wherein the acoustic features may include, but are not limited to, a spectrum, a cepstrum, a formant, or a pitch, and the like features may include, but are not limited to, phonemes or intonations, and the like.
  • the caller may not be a real person to make a call, but is generated by means of digital audio synthesis. Since the voiceprint information of the digital audio synthesized sound is generally stable, the fluctuation of the sound is not obvious when the answering unit Determining that the current caller's call is formed by digital audio synthesis, the answering unit may also feed back to the operating unit an instruction regarding the current call talk exception, at which time the operating unit can suspend the progress of the current call call based on the command.
  • the answering unit transmits the voiceprint information extracted from the sample recording to the matching unit, and the matching unit matches the received voiceprint information with the voiceprint feature data of the voiceprint information base in the storage unit.
  • the matching processing method includes a template matching method, a nearest neighboring method, a neural network method, a Markov model method or a polynomial classification method, etc., wherein the template matching method utilizes dynamic time bending to align training and test feature sequences.
  • the method is mainly for fixing a fixed word in the voiceprint information; the most adjacent method is to identify N feature vectors of each feature vector in the voiceprint information that are closest to the training vector, and the calculation amount is usually large;
  • the neural network method uses a multi-layer radial basis function for display training to distinguish the speech target in the sampled recording; the Markov model method usually uses the Gaussian mixture model to analyze the voiceprint information, which has a good effect;
  • the polynomial classification method uses the polynomial model to calculate the voiceprint information, and its calculation accuracy is high, but the calculation amount is large and the calculation speed is higher. slow.
  • a person skilled in the art can select one or more of the above matching processing methods according to actual needs, thereby achieving an effect of improving the speed of the voiceprint signal matching calculation and reducing the memory required for the calculation process.
  • the matching unit When the matching unit identifies the extracted voiceprint information from the voiceprint information base, it indicates that the current call call and the mobile terminal have a historical harassment call from the same person, and the matching unit feeds back to the operation unit about the current Calling an abnormal call instruction, the operating unit suspends and cuts off the current call based on the instruction; when the matching unit cannot recognize the extracted voiceprint information from the voiceprint information base, the matching unit feeds back to the operating unit The current call is normal command, and the operating unit switches the current call call to the normal answer mode based on the command.
  • the user when the user recognizes that the current call call is an harassing call in the normal answer mode, the user may also identify the number of the current call call as a harassment number, and block the call from the harassment number, and the mobile terminal will also The voiceprint information extracted by the current call call is included in the voiceprint information base to implement data update of the voiceprint information base.
  • the call control processing method can set two different answering modes, namely, a virtual manual answering mode and a normal answering mode, to the mobile terminal, and the virtual manual answering mode can prevent the user from answering the first call from the strange number. The time is wasted and the privacy is leaked.
  • the virtual manual answering mode can also sample the call voice of the calling party and extract the voiceprint information therein to realize the voiceprint information identification of the current call call, thereby determining whether the call is moving and moving.
  • the historical harassing call received by the terminal is from the same person. It can be seen that the call control processing method can effectively avoid the user being harassed by strange incoming calls, and can also improve the communication security of the mobile terminal.
  • FIG. 2 is a schematic structural diagram of a mobile terminal according to an embodiment of the present invention.
  • the mobile terminal includes an operation unit, a storage unit, an answering unit, and a matching unit.
  • the storage unit maintains a number database and a voiceprint information database, which may be, but is not limited to, a directory number database in the mobile terminal or a history call and a short message record corresponding number database of the mobile terminal; the voiceprint information base
  • a voiceprint information database which may be, but is not limited to, a directory number database in the mobile terminal or a history call and a short message record corresponding number database of the mobile terminal; the voiceprint information base
  • the voiceprint information in the harassing phone received by the mobile terminal history is recorded, and the voiceprint information library can update the voiceprint data in real time.
  • the answering unit can perform a switching between a virtual manual answering mode and a normal answering mode, wherein the virtual manual answering mode can be a pre-recorded call mode with different call scenarios, and the call mode is a response with multiple segments of the user's voice call content.
  • the answering unit is capable of selecting a corresponding voice call content and a calling party from the account database according to the content of the current call call Line interaction.
  • the answering unit performs voice recognition of the keyword on the call of the calling party, so that the voice call content is selected in the answering database according to the result of the voice recognition; the normal answering mode allows the mobile to be moved.
  • the user of the terminal directly talks to the calling party.
  • the matching unit is capable of matching the number currently called by the mobile terminal with the number database, and matching the voiceprint information extracted by the answering unit with the voiceprint information base.
  • the matching process between the current call number and the number database may be performed by using a sequential traversal query or a binary tree traversal query; the extracted voiceprint information and the voiceprint information database may be matched by a template matching method, and the most adjacent Method, neural network method, Markov model method or polynomial classification method.
  • the operating unit is configured to control the operations of the storage unit, the answering unit and the matching unit, such as according to the matching processing result of the matching unit with respect to the number of the current call and the number database or about the extracted voiceprint information and the voiceprint information base.
  • the processing result is matched to control the switching of the answering unit between the virtual manual answering mode and the normal answering mode.
  • the mobile terminal can be used to perform the call control processing method shown in FIG.
  • FIG. 1 For the meaning and examples of the terms involved in the embodiment, reference may be made to the corresponding embodiment of FIG. 1. I will not repeat them here.
  • the mobile terminal can implement switching between two different answering modes: a virtual manual answering mode and a normal answering mode, and the virtual manual answering mode can avoid time waste caused by the user receiving the first call from the strange number. And the privacy leak, and the virtual manual answering mode can also sample the call voice of the calling party and extract the voiceprint information therein to realize the voiceprint information identification of the current call call, thereby determining whether the call is received with the history of the mobile terminal.
  • the harassing call is from the same person, so the mobile terminal has high communication security.
  • the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).

Landscapes

  • Telephone Function (AREA)

Abstract

本发明实施例公开了一种呼叫控制处理方法,所述方法包括S101、确定当前呼叫号码是否为陌生号码,若是,则进入虚拟人工接听模式,若否,则进入正常接听模式;S102、在所述虚拟人工接听模式中,获取呼叫方的声音信息,并提取所述声音信息中的声纹信息;S103、将所述提取的声纹信息与声纹信息库中的声纹特征数据进行匹配处理,若所述匹配处理结果为肯定,则切断当前通话,否则,将所述当前通话切换至正常接听模式。本发明实施例还公开了一种移动终端。采用本发明,能提供一种安全性和隐私性都较高的移动通行方法。

Description

一种呼叫控制处理方法和移动终端 技术领域
本发明涉及移动通信技术领域,尤其涉及一种呼叫控制处理方法和移动终端。
背景技术
随着移动通信技术的发展,手机等通信移动终端已经进入人们生活的方方面面,手机等通信移动终端的普及,为人们生活和工作带来极大的便利,然而人们在享受手机带来快速便捷的通信同时,也苦于通信隐私性和安全性不断受到损害。现有技术的手机不仅仅是用于语音通信的终端,手机上还集成有如电子金融和电子支付等不同的功能,不法分子通过电话诈骗、电话中介和电话骚扰等恶意通信手段来向用户发送诈骗信息或者骚扰信息,虽然已经存在能够屏蔽和举报作出该恶意通信手段号码的应用程序,但是鉴于该类型号码的数量庞大,不法分子通过变换通信号码这种低成本的手段就能够再次实施该恶意通信手段。可见,上述应用程序的屏蔽和举报功能不能有效地对移动终端首次接受到的呼叫进行识别,从而降低了用户使用手机通信的安全性。
发明内容
针对上述现有技术的缺陷,本发明所要解决的技术问题在于现有技术的移动终端应用程序不能针对首次接收到的骚扰呼叫进行识别,用户通常需要接听会才能人工分辨出该骚扰呼叫的内容,这不仅浪费用户的时间,同时也降低了用户使用移动终端进行通信的隐私和安全性。
为了解决上述技术问题,本发明实施例提供一种呼叫控制处理方法,其特征在于,所述方法包括:
S101、确定当前呼叫号码是否为陌生号码,若是,则进入虚拟人工接听模式,若否,则进入正常接听模式;
S102、在所述虚拟人工接听模式中,获取呼叫方的声音信息,并提取所述声音信息中的声纹信息;
S103、将所述提取的声纹信息与声纹信息库中的声纹特征数据进行匹配处理,若所述匹配处理结果为肯定,则切断当前通话,否则,将所述当前通话切换至正常接听模式;
进一步,在S101中,所述确定当前呼叫号码是否为陌生号码具体为获取当前呼叫的号码,并将所述当前呼叫的号码与号码数据库进行匹配处理,若所述匹配处理结果为肯定,则所述当前呼叫的号码为非陌生号码,否则,所述当前呼叫的号码为陌生号码;
进一步,在S101中,在进入所述虚拟人工接听模式后,能够通过设定不同的通话场景来实现与所述呼叫方的虚拟通话互动;
其中所述虚拟人工接听模式能够从所述呼叫方的通话中识别语音关键词,以此在应答数据库中选择预设的语音通话内容进行所述虚拟通话互动;
进一步,在S102中,当所述提取的声纹信息被识别为来自数字音频合成的声音后,反馈所述当前通话异常的指令,并中止切断所述当前通话;
进一步,在S103中,所述提取的声纹信息与声纹信息库中的声纹特征数据进行匹配处理通过模板匹配方法、最相邻方法、神经网络法、马尔可夫模型法和多项式分类法中的一种或多种来实现。
相应地,本发明实施例还提供一种移动终端,所述移动终端包括:操作单元、存储单元、接听单元和匹配单元,其特征在于:
所述存储单元保持有号码数据库和声纹信息库,所述号码数据库用于辨别当前呼叫的号码是否为陌生号码,所述声纹信息库用于辨别当前呼叫方是否为历史禁止接听的呼叫方;
所述接听单元能够实施虚拟人工接听模式和正常接听模式的切换,以及对所述呼叫方进行声纹信息的去提取;
所述匹配单元能够将所述当前呼叫的号码与所述号码数据库进行匹配处理,以及将所述接听单元提取的声纹信息与所述声纹信息库进行匹配处理;
所述操作单元用于控制所述存储单元、所述接听单元和所述匹配单元的工作;
进一步,所述接听单元在所述虚拟人工接听模式下,能够从所述呼叫方的通话中识别语音关键词,以此在应答数据库中选择预设的语音通话内容进行不 同的通话场景的虚拟通话互动;
进一步,所述接听单元能够向所述操作单元反馈关于当前呼叫正常或者异常的指令;
所述操作单元基于所述指令控制所述接听单元在所述虚拟人工接听模式和所述正常接听模式之间进行切换;
进一步,所述接听单元还能够识别所述提取的声纹信息是否来自数字音频合成的声音;
进一步,所述匹配单元通过模板匹配方法、最相邻方法、神经网络法、马尔可夫模型法和多项式分类法中的一种或多种来实现将所述提取的声纹信息与所述声纹信息库进行匹配处理。
本发明通过上述技术方案提供一种呼叫控制处理方法和移动终端,该移动终端通过虚拟人工单元来接听来自陌生号码的呼叫,该虚拟人工单元能够设定相应的接听场景,同时通过取样单元获取呼叫方的声音,分析单元接收取样的声音信息后分析其中的声纹信号,并判断该声纹信号是否与存储单元中的声纹信号集合中的特征数据相匹配,若匹配,则切断该呼叫通话,并将该陌生号码保存到拦截号码集合中,否则,将该呼叫通话转接至正常接听模式,其免去了用户需要亲自接听陌生号码才能辨别该通话是否为骚扰呼叫的麻烦,从而提高移动终端的通话安全性。
附图说明
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1是本发明实施例提供的一种呼叫控制处理方法的流程示意图;
图2是本发明实施例提供的一种移动终端的结构示意图。
具体实施方式
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清 楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。
参见图1,为本发明实施例提供的一种呼叫控制处理方法的流程示意图,在本发明实施例中,所述呼叫控制处理方法包括:
S101、确定当前呼叫号码是否为陌生号码,若是,则进入虚拟人工接听模式,若否,则进入正常接听模式;
具体而言,移动终端接收到呼叫后,操作单元获取该呼叫对应的号码,并将该号码与存储单元中的号码数据库进行匹配处理,该号码数据库可为但不限于移动终端中的通信录号码数据库或者移动终端的历史通话和短信记录对应号码数据库。其中该号码匹配处理可采用顺序遍历查询或者二叉树遍历查询的方式进行,该号码数据库中的号码可按照录入时间顺序或者数值大小顺序的形式排布。当匹配单元在该号码数据中查询到当前呼叫的号码时,即该当前呼叫的号码具有正常接入权限,则向该操作单元反馈关于当前呼叫号码正常的指令,当匹配单元在该号码数据中无法查询到当前呼叫的号码时,即该当前呼叫的号码为陌生号码,其不具有正常接入权限,则向该操作单元反馈关于当前呼叫号码异常的指令。
当该操作单元接收到关于当前呼叫号码正常的指令时,该操作单元向接听单元发送允许用户接听的指令,此时接听单元将当前呼叫转接到正常接听模式中,该正常接听模式允许用户进行人工接听或者挂断等操作;当该操作单元接收到关于当前呼叫号码异常的指令时,该操作单元向该接听单元发送禁止用户接听的指令,此时该操作单元切断当前呼叫与该正常接听模式的链路,该接听单元将当前呼叫转接至虚拟人工接听模式中,该虚拟人工接听模式可以是用户预先录制的具有不同通话场景的通话模式,该通话模式为具有多段关于用户的语音通话内容的应答数据库,该接听单元能够根据当前呼叫通话的内容从该应当数据库中选择相应的语音通话内容与呼叫方进行互动。在该虚拟人工接听模式中,该接听单元对呼叫方的通话进行关键词的语音识别,从而根据该语音识别的结果在该应答数据库中进行语音通话内容的选择。
S102、在虚拟人工接听模式中,获取呼叫方的声音信息,并提取声音信息 中的声纹信息;
具体而言,该接听单元还能够对呼叫方得通话声音进行采样录音,该采样录音可以是关于呼叫方的完整通话录音或者是关于呼叫方的片段通话录音,并将该采样录音内容传送到分析单元中。该分析单元能够提取该采样录音内容中的声纹信息,由于每个人各自的声带、鼻腔、口腔、舌头、唇和肺部等相关发声器官的结构并不相同,以及每个人的说话习惯迥异,这导致每个人的说话声音存在独特的特点,并且该特点可从声谱图中分辨和提取出来,从而以声纹的形式作为每个人声音的独特标志。利用声纹这一特点,在仅仅与对方通话的情况下也能分别出对方的身份,这种方式也就做声纹识别。该声纹识别可以从声学特征或者用辞特征等方面进行识别,其中声学特征可包括但不限于频谱、倒频谱、共振峰或者基音等,而用辞特征可包括但不限于音素或者语调等。
实际上,呼叫方可能并不是真实存在的人进行通话,而是通过数字音频合成的方式产生的,由于数字音频合成的声音的声纹信息通常都比较平稳,其起伏变化并不明显当接听单元确定当前呼叫方的通话是通过数字音频合成来形成的,该接听单元也可以向该操作单元反馈关于当前呼叫通话异常的指令,此时该操作单元能够基于该指令中止当前呼叫通话的进行。
S103、将提取的声纹信息与声纹信息库中的声纹特征数据进行匹配处理,若匹配处理结果为肯定,则切断当前通话,否则,将当前通话切换至正常接听模式;
具体而言,该接听单元将从采样录音中提取的声纹信息传送至该匹配单元中,该匹配单元将接收到的声纹信息与该存储单元中声纹信息库的声纹特征数据进行匹配处理。该匹配处理的方式包括模板匹配方法、最相邻方法、神经网络法、马尔可夫模型法或者多项式分类法等,其中,该模板匹配法是利用动态时间弯折以对准训练和测试特征序列,其主要针对声纹信息中的固定用辞进行的;该最相邻方法是识别声纹信息中每个特征矢量中与训练矢量最相近的N个特征矢量,其运算量通常较大;该神经网络法是采用多层径向基函数进行显示训练来区分采样录音中的说话目标;该马尔可夫模型法是通常采用高斯混合模型来进行声纹信息的分析,其具有较好的效果;该多项式分类法是利用多项式模型来进行声纹信息的计算,其计算精度较高,但是计算量大且计算速度较 慢。本领域的技术人员可根据实际需要来选择上述匹配处理方法中的一种或者多种,从而实现提高声纹信号匹配计算速度和降低计算过程所需内存的效果。
当该匹配单元从该声纹信息库中识别出该提取的声纹信息,则表明当前呼叫通话与移动终端某一历史骚扰通话皆出自同一人,此时该匹配单元向该操作单元反馈关于当前呼叫通话异常的指令,该操作单元基于该指令中止和切断当前通话;当该匹配单元不能从该声纹信息库中识别出该提取的声纹信息,此时该匹配单元向该操作单元反馈关于当前呼叫通话正常的指令,该操作单元基于该指令将当前呼叫通话切换至正常接听模式中。
此外,当用户在该正常接听模式中辨别出当前呼叫通话为骚扰电话时,用户还可对当前呼叫通话的号码标识为骚扰号码,并对来自该骚扰号码的呼叫进行屏蔽,同时移动终端还将当前呼叫通话提取的声纹信息收录到该声纹信息库中,以实现该声纹信息库的数据更新。
从上述实施例可以看出,该呼叫控制处理方法能够对移动终端设置虚拟人工接听模式和正常接听模式这两种不同的接听方式,该虚拟人工接听模式能够避免用户因接听来自陌生号码的首次呼叫而造成时间浪费和隐私泄露,同时该虚拟人工接听模式还能够对呼叫方的通话声音进行采样并提取其中的声纹信息,以实现当前呼叫通话的声纹信息识别,从而判断该呼叫是否与移动终端接收的历史骚扰电话来自同一人,可见该呼叫控制处理方法能够有效避免用户受到陌生来电的骚扰,并且还能提高移动终端的通信安全性。
参见图2,为本发明实施例提供的一种移动终端的结构示意图,在本发明实施例中,该移动终端包括操作单元、存储单元、接听单元和匹配单元。
其中,该存储单元保持有号码数据库和声纹信息库,该号码数据库可为但不限于移动终端中的通信录号码数据库或者移动终端的历史通话和短信记录对应号码数据库;该声纹信息库中收录中移动终端历史接收到的骚扰电话中的声纹信息,并能该声纹信息库能够对实时对其中声纹数据进行更新。
该接听单元能够实施虚拟人工接听模式和正常接听模式的切换,其中该虚拟人工接听模式可以是用户预先录制的具有不同通话场景的通话模式,该通话模式为具有多段关于用户的语音通话内容的应答数据库,该接听单元能够根据当前呼叫通话的内容从该应当数据库中选择相应的语音通话内容与呼叫方进 行互动。在该虚拟人工接听模式中,该接听单元对呼叫方的通话进行关键词的语音识别,从而根据该语音识别的结果在该应答数据库中进行语音通话内容的选择;该正常接听模式则是允许移动终端的用户直接与呼叫方进行通话。
该匹配单元能够将移动终端当前呼叫的号码与该号码数据库进行匹配处理,以及将该接听单元提取的声纹信息与该声纹信息库进行匹配处理。其中,当前呼叫的号码与该号码数据库进行匹配处理可采用顺序遍历查询或者二叉树遍历查询的方式进行;该提取的声纹信息与该声纹信息库进行匹配处理可采用模板匹配方法、最相邻方法、神经网络法、马尔可夫模型法或者多项式分类法等方法进行。
该操作单元用于控制存储单元、接听单元和匹配单元的工作,比如根据该匹配单元关于当前呼叫的号码与该号码数据库的匹配处理结果或者关于该提取的声纹信息与该声纹信息库的匹配处理结果来控制接听单元在该虚拟人工接听模式和正常接听模式之间的切换。该移动终端可以用于执行图1所示的呼叫控制处理方法。关于本实施例涉及的术语的含义以及举例,可以参考图1对应的实施例。此处不再赘述。
从上述实施例可以看出,该移动终端能够实现虚拟人工接听模式和正常接听模式这两种不同接听方式的切换,该虚拟人工接听模式能够避免用户因接听来自陌生号码的首次呼叫而造成时间浪费和隐私泄露,同时该虚拟人工接听模式还能够对呼叫方的通话声音进行采样并提取其中的声纹信息,以实现当前呼叫通话的声纹信息识别,从而判断该呼叫是否与移动终端接收的历史骚扰电话来自同一人,因此该移动终端具有较高的通信安全性。
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的程序可存储于一计算机可读取存储介质中,该程序在执行时,可包括如上述各方法的实施例的流程。其中,所述的存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory,ROM)或随机存储记忆体(Random Access Memory,RAM)等。
以上所揭露的仅为本发明一种较佳实施例而已,当然不能以此来限定本发明之权利范围,本领域普通技术人员可以理解实现上述实施例的全部或部分流程,并依本发明权利要求所作的等同变化,仍属于发明所涵盖的范围。

Claims (10)

  1. 一种呼叫控制处理方法,其特征在于,所述方法包括:
    S101、确定当前呼叫号码是否为陌生号码,若是,则进入虚拟人工接听模式,若否,则进入正常接听模式;
    S102、在所述虚拟人工接听模式中,获取呼叫方的声音信息,并提取所述声音信息中的声纹信息;
    S103、将所述提取的声纹信息与声纹信息库中的声纹特征数据进行匹配处理,若所述匹配处理结果为肯定,则切断当前通话,否则,将所述当前通话切换至正常接听模式。
  2. 根据权利要求1所述的方法,其特征在于,
    在S101中,所述确定当前呼叫号码是否为陌生号码具体为获取当前呼叫的号码,并将所述当前呼叫的号码与号码数据库进行匹配处理,若所述匹配处理结果为肯定,则所述当前呼叫的号码为非陌生号码,否则,所述当前呼叫的号码为陌生号码。
  3. 根据权利要求1所述的方法,其特征在于,
    在S101中,在进入所述虚拟人工接听模式后,能够通过设定不同的通话场景来实现与所述呼叫方的虚拟通话互动;
    其中所述虚拟人工接听模式能够从所述呼叫方的通话中识别语音关键词,以此在应答数据库中选择预设的语音通话内容进行所述虚拟通话互动。
  4. 根据权利要求1所述的方法,其特征在于,
    在S102中,当所述提取的声纹信息被识别为来自数字音频合成的声音后,反馈所述当前通话异常的指令,并中止切断所述当前通话。
  5. 根据权利要求1所述的方法,其特征在于,
    在S103中,所述提取的声纹信息与声纹信息库中的声纹特征数据进行匹 配处理通过模板匹配方法、最相邻方法、神经网络法、马尔可夫模型法和多项式分类法中的一种或多种来实现。
  6. 一种移动终端,所述移动终端包括:操作单元、存储单元、接听单元和匹配单元,其特征在于:
    所述存储单元保持有号码数据库和声纹信息库,所述号码数据库用于辨别当前呼叫的号码是否为陌生号码,所述声纹信息库用于辨别当前呼叫方是否为历史禁止接听的呼叫方;
    所述接听单元能够实施虚拟人工接听模式和正常接听模式的切换,以及对所述呼叫方进行声纹信息的去提取;
    所述匹配单元能够将所述当前呼叫的号码与所述号码数据库进行匹配处理,以及将所述接听单元提取的声纹信息与所述声纹信息库进行匹配处理;
    所述操作单元用于控制所述存储单元、所述接听单元和所述匹配单元的工作。
  7. 根据权利要求6所述的移动终端,其特征在于,
    所述接听单元在所述虚拟人工接听模式下,能够从所述呼叫方的通话中识别语音关键词,以此在应答数据库中选择预设的语音通话内容进行不同的通话场景的虚拟通话互动。
  8. 根据权利要求6所述的移动终端,其特征在于,
    所述接听单元能够向所述操作单元反馈关于当前呼叫正常或者异常的指令;
    所述操作单元基于所述指令控制所述接听单元在所述虚拟人工接听模式和所述正常接听模式之间进行切换。
  9. 根据权利要求6所述的移动终端,其特征在于,
    所述接听单元还能够识别所述提取的声纹信息是否来自数字音频合成的声音。
  10. 根据权利要求6所述的移动终端,其特征在于,
    所述匹配单元通过模板匹配方法、最相邻方法、神经网络法、马尔可夫模型法和多项式分类法中的一种或多种来实现将所述提取的声纹信息与所述声纹信息库进行匹配处理。
PCT/CN2017/077824 2017-03-23 2017-03-23 一种呼叫控制处理方法和移动终端 WO2018170816A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/077824 WO2018170816A1 (zh) 2017-03-23 2017-03-23 一种呼叫控制处理方法和移动终端

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/077824 WO2018170816A1 (zh) 2017-03-23 2017-03-23 一种呼叫控制处理方法和移动终端

Publications (1)

Publication Number Publication Date
WO2018170816A1 true WO2018170816A1 (zh) 2018-09-27

Family

ID=63586180

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/077824 WO2018170816A1 (zh) 2017-03-23 2017-03-23 一种呼叫控制处理方法和移动终端

Country Status (1)

Country Link
WO (1) WO2018170816A1 (zh)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7453992B2 (en) * 2005-04-14 2008-11-18 International Business Machines Corporation System and method for management of call data using a vector based model and relational data structure
CN102572839A (zh) * 2010-12-14 2012-07-11 中国移动通信集团四川有限公司 一种控制语音通信的方法和系统
CN104683606A (zh) * 2015-02-06 2015-06-03 深圳市中兴移动通信有限公司 通话数据处理方法及装置
CN104751848A (zh) * 2013-12-25 2015-07-01 三亚中兴软件有限责任公司 通话声音识别方法及装置
CN105100363A (zh) * 2015-06-29 2015-11-25 小米科技有限责任公司 信息处理方法、装置及终端
CN105120068A (zh) * 2015-07-16 2015-12-02 百度在线网络技术(北京)有限公司 一种语音通信的接听方法及装置

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7453992B2 (en) * 2005-04-14 2008-11-18 International Business Machines Corporation System and method for management of call data using a vector based model and relational data structure
CN102572839A (zh) * 2010-12-14 2012-07-11 中国移动通信集团四川有限公司 一种控制语音通信的方法和系统
CN104751848A (zh) * 2013-12-25 2015-07-01 三亚中兴软件有限责任公司 通话声音识别方法及装置
CN104683606A (zh) * 2015-02-06 2015-06-03 深圳市中兴移动通信有限公司 通话数据处理方法及装置
CN105100363A (zh) * 2015-06-29 2015-11-25 小米科技有限责任公司 信息处理方法、装置及终端
CN105120068A (zh) * 2015-07-16 2015-12-02 百度在线网络技术(北京)有限公司 一种语音通信的接听方法及装置

Similar Documents

Publication Publication Date Title
US20090094029A1 (en) Managing Audio in a Multi-Source Audio Environment
US6219407B1 (en) Apparatus and method for improved digit recognition and caller identification in telephone mail messaging
CN1082759C (zh) 电话应答机
US8391445B2 (en) Caller identification using voice recognition
US6064963A (en) Automatic key word or phrase speech recognition for the corrections industry
CN109873907B (zh) 呼叫处理方法、装置、计算机设备及存储介质
CN111294471B (zh) 一种智能电话应答方法和系统
CN107409061A (zh) 语音总结程序
WO2014069076A1 (ja) 会話分析装置及び会話分析方法
CN103903627A (zh) 一种语音数据的传输方法及装置
CN112188017A (zh) 信息交互方法及信息交互系统、处理设备、存储介质
CN111325039B (zh) 基于实时通话的语言翻译方法、系统、程序和手持终端
US10199035B2 (en) Multi-channel speech recognition
CN110062097B (zh) 骚扰电话处理方法、装置、移动终端以及存储介质
CN110381221A (zh) 呼叫处理方法、装置、系统、设备和计算机存储介质
CN108391020A (zh) 一种通话控制方法、装置、设备及存储介质
US11783804B2 (en) Voice communicator with voice changer
CN110517697A (zh) 用于交互式语音应答的提示音智能打断装置
CN110602334A (zh) 一种基于人机协同的智能外呼方法及系统
CN113744742A (zh) 对话场景下的角色识别方法、装置和系统
CN107680592A (zh) 一种移动终端语音识别方法、及移动终端及存储介质
CN108364638A (zh) 一种语音数据处理方法、装置、电子设备和存储介质
CN109616116B (zh) 通话系统及其通话方法
WO2018170816A1 (zh) 一种呼叫控制处理方法和移动终端
CN110865789A (zh) 基于语音识别的智能开启麦克风的方法及系统

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17901476

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17901476

Country of ref document: EP

Kind code of ref document: A1