WO2016062090A1 - 短信过滤方法及短信过滤装置 - Google Patents

短信过滤方法及短信过滤装置 Download PDF

Info

Publication number
WO2016062090A1
WO2016062090A1 PCT/CN2015/080344 CN2015080344W WO2016062090A1 WO 2016062090 A1 WO2016062090 A1 WO 2016062090A1 CN 2015080344 W CN2015080344 W CN 2015080344W WO 2016062090 A1 WO2016062090 A1 WO 2016062090A1
Authority
WO
WIPO (PCT)
Prior art keywords
pending text
text message
filtering
short message
received
Prior art date
Application number
PCT/CN2015/080344
Other languages
English (en)
French (fr)
Inventor
黄建
王飞
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2016062090A1 publication Critical patent/WO2016062090A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/22Arrangements for supervision, monitoring or testing

Definitions

  • This document relates to the field of mobile communication technologies, and in particular, to a short message filtering method and a short message filtering device.
  • SMS has been welcomed by more and more users in the past few years and has achieved rapid development.
  • mobile text messaging is increasingly plagued by a lot of spam messages.
  • spam messages At present, there are more than 600 million mobile phone users in China, with an average of 800 million SMS messages per day, and about 8 spam messages per person per week.
  • spam messages not only seriously interfere with their normal life, but also endanger personal privacy.
  • the proliferation of spam messages has caused huge waste of infrastructure investment such as SMS centers and increased the risk of malicious attacks on the network.
  • relevant parties are stepping up the formulation of relevant laws and regulations, operators are paying more and more attention to spam messages, and have established spam filtering systems, using technical means to filter spam messages, and strive to create a continuous, Preface, healthy development environment.
  • the spam filtering system needs to send a manual review for short messages that may be misjudged, such as suspicious messages.
  • the spam filtering system has directly filtered some of the text messages, the amount of text messages that need to be sent to the manual review is still very large (more than 200,000 per day), which puts a lot of pressure on the manual review work.
  • the embodiment of the invention provides a short message filtering method and a short message filtering device, which can solve the technical problem of manual review of excessive workload of spam messages.
  • the pending text message is sent to the second auditing system.
  • the step of determining whether the received pending text message meets a preset filtering rule comprises:
  • the method before the sending the to-be-reviewed short message to the second auditing system, the method further includes:
  • the step of sending the pending text message to the second auditing system is performed.
  • the method before the step of sending the to-be-reviewed short message to the second auditing system, the method further includes:
  • the step of sending the pending text message to the second auditing system is performed.
  • the method further includes: before the step of determining whether the received calling number and the short message content of the pending text message are already present in the memory library, the method further includes:
  • the caller information of the pending text message does not exist in the memory bank, the caller information is inserted into the memory table.
  • the embodiment of the invention further provides a short message filtering device, the device comprising:
  • the obtaining module is set to obtain a pending text message that does not meet the first auditing system auditing rule
  • a determining module configured to determine whether the received pending text message meets a preset filtering rule
  • a filtering module configured to: when the received pending text message meets a preset filtering rule, filter the pending text message;
  • the sending module is configured to send the pending text message to the second auditing system when the received pending text message does not meet the preset filtering rule.
  • the determining module includes:
  • the first determining unit is configured to determine whether the length of the received character of the pending text message is less than a preset value.
  • the determining module further includes:
  • a second determining unit configured to determine, when the length of the received character of the pending text message is not less than a preset value, whether the received called number of the pending text message matches the preset number segment library.
  • the determining module further includes:
  • a third determining unit configured to determine, when the received called number of the pending text message does not match the preset number segment library, whether the calling number and the short message content of the received pending text message already exist In the memory library.
  • the short message filtering device further includes:
  • a table building module configured to create a memory table including a calling message and an inbound time in the memory library, where the calling information includes a calling number and a short message content sent by the calling number;
  • the information insertion module is configured to insert the caller information into the memory table when the received caller information of the pending text message does not exist in the memory library.
  • the embodiment of the invention further provides a computer readable storage medium storing program instructions, which can be implemented when the program instructions are executed.
  • the embodiment of the present invention determines whether the received pending text message meets the preset filtering rule by acquiring the pending text message that does not meet the first auditing system auditing rule, and if so, filters the pending short message; if not, the The pending text message is sent to the second auditing system, the first auditing system is a spam short message filtering system, and the second auditing system is a manual auditing system, thereby further filtering the pending text messages through the spam short message review, thereby effectively reducing the delivery to the manual Reviewing the number of SMS messages in the system reduces the amount of manual review.
  • FIG. 1 is a schematic flowchart of a short message filtering method according to an embodiment of the present invention
  • FIG. 2 is a schematic flowchart of a short message filtering method according to Embodiment 2 of the present invention.
  • FIG. 3 is a schematic flowchart of a third short message filtering method according to an embodiment of the present invention.
  • FIG. 4 is a schematic diagram of functional modules of a fourth short message filtering apparatus according to an embodiment of the present invention.
  • FIG. 5 is a schematic diagram of a refinement function module of a short message filtering device determining module according to an embodiment of the present invention
  • FIG. 6 is a schematic diagram of functional modules of a short message filtering apparatus according to Embodiment 6 of the present invention.
  • FIG. 7 is a structural diagram of a short message intelligent arbitration system in a short message filtering device according to an application example 1 of the present invention.
  • FIG. 9 is a schematic diagram of the same content arbitration process of the same calling party in the short message filtering method of the application example 2 of the present invention.
  • This embodiment provides a short message filtering method.
  • FIG. 1 is a schematic flowchart of a short message filtering method according to an embodiment of the present disclosure.
  • the short message filtering method includes:
  • Step S10 Obtain a pending text message that does not meet the first audit system audit rule
  • the first auditing system is a spam filtering system, and the to-be-reviewed short message includes spam messages and non-spam messages, wherein a large part of the non-spam messages does not need to be manually reviewed, so it is necessary to judge according to preset filtering rules.
  • Step S20 determining whether the received pending text message meets a preset filtering rule
  • Step S30 if yes, filtering the pending text message
  • Step S40 if no, sending the pending text message to the second auditing system.
  • the first auditing system is a spam short message filtering system
  • the second auditing system is a manual auditing system, thereby further filtering the pending text messages through the spam short message review, thereby effectively reducing the sending
  • the number of text messages to the manual review system reduces the amount of manual review work.
  • FIG. 2 is a schematic flowchart of a short message filtering method according to an embodiment of the present disclosure.
  • the short message filtering method includes:
  • Step S10 Obtain a pending text message that does not meet the first audit system audit rule
  • the first auditing system is a spam filtering system, and the to-be-reviewed short message includes spam messages and non-spam messages, wherein a large part of the non-spam messages does not need to be manually reviewed, so it is necessary to judge according to preset filtering rules.
  • Step S201 determining whether the length of the received character of the pending text message is less than a preset value
  • SMS messages may still exist in the SMS filtered by the spam filtering system.
  • the length of the character to be reviewed is less than or equal to the preset value (for example, 10 characters). Since the length of the pending SMS character is too small, it cannot carry a lot of information. Therefore, it is generally not a spam message, and the manual review system is not required to review the pending text message.
  • the number of pending text messages and the number of filtered short messages that have been determined according to the filtering rule that the length of the character to be reviewed is less than the preset value in step S201 is recorded.
  • step S30 is performed to filter the pending message
  • step S202 is performed to determine whether the received call number of the pending text message matches the preset number segment library
  • the preset number of segments stores all or a well-known SP (Service Provider) service code number segment, generally a 4-digit segment or a 5-digit segment, first determining the received pending text message called Whether the number matches the 4-digit segment. If it matches, it determines that the pending SMS is a non-spam message and directly filters it; if it does not match, it determines whether the called number of the pending SMS matches the 5-digit segment, if it matches, it filters, if not The matching continues to determine whether the received calling number and the content of the short message of the pending text message are Already exists in the memory library. In addition, the number of pending text messages and the number of filtered short messages determined according to the length of the characters are recorded. In addition, the number of pending text messages and the number of filtered short messages determined according to the filtering rule of whether the called party number of the pending text message matches the preset number segment library is recorded in step S202.
  • SP Service Provider
  • step S30 is performed to filter the pending message
  • step S203 is executed to determine whether the received calling number and the short message content of the pending text message already exist in the memory library;
  • the calling number of the pending text message already exists in the memory library, and the content of the short message sent by the calling number is the same as before, it is determined that the pending text message has been sent to the manual auditing system for review, thereby directly filtering without being sent to Manual review system.
  • the pending text message is sent to the manual auditing system.
  • the number of pending text messages and the number of filtered short messages that have been determined according to the filtering rule of the calling number and the short message content of the pending text message in the memory library are recorded in step S203.
  • step S30 is performed to filter the pending message
  • step S40 is executed to send the pending text message to the second auditing system.
  • the pending text message is filtered by three filtering rules to filter non-spam messages, thereby reducing the number of short messages sent to the manual review system and reducing the workload of manual review.
  • FIG. 3 is a schematic flowchart of the short message filtering method according to the embodiment.
  • the method further includes:
  • Step S501 creating a memory table including the calling information and the storage time in the memory library, where the calling information includes a calling number and a short message content sent by the calling number;
  • hashindex hash index
  • insert time two fields.
  • hashindex hash (calling number, SMS content).
  • Step S502 If the received caller information of the pending text message does not exist in the memory bank, insert the caller information into the memory table.
  • the (calling number, short message content) is used as the key value, and the related record of the pending caller information is stored in the memory library within 8 hours. If not, the hashindex of the record is inserted into the memory table, and The pending text message is sent to the manual review system; if it exists, the pending text message is filtered and counted.
  • the number of pending text messages received by the short message filtering device the number of pending text messages filtered by one or more filtering rules, the total number of pending text messages to be filtered, and the number of pending text messages sent by the manual review system are provided for generating reports. data source.
  • This embodiment provides a short message filtering apparatus.
  • FIG. 4 is a schematic diagram of functional modules of the short message filtering apparatus according to the embodiment.
  • the short message filtering device includes an obtaining module 60, a determining module 70, a filtering module 80, and a sending module 90, where:
  • the obtaining module 60 is configured to obtain a pending text message that does not meet the first auditing system auditing rule
  • the first auditing system is a spam filtering system, and the to-be-reviewed short message includes spam messages and non-spam messages, wherein a large part of the non-spam messages does not need to be manually reviewed, so it is necessary to judge according to preset filtering rules.
  • the determining module 70 is configured to determine whether the received pending text message meets a preset filtering rule
  • the filtering module 80 is configured to: when the received pending text message meets a preset filtering rule, filter the pending message;
  • the sending module 90 is configured to send the pending text message to the second auditing system when the received pending text message does not meet the preset filtering rule.
  • the first auditing system is a spam short message filtering system
  • the second auditing system is a manual auditing system
  • FIG. 5 is a schematic diagram of a refinement function module of a short message filtering device determining module according to an embodiment of the present invention.
  • the determining module 70 includes a first determining unit 701, a second determining unit 702, and a third determining unit 703, where:
  • the first determining unit 701 is configured to determine whether the length of the received character of the pending text message is less than a preset value
  • SMS messages may still exist in the SMS filtered by the spam filtering system.
  • the length of the character to be reviewed is less than or equal to the preset value (for example, 10 characters). Since the length of the pending SMS character is too small, it cannot carry a lot of information. Therefore, it is generally not a spam message, and the manual review system is not required to review the pending text message.
  • the number of pending text messages and the number of filtered short messages that are determined by the first determining unit 701 according to whether the character length of the text to be reviewed is less than a preset value is recorded.
  • the second determining unit 702 is configured to determine, when the received character length of the pending text message is not less than a preset value, whether the received called number of the pending text message matches the preset number segment library;
  • the preset number of segments stores all or a well-known SP (Service Provider) service code number segment, generally a 4-digit segment or a 5-digit segment, first determining the received pending text message called Whether the number matches the 4-digit segment. If it matches, it determines that the pending SMS is a non-spam message and directly filters it; if it does not match, it determines whether the called number of the pending SMS matches the 5-digit segment, if it matches, it filters, if not The matching continues to determine whether the received calling number and the short message content of the pending text message already exist in the memory library. In addition, the number of pending text messages and the number of filtered short messages determined according to the length of the characters are recorded. In addition, the number of pending text messages and the number of filtered short messages that are determined by the second determining unit 702 according to whether the called number of the pending text message matches the filtering rule of the preset number segment library is recorded.
  • SP Service Provider
  • the third determining unit 703 is configured to: when determining that the received to-be-reviewed short message number does not match the preset number segment library, determine whether the received calling number and the short message content of the received pending text message are Already exists in the memory library.
  • the calling number of the pending text message already exists in the memory library, and the content of the short message sent by the calling number is the same as before, it is determined that the pending text message has been sent to the manual auditing system for review, thereby directly filtering without being sent to Manual review system.
  • the pending text message is sent to the manual auditing system.
  • the number of pending text messages and the number of filtered short messages that are determined by the third determining unit 703 according to the calling number of the pending text message and the filtering rule of the short message content already existing in the memory library are recorded.
  • the determining module may include only any one or any two of the above three determining units. This embodiment can be combined with Embodiment 4 and Embodiment 6 respectively.
  • FIG. 6 is a schematic diagram of functional modules of a short message filtering apparatus according to an embodiment of the present invention.
  • This embodiment is similar to the fourth embodiment. The difference is that the short message filtering device of this embodiment further includes:
  • the table building module 100 is configured to create a memory table including a calling message and an inbound time in the memory library, where the calling information includes a calling number and a short message content sent by the calling number;
  • hashindex hash (calling number, SMS content).
  • the information insertion module 110 is configured to insert the calling information into the memory table when the received calling information of the pending text message does not exist in the memory library.
  • the (calling number, short message content) is used as the key value, and the related record of the pending caller information is stored in the memory library within 8 hours. If not, the hashindex of the record is inserted into the memory table, and The pending text message is sent to the manual review system; if it exists, the pending text message is filtered and counted.
  • the short message filtering device further includes a statistics module, configured to count the number of pending text messages received by the short message filtering device, the number of pending text messages filtered by one or more filtering rules, and the total number of pending pending text messages, and Send the number of pending text messages for the manual review system to provide a data source for generating reports.
  • a statistics module configured to count the number of pending text messages received by the short message filtering device, the number of pending text messages filtered by one or more filtering rules, and the total number of pending pending text messages, and Send the number of pending text messages for the manual review system to provide a data source for generating reports.
  • the short message filtering device is a short message intelligent arbitration system.
  • the short message intelligent arbitration system includes an intelligent arbitration module, an arbitration result statistics module, and an arbitration rule setting module, where:
  • the intelligent arbitration module is configured to parse the submission document (the submitted document includes the pending text message), and perform intelligent arbitration on the submitted document according to the arbitration rule (for example, intelligent arbitration for the text to be reviewed): if the arbitration rule is matched, the deletion is directly performed; Matching, the generated result file is sent to manual review.
  • the arbitration rule for example, intelligent arbitration for the text to be reviewed
  • the arbitration result statistics module is configured to collect the total amount of statistical intelligent arbitration processing messages, each rule filtering amount, and finally send the manual audit amount, and provide a data source for the report.
  • the arbitration rule setting module is configured to set an arbitration rule, and the arbitration rule supported by the system includes one or more of the following contents: a short message length, a called number segment, and the same content of the same calling party.
  • Arbitration rules can provide extensions and changes based on the needs of the business.
  • FIG. 7 depicts a structure of a short message intelligent arbitration system, which is composed of an intelligent arbitration module, an arbitration result statistics module, and an arbitration rule setting module, and all internal modules are based on TCP/IP or interprocess.
  • the communication mechanism communicates, and the intelligent arbitration system and the spam monitoring system use a file interface for communication.
  • FIG. 8 depicts a general flow of a smart message arbitration module.
  • the flow of the short message filtering method includes the following steps:
  • Step 201 The user sets a short message arbitration rule by using an arbitration rule setting module, and the arbitration rule may include one or more of the following contents: a short message length, a called number segment, and the same calling party same content.
  • the arbitration rule includes the foregoing three.
  • Step 202 After the arbitration rule is set, the arbitration rule setting module synchronizes the arbitration rule to the intelligent arbitration module.
  • Step 203 Length rule filtering: determining whether the length of the pending message length field is less than the configured length. If it is less than or equal to, filtering, and counting; if greater than, continuing to match the next condition;
  • Step 204 Filtering the range of the called number range: determining whether the called number of the pending record matches Called segment set, supports 4 and 5 bit segments, first matches 5 bit segments, if not matched, then matches 4 bit segments, if matched, filters and counts; if both length segments match No, continue to match the next condition;
  • Step 205 Filtering the same calling number and the same short message content rule: determining whether the calling number and the short message content of the pending recording exist in the memory library, if yes, filtering and counting; if not, submitting the review, the judgment process is shown in the figure 9;
  • Step 206 The arbitration result statistics module stores the statistical results of steps 203 to 204 into the database as a report data source.
  • step 207 the arbitrated record is written to the file for manual review.
  • FIG. 9 details the same content arbitration process for the same caller.
  • the flow includes the following steps:
  • Step 301 using (calling number, short message content) as a key value to query whether there is a relevant record in the memory library within 8 hours, if there is a step to step 304, if not, go to step 303;
  • step 302 and step 301 The order of execution of step 302 and step 301 is not limited, as long as step 302 is performed before step 303.
  • Step 303 if it does not exist, insert the hashindex of the record into the memory table, and send it for review;
  • Step 304 if present, filters the record and counts.
  • the intelligent arbitration module parses the spam short message monitoring system to submit the review file, and filters each record according to the “sms length”, “called number segment”, and “same caller same content” filtering rules. .
  • the "sms length" and "called number segment” rules are directly set in the arbitration rule setting module.
  • the same content of the same calling party is set in the arbitration rule setting module to set the time period in which the same calling party matches the same content. If it exists in the memory library within the time period, it is filtered, and if it does not exist, it is inserted into the memory library. After filtering by the filtering rules, the final submission document is generated and sent to the manual review.
  • the arbitration result statistics module collects the total amount of messages processed by the intelligent arbitration, the amount of each rule filter, and the final batch of manual audits, providing a data source for the report.
  • the embodiment of the present invention determines whether the received pending text message meets the preset filtering rule by acquiring the pending text message that does not meet the first auditing system auditing rule, and if so, filters the pending short message; if not, the The pending text message is sent to the second auditing system, the first auditing system is a spam short message filtering system, and the second auditing system is a manual auditing system, thereby further filtering the pending text messages through the spam short message review, thereby effectively reducing the delivery to the manual Reviewing the number of SMS messages in the system reduces the amount of manual review.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)
  • Debugging And Monitoring (AREA)

Abstract

一种短信过滤方法及短信过滤装置,该方法包括:获取未满足第一审核系统审核规则的待审短信;判断接收到的待审短信是否符合预设的过滤规则;若是,则将所述待审短信过滤;若否,则将所述待审短信发送至第二审核系统。

Description

短信过滤方法及短信过滤装置 技术领域
本文涉及移动通信技术领域,尤其涉及短信过滤方法及短信过滤装置。
背景技术
短信作为一种方便快捷的联络形式,在过去几年受到越来越多用户的欢迎,并取得突飞猛进的发展。但与电子邮件一样,手机短信也日益受到大量垃圾短信的困扰。目前,我国手机用户超过6亿,平均每天的短信发送量高达8亿条,每人每周收到的垃圾短信大约8条。对于移动用户来说,垃圾短信不仅严重干扰了其正常生活,而且危害到个人隐私。对于运营商来说,垃圾短信的泛滥造成了短信中心等基础设施投资的巨大浪费,并增加了网络遭到恶意攻击的危险。为此,有关方面正在加紧制定有关的法律法规,运营商也对垃圾短信愈来愈重视,纷纷建立了垃圾短信过滤系统,采用技术手段过滤垃圾短信,努力为短信业务的发展创造一个持续、有序、健康的发展环境。
垃圾短信过滤系统除了根据规则直接过滤部分短信外,对于可能存在误判的短信,如可疑消息,需要送给人工审核。尽管垃圾短信过滤系统已经直接过滤了部分短信,但是需要送给人工审核的短信量依旧十分巨大(每天超过20万条),给人工审核工作带来很大的压力。
发明内容
本发明实施例提供一种短信过滤方法及短信过滤装置,可以解决人工审核垃圾短信工作量过大的技术问题。
本发明实施例提供的一种短信过滤方法,该方法包括以下步骤:
获取未满足第一审核系统审核规则的待审短信;
判断所述接收到的待审短信是否符合预设的过滤规则;
若是,则将所述待审短信过滤;
若否,则将所述待审短信发送至第二审核系统。
可选地,所述判断所述接收到的待审短信是否符合预设的过滤规则的步骤,包括:
判断所述接收到的待审短信的字符长度是否小于预设值。
可选地,所述将所述待审短信发送至第二审核系统之前,所述方法还包括:
判断所述接收到的待审短信被叫号码是否匹配预设的号段库;
若是,则过滤所述待审短信;
若否,则执行所述将待审短信发送至第二审核系统的步骤。
可选地,所述执行所述将所述待审短信发送至第二审核系统的步骤之前,所述方法还包括:
判断所述接收到的待审短信的主叫号码和短信内容是否已经存在于内存库;
若是,则过滤所述待审短信;
若否,则执行所述将待审短信发送至第二审核系统的步骤。
可选地,所述判断所述接收到的待审短信的主叫号码和短信内容是否已经存在于内存库的步骤之前,所述方法还包括:
在内存库中创建一张包括主叫信息和入库时间的内存表,所述主叫信息包括主叫号码和该主叫号码所发送的短信内容;
若接收到的待审短信的主叫信息不存在于内存库,则将所述主叫信息插入到内存表。
本发明实施例还提供一种短信过滤装置,该装置包括:
获取模块,设置为获取未满足第一审核系统审核规则的待审短信;
判断模块,设置为判断所述接收到的待审短信是否符合预设的过滤规则;
过滤模块,设置为在所述接收到的待审短信符合预设的过滤规则时,将所述待审短信过滤;以及
发送模块,设置为在所述接收到的待审短信不符合预设的过滤规则时,将所述待审短信发送至第二审核系统。
可选地,所述判断模块包括:
第一判断单元,其设置为判断所述接收到的待审短信的字符长度是否小于预设值。
可选地,所述判断模块还包括:
第二判断单元,其设置为在所述接收到的待审短信的字符长度不小于预设值时,判断所述接收到的待审短信被叫号码是否匹配预设的号段库。
可选地,所述判断模块还包括:
第三判断单元,其设置为在判断所述接收到的待审短信被叫号码不匹配预设的号段库时,判断所述接收到的待审短信的主叫号码和短信内容是否已经存在于内存库。
可选地,所述短信过滤装置还包括:
建表模块,设置为在内存库中创建一张包括主叫信息和入库时间的内存表,所述主叫信息包括主叫号码和该主叫号码所发送的短信内容;
信息插入模块,设置为在接收到的待审短信的主叫信息不存在于内存库时,将所述主叫信息插入到内存表。
本发明实施例还提供一种计算机可读存储介质,存储有程序指令,当该程序指令被执行时可实现上述方法。
本发明实施例通过获取未满足第一审核系统审核规则的待审短信,判断该接收到的待审短信是否符合预设的过滤规则,若是,则将该待审短信过滤;若否,则将该待审短信发送至第二审核系统,该第一审核系统为垃圾短信过滤系统,第二审核系统为人工审核系统,从而对通过垃圾短信审核的待审短信进行进一步过滤,有效减少送往人工审核系统的短信数量,减少了人工审核的工作量。
附图概述
图1为本发明实施例一短信过滤方法的流程示意图;
图2为本发明实施例二短信过滤方法的流程示意图;
图3为本发明实施例三短信过滤方法的流程示意图;
图4为本发明实施例四短信过滤装置的功能模块示意图;
图5为本发明实施例五短信过滤装置判断模块的细化功能模块示意图;
图6为本发明实施例六短信过滤装置的功能模块示意图;
图7为本发明应用示例1短信过滤装置中短信智能仲裁系统的结构图;
图8为本发明应用示例2短信过滤方法中短信智能仲裁总流程图;
图9为本发明应用示例2短信过滤方法中相同主叫相同内容仲裁过程示意图。
本发明的实施方式
需要说明的是,在不冲突的情况下,本文实施例以及实施例中的特征可以相互任意组合。
实施例一
本实施例提供一种短信过滤方法。
参照图1,图1为本实施例短信过滤方法的流程示意图。
在本实施例中,该短信过滤方法包括:
步骤S10,获取未满足第一审核系统审核规则的待审短信;
该第一审核系统为垃圾短信过滤系统,该待审短信包含垃圾短信和非垃圾短信,其中非垃圾短信有很大一部分是无需人工审核的,故需要根据预设的过滤规则进行判断。
步骤S20,判断接收到的待审短信是否符合预设的过滤规则;
步骤S30,若是,则将所述待审短信过滤;
步骤S40,若否,则将所述待审短信发送至第二审核系统。
在本实施例中,通过获取未满足第一审核系统审核规则的待审短信,判断该接收到的待审短信是否符合预设的过滤规则,若是,则将该待审短信过滤;若否,则将该待审短信发送至第二审核系统,该第一审核系统为垃圾短信过滤系统,第二审核系统为人工审核系统,从而对通过垃圾短信审核的待审短信进行进一步过滤,有效减少送往人工审核系统的短信数量,减少了人工审核的工作量。
实施例二
参照图2,图2为本实施例短信过滤方法的流程示意图。
在本实施例中,该短信过滤方法包括:
步骤S10,获取未满足第一审核系统审核规则的待审短信;
该第一审核系统为垃圾短信过滤系统,该待审短信包含垃圾短信和非垃圾短信,其中非垃圾短信有很大一部分是无需人工审核的,故需要根据预设的过滤规则进行判断。
步骤S201,判断接收到的待审短信的字符长度是否小于预设值;
经过垃圾短信过滤系统过滤后的短信中可能还存在一些垃圾短信,比如待审短信的字符长度小于等于预设值(例如10字符长度),由于待审短信字符长度过小,无法承载很多信息,故一般不是垃圾短信,无需人工审核系统再对该待审短信进行审核。此外,记录经步骤S201中根据待审短信的字符长度是否小于预设值的过滤规则判断过的待审短信数量、过滤的短信数量。
若是,则执行步骤S30将待审短信过滤;
若否,则执行步骤S202判断接收到的待审短信被叫号码是否匹配预设的号段库;
预设的号段库存放了全部或者知名度高的SP(Service Provider,服务提供商)的服务代码号段,一般为4位号段或5位号段,先判断接收到的待审短信被叫号码是否匹配4位号段,若匹配则判定该待审短信为非垃圾短信,直接过滤;若不匹配则判断该待审短信被叫号码是否匹配5位号段,若匹配则过滤,若不匹配则继续判断接收到的待审短信的主叫号码和短信内容是否 已经存在于内存库。此外,记录根据字符长度判断的待审短信数量、过滤的短信数量。此外,记录经步骤S202中根据待审短信被叫号码是否匹配预设的号段库的过滤规则判断过的待审短信数量、过滤的短信数量。
若是,则执行步骤S30将待审短信过滤;
若否,则执行步骤S203判断接收到的待审短信的主叫号码和短信内容是否已经存在于内存库;
当待审短信的主叫号码已经存在于内存库,并且该主叫号码所发的短信内容与之前一样,则判定该待审短信已经发送给人工审核系统审核,从而直接过滤而不需要发送至人工审核系统。当内存库中不存在待审短信的主叫号码或该主叫号码所发送的短信内容,则将该待审短信发送至人工审核系统。此外,记录经步骤S203中根据待审短信的主叫号码和短信内容是否已经存在于内存库的过滤规则判断过的待审短信数量、过滤的短信数量。
若是,则执行步骤S30将待审短信过滤;
若否,则执行步骤S40将所述待审短信发送至第二审核系统。
在第二实施例中,待审短信经过三个过滤规则的过滤,过滤了非垃圾短信,从而减少了送往人工审核系统的短信数量,减少了人工审核的工作量。
实施例三
照图3,图3是为本实施例短信过滤方法的流程示意图。
本实施例与实施例二类似,区别在于在本实施例中,在步骤S203之前还包括:
步骤S501,在内存库中创建一张包括主叫信息和入库时间的内存表,所述主叫信息包括主叫号码和该主叫号码所发送的短信内容;
在内存库中创建一张内存表,包括hashindex(哈希索引),插入时间,两个字段。其中,hashindex=hash(主叫号码,短信内容)。
步骤S502,若接收到的待审短信的主叫信息不存在于内存库,则将所述主叫信息插入到内存表。
以(主叫号码,短信内容)为键值,查询8小时内内存库中是否存在与该待审短信主叫信息的相关记录,若不存在,则将该条记录的hashindex插入内存表,并将该待审短信送审至人工审核系统;若存在,则将该条待审短信过滤,并计数。
此外,统计短信过滤装置接收的待审短信数量、一个或多个过滤规则所过滤的待审短信数量、过滤的待审短信总数量,及发送人工审核系统的待审短信数量,为生成报表提供数据源。
实施例四
本实施例提供一种短信过滤装置。
参照图4,图4为本实施例短信过滤装置的功能模块示意图。
在本实施例中,该短信过滤装置包括获取模块60、判断模块70、过滤模块80、发送模块90,其中:
所述获取模块60,设置为获取未满足第一审核系统审核规则的待审短信;
该第一审核系统为垃圾短信过滤系统,该待审短信包含垃圾短信和非垃圾短信,其中非垃圾短信有很大一部分是无需人工审核的,故需要根据预设的过滤规则进行判断。
所述判断模块70,设置为判断所述接收到的待审短信是否符合预设的过滤规则;
所述过滤模块80,设置为在所述接收到的待审短信符合预设的过滤规则时,将所述待审短信过滤;
所述发送模块90,设置为在所述接收到的待审短信不符合预设的过滤规则时,将所述待审短信发送至第二审核系统。
在本实施例中,通过获取未满足第一审核系统审核规则的待审短信,判断该接收到的待审短信是否符合预设的过滤规则,若是,则将该待审短信过滤;若否,则将该待审短信发送至第二审核系统,该第一审核系统为垃圾短信过滤系统,第二审核系统为人工审核系统,从而对通过垃圾短信审核的待审短信进行进一步过滤,有效减少送往人工审核系统的短信数量,减少了人 工审核的工作量。
实施例五
参照图5,图5为本发明实施例短信过滤装置判断模块的细化功能模块示意图。
在本实施例中,判断模块70包括第一判断单元701、第二判断单元702、第三判断单元703,其中:
所述第一判断单元701,设置为判断所述接收到的待审短信的字符长度是否小于预设值;
经过垃圾短信过滤系统过滤后的短信中可能还存在一些垃圾短信,比如待审短信的字符长度小于等于预设值(例如10字符长度),由于待审短信字符长度过小,无法承载很多信息,故一般不是垃圾短信,无需人工审核系统再对该待审短信进行审核。此外,记录经第一判断单元701根据待审短信的字符长度是否小于预设值的过滤规则判断过的待审短信数量、过滤的短信数量。
所述第二判断单元702,设置为在所述接收到的待审短信的字符长度不小于预设值时,判断所述接收到的待审短信被叫号码是否匹配预设的号段库;
预设的号段库存放了全部或者知名度高的SP(Service Provider,服务提供商)的服务代码号段,一般为4位号段或5位号段,先判断接收到的待审短信被叫号码是否匹配4位号段,若匹配则判定该待审短信为非垃圾短信,直接过滤;若不匹配则判断该待审短信被叫号码是否匹配5位号段,若匹配则过滤,若不匹配则继续判断接收到的待审短信的主叫号码和短信内容是否已经存在于内存库。此外,记录根据字符长度判断的待审短信数量、过滤的短信数量。此外,记录经第二判断单元702根据待审短信被叫号码是否匹配预设的号段库的过滤规则判断过的待审短信数量、过滤的短信数量。
所述第三判断单元703,设置为在判断所述接收到的待审短信被叫号码不匹配预设的号段库时,判断所述接收到的待审短信的主叫号码和短信内容是否已经存在于内存库。
当待审短信的主叫号码已经存在于内存库,并且该主叫号码所发的短信内容与之前一样,则判定该待审短信已经发送给人工审核系统审核,从而直接过滤而不需要发送至人工审核系统。当内存库中不存在待审短信的主叫号码或该主叫号码所发送的短信内容,则将该待审短信发送至人工审核系统。此外,记录经第三判断单元703根据待审短信的主叫号码和短信内容是否已经存在于内存库的过滤规则判断过的待审短信数量、过滤的短信数量。
可选地,判断模块可以只包括上述三个判断单元中的任意一个或任意两个。本实施例可以分别与实施例四、实施例六组合。
实施例六
参照图6,图6为本实施例短信过滤装置的功能模块示意图。
本实施例与实施例四类似,区别在于本实施例短信过滤装置还包括:
建表模块100,设置为在内存库中创建一张包括主叫信息和入库时间的内存表,所述主叫信息包括主叫号码和该主叫号码所发送的短信内容;
在内存库中创建一张内存表,包括hashindex,插入时间,两个字段。其中,hashindex=hash(主叫号码,短信内容)。
信息插入模块110,设置为在接收到的待审短信的主叫信息不存在于内存库时,将所述主叫信息插入到内存表。
以(主叫号码,短信内容)为键值,查询8小时内内存库中是否存在与该待审短信主叫信息的相关记录,若不存在,则将该条记录的hashindex插入内存表,并将该待审短信送审至人工审核系统;若存在,则将该条待审短信过滤,并计数。
此外,可选地,短信过滤装置还包括统计模块,设置为统计短信过滤装置接收的待审短信数量、一个或多个过滤规则所过滤的待审短信数量、过滤的待审短信总数量,及发送人工审核系统的待审短信数量,为生成报表提供数据源。
应用示例1
在本示例中,该短信过滤装置为一种短信智能仲裁系统,参照图7,该短信智能仲裁系统包括智能仲裁模块、仲裁结果统计模块和仲裁规则设置模块,其中:
所述智能仲裁模块,设置为解析送审文件(送审文件包括待审短信),并根据仲裁规则对送审文件进行智能仲裁(例如,对待审短信进行智能仲裁):匹配仲裁规则的,直接删除;不匹配的,生成结果文件送给人工审核。
所述仲裁结果统计模块,设置为统计智能仲裁处理消息总量、每个规则过滤量、和最终送给人工审核量,为报表提供数据源。
所述仲裁规则设置模块,设置为设置仲裁规则,系统支持的仲裁规则包括以下内容的一种或多种:短信长度、被叫号段、相同主叫相同内容。仲裁规则可以根据业务的需要提供扩展和变更。
如图7所示,图7描述了短信智能仲裁系统结构,该短信智能仲裁系统由智能仲裁模块,仲裁结果统计模块,仲裁规则设置模块组成,其所有内部模块间采用基于TCP/IP或进程间的通讯机制进行通讯,智能仲裁系统和垃圾短信监控系统之间采用文件接口进行通讯。
应用示例2
本示例描述一种减少人工审核垃圾短信工作量的短信过滤方法,如图8所示,图8描述了短信智能仲裁模块总的流程,该短信过滤方法的流程包括如下步骤:
步骤201,用户通过仲裁规则设置模块设置短信仲裁规则,仲裁规则可以包括以下内容的一种或多种:短信长度、被叫号段、相同主叫相同内容,在本示例中仲裁规则包括上述三种;
步骤202,仲裁规则设置完成后,仲裁规则设置模块将仲裁规则同步给智能仲裁模块;
步骤203,长度规则过滤:判断待审记录短信长度字段值是否小于配置长度,若小于等于,则过滤,并计数;若大于,继续匹配下一个条件;
步骤204,被叫号码号段范围过滤:判断待审记录被叫号码是否匹配被 叫号段集,支持4位和5位号段,先匹配5位号段,若匹配不上再匹配4位号段,若匹配上则过滤,并计数;若两个长度的号段都匹配不上,继续匹配下一个条件;
步骤205,相同主叫号码和相同短信内容规则过滤:判断待审记录主叫号码和短信内容是否存在于内存库,若存在,则过滤,并计数;若不存在,则送审,判断流程参见图9;
步骤206,仲裁结果统计模块将步骤203至步骤204的统计结果入库,作为报表数据源;
步骤207,经过仲裁的记录写入文件,送给人工审核。
图9详细描述了相同主叫相同内容仲裁过程,流程包括如下步骤:
步骤301,以(主叫号码,短信内容)为键值查询8小时内内存库中是否存在相关记录,若存在转至步骤304,若不存在转至步骤303;
步骤302,在内存库中创建一张内存表,包括hashindex,插入时间,两个字段,其中,hashindex=hash(主叫号码,短信内容);
步骤302与步骤301的执行顺序不作限定,只要步骤302在步骤303之前执行完成即可。
步骤303,若不存在,则将该条记录的hashindex插入内存表,并送审;
步骤304,若存在,则将该条记录过滤,并计数。
在本应用示例1和应用示例2中,智能仲裁模块解析垃圾短信监控系统送审文件,对每条记录按照“短信长度”,“被叫号段”,“相同主叫相同内容”过滤规则进行过滤。其中,“短信长度”,“被叫号段”规则直接在仲裁规则设置模块设置。“相同主叫相同内容”在仲裁规则设置模块设置相同主叫相同内容匹配的时间段,在时间段内若已在内存库中存在,则过滤,若不存在则插入内存库。经过过滤规则过滤后,生成最终送审文件,送给人工审核。仲裁结果统计模块统计智能仲裁处理消息总量、每个规则过滤量、和最终送给人工审核量,为报表提供数据源。
本领域普通技术人员可以理解上述方法中的全部或部分步骤可通过程序来指令相关硬件完成,上述程序可以存储于计算机可读存储介质中,如只读存储器、磁盘或光盘等。可选地,上述实施例的全部或部分步骤也可以使用一个或多个集成电路来实现。相应地,上述实施例中的各模块/单元可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。本发明实施例不限制于任何特定形式的硬件和软件的结合。
工业实用性
本发明实施例通过获取未满足第一审核系统审核规则的待审短信,判断该接收到的待审短信是否符合预设的过滤规则,若是,则将该待审短信过滤;若否,则将该待审短信发送至第二审核系统,该第一审核系统为垃圾短信过滤系统,第二审核系统为人工审核系统,从而对通过垃圾短信审核的待审短信进行进一步过滤,有效减少送往人工审核系统的短信数量,减少了人工审核的工作量。

Claims (11)

  1. 一种短信过滤方法,包括以下步骤:
    获取未满足第一审核系统审核规则的待审短信;
    判断所述接收到的待审短信是否符合预设的过滤规则;
    若是,则将所述待审短信过滤;
    若否,则将所述待审短信发送至第二审核系统。
  2. 如权利要求1所述的短信过滤方法,其中,所述判断所述接收到的待审短信是否符合预设的过滤规则的步骤,包括:
    判断所述接收到的待审短信的字符长度是否小于预设值。
  3. 如权利要求2所述的短信过滤方法,所述将所述待审短信发送至第二审核系统之前,所述方法还包括:
    判断所述接收到的待审短信被叫号码是否匹配预设的号段库;
    若是,则过滤所述待审短信;
    若否,则执行所述将所述待审短信发送至第二审核系统的步骤。
  4. 如权利要求3所述的短信过滤方法,所述执行所述将所述待审短信发送至第二审核系统的步骤之前,所述方法还包括:
    判断所述接收到的待审短信的主叫号码和短信内容是否已经存在于内存库;
    若是,则过滤所述待审短信;
    若否,则执行所述将所述待审短信发送至第二审核系统的步骤。
  5. 如权利要求4所述的短信过滤方法,所述判断所述接收到的待审短信的主叫号码和短信内容是否已经存在于内存库的步骤之前,所述方法还包括:
    在内存库中创建一张包括主叫信息和入库时间的内存表,所述主叫信息包括主叫号码和该主叫号码所发送的短信内容;
    若接收到的待审短信的主叫信息不存在于内存库,则将所述主叫信息插入到内存表。
  6. 一种短信过滤装置,包括:
    获取模块,设置为获取未满足第一审核系统审核规则的待审短信;
    判断模块,设置为判断所述接收到的待审短信是否符合预设的过滤规则;
    过滤模块,设置为在所述接收到的待审短信符合预设的过滤规则时,将所述待审短信过滤;以及
    发送模块,设置为在所述接收到的待审短信不符合预设的过滤规则时,将所述待审短信发送至第二审核系统。
  7. 如权利要求6所述的短信过滤装置,其中,所述判断模块包括:
    第一判断单元,其设置为判断所述接收到的待审短信的字符长度是否小于预设值。
  8. 如权利要求7所述的短信过滤装置,所述判断模块还包括:
    第二判断单元,其设置为在所述接收到的待审短信的字符长度不小于预设值时,判断所述接收到的待审短信被叫号码是否匹配预设的号段库。
  9. 如权利要求8所述的短信过滤装置,所述判断模块还包括:
    第三判断单元,其设置为在判断所述接收到的待审短信被叫号码不匹配预设的号段库时,判断所述接收到的待审短信的主叫号码和短信内容是否已经存在于内存库。
  10. 如权利要求9所述的短信过滤装置,所述短信过滤装置还包括:
    建表模块,设置为在内存库中创建一张包括主叫信息和入库时间的内存表,所述主叫信息包括主叫号码和该主叫号码所发送的短信内容;
    信息插入模块,设置为在接收到的待审短信的主叫信息不存在于内存库时,将所述主叫信息插入到内存表。
  11. 一种计算机可读存储介质,存储有程序指令,当该程序指令被执行时可实现权利要求1-5任一项所述的方法。
PCT/CN2015/080344 2014-10-20 2015-05-29 短信过滤方法及短信过滤装置 WO2016062090A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410558422.0A CN105592429A (zh) 2014-10-20 2014-10-20 短信过滤方法及短信过滤装置
CN201410558422.0 2014-10-20

Publications (1)

Publication Number Publication Date
WO2016062090A1 true WO2016062090A1 (zh) 2016-04-28

Family

ID=55760224

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/080344 WO2016062090A1 (zh) 2014-10-20 2015-05-29 短信过滤方法及短信过滤装置

Country Status (2)

Country Link
CN (1) CN105592429A (zh)
WO (1) WO2016062090A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113568904A (zh) * 2021-06-28 2021-10-29 平安普惠企业管理有限公司 作品送审方法、装置、电子设备及可读存储介质

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109525951A (zh) * 2018-12-03 2019-03-26 中国联合网络通信集团有限公司 垃圾短信处理方法、装置及设备
CN110475216A (zh) * 2019-08-21 2019-11-19 何翠媚 一种优化信息处理的方法

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1628448A1 (en) * 2004-08-17 2006-02-22 Lucent Technologies Inc. Spam filtering for mobile communication devices
CN1801854A (zh) * 2004-12-21 2006-07-12 朗迅科技公司 不想要的消息(垃圾消息)的检测
CN102096703A (zh) * 2010-12-29 2011-06-15 北京新媒传信科技有限公司 短消息的过滤方法和设备

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101232635B (zh) * 2007-01-25 2011-03-09 上海粱江通信系统股份有限公司 一种基于信令处理技术的短信净化系统
CN101257671B (zh) * 2007-07-06 2010-12-08 浙江大学 基于内容的大规模垃圾短信实时过滤方法
CN101150756B (zh) * 2007-11-08 2010-05-19 电子科技大学 一种垃圾短信过滤方法
CN101335920B (zh) * 2008-07-15 2011-04-13 中国联合网络通信集团有限公司 基于主叫号码位置和发送内容的垃圾短消息识别系统及方法
CN103067896B (zh) * 2013-01-17 2015-08-19 中国联合网络通信集团有限公司 垃圾短信过滤方法及装置

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1628448A1 (en) * 2004-08-17 2006-02-22 Lucent Technologies Inc. Spam filtering for mobile communication devices
CN1801854A (zh) * 2004-12-21 2006-07-12 朗迅科技公司 不想要的消息(垃圾消息)的检测
CN102096703A (zh) * 2010-12-29 2011-06-15 北京新媒传信科技有限公司 短消息的过滤方法和设备

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113568904A (zh) * 2021-06-28 2021-10-29 平安普惠企业管理有限公司 作品送审方法、装置、电子设备及可读存储介质
CN113568904B (zh) * 2021-06-28 2023-10-20 南京地平线网络科技有限公司 作品送审方法、装置、电子设备及可读存储介质

Also Published As

Publication number Publication date
CN105592429A (zh) 2016-05-18

Similar Documents

Publication Publication Date Title
US11729311B2 (en) Automatic distribution of inmate phone recordings
US11301628B2 (en) Systems, methods, and apparatus for linguistic analysis and disabling of storage
US11695869B2 (en) System and method for identifying and handling unwanted callers using a call answering system
US20200204509A1 (en) System and method for monitoring, blocking according to selection criteria, converting, and copying multimedia messages into storage locations in a compliance file format
US8156553B1 (en) Systems and methods for correlating log messages into actionable security incidents and managing human responses
CN103618733B (zh) 一种应用于移动互联网的数据过滤系统及方法
WO2016197675A1 (zh) 骚扰电话的识别方法及装置
US8959097B2 (en) Privacy-preserving method for skimming of data from a collaborative infrastructure
US8731918B2 (en) Method and apparatus for automatic correlation of multi-channel interactions
US9544256B2 (en) Crowdsourcing e-mail filtering
WO2020155508A1 (zh) 可疑用户筛选方法、装置、计算机设备及存储介质
US9002333B1 (en) Mobile device reputation system
WO2016062090A1 (zh) 短信过滤方法及短信过滤装置
US20160171093A1 (en) Email mining system
WO2011150881A2 (zh) 用于移动终端的名片管理方法和装置
CN109104429B (zh) 一种针对网络诈骗信息的检测方法
WO2016095646A1 (zh) 一种无痕通讯的方法、装置和存储介质
EP3480821B1 (en) Clinical trial support network data security
CN105472162B (zh) 一种信息处理方法及电子设备
CN110351267B (zh) 一种社交媒体账号被盗的确定方法及装置
US11496620B1 (en) Automated consent management system and method for managing autoreply messages to incoming calls
RU2698412C2 (ru) Система защиты персональных данных пользователей в информационной системе на основании деперсонализации и миграции в безопасное окружение
CN110351116B (zh) 异常对象监控方法、装置、介质及电子设备
US11785143B2 (en) System and method for secure storage and management of transitory data using a blockchain
Sung et al. Development of humming call system for blocking spam on a smartphone

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15851969

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15851969

Country of ref document: EP

Kind code of ref document: A1