CN102905236A - Method, device and system for monitoring spam short messages - Google Patents

Method, device and system for monitoring spam short messages Download PDF

Info

Publication number
CN102905236A
CN102905236A CN2011102120339A CN201110212033A CN102905236A CN 102905236 A CN102905236 A CN 102905236A CN 2011102120339 A CN2011102120339 A CN 2011102120339A CN 201110212033 A CN201110212033 A CN 201110212033A CN 102905236 A CN102905236 A CN 102905236A
Authority
CN
China
Prior art keywords
short message
content
message
calling number
called number
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2011102120339A
Other languages
Chinese (zh)
Other versions
CN102905236B (en
Inventor
疏星
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201110212033.9A priority Critical patent/CN102905236B/en
Publication of CN102905236A publication Critical patent/CN102905236A/en
Application granted granted Critical
Publication of CN102905236B publication Critical patent/CN102905236B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a method, a device and a system for monitoring spam short messages. The method includes acquiring a short message; determining a short message set corresponding to contents of the short message according to the contents of the short message, and adding a calling number and a called number of the short message into the short message set; and judging whether short messages in the message set are spam messages or not according to propagation tracks of the short messages in the short message set when the transmission quantity of the short messages in the short message set is larger than or equal to a set first threshold value. The invention further discloses the device and the system for monitoring the spam short messages. According to the technical scheme, the method, the device and the system have the advantage that the technical problem that short message transmitters evade system monitoring by reducing short message transmission quantities of single numbers can be solved.

Description

A kind of junk short message method for supervising, Apparatus and system
Technical field
The present invention relates to the junk short message recognition technology, in particular, relate to a kind of junk short message method for supervising, Apparatus and system.
Background technology
Nowadays, short message is with its low price, simple to operate and link up the favor that the advantage such as convenient is subject to numerous consumers.Because the transmission of short message has randomness, sending object can choose at random, and expense is cheap, provides great convenience for the sender of junk short message.The junk short message of the types such as reaction, advertisement, swindle presents the gesture that grows in intensity, and has has seriously invaded and harassed consumers in general's daily life.
Usually the mode of shielding rubbish short message has two kinds, and a kind of is the junk short message shielding of mobile phone side, and another kind is the junk short message interception of network side.Wherein, the shielding of mobile phone side junk short message is subjected to the restriction of mobile phone operational capability can only carry out simple blacklist filtration, keyword filtration; The interception of network side junk short message relies on the powerful disposal ability in backstage can realize complicated junk short message analysis, identification and processes, and becomes the main way of junk short message shielding.
The identification of network side junk short message mainly realizes by following several means with interception is current:
Blacklist filters: by system maintenance black list user tabulation, directly tackle being in the short message that the user sends in the blacklist;
Keyword filtration: by the system maintenance keywords database, when the sensitive words that contains in the short message that the user sends in the keywords database, this short message is tackled, " keyword+frequency " interception can be regarded its subset as.
Send behavior monitoring: centered by calling number, set up various monitoring models, monitor single calling number in the unit interval from dimensions such as transmitted traffic, called number rule, content of short message, time periods and whether have in violation of rules and regulations suspicion, and then take further measures, as stop follow-up short message issuing, adding blacklist.
In realizing process of the present invention, the inventor finds that there are the following problems at least in the prior art: the employed recognition technology of conventional garbage short message transmission Analysis model of network behaviors is based on to be analyzed and identifies transmission content and the transmission behavior of single calling number, if the threshold value that the short message traffic volume of a number within the unit interval arranges less than existing Analysis model of network behaviors then can't be by system identification.The junk short message transmit leg can be surveyed by the mode of continuous trial the threshold value of various actions analytical model, thus the purpose that the short message traffic volume in its one number unit interval sheet is monitored to reach eschew system less than this threshold value.In addition, if the junk short message transmit leg uses a collection of client identification module (SIM:Subscriber Identity Module) card simultaneously, the transmission same junk short message of poll, like this for one number, short message in its unit interval sheet sends number and can accomplish seldom, far below the threshold value of default, therefore existing supervisory control system can't effectively be identified and tackle.
Summary of the invention
Embodiments of the invention provide a kind of junk short message method for supervising, Apparatus and system, can solve the short message transmit leg comes the avoidance system monitoring by the short message traffic volume that reduces one number technical problem.
One aspect of the present invention provides a kind of junk short message method for supervising, and described method comprises:
Obtain short message;
Determine the short message set corresponding with described content according to the content of described short message, in described short message set, increase calling number and the called number of described short message;
When the quantity forwarded of described short message set short message during more than or equal to the first threshold set, determine according to the propagation trajectories of described massage set short message whether the short message in the described massage set is rubbish message.
The present invention provides a kind of junk short message watch-dog on the other hand, and described equipment comprises:
The message collection module is used for obtaining short message;
Data preprocessing module is used for determining the short message set corresponding with described content according to the content of described short message, increases calling number and the called number of described short message in described short message set;
The message identification module when the quantity forwarded of described short message set short message during more than or equal to the first threshold set, determines according to the propagation trajectories of described massage set short message whether the short message in the described massage set is rubbish message.
The present invention also provides a kind of junk short message supervisory control system, comprising:
Short message data source device: be used for providing short message data to carry out the identification of junk short message to the junk short message watch-dog, and receive the recognition result of described junk short message; And,
Aforesaid junk short message watch-dog: be used for obtaining short message; Determine the short message set corresponding with described content according to the content of described short message, in described short message set, increase calling number and the called number of described short message; When the quantity forwarded of described short message set short message during more than or equal to the first threshold set, determine according to the propagation trajectories of described massage set short message whether the short message in the described massage set is rubbish message.
Can be found out by the technical scheme that the embodiment of the invention described above provides, the enforcement of technical solution of the present invention, can determine whether the short message in the described massage set is rubbish message according to the propagation trajectories of described massage set short message, not only can solve the short message transmit leg comes the avoidance system monitoring by the short message transmitted traffic that reduces one number problem, can also send by the content of analyzing the same or similar short message of transmission the incidence relation between number and the receiving number, realize that one-off recognition goes out the effect that a collection of junk short message sends number.
Description of drawings
In order to be illustrated more clearly in the technical scheme of the embodiment of the invention, the accompanying drawing of required use was done to introduce simply during the below will describe embodiment, apparently, accompanying drawing in the following describes only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain according to these accompanying drawings other accompanying drawing.
Fig. 1 is a kind of junk short message method for supervising of embodiment of the invention flow chart;
Fig. 2 is the propagation trajectories figure of popular short message in the embodiment of the invention;
Fig. 3 is the propagation trajectories figure of junk short message in the embodiment of the invention;
Fig. 4 is the structural representation of embodiment of the invention junk short message watch-dog;
Fig. 5 is the structural representation of embodiment of the invention junk short message supervisory control system.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making the every other embodiment that obtains under the creative work prerequisite.
A kind of junk short message method for supervising of the embodiment of the invention, Apparatus and system, by will get access to short message, determine the short message set corresponding with described content according to the content of short message, in described short message set, increase calling number and the called number of described short message; When the quantity forwarded of described short message set short message during more than or equal to the first threshold set, carry out the identifying of junk short message, can reject in this way the short message record that a large amount of domestic consumers send, to reach conserve system resources, improve the purpose of analysis efficiency.
Enter the short message of junk short message identifying, determine according to the propagation trajectories of described massage set short message whether the short message in the described massage set is rubbish message, by this RM, can find effectively that different junk short message send the incidence relation between the number, realize that a collection of junk short message of one-off recognition sends the purpose of number.
As shown in Figure 1, a kind of junk short message method for supervising of the embodiment of the invention, the method comprises:
101, obtain short message;
102, determine the short message set corresponding with described content according to the content of described short message, in described short message set, increase calling number and the called number of described short message;
103, when the quantity forwarded of described short message set short message during more than or equal to the first threshold set, determine according to the propagation trajectories of described massage set short message whether the short message in the described massage set is rubbish message.
Wherein, first threshold can be set as required, such as 500,1000, and 2000,10000 etc.
In one embodiment of the invention, can adopt Real-time Obtaining or the non real-time mode of obtaining to realize obtaining of short message in the step 101:
Wherein, the mode of Real-time Obtaining short message comprises:
Dock the message that Real-time Obtaining is to be analyzed with the short message data source device by Short Message Peer to Peer (SMPP:Short Message Peer to Peer) or proprietary protocol; Wherein, the short message data source device can be Short Message Service Center or short messaging gateway.
Instantiation is as follows:
1) from Short Message Service Center Real-time Collection short message, wherein this real-time interface specifically can be SMPP interface etc. to the junk short message supervisory control system by real-time interface.
2) SPM (Signal Process Machine, signalling processing equipment) obtains short message signaling in Signaling System Number 7 (the SS7:Signaling System No.7) network link, by self-defining transmission control protocol (TCP:Transmission Control Protocol)/Internet protocol (IP:Internet Protocol) interface short message is issued the junk short message supervisory control system again.
In addition, the non real-time mode of obtaining short message comprises:
Obtain the short message call bill data by non real-time interface from the short message data source device and analyze, wherein, non real-time interface can be FTP (File Transfer Protocol, file transfer protocol (FTP)) interface etc.
Particularly, the junk short message supervisory control system can gather MO (mobile originated, the mobile initiation) ticket from Short Message Service Center by the FTP interface, realizes that the non real-time of short message data obtains.
Optionally, after getting access to short message, can extract described short message short message content, calling number, called number and transmitting time; And the content of short message is carried out anti-interference rejecting process, particularly, can reject the spcial character in the content of short message.
In an optional embodiment of the present invention, for the interior content of perhaps having rejected the short message of spcial character of the short message that extracts, will determine the short message set corresponding with the content of described short message according to the content of described short message in the step 102, comprising:
The usage data compression algorithm is compressed the content of described short message, obtains the content of short message value;
Determine the short message set corresponding with the content of described short message according to described content of short message value.
Concrete, data compression algorithm specifically can be Tianlhash algorithm or Strhash algorithm or ELFhash algorithm or Hflp algorithm etc. in the embodiment of the invention, the process that the usage data compression algorithm is compressed content of short message, specifically can be the hash value of calculating content of short message according to data compression algorithm, can directly use this hash value as the content of short message value.
Further alternative, determine that according to the content of described short message the short message set corresponding with described content is specially:
Search corresponding short message set according to described message content value; If find corresponding short message set according to described message content value, then the short message set that finds is gathered as the short message corresponding with described content; If do not find corresponding short message set according to described contents value, then generate the short message set corresponding with described content.In the embodiment of the invention, each content of short message value has a corresponding short message set.
Concrete, in one embodiment of the invention, the data structure in the described short message set can be expressed as shown in the following table:
Figure BDA0000079016230000061
Optionally, in another embodiment of the present invention, upper table can also increase the information of short message traffic volume, whenever increases an element in upper table, and the short message traffic volume just adds 1.Be understandable that, the short message traffic volume also can be stored in other position, as exists another with in the table of content of short message value as index, the corresponding relation between this table storage content of short message value and the short message traffic volume.
Optionally, in the short message set, increase calling number and the called number of described short message in the step 102, comprising:
In the set of described short message, increase an element, with described calling number and the called number information as described element.
In an optional embodiment of the present invention, step 103 can comprise:
Add up out-degree in the set of described short message greater than the quantity of 0 calling number; Calculate described out-degree and account for institute's number ratio in the described short message set greater than the quantity of 0 calling number, described all numbers comprise calling number and called number; When described ratio during less than the Second Threshold set, determine that then the short message in the described short message set is the suspicion rubbish message, described out-degree is that the suspicion rubbish message sends number greater than 0 calling number.Particularly, out-degree can adopt following flow process greater than the quantity of 0 calling number in the statistics short message set: a calling number set is set; Order is extracted the element in the short message set, determines whether the calling number of currentElement has been kept at the calling number set, if preserve, the out-degree of this calling number in the calling number set is added 1; If do not preserve, this calling number is joined the calling number set, and out-degree is made as 1.After having traveled through all elements in the short message set, just can determine that out-degree is greater than the quantity of 0 calling number.
Out-degree can represent with t greater than the quantity of 0 calling number in the short message set; Described ratio can be expressed as r=t/T, and wherein, T is institute's number quantity; Second Threshold is an empirical value, also can set as required, and such as 1%, 5%, 10% etc.During less than Second Threshold, determine that the short message in this massage set is the suspicion junk short message at r, out-degree is the transmission number of suspicion junk short message greater than 0 calling number.
A popular short message of blessing class, joke class usually can constantly be transmitted by the user, so the quantity forwarded of this content is easy to reach the Second Threshold of setting; Fig. 2 has described the propagation trajectories of popular short message, as shown in Figure 2,13800000000 sent a short message to 13800000001,13800000002 and 13800000003 after, 13800000002 have further transmitted this short message to 13800000003 and 13800000004, and 13800000004 received this short message after, also can transmit these short messages to 13800000003.
And a junk short message, the user can not continue to transmit after receiving usually, and the calling number of this short message is limited, and called number is a lot, so propagation trajectories is more single, and namely the quantity forwarded of this content usually can be less than the Second Threshold of setting.Fig. 3 has described the propagation trajectories of junk short message, as shown in Figure 3,13700000000 to 13700000001,13700000002,13700000003,13700000004 and 13700000005, after having sent a junk short message, 13700000001,13700000002,13700000003,13700000004 and 13700000005 can not continue to transmit this junk short message; 13800000000 to 13800000001,13800000002,13800000003,13800000004 and 13800000005, after having sent a junk short message, 13800000001,13800000002,13800000003,13800000004 and 13800000005 can not continue to transmit this junk short message.In this case, 13700000000 and 13800000000 is exactly the transmission number of being accused of sending junk short message.Need to prove, because the short message of the transmissions such as many service providers (SP:Service Provider) or company also can have the propagation trajectories of above-mentioned junk short message, and the short message of the transmissions such as SP or company can not be regarded as junk short message, therefore the mode of white list can be set, not process so long as the short message of the transmissions such as SP or company can not be put in the short message set.
Therefore, can find effectively that by above-mentioned recognition methods different junk short message send the incidence relation between the number, realize once identifying the effect that a collection of junk short message sends number.
Specifically, add up in embodiments of the present invention out-degree in the set of described short message greater than the quantity of 0 calling number, can adopt the mode of the short message propagation trajectories of massage set to add up, specifically can use directed graph G (V, E) represent, wherein, V is the set of all elements in the massage set, each element is comprised of calling number and called number, and E is the set of all content of short message;
Represent institute's number quantity among the V with n=|V|, e=|E| represents E short message traffic volume;
With the directed edge (i, j) between number i and the number j, expression number i sends a short message to number j, i wherein, and j takes from set V;
Use d i InThe in-degree of expression number i, namely i is as the number of the short message of called number; Use d i OutThe out-degree of expression number i, namely i then has d as the number of the short message of calling number i In=d i Out=e (i=1:n);
At directed edge (i, j) in, designation code i is abutted to number j, and number j is adjacent to number i, the use adjacency list represents to be abutted to or to be adjacent to other set of numbers of a given number, is used for the set of numbers of expression caller or a called given number in the embodiment of the invention.
In above-described embodiment in the set of the described short message of statistics out-degree greater than the quantity of 0 calling number, unification describes as an example of the number out-degree example, if will be adjacent to the chained list of relation, be applied under the scene of above-described embodiment, when calling number is included in the calling number set in the information of judging currentElement, in calling number set with the information of currentElement in the out-degree of calling number add 1, and called number is joined in the adjacency list; If do not comprise, the calling number in the information of newly-increased currentElement in the set of described calling number then, and the out-degree of calling number in the information of currentElement is made as 1, and called number is joined in the adjacency list;
Concrete, in the embodiment of the invention, use the data structure of each element in the mode descriptive element set V of chained list can be with reference to as follows:
Figure BDA0000079016230000091
Because popular short message can be transmitted between different users, and the recipient of junk short message can not transmit junk short message basically, therefore in another optional embodiment of the present invention, step 103 can also realize by another kind of mode, comprising:
Add up in-degree in the set of described short message greater than the quantity of 0 called number;
Calculate described in-degree and account for institute's number ratio in the described short message set greater than the quantity of 0 called number, described all numbers comprise calling number and called number;
When described ratio during greater than the 3rd threshold value set, determine that then the short message in the described short message set is the suspicion rubbish message, the calling number that sends the short message in the described short message set is that the suspicion rubbish message sends number.Wherein, the 3rd threshold value also is an empirical value, can arrange as required, and such as 99%, 95%, 90% etc.
In an optional embodiment of the present invention, in the short message set, increase calling number and the called number of described short message in the step 102, comprising:
In the set of described short message, increase an element, with described calling number and the called number information as described element;
In-degree comprises greater than the quantity of 0 called number in the set of the described short message of described statistics:
Order is obtained the information of element in the described short message set;
Judge whether called number is included in the called number set in the information of currentElement; If comprise, then in called number set with the information of currentElement in the in-degree of called number add 1; If do not comprise, the called number in the information of newly-increased currentElement in the set of described called number then, and the in-degree of the called number in the information of currentElement is made as 1.
Need to prove, that the identifying to short message describes as an example of the number in-degree example in unification in the embodiment of the invention, identical with the principle that the embodiment of number out-degree realizes with above-mentioned employing unification, do not do specifically at this and to give unnecessary details, specifically can be referring to the concrete scheme of above-described embodiment.
In an optional embodiment of the present invention, determine that according to the propagation trajectories of described massage set short message short message in the described massage set is during for rubbish message, described method can also comprise:
Number to junk short message and transmission junk short message is processed;
Concrete, common suspicion junk short message processing mode comprises at least a in following three kinds of modes:
1) number that will differentiate for transmission suspicion junk short message adds blacklist, and be synchronized to external system, such as SMSC (Short Message Service Center, Short Message Service Center), BOSS (Business and Operation Support System, business operation support system) etc.;
2) will differentiate and give the manual examination and verification platform for information such as the number of transmission suspicion junk short message and content of short message, short message traffic volumes and carry out artificial secondary-confirmation;
3) corresponding content of short message being added entry keyword tackles.
As shown in Figure 4, a kind of junk short message watch-dog of the embodiment of the invention, described watch-dog comprises:
Message collection module 21 is used for obtaining short message;
Data preprocessing module 22 is used for determining the short message set corresponding with described content according to the content of described short message, increases calling number and the called number of described short message in described short message set;
Message identification module 23 when the quantity forwarded of described short message set short message during more than or equal to the first threshold set, determines according to the propagation trajectories of described massage set short message whether the short message in the described massage set is rubbish message.
Optionally, described message collection module 21 specifically can be used for:
Dock the message that Real-time Obtaining is to be analyzed by real-time interface short message data source device; Perhaps,
Obtain in non real-time the call bill data of the short message in the short message data source device analyzes by non real-time interface.
Further alternative, the message collection module will get access to such an extent that the short message data bag is issued data preprocessing module, large and individual server disposal ability has in limited time when the short message flow-rate ratio, the data pretreatment unit need adopt the trunking mode distributed deployment, this moment, traditional load-sharing mode of distributing by the calling number rule can't guarantee to send identical content of short message but the different short messages of calling number are dispensed to same server, so this programme can provide two kinds of load-sharing modes:
A kind ofly be: the load-sharing mode of distributing according to content of short message length.Issue server 1 in 20 bytes with interior short message such as content of short message length, length is issued server 2 at the short message of 20~39 bytes, and length is issued server 3 at the short message of 40~70 bytes, and the short message of length more than 70 bytes issued server 4;
Another kind is: first content of short message is taked certain algorithm to be converted to content of short message value (such as integer), adopted traditional load-sharing mode again, as realizing load balancing according to content of short message value mantissa.
In one embodiment of the invention, described message identification module 23 specifically is used for: add up described short message set out-degree greater than the quantity of 0 calling number; Calculate described out-degree and account for institute's number ratio in the described short message set greater than the quantity of 0 calling number, described all numbers comprise calling number and called number; When described ratio during less than the Second Threshold set, determine that then the short message in the described short message set is the suspicion rubbish message, described out-degree is that the suspicion rubbish message sends number greater than 0 calling number.
In one embodiment of the invention, described data preprocessing module 22 increases calling number and the called number of described short message in the short message set, specifically can comprise: in the set of described short message, increase an element, with described calling number and the called number information as described element;
Out-degree is greater than the quantity of 0 calling number in the described short message set of described message identification module 23 statistics, and comprising: order is obtained the information of element in the described short message set; Judge whether calling number is included in the calling number set in the information of currentElement; If comprise, then in calling number set with the information of currentElement in the out-degree of calling number add 1; If do not comprise, the calling number in the information of newly-increased currentElement in the set of described calling number then, and the out-degree of calling number in the information of currentElement is made as 1.
In another embodiment of the present invention, described message identification module 22 specifically is used for: add up described short message set in-degree greater than the quantity of 0 called number; Calculate described in-degree and account for institute's number ratio in the described short message set greater than the quantity of 0 called number, described all numbers comprise calling number and called number; When described ratio during greater than the 3rd threshold value set, determine that then the short message in the described short message set is the suspicion rubbish message, the calling number that sends the short message in the described short message set is that the suspicion rubbish message sends number.
In another embodiment of the present invention, described data preprocessing module 22 increases calling number and the called number of described short message in the short message set, specifically comprise: in the set of described short message, increase an element, with described calling number and the called number information as described element;
In-degree is greater than the quantity of 0 called number in the described short message set of described message identification module 23 statistics, and comprising: order is obtained the information of element in the described short message set; Judge whether called number is included in the called number set in the information of currentElement; If comprise, then in called number set with the information of currentElement in the in-degree of called number add 1; If do not comprise, the called number in the information of newly-increased currentElement in the set of described called number then, and the in-degree of the called number in the information of currentElement is made as 1.
In an optional embodiment of the present invention, described data preprocessing module 22 will be determined the short message set corresponding with the content of described short message according to the content of described short message, specifically comprise: the usage data compression algorithm is compressed the content of described short message, obtains the content of short message value; Determine the short message set corresponding with the content of described short message according to described content of short message value.
In an optional embodiment of the present invention, described data preprocessing module 22 will be determined the short message set corresponding with described content according to the content of described short message, further comprise: search corresponding short message set according to described message content value; If find corresponding short message set according to described message content value, then the short message set that finds is gathered as the short message corresponding with described content; If do not find corresponding short message set according to described contents value, then generate the short message set corresponding with described content.
Need to prove, junk short message watch-dog embodiment is based on directly acquisition of embodiment of the method among the present invention, comprised the identical or corresponding technical scheme of embodiment of the method, wherein in the embodiment of the invention in each module and the embodiment of the method each step have corresponding relation, specifically can be referring to the associated description of embodiment of the method.
As shown in Figure 5, a kind of junk short message supervisory control system of the embodiment of the invention, described system comprises:
Short message data source device 31: be used for providing short message data to carry out the identification of junk short message to the junk short message watch-dog, and receive the recognition result of described junk short message;
And comprise the short message monitoring equipment 32 of rubbish that the embodiment of the invention provides.
A kind of junk short message message monitoring method, Apparatus and system that the embodiment of the invention provides not only can solve and can't find in time that the short message transmit leg comes the problem of avoidance system monitoring by the short message transmitted traffic that reduces one number; Can also by analyzing the incidence relation that sends between same or similar content of short message transmission number, the receiving number, can realize that one-off recognition goes out the effect that a collection of junk short message sends number.
One of ordinary skill in the art will appreciate that all or part of flow process that realizes in above-described embodiment method, to come the relevant hardware of instruction to finish by computer program, described program can be stored in the computer read/write memory medium, this program can comprise the flow process such as the embodiment of above-mentioned each side method when carrying out.Wherein, described storage medium can be magnetic disc, CD, read-only store-memory body (Read-Only Memory, ROM) or random store-memory body (Random Access Memory, RAM) etc.
The above; only for the better embodiment of the present invention, but protection scope of the present invention is not limited to this, anyly is familiar with those skilled in the art in the technical scope that the present invention discloses; the variation that can expect easily or replacement all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection range of claim.

Claims (15)

1. a junk short message method for supervising is characterized in that, comprising:
Obtain short message;
Determine the short message set corresponding with described content according to the content of described short message, in described short message set, increase calling number and the called number of described short message;
When the quantity forwarded of described short message set short message during more than or equal to the first threshold set, determine according to the propagation trajectories of described massage set short message whether the short message in the described massage set is rubbish message.
2. the method for claim 1 is characterized in that, described propagation trajectories according to described short message set short message determines whether the short message in the described massage set is rubbish message, comprising:
Add up out-degree in the set of described short message greater than the quantity of 0 calling number;
Calculate described out-degree and account for institute's number ratio in the described short message set greater than the quantity of 0 calling number, described all numbers comprise calling number and called number;
When described ratio during less than the Second Threshold set, determine that then the short message in the described short message set is the suspicion rubbish message, described out-degree is that the suspicion rubbish message sends number greater than 0 calling number.
3. method as claimed in claim 2 is characterized in that, described calling number and the called number that increases described short message in the short message set comprises:
In the set of described short message, increase an element, with described calling number and the called number information as described element;
Out-degree comprises greater than the quantity of 0 calling number in the set of the described short message of described statistics:
Order is obtained the information of element in the described short message set;
Judge whether calling number is included in the calling number set in the information of currentElement; If comprise, then in calling number set with the information of currentElement in the out-degree of calling number add 1; If do not comprise, the calling number in the information of newly-increased currentElement in the set of described calling number then, and the out-degree of calling number in the information of currentElement is made as 1.
4. the method for claim 1 is characterized in that, described propagation trajectories according to described short message set short message determines whether the short message in the described massage set is rubbish message, comprising:
Add up in-degree in the set of described short message greater than the quantity of 0 called number;
Calculate described in-degree and account for institute's number ratio in the described short message set greater than the quantity of 0 called number, described all numbers comprise calling number and called number;
When described ratio during greater than the 3rd threshold value set, determine that then the short message in the described short message set is the suspicion rubbish message, the calling number that sends the short message in the described short message set is that the suspicion rubbish message sends number.
5. method as claimed in claim 4 is characterized in that, described calling number and the called number that increases described short message in the short message set comprises:
In the set of described short message, increase an element, with described calling number and the called number information as described element;
In-degree comprises greater than the quantity of 0 called number in the set of the described short message of described statistics:
Order is obtained the information of element in the described short message set;
Judge whether called number is included in the called number set in the information of currentElement; If comprise, then in called number set with the information of currentElement in the in-degree of called number add 1; If do not comprise, the called number in the information of newly-increased currentElement in the set of described called number then, and the in-degree of the called number in the information of currentElement is made as 1.
6. such as the arbitrary described method of claim 1 to 5, it is characterized in that, described content according to described short message is determined the short message set corresponding with the content of described short message, comprising:
The usage data compression algorithm is compressed the content of described short message, obtains the content of short message value;
Determine the short message set corresponding with the content of described short message according to described content of short message value.
7. method as claimed in claim 6 is characterized in that, will determine that according to the content of described short message the short message set corresponding with described content is specially:
Search corresponding short message set according to described message content value;
If find corresponding short message set according to described message content value, then the short message set that finds is gathered as the short message corresponding with described content;
If do not find corresponding short message set according to described contents value, then generate the short message set corresponding with described content.
8. a junk short message watch-dog is characterized in that, comprising:
The message collection module is used for obtaining short message;
Data preprocessing module is used for determining the short message set corresponding with described content according to the content of described short message, increases calling number and the called number of described short message in described short message set;
The message identification module when the quantity forwarded of described short message set short message during more than or equal to the first threshold set, determines according to the propagation trajectories of described massage set short message whether the short message in the described massage set is rubbish message.
9. equipment as claimed in claim 8 is characterized in that, described message identification module specifically is used for:
Add up out-degree in the set of described short message greater than the quantity of 0 calling number;
Calculate described out-degree and account for institute's number ratio in the described short message set greater than the quantity of 0 calling number, described all numbers comprise calling number and called number;
When described ratio during less than the Second Threshold set, determine that then the short message in the described short message set is the suspicion rubbish message, described out-degree is that the suspicion rubbish message sends number greater than 0 calling number.
10. equipment as claimed in claim 9 is characterized in that, described data preprocessing module increases calling number and the called number of described short message in the short message set, specifically comprise:
In the set of described short message, increase an element, with described calling number and the called number information as described element;
Described message identification module is added up out-degree in the set of described short message greater than the quantity of 0 calling number, and specifically comprise: order is obtained the information of element in the described short message set; Judge whether calling number is included in the calling number set in the information of currentElement; If comprise, then in calling number set with the information of currentElement in the out-degree of calling number add 1; If do not comprise, the calling number in the information of newly-increased currentElement in the set of described calling number then, and the out-degree of calling number in the information of currentElement is made as 1.
11. equipment as claimed in claim 8 is characterized in that, described message identification module specifically is used for:
Add up in-degree in the set of described short message greater than the quantity of 0 called number;
Calculate described in-degree and account for institute's number ratio in the described short message set greater than the quantity of 0 called number, described all numbers comprise calling number and called number;
When described ratio during greater than the 3rd threshold value set, determine that then the short message in the described short message set is the suspicion rubbish message, the calling number that sends the short message in the described short message set is that the suspicion rubbish message sends number.
12. equipment as claimed in claim 11 is characterized in that, described data preprocessing module increases calling number and the called number of described short message in the short message set, specifically comprise:
In the set of described short message, increase an element, with described calling number and the called number information as described element;
Described message identification module is added up in-degree in the set of described short message greater than the quantity of 0 called number, and specifically comprise: order is obtained the information of element in the described short message set; Judge whether called number is included in the called number set in the information of currentElement; If comprise, then in called number set with the information of currentElement in the in-degree of called number add 1; If do not comprise, the called number in the information of newly-increased currentElement in the set of described called number then, and the in-degree of the called number in the information of currentElement is made as 1.
13. such as the arbitrary described equipment of claim 8 to 12, it is characterized in that, described data preprocessing module will be determined the short message set corresponding with the content of described short message according to the content of described short message, specifically comprise:
The usage data compression algorithm is compressed the content of described short message, obtains the content of short message value;
Determine the short message set corresponding with the content of described short message according to described content of short message value.
14. equipment as claimed in claim 13 is characterized in that, described data preprocessing module will be determined the short message set corresponding with described content according to the content of described short message, specifically comprise:
Search corresponding short message set according to described message content value;
If find corresponding short message set according to described message content value, then the short message set that finds is gathered as the short message corresponding with described content;
If do not find corresponding short message set according to described contents value, then generate the short message set corresponding with described content.
15. a junk short message supervisory control system is characterized in that, comprising:
Short message data source device: be used for providing short message data to carry out the identification of junk short message to the junk short message watch-dog, and receive the recognition result of described junk short message; And,
Such as arbitrary described junk short message watch-dog in the claim 8 to 14.
CN201110212033.9A 2011-07-27 2011-07-27 A kind of junk short message monitoring method, Apparatus and system Active CN102905236B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110212033.9A CN102905236B (en) 2011-07-27 2011-07-27 A kind of junk short message monitoring method, Apparatus and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110212033.9A CN102905236B (en) 2011-07-27 2011-07-27 A kind of junk short message monitoring method, Apparatus and system

Publications (2)

Publication Number Publication Date
CN102905236A true CN102905236A (en) 2013-01-30
CN102905236B CN102905236B (en) 2016-08-17

Family

ID=47577233

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110212033.9A Active CN102905236B (en) 2011-07-27 2011-07-27 A kind of junk short message monitoring method, Apparatus and system

Country Status (1)

Country Link
CN (1) CN102905236B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104219672A (en) * 2014-10-14 2014-12-17 北京奇虎科技有限公司 Incoming call or message identification method and device
CN106162584A (en) * 2015-01-27 2016-11-23 北京奇虎科技有限公司 Identify the method for refuse messages, client, cloud server and system
CN106454818A (en) * 2015-08-06 2017-02-22 中国移动通信集团四川有限公司 Data information service credit control method and data information service credit control device
CN106815200A (en) * 2015-11-30 2017-06-09 任子行网络技术股份有限公司 Objectionable text detection method and device based on keyword
CN114302351A (en) * 2022-03-09 2022-04-08 太平金融科技服务(上海)有限公司深圳分公司 Short message service processing method and device, computer equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101321365A (en) * 2008-07-17 2008-12-10 浙江大学 Rubbish message sending user identification method by message reply frequency
CN101335920A (en) * 2008-07-15 2008-12-31 中国联合通信有限公司 Rubbish short message recognition system and method based on calling number location and transmitted content
CN101355728A (en) * 2008-05-06 2009-01-28 中国移动通信集团江苏有限公司 SMS life energy system and judging method thereof
CN101572870A (en) * 2008-05-03 2009-11-04 祁勇 Method for monitoring junk information in communication network
WO2010145403A1 (en) * 2009-10-30 2010-12-23 中兴通讯股份有限公司 Method, system, control console and management machine for determining spam messages
CN101977360A (en) * 2010-09-30 2011-02-16 北京新媒传信科技有限公司 Junk short message filter method

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101572870A (en) * 2008-05-03 2009-11-04 祁勇 Method for monitoring junk information in communication network
CN101355728A (en) * 2008-05-06 2009-01-28 中国移动通信集团江苏有限公司 SMS life energy system and judging method thereof
CN101335920A (en) * 2008-07-15 2008-12-31 中国联合通信有限公司 Rubbish short message recognition system and method based on calling number location and transmitted content
CN101321365A (en) * 2008-07-17 2008-12-10 浙江大学 Rubbish message sending user identification method by message reply frequency
WO2010145403A1 (en) * 2009-10-30 2010-12-23 中兴通讯股份有限公司 Method, system, control console and management machine for determining spam messages
CN101977360A (en) * 2010-09-30 2011-02-16 北京新媒传信科技有限公司 Junk short message filter method

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104219672A (en) * 2014-10-14 2014-12-17 北京奇虎科技有限公司 Incoming call or message identification method and device
CN104219672B (en) * 2014-10-14 2017-08-22 北京奇虎科技有限公司 Incoming call or short message recognition methods and device
CN106162584A (en) * 2015-01-27 2016-11-23 北京奇虎科技有限公司 Identify the method for refuse messages, client, cloud server and system
CN106162584B (en) * 2015-01-27 2020-04-24 北京奇虎科技有限公司 Method, client, cloud server and system for identifying spam messages
CN106454818A (en) * 2015-08-06 2017-02-22 中国移动通信集团四川有限公司 Data information service credit control method and data information service credit control device
CN106815200A (en) * 2015-11-30 2017-06-09 任子行网络技术股份有限公司 Objectionable text detection method and device based on keyword
CN114302351A (en) * 2022-03-09 2022-04-08 太平金融科技服务(上海)有限公司深圳分公司 Short message service processing method and device, computer equipment and storage medium
CN114302351B (en) * 2022-03-09 2022-06-17 太平金融科技服务(上海)有限公司深圳分公司 Short message service processing method and device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN102905236B (en) 2016-08-17

Similar Documents

Publication Publication Date Title
CN102802133B (en) Junk information identification method, device and system
CN104301896B (en) Swindle short message intelligent monitoring warning system and method
CN107040863B (en) Real-time service recommendation method and system
CN101472245B (en) Method and apparatus for intercepting rubbish short message
CN101860822A (en) Method and system for monitoring spam messages
CN102231873A (en) Method and system for monitoring garbage message and monitor processing apparatus
CN101686444B (en) System and method for detecting spam SMS sender number in real time
CN102905236B (en) A kind of junk short message monitoring method, Apparatus and system
CN102088697A (en) Method and system for processing spam
CN101909261A (en) Method and system for monitoring spam
CN103391547A (en) Information processing method and terminal
CN104091122A (en) Detection system of malicious data in mobile internet
CN101431434A (en) Content monitoring and plugging system and method based on WAP
CN101389085B (en) Rubbish short message recognition system and method based on sending behavior
CN103796183A (en) Spam short message identification method and device
CN105101124A (en) Method and device for marking category of short messages
CN1997058B (en) A method for monitoring of the high-traffic short message
CN103888919A (en) Short message monitoring method and device thereof
CN102546992A (en) Junk voice message filtering method, filtering device and filtering system
CN102098640B (en) Method, device and system for distinguishing and stopping equipment from sending SMS (short messaging service) spam
CN102932753A (en) Method for intercepting spam multimedia message on link of multimedia system
CN102271331A (en) Method and system for detecting reliability of service provider (SP) site
CN102111723B (en) Method for identifying spam short message user by analyzing short message frequency and content
CN101321365B (en) Rubbish message sending user identification method by message reply frequency
CN101610474B (en) WAP content monitoring method and monitoring device thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant