CN102111767A - Method for improving correct rate of identifying junk short message number based on called dispersed degree - Google Patents

Method for improving correct rate of identifying junk short message number based on called dispersed degree Download PDF

Info

Publication number
CN102111767A
CN102111767A CN2009102006514A CN200910200651A CN102111767A CN 102111767 A CN102111767 A CN 102111767A CN 2009102006514 A CN2009102006514 A CN 2009102006514A CN 200910200651 A CN200910200651 A CN 200910200651A CN 102111767 A CN102111767 A CN 102111767A
Authority
CN
China
Prior art keywords
note
called
threshold value
refuse messages
calling number
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2009102006514A
Other languages
Chinese (zh)
Inventor
肖克华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LIANGJIANG COMMUNICATIONS SYSTEM CO Ltd
Original Assignee
LIANGJIANG COMMUNICATIONS SYSTEM CO Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LIANGJIANG COMMUNICATIONS SYSTEM CO Ltd filed Critical LIANGJIANG COMMUNICATIONS SYSTEM CO Ltd
Priority to CN2009102006514A priority Critical patent/CN102111767A/en
Publication of CN102111767A publication Critical patent/CN102111767A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses a method for improving correct rate of identifying a junk short message number based on a called dispersed degree. The method comprises the following steps: setting a called dispersed degree threshold value; in a set time range, recording each short message sent by a calling number, and increasing a called number in attributes of each message; when the number of the messages sent by the calling number in the set time range exceeds the threshold value, calculating the called dispersed degree of the message sent by the calling number, namely the percentage of called numbers and total message numbers in the sent short messages; when the called dispersed degree exceeds the set called dispersed degree threshold value, regarding the calling number as the junk short message number; and when the called dispersed degree does not exceed the set called dispersed degree threshold value, considering that the calling number is not the junk short message number. By using the method, when short message flow rate is in over frequency, the sending behavior of a subscriber is analyzed according to the characteristics of the called number, thereby expelling normal short message numbers, reducing interception-failure efficiency of intercepting the junk short messages and improving the correct rate of indentifying the junk short messages.

Description

Method based on the raising refuse messages number recognition correct rate of called dispersion
Technical field
The present invention relates to a kind of method that improves refuse messages number recognition accuracy when realizing, relate in particular to a kind of method of the raising refuse messages number recognition correct rate based on called dispersion by the traffic statistics note.
Background technology
SMS (Short Message Service) is as a kind of basic service of mobile communications network, and when convenient message communicating service was provided for the user, also the propagation for garbage information provided channel.And rubbish short message has the trend that grows in intensity, and refuse messages not only brings the harmful effect of customer complaint, also has the malicious owing fee problem, therefore need monitor interception in real time to refuse messages.
The time frequency traffic characteristic of refuse messages comprises: 1, send note time overlength continuously; 2, the unit interval amount greatly; 3, called number disperses.
At present, according to the temporal characteristics of refuse messages, adopt the method for the note flow that sends by time scope statistics number, identify the note time to send a large amount of short message number, and list the refuse messages number in, limiting this number, to send note be a kind of effective means.
Said method can in time be found the refuse messages number from the temporal characteristics angle of refuse messages.But frequently send the situation of note for normal users, because the frequency ratio that note sends is higher, also caused the normal users number to be put into the refuse messages number, it is unsuccessful to cause the normal users note to send.
Original note flow rate calculation mode is the frequency that the calling number in section computing time sends note, and the excess flow threshold value is the refuse messages number.
See also Fig. 1, the statistics formation schematic diagram of the no called number of prior art, setting flow threshold is 10/1 minute, exceeds this value and is the refuse messages number, number 8613988888888 has reached 10 in 45 seconds of 08:00:00-08:00:45.Number 8613988888888 is listed in the refuse messages number.
Summary of the invention
The objective of the invention is to overcome the defective of prior art and a kind of method of the raising refuse messages number recognition correct rate based on called dispersion is provided, can be when note flow overclocking according to called characteristic analysis user's transmission behavior, get rid of normal note number, reduce intercepting rubbish short message generation elam error rate, improve the refuse messages recognition accuracy.
The technical scheme that realizes above-mentioned purpose is: a kind of method of the raising refuse messages number recognition correct rate based on called dispersion,
Set called dispersion threshold value;
In the time range of setting, every note that the record calling number sends increases called number in every note attribute;
When the note quantity that sends when calling number in the time range of setting exceeds threshold value, calculate the called dispersion that this calling number sends note, promptly send the unduplicated called number number in the note and the percentage of note sum;
When called dispersion exceeds the called dispersion threshold value of setting, think that this calling number is the refuse messages number;
When called dispersion does not reach the called dispersion threshold value of setting, do not think that then this calling number is the refuse messages number.
The method of above-mentioned raising refuse messages number recognition correct rate based on called dispersion, wherein, it may further comprise the steps:
At first, write down time, called number that each calling number sends note, check that whether each bar note that calling number sends has reached the interior flow threshold of fixed time section, is correspondingly processed according to different states;
If be no more than flow threshold, then remove the expired note of this calling number, keep the note in the fixed time section, write down the time and the called number of every note, continue monitoring;
If excess flow threshold value but do not reach the called dispersion of setting is then removed the expired note of this calling number, keep the note in the fixed time section, write down the time and the called number of every note, this number is as the refuse messages number, and continues to monitor;
If excess flow threshold value and exceed the called dispersion of setting is then removed the expired note of this calling number, keep the note in the fixed time section, write down the time and the called number of every note, list in the refuse messages number as the refuse messages number.
The invention has the beneficial effects as follows: the present invention is a kind of method that improves refuse messages number accuracy rate during by the traffic statistics note, merely according to the method difference of traffic statistics, carries out secondary calculating with tradition after the overclocking.Adopted this method, during the flow overclocking, added up called dispersion once more, can filter point-to-point normal note, avoided wrong refuse messages number that right number is listed in.
Description of drawings
Fig. 1 is the statistics formation schematic diagram of the no called number of prior art;
Fig. 2 is the flow chart of one embodiment of the invention;
Fig. 3 is the statistics formation schematic diagram that called number is arranged of embodiments of the invention.
Embodiment
A kind of method of the raising refuse messages number recognition correct rate based on called dispersion, implementation method is: set called dispersion threshold value; In the time range of setting, every note that the record calling number sends increases called number in every note attribute; When the note quantity that sends when calling number in the time range of setting exceeds threshold value, calculate the called dispersion that this calling number sends note, promptly send the unduplicated called number number in the note and the percentage of note sum; When called dispersion exceeds the called dispersion threshold value of setting, think that this calling number is the refuse messages number; When called dispersion does not reach the called dispersion threshold value of setting, do not think that then this calling number is the refuse messages number.
This method may further comprise the steps:
At first, write down time, called number that each calling number sends note, check that whether each bar note that calling number sends has reached the interior flow threshold of fixed time section, is correspondingly processed according to different states;
If be no more than flow threshold, then remove the expired note of this calling number, keep the note in the fixed time section, write down the time and the called number of every note, continue monitoring;
If excess flow threshold value but do not reach the called dispersion of setting is then removed the expired note of this calling number, keep the note in the fixed time section, write down the time and the called number of every note, this number is as the refuse messages number, and continues to monitor;
If excess flow threshold value and exceed the called dispersion of setting is then removed the expired note of this calling number, keep the note in the fixed time section, write down the time and the called number of every note, list in the refuse messages number as the refuse messages number.
The invention will be further described below in conjunction with an embodiment.
See also Fig. 2, the flow chart for one embodiment of the invention may further comprise the steps:
Step S1 receives note;
Step S2 deposits the note formation by calling number in;
Step S3 judges whether excess flow threshold value of this calling number,
If then enter step S4;
If not, then return step S1;
Step S4 judges whether the called dispersion of this calling number exceeds threshold value,
If then enter step S5;
If not, then return step S1;
Step S5 confirms, promptly confirms as the refuse messages number.
See also Fig. 3, the statistics formation schematic diagram that called number is arranged of the present invention, the present invention has increased earlier the called number field on original statistical, calculate the dispersion of called number; And set the suitable numerical value (as 80%) of dispersion, exceed this numerical value and be judged to be the refuse messages number.
In the note that calling number sends among Fig. 3 not the repeated number yardage be 1; Only comprise 8,613,911,111,111 1 numbers, the note number is 10, the number dispersion: repeated number yardage/note number=1/10=10% not.
So, though this calling number excess flow threshold value, not as the refuse messages number.Adopt the accuracy that called dispersion is filtered has increased the refuse messages number.
In sum, after the present invention sends the excess flow threshold value to note in the time period, and then add up for called subscriber's distribution situation, according to calculating the percentage that the user sends the called number number/transmission note sum of note in the monitoring period, to exceeding the user who sets percentage, list the refuse messages user in., when number sends overclocking, and then calculate called dispersion and improve accuracy rate during the period in monitoring.
Specifically, when the note number that sends in the official hour scope when number exceeds prior preset threshold, by calculating the dispersion (not repeated number yardage/note number) of called number in the note that sends, filter out normal note number, identify the refuse messages number, improve refuse messages number recognition correct rate.
The present invention can be applicable to the note optimization system, can significantly reduce the elam error rate of refuse messages, improves the degree of hitting of refuse messages identification.
Below embodiment has been described in detail the present invention in conjunction with the accompanying drawings, and those skilled in the art can make the many variations example to the present invention according to the above description.Thereby some details among the embodiment should not constitute limitation of the invention, and the scope that the present invention will define with appended claims is as protection scope of the present invention.

Claims (2)

1. the method based on the raising refuse messages number recognition correct rate of called dispersion is characterized in that,
Set called dispersion threshold value;
In the time range of setting, every note that the record calling number sends increases called number in every note attribute;
When the note quantity that sends when calling number in the time range of setting exceeds threshold value, calculate the called dispersion that this calling number sends note, promptly send the unduplicated called number number in the note and the percentage of note sum;
When called dispersion exceeds the called dispersion threshold value of setting, think that this calling number is the refuse messages number;
When called dispersion does not reach the called dispersion threshold value of setting, do not think that then this calling number is the refuse messages number.
2. the method for the raising refuse messages number recognition correct rate based on called dispersion according to claim 1 is characterized in that it may further comprise the steps:
At first, write down time, called number that each calling number sends note, check that whether each bar note that calling number sends has reached the interior flow threshold of fixed time section, is correspondingly processed according to different states;
If be no more than flow threshold, then remove the expired note of this calling number, keep the note in the fixed time section, write down the time and the called number of every note, continue monitoring;
If excess flow threshold value but do not reach the called dispersion of setting is then removed the expired note of this calling number, keep the note in the fixed time section, write down the time and the called number of every note, this number is as the refuse messages number, and continues to monitor;
If excess flow threshold value and exceed the called dispersion of setting is then removed the expired note of this calling number, keep the note in the fixed time section, write down the time and the called number of every note, list in the refuse messages number as the refuse messages number.
CN2009102006514A 2009-12-24 2009-12-24 Method for improving correct rate of identifying junk short message number based on called dispersed degree Pending CN102111767A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2009102006514A CN102111767A (en) 2009-12-24 2009-12-24 Method for improving correct rate of identifying junk short message number based on called dispersed degree

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2009102006514A CN102111767A (en) 2009-12-24 2009-12-24 Method for improving correct rate of identifying junk short message number based on called dispersed degree

Publications (1)

Publication Number Publication Date
CN102111767A true CN102111767A (en) 2011-06-29

Family

ID=44175763

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2009102006514A Pending CN102111767A (en) 2009-12-24 2009-12-24 Method for improving correct rate of identifying junk short message number based on called dispersed degree

Country Status (1)

Country Link
CN (1) CN102111767A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103139730A (en) * 2011-11-23 2013-06-05 上海粱江通信系统股份有限公司 Method used for identifying situation of mass numbers sending junk short messages at low frequency
CN103167501A (en) * 2011-12-15 2013-06-19 上海粱江通信系统股份有限公司 Method for improving identification accuracy rates of crank call number based on called dispersion degree
CN109104702A (en) * 2017-06-20 2018-12-28 中兴通讯股份有限公司 Information intercepting method, device and storage medium

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103139730A (en) * 2011-11-23 2013-06-05 上海粱江通信系统股份有限公司 Method used for identifying situation of mass numbers sending junk short messages at low frequency
CN103139730B (en) * 2011-11-23 2016-03-30 上海粱江通信系统股份有限公司 For identifying that a large amount of number low frequency sends the method for refuse messages situation
CN103167501A (en) * 2011-12-15 2013-06-19 上海粱江通信系统股份有限公司 Method for improving identification accuracy rates of crank call number based on called dispersion degree
CN109104702A (en) * 2017-06-20 2018-12-28 中兴通讯股份有限公司 Information intercepting method, device and storage medium

Similar Documents

Publication Publication Date Title
CN101790142B (en) Method and system for identifying spam message sources by combining message contents and transmission frequency
CN101771966B (en) Keywords and frequency based method for identifying spam message sources
CN102111731A (en) Method based on content similarity for improving recognition accuracy of spam message numbers
WO2016065908A1 (en) Method, device and system for detecting fraudulent user
WO2010031294A1 (en) De-massing method of position advertising service based on regional strategy and system thereof
CN101217820A (en) An identification system and identification method on disturbance telephone numbers
CN101150762A (en) A spam real time interception method and system
CN102761872A (en) Spam message intercepting method
CN101321070B (en) Monitoring system and method for suspicious user
WO2010054564A1 (en) Flow control method and corresponding system for cell short message
CN101909261A (en) Method and system for monitoring spam
CN101472247A (en) Method and system for controlling rubbish short message
CN102231888A (en) Monitoring method and device
CN102111767A (en) Method for improving correct rate of identifying junk short message number based on called dispersed degree
CN102111723B (en) Method for identifying spam short message user by analyzing short message frequency and content
CN110072251B (en) Method and device for analyzing user communication behavior and managing user
CN103733581B (en) Message processing method and base station
WO2011140874A1 (en) Method and apparatus for evaluating behavior of user equipment in standby state
CN103139730B (en) For identifying that a large amount of number low frequency sends the method for refuse messages situation
CN101827328A (en) Device and method for monitoring short-message
CN108259363B (en) Method and device for controlling stepped service flow
CN102572746B (en) A kind of method sending behavioural characteristic identification junk short message source based on the frequency and user
CN103167501A (en) Method for improving identification accuracy rates of crank call number based on called dispersion degree
CN104168547A (en) A system and method for processing fraud short messages based on signaling technology
CN101340693B (en) System and implementing method for monitoring rubbish short message based on content length

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20110629