CN102801859A - Method and device for identifying junk short message, and mobile communication terminal with device - Google Patents

Method and device for identifying junk short message, and mobile communication terminal with device Download PDF

Info

Publication number
CN102801859A
CN102801859A CN2012102751576A CN201210275157A CN102801859A CN 102801859 A CN102801859 A CN 102801859A CN 2012102751576 A CN2012102751576 A CN 2012102751576A CN 201210275157 A CN201210275157 A CN 201210275157A CN 102801859 A CN102801859 A CN 102801859A
Authority
CN
China
Prior art keywords
regular expression
note
short message
message content
refuse messages
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012102751576A
Other languages
Chinese (zh)
Other versions
CN102801859B (en
Inventor
陈伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201210275157.6A priority Critical patent/CN102801859B/en
Publication of CN102801859A publication Critical patent/CN102801859A/en
Application granted granted Critical
Publication of CN102801859B publication Critical patent/CN102801859B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Telephonic Communication Services (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a method and a device for identifying junk short messages, and a mobile communication terminal with the device. The identifying method comprises: extracting content of a short message; matching the short message content with a preset regular expression; and determining the short message to be a junk short message when the short message content is successfully matched with the preset regular expression. The method can improve identification accuracy of the junk short messages, thereby effectively shielding the junk short messages.

Description

The recognition methods of refuse messages, install and have the mobile communication terminal of this device
Technical field
The present invention relates to the communications field, in particular to a kind of recognition methods, recognition device of refuse messages with have the mobile communication terminal of this device.
Background technology
According to statistics, Chinese cellphone subscriber's quantity reaches several hundred million, and network surveying has 98.1% cellphone subscriber to be harassed by refuse messages, and 58.2% user receives 1 to 3 refuse messages to I haven't seen you for ages every day, and 19.6% user can receive 3 to 5 refuse messages every day.Though the method for regulation refuse messages emerges in an endless stream, and can't stop the propagation of refuse messages all the time.
At present, filtering junk short messages generally all is to adopt preset keyword match filtering, and this method need be gathered a large amount of refuse messages samples, therefrom extracts responsive keyword and sets up the keyword dictionary and filter.
In said method, there are a lot of problems, the first, need to safeguard a large amount of keyword dictionaries, need the refuse messages sample and gather keyword.The second, dictionary also need be brought in constant renewal in additional, when producing new refuse messages, gather new refuse messages keyword.The 3rd; Transmit leg can be avoided various responsive keywords fully; Adopt various means to evade falling keyword, as: interspersed spcial character in the middle of keyword, adopt the Chinese character replacement with the keyword unisonance; Do not influence the readability of refuse messages like this, the addressee can guess the true content that note through homonym fully.Generally speaking, refuse messages recognition methods of the prior art can not effectively identify refuse messages.
Problem to refuse messages processing method shielding rubbish short message weak effect in the correlation technique does not propose effective solution at present as yet.
Summary of the invention
Main purpose of the present invention is to provide a kind of recognition methods, recognition device of refuse messages and has the mobile communication terminal of this device, to solve the problem of refuse messages processing method shielding rubbish short message weak effect.
To achieve these goals, according to an aspect of the present invention, a kind of recognition methods of refuse messages is provided.
Recognition methods according to refuse messages of the present invention comprises: the short message content that extracts note; Matching sms content and preset regular expression; And when short message content matees successfully with preset regular expression, confirm that note is a refuse messages.
Further; Preset regular expression comprises first regular expression and second regular expression; Wherein, Matching sms content and preset regular expression when short message content matees successfully with preset regular expression, confirm that note is that refuse messages comprises: the matching sms content and first regular expression; When short message content and first regular expression mate successfully, confirm that note is a refuse messages; When short message content and the failure of first regular expression coupling, the matching sms content and second regular expression; And mate successfully when short message content and second regular expression, confirm that note is a refuse messages.
Further, preset regular expression comprises following any one or more regular expressions: the regular expression that is used to mate phone number; Be used to mate the regular expression of the telephone number of landline telephone; Be used to mate the regular expression of Bank Account Number; The regular expression that is used for matching web site URL; The regular expression that is used for Match IP Address; And the regular expression that is used for matching network ID number.
Further, the matching sms content comprises with preset regular expression: text conversion identical with the Arabic numerals pronunciation in the short message content for corresponding Arabic numerals, is obtained converted contents; Coupling converted contents and preset regular expression.
Further, before the short message content that extracts note, this method also comprises: the signal code of coming of extracting note; And according to coming signal code to judge whether note is strange note, wherein, the short message content that extracts note comprises: when note is strange note, extract the short message content of note.
Further; Judge according to coming signal code whether note is that strange note comprises: judge signal code whether in contact number tabulation and call history record; Wherein, when coming signal code not in contact number tabulation and call history record, this note is strange note.
Further, before the short message content that extracts note, this method also comprises: the signal code of coming of extracting note; Judge signal code whether to satisfy preset number filtering condition, wherein, the short message content that extracts note comprises: when coming the preset number filtering condition of the discontented foot of signal code, extract the short message content of note.
To achieve these goals, according to a further aspect in the invention, a kind of recognition device of refuse messages is provided, this recognition device is used to carry out the recognition methods of any refuse messages that the invention described above provides.
To achieve these goals, according to a further aspect in the invention, a kind of recognition device of refuse messages is provided, has comprised: extraction module is used to extract the short message content of note; Matching module is used for matching sms content and preset regular expression; And determination module, be used for when short message content matees successfully with preset regular expression, confirming that note is a refuse messages.
Further, preset regular expression comprises first regular expression and second regular expression, and wherein, matching module comprises: first matched sub-block is used for the matching sms content and first regular expression; And second matched sub-block; Be used for when short message content and the failure of first regular expression coupling, the matching sms content and second regular expression, determination module comprises: first confirms submodule; Be used for when short message content and first regular expression mate successfully, confirming that note is a refuse messages; And second confirm submodule, is used for when short message content and second regular expression mate successfully, confirming that note is a refuse messages.
Further, preset regular expression comprises following any one or more regular expressions: the regular expression that is used to mate phone number; Be used to mate the regular expression of the telephone number of landline telephone; Be used to mate the regular expression of Bank Account Number; The regular expression that is used for matching web site URL; The regular expression that is used for Match IP Address; And the regular expression that is used for matching network ID number.
Further, matching module comprises: the conversion submodule, be used for the text conversion that short message content is identical with the Arabic numerals pronunciation for corresponding Arabic numerals, and obtain converted contents; And the 3rd matched sub-block, be used to mate converted contents and preset regular expression.
To achieve these goals, in accordance with a further aspect of the present invention, a kind of mobile communication terminal is provided, this mobile communication terminal comprises the recognition device of any one refuse messages provided by the invention.
Through the present invention, adopt the recognition methods of the refuse messages that may further comprise the steps: the short message content that extracts note; Matching sms content and preset regular expression; And when short message content matees successfully with preset regular expression; Confirm that note is a refuse messages; Can improve the recognition accuracy of refuse messages, solve the problem of refuse messages processing method shielding rubbish short message weak effect, and then reach the effect of effective shielding rubbish short message.
Description of drawings
The accompanying drawing that constitutes the application's a part is used to provide further understanding of the present invention, and illustrative examples of the present invention and explanation thereof are used to explain the present invention, do not constitute improper qualification of the present invention.In the accompanying drawings:
Fig. 1 is the flow chart according to the recognition methods of the refuse messages of first embodiment of the invention;
Fig. 2 is the flow chart according to the recognition methods of the refuse messages of second embodiment of the invention;
Fig. 3 is the flow chart according to the recognition methods of the refuse messages of third embodiment of the invention;
Fig. 4 is the block diagram according to the recognition device of the refuse messages of first embodiment of the invention; And
Fig. 5 is the block diagram according to the recognition device of the refuse messages of second embodiment of the invention.
Embodiment
Need to prove that under the situation of not conflicting, embodiment and the characteristic among the embodiment among the application can make up each other.Below with reference to accompanying drawing and combine embodiment to specify the present invention.
Fig. 1 is the flow chart according to the recognition methods of the refuse messages of first embodiment of the invention, and is as shown in Figure 1, and this method comprises that following step S102 is to step S106:
Step S102: extract the short message content of note, obtain the short message content text.
Step S104: matching sms content and preset regular expression; Wherein, Preset regular expression is provided with according to the short message content characteristics of refuse messages; For example generally can comprise contents such as contact details, account information in the refuse messages, represent this type content, then can realize the coupling of contents such as short message content and contact details, account information through this step through preset regular expression.
Step S106: when short message content matees successfully with preset regular expression; Confirm that note is a refuse messages, correspondingly, when short message content is failed with preset regular expression coupling; Confirm that note is normal note; Also promptly, when including information such as contact details, account information in the short message content, this note is confirmed as refuse messages.
Adopt the recognition methods of the refuse messages that this embodiment provides; Filter note through preset regular expression and short message content matching mode; Compare with the method that predetermined keyword in the prior art is filtered; Greatly increased filter densities, thus accurate recognition refuse messages more, thus shielding rubbish short message effectively.
Fig. 2 is the flow chart according to the recognition methods of the refuse messages of second embodiment of the invention, and is as shown in Figure 2, comprises following step:
Step (1): when new message, extract the detailed content of note, what comprise note comes signal code and short message content.
Step (2): the signal code of coming according to note judges whether note is strange note; Preferably, judge signal code whether in contact number tabulation and call history record, if contact number tabulate and call history record in all do not exist this to come signal code; Then this note is regarded as strange note; If in contact number tabulation or call history record, exist this to come signal code, then this note is regarded as normal note, do not do subsequent treatment.Through this step, before judging whether, at first the note source is judged that the note that can avoid the customer contact people is sent is as refuse messages to refuse messages.
Step (3): if when note is strange note, judge whether the signal code of coming of note satisfies preset number filtering condition, when coming signal code to satisfy preset number filtering condition, then directly this note is regarded as refuse messages, end note identifying.Thereby can all strange notes that satisfies the number filtering condition all be regarded as refuse messages.
Preferably, can adopt following any or adopt following dual mode performing step (3) simultaneously:
First: judge signal code whether in preset reject region; When coming signal code to belong to preset reject region, explain signal code to satisfy preset number filtering condition, then this note is regarded as refuse messages; Wherein, preset reject region can comprise one or more zones.Through this mode, can further increase the flexibility of refuse messages recognition methods, can will fixedly come the note of source region to be regarded as refuse messages automatically according to user's needs.
Second: judge signal code whether in preset region of acceptance; When coming signal code not belong to preset region of acceptance, explain signal code to satisfy preset number filtering condition, then this note is regarded as refuse messages; Wherein, preset region of acceptance can comprise one or more zones.Through this mode, can further increase the flexibility of refuse messages recognition methods, can automatically will be except that fixedly coming all the strange notes the source region all to be regarded as refuse messages according to user's needs.
The the 3rd: at first judge whether comprise country code in the signal code; If comprise country code; As :+86, then in the future the country code in the signal code removes, and judges further whether the number length that removes behind the country code satisfies preset refuse messages number length rule; If do not comprise country code; Judge directly then whether the letter number length satisfies preset refuse messages number length rule, as: preset refuse messages number length rule is set is regarded as refuse messages for number length surpasses the X position, it is regular that number length after removing country code or the letter number length that does not comprise country code satisfy preset refuse messages number length; Explain signal code to satisfy preset number filtering condition, then this note is regarded as refuse messages.Through this mode, can be further in the future the signal code note that do not satisfy proper communication number length rule be regarded as refuse messages, increased filtering junk short messages intensity, for example, signal code is that the note of non-moving telephone number is regarded as refuse messages in the future.
Step (4): when coming the preset number filtering condition of the discontented foot of signal code; The short message text content is mated with preset a plurality of regular expressions one by one; If wherein arbitrary expression formula is mated successfully, then be regarded as refuse messages, otherwise this note is regarded as normal note.
Adopt this embodiment that the recognition methods of refuse messages is provided; With short message content be used to represent that the regular expression of information such as Bank Account Number and contact method matees; As long as mate successfully, the content of depositing number of the account or contact method in the short message content in the bank can be described, thereby can stranger's note of carrying contents such as Bank Account Number, contact method be judged as refuse messages; Preset magnanimity crucial word problem, shielding rubbish short message have effectively been solved.In addition, before carrying out short message content identification, carry out the number filtering condition judgment, comprise number source place, number length rule etc., make that the refuse messages recognition methods is more flexible, satisfy user's personalized requirement.
Need to prove above-mentioned step (3) and the interchangeable execution sequence of step (2).
Preferably; In above-mentioned step (4), when mating one by one, as long as just stop coupling after mating successfully with a regular expression according to preset a plurality of regular expressions; This note is regarded as refuse messages; When with a regular expression coupling failure, carry out the coupling of next regular expression, finish until all regular expression couplings.
Preferably, preset a plurality of regular expressions comprise following any one or more regular expressions:
The regular expression of coupling Email address: ([+.]) * ([.]) * w+ ([.]) *;
The regular expression of matching web site URL: [a-zA-z]+: // [s]+;
The regular expression of coupling link: [] (. [])+;
The regular expression of matching strip area code fixed telephone number: (d{3,4})-d{7,8};
Coupling is not with the regular expression of area code fixed telephone number:, 8};
The regular expression one of 11 phone numbers of coupling: d{11};
Be used to mate the regular expression two of 11 phone numbers: [1-9] [0-9] { 10};
The regular expression of coupling Tencent QQ number: [1-9] [0-9] { 4, };
The regular expression of coupling Bank Account Number: d{16,19};
The regular expression of coupling ip address: d{1,3} (, 3}) { 3}.
Need to prove that the form of the above-mentioned regular expression of enumerating only illustrates, and the invention is not restricted to this, regular expression can have multiple literary style.
Further preferably; In above-mentioned steps (4); Cited regular expression is to the various numbers that utilize Arabic numerals to represent under the normal condition, in addition, also exists with various means and evades by the refuse messages of numeral expression formula identification; As: interspersed spcial character in the middle of the number that Arab representes; As: in telephone number, add space or other characters, middle with forms such as intervals, space, all can adopt more complicated regular expression to mate efficiently rapidly, with the refuse messages of identification distortion at Bank Account Number.
Preferably, a plurality of couplings of presetting are evaded the means regular expression and are comprised following any one or more regular expressions:
The regular expression one of the telephone number of the interspersed blank character of coupling: d (D*) 6,7};
Coupling is interted the regular expression two of telephone number of blank character: d (D d) 6,7};
Coupling is interted the regular expression three of telephone number of blank character band area code: () 9,11};
Coupling is interted the regular expression one of phone number of blank character: d (D d) { 10};
The regular expression two of the phone number of the interspersed blank character of coupling: d (D*) { 10};
Coupling is interted the regular expression of blank character Bank Account Number: d (D d) 15,18}.
Further preferably; In refuse messages, use and represent numeral with the Chinese character that Arabic numerals are sent out unisonance or similar pronunciation and when can not get effectively shielding, in step (4), adopt step as shown in Figure 3 to realize the coupling of short message content and preset regular expression; Particularly; At first convert the Chinese character of sending out unisonance with Arab in the short message text content to Arabic numerals, and then content after will changing and regular expression coupling, mate successfully and then this note is regarded as refuse messages; Otherwise note is normal note.
The embodiment of the invention also provides the recognition device of refuse messages, below the recognition device of the refuse messages that the embodiment of the invention provided is introduced.Need to prove; Recognition methods at the refuse messages of the embodiment of the invention can be carried out through the recognition device of the refuse messages that the embodiment of the invention provided, and the recognition device of the refuse messages of the embodiment of the invention also can be used to carry out the recognition methods of the refuse messages that the embodiment of the invention provides.
Fig. 4 is the block diagram according to the recognition device of the refuse messages of first embodiment of the invention, and is as shown in Figure 4, and the recognition device of this refuse messages comprises extraction module 20, matching module 40 and determination module 60.
Extraction module 20 is used to extract the short message content of note, obtains the short message content text.Matching module 40 is used for matching sms content and preset regular expression; Wherein, Preset regular expression is provided with according to the short message content characteristics of refuse messages; For example generally can comprise contents such as contact details, account information in the refuse messages, represent this type content, then can realize the coupling of contents such as short message content and contact details, account information through this step through preset regular expression.Determination module 60 is used for when short message content matees successfully with preset regular expression; Confirm that note is a refuse messages, correspondingly, when short message content is failed with preset regular expression coupling; Confirm that note is normal note; Also promptly, when including information such as contact details, account information in the short message content, this note is confirmed as refuse messages.
Adopt the recognition device of the refuse messages that this embodiment provides; Filter note through preset regular expression and short message content matching mode; Compare with the method that predetermined keyword in the prior art is filtered; Greatly increased filter densities, thus accurate recognition refuse messages more, thus shielding rubbish short message effectively.
Fig. 5 is the block diagram according to the recognition device of the refuse messages of second embodiment of the invention; As shown in Figure 5, the recognition device of this refuse messages comprises note extraction module, stranger's note determination module, letter number rule determination module, note ownership place determination module and regular expression matching module.
When new message, the note extraction module extracts the detailed content of note, and what comprise note comes signal code, short letter content.
Extract the detailed content of note at the note extraction module after; Stranger's note determination module judges according to the signal code of coming of note whether note is strange note; Preferably, whether stranger's note determination module judge and come signal code in contact number tabulation and call history record, if contact number tabulate and call history record in all do not exist this to come signal code; Then this note is regarded as strange note; If in contact number tabulation or call history record, exist this to come signal code, then this note is regarded as normal note, do not do subsequent treatment.Through stranger's note determination module, before judging whether, at first the note source is judged that the note that can avoid the customer contact people is sent is as refuse messages to refuse messages.
After stranger's note determination module confirms that this note is strange note; Letter number rule determination module at first judges to come whether to comprise in the signal code country code; If comprise country code, as :+86, then the country code in the signal code in future removes; Whether the number length that further judgement is removed behind the country code satisfies preset refuse messages number length rule; If do not comprise country code, judge directly then whether the letter number length satisfies preset refuse messages number length rule, as: preset refuse messages number length rule is set is regarded as refuse messages for number length surpasses the X position; Number length after removing country code or the letter number length that does not comprise country code satisfy preset refuse messages number length rule, then this note are regarded as refuse messages.Through this letter number rule determination module; Can be further in the future the signal code note that do not satisfy proper communication number length rule be regarded as refuse messages; Increased filtering junk short messages intensity, for example, in the future signal code is that the note of non-moving telephone number is regarded as refuse messages.
If letter number rule determination module confirms to come the preset refuse messages number length rule of the discontented foot of signal code; Then whether note ownership place determination module is judged and is come signal code in preset reject region; When coming signal code to belong to preset reject region, then directly this note is regarded as refuse messages, finish the note identifying; Wherein, preset reject region can comprise one or more zones; Perhaps, judge signal code whether in preset region of acceptance,, then directly this note is regarded as refuse messages, finish the note identifying, thereby can the strange note in all strange lands all be regarded as refuse messages when coming signal code not belong to preset region of acceptance.Through note ownership place determination module, can further increase the flexibility of refuse messages recognition methods, can be regarded as refuse messages according to the note that user's needs will be originated to the subregion automatically.
If note ownership place determination module degree confirms to come signal code not belong to preset reject region; When perhaps coming signal code to belong to preset region of acceptance; The short message content that the regular expression matching module extracts the note extraction module matees with preset regular expression, and the preset regular expression at this place can be provided with according to the characteristic of refuse messages, eight big types of for example common swindle refuse messages: provide the sim card do not have the card science, directly the remittance type, change number of the account remittance type, cry out father and mother's type, the stolen consumption-orientation of interchanger, high salary recruitment type, the type of providing low interest loans and draw the Grand Prix type everywhere; The particular content of note has following general character: have Bank Account Number or contact method; Wherein, contact method comprises information such as fixed telephone number, Mobile Directory Number, network address, email address, immediate communication tool number again, thereby; Preset regular expression is set to represent the rule of Bank Account Number or contact method; When mating successfully, explain in the short message content of this note and deposit number of the account or contact method in the bank, belong to refuse messages; When the coupling failure, explain that this note is normal note.
Adopt this embodiment that the recognition device of refuse messages is provided, at first, through various determination modules the information of extracting is judged then, to satisfy user's personalized requirement neatly through the details of note extraction module extraction note.When various determination modules all are not judged to be refuse messages with note; And when this note is stranger's note; The regular expression matching module matees short message content and preset regular expression; Thereby can stranger's note of carrying contents such as Bank Account Number, contact method be judged as refuse messages, solve preset magnanimity crucial word problem, shielding rubbish short message effectively.
Preferably; Preset regular expression comprises a plurality of regular expressions, and the regular expression matching module matees the short message text content one by one with a plurality of regular expressions, as long as just stop coupling after mating successfully with a regular expression; This note is regarded as refuse messages; When with a regular expression coupling failure, carry out the coupling of next regular expression, finish until all regular expression couplings.
Wherein, preset a plurality of regular expressions comprise in the preceding text, any several regular expressions in recognition methods embodiment describes, and this place repeats no more.
Further preferably; In refuse messages, use and represent numeral with the Chinese character that Arabic numerals are sent out unisonance or similar pronunciation and when can not get effectively shielding; The regular expression matching module comprises conversion submodule and matched sub-block, and wherein, the text conversion that the conversion submodule is used for short message content and Arabic numerals are sent out unisonance is corresponding Arabic numerals; Obtain converted contents; Matched sub-block is converted contents and regular expression coupling, and mating successfully when converted contents and regular expression is to explain that the regular expression matching module matees successfully.
The recognition device of any one refuse messages that the embodiment of the invention provided can be arranged at mobile communication terminal; The sms center that also can be arranged at communication common carrier is disposed; When recognition device is arranged at the sms center deployment; For the ease of the judgement of stranger's note, can set up the tabulation of number call history at sms center, whether be stranger's note through call history tabulation identification.
From above description, can find out that the present invention has realized following technique effect: improved the recognition accuracy of refuse messages, thereby shielded refuse messages effectively.
Need to prove; Can in computer system, carry out in the step shown in the flow chart of accompanying drawing such as a set of computer-executable instructions; And; Though logical order has been shown in flow chart, in some cases, can have carried out step shown or that describe with the order that is different from here.
Obviously, it is apparent to those skilled in the art that above-mentioned each module of the present invention or each step can realize with the general calculation device; They can concentrate on the single calculation element; Perhaps be distributed on the network that a plurality of calculation element forms, alternatively, they can be realized with the executable program code of calculation element; Thereby; Can they be stored in the storage device and carry out, perhaps they are made into each integrated circuit modules respectively, perhaps a plurality of modules in them or step are made into the single integrated circuit module and realize by calculation element.Like this, the present invention is not restricted to any specific hardware and software combination.
More than be merely the preferred embodiments of the present invention, be not limited to the present invention, for a person skilled in the art, the present invention can have various changes and variation.All within spirit of the present invention and principle, any modification of being done, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (12)

1. the recognition methods of a refuse messages is characterized in that, comprising:
Extract the short message content of note;
Mate said short message content and preset regular expression; And
When said short message content and said preset regular expression mate successfully, confirm that said note is a refuse messages.
2. the recognition methods of refuse messages according to claim 1; It is characterized in that; Said preset regular expression comprises first regular expression and second regular expression, wherein, matees said short message content and preset regular expression; When said short message content and said preset regular expression mate successfully, confirm that said note is that refuse messages comprises:
Mate said short message content and said first regular expression;
When said short message content and said first regular expression mate successfully, confirm that said note is a refuse messages;
When said short message content and the failure of said first regular expression coupling, mate said short message content and said second regular expression; And
When said short message content and said second regular expression mate successfully, confirm that said note is a refuse messages.
3. the recognition methods of refuse messages according to claim 1 is characterized in that, said preset regular expression comprises following any one or more regular expressions:
Be used to mate the regular expression of phone number;
Be used to mate the regular expression of the telephone number of landline telephone;
Be used to mate the regular expression of Bank Account Number;
The regular expression that is used for matching web site URL;
The regular expression that is used for Match IP Address; And
Be used for matching network ID number regular expression.
4. the recognition methods of refuse messages according to claim 1 is characterized in that, matees said short message content and comprises with preset regular expression:
With being corresponding Arabic numerals with the identical text conversion of Arabic numerals pronunciations in the said short message content, obtain converted contents;
Mate said converted contents and said preset regular expression.
5. according to the recognition methods of each described refuse messages in the claim 1 to 4, it is characterized in that before the short message content that extracts note, said method also comprises:
Extract the signal code of coming of said note; And
Come signal code to judge whether said note is strange note according to said,
Wherein, the short message content of extraction note comprises: when said note is strange note, extract the short message content of said note.
6. the recognition methods of refuse messages according to claim 5 is characterized in that, comes signal code to judge whether said note is that strange note comprises according to said:
Judge and saidly come signal code whether in contact number tabulation and call history record,
Wherein, when coming signal code not in said contact number tabulation and said call history record, said note is strange note when said.
7. according to the recognition methods of each described refuse messages in the claim 1 to 4, it is characterized in that before the short message content that extracts note, said method also comprises:
Extract the signal code of coming of said note;
Judge saidly come signal code whether to satisfy preset number filtering condition,
Wherein, the short message content that extracts note comprises: when coming the said preset number filtering condition of the discontented foot of signal code, extract the short message content of said note when said.
8. the recognition device of a refuse messages is characterized in that, comprising:
Extraction module is used to extract the short message content of note;
Matching module is used to mate said short message content and preset regular expression; And
Determination module is used for when said short message content and said preset regular expression mate successfully, confirming that said note is a refuse messages.
9. the recognition device of refuse messages according to claim 8 is characterized in that, said preset regular expression comprises first regular expression and second regular expression, wherein,
Said matching module comprises: first matched sub-block is used to mate said short message content and said first regular expression; And second matched sub-block, be used for when said short message content and the failure of said first regular expression coupling, mating said short message content and said second regular expression,
Said determination module comprises: first confirms submodule, is used for when said short message content and said first regular expression mate successfully, confirming that said note is a refuse messages; And second confirm submodule, is used for when said short message content and said second regular expression mate successfully, confirming that said note is a refuse messages.
10. the recognition device of refuse messages according to claim 8 is characterized in that, said preset regular expression comprises following any one or more regular expressions:
Be used to mate the regular expression of phone number;
Be used to mate the regular expression of the telephone number of landline telephone;
Be used to mate the regular expression of Bank Account Number;
The regular expression that is used for matching web site URL;
The regular expression that is used for Match IP Address; And
Be used for matching network ID number regular expression.
11. the recognition device of refuse messages according to claim 8 is characterized in that, said matching module comprises:
The conversion submodule is used for the text conversion that said short message content is identical with the Arabic numerals pronunciation for corresponding Arabic numerals, obtains converted contents; And
The 3rd matched sub-block is used to mate said converted contents and said preset regular expression.
12. a mobile communication terminal is characterized in that, comprises the recognition device of each described refuse messages in the claim 8 to 11.
CN201210275157.6A 2012-08-03 2012-08-03 Method and device for identifying junk short message, and mobile communication terminal with device Expired - Fee Related CN102801859B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210275157.6A CN102801859B (en) 2012-08-03 2012-08-03 Method and device for identifying junk short message, and mobile communication terminal with device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210275157.6A CN102801859B (en) 2012-08-03 2012-08-03 Method and device for identifying junk short message, and mobile communication terminal with device

Publications (2)

Publication Number Publication Date
CN102801859A true CN102801859A (en) 2012-11-28
CN102801859B CN102801859B (en) 2014-05-07

Family

ID=47200816

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210275157.6A Expired - Fee Related CN102801859B (en) 2012-08-03 2012-08-03 Method and device for identifying junk short message, and mobile communication terminal with device

Country Status (1)

Country Link
CN (1) CN102801859B (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103067610A (en) * 2013-01-23 2013-04-24 广东欧珀移动通信有限公司 Method and system and mobile terminal of interception of junk short message
CN103313248A (en) * 2013-04-28 2013-09-18 北京小米科技有限责任公司 Method and device for identifying junk information
CN103415004A (en) * 2013-07-26 2013-11-27 中国联合网络通信集团有限公司 Method and device for detecting junk short message
CN103856944A (en) * 2012-12-03 2014-06-11 上海粱江通信系统股份有限公司 Fraud short message recognizing method with numerical characteristics and sending frequency combined
CN104539624A (en) * 2015-01-08 2015-04-22 北京奇虎科技有限公司 Safety monitoring method and device for number information in text
CN104580725A (en) * 2014-12-31 2015-04-29 广东欧珀移动通信有限公司 Method for hinting fraud calls and communication terminal
CN105187646A (en) * 2015-09-08 2015-12-23 小米科技有限责任公司 Short message intercepting method and device
CN105187632A (en) * 2015-08-06 2015-12-23 北京金山安全软件有限公司 Method and device for determining mobile phone number
CN105718477A (en) * 2014-12-03 2016-06-29 中国移动通信集团重庆有限公司 Method and device for obtaining target files
CN105721697A (en) * 2016-02-18 2016-06-29 吴伟东 Mobile phone short message shielding method and system
CN105893615A (en) * 2016-04-27 2016-08-24 厦门市美亚柏科信息股份有限公司 Owner feature attribute excavation method based on mobile phone forensics data and system thereof
CN106332027A (en) * 2016-09-26 2017-01-11 惠州Tcl移动通信有限公司 Message analysis method and intelligent terminal capable of performing message analysis
CN106452859A (en) * 2016-09-29 2017-02-22 南京邮电大学 Automatic cell phone number characteristic keyword extraction method under fixed network WiFi environment
CN106961513A (en) * 2016-01-12 2017-07-18 深圳中兴力维技术有限公司 The auxiliary answering method and device of short message
WO2017139955A1 (en) * 2016-02-18 2017-08-24 吴伟东 Method and system for blocking text messages
CN109446527A (en) * 2018-10-26 2019-03-08 广东小天才科技有限公司 A kind of analysis method and system of meaningless corpus

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR950011064B1 (en) * 1992-12-29 1995-09-27 재단법인한국전자통신연구소 Output inspecting method for checking the unregistered message output
CN101534261A (en) * 2009-04-10 2009-09-16 阿里巴巴集团控股有限公司 A method, device and system of recognizing spam information
CN101888445A (en) * 2010-04-30 2010-11-17 南京邮电大学 Integrated method for filtering short message by introducing query software
CN101902523A (en) * 2010-07-09 2010-12-01 中兴通讯股份有限公司 Mobile terminal and filtering method of short messages thereof
CN102231873A (en) * 2011-06-22 2011-11-02 中兴通讯股份有限公司 Method and system for monitoring garbage message and monitor processing apparatus

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR950011064B1 (en) * 1992-12-29 1995-09-27 재단법인한국전자통신연구소 Output inspecting method for checking the unregistered message output
CN101534261A (en) * 2009-04-10 2009-09-16 阿里巴巴集团控股有限公司 A method, device and system of recognizing spam information
CN101888445A (en) * 2010-04-30 2010-11-17 南京邮电大学 Integrated method for filtering short message by introducing query software
CN101902523A (en) * 2010-07-09 2010-12-01 中兴通讯股份有限公司 Mobile terminal and filtering method of short messages thereof
CN102231873A (en) * 2011-06-22 2011-11-02 中兴通讯股份有限公司 Method and system for monitoring garbage message and monitor processing apparatus

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103856944A (en) * 2012-12-03 2014-06-11 上海粱江通信系统股份有限公司 Fraud short message recognizing method with numerical characteristics and sending frequency combined
CN103067610A (en) * 2013-01-23 2013-04-24 广东欧珀移动通信有限公司 Method and system and mobile terminal of interception of junk short message
CN103313248A (en) * 2013-04-28 2013-09-18 北京小米科技有限责任公司 Method and device for identifying junk information
CN103313248B (en) * 2013-04-28 2017-04-12 小米科技有限责任公司 Method and device for identifying junk information
CN103415004A (en) * 2013-07-26 2013-11-27 中国联合网络通信集团有限公司 Method and device for detecting junk short message
CN105718477A (en) * 2014-12-03 2016-06-29 中国移动通信集团重庆有限公司 Method and device for obtaining target files
CN105718477B (en) * 2014-12-03 2019-05-24 中国移动通信集团重庆有限公司 A kind of method and device obtaining file destination
CN104580725A (en) * 2014-12-31 2015-04-29 广东欧珀移动通信有限公司 Method for hinting fraud calls and communication terminal
CN104539624B (en) * 2015-01-08 2019-06-04 北京奇虎科技有限公司 The safety monitoring method and device of number information in text
CN104539624A (en) * 2015-01-08 2015-04-22 北京奇虎科技有限公司 Safety monitoring method and device for number information in text
CN105187632A (en) * 2015-08-06 2015-12-23 北京金山安全软件有限公司 Method and device for determining mobile phone number
CN105187646B (en) * 2015-09-08 2019-02-12 小米科技有限责任公司 SMS interception method and device
CN105187646A (en) * 2015-09-08 2015-12-23 小米科技有限责任公司 Short message intercepting method and device
CN106961513A (en) * 2016-01-12 2017-07-18 深圳中兴力维技术有限公司 The auxiliary answering method and device of short message
WO2017139955A1 (en) * 2016-02-18 2017-08-24 吴伟东 Method and system for blocking text messages
CN105721697A (en) * 2016-02-18 2016-06-29 吴伟东 Mobile phone short message shielding method and system
CN105893615A (en) * 2016-04-27 2016-08-24 厦门市美亚柏科信息股份有限公司 Owner feature attribute excavation method based on mobile phone forensics data and system thereof
CN105893615B (en) * 2016-04-27 2019-06-14 厦门市美亚柏科信息股份有限公司 Owner's characteristic attribute method for digging and its system based on Mobile Phone Forensics data
CN106332027A (en) * 2016-09-26 2017-01-11 惠州Tcl移动通信有限公司 Message analysis method and intelligent terminal capable of performing message analysis
CN106452859A (en) * 2016-09-29 2017-02-22 南京邮电大学 Automatic cell phone number characteristic keyword extraction method under fixed network WiFi environment
CN109446527A (en) * 2018-10-26 2019-03-08 广东小天才科技有限公司 A kind of analysis method and system of meaningless corpus
CN109446527B (en) * 2018-10-26 2023-10-20 广东小天才科技有限公司 Nonsensical corpus analysis method and system

Also Published As

Publication number Publication date
CN102801859B (en) 2014-05-07

Similar Documents

Publication Publication Date Title
CN102801859B (en) Method and device for identifying junk short message, and mobile communication terminal with device
CN102968439B (en) A kind of method and device pushing microblogging
CN101534261B (en) A method, device and system of recognizing spam information
CN101938565A (en) Short message processing method and mobile terminal
CN103488796B (en) Based on context the method and mobile terminal inputted
CN104462509A (en) Review spam detection method and device
CN102609460A (en) Method and system for microblog data acquisition
CN107633081A (en) A kind of querying method and system of user profile of breaking one's promise
CN102769691A (en) Prompt method of new message and communication terminal
CN105335354A (en) Cheat information recognition method and device
CN104615585A (en) Text information processing method and device
CN101692682A (en) Method and mobile terminal for processing numbers in content of short message
CN104883671A (en) Junk message determining method and system
CN103389976A (en) Searching method and searching system for terminal
CN101651938A (en) Telephone number recognition system for mobile terminal and application method thereof
CN103002103A (en) Short message group sending method and device
CN106685799A (en) Multi-platform WeChat service notification sending method based on CoreSeek
CN102004788A (en) Method and system for intelligently positioning linkman of social networking services
CN104580725A (en) Method for hinting fraud calls and communication terminal
CN103365934A (en) Extracting method and device of complex named entity
CN105574112A (en) Comment information processing method and system of communication process
CN103327178B (en) Automatically method and the device of extension telephone in the page is dialed
CN104346151B (en) A kind of information processing method and electronic equipment
CN105681523A (en) Method and apparatus for sending birthday blessing short message automatically
CN102256255A (en) Detection method for parallel-used-card proof based on time and geographic location collisions

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CP02 Change in the address of a patent holder

Address after: 116021 No. 4, unit 3, unit 33, Ting Cui yuan, Ganjingzi District, Dalian, Liaoning, China.

Patentee after: Chen Wei

Address before: 161314 2 group of Fuhe village, Longhe Town, Nehe City, Qigihar, Heilongjiang

Patentee before: Chen Wei

CP02 Change in the address of a patent holder
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20140507

Termination date: 20180803

CF01 Termination of patent right due to non-payment of annual fee