CN104348712B - A kind of rubbish mail filtering method and device - Google Patents

A kind of rubbish mail filtering method and device Download PDF

Info

Publication number
CN104348712B
CN104348712B CN201410545491.8A CN201410545491A CN104348712B CN 104348712 B CN104348712 B CN 104348712B CN 201410545491 A CN201410545491 A CN 201410545491A CN 104348712 B CN104348712 B CN 104348712B
Authority
CN
China
Prior art keywords
mail
sent
transmitting side
identification
side marking
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410545491.8A
Other languages
Chinese (zh)
Other versions
CN104348712A (en
Inventor
宋健
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sina Technology China Co Ltd
Original Assignee
Sina Technology China Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sina Technology China Co Ltd filed Critical Sina Technology China Co Ltd
Priority to CN201410545491.8A priority Critical patent/CN104348712B/en
Publication of CN104348712A publication Critical patent/CN104348712A/en
Application granted granted Critical
Publication of CN104348712B publication Critical patent/CN104348712B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention discloses a kind of rubbish mail filtering method and device, to improve Spam filtering effect.This method regard the corresponding transmitting side marking of each mail to be sent received as current identification, it regard the pre-recorded corresponding transmitting side marking of the mail of transmission as history identification, when the quantity for the current identification being not included in the logo collection being made up of history identification is more than the first given threshold, refusal sends the part mail in each mail to be sent.The above method is by contrasting the transmitting side marking of each mail to be sent and having sent the transmitting side marking of mail, to determine whether to have the user for sending spam by multiple accounts, once the quantity for the current identification being not included in the logo collection being made up of history identification is more, it just can determine that the user that there is currently and spam is sent by multiple accounts,, can effective filtering spam mail so as to being filtered to each mail to be sent.

Description

A kind of rubbish mail filtering method and device
Technical field
The present invention relates to anti-spam technologies, more particularly to a kind of rubbish mail filtering method and device.
Background technology
With ecommerce and the development of network technology, E-mail address has become the information interchange work that users commonly use One of tool, and increasing spam is also appeared in the mailbox of user.Spam refers to strong without user's license Row is sent to any Email in subscriber mailbox.
Spam can not only take the Internet resources of substantial amounts of transmission, storage and computing etc. during outgoing, cause The waste of Internet resources, it is also possible to inconvenience can be brought to the user for receiving spam.If in addition, the reception server connects Receive and send after a large amount of spams that server is sent, it is likely that the transmission server can be added in blacklist, so that Any mail of transmission server transmission is rejected, so as to influence the transmission of normal email.
In order to avoid the above-mentioned various problems caused by spam in server is sent, it is necessary to configure spam mistake Filter strategy, to prevent the transmission of spam.
In the prior art, Spam filtering strategy is generally:Send server and receive user's transmission postal During the request of part, determine the quantity for the mail that the user was sent within the unit interval (e.g., one minute), if it is determined that quantity it is big In predetermined threshold value, then refusal is user transmission mail.
But, if the user for sending spam sends mail by multiple accounts, each account is in the unit interval The quantity of the mail of interior transmission is less than above-mentioned predetermined threshold value, then above-mentioned rubbish mail filtering method of the prior art will just lose Effect, that is, rubbish mail filtering method of the prior art can not effective filtering spam mail.
The content of the invention
The embodiment of the present invention provides a kind of rubbish mail filtering method and device, to improve the filtering effect to spam Really.
A kind of rubbish mail filtering method provided in an embodiment of the present invention, including:
Receive each mail to be sent;
The corresponding transmitting side marking of each mail to be sent is determined, current identification is used as;
The pre-recorded corresponding transmitting side marking of the mail of transmission is determined, history identification is used as;
According to the logo collection being made up of each history identification, it is determined that being not included in the current identification in the logo collection Quantity;
When the quantity is more than the first given threshold, refusal sends the part mail in each mail to be sent.
A kind of junk mail filter device provided in an embodiment of the present invention, including:
Mail reception module to be sent, for receiving each mail to be sent;
Current identification determining module, for determining the corresponding transmitting side marking of each mail to be sent, is used as current identification;
History identification determining module, for determining the pre-recorded corresponding transmitting side marking of the mail of transmission, as going through History is identified;
Quantity determining module, for according to the logo collection being made up of each history identification, it is determined that being not included in the mark Know the quantity of the current identification in set;
Mail treatment module, for when the quantity is more than the first given threshold, refusal to be sent in each mail to be sent Part mail.
A kind of rubbish mail filtering method provided in an embodiment of the present invention, this method is by each mail pair to be sent received The transmitting side marking answered regard the pre-recorded corresponding transmitting side marking of the mail of transmission as history mark as current identification Know, when the quantity for the current identification being not included in the logo collection being made up of history identification is more than the first given threshold, refuse The part mail in each mail to be sent is sent absolutely.The above method is by contrasting the transmitting side marking of each mail to be sent with having sent out The transmitting side marking of mail is sent, to determine whether to have the user for sending spam by multiple accounts, once do not wrap The quantity for the current identification being contained in the logo collection being made up of history identification is more, so that it may it is determined that there is currently by multiple accounts The user of spam number is sent, can effective filtering spam mail so as to be filtered to each mail to be sent.
Brief description of the drawings
Accompanying drawing described herein is used for providing a further understanding of the present invention, constitutes the part of the present invention, this hair Bright schematic description and description is used to explain the present invention, does not constitute inappropriate limitation of the present invention.In the accompanying drawings:
Fig. 1 is the process of Spam filtering provided in an embodiment of the present invention;
Fig. 2 is the detailed process of Spam filtering provided in an embodiment of the present invention;
Fig. 3 is junk mail filter device structural representation provided in an embodiment of the present invention.
Embodiment
In the prior art, if the user for sending spam sends postal by multiple accounts (being commonly called as " trumpet ") Part, the quantity of the mail that each account is sent within the unit interval is less than given threshold, then Spam filtering in the prior art Method will fail.For effective filtering spam mail, the embodiment of the present invention passes through the current each mail correspondence to be sent of contrast Transmitting side marking transmitting side marking corresponding with having sent mail, to determine whether to exist by multiple small size transmission postals The user of part, if it is present carrying out corresponding filtrating mail.
To make the object, technical solutions and advantages of the present invention clearer, below with reference to the specific embodiment of the invention and Technical solution of the present invention is clearly and completely described corresponding accompanying drawing.Obviously, described embodiment is only the present invention one Section Example, rather than whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art are not doing Go out the every other embodiment obtained under the premise of creative work, belong to the scope of protection of the invention.
Rubbish mail filtering method provided in an embodiment of the present invention is will be explained in detail below.
It is Spam filtering process provided in an embodiment of the present invention referring to Fig. 1, including:
S101:Receive each mail to be sent.
In embodiments of the present invention, user is editted after mail, when mail is sent, and the mail can be sent to first Send in server, now, the mail that the user that transmission server is received edits is exactly mail to be sent.General, Send server to receive after mail to be sent, first can judge whether the mail to be sent is legal using preset strategy, if closed Method, identifies further according to the recipient carried in the mail to be sent, the mail to be sent is sent.
S102:The corresponding transmitting side marking of each mail to be sent is determined, current identification is used as.
In embodiments of the present invention, the corresponding transmitting side marking of mail to be sent can be carried in the mail to be sent Sender's email address, that is, sender's account.Received specifically, sending server after each mail to be sent, can be from every Transmitting side marking is extracted in individual mail to be sent, and duplicate removal processing is carried out to the transmitting side marking extracted, duplicate removal is handled The transmitting side marking obtained afterwards is as current identification.
For example, sending server receives 5 mails to be sent, wherein, the 1st, 2 mails to be sent be that account A is sent , the 3rd, 4 mails to be sent be that account B is sent, the 5th mail to be sent is that account C is sent.Send server from this The transmitting side marking extracted in 5 mails to be sent includes:2 account A, 2 account B, 1 account C.Each account is carried out Duplicate removal processing, obtains account A, account B, account C these three accounts, regard these three accounts as current identification.
S103:The pre-recorded corresponding transmitting side marking of the mail of transmission is determined, history identification is used as.
In embodiments of the present invention, after transmission server often sends a mail, the daily record that can will send the mail is protected In history of existence record, at least include in the daily record:Send the time of mail, transmitting side marking and recipient's mark.Therefore, send out Send server just each transmitting side marking for having sent mail can be obtained, to each transmitting side marking of acquisition according to historical record Duplicate removal processing is carried out, the transmitting side marking after duplicate removal is handled is used as history identification.
Continue to use the example above, it is assumed that have sent 4 mails before sending server, wherein, this 4 have sent mail In, it is that account A is sent that the 1st, which has sent mail, the 2nd, 3 to have sent mail be that account D is sent, the 4th has sent postal Part is that account E is sent.The transmission daily record that server can send mail according to this 4 preserved in historical record is then sent, This 4 are extracted respectively and has sent sender's account of mail, and carries out duplicate removal processing, account A, account D, account E are obtained, by this Three accounts are used as history identification.
S104:According to the logo collection being made up of each history identification, it is determined that being not included in working as in the logo collection The quantity of preceding mark.
Continue to use the example above, send the logo collection being made up of history identification determined of server be account A, Account D, account E }, current identification includes account A, account B, account C, accordingly, it can be determined that being not included in above-mentioned logo collection Current identification be account B and account C, so that it is determined that the quantity for the current identification being not included in the logo collection be 2.
S105:When the quantity is more than the first given threshold, refusal sends the part mail in each mail to be sent.
That is, sending server can determine whether the current identification being not included in logo collection that step S104 is determined Quantity whether be more than the first given threshold, if so, then explanation there is currently by multiple small size users for sending mails, therefore Each mail to be sent is filtered, refusal sends the part mail in each mail to be sent, the mail of remainder can be sent out Send, otherwise, illustrate to there is currently no by multiple small size users for sending mail, all mails to be sent can be directly transmitted.
Continue to use the example above, it is assumed that the first given threshold is 1, then determines not by step S104 due to sending server The quantity of current identification included in logo collection is 2, and more than the first given threshold, therefore, refusal sends 5 postals to be sent Part mail in part.Wherein, part mail can be selected from each mail to be sent at random and refuses to send.
Specifically, for each mail to be sent, predeterminable refusal sends the probability P of each mail to be sent, refuses The probability P all same of each mail to be sent is sent absolutely, therefore, is sent server and is directed to each mail to be sent, can be with probability P Refusal sends the mail to be sent, allows to send the mail to be sent with probability (1-P).And due in advance for each to be sent The probability P all same that the refusal that mail is set is sent, therefore, sending server also can directly determine the quantity of each mail to be sent With the product of preset percentage, the product is rounded, obtains rounding value, from each mail to be sent, quantity is randomly choosed For the mail to be sent of the value of rounding, and refuse to send the mail to be sent selected, send remaining non-selected mail. That is, above-mentioned preset percentage is equal to the probability P sent for the refusal that each mail to be sent is set.
Wherein, it is above-mentioned round method can above to round, under round or the method such as round, the present invention is to this Do not limit.
Continue to use the example above, it is assumed that the probability P that the refusal set for each mail to be sent is sent is 0.7, then this is pre- If percentage is 70%, sends server and can determine that the quantity 5 of each mail to be sent is 5 with the product of preset percentage 70% × 70%=3.5, pair value 3.5 determined carries out round, obtains rounding value 4, from 5 mails to be sent, at random 4 mails to be sent are selected, refusal sends this 4 mails selected, sends remaining 1 mail.
Method shown in above-mentioned Fig. 1, by contrasting the transmitting side marking of each mail to be sent and having sent the transmission of mail Side's mark, to determine whether to exist by multiple small size users for sending mail, even if the user for sending spam is led to Multiple trumpets are crossed to send spam, and each small size negligible amounts for sending mail within the unit interval, the above method The user that there is currently by multiple small size transmission spams is can determine that, so that each mail to be sent is filtered, because This can effective filtering spam mail.
In view of in practical application scene, spam sends the phase typically all within a certain specific period, than Such as the 1 of morning:00~3:In 00 this period, in order that obtaining the spam mistake shown in the above-mentioned Fig. 1 of the embodiment of the present invention Filtering method more targetedly, to avoid the waste of resource, can set a preset time period in time, only default at this Using the method filtering spam mail shown in Fig. 1 in period.
Specifically, server is sent before the part mail in refusing each mail to be sent of transmission by step S105, It needs to be determined that current time is in preset time period.Further, whether send server can first judge current time default In period, if so, then performing method filtering spam mail as shown in Figure 1, otherwise, other method filtering spam postal can be used Part.
Further, in embodiments of the present invention, multiple periods can be set as preset time period, for example, due to rubbish The transmission time of rubbish mail is generally concentrated at the 1 of daily morning:00~3:00 and noon 12:00~13:00 the two periods, Therefore daily 1 can be set:00~3:00 and 12:00~13:00 the two periods were preset time period.Then send service Whether device first judges current time 1 after each mail to be sent is received:00~3:00 and 12:00~13:00 it It is interior, if it is, performing the rubbish mail filtering method shown in Fig. 1, otherwise, other method filtering spam mail can be used.
Accordingly, in above-mentioned steps S103, determine that the method for history identification is specifically as follows, by historical record In, transmission mail corresponding transmitting side marking of the delivery time not in preset time period is determined, history identification is used as.
For example, it is assumed that preset time period is daily 1:00~3:00, and 4 preserved altogether in historical record have sent mail Transmission daily record, this 4 sent mail respectively by account A, account D, account E send, wherein, account A send mail hair Send is 9 constantly:00, not in preset time period (1:00~3:00) in, and the mail that account D and account E are sent is when default Between in section, therefore, send server it is determined that during history identification, determining transmission postal of the delivery time not in preset time period Part is the mail that account D and account E is sent, so that it is determined that account D and account E is history identification.
In view of in practical application scene, user generally will not send mail all the time, in the different time In section, send mail user it is incomplete same in addition it is entirely different be very normal phenomenon, that is to say, that when different Between in section, it is very normal that the corresponding transmitting side marking of each mail to be sent for sending that server receives, which has very big difference, 's.Therefore, (it is more than the when the quantity for the current identification being not included in above-mentioned logo collection (being made up of each history identification) is more One given threshold) when, it can not still illustrate to there is currently by multiple small size users for sending spam completely, these are not included Current identification in logo collection is it could also be possible that the mark of normal users, if as long as being not included in working as in logo collection The quantity of preceding mark is more than the first given threshold, and just refusal sends a part of mail to be sent, will certainly cause substantial amounts of normal Mail can not be sent.
Therefore, ensure the transmission of normal email to try one's best, in embodiments of the present invention, send server and sent in refusal Before part mail in each mail to be sent, it may further determine that the quantity of each mail to be sent is corresponding with each mail to be sent and send out The ratio of the quantity of the side's of sending mark, and judge whether the ratio is more than the second given threshold, if so, then illustrating there is currently to pass through The user of multiple small size transmission mails, therefore each mail to be sent is filtered, refusal sends the portion in each mail to be sent Divide mail, the mail of remainder can be sent, otherwise, illustrate there is currently no by multiple small size users for sending mails, All mails to be sent can be directly transmitted.
For example, it is assumed that the quantity of each mail to be sent is 500, the quantity of the corresponding transmitting side marking of each mail to be sent is 3, the second given threshold is 200, then sends server big in the quantity for determining the current identification being not included in logo collection After the first given threshold, it may be determined that the number of the quantity of each mail to be sent transmitting side marking corresponding with each mail to be sent The ratio of amount, is 500/3.It is therefore, explainable to there is currently no by multiple because the ratio is less than the second given threshold 200 Trumpet sends the user of mail, can directly transmit all mails to be sent.This way it is secured that the transmission of normal email. Certainly, if the ratio is more than the second given threshold, it can determine that in the presence of by multiple small size users for sending mail, refusal hair Send the part mail in each mail to be sent.
Wherein, when setting above-mentioned second given threshold, the user that can count transmission normal email sends out within the unit interval The maximum quantity of mail is sent, the maximum quantity is regard as second given threshold.
Preferably, the rubbish mail filtering method shown in Fig. 1 can be combined with other one or more Spam filterings Strategy, further to improve the filter effect to spam, reduction as far as possible sends the possibility of spam.
For example, the one or more in flow control policy, blacklist strategy, text analyzing strategy and Fig. 1 institutes can be used The method shown is combined.Wherein:
Method using flow control policy filtering posts is usually:Send server and receive each mail to be sent Afterwards, for each transmitting side marking, when it is determined that the quantity of the corresponding mail to be sent of the transmitting side marking is more than the 3rd setting threshold During value, refusal sends the corresponding each mail to be sent of the transmitting side marking.
If it should be noted that first using flow control policy to each filtrating mail to be sent, then using as shown in Figure 1 Method to each filtrating mail to be sent, then the second above-mentioned given threshold need to be less than the 3rd given threshold, just can guarantee that Fig. 1 institutes The method shown is effective.
Method using blacklist policy filtering mail is usually:Server is sent after each mail to be sent is received, For each transmitting side marking, when it is determined that the transmitting side marking is included in default blacklist, refusal sends the sender Identify corresponding each mail to be sent.
Method using text analyzing policy filtering mail is usually:Send server and receive each mail to be sent Afterwards, for each mail to be sent, text analyzing is carried out to the content in the mail to be sent, to judge in the mail to be sent Content in whether include predetermined keyword, if include, refusal send the mail to be sent, otherwise can be transmitted the postal to be sent Part.
Assuming that by the method knot shown in flow control policy, blacklist strategy, three kinds of methods of text analyzing strategy and Fig. 1 Close, then the detailed process of Spam filtering provided in an embodiment of the present invention is as shown in Figure 2.
Referring to Fig. 2, it is rubbish mail filtering method provided in an embodiment of the present invention, comprises the following steps:
S201, each mail to be sent of reception.
S202, using flow control policy each mail to be sent is filtered.
S203, using blacklist strategy the mail to be sent after filtering is filtered.
Wherein, the mail to be sent after the filtering described in step S203 refers to remaining after flow control policy is filtered Each mail to be sent.
S204, using text analyzing strategy the mail to be sent after filtering is filtered.
Wherein, the mail to be sent after the filtering described in step S204 refers to remaining after blacklist policy filtering Each mail to be sent.
It should be noted that above-mentioned steps S202, S203 and S204 execution sequence can be exchanged.
S205, current time is judged whether in preset time period, if so, performing step S206, otherwise, perform step S214。
The corresponding transmitting side marking of mail to be sent after S206, determination filtering, is used as current identification.
Wherein, the mail to be sent after the filtering described in step S206 refers to by flow control policy, blacklist plan Slightly, remaining each mail to be sent after text analyzing policy filtering.
S207, in historical record, determine transmission mail of the current time not in preset time period it is corresponding transmission Side's mark, is used as history identification.
S208, determination are not included in the quantity of the current identification in the logo collection being made up of each history identification.
S209, judge the quantity whether be more than the first given threshold, if it is, perform step S210, otherwise, perform step S214。
The ratio of the quantity of S210, the quantity for determining each mail to be sent transmitting side marking corresponding with each mail to be sent Value.
S211, judge the ratio whether be more than the second given threshold, if so, perform step S212, otherwise, perform step S214。
S212, the quantity for determining each mail to be sent and preset percentage product, round to the product, are taken Whole value.
S213, from each mail to be sent, random selection quantity be the value of rounding mail to be sent, and refuse send, Send non-selected mail to be sent.
Mail to be sent after S214, transmission filtering.
Wherein, the mail to be sent after the filtering described in step S214 refers to by flow control policy, blacklist plan Slightly, remaining each mail to be sent after text analyzing policy filtering.
It is above rubbish mail filtering method provided in an embodiment of the present invention, based on same thinking, the embodiment of the present invention A kind of junk mail filter device is additionally provided, as shown in figure 3, including:
Mail reception module 31 to be sent, for receiving each mail to be sent;
Current identification determining module 32, for determining that the active user for sending each mail to be sent identifies;
History identification determining module 33, for determining the pre-recorded corresponding transmitting side marking of the mail of transmission, as History identification;
Quantity determining module 34, for according to the logo collection being made up of each history identification, it is determined that being not included in described The quantity of current identification in logo collection;
Mail treatment module 35, for when the quantity is more than the first given threshold, refusal to send each mail to be sent In part mail.
Optionally, described device also includes:
Time determining module 36, before the part mail in each mail to be sent of refusal transmission, it is determined that when current It is engraved in preset time period.
Optionally, history identification determining module 33, specifically for:In historical record, determine delivery time not default The corresponding transmitting side marking of the mail of transmission in period, is used as history identification.
Optionally, described device also includes:
Comparison module 37, before the part mail in each mail to be sent of refusal transmission, determines each postal to be sent The ratio of the quantity of the quantity of part transmitting side marking corresponding with each mail to be sent is more than the second given threshold.
Mail treatment module 35, specifically for:
Determine the quantity of each mail to be sent and the product of preset percentage;
The product is rounded, obtains rounding value;
From each mail to be sent, random selection quantity is the mail to be sent for rounding value;
Refusal sends the mail to be sent selected.
A kind of rubbish mail filtering method provided in an embodiment of the present invention, this method is by each mail pair to be sent received The transmitting side marking answered regard the pre-recorded corresponding transmitting side marking of the mail of transmission as history mark as current identification Know, when the quantity for the current identification being not included in the logo collection being made up of history identification is more than the first given threshold, refuse The part mail in each mail to be sent is sent absolutely.The above method is by contrasting the transmitting side marking of each mail to be sent with having sent out The transmitting side marking of mail is sent, to determine whether to have the user for sending spam by multiple accounts, once do not wrap The quantity for the current identification being contained in the logo collection being made up of history identification is more, so that it may it is determined that there is currently by multiple accounts The user of spam number is sent, can effective filtering spam mail so as to be filtered to each mail to be sent.
It should be understood by those skilled in the art that, embodiments of the invention can be provided as method, system or computer program Product.Therefore, the present invention can be using the reality in terms of complete hardware embodiment, complete software embodiment or combination software and hardware Apply the form of example.Moreover, the present invention can be used in one or more computers for wherein including computer usable program code The computer program production that usable storage medium is implemented on (including but is not limited to magnetic disk storage, CD-ROM, optical memory etc.) The form of product.
The present invention is the flow with reference to method according to embodiments of the present invention, equipment (system) and computer program product Figure and/or block diagram are described.It should be understood that can be by every first-class in computer program instructions implementation process figure and/or block diagram Journey and/or the flow in square frame and flow chart and/or block diagram and/or the combination of square frame.These computer programs can be provided The processor of all-purpose computer, special-purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce A raw machine so that produced by the instruction of computer or the computing device of other programmable data processing devices for real The device for the function of being specified in present one flow of flow chart or one square frame of multiple flows and/or block diagram or multiple square frames.
These computer program instructions, which may be alternatively stored in, can guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory, which is produced, to be included referring to Make the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one square frame of block diagram or The function of being specified in multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing devices so that in meter Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented processing, thus in computer or The instruction performed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one The step of function of being specified in individual square frame or multiple square frames.
In a typical configuration, computing device includes one or more processors (CPU), input/output interface, net Network interface and internal memory.
Internal memory potentially includes the volatile memory in computer-readable medium, random access memory (RAM) and/or The forms such as Nonvolatile memory, such as read-only storage (ROM) or flash memory (flash RAM).Internal memory is computer-readable medium Example.
Computer-readable medium includes permanent and non-permanent, removable and non-removable media can be by any method Or technology come realize information store.Information can be computer-readable instruction, data structure, the module of program or other data. The example of the storage medium of computer includes, but are not limited to phase transition internal memory (PRAM), static RAM (SRAM), moved State random access memory (DRAM), other kinds of random access memory (RAM), read-only storage (ROM), electric erasable Programmable read only memory (EEPROM), fast flash memory bank or other memory techniques, read-only optical disc read-only storage (CD-ROM), Digital versatile disc (DVD) or other optical storages, magnetic cassette tape, the storage of tape magnetic rigid disk or other magnetic storage apparatus Or any other non-transmission medium, the information that can be accessed by a computing device available for storage.Define, calculate according to herein Machine computer-readable recording medium does not include temporary computer readable media (transitory media), such as data-signal and carrier wave of modulation.
It should also be noted that, term " comprising ", "comprising" or its any other variant are intended to nonexcludability Comprising so that process, method, commodity or equipment including a series of key elements are not only including those key elements, but also wrap Include other key elements being not expressly set out, or also include for this process, method, commodity or equipment intrinsic want Element.In the absence of more restrictions, the key element limited by sentence "including a ...", it is not excluded that including key element Also there is other identical element in process, method, commodity or equipment.
It will be understood by those skilled in the art that embodiments herein can be provided as method, system or computer program product. Therefore, the application can be using the embodiment in terms of complete hardware embodiment, complete software embodiment or combination software and hardware Form.Deposited moreover, the application can use to can use in one or more computers for wherein including computer usable program code The shape for the computer program product that storage media is implemented on (including but is not limited to magnetic disk storage, CD-ROM, optical memory etc.) Formula.
Embodiments herein is these are only, the application is not limited to.To those skilled in the art, The application can have various modifications and variations.All any modifications made within spirit herein and principle, equivalent substitution, Improve etc., it should be included within the scope of claims hereof.

Claims (10)

1. a kind of rubbish mail filtering method, it is characterised in that including:
Receive each mail to be sent;
The corresponding transmitting side marking of each mail to be sent is determined, current identification is used as;
The pre-recorded corresponding transmitting side marking of the mail of transmission is determined, history identification is used as;
According to the logo collection being made up of each history identification, it is determined that the number for the current identification being not included in the logo collection Amount;
When the quantity is more than the first given threshold, refusal sends the part mail in each mail to be sent.
2. the method as described in claim 1, it is characterised in that refusal is sent before the part mail in each mail to be sent, Methods described also includes:
Determine current time in preset time period.
3. method as claimed in claim 1 or 2, it is characterised in that determine the corresponding transmission of the pre-recorded mail of transmission Side's mark, is specifically included:
In historical record, transmission mail corresponding transmitting side marking of the delivery time not in preset time period is determined.
4. the method as described in claim 1, it is characterised in that refusal is sent before the part mail in each mail to be sent, Methods described also includes:
Determine that the ratio of the quantity of the quantity of each mail to be sent transmitting side marking corresponding with each mail to be sent is more than second Given threshold.
5. the method as described in claim 1, it is characterised in that refusal sends the part mail in each mail to be sent, specifically Including:
Determine the quantity of each mail to be sent and the product of preset percentage;
The product is rounded, obtains rounding value;
From each mail to be sent, random selection quantity is the mail to be sent for rounding value;
Refusal sends the mail to be sent selected.
6. a kind of junk mail filter device, it is characterised in that including:
Mail reception module to be sent, for receiving each mail to be sent;
Current identification determining module, for determining the corresponding transmitting side marking of each mail to be sent, is used as current identification;
History identification determining module, for determining the pre-recorded corresponding transmitting side marking of the mail of transmission, is used as history mark Know;
Quantity determining module, for according to the logo collection being made up of each history identification, it is determined that being not included in the identification sets The quantity of current identification in conjunction;
Mail treatment module, for when the quantity is more than the first given threshold, refusal to send the portion in each mail to be sent Divide mail.
7. device as claimed in claim 6, it is characterised in that described device also includes:
Time determining module, before the part mail in each mail to be sent of refusal transmission, determines current time pre- If in the period.
8. device as claimed in claims 6 or 7, it is characterised in that
History identification determining module, specifically for:In historical record, hair of the delivery time not in preset time period is determined Send mail corresponding transmitting side marking, be used as history identification.
9. device as claimed in claim 6, it is characterised in that described device also includes:
Comparison module, before the part mail in each mail to be sent of refusal transmission, determines the number of each mail to be sent The ratio of the quantity of amount transmitting side marking corresponding with each mail to be sent is more than the second given threshold.
10. device as claimed in claim 6, it is characterised in that mail treatment module, specifically for:
Determine the quantity of each mail to be sent and the product of preset percentage;
The product is rounded, obtains rounding value;
From each mail to be sent, random selection quantity is the mail to be sent for rounding value;
Refusal sends the mail to be sent selected.
CN201410545491.8A 2014-10-15 2014-10-15 A kind of rubbish mail filtering method and device Active CN104348712B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410545491.8A CN104348712B (en) 2014-10-15 2014-10-15 A kind of rubbish mail filtering method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410545491.8A CN104348712B (en) 2014-10-15 2014-10-15 A kind of rubbish mail filtering method and device

Publications (2)

Publication Number Publication Date
CN104348712A CN104348712A (en) 2015-02-11
CN104348712B true CN104348712B (en) 2017-10-27

Family

ID=52503565

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410545491.8A Active CN104348712B (en) 2014-10-15 2014-10-15 A kind of rubbish mail filtering method and device

Country Status (1)

Country Link
CN (1) CN104348712B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106656731A (en) * 2015-11-04 2017-05-10 广东华邦云计算股份有限公司 EDM (Email Direct Marketing) mail sending method and device
CN108880990B (en) * 2018-06-14 2021-02-05 深信服科技股份有限公司 Method, system, device and readable storage medium for detecting outgoing spam
CN113839962B (en) * 2021-11-25 2022-05-06 阿里云计算有限公司 User attribute determination method, apparatus, storage medium, and program product

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1522416A (en) * 2001-06-29 2004-08-18 ��˹��ŵ�� Apparatus and method for handling electronic mail
EP1560384A1 (en) * 2004-01-30 2005-08-03 Openwave Systems Inc. Method for managing e-mail traffic
CN101340387A (en) * 2008-08-12 2009-01-07 华为技术有限公司 Method and apparatus for control forwarding data packets
CN103078752A (en) * 2012-12-27 2013-05-01 华为技术有限公司 Method, device and equipment for detecting e-mail attack

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060075048A1 (en) * 2004-09-14 2006-04-06 Aladdin Knowledge Systems Ltd. Method and system for identifying and blocking spam email messages at an inspecting point

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1522416A (en) * 2001-06-29 2004-08-18 ��˹��ŵ�� Apparatus and method for handling electronic mail
EP1560384A1 (en) * 2004-01-30 2005-08-03 Openwave Systems Inc. Method for managing e-mail traffic
CN101340387A (en) * 2008-08-12 2009-01-07 华为技术有限公司 Method and apparatus for control forwarding data packets
CN103078752A (en) * 2012-12-27 2013-05-01 华为技术有限公司 Method, device and equipment for detecting e-mail attack

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
《垃圾邮件过滤技术研究综述》;陈志贤;《计算机应用研究》;20090531;全文 *

Also Published As

Publication number Publication date
CN104348712A (en) 2015-02-11

Similar Documents

Publication Publication Date Title
CN103347009B (en) A kind of information filtering method and device
US10171399B2 (en) Managing message threads through use of a consolidated message
CN111917740B (en) Abnormal flow alarm log detection method, device, equipment and medium
CN106897141A (en) The processing method and processing device of information
US11362982B2 (en) Mail bot and mailing list detection
CA2502331A1 (en) Social network email filtering
CN106034054B (en) Redundant access controls list acl rule file test method and device
EP3198521B1 (en) Method and apparatus of processing a doi (digital object unique identifier) in interaction information
CN112511524A (en) Access control policy configuration method and device
CN109525484A (en) Risk identification treating method and apparatus
CN104348712B (en) A kind of rubbish mail filtering method and device
US20130151628A1 (en) Time Based System for Urgent Email Messages
CN103136255A (en) Method and device for information management
CN105007218A (en) Junk e-mail resistance method and system thereof
CN108366098B (en) Data interaction method and device for network nodes
US10250543B2 (en) Deduplication of e-mail content by an e-mail server
CN105843916A (en) Sensitive data detection method and equipment based on file merging
JP2009517743A (en) Anti-spam application storage system
CN111064656A (en) Data management method, device, system, storage medium and electronic equipment
CN103179024A (en) Method and device for filtering mails
CN103796184A (en) Spam short message recognition method and system
WO2010127586A1 (en) E-mailbox system as well as output method and device for system mails thereof
CN105474585A (en) Private tokens in electronic messages
CN116132138A (en) Mail detection method and device
CN108924840A (en) Method for managing black list, device and terminal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20230308

Address after: Room 501-502, 5/F, Sina Headquarters Scientific Research Building, Block N-1 and N-2, Zhongguancun Software Park, Dongbei Wangxi Road, Haidian District, Beijing, 100193

Patentee after: Sina Technology (China) Co.,Ltd.

Address before: 100080, International Building, No. 58 West Fourth Ring Road, Haidian District, Beijing, 20 floor

Patentee before: Sina.com Technology (China) Co.,Ltd.