EP1604293A2 - Procede de filtrage de messages de courrier electronique - Google Patents

Procede de filtrage de messages de courrier electronique

Info

Publication number
EP1604293A2
EP1604293A2 EP04718564A EP04718564A EP1604293A2 EP 1604293 A2 EP1604293 A2 EP 1604293A2 EP 04718564 A EP04718564 A EP 04718564A EP 04718564 A EP04718564 A EP 04718564A EP 1604293 A2 EP1604293 A2 EP 1604293A2
Authority
EP
European Patent Office
Prior art keywords
origin
message
whitelist
final
mail
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP04718564A
Other languages
German (de)
English (en)
Inventor
Steven T. Kirsch
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Abaca Technology Corp
Original Assignee
Propel Software Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US10/384,278 external-priority patent/US20040177120A1/en
Priority claimed from US10/683,598 external-priority patent/US7206814B2/en
Priority claimed from US10/685,090 external-priority patent/US20050091320A1/en
Priority claimed from US10/683,599 external-priority patent/US20050091319A1/en
Priority claimed from US10/682,941 external-priority patent/US7366761B2/en
Priority claimed from US10/683,951 external-priority patent/US20050080857A1/en
Application filed by Propel Software Corp filed Critical Propel Software Corp
Publication of EP1604293A2 publication Critical patent/EP1604293A2/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/21Monitoring or handling of messages
    • H04L51/212Monitoring or handling of messages using filtering or selective blocking

Definitions

  • This invention relates to data communications and, in particular, to processing e-mail messages.
  • e-mail messages contain a header having routing information (including IP addresses), a sender's address, recipient's address, and a subject line, among other things.
  • the information in the message header may be used to filter messages.
  • One approach is to filter e-mails based on words that appear in the subject line of the message. For instance, an e- mail user could specify that all e-mail messages containing the word "mortgage" be deleted or posted to a file. An e-mail user can also request that all messages from a certain domain be deleted or placed in a separate folder, or that only messages from specified senders be sent to the user's mailbox.
  • spammers frequently use subject lines that do not indicate the subject matter of the message (subject lines such as "Hi” or "Your request for information" are common) .
  • spammers are capable of forging addresses, so limiting e-mails based solely on domains or e-mail addresses might not result in a decrease of junk mail and might filter out e-mails of actual interest to the user.
  • blacklist i.e., identifying certain senders or content, etc., as spam
  • the Mailshell TM SpamCatcher Network creates a digital fingerprint of each received e-mail and compares the fingerprint to other fingerprints of e-mails received throughout the network to determine whether the received e-mail is spam.
  • Each user's rating of a particular e- mail or sender may be provided to the network, where the user's ratings will be combined with other ratings from other network members to identify spam.
  • Mailfrontier TM Matador TM offers a plug-in that can be used with Microsoft OUTLOOK to filter e-mail messages.
  • Matador TM uses whitelists (which identify certain senders or content as being acceptable to the user), blacklists, scoring, community filters, and a challenge system (where an unrecognized sender of an e- mail message must reply to a message from the filtering software before the e-mail message is passed on to the recipient) to filter e-mails.
  • Cloudmark distributes SpamNet, a software product that seeks to block spam.
  • SpamNet a software product that seeks to block spam.
  • the message is spam and is removed from the user's inbox.
  • Other users of SpamNet may report spam messages to the server. These users are rated for their trustworthiness and these messages are fingerprinted and, if the users are considered trustworthy, the reported messages blocked for other users in the SpamNet community.
  • Spammers are still able to get past many filter systems. Legitimate e-mail addresses may be harvested from websites and spammers may pose as the owners of these e-mail addresses when sending messages. Spammers may also get e-mail users to send them their e-mail addresses (for instance, if e-mail users reference the "opt-out" link in unsolicited e-mail messages) , which are then used by the spammers to send messages. In addition, many spammers forge their IP address in an attempt to conceal which domain they are using to send messages.
  • Information about the origin of the message (as indicated by the identifying information discussed above) is collected and statistics about the origin of the message are compiled at at least one database and used to categorize whether the received message is solicited or unsolicited. These statistics are then used to determine whether or not the received message is spam.
  • Fig. 1 is a block diagram of the network environment in which the invention operates .
  • Fig. 2 is a flowchart showing how e-mail is processed in accordance with the invention.
  • Fig. 3 is a diagram showing the establishment of an SMTP session for sending an e-mail message in the prior art.
  • Fig. 4a is a message header in the prior art.
  • Fig. 4b is a message header in the prior art.
  • Fig. 5 is a flowchart showing how the final IP address is determined in accordance with the invention.
  • Fig. 6 is a flowchart showing how e-mail is processed in accordance with the invention.
  • Fig. 7 is a flowchart showing how a whitelist is created in accordance with the invention.
  • Fig. 8 is a flowchart showing how e-mail is categorized in accordance with the invention.
  • Fig. 9 is a flowchart showing how a lookup of information is handled in accordance with the invention.
  • one embodiment of the invention has a sending device 10, for instance, a personal computer though the sending device could be any computer device capable of sending messages in a network, which is running an e-mail software program 12, such as OUTLOOK, EUDORA, etc.
  • software is a computer-readable storage medium (including compact disc, computer diskette, and computer memory, etc.) with code, or instructions, which, when read and executed by a computer, causes the computer to perform a process or task.
  • the sending device 10 is operated by a user.
  • the sending device 10 is connected to the sending device's e-mail server 16 via a network 14, such as the Internet.
  • the sending device's e-mail server 16 is running software 26 for handling e-mail messages sent by the sending device 10.
  • SMTP is generally used to send messages, while another protocol such as POP3 or IMAP is used for receiving messages; these protocols may run on different servers and the sending device's 10 e-mail program 12 generally specifies both an SMTP server or a POP3 or IMAP server for handling messages.
  • the sending device's 10 e-mail messages are sent through a network 14 from the sending device's e-mail server 16 to the recipient's e-mail server 18.
  • the recipient's e-mail server 18 is running software 24 to handle incoming messages and relay them, via a network 14 connection, to the recipient's 20 e-mail program 22 such as OUTLOOK, EUDORA, etc.
  • the recipient 20 in this embodiment is a personal computer though in other embodiments it could be any computer device capable of receiving messages. (As with the sending device, the recipient may be operated by a user.)
  • Filtering software 64 is associated with the recipient's 20 e-mail program 22. In other embodiments, the filtering software may be located at the recipient's e-mail server 18 or at another device in the network. In some embodiments, the recipient device has a database associated with the filtering software 64.
  • the recipient 20 is a member of an e-mail network consisting of other e-mail users employing the same approach to filtering e- mail messages.
  • Central database 66 stores information and compiles statistics about e-mail messages and their origin (for instance, the origin may be a site from where the message was sent, a specific sender sending a message from the site, and/or may be indicated by the IP path used to send the message) . (As will be discussed in greater detail below, there may be more than one database in other embodiments; each database would store different types of information.
  • the separate databases are not necessarily stored on the same machine but would be maintained by a central server.) This information and the statistics are used to assess the origin's reputation for sending unsolicited e-mail (discussed below in Figs. 2, 6, and 7) .
  • Software for managing the database and managing the e-mail network is associated with the database.
  • the database 66 is located at a third party server 88 which may be accessed over the network 14 by software 24, 64 at both the recipient's e- mail server 18 and the recipient 20.
  • the central database 66 may be located elsewhere in the network 14, such as at the recipient's e-mail server 18 or in direct connection with the recipient's e-mail server 18.
  • the central database 66 receives updates about e-mail messages and information about the origin of messages (for instance, senders, sites, etc.) sent at intervals by e-mail users, such as the recipient 20, within the e-mail network. (In embodiments employing separate databases, the updates and information are received at the central server, which then sends the received material out to the appropriate databases.) This information is normally sent after installation and when a new message is categorized. Updates also may be sent by the users (via the software 64 at their computers) either at regular, programmed intervals (for instance, every hour, though another time interval may be specified by the user or system administrator in other embodiments) or at irregular intervals as determined by the user. Information from the central database 66 (or databases) may be sent to recipients 20 either at regular intervals (for instance, every hour, though another time interval may be specified by the user or system administrator in other embodiments) or in response to a request from the recipient 20.
  • the recipient receives an e-mail message (block 100) .
  • a whitelist created by the recipient to indicate messages which are considered to be solicited, is checked to see if the sender or site is listed (block 102) .
  • the whitelist may contain just e-mail addresses, the e-mail address may be combined with at least one other piece of information from the message header or SMTP session. This information includes fields such as the display name, the final IP address, x-mailer, final domain name, user-agent, information about the client software used by the sender, time zone, source IP address, the sendmail version used by a first receiver, and the MAIL FROM address.
  • IP path Single pieces of information that are difficult to forge, such as the display name, final IP address, final domain name (a hostname, which may be normalized, which obtained by a reverse Domain Name System (“DNS") lookup of the final IP address) , or IP path may be used instead of an e-mail address to list and check senders or sites in other embodiments; in these embodiments, if an incoming message has the information that the user has included on a whitelist, for instance, a final domain name, that message would pass the whitelist test.
  • DNS Domain Name System
  • a whitelist may be created by specialized software (which may be associated with filtering software) running at the recipient's computer.
  • a whitelist may be constructed from the "Contacts" or "Address Book” section (i.e., any area where the recipient stores a list of e-mail addresses the recipient uses to contact others) of the recipient's e- mail program as well as using the To:, Cc : , and Bcc : information of e-mails that the recipient has sent (this may be done, for instance, by scanning the recipient's "Sent Items” folder in the e-mail program) .
  • the whitelist is constructed based on information about other e-mail users to whom the recipient has sent at least one e-mail or who have been explicitly added to the recipient's "Contacts" /"Address Book.”
  • Subject lines may also be used to determine if a sender should be included on the whitelist.
  • the subject line of a received message, stripped of any prefix such as re: and fwd:, is checked to see if it matches the subject line of a message recently sent by the user.
  • the user or administrator may set a parameter to determine the time frame for which the subject line is checked, for instance, messages sent over the last 3 days, 30 days, etc.
  • the user or administrator may also set a character or phrase limitation for adding senders to the whitelist.
  • the phrase "hi" may be used by both the user's acquaintances as well as spammers; the user or system and administrator may determine that messages from senders containing the subject line "hi" should not automatically be added to the whitelist.
  • the whitelist may contain just e-mail addresses or the e-mail address may be combined with at least one other piece of information from the message header or SMTP session. This information includes fields such as the display name, the final IP address, x-mailer, final domain name, user-agent, information about the client software used by the sender, time zone, source IP address, the sendmail version used by a first receiver, and the MAIL FROM address.
  • a blacklist created by the recipient to indicate messages which will not be accepted, is checked (block 106) .
  • Senders on the blacklist may be listed by e-mail address, e-mail address plus at least one piece of information from the message header, or other single pieces of information like the display name, final IP address, final domain name (or normalized reverse DNS lookup of the IP address) , IP path, etc.
  • the sender (or site) is on the blacklist (block 106) , the message is processed according to the recipient's instructions (block 108). For instance, the message could be deleted or sent to a spam folder (i.e., any folder designated as holding suspected unsolicited e- mail) .
  • the spam folder is located at the recipient although it could be located at the incoming mail server in other embodiments.
  • the sender or site is not on the blacklist (block 106) , the actual sender of the message is determined (block 110) .
  • other information identifying the origin (sender and/or the site) such as final IP address, final domain name, normalized reverse DNS of the final IP address, IP path, etc. may be used.
  • the origin of the message may be determined by an e-mail address or IP address.
  • a more trustworthy identifier or signature, indicating an actual sender which identifies a site and/or a specific sender at a site by combining pieces of information in the message header (discussed below) and/or information obtained from the SMTP (or some similar protocol) session used to send the message, at least one of which is not easily forged.
  • a range of IP addresses (where the top numbers of the IP address are identical but the last N bits are variable, indicating machines belonging to the same service provider or organization (for instance, the top 3 numbers may be the same but the last byte is variable) may also be combined with at least one piece of information from the message header or SMTP session to create the signature.
  • ISPs Internet Service Providers
  • a source IP the computer used to send the message
  • a final domain name the domain name corresponding to the IP address of the server which handed the e-mail message off to the recipient's trusted infrastructure
  • final IP address the IP address of the server which handed the e-mail message off to a recipient's trusted infrastructure (for instance, the recipient's mail server or a server associated with a recipient's forwarder or e- mail alias)
  • to identify an actual sender may be preferable since an unauthorized user probably would not know the source IP address and probably could not dial into the ISP and be assigned a machine with the same source IP address.
  • the sending computer 210 when a sender sends a message using the SMTP protocol, the sending computer 210 (for instance, a mail server used by the sender with which the sending device has a network connection) connects to the receiving site or computer 212 (for instance, a mail server for the ultimate recipient) on port 25 214.
  • the receiving site 212 will respond with a code indicating whether it will accept the connection 216.
  • the sending computer sends a HELO command (or EHLO command) to the receiving site 212 followed by the name of the sending computer (for instance, the command may read: HELO hostname.sendnet.com) 218.
  • the receiving site acknowledges the HELO command; at this point, the receiving site also has the IP address of the sending computer 220.
  • the sending computer specifies who the message is from in the MAIL FROM command 222 (here, the message is from sender@sendnet . com) .
  • the receiving computer then performs some tests on the address and either accepts or rejects it; in this case, the address is accepted 224. Some of the tests that may be performed include basic syntax, ensuring the domain exists and has a valid MX entry (i.e., a mail server is associated with the domain) , etc .
  • the sending computer then indicates the recipient's address (recipient@recipientnet.com) 226.
  • the receiving computer will then accept or reject the address 228; assuming the receiving computer is not an open relay, the receiving computer will reject any address that is not local to the receiving computer.
  • the sending computer then indicates it is ready to send the message with the DATA command 230.
  • the receiving computer responds indicating the sending computer may send the message 232.
  • the sending computer then sends the message 234 which the receiving computer acknowledges receiving 236.
  • the sending computer then indicates it wants to close the connection with the QUIT command 238 and the receiving computer indicates it is closing the connection 240.
  • the sending computer gives the receiving computer the following information while the connection is established: the sending computer's IP address and the name of the sending computer as indicated by the HELO (or EHLO) string. This information and/or other information extrapolated from this information may be used to identify the sender or site.
  • message headers 50, 56 are known in the prior art.
  • Message headers 50, 56 detail how an e-mail message arrived at the recipient's mailbox by listing the various relays 52, 84, 90, 86, 58 used to send the e-mail message to its destination.
  • the sender 68, 72, recipient 70, 74, and date 80, 82 (when the message was written as determined by the sender's computer, including the sender's timezone 160, 162) are also listed.
  • a unique Message-ID 76, 78 is created for each message.
  • IP path indicates the IP addresses of devices which handled the message as it was sent from the sender to the recipient. For instance, in Fig. 4a the IP path is 456.12.3.123, 111.22.3.444.
  • the actual sender may be identified by the sender's e-mail address or by creating a signature based on two or more pieces of information from the message header and/or the SMTP session used to send the message.
  • This information includes, but is not limited to: the display name of the sender; the sender's e-mail address; the sender's domain name; the final IP address; the final domain name (which may be normalized); the name of client software used by the actual sender; the user-agent; the timezone of the sender; the source IP address; the sendmail version used by a first receiver; the IP path used to route the message; the HELO or EHLO string; the normalized reverse Domain Name System ("nrDNS") lookup of the final IP address; the address identified in the MAILFROM line; and the IP address identified in the SMTP session.
  • nrDNS normalized reverse Domain Name System
  • the signature identifying the actual sender may also be created by combining a range of IP addresses with at least one piece of information from the message header and/or the SMTP session.
  • the final IP address may be determined by examining the message header of an e- mail message (block 40) . Starting at the top of the message header, the common "received" lines indicating receipt by the recipient's internal infrastructure are stripped off (block 42) . If no forwarder is used by the recipient (block 44) , the topmost remaining IP address corresponds to the server which handed off the message to the recipient's trusted infrastructure (block 48).
  • the receipt lines for the recipient's mail forwarder (i.e., the receipt lines indicating receipt after the message was received at the domain specified in the "To" section of the header) are stripped off (block 46) .
  • the topmost remaining IP address is the final IP address (block 48) .
  • IP address from the message header are as follows. Where no forwarder is used, the message header identifies devices local to the recipient, i.e., the recipient's e- mail infrastructure, and devices that are remote to the recipient, presumably the sender's e-mail infrastructure. Therefore, if the message header identifies the various devices as follows: local local local remote ⁇ - this is the final IP address remote remote remote the final IP address is the last remote server identified before the message is received by a local server. If a forwarding service is used, the message header might appear as follows: local local local forwarder forwarder remote • €- this is the final IP address remote remote.
  • the final IP address in this situation is the last remote server identified before the message is received by the forwarding server.
  • no forwarder is used.
  • the final IP address 54 indicates the server, mail.domainone.com, that handed off to the recipient's server, domaintwo.com.
  • a forwarder is used.
  • the receipt line 58 associated with the forwarder has to be stripped away to indicate the final IP address 62.
  • a final domain name is determined by performing a reverse DNS lookup of the final IP address.
  • the final domain name may be normalized.
  • Various normalizations are possible. For instance, numbers may be converted to a token, e.g. host64.domainone .com becomes host# .domainone . com.
  • a final domain name can be normalized using a handcrafted, special case lookup. For example, if the final domain name ends with "mx.domainone.com," the final domain name is normalized to ⁇ first three characters> + "mx.domainone.com.” Using this approach, if the reverse DNS ("rDNS”) of the final IP address is imo-dOl .mx. domainone.
  • the nrDNS value is imo.mx.domainone.com.
  • any number, or none, of the subdomains found in the rDNS lookup of the final IP address may be stripped away.
  • the rDNS of the final IP address is f63.machinel0.ispmail.com
  • the possible final domains are: f63.machinel0.ispmail.com; machinelO.ispmail.com; or ispmail.com.
  • the final domain name may also be identified by a numerical representation, for instance, a hash code, of the final domain code. Other normalizations may be used in other embodiments.
  • the decision of how to represent the final domain name is made according to settings determined by the system administrator or user.
  • the actual sender can be identified several ways. One way to identify the actual sender is to combine the display name with the final IP address (based on the information in Fig. 4a, Joe Sender/111.22.3.444) . Another way to identify the actual sender is to combine the display name, the e-mail address, and the final IP address
  • the signature identifying the actual sender can contain two or more pieces of information from the message header or SMTP session.
  • the actual sender may be identified by combining the display name, the domain name in the e-mail address, and nrDNS of the final IP address (in this embodiment, the rDNS of the final IP address is imo- d01.mx.domainone.com and the normalized rDNS is imo.mx.domainone.com) (e.g. Joe
  • the actual sender can also be identified by using a MAIL FROM address (which may be normalized, for instance, the hostname only or ⁇ first three letters> + domain name; as with the final domain name, various normalizations are possible) .
  • the domain name is not used to identify the actual sender, and the display name, MAIL FROM address (which may be normalized) , and e-mail address or username identified in the e-mail address are combined with the nrDNS of the final IP address.
  • Other ways to identify the actual sender include combining a domain name (such as the domain name of the sender from the From: line in the e-mail headers) with the final IP address.
  • the signature combines a range of IP addresses with at least one piece of information from the message header or SMTP session, a possible identification of the actual sender could combine the range of IP addresses with the domain name.
  • Other information identifying the origin of the message may be used in other embodiments.
  • the nrDNS name may be used, if it exists, to identify the site (or the sender); if nrDNS is not available, the final IP address, a netblock (a range of consecutive IP addresses) , or owner data stored in databases such as the American Registry of Internet Numbers ( "ARIN" ) may be used instead. In other embodiments, the final IP address, netblock, or owner data may be used to identify the sender or site regardless of whether nrDNS is available.
  • the e-mail message is categorized based on other information about the actual sender (block 112) .
  • a message can be categorized based on other information about the origin of the message, including the site, based on identifying information such as final IP address, final domain name (or nrDNS of the final IP address) or IP path based on the same approach described below) .
  • the information about the actual sender, as well as the recipient's determination of whether the message was solicited is collected at a central database in the network.
  • a score indicating the likelihood that a message from a particular actual sender is unsolicited may be determined, for example, by calculating the number of messages sent by the actual sender which have been whitelisted and comparing that number to the number of messages sent by the actual sender which have been blacklisted or are unknown (no. whitelist/ (no. blacklist + no. unknown) ) .
  • the score may be calculated and applied to a message by either database software or the filtering software.
  • thresholds set by either the user or system administrator determine which messages are passed through the filter and which messages are not passed by the e-mail filter and are instead sent to the spam folder or deleted. The thresholds may be based either on raw statistics or on scores.
  • the threshold should be set so that messages having origins with good reputations should be allowed through the filter while messages having origins with bad or unknown reputations are not allowed through the filter (mechanisms for dealing with origins with unknown reputations are discussed below) . For instance, if more than ninety-nine percent of an actual sender's total number of messages sent or total number of messages sent to unique users go to recipients who wish to receive the message, it is likely that the actual sender is not sending spam. Therefore, a threshold may be set where an actual sender has a good reputation if greater than fifty percent of his or her (or its, in the case of a site) messages are wanted by the recipients. Messages from actual senders whose reputations exceed the fifty percent threshold may be passed on to the recipient.
  • a list of senders with good reputations is compiled at the database. Senders may be added to or removed from the database if their reputation changes.
  • a threshold based on the statistics compiled at the database determines a "good" reputation and is set by either the user or system administrator. Recipients of messages from unknown senders can check the list at the database to see whether the sender has a good reputation, in which case the message will be passed through the filter. If the sender does not have a good reputation and instead possesses a bad or unknown reputation, the message is sent to the spam folder. (Other information about the origin of the message, such as the site sending the message, may be compiled and checked in a similar fashion. )
  • Fig. 6 after the message has been categorized (Fig. 2, block 112), information about the actual sender and the disposition (i.e., status values indicating whether the message was solicited or not) is sent to the central database to be stored using a key (the e-mail address, the actual sender, final IP address, final domain name or nrDNS of the final IP address, IP path, etc.) (block 132).
  • a key the e-mail address, the actual sender, final IP address, final domain name or nrDNS of the final IP address, IP path, etc.
  • the origin key is the final IP address, final domain name or nrDNS of the final IP address, and/or the IP path.
  • Information sent to the central database includes: information about the actual sender; whether the actual sender is included on the recipient's whitelist; whether the actual sender is included on the recipient's blacklist; whether the message could be categorized locally; and whether the recipient changed the whitelist/blacklist status of the message (i.e., changed the status of the sender in the message) .
  • the same information about the final IP address, final domain name or nrDNS of the final IP address, or IP path is sent to the central database.
  • information about the actual sender, final IP address, final domain name or nrDNS of the final IP address, and IP path, or any combination thereof may be sent to the central database.
  • at least two pieces of information are sent to the central database. In one embodiment, this information is sent as soon as the message is categorized; however, the information may be sent at different intervals (for instance, when user activity is observed) set by either the user or the system administrator in different embodiments.
  • the same information sent to the central database is also stored at the recipient device.
  • counts such as the number of messages from each actual sender, final IP address, final domain name or nrDNS of the final IP address, etc. are sent to the central database while a local copy is kept at a database at the recipient device. This gives the recipient access to a set of personal statistics and information based on messages received by the recipient as well as global statistics and information stored at the central database which is based on information about messages received by users in the network.
  • the next message is analyzed (block 206) to see if it was sent by a whitelisted sender (or site) (block 204) . If the message was sent by a sender (or site) on the whitelist (block 204) , information about the sender (or site) , such as the e-mail address, signature, actual sender, final domain name or nrDNS of the final IP address, final IP address, IP path, or any combination of these items, are sent to the central database; in addition, a local copy of the information is kept at the recipient device (block 208) .
  • counts such as the number of messages from each sender (or site) , final IP address, final domain name or nrDNS of the final IP address, etc. are sent to the central database while a local copy may be kept at the recipient device.
  • the next message is then processed accordingly (block 206) . This process may occur at or subsequent to initialization.
  • the central database maintains the statistics about actual senders (or other information sent about the origin such as sender and/or site, IP path, etc. in other embodiments) (block 134) .
  • the recipient's database has the same functionality for storing information and compiling statistics as the central database, discussed below.
  • the central database collects information from users that is used to establish raw counts, for instance: the number of messages sent by an actual sender (identified by a signature combining information from the message header and/or an SMTP session) ; the number of messages sent by an actual sender over a time interval set by a user or system administrator; the total number of messages an actual sender sent to recipients who know the actual sender
  • the sender has been included on the recipient's whitelist through any of the mechanisms discussed herein based on information in the message header: e-mail address, final IP address, domain name, subject line, etc.); the number of messages an actual sender sent to recipients who know the actual sender in the network over a time interval set by the user or system administrator; the number of recipients who know the actual sender; the total number of times a recipient changed an actual sender's whitelist/blacklist status; the number of times a recipient changes an actual sender's whitelist/ blacklist status over a time interval set by a user or system administrator; the total number of messages sent to recipients in the network who don't know the actual sender (i.e., the sender is not on the whitelist); the number of messages sent to recipients in the network who don't know the actual sender over a time interval set by the user or system administrator; and the total number of unique recipients in the network who have received at least one message from the actual sender.
  • the same information may also be compiled for other indicators of the message's origin, for instance, messages' final IP addresses, final domain names (or nrDNS of the final IP address), and/or IP paths.
  • information on the final IP address and all possible final domain names is collected (as noted above, if the reverse DNS lookup of the final IP address results in the domain name f63.machine 10.ispmail.com, the possible final domains are f63.machinel0.ispmail.com, machinelO.ispmail.com, or ispmail.com. Therefore, in this embodiment, information on all these potential final domain names is collected.)
  • separate databases may be maintained for storing different information about the origin of a message. For instance, there may be one database to track information on senders identified by a combination of e-mail address and signature and another for collecting information identified by a combination of the sender's display name, final domain name (or nrDNS of the final IP address), and final IP address. Another database may store information about sites identified by the nrDNS of the final IP address. The types of information stored and number of databases used to store that information are set by the system administrator. While the separate databases may be stored on separate machines, they are maintained by one central server which receives information from the users and sends it to the relevant databases .
  • the central database can use the collected information to compute statistics that may be used to indicate the likelihood that a message having a particular origin is spam.
  • these statistics show whether most of the e-mail sent from an origin (in this example, the actual sender) is sent to recipients who wish to see the contents of those messages.
  • the following statistics may be accumulated for each actual sender:
  • the ratio over a time interval (in one embodiment, 24 hours, though another time interval may be set by the user or system administrator in other embodiments) of the number of e-mails sent to recipients who know the actual sender (i.e., the actual sender, final IP, final domain name, nrDNS of the final IP address, or IP path, etc. was on the recipient's whitelist) in the e-mail network divided by the total number of e-mail messages sent to users in the e-mail network during the time interval;
  • the ratio over a time interval (in one embodiment, 24 hours, though another time interval may be set by the user or system administrator in other embodiments) of the number of unique recipients in the e-mail network who know the actual sender divided by the total number of unique recipients in the network who received e-mails from the actual sender during the time interval;
  • the ratio over a time interval (in one embodiment, 24 hours, though another time interval may be set by the user or system administrator in other embodiments) of the number of times a message from the actual sender was moved from a recipient's whitelist to the blacklist divided by the total number of times a message from the actual sender was moved either from a whitelist to a blacklist or from a blacklist to a whitelist;
  • ratios showing the actual sender mostly sends messages to recipients who know the actual sender may also be used. These ratios will return high values if the actual sender sends to recipients who know the actual sender and low values if the actual sender sends messages to recipients who do not know the actual sender and are not willing to whitelist the message. In other embodiments, these ratios may be calculated for other indicators of the origin of the message, such as final IP addresses, final domain names (or nrDNS of the final IP address) , and/or IP paths as required. Other metrics that are not ratios, for instance, differences, may also be calculated.
  • the difference between the number of expected messages (i.e., messages on the whitelist) versus the number of unexpected messages (i.e., messages not on the whitelist) or the number of times a user moves a message to the whitelist compared to the number of times a user moves a message to the blacklist may be useful in determining whether a message is wanted.
  • the ratios or differences may also be converted to a score and applied to the message (for instance, in the spam folder) to let the recipient know whether the message is likely spam.
  • the score may also be used to sort messages, for instance if they are placed in a spam folder.
  • the score may be a number between 0 and 100.
  • the equation [max(logl0 (ratio) , -4) +4/6] *100 yields a number between 0 and 100.
  • Differences may be converted to a score by determining a percentage.
  • the message score may also be obtained by determining the average, product, or some other function of two or more scores for the message, for instance, the score based on the reputation of the sender as identified by the sender's e-mail address and signature and the score based on the combination of the sender's e-mail address/final domain name/final IP address.
  • a low threshold may be set to differentiate "good" messages from spam. For instance, if more than one percent of an actual sender's total number of messages sent or total number messages sent to unique users, go to recipients who wish to receive the message, it is likely that the actual sender is not sending spam since spam would likely have an approval rate of far less than 1% of the recipients, e.g., ⁇ .01%.
  • the threshold may be set to another, higher percentage by either a user or system administrator
  • the messages are probably not spam and may be passed to the recipient .
  • Each member of the network has the option to set personal "delete" and "spam" thresholds. Assuming that a message with a low rating or score indicates a greater likelihood the message is unsolicited, if a message's rating or score drops below the spam threshold, the message is placed in the spam folder; if the message's score drop below the delete threshold, the message is deleted.
  • the initial rating may be (0,25) where the first number represents the "good” element and the second number represents the "bad” element (the ratings may also be in ratio form, such as 0:25) .
  • Implicit good or bad ratings i.e., those based on a whitelist or blacklist, count as one point while explicit good or bad ratings, where a user manually moves a message to the whitelist or blacklist, count as 25 points.
  • the reputation/ rating is reevaluated, the last entry is reversed and the new entry is entered. For instance, if the last entry is (0,25), indicating a user manually blacklisted a message, and the new entry reflects that one other user has whitelisted the message, the new reputation is (25,25) .
  • Other embodiments may use any rating system, with different weights given to implicit or explicit ratings, chosen by the user or system administrator.
  • multiple values for each origin are maintained at the central database (s) in order to determine the origin's reputation.
  • These values include: the number of messages which were explicitly ranked "good;” the number of messages which were implicitly ranked “good;” the number of messages whose ranking is unknown; the number of messages which were explicitly ranked "bad;” and the number of messages which were implicitly ranked “bad.” Any number of these values may be stored; in one embodiment, as many as five of these values may be maintained for an actual sender, final IP address, final domain name (or nrDNS of the final IP address) , and/or IP path, depending on the embodiment. The values may represent either message counts or ratings of unique users within the network, depending on the embodiment. This approach allows the weighting algorithm of explicit vs. implicit, discussed above, to be changed at any time.
  • a value of four for the number of unknown messages would indicate that four unique users in the network received a message from the origin and none of the unique users has viewed the message. Once a user has viewed the message, it will be given a good or bad explicit or implicit score and the remaining unviewed messages may be processed accordingly.
  • the central database may return up to five of these values to the recipient in order to give the recipient the ability to apply different weights to the message.
  • new, unknown senders may be rated or scored based on information about the final IP address used by that sender.
  • the rating or score for the final IP address should be multiplied by some number less than one, for instance 0.51, to get a score for the new sender.
  • This same approach may also be used to determine a rating or score for an unknown sender with a known final domain name (or nrDNS of the final IP address) .
  • This approach allows senders from trusted domains (those domains whose senders send an overwhelming number of good messages, for instance, 99% of messages sent from the domain are rated as "good”) to pass through the filter even if the sender is not known.
  • new, unknown senders using known final IP addresses or final domain names may be rated based on the rating record of other new senders (i.e., recently- encountered e-mail addresses) that have recently used the final IP address or final domain name (or nrDNS of the final IP address) . For instance, if the majority of new senders using the final IP address or final domain name (or nrDNS of the final IP address) are whitelisted by other recipients in the network, other new senders from that final domain name (or nrDNS of the final IP address) or final IP address are also trusted on their initial e- mail.
  • other new senders from that final domain name (or nrDNS of the final IP address) or final IP address are also trusted on their initial e- mail.
  • the message from the new sender is placed in a spam folder (or, in one embodiment, as "suspected" spam folder where messages which are not easily categorized, for instance because of lack of information, are placed for the recipient to view and rate) .
  • Senders using different IP addresses may get passed through the filter provided they send to known recipients.
  • the sender's reputation for messages from that IP address (assuming that the actual sender here is identified by the e-mail address and final IP address) will be based on 0 messages sent to known recipients and 1 message sent to a recipient in the network - a ratio of 0:1. (In this example, the ratio being used is based on the number of messages sent to known recipients compared to the number of messages sent to unknown recipients . Other ratios may be used in other embodiments.) Therefore, this e-mail message is placed in a spam folder.
  • the sender sends a message to a known recipient
  • the ratio of messages sent to known recipients compared to messages sent to unknown recipients has improved to 1:1. Since most users' thresholds are set to one percent, or a ratio of 1:100, the first message can be released from the spam folder since the threshold for this sender has been exceeded.
  • the same sender dials into an ISP, gets a unique IP number, and sends messages to two unknown recipients.
  • the sender's reputation is based on 0 messages sent to known recipients and 2 messages sent to unique recipients in the network - a ratio of 0:2.
  • the ratio improves to 1 message sent to a known recipient compared to 2 messages sent - the ratio has improved to 1:2. This ratio exceeds the one percent threshold and the message that remains in the spam folder may also be released.
  • the message is added to the whitelist.
  • a new final IP address (or, in other embodiments, a new final domain name (or nrDNS of the final IP address) ) may be given an implicit "good” count of one or more - for instance, its initial rating could be (1,0) (as noted above, the first number represents the "good” element while the second number indicates the "bad” element) .
  • a sender with a new final IP address will have his or her first message passed through the filter. Provided subsequent e-mails are not blacklisted, those e-mail messages will also be passed through and increase the reputation of the sender and the final IP address. However, is the sender is sending unsolicited e-mails, his or reputation will quickly drop and the sender's messages will be stopped by the filter.
  • This approach enables legitimate new sites, as indicated by the final IP address (or final domain name) to establish and maintain a positive reputation within the e-mail network.
  • This approach may also be employed in embodiments where a message score is obtained by determining the average, product, or some other function of two scores for the message.
  • the sender's score and the final IP address score are determined by dividing the number of good messages received by the total number of messages (good + bad) received and multiplying by 100
  • the message score is determined by the product of the sender's score and the final IP address's score
  • the first message from a new sender and a new final IP address are each given an implicit good rating (i.e., a rating of 1)
  • the message score for a new message sent by a new sender from a new final IP address is (1/(1+0) * 1/(1+0)) * 100, or 100.
  • the sender sends 4 unsolicited messages to other users in the network
  • the next message from the sender will receive a score of (1/(1+4) * 1/(1+4)) * 100, or 4.
  • This new message score which reflects the fact that the new sender at the new IP address has sent more unsolicited e-mail than wanted messages, is sufficient to place the newest message in the spam folder.
  • messages from new senders will not be placed in the recipient's inbox because the message score is (1/(1+0) * 1/(1 + large number of unsolicited messages sent from a suspect final IP address)) * 100, which will give a number close to 0.
  • "bad" domain reputations as measured by final IP address or final domain name (or nrDNS of the final IP address) may be reset at some interval, for instance, once a week, in case the final IP address has been reassigned.
  • a message from a new sender may be scored by relying exclusively on the other factor. For instance, in embodiments where the message score is determined by multiplying the sender's reputation and the final IP address reputation, a message from a new sender who is using an established final IP address may be scored by relying only on the final IP address .
  • different initial ratings for new senders, etc. may be used.
  • a new final IP address may be given a rating of (1,1) when the network is fairly new and, after a few months, new final IP addresses may be given a rating of (1,2) .
  • the initial rating is (1,1)
  • the message from the new final IP address will be placed at the top of the spam folder, where the recipient may decide whether to whitelist or blacklist it.
  • the software could send a challenge or notification e-mail to the sender using the new final IP address indicating that the message was placed in a spam folder and the sender should contact the recipient in some other fashion.
  • This approach may also be used for new final domain names.
  • a "most respected rater" scheme may be used in another embodiment.
  • Each new member of the network is given a number when joining. Members with lower numbers (indicating longer membership in the network) have more "clout" and can overwrite members with higher numbers. (Member numbers are recognized when the member logs in to the network and the system can associate each member with his or her number when information is sent to the central database.) Ratings may be monitored and if a new member's ratings are inconsistent with other members' ratings, the new members' ratings are overwritten.
  • Another rating approach requires the release of small numbers of a sender's messages into the inboxes of recipients. The released messages are monitored and the frequency with which these messages are blacklisted is determined. If a small percentage of the released messages is added to blacklists, a larger random sample of a sender' s messages is released and the frequency with which these messages are blacklisted is determined. This process is repeated until all the sender's messages are released or the frequency with which the messages in the sample are blacklisted indicates the sender's message is unwanted.
  • One rating approach requires other members of the network to "outvote" a rating decision made by another member in order to change the rating. For instance, if one member decides to place a message in the Inbox, two other members will have to "vote” to place it in the spam folder in order for the message to be placed in the spam folder. If four members vote to release a message from the spam folder, eight members would have to vote to put it back in the spam folder in order for the message to be returned to the spam folder. The rating eventually stabilizes since there are more good members rating the messages than bad members.
  • the recipient may have to request information from the central database.
  • the statistics and scores about the origin i.e., actual senders, final IP addresses, final domain names (or nrDNS of the final IP addresses) , or IP paths are sent from the central database to the recipient, either upon request, after which they are stored locally at the recipient device in a table or database dedicated to "global" statistics (as opposed to personal statistics based exclusively on messages sent to the recipient) , or at regular intervals (for instance, updated statistics about actual senders, sites, final IP addresses, final domain names (or nrDNS of the final IP addresses) , and/or IP paths known to the recipient may be sent every day, though in other embodiments different intervals may be set by either the user or the system administrator) .
  • the ratios or scores are used to determine whether a message is likely good or spam.
  • information about the actual sender is used to categorize the e-mail. If the reputation of the actual sender (as measured by the ratios and statistics) passes the threshold, i.e., the actual sender has a good reputation, the message may be processed accordingly (for instance, the message may be placed in the recipient's inbox).
  • a list of actual senders with good reputations is checked at the database and the message is processed accordingly and a message from an actual sender with a good reputation is placed in the recipient's inbox.
  • the message may be categorized locally (block 152) .
  • these statistics are checked first before checking the global statistics stored at the recipient device.
  • information about the actual sender is not available locally (block 150)
  • information may be requested from the central database (block 154) .
  • requests are sent to the central database which then retrieves the information from the relevant databases and sends it to the recipient device.
  • the central database will send the recipient information, including raw counts, ratios, and scores, about the actual sender (block 158) .
  • the central database will send the recipient some other information about the origin, such as final IP address, final domain name (or nrDNS of the final IP address) , or IP path in the message (block 160) .
  • raw counts about the final IP address, final domain name (or nrDNS of the final IP address) , or IP path may be sent regardless of the information available about the actual sender; these raw counts may be used by the recipient to determine ratios, etc.
  • the central database may return two or more values or scores to the recipient instead of just one.
  • the central database may return values or scores based on final domain name/final IP address and e-mail address/signature. (Values and scores based on other types of origin- identifying information may be sent in other embodiments.) If the recipient has a value or score from the personal database, the value or score from the personal database may be used instead of the value or score from the global database.
  • information about the final IP address, final domain name (or nrDNS of the final IP address) , and/or the IP path is used to categorize the message.
  • the information is used to determine if senders and/or sites using the final IP address, final domain name (or nrDNS of the final IP address) , and/or IP path have sent spam messages (provided this option is set by either the system administrator or the user) .
  • the information may be looked up for each final IP address, final domain name (or nrDNS of the final IP address), etc., on an individual basis, in another embodiment various pieces of information may be used during the lookup to determine the closest match to information in the central database.
  • the final IP address was found to be 64.12.136.5 and the possible final domains were f63.machinel0.ispmail.com ("final domain 1"); machinelO.ispmail.com ("final domain 2"); or ispmail.com ("final domain 3").
  • a lookup request containing the final IP address and the possible final domains is sent to the central database (block 170) .
  • the central database checks to see if there is information about the final IP address (block 172) . If information about the final IP address is available (block 172) , it is sent to the recipient (block 174) .
  • the central database checks to see if information about final domain 1 is available (block 176) . If so, that information is sent to the recipient (block 174) ; if no information is available for final domain 1 (block 176) , final domain 2 is checked (block 178) . If information is available for final domain 2 (block 178) , it is sent to the recipient
  • the central database checks to see if information about final domain 3 is available (block 180) . If information is available
  • block 180 it is sent to the recipient (block 174) ; otherwise, since no information about the final IP addresses or final domain names is available to be sent to the recipient, the message will be placed in the recipient's spam folder (block 182). On future lookups, the IP address and final domain names are checked in the same order to determine the best possible match.
  • the message is passed only if the final IP address, final domain name (or nrDNS of the final IP address) , or IP path have never been used to pass unwanted messages.
  • other thresholds may be set by the user or system administrator in other embodiments which would allow messages to be passed provided the information about the final IP address, final domain name (or nrDNS of the final IP address) , or IP path passes the threshold.
  • the message is sent to the recipient (for instance, the message is sent to the recipient's inbox) (block 104).
  • the spam folder may be located at either the recipient device or at the incoming mail server.
  • the spam folder may be reviewed by a recipient to determine whether he or she wishes to view any of these messages.
  • a recipient may manually release a message from the spam folder. If a message is released from the spam folder, it is placed on the whitelist unless the recipient decides otherwise.
  • scores from the central database or recipient's database may be applied to messages in the spam folder to indicate likelihood the messages are spam or may be used to sort the messages (for instance, messages that are almost certainly spam are placed at the bottom of the list while messages that are more likely to be of interest to the recipient are placed near the top of the list) . Since the reputations the of origin, indicated by actual senders, final IP addresses, final domain names (or nrDNS of the final IP addresses) , and IP paths, can change over time, the spam folder should be re-evaluated periodically to determine whether a message should be released from the spam folder and sent to the recipient (block 118) .
  • the central database will update the raw counts and statistics for the actual sender as it receives information from each recipient in the network (the statistics for other indicators of the origin such as final IP addresses, final domain names (or nrDNS of the final IP addresses) , and/or IP paths are also updated when this occurs) .
  • the statistics for other indicators of the origin such as final IP addresses, final domain names (or nrDNS of the final IP addresses)
  • IP paths are also updated when this occurs.
  • messages may automatically be removed from the spam folder if messages from the actual sender (or other indicators of origin such as final IP address or final domain name (or nrDNS of the final IP address) ) exceed the threshold.
  • a message that can't be rated locally is put in a spam folder and rating is delayed until user activity (i.e., any interaction (sending a message, viewing a folder, etc.) with the e-mail program) is observed.
  • This "just in time” rating ensures that messages are categorized using the most recent data before the messages are read.
  • the "just in time” rating can work as follows: when the reputation of a sender or site changes (good to bad, bad to good, good to suspect, etc.), the central database (s) tracking global statistics will send, or push, this information to all recipients in the network. The recipients can then check all messages received over the previous 24 hours (another time period may be specified by the user or system administrator in another embodiment) and updating the rating or categorization of that message as necessary.
  • a message's whitelist/blacklist status changes i.e., a message is moved from the whitelist to the blacklist or vice versa
  • the central database is notified and the statistics are updated (block 138) .
  • higher weight is given to manual (explicit) reversals of whitelist/blacklist status than implicit rankings (where, for instance, a sender or site is automatically placed on a whitelist because of the sender's or site's reputation rather than a user explicitly placing the sender or site on the whitelist) .
  • Reversals may be weighed at 100 times a regular vote (different weights may be used in other embodiments) .
  • the ratio of good/total messages is 0/1000. However, if 10 customers (one percent of the recipients) reverse, the ratio becomes 1000/1000, which greatly exceeds the threshold of a one percent favorable response required to release the other messages from the spam folder.
  • the recipients' spam folders are monitored (block 140) .
  • the actual sender's reputation is readjusted as discussed above (block 144). If the actual sender's reputation now exceeds the threshold (block 146) , other messages from the actual sender are automatically released from spam folders (block 148) . This is done by the software at the recipient's computer after receiving updates from the central database.
  • updated information is requested from the central database when the user opens the spam folder. When the information is received, it should be applied to the messages in the spam folder, allowing the user to use the most current information to make decisions about messages in the spam folder.
  • the spam folder is located at the incoming mail server
  • software at the mail server requests information from the central database and manages the spam folder accordingly. If the actual sender's reputation does not exceed the threshold (block 146) , or if no messages were released from the spam folder (block 142) , no further action is taken other than to continue to maintain statistics about actual senders
  • the Inbox as well as the spam folder is also periodically reevaluated to determine if the rating of any of the origins of messages in the Inbox has changed. If the origin's reputation is no longer "good, " and the origin has not been explicitly whitelisted by the recipient, the message can be removed to a spam folder and processed accordingly or deleted, depending on the rating and the recipient's settings. In some embodiments, different formulas may be used each time a message is rated.
  • the first time a message from an unknown sender is rated part of the criteria for rating the message may employ the number of messages recently sent by the unknown sender (if the unknown sender is a spammer, it is likely that he or she will send a high volume of messages in a short time period) .
  • a user or system administrator can set the time period (one hour, one day, etc.) which is checked.
  • the unknown sender's rating will have been established within the network and therefore the number of messages sent recently will not be as determinative of the message's rating as it previously was .
  • the frequency with which the Inbox and/or spam folder is reevaluated may be determined by the user or the system administrator.
  • computer viruses may be detected using an analogous approach.
  • attachments instead of only of identifying and keeping statistics about the origin (i.e., the sender or site sending the message), attachments are also identified, for instance by computing a checksum value of the attachment or using the name of the attachment, and statistics about the attachment identifier are kept at the central database. These statistics may be sent to a database by other users in the e-mail network or may be obtained in some other fashion, for instance by network software which tracks activity within the network.
  • Sample statistics used to assess the reputation of the attachment include: the number of unique senders of an attachment with a particular checksum/name of the attachment over a predetermined amount of time (for instance, the last 3 hours - this period of time may be set by the user or the system administrator) ; the average number of messages sent per sender over a predetermined amount of time (again set by the user or system administrator) ; the rate of growth of the number of messages with a particular checksum/name of the attachment; and the rate of growth of the number of unique senders sending messages with the particular checksum/name of the attachment. (Similar statistics for attachments sent from an origin (a sender or site) may also be tracked.) Other statistics and metrics may also be stored and used to determine whether a message is a virus.
  • the message can be marked as a virus and dealt with according to the user's preferences. For each of the statistics listed above employing a predetermined amount of time, this amount of time is arbitrary and should be set according to the user's or system administrator's needs .

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

La présente invention a trait à un procédé et un système de filtrage de courrier électronique qui classe en catégories des messages de courrier électronique reçus en fonction d'une information concernant l'origine contenue dans l'en-tête du message et la session de protocole de transfert de courrier simple employé pour l'envoi du message. Cette donnée et une autre information concernant le message est ensuite transmise par chaque membre d'un réseau de courrier électronique à une ou plusieurs base(s) de données centrale(s) qui mémorisent l'information et compilent de statistiques concernant des courriers électroniques transmis depuis l'origine pour indiquer la probabilité que le courrier électronique est non sollicité et déterminer la réputation de l'origine (une bonne réputation indique que l'expéditeur envoie des messages sollicités tandis qu'une mauvaise réputation indique que l'expéditeur envoie des messages de courrier électronique non sollicités). L'information en provenance de la base de données centrale est ensuite transmise aux destinataires (20) en vue de la détermination de la probabilité qu'un message de courrier électronique reçu est un courriel poubelle.
EP04718564A 2003-03-07 2004-03-08 Procede de filtrage de messages de courrier electronique Withdrawn EP1604293A2 (fr)

Applications Claiming Priority (13)

Application Number Priority Date Filing Date Title
US384278 1989-07-24
US10/384,278 US20040177120A1 (en) 2003-03-07 2003-03-07 Method for filtering e-mail messages
US682941 2003-10-09
US10/683,598 US7206814B2 (en) 2003-10-09 2003-10-09 Method and system for categorizing and processing e-mails
US10/685,090 US20050091320A1 (en) 2003-10-09 2003-10-09 Method and system for categorizing and processing e-mails
US683598 2003-10-09
US683951 2003-10-09
US10/683,599 US20050091319A1 (en) 2003-10-09 2003-10-09 Database for receiving, storing and compiling information about email messages
US683599 2003-10-09
US10/682,941 US7366761B2 (en) 2003-10-09 2003-10-09 Method for creating a whitelist for processing e-mails
US10/683,951 US20050080857A1 (en) 2003-10-09 2003-10-09 Method and system for categorizing and processing e-mails
US685090 2003-10-09
PCT/US2004/007034 WO2004081734A2 (fr) 2003-03-07 2004-03-08 Procede de filtrage de messages de courrier electronique

Publications (1)

Publication Number Publication Date
EP1604293A2 true EP1604293A2 (fr) 2005-12-14

Family

ID=32996640

Family Applications (1)

Application Number Title Priority Date Filing Date
EP04718564A Withdrawn EP1604293A2 (fr) 2003-03-07 2004-03-08 Procede de filtrage de messages de courrier electronique

Country Status (2)

Country Link
EP (1) EP1604293A2 (fr)
WO (1) WO2004081734A2 (fr)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060015942A1 (en) 2002-03-08 2006-01-19 Ciphertrust, Inc. Systems and methods for classification of messaging entities
US7870203B2 (en) * 2002-03-08 2011-01-11 Mcafee, Inc. Methods and systems for exposing messaging reputation to an end user
US20030172291A1 (en) 2002-03-08 2003-09-11 Paul Judge Systems and methods for automated whitelisting in monitored communications
US8214438B2 (en) 2004-03-01 2012-07-03 Microsoft Corporation (More) advanced spam detection features
WO2005116851A2 (fr) * 2004-05-25 2005-12-08 Postini, Inc. Systeme de reputation d'informations de source de messages electroniques
FI123195B (fi) * 2004-11-22 2012-12-14 Mavenir Systems Oy Tietoliikenneverkkojen yli lähetettävien sanomien käsittely
FI121773B (fi) * 2005-06-13 2011-03-31 Deltagon Group Oy Menetelmä ja palvelin sähköpostiviestin lähettäjän autentikoimiseksi ja sähköpostiviestin välitystavan ilmaisemiseksi
US8010609B2 (en) 2005-06-20 2011-08-30 Symantec Corporation Method and apparatus for maintaining reputation lists of IP addresses to detect email spam
US7962766B2 (en) 2005-07-14 2011-06-14 Atmel Corporation Method and system for encryption-based design obfuscation for an integrated circuit
US7516184B2 (en) * 2005-11-22 2009-04-07 Cisco Technology, Inc. Method and system for a method for evaluating a message based in part on a registrar reputation
US8763114B2 (en) 2007-01-24 2014-06-24 Mcafee, Inc. Detecting image spam
US8214497B2 (en) 2007-01-24 2012-07-03 Mcafee, Inc. Multi-dimensional reputation scoring
US8756691B2 (en) * 2010-11-10 2014-06-17 Symantec Corporation IP-based blocking of malware

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5619648A (en) * 1994-11-30 1997-04-08 Lucent Technologies Inc. Message filtering techniques
US6453327B1 (en) * 1996-06-10 2002-09-17 Sun Microsystems, Inc. Method and apparatus for identifying and discarding junk electronic mail
US6275850B1 (en) * 1998-07-24 2001-08-14 Siemens Information And Communication Networks, Inc. Method and system for management of message attachments
US6330590B1 (en) * 1999-01-05 2001-12-11 William D. Cotten Preventing delivery of unwanted bulk e-mail
US6321267B1 (en) * 1999-11-23 2001-11-20 Escom Corporation Method and apparatus for filtering junk email
US6460050B1 (en) * 1999-12-22 2002-10-01 Mark Raymond Pace Distributed content identification system
US6691156B1 (en) * 2000-03-10 2004-02-10 International Business Machines Corporation Method for restricting delivery of unsolicited E-mail
US6757830B1 (en) * 2000-10-03 2004-06-29 Networks Associates Technology, Inc. Detecting unwanted properties in received email messages
US6769016B2 (en) * 2001-07-26 2004-07-27 Networks Associates Technology, Inc. Intelligent SPAM detection system using an updateable neural analysis engine

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO2004081734A2 *

Also Published As

Publication number Publication date
WO2004081734A2 (fr) 2004-09-23
WO2004081734A3 (fr) 2004-12-16

Similar Documents

Publication Publication Date Title
US7366761B2 (en) Method for creating a whitelist for processing e-mails
US7206814B2 (en) Method and system for categorizing and processing e-mails
US20050198159A1 (en) Method and system for categorizing and processing e-mails based upon information in the message header and SMTP session
US20050091319A1 (en) Database for receiving, storing and compiling information about email messages
US20050080857A1 (en) Method and system for categorizing and processing e-mails
US20050091320A1 (en) Method and system for categorizing and processing e-mails
US20040177120A1 (en) Method for filtering e-mail messages
US10699246B2 (en) Probability based whitelist
US7873695B2 (en) Managing connections and messages at a server by associating different actions for both different senders and different recipients
US8527592B2 (en) Reputation-based method and system for determining a likelihood that a message is undesired
EP1476819B1 (fr) Services de gestion de courrier electronique
US7945674B2 (en) Degrees of separation for handling communications
US8583787B2 (en) Zero-minute virus and spam detection
US7849142B2 (en) Managing connections, messages, and directory harvest attacks at a server
US20050076241A1 (en) Degrees of separation for handling communications
WO2004081734A2 (fr) Procede de filtrage de messages de courrier electronique
US7958187B2 (en) Systems and methods for managing directory harvest attacks via electronic messages

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20051007

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL LT LV MK

DAX Request for extension of the european patent (deleted)
RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: ABACA TECHNOLOGY CORPORATION

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN

18W Application withdrawn

Effective date: 20090109