EP2811699B1 - System and method for spam filtering using shingles - Google Patents

System and method for spam filtering using shingles Download PDF

Info

Publication number
EP2811699B1
EP2811699B1 EP13194359.9A EP13194359A EP2811699B1 EP 2811699 B1 EP2811699 B1 EP 2811699B1 EP 13194359 A EP13194359 A EP 13194359A EP 2811699 B1 EP2811699 B1 EP 2811699B1
Authority
EP
European Patent Office
Prior art keywords
shingles
spam
message
text
received message
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
EP13194359.9A
Other languages
German (de)
French (fr)
Other versions
EP2811699A1 (en
Inventor
Sergey G. Zagorsky
Darya V. Loseva
Vladimir A. Skvortsov
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kaspersky Lab AO
Original Assignee
Kaspersky Lab AO
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from RU2013125984/08A external-priority patent/RU2583713C2/en
Application filed by Kaspersky Lab AO filed Critical Kaspersky Lab AO
Publication of EP2811699A1 publication Critical patent/EP2811699A1/en
Application granted granted Critical
Publication of EP2811699B1 publication Critical patent/EP2811699B1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/21Monitoring or handling of messages
    • H04L51/212Monitoring or handling of messages using filtering or selective blocking

Definitions

  • the disclosure relates generally to the field of information security, and more specifically to systems and methods for spam filtering using shingles.
  • Spam Advertising on the Internet is one of the cheapest forms of advertising.
  • Spam is a mass mailing of advertising or other form of information to people who have not expressed a desire to receive it.
  • Spam includes messages sent by electronic mail, instant messaging protocols, in social networks, blogs, dating sites, forums, and also SMA and MMS messages. Spam messages have become the main and most large-scale form of advertising in the modem world, which take up around 70-90% of the total volume of global mail traffic.
  • Another method of counteracting spam is using content filtering, which involves the use of special spam filters that analyze the constituent parts of messages, including graphics. From the results of the analysis, a lexical vector or a spam weight of the message may be calculated, which can be used to determine whether the message is spam or not.
  • Another method spam detection technique is message clustering, which allows the detection in the mail flow of mass messages that are absolutely identical or that differ slightly.
  • the drawback of this method is that the majority of legitimate services, such as news subscription or update services for example, also employ mass mailing and, consequently, can be incorrectly recognized as sources of a spam mailing when this method is used.
  • Anti-spam laboratories are engaged in creating and improving the filtering rules used by spam filters. At the same time, the people engaged in spam mailing are constantly making attempts to bypass the protection of spam filters.
  • the existing methods of counteracting spam have a number of shortcomings and cannot fully resolve the problem.
  • a technique for efficiently determining similarity between documents comprises generating a set of reference data items by processing a reference document, and obtaining a similarity index representing similarity between a candidate document and the reference document by counting segments of the candidate document matching the reference data items.
  • the candidate document is a message transmitted in a communication system where the message is compared against one or more reference documents representing unwanted messages.
  • the system receives an electronic message including at least a text portion.
  • the system identifies in the received message insignificant text portions.
  • the system then removes identified insignificant text portions to generate an abridged message.
  • the system then generates a set of shingles from the abridged message.
  • the system identifies in the generated set of shingles one or more shingles that occur only in messages not containing spam.
  • the system then removes one or more identified shingles from the generated set of shingles to generate a reduced set of shingles.
  • the system then performs spam filtering of the reduced set of shingles to determine whether the message contains spam.
  • Example aspects of the present invention are described herein in the context of system, method and computer program product for detection and filtering of spam messages using shingles.
  • Those of ordinary skill in the art will realize that the following description is illustrative only and is not intended to be in any way limiting. Other aspects will readily suggest themselves to those skilled in the art having the benefit of this disclosure.
  • an anti-spam application processes electronic mail messages using message clustering technique, in which sets of objects are constructed from the text of the message. Then, the anti-spam application compares already known sets of objects containing spam and sets of objects not containing spam. Based on the reputation of the objects, the anti-spam application determines whether the analyzed message is spam.
  • the objects used in comparison may be words, phrasing, shingles, convolutions, and so on.
  • shingle refers to a contiguous sequence of symbols, letters or words from the analyzed message text, which are also known as n -grams, for which, for example, a hash or control sum may be is computed using known methods, and which are used to gauge the similarity of significant message texts portions with shingles associated with spam messages.
  • the text of the analyzed message may be first divided sequences of words.
  • the algorithm used for comparison of the shingles may depend on the method of counting the shingles and the length of each shingle.
  • the anti-spam application may identify certain text portions that do not have significance in identifying spam, but which are generally not removed when generating the shingles.
  • One such insignificant text portion may, for example, be a signature, such as an auto-signature.
  • An auto-signature is part of an electronic text message that contains information about the sender or any service information about the server or the message. The auto-signatures are typically generated and inserted in messages automatically. Fig.
  • insignificant text portions such as signatures and auto-signatures.
  • Other examples of insignificant text may include, but not limited to parts of the message containing the mailing address and telephone, the post script, and so on, which are found in business correspondence and are of a formal nature. Removing the aforementioned examples of insignificant text of messages makes it possible to get rid of parts of the message that have no significance in the spam filtering, which increases the efficiency and accuracy of the further message processing.
  • Fig. 2 shows an example implementation of a system for spam filtering.
  • the system 200 may be configured to remove shingles from insignificant parts of a message during spam filtering.
  • the system 200 may be implemented in an anti-spam application.
  • the system 200 may include a text processing module 220, a shingles processing module 230, a filtering module 260, a text pattern database 240 and a shingles database 250.
  • the text processing module 220 is configured to receive text messages 210, search for insignificant parts of text that is to be removed by analysing the text of the message 210 for the presence of known insignificant text patterns kept in the text pattern database 240, remove the found insignificant text portions from the message 210, canonize the abridged message text and send the abridged message text to the shingles processing module 230.
  • the shingles processing module 230 is configured to generate using known methods a set of shingles from the abridged message text, search for shingles that are to be removed by comparing the shingles from the generated set of shingles to the shingles stored in the shingles database 250, remove the shingles that match, and send the abridged set of shingles to the filtering module 260.
  • the filtering module 260 may be configured to filter the abridged set of shingles by one of the filtering methods, such as the clustering method, the Bayes method, or other methods known to those of ordinary skill in the art.
  • the text pattern database 240 is configured to store text patterns that occurred only in messages not containing spam and that are characteristic of insignificant parts of a message.
  • the shingles database 250 is configured to store information on the shingles that occurred only in messages not containing spam and that are characteristic of insignificant parts of a message.
  • the insignificant text patterns stored in database 240 may include words, phrases, symbols or collections of words, phrases and symbols that occurred only in messages not containing spam. The distinguishing nature and significance of the text patterns is explained by expert appraisal and on the basis of statistical computations.
  • the canonization of text includes reducing the text of abridge message to a unified standardized form.
  • the text may be cleared of prepositions, conjunctions, punctuation marks, HTML tags, and other "garbage" that should not take part in generation of shingles.
  • substantives may be placed in the nominative case in the singular form or only their word roots are left in place.
  • the text pattern database 240 and the shingles database 250 one can use various types of database, such as: hierarchical (IMS, TDMS, System 2000), network (Cerebrum, Cronospro, DBVist), relational (DB2, Informix, Microsoft SQL Server), object-oriented (Jasmine, Versant, POET), object-relational (Oracle Database, PostgreSQL, FirstSQL/J), functional, etc.
  • hierarchical IMS, TDMS, System 2000
  • network Cerebrum, Cronospro, DBVist
  • relational DB2, Informix, Microsoft SQL Server
  • object-oriented Jasmine, Versant, POET
  • object-relational Oracle Database, PostgreSQL, FirstSQL/J
  • functional etc.
  • the text pattern database 240 may have form of Table 1.
  • Table 1 ID of pattern Text pattern Text identification Frequency 1 best+regards Auto-signature 7594 2 st+ «number» Address 51442 ⁇ ⁇ ⁇ ⁇ n Text pattern n Identification of Text n Frequency n
  • the shingles database 250 may have form of Table 2.
  • Table 2 ID of shingle Shingle Text portion Frequency 1 1bc29b36f623ba82aaf6724fd3b16718 Auto-signature 88451 2 026f8e459c8f89ef75fa7a78265a0025 Post script 54644 ⁇ ⁇ ⁇ ⁇ m Shingle m Text portion m Frequency m
  • the information for text pattern database 240 and the shingles database 250 may be provided from a remote server 270 of an antivirus company.
  • Fig. 3 shows an example method 300 of spam filtering.
  • an incoming electronic message is received by the text processing module 220.
  • the text processing module 220 performs a search for insignificant parts of the text that are to be removed by analyzing the message text for presence of text patterns kept in the text pattern database 240.
  • the text processing module 220 removes parts of the message in which known text patterns were found from the text pattern database 240, and sends an abridged message text to the shingles processing module 230. Also in step 330, the text processing module 220 may perform canonization of the text of the abridged message if necessary.
  • the shingles processing module 230 generates a set of shingles from on the abridged message text.
  • the shingles processing module 230 performs a search for shingles to be removed by comparing the shingles from the generated set of shingles to known shingles that are kept in the shingles database 250.
  • the shingles processing module 230 removes the matching shingles and sends the abridged set of shingles to the filtering module 260.
  • the filtering module 260 performs a filtering of the abridged set of shingles by one of the filtering methods, such as the clustering method, the Bayes method, and so on.
  • Fig. 4 depicts an example configuration of a general-purpose computer 5 that can be used to implement the disclosed system and methods for spam filtering.
  • the computer system 5 may include, but not limited to, a personal computer, a notebook, tablet computer, a smart phone, a network server, a router, or other type of computing device.
  • computer system 5 may include one or more hardware processors 15, system memory 20, one or more hard disk drive(s) 30, optical drive(s) 35, serial port(s) 40, graphics card 45, audio card 50 and network card(s) 55 connected by system bus 10.
  • System bus 10 may be any of several types of bus structures including a memory bus or memory controller, a peripheral bus and a local bus using any of a variety of known bus architectures.
  • Processor 15 may include one or more Intel® Core 2 Quad 2.33 GHz processors or other type of microprocessor.
  • System memory 20 may include a read-only memory (ROM) 21 and random access memory (RAM) 23.
  • Memory 20 may be implemented as in DRAM (dynamic RAM), EPROM, EEPROM, Flash or other type of memory architecture.
  • ROM 21 stores a basic input/output system 22 (BIOS), containing the basic routines that help to transfer information between the components of computer system 5, such as during start-up.
  • BIOS basic input/output system
  • RAM 23 stores operating system 24 (OS), such as Windows® XP Professional or other type of operating system, that is responsible for management and coordination of processes and allocation and sharing of hardware resources in computer system 5.
  • OS operating system 24
  • Memory 20 also stores applications and programs 25.
  • Memory 20 also stores various runtime data 26 used by programs 25.
  • Computer system 5 may further include hard disk drive(s) 30, such as SATA HDD, and optical disk drive(s) 35 for reading from or writing to a removable optical disk, such as a CD-ROM, DVD-ROM or other optical media.
  • Drives 30 and 35 and their associated computer-readable media provide non-volatile storage of computer readable instructions, data structures, applications and program modules/subroutines that implement algorithms and methods disclosed herein.
  • the exemplary computer system 5 employs magnetic and optical disks, it should be appreciated by those skilled in the art that other types of computer readable media that can store data accessible by a computer system 5, such as magnetic cassettes, flash memory cards, digital video disks, RAMs, ROMs, EPROMs and other types of memory may also be used in alternative aspects of the computer system 5.
  • Computer system 5 further includes a plurality of serial ports 40, such as Universal Serial Bus (USB), for connecting data input device(s) 75, such as keyboard, mouse, touch pad and other.
  • Serial ports 40 may be also be used to connect data output device(s) 80, such as printer, scanner and other, as well as other peripheral device(s) 85, such as external data storage devices and the like.
  • System 5 may also include graphics card 45, such as nVidia® Ge-Force® GT 240M or other video card, for interfacing with a monitor 60 or other video reproduction device.
  • System 5 may also include an audio card 50 for reproducing sound via internal or external speakers 65.
  • system 5 may include network card(s) 55, such as Ethernet, WiFi, GSM, Bluetooth or other wired, wireless, or cellular network interface for connecting computer system 5 to network 70, such as the Internet.
  • the systems and methods described herein may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software, the methods may be stored as one or more instructions or code on a non-transitory computer-readable medium.
  • Computer-readable medium includes data storage.
  • such computer-readable medium can comprise RAM, ROM, EEPROM, CD-ROM, Flash memory or other types of electric, magnetic, or optical storage medium, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a processor of a general purpose computer.
  • module means a real-world device, component, or arrangement of components implemented using hardware, such as by an application specific integrated circuit (ASIC) or field-programmable gate array (FPGA), for example, or as a combination of hardware and software, such as by a microprocessor system and a set of instructions to implement the module's functionality, which (while being executed) transform the microprocessor system into a special-purpose device.
  • a module can also be implemented as a combination of the two, with certain functions facilitated by hardware alone, and other functions facilitated by a combination of hardware and software.
  • a module can be executed on the processor of a general purpose computer (such as the one described in greater detail in Fig. 7 above). Accordingly, each module can be realized in a variety of suitable configurations, and should not be limited to any particular implementation exemplified herein.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)

Description

    Technical Field
  • The disclosure relates generally to the field of information security, and more specifically to systems and methods for spam filtering using shingles.
  • Background
  • Advertising on the Internet is one of the cheapest forms of advertising. Spam is a mass mailing of advertising or other form of information to people who have not expressed a desire to receive it. Spam includes messages sent by electronic mail, instant messaging protocols, in social networks, blogs, dating sites, forums, and also SMA and MMS messages. Spam messages have become the main and most large-scale form of advertising in the modem world, which take up around 70-90% of the total volume of global mail traffic.
  • Given the continual growth in volumes of spam mailing, problems of a technical, economic and criminal nature arise. The excess network traffic caused by spam message may overload data transmission channels and network equipment, review and handling of spam messages waists users time, the use of spam messages to perpetrate fraud and thievery - these and other aspects show the acute need for a combat spam.
  • Many methods exist for counteracting spam mailings. One of the most effective is the use of anti-spam applications, which detect, filter and remove unwanted spam messages. One of the key conditions for spam filtering is to avoid false spam detections, which may involve the blocking of legitimate messages. For example, the method using black lists, which essentially involves removing messages arriving from addresses contained in a black list, provides 100% filtering of messages from blacklisted addresses. However, when addresses of ordinary users mistakenly end up on the black list, a false spam detection may occur and legitimate messages may be filtered out and not delivered to their destination.
  • Another method of counteracting spam is using content filtering, which involves the use of special spam filters that analyze the constituent parts of messages, including graphics. From the results of the analysis, a lexical vector or a spam weight of the message may be calculated, which can be used to determine whether the message is spam or not.
  • Another method spam detection technique is message clustering, which allows the detection in the mail flow of mass messages that are absolutely identical or that differ slightly. The drawback of this method is that the majority of legitimate services, such as news subscription or update services for example, also employ mass mailing and, consequently, can be incorrectly recognized as sources of a spam mailing when this method is used.
  • Anti-spam laboratories are engaged in creating and improving the filtering rules used by spam filters. At the same time, the people engaged in spam mailing are constantly making attempts to bypass the protection of spam filters. The existing methods of counteracting spam have a number of shortcomings and cannot fully resolve the problem.
  • From US 2011/0055332 A1 a technique for efficiently determining similarity between documents is known. The technique comprises generating a set of reference data items by processing a reference document, and obtaining a similarity index representing similarity between a candidate document and the reference document by counting segments of the candidate document matching the reference data items. The candidate document is a message transmitted in a communication system where the message is compared against one or more reference documents representing unwanted messages.
  • Therefore, there is a need to improve spam detection techniques.
  • Summary
  • Disclosed are a system, method and a computer program product for detecting spam using shingles as set forth in claims 1, 8 and 15. In one aspect, the system receives an electronic message including at least a text portion. The system identifies in the received message insignificant text portions. The system then removes identified insignificant text portions to generate an abridged message. The system then generates a set of shingles from the abridged message. The system then identifies in the generated set of shingles one or more shingles that occur only in messages not containing spam. The system then removes one or more identified shingles from the generated set of shingles to generate a reduced set of shingles. The system then performs spam filtering of the reduced set of shingles to determine whether the message contains spam.
  • The above simplified summary of example aspects serves to provide a basic understanding of the invention. This summary is not an extensive overview of all contemplated aspects, and is intended to neither identify key or critical elements of all aspects nor delineate the scope of any or all aspects of the invention. Its sole purpose is to present one or more aspects in a simplified form as a prelude to the more detailed description of the invention that follows. To the accomplishment of the foregoing, the one or more aspects of the invention include the features described and particularly pointed out in the claims.
  • Brief description of the drawings
  • The accompanying drawings, which are incorporated into and constitute a part of this specification, illustrate one or more example aspects of the invention and, together with the detailed description, serve to explain their principles and implementations.
    • Fig. 1 illustrates examples of insignificant text in the form of signatures and automatic signatures.
    • Fig. 2 illustrates a diagram of an example system for spam filtering using shingles according to aspects of the present invention.
    • Fig. 3 illustrates a flow diagram of an example method of spam filtering using shingles according to aspects of the present invention.
    • Fig. 4 is a block diagram illustrating an example general-purpose computer system in accordance with aspects of the present invention.
    Details Description
  • Example aspects of the present invention are described herein in the context of system, method and computer program product for detection and filtering of spam messages using shingles. Those of ordinary skill in the art will realize that the following description is illustrative only and is not intended to be in any way limiting. Other aspects will readily suggest themselves to those skilled in the art having the benefit of this disclosure. Reference will now be made in detail to implementations of the example aspects as illustrated in the accompanying drawings. The same reference indicators will be used to the extent possible throughout the drawings and the following description to refer to the same or like items.
  • For example, when an anti-spam application processes electronic mail messages using message clustering technique, in which sets of objects are constructed from the text of the message. Then, the anti-spam application compares already known sets of objects containing spam and sets of objects not containing spam. Based on the reputation of the objects, the anti-spam application determines whether the analyzed message is spam. The objects used in comparison may be words, phrasing, shingles, convolutions, and so on. As used herein, the term "shingle" refers to a contiguous sequence of symbols, letters or words from the analyzed message text, which are also known as n-grams, for which, for example, a hash or control sum may be is computed using known methods, and which are used to gauge the similarity of significant message texts portions with shingles associated with spam messages.
  • In order to generate a set of shingles, the text of the analyzed message may be first divided sequences of words. As known to those of ordinary skill in the art, the algorithm used for comparison of the shingles may depend on the method of counting the shingles and the length of each shingle. In the course of processing the message, the anti-spam application may identify certain text portions that do not have significance in identifying spam, but which are generally not removed when generating the shingles. One such insignificant text portion may, for example, be a signature, such as an auto-signature. An auto-signature is part of an electronic text message that contains information about the sender or any service information about the server or the message. The auto-signatures are typically generated and inserted in messages automatically. Fig. 1 shows examples of insignificant text portions, such as signatures and auto-signatures. Other examples of insignificant text may include, but not limited to parts of the message containing the mailing address and telephone, the post script, and so on, which are found in business correspondence and are of a formal nature. Removing the aforementioned examples of insignificant text of messages makes it possible to get rid of parts of the message that have no significance in the spam filtering, which increases the efficiency and accuracy of the further message processing.
  • Fig. 2 shows an example implementation of a system for spam filtering. The system 200 may be configured to remove shingles from insignificant parts of a message during spam filtering. The system 200 may be implemented in an anti-spam application. The system 200 may include a text processing module 220, a shingles processing module 230, a filtering module 260, a text pattern database 240 and a shingles database 250. The text processing module 220 is configured to receive text messages 210, search for insignificant parts of text that is to be removed by analysing the text of the message 210 for the presence of known insignificant text patterns kept in the text pattern database 240, remove the found insignificant text portions from the message 210, canonize the abridged message text and send the abridged message text to the shingles processing module 230. The shingles processing module 230 is configured to generate using known methods a set of shingles from the abridged message text, search for shingles that are to be removed by comparing the shingles from the generated set of shingles to the shingles stored in the shingles database 250, remove the shingles that match, and send the abridged set of shingles to the filtering module 260. The filtering module 260 may be configured to filter the abridged set of shingles by one of the filtering methods, such as the clustering method, the Bayes method, or other methods known to those of ordinary skill in the art. The text pattern database 240 is configured to store text patterns that occurred only in messages not containing spam and that are characteristic of insignificant parts of a message. The shingles database 250 is configured to store information on the shingles that occurred only in messages not containing spam and that are characteristic of insignificant parts of a message.
  • The insignificant text patterns stored in database 240 may include words, phrases, symbols or collections of words, phrases and symbols that occurred only in messages not containing spam. The distinguishing nature and significance of the text patterns is explained by expert appraisal and on the basis of statistical computations.
  • The canonization of text includes reducing the text of abridge message to a unified standardized form. The text may be cleared of prepositions, conjunctions, punctuation marks, HTML tags, and other "garbage" that should not take part in generation of shingles. Also during the canonization, substantives may be placed in the nominative case in the singular form or only their word roots are left in place.
  • For the text pattern database 240 and the shingles database 250 one can use various types of database, such as: hierarchical (IMS, TDMS, System 2000), network (Cerebrum, Cronospro, DBVist), relational (DB2, Informix, Microsoft SQL Server), object-oriented (Jasmine, Versant, POET), object-relational (Oracle Database, PostgreSQL, FirstSQL/J), functional, etc.
  • For example, the text pattern database 240 may have form of Table 1. Table 1
    ID of pattern Text pattern Text identification Frequency
    1 best+regards Auto-signature 7594
    2 st+«number» Address 51442
    n Text pattern n Identification of Text n Frequency n
  • For example, the shingles database 250 may have form of Table 2. Table 2
    ID of shingle Shingle Text portion Frequency
    1 1bc29b36f623ba82aaf6724fd3b16718 Auto-signature 88451
    2 026f8e459c8f89ef75fa7a78265a0025 Post script 54644
    m Shingle m Text portion m Frequency m
  • The information for text pattern database 240 and the shingles database 250, including text patterns and shingles from messages not containing spam sent from legitimate users, may be provided from a remote server 270 of an antivirus company.
  • Fig. 3 shows an example method 300 of spam filtering. At step 310, an incoming electronic message is received by the text processing module 220. At step 320, the text processing module 220 performs a search for insignificant parts of the text that are to be removed by analyzing the message text for presence of text patterns kept in the text pattern database 240. At step 330, the text processing module 220 removes parts of the message in which known text patterns were found from the text pattern database 240, and sends an abridged message text to the shingles processing module 230. Also in step 330, the text processing module 220 may perform canonization of the text of the abridged message if necessary. At step 340, the shingles processing module 230 generates a set of shingles from on the abridged message text. At step 350, the shingles processing module 230 performs a search for shingles to be removed by comparing the shingles from the generated set of shingles to known shingles that are kept in the shingles database 250. At step 360, the shingles processing module 230 removes the matching shingles and sends the abridged set of shingles to the filtering module 260. At step 370, the filtering module 260 performs a filtering of the abridged set of shingles by one of the filtering methods, such as the clustering method, the Bayes method, and so on.
  • Fig. 4 depicts an example configuration of a general-purpose computer 5 that can be used to implement the disclosed system and methods for spam filtering. The computer system 5 may include, but not limited to, a personal computer, a notebook, tablet computer, a smart phone, a network server, a router, or other type of computing device. As shown, computer system 5 may include one or more hardware processors 15, system memory 20, one or more hard disk drive(s) 30, optical drive(s) 35, serial port(s) 40, graphics card 45, audio card 50 and network card(s) 55 connected by system bus 10. System bus 10 may be any of several types of bus structures including a memory bus or memory controller, a peripheral bus and a local bus using any of a variety of known bus architectures. Processor 15 may include one or more Intel® Core 2 Quad 2.33 GHz processors or other type of microprocessor.
  • System memory 20 may include a read-only memory (ROM) 21 and random access memory (RAM) 23. Memory 20 may be implemented as in DRAM (dynamic RAM), EPROM, EEPROM, Flash or other type of memory architecture. ROM 21 stores a basic input/output system 22 (BIOS), containing the basic routines that help to transfer information between the components of computer system 5, such as during start-up. RAM 23 stores operating system 24 (OS), such as Windows® XP Professional or other type of operating system, that is responsible for management and coordination of processes and allocation and sharing of hardware resources in computer system 5. Memory 20 also stores applications and programs 25. Memory 20 also stores various runtime data 26 used by programs 25.
  • Computer system 5 may further include hard disk drive(s) 30, such as SATA HDD, and optical disk drive(s) 35 for reading from or writing to a removable optical disk, such as a CD-ROM, DVD-ROM or other optical media. Drives 30 and 35 and their associated computer-readable media provide non-volatile storage of computer readable instructions, data structures, applications and program modules/subroutines that implement algorithms and methods disclosed herein. Although the exemplary computer system 5 employs magnetic and optical disks, it should be appreciated by those skilled in the art that other types of computer readable media that can store data accessible by a computer system 5, such as magnetic cassettes, flash memory cards, digital video disks, RAMs, ROMs, EPROMs and other types of memory may also be used in alternative aspects of the computer system 5.
  • Computer system 5 further includes a plurality of serial ports 40, such as Universal Serial Bus (USB), for connecting data input device(s) 75, such as keyboard, mouse, touch pad and other. Serial ports 40 may be also be used to connect data output device(s) 80, such as printer, scanner and other, as well as other peripheral device(s) 85, such as external data storage devices and the like. System 5 may also include graphics card 45, such as nVidia® Ge-Force® GT 240M or other video card, for interfacing with a monitor 60 or other video reproduction device. System 5 may also include an audio card 50 for reproducing sound via internal or external speakers 65. In addition, system 5 may include network card(s) 55, such as Ethernet, WiFi, GSM, Bluetooth or other wired, wireless, or cellular network interface for connecting computer system 5 to network 70, such as the Internet.
  • In various aspects, the systems and methods described herein may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software, the methods may be stored as one or more instructions or code on a non-transitory computer-readable medium. Computer-readable medium includes data storage. By way of example, and not limitation, such computer-readable medium can comprise RAM, ROM, EEPROM, CD-ROM, Flash memory or other types of electric, magnetic, or optical storage medium, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a processor of a general purpose computer.
  • In various aspects, the systems and methods are described in the present disclosure in terms of modules. The term "module" as used herein means a real-world device, component, or arrangement of components implemented using hardware, such as by an application specific integrated circuit (ASIC) or field-programmable gate array (FPGA), for example, or as a combination of hardware and software, such as by a microprocessor system and a set of instructions to implement the module's functionality, which (while being executed) transform the microprocessor system into a special-purpose device. A module can also be implemented as a combination of the two, with certain functions facilitated by hardware alone, and other functions facilitated by a combination of hardware and software. In certain implementations, at least a portion, and in some cases, all, of a module can be executed on the processor of a general purpose computer (such as the one described in greater detail in Fig. 7 above). Accordingly, each module can be realized in a variety of suitable configurations, and should not be limited to any particular implementation exemplified herein.
  • In the interest of clarity, not all of the routine features of the aspects are disclosed herein. It will be appreciated that in the development of any actual implementation of the invention, numerous implementation-specific decisions must be made in order to achieve the developer's specific goals, and that these specific goals will vary for different implementations and different developers. It will be appreciated that such a development effort might be complex and time-consuming, but would nevertheless be a routine undertaking of engineering for those of ordinary skill in the art having the benefit of this disclosure.
  • Furthermore, it is to be understood that the phraseology or terminology used herein is for the purpose of description and not of restriction, such that the terminology or phraseology of the present specification is to be interpreted by the skilled in the art in light of the teachings and guidance presented herein, in combination with the knowledge of the skilled in the relevant art(s). Moreover, it is not intended for any term in the specification or claims to be ascribed an uncommon or special meaning unless explicitly set forth as such.
  • The various aspects disclosed herein encompass present and future known equivalents to the known components referred to herein by way of illustration. Moreover, while aspects and applications have been shown and described, it would be apparent to those skilled in the art having the benefit of this disclosure that many more modifications than mentioned above are possible without departing from the inventive concepts disclosed herein.

Claims (15)

  1. A computer-implemented method for detecting spam, the method comprising:
    receiving an electronic message, the message including at least a text portion;
    identifying in the received message one or more insignificant text portions;
    removing one or more identified insignificant text portions from the message to generate an abridged message;
    canonizing text of the abridged message into a unified standardized form;
    generating a set of shingles from the abridged and canonized message;
    identifying in the generated set of shingles one or more shingles that occur only in messages not containing spam;
    removing the one or more identified shingles from the generated set of shingles to generate a reduced set of shingles; and
    performing spam filtering of the reduced set of shingles to determine whether the received message contains spam.
  2. The method of claim 1, wherein words, phrases, symbols that occur only in messages not containing spam are stored in a text pattern data store..
  3. The method of claim 2 further comprising:
    collecting from known users message not containing spam;
    identifying in the collected messages insignificant text patterns and shingles;
    storing the collected insignificant text patterns and shingles in a data store.
  4. The method of claim 3, wherein identifying in the received message one or more insignificant text portions, includes comparing the plurality of insignificant text patterns identified in the received message with the insignificant text patterns stored in the data store.
  5. The method of claim 3, wherein identifying in the received message one or more shingles that occur only in messages not containing spam, includes comparing the plurality of shingles generated for the received message with the shingles stored in the data store.
  6. The method of claim 1, wherein performing spam filtering of the reduced set of shingles to determine whether the received message contains spam includes performing Bayes spam filtering of the reduced set of shingles.
  7. The method of claim 1, wherein performing spam filtering of the reduced set of shingles to determine whether the received message contains spam includes performing cluster filtering of the reduced set of shingles.
  8. A computer system (5) for detecting spam, the system (5) comprising:
    a processor (15) configured to:
    receive an electronic message, the message including at least a text portion;
    identify in the received message one or more insignificant text portions;
    remove one or more identified insignificant text portions from the message to generate an abridged message;
    canonize text of the abridged message into a unified standardized form;
    generate a set of shingles from the abridged and canonized message;
    identify in the generated set of shingles one or more shingles that occur only in messages not containing spam;
    remove the one or more identified shingles from the generated set of shingles to generate a reduced set of shingles; and
    perform spam filtering of the reduced set of shingles to determine whether the received message contains spam.
  9. The system (5) of claim 8, wherein a text pattern data store stores words, phrases, symbols that occur only in messages not containing spam.
  10. The system (5) of claim 9, wherein the processor (15) is further configured to:
    collect from known users messages not containing spam;
    identify in the collected messages insignificant text patterns and shingles;
    store the collected insignificant text patterns and shingles in a data store.
  11. The system (5) of claim 10, wherein to identify in the received message one or more insignificant text portions, the processor (15) is further configured to compare the plurality of insignificant text patterns identified in the received message with the insignificant text patterns stored in the data store.
  12. The system (5) of claim 10, wherein to identify in the received message one or more shingles that occur only in messages not containing spam, the processor (15) is further configured to compare the plurality of shingles generated for the received message with the shingles stored in the data store.
  13. The system (5) of claim 8, wherein to perform spam filtering of the reduced set of shingles to determine whether the received message contains spam, the processor (15) is further configured to perform Bayes spam filtering of the reduced set of shingles.
  14. The system (5) of claim 8, wherein to perform spam filtering of the reduced set of shingles to determine whether the received message contains spam, the processor (15) is further configured to perform cluster filtering of the reduced set of shingles.
  15. A computer program product comprising one or more instructions for performing all the steps of any of claims 1 to 7.
EP13194359.9A 2013-06-06 2013-11-26 System and method for spam filtering using shingles Active EP2811699B1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
RU2013125984/08A RU2583713C2 (en) 2013-06-06 2013-06-06 System and method of eliminating shingles from insignificant parts of messages when filtering spam
US14/069,889 US8996638B2 (en) 2013-06-06 2013-11-01 System and method for spam filtering using shingles

Publications (2)

Publication Number Publication Date
EP2811699A1 EP2811699A1 (en) 2014-12-10
EP2811699B1 true EP2811699B1 (en) 2015-11-25

Family

ID=49674193

Family Applications (1)

Application Number Title Priority Date Filing Date
EP13194359.9A Active EP2811699B1 (en) 2013-06-06 2013-11-26 System and method for spam filtering using shingles

Country Status (1)

Country Link
EP (1) EP2811699B1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9954804B2 (en) 2015-07-30 2018-04-24 International Business Machines Coporation Method and system for preemptive harvesting of spam messages

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040073617A1 (en) * 2000-06-19 2004-04-15 Milliken Walter Clark Hash-based systems and methods for detecting and preventing transmission of unwanted e-mail
US20080219495A1 (en) * 2007-03-09 2008-09-11 Microsoft Corporation Image Comparison
US9123027B2 (en) * 2010-10-19 2015-09-01 QinetiQ North America, Inc. Social engineering protection appliance
US8874663B2 (en) * 2009-08-28 2014-10-28 Facebook, Inc. Comparing similarity between documents for filtering unwanted documents

Also Published As

Publication number Publication date
EP2811699A1 (en) 2014-12-10

Similar Documents

Publication Publication Date Title
US10817603B2 (en) Computer security system with malicious script document identification
CN109271512B (en) Emotion analysis method, device and storage medium for public opinion comment information
US9692762B2 (en) Systems and methods for efficient detection of fingerprinted data and information
US8738721B1 (en) System and method for detecting spam using clustering and rating of E-mails
CN103336766B (en) Short text garbage identification and modeling method and device
JP5990284B2 (en) Spam detection system and method using character histogram
CA2859135C (en) System and methods for spam detection using frequency spectra of character strings
CN111586695B (en) Short message identification method and related equipment
Sajedi et al. Sms spam filtering using machine learning techniques: A survey
CN113806370B (en) Environmental data supervision method, device, equipment and storage medium based on big data
US9391936B2 (en) System and method for spam filtering using insignificant shingles
WO2023035362A1 (en) Polluted sample data detecting method and apparatus for model training
CN113343228B (en) Event credibility analysis method and device, electronic equipment and readable storage medium
CN108462624A (en) A kind of recognition methods of spam, device and electronic equipment
EP2811699B1 (en) System and method for spam filtering using shingles
Prilepok et al. Spam detection using data compression and signatures
CN114021116B (en) Construction method of homologous analysis knowledge base, homologous analysis method and device
CN113254672B (en) Method, system, equipment and readable storage medium for identifying abnormal account
CN114398887A (en) Text classification method and device and electronic equipment
CN111507368B (en) Campus network intrusion detection method and system
CN115842645A (en) UMAP-RF-based network attack traffic detection method and device and readable storage medium
AbuAlghanam et al. Android Malware Detection System Based on Ensemble Learning
CN113691525A (en) Traffic data processing method, device, equipment and storage medium
CN105095752B (en) The recognition methods of viral data packet, apparatus and system
CN112073360A (en) Detection method, device, terminal equipment and medium for hypertext transmission data

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20131126

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

INTG Intention to grant announced

Effective date: 20150702

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 3

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: AT

Ref legal event code: REF

Ref document number: 763070

Country of ref document: AT

Kind code of ref document: T

Effective date: 20151215

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: DE

Ref legal event code: R096

Ref document number: 602013003954

Country of ref document: DE

REG Reference to a national code

Ref country code: LT

Ref legal event code: MG4D

REG Reference to a national code

Ref country code: NL

Ref legal event code: MP

Effective date: 20160225

REG Reference to a national code

Ref country code: AT

Ref legal event code: MK05

Ref document number: 763070

Country of ref document: AT

Kind code of ref document: T

Effective date: 20151125

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: NO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20160225

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20160325

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: HR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20160226

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20151130

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: PL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: RS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: LV

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20160325

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: CZ

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

REG Reference to a national code

Ref country code: IE

Ref legal event code: MM4A

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 602013003954

Country of ref document: DE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: EE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: RO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: SM

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: SK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 4

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20151126

26N No opposition filed

Effective date: 20160826

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: HU

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT; INVALID AB INITIO

Effective date: 20131126

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20161130

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20161130

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 5

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20151126

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 6

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20151125

REG Reference to a national code

Ref country code: DE

Ref legal event code: R079

Ref document number: 602013003954

Country of ref document: DE

Free format text: PREVIOUS MAIN CLASS: H04L0012580000

Ipc: H04L0051000000

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20230929

Year of fee payment: 11

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20231006

Year of fee payment: 11

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20230929

Year of fee payment: 11