WO2001015405A2 - Systeme et procede d'etablissement de profils sur internet - Google Patents

Systeme et procede d'etablissement de profils sur internet Download PDF

Info

Publication number
WO2001015405A2
WO2001015405A2 PCT/IB2000/001159 IB0001159W WO0115405A2 WO 2001015405 A2 WO2001015405 A2 WO 2001015405A2 IB 0001159 W IB0001159 W IB 0001159W WO 0115405 A2 WO0115405 A2 WO 0115405A2
Authority
WO
WIPO (PCT)
Prior art keywords
individual
previously identified
internet
information
unknown
Prior art date
Application number
PCT/IB2000/001159
Other languages
English (en)
Other versions
WO2001015405A3 (fr
Inventor
Yaron Buznach
Original Assignee
Adwise Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Adwise Ltd. filed Critical Adwise Ltd.
Priority to AU65867/00A priority Critical patent/AU6586700A/en
Publication of WO2001015405A2 publication Critical patent/WO2001015405A2/fr
Publication of WO2001015405A3 publication Critical patent/WO2001015405A3/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/535Tracking the activity of the user
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/30Definitions, standards or architectural aspects of layered protocol stacks
    • H04L69/32Architecture of open systems interconnection [OSI] 7-layer type protocol stacks, e.g. the interfaces between the data link level and the physical level
    • H04L69/322Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions
    • H04L69/329Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions in the application layer [OSI layer 7]

Definitions

  • An Internet profiling method and system for client/user identification by statistical examination of streaming data
  • This invention relates, in general, to a method and system for identifying an Internet or network user who is engaged in repetitive Internet or network inter-sessions. More specifically it relates to an expandable method for a network device, which identifies and associates Internet or network inter-sessions conducted by a user using information sent/received to/from the user's computer to/from any remote host on the Internet or the network.
  • the method is adaptable to many Internet and network users.
  • Identifying an Internet user has become a major issue since the Internet became a media for advertisement and targeted content.
  • Remote content suppliers e.g., web sites
  • the web sites may use resources on the client computer, e.g., cookies or client software, and second, the web sites may ask the client to identify himself/herself when the client asks for the content. These two methods are commonly used in client server interactions.
  • clients connect to a public network, e.g., the Internet, through an ISP (Internet Service Provider).
  • ISP Internet Service Provider
  • the ISP identifies its clients by their login information, e.g., username and password.
  • Another method for the ISP to identify a client is to track their unique IP address, which is assigned to a client after login. The first method allows the ISP to associate a current user Internet session to a specific user and can
  • C0NFIRMATI0N COPY create a user log file.
  • the second method provides the ISP with a way to track the client Internet session in real time until the client disconnects. After disconnecting, the LP address is taken from the client and generally given to another client, so this method is effective only for the current Internet session.
  • the system and method presented herein allows for identifying Internet users by using information extracted from the users' network sessions and from the users' past activity pattern.
  • the system and method allows associating various Internet inter-sessions that are conducted by the same user.
  • the system and method is not dependent on login information or any other registry information.
  • a method and system for determining the identity of an unknown individual participating in a network session is provided.
  • a database is maintained which includes records of each previously identified individual.
  • the records include unique strings associated with each individual which were extracted from prior network sessions of that individual.
  • the data stream of information transmitted to and from that individual is read, and known data elements are identified in the information.
  • a subset of information, which includes at least one unique string associated with a known data element, is extracted from the data stream.
  • the subset of information is analyzed to determine if the individual participating in the current network session is a previously identified individual from the database. The analysis includes comparing the unique strings extracted for that individual with unique strings associated with previously identified individuals.
  • One object of the present invention is to provide a generic, intelligent point which associates and identifies Internet sessions conducted by the same user using information and patterns, or a fingerprint, extracted from prior user sessions.
  • a further object of the present invention is to provide to an ISP a statistical method for monitoring and associating an anonymous user participating in a current Internet session using only the streaming data passing to/form the user to/from a remote server and a database of past Internet session information to compare to this streaming data.
  • Another object of the present invention is to provide a way for an ISP to share information with clients and remote content suppliers, e.g., web sites.
  • the invention accordingly comprises the several steps and relation of one or more of such steps with respect to each of the others, and the system embodying features of construction, combinations of elements and arrangements of parts which are adapted to effect such steps, all as exemplified in the following detailed disclosure, and the scope of the invention will be indicated in the claims.
  • FIG. 1 is a flowchart representation of a typical global computer network in accordance with the prior art
  • FIG. 2 is a flowchart representation of a global computer network in accordance with a preferred embodiment of the present invention
  • FIG. 3 is a detailed flowchart representation of the profiling system of FIG. 2 constructed in accordance with the present invention
  • FIG. 4 is a flowchart representation depicting the steps performed during the profiling and identifying process according to a preferred embodiment of the present invention.
  • FIG. 1 depicts a typical ISP junction in accordance with the prior art.
  • the main ISP site generally indicated at 10 includes an ISP access device 18 which allows, for example, a dial-in access through a modem or the like as indicated at 14, direct access through a router or any other communication means, generally indicated at 16, thereby enabling a client 12, or a network 13 of clients 12a, 12b, 12c to connect to ISP junction 10.
  • the site also includes a hub 22, a domain name server (DNS) 20, client access control such as a Radius 24, an e-mail server 25, hosted servers 26, and a router 30 which connects the ISP junction to a global computer network such as Internet 32.
  • DNS domain name server
  • ISP devices are connected together via a network such as a local area network (LAN).
  • LAN local area network
  • ISP network configurations can be used with the present invention.
  • the arrangement and set up of such configurations are well known to those skilled in the art.
  • the present invention as described below in detail can be used in conjunction with any of these possible configurations.
  • Each client 12 is generally a computer such as a PC or laptop with video and audio capabilities, having a processor and programs or applications associated therewith.
  • Internet 32 is a networked collection of clients and servers which are adapted through software and communication links to communicate with one another. The clients, typically through a browser program, can send a request message to a server and await a response. The response is displayed or presented by the browser.
  • FIG. 2 depicts the network configuration of FIG. 1 in which a profiling system, in the form of a session identifier, generally indicated at 40, and arranged and constructed in accordance with the present invention, has been installed.
  • a profiling system in the form of a session identifier, generally indicated at 40, and arranged and constructed in accordance with the present invention, has been installed.
  • session identifier 40 is provided in ISP junction 10 in this example
  • the profiling system and method desc ⁇ bed herein may be used at other points on the network, such as at the hub of a web site.
  • FIG 3 depicts a flowchart representation of the profiling system 40 of FIG. 2 arranged and constructed in accordance with the present invention.
  • the profiling system preferably includes the following modules: system administrator 104, which enables a system operator to set the profiler policy, storage medium 100 including a database, which stores the profiling records, and sniffer 102, which collect the streaming data passing through hub 22 of the ISP.
  • the profiling process includes the following steps. First, the system extracts information from an Internet session and creates a session fingerprint pattern based on the user browsing activity and data stream. Second, the system tnes to match previously created profile records in a database to the current browsing session. At this stage, the current Internet session may be identified by its umque IP address.
  • FIG. 4 depicts a flowchart of the profiling and identifying process.
  • Task 110 reads the entire data stream passing through hub 22.
  • the sniffer 102 distinguishes between different current user Internet sessions by extracting the umque IP address which is assigned to each client while connecting to the Internet.
  • Task 112 extracts information from the Internet session data stream.
  • the Internet session fingerprint pattern is a collection of umque st ⁇ ngs which the user sends/receives to/from Internet servers during the Internet session.
  • the umque st ⁇ ngs are embedded m the application protocols stream, e.g., HTTP, SMTP, etc.
  • the profiling system uses a profiling policy.
  • the profiling policy includes known data elements to look for and additionally determines the sources from which the umque strmg will be extracted.
  • the source includes, but is not limited to: HTTP URL and domain, e.g., private home page, which includes special URL strings, user SMTP connection streams, cookies sent by an Internet host to a client, and information about the client computer and browser (e.g., OS type and version, browser type and version).
  • HTTP URL and domain e.g., private home page, which includes special URL strings
  • user SMTP connection streams e.g., user SMTP connection streams
  • cookies sent by an Internet host to a client e.g., OS type and version, browser type and version
  • Task 114 analyzes and classifies the extracted umque st ⁇ ngs.
  • the task stores the unique st ⁇ ngs in patterns accordmg to the profiler policy.
  • Task 118 creates and updates a profile record in the database for an Internet session conducted by a user
  • the task assigns a umque se ⁇ al number to the record in order to identify it later on
  • the record contains a group of umque stnngs which was extracted from the user Internet session and later will serve as a reference
  • Task 122 saves the profile record in the database of storage medium 100
  • Task 116 checks if the umque strings extracted from the Internet session match one of the profile records stored in the database contained m the storage medium 100. If a match is found, the Internet session is assigned, in task 120, to the matched profile The match will be dropped when another better match is found and contradicts the first match which was found m the current Internet session.
  • the profiling system uses two tables for the profiling process, Current Session Table and Reference Session table
  • Current Session Table stores information about an Internet session in real time Each session is identified by its umque LP address
  • An example of the fields in this table is set forth below
  • the table can store any information passmg to/from the Internet user du ⁇ ng an Internet session
  • the table fields are determined by the profiling policy.
  • the Reference Session table stores information about previous Internet sessions The information stored is used to associate current sessions with previous sessions This is done by compa ⁇ ng umque keywords which are stored in the table fields.
  • the table includes all the fields that are used in the Current Session Table except the LP address which is replaced by the umque se ⁇ al number for the user The table is updated every time a new umque subset of information is found and associates to a previous session
  • the profiling process occurs at the ISP site where the profiling system can read the information passmg from/to the Internet user
  • An Example of the profiling process is set forth below:
  • Client connects to the ISP and gets a unique EP address.
  • the profiling system extracts information from the browsing stream and stores it in the Cu ⁇ ent Session Table using the LP address as an Internet session ED.
  • the Profiling system tries to find a match between a Current Session Table Entry and a Reference Session Table Entry.
  • the profiling system continues to extract information until a match is found or creates a new record in the database.
  • a method and system for determining the identity of an unknown individual participating in a network session is provided.
  • a database of records of each previously identified individual is maintained.
  • the records include unique strings associated with individuals which were extracted from prior network sessions.
  • the data stream of information transmitted to and from that individual is read by the sniffer, and known data elements are identified in the information.
  • a subset of information, which includes at least one unique string associated with a known data element, is extracted from the data stream.
  • the subset of information is then analyzed by comparing the information extracted from the current network session to information stored in the database to determine if the individual participating in the network session is a previously identified individual.
  • the analysis generally includes comparing the unique strings extracted for that individual with unique strings associated with previously identified individuals in the database.
  • the individual is determined to be a previously identified individual, his or her identity is matched with the identity of the previously identified individual, and the record in the database of the previously identified individuals is updated with the new subset of information extracted from the cu ⁇ ent network session. Otherwise, if the individual is not identified, his or her identity is set as a new individual, and a new record is created in the database for the new individual which includes the subset of information extracted from the current network session.
  • the system and method according to the present invention allows identifying and associating a user's previous Internet sessions with the user's current one.
  • the current session data stream is used to create the user's Internet session fingerprint pattern, and includes unique strings extracted from the data stream.
  • the fingerprint pattern for a current session is compared with fingerprint pattems from previous sessions maintained in a database to associate Internet inter-session data streams conducted by the same user at different times.

Abstract

L'invention concerne un système et un procédé d'établissement de profils sur Internet permettant d'identifier et d'associer les sessions Internet précédentes d'un utilisateur à la session en cours de cet utilisateur. Un flux de données de session sert à créer une empreinte digitale de session Internet comprenant des chaînes uniques. L'empreinte digitale d'une session en cours est comparée aux empreintes digitales conservées dans une base de données de façon à associer les flux de données intersessions Internet générés par le même utilisateur à des moments différents.
PCT/IB2000/001159 1999-08-23 2000-08-23 Systeme et procede d'etablissement de profils sur internet WO2001015405A2 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU65867/00A AU6586700A (en) 1999-08-23 2000-08-23 Internet profiling system and method

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US15021799P 1999-08-23 1999-08-23
US60/150,217 1999-08-23

Publications (2)

Publication Number Publication Date
WO2001015405A2 true WO2001015405A2 (fr) 2001-03-01
WO2001015405A3 WO2001015405A3 (fr) 2001-09-20

Family

ID=22533552

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2000/001159 WO2001015405A2 (fr) 1999-08-23 2000-08-23 Systeme et procede d'etablissement de profils sur internet

Country Status (2)

Country Link
AU (1) AU6586700A (fr)
WO (1) WO2001015405A2 (fr)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002103968A1 (fr) * 2001-06-15 2002-12-27 Beep Science As Dispositif et procede de controle de regles de contenu dans un systeme de messagerie multimedia mobile
US7822639B2 (en) 2000-11-28 2010-10-26 Almondnet, Inc. Added-revenue off-site targeted internet advertising
US8180892B2 (en) 2008-12-22 2012-05-15 Kindsight Inc. Apparatus and method for multi-user NAT session identification and tracking
US11611623B2 (en) * 2021-03-19 2023-03-21 At&T Intellectual Property I, L.P. Trusted system for providing customized content to internet service provider subscribers

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1997026729A2 (fr) * 1995-12-27 1997-07-24 Robinson Gary B Filtrage cooperatif automatise dans la publicite sur le world wide web
US5848396A (en) * 1996-04-26 1998-12-08 Freedom Of Information, Inc. Method and apparatus for determining behavioral profile of a computer user

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1997026729A2 (fr) * 1995-12-27 1997-07-24 Robinson Gary B Filtrage cooperatif automatise dans la publicite sur le world wide web
US5848396A (en) * 1996-04-26 1998-12-08 Freedom Of Information, Inc. Method and apparatus for determining behavioral profile of a computer user

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7822639B2 (en) 2000-11-28 2010-10-26 Almondnet, Inc. Added-revenue off-site targeted internet advertising
US8244586B2 (en) 2000-11-28 2012-08-14 Almondnet, Inc. Computerized systems for added-revenue off-site targeted internet advertising
US10026100B2 (en) 2000-11-28 2018-07-17 Almondnet, Inc. Methods and apparatus for facilitated off-site targeted internet advertising
US10628857B2 (en) 2000-11-28 2020-04-21 Almondnet, Inc. Methods and apparatus for facilitated off-site targeted internet advertising
WO2002103968A1 (fr) * 2001-06-15 2002-12-27 Beep Science As Dispositif et procede de controle de regles de contenu dans un systeme de messagerie multimedia mobile
US8180892B2 (en) 2008-12-22 2012-05-15 Kindsight Inc. Apparatus and method for multi-user NAT session identification and tracking
US11611623B2 (en) * 2021-03-19 2023-03-21 At&T Intellectual Property I, L.P. Trusted system for providing customized content to internet service provider subscribers

Also Published As

Publication number Publication date
WO2001015405A3 (fr) 2001-09-20
AU6586700A (en) 2001-03-19

Similar Documents

Publication Publication Date Title
JP4358188B2 (ja) インターネット検索エンジンにおける無効クリック検出装置
US9307036B2 (en) Web access using cross-domain cookies
US7600020B2 (en) System and program product for tracking web user sessions
US10547691B2 (en) System and method for main page identification in web decoding
US8176557B2 (en) Remote collection of computer forensic evidence
US6401118B1 (en) Method and computer program product for an online monitoring search engine
ATE461566T1 (de) System und verfahren zur analysierung von netzprotokollen
JP2004509413A (ja) ロボット・プルーフ・ウェブ・サイトを実現するためのシステム及び方法
JP2003233623A (ja) フィルタリングの適応化システムおよび適応化方法
JP2003263529A (ja) 付加価値サービスのオンライン個人化のためのオフライン行動分析
CN103399909A (zh) 在提供访问联网内容文件中分配访问控制级的方法和设备
US7032017B2 (en) Identifying unique web visitors behind proxy servers
EP1561327A1 (fr) Procedes et systemes pour l'acheminement de demandes au niveau d'un commutateur de reseau
Suresh et al. An overview of data preprocessing in data and web usage mining
WO2001015405A2 (fr) Systeme et procede d'etablissement de profils sur internet
WO2017177590A1 (fr) Procédé d'association de nom de domaine à un comportement d'accès à un site web
JPH08320846A (ja) 対話管理型情報提供方法及び装置
JPH0950422A (ja) コンピュータネットワーク上の対話継承型アクセス制御方法及びそのサーバコンピュータ
JP5061316B1 (ja) 通信パケット解析装置
KR100619179B1 (ko) 인터넷 검색 엔진에 있어서의 무효 클릭 검출 방법 및 장치
US20070245029A1 (en) Method for Determining Validity of Command and System Thereof
JP6105797B1 (ja) 情報処理装置、情報処理方法及びプログラム
US20030186211A1 (en) Training support program, application installation support program, and training support method
Shafagat Study and Comparative Analysis of Log Files
JPH11306160A (ja) サービス利用履歴からのサービス単位の抽出方法、抽出装置及び抽出プログラムを記録した記録媒体

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP