WO2001015405A2 - Systeme et procede d'etablissement de profils sur internet - Google Patents
Systeme et procede d'etablissement de profils sur internet Download PDFInfo
- Publication number
- WO2001015405A2 WO2001015405A2 PCT/IB2000/001159 IB0001159W WO0115405A2 WO 2001015405 A2 WO2001015405 A2 WO 2001015405A2 IB 0001159 W IB0001159 W IB 0001159W WO 0115405 A2 WO0115405 A2 WO 0115405A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- individual
- previously identified
- internet
- information
- unknown
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/50—Network services
- H04L67/535—Tracking the activity of the user
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L69/00—Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
- H04L69/30—Definitions, standards or architectural aspects of layered protocol stacks
- H04L69/32—Architecture of open systems interconnection [OSI] 7-layer type protocol stacks, e.g. the interfaces between the data link level and the physical level
- H04L69/322—Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions
- H04L69/329—Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions in the application layer [OSI layer 7]
Definitions
- An Internet profiling method and system for client/user identification by statistical examination of streaming data
- This invention relates, in general, to a method and system for identifying an Internet or network user who is engaged in repetitive Internet or network inter-sessions. More specifically it relates to an expandable method for a network device, which identifies and associates Internet or network inter-sessions conducted by a user using information sent/received to/from the user's computer to/from any remote host on the Internet or the network.
- the method is adaptable to many Internet and network users.
- Identifying an Internet user has become a major issue since the Internet became a media for advertisement and targeted content.
- Remote content suppliers e.g., web sites
- the web sites may use resources on the client computer, e.g., cookies or client software, and second, the web sites may ask the client to identify himself/herself when the client asks for the content. These two methods are commonly used in client server interactions.
- clients connect to a public network, e.g., the Internet, through an ISP (Internet Service Provider).
- ISP Internet Service Provider
- the ISP identifies its clients by their login information, e.g., username and password.
- Another method for the ISP to identify a client is to track their unique IP address, which is assigned to a client after login. The first method allows the ISP to associate a current user Internet session to a specific user and can
- C0NFIRMATI0N COPY create a user log file.
- the second method provides the ISP with a way to track the client Internet session in real time until the client disconnects. After disconnecting, the LP address is taken from the client and generally given to another client, so this method is effective only for the current Internet session.
- the system and method presented herein allows for identifying Internet users by using information extracted from the users' network sessions and from the users' past activity pattern.
- the system and method allows associating various Internet inter-sessions that are conducted by the same user.
- the system and method is not dependent on login information or any other registry information.
- a method and system for determining the identity of an unknown individual participating in a network session is provided.
- a database is maintained which includes records of each previously identified individual.
- the records include unique strings associated with each individual which were extracted from prior network sessions of that individual.
- the data stream of information transmitted to and from that individual is read, and known data elements are identified in the information.
- a subset of information, which includes at least one unique string associated with a known data element, is extracted from the data stream.
- the subset of information is analyzed to determine if the individual participating in the current network session is a previously identified individual from the database. The analysis includes comparing the unique strings extracted for that individual with unique strings associated with previously identified individuals.
- One object of the present invention is to provide a generic, intelligent point which associates and identifies Internet sessions conducted by the same user using information and patterns, or a fingerprint, extracted from prior user sessions.
- a further object of the present invention is to provide to an ISP a statistical method for monitoring and associating an anonymous user participating in a current Internet session using only the streaming data passing to/form the user to/from a remote server and a database of past Internet session information to compare to this streaming data.
- Another object of the present invention is to provide a way for an ISP to share information with clients and remote content suppliers, e.g., web sites.
- the invention accordingly comprises the several steps and relation of one or more of such steps with respect to each of the others, and the system embodying features of construction, combinations of elements and arrangements of parts which are adapted to effect such steps, all as exemplified in the following detailed disclosure, and the scope of the invention will be indicated in the claims.
- FIG. 1 is a flowchart representation of a typical global computer network in accordance with the prior art
- FIG. 2 is a flowchart representation of a global computer network in accordance with a preferred embodiment of the present invention
- FIG. 3 is a detailed flowchart representation of the profiling system of FIG. 2 constructed in accordance with the present invention
- FIG. 4 is a flowchart representation depicting the steps performed during the profiling and identifying process according to a preferred embodiment of the present invention.
- FIG. 1 depicts a typical ISP junction in accordance with the prior art.
- the main ISP site generally indicated at 10 includes an ISP access device 18 which allows, for example, a dial-in access through a modem or the like as indicated at 14, direct access through a router or any other communication means, generally indicated at 16, thereby enabling a client 12, or a network 13 of clients 12a, 12b, 12c to connect to ISP junction 10.
- the site also includes a hub 22, a domain name server (DNS) 20, client access control such as a Radius 24, an e-mail server 25, hosted servers 26, and a router 30 which connects the ISP junction to a global computer network such as Internet 32.
- DNS domain name server
- ISP devices are connected together via a network such as a local area network (LAN).
- LAN local area network
- ISP network configurations can be used with the present invention.
- the arrangement and set up of such configurations are well known to those skilled in the art.
- the present invention as described below in detail can be used in conjunction with any of these possible configurations.
- Each client 12 is generally a computer such as a PC or laptop with video and audio capabilities, having a processor and programs or applications associated therewith.
- Internet 32 is a networked collection of clients and servers which are adapted through software and communication links to communicate with one another. The clients, typically through a browser program, can send a request message to a server and await a response. The response is displayed or presented by the browser.
- FIG. 2 depicts the network configuration of FIG. 1 in which a profiling system, in the form of a session identifier, generally indicated at 40, and arranged and constructed in accordance with the present invention, has been installed.
- a profiling system in the form of a session identifier, generally indicated at 40, and arranged and constructed in accordance with the present invention, has been installed.
- session identifier 40 is provided in ISP junction 10 in this example
- the profiling system and method desc ⁇ bed herein may be used at other points on the network, such as at the hub of a web site.
- FIG 3 depicts a flowchart representation of the profiling system 40 of FIG. 2 arranged and constructed in accordance with the present invention.
- the profiling system preferably includes the following modules: system administrator 104, which enables a system operator to set the profiler policy, storage medium 100 including a database, which stores the profiling records, and sniffer 102, which collect the streaming data passing through hub 22 of the ISP.
- the profiling process includes the following steps. First, the system extracts information from an Internet session and creates a session fingerprint pattern based on the user browsing activity and data stream. Second, the system tnes to match previously created profile records in a database to the current browsing session. At this stage, the current Internet session may be identified by its umque IP address.
- FIG. 4 depicts a flowchart of the profiling and identifying process.
- Task 110 reads the entire data stream passing through hub 22.
- the sniffer 102 distinguishes between different current user Internet sessions by extracting the umque IP address which is assigned to each client while connecting to the Internet.
- Task 112 extracts information from the Internet session data stream.
- the Internet session fingerprint pattern is a collection of umque st ⁇ ngs which the user sends/receives to/from Internet servers during the Internet session.
- the umque st ⁇ ngs are embedded m the application protocols stream, e.g., HTTP, SMTP, etc.
- the profiling system uses a profiling policy.
- the profiling policy includes known data elements to look for and additionally determines the sources from which the umque strmg will be extracted.
- the source includes, but is not limited to: HTTP URL and domain, e.g., private home page, which includes special URL strings, user SMTP connection streams, cookies sent by an Internet host to a client, and information about the client computer and browser (e.g., OS type and version, browser type and version).
- HTTP URL and domain e.g., private home page, which includes special URL strings
- user SMTP connection streams e.g., user SMTP connection streams
- cookies sent by an Internet host to a client e.g., OS type and version, browser type and version
- Task 114 analyzes and classifies the extracted umque st ⁇ ngs.
- the task stores the unique st ⁇ ngs in patterns accordmg to the profiler policy.
- Task 118 creates and updates a profile record in the database for an Internet session conducted by a user
- the task assigns a umque se ⁇ al number to the record in order to identify it later on
- the record contains a group of umque stnngs which was extracted from the user Internet session and later will serve as a reference
- Task 122 saves the profile record in the database of storage medium 100
- Task 116 checks if the umque strings extracted from the Internet session match one of the profile records stored in the database contained m the storage medium 100. If a match is found, the Internet session is assigned, in task 120, to the matched profile The match will be dropped when another better match is found and contradicts the first match which was found m the current Internet session.
- the profiling system uses two tables for the profiling process, Current Session Table and Reference Session table
- Current Session Table stores information about an Internet session in real time Each session is identified by its umque LP address
- An example of the fields in this table is set forth below
- the table can store any information passmg to/from the Internet user du ⁇ ng an Internet session
- the table fields are determined by the profiling policy.
- the Reference Session table stores information about previous Internet sessions The information stored is used to associate current sessions with previous sessions This is done by compa ⁇ ng umque keywords which are stored in the table fields.
- the table includes all the fields that are used in the Current Session Table except the LP address which is replaced by the umque se ⁇ al number for the user The table is updated every time a new umque subset of information is found and associates to a previous session
- the profiling process occurs at the ISP site where the profiling system can read the information passmg from/to the Internet user
- An Example of the profiling process is set forth below:
- Client connects to the ISP and gets a unique EP address.
- the profiling system extracts information from the browsing stream and stores it in the Cu ⁇ ent Session Table using the LP address as an Internet session ED.
- the Profiling system tries to find a match between a Current Session Table Entry and a Reference Session Table Entry.
- the profiling system continues to extract information until a match is found or creates a new record in the database.
- a method and system for determining the identity of an unknown individual participating in a network session is provided.
- a database of records of each previously identified individual is maintained.
- the records include unique strings associated with individuals which were extracted from prior network sessions.
- the data stream of information transmitted to and from that individual is read by the sniffer, and known data elements are identified in the information.
- a subset of information, which includes at least one unique string associated with a known data element, is extracted from the data stream.
- the subset of information is then analyzed by comparing the information extracted from the current network session to information stored in the database to determine if the individual participating in the network session is a previously identified individual.
- the analysis generally includes comparing the unique strings extracted for that individual with unique strings associated with previously identified individuals in the database.
- the individual is determined to be a previously identified individual, his or her identity is matched with the identity of the previously identified individual, and the record in the database of the previously identified individuals is updated with the new subset of information extracted from the cu ⁇ ent network session. Otherwise, if the individual is not identified, his or her identity is set as a new individual, and a new record is created in the database for the new individual which includes the subset of information extracted from the current network session.
- the system and method according to the present invention allows identifying and associating a user's previous Internet sessions with the user's current one.
- the current session data stream is used to create the user's Internet session fingerprint pattern, and includes unique strings extracted from the data stream.
- the fingerprint pattern for a current session is compared with fingerprint pattems from previous sessions maintained in a database to associate Internet inter-session data streams conducted by the same user at different times.
Abstract
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU65867/00A AU6586700A (en) | 1999-08-23 | 2000-08-23 | Internet profiling system and method |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15021799P | 1999-08-23 | 1999-08-23 | |
US60/150,217 | 1999-08-23 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2001015405A2 true WO2001015405A2 (fr) | 2001-03-01 |
WO2001015405A3 WO2001015405A3 (fr) | 2001-09-20 |
Family
ID=22533552
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IB2000/001159 WO2001015405A2 (fr) | 1999-08-23 | 2000-08-23 | Systeme et procede d'etablissement de profils sur internet |
Country Status (2)
Country | Link |
---|---|
AU (1) | AU6586700A (fr) |
WO (1) | WO2001015405A2 (fr) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002103968A1 (fr) * | 2001-06-15 | 2002-12-27 | Beep Science As | Dispositif et procede de controle de regles de contenu dans un systeme de messagerie multimedia mobile |
US7822639B2 (en) | 2000-11-28 | 2010-10-26 | Almondnet, Inc. | Added-revenue off-site targeted internet advertising |
US8180892B2 (en) | 2008-12-22 | 2012-05-15 | Kindsight Inc. | Apparatus and method for multi-user NAT session identification and tracking |
US11611623B2 (en) * | 2021-03-19 | 2023-03-21 | At&T Intellectual Property I, L.P. | Trusted system for providing customized content to internet service provider subscribers |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997026729A2 (fr) * | 1995-12-27 | 1997-07-24 | Robinson Gary B | Filtrage cooperatif automatise dans la publicite sur le world wide web |
US5848396A (en) * | 1996-04-26 | 1998-12-08 | Freedom Of Information, Inc. | Method and apparatus for determining behavioral profile of a computer user |
-
2000
- 2000-08-23 WO PCT/IB2000/001159 patent/WO2001015405A2/fr active Application Filing
- 2000-08-23 AU AU65867/00A patent/AU6586700A/en not_active Abandoned
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997026729A2 (fr) * | 1995-12-27 | 1997-07-24 | Robinson Gary B | Filtrage cooperatif automatise dans la publicite sur le world wide web |
US5848396A (en) * | 1996-04-26 | 1998-12-08 | Freedom Of Information, Inc. | Method and apparatus for determining behavioral profile of a computer user |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7822639B2 (en) | 2000-11-28 | 2010-10-26 | Almondnet, Inc. | Added-revenue off-site targeted internet advertising |
US8244586B2 (en) | 2000-11-28 | 2012-08-14 | Almondnet, Inc. | Computerized systems for added-revenue off-site targeted internet advertising |
US10026100B2 (en) | 2000-11-28 | 2018-07-17 | Almondnet, Inc. | Methods and apparatus for facilitated off-site targeted internet advertising |
US10628857B2 (en) | 2000-11-28 | 2020-04-21 | Almondnet, Inc. | Methods and apparatus for facilitated off-site targeted internet advertising |
WO2002103968A1 (fr) * | 2001-06-15 | 2002-12-27 | Beep Science As | Dispositif et procede de controle de regles de contenu dans un systeme de messagerie multimedia mobile |
US8180892B2 (en) | 2008-12-22 | 2012-05-15 | Kindsight Inc. | Apparatus and method for multi-user NAT session identification and tracking |
US11611623B2 (en) * | 2021-03-19 | 2023-03-21 | At&T Intellectual Property I, L.P. | Trusted system for providing customized content to internet service provider subscribers |
Also Published As
Publication number | Publication date |
---|---|
WO2001015405A3 (fr) | 2001-09-20 |
AU6586700A (en) | 2001-03-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4358188B2 (ja) | インターネット検索エンジンにおける無効クリック検出装置 | |
US9307036B2 (en) | Web access using cross-domain cookies | |
US7600020B2 (en) | System and program product for tracking web user sessions | |
US10547691B2 (en) | System and method for main page identification in web decoding | |
US8176557B2 (en) | Remote collection of computer forensic evidence | |
US6401118B1 (en) | Method and computer program product for an online monitoring search engine | |
ATE461566T1 (de) | System und verfahren zur analysierung von netzprotokollen | |
JP2004509413A (ja) | ロボット・プルーフ・ウェブ・サイトを実現するためのシステム及び方法 | |
JP2003233623A (ja) | フィルタリングの適応化システムおよび適応化方法 | |
JP2003263529A (ja) | 付加価値サービスのオンライン個人化のためのオフライン行動分析 | |
CN103399909A (zh) | 在提供访问联网内容文件中分配访问控制级的方法和设备 | |
US7032017B2 (en) | Identifying unique web visitors behind proxy servers | |
EP1561327A1 (fr) | Procedes et systemes pour l'acheminement de demandes au niveau d'un commutateur de reseau | |
Suresh et al. | An overview of data preprocessing in data and web usage mining | |
WO2001015405A2 (fr) | Systeme et procede d'etablissement de profils sur internet | |
WO2017177590A1 (fr) | Procédé d'association de nom de domaine à un comportement d'accès à un site web | |
JPH08320846A (ja) | 対話管理型情報提供方法及び装置 | |
JPH0950422A (ja) | コンピュータネットワーク上の対話継承型アクセス制御方法及びそのサーバコンピュータ | |
JP5061316B1 (ja) | 通信パケット解析装置 | |
KR100619179B1 (ko) | 인터넷 검색 엔진에 있어서의 무효 클릭 검출 방법 및 장치 | |
US20070245029A1 (en) | Method for Determining Validity of Command and System Thereof | |
JP6105797B1 (ja) | 情報処理装置、情報処理方法及びプログラム | |
US20030186211A1 (en) | Training support program, application installation support program, and training support method | |
Shafagat | Study and Comparative Analysis of Log Files | |
JPH11306160A (ja) | サービス利用履歴からのサービス単位の抽出方法、抽出装置及び抽出プログラムを記録した記録媒体 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
AK | Designated states |
Kind code of ref document: A3 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A3 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase | ||
NENP | Non-entry into the national phase |
Ref country code: JP |