CN101145936B - A method and system for adding tags in Web pages - Google Patents

A method and system for adding tags in Web pages Download PDF

Info

Publication number
CN101145936B
CN101145936B CN2006101272136A CN200610127213A CN101145936B CN 101145936 B CN101145936 B CN 101145936B CN 2006101272136 A CN2006101272136 A CN 2006101272136A CN 200610127213 A CN200610127213 A CN 200610127213A CN 101145936 B CN101145936 B CN 101145936B
Authority
CN
China
Prior art keywords
page
client
web
code
web server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN2006101272136A
Other languages
Chinese (zh)
Other versions
CN101145936A (en
Inventor
李江华
张金银
李昂
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN2006101272136A priority Critical patent/CN101145936B/en
Publication of CN101145936A publication Critical patent/CN101145936A/en
Priority to HK08107741.9A priority patent/HK1114486A1/en
Application granted granted Critical
Publication of CN101145936B publication Critical patent/CN101145936B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a method for adding a label to a Web page, which includes the following steps: a Web server acquiring a plurality of pages needed to be delivered to a client end according to the request of the client end; scanning the code of the plurality of pages to position the designated location for inserting a label and embedding the label code into the designated location; delivering the page embedded with the label code to the client end; and a log file server receiving visiting information delivered from the client end according to the label. In this way, the invention resolves the problem that under the prior technology, some of the Web pages delivered by the Web servers to the client ends have no embedded labels so that the analysis results of the log file data are inaccurate. The inventive method not only maintains the advantage of generating visiting log via labels but also prevents the omission of some of Web pages to be embedded with labels, and also reduces the workload of website engineers. The invention also discloses a Web server and a website system.

Description

A kind of method and system thereof that in the Web page, adds label
Technical field
The present invention relates to the internet data analysis field, particularly a kind of method and system thereof that in the Web page, adds label.
Background technology
Along with the development of internet information service, many government departments, company, universities and colleges, scientific research institutions etc. have all had or have built the website of oneself.A Web server is all being moved in the back of each website, and Web server is also referred to as WWW (World Wide Web) server, and major function provides network information and browses service.WWW is the multimedia information inquiry instrument of the Internet Internet, is the service that Internet just grows up in recent years, also is with fastest developing speed and present most widely used service.What WWW adopted is client terminal/server structure; The effect of WEB server is arrangement and stores various WWW resources; And the request of customer in response end, be sent to the required resource of user on the platforms such as Windows95, Windows98, WindowsNT, UNIX or Linux; Be in charge of the Web page like Web server, after receiving user's request, the corresponding page be sent to the client Web browser through local network or Internet and use for the user.Web server commonly used now comprises the Enterprise server of Apache Server, Microsoft IIS server and Iplanet.
Management to the website; Require not only to pay close attention to the Web server throughput of every day; Also to understand the visit situation of each page of website; According to the click frequency of each page improve webpage content and quality, improve the readability of information; Trace packet contains the step of business transaction and the back-end data of managing web etc., is the network company of business with ecommerce or search engine especially, need carry out detailed and thorough analysis to follow the tracks of user's behavior to the operation and the visit situation of Web server especially; And these requirements can be carried out statistics and analysis and accomplish through the journal file to Web server.
The generating mode that two kinds of comparatively popular journal files are arranged on the market at present.
A kind of mode is directly to generate journal file by Web server.Consult shown in Figure 1ly, the user sends the request of Web page browsing to Web server through client computer; After Web server received and should ask, the Web page that user's request is browsed returned to client, and the corresponding access log of record; At last be dispersed in the access log on each Web server and analyze, put in order by unified collection of journal file server.
The major defect of this method is, journal file is dispersed on each Web server, if analyze, needs the collector journal file and handles.General is main website with the ecommerce, if day pageview surpasses 50,000,000, just needs up to a hundred Web servers for it service to be provided; Collect the access log of up to a hundred machines, need take a long time, can influence the load of Web server again, simultaneously, because Web server quantity is too many, the journal file server is easy to the holiday access log, causes that the data analysis result's is unreliable.
Another kind of mode is to adopt label (Tag) to generate journal file.The user sends the request of Web page browsing to Web server through client; The Web page that Web server will embed Tag accordingly sends to client, and said Tag is a bit of code, is generally the Javascript code; Client generates corresponding visit information according to the Tag that embeds on the said Web page, and this visit information is sent to the journal file server; The journal file server generates corresponding access log and returns request results to client according to said visit information, then said access log is analyzed, is put in order.
The access log that this mode directly will be analyzed is concentrated and is created on the journal file server, has reduced the process of log collection.But for the generating mode of Tag, also there is weak point in prior art.
In general, the Web page of user capture is divided into two kinds of static page and dynamic pages.Said static page is meant the only corresponding fixing content of pages of this page; Said dynamic page is meant that this page can return corresponding data according to user's difference request according to predefined page program.
For static page, the website engineer confirms the appointment embedded location of Tag earlier in page code, Tag is embedded on the said position by hand again, has what static page website engineers just to need the manual Tag of embedding code how many times; If there is more static page a website; Workload that the so manual Tag of embedding needs and the workload of in the future safeguarding will be very large; If some static page is omitted when embedding Tag by hand; The visit of these pages statistics will can not be transferred into the journal file server so, thereby cause journal file data analysis result inaccurate.
For dynamic page; The website engineer makes public template with the Tag code; And the assigned address in page code adds call instruction; Like this, the dynamic application on the Web server just according to the public template of said command calls correspondence, embeds Tag in the dynamic page when generating dynamic page; A general large-scale website has a lot of dynamic applications; If will in all dynamic pages, embed Tag; Just need all dynamic applications of free hand modification; Its workload and the workload of in the future safeguarding also will be very large, and network engineers may forget the dynamic application that modification is newly reached the standard grade, and make not embed Tag in the newly-generated dynamic page; The visit of these pages statistics will can not be transferred into the journal file server equally so, thereby cause journal file data analysis result inaccurate.
Summary of the invention
The present invention provides a kind of method and system thereof that in the Web page, adds label, in order to solve in the prior art problem that there is the big and easy error of workload in Web page embedded tags.
Technical scheme of the present invention is following:
A kind of method of in the Web page, adding label comprises:
Web server sends to some pages of this client according to the acquisition request needs of client;
The code that said Web server scans said some pages to be navigating to the assigned address that inserts label, and at this assigned address embedded tags code;
Said Web server sends to said client with the page of embedded tags code;
The journal file server receives the visit information that said client is sent according to this label.
According to said method:
The page that said Web server obtains comprises static page and dynamic page.
Said journal file server generates corresponding access log according to said visit information.
A kind of Web server comprises:
Be used to receive the unit of the Web page browsing request that client sends;
Be used to obtain the unit of some pages that need send to client;
Be used to scan said some page code and insert the assigned address of label with the location, and in the unit of this assigned address embedded tags code;
Be used for sending the page of embedded tags code, according to this label corresponding visit information sent to the unit of journal file server with the indication client to client.
A kind of web station system is characterized in that, comprising:
Client is used for the page to the embedded tags of the request of Web server transmission Web page browsing, the transmission of reception Web server, and sends corresponding visit information according to said label to the journal file server;
Web server; Be used for sending to some pages of this client according to the acquisition request needs of client; The code that scans said some pages to be navigating to the assigned address that inserts label and at this assigned address embedded tags code, and the page of embedded tags code is sent to said client.
The journal file server is used for generating corresponding access log according to the visit information that client is sent.
Beneficial effect of the present invention is following:
The present invention scans the assigned address of the code of this page with definite embedded tags earlier through Web server before sending the Web page, and adds tag identification code in said position.Like this, both kept the advantage that generates access log through label, and prevented that again the part Web page from being omitted when embedded tags, thereby guaranteed journal file data analysis result's accuracy; Simultaneously,, no longer need the manual label that inserts, no longer need the free hand modification program code to call common template for dynamic page, thereby reduced workload significantly for static page.
Description of drawings
Fig. 1 directly generates the system architecture figure of journal file for adopting Web server;
Fig. 2 A is a web station system system assumption diagram in the embodiment of the invention;
Fig. 2 B is a Web server structure chart in the embodiment of the invention;
Fig. 3 is the flow chart of Web server embedded tags in the Web page in the embodiment of the invention.
Embodiment
The present invention scans the code of the said page with definite assigned address that inserts label Tag through Web server in elder generation before client is sent the Web page, and at this position embedded tags code, to reduce workload and to avoid page omission label.
Consult shown in Fig. 2 A, the web station system in the embodiment of the invention comprises client 20, Web server 21 and journal file server 22.Said client 20 is used for sending to Web server 21 page of the embedding Tag of the request of Web page browsing, 21 transmissions of reception Web server, and sends corresponding visit information according to said Tag to journal file server 22; Said Web server 21 is used for sending to according to the acquisition request needs of client 20 page of client 20; The code that scans the said page to be navigating to the assigned address that inserts Tag and at this assigned address embedded tags code, and the page that will embed Tag sends to said client 20; Said journal file server is used for generating corresponding access log according to the visit information that client is sent.
Consult shown in Fig. 2 B, said Web server 21 comprises receiving element 210, chooses unit 211, processing unit 212 and transmitting element 213.Said receiving element 210 is used to receive the Web page browsing request that client is sent; The said unit 211 of choosing is used for according to said Web page browsing request and chooses the corresponding static page or dynamic page; Said processing unit 212 is used to scan the assigned address of the code of said static page of choosing or dynamic page with definite Tag of embedding, and adds tag identification code in this position; Said transmitting element 213 is used for the static page or the dynamic page that embed Tag are sent to client 20, and said tag identification code is defined in advance and is stored on the Web server 21 by the website engineer, and regularly upgrades according to actual needs.
Inserting Tag in the Web page can specify according to actual needs, like front or back (" <body>" through among the scanning agency positions), perhaps other positions in the page of <body>part in the page code.
Consult shown in Figure 3ly, Web server 21 embeds the detailed process of Tag in the Web page following:
Step 300: the user sends the request of Web page browsing through client 20 to Web server 21.
Step 301:Web server 21 is chosen the corresponding static page according to described request or in database, is chosen related data and generates the corresponding dynamic page in this locality.
The page code of step 302:Web server 21 said static pages of scanning or dynamic page is confirmed <body>position partly in this page code, and tag identification code is embedded the front or the back of said <body>part.
Static page or dynamic page that step 303:Web server 21 will contain Tag send to client 20.
After client 20 receives the static page or dynamic page of Web server 21 transmissions; Through Web browser the said page is presented to the user; And said user's visit information is sent to journal file server 22 according to the Tag that embeds in this page, generate corresponding journal file and unified by journal file server 22 according to said visit information and carry out data analysis.
For example: the code of a Web page is following:
<!DOCTYPEHTMLPUBLIC″-//W3C/DTD?HTML4.01//EN″″http://www.w3.org/TR/htm14/strict.dtd″>
<html><head><meta?http-equiv=″Content-Type″content=″text/html″;charset=gb2312″><title>Alibaba</title>
</head>
<body>
Alibaba?Alibaba?Alibaba?Alibaba?Alibaba?Alibaba
</body></html>
After Web server 21 scans < Body>part in the code of this page, the Tag code is embedded into back of < Body>part, it is following to embed behind the Tag code code of the Web page:
<!DOCTYPE?HTMLPUBLIC″-//W3C//DTD?HTML4.01//EN″″http://www.w3.org/TR/html4/strict.dtd″>
<html><head><meta?http-equiv=″Content-Type″content=″text/>html;charset=gb2312″<title>Alibaba</title>
</head>
<body><script?language=″JavaScript″type=″text/javascript″>
<!-
var?r=Math.random();
document.write(″img?alt=\″\″width=\″1\″height=\″1\″style=\″1\″height=\″1\″style=\″display:none\″″);
document.write(″src=\″http://dmtracking.alibaba.com/b.jpg?cD0x.JnU9ey93d3cuYWxpYmFiYS5jb20vfSZtPXtHRVR9JnM9ezIwMH0mcj17LX0mYT17LX0mYj17LX0mYz17LX0=&rand=″+r+″\″>″);
->
</script>
Alibaba?Alibaba?Alibaba?Alibaba?Alibaba?Alibaba
</body><html>
Like this, static page that all are sent by Web server 21 or dynamic page all can be embedded into Tag and can not be omitted, and its corresponding visit information also can be sent to journal file server 22 by client 20.
Add tag identification code through Web server 21 assigned address at the page when sending the Web page; Both kept the advantage that generates access log through label; Prevented that again the part Web page from being omitted when embedded tags, thereby guaranteed journal file data analysis result's accuracy; Simultaneously,, no longer need the manual label that inserts, no longer need the free hand modification program code to call common template for dynamic page, thereby reduced workload significantly for static page.
Obviously, those skilled in the art can carry out various changes and modification to the present invention and not break away from the spirit and scope of the present invention.Like this, belong within the scope of claim of the present invention and equivalent technologies thereof if of the present invention these are revised with modification, then the present invention also is intended to comprise these changes and modification interior.

Claims (7)

1. a method of in the Web page, adding label is characterized in that, comprising:
Web server sends to some pages of this client according to the acquisition request needs of client;
The code that said Web server scans said some pages to be navigating to the assigned address that inserts label, and at this assigned address embedded tags code;
Said Web server sends to said client with the page of embedded tags code;
The journal file server receives the visit information that said client is sent according to this label.
2. the method for claim 1 is characterized in that, the page that said Web server obtains comprises static page and dynamic page.
3. method as claimed in claim 3 is characterized in that, said journal file server generates corresponding access log according to said visit information.
4. a Web server is characterized in that, comprising:
Be used to receive the unit of the Web page browsing request that client sends;
Be used to obtain the unit of some pages that need send to client;
The code that is used to scan said some pages inserts the assigned address of label with the location, and in the unit of this assigned address embedded tags code;
Be used for sending the page of embedded tags code, according to this label corresponding visit information sent to the unit of journal file server with the indication client to client.
5. Web server as claimed in claim 4 is characterized in that the page that said Web server obtains comprises static page and dynamic page.
6. a web station system is characterized in that, comprising:
Client is used for the page to the embedded tags of the request of Web server transmission Web page browsing, the transmission of reception Web server, and sends corresponding visit information according to said label to the journal file server;
Web server; Be used for sending to some pages of this client according to the acquisition request needs of client; The code that scans said some pages to be navigating to the assigned address that inserts label and at this assigned address embedded tags code, and the page of embedded tags code is sent to said client.
7. web station system as claimed in claim 6 is characterized in that, also comprises:
The journal file server is used for generating corresponding access log according to the visit information that client is sent.
CN2006101272136A 2006-09-12 2006-09-12 A method and system for adding tags in Web pages Active CN101145936B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN2006101272136A CN101145936B (en) 2006-09-12 2006-09-12 A method and system for adding tags in Web pages
HK08107741.9A HK1114486A1 (en) 2006-09-12 2008-07-14 A method for adding tags to web pages and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2006101272136A CN101145936B (en) 2006-09-12 2006-09-12 A method and system for adding tags in Web pages

Publications (2)

Publication Number Publication Date
CN101145936A CN101145936A (en) 2008-03-19
CN101145936B true CN101145936B (en) 2012-04-18

Family

ID=39208250

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2006101272136A Active CN101145936B (en) 2006-09-12 2006-09-12 A method and system for adding tags in Web pages

Country Status (2)

Country Link
CN (1) CN101145936B (en)
HK (1) HK1114486A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103684823A (en) * 2012-09-12 2014-03-26 阿里巴巴集团控股有限公司 Weblog recording method, network access path determining method and related devices
CN102999360A (en) * 2012-11-23 2013-03-27 奥维通信股份有限公司 Realization method for dynamic clock based on embedded WEB-OMT (object modeling technology) page
CN105337930B (en) * 2014-06-30 2019-02-19 北京新媒传信科技有限公司 The method and device that a kind of pair of data are handled
CN105119955B (en) * 2015-07-09 2018-10-09 中国科学院计算技术研究所 The method and system that catalogue multipage is supported are read in a kind of distributed file system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1026608A2 (en) * 1999-01-28 2000-08-09 Lucent Technologies Inc. Method and apparatus for logging web user activity using client-side support
CN1519722A (en) * 2003-01-20 2004-08-11 鼎新电脑股份有限公司 System and method for processing events on java web page
CN1744498A (en) * 2004-08-31 2006-03-08 华为技术有限公司 Network server and data interacting method for network server and browser

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1026608A2 (en) * 1999-01-28 2000-08-09 Lucent Technologies Inc. Method and apparatus for logging web user activity using client-side support
CN1519722A (en) * 2003-01-20 2004-08-11 鼎新电脑股份有限公司 System and method for processing events on java web page
CN1744498A (en) * 2004-08-31 2006-03-08 华为技术有限公司 Network server and data interacting method for network server and browser

Also Published As

Publication number Publication date
HK1114486A1 (en) 2008-10-31
CN101145936A (en) 2008-03-19

Similar Documents

Publication Publication Date Title
CN101211364B (en) Method and system for social bookmarking of resources exposed in web pages
CN101427229B (en) Technique for modifying presentation of information displayed to end users of a computer system
CN100399290C (en) Gethering enriched server activity data of cached web content
CN101131747B (en) Method, device and system for catching and/or analyzing Web page events at client terminal
US8065620B2 (en) System and method for defining and presenting a composite web page
CN1941786B (en) Method and apparatus for publishing content through blog
US20040254935A1 (en) Method and apparatus for automatic consolidation of personalized dynamic data
CN101364979B (en) Downloaded material parsing and processing system and method
EP1258819A2 (en) System and method for providing a file in multiple languages
US20030093400A1 (en) Method for updating a database from a browser
US20120011431A1 (en) Method and System of Retrieving Ajax Web Page Content
CN103685604B (en) A kind of domain name pre-parsed method and device
US20100199169A1 (en) Web browser extension for simplified utilization of web services
US9311303B2 (en) Interpreted language translation system and method
US8090702B2 (en) Analyzing web site traffic
US8195762B2 (en) Locating a portion of data on a computer network
US7069292B2 (en) Automatic display method and apparatus for update information, and medium storing program for the method
US7623741B2 (en) System and method for transferring scanned imaging data to a personal imaging repository
WO2002065359A1 (en) Electronic information management system
CN101145936B (en) A method and system for adding tags in Web pages
CN1960371B (en) Method and system for accessing file of Web application program
US20130019152A1 (en) Method and Apparatus for Tracking Exit Destinations of Web Page
US20040205490A1 (en) Method and system for automated simulation of dynamic URL-based Web application
CN101772196A (en) Method and system for processing message sent by mobile terminal and acting server
EP1384147A2 (en) Method and apparatus for processing of internet forms

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1114486

Country of ref document: HK

C14 Grant of patent or utility model
GR01 Patent grant
REG Reference to a national code

Ref country code: HK

Ref legal event code: GR

Ref document number: 1114486

Country of ref document: HK