CN101145936A - A method and system for adding tags in Web pages - Google Patents

A method and system for adding tags in Web pages Download PDF

Info

Publication number
CN101145936A
CN101145936A CNA2006101272136A CN200610127213A CN101145936A CN 101145936 A CN101145936 A CN 101145936A CN A2006101272136 A CNA2006101272136 A CN A2006101272136A CN 200610127213 A CN200610127213 A CN 200610127213A CN 101145936 A CN101145936 A CN 101145936A
Authority
CN
China
Prior art keywords
page
client
web
code
web server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2006101272136A
Other languages
Chinese (zh)
Other versions
CN101145936B (en
Inventor
李江华
张金银
李昂
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN2006101272136A priority Critical patent/CN101145936B/en
Publication of CN101145936A publication Critical patent/CN101145936A/en
Priority to HK08107741.9A priority patent/HK1114486A1/en
Application granted granted Critical
Publication of CN101145936B publication Critical patent/CN101145936B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a method for adding a label to a Web page, which includes that a Web server acquires the page needed to be delivered to a client end according to the request of the client end; scan the code of the page to position the designated location for inserting a label and embed the label code into the designated location; and deliver the page embedded with the identification code to the client end. In this way, the invention resolves the problem that under the prior technology, some of the Web pages delivered by Web servers to client ends have no embedded labels so that the analysis results of the log file data are inaccurate; the invention not only maintains the advantage of generating visit log via labels but also prevents the omission of some of Web pages to be embedded with labels, and also reduces the workload of website engineers. The invention also discloses a Web server and a website system.

Description

A kind of method and system thereof that in the Web page, adds label
Technical field
The present invention relates to the internet data analysis field, particularly a kind of method and system thereof that in the Web page, adds label.
Background technology
Along with the development of internet information service, many government departments, company, universities and colleges, scientific research institutions etc. have all had or have built the website of oneself.A Web server is all being moved in the back of each website, and Web server is also referred to as WWW (World Wide Web) server, and major function provides network information and browses service.WWW is the multimedia information inquiry instrument of the Internet Internet, is the service that Internet just grows up in recent years, also is with fastest developing speed and present most widely used service.What WWW adopted is client terminal/server structure, the effect of WEB server is arrangement and stores various WWW resources, and the request of customer in response end, the required resource of user is sent on the platforms such as Windows95, Windows98, WindowsNT, UNIX or Linux; Be in charge of the Web page as Web server, after receiving user's request, the corresponding page be sent to the client Web browser by local network or Internet and use for the user.Web server commonly used now comprises the Enterprise server of Apache Server, Microsoff IIS server and Iplanet.
Management to the website, require not only to pay close attention to the Web server throughput of every day, also to understand the visit situation of each page of website, improve the readability of the content of webpage and quality, raising information according to the click frequency of each page, trace packet contains the step of business transaction and the back-end data of managing web etc., especially be the network company of business with ecommerce or search engine, need especially the operation and the visit situation of Web server are carried out detailed and thorough analysis to follow the tracks of user's behavior; And these requirements can be carried out statistics and analysis and accomplish by the journal file to Web server.
The generating mode that two kinds of comparatively popular journal files are arranged on the market at present.
A kind of mode is directly to generate journal file by Web server.Consult shown in Figure 1ly, the user sends the request of Web page browsing to Web server by client computer; After Web server received this request, the Web page that user's request is browsed returned to client, and the corresponding access log of record; At last be dispersed in the access log on each Web server and analyze, put in order by unified collection of journal file server.
The major defect of this method is, journal file is dispersed on each Web server, if analyze, needs the collector journal file and handles.The general website based on ecommerce if day pageview surpasses 50,000,000, just needs up to a hundred Web servers to provide service for it; Collect the access log of up to a hundred machines, need take a long time, can influence the load of Web server again, simultaneously, because Web server quantity is too many, the journal file server is easy to the holiday access log, causes that the data analysis result's is unreliable.
Another kind of mode is to adopt label (Tag) to generate journal file.The user sends the request of Web page browsing to Web server by client; The Web page that Web server will embed Tag accordingly sends to client, and described Tag is a bit of code, is generally the Javascript code; Client generates corresponding visit information according to the Tag that embeds on the described Web page, and this visit information is sent to the journal file server; The journal file server generates corresponding access log and returns request results to client according to described visit information, then described access log is analyzed, is put in order.
This mode directly will need the access log analyzed to concentrate to be created on the journal file server, to have reduced the process of log collection.But for the generating mode of Tag, also there is weak point in prior art.
In general, the Web page of user capture is divided into two kinds of static page and dynamic pages.Described static page is meant the only corresponding fixing content of pages of this page; Described dynamic page is meant that this page can return corresponding data according to user's difference request according to predefined page program.
For static page, the website engineer determines the appointment embedded location of Tag earlier in page code, Tag is embedded on the described position by hand again, has what static page website engineers just to need the manual Tag of embedding code how many times; If there is more static page a website, workload that the so manual Tag of embedding needs and the workload of in the future safeguarding will be very large, if some static page is omitted when embedding Tag by hand, the visit of these pages statistics will can not be transferred into the journal file server so, thereby cause journal file data analysis result inaccurate.
For dynamic page, the website engineer makes public template with the Tag code, and the assigned address in page code adds call instruction, like this, dynamic application on the Web server just according to the public template of described command calls correspondence, embeds Tag in the dynamic page when generating dynamic page; A general large-scale website has a lot of dynamic applications, if will in all dynamic pages, embed Tag, just need manual all dynamic applications of revising, its workload and the workload of in the future safeguarding also will be very large, and network engineers may forget the dynamic application that modification is newly reached the standard grade, make and do not embed Tag in the newly-generated dynamic page, the visit of these pages statistics will can not be transferred into the journal file server equally so, thereby cause journal file data analysis result inaccurate.
Summary of the invention
The invention provides a kind of method and system thereof that in the Web page, adds label, exist workload greatly and the problem of makeing mistakes easily in order to solve in the prior art in the Web page embedded tags.
Technical solution of the present invention is as follows:
A kind of method of adding label in the Web page comprises:
Web server sends to the page of this client according to the acquisition request needs of client;
The code that described Web server scans the described page to be navigating to the assigned address that inserts label, and at this assigned address embedded tags code;
The page that described Web server will embed authentication code sends to described client.
According to said method:
The page that described Web server obtains comprises static page and dynamic page.
Described client sends to the journal file server according to this label with corresponding visit information after receiving the page of embedded tags.
Described journal file server generates corresponding access log according to described visit information.
A kind of Web server comprises:
Be used to receive the unit of the Web page browsing request that client sends;
Be used to obtain the unit of the page that need send to client;
Be used to scan described page code and insert the assigned address of label with the location, and in the unit of this assigned address embedded tags code;
Be used for sending the unit of the page that embeds authentication code to client.
A kind of web station system is characterized in that, comprising:
Client is used for the page to the embedded tags of the request of Web server transmission Web page browsing, the transmission of reception Web server, and sends corresponding visit information according to described label to the journal file server.
Web server, be used for sending to the page of this client according to the acquisition request needs of client, the code that scans the described page to be navigating to the assigned address that inserts label and at this assigned address embedded tags code, and the page that will embed authentication code sends to described client.
The journal file server is used for generating corresponding access log according to the visit information that client sends.
Beneficial effect of the present invention is as follows:
The present invention scans the assigned address of the code of this page with definite embedded tags earlier by Web server before sending the Web page, and adds tag identification code in described position.Like this, both kept the advantage that generates access log by label, and prevented that again the part Web page from being omitted when embedded tags, thereby guaranteed journal file data analysis result's accuracy; Simultaneously,, no longer need the manual label that inserts, no longer need manual update routine code to call common template for dynamic page, thereby reduced workload significantly for static page.
Description of drawings
Fig. 1 directly generates the system architecture figure of journal file for adopting Web server;
Fig. 2 A is a web station system system assumption diagram in the embodiment of the invention;
Fig. 2 B is a Web server structure chart in the embodiment of the invention;
Fig. 3 is the flow chart of Web server embedded tags in the Web page in the embodiment of the invention.
Embodiment
The present invention scans the code of the described page with definite assigned address that inserts label Tag by Web server in elder generation before client sends the Web page, and at this position embedded tags code, to reduce workload and to avoid page omission label.
Consult shown in Fig. 2 A, the web station system in the embodiment of the invention comprises client 20, Web server 21 and journal file server 22.Described client 20 is used for sending to Web server 21 page of the embedding Tag of the request of Web page browsing, 21 transmissions of reception Web server, and sends corresponding visit information according to described Tag to journal file server 22; Described Web server 21 is used for sending to according to the acquisition request needs of client 20 page of client 20, the code that scans the described page to be navigating to the assigned address that inserts Tag and at this assigned address embedded tags code, and the page that will embed Tag sends to described client 20; Described journal file server is used for generating corresponding access log according to the visit information that client sends.
Consult shown in Fig. 2 B, described Web server 21 comprises receiving element 210, chooses unit 211, processing unit 212 and transmitting element 213.Described receiving element 210 is used to receive the Web page browsing request that client sends; The described unit 211 of choosing is used for according to described Web page browsing request and chooses the corresponding static page or dynamic page; Described processing unit 212 is used to scan the assigned address of the code of described static page of choosing or dynamic page with definite Tag of embedding, and adds tag identification code in this position; Static page or dynamic page that described transmitting element 213 is used for embedding Tag send to client 20, and described tag identification code is pre-defined and be stored on the Web server 21 by the website engineer, and regularly upgrades according to actual needs.
Inserting Tag in the Web page can specify according to actual needs, as in the page code<body the front of part or back (positioning), perhaps other positions in the page by "<body〉" among the scanning agency.
Consult shown in Figure 3ly, Web server 21 embeds the detailed process of Tag in the Web page as follows:
Step 300: the user sends the request of Web page browsing by client 20 to Web server 21.
Step 301:Web server 21 is chosen the corresponding static page according to described request or choose related data in database and generates the corresponding dynamic page in this locality.
The page code of step 302:Web server 21 described static pages of scanning or dynamic page is determined in this page code<body〉position of part, and tag identification code embedded described<body〉front or the back of part.
Static page or dynamic page that step 303:Web server 21 will contain Tag send to client 20.
After client 20 receives the static page or dynamic page of Web server 21 transmissions, by Web browser the described page is presented to the user, and described user's visit information is sent to journal file server 22 according to the Tag that embeds in this page, generate corresponding journal file and unifiedly carry out data analysis according to described visit information by journal file server 22.
For example: the code of a Web page is as follows:
<!DOCIYPEHTMLPUBLIC″-//W3C//DTDHTML4.01//EN″″http://www.w3.org/TR/html4/strict.dtd″>
<html><head><metahttp-equiv=″Content-Type″content=″text/html;charset=gb2312″><title>Alibaba</title>
</head>
<body>
Alibaba?Alibaba?Alibaba?Alibaba?Alibaba?Alibaba
</body></html>
Web server 21 scans in the code of this page<Body〉after the part, the Tag code is embedded into<Body the back of part, it is as follows to embed behind the Tag code code of the Web page:
<!DOCTYPEHTMLPUBLIC″-//W3C//DTDHTML4.01//EN″″http://www.w3.org/TR/html4/strict.dtd″>
<html><head><metahttp-equiv=″Content-Type″content=″text/html;charset=gb2312″><title>Alibaba</title>
</head>
<body><scriptlanguage=″JavaScript″type=″text/javascript″>
<!-
varr=Math.random();
document.write(″<imgalt=\″\″width=\″1\″height=\″1\″style=\″disphay:none\″″);
documentwrite(″src=\″http://dmtracking.alibaba.com/b.jpg?cD0x.JnU9ey93d3cuYWxpYmFiYS5jb20vfSZ
tPXtHRVR9JnM9ezIwMH0mcj17LX0mYT17LX0mYj17LX0mYz17LX0=&rand=″+r+″\″>″);
->
</script>
AlibabaAlibabaAlibabaAlibabaAlibabaAlibaba
</body></html>
Like this, static page that all are sent by Web server 21 or dynamic page all can be embedded into Tag and can not be omitted, and its corresponding visit information also can be sent to journal file server 22 by client 20.
Add tag identification code by Web server 21 assigned address at the page when sending the Web page, both kept the advantage that generates access log by label, prevented that again the part Web page from being omitted when embedded tags, thereby guaranteed journal file data analysis result's accuracy; Simultaneously,, no longer need the manual label that inserts, no longer need manual update routine code to call common template for dynamic page, thereby reduced workload significantly for static page.
Obviously, those skilled in the art can carry out various changes and modification to the present invention and not break away from the spirit and scope of the present invention.Like this, if of the present invention these are revised and modification belongs within the scope of claim of the present invention and equivalent technologies thereof, then the present invention also is intended to comprise these changes and modification interior.

Claims (9)

1. a method of adding label in the Web page is characterized in that, comprising:
Web server sends to the page of this client according to the acquisition request needs of client;
The code that described Web server scans the described page to be navigating to the assigned address that inserts label, and at this assigned address embedded tags code;
The page that described Web server will embed authentication code sends to described client.
2. the method for claim 1 is characterized in that, the page that described Web server obtains comprises static page and dynamic page.
3. method as claimed in claim 1 or 2 is characterized in that, described client sends to the journal file server according to this label with corresponding visit information after receiving the page of embedded tags.
4. method as claimed in claim 3 is characterized in that, described journal file server generates corresponding access log according to described visit information.
5. a Web server is characterized in that, comprises
Be used to receive the unit of the Web page browsing request that client sends;
Be used to obtain the unit of the page that need send to client;
The code that is used to scan the described page inserts the assigned address of label with the location, and in the unit of this assigned address embedded tags code;
Be used for sending the unit of the page that embeds authentication code to client.
6. Web server as claimed in claim 5 is characterized in that the page that described Web server obtains comprises static page and dynamic page.
7. a web station system is characterized in that, comprising:
Client is used for the page to the embedded tags of the request of Web server transmission Web page browsing, the transmission of reception Web server, and sends corresponding visit information according to described label to the journal file server;
Web server, be used for sending to the page of this client according to the acquisition request needs of client, the code that scans the described page to be navigating to the assigned address that inserts label and at this assigned address embedded tags code, and the page that will embed authentication code sends to described client.
8. web station system as claimed in claim 7 is characterized in that, also comprises:
The journal file server is used for generating corresponding access log according to the visit information that client sends.
9. web station system as claimed in claim 7 is characterized in that, described Web server comprises:
Be used to receive the unit of the Web page browsing request that client sends;
Be used to obtain the unit of the page that need send to client;
Be used to scan described page code and insert the assigned address of label with the location, and in the unit of this assigned address embedded tags code;
Be used for sending the unit of the page that embeds authentication code to client.
CN2006101272136A 2006-09-12 2006-09-12 A method and system for adding tags in Web pages Active CN101145936B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN2006101272136A CN101145936B (en) 2006-09-12 2006-09-12 A method and system for adding tags in Web pages
HK08107741.9A HK1114486A1 (en) 2006-09-12 2008-07-14 A method for adding tags to web pages and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2006101272136A CN101145936B (en) 2006-09-12 2006-09-12 A method and system for adding tags in Web pages

Publications (2)

Publication Number Publication Date
CN101145936A true CN101145936A (en) 2008-03-19
CN101145936B CN101145936B (en) 2012-04-18

Family

ID=39208250

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2006101272136A Active CN101145936B (en) 2006-09-12 2006-09-12 A method and system for adding tags in Web pages

Country Status (2)

Country Link
CN (1) CN101145936B (en)
HK (1) HK1114486A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102999360A (en) * 2012-11-23 2013-03-27 奥维通信股份有限公司 Realization method for dynamic clock based on embedded WEB-OMT (object modeling technology) page
CN103684823A (en) * 2012-09-12 2014-03-26 阿里巴巴集团控股有限公司 Weblog recording method, network access path determining method and related devices
CN105119955A (en) * 2015-07-09 2015-12-02 中国科学院计算技术研究所 Method and system for supporting reading of multi-page directory in distributed file system
CN105337930A (en) * 2014-06-30 2016-02-17 北京新媒传信科技有限公司 Data processing method and apparatus

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2296140A1 (en) * 1999-01-28 2000-07-28 Lucent Technologies, Inc. Methods and apparatus for logging web user activity using client-side support
CN1519722A (en) * 2003-01-20 2004-08-11 鼎新电脑股份有限公司 System and method for processing events on java web page
CN100362788C (en) * 2004-08-31 2008-01-16 华为技术有限公司 Network server and data interacting method for network server and browser

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103684823A (en) * 2012-09-12 2014-03-26 阿里巴巴集团控股有限公司 Weblog recording method, network access path determining method and related devices
CN102999360A (en) * 2012-11-23 2013-03-27 奥维通信股份有限公司 Realization method for dynamic clock based on embedded WEB-OMT (object modeling technology) page
CN105337930A (en) * 2014-06-30 2016-02-17 北京新媒传信科技有限公司 Data processing method and apparatus
CN105337930B (en) * 2014-06-30 2019-02-19 北京新媒传信科技有限公司 The method and device that a kind of pair of data are handled
CN105119955A (en) * 2015-07-09 2015-12-02 中国科学院计算技术研究所 Method and system for supporting reading of multi-page directory in distributed file system

Also Published As

Publication number Publication date
CN101145936B (en) 2012-04-18
HK1114486A1 (en) 2008-10-31

Similar Documents

Publication Publication Date Title
CN101427229B (en) Technique for modifying presentation of information displayed to end users of a computer system
CN101211364B (en) Method and system for social bookmarking of resources exposed in web pages
US8065620B2 (en) System and method for defining and presenting a composite web page
US8423587B2 (en) System and method for real-time content aggregation and syndication
US8060518B2 (en) System and methodology for extraction and aggregation of data from dynamic content
US8260844B2 (en) Information messaging and collaboration system
CN100399290C (en) Gethering enriched server activity data of cached web content
CN101131747B (en) Method, device and system for catching and/or analyzing Web page events at client terminal
US20040254935A1 (en) Method and apparatus for automatic consolidation of personalized dynamic data
CN103685604B (en) A kind of domain name pre-parsed method and device
US20010039540A1 (en) Method and structure for dynamic conversion of data
EP1258819A2 (en) System and method for providing a file in multiple languages
US9311303B2 (en) Interpreted language translation system and method
KR20090016745A (en) Message catalogs for remote modules
US20030050969A1 (en) Information integration system
MX2009001640A (en) Enabling web analytics for interactive web applications.
CN101217507A (en) Method, device and system for providing and altering data on network page
CN101146040B (en) Analyzing web site traffic method and device
US8195762B2 (en) Locating a portion of data on a computer network
US7623741B2 (en) System and method for transferring scanned imaging data to a personal imaging repository
WO2002065359A1 (en) Electronic information management system
US20130019152A1 (en) Method and Apparatus for Tracking Exit Destinations of Web Page
CN101145936B (en) A method and system for adding tags in Web pages
US20020029257A1 (en) Method of aiding portal site making and method of providing services
KR20000049986A (en) Messaging service system and method for processing the same

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1114486

Country of ref document: HK

C14 Grant of patent or utility model
GR01 Patent grant
REG Reference to a national code

Ref country code: HK

Ref legal event code: GR

Ref document number: 1114486

Country of ref document: HK