CN110020339A - Based on without the webpage data acquiring method and device buried a little - Google Patents
Based on without the webpage data acquiring method and device buried a little Download PDFInfo
- Publication number
- CN110020339A CN110020339A CN201710708948.6A CN201710708948A CN110020339A CN 110020339 A CN110020339 A CN 110020339A CN 201710708948 A CN201710708948 A CN 201710708948A CN 110020339 A CN110020339 A CN 110020339A
- Authority
- CN
- China
- Prior art keywords
- click
- target webpage
- data
- webpage
- webpage element
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Transfer Between Computers (AREA)
Abstract
The invention discloses a kind of based on without the webpage data acquiring method and device buried a little, is related to network technique field, main purpose is to improve the accuracy of collection result during based on without collecting webpage data a little is buried and improves the rich of acquisition content.The method comprise the steps that obtaining the click data of the target webpage element after receiving the triggering information of target webpage element;According to pre-set click list of thing, the click event of the corresponding target webpage element is judged whether there is, wherein record has the corresponding click event of different web pages element in the click list of thing;If it exists, then the click data is saved into the record sheet of the correspondence target webpage element.The present invention is used for based on without the collecting webpage data buried a little.
Description
Technical field
The present invention relates to network technique field more particularly to a kind of webpage data acquiring methods and dress buried a little based on nothing
It sets.
Background technique
With the rapid development of internet, become the important means of website acquisition user behavior without a technology is buried gradually.Its
In, the technical staff of website can by without burying a technology, the click behavior that webpage front-end is web page element define one with
Corresponding " click event ", and the number of bursts by counting the click event determines user when accessing webpage to this yuan
The click condition of element to realize the acquisition to web data, and then lays the foundation for the analysis work of subsequent user behavior.By
Subsequent analysis work will be directly affected in collection result, if collected data it is not accurate enough or acquisition data content it is inadequate
It is abundant, then undesirable influence can be caused on subsequent analysis work, therefore, for advertiser, enterprise and website webmaster,
In order to preferably analyze user behavior, it is highly important for how preferably acquiring the click condition of web page element.
Currently, coming frequently with programs such as GrowingIO, Heap to the number in website when being acquired to web data
According to being acquired.However, in actual use, GrowingIO by the text or link definition event in webpage, and according to
Event triggering situation realizes the acquisition of web data, but this mode is often because the variation of text or link leads to the event of definition
Failure, to influence the accuracy of collection result.By taking the advertisement position that operator often needs to detect as an example, according to link in advertisement position
Certain activity is defined, when the activity is offline, it is necessary to redefine;When using Heap to be acquired, member is usually utilized
Plain path is acquired, but this mode is adopted to define web page element according to click condition of the which to web page element
The data content of collection is more single, influences subsequent analysis.Therefore, how during collecting webpage data to guarantee collection result
Accuracy and ensure acquire content it is rich, become urgent problem to be solved in the industry.
Summary of the invention
In view of the above problems, the present invention provides a kind of based on without the webpage data acquiring method and device buried a little, main mesh
In order to during based on without collecting webpage data a little is buried, improve the accuracy of collection result and ensure the rich of acquisition data
Fu Xing.
In order to solve the above technical problems, in a first aspect, the present invention provides a kind of based on without the collecting webpage data buried a little
Method, this method comprises:
After receiving the triggering information of target webpage element, the click data of the target webpage element is obtained;
According to pre-set click list of thing, the click thing of the corresponding target webpage element is judged whether there is
Part, wherein record has the corresponding click event of different web pages element in the click list of thing;
If it exists, then the click data is saved into the record sheet of the correspondence target webpage element.
Optionally, described after receiving the triggering information of target webpage element, obtain the point of the target webpage element
Hitting data includes:
The title of the target webpage element is parsed from the triggering information of the target webpage element;
According to the title of the target webpage element, the click logs of the target webpage element, the click day are obtained
Will is used to record the click data of web page element;
The click data of the target webpage element is extracted from the click logs of the target webpage element.
Optionally, the method also includes:
Monitoring code is disposed in the webpage, to pass through the monitoring when the target webpage element is triggered
The triggering information of target webpage element described in Code obtaining.
Optionally, in the title according to the target webpage element, obtain the target webpage element click logs it
Before, the method also includes:
Deployment obtains code in the webpage, to pass through institute after getting the triggering information of the web page element
The click logs for obtaining Code obtaining and sending the web page element are stated, the click logs are that the web page element is being triggered
Shi Shengcheng's;
Receive the click logs that the acquisition code is sent.
Optionally, the click data of the target webpage element includes triggered time, element path, element text, element
One of uniform resource position mark URL of link and the webpage is a variety of.
Optionally, described save the click data includes: into the record sheet of the correspondence target webpage element
Preservation request is issued the user with, so that user sends corresponding feedback command according to preservation request;
According to the content in the feedback command of the user, from when triggering in the click data of the target webpage element
Between, determined in the uniform resource position mark URL of element path, element text, element link and the webpage and save content;
The preservation content is saved into the record sheet of the correspondence target webpage element.
Second aspect, the present invention also provides a kind of collecting webpage data device buried a little based on nothing, which includes:
Acquiring unit, for obtaining the target webpage element after receiving the triggering information of target webpage element
Click data;
Judging unit, for judging whether there is the corresponding target webpage according to pre-set click list of thing
The click event of element, wherein recording the corresponding click event of different web pages element in the click list of thing;
Storage unit, if judging there is corresponding institute for the judging unit according to pre-set click list of thing
The click event of target webpage element is stated, then is saved the click data to the record sheet of the correspondence target webpage element
In.
Optionally, the acquiring unit includes:
Parsing module, for parsing the name of the target webpage element from the triggering information of the target webpage element
Claim;
It obtains module and obtains the click day of the target webpage element for the title according to the target webpage element
Will, the click logs are used to record the click data of web page element;
Extraction module, for extracting the click of the target webpage element from the click logs of the target webpage element
Data.
Optionally, described device further include:
Deployment unit, for disposing monitoring code in the webpage, so as to when the target webpage element is triggered,
The triggering information of the target webpage element is obtained by the monitoring code.
Optionally, the acquiring unit further include:
Deployment module, for the deployment acquisition code in the webpage, so as to when the triggering for getting the web page element
After information, by the click logs for obtaining Code obtaining and sending the web page element, the click logs are the net
Page element is generated when being triggered;
Receiving module, the click logs sent for receiving the acquisition code.
Optionally, the click data of the target webpage element includes triggered time, element path, element text, element
One of uniform resource position mark URL of link and the webpage is a variety of.
Optionally, the storage unit includes:
Module is issued, saves request for issuing the user with, so that user is corresponding anti-according to preservation request transmission
Feedback instruction;
Determining module, for the content in the feedback command according to the user, from the click of the target webpage element
In the uniform resource position mark URL in triggered time, element path, element text, element link and the webpage in data
It determines and saves content;
Preserving module, the preservation content for determining the determining module are saved to the correspondence target webpage element
In record sheet.
To achieve the goals above, according to the third aspect of the invention we, a kind of storage medium, the storage medium are provided
Program including storage, wherein equipment where controlling the storage medium in described program operation executes base described above
In without the webpage data acquiring method buried a little.
To achieve the goals above, according to the fourth aspect of the invention, a kind of processor is provided, the processor is used for
Run program, wherein described program executes described above based on without the webpage data acquiring method buried a little when running.
It is provided by the invention that webpage data acquiring method and device a little are buried based on nothing by above-mentioned technical proposal, for
For the prior art during based on without collecting webpage data a little is buried, the accuracy of collection result is lower, acquisition content is more single
One the problem of, the present invention is after receiving the triggering information of web page element, by obtaining the click logs of web page element, and from point
It hits in log and extracts the click data of web page element, then determine there is the corresponding element in the click list of thing of webpage
Click event when, the click data of extraction is saved into corresponding record sheet, will click on list of thing so as to realize
The acquisition of the click data of middle corresponding element, to realize based on without the collecting webpage data buried a little.Wherein, by webpage
Click list of thing determine whether there is the click event of corresponding web page element, may be implemented in webpage to being defined event
The acquisition of the click data of element, and then can targetedly acquire the click data of element, it is ensured that it is buried a little based on nothing
The accuracy of collection result during collecting webpage data.Meanwhile by the click data of the target webpage element of acquisition come real
The data acquisition function of existing target webpage element, more can comprehensively obtain data of the web page element when being clicked, thus
Make based on without acquisition content more horn of plenty during the collecting webpage data buried a little.
The above description is only an overview of the technical scheme of the present invention, in order to better understand the technical means of the present invention,
And it can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage can
It is clearer and more comprehensible, the followings are specific embodiments of the present invention.
Detailed description of the invention
By reading the following detailed description of the preferred embodiment, various other advantages and benefits are common for this field
Technical staff will become clear.The drawings are only for the purpose of illustrating a preferred embodiment, and is not considered as to the present invention
Limitation.And throughout the drawings, the same reference numbers will be used to refer to the same parts.In the accompanying drawings:
Fig. 1 shows provided in an embodiment of the present invention a kind of based on without the webpage data acquiring method flow chart buried a little;
Fig. 2 shows provided in an embodiment of the present invention another based on without the webpage data acquiring method flow chart buried a little;
Fig. 3 shows provided in an embodiment of the present invention a kind of based on without the composition frame for burying collecting webpage data device a little
Figure;
Fig. 4 shows provided in an embodiment of the present invention another based on without the composition frame for burying collecting webpage data device a little
Figure.
Specific embodiment
The exemplary embodiment that the present invention will be described in more detail below with reference to accompanying drawings.Although showing the present invention in attached drawing
Exemplary embodiment, it being understood, however, that may be realized in various forms the present invention without should be by embodiments set forth here
It is limited.It is to be able to thoroughly understand the present invention on the contrary, providing these embodiments, and can be by the scope of the present invention
It is fully disclosed to those skilled in the art.
In order to improve based on without the accuracy buried during collecting webpage data a little and rich, the embodiment of the present invention is mentioned
A kind of webpage data acquiring method buried a little based on nothing has been supplied, as shown in Figure 1, this method comprises:
101, after receiving the triggering information of target webpage element, the click data of the target webpage element is obtained.
In general, user can carry out some operation behaviors when accessing Website page on webpage, such as drag, point
Hit equal behaviors.When carrying out these operation behaviors, all there is the web page element of the corresponding operation behavior.Wherein, web page element can
To be interpreted as the element for forming webpage, there are a large amount of elements for each webpage.Specifically, webpage elements can
Think that text, picture, audio, animation, video, link etc. are one or more.There are some elements being carried out some operation behaviors
Afterwards, webpage can occur to operate corresponding response therewith.For example, when the link in webpage clicking element, it may occur that the jump of the page
Turn.Meanwhile when these operable web page elements are triggered, system can all generate the point of the corresponding web page element in local
Log is hit, for recording the related data information when element is triggered, for example, triggered time, triggering times etc..In addition,
When obtaining the click data of the target webpage element, it can also be passed through by the way that data acquisition script is arranged in target webpage
The parameter of setting is come the classification and acquisition that adjust acquisition modes of the data acquisition script when obtaining click data, obtain data
The quantity of data;Or in such a way that the day after tomorrow is accessed, the data information of user sharing is obtained using shared data bank, so as to
It is screened out from it the click data and its relevant data information about the target webpage element.Specifically, obtaining hits
According to mode can voluntarily select according to actual needs, it is not limited here.
Since during based on without collecting webpage data a little is buried, the click condition of web page element is the weight for needing to acquire
Data are wanted, therefore, the method according to this step specifically can be, when detecting the click row for occurring user in webpage
For when, in webpage identify user click behavior effect web page element, i.e., target webpage described in the embodiment of the present invention member
Then element obtains the routing information of the element, so as to the operation of subsequent elemental recognition.Meanwhile being by the local of the web page element
The click logs of the corresponding element are inquired in system, and are obtained after finding.It should be noted that receiving web page element
Triggering information after acquisition click logs mode, can by target webpage dispose obtain code it is relevant to realize
Acquisition behavior, or the data of needs are obtained using the script plug-in unit comprising identical function, or in target webpage generation
Corresponding function command or program are implanted into code to realize relevant acquisition behavior.It is chosen specifically, can according to need,
It is not limited here.
102, according to pre-set click list of thing, the click of the corresponding target webpage element is judged whether there is
Event.
Currently, when acquire based on the data without a technology of burying, can all determining for event be carried out to collected element
Justice, the case where being triggered by the number of event come statistical elements.Similarly, identical side is also used in embodiments of the present invention
Therefore method needs to judge whether the element being triggered is defined in this step, specifically, needing by clicking thing
Part list is judged.Wherein pre-set click list of thing described in the embodiment of the present invention is one for recording net
The list of the corresponding click event of difference element in page.When the event for clicking some element is present in the pre-set click thing
When part list, illustrate that event that this element is clicked is the event being defined, needs to acquire this yuan in collection process
The case where element is clicked.As a result, when there is the click event of certain corresponding element in the click list of thing, then illustrate to click
The case where event of the element is defined, which is clicked is to need to be collected;And work as the click list of thing
In there is no when the click event of certain corresponding element, then illustrate that the event for clicking the element is not defined, the element is by point
The case where hitting does not need collected.
If 103, judgement has the click event of the corresponding target webpage element, the click data is saved to right
In the record sheet for answering the target webpage element.
After step 102 is judged, when determining the click event that there is the corresponding target webpage element, illustrate this
The click event of web page element be the event being defined, its click condition be need it is collected.As a result, according to this step
The click data of the correspondence target webpage element is stored in corresponding record sheet, so as to according to record by the method
Click data in table carries out subsequent analysis to the click condition of element.
It is provided in an embodiment of the present invention that webpage data acquiring method a little is buried based on nothing, the prior art is buried based on nothing
During the collecting webpage data of point, the lower problem of the accuracy of collection result, the present invention passes through in preset click
List of thing determines whether there is the click event of corresponding target webpage element, may be implemented in webpage to the mesh for the event that defines
The acquisition of the click data of web page element is marked, and then can targetedly acquire the corresponding click data of element, it is ensured that base
The accuracy of collection result during without the collecting webpage data buried a little.Meanwhile the point of the target webpage element by acquisition
Data are hit to realize the data acquisition function of target webpage element, more can comprehensively obtain web page element when being clicked
Data, to make based on without acquisition content more horn of plenty during the collecting webpage data buried a little.
Further, as the refinement and extension to embodiment illustrated in fig. 1, the embodiment of the invention also provides another bases
Webpage data acquiring method a little is buried in nothing, as shown in Fig. 2, its specific steps includes:
201, monitoring code is disposed, in webpage to pass through the monitoring when the target webpage element is triggered
The triggering information of target webpage element described in Code obtaining.
In this step, by being monitored the deployment of code in webpage, it can be ensured that when web page element is triggered, energy
Enough obtain the triggering information of the web page element.The monitoring code can choose web crawlers or other monitoring codes, tool
Body, it can according to need to be selected, it is not limited here.But selected monitoring code will guarantee in net
When page element is triggered, the triggering information of the web page element is obtained in time.It, can as a result, by disposing the monitoring code
Webpage is monitored, and when web page element is triggered, can timely obtain triggering information, and then realize to webpage
The timely acquisition of the triggering information of element and real time monitoring function.
202, after receiving the triggering information of target webpage element, the click data of the target webpage element is obtained.
In the present invention is implemented, the description Yu aforementioned implementation of the click logs of the target webpage element, web page element
Description in example step 101 is identical, and this will not be repeated here.
Therefore, in this step, the web page element can be parsed from the triggering information of target webpage element first
Title, then obtains the click logs of the corresponding title according to the title from system local, last according to from the target webpage
The click data of the target webpage element is extracted in the click logs of element.Certainly, in addition to the element term of target webpage element
Except, can also carry out the acquisition of click logs by the identity of web page element, described in this step according to title come
The mode for determining click logs is only preferably embodiment, other specific modes in various ways, can according to need from
Row is chosen.
Specifically, when obtaining the click logs of web page element code can also be obtained by disposing in webpage, when connecing
When receiving the triggering information of web page element, the acquisition code activation, so as to using the acquisition Code obtaining and send webpage member
The click logs of element.Then, the click logs for obtaining code and sending are received, realize that the acquisition to the click logs is grasped
Make.Wherein, the acquisition code can use web crawlers or other codes, specifically it is not limited here.In addition, this step
What the rapid click logs can generate for the web page element when being triggered is directed to when time click logs of operation, can also
Think and be formed by log after constantly recording after multi-pass operation, it is not limited here.
According to the method for this step, corresponding click logs are obtained by the title of web page element, it can be ensured that click
The accuracy of log acquisition, and then ensure based on the accuracy without the collection result for burying collecting webpage data a little.
Further, due to including number of types of data in the click logs of webpage, and the embodiment of the present invention is come
It says, most importantly the click data of web page element, wherein the click data of web page element may include triggered time, element
Path, element text, element link and the webpage one of uniform resource position mark URL or a variety of.Unified resource
Finger URL (Uniform Resource Locator, abbreviation URL) is a kind of for characterizing the position and the visit that interconnect internet resource
Ask the character string of method, it can be understood as the address information of standard resource on internet.Each file on internet has one
A unique URL.
It should be noted that specific extracting mode can be chosen from the prior art according to the actual situation, still
The accuracy for the click data content for ensuring to extract, and ensure that the extracting method chosen can be by above-mentioned click data
The middle triggered time, element path, element text, element link and the webpage uniform resource position mark URL extract.
203, according to pre-set click list of thing, the click of the corresponding target webpage element is judged whether there is
Event.
Wherein, click list of thing described in this step is identical as step 102 in previous embodiment, does not do herein superfluous
It states.
Therefore, the method according to this step, the event that the target webpage element is currently clicked in judgement whether there is
In the click list of thing of the webpage, when it is present, it is collected to illustrate that the click condition of the element needs;When being not present
When, illustrate that the click condition of the element does not need acquisition.
If 204, judgement has the click event of the corresponding target webpage element, the target click data is saved
Into the record sheet of the correspondence target webpage element.
After method described in step 203, through judging, it is described exist in the click list of thing of the webpage it is described
The click event of web page element illustrates that the click condition of the element is counted and acquired.Therefore, it is converged in this step
Always, it needs to save the corresponding click data of the element into corresponding record sheet.
Further, the specific executive mode that method is executed as this step, can be with are as follows:
First choice issues the user with preservation request, so that user sends corresponding feedback command according to preservation request;Its
In, it may include the data class in the title of the element and the click data of the element in saving request, when such as triggering
Between, one of the uniform resource position mark URL of element path, element text, element link and the webpage or a variety of, tool
The type and quantity of body determines according to actual conditions, it is not limited here.By the request, family can be used can be as needed
It is chosen, to be saved to the data of selection.
Then, the triggering according to the content in the feedback command of user, from the click data of the target webpage element
Time, element path, element text, element link and the webpage uniform resource position mark URL in determine and save content.
According to saving what the click data type provided in request determined when due to, the feedback command of user, therefore, the instruction can be with
Including to the instruction which kind of click data is saved in click data.
Finally, saving content is saved into the record sheet of the correspondence web page element.
The method according to this step, connected applications scene, is exemplified below:
When the click event of webpage clicked in Event Log Table in the presence of corresponding web page element A, illustrate web page element A
Click condition be need collected, i.e. target webpage element, meanwhile, determine to include touching in the click data of web page element A
When sending out time, tetra- kinds of element text, element link and webpage URL click datas, preservation is issued the user with first and is requested, in request
Title and triggered time, element text, element link and tetra- kinds of click datas of webpage URL comprising web page element A.Then user
Tri- kinds of triggered time in web page element A, element text, webpage URL click datas are saved, and fed back in determination
It is corresponding to save instruction, then according to the three kinds of click data types determined in instruction are saved, by triggered time, element text, net
The data content of page tri- kinds of click datas of URL is stored in the record sheet of corresponding web page element A.
The method according to this step saves instruction by obtaining to user, may be implemented in front end to click data
Content saves type and saves the control of quantity, and then can save corresponding click data content according to the needs of users, solves
Determined previous preservation click data when, inflexible problem is then improved based on without burying webpage data acquiring method a little
Flexibility.
Further, as the realization to method shown in above-mentioned Fig. 1, the embodiment of the invention also provides one kind to be buried based on nothing
The collecting webpage data device of point, for being realized to above-mentioned method shown in FIG. 1.The Installation practice and preceding method are real
It is corresponding to apply example, be it is easy to read, present apparatus embodiment no longer repeats the detail content in preceding method embodiment one by one,
It should be understood that the device in the present embodiment can correspond to the full content realized in preceding method embodiment.As shown in figure 3,
The device includes: acquiring unit 31, judging unit 32, storage unit 33, wherein
Acquiring unit 31 can be used for obtaining the target webpage after receiving the triggering information of target webpage element
The click data of element.
Judging unit 32 can be used for judging whether there is the corresponding mesh according to pre-set click list of thing
The click event of web page element is marked, wherein recording the corresponding click event of different web pages element in the click list of thing.
Storage unit 33, if can be used for the judging unit 32 according to pre-set click list of thing, judgement is deposited
In the click event of the correspondence target webpage element, then the click data that the acquiring unit 31 obtains is saved to corresponding institute
In the record sheet for stating target webpage element.
Further, as the realization to method shown in above-mentioned Fig. 2, the embodiment of the invention also provides another kinds to be based on nothing
Collecting webpage data device a little is buried, for realizing to above-mentioned method shown in Fig. 2.The Installation practice and preceding method
Embodiment is corresponding, be it is easy to read, present apparatus embodiment no longer goes to live in the household of one's in-laws on getting married one by one to the detail content in preceding method embodiment
It states, it should be understood that the device in the present embodiment can correspond to the full content realized in preceding method embodiment.
Wherein, Website page can be deployed in by burying collecting webpage data device a little based on nothing described in the embodiment of the present invention
The server-side at place, user terminal, or it is deployed in the third end service that other one is connected with the server-side and user terminal
In device.It does not do specific restriction herein, can according to need and disposed.
The another kind provided in this embodiment buries collecting webpage data device a little based on nothing, can be as shown in figure 4, should
Device includes: acquiring unit 41, judging unit 42, storage unit 43, wherein
Acquiring unit 41 can be used for obtaining the target webpage after receiving the triggering information of target webpage element
The click data of element.
Judging unit 42 can be used for judging whether there is the corresponding mesh according to pre-set click list of thing
The click event of web page element is marked, wherein recording the corresponding click event of different web pages element in the click list of thing.
Storage unit 43, if can be used for the judging unit 42 according to pre-set click list of thing, judgement is deposited
In the click event of the correspondence target webpage element, then the click data that the acquiring unit 41 obtains is saved to corresponding institute
In the record sheet for stating target webpage element.
Further, the acquiring unit 41 includes:
Parsing module 411 can be used for parsing the target webpage from the triggering information of the target webpage element
The title of element.
Module 412 is obtained, the title for the target webpage element that can be used for parsing according to the parsing module 411 obtains
The click logs of the target webpage element are taken, the click logs are used to record the click data of web page element;
Extraction module, 413, it can be used for from the click logs for the target webpage element that the acquisition module 412 obtains
Extract the click data of the target webpage element
Further, described device further include:
Deployment unit 44 can be used for disposing monitoring code in the webpage, to work as the target webpage element quilt
When triggering, the triggering information of the target webpage element is obtained by the monitoring code and is sent in acquiring unit 41.
Further, the acquiring unit 41 further include:
Deployment module 414 can be used for disposing obtaining code in the webpage, get the web page element to work as
Triggering information after, by the acquisition Code obtaining and send the click logs of the web page element, the click logs are
What the web page element was generated when being triggered.
Receiving module 415 can be used for receiving the click logs that the acquisition code is sent.
Further, the click data of the target webpage element includes triggered time, element path, element text, member
One of uniform resource position mark URL of element link and the webpage is a variety of.
Further, the storage unit 43 includes:
Module 431 is issued, can be used for issuing the user with preservation request, so that user is according to preservation request transmission pair
The feedback command answered.
Determining module 432 can be used for the content in the feedback command according to the user, state web page element from institute's target
Click data in triggered time, element path, element text, element link and the webpage uniform resource locator
It is determined in URL and saves content.
Preserving module 433 can be used for saving the preservation content that the determining module 442 determines to the correspondence target
In the record sheet of web page element.
By above-mentioned technical proposal, the embodiment of the present invention provides a kind of based on without the webpage data acquiring method and dress buried a little
It sets.For the prior art during based on without collecting webpage data a little is buried, the lower problem of the accuracy of collection result, this
Invention determines whether there is the click event of corresponding target webpage element by the click list of thing in webpage, and net may be implemented
To the acquisition of the click data of the target webpage element for the event that defines in page, and then it is corresponding targetedly to acquire element
Click data, it is ensured that based on without the accuracy for burying collection result during collecting webpage data a little.Meanwhile utilizing webpage
The click logs of element extract the click data of web page element, more can comprehensively obtain web page element when being clicked
Data, to make based on without acquisition content more horn of plenty during the collecting webpage data buried a little.In addition, passing through web page element
Title obtains corresponding click logs, it can be ensured that the accuracy that click logs obtain, and then ensure based on without burying a little
The accuracy of the collection result of collecting webpage data.Meanwhile instruction is saved by obtaining to user, it may be implemented in front end to point
It hits data content to save type and save the control of quantity, and then can save according to the needs of users in corresponding click data
Hold, when solving previous preservation click data, inflexible problem is then improved based on without the collecting webpage data buried a little
The flexibility of method.In addition, can achieve the effect that be monitored webpage, and work as by disposing monitoring code in webpage
When web page element is triggered, triggering information can be timely obtained, and then realizes the timely of triggering information to web page element
Acquisition and real time monitoring function.
It is described based on including processor and memory, above-mentioned acquiring unit, extraction without the collecting webpage data device for burying a little
Unit, judging unit and storage unit etc. store in memory as program unit, are stored in storage by processor execution
Above procedure unit in device realizes corresponding function.
Include kernel in processor, is gone in memory to transfer corresponding program unit by kernel.Kernel can be set one
Or more, it improves by adjusting kernel parameter based on without the accuracy buried during collecting webpage data a little and rich.
Memory may include the non-volatile memory in computer-readable medium, random access memory (RAM) and/
Or the forms such as Nonvolatile memory, if read-only memory (ROM) or flash memory (flash RAM), memory include that at least one is deposited
Store up chip.
The embodiment of the invention provides a kind of storage mediums, are stored thereon with program, real when which is executed by processor
It is existing described based on without the webpage data acquiring method buried a little.
The embodiment of the invention provides a kind of processor, the processor is for running program, wherein described program operation
Based on without the webpage data acquiring method buried a little described in Shi Zhihang.
The embodiment of the invention provides a kind of equipment, equipment include processor, memory and storage on a memory and can
The program run on a processor, processor are performed the steps of when executing program when the triggering for receiving target webpage element
After information, the click data of the target webpage element is obtained;
According to pre-set click list of thing, the click thing of the corresponding target webpage element is judged whether there is
Part, wherein record has the corresponding click event of different web pages element in the click list of thing;
If it exists, then the click data is saved into the record sheet of the correspondence target webpage element.
Further, described after receiving the triggering information of target webpage element, obtain the target webpage element
Click data includes:
The title of the target webpage element is parsed from the triggering information of the target webpage element;
According to the title of the target webpage element, the click logs of the target webpage element, the click day are obtained
Will is used to record the click data of web page element;
The click data of the target webpage element is extracted from the click logs of the target webpage element.
Further, the method also includes:
Monitoring code is disposed in the webpage, to pass through the monitoring when the target webpage element is triggered
The triggering information of target webpage element described in Code obtaining.
Further, in the title according to the target webpage element, the click logs of the target webpage element are obtained
Before, the method also includes:
Deployment obtains code in the webpage, to pass through institute after getting the triggering information of the web page element
The click logs for obtaining Code obtaining and sending the web page element are stated, the click logs are that the web page element is being triggered
Shi Shengcheng's;
Receive the click logs that the acquisition code is sent.
Further, the click data of the target webpage element includes triggered time, element path, element text, member
One of uniform resource position mark URL of element link and the webpage is a variety of.
Further, described save the click data includes: into the record sheet of the correspondence target webpage element
Preservation request is issued the user with, so that user sends corresponding feedback command according to preservation request;
According to the content in the feedback command of the user, from when triggering in the click data of the target webpage element
Between, determined in the uniform resource position mark URL of element path, element text, element link and the webpage and save content;
The preservation content is saved into the record sheet of the correspondence target webpage element.
Equipment in the embodiment of the present invention can be server, PC, PAD, mobile phone etc..
The embodiment of the invention also provides a kind of computer program products, when executing on data processing equipment, are suitable for
It executes the program of initialization there are as below methods step: after receiving the triggering information of target webpage element, obtaining the target
The click data of web page element;
According to pre-set click list of thing, the click thing of the corresponding target webpage element is judged whether there is
Part, wherein record has the corresponding click event of different web pages element in the click list of thing;
If it exists, then the click data is saved into the record sheet of the correspondence target webpage element.
Further, described after receiving the triggering information of target webpage element, obtain the target webpage element
Click data includes:
The title of the target webpage element is parsed from the triggering information of the target webpage element;
According to the title of the target webpage element, the click logs of the target webpage element, the click day are obtained
Will is used to record the click data of web page element;
The click data of the target webpage element is extracted from the click logs of the target webpage element.
Further, the method also includes:
Monitoring code is disposed in the webpage, to pass through the monitoring when the target webpage element is triggered
The triggering information of target webpage element described in Code obtaining.
Further, in the title according to the target webpage element, the click logs of the target webpage element are obtained
Before, the method also includes:
Deployment obtains code in the webpage, to pass through institute after getting the triggering information of the web page element
The click logs for obtaining Code obtaining and sending the web page element are stated, the click logs are that the web page element is being triggered
Shi Shengcheng's;
Receive the click logs that the acquisition code is sent.
Further, the click data of the target webpage element includes triggered time, element path, element text, member
One of uniform resource position mark URL of element link and the webpage is a variety of.
Further, described save the click data includes: into the record sheet of the correspondence target webpage element
Preservation request is issued the user with, so that user sends corresponding feedback command according to preservation request;
According to the content in the feedback command of the user, from when triggering in the click data of the target webpage element
Between, determined in the uniform resource position mark URL of element path, element text, element link and the webpage and save content;
The preservation content is saved into the record sheet of the correspondence target webpage element.
It should be understood by those skilled in the art that, embodiments herein can provide as method, system or computer program
Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the application
Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the application, which can be used in one or more,
The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces
The form of product.
The application is referring to method, the process of equipment (system) and computer program product according to the embodiment of the present application
Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions
The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs
Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce
A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real
The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy
Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates,
Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or
The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting
Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or
The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one
The step of function of being specified in a box or multiple boxes.
In a typical configuration, calculating equipment includes one or more processors (CPU), input/output interface, net
Network interface and memory.
Memory may include the non-volatile memory in computer-readable medium, random access memory (RAM) and/
Or the forms such as Nonvolatile memory, such as read-only memory (ROM) or flash memory (flash RAM).Memory is computer-readable Jie
The example of matter.
Computer-readable medium includes permanent and non-permanent, removable and non-removable media can be by any method
Or technology come realize information store.Information can be computer readable instructions, data structure, the module of program or other data.
The example of the storage medium of computer includes, but are not limited to phase change memory (PRAM), static random access memory (SRAM), moves
State random access memory (DRAM), other kinds of random access memory (RAM), read-only memory (ROM), electric erasable
Programmable read only memory (EEPROM), flash memory or other memory techniques, read-only disc read only memory (CD-ROM) (CD-ROM),
Digital versatile disc (DVD) or other optical storage, magnetic cassettes, tape magnetic disk storage or other magnetic storage devices
Or any other non-transmission medium, can be used for storage can be accessed by a computing device information.As defined in this article, it calculates
Machine readable medium does not include temporary computer readable media (transitory media), such as the data-signal and carrier wave of modulation.
It should also be noted that, the terms "include", "comprise" or its any other variant are intended to nonexcludability
It include so that the process, method, commodity or the equipment that include a series of elements not only include those elements, but also to wrap
Include other elements that are not explicitly listed, or further include for this process, method, commodity or equipment intrinsic want
Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including element
There is also other identical elements in process, method, commodity or equipment.
It will be understood by those skilled in the art that embodiments herein can provide as method, system or computer program product.
Therefore, complete hardware embodiment, complete software embodiment or embodiment combining software and hardware aspects can be used in the application
Form.It is deposited moreover, the application can be used to can be used in the computer that one or more wherein includes computer usable program code
The shape for the computer program product implemented on storage media (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.)
Formula.
The above is only embodiments herein, are not intended to limit this application.To those skilled in the art,
Various changes and changes are possible in this application.It is all within the spirit and principles of the present application made by any modification, equivalent replacement,
Improve etc., it should be included within the scope of the claims of this application.
Claims (10)
1. a kind of based on without the webpage data acquiring method buried a little characterized by comprising
After receiving the triggering information of target webpage element, the click data of the target webpage element is obtained;
According to pre-set click list of thing, the click event of the corresponding target webpage element is judged whether there is,
Described in click list of thing in record have the corresponding click event of different web pages element;
If it exists, then the click data is saved into the record sheet of the correspondence target webpage element.
2. the method according to claim 1, wherein described when the triggering information for receiving target webpage element
Afterwards, the click data for obtaining the target webpage element includes:
The title of the target webpage element is parsed from the triggering information of the target webpage element;
According to the title of the target webpage element, the click logs of the target webpage element are obtained, the click logs are used
In the click data of record web page element;
The click data of the target webpage element is extracted from the click logs of the target webpage element.
3. the method according to claim 1, wherein the method also includes:
Monitoring code is disposed in the webpage, to pass through the monitoring code when the target webpage element is triggered
Obtain the triggering information of the target webpage element.
4. according to the method described in claim 2, it is characterized in that, obtaining institute in the title according to the target webpage element
Before the click logs for stating target webpage element, the method also includes:
Deployment obtains code in the webpage, to be obtained after getting the triggering information of the web page element by described
It takes Code obtaining and sends the click logs of the web page element, the click logs are that the web page element is raw when being triggered
At;
Receive the click logs that the acquisition code is sent.
5. method according to claim 1-4, which is characterized in that the click data packet of the target webpage element
Include one of triggered time, element path, element text, element link and uniform resource position mark URL of the webpage
Or it is a variety of.
6. according to the method described in claim 5, it is characterized in that, described save the click data to the correspondence target
Include: in the record sheet of web page element
Preservation request is issued the user with, so that user sends corresponding feedback command according to preservation request;
According to the content in the feedback command of the user, from the click data of the target webpage element triggered time,
Element path, element text, element link and the webpage uniform resource position mark URL in determine and save content;
The preservation content is saved into the record sheet of the correspondence target webpage element.
7. a kind of based on without the collecting webpage data device buried a little characterized by comprising
Acquiring unit, for obtaining the click of the target webpage element after receiving the triggering information of target webpage element
Data;
Judging unit, for judging whether there is the corresponding target webpage element according to pre-set click list of thing
Click event, wherein recording the corresponding click event of different web pages element in the click list of thing;
Storage unit, if judging there is the corresponding mesh for the judging unit according to pre-set click list of thing
The click event for marking web page element, then save the click data that the acquiring unit obtains to the correspondence target webpage element
Record sheet in.
8. device according to claim 7, which is characterized in that the acquiring unit includes:
Parsing module, for parsing the title of the target webpage element from the triggering information of the target webpage element;
It obtains module and obtains the click logs of the target webpage element, institute for the title according to the target webpage element
Click logs are stated for recording the click data of web page element;
Extraction module, for extracting the target webpage from the click logs for the target webpage element that the acquisition module obtains
The click data of element.
9. a kind of storage medium, which is characterized in that the storage medium includes the program of storage, wherein run in described program
When control the storage medium where equipment perform claim require 1 to described in any one of claim 6 based on without burying a little
Webpage data acquiring method.
10. a kind of processor, which is characterized in that the processor is for running program, wherein right of execution when described program is run
Benefit require 1 to described in any one of claim 6 based on without the webpage data acquiring method buried a little.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710708948.6A CN110020339B (en) | 2017-08-17 | 2017-08-17 | Webpage data acquisition method and device based on non-buried point |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710708948.6A CN110020339B (en) | 2017-08-17 | 2017-08-17 | Webpage data acquisition method and device based on non-buried point |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110020339A true CN110020339A (en) | 2019-07-16 |
CN110020339B CN110020339B (en) | 2022-03-18 |
Family
ID=67186096
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710708948.6A Active CN110020339B (en) | 2017-08-17 | 2017-08-17 | Webpage data acquisition method and device based on non-buried point |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110020339B (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110674426A (en) * | 2019-08-30 | 2020-01-10 | 腾讯科技(深圳)有限公司 | Webpage behavior reporting method and device |
CN111274574A (en) * | 2020-01-16 | 2020-06-12 | 恩亿科(北京)数据科技有限公司 | Webpage event anti-shaking method and device, server and computer readable storage medium |
CN111523064A (en) * | 2020-04-16 | 2020-08-11 | 山东贝赛信息科技有限公司 | Webpage collection technology based on Jxbrowser |
CN111581069A (en) * | 2020-04-30 | 2020-08-25 | 北京三快在线科技有限公司 | Data processing method and device |
CN112199263A (en) * | 2020-09-30 | 2021-01-08 | 北京字节跳动网络技术有限公司 | Method, device, equipment and medium for recording page |
CN112799946A (en) * | 2021-01-29 | 2021-05-14 | 长沙市到家悠享网络科技有限公司 | Method, equipment and storage medium for embedding points and collecting data |
CN114036426A (en) * | 2021-11-25 | 2022-02-11 | 深圳视界信息技术有限公司 | Webpage data acquisition method, device, equipment and medium |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100835905B1 (en) * | 2007-04-02 | 2008-06-19 | (주)비즈스프링 | Apparatus for visualizing website visitor's click distribution in webpage and method using the same |
CN103246661A (en) * | 2012-02-07 | 2013-08-14 | 阿里巴巴集团控股有限公司 | Visual user behavior collecting system and method |
CN103309884A (en) * | 2012-03-13 | 2013-09-18 | 阿里巴巴集团控股有限公司 | User behavior data collecting method and system |
CN103631699A (en) * | 2012-08-28 | 2014-03-12 | 纽海信息技术(上海)有限公司 | Log management system and method for log monitoring, acquiring and querying |
CN105975599A (en) * | 2016-05-11 | 2016-09-28 | 北京京东尚博广益投资管理有限公司 | Method and device monitoring website page event tracking |
CN106571949A (en) * | 2016-09-23 | 2017-04-19 | 北京五八信息技术有限公司 | Event tracking point processing method and apparatus |
CN106933722A (en) * | 2017-03-06 | 2017-07-07 | 腾云天宇科技(北京)有限公司 | A kind of web application monitoring method, server and system |
CN106933472A (en) * | 2017-05-20 | 2017-07-07 | 南京西桥科技有限公司 | A kind of user behavior data acquisition system and its control method based on mobile phone A PP |
CN107018046A (en) * | 2017-06-06 | 2017-08-04 | 上海鋆创信息技术有限公司 | A kind of collecting method, device, terminal and storage medium |
-
2017
- 2017-08-17 CN CN201710708948.6A patent/CN110020339B/en active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100835905B1 (en) * | 2007-04-02 | 2008-06-19 | (주)비즈스프링 | Apparatus for visualizing website visitor's click distribution in webpage and method using the same |
CN103246661A (en) * | 2012-02-07 | 2013-08-14 | 阿里巴巴集团控股有限公司 | Visual user behavior collecting system and method |
CN103309884A (en) * | 2012-03-13 | 2013-09-18 | 阿里巴巴集团控股有限公司 | User behavior data collecting method and system |
CN103631699A (en) * | 2012-08-28 | 2014-03-12 | 纽海信息技术(上海)有限公司 | Log management system and method for log monitoring, acquiring and querying |
CN105975599A (en) * | 2016-05-11 | 2016-09-28 | 北京京东尚博广益投资管理有限公司 | Method and device monitoring website page event tracking |
CN106571949A (en) * | 2016-09-23 | 2017-04-19 | 北京五八信息技术有限公司 | Event tracking point processing method and apparatus |
CN106933722A (en) * | 2017-03-06 | 2017-07-07 | 腾云天宇科技(北京)有限公司 | A kind of web application monitoring method, server and system |
CN106933472A (en) * | 2017-05-20 | 2017-07-07 | 南京西桥科技有限公司 | A kind of user behavior data acquisition system and its control method based on mobile phone A PP |
CN107018046A (en) * | 2017-06-06 | 2017-08-04 | 上海鋆创信息技术有限公司 | A kind of collecting method, device, terminal and storage medium |
Non-Patent Citations (1)
Title |
---|
宋星: ""无埋点实现监测的真相——革新还是噱头? _ 互联网分析在中国——从基础到前沿"", 《无埋点实现监测的真相——革新还是噱头? _ 互联网分析在中国——从基础到前沿》 * |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110674426A (en) * | 2019-08-30 | 2020-01-10 | 腾讯科技(深圳)有限公司 | Webpage behavior reporting method and device |
CN110674426B (en) * | 2019-08-30 | 2024-03-22 | 腾讯科技(深圳)有限公司 | Webpage behavior reporting method and device |
CN111274574A (en) * | 2020-01-16 | 2020-06-12 | 恩亿科(北京)数据科技有限公司 | Webpage event anti-shaking method and device, server and computer readable storage medium |
CN111523064A (en) * | 2020-04-16 | 2020-08-11 | 山东贝赛信息科技有限公司 | Webpage collection technology based on Jxbrowser |
CN111581069A (en) * | 2020-04-30 | 2020-08-25 | 北京三快在线科技有限公司 | Data processing method and device |
CN112199263A (en) * | 2020-09-30 | 2021-01-08 | 北京字节跳动网络技术有限公司 | Method, device, equipment and medium for recording page |
CN112799946A (en) * | 2021-01-29 | 2021-05-14 | 长沙市到家悠享网络科技有限公司 | Method, equipment and storage medium for embedding points and collecting data |
CN114036426A (en) * | 2021-11-25 | 2022-02-11 | 深圳视界信息技术有限公司 | Webpage data acquisition method, device, equipment and medium |
Also Published As
Publication number | Publication date |
---|---|
CN110020339B (en) | 2022-03-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110020339A (en) | Based on without the webpage data acquiring method and device buried a little | |
US20170255706A1 (en) | Methods and apparatus to track web browsing sessions | |
CN107609135B (en) | Page element determining method and device, and user behavior path determining method and device | |
WO2016066046A1 (en) | Information acquisition method and apparatus | |
US20110238723A1 (en) | Systems and methods for web decoding | |
CN108256888B (en) | Landing page acquisition method, website server and network advertisement monitoring system | |
CN103401835A (en) | Method and device for presenting safety detection results of microblog page | |
CN111177519B (en) | Webpage content acquisition method, device, storage medium and equipment | |
CN114417197A (en) | Access record processing method and device and storage medium | |
CN110069683A (en) | A kind of method and device crawling data based on browser | |
CN109428776B (en) | Website traffic monitoring method and device | |
CN105160027B (en) | Advertisement data processing method and device | |
CN110020044A (en) | A kind of crawling method and device of crawler | |
CN102831218A (en) | Method and device for determining data in thermodynamic chart | |
CN107294918B (en) | Phishing webpage detection method and device | |
CN109948074A (en) | Website data interconnection method, device, storage medium, processor and electronic equipment | |
CN102870118A (en) | Access method, device and system to user behavior | |
CN104158697B (en) | A kind of dead chain detection method and device | |
US8929667B1 (en) | Analysis of web application state | |
US9396259B1 (en) | Capture of web application state | |
CN109471639A (en) | The monitoring method and device in a kind of application downloading source | |
US20130290939A1 (en) | Dynamic data for producing a script | |
CN110020297A (en) | A kind of loading method of web page contents, apparatus and system | |
CN110969469B (en) | Data acquisition method and device | |
US10372513B2 (en) | Classification of application events using call stacks |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
CB02 | Change of applicant information |
Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing Applicant after: Beijing Guoshuang Technology Co.,Ltd. Address before: 100086 Beijing city Haidian District Shuangyushu Area No. 76 Zhichun Road cuigongfandian 8 layer A Applicant before: Beijing Guoshuang Technology Co.,Ltd. |
|
CB02 | Change of applicant information | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |