CN110020339A - Based on without the webpage data acquiring method and device buried a little - Google Patents

Based on without the webpage data acquiring method and device buried a little Download PDF

Info

Publication number
CN110020339A
CN110020339A CN201710708948.6A CN201710708948A CN110020339A CN 110020339 A CN110020339 A CN 110020339A CN 201710708948 A CN201710708948 A CN 201710708948A CN 110020339 A CN110020339 A CN 110020339A
Authority
CN
China
Prior art keywords
click
target webpage
data
webpage
webpage element
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710708948.6A
Other languages
Chinese (zh)
Other versions
CN110020339B (en
Inventor
沈思辰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201710708948.6A priority Critical patent/CN110020339B/en
Publication of CN110020339A publication Critical patent/CN110020339A/en
Application granted granted Critical
Publication of CN110020339B publication Critical patent/CN110020339B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a kind of based on without the webpage data acquiring method and device buried a little, is related to network technique field, main purpose is to improve the accuracy of collection result during based on without collecting webpage data a little is buried and improves the rich of acquisition content.The method comprise the steps that obtaining the click data of the target webpage element after receiving the triggering information of target webpage element;According to pre-set click list of thing, the click event of the corresponding target webpage element is judged whether there is, wherein record has the corresponding click event of different web pages element in the click list of thing;If it exists, then the click data is saved into the record sheet of the correspondence target webpage element.The present invention is used for based on without the collecting webpage data buried a little.

Description

Based on without the webpage data acquiring method and device buried a little
Technical field
The present invention relates to network technique field more particularly to a kind of webpage data acquiring methods and dress buried a little based on nothing It sets.
Background technique
With the rapid development of internet, become the important means of website acquisition user behavior without a technology is buried gradually.Its In, the technical staff of website can by without burying a technology, the click behavior that webpage front-end is web page element define one with Corresponding " click event ", and the number of bursts by counting the click event determines user when accessing webpage to this yuan The click condition of element to realize the acquisition to web data, and then lays the foundation for the analysis work of subsequent user behavior.By Subsequent analysis work will be directly affected in collection result, if collected data it is not accurate enough or acquisition data content it is inadequate It is abundant, then undesirable influence can be caused on subsequent analysis work, therefore, for advertiser, enterprise and website webmaster, In order to preferably analyze user behavior, it is highly important for how preferably acquiring the click condition of web page element.
Currently, coming frequently with programs such as GrowingIO, Heap to the number in website when being acquired to web data According to being acquired.However, in actual use, GrowingIO by the text or link definition event in webpage, and according to Event triggering situation realizes the acquisition of web data, but this mode is often because the variation of text or link leads to the event of definition Failure, to influence the accuracy of collection result.By taking the advertisement position that operator often needs to detect as an example, according to link in advertisement position Certain activity is defined, when the activity is offline, it is necessary to redefine;When using Heap to be acquired, member is usually utilized Plain path is acquired, but this mode is adopted to define web page element according to click condition of the which to web page element The data content of collection is more single, influences subsequent analysis.Therefore, how during collecting webpage data to guarantee collection result Accuracy and ensure acquire content it is rich, become urgent problem to be solved in the industry.
Summary of the invention
In view of the above problems, the present invention provides a kind of based on without the webpage data acquiring method and device buried a little, main mesh In order to during based on without collecting webpage data a little is buried, improve the accuracy of collection result and ensure the rich of acquisition data Fu Xing.
In order to solve the above technical problems, in a first aspect, the present invention provides a kind of based on without the collecting webpage data buried a little Method, this method comprises:
After receiving the triggering information of target webpage element, the click data of the target webpage element is obtained;
According to pre-set click list of thing, the click thing of the corresponding target webpage element is judged whether there is Part, wherein record has the corresponding click event of different web pages element in the click list of thing;
If it exists, then the click data is saved into the record sheet of the correspondence target webpage element.
Optionally, described after receiving the triggering information of target webpage element, obtain the point of the target webpage element Hitting data includes:
The title of the target webpage element is parsed from the triggering information of the target webpage element;
According to the title of the target webpage element, the click logs of the target webpage element, the click day are obtained Will is used to record the click data of web page element;
The click data of the target webpage element is extracted from the click logs of the target webpage element.
Optionally, the method also includes:
Monitoring code is disposed in the webpage, to pass through the monitoring when the target webpage element is triggered The triggering information of target webpage element described in Code obtaining.
Optionally, in the title according to the target webpage element, obtain the target webpage element click logs it Before, the method also includes:
Deployment obtains code in the webpage, to pass through institute after getting the triggering information of the web page element The click logs for obtaining Code obtaining and sending the web page element are stated, the click logs are that the web page element is being triggered Shi Shengcheng's;
Receive the click logs that the acquisition code is sent.
Optionally, the click data of the target webpage element includes triggered time, element path, element text, element One of uniform resource position mark URL of link and the webpage is a variety of.
Optionally, described save the click data includes: into the record sheet of the correspondence target webpage element
Preservation request is issued the user with, so that user sends corresponding feedback command according to preservation request;
According to the content in the feedback command of the user, from when triggering in the click data of the target webpage element Between, determined in the uniform resource position mark URL of element path, element text, element link and the webpage and save content;
The preservation content is saved into the record sheet of the correspondence target webpage element.
Second aspect, the present invention also provides a kind of collecting webpage data device buried a little based on nothing, which includes:
Acquiring unit, for obtaining the target webpage element after receiving the triggering information of target webpage element Click data;
Judging unit, for judging whether there is the corresponding target webpage according to pre-set click list of thing The click event of element, wherein recording the corresponding click event of different web pages element in the click list of thing;
Storage unit, if judging there is corresponding institute for the judging unit according to pre-set click list of thing The click event of target webpage element is stated, then is saved the click data to the record sheet of the correspondence target webpage element In.
Optionally, the acquiring unit includes:
Parsing module, for parsing the name of the target webpage element from the triggering information of the target webpage element Claim;
It obtains module and obtains the click day of the target webpage element for the title according to the target webpage element Will, the click logs are used to record the click data of web page element;
Extraction module, for extracting the click of the target webpage element from the click logs of the target webpage element Data.
Optionally, described device further include:
Deployment unit, for disposing monitoring code in the webpage, so as to when the target webpage element is triggered, The triggering information of the target webpage element is obtained by the monitoring code.
Optionally, the acquiring unit further include:
Deployment module, for the deployment acquisition code in the webpage, so as to when the triggering for getting the web page element After information, by the click logs for obtaining Code obtaining and sending the web page element, the click logs are the net Page element is generated when being triggered;
Receiving module, the click logs sent for receiving the acquisition code.
Optionally, the click data of the target webpage element includes triggered time, element path, element text, element One of uniform resource position mark URL of link and the webpage is a variety of.
Optionally, the storage unit includes:
Module is issued, saves request for issuing the user with, so that user is corresponding anti-according to preservation request transmission Feedback instruction;
Determining module, for the content in the feedback command according to the user, from the click of the target webpage element In the uniform resource position mark URL in triggered time, element path, element text, element link and the webpage in data It determines and saves content;
Preserving module, the preservation content for determining the determining module are saved to the correspondence target webpage element In record sheet.
To achieve the goals above, according to the third aspect of the invention we, a kind of storage medium, the storage medium are provided Program including storage, wherein equipment where controlling the storage medium in described program operation executes base described above In without the webpage data acquiring method buried a little.
To achieve the goals above, according to the fourth aspect of the invention, a kind of processor is provided, the processor is used for Run program, wherein described program executes described above based on without the webpage data acquiring method buried a little when running.
It is provided by the invention that webpage data acquiring method and device a little are buried based on nothing by above-mentioned technical proposal, for For the prior art during based on without collecting webpage data a little is buried, the accuracy of collection result is lower, acquisition content is more single One the problem of, the present invention is after receiving the triggering information of web page element, by obtaining the click logs of web page element, and from point It hits in log and extracts the click data of web page element, then determine there is the corresponding element in the click list of thing of webpage Click event when, the click data of extraction is saved into corresponding record sheet, will click on list of thing so as to realize The acquisition of the click data of middle corresponding element, to realize based on without the collecting webpage data buried a little.Wherein, by webpage Click list of thing determine whether there is the click event of corresponding web page element, may be implemented in webpage to being defined event The acquisition of the click data of element, and then can targetedly acquire the click data of element, it is ensured that it is buried a little based on nothing The accuracy of collection result during collecting webpage data.Meanwhile by the click data of the target webpage element of acquisition come real The data acquisition function of existing target webpage element, more can comprehensively obtain data of the web page element when being clicked, thus Make based on without acquisition content more horn of plenty during the collecting webpage data buried a little.
The above description is only an overview of the technical scheme of the present invention, in order to better understand the technical means of the present invention, And it can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage can It is clearer and more comprehensible, the followings are specific embodiments of the present invention.
Detailed description of the invention
By reading the following detailed description of the preferred embodiment, various other advantages and benefits are common for this field Technical staff will become clear.The drawings are only for the purpose of illustrating a preferred embodiment, and is not considered as to the present invention Limitation.And throughout the drawings, the same reference numbers will be used to refer to the same parts.In the accompanying drawings:
Fig. 1 shows provided in an embodiment of the present invention a kind of based on without the webpage data acquiring method flow chart buried a little;
Fig. 2 shows provided in an embodiment of the present invention another based on without the webpage data acquiring method flow chart buried a little;
Fig. 3 shows provided in an embodiment of the present invention a kind of based on without the composition frame for burying collecting webpage data device a little Figure;
Fig. 4 shows provided in an embodiment of the present invention another based on without the composition frame for burying collecting webpage data device a little Figure.
Specific embodiment
The exemplary embodiment that the present invention will be described in more detail below with reference to accompanying drawings.Although showing the present invention in attached drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the present invention without should be by embodiments set forth here It is limited.It is to be able to thoroughly understand the present invention on the contrary, providing these embodiments, and can be by the scope of the present invention It is fully disclosed to those skilled in the art.
In order to improve based on without the accuracy buried during collecting webpage data a little and rich, the embodiment of the present invention is mentioned A kind of webpage data acquiring method buried a little based on nothing has been supplied, as shown in Figure 1, this method comprises:
101, after receiving the triggering information of target webpage element, the click data of the target webpage element is obtained.
In general, user can carry out some operation behaviors when accessing Website page on webpage, such as drag, point Hit equal behaviors.When carrying out these operation behaviors, all there is the web page element of the corresponding operation behavior.Wherein, web page element can To be interpreted as the element for forming webpage, there are a large amount of elements for each webpage.Specifically, webpage elements can Think that text, picture, audio, animation, video, link etc. are one or more.There are some elements being carried out some operation behaviors Afterwards, webpage can occur to operate corresponding response therewith.For example, when the link in webpage clicking element, it may occur that the jump of the page Turn.Meanwhile when these operable web page elements are triggered, system can all generate the point of the corresponding web page element in local Log is hit, for recording the related data information when element is triggered, for example, triggered time, triggering times etc..In addition, When obtaining the click data of the target webpage element, it can also be passed through by the way that data acquisition script is arranged in target webpage The parameter of setting is come the classification and acquisition that adjust acquisition modes of the data acquisition script when obtaining click data, obtain data The quantity of data;Or in such a way that the day after tomorrow is accessed, the data information of user sharing is obtained using shared data bank, so as to It is screened out from it the click data and its relevant data information about the target webpage element.Specifically, obtaining hits According to mode can voluntarily select according to actual needs, it is not limited here.
Since during based on without collecting webpage data a little is buried, the click condition of web page element is the weight for needing to acquire Data are wanted, therefore, the method according to this step specifically can be, when detecting the click row for occurring user in webpage For when, in webpage identify user click behavior effect web page element, i.e., target webpage described in the embodiment of the present invention member Then element obtains the routing information of the element, so as to the operation of subsequent elemental recognition.Meanwhile being by the local of the web page element The click logs of the corresponding element are inquired in system, and are obtained after finding.It should be noted that receiving web page element Triggering information after acquisition click logs mode, can by target webpage dispose obtain code it is relevant to realize Acquisition behavior, or the data of needs are obtained using the script plug-in unit comprising identical function, or in target webpage generation Corresponding function command or program are implanted into code to realize relevant acquisition behavior.It is chosen specifically, can according to need, It is not limited here.
102, according to pre-set click list of thing, the click of the corresponding target webpage element is judged whether there is Event.
Currently, when acquire based on the data without a technology of burying, can all determining for event be carried out to collected element Justice, the case where being triggered by the number of event come statistical elements.Similarly, identical side is also used in embodiments of the present invention Therefore method needs to judge whether the element being triggered is defined in this step, specifically, needing by clicking thing Part list is judged.Wherein pre-set click list of thing described in the embodiment of the present invention is one for recording net The list of the corresponding click event of difference element in page.When the event for clicking some element is present in the pre-set click thing When part list, illustrate that event that this element is clicked is the event being defined, needs to acquire this yuan in collection process The case where element is clicked.As a result, when there is the click event of certain corresponding element in the click list of thing, then illustrate to click The case where event of the element is defined, which is clicked is to need to be collected;And work as the click list of thing In there is no when the click event of certain corresponding element, then illustrate that the event for clicking the element is not defined, the element is by point The case where hitting does not need collected.
If 103, judgement has the click event of the corresponding target webpage element, the click data is saved to right In the record sheet for answering the target webpage element.
After step 102 is judged, when determining the click event that there is the corresponding target webpage element, illustrate this The click event of web page element be the event being defined, its click condition be need it is collected.As a result, according to this step The click data of the correspondence target webpage element is stored in corresponding record sheet, so as to according to record by the method Click data in table carries out subsequent analysis to the click condition of element.
It is provided in an embodiment of the present invention that webpage data acquiring method a little is buried based on nothing, the prior art is buried based on nothing During the collecting webpage data of point, the lower problem of the accuracy of collection result, the present invention passes through in preset click List of thing determines whether there is the click event of corresponding target webpage element, may be implemented in webpage to the mesh for the event that defines The acquisition of the click data of web page element is marked, and then can targetedly acquire the corresponding click data of element, it is ensured that base The accuracy of collection result during without the collecting webpage data buried a little.Meanwhile the point of the target webpage element by acquisition Data are hit to realize the data acquisition function of target webpage element, more can comprehensively obtain web page element when being clicked Data, to make based on without acquisition content more horn of plenty during the collecting webpage data buried a little.
Further, as the refinement and extension to embodiment illustrated in fig. 1, the embodiment of the invention also provides another bases Webpage data acquiring method a little is buried in nothing, as shown in Fig. 2, its specific steps includes:
201, monitoring code is disposed, in webpage to pass through the monitoring when the target webpage element is triggered The triggering information of target webpage element described in Code obtaining.
In this step, by being monitored the deployment of code in webpage, it can be ensured that when web page element is triggered, energy Enough obtain the triggering information of the web page element.The monitoring code can choose web crawlers or other monitoring codes, tool Body, it can according to need to be selected, it is not limited here.But selected monitoring code will guarantee in net When page element is triggered, the triggering information of the web page element is obtained in time.It, can as a result, by disposing the monitoring code Webpage is monitored, and when web page element is triggered, can timely obtain triggering information, and then realize to webpage The timely acquisition of the triggering information of element and real time monitoring function.
202, after receiving the triggering information of target webpage element, the click data of the target webpage element is obtained.
In the present invention is implemented, the description Yu aforementioned implementation of the click logs of the target webpage element, web page element Description in example step 101 is identical, and this will not be repeated here.
Therefore, in this step, the web page element can be parsed from the triggering information of target webpage element first Title, then obtains the click logs of the corresponding title according to the title from system local, last according to from the target webpage The click data of the target webpage element is extracted in the click logs of element.Certainly, in addition to the element term of target webpage element Except, can also carry out the acquisition of click logs by the identity of web page element, described in this step according to title come The mode for determining click logs is only preferably embodiment, other specific modes in various ways, can according to need from Row is chosen.
Specifically, when obtaining the click logs of web page element code can also be obtained by disposing in webpage, when connecing When receiving the triggering information of web page element, the acquisition code activation, so as to using the acquisition Code obtaining and send webpage member The click logs of element.Then, the click logs for obtaining code and sending are received, realize that the acquisition to the click logs is grasped Make.Wherein, the acquisition code can use web crawlers or other codes, specifically it is not limited here.In addition, this step What the rapid click logs can generate for the web page element when being triggered is directed to when time click logs of operation, can also Think and be formed by log after constantly recording after multi-pass operation, it is not limited here.
According to the method for this step, corresponding click logs are obtained by the title of web page element, it can be ensured that click The accuracy of log acquisition, and then ensure based on the accuracy without the collection result for burying collecting webpage data a little.
Further, due to including number of types of data in the click logs of webpage, and the embodiment of the present invention is come It says, most importantly the click data of web page element, wherein the click data of web page element may include triggered time, element Path, element text, element link and the webpage one of uniform resource position mark URL or a variety of.Unified resource Finger URL (Uniform Resource Locator, abbreviation URL) is a kind of for characterizing the position and the visit that interconnect internet resource Ask the character string of method, it can be understood as the address information of standard resource on internet.Each file on internet has one A unique URL.
It should be noted that specific extracting mode can be chosen from the prior art according to the actual situation, still The accuracy for the click data content for ensuring to extract, and ensure that the extracting method chosen can be by above-mentioned click data The middle triggered time, element path, element text, element link and the webpage uniform resource position mark URL extract.
203, according to pre-set click list of thing, the click of the corresponding target webpage element is judged whether there is Event.
Wherein, click list of thing described in this step is identical as step 102 in previous embodiment, does not do herein superfluous It states.
Therefore, the method according to this step, the event that the target webpage element is currently clicked in judgement whether there is In the click list of thing of the webpage, when it is present, it is collected to illustrate that the click condition of the element needs;When being not present When, illustrate that the click condition of the element does not need acquisition.
If 204, judgement has the click event of the corresponding target webpage element, the target click data is saved Into the record sheet of the correspondence target webpage element.
After method described in step 203, through judging, it is described exist in the click list of thing of the webpage it is described The click event of web page element illustrates that the click condition of the element is counted and acquired.Therefore, it is converged in this step Always, it needs to save the corresponding click data of the element into corresponding record sheet.
Further, the specific executive mode that method is executed as this step, can be with are as follows:
First choice issues the user with preservation request, so that user sends corresponding feedback command according to preservation request;Its In, it may include the data class in the title of the element and the click data of the element in saving request, when such as triggering Between, one of the uniform resource position mark URL of element path, element text, element link and the webpage or a variety of, tool The type and quantity of body determines according to actual conditions, it is not limited here.By the request, family can be used can be as needed It is chosen, to be saved to the data of selection.
Then, the triggering according to the content in the feedback command of user, from the click data of the target webpage element Time, element path, element text, element link and the webpage uniform resource position mark URL in determine and save content. According to saving what the click data type provided in request determined when due to, the feedback command of user, therefore, the instruction can be with Including to the instruction which kind of click data is saved in click data.
Finally, saving content is saved into the record sheet of the correspondence web page element.
The method according to this step, connected applications scene, is exemplified below:
When the click event of webpage clicked in Event Log Table in the presence of corresponding web page element A, illustrate web page element A Click condition be need collected, i.e. target webpage element, meanwhile, determine to include touching in the click data of web page element A When sending out time, tetra- kinds of element text, element link and webpage URL click datas, preservation is issued the user with first and is requested, in request Title and triggered time, element text, element link and tetra- kinds of click datas of webpage URL comprising web page element A.Then user Tri- kinds of triggered time in web page element A, element text, webpage URL click datas are saved, and fed back in determination It is corresponding to save instruction, then according to the three kinds of click data types determined in instruction are saved, by triggered time, element text, net The data content of page tri- kinds of click datas of URL is stored in the record sheet of corresponding web page element A.
The method according to this step saves instruction by obtaining to user, may be implemented in front end to click data Content saves type and saves the control of quantity, and then can save corresponding click data content according to the needs of users, solves Determined previous preservation click data when, inflexible problem is then improved based on without burying webpage data acquiring method a little Flexibility.
Further, as the realization to method shown in above-mentioned Fig. 1, the embodiment of the invention also provides one kind to be buried based on nothing The collecting webpage data device of point, for being realized to above-mentioned method shown in FIG. 1.The Installation practice and preceding method are real It is corresponding to apply example, be it is easy to read, present apparatus embodiment no longer repeats the detail content in preceding method embodiment one by one, It should be understood that the device in the present embodiment can correspond to the full content realized in preceding method embodiment.As shown in figure 3, The device includes: acquiring unit 31, judging unit 32, storage unit 33, wherein
Acquiring unit 31 can be used for obtaining the target webpage after receiving the triggering information of target webpage element The click data of element.
Judging unit 32 can be used for judging whether there is the corresponding mesh according to pre-set click list of thing The click event of web page element is marked, wherein recording the corresponding click event of different web pages element in the click list of thing.
Storage unit 33, if can be used for the judging unit 32 according to pre-set click list of thing, judgement is deposited In the click event of the correspondence target webpage element, then the click data that the acquiring unit 31 obtains is saved to corresponding institute In the record sheet for stating target webpage element.
Further, as the realization to method shown in above-mentioned Fig. 2, the embodiment of the invention also provides another kinds to be based on nothing Collecting webpage data device a little is buried, for realizing to above-mentioned method shown in Fig. 2.The Installation practice and preceding method Embodiment is corresponding, be it is easy to read, present apparatus embodiment no longer goes to live in the household of one's in-laws on getting married one by one to the detail content in preceding method embodiment It states, it should be understood that the device in the present embodiment can correspond to the full content realized in preceding method embodiment.
Wherein, Website page can be deployed in by burying collecting webpage data device a little based on nothing described in the embodiment of the present invention The server-side at place, user terminal, or it is deployed in the third end service that other one is connected with the server-side and user terminal In device.It does not do specific restriction herein, can according to need and disposed.
The another kind provided in this embodiment buries collecting webpage data device a little based on nothing, can be as shown in figure 4, should Device includes: acquiring unit 41, judging unit 42, storage unit 43, wherein
Acquiring unit 41 can be used for obtaining the target webpage after receiving the triggering information of target webpage element The click data of element.
Judging unit 42 can be used for judging whether there is the corresponding mesh according to pre-set click list of thing The click event of web page element is marked, wherein recording the corresponding click event of different web pages element in the click list of thing.
Storage unit 43, if can be used for the judging unit 42 according to pre-set click list of thing, judgement is deposited In the click event of the correspondence target webpage element, then the click data that the acquiring unit 41 obtains is saved to corresponding institute In the record sheet for stating target webpage element.
Further, the acquiring unit 41 includes:
Parsing module 411 can be used for parsing the target webpage from the triggering information of the target webpage element The title of element.
Module 412 is obtained, the title for the target webpage element that can be used for parsing according to the parsing module 411 obtains The click logs of the target webpage element are taken, the click logs are used to record the click data of web page element;
Extraction module, 413, it can be used for from the click logs for the target webpage element that the acquisition module 412 obtains Extract the click data of the target webpage element
Further, described device further include:
Deployment unit 44 can be used for disposing monitoring code in the webpage, to work as the target webpage element quilt When triggering, the triggering information of the target webpage element is obtained by the monitoring code and is sent in acquiring unit 41.
Further, the acquiring unit 41 further include:
Deployment module 414 can be used for disposing obtaining code in the webpage, get the web page element to work as Triggering information after, by the acquisition Code obtaining and send the click logs of the web page element, the click logs are What the web page element was generated when being triggered.
Receiving module 415 can be used for receiving the click logs that the acquisition code is sent.
Further, the click data of the target webpage element includes triggered time, element path, element text, member One of uniform resource position mark URL of element link and the webpage is a variety of.
Further, the storage unit 43 includes:
Module 431 is issued, can be used for issuing the user with preservation request, so that user is according to preservation request transmission pair The feedback command answered.
Determining module 432 can be used for the content in the feedback command according to the user, state web page element from institute's target Click data in triggered time, element path, element text, element link and the webpage uniform resource locator It is determined in URL and saves content.
Preserving module 433 can be used for saving the preservation content that the determining module 442 determines to the correspondence target In the record sheet of web page element.
By above-mentioned technical proposal, the embodiment of the present invention provides a kind of based on without the webpage data acquiring method and dress buried a little It sets.For the prior art during based on without collecting webpage data a little is buried, the lower problem of the accuracy of collection result, this Invention determines whether there is the click event of corresponding target webpage element by the click list of thing in webpage, and net may be implemented To the acquisition of the click data of the target webpage element for the event that defines in page, and then it is corresponding targetedly to acquire element Click data, it is ensured that based on without the accuracy for burying collection result during collecting webpage data a little.Meanwhile utilizing webpage The click logs of element extract the click data of web page element, more can comprehensively obtain web page element when being clicked Data, to make based on without acquisition content more horn of plenty during the collecting webpage data buried a little.In addition, passing through web page element Title obtains corresponding click logs, it can be ensured that the accuracy that click logs obtain, and then ensure based on without burying a little The accuracy of the collection result of collecting webpage data.Meanwhile instruction is saved by obtaining to user, it may be implemented in front end to point It hits data content to save type and save the control of quantity, and then can save according to the needs of users in corresponding click data Hold, when solving previous preservation click data, inflexible problem is then improved based on without the collecting webpage data buried a little The flexibility of method.In addition, can achieve the effect that be monitored webpage, and work as by disposing monitoring code in webpage When web page element is triggered, triggering information can be timely obtained, and then realizes the timely of triggering information to web page element Acquisition and real time monitoring function.
It is described based on including processor and memory, above-mentioned acquiring unit, extraction without the collecting webpage data device for burying a little Unit, judging unit and storage unit etc. store in memory as program unit, are stored in storage by processor execution Above procedure unit in device realizes corresponding function.
Include kernel in processor, is gone in memory to transfer corresponding program unit by kernel.Kernel can be set one Or more, it improves by adjusting kernel parameter based on without the accuracy buried during collecting webpage data a little and rich.
Memory may include the non-volatile memory in computer-readable medium, random access memory (RAM) and/ Or the forms such as Nonvolatile memory, if read-only memory (ROM) or flash memory (flash RAM), memory include that at least one is deposited Store up chip.
The embodiment of the invention provides a kind of storage mediums, are stored thereon with program, real when which is executed by processor It is existing described based on without the webpage data acquiring method buried a little.
The embodiment of the invention provides a kind of processor, the processor is for running program, wherein described program operation Based on without the webpage data acquiring method buried a little described in Shi Zhihang.
The embodiment of the invention provides a kind of equipment, equipment include processor, memory and storage on a memory and can The program run on a processor, processor are performed the steps of when executing program when the triggering for receiving target webpage element After information, the click data of the target webpage element is obtained;
According to pre-set click list of thing, the click thing of the corresponding target webpage element is judged whether there is Part, wherein record has the corresponding click event of different web pages element in the click list of thing;
If it exists, then the click data is saved into the record sheet of the correspondence target webpage element.
Further, described after receiving the triggering information of target webpage element, obtain the target webpage element Click data includes:
The title of the target webpage element is parsed from the triggering information of the target webpage element;
According to the title of the target webpage element, the click logs of the target webpage element, the click day are obtained Will is used to record the click data of web page element;
The click data of the target webpage element is extracted from the click logs of the target webpage element.
Further, the method also includes:
Monitoring code is disposed in the webpage, to pass through the monitoring when the target webpage element is triggered The triggering information of target webpage element described in Code obtaining.
Further, in the title according to the target webpage element, the click logs of the target webpage element are obtained Before, the method also includes:
Deployment obtains code in the webpage, to pass through institute after getting the triggering information of the web page element The click logs for obtaining Code obtaining and sending the web page element are stated, the click logs are that the web page element is being triggered Shi Shengcheng's;
Receive the click logs that the acquisition code is sent.
Further, the click data of the target webpage element includes triggered time, element path, element text, member One of uniform resource position mark URL of element link and the webpage is a variety of.
Further, described save the click data includes: into the record sheet of the correspondence target webpage element
Preservation request is issued the user with, so that user sends corresponding feedback command according to preservation request;
According to the content in the feedback command of the user, from when triggering in the click data of the target webpage element Between, determined in the uniform resource position mark URL of element path, element text, element link and the webpage and save content;
The preservation content is saved into the record sheet of the correspondence target webpage element.
Equipment in the embodiment of the present invention can be server, PC, PAD, mobile phone etc..
The embodiment of the invention also provides a kind of computer program products, when executing on data processing equipment, are suitable for It executes the program of initialization there are as below methods step: after receiving the triggering information of target webpage element, obtaining the target The click data of web page element;
According to pre-set click list of thing, the click thing of the corresponding target webpage element is judged whether there is Part, wherein record has the corresponding click event of different web pages element in the click list of thing;
If it exists, then the click data is saved into the record sheet of the correspondence target webpage element.
Further, described after receiving the triggering information of target webpage element, obtain the target webpage element Click data includes:
The title of the target webpage element is parsed from the triggering information of the target webpage element;
According to the title of the target webpage element, the click logs of the target webpage element, the click day are obtained Will is used to record the click data of web page element;
The click data of the target webpage element is extracted from the click logs of the target webpage element.
Further, the method also includes:
Monitoring code is disposed in the webpage, to pass through the monitoring when the target webpage element is triggered The triggering information of target webpage element described in Code obtaining.
Further, in the title according to the target webpage element, the click logs of the target webpage element are obtained Before, the method also includes:
Deployment obtains code in the webpage, to pass through institute after getting the triggering information of the web page element The click logs for obtaining Code obtaining and sending the web page element are stated, the click logs are that the web page element is being triggered Shi Shengcheng's;
Receive the click logs that the acquisition code is sent.
Further, the click data of the target webpage element includes triggered time, element path, element text, member One of uniform resource position mark URL of element link and the webpage is a variety of.
Further, described save the click data includes: into the record sheet of the correspondence target webpage element
Preservation request is issued the user with, so that user sends corresponding feedback command according to preservation request;
According to the content in the feedback command of the user, from when triggering in the click data of the target webpage element Between, determined in the uniform resource position mark URL of element path, element text, element link and the webpage and save content;
The preservation content is saved into the record sheet of the correspondence target webpage element.
It should be understood by those skilled in the art that, embodiments herein can provide as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the application Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the application, which can be used in one or more, The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces The form of product.
The application is referring to method, the process of equipment (system) and computer program product according to the embodiment of the present application Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
In a typical configuration, calculating equipment includes one or more processors (CPU), input/output interface, net Network interface and memory.
Memory may include the non-volatile memory in computer-readable medium, random access memory (RAM) and/ Or the forms such as Nonvolatile memory, such as read-only memory (ROM) or flash memory (flash RAM).Memory is computer-readable Jie The example of matter.
Computer-readable medium includes permanent and non-permanent, removable and non-removable media can be by any method Or technology come realize information store.Information can be computer readable instructions, data structure, the module of program or other data. The example of the storage medium of computer includes, but are not limited to phase change memory (PRAM), static random access memory (SRAM), moves State random access memory (DRAM), other kinds of random access memory (RAM), read-only memory (ROM), electric erasable Programmable read only memory (EEPROM), flash memory or other memory techniques, read-only disc read only memory (CD-ROM) (CD-ROM), Digital versatile disc (DVD) or other optical storage, magnetic cassettes, tape magnetic disk storage or other magnetic storage devices Or any other non-transmission medium, can be used for storage can be accessed by a computing device information.As defined in this article, it calculates Machine readable medium does not include temporary computer readable media (transitory media), such as the data-signal and carrier wave of modulation.
It should also be noted that, the terms "include", "comprise" or its any other variant are intended to nonexcludability It include so that the process, method, commodity or the equipment that include a series of elements not only include those elements, but also to wrap Include other elements that are not explicitly listed, or further include for this process, method, commodity or equipment intrinsic want Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including element There is also other identical elements in process, method, commodity or equipment.
It will be understood by those skilled in the art that embodiments herein can provide as method, system or computer program product. Therefore, complete hardware embodiment, complete software embodiment or embodiment combining software and hardware aspects can be used in the application Form.It is deposited moreover, the application can be used to can be used in the computer that one or more wherein includes computer usable program code The shape for the computer program product implemented on storage media (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) Formula.
The above is only embodiments herein, are not intended to limit this application.To those skilled in the art, Various changes and changes are possible in this application.It is all within the spirit and principles of the present application made by any modification, equivalent replacement, Improve etc., it should be included within the scope of the claims of this application.

Claims (10)

1. a kind of based on without the webpage data acquiring method buried a little characterized by comprising
After receiving the triggering information of target webpage element, the click data of the target webpage element is obtained;
According to pre-set click list of thing, the click event of the corresponding target webpage element is judged whether there is, Described in click list of thing in record have the corresponding click event of different web pages element;
If it exists, then the click data is saved into the record sheet of the correspondence target webpage element.
2. the method according to claim 1, wherein described when the triggering information for receiving target webpage element Afterwards, the click data for obtaining the target webpage element includes:
The title of the target webpage element is parsed from the triggering information of the target webpage element;
According to the title of the target webpage element, the click logs of the target webpage element are obtained, the click logs are used In the click data of record web page element;
The click data of the target webpage element is extracted from the click logs of the target webpage element.
3. the method according to claim 1, wherein the method also includes:
Monitoring code is disposed in the webpage, to pass through the monitoring code when the target webpage element is triggered Obtain the triggering information of the target webpage element.
4. according to the method described in claim 2, it is characterized in that, obtaining institute in the title according to the target webpage element Before the click logs for stating target webpage element, the method also includes:
Deployment obtains code in the webpage, to be obtained after getting the triggering information of the web page element by described It takes Code obtaining and sends the click logs of the web page element, the click logs are that the web page element is raw when being triggered At;
Receive the click logs that the acquisition code is sent.
5. method according to claim 1-4, which is characterized in that the click data packet of the target webpage element Include one of triggered time, element path, element text, element link and uniform resource position mark URL of the webpage Or it is a variety of.
6. according to the method described in claim 5, it is characterized in that, described save the click data to the correspondence target Include: in the record sheet of web page element
Preservation request is issued the user with, so that user sends corresponding feedback command according to preservation request;
According to the content in the feedback command of the user, from the click data of the target webpage element triggered time, Element path, element text, element link and the webpage uniform resource position mark URL in determine and save content;
The preservation content is saved into the record sheet of the correspondence target webpage element.
7. a kind of based on without the collecting webpage data device buried a little characterized by comprising
Acquiring unit, for obtaining the click of the target webpage element after receiving the triggering information of target webpage element Data;
Judging unit, for judging whether there is the corresponding target webpage element according to pre-set click list of thing Click event, wherein recording the corresponding click event of different web pages element in the click list of thing;
Storage unit, if judging there is the corresponding mesh for the judging unit according to pre-set click list of thing The click event for marking web page element, then save the click data that the acquiring unit obtains to the correspondence target webpage element Record sheet in.
8. device according to claim 7, which is characterized in that the acquiring unit includes:
Parsing module, for parsing the title of the target webpage element from the triggering information of the target webpage element;
It obtains module and obtains the click logs of the target webpage element, institute for the title according to the target webpage element Click logs are stated for recording the click data of web page element;
Extraction module, for extracting the target webpage from the click logs for the target webpage element that the acquisition module obtains The click data of element.
9. a kind of storage medium, which is characterized in that the storage medium includes the program of storage, wherein run in described program When control the storage medium where equipment perform claim require 1 to described in any one of claim 6 based on without burying a little Webpage data acquiring method.
10. a kind of processor, which is characterized in that the processor is for running program, wherein right of execution when described program is run Benefit require 1 to described in any one of claim 6 based on without the webpage data acquiring method buried a little.
CN201710708948.6A 2017-08-17 2017-08-17 Webpage data acquisition method and device based on non-buried point Active CN110020339B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710708948.6A CN110020339B (en) 2017-08-17 2017-08-17 Webpage data acquisition method and device based on non-buried point

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710708948.6A CN110020339B (en) 2017-08-17 2017-08-17 Webpage data acquisition method and device based on non-buried point

Publications (2)

Publication Number Publication Date
CN110020339A true CN110020339A (en) 2019-07-16
CN110020339B CN110020339B (en) 2022-03-18

Family

ID=67186096

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710708948.6A Active CN110020339B (en) 2017-08-17 2017-08-17 Webpage data acquisition method and device based on non-buried point

Country Status (1)

Country Link
CN (1) CN110020339B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110674426A (en) * 2019-08-30 2020-01-10 腾讯科技(深圳)有限公司 Webpage behavior reporting method and device
CN111274574A (en) * 2020-01-16 2020-06-12 恩亿科(北京)数据科技有限公司 Webpage event anti-shaking method and device, server and computer readable storage medium
CN111523064A (en) * 2020-04-16 2020-08-11 山东贝赛信息科技有限公司 Webpage collection technology based on Jxbrowser
CN111581069A (en) * 2020-04-30 2020-08-25 北京三快在线科技有限公司 Data processing method and device
CN112199263A (en) * 2020-09-30 2021-01-08 北京字节跳动网络技术有限公司 Method, device, equipment and medium for recording page
CN112799946A (en) * 2021-01-29 2021-05-14 长沙市到家悠享网络科技有限公司 Method, equipment and storage medium for embedding points and collecting data
CN114036426A (en) * 2021-11-25 2022-02-11 深圳视界信息技术有限公司 Webpage data acquisition method, device, equipment and medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100835905B1 (en) * 2007-04-02 2008-06-19 (주)비즈스프링 Apparatus for visualizing website visitor's click distribution in webpage and method using the same
CN103246661A (en) * 2012-02-07 2013-08-14 阿里巴巴集团控股有限公司 Visual user behavior collecting system and method
CN103309884A (en) * 2012-03-13 2013-09-18 阿里巴巴集团控股有限公司 User behavior data collecting method and system
CN103631699A (en) * 2012-08-28 2014-03-12 纽海信息技术(上海)有限公司 Log management system and method for log monitoring, acquiring and querying
CN105975599A (en) * 2016-05-11 2016-09-28 北京京东尚博广益投资管理有限公司 Method and device monitoring website page event tracking
CN106571949A (en) * 2016-09-23 2017-04-19 北京五八信息技术有限公司 Event tracking point processing method and apparatus
CN106933722A (en) * 2017-03-06 2017-07-07 腾云天宇科技(北京)有限公司 A kind of web application monitoring method, server and system
CN106933472A (en) * 2017-05-20 2017-07-07 南京西桥科技有限公司 A kind of user behavior data acquisition system and its control method based on mobile phone A PP
CN107018046A (en) * 2017-06-06 2017-08-04 上海鋆创信息技术有限公司 A kind of collecting method, device, terminal and storage medium

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100835905B1 (en) * 2007-04-02 2008-06-19 (주)비즈스프링 Apparatus for visualizing website visitor's click distribution in webpage and method using the same
CN103246661A (en) * 2012-02-07 2013-08-14 阿里巴巴集团控股有限公司 Visual user behavior collecting system and method
CN103309884A (en) * 2012-03-13 2013-09-18 阿里巴巴集团控股有限公司 User behavior data collecting method and system
CN103631699A (en) * 2012-08-28 2014-03-12 纽海信息技术(上海)有限公司 Log management system and method for log monitoring, acquiring and querying
CN105975599A (en) * 2016-05-11 2016-09-28 北京京东尚博广益投资管理有限公司 Method and device monitoring website page event tracking
CN106571949A (en) * 2016-09-23 2017-04-19 北京五八信息技术有限公司 Event tracking point processing method and apparatus
CN106933722A (en) * 2017-03-06 2017-07-07 腾云天宇科技(北京)有限公司 A kind of web application monitoring method, server and system
CN106933472A (en) * 2017-05-20 2017-07-07 南京西桥科技有限公司 A kind of user behavior data acquisition system and its control method based on mobile phone A PP
CN107018046A (en) * 2017-06-06 2017-08-04 上海鋆创信息技术有限公司 A kind of collecting method, device, terminal and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
宋星: ""无埋点实现监测的真相——革新还是噱头? _ 互联网分析在中国——从基础到前沿"", 《无埋点实现监测的真相——革新还是噱头? _ 互联网分析在中国——从基础到前沿》 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110674426A (en) * 2019-08-30 2020-01-10 腾讯科技(深圳)有限公司 Webpage behavior reporting method and device
CN110674426B (en) * 2019-08-30 2024-03-22 腾讯科技(深圳)有限公司 Webpage behavior reporting method and device
CN111274574A (en) * 2020-01-16 2020-06-12 恩亿科(北京)数据科技有限公司 Webpage event anti-shaking method and device, server and computer readable storage medium
CN111523064A (en) * 2020-04-16 2020-08-11 山东贝赛信息科技有限公司 Webpage collection technology based on Jxbrowser
CN111581069A (en) * 2020-04-30 2020-08-25 北京三快在线科技有限公司 Data processing method and device
CN112199263A (en) * 2020-09-30 2021-01-08 北京字节跳动网络技术有限公司 Method, device, equipment and medium for recording page
CN112799946A (en) * 2021-01-29 2021-05-14 长沙市到家悠享网络科技有限公司 Method, equipment and storage medium for embedding points and collecting data
CN114036426A (en) * 2021-11-25 2022-02-11 深圳视界信息技术有限公司 Webpage data acquisition method, device, equipment and medium

Also Published As

Publication number Publication date
CN110020339B (en) 2022-03-18

Similar Documents

Publication Publication Date Title
CN110020339A (en) Based on without the webpage data acquiring method and device buried a little
US20170255706A1 (en) Methods and apparatus to track web browsing sessions
CN107609135B (en) Page element determining method and device, and user behavior path determining method and device
WO2016066046A1 (en) Information acquisition method and apparatus
US20110238723A1 (en) Systems and methods for web decoding
CN108256888B (en) Landing page acquisition method, website server and network advertisement monitoring system
CN103401835A (en) Method and device for presenting safety detection results of microblog page
CN111177519B (en) Webpage content acquisition method, device, storage medium and equipment
CN114417197A (en) Access record processing method and device and storage medium
CN110069683A (en) A kind of method and device crawling data based on browser
CN109428776B (en) Website traffic monitoring method and device
CN105160027B (en) Advertisement data processing method and device
CN110020044A (en) A kind of crawling method and device of crawler
CN102831218A (en) Method and device for determining data in thermodynamic chart
CN107294918B (en) Phishing webpage detection method and device
CN109948074A (en) Website data interconnection method, device, storage medium, processor and electronic equipment
CN102870118A (en) Access method, device and system to user behavior
CN104158697B (en) A kind of dead chain detection method and device
US8929667B1 (en) Analysis of web application state
US9396259B1 (en) Capture of web application state
CN109471639A (en) The monitoring method and device in a kind of application downloading source
US20130290939A1 (en) Dynamic data for producing a script
CN110020297A (en) A kind of loading method of web page contents, apparatus and system
CN110969469B (en) Data acquisition method and device
US10372513B2 (en) Classification of application events using call stacks

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
CB02 Change of applicant information

Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing

Applicant after: Beijing Guoshuang Technology Co.,Ltd.

Address before: 100086 Beijing city Haidian District Shuangyushu Area No. 76 Zhichun Road cuigongfandian 8 layer A

Applicant before: Beijing Guoshuang Technology Co.,Ltd.

CB02 Change of applicant information
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant