WO2020151241A1 - Information non-trace exposure collection method and device - Google Patents

Information non-trace exposure collection method and device Download PDF

Info

Publication number
WO2020151241A1
WO2020151241A1 PCT/CN2019/106783 CN2019106783W WO2020151241A1 WO 2020151241 A1 WO2020151241 A1 WO 2020151241A1 CN 2019106783 W CN2019106783 W CN 2019106783W WO 2020151241 A1 WO2020151241 A1 WO 2020151241A1
Authority
WO
WIPO (PCT)
Prior art keywords
exposure
information
traceless
memory
page
Prior art date
Application number
PCT/CN2019/106783
Other languages
French (fr)
Chinese (zh)
Inventor
胡晓
李勇睿
许夏骏
胡孟亮
Original Assignee
苏宁云计算有限公司
苏宁易购集团股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 苏宁云计算有限公司, 苏宁易购集团股份有限公司 filed Critical 苏宁云计算有限公司
Priority to CA3168957A priority Critical patent/CA3168957A1/en
Publication of WO2020151241A1 publication Critical patent/WO2020151241A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Definitions

  • the present invention relates to the technical field of information collection, in particular to a method and device for seamless exposure and collection of information.
  • the embodiments of the present invention provide a method and device for seamless exposure and collection of information, so as to overcome the problems of high intrusion of exposure buried points and high-frequency HTTP requests in the prior art.
  • the technical solution adopted by the present invention is:
  • a method for seamless exposure and collection of information includes the following steps:
  • S1 Obtain the traceless exposure configuration information from the server and write it into the memory, search for the content of the area where traceless exposure is enabled according to the traceless exposure configuration information, and set relevant exposure rules;
  • S2 Searching for all areas on the page where the seamless exposure has been turned on, capturing the exposure element onload events of the areas where the seamless exposure has been turned on, and obtaining exposure information according to the exposure elements;
  • S4 Send and report the exposure information written in the memory according to the exposure rule.
  • step S3 specifically includes:
  • step S4 specifically includes:
  • step S4.1 Query whether there is exposure information in the memory, if not, end the process, otherwise, go to step S4.2;
  • step S4.2 Query whether the timer is turned on, if yes, proceed directly to step S4.3, otherwise, proceed to step S4.3 after turning on the timer;
  • S4.3 Splicing and reporting the exposure information according to the flow control rules in the exposure rules, and delete the sent exposure information from the memory;
  • step S4.4 Query whether there is exposure information in the memory, if so, return to step S4.3, otherwise, the process ends after closing the timer.
  • the method further includes:
  • a device for seamless exposure and collection of information which includes:
  • a setting module which is used to obtain the traceless exposure configuration information from the server and write it into the memory, search for the content of the area where traceless exposure is turned on according to the traceless exposure configuration information, and set relevant exposure rules;
  • the search module is configured to search for all areas on the page where the seamless exposure has been turned on, capture the exposure element onload events of the areas where the seamless exposure has been turned on, and obtain exposure information according to the exposure elements;
  • the storage module is used to write the exposure information into the memory in a simulated stack mode
  • the reporting module is configured to send and report the exposure information written in the memory according to the exposure rule.
  • search module includes:
  • the judging unit is used to judge whether to turn on the traceless exposure of the page, if it is enabled, capture the exposure element onload event of the area where the traceless exposure has been turned on, and obtain the exposure information according to the exposure element, otherwise no processing is performed .
  • the storage module is specifically used for:
  • reporting module includes:
  • the first query unit is used to query whether there is exposure information in the memory
  • the second query unit is used to query whether the timer is on;
  • the information reporting unit is configured to splice and report the exposure information according to the flow control rules in the exposure rules, and delete the sent exposure information from the memory.
  • the device further includes:
  • the configuration module is used to configure the traceless exposure switch of the corresponding page, and turn on or off traceless exposure collection in real time.
  • the business side by capturing the onload event of the element in the automatic exposure area, the business side only needs to configure the switch of whether to open the exposure and the exposure in the corresponding background management page.
  • the frequency is sufficient, and there is no need to manually call the exposure method of the corresponding collection script, thereby eliminating the intrusion of your own business, and almost achieving the collection of page information exposure under the condition that the business has no buried points, thus greatly reducing the information exposure collection threshold;
  • the method and device for seamless exposure collection of information provided by the embodiments of the present invention send HTTP requests in a first-in-first-out order by simulating stack storage, which solves the problem of HTTP request blocking caused by high concurrency, web page jams, crashes, etc. problem.
  • the rules of the exposure information reporting module can be configured in real time to deal with different business scenarios.
  • Fig. 1 is a flow chart showing a method for collecting information traceless exposure according to an exemplary embodiment
  • Fig. 2 is another flowchart showing a method for collecting information traceless exposure according to an exemplary embodiment
  • Fig. 3 is a schematic diagram showing the structure of an information traceless exposure collection device according to an exemplary embodiment.
  • the method and device for non-trace exposure collection of information provided by the embodiments of the present invention circumvent and perform unified management and control in response to the high intrusiveness of exposure buried points and high-frequency HTTP requests existing in the current WEB information exposure technology.
  • the method and device for seamless exposure collection of information provided by the embodiments of the present invention there will be no case where the business code and the code of the exposure buried point are coupled together in the implementation of the traditional exposure collection technology.
  • the HTTP request is sent in an orderly first-in, first-out manner through the simulated stack storage method, which solves the problems of HTTP request blocking and other problems that occur with traditional exposure technology.
  • the flow control for reporting the exposure information is added to the embodiment of the present invention to meet the needs of various business scenarios such as big promotion and non-big promotion.
  • Fig. 1 is a flowchart of a method for collecting information traceless exposure according to an exemplary embodiment. Referring to Fig. 1, the method includes the following steps:
  • S1 Obtain the traceless exposure configuration information from the server and write it into the memory, search for the content of the region where the traceless exposure is enabled according to the traceless exposure configuration information, and set related exposure rules.
  • the traceless exposure configuration information is obtained from the server at the first time when the page is loaded and written into the memory. Then search for the content of the area where the seamless exposure is turned on according to the seamless exposure configuration information, and set the relevant exposure rule Rn according to the seamless exposure configuration information.
  • relevant exposure rules include at least flow control rules, collection rules, etc. It should be noted here that in the embodiment of the present invention, the exposure rule supports real-time configuration, that is, the user can configure the exposure rule in real time according to actual business scenarios.
  • S2 Search for all areas on the page where the seamless exposure has been turned on, capture the exposure element onload events of the areas where the seamless exposure has been turned on, and obtain exposure information according to the exposure elements.
  • the page when the page is loaded, search for the area on the page that has been opened for seamless exposure, and capture the onload event of the exposed element downward (the onload event will be triggered in a specific scene, such as the page or image immediately after it is loaded), Then obtain the exposure information according to the exposure element.
  • the corresponding collection method is triggered, and the corresponding exposure information is collected according to the collection rule. That is, according to the captured exposure elements, search for various attributes carried by itself, and position information in the current page DOM node, and bubble upward to query the pre-set rule information on the parent node, record and report.
  • writing the exposure information simulation stack mode to the memory specifically includes:
  • the callback is triggered to search for the information of the exposure element that currently triggers the onload event, and bubble upward to search for the exposure information carried by the corresponding exposure element, and then write the exposure information into the memory In storage.
  • S4 Send and report the exposure information written in the memory according to the exposure rule.
  • the HTTP request is issued in an orderly first-in-first-out manner through the simulated stack storage method, and the exposure information is reported in this section. Flow control rules are added, and the exposure information is sent out in batches, regularly and quantitatively.
  • the following further includes:
  • the basis for judging whether to open the traceless exposure of the page can be: traversing the document object model (DOM) nodes of the page, searching for areas with the traceless exposure attribute (SAP-MODID), and It is considered that the area of non-marking exposure is turned on. This operation is to turn on the traceless exposure front desk or main switch. If the SAP-MODID cannot be queried, there will be no subsequent exposure operation.
  • the user can determine whether to open the seamless exposure of the page according to actual needs.
  • the step S4 specifically includes:
  • step S4.1 Query whether there is exposure information in the memory, if not, end the process, otherwise, perform step S4.2.
  • the cache record of exposure data (ie, exposure information) is sent and reported in batches, and before the exposure information is reported, it is necessary to query whether there is data (ie, exposure information) in the memory.
  • step S4.2 Query whether the timer is turned on, if yes, proceed directly to step S4.3, otherwise, proceed to step S4.3 after turning on the timer.
  • the time interval of the timer can be set according to the actual needs of the user, for example, the time interval of the timer is set to 100ms.
  • the set exposure rules include flow control rules, and the exposure information is spliced and reported according to the flow control rules. For example, every 100ms the exposure information in the previous memory is sent out, and a maximum of 10 HTTP requests are sent. After the exposure information is sent, the sent exposure information will be deleted from the memory.
  • step S4.4 Query whether there is exposure information in the memory, if so, return to step S4.3, otherwise, the process ends after closing the timer.
  • each time the exposure information is sent out check again whether there is exposure information in the memory. If so, return to step S4.3, continue to splice and report the exposure information according to the flow control rules, otherwise, first turn off the timer and then end Process.
  • the method further includes:
  • a traceless exposure switch corresponding to the page can be configured in the background management page, and the traceless exposure switch can be turned on or off in real time according to user requirements.
  • Fig. 2 is another flow chart of a method for collecting information traceless exposure according to an exemplary embodiment. The following illustrates the specific implementation of the present invention with reference to Fig. 2 as an example:
  • the collection method will search for ID1 and capture all DOM elements (ie exposure elements) that support the onload event under the ID1 area, assuming DOMS1.
  • the acquisition method continues to search for the automatic exposure rule R1 in the page.
  • the product card loaded in the current page (that is, being browsed by the user) will automatically trigger the corresponding The onload event triggers the corresponding collection method, and the exposure diary collection starts. Then the exposure information is collected according to different collection rules.
  • the collected exposure information of P1 may include: P1 parent node A, A.herf, A.title... after encoding, it is _infoA.
  • P1's _infoA log report format G1 is as follows: //www.xxx.xxx/salog.gif? _infoA.
  • the exposure information of P1 is reported according to G1, for example, 10 pieces are sent every 100ms.
  • flow control rules can be configured in real time to report exposure information. For example, set to send 10 messages every 1000ms, etc.
  • Fig. 3 is a schematic structural diagram of an information traceless exposure collection device according to an exemplary embodiment. Referring to Fig. 3, the device includes:
  • a setting module configured to obtain the traceless exposure configuration information from the server and write it into the memory, search for the content of the region where traceless exposure is enabled according to the traceless exposure configuration information, and set relevant exposure rules;
  • the search module is configured to search for all areas on the page where the seamless exposure has been turned on, capture the exposure element onload events of the areas where the seamless exposure has been turned on, and obtain exposure information according to the exposure elements;
  • the storage module is used to write the exposure information into the memory in a simulated stack mode
  • the reporting module is configured to send and report the exposure information written in the memory according to the exposure rule.
  • the reporting module sends HTTP requests in an orderly first-in-first-out manner by simulating stack storage, and reports the exposure information.
  • This block has joined the flow control rules, and the exposure information is sent out in batches, regularly and quantitatively.
  • the search module includes:
  • the judging unit is used to judge whether to turn on the traceless exposure of the page, if it is enabled, capture the exposure element onload event of the area where the traceless exposure has been turned on, and obtain the exposure information according to the exposure element, otherwise no processing is performed .
  • the storage module is specifically used for:
  • the reporting module includes:
  • the first query unit is used to query whether there is exposure information in the memory
  • the second query unit is used to query whether the timer is on;
  • the information reporting unit is configured to splice and report the exposure information according to the flow control rules in the exposure rules, and delete the sent exposure information from the memory.
  • the device further includes:
  • the configuration module is used to configure the traceless exposure switch of the corresponding page, and enable or disable traceless exposure collection in real time.
  • the business side by capturing the onload event of the element in the automatic exposure area, the business side only needs to configure the switch of whether to open the exposure and the exposure in the corresponding background management page.
  • the frequency is sufficient, and there is no need to manually call the exposure method of the corresponding collection script, thereby eliminating the intrusion of your own business, and almost achieving the collection of page information exposure under the condition that the business has no buried points, thus greatly reducing the information exposure collection threshold;
  • the method and device for seamless exposure collection of information provided by the embodiments of the present invention send HTTP requests in a first-in-first-out order by simulating stack storage, which solves the problem of HTTP request blocking caused by high concurrency, web page jams, crashes, etc. problem.
  • the rules of the exposure information reporting module can be configured in real time to deal with different business scenarios.
  • the information traceless exposure collection device provided in the above embodiment triggers the traceless exposure collection service
  • only the division of the above-mentioned functional modules is used as an example for illustration.
  • the above-mentioned functions can be allocated by Different functional modules are completed, that is, the internal structure of the device is divided into different functional modules to complete all or part of the functions described above.
  • the information traceless exposure collection device provided by the above-mentioned embodiment belongs to the same concept as the embodiment of the information traceless exposure collection method, that is, the method is based on the information traceless exposure collection device.
  • the specific implementation process please refer to the method embodiments. I won't repeat it here.
  • the program can be stored in a computer-readable storage medium.
  • the storage medium mentioned can be a read-only memory, a magnetic disk or an optical disk, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Databases & Information Systems (AREA)
  • Strategic Management (AREA)
  • General Physics & Mathematics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Debugging And Monitoring (AREA)

Abstract

An information non-trace exposure collection method and device, the method comprising the following steps: S1, acquiring non-trace exposure configuration information from a server and writing same into a memory, and searching, according to the non-trace exposure configuration information, for the content of an area where non-trace exposure is started and setting a relevant exposure rule; S2, searching for all the areas, where the non-trace exposure has been started, on a page, capturing an exposure element onload event of the areas where the non-trace exposure has been started, and acquiring exposure information according to exposure elements; S3, writing the exposure information to the memory by means of an analog stack; and S4, sending and reporting, according to the exposure rule, the exposure information written into the memory. The current intrusion to a service thereof in an exposure point burying process can be solved, and it is almost possible to collect page information exposure without a service buried point, thereby greatly reducing the threshold of information exposure collection; and the process of reporting exposure information is regulated, thereby solving problems, such as HTTP request blocking and web page freezing and crashing, caused by high concurrency.

Description

一种信息无痕曝光采集方法及装置Method and device for seamless exposure and collection of information 技术领域Technical field
本发明涉及信息采集技术领域,特别涉及一种信息无痕曝光采集方法及装置。The present invention relates to the technical field of information collection, in particular to a method and device for seamless exposure and collection of information.
背景技术Background technique
目前关于web端信息曝光多采用的是通过JS手动埋点的方式,即当需要曝光的节点或者信息达到业务规定的相关场景后,调用预先设定义好的方法(此方法一般为JS中提供的全局方法)完成实时上报。这样的做法存在以下问题:At present, the most common method for web-side information exposure is to manually embed points through JS, that is, when the node or information that needs to be exposed reaches the relevant scene of the business regulations, the pre-defined method is called (this method is generally provided in JS) Global method) complete real-time reporting. This approach has the following problems:
1、信息曝光的相关操作侵入到页面的业务逻辑中,给正常的业务逻辑的开发带来诸多不便;1. The related operations of information exposure invade the business logic of the page, which brings a lot of inconvenience to the development of normal business logic;
2、由于曝光信息的上报需要发送HTTP请求(通常情况下预先设定义好的方法的调用即会产生一次HTTP请求),在实践中我们发现预先设定义好的方法被调用的场景及次数很混乱,这导致在某一个具体的时间点HTTP请求暴增,超过浏览器能够承受的并发,从而导致HTTP请求阻塞、页面崩溃、闪退等严重的问题。2. Since the reporting of exposure information needs to send an HTTP request (usually the call of a pre-defined method will generate an HTTP request), in practice we found that the scene and number of calls of the pre-defined method are very confusing This leads to a surge in HTTP requests at a specific point in time, which exceeds the concurrency that the browser can withstand, resulting in serious problems such as HTTP request blocking, page crashes, and flashbacks.
发明内容Summary of the invention
为了解决现有技术的问题,本发明实施例提供了一种信息无痕曝光采集方法及装置,以克服现有技术中存在的曝光埋点侵入性高,以及高频的HTTP请求等问题。In order to solve the problems of the prior art, the embodiments of the present invention provide a method and device for seamless exposure and collection of information, so as to overcome the problems of high intrusion of exposure buried points and high-frequency HTTP requests in the prior art.
为解决上述一个或多个技术问题,本发明采用的技术方案是:In order to solve one or more of the above technical problems, the technical solution adopted by the present invention is:
一方面,提供了一种信息无痕曝光采集方法,所述方法包括如下步骤:In one aspect, a method for seamless exposure and collection of information is provided. The method includes the following steps:
S1:从服务端获取无痕曝光配置信息并写入内存中,根据所述无痕曝光配置信息搜索开启无痕曝光的区域的内容并设置相关曝光规则;S1: Obtain the traceless exposure configuration information from the server and write it into the memory, search for the content of the area where traceless exposure is enabled according to the traceless exposure configuration information, and set relevant exposure rules;
S2:搜索页面上所有已开启无痕曝光的区域,捕获所述已开启无痕曝光的区域的曝光元素onload事件,根据所述曝光元素获取曝光信息;S2: Searching for all areas on the page where the seamless exposure has been turned on, capturing the exposure element onload events of the areas where the seamless exposure has been turned on, and obtaining exposure information according to the exposure elements;
S3:将所述曝光信息模拟堆栈方式写入内存;S3: Write the exposure information into the memory in a simulated stack mode;
S4:将所述写入内存的曝光信息根据所述曝光规则进行发送上报。S4: Send and report the exposure information written in the memory according to the exposure rule.
进一步的,所述搜索页面上所有已开启无痕曝光的区域后还包括:Further, after all the areas on the search page that have been enabled for seamless exposure, it also includes:
判断是否开启所述页面的无痕曝光,若开启,则捕获所述已开启无痕曝光的区域的曝光元素onload事件,并根据所述曝光元素获取曝光信息,否则不做处理。It is determined whether the traceless exposure of the page is turned on, and if it is turned on, the onload event of the exposure element in the area where the traceless exposure is turned on is captured, and the exposure information is obtained according to the exposure element, otherwise no processing is performed.
进一步的,所述步骤S3具体包括:Further, the step S3 specifically includes:
搜索所述onload事件的曝光元素的信息,并向上冒泡搜索所述曝光元素携带的曝光信息,将所述曝光信息写入内存。Search for the information of the exposure element of the onload event, and bubble upward to search for the exposure information carried by the exposure element, and write the exposure information into the memory.
进一步的,所述步骤S4具体包括:Further, the step S4 specifically includes:
S4.1:查询所述内存中是否有曝光信息,若没有,则结束流程,否则,执行步骤S4.2;S4.1: Query whether there is exposure information in the memory, if not, end the process, otherwise, go to step S4.2;
S4.2:查询定时器是否开启,若是,则直接执行步骤S4.3,否则,开启定时器后执行步骤S4.3;S4.2: Query whether the timer is turned on, if yes, proceed directly to step S4.3, otherwise, proceed to step S4.3 after turning on the timer;
S4.3:将所述曝光信息根据所述曝光规则中的流控规则拼接上报,并将已发送的曝光信息从所述内存中删除;S4.3: Splicing and reporting the exposure information according to the flow control rules in the exposure rules, and delete the sent exposure information from the memory;
S4.4:查询所述内存中是否有曝光信息,若有,则返回步骤S4.3,否则关闭所述定时器后结束流程。S4.4: Query whether there is exposure information in the memory, if so, return to step S4.3, otherwise, the process ends after closing the timer.
进一步的,所述方法还包括:Further, the method further includes:
配置对应页面的无痕曝光开关,实时开启或者关闭无痕曝光采集。Configure the traceless exposure switch on the corresponding page to enable or disable traceless exposure collection in real time.
另一方面,提供了一种信息无痕曝光采集装置,所述装置包括:In another aspect, a device for seamless exposure and collection of information is provided, which includes:
设置模块,用于从服务端获取无痕曝光配置信息并写入内存中,根据所述 无痕曝光配置信息搜索开启无痕曝光的区域的内容并设置相关曝光规则;A setting module, which is used to obtain the traceless exposure configuration information from the server and write it into the memory, search for the content of the area where traceless exposure is turned on according to the traceless exposure configuration information, and set relevant exposure rules;
搜索模块,用于搜索页面上所有已开启无痕曝光的区域,捕获所述已开启无痕曝光的区域的曝光元素onload事件,根据所述曝光元素获取曝光信息;The search module is configured to search for all areas on the page where the seamless exposure has been turned on, capture the exposure element onload events of the areas where the seamless exposure has been turned on, and obtain exposure information according to the exposure elements;
存储模块,用于将所述曝光信息模拟堆栈方式写入内存;The storage module is used to write the exposure information into the memory in a simulated stack mode;
上报模块,用于将所述写入内存的曝光信息根据所述曝光规则进行发送上报。The reporting module is configured to send and report the exposure information written in the memory according to the exposure rule.
进一步的,所述搜索模块包括:Further, the search module includes:
判断单元,用于判断是否开启所述页面的无痕曝光,若开启,则捕获所述已开启无痕曝光的区域的曝光元素onload事件,并根据所述曝光元素获取曝光信息,否则不做处理。The judging unit is used to judge whether to turn on the traceless exposure of the page, if it is enabled, capture the exposure element onload event of the area where the traceless exposure has been turned on, and obtain the exposure information according to the exposure element, otherwise no processing is performed .
进一步的,所述存储模块具体用于:Further, the storage module is specifically used for:
搜索所述onload事件的曝光元素的信息,并向上冒泡搜索所述曝光元素携带的曝光信息,将所述曝光信息写入内存。Search for the information of the exposure element of the onload event, and bubble upward to search for the exposure information carried by the exposure element, and write the exposure information into the memory.
进一步的,所述上报模块包括:Further, the reporting module includes:
第一查询单元,用于查询所述内存中是否有曝光信息;The first query unit is used to query whether there is exposure information in the memory;
第二查询单元,用于查询定时器是否开启;The second query unit is used to query whether the timer is on;
信息上报单元,用于将所述曝光信息根据所述曝光规则中的流控规则拼接上报,并将已发送的曝光信息从所述内存中删除。The information reporting unit is configured to splice and report the exposure information according to the flow control rules in the exposure rules, and delete the sent exposure information from the memory.
进一步的,所述装置还包括:Further, the device further includes:
配置模块,用于配置对应页面的无痕曝光开关,实时开启或者关闭无痕曝光采集。The configuration module is used to configure the traceless exposure switch of the corresponding page, and turn on or off traceless exposure collection in real time.
本发明实施例提供的技术方案带来的有益效果是:The beneficial effects brought about by the technical solutions provided by the embodiments of the present invention are:
1、本发明实施例提供的信息无痕曝光采集方法及装置,通过捕获开启自动曝光区域内的元素onload事件,业务方只需要在对应的后台管理页面中,配置是否开启曝光的开关,以及曝光频率即可,无需在手动调用对应采集脚本的曝光方法,从而去除了对自身业务的入侵,几乎可做到业务无埋点的情况下进行 页面信息曝光的采集,从而大大降低了信息曝光采集的门槛;1. According to the method and device for seamless exposure collection of information provided by the embodiments of the present invention, by capturing the onload event of the element in the automatic exposure area, the business side only needs to configure the switch of whether to open the exposure and the exposure in the corresponding background management page. The frequency is sufficient, and there is no need to manually call the exposure method of the corresponding collection script, thereby eliminating the intrusion of your own business, and almost achieving the collection of page information exposure under the condition that the business has no buried points, thus greatly reducing the information exposure collection threshold;
2、本发明实施例提供的信息无痕曝光采集方法及装置,通过模拟堆栈存储方式将HTTP请求先进先出有序发送,解决了高并发导致的HTTP请求阻塞,web端页面卡死、崩溃等问题。而且可以实时的对曝光信息上报模块的规则进行配置,以应对不同业务场景。2. The method and device for seamless exposure collection of information provided by the embodiments of the present invention send HTTP requests in a first-in-first-out order by simulating stack storage, which solves the problem of HTTP request blocking caused by high concurrency, web page jams, crashes, etc. problem. Moreover, the rules of the exposure information reporting module can be configured in real time to deal with different business scenarios.
附图说明Description of the drawings
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions in the embodiments of the present invention more clearly, the following will briefly introduce the accompanying drawings used in the description of the embodiments. Obviously, the accompanying drawings in the following description are only some embodiments of the present invention. For those of ordinary skill in the art, other drawings can be obtained from these drawings without creative work.
图1是根据一示例性实施例示出的信息无痕曝光采集方法的流程图;Fig. 1 is a flow chart showing a method for collecting information traceless exposure according to an exemplary embodiment;
图2是根据一示例性实施例示出的信息无痕曝光采集方法的另一流程图;Fig. 2 is another flowchart showing a method for collecting information traceless exposure according to an exemplary embodiment;
图3是根据一示例性实施例示出的信息无痕曝光采集装置的结构示意图。Fig. 3 is a schematic diagram showing the structure of an information traceless exposure collection device according to an exemplary embodiment.
具体实施方式detailed description
为使本发明的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the objectives, technical solutions, and advantages of the present invention clearer, the following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only Some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.
本发明实施例提供的信息无痕曝光采集方法及装置,针对目前WEB信息曝光的技术中存在的曝光埋点侵入性高以及高频的HTTP请求等问题,做了规避和统一管控。首先,本发明实施例提供的信息无痕曝光采集方法及装置中,传统曝光采集技术实现方式中业务代码和曝光埋点的代码耦合在一起的情况将不存在。然后通过模拟堆栈存储方式将HTTP请求先进先出有序发送,很好的解 决了关于传统曝光技术出现的HTTP请求阻塞等问题。而且本发明实施例中加入了对曝光信息上报这块的流控,以应对大促及非大促等各种业务场景的需要。The method and device for non-trace exposure collection of information provided by the embodiments of the present invention circumvent and perform unified management and control in response to the high intrusiveness of exposure buried points and high-frequency HTTP requests existing in the current WEB information exposure technology. First of all, in the method and device for seamless exposure collection of information provided by the embodiments of the present invention, there will be no case where the business code and the code of the exposure buried point are coupled together in the implementation of the traditional exposure collection technology. Then, the HTTP request is sent in an orderly first-in, first-out manner through the simulated stack storage method, which solves the problems of HTTP request blocking and other problems that occur with traditional exposure technology. Moreover, the flow control for reporting the exposure information is added to the embodiment of the present invention to meet the needs of various business scenarios such as big promotion and non-big promotion.
图1是根据一示例性实施例示出的信息无痕曝光采集方法的流程图,参照图1所示,该方法包括如下步骤:Fig. 1 is a flowchart of a method for collecting information traceless exposure according to an exemplary embodiment. Referring to Fig. 1, the method includes the following steps:
S1:从服务端获取无痕曝光配置信息并写入内存中,根据所述无痕曝光配置信息搜索开启无痕曝光的区域的内容并设置相关曝光规则。S1: Obtain the traceless exposure configuration information from the server and write it into the memory, search for the content of the region where the traceless exposure is enabled according to the traceless exposure configuration information, and set related exposure rules.
具体的,页面加载开始的第一时间从服务端获取无痕曝光配置信息并将其写入内存中。然后根据无痕曝光配置信息搜索开启无痕曝光的区域的内容,并且根据无痕曝光配置信息设置相关曝光规则Rn。其中,相关曝光规则至少包括流控规则、采集规则等。这里需要说明的是,在本发明实施例中,曝光规则支持实时配置,即用户可根据实际业务场景对曝光规则进行实时配置。Specifically, the traceless exposure configuration information is obtained from the server at the first time when the page is loaded and written into the memory. Then search for the content of the area where the seamless exposure is turned on according to the seamless exposure configuration information, and set the relevant exposure rule Rn according to the seamless exposure configuration information. Among them, relevant exposure rules include at least flow control rules, collection rules, etc. It should be noted here that in the embodiment of the present invention, the exposure rule supports real-time configuration, that is, the user can configure the exposure rule in real time according to actual business scenarios.
S2:搜索页面上所有已开启无痕曝光的区域,捕获所述已开启无痕曝光的区域的曝光元素onload事件,根据所述曝光元素获取曝光信息。S2: Search for all areas on the page where the seamless exposure has been turned on, capture the exposure element onload events of the areas where the seamless exposure has been turned on, and obtain exposure information according to the exposure elements.
具体的,当页面加载完成以后,搜索页面上已开启无痕曝光的区域,向下捕获曝光元素的onload事件(onload事件会在特定场景被触发,如页面或图像加载完成后立即被触发),然后根据曝光元素获取曝光信息。当监测到曝光元素对应的onload事件被触发后,进而触发对应的采集方法,根据采集规则采集相应的曝光信息。即根据捕获到的曝光元素,搜索其自身携带的各种属性,及在当前页面DOM节点中的位置信息,并向上冒泡查询其父辈节点的上符合预先设定的规则信息,记录和上报。Specifically, when the page is loaded, search for the area on the page that has been opened for seamless exposure, and capture the onload event of the exposed element downward (the onload event will be triggered in a specific scene, such as the page or image immediately after it is loaded), Then obtain the exposure information according to the exposure element. When it is detected that the onload event corresponding to the exposure element is triggered, the corresponding collection method is triggered, and the corresponding exposure information is collected according to the collection rule. That is, according to the captured exposure elements, search for various attributes carried by itself, and position information in the current page DOM node, and bubble upward to query the pre-set rule information on the parent node, record and report.
作为一种较优的实施方式,本发明实施例中,还需要设置onload事件的回调方法,在该回调方法中根据上述相关曝光规则设置对应曝光方案。As a preferred implementation manner, in the embodiment of the present invention, it is also necessary to set a callback method for the onload event, in which the corresponding exposure scheme is set according to the above-mentioned relevant exposure rules.
S3:将所述曝光信息模拟堆栈方式写入内存。S3: Write the exposure information into the memory in a simulated stack mode.
作为一种较优的实施方式,本发明实施例中,将曝光信息模拟堆栈方式写入内存具体包括:As a preferred implementation manner, in the embodiment of the present invention, writing the exposure information simulation stack mode to the memory specifically includes:
搜索所述onload事件的曝光元素的信息,并向上冒泡搜索所述曝光元素携 带的曝光信息,将所述曝光信息写入内存。Search for the information of the exposure element of the onload event, and bubble upward to search for the exposure information carried by the exposure element, and write the exposure information into the memory.
具体的,在监测到曝光元素对应的onload事件触发后,触发执行回调,搜索当前触发onload事件的曝光元素的信息,并向上冒泡搜索对应曝光元素携带的曝光信息,然后将曝光信息写入内存中存储。Specifically, after detecting that the onload event corresponding to the exposure element is triggered, the callback is triggered to search for the information of the exposure element that currently triggers the onload event, and bubble upward to search for the exposure information carried by the corresponding exposure element, and then write the exposure information into the memory In storage.
S4:将所述写入内存的曝光信息根据所述曝光规则进行发送上报。S4: Send and report the exposure information written in the memory according to the exposure rule.
具体的,为了解决传统曝光技术中出现的HTTP请求阻塞页面崩溃、闪退等问题,本发明实施例中,通过模拟堆栈存储方式将HTTP请求先进先出有序发出,并且在曝光信息上报这块加入了流控规则,将曝光信息分批定时定量的发出。Specifically, in order to solve the problems of HTTP request blocking page crashes and crashes that occur in the traditional exposure technology, in the embodiment of the present invention, the HTTP request is issued in an orderly first-in-first-out manner through the simulated stack storage method, and the exposure information is reported in this section. Flow control rules are added, and the exposure information is sent out in batches, regularly and quantitatively.
作为一种较优的实施方式,本发明实施例中,所述搜索页面上所有已开启无痕曝光的区域后还包括:As a preferred implementation manner, in the embodiment of the present invention, after all the areas on the search page that have been enabled for seamless exposure, the following further includes:
判断是否开启所述页面的无痕曝光,若开启,则捕获所述已开启无痕曝光的区域的曝光元素onload事件,并根据所述曝光元素获取曝光信息,否则不做处理。这里需要说明的是,本发明实施例中,判断是否开启页面的无痕曝光的依据可以为:遍历页面文档对象模型(DOM)节点,寻找存在无痕曝光属性(SAP-MODID)的区域,并认为开启无痕曝光的区域。这个操作是开启无痕曝光前台或者说是主开关,如果查询不到SAP-MODID将没有后续的曝光操作。It is determined whether the traceless exposure of the page is turned on, and if it is turned on, the onload event of the exposure element in the area where the traceless exposure is turned on is captured, and the exposure information is obtained according to the exposure element, otherwise no processing is performed. It should be noted here that in the embodiment of the present invention, the basis for judging whether to open the traceless exposure of the page can be: traversing the document object model (DOM) nodes of the page, searching for areas with the traceless exposure attribute (SAP-MODID), and It is considered that the area of non-marking exposure is turned on. This operation is to turn on the traceless exposure front desk or main switch. If the SAP-MODID cannot be queried, there will be no subsequent exposure operation.
具体的,用户可以根据实际需求判断是否开启页面的无痕曝光。Specifically, the user can determine whether to open the seamless exposure of the page according to actual needs.
作为一种较优的实施方式,本发明实施例中,所述步骤S4具体包括:As a preferred implementation manner, in the embodiment of the present invention, the step S4 specifically includes:
S4.1:查询所述内存中是否有曝光信息,若没有,则结束流程,否则,执行步骤S4.2。S4.1: Query whether there is exposure information in the memory, if not, end the process, otherwise, perform step S4.2.
具体的,本发明实施例中,曝光数据(即曝光信息)缓存记录采用分批的方式发送上报,且曝光信息上报前,需先查询内存中是否有数据(即曝光信息)。Specifically, in the embodiment of the present invention, the cache record of exposure data (ie, exposure information) is sent and reported in batches, and before the exposure information is reported, it is necessary to query whether there is data (ie, exposure information) in the memory.
S4.2:查询定时器是否开启,若是,则直接执行步骤S4.3,否则,开启定时器后执行步骤S4.3。S4.2: Query whether the timer is turned on, if yes, proceed directly to step S4.3, otherwise, proceed to step S4.3 after turning on the timer.
具体的,定时器的时间间隔可以根据用户的实际需求进行设置,例如设置 定时器的时间间隔为100ms。Specifically, the time interval of the timer can be set according to the actual needs of the user, for example, the time interval of the timer is set to 100ms.
S4.3:将所述曝光信息根据所述曝光规则中的流控规则拼接上报,并将已发送的曝光信息从所述内存中删除。S4.3: The exposure information is spliced and reported according to the flow control rules in the exposure rules, and the sent exposure information is deleted from the memory.
具体的,在本发明实施例中,设置的曝光规则中包括流控规则,曝光信息根据流控规则拼接上报。例如,每100ms将之前内存中的曝光信息发送出去,最多发10条HTTP请求。曝光信息发送出去后将已发送的曝光信息从内存中删除。Specifically, in the embodiment of the present invention, the set exposure rules include flow control rules, and the exposure information is spliced and reported according to the flow control rules. For example, every 100ms the exposure information in the previous memory is sent out, and a maximum of 10 HTTP requests are sent. After the exposure information is sent, the sent exposure information will be deleted from the memory.
S4.4:查询所述内存中是否有曝光信息,若有,则返回步骤S4.3,否则关闭所述定时器后结束流程。S4.4: Query whether there is exposure information in the memory, if so, return to step S4.3, otherwise, the process ends after closing the timer.
具体的,每次曝光信息发送出去后,再次查询内存中是否有曝光信息,若有,则返回步骤S4.3,继续将曝光信息根据流控规则拼接上报,否则,先关闭定时器,然后结束流程。Specifically, each time the exposure information is sent out, check again whether there is exposure information in the memory. If so, return to step S4.3, continue to splice and report the exposure information according to the flow control rules, otherwise, first turn off the timer and then end Process.
作为一种较优的实施方式,本发明实施例中,所述方法还包括:As a preferred implementation manner, in an embodiment of the present invention, the method further includes:
配置对应页面的无痕曝光开关,实时开启或者关闭无痕曝光采集。Configure the traceless exposure switch on the corresponding page to enable or disable traceless exposure collection in real time.
具体的,本发明实施例中,可在后台管理页面中配置对应页面的无痕曝光开关,通过该无痕曝光开关可根据用户需求实时开启或者关闭无痕曝光采集。Specifically, in the embodiment of the present invention, a traceless exposure switch corresponding to the page can be configured in the background management page, and the traceless exposure switch can be turned on or off in real time according to user requirements.
图2是根据一示例性实施例示出的信息无痕曝光采集方法的另一流程图,以下结合图2示例性举例说明本发明的具体实施方式:Fig. 2 is another flow chart of a method for collecting information traceless exposure according to an exemplary embodiment. The following illustrates the specific implementation of the present invention with reference to Fig. 2 as an example:
例如,假设某页面A中三块区域需要开启无痕曝光,分别是:‘人气好货’、‘生活部落’、‘排行榜’且对应自动曝光的ID标识为“99348015073”,即ID1为“99348015073”。当页面加载完成以后,采集方法会搜索到ID1,并捕获ID1区域下的所有支持onload事件的DOM元素(即曝光元素),假设为DOMS1。采集方法继续在页面中搜索自动曝光规则R1。设置DOMS1的onload事件的回调方法,在此回调方法中根据曝光规则R1设置对应的曝光方案。For example, suppose that three areas of a page A need to be exposed without trace, namely:'popular goods','life tribes', and'ranking lists', and the ID corresponding to the automatic exposure is "99348015073", that is, ID1 is " 99348015073". When the page is loaded, the collection method will search for ID1 and capture all DOM elements (ie exposure elements) that support the onload event under the ID1 area, assuming DOMS1. The acquisition method continues to search for the automatic exposure rule R1 in the page. Set the callback method of the onload event of DOMS1. In this callback method, set the corresponding exposure scheme according to the exposure rule R1.
当用户浏览到页面中的ID1区域时,假设ID1区域中需要曝光的产品卡包括:P1、P2、P3、P4……产品卡中在当前页面中被加载(即被用户浏览)会自 动触发对应的onload事件,进而触发对应的采集方法,曝光日记采集开始。然后根据不同的采集规则采集曝光信息。采集的P1的曝光信息可包括:P1父节点A,A.herf,A.title…编码以后为_infoA。P1的_infoA日志上报格式G1如下://www.xxx.xxx/salog.gif?_infoA。然后根据流控规则按G1上报P1的曝光信息,例如,每100ms发送10条。When the user browses to the ID1 area on the page, assume that the product cards that need to be exposed in the ID1 area include: P1, P2, P3, P4... The product card loaded in the current page (that is, being browsed by the user) will automatically trigger the corresponding The onload event triggers the corresponding collection method, and the exposure diary collection starts. Then the exposure information is collected according to different collection rules. The collected exposure information of P1 may include: P1 parent node A, A.herf, A.title... after encoding, it is _infoA. P1's _infoA log report format G1 is as follows: //www.xxx.xxx/salog.gif? _infoA. Then according to the flow control rules, the exposure information of P1 is reported according to G1, for example, 10 pieces are sent every 100ms.
当页面上有很多类似P1产品卡Pn时,可实时配置流控规则上报曝光信息。例如,设置每1000ms发送10条等。When there are many similar P1 product cards Pn on the page, flow control rules can be configured in real time to report exposure information. For example, set to send 10 messages every 1000ms, etc.
图3是根据一示例性实施例示出的信息无痕曝光采集装置的结构示意图,参照图3所示,该装置包括:Fig. 3 is a schematic structural diagram of an information traceless exposure collection device according to an exemplary embodiment. Referring to Fig. 3, the device includes:
设置模块,用于从服务端获取无痕曝光配置信息并写入内存中,根据所述无痕曝光配置信息搜索开启无痕曝光的区域的内容并设置相关曝光规则;A setting module, configured to obtain the traceless exposure configuration information from the server and write it into the memory, search for the content of the region where traceless exposure is enabled according to the traceless exposure configuration information, and set relevant exposure rules;
搜索模块,用于搜索页面上所有已开启无痕曝光的区域,捕获所述已开启无痕曝光的区域的曝光元素onload事件,根据所述曝光元素获取曝光信息;The search module is configured to search for all areas on the page where the seamless exposure has been turned on, capture the exposure element onload events of the areas where the seamless exposure has been turned on, and obtain exposure information according to the exposure elements;
存储模块,用于将所述曝光信息模拟堆栈方式写入内存;The storage module is used to write the exposure information into the memory in a simulated stack mode;
上报模块,用于将所述写入内存的曝光信息根据所述曝光规则进行发送上报。The reporting module is configured to send and report the exposure information written in the memory according to the exposure rule.
具体的,为了解决传统曝光技术中出现的HTTP请求阻塞页面崩溃、闪退等问题,本发明实施例中,上报模块通过模拟堆栈存储方式将HTTP请求先进先出有序发出,并且在曝光信息上报这块加入了流控规则,将曝光信息分批定时定量的发出。Specifically, in order to solve the problems of HTTP request blocking page crashes and crashes that occur in the traditional exposure technology, in the embodiment of the present invention, the reporting module sends HTTP requests in an orderly first-in-first-out manner by simulating stack storage, and reports the exposure information. This block has joined the flow control rules, and the exposure information is sent out in batches, regularly and quantitatively.
作为一种较优的实施方式,本发明实施例中,所述搜索模块包括:As a preferred implementation manner, in this embodiment of the present invention, the search module includes:
判断单元,用于判断是否开启所述页面的无痕曝光,若开启,则捕获所述已开启无痕曝光的区域的曝光元素onload事件,并根据所述曝光元素获取曝光信息,否则不做处理。The judging unit is used to judge whether to turn on the traceless exposure of the page, if it is enabled, capture the exposure element onload event of the area where the traceless exposure has been turned on, and obtain the exposure information according to the exposure element, otherwise no processing is performed .
作为一种较优的实施方式,本发明实施例中,所述存储模块具体用于:As a preferred implementation manner, in the embodiment of the present invention, the storage module is specifically used for:
搜索所述onload事件的曝光元素的信息,并向上冒泡搜索所述曝光元素携 带的曝光信息,将所述曝光信息写入内存。Search for the information of the exposure element of the onload event, and bubble upward to search for the exposure information carried by the exposure element, and write the exposure information into the memory.
作为一种较优的实施方式,本发明实施例中,所述上报模块包括:As a preferred implementation manner, in the embodiment of the present invention, the reporting module includes:
第一查询单元,用于查询所述内存中是否有曝光信息;The first query unit is used to query whether there is exposure information in the memory;
第二查询单元,用于查询定时器是否开启;The second query unit is used to query whether the timer is on;
信息上报单元,用于将所述曝光信息根据所述曝光规则中的流控规则拼接上报,并将已发送的曝光信息从所述内存中删除。The information reporting unit is configured to splice and report the exposure information according to the flow control rules in the exposure rules, and delete the sent exposure information from the memory.
作为一种较优的实施方式,本发明实施例中,所述装置还包括:As a preferred implementation manner, in an embodiment of the present invention, the device further includes:
配置模块,用于配置对应页面的无痕曝光开关,实时开启或者关闭无痕曝光采集。The configuration module is used to configure the traceless exposure switch of the corresponding page, and enable or disable traceless exposure collection in real time.
综上所述,本发明实施例提供的技术方案带来的有益效果是:In summary, the beneficial effects brought about by the technical solutions provided by the embodiments of the present invention are:
1、本发明实施例提供的信息无痕曝光采集方法及装置,通过捕获开启自动曝光区域内的元素onload事件,业务方只需要在对应的后台管理页面中,配置是否开启曝光的开关,以及曝光频率即可,无需在手动调用对应采集脚本的曝光方法,从而去除了对自身业务的入侵,几乎可做到业务无埋点的情况下进行页面信息曝光的采集,从而大大降低了信息曝光采集的门槛;1. According to the method and device for seamless exposure collection of information provided by the embodiments of the present invention, by capturing the onload event of the element in the automatic exposure area, the business side only needs to configure the switch of whether to open the exposure and the exposure in the corresponding background management page. The frequency is sufficient, and there is no need to manually call the exposure method of the corresponding collection script, thereby eliminating the intrusion of your own business, and almost achieving the collection of page information exposure under the condition that the business has no buried points, thus greatly reducing the information exposure collection threshold;
2、本发明实施例提供的信息无痕曝光采集方法及装置,通过模拟堆栈存储方式将HTTP请求先进先出有序发送,解决了高并发导致的HTTP请求阻塞,web端页面卡死、崩溃等问题。而且可以实时的对曝光信息上报模块的规则进行配置,以应对不同业务场景。2. The method and device for seamless exposure collection of information provided by the embodiments of the present invention send HTTP requests in a first-in-first-out order by simulating stack storage, which solves the problem of HTTP request blocking caused by high concurrency, web page jams, crashes, etc. problem. Moreover, the rules of the exposure information reporting module can be configured in real time to deal with different business scenarios.
需要说明的是:上述实施例提供的信息无痕曝光采集装置在触发无痕曝光采集业务时,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。另外,上述实施例提供的信息无痕曝光采集装置与信息无痕曝光采集方法实施例属于同一构思,即该方法是基于该信息无痕曝光采集装置的,其具体实现过程详见方法实施例,这里不再赘述。It should be noted that when the information traceless exposure collection device provided in the above embodiment triggers the traceless exposure collection service, only the division of the above-mentioned functional modules is used as an example for illustration. In practical applications, the above-mentioned functions can be allocated by Different functional modules are completed, that is, the internal structure of the device is divided into different functional modules to complete all or part of the functions described above. In addition, the information traceless exposure collection device provided by the above-mentioned embodiment belongs to the same concept as the embodiment of the information traceless exposure collection method, that is, the method is based on the information traceless exposure collection device. For the specific implementation process, please refer to the method embodiments. I won't repeat it here.
本领域普通技术人员可以理解实现上述实施例的全部或部分步骤可以通过硬件来完成,也可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,上述提到的存储介质可以是只读存储器,磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the above embodiments can be implemented by hardware, or by a program instructing related hardware to complete. The program can be stored in a computer-readable storage medium. The storage medium mentioned can be a read-only memory, a magnetic disk or an optical disk, etc.
以上所述仅为本发明的较佳实施例,并不用以限制本发明,凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The above descriptions are only preferred embodiments of the present invention and are not intended to limit the present invention. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention shall be included in the protection of the present invention. Within range.

Claims (10)

  1. 一种信息无痕曝光采集方法,其特征在于,所述方法包括如下步骤:A method for seamless exposure and collection of information, characterized in that the method includes the following steps:
    S1:从服务端获取无痕曝光配置信息并写入内存中,根据所述无痕曝光配置信息搜索开启无痕曝光的区域的内容并设置相关曝光规则;S1: Obtain the traceless exposure configuration information from the server and write it into the memory, search for the content of the area where traceless exposure is enabled according to the traceless exposure configuration information, and set relevant exposure rules;
    S2:搜索页面上所有已开启无痕曝光的区域,捕获所述已开启无痕曝光的区域的曝光元素onload事件,根据所述曝光元素获取曝光信息;S2: Searching for all areas on the page where the seamless exposure has been turned on, capturing the exposure element onload events of the areas where the seamless exposure has been turned on, and obtaining exposure information according to the exposure elements;
    S3:将所述曝光信息模拟堆栈方式写入内存;S3: Write the exposure information into the memory in a simulated stack mode;
    S4:将所述写入内存的曝光信息根据所述曝光规则进行发送上报。S4: Send and report the exposure information written in the memory according to the exposure rule.
  2. 根据权利要求1所述的信息无痕曝光采集方法,其特征在于,所述搜索页面上所有已开启无痕曝光的区域后还包括:The method for collecting information traceless exposure according to claim 1, characterized in that, after searching all areas on the page for which traceless exposure has been turned on, the method further comprises:
    判断是否开启所述页面的无痕曝光,若开启,则捕获所述已开启无痕曝光的区域的曝光元素onload事件,并根据所述曝光元素获取曝光信息,否则不做处理。It is determined whether the traceless exposure of the page is turned on, and if it is turned on, the onload event of the exposure element in the area where the traceless exposure is turned on is captured, and the exposure information is obtained according to the exposure element, otherwise no processing is performed.
  3. 根据权利要求1或2所述的信息无痕曝光采集方法,其特征在于,所述步骤S3具体包括:The method according to claim 1 or 2, wherein the step S3 specifically includes:
    搜索所述onload事件的曝光元素的信息,并向上冒泡搜索所述曝光元素携带的曝光信息,将所述曝光信息写入内存。Search for the information of the exposure element of the onload event, and bubble upward to search for the exposure information carried by the exposure element, and write the exposure information into the memory.
  4. 根据权利要求1或2所述的信息无痕曝光采集方法,其特征在于,所述步骤S4具体包括:The method for collecting information traceless exposure according to claim 1 or 2, wherein the step S4 specifically includes:
    S4.1:查询所述内存中是否有曝光信息,若没有,则结束流程,否则,执行步骤S4.2;S4.1: Query whether there is exposure information in the memory, if not, end the process, otherwise, go to step S4.2;
    S4.2:查询定时器是否开启,若是,则直接执行步骤S4.3,否则,开启定时器后执行步骤S4.3;S4.2: Query whether the timer is turned on, if yes, proceed directly to step S4.3, otherwise, proceed to step S4.3 after turning on the timer;
    S4.3:将所述曝光信息根据所述曝光规则中的流控规则拼接上报,并将已发送的曝光信息从所述内存中删除;S4.3: Splicing and reporting the exposure information according to the flow control rules in the exposure rules, and delete the sent exposure information from the memory;
    S4.4:查询所述内存中是否有曝光信息,若有,则返回步骤S4.3,否则关闭所述定时器后结束流程。S4.4: Query whether there is exposure information in the memory, if so, return to step S4.3, otherwise, the process ends after closing the timer.
  5. 根据权利要求1或2所述的信息无痕曝光采集方法,其特征在于,所述方法还包括:The method for collecting information traceless exposure according to claim 1 or 2, wherein the method further comprises:
    配置对应页面的无痕曝光开关,实时开启或者关闭无痕曝光采集。Configure the traceless exposure switch on the corresponding page to enable or disable traceless exposure collection in real time.
  6. 一种信息无痕曝光采集装置,其特征在于,所述装置包括:An information seamless exposure collection device, characterized in that the device comprises:
    设置模块,用于从服务端获取无痕曝光配置信息并写入内存中,根据所述无痕曝光配置信息搜索开启无痕曝光的区域的内容并设置相关曝光规则;A setting module, configured to obtain the traceless exposure configuration information from the server and write it into the memory, search for the content of the region where traceless exposure is enabled according to the traceless exposure configuration information, and set relevant exposure rules;
    搜索模块,用于搜索页面上所有已开启无痕曝光的区域,捕获所述已开启无痕曝光的区域的曝光元素onload事件,根据所述曝光元素获取曝光信息;The search module is configured to search for all areas on the page where the seamless exposure has been turned on, capture the exposure element onload events of the areas where the seamless exposure has been turned on, and obtain exposure information according to the exposure elements;
    存储模块,用于将所述曝光信息模拟堆栈方式写入内存;The storage module is used to write the exposure information into the memory in a simulated stack mode;
    上报模块,用于将所述写入内存的曝光信息根据所述曝光规则进行发送上报。The reporting module is configured to send and report the exposure information written in the memory according to the exposure rule.
  7. 根据权利要求6所述的信息无痕曝光采集装置,其特征在于,所述搜索模块包括:The information seamless exposure collection device according to claim 6, wherein the search module comprises:
    判断单元,用于判断是否开启所述页面的无痕曝光,若开启,则捕获所述已开启无痕曝光的区域的曝光元素onload事件,并根据所述曝光元素获取曝光信息,否则不做处理。The judging unit is used to judge whether to turn on the traceless exposure of the page, if it is enabled, capture the exposure element onload event of the area where the traceless exposure has been turned on, and obtain the exposure information according to the exposure element, otherwise no processing is performed .
  8. 根据权利要求6或7所述的信息无痕曝光采集装置,其特征在于,所述存储模块具体用于:The information traceless exposure collection device according to claim 6 or 7, wherein the storage module is specifically used for:
    搜索所述onload事件的曝光元素的信息,并向上冒泡搜索所述曝光元素携带的曝光信息,将所述曝光信息写入内存。Search for the information of the exposure element of the onload event, and bubble upward to search for the exposure information carried by the exposure element, and write the exposure information into the memory.
  9. 根据权利要求6或7所述的信息无痕曝光采集装置,其特征在于,所述上报模块包括:The information seamless exposure collection device according to claim 6 or 7, wherein the reporting module comprises:
    第一查询单元,用于查询所述内存中是否有曝光信息;The first query unit is used to query whether there is exposure information in the memory;
    第二查询单元,用于查询定时器是否开启;The second query unit is used to query whether the timer is on;
    信息上报单元,用于将所述曝光信息根据所述曝光规则中的流控规则拼接上报,并将已发送的曝光信息从所述内存中删除。The information reporting unit is configured to splice and report the exposure information according to the flow control rules in the exposure rules, and delete the sent exposure information from the memory.
  10. 根据权利要求6或7所述的信息无痕曝光采集装置,其特征在于,所述装置还包括:The information traceless exposure collection device according to claim 6 or 7, wherein the device further comprises:
    配置模块,用于配置对应页面的无痕曝光开关,实时开启或者关闭无痕曝光采集。The configuration module is used to configure the traceless exposure switch of the corresponding page, and enable or disable traceless exposure collection in real time.
PCT/CN2019/106783 2019-01-23 2019-09-19 Information non-trace exposure collection method and device WO2020151241A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CA3168957A CA3168957A1 (en) 2019-01-23 2019-09-19 Incognito-exposure information acquiring method and apparatus thereof

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910066981.2A CN111475756B (en) 2019-01-23 2019-01-23 Information traceless exposure acquisition method and device
CN201910066981.2 2019-01-23

Publications (1)

Publication Number Publication Date
WO2020151241A1 true WO2020151241A1 (en) 2020-07-30

Family

ID=71735894

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/106783 WO2020151241A1 (en) 2019-01-23 2019-09-19 Information non-trace exposure collection method and device

Country Status (3)

Country Link
CN (1) CN111475756B (en)
CA (1) CA3168957A1 (en)
WO (1) WO2020151241A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116668506A (en) * 2023-07-26 2023-08-29 北京搜狐互联网信息服务有限公司 Feed exposure reporting and recording method, server, equipment and medium

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113721803B (en) * 2021-08-18 2023-07-11 腾讯科技(深圳)有限公司 Data generation method, device, equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130179767A1 (en) * 2012-01-05 2013-07-11 Microsoft Corporation Reducing Ad Impact to Browser Onload Event
CN103268562A (en) * 2013-05-23 2013-08-28 中国科学院计算机网络信息中心 Internet advertisement audience population ascribed characteristic monitoring method and system
CN107871242A (en) * 2016-09-28 2018-04-03 杭州顺网科技股份有限公司 Advertisement delivery system and method
CN108230024A (en) * 2017-12-29 2018-06-29 北京皮尔布莱尼软件有限公司 Automotive engine system is launched based on the advertisement that clue is collected
CN108365971A (en) * 2018-01-10 2018-08-03 深圳市金立通信设备有限公司 Daily record analytic method, equipment and computer-readable medium
CN109218272A (en) * 2017-07-07 2019-01-15 腾讯科技(北京)有限公司 Media content method for pushing, device, server, system and storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012011151A1 (en) * 2010-07-21 2012-01-26 Empire Technology Development Llc Information processing apparatus, server-client system, and computer program product
CN105450698B (en) * 2014-08-25 2019-03-12 苏宁易购集团股份有限公司 A kind of exposure Log Collect System and method
CN108846116A (en) * 2018-06-26 2018-11-20 北京京东金融科技控股有限公司 Page Impression collecting method, system, electronic equipment and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130179767A1 (en) * 2012-01-05 2013-07-11 Microsoft Corporation Reducing Ad Impact to Browser Onload Event
CN103268562A (en) * 2013-05-23 2013-08-28 中国科学院计算机网络信息中心 Internet advertisement audience population ascribed characteristic monitoring method and system
CN107871242A (en) * 2016-09-28 2018-04-03 杭州顺网科技股份有限公司 Advertisement delivery system and method
CN109218272A (en) * 2017-07-07 2019-01-15 腾讯科技(北京)有限公司 Media content method for pushing, device, server, system and storage medium
CN108230024A (en) * 2017-12-29 2018-06-29 北京皮尔布莱尼软件有限公司 Automotive engine system is launched based on the advertisement that clue is collected
CN108365971A (en) * 2018-01-10 2018-08-03 深圳市金立通信设备有限公司 Daily record analytic method, equipment and computer-readable medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116668506A (en) * 2023-07-26 2023-08-29 北京搜狐互联网信息服务有限公司 Feed exposure reporting and recording method, server, equipment and medium
CN116668506B (en) * 2023-07-26 2023-10-03 北京搜狐互联网信息服务有限公司 Feed exposure reporting and recording method, server, equipment and medium

Also Published As

Publication number Publication date
CA3168957A1 (en) 2020-07-30
CN111475756A (en) 2020-07-31
CN111475756B (en) 2023-04-14

Similar Documents

Publication Publication Date Title
CN107506451B (en) Abnormal information monitoring method and device for data interaction
CN103246595B (en) Application management method, device, server and terminating unit
CN111352921A (en) ELK-based slow query monitoring method and device, computer equipment and storage medium
CN110851308A (en) Test method, test device, electronic equipment and storage medium
CN103716690A (en) Multimedia live broadcast whistle-blowing method, terminal, server and system
CN111881011A (en) Log management method, platform, server and storage medium
WO2020151241A1 (en) Information non-trace exposure collection method and device
CN109710439B (en) Fault processing method and device
CN102957550A (en) System and method for alarming based on log detection
CN111367873A (en) Log data storage method and device, terminal and computer storage medium
CN109635210B (en) Behavior data reporting method, device, equipment and storage medium
CN114547076A (en) Data processing method and data processing system
CN112217684A (en) API gateway monitoring method, device and system
CN108520052B (en) Slow query information retrieval method and device, server and readable storage medium
WO2020238567A1 (en) Method and apparatus for resource detection
CN110717130B (en) Dotting method, dotting device, dotting terminal and storage medium
CN109240664B (en) Method and terminal for collecting user behavior information
CN113987393A (en) Web page operation recorder, system, device and method
CN104281581A (en) Method and system for monitoring exposure of content at recommendation position of webpage
CN113688106B (en) File monitoring method and device
CN111753141A (en) Data management method and related equipment
CN104252453A (en) Detection method and system for write operation in webpage recommendation location content access track
CN112860720A (en) Storage capacity updating method and device
CN115203063B (en) Playback method and system of production flow re-running risk program based on real-time recording
CN108052648B (en) Website picture deleting method and device and electronic equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19911377

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19911377

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 3168957

Country of ref document: CA