WO2023169165A1 - Access data processing method and apparatus, electronic device, and computer readable medium - Google Patents

Access data processing method and apparatus, electronic device, and computer readable medium Download PDF

Info

Publication number
WO2023169165A1
WO2023169165A1 PCT/CN2023/076143 CN2023076143W WO2023169165A1 WO 2023169165 A1 WO2023169165 A1 WO 2023169165A1 CN 2023076143 W CN2023076143 W CN 2023076143W WO 2023169165 A1 WO2023169165 A1 WO 2023169165A1
Authority
WO
WIPO (PCT)
Prior art keywords
page
access
application
track
terminal device
Prior art date
Application number
PCT/CN2023/076143
Other languages
French (fr)
Chinese (zh)
Inventor
吴轶伦
Original Assignee
北京京东拓先科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京京东拓先科技有限公司 filed Critical 北京京东拓先科技有限公司
Publication of WO2023169165A1 publication Critical patent/WO2023169165A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/604Tools and structures for managing or administering access control systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/906Clustering; Classification

Definitions

  • the present disclosure relates to the field of computer technology, specifically to data analysis and other technical fields, and in particular to access data processing methods and devices, electronic equipment, computer-readable media and computer program products.
  • Embodiments of the present disclosure provide access data processing methods and devices, electronic devices, computer-readable media, and computer program products.
  • an access data processing method includes: collecting access logs of pages of the application accessed by different terminal devices through buried point data of the application; based on the access log, obtaining at least The access track of each terminal device to the application page in at least one visit in a preset time period; aggregate statistics on terminal devices with the same access track to obtain an aggregation cluster of corresponding access tracks and terminal device information; based on each aggregation The access track and terminal device information corresponding to the cluster are used to optimize the application page and obtain the optimized application.
  • the above method further includes: receiving a query track, where the query track includes at least one page of the application; and matching the query track with the access track corresponding to each aggregation cluster.
  • the traces are matched; in response to determining that the query trace matches the access trace corresponding to the aggregation cluster, the terminal device information corresponding to the aggregation cluster is obtained and displayed.
  • the above method further includes: using a Sankey diagram to display the terminal device information and access trajectories corresponding to each aggregation cluster.
  • the above method further includes: labeling all pages of the application; and in response to receiving the label of the page, performing statistics on access trajectories where the page is located to obtain access trajectory statistical results.
  • the above method also includes: marking the landing page for the first page of the access track corresponding to each aggregation cluster; in response to receiving a query condition using one page as the landing page, obtaining and displaying all the pages as the landing page.
  • the access track of the landing page in response to receiving a query condition using one page as the landing page, obtaining and displaying all the pages as the landing page.
  • the above method also includes: marking the exit page for the last page of the access track corresponding to each aggregation cluster; in response to receiving a query condition using one page as the exit page, obtaining and displaying all the pages as landing pages. Page access track.
  • the pages of the above-mentioned application include: at least one landing page, and the access track of each aggregation cluster is provided with a bounce node adjacent to the last page of the access track.
  • the above-mentioned access based on each aggregation cluster corresponds to Trajectory and terminal device information, optimize the application page, and obtain the optimized application, including: targeting at least one page as a landing page, when the next node of the landing page in the access trajectory is a bounce node, calculating the bounce node
  • the above-mentioned optimization of application pages based on the access trajectories and terminal device information corresponding to each aggregation cluster to obtain an optimized application includes: querying the preset access trajectories from the access trajectories corresponding to all aggregation clusters; Based on the terminal device information of each page in the preset access track, calculate the conversion rate of each page in the preset access track; in response to the fact that the conversion rate of a page in the preset access track is less than that of pages other than the page in the preset access track conversion rate, optimize the page, and get the optimized application.
  • the access track of each aggregation cluster is provided with a bounce node adjacent to the last page of the access track, and the application pages are optimized based on the access track and terminal device information corresponding to each aggregation cluster.
  • optimized applications include: Traverse the access trajectories corresponding to all aggregated clusters, and calculate the number of terminal devices in all access trajectories that jump out of the page before the node; in response to the number of terminal devices that jump out of the page before the node being greater than the preset threshold, optimize the page before jumping out of the node , get the optimized application.
  • an access data processing device includes: a collection unit configured to collect access logs of pages of the application accessed by different terminal devices through buried point data of the application;
  • the acquisition unit is configured to obtain, based on the access log, access trajectories of each terminal device to the page of the application in at least one visit in at least one preset time period;
  • the aggregation unit is configured to perform an aggregation operation on terminal devices with the same access trajectories.
  • the optimization unit is configured to optimize application pages based on access trajectories and terminal device information corresponding to each aggregation cluster to obtain an optimized application.
  • the above device further includes: a receiving unit configured to receive a query track, where the query track includes at least one page of the application; a matching unit configured to match the query track with access tracks corresponding to each aggregation cluster. ; The obtaining unit is configured to obtain and display the terminal device information corresponding to the aggregation cluster in response to determining that the query trace matches the access trace corresponding to the aggregation cluster.
  • the above apparatus further includes: a display unit configured to use a Sankey diagram to display the terminal device information and access trajectories corresponding to each aggregation cluster.
  • the above-mentioned device further includes: a page labeling unit configured to label all pages of the application; a statistics unit configured to respond to receiving the label of the page and perform statistics on the access track where the page is located, Obtain the access trajectory statistics results.
  • the above device further includes: a landing annotation unit configured to annotate the landing page for the first page of the access track corresponding to each aggregation cluster; a landing query unit configured to respond to receiving a page as the landing page. When querying the page, get and display all the access tracks of this page as a landing page.
  • the above device further includes: an exit labeling unit configured to label the last page of the access track corresponding to each aggregation cluster as an exit page; an exit query unit configured to respond to receiving a page as the exit page When the query conditions are Go to and display all access tracks of this page as a landing page.
  • the pages of the above-mentioned application include: at least one landing page, and the access track of each aggregation cluster is provided with a bounce node adjacent to the last page of the access track.
  • the above-mentioned optimization unit includes: a computing module, It is configured to target at least one page as a landing page. When the next node of the landing page in the access track is a bounce node, calculate the number of terminal devices of the bounce node and the number of terminal devices of the landing page at the first node of the access track. The proportion; the page optimization module is configured to respond to the proportion being greater than the average exit rate of all landing pages, optimize the landing page, and obtain an optimized application.
  • the above-mentioned optimization unit includes: a query module, configured to query preset access trajectories from access trajectories corresponding to all aggregated clusters; a conversion module, configured to based on the terminal device information of each page of the preset access trajectories , calculate the conversion rate of each page in the preset access track; the application optimization module is configured to respond to the conversion rate of a page in the preset access track being less than the conversion rate of pages other than the page in the preset access track, This page is optimized and the optimized application is obtained.
  • the access track of each aggregation cluster is provided with a jump node adjacent to the last page of the access track.
  • the above-mentioned optimization unit includes: a traversal module configured to traverse the access tracks corresponding to all aggregation clusters. , calculate the number of terminal devices in all access trajectories that jump out of the page before the node; the node optimization module is configured to respond to the number of terminal devices that jump out of the page before the node is greater than the preset threshold, optimize the page before jumping out of the node, Get optimized applications.
  • an electronic device includes: one or more processors; a storage device with one or more programs stored thereon; when one or more programs Executed by one or more processors, causing the one or more processors to implement the method described in any of the above implementations.
  • a computer-readable medium is provided, with a computer program stored thereon.
  • the program is executed by a processor, the method described in any of the above implementations is implemented.
  • a computer program product including a computer program.
  • the computer program When executed by a processor, the computer program implements the method described in any of the above implementations.
  • Figure 1 is an exemplary system architecture diagram in which an embodiment of the present disclosure may be applied;
  • Figure 2 is a flow chart of an embodiment of an access data processing method according to the present disclosure
  • Figure 3 is a flow chart of another embodiment of an access data processing method according to the present disclosure.
  • Figure 4 is a schematic diagram using a Sankey diagram to display access trajectories according to the present disclosure
  • Figure 5 is a schematic structural diagram of an embodiment of an access data processing device according to the present disclosure.
  • FIG. 6 is a schematic structural diagram of an electronic device suitable for implementing embodiments of the present disclosure.
  • FIG. 1 illustrates an exemplary system architecture 100 to which the access data processing method of the present disclosure may be applied.
  • the system architecture 100 may include terminal devices 101, 102, 103, a network 104 and a server 105.
  • the network 104 is a medium used to provide communication links between the terminal devices 101, 102, 103 and the server 105.
  • Network 104 may include various connection types and may typically include wireless communication links and the like.
  • the terminal devices 101, 102, 103 interact with the server 105 through the network 104 to receive or send messages, etc.
  • Various communication client applications can be installed on the terminal devices 101, 102, and 103, such as instant messaging tools, email clients, etc.
  • the terminal devices 101, 102, and 103 may be hardware or software. When the terminal devices 101, 102, and 103 are hardware, they may be devices with communication and control functions. superior The device can communicate with the server 105. When the terminal devices 101, 102, and 103 are software, they can be installed in the above-mentioned terminals. It may be implemented as multiple software or software modules (such as software or software modules used to access applications), or as a single software or software module. There are no specific limitations here.
  • the server 105 may be a server that provides various services, such as an application server that provides support for applications on the terminal devices 101, 102, and 103.
  • the application server can analyze and process the relevant information of each terminal in the network, and feed back the processing results (such as optimized application installation programs, etc.) to the terminal device.
  • the server can be hardware or software.
  • the server can be implemented as a distributed server cluster composed of multiple servers or as a single server.
  • the server is software, it can be implemented as multiple software or software modules (for example, software or software modules used to provide distributed services), or it can be implemented as a single software or software module. There are no specific limitations here.
  • the access data processing method provided by the embodiment of the present disclosure is generally executed by the server 105.
  • the access data processing method includes the following steps:
  • Step 201 Collect access logs of pages of the application accessed by different terminal devices through the embedded data of the application.
  • the hidden data of the application is obtained by burying data on the application page.
  • Buried data is a common data collection method for page analysis. Specifically, it refers to injecting a paragraph or paragraph into the source code corresponding to each page of the application.
  • Several scripts are used to obtain the behavior data corresponding to the operation events of the terminal device operation page. The behavior data of multiple different time periods are combined together to obtain the access log of the terminal device.
  • the buried point data of the application includes the behavior data of at least one terminal device operating the application.
  • the terminal device information in the buried point data can be used to determine the behavior data of the terminal device in at least one terminal device.
  • the terminal device The behavioral data are arranged together according to the timestamp, and the access log of the terminal can be obtained.
  • Access logs can be obtained through multiple channels, for example, by using tracking tools to obtain tracking data of applications installed on terminal devices (such as terminals 101, 102, and 103 shown in Figure 1) in real time, or by obtaining pre-set tracking data from a database. Buried data after burying the application.
  • the access log may be a browsing log of the terminal device browsing the application page.
  • the tracking data reported to the execution subject by the tracking tool or script may include: reporting site, The unique identifier of the page, the number that identifies the same visit, the device number, the request time period, etc. Every time the terminal device browses, the tracking tool or script will generate a piece of tracking data and store the tracking data.
  • the page of the application can be a page opened through a browser, or a native page of the application, or a page embedded in the application that needs to be invoked; the access log is the data generated by multiple device terminals in different visits.
  • a visit refers to a device terminal accessing an application.
  • the user opening the application through the device terminal is the beginning of a visit.
  • Step 202 Based on the access log, obtain the access track of each terminal device to the page of the application in at least one visit in at least one preset time period.
  • the access log is a collection of behavior data of multiple terminal devices operating on the application page.
  • the access log can be used to obtain the information about the terminal device's operation of the application page. Visit trajectories during visits.
  • the application has multiple pages, and the access track is a record of the user accessing the application through the terminal device and operating each page in the application.
  • the application includes a home page, an activity page, a search page, and a details page.
  • the user accesses the application through the device.
  • the terminal browses the homepage, activity page, homepage, and search page of the application in sequence, and the access track corresponding to the device terminal is homepage-activity page-homepage-search page.
  • the access track includes multiple nodes, each node corresponding to a page of the application. Any two nodes among the multiple nodes can be the same or different.
  • the access track above has two home pages, that is, in the access track In the track, there are two nodes corresponding to the same page.
  • the preset time period can be set according to application development requirements.
  • the preset time period is one day, or half a day, etc.
  • Daily scheduled tasks can be used to calculate the access track of each terminal device to the application's page in at least one visit through the basic traffic data obtained.
  • the calculation task is executed at 3 a.m. every day.
  • the access track of each terminal device to the application page in one visit in the preset time period refers to the collection of pages viewed by the same terminal device in the same visit sequence in the preset time period.
  • the above-mentioned obtaining the access track of each terminal to the application page in at least one visit in at least one preset time period includes: taking the preset time period as the cycle, sorting the pages browsed by the same user in the same visit according to the timestamp from first to last. , perform adjacent deduplication according to the unique identifier of the page (two adjacent nodes in the access trajectory are the same and remove one of the nodes), and obtain the access trajectory of the terminal device in the preset time period.
  • an exit identifier can be added as a unified APP exit after the last page in the access track of all terminal devices, that is, exit is the exit node of the access track.
  • the access track of terminal A is home page -> activity page -> home page -> search page -> exit.
  • Step 203 Aggregate statistics on terminal devices with the same access trajectories to obtain aggregation clusters corresponding to access trajectories and terminal device information.
  • the same access trajectory corresponds to at least one terminal device.
  • an aggregation cluster corresponding to the access trajectory can be obtained.
  • the obtained aggregation cluster can include at least one terminal device. Terminal device information.
  • the terminal device information is information related to the terminal device.
  • the terminal device information includes: a unique identifier of the terminal device, a device number of the terminal device, and the number of terminal devices.
  • the number of terminal devices is the sum of the number of at least one terminal with the same access trajectory.
  • the access trajectory of terminal A is home page -> activity page -> home page -> search page -> exit, and only on that day
  • the access track of terminal B is exactly the same as that of terminal A.
  • the access track of terminal B is also home page -> activity page -> home page -> search page -> exit.
  • the number of terminal devices on this access track that day is 2.
  • the access track is a page or node arranged in forward chronological order.
  • the access track clearly identifies the pages accessed from the first step to the Nth.
  • the aggregation table of the aggregation cluster is shown in Table 1.
  • the first to fourth steps are the accessed pages of the access track, or the set nodes (such as exit nodes).
  • the accessed pages include: activity page, details page, search page, homepage, list page, These access pages are related to the application. When the application is different, the pages contained are different, and the names of the pages are also different accordingly.
  • Step 204 Based on the access trajectories and terminal device information corresponding to each aggregation cluster, optimize the application page to obtain an optimized application.
  • the application may have one or more pages. Based on the access trajectories and terminal device information corresponding to each aggregation cluster, the application pages may be optimized. This may be to optimize one page of the application or to optimize multiple pages. Page optimization. Among them, the optimization of the page can include: deleting the page, rearranging the page, modifying the content of the page and other optimization methods.
  • an aggregation cluster corresponds to an access track in one visit.
  • An aggregation cluster may have terminal device information of one or more terminal devices, and the one or more terminal devices correspond to an access track in the same visit. .
  • the pages of the above application include: at least one landing page, and the access track of each aggregate cluster is provided with a jump node adjacent to the last page of the access track.
  • optimize the application pages Based on each Aggregate the access trajectories and terminal device information corresponding to the cluster, optimize the application pages, and obtain the optimized application, including: targeting at least one page as a landing page, when the next node of the landing page in the access trajectory is a bounce node, Calculate the ratio of the number of terminal devices that jump out of the node to the number of terminal devices that are at the first node of the access trajectory for the landing page; in response to the ratio being greater than the average exit rate of all landing pages, optimize the landing page to obtain the optimized application .
  • the landing page refers to a specific activity with a clear theme that the visitor sees in other places outside the application, such as attractive discount information published through emails, social media or advertisements, etc., and is linked to after clicking.
  • This landing page can be Any page in the application, for example, the landing page is the details page in the application, or the landing page is the home page in the application, etc.
  • the exit node is the last node in all access trajectories. Through the exit node, it can be determined that the terminal device has finished accessing the application.
  • exit rate the number of terminal devices whose next node on the landing page is the exit node, as a proportion of the number of terminal devices on the first page of the landing page's access track.
  • the average exit rate is calculated by counting the traffic of all landing pages. The obtained value, specifically, the average exit rate has three calculation methods: weighted average, arithmetic average and geometric average:
  • Weighted average The sum of the number of terminal devices that jump out of the node after the landing page in all access trajectories in the application is used as the numerator, and the sum of the number of terminal devices that have the landing page on the first page of the access trajectory is used as the denominator. The proportion value obtained by comparison .
  • Geometric mean The exit rates of all landing pages are multiplied and raised to the power N. N is equal to the number of landing pages.
  • landing page there is not only one landing page, but there can be many landing pages.
  • the definition of landing page is that the first page of the application is called the landing page.
  • the access situation of the landing page can be effectively analyzed, providing a reliable basis for improving the landing page. in accordance with.
  • the application pages are optimized based on the access trajectories and terminal device information corresponding to each aggregation cluster, and the optimized application is obtained, including: from the access trajectories corresponding to all aggregation clusters Query the preset access track; based on the terminal device information of each page of the preset access track, calculate the conversion rate of each page in the preset access track; in response to the conversion rate of a page in the preset access track being less than the preset access track except The conversion rate of pages other than this page, the page is optimized, and the optimized application is obtained.
  • the conversion rate is related to the number of terminal device visits to each page. For example, if a visit track is from the search page to the product details page, then the conversion rate of the search page is: the ratio with the number of terminal devices on the search page as the denominator and the number of terminal devices from the search page to the product details page as the numerator.
  • the page is a page to be optimized and needs to be Consider removing or changing this page.
  • the conversion rate of each page of the application is calculated to ensure the optimization effect of the core pages in the application and improve the efficiency of application optimization.
  • the access track of each aggregation cluster is provided with a jump-out node adjacent to the last page of the access track.
  • the above is based on the access track and terminal device information corresponding to each aggregation cluster.
  • optimize the application pages to obtain the optimized application including: traversing the access trajectories corresponding to all aggregation clusters, calculating the number of terminal devices in all access trajectories that jump out of the page before the node; responding to the terminal devices that jump out of the page before the node If the number of devices is greater than the preset threshold, the page before jumping out of the node will be optimized to obtain an optimized application.
  • the traffic of all jump pages (the number of terminal devices) is counted.
  • the jump page with the highest traffic is the high-frequency jump page.
  • the high-frequency jumping pages in the application are obtained, which provides a reliable basis for optimizing the application's pages.
  • the access data processing method provided by the embodiments of the present disclosure can view the path distribution of users when using products through cluster information without configuration, and supports global observation of users' hot pages and main access trajectories in products.
  • users can be grouped according to the terminal device information and access trajectories in the cluster, supporting the comparison of the differences in behavioral paths of different types of users.
  • landing pages in the access track you can also filter different landing pages as a starting point to view the user's subsequent path distribution. It can not only view the global user path distribution with zero configuration, but also meet the needs of configurable analysis to a certain extent.
  • the access data processing method first collects the access logs of the pages of the application accessed by different terminal devices through the embedded data of the application; secondly, based on the access data, Query the log to obtain the access track of each terminal device to the application page in at least one visit in at least one preset time period; thirdly, perform aggregate statistics on the terminal devices with the same access track to obtain the corresponding access track and terminal device information aggregation clusters; finally, based on the access trajectories and terminal device information corresponding to each aggregation cluster, the application page is optimized to obtain an optimized application. Therefore, based on the access trajectory of the terminal device to the application page in one visit, clustering the terminal devices can determine all terminal devices with the same access trajectory, providing an effective optimization basis for the application page and improving application optimization. Efficiency and improved user experience.
  • the above-mentioned access data processing method also includes: receiving query trajectories, where the query trajectories include at least one page of the application; and corresponding query trajectories to each aggregation cluster.
  • the access traces are matched; in response to determining that the query trace matches the access trace corresponding to the aggregation cluster, the terminal device information corresponding to the aggregation cluster is obtained and displayed.
  • the query track sent by the developer can be received.
  • the query track can be the same as the access track corresponding to the aggregation cluster, or it can be different from the access track corresponding to the aggregation cluster.
  • Trajectory when the query trajectory is the same as the access trajectory, it is determined that the query trajectory matches the access trajectory corresponding to the aggregation cluster. Accordingly, the terminal device information of the aggregation cluster corresponding to the query trajectory can be obtained, such as the number of terminal devices, so that the query trajectory can be analyzed. Data access traffic.
  • the information of the terminals of the aggregation cluster is queried through the query trajectory, which provides a reliable query basis for the data access status of the application page and ensures the reliability of the application access data analysis.
  • the above access data processing method also includes: labeling all pages of the application; in response to receiving the label of the page, labeling the page where the page is located. Perform statistics on access trajectories to obtain access trajectories statistical results.
  • all pages of the application are tagged, and the generated access trajectories also have tagged tags.
  • Timing can determine the access track where the page is located and the aggregation cluster where the page is located, and then the number of access tracks, the number of aggregation clusters and other information can be obtained.
  • the access trajectory statistics results include: access trajectory name, access trajectory number, aggregation cluster corresponding to the access trajectory, number of terminal devices corresponding to the access trajectory, etc.
  • tags can be text, symbols, codes, etc.
  • the tag of the received page can uniquely indicate the page of the application.
  • independent query of the page can be supported, as well as the access track of the received page.
  • Statistics thereby realizing an interleaved query method with access trajectories as horizontal queries and pages as vertical queries.
  • each page is tagged, and the user path is tagged according to the included page.
  • the user can view the access track through the page after inputting the unique identifier of the page of interest. Distribution, you can also effectively query the access trajectory statistics such as the number of terminal devices in the aggregate cluster, which improves the reliability of page retrieval.
  • the above method further includes: marking the landing page for the first page of the access track corresponding to each aggregation cluster; in response to receiving a page as When querying the landing page, obtain and display all the access tracks of this page as a landing page.
  • query conditions for the access trajectories can be added; for example, when an access trajectory includes: details page and search page; then the details page is used as the landing page.
  • the query conditions are: when the details page is used as the landing page, query and display all access trajectories with the details page as the landing page.
  • the Sankey diagram can be used to display and query the access trajectories.
  • the landing page is marked for the first page of the access track corresponding to each aggregation cluster.
  • the query condition of the landing page all access tracks of this page as the landing page are obtained and displayed, which improves access Richness of trajectory queries.
  • the above method also includes: marking the last page of the access track corresponding to each aggregation cluster as an exit page; in response to receiving a page as the exit Page query conditions are obtained and Display all access tracks of this page as a landing page.
  • the exit page is marked on the last page of all access trajectories, and query conditions for the access trajectories can be added; for example, when an access trajectory includes: details page and search page; then the search page is used as the exit page.
  • querying The conditions are: when the search page is used as the exit page, query and display the access trajectories in all access trajectories that use the search page as the exit page. It should be noted that a Sankey diagram can be used to display the access trajectories.
  • the exit page is marked for the last page of the access track corresponding to each aggregation cluster.
  • the query condition for the exit page all the access tracks for this page as the exit page are obtained and displayed, which improves the access track. The richness of the query.
  • FIG. 3 shows a process 300 of another embodiment of the access data processing method provided by the present disclosure.
  • the access data processing method may include the following steps:
  • Step 301 Collect access logs of pages of the application accessed by different terminal devices through the embedded data of the application.
  • Step 302 Based on the access log, obtain the access track of each terminal device to the page of the application in at least one visit in at least one preset time period.
  • Step 303 Perform aggregation statistics on terminal devices with the same access trajectories to obtain aggregation clusters corresponding to access trajectories and terminal device information.
  • Step 304 Use a Sankey diagram to display the terminal device information and access trajectories corresponding to each aggregation cluster.
  • Sankey diagram Sankey energy distribution diagram, also called Sankey energy balance diagram. It is a specific type of flowchart in which the width of the extended branches corresponds to the size of the data flow.
  • the Sankey diagram can show the access trajectory distribution of an application without configuration.
  • the number of terminal devices passing through the page is prompted; when the mouse slides between any two steps of the access track, the prompt is The number of terminal devices that go from one page to another.
  • FIG 4 it is a schematic diagram showing access trajectories using Sankey diagram.
  • S, T, W, V, M, N, and U represent different pages in the application.
  • terminal device information such as the number of terminal devices, ID, etc.
  • Step 305 Based on the access trajectories and terminal device information corresponding to each aggregation cluster, optimize the application page to obtain an optimized application.
  • the access data processing method provided in this embodiment uses a Sankey diagram to display the terminal device information and access trajectories before optimizing the application page, which can visually represent the aggregation clusters and provide a vivid trajectory display effect for application improvement. .
  • the present disclosure provides an embodiment of an access data processing device.
  • the device embodiment corresponds to the method embodiment shown in Figure 2.
  • the device can be specifically applied in various electronic devices.
  • an embodiment of the present disclosure provides an access data processing device 500.
  • the device 500 includes: a collection unit 501, an acquisition unit 502, an aggregation unit 503, and an optimization unit 504.
  • the above-mentioned collection unit 501 may be configured to collect access logs of pages of the application accessed by different terminal devices through buried point data of the application.
  • the above-mentioned obtaining unit 502 may be configured to obtain, based on the access log, the access track of each terminal device to the page of the application in at least one visit in at least one preset time period.
  • the above-mentioned aggregation unit 503 may be configured to perform aggregation statistics on terminal devices with the same access trajectories, and obtain aggregation clusters corresponding to access trajectories and terminal device information.
  • the above-mentioned optimization unit 504 may be configured to optimize application pages based on access trajectories and terminal device information corresponding to each aggregation cluster to obtain an optimized application.
  • the specific processing of the collection unit 501, the acquisition unit 502, the aggregation unit 503, and the optimization unit 504 and the technical effects they bring can be referred to the steps in the corresponding embodiment of Figure 2 respectively. 201, step 202, step 203, Step 204.
  • the above-mentioned device 500 further includes: a receiving unit (not shown in the figure), a matching unit (not shown in the figure), and a obtaining unit (not shown in the figure).
  • the above-mentioned receiving unit may be configured to receive a query track, where the query track includes at least one page of the application.
  • the above-mentioned matching unit may be configured to match the query trajectories with the access trajectories corresponding to each aggregation cluster.
  • the above obtaining unit may be configured to obtain and display the terminal device information corresponding to the aggregation cluster in response to determining that the query trajectory matches the access trajectory corresponding to the aggregation cluster.
  • the above-mentioned device 500 further includes: a display unit (not shown in the figure).
  • the above display unit may be configured to use a Sankey diagram to display the terminal device information and access trajectories corresponding to each aggregation cluster.
  • the above-mentioned device 500 also includes: a page annotation unit (not shown in the figure) and a statistics unit (not shown in the figure).
  • the above-mentioned labeling unit can be configured to label all pages of the application.
  • the above statistics unit may be configured to, in response to receiving the tag of the page, perform statistics on the access track where the page is located, and obtain access track statistics results.
  • the above-mentioned device 500 also includes: a landing annotation unit (not shown in the figure) and a landing query unit (not shown in the figure).
  • the above-mentioned landing annotation unit may be configured to perform landing page annotation on the first page of the access track corresponding to each aggregation cluster.
  • the landing query unit is configured to, in response to receiving a query condition using a page as a landing page, obtain and display all access tracks of the page as a landing page.
  • the above-mentioned device 500 also includes: an exit annotation unit (not shown in the figure) and an exit query unit (not shown in the figure).
  • the above-mentioned exit labeling unit can be configured to perform exit page labeling on the last page of the access track corresponding to each aggregation cluster.
  • the above-mentioned exit query unit may be configured to, in response to receiving a query condition that uses a page as an exit page, obtain and display all access trajectories of this page as a landing page.
  • the pages of the above-mentioned application include: at least one landing page, and the access track of each aggregation cluster is provided with a bounce node adjacent to the last page of the access track.
  • the above-mentioned optimization unit 504 includes: a calculation module (not shown in the figure), page optimization module (not shown in the figure).
  • the above-mentioned computing module can be configured to target at least one A page is used as a landing page.
  • the next node of the landing page in the access trajectory is a bounce node
  • the ratio of the number of terminal devices of the bounce node to the number of terminal devices of the landing page at the first node of the access trajectory is calculated.
  • the above page optimization module can be configured to optimize the landing page in response to a ratio greater than the average exit rate of all landing pages to obtain an optimized application.
  • the above-mentioned optimization unit 504 includes: a query module (not shown in the figure), a conversion module (not shown in the figure), and an application optimization module (not shown in the figure).
  • the above query module can be configured to query the preset access trajectories from the access trajectories corresponding to all aggregation clusters.
  • the above-mentioned conversion module may be configured to calculate the conversion rate of each page in the preset access track based on the terminal device information of each page in the preset access track.
  • the above-mentioned application optimization module may be configured to optimize the page in response to the conversion rate of a page in the preset access track being less than the conversion rate of pages other than the page in the preset access track to obtain an optimized application.
  • the access track of each aggregation cluster is provided with a jump node adjacent to the last page of the access track.
  • the above-mentioned optimization unit 504 includes: a traversal module (not shown in the figure), a node optimization module (not shown in the figure).
  • the above traversal module can be configured to traverse the access trajectories corresponding to all aggregation clusters, and calculate the number of terminal devices in all access trajectories that jump out of the page before the node.
  • the above-mentioned node optimization module may be configured to respond to the number of terminal devices that jump out of the page before the node is greater than the preset threshold, optimize the page before jumping out of the node, and obtain an optimized application.
  • the collection unit 501 first collects the access logs of the pages of the application accessed by different terminal devices through the embedded data of the application; secondly, the acquisition unit 502 obtains at least one preset time based on the access log. The access track of each terminal device to the application page in at least one visit during the cycle; again, the aggregation unit 503 performs aggregate statistics on the terminal devices with the same access track, and obtains an aggregation cluster corresponding to the access track and terminal device information; finally, The optimization unit 504 optimizes the application pages based on the access trajectories and terminal device information corresponding to each aggregation cluster to obtain an optimized application. Therefore, based on the access trajectory of the terminal device to the application page in one visit, clustering the terminal devices can determine all terminal devices with the same access trajectory, providing an effective optimization basis for the application page and improving application optimization. efficiency, improved user experience.
  • FIG. 6 a schematic structural diagram of an electronic device 600 suitable for implementing embodiments of the present disclosure is shown.
  • the electronic device 600 may include a processing device (eg, central processing unit, graphics processor, etc.) 601, which may be loaded into a random access device according to a program stored in a read-only memory (ROM) 602 or from a storage device 608.
  • the program in the memory (RAM) 603 executes various appropriate actions and processes.
  • various programs and data required for the operation of the electronic device 600 are also stored.
  • the processing device 601, ROM 602 and RAM 603 are connected to each other via a bus 604.
  • An input/output (I/O) interface 605 is also connected to bus 604.
  • the following devices can be connected to the I/O interface 605: input devices 606 including, for example, a touch screen, touch pad, keyboard, mouse, etc.; output devices including, for example, a liquid crystal display (LCD, Liquid Crystal Display), speakers, vibrators, etc. 607; a storage device 608 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 609.
  • Communication device 609 may allow electronic device 600 to communicate wirelessly or wiredly with other devices to exchange data.
  • FIG. 6 illustrates electronic device 600 with various means, it should be understood that implementation or availability of all illustrated means is not required. More or fewer means may alternatively be implemented or provided. Each block shown in Figure 6 may represent one device, or may represent multiple devices as needed.
  • embodiments of the present disclosure include a computer program product including a computer program carried on a computer-readable medium, the computer program containing program code for performing the method illustrated in the flowchart.
  • the computer program may be downloaded and installed from the network via communication device 609, or from storage device 608, or from ROM 602.
  • the processing device 601 the above-described functions defined in the method of the embodiment of the present disclosure are performed.
  • Computer-readable medium in the embodiments of the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two.
  • Computer-readable storage media may be, for example, but not limited to, electronic, magnetic, optical, electromagnetic, Infrared, or semiconductor systems, devices or devices, or any combination of the above. More specific examples of computer readable storage media may include, but are not limited to: an electrical connection having one or more wires, a portable computer disk, a hard drive, random access memory (RAM), read only memory (ROM), removable Programmd read-only memory (EPROM or flash memory), fiber optics, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in connection with an instruction execution system, apparatus, or device.
  • the computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, in which computer-readable program code is carried. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above.
  • a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device .
  • Program code contained on a computer-readable medium can be transmitted using any appropriate medium, including but not limited to: wires, optical cables, RF (Radio Frequency, Radio Frequency), etc., or any suitable combination of the above.
  • the above-mentioned computer-readable medium may be included in the above-mentioned server; it may also exist separately without being assembled into the server.
  • the above-mentioned computer-readable medium carries one or more programs.
  • the server collects the access logs of the pages of the application accessed by different terminal devices through the buried point data of the application; based on Access log, obtain the access track of each terminal device to the application page in at least one visit in at least one preset time period; perform aggregate statistics on terminal devices with the same access track, and obtain an aggregation of the corresponding access track and terminal device information Clusters; based on the access trajectories and terminal device information corresponding to each aggregated cluster, the application page is optimized to obtain an optimized application.
  • Computer program code for performing operations of embodiments of the present disclosure may be written in one or more programming languages, including object-oriented programming languages—such as Java, Smalltalk, C++, and A conventional procedural programming language—such as "C" or a similar programming language.
  • Program code can be completely Execute partly on the user's computer, execute partly on the user's computer, execute as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
  • the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as an Internet service provider through Internet connection).
  • LAN local area network
  • WAN wide area network
  • Internet service provider such as an Internet service provider through Internet connection
  • each block in the flowchart or block diagram may represent a module, segment, or portion of code that contains one or more logic functions that implement the specified executable instructions.
  • the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown one after another may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved.
  • each block of the block diagram and/or flowchart illustration, and combinations of blocks in the block diagram and/or flowchart illustration can be implemented by special purpose hardware-based systems that perform the specified functions or operations. , or can be implemented using a combination of specialized hardware and computer instructions.
  • the units involved in the embodiments of the present disclosure may be implemented in software or hardware.
  • the described unit can also be provided in a processor.
  • a processor including a collection unit, an acquisition unit, an aggregation unit, and an optimization unit.
  • the names of these units do not constitute a limitation on the unit itself under certain circumstances.
  • the collection unit can also be described as "configured to collect the pages of the application accessed by different terminal devices through the embedded data of the application.” Access Log" unit.

Abstract

The present invention relates to the technical fields of data analysis and the like, and disclosed are an access data processing method and apparatus. A specific embodiment of the method comprises: collecting, by means of buried point data of an application, access logs of different terminal devices accessing a page of the application; on the basis of the access logs, acquiring an access trajectory of each terminal device to the page of the application in at least one access in at least one preset period of time; performing aggregation statistics on the terminal devices having the same access trajectory to obtain aggregation clusters corresponding to the access trajectory and terminal device information; and optimizing the page of the application on the basis of the access trajectory and terminal device information corresponding to each aggregation cluster to obtain the optimized application.

Description

访问数据处理方法和装置、电子设备、计算机可读介质Access data processing methods and devices, electronic equipment, computer-readable media
相关申请的交叉引用Cross-references to related applications
本专利申请要求于2022年3月10日提交的、申请号为202210231228.6、发明名称为“访问数据处理方法和装置、电子设备、计算机可读介质”的中国专利申请的优先权,该申请的全文以引用的方式并入本申请中。This patent application claims priority to the Chinese patent application submitted on March 10, 2022, with the application number 202210231228.6 and the invention title "Access data processing method and device, electronic equipment, computer-readable medium". The full text of the application incorporated by reference into this application.
技术领域Technical field
本公开涉及计算机技术领域,具体涉及数据分析等技术领域,尤其涉及访问数据处理方法和装置、电子设备、计算机可读介质以及计算机程序产品。The present disclosure relates to the field of computer technology, specifically to data analysis and other technical fields, and in particular to access data processing methods and devices, electronic equipment, computer-readable media and computer program products.
背景技术Background technique
对于网页或者应用来说,每天都有大量的用户进行访问,点击页面的每个部位,如果能正确获取用户在页面的行为轨迹,对互联网产品的提升以及用户操作的便捷性都是非常有帮助的。For web pages or applications, a large number of users visit every day and click on every part of the page. If the user's behavior track on the page can be accurately obtained, it will be very helpful for the improvement of Internet products and the convenience of user operations. of.
发明内容Contents of the invention
本公开的实施例提出了访问数据处理方法和装置、电子设备、计算机可读介质以及计算机程序产品。Embodiments of the present disclosure provide access data processing methods and devices, electronic devices, computer-readable media, and computer program products.
在本公开的一个或多个实施例中,提供了一种访问数据处理方法,该方法包括:通过应用的埋点数据,采集不同终端设备访问应用的页面的访问日志;基于访问日志,获取至少一个预设时间周期中各个终端设备对应用的页面在至少一个访次中的访问轨迹;对具有相同访问轨迹的终端设备进行聚合统计,得到对应访问轨迹以及终端设备信息的聚合簇;基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用。In one or more embodiments of the present disclosure, an access data processing method is provided. The method includes: collecting access logs of pages of the application accessed by different terminal devices through buried point data of the application; based on the access log, obtaining at least The access track of each terminal device to the application page in at least one visit in a preset time period; aggregate statistics on terminal devices with the same access track to obtain an aggregation cluster of corresponding access tracks and terminal device information; based on each aggregation The access track and terminal device information corresponding to the cluster are used to optimize the application page and obtain the optimized application.
在一些实施例中,上述方法还包括:接收查询轨迹,查询轨迹包括应用的至少一个页面;将查询轨迹分别与各个聚合簇对应的访问轨 迹进行匹配;响应于确定查询轨迹与聚合簇对应的访问轨迹相匹配,得到并显示该聚合簇对应的终端设备信息。In some embodiments, the above method further includes: receiving a query track, where the query track includes at least one page of the application; and matching the query track with the access track corresponding to each aggregation cluster. The traces are matched; in response to determining that the query trace matches the access trace corresponding to the aggregation cluster, the terminal device information corresponding to the aggregation cluster is obtained and displayed.
在一些实施例中,上述方法还包括:采用桑基图对各个聚合簇对应的终端设备信息和访问轨迹进行展示。In some embodiments, the above method further includes: using a Sankey diagram to display the terminal device information and access trajectories corresponding to each aggregation cluster.
在一些实施例中,上述方法还包括:为应用的所有页面进行标签标注;响应于接收到页面的标签,对该页面所在的访问轨迹进行统计,得到访问轨迹统计结果。In some embodiments, the above method further includes: labeling all pages of the application; and in response to receiving the label of the page, performing statistics on access trajectories where the page is located to obtain access trajectory statistical results.
在一些实施例中,上述方法还包括:为各个聚合簇对应的访问轨迹的第一页面进行落地页标注;响应于接收到以一个页面作为落地页的查询条件时,得到并显示所有该页面作为落地页的访问轨迹。In some embodiments, the above method also includes: marking the landing page for the first page of the access track corresponding to each aggregation cluster; in response to receiving a query condition using one page as the landing page, obtaining and displaying all the pages as the landing page. The access track of the landing page.
在一些实施例中,上述方法还包括:为各个聚合簇对应的访问轨迹的最后页面进行退出页标注;响应于接收到以一个页面作为退出页的查询条件时,得到并显示所有该页面作为落地页的访问轨迹。In some embodiments, the above method also includes: marking the exit page for the last page of the access track corresponding to each aggregation cluster; in response to receiving a query condition using one page as the exit page, obtaining and displaying all the pages as landing pages. Page access track.
在一些实施例中,上述应用的页面包括:至少一个落地页,每个聚合簇的访问轨迹上均设置有与该访问轨迹的最后一个页面相邻的跳出节点,上述基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用,包括:针对至少一个页面作为落地页,当该落地页在访问轨迹中的下一节点是跳出节点,计算该跳出节点的终端设备的数量与该落地页处于访问轨迹首节点的终端设备的数量的比例;响应于比例大于所有落地页的退出率均值,对该落地页进行优化,得到优化后的应用。In some embodiments, the pages of the above-mentioned application include: at least one landing page, and the access track of each aggregation cluster is provided with a bounce node adjacent to the last page of the access track. The above-mentioned access based on each aggregation cluster corresponds to Trajectory and terminal device information, optimize the application page, and obtain the optimized application, including: targeting at least one page as a landing page, when the next node of the landing page in the access trajectory is a bounce node, calculating the bounce node The ratio of the number of terminal devices to the number of terminal devices for which the landing page is at the first node of the access trajectory; in response to the ratio being greater than the average exit rate of all landing pages, the landing page is optimized to obtain an optimized application.
在一些实施例中,上述基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用,包括:从所有聚合簇对应的访问轨迹中查询预设访问轨迹;基于预设访问轨迹的各个页面的终端设备信息,计算预设访问轨迹中各个页面的转化率;响应于预设访问轨迹中一页面的转化率小于预设访问轨迹中除该页面之外的页面的转化率,对该页面进行优化,得到优化后的应用。In some embodiments, the above-mentioned optimization of application pages based on the access trajectories and terminal device information corresponding to each aggregation cluster to obtain an optimized application includes: querying the preset access trajectories from the access trajectories corresponding to all aggregation clusters; Based on the terminal device information of each page in the preset access track, calculate the conversion rate of each page in the preset access track; in response to the fact that the conversion rate of a page in the preset access track is less than that of pages other than the page in the preset access track conversion rate, optimize the page, and get the optimized application.
在一些实施例中,每个聚合簇的访问轨迹上均设置有与该访问轨迹的最后一个页面相邻的跳出节点,基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用,包括: 遍历所有聚合簇对应的访问轨迹,计算所有访问轨迹中跳出节点之前的页面的终端设备的数量;响应于跳出节点之前的页面的终端设备的数量大于预设阈值,对跳出节点之前的页面进行优化,得到优化后的应用。In some embodiments, the access track of each aggregation cluster is provided with a bounce node adjacent to the last page of the access track, and the application pages are optimized based on the access track and terminal device information corresponding to each aggregation cluster. , optimized applications include: Traverse the access trajectories corresponding to all aggregated clusters, and calculate the number of terminal devices in all access trajectories that jump out of the page before the node; in response to the number of terminal devices that jump out of the page before the node being greater than the preset threshold, optimize the page before jumping out of the node , get the optimized application.
在本公开的一个或多个实施例中,提供了一种访问数据处理装置,该装置包括:采集单元,被配置成通过应用的埋点数据,采集不同终端设备访问应用的页面的访问日志;获取单元,被配置成基于访问日志,获取至少一个预设时间周期中各个终端设备对应用的页面在至少一个访次中的访问轨迹;聚合单元,被配置成对具有相同访问轨迹的终端设备进行聚合统计,得到对应访问轨迹以及终端设备信息的聚合簇;优化单元,被配置成基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用。In one or more embodiments of the present disclosure, an access data processing device is provided. The device includes: a collection unit configured to collect access logs of pages of the application accessed by different terminal devices through buried point data of the application; The acquisition unit is configured to obtain, based on the access log, access trajectories of each terminal device to the page of the application in at least one visit in at least one preset time period; the aggregation unit is configured to perform an aggregation operation on terminal devices with the same access trajectories. Aggregate statistics to obtain aggregation clusters corresponding to access trajectories and terminal device information; the optimization unit is configured to optimize application pages based on access trajectories and terminal device information corresponding to each aggregation cluster to obtain an optimized application.
在一些实施例中,上述装置还包括:接收单元,被配置成接收查询轨迹,查询轨迹包括应用的至少一个页面;匹配单元,被配置成将查询轨迹分别与各个聚合簇对应的访问轨迹进行匹配;得到单元,被配置成响应于确定查询轨迹与聚合簇对应的访问轨迹相匹配,得到并显示该聚合簇对应的终端设备信息。In some embodiments, the above device further includes: a receiving unit configured to receive a query track, where the query track includes at least one page of the application; a matching unit configured to match the query track with access tracks corresponding to each aggregation cluster. ; The obtaining unit is configured to obtain and display the terminal device information corresponding to the aggregation cluster in response to determining that the query trace matches the access trace corresponding to the aggregation cluster.
在一些实施例中,上述装置还包括:展示单元,被配置成采用桑基图对各个聚合簇对应的终端设备信息和访问轨迹进行展示。In some embodiments, the above apparatus further includes: a display unit configured to use a Sankey diagram to display the terminal device information and access trajectories corresponding to each aggregation cluster.
在一些实施例中,上述装置还包括:页面标注单元,被配置成为应用的所有页面进行标签标注;统计单元,被配置成响应于接收到页面的标签,对该页面所在的访问轨迹进行统计,得到访问轨迹统计结果。In some embodiments, the above-mentioned device further includes: a page labeling unit configured to label all pages of the application; a statistics unit configured to respond to receiving the label of the page and perform statistics on the access track where the page is located, Obtain the access trajectory statistics results.
在一些实施例中,上述装置还包括:落地标注单元,被配置成为各个聚合簇对应的访问轨迹的第一页面进行落地页标注;落地查询单元,被配置成响应于接收到以一个页面作为落地页的查询条件时,得到并显示所有该页面作为落地页的访问轨迹。In some embodiments, the above device further includes: a landing annotation unit configured to annotate the landing page for the first page of the access track corresponding to each aggregation cluster; a landing query unit configured to respond to receiving a page as the landing page. When querying the page, get and display all the access tracks of this page as a landing page.
在一些实施例中,上述装置还包括:退出标注单元,被配置成为各个聚合簇对应的访问轨迹的最后页面进行退出页标注;退出查询单元,被配置成响应于接收到以一个页面作为退出页的查询条件时,得 到并显示所有该页面作为落地页的访问轨迹。In some embodiments, the above device further includes: an exit labeling unit configured to label the last page of the access track corresponding to each aggregation cluster as an exit page; an exit query unit configured to respond to receiving a page as the exit page When the query conditions are Go to and display all access tracks of this page as a landing page.
在一些实施例中,上述应用的页面包括:至少一个落地页,每个聚合簇的访问轨迹上均设置有与该访问轨迹的最后一个页面相邻的跳出节点,上述优化单元包括:计算模块,被配置成针对至少一个页面作为落地页,当该落地页在访问轨迹中的下一节点是跳出节点,计算该跳出节点的终端设备的数量与该落地页处于访问轨迹首节点的终端设备的数量的比例;页面优化模块,被配置成响应于比例大于所有落地页的退出率均值,对该落地页进行优化,得到优化后的应用。In some embodiments, the pages of the above-mentioned application include: at least one landing page, and the access track of each aggregation cluster is provided with a bounce node adjacent to the last page of the access track. The above-mentioned optimization unit includes: a computing module, It is configured to target at least one page as a landing page. When the next node of the landing page in the access track is a bounce node, calculate the number of terminal devices of the bounce node and the number of terminal devices of the landing page at the first node of the access track. The proportion; the page optimization module is configured to respond to the proportion being greater than the average exit rate of all landing pages, optimize the landing page, and obtain an optimized application.
在一些实施例中,上述优化单元包括:查询模块,被配置成从所有聚合簇对应的访问轨迹中查询预设访问轨迹;转化模块,被配置成基于预设访问轨迹的各个页面的终端设备信息,计算预设访问轨迹中各个页面的转化率;应用优化模块,被配置成响应于预设访问轨迹中一页面的转化率小于预设访问轨迹中除该页面之外的页面的转化率,对该页面进行优化,得到优化后的应用。In some embodiments, the above-mentioned optimization unit includes: a query module, configured to query preset access trajectories from access trajectories corresponding to all aggregated clusters; a conversion module, configured to based on the terminal device information of each page of the preset access trajectories , calculate the conversion rate of each page in the preset access track; the application optimization module is configured to respond to the conversion rate of a page in the preset access track being less than the conversion rate of pages other than the page in the preset access track, This page is optimized and the optimized application is obtained.
在一些实施例中,每个聚合簇的访问轨迹上均设置有与该访问轨迹的最后一个页面相邻的跳出节点,上述优化单元包括:遍历模块,被配置成遍历所有聚合簇对应的访问轨迹,计算所有访问轨迹中跳出节点之前的页面的终端设备的数量;节点优化模块,被配置成响应于跳出节点之前的页面的终端设备的数量大于预设阈值,对跳出节点之前的页面进行优化,得到优化后的应用。In some embodiments, the access track of each aggregation cluster is provided with a jump node adjacent to the last page of the access track. The above-mentioned optimization unit includes: a traversal module configured to traverse the access tracks corresponding to all aggregation clusters. , calculate the number of terminal devices in all access trajectories that jump out of the page before the node; the node optimization module is configured to respond to the number of terminal devices that jump out of the page before the node is greater than the preset threshold, optimize the page before jumping out of the node, Get optimized applications.
在本公开的一个或多个实施例中,提供了一种电子设备,该电子设备包括:一个或多个处理器;存储装置,其上存储有一个或多个程序;当一个或多个程序被一个或多个处理器执行,使得一个或多个处理器实现如上述任一实现方式描述的方法。In one or more embodiments of the present disclosure, an electronic device is provided. The electronic device includes: one or more processors; a storage device with one or more programs stored thereon; when one or more programs Executed by one or more processors, causing the one or more processors to implement the method described in any of the above implementations.
在本公开的一个或多个实施例中,提供了一种计算机可读介质,其上存储有计算机程序,该程序被处理器执行时实现如上述任一实现方式描述的方法。In one or more embodiments of the present disclosure, a computer-readable medium is provided, with a computer program stored thereon. When the program is executed by a processor, the method described in any of the above implementations is implemented.
在本公开的一个或多个实施例中,提供了一种计算机程序产品,包括计算机程序,计算机程序在被处理器执行时实现如上述任一实现方式描述的方法。 In one or more embodiments of the present disclosure, a computer program product is provided, including a computer program. When executed by a processor, the computer program implements the method described in any of the above implementations.
附图说明Description of the drawings
通过阅读参照以下附图所作的对非限制性实施例所作的详细描述,本公开的其它特征、目的和优点将会变得更明显:Other features, objects and advantages of the present disclosure will become more apparent upon reading the detailed description of the non-limiting embodiments with reference to the following drawings:
图1是本公开的一个实施例可以应用于其中的示例性系统架构图;Figure 1 is an exemplary system architecture diagram in which an embodiment of the present disclosure may be applied;
图2是根据本公开的访问数据处理方法的一个实施例的流程图;Figure 2 is a flow chart of an embodiment of an access data processing method according to the present disclosure;
图3是根据本公开的访问数据处理方法的另一个实施例的流程图;Figure 3 is a flow chart of another embodiment of an access data processing method according to the present disclosure;
图4是根据本公开中采用桑基图展示访问轨迹的一种示意图;Figure 4 is a schematic diagram using a Sankey diagram to display access trajectories according to the present disclosure;
图5是根据本公开的访问数据处理装置的实施例的结构示意图;Figure 5 is a schematic structural diagram of an embodiment of an access data processing device according to the present disclosure;
图6是适于用来实现本公开的实施例的电子设备的结构示意图。FIG. 6 is a schematic structural diagram of an electronic device suitable for implementing embodiments of the present disclosure.
具体实施方式Detailed ways
下面结合附图和实施例对本公开作进一步的详细说明。可以理解的是,此处所描述的具体实施例仅仅用于解释相关发明,而非对该发明的限定。另外还需要说明的是,为了便于描述,附图中仅示出了与有关发明相关的部分。The present disclosure will be further described in detail below in conjunction with the accompanying drawings and examples. It can be understood that the specific embodiments described here are only used to explain the relevant invention, but not to limit the invention. It should also be noted that, for convenience of description, only the parts related to the invention are shown in the drawings.
需要说明的是,在不冲突的情况下,本公开中的实施例及实施例中的特征可以相互组合。下面将参考附图并结合实施例来详细说明本公开。It should be noted that, as long as there is no conflict, the embodiments and features in the embodiments of the present disclosure can be combined with each other. The present disclosure will be described in detail below in conjunction with embodiments with reference to the accompanying drawings.
图1示出了可以应用本公开的访问数据处理方法的示例性系统架构100。FIG. 1 illustrates an exemplary system architecture 100 to which the access data processing method of the present disclosure may be applied.
如图1所示,系统架构100可以包括终端设备101、102、103,网络104和服务器105。网络104用以在终端设备101、102、103和服务器105之间提供通信链路的介质。网络104可以包括各种连接类型,通常可以包括无线通信链路等等。As shown in Figure 1, the system architecture 100 may include terminal devices 101, 102, 103, a network 104 and a server 105. The network 104 is a medium used to provide communication links between the terminal devices 101, 102, 103 and the server 105. Network 104 may include various connection types and may typically include wireless communication links and the like.
终端设备101、102、103通过网络104与服务器105交互,以接收或发送消息等。终端设备101、102、103上可以安装有各种通讯客户端应用,例如即时通信工具、邮箱客户端等。The terminal devices 101, 102, 103 interact with the server 105 through the network 104 to receive or send messages, etc. Various communication client applications can be installed on the terminal devices 101, 102, and 103, such as instant messaging tools, email clients, etc.
终端设备101、102、103可以是硬件,也可以是软件。当终端设备101、102、103为硬件时,可以是具有通信和控制功能的设备。上 述设备可以与服务器105进行通信。当终端设备101、102、103为软件时,可以安装在上述终端中。其可以实现成多个软件或软件模块(例如用来访问应用的软件或软件模块),也可以实现成单个软件或软件模块。在此不做具体限定。The terminal devices 101, 102, and 103 may be hardware or software. When the terminal devices 101, 102, and 103 are hardware, they may be devices with communication and control functions. superior The device can communicate with the server 105. When the terminal devices 101, 102, and 103 are software, they can be installed in the above-mentioned terminals. It may be implemented as multiple software or software modules (such as software or software modules used to access applications), or as a single software or software module. There are no specific limitations here.
服务器105可以是提供各种服务的服务器,例如为终端设备101、102、103上应用提供支持的应用服务器。应用服务器可以对网络中各终端的相关信息进行分析处理,并将处理结果(如优化后的应用安装程序等)反馈给终端设备。The server 105 may be a server that provides various services, such as an application server that provides support for applications on the terminal devices 101, 102, and 103. The application server can analyze and process the relevant information of each terminal in the network, and feed back the processing results (such as optimized application installation programs, etc.) to the terminal device.
需要说明的是,服务器可以是硬件,也可以是软件。当服务器为硬件时,可以实现成多个服务器组成的分布式服务器集群,也可以实现成单个服务器。当服务器为软件时,可以实现成多个软件或软件模块(例如用来提供分布式服务的软件或软件模块),也可以实现成单个软件或软件模块。在此不做具体限定。It should be noted that the server can be hardware or software. When the server is hardware, it can be implemented as a distributed server cluster composed of multiple servers or as a single server. When the server is software, it can be implemented as multiple software or software modules (for example, software or software modules used to provide distributed services), or it can be implemented as a single software or software module. There are no specific limitations here.
需要说明的是,本公开的实施例所提供的访问数据处理方法一般由服务器105执行。It should be noted that the access data processing method provided by the embodiment of the present disclosure is generally executed by the server 105.
如图2,示出了根据本公开的访问数据处理方法的一个实施例的流程200,该访问数据处理方法包括以下步骤:As shown in Figure 2, a process 200 of an embodiment of an access data processing method according to the present disclosure is shown. The access data processing method includes the following steps:
步骤201,通过应用的埋点数据,采集不同终端设备访问应用的页面的访问日志。Step 201: Collect access logs of pages of the application accessed by different terminal devices through the embedded data of the application.
本实施例中,应用的埋点数据通过在应用的页面埋点得到,埋点是页面分析的一种常用的数据采集方法,具体是指在应用的每个页面对应的源代码中注入一段或几段脚本,这些脚本用于获取终端设备操作页面的操作事件对应的行为数据,多个不同时间段的行为数据组合在一起得到该终端设备的访问日志。In this embodiment, the hidden data of the application is obtained by burying data on the application page. Buried data is a common data collection method for page analysis. Specifically, it refers to injecting a paragraph or paragraph into the source code corresponding to each page of the application. Several scripts are used to obtain the behavior data corresponding to the operation events of the terminal device operation page. The behavior data of multiple different time periods are combined together to obtain the access log of the terminal device.
本实施例中,应用的埋点数据包括至少一个终端设备对应用进行操作的行为数据,通过埋点数据中的终端设备信息可以确定至少一个终端设备中的终端设备的行为数据,将该终端设备的行为数据按照时间戳排列在一起,可以得到该终端的访问日志。In this embodiment, the buried point data of the application includes the behavior data of at least one terminal device operating the application. The terminal device information in the buried point data can be used to determine the behavior data of the terminal device in at least one terminal device. The terminal device The behavioral data are arranged together according to the timestamp, and the access log of the terminal can be obtained.
访问数据处理方法运行于其上的执行主体(例如图1所示的服务 器)可以通过多种途径得到访问日志,例如,通过埋点工具实时获取终端设备(例如图1所示的终端101、102、103)上安装的应用的埋点数据,或者从数据库中获取预先对应用进行埋点后的埋点数据。Access the execution principal on which the data processing method runs (such as the service shown in Figure 1 Access logs can be obtained through multiple channels, for example, by using tracking tools to obtain tracking data of applications installed on terminal devices (such as terminals 101, 102, and 103 shown in Figure 1) in real time, or by obtaining pre-set tracking data from a database. Buried data after burying the application.
本实施例中,访问日志可以是终端设备对应用的页面进行浏览的浏览日志,用户通过终端设备浏览应用的页面时,埋点工具或者脚本向执行主体上报的埋点数据可以包括:上报站点、页面的唯一标识、识别同一访次的编号、设备号、请求时间段等,终端设备每浏览一次,埋点工具或者脚本就会生成一条埋点数据,并存储该埋点数据。In this embodiment, the access log may be a browsing log of the terminal device browsing the application page. When the user browses the application page through the terminal device, the tracking data reported to the execution subject by the tracking tool or script may include: reporting site, The unique identifier of the page, the number that identifies the same visit, the device number, the request time period, etc. Every time the terminal device browses, the tracking tool or script will generate a piece of tracking data and store the tracking data.
本实施例中,应用的页面可以是通过浏览器打开的页面,或者是应用的原生页面,也可以是内嵌在应用中需要被唤起的页面;访问日志是多个设备终端在不同访次中对应用的页面的操作的行为数据集合,其中,访次是指设备终端访问一次应用,用户通过设备终端打开应用是一个访次的开始,用户关闭应用或者将应用退至后台运行设定时间(例如30min)未操作。一个用户在预设时间段中多次访问一个应用,则会被执行主体记录多个访次。In this embodiment, the page of the application can be a page opened through a browser, or a native page of the application, or a page embedded in the application that needs to be invoked; the access log is the data generated by multiple device terminals in different visits. A collection of behavioral data for operations on application pages. A visit refers to a device terminal accessing an application. The user opening the application through the device terminal is the beginning of a visit. The user closes the application or returns the application to the background to run for a set time ( For example, 30min) without operation. If a user visits an application multiple times within a preset time period, the execution entity will record multiple visits.
步骤202,基于访问日志,获取至少一个预设时间周期中各个终端设备对应用的页面在至少一个访次中的访问轨迹。Step 202: Based on the access log, obtain the access track of each terminal device to the page of the application in at least one visit in at least one preset time period.
本实施例中,访问日志为多个终端设备对应用的页面进行操作的行为数据的集合,针对多个终端设备中的一个终端设备,可以通过访问日志,得到该终端设备对应用的页面在一个访次中的访问轨迹。In this embodiment, the access log is a collection of behavior data of multiple terminal devices operating on the application page. For one terminal device among the multiple terminal devices, the access log can be used to obtain the information about the terminal device's operation of the application page. Visit trajectories during visits.
本实施例中,应用的页面有多个,访问轨迹是用户通过终端设备访问应用对应用中的各个页面进行操作的记录,例如,应用包括首页、活动页、搜索页、详情页,用户通过设备终端依次浏览了应用的首页、活动页、首页、搜索页,则设备终端对应的访问轨迹为首页-活动页-首页-搜索页。In this embodiment, the application has multiple pages, and the access track is a record of the user accessing the application through the terminal device and operating each page in the application. For example, the application includes a home page, an activity page, a search page, and a details page. The user accesses the application through the device. The terminal browses the homepage, activity page, homepage, and search page of the application in sequence, and the access track corresponding to the device terminal is homepage-activity page-homepage-search page.
在本实施例中,访问轨迹包括多个节点,每个节点对应应用的一个页面,多个节点中任意两个节点可以相同也可以不同,例如,上面访问轨迹具有两个首页,即在该访问轨迹中,具有两个对应同一页面的节点。In this embodiment, the access track includes multiple nodes, each node corresponding to a page of the application. Any two nodes among the multiple nodes can be the same or different. For example, the access track above has two home pages, that is, in the access track In the track, there are two nodes corresponding to the same page.
本实施例中,预设时间周期可以根据应用开发需求而设置的时间, 例如预设时间周期为一天,或者半天等。可以使用每日定时任务,通过埋点得到的基础流量数据计算得到各个终端设备对应用的页面在至少一个访次中的访问轨迹。例如每天凌晨3点执行该计算任务。预设时间周期中各个终端设备对应用的页面在一个访次中的访问轨迹是指:在预设时间周期中同一终端设备同一访次顺序浏览过的页面的集合。上述获取至少一个预设时间周期中各个终端对应用的页面在至少一个访次中的访问轨迹包括:以预设时间段为周期,将同一用户同一访次浏览页面按照时间戳从先到后排序,根据页面唯一标识进行相邻去重(在访问轨迹中的相邻的两个节点相同去掉其中的一个节点),得到预设时间段该终端设备的访问轨迹。In this embodiment, the preset time period can be set according to application development requirements. For example, the preset time period is one day, or half a day, etc. Daily scheduled tasks can be used to calculate the access track of each terminal device to the application's page in at least one visit through the basic traffic data obtained. For example, the calculation task is executed at 3 a.m. every day. The access track of each terminal device to the application page in one visit in the preset time period refers to the collection of pages viewed by the same terminal device in the same visit sequence in the preset time period. The above-mentioned obtaining the access track of each terminal to the application page in at least one visit in at least one preset time period includes: taking the preset time period as the cycle, sorting the pages browsed by the same user in the same visit according to the timestamp from first to last. , perform adjacent deduplication according to the unique identifier of the page (two adjacent nodes in the access trajectory are the same and remove one of the nodes), and obtain the access trajectory of the terminal device in the preset time period.
可选地,可以在所有终端设备的访问轨迹中的最后页面之后,加一个exit跳出标识作为统一的APP出口,也即exit为访问轨迹的跳出节点。如A终端设备的访问轨迹为首页->活动页->首页->搜索页->exit。Optionally, an exit identifier can be added as a unified APP exit after the last page in the access track of all terminal devices, that is, exit is the exit node of the access track. For example, the access track of terminal A is home page -> activity page -> home page -> search page -> exit.
步骤203,对具有相同访问轨迹的终端设备进行聚合统计,得到对应访问轨迹以及终端设备信息的聚合簇。Step 203: Aggregate statistics on terminal devices with the same access trajectories to obtain aggregation clusters corresponding to access trajectories and terminal device information.
本实施例中,同一访问轨迹对应至少一个终端设备,当具有相同访问轨迹的终端设备聚合在一起,可以得到与访问轨迹一一对应的聚合簇,该得到的聚合簇可以包括至少一个终端设备的终端设备信息。In this embodiment, the same access trajectory corresponds to at least one terminal device. When terminal devices with the same access trajectory are aggregated together, an aggregation cluster corresponding to the access trajectory can be obtained. The obtained aggregation cluster can include at least one terminal device. Terminal device information.
本实施例中,终端设备信息是与终端设备相关的信息,例如,终端设备信息包括:终端设备的唯一标志、终端设备的设备号、终端设备的数量。需要说明的是,终端设备的数量是由访问轨迹相同的至少一个终端的数量之和,例如,A终端设备的访问轨迹为首页->活动页->首页->搜索页->exit,当天仅B终端设备与A终端设备访问轨迹完全相同,B终端设备的访问轨迹也为首页->活动页->首页->搜索页->exit,这条访问轨迹当天的终端设备的数量为2。In this embodiment, the terminal device information is information related to the terminal device. For example, the terminal device information includes: a unique identifier of the terminal device, a device number of the terminal device, and the number of terminal devices. It should be noted that the number of terminal devices is the sum of the number of at least one terminal with the same access trajectory. For example, the access trajectory of terminal A is home page -> activity page -> home page -> search page -> exit, and only on that day The access track of terminal B is exactly the same as that of terminal A. The access track of terminal B is also home page -> activity page -> home page -> search page -> exit. The number of terminal devices on this access track that day is 2.
本实施例中,访问轨迹是在时间顺序正向排列的页面或节点,访问轨迹明确标识有第一步至第N访问的页面,具体地,聚合簇的聚合表如表1所示。In this embodiment, the access track is a page or node arranged in forward chronological order. The access track clearly identifies the pages accessed from the first step to the Nth. Specifically, the aggregation table of the aggregation cluster is shown in Table 1.
表1

Table 1

在表1中,第一步至第四步是访问轨迹的访问的页面,或设置的节点(如exit跳出节点),访问的页面包括:活动页、详情页、搜索页、首页、列表页,这些访问页面与应用相关,当应用不同时,包含的页面不同,页面的名称也相应不同。In Table 1, the first to fourth steps are the accessed pages of the access track, or the set nodes (such as exit nodes). The accessed pages include: activity page, details page, search page, homepage, list page, These access pages are related to the application. When the application is different, the pages contained are different, and the names of the pages are also different accordingly.
步骤204,基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用。Step 204: Based on the access trajectories and terminal device information corresponding to each aggregation cluster, optimize the application page to obtain an optimized application.
本实施例中,应用的页面可以有一个或多个,基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,可以是对应用的一个页面进行优化,也可以是对多个页面进行的优化。其中,对页面的优化可以包括:删除页面、为页面重新排版、修改页面的内容等优化方式。In this embodiment, the application may have one or more pages. Based on the access trajectories and terminal device information corresponding to each aggregation cluster, the application pages may be optimized. This may be to optimize one page of the application or to optimize multiple pages. Page optimization. Among them, the optimization of the page can include: deleting the page, rearranging the page, modifying the content of the page and other optimization methods.
本实施例中,一个聚合簇对应一个访次中的访问轨迹,一个聚合簇中可以具有一个或多个终端设备的终端设备信息,该一个或多个终端设备对应同一个访次中的访问轨迹。In this embodiment, an aggregation cluster corresponds to an access track in one visit. An aggregation cluster may have terminal device information of one or more terminal devices, and the one or more terminal devices correspond to an access track in the same visit. .
在本实施例的一些可选实现方式中,上述应用的页面包括:至少一个落地页,每个聚合簇的访问轨迹上均设置有与该访问轨迹的最后一个页面相邻的跳出节点,基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用,包括:针对至少一个页面作为落地页,当该落地页在访问轨迹中的下一节点是跳出节点,计算该跳出节点的终端设备的数量与该落地页处于访问轨迹首节点的终端设备的数量的比例;响应于比例大于所有落地页的退出率均值,对该落地页进行优化,得到优化后的应用。In some optional implementations of this embodiment, the pages of the above application include: at least one landing page, and the access track of each aggregate cluster is provided with a jump node adjacent to the last page of the access track. Based on each Aggregate the access trajectories and terminal device information corresponding to the cluster, optimize the application pages, and obtain the optimized application, including: targeting at least one page as a landing page, when the next node of the landing page in the access trajectory is a bounce node, Calculate the ratio of the number of terminal devices that jump out of the node to the number of terminal devices that are at the first node of the access trajectory for the landing page; in response to the ratio being greater than the average exit rate of all landing pages, optimize the landing page to obtain the optimized application .
本实施例中,落地页是指访问者在应用之外的其他地方看到发出的某个具有明确主题的特定活动通过邮件、社交媒体或广告发布的诱人优惠信息等,点击后被链接到应用的第一个页面。该落地页可以应 用中的任意一个页面,例如,落地页是应用中的详情页,或者落地页是应用中的首页等。In this embodiment, the landing page refers to a specific activity with a clear theme that the visitor sees in other places outside the application, such as attractive discount information published through emails, social media or advertisements, etc., and is linked to after clicking. The first page of the application. This landing page can be Any page in the application, for example, the landing page is the details page in the application, or the landing page is the home page in the application, etc.
本可选实现方式中,跳出节点是所有访问轨迹中的最后的一个节点,通过跳出节点可以确定终端设备已经结束对应用的访问。In this optional implementation, the exit node is the last node in all access trajectories. Through the exit node, it can be determined that the terminal device has finished accessing the application.
本可选实现方式中,退出率:落地页下一个节点为跳出节点的终端设备数量,占落地页处于访问轨迹第一页面的终端设备数量的比例,退出率均值是通过统计所有落地页的流量得到的值,具体地,退出率均值有加权平均、算数平均以及几何平均三种计算方式:In this optional implementation, exit rate: the number of terminal devices whose next node on the landing page is the exit node, as a proportion of the number of terminal devices on the first page of the landing page's access track. The average exit rate is calculated by counting the traffic of all landing pages. The obtained value, specifically, the average exit rate has three calculation methods: weighted average, arithmetic average and geometric average:
加权平均:应用中所有访问轨迹中落地页后的跳出节点的终端设备的数量之和作为分子,所有落地页处于访问轨迹第一页面的终端设备的数量之和作为分母,相比得到的比例值。Weighted average: The sum of the number of terminal devices that jump out of the node after the landing page in all access trajectories in the application is used as the numerator, and the sum of the number of terminal devices that have the landing page on the first page of the access trajectory is used as the denominator. The proportion value obtained by comparison .
算数平均:全部落地页的退出率直接加和后,除以落地页的数量。Arithmetic average: The exit rates of all landing pages are directly added up and divided by the number of landing pages.
几何平均:全部落地页的退出率相乘,再开N次方,N等于落地页的数量。Geometric mean: The exit rates of all landing pages are multiplied and raised to the power N. N is equal to the number of landing pages.
本实施例中,落地页不是只有一个,落地页可以有很多,落地页的定义是应用的第一个页面叫落地页。In this embodiment, there is not only one landing page, but there can be many landing pages. The definition of landing page is that the first page of the application is called the landing page.
应用中所有访问轨迹中落地页后的跳出节点的终端设备的数量与所有落地页处于访问轨迹首节点的终端设备的数量的比例的均值。The mean value of the ratio of the number of terminal devices that jump out of the node after the landing page in all access trajectories in the application to the number of terminal devices that have the landing page at the first node of the access trajectory.
本可选实现方式中,通过计算跳出节点的终端设备的数量与落地页处于访问轨迹首节点的终端设备的数量的比例,可以有效地分析落地页的访问情况,为改进落地页提供了可靠的依据。In this optional implementation, by calculating the ratio of the number of terminal devices that jump out of the node to the number of terminal devices with the landing page at the first node of the access trajectory, the access situation of the landing page can be effectively analyzed, providing a reliable basis for improving the landing page. in accordance with.
在本实施例的一些可选实现方式中,上述基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用,包括:从所有聚合簇对应的访问轨迹中查询预设访问轨迹;基于预设访问轨迹的各个页面的终端设备信息,计算预设访问轨迹中各个页面的转化率;响应于预设访问轨迹中一页面的转化率小于预设访问轨迹中除该页面之外的页面的转化率,对该页面进行优化,得到优化后的应用。In some optional implementations of this embodiment, the application pages are optimized based on the access trajectories and terminal device information corresponding to each aggregation cluster, and the optimized application is obtained, including: from the access trajectories corresponding to all aggregation clusters Query the preset access track; based on the terminal device information of each page of the preset access track, calculate the conversion rate of each page in the preset access track; in response to the conversion rate of a page in the preset access track being less than the preset access track except The conversion rate of pages other than this page, the page is optimized, and the optimized application is obtained.
本可选实现方式中,转化率与各个页面的终端设备的访问数量相关,例如,一访问轨迹是搜索页到商品详情页,那么搜索页的转化率 为:以搜索页的终端设备的数量为分母,以搜索页到商品详情页的终端设备的数量为分子的比例。In this optional implementation, the conversion rate is related to the number of terminal device visits to each page. For example, if a visit track is from the search page to the product details page, then the conversion rate of the search page is: the ratio with the number of terminal devices on the search page as the denominator and the number of terminal devices from the search page to the product details page as the numerator.
本可选实现方式中,通过关注核心页面作为流程中的任意页的转化率,当整个访问轨迹中某一页面的转化率低于剩余其他页面的转化率时,该页面为待优化页面,需要考虑删除或变更该页面。In this optional implementation, by focusing on the core page as the conversion rate of any page in the process, when the conversion rate of a certain page in the entire access track is lower than the conversion rate of the remaining other pages, the page is a page to be optimized and needs to be Consider removing or changing this page.
本可选实现方式中,通过对应用的各个页面的转化率进行计算,保证了对应用中核心页面的优化效果,提高了应用优化效率。In this optional implementation method, the conversion rate of each page of the application is calculated to ensure the optimization effect of the core pages in the application and improve the efficiency of application optimization.
在本实施例的一些可选实现方式中,每个聚合簇的访问轨迹上均设置有与该访问轨迹的最后一个页面相邻的跳出节点,上述基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用,包括:遍历所有聚合簇对应的访问轨迹,计算所有访问轨迹中跳出节点之前的页面的终端设备的数量;响应于跳出节点之前的页面的终端设备的数量大于预设阈值,对跳出节点之前的页面进行优化,得到优化后的应用。In some optional implementations of this embodiment, the access track of each aggregation cluster is provided with a jump-out node adjacent to the last page of the access track. The above is based on the access track and terminal device information corresponding to each aggregation cluster. , optimize the application pages to obtain the optimized application, including: traversing the access trajectories corresponding to all aggregation clusters, calculating the number of terminal devices in all access trajectories that jump out of the page before the node; responding to the terminal devices that jump out of the page before the node If the number of devices is greater than the preset threshold, the page before jumping out of the node will be optimized to obtain an optimized application.
本可选实现方式中,通过遍历全部的访问轨迹中跳出节点之前的页面(跳出页),统计全部跳出页的流量(终端设备的数量),流量最高的跳出页为高频跳出页面,通过优化高频跳出页面,考虑该页面流失用户的原因及优化方向,可以有效地优化应用。In this optional implementation, by traversing the pages before the jump node (bounce page) in all access trajectories, the traffic of all jump pages (the number of terminal devices) is counted. The jump page with the highest traffic is the high-frequency jump page. Through optimization Frequently jumping out of the page, considering the reasons for losing users on the page and the optimization direction, can effectively optimize the application.
本可选实现方式中,通过分析所有访问轨迹中跳出节点之前的页面的终端设备的数量,得出应用中的高频跳出页面,为应用的页面的优化提供了可靠依据。In this optional implementation, by analyzing the number of terminal devices in all access trajectories that jump out of the page before the node, the high-frequency jumping pages in the application are obtained, which provides a reliable basis for optimizing the application's pages.
本公开的实施例提供的访问数据处理方法无需配置即可通过聚类簇的信息查看用户在使用产品时的路径分布情况,支持全局观测用户在产品中的热点页面和主要访问轨迹。同时,可以根据聚类簇中终端设备信息和访问轨迹对用户进行分群,支持对比不同类型用户在行为路径差异。当访问轨迹中有落地页时,也可以通过筛选不同落地页作为起点,查看用户后续的路径分布。实现既可以零配置的查看全局用户路径分布,也可以一定程度的满足可配置分析的需求。The access data processing method provided by the embodiments of the present disclosure can view the path distribution of users when using products through cluster information without configuration, and supports global observation of users' hot pages and main access trajectories in products. At the same time, users can be grouped according to the terminal device information and access trajectories in the cluster, supporting the comparison of the differences in behavioral paths of different types of users. When there are landing pages in the access track, you can also filter different landing pages as a starting point to view the user's subsequent path distribution. It can not only view the global user path distribution with zero configuration, but also meet the needs of configurable analysis to a certain extent.
本公开的实施例提供的访问数据处理方法,首先通过应用的埋点数据,采集不同终端设备访问应用的页面的访问日志;其次,基于访 问日志,获取至少一个预设时间周期中各个终端设备对应用的页面在至少一个访次中的访问轨迹;再次,对具有相同访问轨迹的终端设备进行聚合统计,得到对应访问轨迹以及终端设备信息的聚合簇;最后,基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用。由此,基于终端设备对应用的页面在一个访次中的访问轨迹,聚类终端设备,可以确定具有相同访问轨迹的所有终端设备,为应用的页面提供了有效的优化依据,提高了应用优化效率,提高了用户体验。The access data processing method provided by the embodiment of the present disclosure first collects the access logs of the pages of the application accessed by different terminal devices through the embedded data of the application; secondly, based on the access data, Query the log to obtain the access track of each terminal device to the application page in at least one visit in at least one preset time period; thirdly, perform aggregate statistics on the terminal devices with the same access track to obtain the corresponding access track and terminal device information aggregation clusters; finally, based on the access trajectories and terminal device information corresponding to each aggregation cluster, the application page is optimized to obtain an optimized application. Therefore, based on the access trajectory of the terminal device to the application page in one visit, clustering the terminal devices can determine all terminal devices with the same access trajectory, providing an effective optimization basis for the application page and improving application optimization. Efficiency and improved user experience.
为了对访问轨迹进行更好的分析,在本公开的另一个实施例中,上述访问数据处理方法还包括:接收查询轨迹,查询轨迹包括应用的至少一个页面;将查询轨迹分别与各个聚合簇对应的访问轨迹进行匹配;响应于确定查询轨迹与聚合簇对应的访问轨迹相匹配,得到并显示该聚合簇对应的终端设备信息。In order to better analyze access trajectories, in another embodiment of the present disclosure, the above-mentioned access data processing method also includes: receiving query trajectories, where the query trajectories include at least one page of the application; and corresponding query trajectories to each aggregation cluster. The access traces are matched; in response to determining that the query trace matches the access trace corresponding to the aggregation cluster, the terminal device information corresponding to the aggregation cluster is obtained and displayed.
本实施例中,为了更好的查询访问轨迹,可以接收开发人员发送的查询轨迹,查询轨迹可以是与聚合簇对应的访问轨迹相同的轨迹,也可以是与聚合簇对应的访问轨迹不相同的轨迹,当查询轨迹与访问轨迹相同时,确定查询轨迹与聚合簇对应的访问轨迹相匹配,相应可以得到于查询轨迹对应的聚合簇的终端设备信息,例如终端设备数量,从而可以分析查询轨迹的数据访问流量。In this embodiment, in order to better query the access track, the query track sent by the developer can be received. The query track can be the same as the access track corresponding to the aggregation cluster, or it can be different from the access track corresponding to the aggregation cluster. Trajectory, when the query trajectory is the same as the access trajectory, it is determined that the query trajectory matches the access trajectory corresponding to the aggregation cluster. Accordingly, the terminal device information of the aggregation cluster corresponding to the query trajectory can be obtained, such as the number of terminal devices, so that the query trajectory can be analyzed. Data access traffic.
本实施例中,通过查询轨迹查询聚合簇的终端的信息,为应用的页面的数据访问情况提供了可靠的查询依据,保证了应用访问数据分析的可靠性。In this embodiment, the information of the terminals of the aggregation cluster is queried through the query trajectory, which provides a reliable query basis for the data access status of the application page and ensures the reliability of the application access data analysis.
为了对应用页面的访问情况进行统计,在本公开的另一个实施例中,上述访问数据处理方法还包括:为应用的所有页面进行标签标注;响应于接收到页面的标签,对该页面所在的访问轨迹进行统计,得到访问轨迹统计结果。In order to collect statistics on access to application pages, in another embodiment of the present disclosure, the above access data processing method also includes: labeling all pages of the application; in response to receiving the label of the page, labeling the page where the page is located. Perform statistics on access trajectories to obtain access trajectories statistical results.
本实施例中,为应用的所有页面进行标签标注,则生成的访问轨迹也具有标注的标签,在对所有页面中任一页面进行访问轨迹进行统 计时,可以确定该页面所在的访问轨迹以及该页面所在的聚合簇,进而可以通过出访问轨迹的数量、聚合簇的数量等信息。In this embodiment, all pages of the application are tagged, and the generated access trajectories also have tagged tags. After the access trajectories of any page among all pages are unified, Timing can determine the access track where the page is located and the aggregation cluster where the page is located, and then the number of access tracks, the number of aggregation clusters and other information can be obtained.
本实施例中,访问轨迹统计结果包括:访问轨迹名称、访问轨迹数量、访问轨迹对应的聚合簇、访问轨迹对应的终端设备的数量等。In this embodiment, the access trajectory statistics results include: access trajectory name, access trajectory number, aggregation cluster corresponding to the access trajectory, number of terminal devices corresponding to the access trajectory, etc.
本实施例中,标签可以是文字、符号、代码等,接收的页面的标签可以唯一指示应用的页面,通过为页面设置标签,可以支持对页面的独立查询,以及对接收到的页面的访问轨迹的统计,从而实现了以访问轨迹为横向、页面为纵向查询的交织查询方式。In this embodiment, tags can be text, symbols, codes, etc. The tag of the received page can uniquely indicate the page of the application. By setting tags for the page, independent query of the page can be supported, as well as the access track of the received page. Statistics, thereby realizing an interleaved query method with access trajectories as horizontal queries and pages as vertical queries.
本实施例中,通过为各个页面进行标签标注,将用户路径依据所包含的页面进行标签标注,在前端配置上,可以满足用户输入关注页面的唯一标识后,可查看到经过该页面的访问轨迹分布,还可以有效查询到聚合簇中终端设备的数量等访问轨迹统计结果,提高了页面检索的可靠性。In this embodiment, each page is tagged, and the user path is tagged according to the included page. In the front-end configuration, the user can view the access track through the page after inputting the unique identifier of the page of interest. Distribution, you can also effectively query the access trajectory statistics such as the number of terminal devices in the aggregate cluster, which improves the reliability of page retrieval.
为了对应用页面的落地页进行统计,在本公开的另一个实施例中,上述方法还包括:为各个聚合簇对应的访问轨迹的第一页面进行落地页标注;响应于接收到以一个页面作为落地页的查询条件时,得到并显示所有该页面作为落地页的访问轨迹。In order to count the landing pages of application pages, in another embodiment of the present disclosure, the above method further includes: marking the landing page for the first page of the access track corresponding to each aggregation cluster; in response to receiving a page as When querying the landing page, obtain and display all the access tracks of this page as a landing page.
本实施例中,对所有访问轨迹的第一页面进行落地页标注,可以增加对访问轨迹的查询条件;例如,当一个访问轨迹包括:详情页、搜索页;则将详情页作为落地页,当查询条件为:以详情页为落地页时,查询并显示所有访问轨迹中以详情页作为落地页的访问轨迹,需要说明的是,可以采用桑基图对访问轨迹进行显示与查询。In this embodiment, by marking the landing page on the first page of all access trajectories, query conditions for the access trajectories can be added; for example, when an access trajectory includes: details page and search page; then the details page is used as the landing page. The query conditions are: when the details page is used as the landing page, query and display all access trajectories with the details page as the landing page. It should be noted that the Sankey diagram can be used to display and query the access trajectories.
本实施例中,为各个聚合簇对应的访问轨迹的第一页面进行落地页标注,在以一个页面作为落地页的查询条件时,得到并显示所有该页面作为落地页的访问轨迹,提高了访问轨迹查询的丰富度。In this embodiment, the landing page is marked for the first page of the access track corresponding to each aggregation cluster. When one page is used as the query condition of the landing page, all access tracks of this page as the landing page are obtained and displayed, which improves access Richness of trajectory queries.
为了对应用页面的落地页进行统计,在本公开的另一个实施例中,上述方法还包括:为各个聚合簇对应的访问轨迹的最后页面进行退出页标注;响应于接收到以一个页面作为退出页的查询条件时,得到并 显示所有该页面作为落地页的访问轨迹。In order to collect statistics on landing pages of application pages, in another embodiment of the present disclosure, the above method also includes: marking the last page of the access track corresponding to each aggregation cluster as an exit page; in response to receiving a page as the exit Page query conditions are obtained and Display all access tracks of this page as a landing page.
本实施例中,对所有访问轨迹的最后页面进行退出页标注,可以增加对访问轨迹的查询条件;例如,当一个访问轨迹包括:详情页、搜索页;则将搜索页作为退出页,当查询条件为:以搜索页为退出页时,查询并显示所有访问轨迹中以搜素页作为退出页的访问轨迹,需要说明的是,可以采用桑基图对访问轨迹进行显示。In this embodiment, the exit page is marked on the last page of all access trajectories, and query conditions for the access trajectories can be added; for example, when an access trajectory includes: details page and search page; then the search page is used as the exit page. When querying The conditions are: when the search page is used as the exit page, query and display the access trajectories in all access trajectories that use the search page as the exit page. It should be noted that a Sankey diagram can be used to display the access trajectories.
本实施例中,为各个聚合簇对应的访问轨迹的最后页面进行退出页标注,在以一个页面作为退出页的查询条件时,得到并显示所有该页面作为退出页的访问轨迹,提高了访问轨迹查询的丰富度。In this embodiment, the exit page is marked for the last page of the access track corresponding to each aggregation cluster. When one page is used as the query condition for the exit page, all the access tracks for this page as the exit page are obtained and displayed, which improves the access track. The richness of the query.
请参见图3,其示出了本公开提供的访问数据处理方法的另一个实施例的流程300。该访问数据处理方法可以包括以下步骤:Please refer to FIG. 3 , which shows a process 300 of another embodiment of the access data processing method provided by the present disclosure. The access data processing method may include the following steps:
步骤301,通过应用的埋点数据,采集不同终端设备访问应用的页面的访问日志。Step 301: Collect access logs of pages of the application accessed by different terminal devices through the embedded data of the application.
步骤302,基于访问日志,获取至少一个预设时间周期中各个终端设备对应用的页面在至少一个访次中的访问轨迹。Step 302: Based on the access log, obtain the access track of each terminal device to the page of the application in at least one visit in at least one preset time period.
步骤303,对具有相同访问轨迹的终端设备进行聚合统计,得到对应访问轨迹以及终端设备信息的聚合簇。Step 303: Perform aggregation statistics on terminal devices with the same access trajectories to obtain aggregation clusters corresponding to access trajectories and terminal device information.
步骤304,采用桑基图对各个聚合簇对应的终端设备信息和访问轨迹进行展示。Step 304: Use a Sankey diagram to display the terminal device information and access trajectories corresponding to each aggregation cluster.
桑基图(Sankey diagram):即桑基能量分流图,也叫桑基能量平衡图。它是一种特定类型的流程图,图中延伸的分支的宽度对应数据流量的大小。Sankey diagram: Sankey energy distribution diagram, also called Sankey energy balance diagram. It is a specific type of flowchart in which the width of the extended branches corresponds to the size of the data flow.
具体地,使用可视化工具,调用应用层数据,通过拖拽字段的形式就可以完成桑基图的搭建。将各个访问轨迹中的第1步、第2步……第N(N>1)步字段依次拖拽到维度,指标设置为用户数字段,再设置排序条件(例如用户数降序)。Specifically, you can use visualization tools to call application layer data and drag and drop fields to complete the construction of the Sankey diagram. Drag the first, second, and Nth (N>1) step fields in each access track to the dimension in sequence, set the indicator to the user number field, and then set the sorting conditions (for example, descending order of user number).
在没有任何条件筛选的情况下,桑基图可以展示的是无需配置的某应用的访问轨迹分布情况。鼠标滑动到某一步上时,提示经过该页面的终端设备数量;鼠标滑动到访问轨迹任一两步之间时,提示的是 从某一页面去到另一页面的终端设备数量。Without any conditional filtering, the Sankey diagram can show the access trajectory distribution of an application without configuration. When the mouse slides over a certain step, the number of terminal devices passing through the page is prompted; when the mouse slides between any two steps of the access track, the prompt is The number of terminal devices that go from one page to another.
如图4所示,为采用桑基图展示访问轨迹的一种示意图,在图4中,S、T、W、V、M、N、U代表应用中的不同页面,相应地,还可以在桑基图对应位置显示终端设备信息(如终端设备数量、ID等)。As shown in Figure 4, it is a schematic diagram showing access trajectories using Sankey diagram. In Figure 4, S, T, W, V, M, N, and U represent different pages in the application. Correspondingly, you can also The corresponding position of the Sankey diagram displays terminal device information (such as the number of terminal devices, ID, etc.).
步骤305,基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用。Step 305: Based on the access trajectories and terminal device information corresponding to each aggregation cluster, optimize the application page to obtain an optimized application.
应当理解,上述步骤301-步骤303、步骤305中的操作和特征,分别与步骤201-步骤204中的操作和特征相对应,因此,上述步骤201-步骤204中对于操作和特征的描述,同样适用于步骤301-步骤303、步骤305,在此不再赘述。It should be understood that the operations and features in the above steps 301 to 303 and 305 respectively correspond to the operations and features in steps 201 to 204. Therefore, the descriptions of the operations and features in the above steps 201 to 204 are also the same. It is applicable to step 301 to step 303 and step 305, and will not be described again here.
本实施例提供的访问数据处理方法,在对应用的页面进行优化之前,采用桑基图对终端设备信息和访问轨迹进行展示,可以形象化的示意聚合簇,为应用改进提供形象的轨迹展示效果。The access data processing method provided in this embodiment uses a Sankey diagram to display the terminal device information and access trajectories before optimizing the application page, which can visually represent the aggregation clusters and provide a vivid trajectory display effect for application improvement. .
进一步参考图5,作为对上述各图所示方法的实现,本公开提供了访问数据处理装置的一个实施例,该装置实施例与图2所示的方法实施例相对应,该装置具体可以应用于各种电子设备中。With further reference to Figure 5, as an implementation of the methods shown in the above figures, the present disclosure provides an embodiment of an access data processing device. The device embodiment corresponds to the method embodiment shown in Figure 2. The device can be specifically applied in various electronic devices.
如图5所示,本公开的实施例提供了一种访问数据处理装置500,该装置500包括:采集单元501、获取单元502、聚合单元503、优化单元504。其中,上述采集单元501,可以被配置成通过应用的埋点数据,采集不同终端设备访问应用的页面的访问日志。上述获取单元502,可以被配置成基于访问日志,获取至少一个预设时间周期中各个终端设备对应用的页面在至少一个访次中的访问轨迹。上述聚合单元503,可以被配置成对具有相同访问轨迹的终端设备进行聚合统计,得到对应访问轨迹以及终端设备信息的聚合簇。上述优化单元504,可以被配置成基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用。As shown in Figure 5, an embodiment of the present disclosure provides an access data processing device 500. The device 500 includes: a collection unit 501, an acquisition unit 502, an aggregation unit 503, and an optimization unit 504. The above-mentioned collection unit 501 may be configured to collect access logs of pages of the application accessed by different terminal devices through buried point data of the application. The above-mentioned obtaining unit 502 may be configured to obtain, based on the access log, the access track of each terminal device to the page of the application in at least one visit in at least one preset time period. The above-mentioned aggregation unit 503 may be configured to perform aggregation statistics on terminal devices with the same access trajectories, and obtain aggregation clusters corresponding to access trajectories and terminal device information. The above-mentioned optimization unit 504 may be configured to optimize application pages based on access trajectories and terminal device information corresponding to each aggregation cluster to obtain an optimized application.
在本实施例中,访问数据处理装置500中,采集单元501、获取单元502、聚合单元503、优化单元504的具体处理及其所带来的技术效果可分别参考图2对应实施例中的步骤201、步骤202、步骤203、 步骤204。In this embodiment, in the access data processing device 500, the specific processing of the collection unit 501, the acquisition unit 502, the aggregation unit 503, and the optimization unit 504 and the technical effects they bring can be referred to the steps in the corresponding embodiment of Figure 2 respectively. 201, step 202, step 203, Step 204.
在一些实施例中,上述装置500还包括:接收单元(图中未示出)、匹配单元(图中未示出)、得到单元(图中未示出)。其中,上述接收单元,可以被配置成接收查询轨迹,查询轨迹包括应用的至少一个页面。上述匹配单元,可以被配置成将查询轨迹分别与各个聚合簇对应的访问轨迹进行匹配。上述得到单元,可以被配置成响应于确定查询轨迹与聚合簇对应的访问轨迹相匹配,得到并显示该聚合簇对应的终端设备信息。In some embodiments, the above-mentioned device 500 further includes: a receiving unit (not shown in the figure), a matching unit (not shown in the figure), and a obtaining unit (not shown in the figure). Wherein, the above-mentioned receiving unit may be configured to receive a query track, where the query track includes at least one page of the application. The above-mentioned matching unit may be configured to match the query trajectories with the access trajectories corresponding to each aggregation cluster. The above obtaining unit may be configured to obtain and display the terminal device information corresponding to the aggregation cluster in response to determining that the query trajectory matches the access trajectory corresponding to the aggregation cluster.
在一些实施例中,上述装置500还包括:展示单元(图中未示出)。其中,上述展示单元,可以被配置成采用桑基图对各个聚合簇对应的终端设备信息和访问轨迹进行展示。In some embodiments, the above-mentioned device 500 further includes: a display unit (not shown in the figure). The above display unit may be configured to use a Sankey diagram to display the terminal device information and access trajectories corresponding to each aggregation cluster.
在一些实施例中,上述装置500还包括:页面标注单元(图中未示出)、统计单元(图中未示出)。其中,上述标注单元,可以被配置成为应用的所有页面进行标签标注。上述统计单元,可以被配置成响应于接收到页面的标签,对该页面所在的访问轨迹进行统计,得到访问轨迹统计结果。In some embodiments, the above-mentioned device 500 also includes: a page annotation unit (not shown in the figure) and a statistics unit (not shown in the figure). Among them, the above-mentioned labeling unit can be configured to label all pages of the application. The above statistics unit may be configured to, in response to receiving the tag of the page, perform statistics on the access track where the page is located, and obtain access track statistics results.
在一些实施例中,上述装置500还包括:落地标注单元(图中未示出)、落地查询单元(图中未示出)。其中,上述落地标注单元,可以被配置成为各个聚合簇对应的访问轨迹的第一页面进行落地页标注。落地查询单元,被配置成响应于接收到以一个页面作为落地页的查询条件时,得到并显示所有该页面作为落地页的访问轨迹。In some embodiments, the above-mentioned device 500 also includes: a landing annotation unit (not shown in the figure) and a landing query unit (not shown in the figure). Wherein, the above-mentioned landing annotation unit may be configured to perform landing page annotation on the first page of the access track corresponding to each aggregation cluster. The landing query unit is configured to, in response to receiving a query condition using a page as a landing page, obtain and display all access tracks of the page as a landing page.
在一些实施例中,上述装置500还包括:退出标注单元(图中未示出)、退出查询单元(图中未示出)。其中,上述退出标注单元,可以被配置成为各个聚合簇对应的访问轨迹的最后页面进行退出页标注。上述退出查询单元,可以被配置成响应于接收到以一个页面作为退出页的查询条件时,得到并显示所有该页面作为落地页的访问轨迹。In some embodiments, the above-mentioned device 500 also includes: an exit annotation unit (not shown in the figure) and an exit query unit (not shown in the figure). Among them, the above-mentioned exit labeling unit can be configured to perform exit page labeling on the last page of the access track corresponding to each aggregation cluster. The above-mentioned exit query unit may be configured to, in response to receiving a query condition that uses a page as an exit page, obtain and display all access trajectories of this page as a landing page.
在一些实施例中,上述应用的页面包括:至少一个落地页,每个聚合簇的访问轨迹上均设置有与该访问轨迹的最后一个页面相邻的跳出节点,上述优化单元504包括:计算模块(图中未示出)、页面优化模块(图中未示出)。其中,上述计算模块,可以被配置成针对至少一 个页面作为落地页,当该落地页在访问轨迹中的下一节点是跳出节点,计算该跳出节点的终端设备的数量与该落地页处于访问轨迹首节点的终端设备的数量的比例。上述页面优化模块,可以被配置成响应于比例大于所有落地页的退出率均值,对该落地页进行优化,得到优化后的应用。In some embodiments, the pages of the above-mentioned application include: at least one landing page, and the access track of each aggregation cluster is provided with a bounce node adjacent to the last page of the access track. The above-mentioned optimization unit 504 includes: a calculation module (not shown in the figure), page optimization module (not shown in the figure). Wherein, the above-mentioned computing module can be configured to target at least one A page is used as a landing page. When the next node of the landing page in the access trajectory is a bounce node, the ratio of the number of terminal devices of the bounce node to the number of terminal devices of the landing page at the first node of the access trajectory is calculated. The above page optimization module can be configured to optimize the landing page in response to a ratio greater than the average exit rate of all landing pages to obtain an optimized application.
在一些实施例中,上述优化单元504包括:查询模块(图中未示出)、转化模块(图中未示出)、应用优化模块(图中未示出)。上述查询模块,可以被配置成从所有聚合簇对应的访问轨迹中查询预设访问轨迹。上述转化模块,可以被配置成基于预设访问轨迹的各个页面的终端设备信息,计算预设访问轨迹中各个页面的转化率。上述应用优化模块,可以被配置成响应于预设访问轨迹中一页面的转化率小于预设访问轨迹中除该页面之外的页面的转化率,对该页面进行优化,得到优化后的应用。In some embodiments, the above-mentioned optimization unit 504 includes: a query module (not shown in the figure), a conversion module (not shown in the figure), and an application optimization module (not shown in the figure). The above query module can be configured to query the preset access trajectories from the access trajectories corresponding to all aggregation clusters. The above-mentioned conversion module may be configured to calculate the conversion rate of each page in the preset access track based on the terminal device information of each page in the preset access track. The above-mentioned application optimization module may be configured to optimize the page in response to the conversion rate of a page in the preset access track being less than the conversion rate of pages other than the page in the preset access track to obtain an optimized application.
在一些实施例中,每个聚合簇的访问轨迹上均设置有与该访问轨迹的最后一个页面相邻的跳出节点,上述优化单元504包括:遍历模块(图中未示出)、节点优化模块(图中未示出)。其中,上述遍历模块,可以被配置成遍历所有聚合簇对应的访问轨迹,计算所有访问轨迹中跳出节点之前的页面的终端设备的数量。上述节点优化模块,可以被配置成响应于跳出节点之前的页面的终端设备的数量大于预设阈值,对跳出节点之前的页面进行优化,得到优化后的应用。In some embodiments, the access track of each aggregation cluster is provided with a jump node adjacent to the last page of the access track. The above-mentioned optimization unit 504 includes: a traversal module (not shown in the figure), a node optimization module (not shown in the figure). Wherein, the above traversal module can be configured to traverse the access trajectories corresponding to all aggregation clusters, and calculate the number of terminal devices in all access trajectories that jump out of the page before the node. The above-mentioned node optimization module may be configured to respond to the number of terminal devices that jump out of the page before the node is greater than the preset threshold, optimize the page before jumping out of the node, and obtain an optimized application.
本公开的实施例提供的访问数据处理装置,首先采集单元501通过应用的埋点数据,采集不同终端设备访问应用的页面的访问日志;其次,获取单元502基于访问日志,获取至少一个预设时间周期中各个终端设备对应用的页面在至少一个访次中的访问轨迹;再次,聚合单元503对具有相同访问轨迹的终端设备进行聚合统计,得到对应访问轨迹以及终端设备信息的聚合簇;最后,优化单元504基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用。由此,基于终端设备对应用的页面在一个访次中的访问轨迹,聚类终端设备,可以确定具有相同访问轨迹的所有终端设备,为应用的页面提供了有效的优化依据,提高了应用优化效率,提高了 用户体验。In the access data processing device provided by the embodiment of the present disclosure, the collection unit 501 first collects the access logs of the pages of the application accessed by different terminal devices through the embedded data of the application; secondly, the acquisition unit 502 obtains at least one preset time based on the access log. The access track of each terminal device to the application page in at least one visit during the cycle; again, the aggregation unit 503 performs aggregate statistics on the terminal devices with the same access track, and obtains an aggregation cluster corresponding to the access track and terminal device information; finally, The optimization unit 504 optimizes the application pages based on the access trajectories and terminal device information corresponding to each aggregation cluster to obtain an optimized application. Therefore, based on the access trajectory of the terminal device to the application page in one visit, clustering the terminal devices can determine all terminal devices with the same access trajectory, providing an effective optimization basis for the application page and improving application optimization. efficiency, improved user experience.
下面参考图6,其示出了适于用来实现本公开的实施例的电子设备600的结构示意图。Referring now to FIG. 6 , a schematic structural diagram of an electronic device 600 suitable for implementing embodiments of the present disclosure is shown.
如图6所示,电子设备600可以包括处理装置(例如中央处理器、图形处理器等)601,其可以根据存储在只读存储器(ROM)602中的程序或者从存储装置608加载到随机访问存储器(RAM)603中的程序而执行各种适当的动作和处理。在RAM 603中,还存储有电子设备600操作所需的各种程序和数据。处理装置601、ROM 602以及RAM603通过总线604彼此相连。输入/输出(I/O)接口605也连接至总线604。As shown in FIG. 6, the electronic device 600 may include a processing device (eg, central processing unit, graphics processor, etc.) 601, which may be loaded into a random access device according to a program stored in a read-only memory (ROM) 602 or from a storage device 608. The program in the memory (RAM) 603 executes various appropriate actions and processes. In the RAM 603, various programs and data required for the operation of the electronic device 600 are also stored. The processing device 601, ROM 602 and RAM 603 are connected to each other via a bus 604. An input/output (I/O) interface 605 is also connected to bus 604.
通常,以下装置可以连接至I/O接口605:包括例如触摸屏、触摸板、键盘、鼠标、等的输入装置606;包括例如液晶显示器(LCD,Liquid Crystal Display)、扬声器、振动器等的输出装置607;包括例如磁带、硬盘等的存储装置608;以及通信装置609。通信装置609可以允许电子设备600与其他设备进行无线或有线通信以交换数据。虽然图6示出了具有各种装置的电子设备600,但是应理解的是,并不要求实施或具备所有示出的装置。可以替代地实施或具备更多或更少的装置。图6中示出的每个方框可以代表一个装置,也可以根据需要代表多个装置。Generally, the following devices can be connected to the I/O interface 605: input devices 606 including, for example, a touch screen, touch pad, keyboard, mouse, etc.; output devices including, for example, a liquid crystal display (LCD, Liquid Crystal Display), speakers, vibrators, etc. 607; a storage device 608 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 609. Communication device 609 may allow electronic device 600 to communicate wirelessly or wiredly with other devices to exchange data. Although FIG. 6 illustrates electronic device 600 with various means, it should be understood that implementation or availability of all illustrated means is not required. More or fewer means may alternatively be implemented or provided. Each block shown in Figure 6 may represent one device, or may represent multiple devices as needed.
特别地,根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信装置609从网络上被下载和安装,或者从存储装置608被安装,或者从ROM 602被安装。在该计算机程序被处理装置601执行时,执行本公开的实施例的方法中限定的上述功能。In particular, according to embodiments of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product including a computer program carried on a computer-readable medium, the computer program containing program code for performing the method illustrated in the flowchart. In such embodiments, the computer program may be downloaded and installed from the network via communication device 609, or from storage device 608, or from ROM 602. When the computer program is executed by the processing device 601, the above-described functions defined in the method of the embodiment of the present disclosure are performed.
需要说明的是,本公开的实施例的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、 红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开的实施例中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开的实施例中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、RF(Radio Frequency,射频)等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium in the embodiments of the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two. Computer-readable storage media may be, for example, but not limited to, electronic, magnetic, optical, electromagnetic, Infrared, or semiconductor systems, devices or devices, or any combination of the above. More specific examples of computer readable storage media may include, but are not limited to: an electrical connection having one or more wires, a portable computer disk, a hard drive, random access memory (RAM), read only memory (ROM), removable Programmed read-only memory (EPROM or flash memory), fiber optics, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. In embodiments of the present disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in connection with an instruction execution system, apparatus, or device. In embodiments of the present disclosure, the computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, in which computer-readable program code is carried. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device . Program code contained on a computer-readable medium can be transmitted using any appropriate medium, including but not limited to: wires, optical cables, RF (Radio Frequency, Radio Frequency), etc., or any suitable combination of the above.
上述计算机可读介质可以是上述服务器中所包含的;也可以是单独存在,而未装配入该服务器中。上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该服务器执行时,使得该服务器:通过应用的埋点数据,采集不同终端设备访问应用的页面的访问日志;基于访问日志,获取至少一个预设时间周期中各个终端设备对应用的页面在至少一个访次中的访问轨迹;对具有相同访问轨迹的终端设备进行聚合统计,得到对应访问轨迹以及终端设备信息的聚合簇;基于各个聚合簇对应的访问轨迹和终端设备信息,对应用的页面进行优化,得到优化后的应用。The above-mentioned computer-readable medium may be included in the above-mentioned server; it may also exist separately without being assembled into the server. The above-mentioned computer-readable medium carries one or more programs. When the above-mentioned one or more programs are executed by the server, the server: collects the access logs of the pages of the application accessed by different terminal devices through the buried point data of the application; based on Access log, obtain the access track of each terminal device to the application page in at least one visit in at least one preset time period; perform aggregate statistics on terminal devices with the same access track, and obtain an aggregation of the corresponding access track and terminal device information Clusters; based on the access trajectories and terminal device information corresponding to each aggregated cluster, the application page is optimized to obtain an optimized application.
可以以一种或多种程序设计语言或其组合来编写用于执行本公开的实施例的操作的计算机程序代码,程序设计语言包括面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地 在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。Computer program code for performing operations of embodiments of the present disclosure may be written in one or more programming languages, including object-oriented programming languages—such as Java, Smalltalk, C++, and A conventional procedural programming language—such as "C" or a similar programming language. Program code can be completely Execute partly on the user's computer, execute partly on the user's computer, execute as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In situations involving remote computers, the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as an Internet service provider through Internet connection).
附图中的流程图和框图,图示了按照本公开的各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products in accordance with various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagram may represent a module, segment, or portion of code that contains one or more logic functions that implement the specified executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown one after another may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved. It will also be noted that each block of the block diagram and/or flowchart illustration, and combinations of blocks in the block diagram and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or operations. , or can be implemented using a combination of specialized hardware and computer instructions.
描述于本公开的实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。所描述的单元也可以设置在处理器中,例如,可以描述为:一种处理器,包括采集单元、获取单元、聚合单元、优化单元。其中,这些单元的名称在某种情况下并不构成对该单元本身的限定,例如,采集单元还可以被描述为“被配置成通过应用的埋点数据,采集不同终端设备访问应用的页面的访问日志”的单元。The units involved in the embodiments of the present disclosure may be implemented in software or hardware. The described unit can also be provided in a processor. For example, it can be described as: a processor including a collection unit, an acquisition unit, an aggregation unit, and an optimization unit. Among them, the names of these units do not constitute a limitation on the unit itself under certain circumstances. For example, the collection unit can also be described as "configured to collect the pages of the application accessed by different terminal devices through the embedded data of the application." Access Log" unit.
以上描述仅为本公开的较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本公开的实施例中所涉及的发明范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离上述发明构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开的实施例中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方 案。 The above description is only a description of the preferred embodiments of the present disclosure and the technical principles applied. Persons skilled in the art should understand that the scope of the invention involved in the embodiments of the present disclosure is not limited to technical solutions composed of specific combinations of the above technical features, and should also cover the above-mentioned technical solutions without departing from the above-mentioned inventive concept. Other technical solutions formed by any combination of technical features or their equivalent features. For example, the above features are formed by mutually replacing technical features with similar functions disclosed in the embodiments of the present disclosure (but not limited to). case.

Claims (13)

  1. 一种访问数据处理方法,其中,所述方法包括:An access data processing method, wherein the method includes:
    通过应用的埋点数据,采集不同终端设备访问所述应用的页面的访问日志;Collect the access logs of pages of the application accessed by different terminal devices through the embedded data of the application;
    基于所述访问日志,获取至少一个预设时间周期中各个终端设备对所述应用的页面在至少一个访次中的访问轨迹;Based on the access log, obtain the access track of each terminal device to the page of the application in at least one visit in at least one preset time period;
    对具有相同访问轨迹的终端设备进行聚合统计,得到对应所述访问轨迹以及终端设备信息的聚合簇;Perform aggregation statistics on terminal devices with the same access trajectories to obtain aggregation clusters corresponding to the access trajectories and terminal device information;
    基于各个聚合簇对应的访问轨迹和终端设备信息,对所述应用的页面进行优化,得到优化后的应用。Based on the access trajectories and terminal device information corresponding to each aggregation cluster, the page of the application is optimized to obtain an optimized application.
  2. 根据权利要求1所述的方法,其中,所述方法还包括:The method of claim 1, further comprising:
    接收查询轨迹,所述查询轨迹包括所述应用的至少一个页面;Receive a query track, the query track including at least one page of the application;
    将所述查询轨迹分别与各个聚合簇对应的访问轨迹进行匹配;Match the query trajectories with the access trajectories corresponding to each aggregation cluster;
    响应于确定所述查询轨迹与聚合簇对应的访问轨迹相匹配,得到并显示该聚合簇对应的终端设备信息。In response to determining that the query trace matches the access trace corresponding to the aggregation cluster, terminal device information corresponding to the aggregation cluster is obtained and displayed.
  3. 根据权利要求1所述的方法,其中,所述方法还包括:The method of claim 1, further comprising:
    采用桑基图对各个聚合簇对应的终端设备信息和访问轨迹进行展示。A Sankey diagram is used to display the terminal device information and access trajectories corresponding to each aggregation cluster.
  4. 根据权利要求1所述的方法,其中,所述方法还包括:The method of claim 1, further comprising:
    为所述应用的所有页面进行标签标注;Label all pages of the application;
    响应于接收到页面的标签,对该页面所在的访问轨迹进行统计,得到访问轨迹统计结果。In response to receiving the tag of the page, statistics are performed on the access track where the page is located, and the access track statistics result is obtained.
  5. 根据权利要求1所述的方法,其中,所述方法还包括:The method of claim 1, further comprising:
    为各个聚合簇对应的访问轨迹的第一页面进行落地页标注;Mark the landing page for the first page of the access track corresponding to each aggregation cluster;
    响应于接收到以一个页面作为落地页的查询条件时,得到并显示 所有该页面作为落地页的访问轨迹。In response to receiving a query condition that uses a page as the landing page, obtain and display All this page serves as the access track of the landing page.
  6. 根据权利要求1所述的方法,其中,所述方法还包括:The method of claim 1, further comprising:
    为各个聚合簇对应的访问轨迹的最后页面进行退出页标注;Mark the exit page for the last page of the access track corresponding to each aggregation cluster;
    响应于接收到以一个页面作为退出页的查询条件时,得到并显示所有该页面作为落地页的访问轨迹。In response to receiving a query condition that uses a page as an exit page, obtain and display all access tracks of this page as a landing page.
  7. 根据权利要求1-6之一所述的方法,其中,所述应用的页面包括:至少一个落地页,每个聚合簇的访问轨迹上均设置有与该访问轨迹的最后一个页面相邻的跳出节点,所述基于各个聚合簇对应的访问轨迹和终端设备信息,对所述应用的页面进行优化,得到优化后的应用,包括:The method according to any one of claims 1 to 6, wherein the application pages include: at least one landing page, and the access track of each aggregate cluster is provided with a jumpout adjacent to the last page of the access track. Node, the page of the application is optimized based on the access trajectory and terminal device information corresponding to each aggregation cluster, and the optimized application is obtained, including:
    针对至少一个页面作为落地页,当该落地页在访问轨迹中的下一节点是跳出节点,计算该跳出节点的终端设备的数量与该落地页处于访问轨迹首节点的终端设备的数量的比例;For at least one page as a landing page, when the next node of the landing page in the access trajectory is a bounce node, calculate the ratio of the number of terminal devices of the bounce node to the number of terminal devices of the landing page at the first node of the access trajectory;
    响应于所述比例大于所有落地页的退出率均值,对该落地页进行优化,得到优化后的应用。In response to the ratio being greater than the average exit rate of all landing pages, the landing page is optimized to obtain an optimized application.
  8. 根据权利要求1-6之一所述的方法,其中,所述基于各个聚合簇对应的访问轨迹和终端设备信息,对所述应用的页面进行优化,得到优化后的应用,包括:The method according to any one of claims 1 to 6, wherein the page of the application is optimized based on the access trajectories and terminal device information corresponding to each aggregation cluster to obtain an optimized application, including:
    从所有聚合簇对应的访问轨迹中查询预设访问轨迹;Query the preset access trajectories from the access trajectories corresponding to all aggregated clusters;
    基于所述预设访问轨迹的各个页面的终端设备信息,计算所述预设访问轨迹中各个页面的转化率;Calculate the conversion rate of each page in the preset access trajectory based on the terminal device information of each page in the preset access trajectory;
    响应于所述预设访问轨迹中一页面的转化率小于所述预设访问轨迹中除该页面之外的页面的转化率,对该页面进行优化,得到优化后的应用。In response to the conversion rate of a page in the preset access track being less than the conversion rate of pages other than the page in the preset access track, the page is optimized to obtain an optimized application.
  9. 根据权利要求1-6之一所述的方法,其中,每个聚合簇的访问轨迹上均设置有与该访问轨迹的最后一个页面相邻的跳出节点,所述 基于各个聚合簇对应的访问轨迹和终端设备信息,对所述应用的页面进行优化,得到优化后的应用,包括:The method according to any one of claims 1 to 6, wherein the access track of each aggregation cluster is provided with a jump node adjacent to the last page of the access track, said Based on the access trajectories and terminal device information corresponding to each aggregation cluster, the page of the application is optimized to obtain an optimized application, including:
    遍历所有聚合簇对应的访问轨迹,计算所有访问轨迹中所述跳出节点之前的页面的终端设备的数量;Traverse the access trajectories corresponding to all aggregation clusters, and calculate the number of terminal devices in all access trajectories that jump out of the page before the node;
    响应于所述跳出节点之前的页面的终端设备的数量大于预设阈值,对所述跳出节点之前的页面进行优化,得到优化后的应用。In response to the number of terminal devices on the page before the jumping out node being greater than the preset threshold, the page before the jumping out node is optimized to obtain an optimized application.
  10. 一种访问数据处理装置,其中,所述装置包括:An access data processing device, wherein the device includes:
    采集单元,被配置成通过应用的埋点数据,采集不同终端设备访问所述应用的页面的访问日志;The collection unit is configured to collect the access logs of different terminal devices accessing the pages of the application through the embedded data of the application;
    获取单元,被配置成基于所述访问日志,获取至少一个预设时间周期中各个终端设备对所述应用的页面在至少一个访次中的访问轨迹;The acquisition unit is configured to acquire, based on the access log, the access track of each terminal device to the page of the application in at least one visit in at least one preset time period;
    聚合单元,被配置成对具有相同访问轨迹的终端设备进行聚合统计,得到对应所述访问轨迹以及终端设备信息的聚合簇;an aggregation unit configured to perform aggregation statistics on terminal devices with the same access trajectory, and obtain an aggregation cluster corresponding to the access trajectory and terminal device information;
    优化单元,被配置成基于各个聚合簇对应的访问轨迹和终端设备信息,对所述应用的页面进行优化,得到优化后的应用。The optimization unit is configured to optimize the page of the application based on the access trajectories and terminal device information corresponding to each aggregation cluster to obtain an optimized application.
  11. 一种电子设备,包括:An electronic device including:
    一个或多个处理器;one or more processors;
    存储装置,其上存储有一个或多个程序;A storage device on which one or more programs are stored;
    其中,当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如权利要求1-9中任一所述的方法。Wherein, when the one or more programs are executed by the one or more processors, the one or more processors implement the method as described in any one of claims 1-9.
  12. 一种计算机可读介质,其上存储有计算机程序,其中,该程序被处理器执行时实现如权利要求1-9中任一所述的方法。A computer-readable medium with a computer program stored thereon, wherein when the program is executed by a processor, the method according to any one of claims 1-9 is implemented.
  13. 一种计算机程序产品,包括计算机程序,其中,所述计算机程序在被处理器执行时实现权利要求1-9中任一项所述的方法。 A computer program product comprising a computer program, wherein the computer program implements the method of any one of claims 1-9 when executed by a processor.
PCT/CN2023/076143 2022-03-10 2023-02-15 Access data processing method and apparatus, electronic device, and computer readable medium WO2023169165A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210231228.6A CN114595473A (en) 2022-03-10 2022-03-10 Access data processing method and device, electronic equipment and computer readable medium
CN202210231228.6 2022-03-10

Publications (1)

Publication Number Publication Date
WO2023169165A1 true WO2023169165A1 (en) 2023-09-14

Family

ID=81808906

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/076143 WO2023169165A1 (en) 2022-03-10 2023-02-15 Access data processing method and apparatus, electronic device, and computer readable medium

Country Status (2)

Country Link
CN (1) CN114595473A (en)
WO (1) WO2023169165A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114595473A (en) * 2022-03-10 2022-06-07 北京京东拓先科技有限公司 Access data processing method and device, electronic equipment and computer readable medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040133671A1 (en) * 2003-01-08 2004-07-08 David Taniguchi Click stream analysis
CN103823904A (en) * 2014-03-19 2014-05-28 广东绿瘦健康信息咨询有限公司 Webpage browsing path optimization method and system
CN106909567A (en) * 2015-12-23 2017-06-30 北京国双科技有限公司 Data processing method and device
CN109242164A (en) * 2018-08-22 2019-01-18 中国平安人寿保险股份有限公司 Optimize method and device, the computer storage medium, electronic equipment in product path
CN113590974A (en) * 2021-09-29 2021-11-02 北京每日优鲜电子商务有限公司 Recommendation page configuration method and device, electronic equipment and computer readable medium
CN114595473A (en) * 2022-03-10 2022-06-07 北京京东拓先科技有限公司 Access data processing method and device, electronic equipment and computer readable medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040133671A1 (en) * 2003-01-08 2004-07-08 David Taniguchi Click stream analysis
CN103823904A (en) * 2014-03-19 2014-05-28 广东绿瘦健康信息咨询有限公司 Webpage browsing path optimization method and system
CN106909567A (en) * 2015-12-23 2017-06-30 北京国双科技有限公司 Data processing method and device
CN109242164A (en) * 2018-08-22 2019-01-18 中国平安人寿保险股份有限公司 Optimize method and device, the computer storage medium, electronic equipment in product path
CN113590974A (en) * 2021-09-29 2021-11-02 北京每日优鲜电子商务有限公司 Recommendation page configuration method and device, electronic equipment and computer readable medium
CN114595473A (en) * 2022-03-10 2022-06-07 北京京东拓先科技有限公司 Access data processing method and device, electronic equipment and computer readable medium

Also Published As

Publication number Publication date
CN114595473A (en) 2022-06-07

Similar Documents

Publication Publication Date Title
US11106442B1 (en) Information technology networked entity monitoring with metric selection prior to deployment
US11244247B2 (en) Facilitating concurrent forecasting of multiple time series
US11947556B1 (en) Computerized monitoring of a metric through execution of a search query, determining a root cause of the behavior, and providing a notification thereof
US11593400B1 (en) Automatic triage model execution in machine data driven monitoring automation apparatus
US11620300B2 (en) Real-time measurement and system monitoring based on generated dependency graph models of system components
US11693895B1 (en) Graphical user interface with chart for event inference into tasks
US11379475B2 (en) Analyzing tags associated with high-latency and error spans for instrumented software
US10942960B2 (en) Automatic triage model execution in machine data driven monitoring automation apparatus with visualization
US20190095478A1 (en) Information technology networked entity monitoring with automatic reliability scoring
US20220269741A1 (en) Generating sequential segments with pre-sequence or post-sequence analytics data
US10657146B2 (en) Techniques for generating structured metrics from ingested events
Zheng et al. Service-generated big data and big data-as-a-service: an overview
Lee et al. The unified logging infrastructure for data analytics at Twitter
US9171319B2 (en) Analysis system and method used to construct social structures based on data collected from monitored web pages
US20170220672A1 (en) Enhancing time series prediction
US20070271519A1 (en) System and Method for Collecting User Interest Data
WO2020199662A1 (en) Method and device for pushing information
US8639560B2 (en) Brand analysis using interactions with search result items
CN110020273B (en) Method, device and system for generating thermodynamic diagram
WO2023169165A1 (en) Access data processing method and apparatus, electronic device, and computer readable medium
US11676345B1 (en) Automated adaptive workflows in an extended reality environment
CN111488386A (en) Data query method and device
US11663109B1 (en) Automated seasonal frequency identification
US11861767B1 (en) Streaming data visualizations
WO2022212724A1 (en) Delivery of data-driven and cross-platform experiences based on behavioral cohorts and identity resolution

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23765743

Country of ref document: EP

Kind code of ref document: A1