CN111415200A - Data processing method and device - Google Patents

Data processing method and device Download PDF

Info

Publication number
CN111415200A
CN111415200A CN202010200785.2A CN202010200785A CN111415200A CN 111415200 A CN111415200 A CN 111415200A CN 202010200785 A CN202010200785 A CN 202010200785A CN 111415200 A CN111415200 A CN 111415200A
Authority
CN
China
Prior art keywords
user
retention
users
period
sparse
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010200785.2A
Other languages
Chinese (zh)
Other versions
CN111415200B (en
Inventor
赵蕊
李深远
张李伟
刘旭芬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Music Entertainment Technology Shenzhen Co Ltd
Original Assignee
Tencent Music Entertainment Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Music Entertainment Technology Shenzhen Co Ltd filed Critical Tencent Music Entertainment Technology Shenzhen Co Ltd
Priority to CN202010200785.2A priority Critical patent/CN111415200B/en
Publication of CN111415200A publication Critical patent/CN111415200A/en
Application granted granted Critical
Publication of CN111415200B publication Critical patent/CN111415200B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Engineering & Computer Science (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Game Theory and Decision Science (AREA)
  • Data Mining & Analysis (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The application discloses a data processing method and a data processing device, wherein the data processing method comprises the following steps: acquiring sparse coded data of a preset user and acquiring preset retention analysis parameters, wherein the retention analysis parameters comprise initial operation related parameters and retention operation related parameters, determining a user to be checked, in which the sparse coded data meets the initial operation related parameters, in the preset user, and determining a retention user, in which the sparse coded data meets the retention operation related parameters, in the user to be checked; and determining a user retention rate based on the number of the users to be considered and the number of the retained users. By adopting the technical scheme, the retention rate of the user can be quickly calculated through sparse coding data, and the calculation efficiency of the retention rate of the user is improved.

Description

Data processing method and device
Technical Field
The present invention relates to the field of internet technologies, and in particular, to a data processing method and apparatus.
Background
In the internet industry, owners of application programs pay more attention to user retention rate of the application programs, and the user retention rate can reflect the quality of the application programs and the capability of retaining users on one hand, and reflect the function utilization rate of the application programs and determine the optimization iteration direction of the application programs on the other hand.
In the prior art, in order to obtain the retention rate of users, an original log needs to be traversed first to determine users to be checked in a first time period, then the original log is traversed again to screen out retention users which are subjected to specific subsequent operations in a second time period from the users to be checked, so that the retention rate of the users is calculated, and the method for calculating the retention rate of the users is very low in efficiency under the condition that the number of users is huge.
Disclosure of Invention
The embodiment of the invention provides a data processing method and device, which can be used for quickly calculating the user retention rate through sparse coding data and improving the calculation efficiency of the user retention rate.
In a first aspect, an embodiment of the present invention provides a data processing method, including:
acquiring sparse coded data of a preset user, wherein the sparse coded data is used for recording the operation of the preset user on a target application program;
obtaining preset retention analysis parameters, wherein the retention analysis parameters comprise initial operation related parameters and retention operation related parameters; wherein the start operation-related parameter is used to represent a start operation performed on the target application, and the leave operation-related parameter is used to represent a leave operation performed on the target application;
determining a user to be checked of which the sparse coded data meets the initial operation related parameters in the preset users, and determining a retention user of which the sparse coded data meets the retention operation related parameters in the user to be checked;
and determining a user retention rate based on the number of the users to be considered and the number of the retained users.
In one possible implementation example, the generating step of the sparse coding data of the preset user includes:
generating a bitmap code corresponding to a single recording period of the preset user according to at least one operation executed on the target application program by the preset user in the single recording period, wherein the bitmap code comprises at least one code value, and the code value is used for indicating whether the preset user executes the operation corresponding to the code value on the target application program;
obtaining an operation behavior value corresponding to the bitmap coding of the recording period;
obtaining a period index value corresponding to the recording period;
and generating sparse coding data corresponding to the preset user according to the operation behavior value and the cycle index value.
In an embodiment of a possible implementation, the obtaining a period index value corresponding to the recording period includes:
if the operation behavior value corresponding to the bitmap coding of the recording period is not null, obtaining a period index value corresponding to the recording period;
and if the operation behavior value corresponding to the bitmap coding of the recording period is null, not executing the step of obtaining the period index value corresponding to the recording period.
In one possible implementation, the obtaining the preset retention analysis parameter includes:
if the user retention calculation instruction is detected, outputting at least one retention parameter option;
and receiving the operation of the user on the at least one retention parameter option, and obtaining a preset retention analysis parameter according to the operation of the user on the at least one retention parameter option.
In one possible implementation, the preset user includes a plurality of users, one user corresponds to one sparsely encoded data, and the starting operation related parameter includes: an initial operation behavior, an initial operation time period;
the step of determining, among the preset users, a user to be examined whose sparsely encoded data satisfy the initial operation-related parameter includes:
determining a first cycle index value corresponding to the starting operational time period;
determining R sparse coded data comprising a first periodic index value from a plurality of sparse coded data corresponding to the plurality of users, wherein R is an integer greater than or equal to 1;
according to the initial operation behavior, determining N sparse coding data from the R sparse coding data, and determining N users corresponding to the N sparse coding data as users to be inspected, wherein operation behavior values in the N sparse coding data indicate that the users to be inspected perform the initial operation behavior on the target application program in a recording period corresponding to the first period index value.
In one possible implementation, the persistence operation-related parameter comprises: keeping operation behavior and keeping operation time period;
the determining, among the users to be examined, a surviving user for which sparsely encoded data satisfies the surviving operation-related parameter includes:
determining a second cycle index value corresponding to the retention operation time period;
determining M sparse coded data comprising a second period index value from N sparse coded data corresponding to the N users, wherein M is an integer greater than or equal to 1, and M is a value less than or equal to N;
according to the retention operation behavior, S sparse coding data are determined from the M sparse coding data, and S users to be checked corresponding to the S sparse coding data are determined as retention users, wherein operation behavior values in the S sparse coding data indicate that the retention users perform the retention operation behavior on the target application program in a recording period corresponding to the second period index value.
In a second aspect, an embodiment of the present invention provides a data processing apparatus, including:
the system comprises a first obtaining module, a second obtaining module and a third obtaining module, wherein the first obtaining module is used for obtaining sparse coded data of a preset user, and the sparse coded data is used for recording operation of the preset user on a target application program;
the second obtaining module is used for obtaining preset retention analysis parameters, and the retention analysis parameters comprise initial operation related parameters and retention operation related parameters; wherein the start operation-related parameter is used to represent a start operation performed on the target application, and the leave operation-related parameter is used to represent a leave operation performed on the target application;
the first determining module is used for determining a user to be checked of which the sparsely encoded data meet the initial operation related parameters in the preset users and determining a remaining user of which the sparsely encoded data meet the remaining operation related parameters in the user to be checked;
and the second determining module is used for determining the user retention rate based on the number of the users to be examined and the number of the retained users.
In one possible implementation, the first obtaining module includes:
a first generating unit, configured to generate a bitmap code corresponding to a single recording period of the preset user according to at least one operation performed on the target application program by the preset user in the single recording period, where the bitmap code includes at least one code value, and the code value is used to indicate whether the preset user performs the operation corresponding to the code value on the target application program;
a first obtaining unit, configured to obtain an operation behavior value corresponding to a bitmap encoding of the recording period;
a second obtaining unit, configured to obtain a period index value corresponding to the recording period;
and the second generating unit is used for generating sparse coded data corresponding to the preset user according to the operation behavior value and the cycle index value.
In an embodiment of a possible implementation, the second obtaining unit is specifically configured to:
if the operation behavior value corresponding to the bitmap coding of the recording period is not null, obtaining a period index value corresponding to the recording period;
and if the operation behavior value corresponding to the bitmap coding of the recording period is null, not executing the step of obtaining the period index value corresponding to the recording period.
In one possible implementation, the second obtaining module includes:
the output unit is used for outputting at least one retention parameter option if the user retention calculation instruction is detected;
and the receiving unit is used for receiving the operation of the user on the at least one retention parameter option and obtaining the preset retention analysis parameter according to the operation of the user on the at least one retention parameter option.
In one possible implementation, the preset user includes a plurality of users, one user corresponds to one sparsely encoded data, and the starting operation related parameter includes: an initial operation behavior, an initial operation time period;
the first determining module includes:
a first determination unit for determining a first cycle index value corresponding to the start operation period;
a second determining unit configured to determine R pieces of sparsely encoded data including a first cycle index value from among a plurality of sparsely encoded data corresponding to the plurality of users, the R being an integer greater than or equal to 1;
and a third determining unit, configured to determine, according to the starting operation behavior, N pieces of sparse coding data from the R pieces of sparse coding data, and determine, as a user to be examined, N users corresponding to the N pieces of sparse coding data, where an operation behavior value in the N pieces of sparse coding data indicates that the user to be examined has performed the starting operation behavior on the target application program in a recording cycle corresponding to the first cycle index value.
In one possible implementation, the persistence operation-related parameter comprises: keeping operation behavior and keeping operation time period;
the second determining module includes:
a fourth determining unit configured to determine a second cycle index value corresponding to the leave-on operation period;
a fifth determining unit, configured to determine M sparse coded data including a second period index value from N sparse coded data corresponding to the N users, where M is an integer greater than or equal to 1, and M is a value less than or equal to N;
a sixth determining unit, configured to determine, according to the retention operation behavior, S sparse coded data from the M sparse coded data, and determine S to-be-inspected users corresponding to the S sparse coded data as retention users, where operation behavior values in the S sparse coded data indicate that the retention user performed the retention operation behavior on the target application program in a recording cycle corresponding to the second cycle index value.
In a third aspect, an embodiment of the present invention provides a data processing apparatus, including a processor, a memory, and a communication interface, where the processor, the memory, and the communication interface are connected to each other, where the communication interface is used to receive and send data, the memory is used to store program codes, and the processor is used to call the program codes to execute the method according to the first aspect.
In a fourth aspect, an embodiment of the present invention provides a computer storage medium storing a computer program comprising program instructions that, when executed by a processor, perform the method of the first aspect.
In the embodiment of the invention, sparse coded data of a preset user is obtained, and a preset retention analysis parameter is obtained, wherein the retention analysis parameter comprises an initial operation related parameter and a retention operation related parameter, in the preset user, a user to be checked of which the sparse coded data meets the initial operation related parameter is determined, and in the user to be checked, a user to be checked of which the sparse coded data meets the retention operation related parameter is determined; and determining a user retention rate based on the number of the users to be considered and the number of the retained users. By adopting the technical scheme, each user in the target application program is subjected to sparse coding, the retention rate of the user is rapidly calculated according to the sparse coding data, and the calculation efficiency of the retention rate of the user is improved.
Drawings
In order to illustrate embodiments of the present invention or technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below.
Fig. 1 is a schematic flow chart of a data processing method according to an embodiment of the present invention;
fig. 2 is a schematic diagram of a method for generating sparse coding data corresponding to a preset user according to an embodiment of the present invention;
fig. 3 is a schematic diagram of a method for storing bitmap coding by sparse coding according to an embodiment of the present invention;
fig. 4 is a schematic diagram of a method for generating sparse coding data according to bitmap coding according to an embodiment of the present invention;
fig. 5 is a schematic diagram of a method for updating sparse coding data according to an embodiment of the present invention;
FIG. 6 is a schematic diagram of a method for obtaining retention analysis parameters according to an embodiment of the present invention;
fig. 7 is a schematic diagram of a retention rate corresponding to a user to be examined in multiple start times according to an embodiment of the present invention;
FIG. 8 is a schematic illustration of a retention rate line graph analysis provided by an embodiment of the present invention;
fig. 9 is a schematic structural diagram of a data processing apparatus according to an embodiment of the present invention;
fig. 10 is a schematic structural diagram of another data processing apparatus according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention.
The data processing method provided by the embodiment of the invention will be described in detail below with reference to fig. 1 to 8.
Fig. 1 is a schematic flow chart of a data processing method according to an embodiment of the present invention. As shown in fig. 1, the data processing method of an embodiment of the present invention may include the following steps S101 to S104.
And S101, acquiring sparse coded data of a preset user.
The sparse coding data is used for recording the operation of a preset user on a target application program, the preset user comprises a plurality of users, and one user corresponds to one sparse coding data. And each user enters the target application program, the operation behavior of each user on the target application program is recorded by using the sparse coding, and the sparse coding data with the recorded user operation behavior is stored.
Optionally, as shown in fig. 2, a schematic diagram of a method for generating sparse coding data corresponding to a preset user according to an embodiment of the present invention is provided, where the method for generating sparse coding data of the preset user may include steps S21-S24.
And S21, generating a bitmap code corresponding to the single recording period of the preset user according to at least one operation executed by the preset user on the target application program in the single recording period.
Firstly, a recording period is preset, wherein the recording period refers to a basic period for recording user operation behaviors when a user performs sparse coding after entering a target application program, and the recording period includes, but is not limited to, hours, days and the like. Note that the recording period is generally set to the analysis time granularity of the retention analysis, or a sub-unit of the analysis time granularity.
One recording period corresponds to one piece of recording data of the user, and one piece of recording data may include one or more pieces of recording sub data, where the recording sub data is used to indicate an operation behavior performed on the target application by the user in the recording period. One implementation of recording is a bitmap mode, where the recorded data includes one or more bits, and different bits correspond to different operation behaviors. If the user executes a certain operation behavior on the target application program in the recording period, the bit corresponding to the operation behavior in the recorded data of the user is 1, and conversely, if the user does not execute a certain operation behavior on the target application program in the recording period, the bit corresponding to the operation behavior in the recorded data of the user is 0. The bitmap mode is bitmap coding (bitmap coding), the bitmap coding includes at least one coding value, the coding value is used for indicating whether a preset user executes an operation corresponding to the coding value on a target application program, the at least one coding value is at least one bit, and one bit corresponds to an operation behavior of the preset user on the target application program.
Specifically, each new user obtains an ID after entering the target application, and performs bitmap encoding on the operation behavior of each new user on the target application in each recording period, where the bitmap encoding includes at least one bit, and one bit is used to indicate an operation behavior of one user in one recording period. For example, the first bit in the bitmap code is used to indicate the operation behavior of the user to log in the target application program, if the user logs in the target application program in the recording period corresponding to the bitmap code, the value of the first bit in the bitmap code is 1, and if the user does not log in the target application program in the recording period corresponding to the bitmap code, the value of the first bit in the bitmap code is 0. If the user listens to the song in the target application program in the recording period corresponding to the bitmap code, the value of the second bit in the bitmap code is 1, and if the user does not listen to the song in the target application program in the recording period corresponding to the bitmap code, the value of the second bit in the bitmap code is 0, so that the operation behavior of each new user on the target application program in each recording period is subjected to bitmap coding storage.
It should be noted that at least one bit in the bitmap coding corresponds to at least one operation behavior, and one bit corresponds to one operation behavior, and a user can customize the setting. Through the process, various operation behaviors of the user on the target application program in a plurality of different recording periods can be recorded. For example, which operation behaviors the user has performed on the target application respectively every day within one year are recorded.
And S22, obtaining an operation behavior value corresponding to the bitmap coding of the recording period.
S23, obtaining a period index value corresponding to the recording period;
and S24, generating sparse coding data corresponding to the preset user according to the operation behavior value and the cycle index value.
As shown in fig. 3, for a schematic diagram of a method for performing sparse coding storage on bitmap coding according to an embodiment of the present invention, as shown in fig. 3, after performing bitmap coding storage on an operation behavior of each user in each recording period in a target application program, performing sparse coding storage on a bitmap code corresponding to each user, where a structural schematic diagram of the sparse coding data is 4 rows of data, i.e., uid, the number of periods, a period index value, and an operation behavior value, as shown in fig. 3, and fig. 3 may be sparse coding data corresponding to one user. In FIG. 3 uid represents the user identity id; the number of cycles represents the number of recording cycles, for example, if 365 days of operation behaviors of the user are recorded in total, the number of cycles is 365; the period index value set specifically refers to a set of period index values of a recording period, where one period index value corresponds to one recording period.
If the operation behavior value corresponding to the bitmap code of the recording period is not null, the period index value corresponding to the recording period is obtained, and if the operation behavior value corresponding to the bitmap code of the recording period is null, the step of obtaining the period index value corresponding to the recording period is not executed. In order to reduce the waste of the storage space, the period index value set may only store the period index values of the recording periods in which the bitmap encoding is not empty, where the bitmap encoding is empty means that each bit in the bitmap encoding is 0, that is, there is no operation action performed on the target application program by the user in the recording period. That is, a user has an operation behavior on a target application program in a recording period, and a period index value of the recording period exists in the sparse coding data corresponding to the user. For example, if the user operates the target application program in the first recording period (for example, No. 1/month), the sparse coded data corresponding to the user identifier includes a period index value 1, that is, the period index value set includes the period index value 1, and if the user does not perform any operation on the target application program in the second recording period (for example, No. 1/month 2), the period index value set does not include the period index value 2.
The operation behavior value corresponding to the period index value in the sparse coded data represents a set of bitmap coded data, which may also be referred to as an operation behavior value set, and specifically includes bitmap coded data of a recording period corresponding to each period index value in the period index value set. In one implementation, the operation behavior value is an explicit representation of the bitmap encoded data, where the explicit representation refers to a value obtained by performing some encoding operation on the bitmap encoding. For example, as shown in fig. 3, the bitmap code corresponding to the operation behavior of a certain user on a certain day is 11000000111000001010000011101100 … …, and the display obtained through the encoding operation is represented as the first bitmap code value, that is, 0.1.
As shown in fig. 4, for a schematic diagram of generating a sparse coding method bitmap according to bitmap coding provided in an embodiment of the present invention, as shown in fig. 4, bitmap coding is performed on an operation behavior of each user in each recording period, and then sparse coding is performed according to bitmap coding corresponding to each user. Each bit in the bitmap code is used for representing one operation behavior of the user, and the whole sparse code storage represents the operation behaviors of all history recording periods of the user. The operation behaviors of all the recording periods of each user are represented by sparse coding, and the bitmap coding of a single recording period represents the operation behaviors of the user in one recording period. The sparse coding data only stores the cycle index value corresponding to the recording cycle in which the user corresponding to the user identification has the operation behavior on the target application program, and if the user corresponding to the user identification does not have the operation behavior on the target application program in a certain recording cycle, the cycle index value corresponding to a certain time cycle is not stored, so that a large amount of storage and calculation cost can be saved.
Therefore, after the bitmap code corresponding to each recording period of the user is obtained, the operation behavior value corresponding to the bitmap code of the recording period is obtained, the period index value corresponding to the recording period is obtained, and the sparse coded data corresponding to the user is generated according to the operation behavior value and the period index value.
Referring to fig. 5, a schematic diagram of a method for updating sparse coding data according to an embodiment of the present invention is shown in fig. 5, where after a user performs an operation on a target application in a newly added recording period, the operation of the user in the newly added recording period is first bitmap coded, and then the sparse coding data corresponding to the user is updated, and for specific implementation, reference is made to the following embodiments.
For example, the first bit in the bitmap coding represents login, the second bit represents listening to a song, the third bit represents watching MV, the user with ID 2019010122 logs in QQ music No. 1 month and listens to a song, logs in QQ music No. 1 month 2, listens to a song and watches MV, logs in QQ music No. 1 month 3 and watches MV, does not log in QQ music No. 1 month 4, and does not perform any operation. Firstly, after a user logs in QQ music in No. 1 month, the operation behavior executed by the user on the QQ music on the same day (period: day) is subjected to bitmap coding, and then the bitmap coding of No. 1 month is 11000000. After bitmap coding is carried out, sparse coding is carried out on bitmap coding corresponding to the No. 1 month, the number of cycles in the sparse coding data is 1, the cycle index value set comprises 1, and the operation behavior value comprises 0.1. After a user executes an operation behavior on the QQ music in month 1 and 2, bitmap coding is firstly carried out on the operation behavior in month 1 and 2, namely the bitmap coding in month 1 and 2 is 11100000, then historical sparse coded data of the user are obtained, the historical sparse coded data are updated according to the bitmap coding in month 1 and 2, the number of cycles in the updated sparse coded data is 2, the cycle index value set is {1,2}, and the operation behavior value set is {0.1,0.2 }. Similarly, the bitmap coding data of the user in the No. 1/month No. 3 is 10100000, the number of cycles in the sparse coding data is 3, the cycle index value set is {1,2,3}, and the operation behavior value set is {0.1,0.2,0.3 }; the bitmap encoding of the user at 1 month 4 is 00000000, and since the user does not perform any operation on the QQ music at 1 month 4, the cycle index value corresponding to 1 month 4 and the operation behavior value are not stored in the sparsely encoded data corresponding to the user.
S102, obtaining preset retention analysis parameters, wherein the retention analysis parameters comprise initial operation related parameters and retention operation related parameters.
Wherein the start operation related parameter is used for representing a start operation executed on the target application program, and the retention operation related parameter is used for representing a retention operation executed on the target application program. And if the user retention calculation instruction is detected, outputting at least one retention parameter option, receiving the operation of the user on the at least one retention parameter option, and obtaining a preset retention analysis parameter according to the operation of the user on the at least one retention parameter option.
Referring to fig. 6, which is a schematic diagram of a retention analysis configuration interface according to an embodiment of the present invention, as shown in fig. 6, a user may set a retention analysis table in the retention analysis configuration interface, and obtain preset retention analysis parameters according to the retention analysis table. In the retention analysis table, the user may designate the user to be examined as all users, a new user, a reflow user, or the like, or may designate the user to be examined as a user corresponding to the specific number packet ID.
The user can also customize an initial operation and a retention operation which need to analyze the retention rate of the user through the retention analysis table, wherein the initial operation refers to a certain operation behavior performed by the inspected user in an inspection initial time period, and the initial operation can also be called as a first target operation behavior; the retention operation refers to a certain operation behavior of the inspected user after the initial behavior is performed in a subsequent inspection time period, and the retention operation may also be referred to as a second target operation behavior. Both the start operation and the leave operation may be a specific behavior, for example, the start operation may be: "my" page-my-activity center-browse, the leave operation can be: and (4) listening to the songs, wherein the corresponding user retention rate is the proportion of users who listen to the songs for a period of time in the users who browse the activity center on the same day. The default state may persist for logins of the user group. It should be noted that the start operation and the leave operation may be set to any operation behaviors, such as a registration behavior, a login behavior, and the like.
The user may also set the start operation time period by using the retention analysis table, and as shown in fig. 6, the user sets start time dates 2019-01-01 to 2019-01-08, and acquires the time period as the start operation time period. It should be noted that the initial operation time period may also be a fixed time point or other arbitrary time period defined by a user.
The initial operation-related parameters include: the initial operation behavior, the initial operation time period, the initial operation, and the initial operation time period are used to determine an initial user, and specifically, a user who has performed the initial operation behavior in the initial operation time period in the user group may be determined as the initial user. The range user of the user group may also be set through the retention analysis table, as shown in fig. 6, the user group is set as a new user in the large disk users, of course, the user group is only an example, and may also be set as a user group in another range according to actual requirements. It should be noted that the initial user may also be referred to as a user to be checked.
The user may also set the analysis granularity, i.e. the size of the time period for which the user retention rate needs to be analyzed, which may also be referred to as the retention period, through the retention analysis table. The retention period means that whether the user performs the retention operation within the time period is determined. As shown in fig. 6, the analysis granularity (retention period) is day, which indicates that it is necessary to determine whether the user has performed a song listening behavior within one day. It should be noted that, the analysis granularity can be set as, but not limited to: natural day, natural week, natural month, 7 days, 30 days, respectively corresponding to day retention, week retention, month retention, retention within 7 days, and retention within 30 days.
The persistence operation-related parameters include: the retention operation behavior, the retention operation time period, the retention operation behavior, and the retention operation time period are used to determine a retention user, and specifically, a user who performs a retention operation within a retention operation time period after the initial operation time period among the initial users is determined as a retention user. With the retention operation period also referred to as the retention period.
After the initial user and the retained user are determined, the user retention rate can be calculated. The user retention rate is the ratio of the retained user to the initial user, and is generally used for measuring the product value. Generally, the user sees the startup and login retention of a large disk, but in the embodiment of the invention, the user retention rate of some fine scenes can be obtained through user-defined retention operation, for example, the user retention rate can be efficiently analyzed for the retention of a certain page song listening, a certain page video playing, the retention of core behaviors, the retention of a certain operation activity number packet and the like.
It should be noted that there may be a plurality of retention periods after the initial operation time period, and the user retention rate may be calculated for each retention period. As shown in FIG. 6, after the initial operating period 2019-01-08, the retention rate for the day may be calculated separately for each day. The user retention rate of how many retention periods are calculated can be determined by the retention analysis period, which can also be referred to as a retention observation period, and is used to represent the number of retention periods for which the user retention rate needs to be analyzed. As shown in fig. 6, the user sets the retention analysis period to 30 through the retention analysis table, which means that the daily user retention rate in 30 days after 2019-01-08 needs to be calculated.
The user can input retention analysis parameters through a retention analysis configuration interface provided by the front end, store the corresponding retention analysis parameters, trigger a background computing task, and generate the retention analysis computing task by the background according to the retention analysis parameters.
S103, in the preset users, determining to-be-checked users with sparse coded data meeting the initial operation related parameters, and in the to-be-checked users, determining to-be-checked users with sparse coded data meeting the retention operation related parameters.
And after the user to be checked and the reserved user are determined, the number of the user to be checked and the number of the reserved user can be obtained according to the stored sparse coding data.
Optionally, after obtaining the start operation related parameter according to the retention analysis table, the first cycle index value may be determined according to the start operation time period in the start operation related parameter. R sparse coded data including a first period index value are determined from a plurality of sparse coded data corresponding to a plurality of users, wherein R is an integer greater than or equal to 1. And determining N sparse coded data from the R sparse coded data according to the initial operation behavior in the initial operation related parameters, and determining N users corresponding to the N sparse coded data as users to be inspected, wherein the operation behavior values in the N sparse coded data indicate that the users to be inspected perform initial operation behaviors on the target application program in the recording period corresponding to the first period index value.
For example, the initial operation time period is 20181231 and 20190106 are obtained from the retention analysis table, and the initial operation behavior is login behavior. If the recording period is set to be days, determining a period index value corresponding to the time period according to the initial operation time period 20181231 and 20190106, wherein the first period index value comprises a plurality of period index values, namely 20181231, 20190101, 20190102, 20190103, 20190104, 20190105 and 20190106. Then R sparse coded data including the plurality of period index values are determined from a plurality of sparse coded data corresponding to a plurality of users, then operation behavior values corresponding to a first period index value, such as 0.1,0.2,0.3, 0.4, 0.5, 0.6, 0.7, are obtained from the R sparse coded data, and bitmap coding of a recording period corresponding to the first period index value is obtained according to the plurality of operation behavior values. If the first bit in the bitmap coding is set as login, according to the initial operation behavior of login, acquiring N sparse coding data with the value of 1 of the first bit in the bitmap coding from the R sparse coding data, and determining N users corresponding to the N sparse coding data as users to be inspected, namely the operation behavior values in the N sparse coding data indicate that the users to be inspected log in the target application program in the recording period corresponding to the first period index value.
And after determining the user to be inspected according to the initial operation related parameters, determining the retention user meeting the retention operation related parameters in the sparse coding data corresponding to the user to be inspected. The second cycle index value corresponding to the retention operation time period may be determined according to the retention operation time period, and then M sparse coded data including the second cycle index value are determined from the sparse coded data corresponding to the N users to be examined, where M is an integer greater than or equal to 1, and M is a value less than or equal to N. And determining S sparse coded data from the M sparse coded data according to the retention operation behavior in the retention related parameter, and determining S users to be checked corresponding to the S sparse coded data as retention users, wherein the operation behavior values in the S sparse coded data indicate that the retention users execute the retention operation behavior on the target application program in the recording period corresponding to the second period index value.
For example, if the retention operation time period is obtained from the retention analysis table as 20190107 plus 20190217, the corresponding second cycle index value is determined according to the retention operation time period, and the second cycle index value is the date of each day in which 20190107 plus 20190217, and M sparse coded data including the second cycle index value are determined from the N sparse coded data. And then obtaining the operation behavior value corresponding to the second period index value from the M sparse coding data, thereby obtaining the bitmap coding of the recording period corresponding to the second period index value. If the second bit in the bitmap coding is set as a song listening behavior, and if the reserved operation behavior is a song listening behavior, S sparse coding data with the value of the second bit being 1 in the bitmap coding are obtained from the M sparse coding data, and S users corresponding to the S sparse coding data are determined as reserved users, namely the operation behavior values in the S sparse coding data indicate that the reserved users execute the song listening operation behavior on the target application program in the recording period corresponding to the second period index value.
And S104, determining the user retention rate based on the number of the users to be considered and the number of the retained users.
And determining the user retention rate of each retention period according to the acquired number of the users to be checked and the number of the retained users in each retention period.
As shown in fig. 7, a schematic diagram of analysis of retention rates corresponding to users to be checked in multiple start times provided by the embodiment of the present invention is shown in fig. 7, where 230398 users who have performed the start operation behavior "listen to songs" on the target application in the start operation time period 20181231 and 20190106 are shown in fig. 7, and a user who performs the retention operation behavior "listen to songs" on the target application in the current week is also 230398, so the retention rate of the user in the current week is 100%. The number of users who listen to songs in the first week and carry out retention operation on the target application program is reduced to 188603, and the user retention rate after 1 week is 81.86% according to the proportion of the number of users to be checked 230398 and the number of users retained after 1 week 188603. Therefore, the user retention rate of the retention period after 2 weeks, 3 weeks and the like can be obtained through calculation, and the user retention rate of other initial operation time periods can also be calculated, and the description is not repeated.
As shown in fig. 8, a schematic diagram of a retention rate line graph analysis provided in the embodiment of the present invention is, as shown in fig. 8, line graphs of retention rates of 230398 users who have performed the initial operation behavior "listen to a song" on the target application program in the initial operation time period 20181231 and 20190106 after 1 week, 2 weeks, and 3 weeks.
In the embodiment of the invention, sparse coded data of a preset user is obtained, and a preset retention analysis parameter is obtained, wherein the retention analysis parameter comprises an initial operation related parameter and a retention operation related parameter, in the preset user, a user to be checked of which the sparse coded data meets the initial operation related parameter is determined, and in the user to be checked, a retention user of which the sparse coded data meets the retention operation related parameter is determined; and determining a user retention rate based on the number of the users to be considered and the number of the retained users. By adopting the technical scheme, compared with a complex and tedious calculation mode in the prior art, the user retention rate in a single period or multiple periods is decoded and summed only once, so that the calculation speed and efficiency are obviously improved, the problem that the calculation mode of the user retention rate in the prior art is slow in speed and low in efficiency is solved, and the effects of improving the calculation speed and the calculation efficiency are further achieved; meanwhile, the retention rate of any user can be calculated, retention analysis of the user on various behavior operations can be met by using at least one bit in bitmap encoding data, multi-cycle analysis can be met by using sparse encoding data storage, a large amount of analysis cost can be saved, the calculation speed and efficiency are obviously improved, and the problems of low speed and low efficiency of a calculation mode of the user retention rate in the prior art are solved.
Referring to fig. 9, a schematic structural diagram of a data processing apparatus is provided for an embodiment of the present invention. As shown in fig. 9, the data processing apparatus according to the embodiment of the present invention may include:
the first obtaining module 11 is configured to obtain sparse coded data of a preset user, where the sparse coded data is used to record an operation performed on a target application by the preset user;
a second obtaining module 12, configured to obtain preset retention analysis parameters, where the retention analysis parameters include an initial operation related parameter and a retention operation related parameter; wherein the start operation-related parameter is used to represent a start operation performed on the target application, and the leave operation-related parameter is used to represent a leave operation performed on the target application;
a first determining module 13, configured to determine, among the preset users, a user to be checked for which the sparsely encoded data satisfies the initial operation related parameter, and determine, among the user to be checked, a remaining user for which the sparsely encoded data satisfies the remaining operation related parameter;
a second determining module 14, configured to determine a user retention rate based on the number of users to be considered and the number of retained users.
Optionally, the first obtaining module includes a first generating unit, a first obtaining unit, a second obtaining unit, and a second generating unit.
A first generating unit, configured to generate a bitmap code corresponding to a single recording period of the preset user according to at least one operation performed on the target application program by the preset user in the single recording period, where the bitmap code includes at least one code value, and the code value is used to indicate whether the preset user performs the operation corresponding to the code value on the target application program;
a first obtaining unit configured to obtain an operation behavior value corresponding to a bitmap encoding of the recording period;
a second obtaining unit, configured to obtain a period index value corresponding to the recording period;
and the second generating unit is used for generating sparse coded data corresponding to the preset user according to the operation behavior value and the cycle index value.
Wherein the second obtaining unit is specifically configured to:
if the operation behavior value corresponding to the bitmap coding of the recording period is not null, obtaining a period index value corresponding to the recording period;
and if the operation behavior value corresponding to the bitmap coding of the recording period is null, not executing the step of obtaining the period index value corresponding to the recording period.
Optionally, the second obtaining module includes an output unit and an accepting unit.
The output unit is used for outputting at least one retention parameter option if the user retention calculation instruction is detected;
and the receiving unit is used for receiving the operation of the user on the at least one retention parameter option and obtaining the preset retention analysis parameter according to the operation of the user on the at least one retention parameter option.
Wherein the preset user comprises a plurality of users, one user corresponds to one sparse coding data, and the initial operation related parameters comprise: an initial operation behavior, an initial operation time period;
optionally, the first determining module includes a first determining unit, a second determining unit, and a third determining unit.
A first determination unit for determining a first cycle index value corresponding to the start operation period;
a second determining unit configured to determine R pieces of sparsely encoded data including a first cycle index value from among a plurality of sparsely encoded data corresponding to the plurality of users, the R being an integer greater than or equal to 1;
and a third determining unit, configured to determine, according to the starting operation behavior, N pieces of sparse coding data from the R pieces of sparse coding data, and determine, as a user to be examined, N users corresponding to the N pieces of sparse coding data, where an operation behavior value in the N pieces of sparse coding data indicates that the user to be examined has performed the starting operation behavior on the target application program in a recording cycle corresponding to the first cycle index value.
Wherein the persistence operation-related parameters include: retention operation behavior, retention operation time period.
The second determination module comprises a fourth determination unit, a fifth determination unit and a sixth determination unit.
A fourth determining unit configured to determine a second cycle index value corresponding to the leave-on operation period;
a fifth determining unit, configured to determine M sparse coded data including a second period index value from N sparse coded data corresponding to the N users, where M is an integer greater than or equal to 1, and M is a value less than or equal to N;
a sixth determining unit, configured to determine, according to the retention operation behavior, S sparse coded data from the M sparse coded data, and determine S to-be-inspected users corresponding to the S sparse coded data as retention users, where operation behavior values in the S sparse coded data indicate that the retention user performed the retention operation behavior on the target application program in a recording cycle corresponding to the second cycle index value.
It should be noted that, for a specific execution process, reference may be made to the specific description of the above data processing method embodiment, which is not described herein again.
In the embodiment of the invention, sparse coded data of a preset user is obtained, and a preset retention analysis parameter is obtained, wherein the retention analysis parameter comprises an initial operation related parameter and a retention operation related parameter, in the preset user, a user to be checked of which the sparse coded data meets the initial operation related parameter is determined, and in the user to be checked, a retention user of which the sparse coded data meets the retention operation related parameter is determined; and determining a user retention rate based on the number of the users to be considered and the number of the retained users. By adopting the technical scheme, compared with a complex and tedious calculation mode in the prior art, the user retention rate in a single period or multiple periods is decoded and summed only once, so that the calculation speed and efficiency are obviously improved, the problem that the calculation mode of the user retention rate in the prior art is slow in speed and low in efficiency is solved, and the effects of improving the calculation speed and the calculation efficiency are further achieved; meanwhile, the retention rate of any user can be calculated, retention analysis of the user on various behavior operations can be met by using at least one bit in bitmap encoding data, multi-cycle analysis can be met by using sparse encoding data storage, a large amount of analysis cost can be saved, the calculation speed and efficiency are obviously improved, and the problems of low speed and low efficiency of a calculation mode of the user retention rate in the prior art are solved.
Referring to fig. 10, which is a schematic structural diagram of another data processing apparatus according to an embodiment of the present invention, as shown in fig. 10, the data processing apparatus 1000 may include: at least one processor 1001, such as a CPU, at least one communication interface 1003, memory 1004, at least one communication bus 1002. Wherein a communication bus 1002 is used to enable connective communication between these components. The communication interface 1003 may optionally include a standard wired interface, a wireless interface (e.g., WI-FI interface). The memory 1004 may be a high-speed RAM memory or a non-volatile memory (e.g., at least one disk memory). The memory 1004 may optionally be at least one storage device located remotely from the processor 1001. As shown in fig. 10, memory 1004, which is a type of computer storage medium, may include an operating system, a network communication module, and program instructions.
In the data processing apparatus 1000 shown in fig. 10, the processor 1001 may be configured to load program instructions stored in the memory 1004 and specifically perform the following operations:
acquiring sparse coded data of a preset user, wherein the sparse coded data is used for recording the operation of the preset user on a target application program;
obtaining preset retention analysis parameters, wherein the retention analysis parameters comprise initial operation related parameters and retention operation related parameters; wherein the start operation-related parameter is used to represent a start operation performed on the target application, and the leave operation-related parameter is used to represent a leave operation performed on the target application;
determining a user to be checked of which the sparse coded data meets the initial operation related parameters in the preset users, and determining a retention user of which the sparse coded data meets the retention operation related parameters in the user to be checked;
and determining a user retention rate based on the number of the users to be considered and the number of the retained users.
In one possible implementation example, the generating step of the sparse coding data of the preset user includes:
generating a bitmap code corresponding to a single recording period of the preset user according to at least one operation executed on the target application program by the preset user in the single recording period, wherein the bitmap code comprises at least one code value, and the code value is used for indicating whether the preset user executes the operation corresponding to the code value on the target application program;
obtaining an operation behavior value corresponding to the bitmap coding of the recording period;
obtaining a period index value corresponding to the recording period;
and generating sparse coding data corresponding to the preset user according to the operation behavior value and the cycle index value.
In an embodiment of a possible implementation, the obtaining a period index value corresponding to the recording period includes:
if the operation behavior value corresponding to the bitmap coding of the recording period is not null, obtaining a period index value corresponding to the recording period;
if the operation behavior value corresponding to the bitmap encoding of the recording period is null, the step of obtaining the period index value corresponding to the recording period is not executed;
in one possible implementation, the obtaining the preset retention analysis parameter includes:
if the user retention calculation instruction is detected, outputting at least one retention parameter option;
and receiving the operation of the user on the at least one retention parameter option, and obtaining a preset retention analysis parameter according to the operation of the user on the at least one retention parameter option.
In one possible implementation, the preset user includes a plurality of users, one user corresponds to one sparsely encoded data, and the starting operation related parameter includes: an initial operation behavior, an initial operation time period;
the step of determining, among the preset users, a user to be examined whose sparsely encoded data satisfy the initial operation-related parameter includes:
determining a first cycle index value corresponding to the starting operational time period;
determining R sparse coded data comprising a first periodic index value from a plurality of sparse coded data corresponding to the plurality of users, wherein R is an integer greater than or equal to 1;
according to the initial operation behavior, determining N sparse coding data from the R sparse coding data, and determining N users corresponding to the N sparse coding data as users to be inspected, wherein operation behavior values in the N sparse coding data indicate that the users to be inspected perform the initial operation behavior on the target application program in a recording period corresponding to the first period index value.
In one possible implementation, the persistence operation-related parameter comprises: keeping operation behavior and keeping operation time period;
the determining, among the users to be examined, a surviving user for which sparsely encoded data satisfies the surviving operation-related parameter includes:
determining a second cycle index value corresponding to the retention operation time period;
determining M sparse coded data comprising a second period index value from N sparse coded data corresponding to the N users, wherein M is an integer greater than or equal to 1, and M is a value less than or equal to N;
according to the retention operation behavior, S sparse coding data are determined from the M sparse coding data, and S users to be checked corresponding to the S sparse coding data are determined as retention users, wherein operation behavior values in the S sparse coding data indicate that the retention users perform the retention operation behavior on the target application program in a recording period corresponding to the second period index value.
It should be noted that, for a specific implementation process, reference may be made to specific descriptions of the method embodiment shown in fig. 1, which are not described herein again.
An embodiment of the present invention further provides a computer storage medium, where the computer storage medium may store a plurality of instructions, where the instructions are suitable for being loaded by a processor and executing the method steps in the embodiment shown in fig. 1, and a specific execution process may refer to a specific description of the embodiment shown in fig. 1, which is not described herein again.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium, and includes processes of the embodiments of the methods described above when the computer program is executed. The storage medium may be a magnetic disk, an optical disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), or the like.

Claims (14)

1. A data processing method, comprising:
acquiring sparse coded data of a preset user, wherein the sparse coded data is used for recording the operation of the preset user on a target application program;
obtaining preset retention analysis parameters, wherein the retention analysis parameters comprise initial operation related parameters and retention operation related parameters; wherein the start operation-related parameter is used to represent a start operation performed on the target application, and the leave operation-related parameter is used to represent a leave operation performed on the target application;
determining a user to be checked of which the sparse coded data meets the initial operation related parameters in the preset users, and determining a retention user of which the sparse coded data meets the retention operation related parameters in the user to be checked;
and determining a user retention rate based on the number of the users to be considered and the number of the retained users.
2. The method of claim 1, wherein the generating of the sparse coding data of the preset user comprises:
generating a bitmap code corresponding to a single recording period of the preset user according to at least one operation executed on the target application program by the preset user in the single recording period, wherein the bitmap code comprises at least one code value, and the code value is used for indicating whether the preset user executes the operation corresponding to the code value on the target application program;
obtaining an operation behavior value corresponding to the bitmap coding of the recording period;
obtaining a period index value corresponding to the recording period;
and generating sparse coding data corresponding to the preset user according to the operation behavior value and the cycle index value.
3. The method of claim 2, wherein the obtaining a period index value corresponding to the recording period comprises:
if the operation behavior value corresponding to the bitmap coding of the recording period is not null, obtaining a period index value corresponding to the recording period;
and if the operation behavior value corresponding to the bitmap coding of the recording period is null, not executing the step of obtaining the period index value corresponding to the recording period.
4. The method of claim 1 or 3, wherein said obtaining preset retention analysis parameters comprises:
if the user retention calculation instruction is detected, outputting at least one retention parameter option;
and receiving the operation of the user on the at least one retention parameter option, and obtaining a preset retention analysis parameter according to the operation of the user on the at least one retention parameter option.
5. The method of claim 1, wherein the predetermined users include a plurality of users, one user corresponding to one sparsely encoded data, and the starting operation related parameters include: an initial operation behavior, an initial operation time period;
the step of determining, among the preset users, a user to be examined whose sparsely encoded data satisfy the initial operation-related parameter includes:
determining a first cycle index value corresponding to the starting operational time period;
determining R sparse coded data comprising a first periodic index value from a plurality of sparse coded data corresponding to the plurality of users, wherein R is an integer greater than or equal to 1;
according to the initial operation behavior, determining N sparse coding data from the R sparse coding data, and determining N users corresponding to the N sparse coding data as users to be inspected, wherein operation behavior values in the N sparse coding data indicate that the users to be inspected perform the initial operation behavior on the target application program in a recording period corresponding to the first period index value.
6. The method of claim 1, wherein said persisting operation-related parameters comprises: keeping operation behavior and keeping operation time period;
the determining, among the users to be examined, a surviving user for which sparsely encoded data satisfies the surviving operation-related parameter includes:
determining a second cycle index value corresponding to the retention operation time period;
determining M sparse coded data comprising a second period index value from N sparse coded data corresponding to the N users, wherein M is an integer greater than or equal to 1, and M is a value less than or equal to N;
according to the retention operation behavior, S sparse coding data are determined from the M sparse coding data, and S users to be checked corresponding to the S sparse coding data are determined as retention users, wherein operation behavior values in the S sparse coding data indicate that the retention users perform the retention operation behavior on the target application program in a recording period corresponding to the second period index value.
7. A data processing apparatus, comprising:
the system comprises a first obtaining module, a second obtaining module and a third obtaining module, wherein the first obtaining module is used for obtaining sparse coded data of a preset user, and the sparse coded data is used for recording operation of the preset user on a target application program;
the second obtaining module is used for obtaining preset retention analysis parameters, and the retention analysis parameters comprise initial operation related parameters and retention operation related parameters; wherein the start operation-related parameter is used to represent a start operation performed on the target application, and the leave operation-related parameter is used to represent a leave operation performed on the target application;
the first determining module is used for determining a user to be checked of which the sparsely encoded data meet the initial operation related parameters in the preset users and determining a remaining user of which the sparsely encoded data meet the remaining operation related parameters in the user to be checked;
and the second determining module is used for determining the user retention rate based on the number of the users to be examined and the number of the retained users.
8. The apparatus of claim 7, wherein the first obtaining module comprises:
a first generating unit, configured to generate a bitmap code corresponding to a single recording period of the preset user according to at least one operation performed on the target application program by the preset user in the single recording period, where the bitmap code includes at least one code value, and the code value is used to indicate whether the preset user performs the operation corresponding to the code value on the target application program;
a first obtaining unit configured to obtain an operation behavior value corresponding to a bitmap encoding of the recording period;
a second obtaining unit, configured to obtain a period index value corresponding to the recording period;
and the second generating unit is used for generating sparse coded data corresponding to the preset user according to the operation behavior value and the cycle index value.
9. The apparatus according to claim 8, wherein the second obtaining unit is specifically configured to:
if the operation behavior value corresponding to the bitmap coding of the recording period is not null, obtaining a period index value corresponding to the recording period;
and if the operation behavior value corresponding to the bitmap coding of the recording period is null, not executing the step of obtaining the period index value corresponding to the recording period.
10. The apparatus of claim 7 or 9, wherein the second obtaining module comprises:
the output unit is used for outputting at least one retention parameter option if the user retention calculation instruction is detected;
and the receiving unit is used for receiving the operation of the user on the at least one retention parameter option and obtaining the preset retention analysis parameter according to the operation of the user on the at least one retention parameter option.
11. The apparatus of claim 7, wherein the predetermined users comprise a plurality of users, one user for each sparsely encoded data, and the initial operation-related parameters comprise: an initial operation behavior, an initial operation time period;
the first determining module includes:
a first determination unit for determining a first cycle index value corresponding to the start operation period;
a second determining unit configured to determine R pieces of sparsely encoded data including a first cycle index value from among a plurality of sparsely encoded data corresponding to the plurality of users, the R being an integer greater than or equal to 1;
and a third determining unit, configured to determine, according to the starting operation behavior, N pieces of sparse coding data from the R pieces of sparse coding data, and determine, as a user to be examined, N users corresponding to the N pieces of sparse coding data, where an operation behavior value in the N pieces of sparse coding data indicates that the user to be examined has performed the starting operation behavior on the target application program in a recording cycle corresponding to the first cycle index value.
12. The apparatus of claim 7, wherein the persistence operation-related parameter comprises: keeping operation behavior and keeping operation time period;
the second determining module includes:
a fourth determining unit configured to determine a second cycle index value corresponding to the leave-on operation period;
a fifth determining unit, configured to determine M sparse coded data including a second period index value from N sparse coded data corresponding to the N users, where M is an integer greater than or equal to 1, and M is a value less than or equal to N;
a sixth determining unit, configured to determine, according to the retention operation behavior, S sparse coded data from the M sparse coded data, and determine S to-be-inspected users corresponding to the S sparse coded data as retention users, where operation behavior values in the S sparse coded data indicate that the retention user performed the retention operation behavior on the target application program in a recording cycle corresponding to the second cycle index value.
13. A data processing apparatus comprising a processor, a memory and a communication interface, the processor, the memory and the communication interface being interconnected, wherein the communication interface is configured to receive and transmit data, the memory is configured to store program code, and the processor is configured to invoke the program code to perform a method according to any one of claims 1 to 6.
14. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program which is executed by a processor to implement the method of any one of claims 1 to 6.
CN202010200785.2A 2020-03-20 2020-03-20 Data processing method and device Active CN111415200B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010200785.2A CN111415200B (en) 2020-03-20 2020-03-20 Data processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010200785.2A CN111415200B (en) 2020-03-20 2020-03-20 Data processing method and device

Publications (2)

Publication Number Publication Date
CN111415200A true CN111415200A (en) 2020-07-14
CN111415200B CN111415200B (en) 2023-05-26

Family

ID=71491265

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010200785.2A Active CN111415200B (en) 2020-03-20 2020-03-20 Data processing method and device

Country Status (1)

Country Link
CN (1) CN111415200B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112269726A (en) * 2020-10-22 2021-01-26 腾讯音乐娱乐科技(深圳)有限公司 Data processing method and device
CN113269370A (en) * 2021-06-18 2021-08-17 腾讯科技(成都)有限公司 Active user prediction method and device, electronic equipment and readable storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105630972A (en) * 2015-12-24 2016-06-01 网易(杭州)网络有限公司 Data processing method and device
CN106997431A (en) * 2016-01-22 2017-08-01 阿里巴巴集团控股有限公司 A kind of data processing method and device

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105630972A (en) * 2015-12-24 2016-06-01 网易(杭州)网络有限公司 Data processing method and device
CN106997431A (en) * 2016-01-22 2017-08-01 阿里巴巴集团控股有限公司 A kind of data processing method and device

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112269726A (en) * 2020-10-22 2021-01-26 腾讯音乐娱乐科技(深圳)有限公司 Data processing method and device
CN113269370A (en) * 2021-06-18 2021-08-17 腾讯科技(成都)有限公司 Active user prediction method and device, electronic equipment and readable storage medium
CN113269370B (en) * 2021-06-18 2023-12-12 腾讯科技(成都)有限公司 Active user prediction method and device, electronic equipment and readable storage medium

Also Published As

Publication number Publication date
CN111415200B (en) 2023-05-26

Similar Documents

Publication Publication Date Title
CN108647329B (en) User behavior data processing method and device and computer readable storage medium
CN111415200B (en) Data processing method and device
CN111431926A (en) Data association analysis method, system, equipment and readable storage medium
CN111679979A (en) Destructive testing method and device
CN107729375B (en) Log data sorting method and device
CN113434396A (en) Interface test method, device, equipment, storage medium and program product
CN110955438A (en) Method, device and equipment for monitoring performance of small program and storage medium
CN110728118B (en) Cross-data-platform data processing method, device, equipment and storage medium
CN109558403B (en) Data aggregation method and device, computer device and computer readable storage medium
CN115688853A (en) Process mining method and system
CN113485864B (en) Abnormality detection, abnormality analysis method, abnormality detection device, abnormality analysis device, electronic apparatus, and storage medium
CN117131059A (en) Report data processing method, device, equipment and storage medium
CN107508705A (en) The resource tree constructing method and computing device of a kind of HTTP elements
CN114661571A (en) Model evaluation method, model evaluation device, electronic equipment and storage medium
CN113326237A (en) Log data processing method and device, terminal device and storage medium
CN113806231A (en) Code coverage rate analysis method, device, equipment and medium
CN112184027A (en) Task progress updating method and device and storage medium
CN113760497A (en) Scheduling task configuration method and device
CN110674839A (en) Abnormal user identification method and device, storage medium and electronic equipment
CN111353860A (en) Product information pushing method and system
CN117493777B (en) Ultrasonic flowmeter data cleaning method, system and device based on Internet of things
CN111309623B (en) Coordinate class data classification test method and device
CN117748498A (en) Method, device, equipment and storage medium for predicting electric load
CN117455124B (en) Environment-friendly equipment monitoring method, system, medium and electronic equipment for enterprises
CN117195568B (en) Simulation engine performance analysis method and device based on discrete event

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant