CN106484688A - A kind of data processing method and system - Google Patents

A kind of data processing method and system Download PDF

Info

Publication number
CN106484688A
CN106484688A CN201510522784.9A CN201510522784A CN106484688A CN 106484688 A CN106484688 A CN 106484688A CN 201510522784 A CN201510522784 A CN 201510522784A CN 106484688 A CN106484688 A CN 106484688A
Authority
CN
China
Prior art keywords
data
exposure
channel
news
list
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510522784.9A
Other languages
Chinese (zh)
Other versions
CN106484688B (en
Inventor
黄艳香
向宇
徐钊
张文郁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201510522784.9A priority Critical patent/CN106484688B/en
Publication of CN106484688A publication Critical patent/CN106484688A/en
Application granted granted Critical
Publication of CN106484688B publication Critical patent/CN106484688B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9574Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching

Abstract

Data processing method and system that the present invention is provided, configure the ID of first data and second data and the one-to-one relation of the 3rd data and using first data as data cached, inquire about according to the ID that to obtain the ID corresponding described data cached in the relation, according to second data query for being operated data cached described in corresponding 3rd data of the second data for being operated, corresponding for the second clicked data the 3rd data are used as True Data for backstage when judging that corresponding 3rd data of the second data for being operated meet pre-conditioned, True Data is only counted, improve the accuracy of back-end data process.

Description

A kind of data processing method and system
Technical field
The present invention relates to field of mobile terminals, more particularly to a kind of data processing method and system.
Background technology
With the development of the intelligent mobile product such as mobile phone, mobile terminal application is developed on an unprecedented scale, such as wechat, mobile phone terminal Tengxun news etc..In these mobile terminal applications, generally for Consumer's Experience is improved, can there is certain " pulling in advance " process.Describe for convenience, a user interface at mobile phone news end is illustrated in conjunction with Fig. 1, for enabling user to read the news of different channel when sliding to the left and right glibly, mobile phone news end can be when user clicks on " video " channel, just in advance the news of the channel such as " Guangdong ", " finance and economics ", " amusement " is loaded in subscription client, so as to produce one group of exposure data to user.This pre- pulling processes the exposure data for producing we term it pseudo- exposure data, because actually user does not also see the news of the channels such as " Guangdong " in example, " finance and economics ", " amusement ", in the exposure for counting news and click data, if being not added with distinguishing, this puppet exposure data can cause the statistics of mistake.
In order to solve above-mentioned problem, common data statistics is using offline storage and the mode of regular computing, first the user behavior data in one period is concentrated and be transferred in a distributed file system offline, then periodically off-line data is counted, in this case, whole behaviors of user are all visible, pseudo- exposure data and real exposure data can be distinguished by behavior of the user after pulling in advance, news under " video " and " amusement " channel of user's actual click generates real exposure, and " Guangdong " that user does not click on, the pre- exposure data that pulls under channels such as " finance and economicss " will be dropped, it is not involved in statistics.
As off-line data statistical project is using the calculation of batch processing, first stores data in disk, then periodically processed, this computation schema can not produce real-time statistics, bring larger time delay, it is impossible to meet current real-time requirement.
Content of the invention
In view of this, a kind of data processing method and system are embodiments provided.
It is an object of the present invention to provide a kind of data processing method, including:
Obtain the first data that user operation client is produced, wherein, the user has the ID for identity, first data include ID, multiple second data corresponding with the ID and multiple 3rd data, and second data are corresponded with the 3rd data;
Configure the ID of first data and second data and the one-to-one relation of the 3rd data and using first data as data cached;
Obtain the Object Operations data of user operation client generation, the second data that the Object Operations data include the ID, operated, wherein, the object that operated is one in the plurality of second data;
Inquire about according to the ID that to obtain the ID corresponding described data cached in the relation;
According to second data query for being operated data cached described in corresponding 3rd data of the second data for being operated;
Corresponding for the second clicked data the 3rd data are used as True Data for backstage when judging that corresponding 3rd data of the second data for being operated meet pre-conditioned.
Further, first data are original exposure data, second data are exposure channel, 3rd data are exposure channel news ID list, the Object Operations data are channel click data, first data for obtaining the generation of user operation client, wherein, the user has the ID for identity, first data include ID, multiple second data corresponding with the ID and multiple 3rd data, second data are corresponded with the 3rd data, including:
Obtain the original exposure data that user operation client is produced, wherein, the user has the ID for identity, the original exposure data include ID, multiple exposure channels corresponding with the ID and multiple exposure channel news ID lists, and the exposure channel is corresponded with exposure channel news ID list;
The ID of the configuration first data and second data and the one-to-one relation of the 3rd data using first data as data cached, including:
By the original exposure data with the ID as key, be stored in the KV storage system based on internal memory as value and cache as exposure to expose channel and exposure channel news ID list;
Obtain the Object Operations data of user operation client generation, the second data that the Object Operations data include the ID, operated, wherein, the object that operated is one in the plurality of second data, including:
The channel click data of user operation client generation is obtained, the channel click data includes the ID, clicked exposure channel, wherein, the clicked channel is in the plurality of exposure channel;
Inquire about according to the ID that to obtain the ID corresponding described data cached in the relation, including:
The corresponding exposure caching of the ID is inquired about in the KV storage system according to the ID;
According to second data query for being operated data cached described in corresponding 3rd data of the second data for being operated, including:
The corresponding exposure channel news ID list of clicked exposure channel described in exposure caching according to the clicked exposure channel query;
Corresponding for the second clicked data the 3rd data are used as True Data for backstage when judging that corresponding 3rd data of the second data for being operated meet pre-conditioned, including:
Corresponding for clicked exposure channel exposure channel news ID list is used as true exposure data for backstage when judging that the corresponding exposure channel news ID list of clicked exposure channel meets pre-conditioned.
Further, the exposure channel news ID list includes time for exposure and effective time,
Before corresponding for clicked exposure channel exposure channel news ID list is used for backstage when meeting pre-conditioned by the corresponding exposure channel news ID list of the clicked exposure channel of the judgement as true exposure data, also include:
Obtain the click behavior time of origin of clicked exposure channel;
Calculate the time difference between the time for exposure of the clicked exposure channel news ID list and the click behavior time of origin;
It is described that to judge that the corresponding exposure channel news ID list of clicked exposure channel meets pre-conditioned, including:
The corresponding exposure channel news ID list of then clicked exposure channel when being not more than the effective time of the time difference meets pre-conditioned.
Further, after the corresponding exposure channel news ID list of clicked described in exposure caching exposure channel according to the clicked exposure channel query, also include:
Corresponding for clicked exposure channel exposure channel news ID list is abandoned as pseudo- exposure data when judging that the corresponding exposure channel news ID list of clicked exposure channel does not meet pre-conditioned.
Further, described according to the ID inquire about in the KV storage system ID corresponding described exposure caching before, also include:
The news click data of user operation client generation is obtained, the news is clicked on packet and the ID, news are located exposure channel and news ID is included, wherein, exposure channel that the news is located is in the plurality of exposure channel;
Described according to the ID inquire about in the KV storage system ID corresponding described exposure caching after, also include:
According to the corresponding exposure channel news ID list of exposure channel that news described in exposure caching described in exposure channel query that the news is located is located;
The corresponding positional information of news ID is obtained according to news ID in the news is located the corresponding exposure channel news ID list of exposure channel, so that the positional information is available for backstage use.
Further, the effective time is random time in 1 minute to 10 minutes.
It is a further object to provide a kind of data handling system, including:
First acquisition unit, for obtaining the first data of user operation client generation, wherein, the user has the ID for identity, first data include ID, multiple second data corresponding with the ID and multiple 3rd data, and second data are corresponded with the 3rd data;
Memory cell, for configuring the ID of first data and second data and the one-to-one relation of the 3rd data and using first data as data cached;
Second acquisition unit, for obtaining the Object Operations data of user operation client generation, the second data that the Object Operations data include the ID, operated, wherein, the object that operated is one in the plurality of second data;
First query unit, for being inquired about in the relation according to the ID, to obtain the ID corresponding described data cached;
Second query unit, corresponding 3rd data of the second data for being operated described in data cached according to second data query for being operated;
Corresponding for the second clicked data the 3rd data are used for backstage when corresponding 3rd data of the second data for judging to be operated meet pre-conditioned by the first judging unit as True Data.
Further, first acquisition unit, for obtaining the original exposure data of user operation client generation, wherein, the user has the ID for identity, the original exposure data include ID, multiple exposure channels corresponding with the ID and multiple exposure channel news ID lists, and the exposure channel is corresponded with exposure channel news ID list;
Memory cell, for by the original exposure data with the ID as key, be stored in the KV storage system based on internal memory as value and cache as exposure to expose channel and exposure channel news ID list;
Second acquisition unit, for the channel click data that the user operation client is produced, the channel click data includes the first user ID, clicked exposure channel, and wherein, the clicked channel is in the plurality of exposure channel;
First query unit, for inquiring about the corresponding exposure caching of the ID according to the ID in the KV storage system;
Second query unit, for exposing the corresponding exposure channel news ID list of clicked exposure channel described in caching according to the clicked exposure channel query;
First judging unit, for judging when the corresponding exposure channel news ID list of clicked exposure channel meets pre-conditioned to use corresponding for clicked exposure channel exposure channel news ID list as true exposure data for backstage.
Further, the exposure channel news ID list includes time for exposure and effective time, and the data handling system also includes:
3rd acquiring unit, for obtaining the click behavior time of origin of clicked exposure channel;
Computing unit, for calculating the time difference between the time for exposure of the clicked exposure channel news ID list and the click behavior time of origin;
First judging unit be additionally operable to the corresponding exposure channel news ID list of the then clicked exposure channel when the time difference is not more than the effective time meet pre-conditioned.
Further, the data handling system also includes:
Second judging unit, for judging when the corresponding exposure channel news ID list of clicked exposure channel does not meet pre-conditioned to abandon corresponding for clicked exposure channel exposure channel news ID list as pseudo- exposure data.
Further, the data handling system also includes:
4th acquiring unit, for the news click data that the user operation client is produced, the news is clicked on packet and includes the ID, news are located exposure channel and news ID, and wherein, exposure channel that the news is located is in the plurality of exposure channel;
3rd query unit, for according to the corresponding exposure channel news ID list of exposure channel that news described in exposure caching described in exposure channel query that the news is located is located;
4th query unit, for obtaining the corresponding positional information of news ID according to news ID in the news is located the corresponding exposure channel news ID list of exposure channel, so that the positional information is available for backstage use.
As can be seen from the above technical solutions, the embodiment of the present invention has advantages below:
Data processing method and system that the present invention is provided, configure the ID of first data and second data and the one-to-one relation of the 3rd data and using first data as data cached, inquire about according to the ID that to obtain the ID corresponding described data cached in the relation, according to second data query for being operated data cached described in corresponding 3rd data of the second data for being operated, corresponding for the second clicked data the 3rd data are used as True Data for backstage when judging that corresponding 3rd data of the second data for being operated meet pre-conditioned, True Data is only counted, improve the accuracy of back-end data process.
Description of the drawings
Fig. 1 is the schematic diagram at prior art mobile phone news end;
Fig. 2 a is a kind of flow chart of embodiment of the data processing method that the present invention is provided;
Fig. 2 b is the flow chart of another kind of embodiment of the data processing method that the present invention is provided;
Fig. 3 is the flow chart of another kind of embodiment of the data processing method that the present invention is provided;
Fig. 4 is the flow chart of another kind of embodiment of the data processing method that the present invention is provided;
Fig. 5 is a kind of structure chart of embodiment of the data handling system that the present invention is provided.
Specific embodiment
In order that those skilled in the art more fully understand the present invention program, below in conjunction with the accompanying drawing in the embodiment of the present invention, technical scheme in the embodiment of the present invention is clearly and completely described, obviously, described embodiment is only the embodiment of a present invention part, rather than whole embodiments.Based on the embodiment in the present invention, the every other embodiment obtained under the premise of creative work is not made by those of ordinary skill in the art, should all belong to the scope of protection of the invention.
Term " first ", " second ", " the 3rd " " the 4th " in description and claims of this specification and above-mentioned accompanying drawing etc. be for distinguishing similar object, without for describing specific order or precedence.It should be appreciated that the data for so using can be exchanged in the appropriate case, so as to the embodiments described herein can with except illustrate here or the content of description in addition to order implement.In addition, term " comprising " and " having " and their any deformation, it is intended to cover non-exclusive including, for example, process, method, system, product or the equipment for containing series of steps or unit is not necessarily limited to those steps that clearly lists or unit, but may include clearly not list or for other intrinsic steps of these processes, method, product or equipment or unit.
In conjunction with shown in Fig. 2 a, a kind of embodiment of the data processing method that the present invention is provided, including:
The first data that S1, acquisition user operation client are produced, wherein, the user has the ID for identity, first data include ID, multiple second data corresponding with the ID and multiple 3rd data, and second data are corresponded with the 3rd data.
In the present embodiment, first data can be original exposure data, second data can be exposure channel, 3rd data can be exposure channel news ID list, the Object Operations data can be channel click data, user is when using Client browse web page news, client can carry out prestretching extract operation, the news ID list of adjacent channel is got out for client in advance, this partial data is used as original exposure data, each user can have an ID, for identifying the identity of user, each ID can produce original exposure data, each ID can correspond to several exposure channels, each exposure channel corresponds to an exposure channel news ID list.
S2, configure the ID of first data and second data and the one-to-one relation of the 3rd data and using first data as data cached.
Referred to herein as relation be in order to set up the relation that is inquired about using ID,Can be carried out using KV storage system,It is the storage system using key-value for form for KV storage,Corresponding value can be inquired by key,Those of ordinary skill in the art are understood that,So the implication to KV storage is not specifically introduced,By the use of ID as key,I.e. in key value list,Can there are multiple IDs,It is not limited only to first user ID,When KV storage system is built,Using first user ID as key,First user ID is then operated the exposure channel that correspondence is pulled in advance and the list of exposure channel news ID as value,Exposure channel and the list of exposure channel news ID are corresponded,The corresponding pre- exposure channel for pulling and exposure channel news ID list can be inquired using first user ID.
S3, the Object Operations data that the user operation client is produced are obtained, the second data that the Object Operations data include the ID, operated, wherein, the object that operated is one in the plurality of second data.
The second data for being operated can be clicked exposure channel, that is the channel of user's practical operation, after setting up KV storage system, when the channel click data for receiving first user ID operation client generation, that is first user ID produces channel click data by clicking certain channel, now the user of explanation first user ID is browsing the news under the channel, so it is considered that being that user sees to the news ID list that the channel is pulled in advance, can use as true exposure data, the channel click data includes the first user ID, clicked exposure channel, wherein, the clicked channel is in the plurality of exposure channel, clicked exposure channel is the channel that the user of first user ID currently browses.
S4, inquire about according to the ID that to obtain the ID corresponding described data cached in the relation.
Because in the relation ID and data cached be one-to-one, can be according to ID inquiry to corresponding data cached, characteristic using KV storage system, can be inquired about for key according to first user ID and obtain exposure caching corresponding with the first user ID, i.e., the exposure channel that first user ID is pulled in advance and exposure channel news ID list in database.
S5, data cached according to second data query for being operated described in corresponding 3rd data of the second data for being operated.
After obtaining the corresponding exposure channel of first user ID and exposure channel news ID list, the information in clicked exposure channel query exposure caching is recycled, the corresponding exposure channel news ID list of clicked exposure channel can be obtained.
S6, corresponding for the second clicked data the 3rd data are used as True Data for backstage when judging that corresponding 3rd data of the second data for being operated meet pre-conditioned.
Validation verification is carried out for the 3rd data for obtaining, just use for backstage when meeting pre-conditioned, but when the 3rd data are for exposure channel news list, can be using the generation time of record exposure channel news ID list, and set effective time, the time of the click behavior of channel is defined by receiving user, calculate whether time interval is not more than effective time, if being not more than, this can determine that the data for obtaining are limited, corresponding for clicked exposure channel exposure channel news ID list can be used as true exposure data for backstage, i.e. user has really browsed the information of the channel, backstage is using can count to true exposure data, as the reference that news is recommended, improve the precision of news recommendation.
The data processing method that the present invention is provided, the original exposure data for being pulled generation in advance carry out KV storage, recycle the ID in channel click data to inquire about in the database that KV is stored and obtain exposure caching, exposure channel news ID list will be inquired in exposure caching using clicked channel, the exposure channel news ID list obtained by the use of clicked channel query is used as true exposure data for backstage, so that backstage is when being counted, the exposure channel news ID list that only clicks on through channel in statistics original exposure data, true exposure data is only counted, and original exposure data are screened with pre-conditioned using KV storage, improve the accuracy of back-end data process.
In conjunction with shown in Fig. 2 b, the data processing method of the present invention additionally provides a kind of embodiment, wherein, first data are original exposure data, and second data are exposure channel, and the 3rd data are exposure channel news ID list, the Object Operations data are channel click data, and methods described includes:
The original exposure data that S101, acquisition user operation client are produced, wherein, the user has the ID for identity, the original exposure data include ID, multiple exposure channels corresponding with the ID and multiple exposure channel news ID lists, and the exposure channel is corresponded with exposure channel news ID list.
User is when using Client browse web page news,Client can carry out prestretching extract operation,The news ID list of adjacent channel is got out for client in advance,This partial data is used as original exposure data,Each user can have an ID,For identifying the identity of user,Each ID can produce original exposure data,Each ID can correspond to several exposure channels,Each exposure channel corresponds to an exposure channel news ID list,And each exposure channel news ID list is just had determined when original exposure data are produced,The corresponding exposure channel of multiple IDs and exposure channel news ID list can be included in initial data data,Corresponding original exposure data can be all produced when different IDs is operated,For example,The user of ID browses " military channel ",The news ID list of adjacent to " military channel " " social channel " and the news ID list of " political situation of the time channel " can be carried out prestretching and be taken as the original exposure data for the ID by client,And the user for working as another ID browses " social channel ",The news ID list of adjacent to " social channel " " military channel " and the news ID list of " entertainment channel " can be carried out prestretching and pick and place original exposure data corresponding as this ID by client,Herein for convenient explanation,It is introduced with the operation of ID,This is not hereinafter repeated.
S102, by the original exposure data with the ID as key, be stored in the KV storage system based on internal memory as value and cache as exposure to expose channel and exposure channel news ID list.
It is the storage system using key-value for form for KV storage,Corresponding value can be inquired by key,Those of ordinary skill in the art are understood that,So the implication to KV storage is not specifically introduced,By the use of ID as key,I.e. in key value list,Can there are multiple IDs,It is not limited only to first user ID,When KV storage system is built,Using first user ID as key,First user ID is then operated the exposure channel that correspondence is pulled in advance and the list of exposure channel news ID as value,Exposure channel and the list of exposure channel news ID are corresponded,The corresponding pre- exposure channel for pulling and exposure channel news ID list can be inquired using first user ID,In the same manner,Second user ID can be utilized,Do not repeated,In this manner initial data is stored in the database of KV storage system,Original exposure data storage in KV system after be referred to as exposure caching,Hereinafter occur not repeated.
The channel click data that S103, acquisition are produced with first user ID operation client, the channel click data include the first user ID, clicked exposure channel, and wherein, the clicked channel is in the plurality of exposure channel.
After setting up KV storage system,When the channel click data for receiving first user ID operation client generation,That is first user ID produces channel click data by clicking certain channel,Now the user of explanation first user ID is browsing the news under the channel,So it is considered that being that user sees to the news ID list that the channel is pulled in advance,Can use as true exposure data,The channel click data includes the first user ID、Clicked exposure channel,Wherein,The clicked channel is in the plurality of exposure channel,Clicked exposure channel is the channel that the user of first user ID currently browses,The user of for example current first user ID is browsing " military channel ",Then clicked exposure channel is " military channel ",And multiple exposure channels " political situation of the time channel " that this clicked exposure channel " military channel " is exactly pulled before in advance、One in " social channel " and " military channel ".
S104, inquired about in the KV storage system according to the first user ID first user ID corresponding described exposure caching.
Characteristic using KV storage system, can be inquired about for key according to first user ID and obtain exposure caching corresponding with the first user ID in database, the exposure channel that i.e. first user ID is pulled in advance and exposure channel news ID list, for example, news ID list under " military channel " and " military channel " of the current operation being mentioned above, the news ID list under the list of news ID and " social channel " and " social channel " under " political situation of the time channel " and " political situation of the time channel ", need below to do is to from the corresponding pre- exposure channel for pulling of first user ID and expose the channel that channel news ID list determines that user has browsed, determine true exposure data.
S105, the corresponding exposure channel news ID list of clicked described in exposure caching exposure channel according to the clicked exposure channel query.
After obtaining the corresponding exposure channel of first user ID and exposure channel news ID list, recycle the information in clicked exposure channel query exposure caching, the corresponding exposure channel news ID list of clicked exposure channel can be obtained, the user being for example mentioned above clicks " military channel ", the news ID list of " military channel " then can be inquired about according to " military channel " in exposure caching, and the channel that the news ID list exactly user of this " military channel " browses is can be used as True Data after checking.
S106, corresponding for clicked exposure channel exposure channel news ID list is used as true exposure data for backstage when judging that the corresponding exposure channel news ID list of clicked exposure channel meets pre-conditioned.
After having obtained the corresponding exposure channel news ID list of clicked exposure channel, need to carry out validation verification, for example, data to exposure caching carry out time restriction, can be dropped more than the exposure caching of certain time, to save memory space, specifically can be using the generation time of record exposure channel news ID list, and set effective time, the time of the click behavior of channel is defined by receiving user, calculate whether time interval is not more than effective time, if being not more than, this can determine that the data for obtaining are limited, corresponding for clicked exposure channel exposure channel news ID list can be used as true exposure data for backstage, i.e. user has really browsed the information of the channel, backstage is using can count to true exposure data, as the reference that news is recommended, improve the precision of news recommendation.
The data processing method that the present invention is provided, the original exposure data for being pulled generation in advance carry out KV storage, recycle the ID in channel click data to inquire about in the database that KV is stored and obtain exposure caching, exposure channel news ID list will be inquired in exposure caching using clicked channel, the exposure channel news ID list obtained by the use of clicked channel query is used as true exposure data for backstage, so that backstage is when being counted, the exposure channel news ID list that only clicks on through channel in statistics original exposure data, true exposure data is only counted, and original exposure data are screened with pre-conditioned using KV storage, improve the accuracy of back-end data process.
In conjunction with shown in Fig. 3, the data processing method of the present invention additionally provides another kind of embodiment, including:
The original exposure data that S201, acquisition user operation client are produced, wherein, the user has the ID for identity, the original exposure data include ID, multiple exposure channels corresponding with the ID and multiple exposure channel news ID lists, the exposure channel is corresponded with exposure channel news ID list, and the ID at least includes first user ID.
Similar with S101 in a upper embodiment in step 201, do not repeat herein.
S202, by the original exposure data with the ID as key, be stored in the KV storage system based on internal memory as value and cache as exposure to expose channel and exposure channel news ID list.
The exposure channel news ID list includes time for exposure and effective time, stored in KV storage system in the lump, the list time for exposure can be obtained after exposure channel news ID list is inquired, that is the generation time of list, effective time is used for verifying the information that inquiry is obtained, the data for exceeding effective time are abandoned, with save space, improve the accuracy of data statistics, the effective time flexibly can be set, random time in 1 minute to 10 minutes can be for example set to, specifically can be selected as needed, here is not defined.
The channel click data that S203, acquisition are produced with first user ID operation client, the channel click data include the first user ID, clicked exposure channel, and wherein, the clicked channel is in the plurality of exposure channel.
Similar with S103 in a upper embodiment in step 203, do not repeat herein.
S204, inquired about in the KV storage system according to the first user ID first user ID corresponding described exposure caching.
Similar with S104 in a upper embodiment in step 204, do not repeat herein.
S205, the corresponding exposure channel news ID list of clicked described in exposure caching exposure channel according to the clicked exposure channel query.
Similar with S105 in a upper embodiment in step 205, do not repeat herein.
The click behavior time of origin of the clicked exposure channel of S206, acquisition.
The time at that time is recorded when user clicks on channel, will click on time of origin and put in channel click data, receive channel click data and click time of origin can be parsed, be used as judging the datum mark whether time for exposure exceeds effective time.
Time difference between S207, the time for exposure for calculating the clicked exposure channel news ID list and the click behavior time of origin.
The time for exposure can be generated when exposure channel news ID list is pulled in advance, it is used for the generation time for pointing out to expose channel news ID list, the duration of exposure channel news ID list can be calculated using click behavior time of origin, i.e. time difference, compares time difference and the size of effective time can determine whether exposure channel news ID list can be used as true exposure data.
S208, then clicked exposure channel corresponding exposure channel news ID list when judging that the time difference is not more than the effective time, if so, then execute S109, if it is not, then executing S110.
The corresponding exposure channel news ID list of then clicked exposure channel when being not more than the effective time of time difference meets pre-conditioned, the corresponding exposure channel news ID list of then clicked exposure channel when being more than the effective time of time difference is then unsatisfactory for pre-conditioned, those of ordinary skill in the art are not it is to be appreciated that repeated herein.
S209, using corresponding for the clicked exposure channel exposure list of channel news ID use as true exposure data for backstage.
Similar with S106 in a upper embodiment in step 209, do not repeat herein.
S210, using corresponding for the clicked exposure channel exposure list of channel news ID abandon as pseudo- exposure data.
Backstage needs true exposure data is carried out the operation such as counting, it is therefore desirable to exclude pseudo- exposure data, in order to pseudo- exposure data can be abandoned by the space for saving KV storage system, will puppet exposure data deleted, with save space.
By accurately effective exposure data is provided in real time, the analysis of more accurate news data is obtained, reduced the mistake of statistics that pseudo- exposure data is caused in original exposure data.
In order to preferably provide accurate recommending data, for operation of the user to concrete news, the data processing method of the present invention additionally provides a kind of embodiment, illustrates with reference to Fig. 4.
The original exposure data that S301, acquisition user operation client are produced, wherein, the user has the ID for identity, the original exposure data include ID, multiple exposure channels corresponding with the ID and multiple exposure channel news ID lists, the exposure channel is corresponded with exposure channel news ID list, and the ID at least includes first user ID.
Step S301 is similar with step S201, is not repeated herein.
S302, by the original exposure data with the ID as key, be stored in the KV storage system based on internal memory as value and cache as exposure to expose channel and exposure channel news ID list.
Step S302 is similar with step S202, is not repeated herein.
The channel click data that S303, acquisition are produced with first user ID operation client, the channel click data include the first user ID, clicked exposure channel, and wherein, the clicked channel is in the plurality of exposure channel.
Step S303 is similar with step S203, is not repeated herein.
The news click data that S304, acquisition are produced with first user ID operation client, the news is clicked on packet and includes the first user ID, news are located exposure channel and news ID, wherein, exposure channel that the news is located is in the plurality of exposure channel.
The click behavior of user and news location have much relations, the different attractions to different user in news present position are different, therefore news present position when user clicks on news is obtained, all extremely important to precisely recommendation or data analysis, after the news click data of the user for receiving first user ID, the exposure caching of first user ID is obtained according to first user ID inquiry based on the KV storage of internal memory, exposure channel news ID list according to news place channel in news place channel extraction exposure caching, because news ID in exposure channel news ID list is arranged in sequence, therefore can inquire about and obtain news ID this time clicked on present position in lists, that is the news recommended location residing when user is exposed to.
S305, inquired about in the KV storage system according to the first user ID first user ID corresponding described exposure caching.
Step S305 is similar with step S204, is not repeated herein.
S306, the corresponding exposure channel news ID list of clicked described in exposure caching exposure channel according to the clicked exposure channel query.
Step S306 is similar with step S205, is not repeated herein.
S307 obtains the click behavior time of origin of clicked exposure channel.
Step S307 is similar with step S206, is not repeated herein.
S308 calculates the time difference between the time for exposure of the clicked exposure channel news ID list and the click behavior time of origin.
Step S308 is similar with step S207, is not repeated herein.
S309, then clicked exposure channel corresponding exposure channel news ID list when judging that the time difference is not more than the effective time, if so, then execute S310, if it is not, then executing S313.
The corresponding exposure channel news ID list of then clicked exposure channel when being not more than the effective time of time difference meets pre-conditioned, the corresponding exposure channel news ID list of then clicked exposure channel when being more than the effective time of time difference is then unsatisfactory for pre-conditioned, those of ordinary skill in the art are not it is to be appreciated that repeated herein.
S310, using corresponding for the clicked exposure channel exposure list of channel news ID use as true exposure data for backstage.
Similar with S209 in a upper embodiment in step 301, do not repeat herein.
S311, news described in exposure caching according to news place exposure channel query are located and expose the corresponding exposure channel news ID list of channel.
The exposure caching of first user ID is obtained according to first user ID inquiry based on the KV storage of internal memory, according to the exposure channel news ID list of news place channel in news place channel extraction exposure caching.
S312, according to news ID the news be located exposure channel corresponding exposure channel news ID list in obtain the corresponding positional information of news ID so that the positional information be available for backstage use.
Exposure channel news ID list according to news place channel in news place channel extraction exposure caching, because news ID in exposure channel news ID list is arranged in sequence, therefore can inquire about and obtain news ID this time clicked on present position in lists, that is the news recommended location residing when user is exposed to, by clicking on the working process of behavior to user's news, obtain clicked news present position, there is provided recommending the fine granularity CTR (Chinese of position aspect:Click-through-rate, English:Click Through Rate) data analysis condition.
S313, using corresponding for the clicked exposure channel exposure list of channel news ID abandon as pseudo- exposure data.
Backstage needs true exposure data is carried out the operation such as counting, it is therefore desirable to exclude pseudo- exposure data, in order to pseudo- exposure data can be abandoned by the space for saving KV storage system, will puppet exposure data deleted, with save space.
Business is recommended to demonstrate the method by clicking on the working process of behavior to user's news by actual news, obtain clicked news present position, the attraction of different recommendation positions is distinguished, has more accurately understood user interest, effectively increased the precision that recommends in news proposed algorithm.
For the ease of understanding the data processing method of the application, a kind of application scenarios are provided below and readily appreciate scheme.
User browses web page news on the client, the current channel for browsing is B channel, adjacent with B channel is A channel and C channel, ID is first user ID, news ID list in A channel or C channel can quickly being checked for the ease of user when horizontally slipping, need to pull the news ID list of the channel list of A channel and C channel in advance, the time for now carrying out prestretching extract operation is the time for exposure, it is determined as:00 point when 10,Assume that the news ID list of A channel includes a1、a2、a3,The news ID list of B channel is b1、b2、b3,The news ID list of C channel is c1、c2、c3,The now news ID list of A channel、The news ID list of B channel and the news ID list of C channel are used as original exposure data,By these original exposure light data with first user ID as key、The news ID list of the news ID list C channel and C channel of the news ID list B channel of A channel and A channel and B channel and time for exposure are for as exposure caching in value storage KV storage system,When first user ID operates A channel,Generate the channel click data of A channel,Channel click data includes first user ID、Clicked A channel,And click on time of origin (05 point when being defined as 10) and effective time (being defined as 10 minutes),Exposure caching corresponding with first user ID is inquired in KV storage system using first user ID,Recycle the news ID list a1 of corresponding A channel in clicked A channel query exposure caching、a2、a3,It it is 5 minutes according to time of origin and time for exposure calculating time difference is clicked on,Time difference is not more than effective time,So the news ID list of the A channel and A channel in original exposure data is true exposure data,Can use for backstage statistics,And the user of first user ID clicks on the C channel moment (time difference is more than 10 points 10 points when being later than 10,Beyond effective time) or do not click on C channel,Then the news ID list that C channel is C channel can be deleted as pseudo- exposure data with save space.
A kind of data processing method presented hereinabove, in conjunction with shown in Fig. 5, accordingly, present invention also offers a kind of embodiment of data handling system, including:
First acquisition unit 401, for obtaining the first data of user operation client generation, wherein, the user has the ID for identity, first data include ID, multiple second data corresponding with the ID and multiple 3rd data, and second data are corresponded with the 3rd data;
Memory cell 402, for configuring the ID of first data and second data and the one-to-one relation of the 3rd data and using first data as data cached;
Second acquisition unit 403, for obtaining the Object Operations data of user operation client generation, the second data that the Object Operations data include the ID, operated, wherein, the object that operated is one in the plurality of second data;
First query unit 404, for being inquired about in the relation according to the ID, to obtain the ID corresponding described data cached;
Second query unit 405, corresponding 3rd data of the second data for being operated described in data cached according to second data query for being operated;
Corresponding for the second clicked data the 3rd data are used for backstage when corresponding 3rd data of the second data for judging to be operated meet pre-conditioned by the first judging unit 406 as True Data.
Alternatively, first acquisition unit 401, for obtaining the original exposure data of user operation client generation, wherein, the user has the ID for identity, the original exposure data include ID, multiple exposure channels corresponding with the ID and multiple exposure channel news ID lists, and the exposure channel is corresponded with exposure channel news ID list, and the ID at least includes first user ID.
Memory cell 402, for by the original exposure data with the ID as key, be stored in the KV storage system based on internal memory as value and cache as exposure to expose channel and exposure channel news ID list.
Second acquisition unit 403, for obtaining the channel click data produced with first user ID operation client, the channel click data includes the first user ID, clicked exposure channel, and wherein, the clicked channel is in the plurality of exposure channel.
First query unit 404, for inquiring about the corresponding exposure caching of the first user ID according to the first user ID in the KV storage system.
Second query unit 405, for exposing the corresponding exposure channel news ID list of clicked exposure channel described in caching according to the clicked exposure channel query.
First judging unit 406, for judging when the corresponding exposure channel news ID list of clicked exposure channel meets pre-conditioned to use corresponding for clicked exposure channel exposure channel news ID list as true exposure data for backstage.
The data handling system that the present invention is provided, the original exposure data for being pulled generation in advance carry out KV storage, recycle the ID in channel click data to inquire about in the database that KV is stored and obtain exposure caching, exposure channel news ID list will be inquired in exposure caching using clicked channel, the exposure channel news ID list obtained by the use of clicked channel query is used as true exposure data for backstage, so that backstage is when being counted, the exposure channel news ID list that only clicks on through channel in statistics original exposure data, true exposure data is only counted, and original exposure data are screened with pre-conditioned using KV storage, improve the accuracy of back-end data process.
Further, the exposure channel news ID list includes time for exposure and effective time, and the data handling system also includes:
3rd acquiring unit 407, for obtaining the click behavior time of origin of clicked exposure channel;
Computing unit 408, for calculating the time difference between the time for exposure of the clicked exposure channel news ID list and the click behavior time of origin;
First judging unit be additionally operable to the corresponding exposure channel news ID list of the then clicked exposure channel when the time difference is not more than the effective time meet pre-conditioned.
Further, the data handling system also includes:
Second judging unit 409, for judging when the corresponding exposure channel news ID list of clicked exposure channel does not meet pre-conditioned to abandon corresponding for clicked exposure channel exposure channel news ID list as pseudo- exposure data.
Further, the data handling system also includes:
4th acquiring unit 410, for obtaining the news click data produced with first user ID operation client, the news is clicked on packet and includes the first user ID, news are located exposure channel and news ID, and wherein, exposure channel that the news is located is in the plurality of exposure channel;
3rd query unit 411, for according to the corresponding exposure channel news ID list of exposure channel that news described in exposure caching described in exposure channel query that the news is located is located;
4th query unit 412, for obtaining the corresponding positional information of news ID according to news ID in the news is located the corresponding exposure channel news ID list of exposure channel, so that the positional information is available for backstage use.
The data handling system being mentioned above, there is also provided the terminal as data handling system carrier.
A kind of terminal, including previously described data handling system, certain terminal also needs to include necessary hardware configuration, is specifically introduced below.
The terminal can be to include the arbitrarily terminal device such as mobile phone, panel computer, PDA (Personal Digital Assistant, personal digital assistant), vehicle-mounted computer, so that terminal is as mobile phone as an example:
Mobile phone includes:Radio frequency (Radio Frequency, RF) circuit, memory, input block, touch display screen, sensor, voicefrequency circuit, Wireless Fidelity (wireless fidelity, the WiFi) part such as module, processor and power supply.
Each component parts to mobile phone is specifically introduced below:
RF circuit can be used to receiving and sending messages or communication process in, the reception of signal and transmission, especially, after the downlink information of base station is received, process to processor;In addition, up data is activation will be designed to base station.Generally, RF circuit includes but is not limited to antenna, at least one amplifier, transceiver, coupler, low-noise amplifier (Low Noise Amplifier, LNA), duplexer etc..Additionally, RF circuit can also be communicated with network and other equipment by radio communication.Above-mentioned radio communication can be using arbitrary communication standard or agreement,Including but not limited to global system for mobile communications (Global System of Mobile communication,GSM)、General packet radio service (General Packet Radio Service,GPRS)、CDMA (Code Division Multiple Access,CDMA)、WCDMA (Wideband Code Division Multiple Access,WCDMA)、Long Term Evolution (Long Term Evolution,LTE)、Email、Short Message Service (Short Messaging Service,SMS) etc..
Memory can be used to store software program and module, and processor is stored in software program and the module of memory by operation, so as to execute various function application and the data processing of mobile phone.Memory can mainly include storing program area and storage data field, and wherein, storing program area can storage program area, application program (such as sound-playing function, image player function etc.) needed at least one function etc.;Storage data field can be stored and use created data (such as voice data, phone directory etc.) etc. according to mobile phone.Additionally, memory can include high-speed random access memory, nonvolatile memory, for example, at least one disk memory, flush memory device or other volatile solid-state parts can also be included.
Input block can be used for the numeral of receives input or character information, and produce the key signals input relevant with the user setup of mobile phone and function control.Specifically, input block may include contact panel and other input equipments.Contact panel, also referred to as touch-screen, user can be collected thereon or neighbouring touch operation (operation of the such as user using any suitable object such as finger, stylus or annex on contact panel or near the contact panel), and corresponding attachment means are driven according to formula set in advance.Optionally, contact panel may include two parts of touch detecting apparatus and touch controller.Wherein, the touch orientation of touch detecting apparatus detection user, and the signal that touch operation brings is detected, transmit a signal to touch controller;Touch controller receives touch information from touch detecting apparatus, and is converted into contact coordinate, then gives processor, and the order sent of receiving processor can be executed.Furthermore, it is possible to realize contact panel using polytypes such as resistance-type, condenser type, infrared ray and surface acoustic waves.Except contact panel, input block can also include other input equipments.Specifically, other input equipments can include but is not limited to one or more in physical keyboard, function key (such as volume control button, switch key etc.), trace ball, mouse, action bars etc..
Touch display screen can be used for display by the information of user input or be supplied to the information of user and the various menus of mobile phone.Touch display screen may include display floater, optionally, can be using liquid crystal display (English:Liquid Crystal Display, referred to as:LCD), Organic Light Emitting Diode (English:Organic Light-Emitting Diode, referred to as:) etc. OLED form is configuring display floater.Further, contact panel can cover display floater, when contact panel is detected thereon or after neighbouring touch operation, processor is sent to determine the type of touch event, provide corresponding visual output with preprocessor on a display panel according to the type of touch event.
Mobile phone may also include at least one sensor, such as optical sensor, motion sensor and other sensors.Specifically, optical sensor may include ambient light sensor and proximity transducer, and wherein, ambient light sensor can adjust the brightness of display floater according to the light and shade of ambient light, and proximity transducer can cut out display floater and/or backlight when mobile phone is moved in one's ear.One kind as motion sensor, accelerometer sensor can detect the size of (generally three axles) acceleration in all directions, size and the direction of gravity is can detect that when static, can be used to recognize the application (such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating) of mobile phone attitude, Vibration identification correlation function (such as pedometer, percussion) etc.;The other sensors such as the gyroscope that can also configure as mobile phone, barometer, hygrometer, thermometer, infrared ray sensor, will not be described here.
Electric signal after the voice data for receiving conversion can be transferred to loudspeaker by voicefrequency circuit, be converted to voice signal output by loudspeaker;On the other hand, the voice signal of collection is converted to electric signal by microphone, is converted to voice data by voicefrequency circuit after being received, then after voice data output processor is processed, through RF circuit being sent to such as another mobile phone, or voice data is exported to memory to process further.
WiFi belongs to short range wireless transmission technology, and mobile phone can help user to send and receive e-mail by WiFi module, browse webpage and access streaming video etc., and it has provided the user wireless broadband internet and has accessed.
Processor is the control centre of mobile phone, using various interfaces and the various pieces of connection whole mobile phone, by running or executing software program and/or the module being stored in memory, and call the data being stored in memory, various functions and the processing data of mobile phone is executed, so as to integral monitoring be carried out to mobile phone.Optionally, processor may include one or more processing units;Preferably, processor can integrated application processor and modem processor, wherein, application processor mainly processes operating system, user interface and application program etc., and modem processor mainly processes radio communication.It is understood that above-mentioned modem processor can not also be integrated in processor.
Mobile phone also includes the power supply (such as battery) that powers to all parts, it is preferred that power supply can be logically contiguous with processor by power-supply management system, so as to realize the functions such as management charging, electric discharge and power managed by power-supply management system.
Although not shown, mobile phone can also include camera, bluetooth module etc., will not be described here.
Those skilled in the art can be understood that, for convenience and simplicity of description, the specific work process of the system, apparatus, and unit of foregoing description, the corresponding process in preceding method embodiment is may be referred to, be will not be described here.
In several embodiments provided herein, it should be understood that disclosed system, apparatus and method, can realize by another way.For example, device embodiment described above is only schematic, for example, the division of the unit, be only a kind of division of logic function, can have when actually realizing other dividing mode, for example multiple units or component can in conjunction with or be desirably integrated into another system, or some features can be ignored, or do not execute.Another, shown or discussed coupling each other or direct-coupling or communication connection can be the INDIRECT COUPLING or communication connection of device or unit by some interfaces, can be electrical, mechanical or other forms.
The unit that illustrates as separating component can be or may not be physically separate, as the part that unit shows can be or may not be physical location, you can be located at a place, or can also be distributed on multiple NEs.Some or all of unit therein can be selected according to the actual needs to realize the purpose of this embodiment scheme.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, or unit is individually physically present, it is also possible to which two or more units are integrated in a unit.Above-mentioned integrated unit both can be realized in the form of hardware, it would however also be possible to employ the form of SFU software functional unit is realized.
One of ordinary skill in the art will appreciate that all or part of step in the various methods of above-described embodiment can be by program completing to instruct the hardware of correlation, the program can be stored in a computer-readable recording medium, and storage medium can include:Read-only storage (ROM, Read Only Memory), random access memory (RAM, Random Access Memory), disk or CD etc..
One of ordinary skill in the art will appreciate that the hardware that all or part of step that realizes in above-described embodiment method can be by program to instruct correlation is completed, described program can be stored in a kind of computer-readable recording medium, storage medium mentioned above can be read-only storage, disk or CD etc..
It is that relevant device is described in detail to a kind of data processing method provided by the present invention above, for one of ordinary skill in the art, thought according to the embodiment of the present invention, all will change in specific embodiments and applications, in sum, this specification content should not be construed as limiting the invention.

Claims (11)

1. a kind of data processing method, it is characterised in that include:
Obtain the first data that user operation client is produced, wherein, the user has the ID for identity, first data include ID, multiple second data corresponding with the ID and multiple 3rd data, and second data are corresponded with the 3rd data;
Configure the ID of first data and second data and the one-to-one relation of the 3rd data and using first data as data cached;
Obtain the Object Operations data of user operation client generation, the second data that the Object Operations data include the ID, operated, wherein, the object that operated is one in the plurality of second data;
Inquire about according to the ID that to obtain the ID corresponding described data cached in the relation;
According to second data query for being operated data cached described in corresponding 3rd data of the second data for being operated;
Corresponding for the second clicked data the 3rd data are used as True Data for backstage when judging that corresponding 3rd data of the second data for being operated meet pre-conditioned.
2. method according to claim 1, it is characterized in that, first data are original exposure data, second data are exposure channel, 3rd data are exposure channel news ID list, the Object Operations data are channel click data, first data for obtaining the generation of user operation client, wherein, the user has the ID for identity, first data include ID, multiple second data corresponding with the ID and multiple 3rd data, second data are corresponded with the 3rd data, including:
Obtain the original exposure data that user operation client is produced, wherein, the user has the ID for identity, the original exposure data include ID, multiple exposure channels corresponding with the ID and multiple exposure channel news ID lists, and the exposure channel is corresponded with exposure channel news ID list;
The ID of the configuration first data and second data and the one-to-one relation of the 3rd data using first data as data cached, including:
By the original exposure data with the ID as key, be stored in the KV storage system based on internal memory as value and cache as exposure to expose channel and exposure channel news ID list;
Obtain the Object Operations data of user operation client generation, the second data that the Object Operations data include the ID, operated, wherein, the object that operated is one in the plurality of second data, including:
The channel click data of user operation client generation is obtained, the channel click data includes the ID, clicked exposure channel, wherein, the clicked channel is in the plurality of exposure channel;
Inquire about according to the ID that to obtain the ID corresponding described data cached in the relation, including:
The corresponding exposure caching of the ID is inquired about in the KV storage system according to the ID;
According to second data query for being operated data cached described in corresponding 3rd data of the second data for being operated, including:
The corresponding exposure channel news ID list of clicked exposure channel described in exposure caching according to the clicked exposure channel query;
Corresponding for the second clicked data the 3rd data are used as True Data for backstage when judging that corresponding 3rd data of the second data for being operated meet pre-conditioned, including:
Corresponding for clicked exposure channel exposure channel news ID list is used as true exposure data for backstage when judging that the corresponding exposure channel news ID list of clicked exposure channel meets pre-conditioned.
3. data processing method according to claim 2, it is characterised in that the exposure channel news ID list includes time for exposure and effective time,
Before corresponding for clicked exposure channel exposure channel news ID list is used for backstage when meeting pre-conditioned by the corresponding exposure channel news ID list of the clicked exposure channel of the judgement as true exposure data, also include:
Obtain the click behavior time of origin of clicked exposure channel;
Calculate the time difference between the time for exposure of the clicked exposure channel news ID list and the click behavior time of origin;
It is described that to judge that the corresponding exposure channel news ID list of clicked exposure channel meets pre-conditioned, including:
The corresponding exposure channel news ID list of then clicked exposure channel when being not more than the effective time of the time difference meets pre-conditioned.
4. data processing method according to claim 2, it is characterised in that after the corresponding exposure channel news ID list of clicked exposure channel described in exposure caching according to the clicked exposure channel query, also include:
Corresponding for clicked exposure channel exposure channel news ID list is abandoned as pseudo- exposure data when judging that the corresponding exposure channel news ID list of clicked exposure channel does not meet pre-conditioned.
5. data processing method according to claim 2, it is characterised in that described the corresponding exposure caching of the ID is inquired about in the KV storage system according to the ID before, also include:
The news click data of user operation client generation is obtained, the news is clicked on packet and the ID, news are located exposure channel and news ID is included, wherein, exposure channel that the news is located is in the plurality of exposure channel;
Described according to the user inquire about in the KV storage system ID corresponding described exposure caching after, also include:
According to the corresponding exposure channel news ID list of exposure channel that news described in exposure caching described in exposure channel query that the news is located is located;
The corresponding positional information of news ID is obtained according to news ID in the news is located the corresponding exposure channel news ID list of exposure channel, so that the positional information is available for backstage use.
6. data processing method according to claim 3, it is characterised in that the effective time is random time in 1 minute to 10 minutes.
7. a kind of data handling system, it is characterised in that include:
First acquisition unit, for obtaining the first data of user operation client generation, wherein, the user has the ID for identity, first data include ID, multiple second data corresponding with the ID and multiple 3rd data, and second data are corresponded with the 3rd data;
Memory cell, for configuring the ID of first data and second data and the one-to-one relation of the 3rd data and using first data as data cached;
Second acquisition unit, for obtaining the Object Operations data of user operation client generation, the second data that the Object Operations data include the ID, operated, wherein, the object that operated is one in the plurality of second data;
First query unit, for being inquired about in the relation according to the ID, to obtain the ID corresponding described data cached;
Second query unit, corresponding 3rd data of the second data for being operated described in data cached according to second data query for being operated;
Corresponding for the second clicked data the 3rd data are used for backstage when corresponding 3rd data of the second data for judging to be operated meet pre-conditioned by the first judging unit as True Data.
8. data handling system according to claim 7, it is characterised in that
First acquisition unit is additionally operable to obtain the original exposure data that user operation client is produced, wherein, the user has the ID for identity, the original exposure data include ID, multiple exposure channels corresponding with the ID and multiple exposure channel news ID lists, and the exposure channel is corresponded with exposure channel news ID list;
Memory cell is additionally operable to the original exposure data with the ID as key, be stored in the KV storage system based on internal memory as value and cache as exposure to expose channel and exposure channel news ID list;
Second acquisition unit is additionally operable to obtain the channel click data of user operation client generation, and the channel click data includes the ID, clicked exposure channel, and wherein, the clicked channel is in the plurality of exposure channel;
First query unit is additionally operable to inquire about the corresponding exposure caching of the ID in the KV storage system according to the ID;
Second query unit is additionally operable to the corresponding exposure channel news ID list of clicked described in exposure caching exposure channel according to the clicked exposure channel query;
First judging unit is additionally operable to when judging that the corresponding exposure channel news ID list of clicked exposure channel meets pre-conditioned use corresponding for clicked exposure channel exposure channel news ID list as true exposure data for backstage.
9. data handling system according to claim 8, it is characterised in that the exposure channel news ID list includes time for exposure and effective time, and the data handling system also includes:
3rd acquiring unit, for obtaining the click behavior time of origin of clicked exposure channel;
Computing unit, for calculating the time difference between the time for exposure of the clicked exposure channel news ID list and the click behavior time of origin;
First judging unit be additionally operable to the corresponding exposure channel news ID list of the then clicked exposure channel when the time difference is not more than the effective time meet pre-conditioned.
10. data handling system according to claim 8, it is characterised in that the data handling system also includes:
Second judging unit, for judging when the corresponding exposure channel news ID list of clicked exposure channel does not meet pre-conditioned to abandon corresponding for clicked exposure channel exposure channel news ID list as pseudo- exposure data.
11. data handling systems according to claim 8, it is characterised in that the data handling system also includes:
4th acquiring unit, for obtaining the news click data of user operation client generation, the news is clicked on packet and includes the ID, news are located exposure channel and news ID, and wherein, exposure channel that the news is located is in the plurality of exposure channel;
3rd query unit, for according to the corresponding exposure channel news ID list of exposure channel that news described in exposure caching described in exposure channel query that the news is located is located;
4th query unit, for obtaining the corresponding positional information of news ID according to news ID in the news is located the corresponding exposure channel news ID list of exposure channel, so that the positional information is available for backstage use.
CN201510522784.9A 2015-08-24 2015-08-24 Data processing method and system Active CN106484688B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510522784.9A CN106484688B (en) 2015-08-24 2015-08-24 Data processing method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510522784.9A CN106484688B (en) 2015-08-24 2015-08-24 Data processing method and system

Publications (2)

Publication Number Publication Date
CN106484688A true CN106484688A (en) 2017-03-08
CN106484688B CN106484688B (en) 2020-07-24

Family

ID=58233028

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510522784.9A Active CN106484688B (en) 2015-08-24 2015-08-24 Data processing method and system

Country Status (1)

Country Link
CN (1) CN106484688B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110460902A (en) * 2018-05-08 2019-11-15 腾讯科技(深圳)有限公司 Playing method and device, storage medium, the electronic device of media information
CN110968488A (en) * 2018-09-30 2020-04-07 北京国双科技有限公司 User data storage method and device
CN111460285A (en) * 2020-03-17 2020-07-28 北京百度网讯科技有限公司 Information processing method, device, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101159592A (en) * 2007-08-10 2008-04-09 北大方正集团有限公司 Statistical method and device of internet data information clicking rates
CN101271562A (en) * 2008-05-12 2008-09-24 腾讯科技(深圳)有限公司 Collection processing method and system for network advertisement operation event information
CN102135873A (en) * 2010-01-26 2011-07-27 腾讯科技(深圳)有限公司 Method and device for creating user interface
CN103729446A (en) * 2013-12-30 2014-04-16 广州金山网络科技有限公司 Processing method and device for user operation data and server
US20150235261A1 (en) * 2009-03-25 2015-08-20 Google Inc. Advertisement effectiveness measurement

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101159592A (en) * 2007-08-10 2008-04-09 北大方正集团有限公司 Statistical method and device of internet data information clicking rates
CN101271562A (en) * 2008-05-12 2008-09-24 腾讯科技(深圳)有限公司 Collection processing method and system for network advertisement operation event information
US20150235261A1 (en) * 2009-03-25 2015-08-20 Google Inc. Advertisement effectiveness measurement
CN102135873A (en) * 2010-01-26 2011-07-27 腾讯科技(深圳)有限公司 Method and device for creating user interface
CN103729446A (en) * 2013-12-30 2014-04-16 广州金山网络科技有限公司 Processing method and device for user operation data and server

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110460902A (en) * 2018-05-08 2019-11-15 腾讯科技(深圳)有限公司 Playing method and device, storage medium, the electronic device of media information
CN110968488A (en) * 2018-09-30 2020-04-07 北京国双科技有限公司 User data storage method and device
CN111460285A (en) * 2020-03-17 2020-07-28 北京百度网讯科技有限公司 Information processing method, device, electronic equipment and storage medium
EP3882792A1 (en) * 2020-03-17 2021-09-22 Beijing Baidu Netcom Science And Technology Co. Ltd. Method and apparatus for processing information, electronic device and storage medium
JP2021149963A (en) * 2020-03-17 2021-09-27 ベイジン バイドゥ ネットコム サイエンス アンド テクノロジー カンパニー リミテッド Information processing method, device, electronic apparatus, and storage medium
US11250066B2 (en) 2020-03-17 2022-02-15 Beijing Baidu Netcom Science And Technology Co., Ltd. Method for processing information, electronic device and storage medium
JP7261827B2 (en) 2020-03-17 2023-04-20 阿波▲羅▼智▲聯▼(北京)科技有限公司 Information processing method, device, electronic device and storage medium
CN111460285B (en) * 2020-03-17 2023-11-03 阿波罗智联(北京)科技有限公司 Information processing method, apparatus, electronic device and storage medium

Also Published As

Publication number Publication date
CN106484688B (en) 2020-07-24

Similar Documents

Publication Publication Date Title
CN103473011B (en) A kind of mobile terminal performance detection method, device and mobile terminal
CN103530115B (en) Application program display method and device and terminal equipment
CN107146616A (en) Apparatus control method and Related product
CN103327102A (en) Application program recommending method and device
CN104426919A (en) Page sharing method, device and system
CN105260087A (en) Information display method and terminal
CN104571529A (en) Application wake method and mobile terminal
CN105447583A (en) User churn prediction method and device
CN104519262A (en) Method, device for acquiring video data, and terminal
CN103945241A (en) Streaming data statistical method, system and related device
CN104424211A (en) Microblog-based service data release method, device and system
CN103399705A (en) Method, device and equipment for remotely controlling terminal equipment
CN104424278A (en) Method and device for acquiring hotspot information
CN107817988A (en) The management method and Related product of PUSH message
CN105512150A (en) Method and device for information search
CN104951637A (en) Method and device for obtaining training parameters
CN104699501A (en) Method and device for running application program
CN105550316A (en) Pushing method and device of audio list
CN103455602A (en) Video URL (Uniform Resource Locator) capturing method and device and terminal equipment
CN103399706A (en) Page interaction method, device and terminal
CN105047185A (en) Method, device and system for obtaining audio frequency of accompaniment
CN106484688A (en) A kind of data processing method and system
CN105743773A (en) Weather data acquisition method and device
CN104918130A (en) Methods for transmitting and playing multimedia information, devices and system
CN104702643A (en) A webpage access method, device and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant