CN106446171A - Method and device for generating statistics indexes - Google Patents

Method and device for generating statistics indexes Download PDF

Info

Publication number
CN106446171A
CN106446171A CN201610852083.6A CN201610852083A CN106446171A CN 106446171 A CN106446171 A CN 106446171A CN 201610852083 A CN201610852083 A CN 201610852083A CN 106446171 A CN106446171 A CN 106446171A
Authority
CN
China
Prior art keywords
index
field
daily record
data
statistical
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610852083.6A
Other languages
Chinese (zh)
Inventor
孔矾建
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
TVM Beijing Technology Co Ltd
Original Assignee
TVM Beijing Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by TVM Beijing Technology Co Ltd filed Critical TVM Beijing Technology Co Ltd
Priority to CN201610852083.6A priority Critical patent/CN106446171A/en
Publication of CN106446171A publication Critical patent/CN106446171A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Information Transfer Between Computers (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method and device for generating statistics indexes, so as to achieve the goal of flexible statistics of a page view amount. The method comprises the steps of generating log data corresponding to a page browse event when the page browse event is monitored; obtaining a self-defined index structure, wherein the index structure is at least composed of a query dimension and index rules; matching the index rules with the log data, and matching one or more fields in the log data; and generating the statistics indexes based on the index structure and data corresponding to each matched field.

Description

A kind of method and device for generating statistical index
Technical field
The present invention relates to technical field of internet application, more particularly to a kind of method and device for generating statistical index.
Background technology
PV (Page View) i.e. page browsing amount, typically weighs an Internet news channel or a website even net The leading indicator of network news.Web page browsing number is to evaluate one of the most frequently used index of website traffic, referred to as PV.Monitoring site PV Variation tendency and analyze its reason of changes be a lot of head of a station work periodically to be done.Page in PV generally refers to common Html webpage, the html content for also producing comprising the dynamic such as php, jsp.Html content requests from browser can be seen Make a PV, gradually add up to become PV sum.
Content of the invention
The present invention provides a kind of method and device for generating statistical index, in order to realize neatly carrying out page browsing amount The purpose of statistics.
The present invention provides a kind of method for generating statistical index, including:
When page browsing event is monitored, the corresponding daily record data of the page browsing event is generated;
Self-defining index structure is obtained, wherein, the index structure is at least made up of inquiry dimension, index rule;
Mated regular for the index with the daily record data, matched one or more in the daily record data Field;
Based on the index structure and the corresponding data of each field for matching, statistical index is generated.
In an embodiment of the present invention, the inquiry dimension includes:Inquiry field and the value of the inquiry field.
In an embodiment of the present invention, the daily record data includes at least one following field:
ID, user location, channel, time point, advertiser, IP address, the operation system of user place terminal System, operator, sex.
In an embodiment of the present invention, described index rule in comprising with the daily record data in field identical index Field.
In an embodiment of the present invention, based on the index structure and the corresponding data of each field for matching, generate Statistical index, including:
In the index structure, using corresponding for each field for matching data as the regular number of corresponding index According to the key of generation statistical index;
The value of record statistical items is used as the key assignments of the statistical index.
The present invention also provides a kind of device for generating statistical index, including:
Daily record generation module, for when page browsing event is monitored, generating the page browsing event corresponding day Will data;
Acquisition module, for obtaining self-defining index structure, wherein, the index structure is at least by inquiry dimension, rope Draw regular composition;
Matching module, for being mated regular for the index with the daily record data, matches the daily record data In one or more fields;
Index generation module, for based on the index structure and the corresponding data of each field for matching, generating system Meter index.
In an embodiment of the present invention, the inquiry dimension includes:Inquiry field and the value of the inquiry field.
In an embodiment of the present invention, the daily record data includes at least one following field:
ID, user location, channel, time point, advertiser, IP address, the operation system of user place terminal System, operator, sex.
In an embodiment of the present invention, described index rule in comprising with the daily record data in field identical index Field.
In an embodiment of the present invention, the index generation module is additionally operable to:
In the index structure, using corresponding for each field for matching data as the regular number of corresponding index According to the key of generation statistical index;
The value of record statistical items is used as the key assignments of the statistical index.
In embodiments of the present invention, when page browsing event is monitored, the page browsing event corresponding day is generated Will data;Self-defining index structure is obtained, wherein, the index structure is at least made up of inquiry dimension, index rule;By institute State index regular mated with the daily record data, match the one or more fields in the daily record data;Based on institute State index structure and the corresponding data of each field for matching, generate statistical index.As can be seen here, the embodiment of the present invention can Realize the purpose for neatly page browsing amount being counted.
Other features and advantages of the present invention will be illustrated in the following description, also, partly be become from description Obtain it is clear that or being understood by implementing the present invention.The purpose of the present invention and other advantages can pass through in the explanation that is write In book, claims and accompanying drawing, specifically noted structure is realizing and obtain.
Below by drawings and Examples, technical scheme is described in further detail.
Description of the drawings
Accompanying drawing is used for providing a further understanding of the present invention, and constitutes a part for description, the reality with the present invention Example is applied together for the present invention is explained, be not construed as limiting the invention.In the accompanying drawings:
The flow chart that Fig. 1 show in one embodiment of the invention the method for generating statistical index;
The flow chart that Fig. 2 show in another embodiment of the present invention the method for generating statistical index;And
Fig. 3 show in one embodiment of the invention the structural representation of the device for generating statistical index.
Specific embodiment
Below in conjunction with accompanying drawing, the preferred embodiments of the present invention are illustrated, it will be appreciated that preferred reality described herein Apply example to be merely to illustrate and the present invention is explained, be not intended to limit the present invention.
The flow chart of the method for generating statistical index is illustrated in figure 1 in one embodiment of the invention, and the method can include Following steps S11-S14.
Step S11, when page browsing event is monitored, generates the corresponding daily record data of page browsing event.
In this step, user browses a page, will trigger the corresponding daily record data of generation page browsing event.This In, the field in daily record data can be:ID, user location, channel, time point, advertiser, IP address, use The operating system of family place terminal, operator, sex, plan ID etc. is thrown in, the invention is not restricted to this.
Step S12, obtains self-defining index structure, and wherein, the index structure is at least by inquiry dimension, index rule sets Become.
In this step, inquiry dimension can include the value of inquiry field and the inquiry field.
Step S13, index rule is mated with daily record data, is matched the one or more words in daily record data Section.
Step S14, based on index structure and the corresponding data of each field for matching, generates statistical index.
In the step, the statistical index of generation can be as follows:
/ idx (representing this order for index)/ts (representing that key is based on seasonal effect in time series)/hh (united with hour by expression Meter)/daily record name/inquiry dimension (key, the field including inquiry and the value of field)/event/ index field (only one of which)/rope Draw the date that the value/daily record of field is generated.
Additionally, the statistical index for generating can also be as follows:
/ idx (representing this order for index)/ts (representing that key is based on seasonal effect in time series)/hh (united with hour by expression Meter)/event/unicode (carries out only/daily record name/inquiry dimension (field including inquiry and the value of field) to certain field One index, index word segment value is unique)/timezone (utc time zone)/timezone-area (one day, one week, one month)/rope Draw the date that the value/daily record of field/index field is generated.
In embodiments of the present invention, when page browsing event is monitored, the page browsing event corresponding day is generated Will data;Self-defining index structure is obtained, wherein, the index structure is at least made up of inquiry dimension, index rule;By institute State index regular mated with the daily record data, match the one or more fields in the daily record data;Based on institute State index structure and the corresponding data of each field for matching, generate statistical index.As can be seen here, the embodiment of the present invention can Realize the purpose for neatly page browsing amount being counted.
In the alternative embodiment of the present invention, index rule in comprising with daily record data in field identical index word Section.
Based on index structure and the corresponding data of each field for matching in above step S14, statistical index is generated, this Inventive embodiments provide a kind of optional scheme, and the program may comprise steps of A1-A2.
Step A1, in index structure, using corresponding for each field for matching data as corresponding index rule Data, generate the key of statistical index.
Step A2, records the key assignments of the value as statistical index of statistical items.
In embodiments of the present invention, the structural generation statistical index of key-value can be adopted, and wherein, key is statistics rope The key for drawing, value is the key assignments of statistical index.
Be discussed in detail below by a specific embodiment present invention generation statistical index method realize process.Such as The flow chart that Fig. 2 show in another embodiment of the present invention the method for generating statistical index, the method may comprise steps of S21-S25.
Step S21, when page browsing event is monitored, generates the corresponding daily record data of page browsing event.
Step S22, obtains self-defining index structure, and wherein, the index structure is at least by inquiry dimension, index rule sets Become.
In this step, inquiry dimension can include the value of inquiry field and the inquiry field.
Step S23, index rule is mated with daily record data, is matched the one or more words in daily record data Section.
Step S24, in index structure, using corresponding for each field for matching data as corresponding index rule Data, generate the key of statistical index.
Step S25, records the key assignments of the value as statistical index of statistical items.
In the step, the statistical index of generation can be as follows:
/ idx (representing this order for index)/ts (representing that key is based on seasonal effect in time series)/hh (united with hour by expression Meter)/daily record name/inquiry dimension (key, the field including inquiry and the value of field)/event/ index field (only one of which)/rope Draw the date that the value/daily record of field is generated.
Additionally, the statistical index for generating can also be as follows:
/ idx (representing this order for index)/ts (representing that key is based on seasonal effect in time series)/hh (united with hour by expression Meter)/event/unicode (carries out only/daily record name/inquiry dimension (field including inquiry and the value of field) to certain field One index, index word segment value is unique)/timezone (utc time zone)/timezone-area (one day, one week, one month)/rope Draw the date that the value/daily record of field/index field is generated.
In the alternative embodiment of the present invention, generated after statistical index according to daily record data and index structure, the head of a station is permissible Inquiry request is initiated, querying command is generated, querying command is mated with index, obtains statistical result.That is, can be with inquiry Several parameters, such as instance name, filtering rule field, index field, index word segment value, statistical etc., be exactly in a word above The ingredient (except last time series) of key, then combination producing key prefix, holds key prefix and goes data base's Scanning, you can obtain statistical data.
It should be noted that in actual applications, above-mentioned all optional embodiments can be any in the way of using combining Combination, forms the alternative embodiment of the present invention, and this is no longer going to repeat them.
Corresponding to the method for the generation statistical index in above-described embodiment, the present invention also provides a kind of generation statistical index Device.The structural representation of the device for generating statistical index is illustrated in figure 3 in one embodiment of the invention, including:
Daily record generation module 31, for when page browsing event is monitored, generating the page browsing event corresponding Daily record data;
Acquisition module 32, for obtaining self-defining index structure, wherein, the index structure at least by inquiry dimension, Index rule composition;
Matching module 33, for being mated regular for the index with the daily record data, matches the daily record number According in one or more fields;
Index generation module 34, for based on the index structure and the corresponding data of each field for matching, generating Statistical index.
In an embodiment of the present invention, the inquiry dimension includes:Inquiry field and the value of the inquiry field.
In an embodiment of the present invention, the daily record data includes at least one following field:
ID, user location, channel, time point, advertiser, IP address, the operation system of user place terminal System, operator, sex.
In an embodiment of the present invention, described index rule in comprising with the daily record data in field identical index Field.
In an embodiment of the present invention, index generation module 34 is additionally operable to:
In the index structure, using corresponding for each field for matching data as the regular number of corresponding index According to the key of generation statistical index;
The value of record statistical items is used as the key assignments of the statistical index.
The said apparatus of the embodiment of the present invention:When page browsing event is monitored, the page browsing event pair is generated The daily record data that answers;Self-defining index structure is obtained, wherein, the index structure is at least by inquiry dimension, index rule sets Become;Mated regular for the index with the daily record data, matched the one or more fields in the daily record data; Based on the index structure and the corresponding data of each field for matching, statistical index is generated.As can be seen here, the present invention is implemented Example can realize the purpose for neatly page browsing amount being counted.
Those skilled in the art are it should be appreciated that embodiments of the invention can be provided as method, system or computer program Product.Therefore, the present invention can adopt complete hardware embodiment, complete software embodiment or with reference to software and hardware in terms of reality Apply the form of example.And, the present invention can be adopted in one or more computers for wherein including computer usable program code The shape of the upper computer program that implements of usable storage medium (including but not limited to disk memory and optical memory etc.) Formula.
The present invention is the flow process with reference to method according to embodiments of the present invention, equipment (system) and computer program Figure and/or block diagram are describing.It should be understood that can be by computer program instructions flowchart and/or each stream in block diagram Journey and/or the combination of square frame and flow chart and/or the flow process in block diagram and/or square frame.These computer programs can be provided The processor of general purpose computer, special-purpose computer, Embedded Processor or other programmable data processing device is instructed to produce A raw machine so that produced for reality by the instruction of computer or the computing device of other programmable data processing device The device of the function of specifying in present one flow process of flow chart or multiple flow processs and/or one square frame of block diagram or multiple square frames.
These computer program instructions may be alternatively stored in and can guide computer or other programmable data processing device with spy Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory is produced to be included to refer to Make the manufacture of device, the command device realize in one flow process of flow chart or multiple flow processs and/or one square frame of block diagram or The function of specifying in multiple square frames.
These computer program instructions can be also loaded in computer or other programmable data processing device so that in meter Series of operation steps is executed on calculation machine or other programmable devices to produce computer implemented process, so as in computer or The instruction for executing on other programmable devices is provided for realizing in one flow process of flow chart or multiple flow processs and/or block diagram one The step of function of specifying in individual square frame or multiple square frames.
Obviously, those skilled in the art can carry out the essence of various changes and modification without deviating from the present invention to the present invention God and scope.So, if these modifications of the present invention and modification belong to the scope of the claims in the present invention and its equivalent technologies Within, then the present invention is also intended to comprising these changes and modification.

Claims (10)

1. a kind of generate statistical index method, it is characterised in that include:
When page browsing event is monitored, the corresponding daily record data of the page browsing event is generated;
Self-defining index structure is obtained, wherein, the index structure is at least made up of inquiry dimension, index rule;
Mated regular for the index with the daily record data, matched the one or more words in the daily record data Section;
Based on the index structure and the corresponding data of each field for matching, statistical index is generated.
2. method according to claim 1, it is characterised in that the inquiry dimension includes:Inquiry field and the inquiry The value of field.
3. method according to claim 1 and 2, it is characterised in that the daily record data includes at least one following Field:
ID, user location, channel, time point, advertiser, IP address, the operating system of user place terminal, fortune Battalion business, sex.
4. method according to claim 3, it is characterised in that in the index rule comprising with the daily record data in Field identical index field.
5. method according to claim 1, it is characterised in that based on the index structure and each field pair for matching The data that answers, generate statistical index, including:
In the index structure, using corresponding for each field for matching data as the regular data of corresponding index, raw Become the key of statistical index;
The value of record statistical items is used as the key assignments of the statistical index.
6. a kind of generate statistical index device, it is characterised in that include:
Daily record generation module, for when page browsing event is monitored, generating the corresponding daily record number of the page browsing event According to;
Acquisition module, for obtaining self-defining index structure, wherein, the index structure is at least by inquiry dimension, index rule Then constitute;
Matching module, for being mated regular for the index with the daily record data, is matched in the daily record data One or more fields;
Index generation module, for based on the index structure and the corresponding data of each field for matching, generating statistics rope Draw.
7. device according to claim 6, it is characterised in that the inquiry dimension includes:Inquiry field and the inquiry The value of field.
8. the device according to claim 6 or 7, it is characterised in that the daily record data includes at least one following Field:
ID, user location, channel, time point, advertiser, IP address, the operating system of user place terminal, fortune Battalion business, sex.
9. device according to claim 8, it is characterised in that in the index rule comprising with the daily record data in Field identical index field.
10. device according to claim 6, it is characterised in that the index generation module is additionally operable to:
In the index structure, using corresponding for each field for matching data as the regular data of corresponding index, raw Become the key of statistical index;
The value of record statistical items is used as the key assignments of the statistical index.
CN201610852083.6A 2016-09-26 2016-09-26 Method and device for generating statistics indexes Pending CN106446171A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610852083.6A CN106446171A (en) 2016-09-26 2016-09-26 Method and device for generating statistics indexes

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610852083.6A CN106446171A (en) 2016-09-26 2016-09-26 Method and device for generating statistics indexes

Publications (1)

Publication Number Publication Date
CN106446171A true CN106446171A (en) 2017-02-22

Family

ID=58170270

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610852083.6A Pending CN106446171A (en) 2016-09-26 2016-09-26 Method and device for generating statistics indexes

Country Status (1)

Country Link
CN (1) CN106446171A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109034894A (en) * 2018-07-20 2018-12-18 武汉斗鱼网络科技有限公司 Advertisement page pageview statistical method, device, electronic equipment and storage medium
CN110019200A (en) * 2017-09-30 2019-07-16 阿里巴巴集团控股有限公司 A kind of foundation of index, application method and device
CN110673965A (en) * 2019-09-11 2020-01-10 杭州火小二科技有限公司 Method and device for capturing second-killing commodities

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080049641A1 (en) * 2006-08-22 2008-02-28 Edwards Stephen K System and method for displaying a graph representative of network performance over a time period
CN101271562A (en) * 2008-05-12 2008-09-24 腾讯科技(深圳)有限公司 Collection processing method and system for network advertisement operation event information
CN102298551A (en) * 2010-06-23 2011-12-28 中兴通讯股份有限公司 Variable-rule method and system for counting real-time performance based on events
CN104346458A (en) * 2014-10-31 2015-02-11 易准科技发展(上海)有限公司 Data storage method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080049641A1 (en) * 2006-08-22 2008-02-28 Edwards Stephen K System and method for displaying a graph representative of network performance over a time period
CN101271562A (en) * 2008-05-12 2008-09-24 腾讯科技(深圳)有限公司 Collection processing method and system for network advertisement operation event information
CN102298551A (en) * 2010-06-23 2011-12-28 中兴通讯股份有限公司 Variable-rule method and system for counting real-time performance based on events
CN104346458A (en) * 2014-10-31 2015-02-11 易准科技发展(上海)有限公司 Data storage method and device

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110019200A (en) * 2017-09-30 2019-07-16 阿里巴巴集团控股有限公司 A kind of foundation of index, application method and device
CN110019200B (en) * 2017-09-30 2023-05-09 阿里巴巴集团控股有限公司 Index establishing and using method and device
CN109034894A (en) * 2018-07-20 2018-12-18 武汉斗鱼网络科技有限公司 Advertisement page pageview statistical method, device, electronic equipment and storage medium
CN109034894B (en) * 2018-07-20 2021-07-23 武汉斗鱼网络科技有限公司 Advertisement page browsing amount statistical method and device, electronic equipment and storage medium
CN110673965A (en) * 2019-09-11 2020-01-10 杭州火小二科技有限公司 Method and device for capturing second-killing commodities

Similar Documents

Publication Publication Date Title
US11444856B2 (en) Systems and methods for configuring a resource for network traffic analysis
CN104268082B (en) The method for testing pressure and device of browser
CN107733972A (en) A kind of short linking analytic method, device and equipment
CN105589956B (en) A kind of method and device of user's portrait
CN103218412B (en) Public feelings information processing method and device
US9607081B2 (en) Ontology based categorization of users
CN106446171A (en) Method and device for generating statistics indexes
EP3295409A1 (en) Systems and methods for providing call context to content providers
WO2013106595A2 (en) Processing store visiting data
DE112015006537T5 (en) DEVICE-BASED FILTRATION OF CONTENT ELEMENTS ASSOCIATED WITH MOBILE APPLICATIONS
CN103729380A (en) Data processing method, system and device
CN106878397A (en) A kind of WEB user behaviors feedback method and system
CN109977312A (en) A kind of knowledge base recommender system based on content tab
CN103870452A (en) Method and method for recommending data
JP5988345B1 (en) Evaluation device, evaluation method, evaluation program, recommendation device, recommendation method, and recommendation program
CN106484819A (en) A kind of method and device of counting user amount
WO2015013718A1 (en) Curating chat transcripts into webpages
CN106202357A (en) A kind of website browsing data analysing method and device
CN105450462B (en) The monitoring method and system of presence
CN101261643A (en) Website page information statistical method and apparatus
CN111064996B (en) Method, system and storage medium for identifying user watching video content preference
CN110019786A (en) Topic sending method, the topic list ordering method and device of Web Community
CN103227791A (en) Method and device for wireless data collection
CN108229993B (en) Data processing method and device
US9367583B1 (en) Systems and methods of generating content performance metrics

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170222