CN103414608B - Rapid web flow collection statistical system and method - Google Patents

Rapid web flow collection statistical system and method Download PDF

Info

Publication number
CN103414608B
CN103414608B CN201310357520.3A CN201310357520A CN103414608B CN 103414608 B CN103414608 B CN 103414608B CN 201310357520 A CN201310357520 A CN 201310357520A CN 103414608 B CN103414608 B CN 103414608B
Authority
CN
China
Prior art keywords
flow
data
node
service
service traffics
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310357520.3A
Other languages
Chinese (zh)
Other versions
CN103414608A (en
Inventor
洪珂
邹宁勇
张芽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wangsu Science and Technology Co Ltd
Original Assignee
Wangsu Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wangsu Science and Technology Co Ltd filed Critical Wangsu Science and Technology Co Ltd
Priority to CN201310357520.3A priority Critical patent/CN103414608B/en
Publication of CN103414608A publication Critical patent/CN103414608A/en
Application granted granted Critical
Publication of CN103414608B publication Critical patent/CN103414608B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses a rapid web flow collection statistical system and method. Based on processing on one-machine web server real-time statistical service flow, the web flow real-time statistical aim is achieved with less resource cost. According to the technical scheme, the rapid web flow collection statistical system comprises a flow collection analysis device, a flow transferring merging device and a flow final-merging device. The flow collection analysis device is deployed on a content distribution network node and collects and analyzes a flow log in real time, service flow data for generating a single node are counted, and the service flow data of the single node are sent to the flow transferring merging device. The flow transferring merging device is deployed on a flow transferring merging node, merging counting is carried out on the service flow data of the content distribution network node, middle flow data of a service are generated in real time, and then the middle flow data are transmitted to the flow final-merging device. The flow final-merging device is deployed on a flow final-merging node, merging counting is carried out on the middle flow data of the service in real time, and final flow data of the service are generated.

Description

Quick web flow collection statistical system and method
Technical field
The present invention relates to web flow amount statistical technique, more particularly to content distributing network(CDN)The web flow amount of service provider Collecting statistical system and method.
Background technology
CDN service business typically can dispose large-scale CDN to accelerate client to provide the service that effectively accelerate to CDN Node, these coverage wide ranges, scope even covering the whole world, and for monitor in real time client and node application service Various datas on flows change, need rapidly to be acquired statistics to the web flow amount on these nodes.
Existing web flow amount statistical technique, the traffic statistics due to being related to service layer will for example count client's domain name Traffic related data, be to count the flow number of client and node application service by collecting and analyzing web access logs According to.
This traditional web flow amount statistical technique, the web access logs being distributed across on each node to be processed, day The scale of will amount is directly proportional with the scale of web visit capacities, and the scale of the daily web access logs amount of CDN service business typically all reaches TB ranks even PB ranks, i.e. massive logs, it may be said that this technology is suffered from the drawback that:
1) in order to process magnanimity web access log, and these daily records or scattered, need to consume substantial amounts of resource, Including the consumption of bandwidth for transmission and machinery equipment.
2) statistical data delay is larger, because the time of collection, process and analysis web access logs is long, and due to frame Level on structure is more, even if increasing resource, it is also difficult to realize real-time statistics.
The content of the invention
It is an object of the invention to solve the above problems, there is provided a kind of quick web flow collection statistical system and side Method, is the process carried out based on unit web server real-time statistics service traffics, and with less Resources Consumption web flow amount is realized The problem of real-time statistics.
The technical scheme is that:Present invention is disclosed a kind of quick web flow collection statistical system, including flow Acquisition and analysis device, flow transfer merge device and flow finally merges device, wherein:
Flow collection analytical equipment, is deployed on content delivery network node, Real-time Collection analysis traffic log, statistics life Into the service traffics data of single-point, the service traffics data is activation of single-point to flow transfer is merged into device;
Flow transfer merges device, is deployed in flow transfer merge node, by the business on content delivery network node Data on flows merges statistics, the middle data on flows of business is generated in real time, then is transferred to flow and finally merge device;
Flow finally merges device, is deployed in the final merge node of flow, and the middle flow of statistical service is merged in real time Data, generate the final data on flows of business.
One embodiment of quick web flow collection statistical system of the invention, flow transfer merge node is root Choose according to the Node distribution scope of content distributing network.
One embodiment of quick web flow collection statistical system of the invention, the final merge node of flow is root The coverage of flow transfer merge node is chosen according to selected node.
One embodiment of quick web flow collection statistical system of the invention, flow transfer merge node according to The node scale of content distributing network carries out extending transversely.
One embodiment of quick web flow collection statistical system of the invention, data on flows include the Connection Time, Number of request and request method.
Present invention further teaches a kind of quick web flow amount gathers statistical method, on single content delivery network node Implement, including:
Loading business statistics rule;
Obtain untreated traffic log queue;
Judge that whether traffic log queue is empty, previous step is then returned if it is empty, one is then taken out from queue if not empty Individual traffic log;
Obtain the corresponding time point of the traffic log;
Parse the traffic log;
According to the service traffics data of the business statistics rule-statistical time point;
The traffic log is deleted after the service traffics data for exporting the time point.
Quick web flow amount of the invention gathers an embodiment of statistical method, parses in traffic log name To the corresponding time point of the traffic log.
Present invention contrast prior art has following beneficial effect:The present invention system include flow collection analytical equipment, Flow transfer merges device and flow finally merges device, by arranging an extendible middle merging layer, by network Merge machine in the middle of detection random selection preferably, the service traffics data transfer of web server is gone out, due to service traffics Data are simultaneously little, and network routing has been done again, can substantially accomplish real-time Transmission.
Description of the drawings
Fig. 1 shows the structure chart of the preferred embodiment of the quick web flow collection statistical system of the present invention.
Fig. 2 shows the data flow diagram of the quick web flow collection statistical system of the present invention.
Fig. 3 shows the flow chart of the preferred embodiment of the quick web flow amount collection statistical method of the present invention.
Specific embodiment
With reference to the accompanying drawings and examples the invention will be further described.
The embodiment of quick web flow collection statistical system
Fig. 1 shows the structure of the preferred embodiment of the quick web flow collection statistical system of the present invention.Refer to figure 1, the system of the present embodiment includes that flow collection analytical equipment 1, flow transfer merge device 2 and flow finally merges device 3.
Flow collection analytical equipment 1 is deployed on content delivery network node, on a web server Real-time Collection analysis stream Amount daily record, the essential information that the daily record only needs comprising service traffics data statisticss, and format(Web server needs According to this journal format record traffic log)As long as doing simple dissection process, it is possible to quick obtaining service traffics data, The traffic log that flow collection analytical equipment 1 is collected is the lattice of lightweight for general web access logs The data of formula, this part works for the analyzing and processing of general web access logs, and workload is considerably less, substantially The performance of web server is not affected.The statistics of flow analysis device 1 generates the service traffics data of single-point, by the Business Stream of single-point Amount data is activation to flow transfer merges device 2.Here data on flows is not sense stricto data on flows, contains diversification Data, such as including Connection Time, number of request and request method etc..
Certain Centroid is transferred to after the service traffics data of statistics generation unit and merges statistics, due to CDN sections Point machine is very more, usually thousand ranks or into ten thousand ranks, disperses in different geographical locations, to be here provided with one Individual extendible middle merging layer, by merging machine in the middle of network detection random selection preferably, the business of web server Data on flows is transferred out, and due to service traffics data and less, network routing has been done again, is essentially ensures that the real-time biography of data The load balancing of defeated and flow transfer merge node.
Flow transfer merges device 2 and is deployed in flow transfer merge node, will be interior according to the business statistics rule of configuration Service traffics data on content distributing network node merge statistics, and the middle data on flows of business is generated in real time, then are transferred to stream Amount is final to merge device 3.Chosen according to the Node distribution scope of content distributing network in the present embodiment, that is, inquire after network choosing Preferably flow transfer merge node is taken, the network coverage of general selected node is preferable.Flow transfer merge node assume responsibility for industry The part of business data on flows merges statistical work, while also serving as content delivery network node between the final merge node of flow One terminal of network, and can be carried out according to the node scale of content distributing network extending transversely.
Flow finally merges device 3 and is deployed in the final merge node of flow, real-time according to the business statistics rule of configuration Merge the middle data on flows of statistical service, generate the final data on flows of business.The final merge node of flow is according to selected Node is chosen to the coverage of flow transfer merge node.
Fig. 2 shows the data flow of web flow collection statistical system.Fig. 2 is referred to, flow collection analytical equipment 1 is real-time Collection and analysis traffic log, and the single-point service traffics data is activation for generating is merged into device 2 to flow transfer.Flow transfer Merge device 2 and analyze each node traffic data on flows in real time, flow data is activation in the middle of the business of generation is finally closed to flow And device 3.
Quickly web flow amount gathers the embodiment of statistical method
Fig. 3 shows the flow process of the preferred embodiment of the quick web flow amount collection statistical method of the present invention.Refer to figure 3, details are as follows for the implementation steps of the method for the present embodiment.
Step S10:Loading business statistics rule.
Which field is business statistics rule mainly configuration object of statistics will count, such as using domain name as object of statistics, Each domain name flow per minute and number of request are counted, number of request of each domain name each conditional code per minute etc. is counted.
Step S11:Obtain untreated traffic log queue.
Once traffic log catalogue has newly-increased traffic log, flow collection analytical equipment then scans the untreated stream of generation Amount journal queue.
Step S12:Judge that whether traffic log queue is empty, if it is empty then return to step S11, then enters if not empty step Rapid S13.
Step S13:A traffic log is taken out from queue.
Step S14:Obtain the corresponding time point of the traffic log.
The parsing from traffic log name obtains the corresponding time point of the traffic log, and service traffics data statisticss are typically minimum In units of one minute, so traffic log was recorded by minute.
Step S15:Parse the traffic log.
Flow collection analytical equipment is analyzed process to traffic log by predefined form, obtains the number of each field According to.
Step S16:According to the service traffics data of the business statistics rule-statistical time point.
Flow collection analytical equipment counts the time point service traffics data according to business statistics rule, such as statistics should The flow and number of request of each domain name of time point.
Step S17:Export the service traffics data of the time point.
Flow collection analytical equipment is by the service traffics data persistence of the time point to hard disk.
Step S18:Delete the traffic log.
Flow collection analytical equipment is analyzing and processing the traffic log, then the traffic log is deleted, subsequently into step S12。
Above-described embodiment is available to those of ordinary skill in the art to realize and use the present invention, the common skill in this area Art personnel can make various modifications or change to above-described embodiment without departing from the present invention in the case of the inventive idea, thus Protection scope of the present invention is not limited by above-described embodiment, and should meet the inventive features that claims are previously mentioned Maximum magnitude.

Claims (3)

1. a kind of quick web flow collection statistical system, including flow collection analytical equipment, flow transfer merge device and Flow finally merges device, wherein:
Flow collection analytical equipment, is deployed on content delivery network node, and Real-time Collection analysis only includes service traffics data The essential information and formatted traffic log of statistics needs, statistics generates the service traffics data of single-point, by single-point Service traffics data is activation to flow transfer merges device, and the service traffics data of wherein single-point contain Connection Time, request The data of the diversification of book and request method;
Flow transfer merges device, the flow transfer merge node being deployed according to selected by the range of nodes of content distributing network On, the service traffics data on content delivery network node are merged into statistics, the middle data on flows of business is generated in real time, then pass It is defeated finally to merge device to flow, and flow transfer merge node laterally expanded according to the node scale of content distributing network Exhibition;
Flow finally merges device, is deployed in the final merge node of flow, and the middle data on flows of statistical service is merged in real time, The final data on flows of generation business.
2. quick web flow collection statistical system according to claim 1, it is characterised in that flow finally merges section Point is chosen according to the coverage of flow transfer merge node.
3. a kind of quick web flow amount gathers statistical method, including:
The flow collection analytical equipment Real-time Collection analysis on content delivery network node is deployed in only comprising service traffics data The essential information and formatted traffic log of statistics needs, statistics generates the service traffics data of single-point, by single-point Service traffics data is activation to flow transfer merges device, and the service traffics data of wherein single-point contain Connection Time, request The data of the diversification of book and request method;
Flow transfer merging device is deployed in the flow transfer merge node according to selected by the range of nodes of content distributing network On, the service traffics data on content delivery network node are merged into statistics, the middle data on flows of business is generated in real time, then pass It is defeated finally to merge device to flow, and flow transfer merge node laterally expanded according to the node scale of content distributing network Exhibition;
The flow being deployed in the final merge node of flow finally merges the middle data on flows that device merges in real time statistical service, The final data on flows of generation business.
CN201310357520.3A 2013-08-15 2013-08-15 Rapid web flow collection statistical system and method Active CN103414608B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310357520.3A CN103414608B (en) 2013-08-15 2013-08-15 Rapid web flow collection statistical system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310357520.3A CN103414608B (en) 2013-08-15 2013-08-15 Rapid web flow collection statistical system and method

Publications (2)

Publication Number Publication Date
CN103414608A CN103414608A (en) 2013-11-27
CN103414608B true CN103414608B (en) 2017-05-17

Family

ID=49607594

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310357520.3A Active CN103414608B (en) 2013-08-15 2013-08-15 Rapid web flow collection statistical system and method

Country Status (1)

Country Link
CN (1) CN103414608B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103905428B (en) * 2014-01-28 2017-06-23 北京奇虎科技有限公司 The method and system of visiting information in a kind of network
CN104202759A (en) * 2014-09-05 2014-12-10 上海斐讯数据通信技术有限公司 WIFI (Wireless Fidelity) traffic statistical method
CN105791247B (en) * 2014-12-25 2019-02-05 中国移动通信集团公司 A kind of flow system pair service implementation method and relevant device and system
CN106027272A (en) * 2016-04-26 2016-10-12 乐视控股(北京)有限公司 CDN (Content Delivery Network) node server traffic time deduction method and system
CN105791150B (en) * 2016-05-09 2018-11-30 中国联合网络通信集团有限公司 CDN network dispositions method and system
CN106302020B (en) * 2016-08-18 2019-08-16 上海帝联信息科技股份有限公司 Network bandwidth statistical method and device
CN106656616A (en) * 2016-12-29 2017-05-10 北京天元创新科技有限公司 Whole network flow analysis method of computer network
CN109561051A (en) * 2017-09-26 2019-04-02 中兴通讯股份有限公司 Content distributing network safety detection method and system
CN107707414A (en) * 2017-11-22 2018-02-16 北京搜狐新媒体信息技术有限公司 The monitoring system and method for CDN
CN110401657B (en) * 2019-07-24 2020-09-25 网宿科技股份有限公司 Processing method and device for access log

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20060012134A (en) * 2004-08-02 2006-02-07 주식회사 케이티 Realtime service management system for enterprise and a method thereof
CN1925423A (en) * 2005-08-30 2007-03-07 飞塔信息科技(北京)有限公司 Log device, system and method with function of analyzing network traffic
CN201018524Y (en) * 2007-03-26 2008-02-06 南京邮电大学 Mobile communications network based flux remote monitoring system
CN101159636A (en) * 2007-11-23 2008-04-09 中国电信股份有限公司 System and method for detecting illegal access

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102571856B (en) * 2010-12-17 2015-04-22 中国移动通信集团公司 Method, device and system for selecting transition node

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20060012134A (en) * 2004-08-02 2006-02-07 주식회사 케이티 Realtime service management system for enterprise and a method thereof
CN1925423A (en) * 2005-08-30 2007-03-07 飞塔信息科技(北京)有限公司 Log device, system and method with function of analyzing network traffic
CN201018524Y (en) * 2007-03-26 2008-02-06 南京邮电大学 Mobile communications network based flux remote monitoring system
CN101159636A (en) * 2007-11-23 2008-04-09 中国电信股份有限公司 System and method for detecting illegal access

Also Published As

Publication number Publication date
CN103414608A (en) 2013-11-27

Similar Documents

Publication Publication Date Title
CN103414608B (en) Rapid web flow collection statistical system and method
CN105490854B (en) Real-time logs collection method, system and application server cluster
CN105357054B (en) Website traffic analysis method, device and electronic equipment
CN104488231B (en) Method, apparatus and system for selectively monitoring flow
CN104778188B (en) A kind of distributed apparatus log collection method
US9531620B2 (en) Control plane packet traffic statistics
CN104424229B (en) A kind of calculation method and system that various dimensions are split
WO2017198227A1 (en) Interactive internet protocol television system and real-time acquisition method for user data
CN103870297B (en) The performance data collection system and method for virtual machine in cloud computing environment
EP2544408A1 (en) Parallel processing for multiple instance real-time monitoring
CN109962790A (en) A kind of network quality monitoring method, device, electronic equipment and storage medium
KR20170106648A (en) High-capacity network data processing techniques
US9729563B2 (en) Data transfer for network interaction fraudulence detection
CN103617287A (en) Log management method and device in distributed environment
CN104753732A (en) Distribution based network traffic analysis system and method
CN104504006B (en) The method and system of data acquisition and parsing to news client
CN103166980B (en) Internet data pulls method and system
CN108900374A (en) A kind of data processing method and device applied to DPI equipment
CN107147535A (en) A kind of distributed network measurement data statistical analysis technique
CN106656616A (en) Whole network flow analysis method of computer network
CN107579874A (en) The method and device that a kind of detection flows collecting device data acquisition is failed to report
CN107332685A (en) A kind of method based on big data O&M daily record applied in state's net cloud
CN104486116A (en) Multidimensional query method and multidimensional query system of flow data
CN111459986A (en) Data computing system and method
CN102664789A (en) Method and system for processing large-scale data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant