CN103488774A - Processing method for big data log analysis - Google Patents

Processing method for big data log analysis Download PDF

Info

Publication number
CN103488774A
CN103488774A CN201310453556.1A CN201310453556A CN103488774A CN 103488774 A CN103488774 A CN 103488774A CN 201310453556 A CN201310453556 A CN 201310453556A CN 103488774 A CN103488774 A CN 103488774A
Authority
CN
China
Prior art keywords
task
data
tasks
analysis
query
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310453556.1A
Other languages
Chinese (zh)
Inventor
曾敬鸿
邓翔宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
GUIZHOU BROADCASTING&TV INFORMATION NETWORK Co Ltd
Original Assignee
GUIZHOU BROADCASTING&TV INFORMATION NETWORK Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by GUIZHOU BROADCASTING&TV INFORMATION NETWORK Co Ltd filed Critical GUIZHOU BROADCASTING&TV INFORMATION NETWORK Co Ltd
Priority to CN201310453556.1A priority Critical patent/CN103488774A/en
Publication of CN103488774A publication Critical patent/CN103488774A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3466Performance evaluation by tracing or monitoring
    • G06F11/3476Data logging

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a processing method for big data log analysis. The method includes the steps that first, a user inputs a user identifier, a service identifier, a query time point and an extension time period through a foreground page; second, a background system acquires the parameter information input through the foreground page and decomposes tasks with fixed time intervals as unit; third, each decomposed task is compared with completed tasks stored in a database, and the completed tasks are filtered out; fourth, the uncompleted tasks after comparison are issued to all client-sides through a server-side; fifth, an inquiry program is started after all the client-sides receive the tasks, and inquiry result data are reported to a cache; six, the inquiry result data are sent to the foreground page to be read by the user after all the inquiry tasks are completed. The processing method for big data log analysis solves the problem that because a traditional data analysis method adopted for log analysis of data of a broadcasting and TV service system has no universality, complex algorithms can exist, and an analysis result acquired on each time can not provide available data for analysis at later stages.

Description

A kind of disposal route of analyzing for large data logging
Technical field
The invention belongs to broadcasting and TV service monitoring system technology, relate in particular to a kind of disposal route of analyzing for large data logging.
Background technology
Sharply increase along with broadcasting and TV business system user quantity, system occurs that wrong and location mistake becomes increasingly complex, conventional location mistake is generally to go to location by the profiling error daily record, it is generally the time according to customer complaint generation problem, then find journal file from the journal file of magnanimity, and then the abnormal log that finds the report user to go wrong from journal file, but because the user is many, concurrent large, system all generates a large amount of daily records all the time, location and the reparation of the problem of giving have brought very large difficulty, thereby traditional data analysis generally reaches the purpose of accelerating efficiency by specific algorithm optimization for specific data structure, do not possess versatility and the algorithm of more complicated may occur, and each analysis result can not provide available data for post analysis, the while treatment effeciency is low etc.
Summary of the invention
The technical problem to be solved in the present invention: a kind of disposal route of analyzing for large data logging is provided, thereby adopt traditional data analysis generally for specific data structure, by specific algorithm optimization, to reach the purpose of accelerating efficiency to solve broadcasting and TV business system data log analysis, do not possess versatility and the algorithm of more complicated may occur, and each analysis result can not provide available data and the problem such as query processing efficiency is low for post analysis.
Technical solution of the present invention:
A kind of disposal route of analyzing for large data logging, it comprises the steps:
Step 1, user input user's sign, business sign, query time point and expansion time section by front page layout;
Step 2, background system obtain front page layout input parameter information, are spaced apart the Partition of Unity task with regular time;
The completed task of storing in step 3, each task after decomposing and database is compared, and filters out completed task;
Step 4, the uncompleted task afterwards of comparing are issued to each client by server end;
Step 5, each client, after task, start polling routine, and the Query Result data upload is arrived to buffer memory, and announcement server end corresponding task completes;
After step 6, all query tasks complete, server end obtains the Query Result data from buffer memory and delivers to database, and the Query Result data are delivered to front page layout and read for the user.
Step 2 is described with the regular time interval, and its time is spaced apart 20 minutes.
In step 5, be reported to the Query Result data of buffer memory after all query tasks complete, the unified server end of delivering to is stored.
Beneficial effect of the present invention:
By the present invention, take the time as unit carries out the task cutting, be issued to each client Agent after comparing by the multithread analyzing daily record, accelerate log analysis efficiency; By the task cutting, shorten the time cycle of each task, improve success ratio, the data that analysis completes can be to analyze next time provides data available, the task cutting time interval is made as 20 minutes, after considering that the client finds fault, the reaction time of reporting fault is generally about 20 minutes, therefore the time interval is made as 20 minutes, to improve search efficiency, reduce client's stand-by period, the result data in buffering is the unified server of delivering to after all query tasks complete, be in order to reduce the burden of server, accelerate the treatment effeciency in processing procedure; Thereby the invention solves broadcasting and TV business system data log analysis adopts traditional data analysis generally for specific data structure, by specific algorithm optimization, to reach the purpose of accelerating efficiency, do not possess versatility and the algorithm of more complicated may occur, and each analysis result can not provide data available and the problem such as query processing efficiency is low for post analysis.
embodiment:
A kind of disposal route of analyzing for large data logging, it comprises the steps:
Step 1, user input user's sign, business sign, query time point and expansion time section by front page layout; Be generally that to take the number of minutes of time point and expansion time section be that 3 integers are optional value: 20 minutes, 40 minutes, 1 hour, number of seconds was 0, and this is mainly after considering that the rating end subscriber is found fault, the reaction time of reporting fault.
Step 2, background system obtain front page layout input parameter information, are spaced apart the Partition of Unity task with regular time; Its time is spaced apart 20 minutes optimums, also can arbitrarily set interval.
The completed task of storing in each task after step 3, background system will decompose and database is compared, and filters out completed task; Having completed of task does not need to repeat inquiry, and the result that directly output is stored in server gets final product.
Step 4, the uncompleted task afterwards of comparing are issued to each client by server end;
Step 5, each client, after task, start polling routine, and the Query Result data are reported to buffer memory, kept in, and this task of announcement server complete;
Step 6, by heartbeat, judge, after all query tasks complete, server end obtains the Query Result data from buffer memory and delivers to database, and the Query Result data are delivered to front page layout and read for the user.

Claims (3)

1. a disposal route of analyzing for large data logging, it comprises the steps:
Step 1, user input user's sign, business sign, query time point and expansion time section by front page layout;
Step 2, background system obtain front page layout input parameter information, are spaced apart the Partition of Unity task with regular time;
The completed task of storing in step 3, each task after decomposing and database is compared, and filters out completed task;
Step 4, the uncompleted task afterwards of comparing are issued to each client by server end;
Step 5, each client, after task, start polling routine, and the Query Result data upload is arrived to buffer memory, and announcement server end corresponding task completes;
After step 6, all query tasks complete, server end obtains the Query Result data from buffer memory and delivers to database, and the Query Result data are delivered to front page layout and read for the user.
2. according to claim 1 a kind of for the disposal route that data logging is analyzed greatly, it is characterized in that: step 2 is described with the regular time interval, and its time is spaced apart 20 minutes.
3. according to claim 1 a kind of for the disposal route that data logging is analyzed greatly, it is characterized in that: in step 5, be reported to the Query Result data of buffer memory after all query tasks complete, the unified server end of delivering to is stored.
CN201310453556.1A 2013-09-29 2013-09-29 Processing method for big data log analysis Pending CN103488774A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310453556.1A CN103488774A (en) 2013-09-29 2013-09-29 Processing method for big data log analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310453556.1A CN103488774A (en) 2013-09-29 2013-09-29 Processing method for big data log analysis

Publications (1)

Publication Number Publication Date
CN103488774A true CN103488774A (en) 2014-01-01

Family

ID=49829000

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310453556.1A Pending CN103488774A (en) 2013-09-29 2013-09-29 Processing method for big data log analysis

Country Status (1)

Country Link
CN (1) CN103488774A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108847981A (en) * 2018-06-26 2018-11-20 咸宁职业技术学院 Distributed computer cloud computing processing method
CN110334109A (en) * 2019-05-08 2019-10-15 重庆猪八戒知识产权服务有限公司 Relational database data query method, system, medium and electronic equipment
CN112765228A (en) * 2020-12-22 2021-05-07 珠海格力电器股份有限公司 IEMS data problem troubleshooting system and method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101496012A (en) * 2006-07-26 2009-07-29 微软公司 Data processing over very large databases
WO2009153687A1 (en) * 2008-06-18 2009-12-23 Petascan Ltd Distributed hardware-based data querying
CN101930472A (en) * 2010-09-09 2010-12-29 南京中兴特种软件有限责任公司 Parallel query method for distributed database

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101496012A (en) * 2006-07-26 2009-07-29 微软公司 Data processing over very large databases
WO2009153687A1 (en) * 2008-06-18 2009-12-23 Petascan Ltd Distributed hardware-based data querying
CN101930472A (en) * 2010-09-09 2010-12-29 南京中兴特种软件有限责任公司 Parallel query method for distributed database

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108847981A (en) * 2018-06-26 2018-11-20 咸宁职业技术学院 Distributed computer cloud computing processing method
CN110334109A (en) * 2019-05-08 2019-10-15 重庆猪八戒知识产权服务有限公司 Relational database data query method, system, medium and electronic equipment
CN110334109B (en) * 2019-05-08 2020-07-24 重庆猪八戒知识产权服务有限公司 Relational database data query method, system, medium and electronic device
CN112765228A (en) * 2020-12-22 2021-05-07 珠海格力电器股份有限公司 IEMS data problem troubleshooting system and method

Similar Documents

Publication Publication Date Title
CN107590277B (en) Data synchronization method and device, electronic equipment and storage medium
CN111125260A (en) Data synchronization method and system based on SQL Server
CN108073625B (en) System and method for metadata information management
CN107038162A (en) Real time data querying method and system based on database journal
CN107247811B (en) SQL statement performance optimization method and device based on Oracle database
CN103034735A (en) Big data distributed file export method
CN105405070A (en) Distributed memory power grid system construction method
CN112506743A (en) Log monitoring method and device and server
CN112948492A (en) Data processing system, method and device, electronic equipment and storage medium
CN108268468B (en) Big data analysis method and system
CN111209467A (en) Data real-time query system under multi-concurrency multi-channel environment
CN103488774A (en) Processing method for big data log analysis
CN113901078A (en) Business order association query method, device, equipment and storage medium
CN114356692A (en) Visual processing method and device for application monitoring link and storage medium
CN113779094B (en) Batch-flow-integration-based data processing method and device, computer equipment and medium
CN105338107A (en) Stronghold operation synchronous management system and stronghold operation synchronous management method
CN111125109A (en) Real-time statistical report system based on time grouping accumulation algorithm
CN113342826A (en) Method, storage medium and system for uniformly managing data operations of different data acquisition engines
CN111343269B (en) Data downloading method, device, computer equipment and storage medium
CN104346378B (en) A kind of method, apparatus and system for realizing complex data processing
CN102594889B (en) Data-call-based data synchronization and analysis system
CN110968592A (en) Metadata acquisition method and device, computer equipment and computer-readable storage medium
CN116049188A (en) Maintenance method and device for equipment information table
CN110427399A (en) Real-time data acquisition method, system, device and storage medium
CN112800064B (en) Real-time big data application development method and system based on Confluent community open source version

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20140101