CN110245158A - A kind of multi-source heterogeneous generating date system and method based on Flink stream calculation technology - Google Patents

A kind of multi-source heterogeneous generating date system and method based on Flink stream calculation technology Download PDF

Info

Publication number
CN110245158A
CN110245158A CN201910495241.0A CN201910495241A CN110245158A CN 110245158 A CN110245158 A CN 110245158A CN 201910495241 A CN201910495241 A CN 201910495241A CN 110245158 A CN110245158 A CN 110245158A
Authority
CN
China
Prior art keywords
data
flink
kafka
source
mode
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910495241.0A
Other languages
Chinese (zh)
Inventor
肖荣
马思峻
陆晋军
郑荣
丁富强
姚磊
孙海
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Ideal Information Industry Group Co Ltd
Original Assignee
Shanghai Ideal Information Industry Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Ideal Information Industry Group Co Ltd filed Critical Shanghai Ideal Information Industry Group Co Ltd
Priority to CN201910495241.0A priority Critical patent/CN110245158A/en
Publication of CN110245158A publication Critical patent/CN110245158A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/1805Append-only file systems, e.g. using logs or journals to store data
    • G06F16/1815Journaling file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24568Data stream processing; Continuous queries

Abstract

The invention discloses a kind of multi-source heterogeneous generating date system and methods based on Flink stream calculation technology, the system includes: data acquisition side, the isomeric data being dispersed in multiple system components is obtained by log mode and/or SDK mode and/or MQ mode, is sent to Kafka in a manner of continuous flow after preliminary treatment;Task management platform side, configuration data Source Type, configure isomeric data cleaning and segmentation rules and configuration data collection dimension and index, and Flink stream calculation technology log-on data real time processing tasks are based on after the completion of all configurations, it is stored after data calculate in real time according to data set definition;Data exhibiting and outlet side, obtain the result output of data set, the present invention can analyze the data of a variety of source different structures exported in existing business system, find the correlation between log event and business, to help operation maintenance personnel to improve efficiency, supplement is provided for existing business diagnosis system.

Description

A kind of multi-source heterogeneous generating date system based on Flink stream calculation technology and Method
Technical field
The present invention relates to a kind of multi-source heterogeneous generating date system and methods, are based on Flink more particularly to one kind The multi-source heterogeneous generating date system and method for stream calculation technology.
Background technique
In internet+epoch, for the demands such as the quick exploitation, the elastic telescopic that adapt to business, the IT system framework of enterprise Positive Docker container cluster and the evolution of micro services direction, this framework improves resource utilization, it is bigger flexible to bring Property, support high concurrent scene.
But with the increase of call relation complexity between the expansion of business scale, service, log output quantity is increasingly More, when facing failure and performance issue, the difficulty of analysis is bigger, therefore, how to divide the mass data of system output Wherein valuable information is found out in analysis, helps operation maintenance personnel to improve efficiency, providing supplement for existing business diagnosis system is urgently Problem to be solved, multi-source heterogeneous generating date technology is come into being as a result,.
Currently, multi-source heterogeneous generating date technology is mainly by establishing self-defining data to solve current problem The data that real time processing tasks export a variety of sources such as file journalization, Agent output, message queue carry out cleaning cutting, spirit The dimension and index of ground living configuration data collection generate time series data, the data in graphic exhibition data set, according to configured Alarm rule notifies contact person, mitigates the pressure of operation maintenance personnel, finds valuable information relevant with business.
However, for this problem, the solution of current multi-source heterogeneous generating date technology there is also it is some not Foot place:
1, the visualized graph interface of data cleansing and segmentation rules can be constructed without providing, but passes through configuration file It realizes.
2, daily record data is not converted into the time series data of structuring, so that index value can not be carried out according to time, dimension Grouping calculates.
3, show interface without providing configurable data interaction chart.
Flink is the distributed data processing platform efficiently calculated based on memory, is the top project of Apache One of.Its core is a streamed data stream engine (Streaming dataflow engine), provides point of data flow Cloth data distribution, communication and fault tolerance have the characteristics such as efficient, reliable, expansible, and have with the Hadoop ecosystem Well compatibility.Flink describes the data set of parallel computation using DataSet, and provides to corresponding data set Such as map, reduce, join, group etc data-processing interface abundant.However, currently, not occurring also flowing based on Flink The multi-source heterogeneous generating date technology of computing technique.
Summary of the invention
In order to overcome the deficiencies of the above existing technologies, purpose of the present invention is to provide one kind to be based on Flink stream calculation The multi-source heterogeneous generating date system and method for technology, by a variety of source difference knots exported in existing business system The mass data of structure is analyzed, and is handled node by all systems that key information match business is passed through, is found log event Correlation between business helps operation maintenance personnel to improve efficiency, provides supplement for existing business diagnosis system.
In order to achieve the above object, the present invention proposes a kind of multi-source heterogeneous generating date based on Flink stream calculation technology System, comprising:
Data acquire side, multiple for being dispersed in by log mode and/or the acquisition simultaneously of SDK mode and/or MQ mode Isomeric data in system component, is sent to Kafka after preliminary treatment in a manner of continuous flow;
Task management platform side for configuration data Source Type, the cleaning of configuration isomeric data and segmentation rules and is matched It sets the dimension and index of data set, and is based on after the completion of all configurations Flink stream calculation technology log-on data and handle in real time times Business, and storage unit is stored according to data set definition after data calculate in real time;
Data exhibiting and outlet side show with chart mode or pass through interface mode for obtaining the result in data set Output.
Preferably, the log mode is to read to specify the newly-increased interior of journal file in real time using log data acquisition device Hold, be sent to log and collect module, the data of acquisition are sent into Kafka after modular filtration is collected in log;The SDK mode is Support that insertion Agent uploads data as data source in application or container, Agent uploads data to background service, and data are passed through Enter Kafka after background service processing;Or Kafka is directly sent data to by Agent as data source;The MQ mode is Support Kafka message queue as data source, data are transmitted directly to Kafka.
Preferably, the task management platform side includes:
Configuration unit, for configuration data Source Type, the cleaning of configuration isomeric data and segmentation rules and configuration data The dimension and index of collection;
Data processing unit is handled in real time for being based on Flink stream calculation technology log-on data after the completion of all configurations Task, and storage unit, each generating date task corresponding one are stored according to data set definition after data calculate in real time A Flink data segmentation task can have multiple data sets, the corresponding Flink data set of each data set in one task Calculating task.
Preferably, when configuration data Source Type, if selecting daily record data as data source, input journal road is needed Diameter needs to input the AccessKeys of SDK Agent if selecting insertion SDK Agent reported data as data source, if choosing MQ is selected as data source, then needs to input the Topic of Kafka.
Preferably, it when configuring the cleaning and segmentation rules of isomeric data, is constructed in such a way that figure pulls building block Data segmentation rules, and after the real time data for obtaining crawl, data cutting preview is carried out according to the data segmentation rules of definition Trial cut point.
Preferably, it when the dimension and index of configuration data collection, according to the data definition data set after cutting, and needs to input The parameters such as filter condition, polymerization dimension, statistical indicator, time field.
Preferably, the data processing unit further comprises:
Flink cleaning and cutting unit to data cutting and are patrolled for consuming the data in Kafka according to segmentation rules Processing is collected, and cutting data are placed again into Kafka;
Flink computing unit calculates after Kafka consumption data according to time, dimension real time aggregation, and will be after calculating As a result it is stored in storage unit;
Storage unit, including ElasticSearch search server and InfluxDb time series database, it is described For ElasticSearch search server for storing initial data, the InfluxDb time series database is described for saving Time series data after the polymerization calculating of Flink computing unit.
Preferably, the configuration unit is also used to configure customized alarm rule, and the data processing unit further includes Flink alert process unit, for after consumption data in the Topic of Kafka, according to the alarm rule real-time judge whether It needs to alarm, generates alarm logging, and notify contact person.
Preferably, the task management platform side further includes query unit, every for being inquired according to the input condition of acquisition A data concentrate the data for having calculated completion.
In order to achieve the above objectives, the present invention also provides a kind of multi-source heterogeneous data based on Flink stream calculation technology are real-time Processing method includes the following steps:
Step S1 acquires side in data, is obtained simultaneously by log mode or SDK mode or MQ mode and is dispersed in multiple systems Isomeric data in system component, is sent to Kafka after preliminary treatment in a manner of continuous flow;
Step S2, in task management platform side, configuration data Source Type, configure cleaning and the segmentation rules of isomeric data with And the dimension and index of configuration data collection, and be based on Flink stream calculation technology log-on data after the completion of all configurations and locate in real time Reason task, and time series database is stored according to data set definition after data are calculated in real time;
Step S3, the result obtained in the task management platform side data set show with chart mode or pass through interface Mode exports.
Compared with prior art, a kind of multi-source heterogeneous generating date system based on Flink stream calculation technology of the present invention System and method are analyzed by the mass data to a variety of source different structures exported in existing business system, pass through key Information (such as TraceID, order number) matches all systems that business is passed through and handles node, finds between log event and business Correlation, help operation maintenance personnel improve efficiency, provide supplement for existing business diagnosis system.
Detailed description of the invention
Fig. 1 is that a kind of structure of the multi-source heterogeneous generating date system based on Flink stream calculation technology of the present invention is shown It is intended to;
The step of Fig. 2 is a kind of multi-source heterogeneous Real-time Data Processing Method based on Flink stream calculation technology of the invention is flowed Cheng Tu;
Fig. 3 is the multi-source heterogeneous generating date system based on Flink stream calculation technology in the specific embodiment of the invention Functional block diagram;
Fig. 4 is the multi-source heterogeneous generating date system based on Flink stream calculation technology in the specific embodiment of the invention Logical architecture figure.
Specific embodiment
Below by way of specific specific example and embodiments of the present invention are described with reference to the drawings, those skilled in the art can Understand further advantage and effect of the invention easily by content disclosed in the present specification.The present invention can also pass through other differences Specific example implemented or applied, details in this specification can also be based on different perspectives and applications, without departing substantially from Various modifications and change are carried out under spirit of the invention.
Before introducing the present invention, several open source components according to the present invention are first introduced:
1、Elasticsearch
Storage assembly as ultimate log, and can, it can be achieved that distributed storage, in real time search and mass data analysis Log is persisted to disk.And the RESTful api interface for providing open source realizes being introduced directly into for log.
2、Logstash
The data in various formats and source can be collected, can storage format as needed write parsing script, realize data The unitized output of format.
3、Filebeat
It for light-weighted log collection component, can install on the server, realize persistent collection and the transmission of log.
4、Kafka
Message system is subscribed to for a kind of distributed post of high-throughput, is capable of the storage log of persistence and fault-tolerance Stream, can solve the speed of log collection and the speed inconsistence problems of processing.
Fig. 1 is that a kind of structure of the multi-source heterogeneous generating date system based on Flink stream calculation technology of the present invention is shown It is intended to.As shown in Figure 1, a kind of multi-source heterogeneous generating date system based on Flink stream calculation technology of the present invention, comprising:
Data acquire side 10, for passing through log mode or SDK (Software Development Kit, software tool Packet) mode or MQ (Message Queue, message queue are also message-oriented middleware) mode while acquisition be dispersed in multiple systems Isomeric data in component, is sent to Kafka after preliminary treatment in a manner of continuous flow.In the specific embodiment of the invention In, data acquisition the collected isomeric data in side 10 for example in addition to having time stamp, call D-chain trace TraceId, error stack, answer With service etc. outside system informations, there are also and the information such as the relevant order number of business, cell-phone number, commodity sign, the present invention is not with this It is limited.
The log mode refers to the new content for reading specified journal file in real time using log data acquisition device, hair It gives log and collects module, the data of acquisition are sent into Kafka after modular filtration is collected in log, in the specific embodiment of the invention In, Yu Suoshu installs FileBeat light-type log collector in the server of data acquisition side 10, it can read and turn Log lines are sent out, can also be restarted from the position of interruption, data is read and uploads LogStach, and match in the LogStach The filtering rule of data is set, the data that multiple Filebeat are uploaded enter Kafka after passing through LogStach filtration treatment, that is, It says, data acquire side 10 and support journal file as data source, and it is newly-increased interior to read specified journal file in real time using FileBeat Hold, be sent to LogStach, data enter Kafka after LogStach is filtered;
The SDK mode refers to supporting that insertion Agent uploads data as data source, on Agent in application or container Data are passed to background service, data enter Kafka after background service is handled, and certain Agent can also directly send out data Kafka is given as data source;
The MQ mode refers to support Kafka message queue as data source, and data are transmitted directly to Kafka.
Task management platform side 20, be used for configuration data Source Type, configure isomeric data cleaning and segmentation rules and The dimension and index of configuration data collection, and Flink stream calculation log-on data real time processing tasks are based on after the completion of all configurations, And storage unit is stored according to data set definition after data calculate in real time.
Specifically, task management platform side 20 includes:
Configuration unit 201 needs to input day for configuration data Source Type if selecting daily record data as data source Will path needs to input the AccessKeys of SDK Agent if selecting insertion SDK Agent reported data as data source, If selecting MQ as data source, need to input the Topic of Kafka, in the specific embodiment of the invention, configuration unit 201 The a variety of data sources of configuration are held, and try the data that crawl uploads, examine the data source of configuration whether correct;Configuration unit 201 It is also used to configure the cleaning and segmentation rules of isomeric data, it, can be by web administration console in the specific embodiment of the invention Segmentation rules are defined by graphical building blocks block mode in interface, support single, more separator sheers and simple logic Operation, the cleaning of input test data preview and cutting are as a result, specifically, configuration unit 20 can pass through web administration console interface Data segmentation rules are constructed in such a way that figure pulls building block, and after the real time data for obtaining crawl, data cutting is pre- It lookes at and carries out trial cut point according to the data segmentation rules of definition, to help user judges the whether correct of segmentation rules configuration;Match Dimension and index that unit 20 is also used to configuration data collection is set to need to input that is, according to the data definition data set after cutting Parameters, a real time data calculating tasks such as filter condition, polymerization dimension, statistical indicator, time field can define multiple data Collection, data set are calculated in real time based on the data after cutting.
Data processing unit 202 is based on Flink stream calculation technology log-on data after the completion of all configurations and handle in real time times Business, and time series database is stored according to data set definition after data calculate in real time.In the present invention, each generating date Task corresponds to a Flink data segmentation task, can there is multiple data sets in a task, and each data set is one corresponding Flink data set calculating task, after task start, the corresponding segmentation task of task and affiliated data set calculating task are all opened Dynamic, the newly-increased data in data source finally can all enter Kafka, and Flink data segmentation task consumes the data in Kafka, root According to segmentation rules to data cutting and logical process, cutting processing result is placed again into Kafka, Flink data set calculating task Time series database is stored in after calculating after Kafka consumption data according to time, dimension real time aggregation.
Specifically, data processing unit 202 further comprises:
Flink cleaning and cutting unit 2021, for consuming the data in Kafka, according to segmentation rules to data cutting And logical process, and cutting data are placed again into Kafka;
Flink computing unit 2022 calculates after Kafka consumption data according to time, dimension real time aggregation, and will calculate Result afterwards is stored in storage unit 2023, specifically, the consumption data from the Topic of Kafka of Flink computing unit 2022 Afterwards, then according to data set definition according to time, dimension packet aggregation parameter, composite index value, timing is stored into after calculating Database.It should be noted that each data set has corresponding Flink Job task, the same Kafka can be consumed Cutting data in Topic, generate different data sets.
Storage unit 2023, for storing related data.In the specific embodiment of the invention, storage unit 2023 includes ElasticSearch search server and InfluxDb time series database, wherein ElasticSearch search server is used for Initial data is stored, InfluxDb time series database is used to save the time series data after the polymerization of Flink computing unit 2022 calculates.
In the specific embodiment of the invention, data are acquired in the collected isomeric data in side 10 in addition to having time stamp, calling Outside the system informations such as D-chain trace TraceId, error stack, application service, there are also and the relevant order number of business, cell-phone number, quotient The information such as product mark, in task management platform side 20 according to the cleaning segmentation rules for configuring isomeric data, aggregated data collection Dimension and index, order number, timestamp, call D-chain trace TraceId data to be stored in data to concentrate, starting After the real-time calculating task of Flink, the data of business and system bi-directional association will enter data set, can thus pass through key All systems that information (such as system tracks TraceID, order number) matching business is passed through handle node.
Preferably, configuration unit 201 can also configure customized alarm rule, and it is fixed that each newly-built alarm needs to input previous step Justice data set, alert notice mode and notice object, alarm rule, in the specific embodiment of the invention, every alarm rule Need to input following parameter:
Nearest a few minutes
Index in data set
Average value, total, maximum value, minimum value
Be more than or equal to, be less than or equal to, ring it is rise/fall % more year-on-year than rise/fall %, yesterday
Threshold values
A plurality of alarm rule can be defined in one alarm task.
Correspondingly, data processing unit 202 further includes Flink alert process unit, for disappearing from the Topic of Kafka After taking data, whether need to alarm according to alarm rule real-time judge, generates alarm logging, and notify contact person, for example, by using Short message or lettergram mode notify contact person.
Preferably, after the definition of the completion task of task management platform side 20 and alarm rule, it can star and stop Task, i.e., the described task management platform side 20 can star, stop real time data processing Flink task, Flink cleaning and cutting The data segmentation task of unit 2021, the data calculating task (Flink computing unit 2022) of each data set, alarm task (Flink alert process unit) is all as an individual Flink Job operation.Each data set calculates the data completed and is put into Time series database influxDb.If meeting alarm rule, alarm task generates alarm logging and notifies contact person.
Preferably, the task management platform side 20 further includes query unit, by inquiring in each data set based on Calculate the data completed.Specifically, the task management platform side 20 can obtain input time model by web administration console interface It encloses, time interval inquires the data that completion has been calculated in each data set.That is, the task management platform side 20 is also The external inquiry function of data intensive data is provided, the Http inquiry request of external belt parameter is sent to background service, inquiry knot Fruit is placed in Response to be returned in the form of JSON string.
Preferably, the task management platform side 20 can also be by defining in data set in web administration console interface The chart ways of presentation of data, input data set number and its need show index, select a chart type and its configuration item after, Can be according to configured item with the data in chart mode set of displayable data, in the specific embodiment of the invention, the web administration control The task configuration data in Postgresql is read at platform interface processed using Ant Design React.js front end frame, is used BizCharts.js shows the time series data being stored in Influxdb data set with chart mode.
Data exhibiting and outlet side 30 are showed or by interface side for obtaining the result in data set with chart mode Formula output, that is to say, that the present invention provides external interface and exports the time series data being stored in data set, after user obtains data Designed, designed chart shows data.
The step of Fig. 2 is a kind of multi-source heterogeneous Real-time Data Processing Method based on Flink stream calculation technology of the invention is flowed Cheng Tu.As shown in Fig. 2, a kind of multi-source heterogeneous Real-time Data Processing Method based on Flink stream calculation technology of the present invention, including such as Lower step:
Step S1 acquires side in data, is obtained simultaneously by log mode or SDK mode or MQ mode and is dispersed in multiple systems Isomeric data in system component, is sent to Kafka after preliminary treatment in a manner of continuous flow.
The log mode refers to the new content for reading specified journal file in real time using log data acquisition device, hair It gives log and collects module, the data of acquisition are sent into Kafka after modular filtration is collected in log, in the specific embodiment of the invention In, Yu Suoshu installs FileBeat light-type log collector in the server of data acquisition side 10, it can read and turn Log lines are sent out, can also be restarted from the position of interruption, data is read and uploads LogStach, and match in the LogStach The filtering rule of data is set, the data that multiple Filebeat are uploaded enter Kafka after passing through LogStach filtration treatment, that is, It says, data acquire side 10 and support journal file as data source, and it is newly-increased interior to read specified journal file in real time using FileBeat Hold, be sent to LogStach, data enter Kafka after LogStach is filtered;
The SDK mode refers to supporting that insertion Agent uploads data as data source, on Agent in application or container Data are passed to background service, data enter Kafka after background service is handled, and certain Agent can also directly send out data Kafka is given as data source;
The MQ mode refers to support Kafka message queue as data source, and data are transmitted directly to Kafka.
Step S2, in task management platform side, configuration data Source Type, configure cleaning and the segmentation rules of isomeric data with And the dimension and index of configuration data collection, and be based on Flink stream calculation technology log-on data after the completion of all configurations and locate in real time Reason task, and read in real time that is, after the completion of configuration after data are calculated in real time according to data set definition deposit time series database Data in Kafka are placed again into after extracting the data needed according to the real-time cutting of segmentation rules based on Flink stream calculation Kafka carries out polymerization calculating according to time, dimension according to data set definition based on Flink stream calculation, and in time series database Result after storage calculates.
In the specific embodiment of the invention, for configuration data Source Type, if selecting daily record data as data source, need Input journal path is wanted, if selecting insertion SDK Agent reported data as data source, needs to input SDK Agent's AccessKeys needs to input the Topic of Kafka if selecting MQ as data source, that is to say, that the present invention supports configuration A variety of data sources, and the data that crawl uploads are tried, examine the data source of configuration whether correct;For configuration isomeric data Cleaning and segmentation rules can be by passing through graphical building blocks in the specific embodiment of the invention in web administration console interface Block mode is defined segmentation rules, supports single, more separator sheers and simple logical operation, input test data preview Cleaning and cutting result;For the dimension and index of configuration data collection, the data after cutting are according to time, dimension packet aggregation meter Index value is calculated, specifically, data cutting rule can be constructed in such a way that web administration console interface pulls building block using figure Then, and after the real time data for obtaining crawl, data cutting preview carries out trial cut point according to the data segmentation rules of definition, with side Help user judges the whether correct of segmentation rules configuration;For the dimension and index of configuration data collection, i.e., according to cutting after Data definition data set needs to input the parameters such as filter condition, polymerization dimension, statistical indicator, time field, a real time data Calculating task can define multiple data sets, and data set is calculated in real time based on the data after cutting.
The log-on data real time processing tasks after the completion of all configurations, and according to data set definition after data are calculated in real time It is stored in time series database.In the present invention, the corresponding Flink data segmentation task of each generating date task, one There can be multiple data sets in task, the corresponding Flink data set calculating task of each data set, after task start, task Corresponding segmentation task and affiliated data set calculating task all start, and the newly-increased data in data source finally can all enter Kafka, Flink data segmentation task consume the data in Kafka, according to segmentation rules to data cutting and logical process, knot Fruit is placed again into Kafka, and Flink data set calculating task calculates after Kafka consumption data according to time, dimension real time aggregation After be stored in time series database.
Preferably, in step S2, customized alarm rule is also configured, after consumption data in the Topic from Kafka, Whether need to alarm according to alarm rule real-time judge, alarm logging is generated, and notify contact person, for example, by using short message or postal Part mode notifies contact person.
Step S3, the result obtained in the task management platform side data set show with chart mode or pass through interface Mode exports, that is to say, that the present invention also provides external interfaces to export the time series data being stored in data set, and user obtains number Show data according to rear designed, designed chart.
It is below that the multi-source heterogeneous data based on Flink stream calculation technology for illustrating the present invention by specific embodiment are real When processing system treatment process.Fig. 3 is the multi-source heterogeneous data based on Flink stream calculation technology in the specific embodiment of the invention The functional block diagram of real time processing system, Fig. 4 are in the specific embodiment of the invention based on the multi-source heterogeneous of Flink stream calculation technology The logical architecture figure of generating date system.It as shown in Figures 3 and 4, should the multi-source heterogeneous number based on Flink stream calculation technology It is as follows according to the treatment process of real time processing system:
Data acquire side, support daily record data as data source, and it is newly-increased interior that FileBeat reads specified journal file in real time Hold, be sent to LogStach, data enter Kafka after LogStach is filtered;Support that insertion SDK is uploaded in application or container Data are as data source, and SDK uploads data to background service, and data enter Kafka after treatment, support Kafka message team Column are used as data source, and data are transmitted directly to Kafka.
Console side is managed, configuration data Source Type is supported to need to input day if selecting daily record data as data source Will path needs to input the AccessKeys of SDK Agent if selecting insertion SDK Agent reported data as data source, If selecting MQ as data source, need to input the Topic of Kafka;Also support the clear of configuration isomeric data in management console side It washes and segmentation rules, segmentation rules is defined by graphical building blocks block mode in interface, input test data preview is clear It washes and cutting result;The dimension and index for supporting configuration data collection, the data after cutting are calculated according to time, dimension packet aggregation Index value, in embodiments of the present invention, management console side includes:
Data receiver layer acquires the data that side uploads for receiving data;
Data analysis layer, the log-on data real time processing tasks after the completion of all configurations, each generating date task A Flink data segmentation task is corresponded to, can there is multiple data sets, the corresponding Flink of each data set in a task Data set calculating task.After task start, the corresponding segmentation task of task and affiliated data set calculating task all start.Number It finally can all enter Kafka according to the newly-increased data in source, Flink task consumes the data in Kafka, according to segmentation rules logarithm According to cutting and logical process, be as a result placed again into Kafka, Flink data set calculating task after Kafka consumption data according to when Between, dimension real time aggregation calculate after be stored in accumulation layer, wherein accumulation layer include ElasticSearch search server and InfluxDb time series database, wherein ElasticSearch search server is for storing initial data, ordinal number when InfluxDb It is used to save the time series data after polymerization calculates according to library.
Input time range, time granularity is also supported to inquire the data in data set in the management console side.Support is matched Set subtype and its option display data intensive data.If chart, which shows interface, is not able to satisfy user demand, support to pass through HTTP interface externally provides the service of inquiry data intensive data, and input time range, time granularity, index and dimension list are made The a plurality of time series data concentrated for querying condition, returned data.
In the present embodiment, management console side uses Ant Design React or Bizchart front end frame chart exhibition Time series data in existing database.
Console, for providing the definition of task data source, the definition of task segmentation rules, the definition of task data collection, task pipe Manage (starting/stopping), external interface service, data set inquiry, interaction chart, alerting service.
In conclusion the present invention is a kind of based on the multi-source heterogeneous generating date system of Flink stream calculation technology and side Method is analyzed by the mass data to a variety of source different structures exported in existing business system, since data acquire side In addition to system informations such as having time stamp, calling D-chain trace TraceId, error stack, application services in collected isomeric data Outside, there are also and the information such as the relevant order number of business, cell-phone number, commodity sign, the configuration according to task management platform side Cleaning segmentation rules, the dimension and index of aggregated data collection of isomeric data order number, timestamp, call D-chain trace TraceId data are stored in a data and concentrate, after starting the real-time calculating task of Flink, the number of business and system bi-directional association According to data set will be entered, it can thus pass through key message (such as system tracks TraceID, order number) matching business warp All systems processing node crossed, finds the correlation between log event and business, helps operation maintenance personnel to improve efficiency, be existing Some business diagnosis systems provide supplement.
The above-described embodiments merely illustrate the principles and effects of the present invention, and is not intended to limit the present invention.Any Without departing from the spirit and scope of the present invention, modifications and changes are made to the above embodiments by field technical staff.Therefore, The scope of the present invention, should be as listed in the claims.

Claims (10)

1. a kind of multi-source heterogeneous generating date system based on Flink stream calculation technology, comprising:
Data acquire side, are dispersed in multiple systems for obtaining simultaneously by log mode and/or SDK mode and/or MQ mode Isomeric data in component, is sent to Kafka after preliminary treatment in a manner of continuous flow;
Task management platform side, for configuration data Source Type, the cleaning of configuration isomeric data and segmentation rules and configuration number Flink stream calculation technology log-on data real time processing tasks are based on according to the dimension and index of collection, and after the completion of all configurations, and Storage unit is stored according to data set definition after data calculate in real time;
Data exhibiting and outlet side show for obtaining the result in data set with chart mode or defeated by interface mode Out.
2. a kind of multi-source heterogeneous generating date system based on Flink stream calculation technology as described in claim 1, special Sign is: the log mode is to read the new content of specified journal file in real time using log data acquisition device, is sent to Module is collected in log, and the data of acquisition are sent into Kafka after modular filtration is collected in log;The SDK mode is to support application Or be embedded in Agent in container and upload data as data source, Agent uploads data to background service, and data pass through background service Enter Kafka after processing;Or Kafka is directly sent data to by Agent as data source;The MQ mode is to support Kafka message queue is transmitted directly to Kafka as data source, data.
3. a kind of multi-source heterogeneous generating date system based on Flink stream calculation technology as claimed in claim 2, special Sign is that the task management platform side includes:
Configuration unit for configuration data Source Type, configures cleaning and segmentation rules and the configuration data collection of isomeric data Dimension and index;
Data processing unit, for being based on Flink stream calculation technology log-on data real time processing tasks after the completion of all configurations, And storage unit is stored according to data set definition after data calculate in real time, each generating date task is one corresponding Flink data segmentation task can have multiple data sets, the corresponding Flink data set meter of each data set in one task Calculation task.
4. a kind of multi-source heterogeneous generating date system based on Flink stream calculation technology as claimed in claim 3, special Sign is: when configuration data Source Type, if selecting daily record data as data source, input journal path is needed, if selection Be embedded in SDK Agent reported data as data source, then need to input the AccessKeys of SDK Agent, if select MQ as Data source then needs to input the Topic of Kafka.
5. a kind of multi-source heterogeneous generating date system based on Flink stream calculation technology as claimed in claim 3, special Sign is: when configuring the cleaning and segmentation rules of isomeric data, constructing data cutting in such a way that figure pulls building block Rule, and after the real time data for obtaining crawl, data cutting preview carries out trial cut point according to the data segmentation rules of definition.
6. a kind of multi-source heterogeneous generating date system based on Flink stream calculation technology as claimed in claim 3, special Sign is: when the dimension and index of configuration data collection, according to the data definition data set after cutting, and need to input filtering rod The parameters such as part, polymerization dimension, statistical indicator, time field.
7. a kind of multi-source heterogeneous generating date system based on Flink stream calculation technology as claimed in claim 3, special Sign is that the data processing unit further comprises:
Flink cleaning and cutting unit, for consuming the data in Kafka, according to segmentation rules to data cutting and logic at Reason, and cutting data are placed again into Kafka;
Flink computing unit calculates after Kafka consumption data according to time, dimension real time aggregation, and by the result after calculating It is stored in storage unit;
Storage unit, including ElasticSearch search server and InfluxDb time series database, it is described For ElasticSearch search server for storing initial data, the InfluxDb time series database is described for saving Time series data after the polymerization calculating of Flink computing unit.
8. a kind of multi-source heterogeneous generating date system based on Flink stream calculation technology as claimed in claim 7, special Sign is that the configuration unit is also used to configure customized alarm rule, and the data processing unit further includes at Flink alarm Unit is managed, it is raw for whether needing to alarm according to the alarm rule real-time judge after consumption data in the Topic of Kafka At alarm logging, and notify contact person.
9. a kind of multi-source heterogeneous generating date system based on Flink stream calculation technology as claimed in claim 7, special Sign is: the task management platform side further includes query unit, for inquiring each data set according to the input condition of acquisition In calculated the data of completion.
10. a kind of multi-source heterogeneous Real-time Data Processing Method based on Flink stream calculation technology, includes the following steps:
Step S1 acquires side in data, is obtained simultaneously by log mode or SDK mode or MQ mode and is dispersed in multiple system groups Isomeric data in part, is sent to Kafka after preliminary treatment in a manner of continuous flow;
Step S2, in task management platform side, configuration data Source Type configures the cleaning of isomeric data and segmentation rules and matches It sets the dimension and index of data set, and is based on after the completion of all configurations Flink stream calculation technology log-on data and handle in real time times Business, and according to data set definition deposit time series database after data are calculated in real time;
Step S3, the result obtained in the task management platform side data set show with chart mode or pass through interface mode Output.
CN201910495241.0A 2019-06-10 2019-06-10 A kind of multi-source heterogeneous generating date system and method based on Flink stream calculation technology Pending CN110245158A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910495241.0A CN110245158A (en) 2019-06-10 2019-06-10 A kind of multi-source heterogeneous generating date system and method based on Flink stream calculation technology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910495241.0A CN110245158A (en) 2019-06-10 2019-06-10 A kind of multi-source heterogeneous generating date system and method based on Flink stream calculation technology

Publications (1)

Publication Number Publication Date
CN110245158A true CN110245158A (en) 2019-09-17

Family

ID=67886263

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910495241.0A Pending CN110245158A (en) 2019-06-10 2019-06-10 A kind of multi-source heterogeneous generating date system and method based on Flink stream calculation technology

Country Status (1)

Country Link
CN (1) CN110245158A (en)

Cited By (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110688383A (en) * 2019-09-26 2020-01-14 中国银行股份有限公司 Data acquisition method and system
CN110691124A (en) * 2019-09-24 2020-01-14 华润网络(深圳)有限公司 Data processing method and system under remote multi-active architecture
CN110705891A (en) * 2019-10-11 2020-01-17 集奥聚合(北京)人工智能科技有限公司 Data processing method based on high-allocable changeability
CN110727700A (en) * 2019-10-22 2020-01-24 中信银行股份有限公司 Method and system for integrating multi-source streaming data into transaction type streaming data
CN110784419A (en) * 2019-10-22 2020-02-11 中国铁道科学研究院集团有限公司电子计算技术研究所 Method and system for visualizing professional data of railway electric affairs
CN111177193A (en) * 2019-12-13 2020-05-19 航天信息股份有限公司 Flink-based log streaming processing method and system
CN111221831A (en) * 2019-12-26 2020-06-02 杭州顺网科技股份有限公司 Computing system for real-time processing of advertisement effect data
CN111241044A (en) * 2020-01-08 2020-06-05 中国联合网络通信集团有限公司 Method, device and equipment for building heterogeneous database and readable storage medium
CN111339052A (en) * 2020-02-28 2020-06-26 中国银联股份有限公司 Unstructured log data processing method and device
CN111523003A (en) * 2020-04-27 2020-08-11 北京图特摩斯科技有限公司 Data application method and platform with time sequence dynamic map as core
CN111657891A (en) * 2020-06-23 2020-09-15 北京理工大学 Method for monitoring health condition of old people based on edge computing platform
CN112000636A (en) * 2020-08-31 2020-11-27 民生科技有限责任公司 User behavior statistical analysis method based on Flink streaming processing
CN112069264A (en) * 2020-09-10 2020-12-11 阳光保险集团股份有限公司 Heterogeneous data source acquisition method and device, electronic equipment and storage medium
CN112148578A (en) * 2020-10-12 2020-12-29 贵州电网有限责任公司 IT fault defect prediction method based on machine learning
CN112202818A (en) * 2020-12-01 2021-01-08 南京中孚信息技术有限公司 Network traffic intrusion detection method and system fusing threat information
CN112214866A (en) * 2020-08-25 2021-01-12 武汉船用机械有限责任公司 Health condition judgment method and device for marine equipment and computer equipment
CN112286757A (en) * 2020-10-12 2021-01-29 浙江深大智能科技有限公司 Data synchronization monitoring method and device, electronic equipment and storage medium
CN112307057A (en) * 2020-10-27 2021-02-02 北京健康之家科技有限公司 Data processing method and device, electronic equipment and computer storage medium
CN112330147A (en) * 2020-11-04 2021-02-05 北京思特奇信息技术股份有限公司 Service acceptance information monitoring method and device and storage medium
CN112422445A (en) * 2020-10-10 2021-02-26 四川新网银行股份有限公司 Kafka-based real-time acquisition, calculation and storage method for buried point data
CN112506743A (en) * 2020-12-09 2021-03-16 天津狮拓信息技术有限公司 Log monitoring method and device and server
CN112527385A (en) * 2021-02-18 2021-03-19 成都新希望金融信息有限公司 Data processing method, device, working node and storage medium
CN112543127A (en) * 2019-09-23 2021-03-23 北京轻享科技有限公司 Monitoring method and device of micro-service architecture
CN112596997A (en) * 2020-12-29 2021-04-02 科技谷(厦门)信息技术有限公司 Automatic flow control method based on Flink real-time calculation
CN112667683A (en) * 2020-12-25 2021-04-16 平安科技(深圳)有限公司 Stream computing system, electronic device and storage medium therefor
CN112948455A (en) * 2021-01-08 2021-06-11 四川新网银行股份有限公司 Real-time analysis and calculation method based on Apache drive
CN113010483A (en) * 2020-11-20 2021-06-22 云智慧(北京)科技有限公司 Mass log management method and system
CN113094250A (en) * 2021-05-12 2021-07-09 成都新希望金融信息有限公司 Log early warning method and device, electronic equipment and storage medium
CN113111652A (en) * 2020-01-13 2021-07-13 阿里巴巴集团控股有限公司 Data processing method and device and computing equipment
CN113268503A (en) * 2021-04-02 2021-08-17 北京比格大数据有限公司 Information aggregation method, storage medium, and computer device
CN113515500A (en) * 2021-05-24 2021-10-19 苏州维众数据技术有限公司 Visual data processing system and processing method
CN113518365A (en) * 2021-04-29 2021-10-19 北京红山信息科技研究院有限公司 Data association method, device, server and storage medium
CN113568761A (en) * 2020-04-28 2021-10-29 中国联合网络通信集团有限公司 Data processing method, device, equipment and storage medium
CN113626447A (en) * 2021-10-12 2021-11-09 民航成都信息技术有限公司 Civil aviation data management platform and method
CN113641301A (en) * 2021-02-19 2021-11-12 中国建设银行股份有限公司 Data management method and device
CN113656264A (en) * 2021-09-08 2021-11-16 上海童渠信息技术有限公司 Real-time alarm service platform system
CN113824601A (en) * 2021-11-24 2021-12-21 国网江苏省电力有限公司营销服务中心 Electric power marketing monitored control system based on service log
CN113844976A (en) * 2021-09-10 2021-12-28 北京声智科技有限公司 Alarm data processing method and device, computer equipment and storage medium
CN114116842A (en) * 2021-11-25 2022-03-01 上海柯林布瑞信息技术有限公司 Multi-dimensional medical data real-time acquisition method and device, electronic equipment and storage medium
CN114297189A (en) * 2022-01-10 2022-04-08 成都国铁电气设备有限公司 Method for cleaning geometric detection data of subway track based on Flink stream processing
CN114371884A (en) * 2021-12-31 2022-04-19 南京星云数字技术有限公司 Method, device, equipment and storage medium for processing Flink calculation task
CN114629929A (en) * 2022-03-16 2022-06-14 北京奇艺世纪科技有限公司 Log recording method, device and system
CN114979186A (en) * 2022-05-16 2022-08-30 浪潮云信息技术股份公司 Flow link analysis method and system based on Flink component
CN115168474A (en) * 2022-07-26 2022-10-11 成都智元汇信息技术股份有限公司 Internet of things center station system building method based on big data model
CN115499303A (en) * 2022-08-29 2022-12-20 浪潮软件科技有限公司 Log analysis tool based on Flink
CN115952200A (en) * 2023-01-17 2023-04-11 安芯网盾(北京)科技有限公司 Multi-source heterogeneous data aggregation query method and device based on MPP (maximum power point tracking) architecture

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104766151A (en) * 2014-12-29 2015-07-08 国家电网公司 Quality management and control method for electricity transaction data warehouses and management and control system thereof
CN107545014A (en) * 2016-06-28 2018-01-05 国网天津市电力公司 Stream calculation instant disposal system for treating based on Storm
CN107577805A (en) * 2017-09-26 2018-01-12 华南理工大学 A kind of business service system towards the analysis of daily record big data
CN107908690A (en) * 2017-11-01 2018-04-13 南京欣网互联网络科技有限公司 A kind of data processing method based on big data OA operation analysis
CN109271412A (en) * 2018-09-28 2019-01-25 中国-东盟信息港股份有限公司 The real-time streaming data processing method and system of smart city
CN109542733A (en) * 2018-12-05 2019-03-29 焦点科技股份有限公司 A kind of highly reliable real-time logs collection and visual m odeling technique method
US20190130004A1 (en) * 2017-10-27 2019-05-02 Streamsimple, Inc. Streaming Microservices for Stream Processing Applications
CN109710731A (en) * 2018-11-19 2019-05-03 北京计算机技术及应用研究所 A kind of multidirectional processing system of data flow based on Flink

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104766151A (en) * 2014-12-29 2015-07-08 国家电网公司 Quality management and control method for electricity transaction data warehouses and management and control system thereof
CN107545014A (en) * 2016-06-28 2018-01-05 国网天津市电力公司 Stream calculation instant disposal system for treating based on Storm
CN107577805A (en) * 2017-09-26 2018-01-12 华南理工大学 A kind of business service system towards the analysis of daily record big data
US20190130004A1 (en) * 2017-10-27 2019-05-02 Streamsimple, Inc. Streaming Microservices for Stream Processing Applications
CN107908690A (en) * 2017-11-01 2018-04-13 南京欣网互联网络科技有限公司 A kind of data processing method based on big data OA operation analysis
CN109271412A (en) * 2018-09-28 2019-01-25 中国-东盟信息港股份有限公司 The real-time streaming data processing method and system of smart city
CN109710731A (en) * 2018-11-19 2019-05-03 北京计算机技术及应用研究所 A kind of multidirectional processing system of data flow based on Flink
CN109542733A (en) * 2018-12-05 2019-03-29 焦点科技股份有限公司 A kind of highly reliable real-time logs collection and visual m odeling technique method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
从义昊: "基于数据流的分布式实时推荐算法的研究与实现" *
李若鹏: "基于大数据的网络异常行为检测平台的设计与实现" *
蔡鲲鹏: "基于Flink平台的应用研究" *

Cited By (62)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112543127A (en) * 2019-09-23 2021-03-23 北京轻享科技有限公司 Monitoring method and device of micro-service architecture
CN110691124B (en) * 2019-09-24 2020-11-24 华润网络(深圳)有限公司 Data processing method and system under remote multi-active architecture
CN110691124A (en) * 2019-09-24 2020-01-14 华润网络(深圳)有限公司 Data processing method and system under remote multi-active architecture
CN110688383A (en) * 2019-09-26 2020-01-14 中国银行股份有限公司 Data acquisition method and system
CN110705891A (en) * 2019-10-11 2020-01-17 集奥聚合(北京)人工智能科技有限公司 Data processing method based on high-allocable changeability
CN110727700A (en) * 2019-10-22 2020-01-24 中信银行股份有限公司 Method and system for integrating multi-source streaming data into transaction type streaming data
CN110784419A (en) * 2019-10-22 2020-02-11 中国铁道科学研究院集团有限公司电子计算技术研究所 Method and system for visualizing professional data of railway electric affairs
CN110784419B (en) * 2019-10-22 2023-02-28 中国铁道科学研究院集团有限公司电子计算技术研究所 Method and system for visualizing professional railway electric service data
CN111177193A (en) * 2019-12-13 2020-05-19 航天信息股份有限公司 Flink-based log streaming processing method and system
CN111221831A (en) * 2019-12-26 2020-06-02 杭州顺网科技股份有限公司 Computing system for real-time processing of advertisement effect data
CN111221831B (en) * 2019-12-26 2024-03-29 杭州顺网科技股份有限公司 Computing system for processing advertisement effect data in real time
CN111241044A (en) * 2020-01-08 2020-06-05 中国联合网络通信集团有限公司 Method, device and equipment for building heterogeneous database and readable storage medium
CN111241044B (en) * 2020-01-08 2023-09-19 中国联合网络通信集团有限公司 Method, device, equipment and readable storage medium for constructing heterogeneous database
CN113111652A (en) * 2020-01-13 2021-07-13 阿里巴巴集团控股有限公司 Data processing method and device and computing equipment
CN113111652B (en) * 2020-01-13 2024-02-13 阿里巴巴集团控股有限公司 Data processing method and device and computing equipment
CN111339052A (en) * 2020-02-28 2020-06-26 中国银联股份有限公司 Unstructured log data processing method and device
CN111523003A (en) * 2020-04-27 2020-08-11 北京图特摩斯科技有限公司 Data application method and platform with time sequence dynamic map as core
CN113568761A (en) * 2020-04-28 2021-10-29 中国联合网络通信集团有限公司 Data processing method, device, equipment and storage medium
CN113568761B (en) * 2020-04-28 2023-06-27 中国联合网络通信集团有限公司 Data processing method, device, equipment and storage medium
CN111657891A (en) * 2020-06-23 2020-09-15 北京理工大学 Method for monitoring health condition of old people based on edge computing platform
CN112214866B (en) * 2020-08-25 2023-11-17 武汉船用机械有限责任公司 Marine equipment health condition judging method and device and computer equipment
CN112214866A (en) * 2020-08-25 2021-01-12 武汉船用机械有限责任公司 Health condition judgment method and device for marine equipment and computer equipment
CN112000636A (en) * 2020-08-31 2020-11-27 民生科技有限责任公司 User behavior statistical analysis method based on Flink streaming processing
CN112069264A (en) * 2020-09-10 2020-12-11 阳光保险集团股份有限公司 Heterogeneous data source acquisition method and device, electronic equipment and storage medium
CN112422445A (en) * 2020-10-10 2021-02-26 四川新网银行股份有限公司 Kafka-based real-time acquisition, calculation and storage method for buried point data
CN112286757A (en) * 2020-10-12 2021-01-29 浙江深大智能科技有限公司 Data synchronization monitoring method and device, electronic equipment and storage medium
CN112148578A (en) * 2020-10-12 2020-12-29 贵州电网有限责任公司 IT fault defect prediction method based on machine learning
CN112307057A (en) * 2020-10-27 2021-02-02 北京健康之家科技有限公司 Data processing method and device, electronic equipment and computer storage medium
CN112330147A (en) * 2020-11-04 2021-02-05 北京思特奇信息技术股份有限公司 Service acceptance information monitoring method and device and storage medium
CN113010483A (en) * 2020-11-20 2021-06-22 云智慧(北京)科技有限公司 Mass log management method and system
CN112202818A (en) * 2020-12-01 2021-01-08 南京中孚信息技术有限公司 Network traffic intrusion detection method and system fusing threat information
CN112506743A (en) * 2020-12-09 2021-03-16 天津狮拓信息技术有限公司 Log monitoring method and device and server
CN112667683A (en) * 2020-12-25 2021-04-16 平安科技(深圳)有限公司 Stream computing system, electronic device and storage medium therefor
CN112667683B (en) * 2020-12-25 2023-05-26 平安科技(深圳)有限公司 Stream computing system, electronic device thereof, and storage medium
CN112596997A (en) * 2020-12-29 2021-04-02 科技谷(厦门)信息技术有限公司 Automatic flow control method based on Flink real-time calculation
CN112948455A (en) * 2021-01-08 2021-06-11 四川新网银行股份有限公司 Real-time analysis and calculation method based on Apache drive
CN112527385A (en) * 2021-02-18 2021-03-19 成都新希望金融信息有限公司 Data processing method, device, working node and storage medium
CN112527385B (en) * 2021-02-18 2021-11-30 成都新希望金融信息有限公司 Data processing method, device, working node and storage medium
CN113641301A (en) * 2021-02-19 2021-11-12 中国建设银行股份有限公司 Data management method and device
CN113268503A (en) * 2021-04-02 2021-08-17 北京比格大数据有限公司 Information aggregation method, storage medium, and computer device
CN113268503B (en) * 2021-04-02 2024-03-08 北京比格大数据有限公司 Information aggregation method, storage medium, and computer device
CN113518365A (en) * 2021-04-29 2021-10-19 北京红山信息科技研究院有限公司 Data association method, device, server and storage medium
CN113518365B (en) * 2021-04-29 2023-11-17 北京红山信息科技研究院有限公司 Data association method, device, server and storage medium
CN113094250A (en) * 2021-05-12 2021-07-09 成都新希望金融信息有限公司 Log early warning method and device, electronic equipment and storage medium
CN113094250B (en) * 2021-05-12 2023-08-18 成都新希望金融信息有限公司 Log early warning method and device, electronic equipment and storage medium
CN113515500A (en) * 2021-05-24 2021-10-19 苏州维众数据技术有限公司 Visual data processing system and processing method
CN113515500B (en) * 2021-05-24 2023-06-30 苏州维众数据技术有限公司 Visual data processing system and processing method
CN113656264A (en) * 2021-09-08 2021-11-16 上海童渠信息技术有限公司 Real-time alarm service platform system
CN113844976A (en) * 2021-09-10 2021-12-28 北京声智科技有限公司 Alarm data processing method and device, computer equipment and storage medium
CN113844976B (en) * 2021-09-10 2023-04-25 北京声智科技有限公司 Alarm data processing method, device, computer equipment and storage medium
CN113626447A (en) * 2021-10-12 2021-11-09 民航成都信息技术有限公司 Civil aviation data management platform and method
CN113824601A (en) * 2021-11-24 2021-12-21 国网江苏省电力有限公司营销服务中心 Electric power marketing monitored control system based on service log
CN114116842B (en) * 2021-11-25 2023-05-19 上海柯林布瑞信息技术有限公司 Multidimensional medical data real-time acquisition method and device, electronic equipment and storage medium
CN114116842A (en) * 2021-11-25 2022-03-01 上海柯林布瑞信息技术有限公司 Multi-dimensional medical data real-time acquisition method and device, electronic equipment and storage medium
CN114371884A (en) * 2021-12-31 2022-04-19 南京星云数字技术有限公司 Method, device, equipment and storage medium for processing Flink calculation task
CN114297189A (en) * 2022-01-10 2022-04-08 成都国铁电气设备有限公司 Method for cleaning geometric detection data of subway track based on Flink stream processing
CN114629929A (en) * 2022-03-16 2022-06-14 北京奇艺世纪科技有限公司 Log recording method, device and system
CN114629929B (en) * 2022-03-16 2024-03-08 北京奇艺世纪科技有限公司 Log recording method, device and system
CN114979186A (en) * 2022-05-16 2022-08-30 浪潮云信息技术股份公司 Flow link analysis method and system based on Flink component
CN115168474A (en) * 2022-07-26 2022-10-11 成都智元汇信息技术股份有限公司 Internet of things center station system building method based on big data model
CN115499303A (en) * 2022-08-29 2022-12-20 浪潮软件科技有限公司 Log analysis tool based on Flink
CN115952200A (en) * 2023-01-17 2023-04-11 安芯网盾(北京)科技有限公司 Multi-source heterogeneous data aggregation query method and device based on MPP (maximum power point tracking) architecture

Similar Documents

Publication Publication Date Title
CN110245158A (en) A kind of multi-source heterogeneous generating date system and method based on Flink stream calculation technology
CN107577805B (en) Business service system for log big data analysis
US11086687B2 (en) Managing resource allocation in a stream processing framework
CN108335075B (en) Logistics big data oriented processing system and method
CN104407964B (en) A kind of centralized monitoring system and method based on data center
CN109634818A (en) Log analysis method, system, terminal and computer readable storage medium
CN108365985A (en) A kind of cluster management method, device, terminal device and storage medium
US20100070981A1 (en) System and Method for Performing Complex Event Processing
CN103401698B (en) For the monitoring system that server health is reported to the police in server set group operatione
CN111339175B (en) Data processing method, device, electronic equipment and readable storage medium
CN107291594A (en) The device and method that openstack platforms are monitored and managed to ceph
CN111221831B (en) Computing system for processing advertisement effect data in real time
CN112506743A (en) Log monitoring method and device and server
CN113448812A (en) Monitoring alarm method and device under micro-service scene
CN114416684A (en) Method for identifying part of executable logic, hardware storage device and computing system
CN108595316A (en) Life cycle management method, manager, equipment and the medium of Distributed Application
CN112965874A (en) Configurable monitoring alarm method and system
CN109190025A (en) information monitoring method, device, system and computer readable storage medium
CN111177237B (en) Data processing system, method and device
US11288258B2 (en) Dedicated audit port for implementing recoverability in outputting audit data
CN111339052A (en) Unstructured log data processing method and device
CN103823743A (en) Monitoring method and monitoring device of software system
CN111263120B (en) Video equipment online time counting method and device based on stream calculation and terminal
CN116232844A (en) System monitoring method based on distributed system
CN114756301B (en) Log processing method, device and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190917

RJ01 Rejection of invention patent application after publication