CN109412970B - Data transfer system, data transfer method, electronic device, and storage medium - Google Patents

Data transfer system, data transfer method, electronic device, and storage medium Download PDF

Info

Publication number
CN109412970B
CN109412970B CN201811158015.5A CN201811158015A CN109412970B CN 109412970 B CN109412970 B CN 109412970B CN 201811158015 A CN201811158015 A CN 201811158015A CN 109412970 B CN109412970 B CN 109412970B
Authority
CN
China
Prior art keywords
data
module
layer
processing module
interface
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811158015.5A
Other languages
Chinese (zh)
Other versions
CN109412970A (en
Inventor
迟轩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Manyun Software Technology Co Ltd
Original Assignee
Jiangsu Manyun Software Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Manyun Software Technology Co Ltd filed Critical Jiangsu Manyun Software Technology Co Ltd
Priority to CN201811158015.5A priority Critical patent/CN109412970B/en
Publication of CN109412970A publication Critical patent/CN109412970A/en
Application granted granted Critical
Publication of CN109412970B publication Critical patent/CN109412970B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/24Traffic characterised by specific attributes, e.g. priority or QoS
    • H04L47/2408Traffic characterised by specific attributes, e.g. priority or QoS for supporting different services, e.g. a differentiated services [DiffServ] type of service
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/02Capturing of monitoring data
    • H04L43/028Capturing of monitoring data by filtering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/60Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention provides a data flow system, a data flow method, an electronic device and a storage medium, wherein the system comprises: the extensible data processing module is used for acquiring and filtering data and outputting the filtered data; the configurable scheduling module is used for scheduling data of each layer of the data processing module; the configurable flow control module is used for managing the data flow of each layer of the data processing module; the interface layer provides an expansion interface at least for the data processing module to expand; and a configuration module providing a configuration interface for at least the configuration of the scheduling module and the flow control module. The data processing module is expanded through the expansion interface, so that different data sources can be conveniently butted; the scheduling module and the flow control module are configured through a configuration interface, so that dynamic management of data flow is realized; the modules are communicated through interfaces, are independent from each other, and flexibly and efficiently realize the circulation processing of mass data.

Description

Data transfer system, data transfer method, electronic device, and storage medium
Technical Field
The present invention relates to the field of data processing technologies, and in particular, to a data streaming system, a data streaming method, an electronic device, and a storage medium.
Background
Large systems, such as distributed systems, always have large amounts of data to collect, filter, and send to downstream service processes. How to effectively collect data and efficiently transmit the data to downstream is a problem which must be considered when processing massive data.
Existing data streamer tools are more or less flawed. Some are excellent but have a single goal, such as to primarily address synchronization between database data. There are also a wide range of goals, but for design reasons, there is a sacrifice in performance.
For example, log data tends to be massive in large systems, typically. Common data flow tools for log data are flash and Logstash. The Flume provides log synchronization service, so that the performance is greatly sacrificed to avoid data exception; if the performance is expected to be improved, a large number of Flume nodes are required to be started, the operation and maintenance cost and the management cost are increased invisibly, and a new problem is brought to the log storage of the back end. Due to the language characteristics, Logstash faces various difficulties in extension and JVM fine adjustment.
It is to be noted that the information disclosed in the above background section is only for enhancement of understanding of the background of the present invention and therefore may include information that does not constitute prior art known to a person of ordinary skill in the art.
Disclosure of Invention
In view of this, the present invention provides a data transfer system, a data transfer method, an electronic device, and a storage medium, which solve the problems in the prior art that a data transfer tool has poor expandability, cannot effectively control the number of nodes, and is not flexible enough in scheduling and management of data transfer.
According to an aspect of the present invention, there is provided a data streaming system including: the extensible data processing module is used for acquiring and filtering data and outputting the filtered data; the configurable scheduling module is communicated with the scheduling module through an interface and schedules data of each layer of the data processing module; the configurable flow control module is communicated with the data processing module through an interface and manages the data flow of each layer of the data processing module; the interface layer provides an expansion interface at least for the data processing module to expand; and a configuration module providing a configuration interface for at least the configuration of the scheduling module and the flow control module.
Preferably, in the above data flow system, the data processing module includes: the data acquisition layer comprises one or more collectors for acquiring data; a filtering layer comprising one or more filters to filter the acquired data; the data writing layer comprises one or more output interfaces and is used for outputting the filtered data; and all layers of the data processing module are communicated through an abstract interface.
Preferably, in the above data flow system, the data processing module further includes: one or more buffer layers between the data acquisition layer and the filter layer, and/or between the filter layer and the data writing layer; and the data processing modules perform asynchronous data transfer among the layers through the buffer layers.
Preferably, in the data flow system, the data acquisition layer has an acquisition progress feedback interface, and is configured to submit a data acquisition progress to at least the scheduling module and the flow control module.
Preferably, in the data flow system, the collection progress feedback interface includes a synchronous feedback interface and an asynchronous feedback interface.
Preferably, in the above data streaming system, the scheduling module executes: scheduling one or more filters of the filtering layer, controlling data flow from the data acquisition layer through the scheduled filters, respectively; and controlling the filtered data to be output from the data write layer.
Preferably, in the above data streaming system, the scheduling module further executes: unique identification IDs are added to the data of the circulation with the same behavior.
Preferably, in the data flow system, the flow control module manages the data reading number and the data reading frequency of each layer of the data processing module.
Preferably, the data flow system further includes: and the configurable abnormality detection module is used for monitoring the data in circulation.
According to another aspect of the present invention, there is provided a data flow method, which is performed by the data flow system described above, and includes: the data processing module collects and filters data and outputs the filtered data; the scheduling module is communicated with the data processing module through an interface and schedules data of each layer of the data processing module; the flow control module is communicated with the data processing module through an interface to manage the data flow of each layer of the data processing module; the data processing module is used for processing one or more data sources through expansion of an expansion interface; the scheduling module configures a scheduling mode of data through a configuration interface; and the flow control module configures a management mode of data flow through a configuration interface.
According to an aspect of the present invention, there is provided an electronic apparatus including: a processor; and a memory for storing executable instructions; wherein the processor is configured to perform the steps of the data flow method described above via execution of the executable instructions.
According to an aspect of the present invention, a computer-readable storage medium is provided, on which a computer program is stored, which computer program, when being executed by a processor, carries out the steps of the above-mentioned data streaming method.
Compared with the prior art, the invention has the beneficial effects that:
the data processing module can be expanded through the expansion interface, so that different data sources can be conveniently butted; the scheduling module can be configured through the configuration interface, and takes effect dynamically, so that system resources are convenient to integrate and schedule; the flow control module can be configured through a configuration interface, dynamically takes effect, and performs flow control management on data under the condition of real-time data flow transfer; the whole system has good flexibility and is convenient to use;
and the data processing module, the scheduling module and the flow control module are communicated through interfaces, all the modules are mutually independent, the number of nodes can be effectively controlled, and the whole system can efficiently transfer a large amount of data.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the invention and together with the description, serve to explain the principles of the invention. It is obvious that the drawings in the following description are only some embodiments of the invention, and that for a person skilled in the art, other drawings can be derived from them without inventive effort.
FIG. 1 is a block diagram of a data flow system according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of an architecture of a data flow system according to an embodiment of the present invention;
FIG. 3 is a block diagram of an architecture of a data processing module according to an embodiment of the present invention;
FIG. 4 is a diagram illustrating steps of a data flow method according to an embodiment of the present invention;
FIG. 5 shows a schematic diagram of an electronic device in an embodiment of the invention;
FIG. 6 shows a schematic diagram of a computer-readable storage medium in an embodiment of the invention.
Detailed Description
Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art. The same reference numerals in the drawings denote the same or similar structures, and thus their repetitive description will be omitted.
The data transfer system of the invention can efficiently transfer and process a large amount of data, and the whole system has good flexibility and is very convenient for users to use. As shown in connection with fig. 1 and 2, in some embodiments of the invention, the data flow system 1 includes, but is not limited to, the following modules:
and the extensible data processing module 11 is used for acquiring and filtering data by the data processing module 11 and outputting the filtered data.
In a preferred embodiment, the data processing module 11 further comprises:
the data acquisition layer 111 includes one or more collectors for acquiring data. As shown in fig. 2, the data collection layer 111 may include Kafka collectors (Kafka is a high throughput distributed publish-subscribe messaging system), Log File collectors (Log File collectors) and other extensible collectors, which may be extended as needed to interface with different data sources. Each collector in the data collection layer 111 collects original data, and abstractly outputs the original data by matching with a frame interface. Each collector in the data acquisition layer 111 communicates through an interface, and each collector is independent from each other and can control a specific collector to operate as required.
The data acquisition layer 11 further has an acquisition progress feedback interface for submitting a data acquisition progress to at least the scheduling module 12 and the flow control module 13. Preferably, the acquisition progress feedback interface of the data acquisition layer 11 includes a synchronous feedback interface for synchronously submitting the data acquisition progress and an asynchronous feedback interface for asynchronously submitting the data acquisition progress. The data acquisition progress of each acquisition device in the data acquisition layer 11 can be mastered in real time, and the situations of repeated data acquisition and data loss are avoided.
A filtering layer 112, including one or more filters, is used to filter the acquired data. Filtration layer 112 may include filters 1121, 1122, 1123, and other expandable filters. Each filter in filter layer 112 communicates via an interface, and each filter is independent of the other and can be programmed and plugged as desired.
And a data writing layer 113 including one or more output interfaces for outputting the filtered data. The data writing layer 113 may include a DB (Database) interface, an Elasticsearch interface and other extensible output interfaces, and may be extended and interfaced to different systems as needed to implement targeted output of data.
Further, the data processing module 11 may further include one or more buffer layers 114 between the data acquisition layer 111 and the filter layer 112, and/or between the filter layer 112 and the data writing layer 113, where the buffer layers 114 perform asynchronous data flow between the layers of the data processing module 11. Specifically, if the buffer layer 114 is not configured, the scheduling module 12 controls data flow between the upper layer and the lower layer of the data processing module 11 in a synchronous manner; the buffer layer 114 allows the data acquisition layer 111 to perform the next data acquisition as soon as possible, and also to package the data flowing in many times, and the next processor needs no lock contention as much as possible.
The buffer layer 114 includes a plurality of buffers, such as ring buffers, blocking queues and other expandable buffers shown in fig. 2, and different buffers can be expanded according to needs to implement the caching processing of data.
In a preferred embodiment, the layers of the data processing module 11 communicate with each other via an abstract interface, so that the layers of the data processing module 11 are independent of each other, can operate independently, and can be applied to other environments.
The data processing module 11 communicates with the scheduling module 12 through an interface, and the scheduling module 12 schedules data of each layer of the data processing module 11.
Specifically, in some embodiments, scheduling module 12 performs: the one or more collectors of the data collection layer 111 are first controlled to collect data. Then schedule one or more filters of filter layer 112, and control data flows through the scheduled filters for filtering, respectively; the scheduling module 12 can program the filters of the filtering layer 112, and data will flow through all the filters in a stream form (as shown by arrows in fig. 2, data flows from the Filter 1121 to the Filter 1123), and the filters can be plugged and unplugged at will through interfaces according to the filtering requirements. Scheduling module 12 may then control the output of the filtered data from one or more output interfaces of data write layer 113.
The scheduling module 12 can integrate resources of each layer of the data processing module 11, and because each layer communicates through an interface, data processing modules with different configurations can be built as required. In addition, the scheduling module 12 may also schedule internal threads of each layer of the data processing module 11, so that the single process uses system resources as much as possible.
In a preferred embodiment, the scheduling module 12 may further perform: unique identification IDs are added to the data of the circulation with the same behavior. Specifically, for rotating data abstract ID fields, the same ID indicates the same behavior, which makes it easier to guarantee idempotent data at the data write layer 113.
Further, the scheduling module 12 may be configured through a configuration interface provided by the configuration module 15, for example, configured through a browser, so as to adapt to different data flow scheduling requirements.
The flow control module 13 is configurable, the data processing module 11 communicates with the flow control module 13 through an interface, and the flow control module 13 manages data flow of each layer of the data processing module 11.
Taking the log data flow processing as an example, due to the large data volume, the network card of the whole physical machine may be affected by excessive data, and even during the peak period, the buffer layer 114 may be improperly configured, which may cause memory overflow. The flow control module 13 configures parameters such as the number, frequency, and interval of data read by the data processing module 11, and the parameters take effect dynamically, so that the flow control module 13 performs flow control management on data under the condition of real-time data flow transfer.
The flow control module 13 can be configured through a configuration interface provided by the configuration module 15, for example, configured through a browser, so as to adapt to different data traffic management control requirements.
The interface layer 14 provides an expansion interface for at least expanding the data processing module 11, so that the data processing module 11 can be flexibly expanded under the condition of realizing real-time data stream transfer, and different data sources can be conveniently butted.
The interface layer 14 also provides an abstract interface for the above modules (including the data processing module 11, the scheduling module 12, the flow control module 13, and the like) to communicate with each other. The mutual independence of acquisition, filtering, sending, scheduling and management is realized by abstracting a plurality of main nodes of data flow respectively. By realizing an abstract interface, the core of the data transfer system only needs to pay attention to the general logic, and the extension party pays attention to the specific application logic. The modules are mutually independent and combined according to needs to adapt to different data circulation requirements.
Further, in a preferred embodiment, the data flow system 1 further includes: and a configurable anomaly detection module 16 for monitoring the data of the flow. Due to the diversity of data, a part of filters may be in a dead state due to data abnormality, or the problem that the data in the memory of the current processor is too much to be processed is caused, and the like, the data can be monitored by the abnormality detection module 16, and an alarm is given when the abnormality is found. The anomaly detection module 16 may also be configured through a configuration interface provided by the configuration module 15, such as a browser, to meet different data anomaly monitoring requirements.
The data circulation system in the embodiment expands the data processing module through the expansion interface, so that different data sources can be conveniently butted; the management modules such as the scheduling module, the flow control module and the anomaly detection module are configured through the configuration interface, the dynamic effect is achieved, the data can be conveniently managed under the condition of real-time data flow transfer, and the whole system has good flexibility and is convenient to use. All modules are communicated through interfaces and are independent from each other, the number of nodes can be effectively controlled, and the whole system can efficiently transfer a large amount of data. And because the collection, the filtration, the sending, the scheduling, the management and the like are mutually independent, when a problem occurs, the problem can be checked for a small number of modules, the computing resources are saved, and the resource utilization rate is improved.
Referring to fig. 3, in a specific application example, the data processing module 11 of any of the above embodiments is used to collect, filter and output log data.
The data processing module 11 in this embodiment comprises a data acquisition layer 111, a filter layer 112, a data writing layer 113, and two buffer layers 114, respectively located between the data acquisition layer 111 and the filter layer 112, and between the filter layer 112 and the data writing layer 113.
The data acquisition layer 111 acquires log data including service logs, gateway logs, Metrics data, Agent logs and other log data through a Kafka collector.
Filter layer 112 filters the collected log data through a number of filters, including format filtering, traffic filtering, gateway filtering, and the like.
The data writing layer 113 writes the filtered log data into the Elasticsearch cluster through the Elasticsearch interface.
The buffer layer 114 implements multithread sharing of data through blocking queue. Through bulk queue, the number of requests for the elastic search is effectively reduced, and a single process can process various data circulation requirements.
Through the data processing module 11, the peak period single machine can process 30000 log data per second and write the log data into the Elasticsearch cluster. Even if a certain collector is restarted in a failure, the log data can be effectively ensured not to be repeated and lost.
Embodiments of the present invention further provide a data streaming method, where the data streaming method is executed by the data streaming system described in any of the above embodiments. Referring to FIG. 4, in some embodiments, the data streaming method includes, but is not limited to, the following steps:
and S10, the data processing module collects and filters data and outputs the filtered data.
And S20, the scheduling module communicates with the data processing module through the interface to schedule the data of each layer of the data processing module.
And S30, the flow control module communicates with the data processing module through the interface to manage the data flow of each layer of the data processing module.
And S40, the data processing module processes one or more data sources through the expansion interface expansion.
And S50, the scheduling module configures the scheduling mode of the data through the configuration interface.
And S60, configuring a management mode of the data flow through a configuration interface by the flow control module.
The reference numbers before the steps are only used for distinguishing different steps, and do not limit the execution sequence among the steps.
Further, the data flow method further includes steps executed by sub-modules of the data flow system described in any of the above embodiments, and the description is not repeated here.
An embodiment of the present invention further provides an electronic device, which includes a processor and a memory, where the memory stores executable instructions, and the processor is configured to execute the steps of the data flow method in the foregoing embodiment by executing the executable instructions.
As described above, the electronic device of the present invention can expand the data processing module through the expansion interface, thereby facilitating the docking of different data sources; configuring each management module through a configuration interface to realize dynamic management of data flow; all modules are communicated through interfaces and are independent of each other, the number of nodes can be effectively controlled, computing resources are saved, and the resource utilization rate is improved.
Fig. 5 is a schematic structural diagram of an electronic device in an embodiment of the present invention, and it should be understood that fig. 5 only schematically illustrates various modules, and these modules may be virtual software modules or actual hardware modules, and the combination, the splitting, and the addition of the remaining modules of these modules are within the scope of the present invention.
As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or program product. Thus, various aspects of the invention may be embodied in the form of: an entirely hardware embodiment, an entirely software embodiment (including firmware, microcode, etc.) or an embodiment combining hardware and software aspects that may all generally be referred to herein as a "circuit," module "or" platform.
An electronic device 400 according to this embodiment of the invention is described below with reference to fig. 5. The electronic device 400 shown in fig. 4 is only an example and should not bring any limitation to the function and the scope of use of the embodiments of the present invention.
As shown in fig. 4, electronic device 400 is embodied in the form of a general purpose computing device. The components of electronic device 400 may include, but are not limited to: at least one processing unit 410, at least one memory unit 420, a bus 430 connecting different platform components (including memory unit 420 and processing unit 410), a display unit 440, and the like.
Wherein the storage unit stores program code, which can be executed by the processing unit 410, so that the processing unit 410 performs the steps according to various exemplary embodiments of the present invention described in the above data flow method section of the present specification. For example, the processing unit 410 may perform the steps shown in fig. 4, respectively.
The storage unit 420 may include readable media in the form of volatile storage units, such as a random access memory unit (RAM)4201 and/or a cache memory unit 4202, and may further include a read only memory unit (ROM) 4203.
The storage unit 420 may also include a program/utility 4204 having a set (at least one) of program modules 4205, such program modules 4205 including, but not limited to: an operating system, one or more application programs, other program modules, and program data, each of which, or some combination thereof, may comprise an implementation of a network environment.
Bus 430 may be any bus representing one or more of several types of bus structures, including a memory unit bus or memory unit controller, a peripheral bus, an accelerated graphics port, a processing unit, or a local bus using any of a variety of bus architectures.
The electronic device 400 may also communicate with one or more external devices 500 (e.g., keyboard, pointing device, bluetooth device, etc.), with one or more devices that enable a user to interact with the electronic device 400, and/or with any devices (e.g., router, modem, etc.) that enable the electronic device 400 to communicate with one or more other computing devices. Such communication may occur via input/output (I/O) interfaces 450. Also, the electronic device 400 may communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network, such as the internet) via the network adapter 460. The network adapter 460 may communicate with other modules of the electronic device 400 via the bus 430. It should be appreciated that although not shown in the figures, other hardware and/or software modules may be used in conjunction with electronic device 400, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives, and data backup storage platforms, to name a few.
The embodiment of the present invention further provides a computer-readable storage medium, which is used for storing a computer program, and when the computer program is executed, the steps of the data streaming method of the above embodiment are implemented. In some possible embodiments, the various aspects of the invention may also be implemented in the form of a program product comprising program code means for causing a terminal device to carry out the steps according to various exemplary embodiments of the invention described in the above-mentioned data flow method section of the present description, when the program product is run on the terminal device.
As described above, the computer-readable storage medium of the present invention can expand the data processing module through the expansion interface, thereby facilitating the docking of different data sources; configuring each management module through a configuration interface to realize dynamic management of data flow; all modules are communicated through interfaces and are independent of each other, the number of nodes can be effectively controlled, computing resources are saved, and the resource utilization rate is improved.
Fig. 6 is a schematic structural diagram of a computer-readable storage medium of the present invention. Referring to fig. 6, a program product 600 for implementing the above method according to an embodiment of the present invention is described, which may employ a portable compact disc read only memory (CD-ROM) and include program code, and may be run on a terminal device, such as a personal computer. However, the program product of the present invention is not limited in this regard and, in the present document, a readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
The program product may employ any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
A computer readable storage medium may include a propagated data signal with readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A readable storage medium may also be any readable medium that is not a readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a readable storage medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server. In the case of a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).
The foregoing is a more detailed description of the invention in connection with specific preferred embodiments and it is not intended that the invention be limited to these specific details. For those skilled in the art to which the invention pertains, several simple deductions or substitutions can be made without departing from the spirit of the invention, and all shall be considered as belonging to the protection scope of the invention.

Claims (8)

1. A data streaming system, comprising:
an extensible data processing module comprising: the data acquisition layer comprises one or more collectors for acquiring data; a filtering layer comprising one or more filters for filtering the acquired data; the data writing layer comprises one or more output interfaces and is used for outputting the filtered data; all layers of the data processing module are communicated through an abstract interface;
a configurable scheduling module, said data processing module communicating with said scheduling module through an interface, said scheduling module scheduling resources of each layer of said data processing module, comprising: controlling one or more collectors of the data collection layer to collect data; scheduling one or more filters of the filter layer, arranging the scheduled filters so that data respectively flow through the scheduled filters in a stream form for filtering, and controlling the filtered data to be output from one or more output interfaces of the data writing layer;
the flow control module is configured, and is used for configuring the data reading quantity and the data reading frequency of the data processing module so as to manage the data reading flow and the data reading frequency of each layer of the data processing module;
the data acquisition layer is provided with an acquisition progress feedback interface and is used for submitting data acquisition progress to at least the scheduling module and the flow control module;
the interface layer provides an expansion interface at least for the data processing module to expand; and
and the configuration module provides a configuration interface at least for the configuration of the scheduling module and the flow control module.
2. The data flow system of claim 1, wherein the data processing module further comprises:
one or more buffer layers between the data acquisition layer and the filter layer, and/or between the filter layer and the data writing layer;
and the data processing modules perform asynchronous data transfer among the layers through the buffer layers.
3. The data flow system of claim 1, wherein the acquisition progress feedback interface includes a synchronous feedback interface and an asynchronous feedback interface.
4. The data flow system of claim 1, wherein the scheduling module further performs:
unique identification IDs are added to the data of the circulation with the same behavior.
5. The data flow system of claim 1, further comprising:
and the configurable abnormality detection module is used for monitoring the data in circulation.
6. A data flow method, performed by the data flow system of any one of claims 1-5, comprising:
the data processing module collects and filters data and outputs the filtered data;
the scheduling module is communicated with the data processing module through an interface and schedules data of each layer of the data processing module;
the flow control module is communicated with the data processing module through an interface to manage the data flow of each layer of the data processing module;
the data processing module is used for processing one or more data sources through expansion of an expansion interface;
the scheduling module configures a scheduling mode of data through a configuration interface; and
and the flow control module configures a management mode of data flow through a configuration interface.
7. An electronic device, comprising:
a processor; and
a memory for storing executable instructions;
wherein the processor is configured to perform the steps of the data flow method of claim 6 via execution of the executable instructions.
8. A computer-readable storage medium, on which a computer program is stored, the computer program, when being executed by a processor, implementing the steps of the data flow method of claim 6.
CN201811158015.5A 2018-09-30 2018-09-30 Data transfer system, data transfer method, electronic device, and storage medium Active CN109412970B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811158015.5A CN109412970B (en) 2018-09-30 2018-09-30 Data transfer system, data transfer method, electronic device, and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811158015.5A CN109412970B (en) 2018-09-30 2018-09-30 Data transfer system, data transfer method, electronic device, and storage medium

Publications (2)

Publication Number Publication Date
CN109412970A CN109412970A (en) 2019-03-01
CN109412970B true CN109412970B (en) 2022-07-05

Family

ID=65466667

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811158015.5A Active CN109412970B (en) 2018-09-30 2018-09-30 Data transfer system, data transfer method, electronic device, and storage medium

Country Status (1)

Country Link
CN (1) CN109412970B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109947837A (en) * 2019-03-21 2019-06-28 上海富数科技有限公司 Realize the system and method for intelligent scheduling data grabber function
CN110071873A (en) * 2019-04-09 2019-07-30 华为技术有限公司 A kind of method, apparatus and relevant device sending data

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105978960A (en) * 2016-05-06 2016-09-28 武汉烽火众智数字技术有限责任公司 Cloud scheduling system and method based on mass video structured processing
CN107317724A (en) * 2017-06-06 2017-11-03 中证信用增进股份有限公司 Data collecting system and method based on cloud computing technology

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6047323A (en) * 1995-10-19 2000-04-04 Hewlett-Packard Company Creation and migration of distributed streams in clusters of networked computers
CN101567845B (en) * 2009-04-27 2011-12-07 成都科来软件有限公司 Real-time processing system and method of universal network data packet
CN102685801B (en) * 2012-05-11 2014-10-22 中国联合网络通信集团有限公司 Network management equipment and method
CN104090823A (en) * 2014-06-09 2014-10-08 中国建设银行股份有限公司 Flow control method and device for computer system
CN106060149A (en) * 2016-06-24 2016-10-26 北京交通大学 Mobile internet mass data analysis and audit technical architecture
CN106354795B (en) * 2016-08-26 2019-09-17 南威软件股份有限公司 A kind of Distributed data exchange system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105978960A (en) * 2016-05-06 2016-09-28 武汉烽火众智数字技术有限责任公司 Cloud scheduling system and method based on mass video structured processing
CN107317724A (en) * 2017-06-06 2017-11-03 中证信用增进股份有限公司 Data collecting system and method based on cloud computing technology

Also Published As

Publication number Publication date
CN109412970A (en) 2019-03-01

Similar Documents

Publication Publication Date Title
CN113742031B (en) Node state information acquisition method and device, electronic equipment and readable storage medium
KR101137073B1 (en) Proactive power management in a parallel computer
US10031683B2 (en) Providing service address space for diagnostics collection
CN101652750A (en) Data processing device, distributed processing system, data processing method, and data processing program
CN111966465B (en) Method, system, equipment and medium for modifying host configuration parameters in real time
CN109412970B (en) Data transfer system, data transfer method, electronic device, and storage medium
CN108959139A (en) A kind of CPLD pin multiplexing method and device
CN110868324A (en) Service configuration method, device, equipment and storage medium
US8977752B2 (en) Event-based dynamic resource provisioning
CN113658351B (en) Method and device for producing product, electronic equipment and storage medium
EP4024761A1 (en) Communication method and apparatus for multiple management domains
CN116401116A (en) High availability scheduler event tracking
CN112328184B (en) Cluster capacity expansion method, device, equipment and storage medium
US9772877B2 (en) Managing I/O operations in a shared file system
CN117112121A (en) Distributed task processing system, method, apparatus and computer program product
CN113923100A (en) Alarm acquisition method, alarm acquisition system, storage medium and electronic device
CN113986097A (en) Task scheduling method and device and electronic equipment
CN113031969A (en) Equipment deployment inspection method and device, computer equipment and storage medium
CN112953810A (en) Network request processing method and device
CN111367875B (en) Ticket file processing method, system, equipment and medium
Park et al. Design and implementation of connected DataLake system for reliable data transmission
Pugdeethosapol et al. Dynamic configuration of the computing nodes of the ALICE O 2 system
CN111666156B (en) Container arranging method, device, equipment and storage medium
CN113900898B (en) Data processing system, equipment and medium
CN111953506B (en) Message processing method and processing system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
EE01 Entry into force of recordation of patent licensing contract
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20190301

Assignee: Nanjing Manyun Cold Chain Technology Co.,Ltd.

Assignor: JIANGSU MANYUN SOFTWARE TECHNOLOGY Co.,Ltd.

Contract record no.: X2023980038397

Denomination of invention: Data flow system, data flow methods, electronic devices, and storage media

Granted publication date: 20220705

License type: Common License

Record date: 20230724