CN113469857A - Data processing method and device, electronic equipment and storage medium - Google Patents

Data processing method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN113469857A
CN113469857A CN202110811739.0A CN202110811739A CN113469857A CN 113469857 A CN113469857 A CN 113469857A CN 202110811739 A CN202110811739 A CN 202110811739A CN 113469857 A CN113469857 A CN 113469857A
Authority
CN
China
Prior art keywords
target data
data
abnormal
type
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110811739.0A
Other languages
Chinese (zh)
Inventor
祖启航
许义军
毛佳茗
秦小钟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shencai Technology Co Ltd
Original Assignee
Shencai Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shencai Technology Co Ltd filed Critical Shencai Technology Co Ltd
Priority to CN202110811739.0A priority Critical patent/CN113469857A/en
Publication of CN113469857A publication Critical patent/CN113469857A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Strategic Management (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention discloses a data processing method and device, electronic equipment and a storage medium, and belongs to the technical field of data processing. The method comprises the following steps: detecting target data based on a preset data detection rule, and determining whether the target data is abnormal; and if so, processing the target data according to an exception handling mode associated with the exception type to which the target data belongs. According to the technical scheme, abnormal data in the pollution source online monitoring system can be rapidly found, the abnormal data can be flexibly processed, human resources are saved, and a new thought is provided for finding and processing the abnormal data in the pollution source online monitoring system.

Description

Data processing method and device, electronic equipment and storage medium
Technical Field
The embodiment of the invention relates to the technical field of data processing, in particular to a data processing method and device, electronic equipment and a storage medium.
Background
The pollution source online monitoring system is used as an important means for environment supervision, and has an important effect on improving the environment supervision level. The effectiveness of the pollution source online monitoring data is directly related to the exertion of the function of the pollution source online monitoring system. The on-line monitoring system uploads the data generated by various analytical instruments, sensors and the like to the environmental protection law enforcement department. Because the monitoring equipment is influenced by uncertain factors such as environment, equipment loss, human intervention and the like, the online monitoring data acquired by the environmental protection department often has the conditions of data error, repetition, abnormality and the like. Abnormal monitoring data cannot truly reflect the actual pollution discharge situation of an enterprise, and meanwhile, the monitoring alarm, administrative law enforcement and punishment judgment of an environmental protection department can cause wrong influence. Therefore, an effective data management method is needed to find and process abnormal online monitoring data.
Disclosure of Invention
The invention provides a data processing method, a data processing device, electronic equipment and a storage medium, which are used for realizing automatic identification and timely processing of abnormal data.
In a first aspect, an embodiment of the present invention provides a data processing method, which is applied to a pollution source monitoring system, and the method includes:
detecting target data based on a preset data detection rule, and determining whether the target data is abnormal;
and if so, processing the target data according to an exception handling mode associated with the exception type to which the target data belongs.
In a second aspect, an embodiment of the present invention further provides a data processing apparatus configured in a pollution source monitoring system, where the apparatus includes:
the anomaly determination module is used for detecting target data based on a preset data detection rule and determining whether the target data is abnormal or not;
and if so, processing the target data according to an exception handling mode associated with the exception type to which the target data belongs.
In a third aspect, an embodiment of the present invention further provides an electronic device, including:
one or more processors;
a memory for storing one or more programs;
when the one or more programs are executed by the one or more processors, the one or more processors are caused to implement the data processing method provided by any embodiment of the present invention.
In a fourth aspect, the embodiment of the present invention further provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the data processing method provided in any embodiment of the present invention.
According to the technical scheme of the embodiment of the invention, the target data is detected based on the preset data detection rule, whether the target data is abnormal or not is determined, and if yes, the target data is processed according to the abnormal processing mode associated with the abnormal type of the target data. According to the technical scheme, abnormal data in the pollution source online monitoring system can be rapidly found, the abnormal data can be flexibly processed, human resources are saved, and a new thought is provided for finding and processing the abnormal data in the pollution source online monitoring system.
Drawings
Fig. 1 is a flowchart of a data processing method according to an embodiment of the present invention;
fig. 2 is a flowchart of a data processing method according to a second embodiment of the present invention;
fig. 3 is a schematic structural diagram of a data processing apparatus according to a third embodiment of the present invention;
fig. 4 is a schematic structural diagram of an electronic device according to a fourth embodiment of the present invention.
Detailed Description
The present invention will be described in further detail with reference to the accompanying drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the invention and are not limiting of the invention. It should be further noted that, for the convenience of description, only some of the structures related to the present invention are shown in the drawings, not all of the structures.
Example one
Fig. 1 is a flowchart of a data processing method according to an embodiment of the present invention, where the embodiment is applicable to a situation where abnormal data in an online pollution source monitoring system is detected and processed, and the method may be executed by a data processing apparatus, where the apparatus may be implemented by software and/or hardware, and incorporated into an electronic device that carries a data processing function, such as a server.
As shown in fig. 1, the method may specifically include:
s110, detecting the target data based on a preset data detection rule, and determining whether the target data is abnormal.
In this embodiment, the target data refers to data that needs to be subjected to anomaly detection in the pollution source online monitoring system. The preset data detection rule may include a data common detection rule and a data service detection rule, where the data common detection rule is used to detect data itself, and the data service detection rule is used to detect a service attribute to which the data belongs.
The so-called common detection rule of data can comprise null value check, unique check, code check, value range check, data type check, repeatability check, normative check, referential integrity check, outlier check and the like; the null value check is to check the values of a specific table and a specific column, such as whether the enterprise address, the legal person, the longitude and latitude and the like are null values; the unique check is a unique check on the field provided, such as a contamination source unique number; the code check is to check whether the field value is in a code table (defined in advance by the system), and the field default with the code application in the source data is checked; the value range check is to judge whether the data to be detected meets the defined value range of the index metadata; the data type check means whether the data to be detected meets the definition type of the metadata; the repeatability check means that a set is formed by a specific table and a column or a combination of columns, and whether repeated values exist or not is found, or a repeated set is found; the normative check refers to checking specific normative on a specific table and a specific column, such as constraint check of rules of identity cards, mobile phone numbers, mailboxes, postal codes, fixed phones, IP addresses, MAC addresses, dates and the like; the referential integrity check refers to the check of the integrity of the referential between the main external keys of the data and the discovery of the empty referential of the external keys to the main keys; the outlier inspection refers to the step of performing distribution analysis on the content of the numerical field, verifying the distribution of data according to the Lauda method or the Grubbs method, and feeding back the outlier found to a corresponding business department.
The data service detection rule is a corresponding detection rule specified for the service field to which the data belongs, for example, for index data of environmental quality class, the evaluation result of the environmental quality is formulated according to the service attribute rules of national standard and line standard mainly from the relevant monitoring standard and evaluation standard of the environmental quality. For example, the relevant service indexes of the atmospheric environment may refer to the technical specification of Air Quality Index (AQI), and the rules of service limit values are formulated for the Air Quality Index, the pollutant item concentration, and the like. For the abnormal detection of the environmental supervision data, business rules can be formulated for industry attributes, region attributes, name attributes, management type attributes and the like of the pollution source according to administrative permission and administrative management specifications.
Optionally, a target type of the target data may be determined; and detecting the target data based on the data detection rule associated with the target type to determine whether the target data is abnormal. Wherein, one target type can be associated with a plurality of data detection rules.
Specifically, the target type of the target data is determined according to a predefined data standard, and then the target data is detected based on a data detection rule associated with the target type to determine whether the target data is abnormal. For example, if the target data is table data, null value check may be used to detect whether each field in the table has a null value, and value range check may also be used to detect whether the value of each field data in the table meets the defined value range of the index metadata; a repeatability check may also be employed to detect whether duplicate values or overlapping sets exist in the data of a set of characteristic columns or combinations of columns in the table data, and so forth.
Before the target data is detected, data standards are formulated, for example, data asset combing is performed according to business domains and data types of online monitoring of enterprises, and relevant standards of production, management and application of the data assets are formulated. Firstly, defining a uniform identifier, specifically comprising a data identifier, an equipment identifier, an organization identifier, a type identifier and the like, and forming a standard center for online monitoring data by referring to national and local standards. Secondly, defining metadata standards related to online monitoring business data by researching and combing the online monitoring business and data, wherein the metadata standards comprise: code specification of online monitoring data assets, core metadata contents (including Chinese paraphrases, remarks, logic primary keys, quality detection rules, data sources, data types, gate department authority, analysis rules and the like), and thirdly, standardizing a data model for online monitoring, including: naming rules of each service main data model, field naming rules, dictionary table reference relations, data types, field lengths, logic grouping and the like; fourthly, the standardization of data operation mainly comprises: data cleaning processing rules, data integration naming rules, scheduling task rules, data development rules and the like.
And S120, if so, processing the target data according to the exception handling mode associated with the exception type to which the target data belongs.
In this embodiment, if it is determined that the target data is abnormal, the abnormal type to which the target data belongs is determined, and the target data is processed according to the abnormal processing mode associated with the abnormal type described in the target data.
According to the technical scheme of the embodiment of the invention, the target data is detected based on the preset data detection rule, whether the target data is abnormal or not is determined, and if yes, the target data is processed according to the abnormal processing mode associated with the abnormal type of the target data. According to the technical scheme, abnormal data in the pollution source online monitoring system can be rapidly found, the abnormal data can be flexibly processed, human resources are saved, and a new thought is provided for finding and processing the abnormal data in the pollution source online monitoring system.
On the basis of the above embodiment, as an optional manner of the embodiment of the present invention, if the target data is abnormal, the target data may be marked, and the abnormality warning information may be sent to the terminal.
Optionally, the target data may be marked in a highlight mode for the abnormal target data, and the address information and the abnormal information of the abnormal target data may be sent to the terminal through voice or a short message.
It can be understood that by marking the abnormal target data, the abnormal target data can be clearly marked for the administrator to check; and the prediction early warning information is sent to the terminal, and related personnel can be informed to process the abnormal data in time.
Example two
Fig. 2 is a flowchart of a data processing method according to a second embodiment of the present invention; on the basis of the above embodiment, optimization is carried out, and an alternative implementation scheme is provided.
As shown in fig. 2, the method may specifically include:
s210, based on the business scene to which the target data belongs and/or a preset standardization rule, standardizing the target data.
In this embodiment, because different online pollution source monitoring data have different dimensions and dimension units, such data may affect detection and analysis of the data, and in order to ensure that the influence of the dimensions on the monitoring data is reduced, the target data may be standardized based on a service scenario to which the target data belongs, and different processing algorithms and processing modes are adopted for the data standardization processing aiming at different information fields based on different service scenarios. For example, if the target data belongs to the air environment type data, the target data may be normalized based on the relevant business standard of the atmospheric environment. For another example, for index monitoring data acquired by the monitoring device automatically, based on the requirement of data analysis modeling, a standardized algorithm may be used for automatic preprocessing. The standardization of partial data fields needs to be mainly carried out in a manual mode, for example, for an enterprise pollution source coordinate information field of 1-3-5 kilometers of pollution source data, the coordinates expressed in different units can be uniformly converted in an automatic calculation mode, and the uniform standard of a numerical value expression format needs manual intervention processing.
Optionally, the target data may be normalized based on a preset normalization rule. For example, the target data can be set to the same dimensional level by min-max normalization, Z-score normalization, and the like.
Optionally, the target data may be standardized based on a service scenario to which the target data belongs and a preset standardization rule.
S220, detecting the target data based on a preset data detection rule, and determining whether the target data is abnormal.
In this embodiment, based on a preset data detection rule, the target data may be detected by identifying whether the target data includes outlier data; the outlier data refers to data of a boundary outlier in the data, or noise data. Further, if the target data includes outlier data, determining that the abnormal type of the target data is a noise data type.
Optionally, based on a preset data detection rule, detecting the target data may be to identify whether there is a deviation between an actual receiving time point of the target data and an ideal receiving time point corresponding to the target data; the actual receiving time point refers to the time point when the target data are transmitted to the pollution source online monitoring system; the ideal reception time point refers to a time point at which the target data should theoretically be transmitted to the pollution source online monitoring system. And if the actual receiving time point of the target data is different from the ideal receiving time point corresponding to the target data, determining that the abnormal type of the target data is the jitter data type.
Optionally, based on a preset data detection rule, detecting the target data may be detecting the integrity of the target data. Specifically, if the target data is incomplete, the abnormal type of the target data is determined to be the missing data type.
Optionally, based on a preset data detection rule, the detection of the target data may be to identify whether the target data is within a set value range. Specifically, if the value of the target data is not within the set value range, the exception type of the target data is determined to be a logical error exception type.
Optionally, based on a preset data detection rule, the detection of the target data may be consistency comparison between the target data and reported data corresponding to the target data; wherein, the reported data refers to the data of reporting records by the enterprise to the supervision department before pollution discharge; the target data refers to data which is actually detected by sensing equipment at the sewage discharge port of the enterprise and uploaded to the pollution source online monitoring system in real time. Specifically, if the target data is inconsistent with the reported data corresponding to the target data, the abnormal type of the target data is determined to be an inconsistent data type.
Optionally, the target data may be detected based on a preset data detection rule, where the detection may be repeated detection of the target data. Specifically, the target data may be repeatedly identified based on a sorting and merging method, grouping by key field, identifying context information, identifying based on domain and business knowledge, identifying data characteristics, and the like, and if the target data has repeated data, the abnormal type of the target data is determined to be a repeated data type.
And S230, if so, processing the target data according to the exception handling mode associated with the exception type to which the target data belongs.
Optionally, if the abnormal type of the target data is a noise data type, denoising processing is performed on the target data. Specifically, methods such as linear regression, binning, outlier analysis, and the like may be used to process the abnormal target data. Linear regression is to find the direct optimal straight line fitting two variables, and when a specific value of one variable is given, the linear regression can be used for predicting the other variable; when the data variable is plural, multiple linear regression may be employed. Binning refers to analyzing data by using a "neighborhood" method, and the data is smooth and effective, and the data is distributed into some bins, which may include a bin mean smoothing method, a bin number smoothing method, a bin boundary smoothing method, and the like.
Optionally, if the heterogeneous type of the target data is a jitter data type, performing jitter offset processing on the target data. Specifically, an appropriate smoothing method may be constructed based on a moving average method, an exponential average method, an SG filtering method, or the like, and the target data may be processed by the dither offset processing to improve the usability of the data. The moving average method is based on an average method, new data are gradually added or old data are reduced according to the sequence, a moving average value is calculated to eliminate accidental variation factors, data are predicted, and the method is suitable for the conditions that the real value is not large, linear variation exists, and the noise average value is zero data; the exponential averaging method is moving average with exponential descending weighting, and the exponential moving averaging method has stronger real-time performance than the sliding averaging method and is closer to the observed value at the current moment; the SG (Savitzky-Golay) filtering method is a polynomial smoothing algorithm based on the least square principle, and the core is to carry out weighted filtering on data, so that the information of data signal change can be more effectively reserved.
Optionally, if the abnormal type of the target data is the missing data type, the target data is padded. Specifically, the target data may be subjected to padding processing by an average value method, a median method, a mode method, a regression padding method, a near padding method, a trend scoring method, or the like.
Optionally, if the exception type of the target data is a logical error data type, performing logical verification on the target data. Specifically, a historical data comparison method can be used for observing the change trend, characteristics and rules of the data through analysis of historical data, so that the correctness of the data is further verified. And the data with larger increase (or decrease) of the concentration (content) of the same proportion can be mainly checked during evaluation according to the characteristics of the target data, and the correct data needs to be reported again by related units if the data is confirmed to be wrong after the verification.
Optionally, if the abnormal type of the target data is an inconsistent data type, an error prompt of the target data is sent to the user terminal. Specifically, the error prompt of the target data can be sent to the user terminal in the forms of short message, telephone, voice and the like. The error prompt message may include error data.
Optionally, if the abnormal type of the target data is a repeated data type, deleting the target data. Specifically, the duplicated target data is deleted.
According to the technical scheme of the embodiment of the invention, the target data is subjected to standardized processing based on the business scene to which the target data belongs and/or a preset standardized rule, then the target data is detected based on a preset data detection rule, whether the target data is abnormal or not is determined, and if yes, the target data is processed according to an abnormal processing mode related to the abnormal type to which the target data belongs. According to the technical scheme, abnormal data in the pollution source online monitoring system can be rapidly found, the abnormal data can be flexibly processed, human resources are saved, and a new thought is provided for finding and processing the abnormal data in the pollution source online monitoring system.
EXAMPLE III
Fig. 3 is a schematic structural diagram of a data processing apparatus according to a third embodiment of the present invention; the device can be realized in a software and/or hardware mode and is integrated in electronic equipment bearing a data processing function, such as a server.
As shown in fig. 3, the apparatus may include an anomaly determination module 310 and a data processing module 320, wherein,
an anomaly determination module 310, configured to detect target data based on a preset data detection rule, and determine whether the target data is anomalous;
and the data processing module 320 is configured to, if yes, process the target data according to an exception handling manner associated with the exception type to which the target data belongs.
According to the technical scheme of the embodiment of the invention, the target data is detected based on the preset data detection rule, whether the target data is abnormal or not is determined, and if yes, the target data is processed according to the abnormal processing mode associated with the abnormal type of the target data. According to the technical scheme, abnormal data in the pollution source online monitoring system can be rapidly found, the abnormal data can be flexibly processed, human resources are saved, and a new thought is provided for finding and processing the abnormal data in the pollution source online monitoring system.
Further, the device further comprises a standardization processing module, and the standardization processing module is specifically configured to:
and standardizing the target data based on the service scene to which the target data belongs and/or a preset standardization rule.
Further, the anomaly determination module 310 includes a target type determination unit and an anomaly determination unit, wherein,
a target type determination unit for determining a target type of the target data;
and the abnormity determining unit is used for detecting the target data based on the data detection rule associated with the target type and determining whether the target data is abnormal.
Further, the anomaly determination module 310 is specifically configured to:
identifying whether the target data comprises outlier data;
identifying whether the actual receiving time point of the target data and the ideal receiving time point corresponding to the target data have deviation or not;
detecting the integrity of the target data;
identifying whether the target data is within a set numerical range;
carrying out consistency comparison on the target data and reported data corresponding to the target data;
and detecting the repeatability of the target data.
Further, the data processing module 320 is specifically configured to:
if the abnormal type of the target data is a noise data type, denoising the target data;
if the different type of the target data is the jitter data type, performing jitter offset processing on the target data;
if the abnormal type of the target data is the missing data type, filling the target data;
if the abnormal type of the target data is a logic error data type, performing logic check on the target data;
if the abnormal type of the target data is the inconsistent data type, sending an error prompt of the target data to the user terminal;
and if the abnormal type of the target data is the repeated data type, deleting the target data.
Further, the device further comprises an early warning information sending module, wherein the early warning information sending module is specifically used for:
and if the target data is abnormal, marking the target data and sending abnormal early warning information to the terminal.
The data processing device can execute the data processing method provided by any embodiment of the invention, and has corresponding functional modules and beneficial effects of the execution method.
Example four
Fig. 4 is a schematic structural diagram of an electronic device according to a fourth embodiment of the present invention, and fig. 4 shows a block diagram of an exemplary device suitable for implementing the embodiment of the present invention. The device shown in fig. 4 is only an example and should not bring any limitation to the function and the scope of use of the embodiments of the present invention.
As shown in FIG. 4, electronic device 12 is embodied in the form of a general purpose computing device. The components of electronic device 12 may include, but are not limited to: one or more processors or processing units 16, a system memory 28, and a bus 18 that couples various system components including the system memory 28 and the processing unit 16.
Bus 18 represents one or more of any of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, and a processor or local bus using any of a variety of bus architectures. By way of example, such architectures include, but are not limited to, Industry Standard Architecture (ISA) bus, micro-channel architecture (MAC) bus, enhanced ISA bus, Video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnect (PCI) bus.
Electronic device 12 typically includes a variety of computer system readable media. Such media may be any available media that is accessible by electronic device 12 and includes both volatile and nonvolatile media, removable and non-removable media.
The system memory 28 may include computer system readable media in the form of volatile memory, such as Random Access Memory (RAM)30 and/or cache memory (cache 32). The electronic device 12 may further include other removable/non-removable, volatile/nonvolatile computer system storage media. By way of example only, storage system 34 may be used to read from and write to non-removable, nonvolatile magnetic media (not shown in FIG. 4, and commonly referred to as a "hard drive"). Although not shown in FIG. 4, a magnetic disk drive for reading from and writing to a removable, nonvolatile magnetic disk (e.g., a "floppy disk") and an optical disk drive for reading from or writing to a removable, nonvolatile optical disk (e.g., a CD-ROM, DVD-ROM, or other optical media) may be provided. In these cases, each drive may be connected to bus 18 by one or more data media interfaces. System memory 28 may include at least one program product having a set (e.g., at least one) of program modules that are configured to carry out the functions of embodiments of the invention.
A program/utility 40 having a set (at least one) of program modules 42 may be stored, for example, in system memory 28, such program modules 42 including, but not limited to, an operating system, one or more application programs, other program modules, and program data, each of which examples or some combination thereof may comprise an implementation of a network environment. Program modules 42 generally carry out the functions and/or methodologies of embodiments described herein.
Electronic device 12 may also communicate with one or more external devices 14 (e.g., keyboard, pointing device, display 24, etc.), with one or more devices that enable a user to interact with electronic device 12, and/or with any devices (e.g., network card, modem, etc.) that enable electronic device 12 to communicate with one or more other computing devices. Such communication may be through an input/output (I/O) interface 22. Also, the electronic device 12 may communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network, such as the Internet) via the network adapter 20. As shown, the network adapter 20 communicates with other modules of the electronic device 12 via the bus 18. It should be understood that although not shown in the figures, other hardware and/or software modules may be used in conjunction with electronic device 12, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives, and data backup storage systems, among others.
The processing unit 16 executes various functional applications and data processing, for example, implementing a data processing method provided by an embodiment of the present invention, by executing a program stored in the system memory 28.
EXAMPLE five
Fifth, an embodiment of the present invention further provides a computer-readable storage medium, on which a computer program (or referred to as computer-executable instructions) is stored, where the computer program is used to execute a data processing method provided in the embodiment of the present invention when executed by a processor, and the method includes:
detecting target data based on a preset data detection rule, and determining whether the target data is abnormal;
and if so, processing the target data according to the exception handling mode associated with the exception type to which the target data belongs.
Computer storage media for embodiments of the invention may employ any combination of one or more computer-readable media. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations for embodiments of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + +, and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
It is to be noted that the foregoing is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, although the embodiments of the present invention have been described in more detail through the above embodiments, the embodiments of the present invention are not limited to the above embodiments, and many other equivalent embodiments may be included without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.

Claims (10)

1. A data processing method is applied to a pollution source monitoring system, and comprises the following steps:
detecting target data based on a preset data detection rule, and determining whether the target data is abnormal;
and if so, processing the target data according to an exception handling mode associated with the exception type to which the target data belongs.
2. The method according to claim 1, wherein before detecting the target data based on the preset data detection rule and determining whether the target data is abnormal, the method further comprises:
and standardizing the target data based on the service scene to which the target data belongs and/or a preset standardization rule.
3. The method according to claim 1, wherein the detecting target data based on a preset data detection rule to determine whether the target data is abnormal comprises:
determining a target type of target data;
and detecting the target data based on a data detection rule associated with the target type, and determining whether the target data is abnormal.
4. The method according to claim 1, wherein the detecting the target data based on the preset data detection rule comprises at least one of the following:
identifying whether outlier data is included in the target data;
identifying whether the actual receiving time point of the target data and the ideal receiving time point corresponding to the target data have deviation or not;
detecting the integrity of the target data;
identifying whether the target data is within a set numerical range;
carrying out consistency comparison on the target data and reported data corresponding to the target data;
and detecting the repeatability of the target data.
5. The method according to claim 1, wherein the processing the target data according to the exception handling manner associated with the exception type to which the target data belongs includes:
if the abnormal type of the target data is a noise data type, denoising the target data;
if the different type of the target data is a jitter data type, performing jitter offset processing on the target data;
if the abnormal type of the target data is a missing data type, filling the target data;
if the abnormal type of the target data is a logic error data type, performing logic check on the target data;
if the abnormal type of the target data is an inconsistent data type, sending an error prompt of the target data to a user terminal;
and if the abnormal type of the target data is the repeated data type, deleting the target data.
6. The method of claim 1, further comprising:
and if the target data is abnormal, marking the target data and sending abnormal early warning information to a terminal.
7. A data processing apparatus configured for use in a pollution source monitoring system, the apparatus comprising:
the anomaly determination module is used for detecting target data based on a preset data detection rule and determining whether the target data is abnormal or not;
and if so, processing the target data according to an exception handling mode associated with the exception type to which the target data belongs.
8. The apparatus of claim 7, further comprising:
and the standardization processing module is used for carrying out standardization processing on the target data based on the business scene to which the target data belongs and/or a preset standardization rule.
9. An electronic device, comprising:
one or more processors;
a memory for storing one or more programs;
when executed by the one or more processors, cause the one or more processors to implement a data processing method as claimed in any one of claims 1-6.
10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the data processing method of any one of claims 1 to 6.
CN202110811739.0A 2021-07-19 2021-07-19 Data processing method and device, electronic equipment and storage medium Pending CN113469857A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110811739.0A CN113469857A (en) 2021-07-19 2021-07-19 Data processing method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110811739.0A CN113469857A (en) 2021-07-19 2021-07-19 Data processing method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN113469857A true CN113469857A (en) 2021-10-01

Family

ID=77880956

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110811739.0A Pending CN113469857A (en) 2021-07-19 2021-07-19 Data processing method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113469857A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113901043A (en) * 2021-12-13 2022-01-07 西安中电环通数字科技有限公司 Pollution source intelligent supervision and data fusion analysis method and system
CN114567536A (en) * 2022-02-24 2022-05-31 北京百度网讯科技有限公司 Abnormal data processing method and device, electronic equipment and storage medium
CN114897516A (en) * 2022-07-12 2022-08-12 山东乐习信息科技有限公司 Method for overall process data management of special equipment
CN117312295A (en) * 2023-11-29 2023-12-29 中国电力科学研究院有限公司 Method, device, medium and equipment for managing power grid material spot check data

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103617360A (en) * 2013-12-03 2014-03-05 力合科技(湖南)股份有限公司 Early-warning event processing method and device
CN108830765A (en) * 2018-04-18 2018-11-16 中国地质大学(武汉) A kind of checking method and system of pollution entering the water monitoring data
CN109783552A (en) * 2018-11-28 2019-05-21 中国电力科学研究院有限公司 A kind of data cleansing restorative procedure
CN111752936A (en) * 2020-06-30 2020-10-09 中国科学院西北生态环境资源研究院 Data detection management method, device, server and readable storage medium
CN111814203A (en) * 2020-09-08 2020-10-23 江苏神彩科技股份有限公司 Method, device and equipment for processing environmental data and storage medium
CN112379656A (en) * 2020-10-09 2021-02-19 爱普(福建)科技有限公司 Processing method, device, equipment and medium for detecting abnormal data of industrial system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103617360A (en) * 2013-12-03 2014-03-05 力合科技(湖南)股份有限公司 Early-warning event processing method and device
CN108830765A (en) * 2018-04-18 2018-11-16 中国地质大学(武汉) A kind of checking method and system of pollution entering the water monitoring data
CN109783552A (en) * 2018-11-28 2019-05-21 中国电力科学研究院有限公司 A kind of data cleansing restorative procedure
CN111752936A (en) * 2020-06-30 2020-10-09 中国科学院西北生态环境资源研究院 Data detection management method, device, server and readable storage medium
CN111814203A (en) * 2020-09-08 2020-10-23 江苏神彩科技股份有限公司 Method, device and equipment for processing environmental data and storage medium
CN112379656A (en) * 2020-10-09 2021-02-19 爱普(福建)科技有限公司 Processing method, device, equipment and medium for detecting abnormal data of industrial system

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
乌兰: "环境监测中的异常数据处理研究", 《资源节约与环保》 *
雷敏珊等: "关于环境监测数据中的异常数据分析与处理", 《当代化工研究》 *
马琳等: "基于离群点诊断的污染源在线监测异常数据识别方法探究", 《油气田环境保护》 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113901043A (en) * 2021-12-13 2022-01-07 西安中电环通数字科技有限公司 Pollution source intelligent supervision and data fusion analysis method and system
CN114567536A (en) * 2022-02-24 2022-05-31 北京百度网讯科技有限公司 Abnormal data processing method and device, electronic equipment and storage medium
CN114567536B (en) * 2022-02-24 2024-02-23 北京百度网讯科技有限公司 Abnormal data processing method, device, electronic equipment and storage medium
CN114897516A (en) * 2022-07-12 2022-08-12 山东乐习信息科技有限公司 Method for overall process data management of special equipment
CN117312295A (en) * 2023-11-29 2023-12-29 中国电力科学研究院有限公司 Method, device, medium and equipment for managing power grid material spot check data
CN117312295B (en) * 2023-11-29 2024-03-12 中国电力科学研究院有限公司 Method, device, medium and equipment for managing power grid material spot check data

Similar Documents

Publication Publication Date Title
CN113469857A (en) Data processing method and device, electronic equipment and storage medium
CN111343161B (en) Abnormal information processing node analysis method, abnormal information processing node analysis device, abnormal information processing node analysis medium and electronic equipment
US20120102361A1 (en) Heuristic policy analysis
CN113489713B (en) Network attack detection method, device, equipment and storage medium
CN110751451B (en) Laboratory big data management system
TWI734466B (en) Risk assessment method and device for leakage of privacy data
US11968162B1 (en) Message content cleansing
CN109241084B (en) Data query method, terminal equipment and medium
CN110688536A (en) Label prediction method, device, equipment and storage medium
CN110471912B (en) Employee attribute information verification method and device and terminal equipment
CN111209153B (en) Abnormity detection processing method and device and electronic equipment
CN113806370A (en) Environmental data supervision method, device, equipment and storage medium based on big data
CN114595765A (en) Data processing method and device, electronic equipment and storage medium
CN114443437A (en) Alarm root cause output method, apparatus, device, medium, and program product
CN109145609B (en) Data processing method and device
CN110866700A (en) Method and device for determining enterprise employee information disclosure source
CN110795308A (en) Server inspection method, device, equipment and storage medium
CN115758389A (en) Vulnerability processing result checking method and device, electronic equipment and storage medium
CN115016929A (en) Data processing method, device, equipment and storage medium
CN114785616A (en) Data risk detection method and device, computer equipment and storage medium
CN115083030A (en) Service inspection method and device and electronic equipment
KR20140054913A (en) Apparatus and method for processing data error for distributed system
CN112687030A (en) Vehicle condition information processing method and device
CN111782967A (en) Information processing method, information processing device, electronic equipment and computer readable storage medium
CN112989403A (en) Method, device and equipment for detecting database destruction and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20211001