CN115640931A - Fixed source data management method and system - Google Patents

Fixed source data management method and system Download PDF

Info

Publication number
CN115640931A
CN115640931A CN202211364351.1A CN202211364351A CN115640931A CN 115640931 A CN115640931 A CN 115640931A CN 202211364351 A CN202211364351 A CN 202211364351A CN 115640931 A CN115640931 A CN 115640931A
Authority
CN
China
Prior art keywords
basic information
pollution source
information
business
pollution
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211364351.1A
Other languages
Chinese (zh)
Inventor
康庆
彭道发
常伟
张升
梁必文
万鹏
易枭奇
曾智勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Bowo Wisdom Technology Co ltd
Original Assignee
Shenzhen Bowo Wisdom Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Bowo Wisdom Technology Co ltd filed Critical Shenzhen Bowo Wisdom Technology Co ltd
Priority to CN202211364351.1A priority Critical patent/CN115640931A/en
Publication of CN115640931A publication Critical patent/CN115640931A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention relates to the technical field of pollution source treatment, in particular to a fixed source data treatment method and system. The fixed source data treatment method comprises the steps of firstly obtaining pollution source data, extracting basic information of the pollution source data, then classifying the basic information of the pollution sources, and combining the basic information of the same kind of pollution sources to obtain basic information of a main pollution source; meanwhile, the service project data are obtained, the basic information of the service project data is extracted to obtain the basic information of the service project, and finally the basic information of the main pollution source is associated with the basic information of the service project, so that the data source can be directly found when the pollution source is inquired, the pollution source can be controlled in a targeted manner aiming at the associated service project, the control difficulty of the pollution source is reduced, the control cost is reduced, and the control efficiency is improved.

Description

Fixed source data management method and system
Technical Field
The invention relates to the technical field of pollution source treatment, in particular to a fixed source data treatment method and system.
Background
The fixed source is also called as a fixed pollution source, and refers to various industrial enterprises, places, production facilities, fixed equipment and the like which discharge or release harmful substances to the environment or have harmful effects on the environment, and is called as a fixed source for short.
With the rapid development of the digital reform, the requirement of the environmental protection industry on data is higher and higher, single service data cannot meet the development of environmental protection services, and the data of each environmental service needs to be integrated to perform application innovation of the environmental services. Therefore, in the eco-friendly service, the fixed source as the most important main data thereof needs to be well managed. However, at present, each business department is a self-built business system, and the fixed source is also an independent system, which cannot uniformly manage the fixed source and the business, so that information between the business and the fixed source is difficult to share, and data of the fixed source does not have a reliable data source with reliability, thereby increasing the difficulty in treating the fixed source and lowering the treatment efficiency.
Disclosure of Invention
The fixed source data treatment method and the fixed source data treatment system provided by the embodiment of the invention solve the problems of high treatment difficulty and low efficiency of a fixed source in the prior art.
According to a first aspect, an embodiment provides a fixed-source data governance method, comprising;
acquiring a plurality of pollution source data, wherein the pollution source data at least comprises an enterprise name or a unified social credit code;
respectively extracting multiple types of enterprise names, unified social credit codes, legal persons, legal person contact telephones, enterprise addresses and business items from the multiple pollution source data to obtain multiple pollution source basic information; the basic information of the pollution source at least comprises an enterprise name or a unified social credit code, and further comprises the following steps: at least one of a corporate person, a corporate contact phone, a business address, and one or more business items;
classifying the basic information of the plurality of pollution sources, and combining the basic information of the pollution sources of the same type to obtain the basic information of the main pollution source;
acquiring service project data, wherein the service project data at least comprise service projects, and the service projects are used for representing project types for monitoring pollution sources;
extracting a plurality of business names, unified social credit codes, legal persons, legal person contact phones, business addresses and business items from the business item data to obtain basic information of the business items, wherein the basic information of the business items at least comprises the business items, and the method further comprises the following steps: at least one of a business name, a uniform social credit code, a corporate person, a corporate contact phone, and a business address;
and calculating the association degree of the basic information of the primary pollution source and the basic information of the service project, and associating the basic information of the primary pollution source with the basic information of the service project, wherein the association degree of the basic information of the primary pollution source exceeds a preset value.
As a possible implementation manner, the extracting multiple kinds of enterprise names, unified social credit codes, legal persons, legal person contact phones, enterprise addresses and business items from the multiple pollution source data respectively to obtain multiple pollution source basic information includes:
respectively extracting various field information corresponding to an enterprise name, a unified social credit code, a legal person contact telephone, an enterprise address and a business project from the plurality of pollution source data;
mapping various field information extracted from pollution source data to a preset pollution source basic information identifier to obtain a plurality of pollution source basic information; wherein, the pollution source basic information identification at least comprises an enterprise name identification or a unified social credit code identification, and the method further comprises the following steps: at least one of a corporate identity, a corporate contact telephone identity, a business address identity, and one or more business item identities.
As a possible implementation manner, the classifying the plurality of pollution source basic information includes:
and classifying the basic information of the plurality of pollution sources according to a multi-dimensional comparison algorithm.
As a possible implementation manner, the enterprise name of the pollution source basic information includes field information corresponding to an enterprise name, the unified social credit code of the pollution source basic information includes field information corresponding to the unified social credit code, the legal person of the pollution source basic information includes field information corresponding to the legal person, the legal person contact phone of the pollution source basic information includes field information corresponding to a legal person contact phone, the enterprise address of the pollution source basic information includes field information corresponding to an enterprise address, and the business item of the pollution source basic information includes field information corresponding to a business item; the merging of the basic information of the similar pollution sources to obtain the basic information of the main pollution source comprises the following steps:
selecting the pollution source basic information with the most kinds of field information contained in the same type of pollution source basic information as main pollution source basic information;
and determining missing field information in the basic information of the main pollution source, acquiring the missing field information from the other similar pollution source basic information, and supplementing the missing field information to the basic information of the main pollution source.
As a possible implementation manner, after the basic information of the pollution sources of the same type is combined to obtain the basic information of the main pollution source, the method further includes:
editing field information in the basic information of the main pollution source based on editing operation of a user, wherein the editing comprises at least one of supplementing new field information, modifying original field information and deleting original field information; and/or the presence of a gas in the gas,
and if the integrity degree of the field information of the rest similar pollution source basic information is higher, replacing the corresponding field information in the main pollution source basic information with the field information with the higher integrity degree.
As a possible implementation manner, if the integrity of the field information of the remaining similar pollution source basic information is higher, replacing the corresponding field information in the primary pollution source basic information with the field information with the higher integrity includes:
displaying a merging interface of the same type of the basic information of the plurality of pollution sources, wherein the merging interface displays the identification and the corresponding field information of the basic information of the main pollution source, and the identifications and the corresponding field information of the basic information of the other pollution sources;
receiving an instruction for selecting the identification, and replacing the field information corresponding to the identification in the primary pollution source basic information with the selected field information corresponding to the identification of the rest pollution source basic information in response to the instruction.
As a possible implementation manner, the extracting multiple of the enterprise name, the unified social credit code, the legal person contact phone, the enterprise address and the business item from the business item data to obtain basic information of the business item includes:
extracting enterprise names, unified social credit codes, legal persons, legal person contact phones, enterprise addresses and various field information corresponding to the business items from the business item data;
mapping various field information extracted from the service project data to a preset service project basic information identifier to obtain service project basic information; the basic information identifier of the service item at least comprises the service item, and further comprises: at least one of a business name, a unified social credit code, a legal person contact phone, and a business address.
As a possible implementation manner, after the calculating a relevance degree between the primary pollution source basic information and the business item basic information, and associating the primary pollution source basic information with a relevance degree exceeding a preset value with the business item basic information, the method further includes:
displaying basic information of the unassociated business items;
and receiving an instruction for adding the basic information of the pollution source, and responding to the instruction, and adding the basic information of the pollution source corresponding to the basic information of the unassociated service project.
As a possible implementation manner, after associating the primary pollution source basic information with the business item basic information, the associating degree of which exceeds a preset value, the method further includes:
and storing the basic information of the main pollution source associated with the basic information of the business project, and supervising the associated basic information of the main pollution source.
According to a second aspect, there is provided in one embodiment a fixed source data governance system comprising:
the system comprises a first acquisition module, a second acquisition module and a third acquisition module, wherein the first acquisition module is used for acquiring a plurality of pollution source data, and the pollution source data at least comprises enterprise names or unified social credit codes;
the first information extraction module is used for extracting a plurality of enterprise names, unified social credit codes, legal persons, legal person contact telephones, enterprise addresses and business items from the plurality of pollution source data respectively to obtain a plurality of pollution source basic information; the basic information of the pollution source at least comprises an enterprise name or a unified social credit code, and further comprises the following steps: at least one of a corporate person, a corporate contact phone, an enterprise address, and one or more business items;
the merging module is used for classifying the basic information of the plurality of pollution sources and merging the basic information of the same type of pollution sources to obtain the basic information of the main pollution source;
the second acquisition module is used for acquiring service project data, wherein the service project data at least comprises service projects, and the service projects are used for representing project types for monitoring pollution sources;
a second information extraction module, configured to extract multiple types of an enterprise name, a unified social credit code, a legal person contact phone, an enterprise address, and a business item from the business item data to obtain basic information of the business item, where the basic information of the business item at least includes the business item, and the second information extraction module further includes: at least one of a business name, a uniform social credit code, a legal person contact phone, and a business address;
and the association module is used for calculating the association degree of the basic information of the pollution source and the basic information of the service project, and associating the basic information of the main pollution source with the basic information of the service project, wherein the association degree of the basic information of the main pollution source exceeds a preset value.
According to the fixed source data treatment method and system provided by the embodiment of the application, firstly, pollution source data are obtained, basic information of the pollution source data is extracted, then the pollution source basic information is classified, and the similar pollution source basic information is combined to obtain main pollution source basic information; meanwhile, the service project data are acquired, the basic information of the service project data is extracted to obtain the basic information of the service project, and finally the basic information of the main pollution source and the basic information of the service project are associated, so that when the pollution source is inquired, the data source can be directly found, and the pollution source can be controlled in a targeted manner aiming at the associated service project, so that the control difficulty is reduced, the control cost is reduced, and the control efficiency is improved.
Drawings
Fig. 1 is a flowchart of a fixed source data governance method provided in this embodiment;
FIG. 2 is a flowchart for obtaining basic information of a pollution source according to the present embodiment;
FIG. 3 is an interface effect diagram of the basic information of the pollution source provided by the present embodiment;
FIG. 4 is an interface effect diagram of details of basic information of a pollution source provided in this embodiment;
FIG. 5 is a flowchart for obtaining basic information of a main pollution source according to the present embodiment;
FIG. 6 is a diagram of the effect of the primary pollution source basic information interface provided by the present embodiment;
fig. 7 is a first flowchart of completing basic information of a main pollution source according to the present embodiment;
fig. 8 is a second flowchart of completing the basic information of the main pollution source according to the present embodiment;
FIG. 9 is an interface effect diagram of completing basic information of a main pollution source according to this embodiment;
fig. 10 is a flowchart for obtaining basic information of a service project according to this embodiment;
fig. 11 is an interface effect diagram of basic information of a service project provided by this embodiment;
FIG. 12 is a flowchart of adding basic information of pollution sources corresponding to basic information of unassociated business items according to the present embodiment;
fig. 13 is an interface effect diagram of configuration of information of a main pollution source according to the embodiment;
FIG. 14 is an interface effect diagram of another configuration of basic information of a main pollution source provided in this embodiment;
fig. 15 is an effect diagram of an operation log interface of the pollution source basic information list provided in the present embodiment;
FIG. 16 is an interface effect diagram of an operation log of the basic information of the pollution source provided by the embodiment;
fig. 17 is a block diagram of the fixed-source data governance system provided in this embodiment.
Reference numerals: 100. a first acquisition module; 200. a first information extraction module; 300. a merging module; 400. a second acquisition module; 500. a second information extraction module; 600. and (5) an association module.
Detailed Description
The present invention will be described in further detail with reference to the following detailed description and accompanying drawings. Wherein like elements in different embodiments are numbered with like associated elements. In the following description, numerous details are set forth in order to provide a better understanding of the present application. However, those skilled in the art will readily recognize that some of the features may be omitted or replaced with other elements, materials, methods in different instances. In some instances, certain operations related to the present application have not been shown or described in detail in order to avoid obscuring the core of the present application from excessive description, and it is not necessary for those skilled in the art to describe these operations in detail, so that they may be fully understood from the description in the specification and the general knowledge in the art.
Furthermore, the features, operations, or characteristics described in the specification may be combined in any suitable manner to form various embodiments. Also, the various steps or actions in the method descriptions may be transposed or transposed in order, as will be apparent to one of ordinary skill in the art. Thus, the various sequences in the specification and drawings are for the purpose of describing certain embodiments only and are not intended to imply a required sequence unless otherwise indicated where such sequence must be followed.
The numbering of the components as such, e.g., "first", "second", etc., is used herein only to distinguish the objects as described, and does not have any sequential or technical meaning. The term "connected" and "coupled" when used in this application, unless otherwise indicated, includes both direct and indirect connections (couplings).
At present, the treatment scheme for the fixed source mainly comprises the following steps: data transformation mapping, assuming master data management functions through an application system, or introducing a separate master data management platform. The implementation cost of the three schemes is higher, and the personalized and targeted treatment function cannot be provided according to the characteristics of the fixed source. Therefore, according to the characteristics of the fixed source, the fixed source data treatment method and the fixed source data treatment system are designed, so that the fixed source is associated with the business data, and the fixed source can be treated in a targeted manner.
The method for realizing fixed source governance by associating fixed source and service data is described in detail with the accompanying drawings.
Referring to fig. 1, an embodiment provides a fixed-source data governance method, including;
step 1: a plurality of pollution source data is obtained, the pollution source data including at least a business name or a uniform social credit code.
The pollution source data treatment system, that is, the fixed source data treatment system, can obtain a plurality of pollution source data from the second national pollution source census data, and the specifically obtained pollution source data include an enterprise name, a unified social credit code, a legal person contact telephone, an enterprise address, a business item, a pollution source address, a pollution source name, a pollution source number, a pollution discharge license number, a business license registration number, and the like, wherein the obtained pollution source data at least include the enterprise name or the unified social credit code, so as to classify the pollution source in the following process.
Step 2: respectively extracting multiple types of enterprise names, unified social credit codes, legal persons, legal person contact phones, enterprise addresses and business items from the multiple pollution source data to obtain multiple pollution source basic information; the basic information of the pollution source at least comprises a business name or a unified social credit code, and further comprises the following steps: at least one of a corporate person, a corporate contact phone, a business address, and one or more business items.
In the acquired multiple pollution source data, the pollution source data treatment system respectively extracts multiple information in information such as enterprise names, unified social credit codes, legal persons, legal person contact phones, enterprise addresses and business projects in each pollution source data to obtain multiple pollution source basic information. The basic information of the pollution sources obtained by extracting the data of each pollution source mainly comprises various information such as enterprise names, unified social credit codes, legal persons, contact telephones of the legal persons, enterprise addresses and business items.
And step 3: and classifying the basic information of the plurality of pollution sources, and combining the basic information of the same type of pollution sources to obtain the basic information of the main pollution source.
The pollution source data treatment system adopts a multi-dimensional comparison algorithm to compare and classify a plurality of pollution source basic information, the compared and classified results are displayed in the form of information cards as shown in fig. 3, each information card displays the pollution source basic information corresponding to the pollution source data, cleaning personnel can visually see the pollution source basic information corresponding to each pollution source data and judge which pollution source basic information belongs to the same type, and then the pollution source basic information of the same type is combined to one pollution source basic information through the combination function of the system to obtain the main pollution source basic information.
In addition, when the basic information of the pollution source cannot be classified through the extracted information, the cleaning personnel opens a detail button on the information card, and the pollution source data management system displays a detail page after receiving a command for triggering the detail button, wherein as shown in fig. 4, the detail page is used for displaying information such as a pollution source address, a pollution source name, a pollution source number, a pollution discharge license number, a business license registration number and the like. In addition, when repeated pollution source basic information appears, names and unified social credit codes of pollution sources can be inquired through the existing websites inquired by enterprises or other enterprises, and whether the pollution sources are overlapped or not is judged or which pollution source basic information is the latest pollution source basic information corresponding to the pollution source is judged.
And 4, step 4: and acquiring service project data, wherein the service project data at least comprise service projects, and the service projects are used for representing project types for monitoring pollution sources.
Meanwhile, the pollution source data treatment system can acquire service project data from a service system of a government or an enterprise. The pollution source data is generally obtained from the second national pollution source general survey data, and therefore, the system for obtaining the pollution source data and the service project data belongs to two completely different and unrelated systems, so that the two data need to be searched and matched one by one in the process of treating the pollution source at present, time and labor are wasted, and the accuracy is low. In the embodiment, the pollution source data and the service project data are respectively acquired and then associated, so that the pollution source can be better managed.
In this embodiment, the pollution source data management system obtains service item data from the service system, where the service item data at least includes service items, and the service items are used to represent item types for monitoring various pollution sources, for example, construction item information, environment letter and visit information, fixed waste, administrative penalty information, pollution discharge license information, pollution source online data, VOC pollution sources, and the like all belong to service items.
And 5: extracting a plurality of enterprise names, unified social credit codes, legal persons, legal person contact phones, enterprise addresses and business items from the business item data to obtain basic information of the business items, wherein the basic information of the business items at least comprises the business items, and the method further comprises the following steps: at least one of a business name, a unified social credit code, a legal person contact phone, and a business address.
In this embodiment, the extraction method of the basic information of the business project is similar to that of the basic information of the pollution source, and specifically, the pollution source data management system extracts various information in the information such as the enterprise name, the unified social credit code, the legal person, the contact telephone of the legal person, the enterprise address, the business project and the like in each business project data to obtain the basic information of the business project. The basic information of the business project obtained by extracting the data of each business project mainly comprises various information such as the business project, the enterprise name, the unified social credit code, the legal person, the contact telephone of the legal person, the enterprise address and the like.
Step 6: and calculating the association degree of the basic information of the main pollution source and the basic information of the service project, and associating the basic information of the main pollution source with the basic information of the service project, wherein the association degree of the basic information of the main pollution source exceeds a preset value.
The system calculates the association degree of the combined basic information of the main pollution source and the basic information of the service project through a multi-dimensional comparison algorithm, and then displays the association degree result, for example, the association degree result is displayed on a basic information card of the main pollution source of a display interface. Specifically, the cleaning personnel can select the service item data and then check the association degree on the primary pollution source basic information card, that is, the comprehensive similarity displayed on the information card in fig. 6, when the comprehensive similarity exceeds a preset value, the cleaning personnel can input an association instruction to associate the primary pollution source basic information with the service item basic information, and also can directly judge through the system that when the comprehensive similarity exceeds the preset value, the primary pollution source basic information is directly associated with the service item basic information.
As shown in fig. 6, the basic information of the main pollution source associated with the basic information of the business item is displayed on the business data association interface, and the business data association interface is further provided with an input box, the input box is used for a cleaning person to search the key information, and the searched basic information of the main pollution source associated with the basic information of the business item is displayed on the display interface in the form of card information. Through the operation on the interface, cleaning personnel can conveniently inquire basic information of the pollution source possibly associated with target business project data, and can carry out targeted treatment on the pollution source aiming at the associated business project, so that the treatment difficulty is reduced, the treatment cost is reduced, and the treatment efficiency is improved.
As one way that can be implemented, referring to fig. 2, the first information extraction module extracts multiple kinds of enterprise names, unified social credit codes, legal persons, legal person contact phones, enterprise addresses, and business items from multiple pollution source data, respectively, to obtain multiple pollution source basic information, including:
step 21: and respectively extracting various field information corresponding to the enterprise name, the unified social credit code, the legal person contact telephone, the enterprise address and the business item from the plurality of pollution source data.
Step 22: mapping various field information extracted from pollution source data to a preset pollution source basic information identifier to obtain a plurality of pollution source basic information; wherein, the pollution source basic information mark at least comprises an enterprise name mark or a unified social credit code mark, and the method further comprises the following steps: at least one of a corporate identity, a corporate contact telephone identity, a business address identity, and one or more business item identities. In practical applications, in the pollution source basic information identifier of this embodiment, an enterprise name identifier, a unified social credit code identifier, a corporate contact telephone identifier, an enterprise address identifier, a plurality of business item identifiers, and the like are all displayed in a pollution source configuration page.
In this embodiment, as shown in fig. 13 and 14, specifically, when entering a pollution source configuration page, the first information extraction module extracts corresponding field information required in information such as an enterprise name, a unified social credit code, a legal person contact phone, an enterprise address, a business project, and the like from a plurality of pollution source data, and maps the corresponding field information of each pollution source data to a preset pollution source basic information identifier in a system in a mode of an intermediate table or a source table, so that each identifier corresponds to the corresponding field information, and finally, a plurality of pollution source basic information are obtained respectively. Correspondingly, the pollution source basic information identifier preset in the system at least needs to comprise an enterprise name identifier or a unified social credit code identifier, and further comprises a legal person identifier, a legal person contact telephone identifier, an enterprise address identifier, one or more business project identifiers and the like in order to ensure the integrity of the information; and may also include at least one of a corporate identity, a corporate contact telephone identity, a business address identity, and one or more business item identities, etc.
The intermediate table or the source table is selected according to a service project data structure of the service system, the mode of the intermediate table writes the association relation into the corresponding intermediate table, and the association relation is mapped into a pollution source basic information identifier preset in the system through the intermediate table; the source table mode is to directly write the association relationship into the source table, that is, to directly map the field information into the basic information identifier of the pollution source preset in the system. It should be noted that, in the intermediate table or the source table, an item of the pollution source basic information needs to be configured, and the name of the item is fixed as "pollution source basic information" to ensure that a field corresponding to pollution source data information in the intermediate table or the source table can be accurately mapped to a preset pollution source basic information identifier in the system when matching. Referring to fig. 14, for the intermediate table, data label SQL is used to edit pollution source data or service item data, specifically, a SQL statement is written to query the service item data of the current pollution source, a plurality of service item data are connected by UNION ALL, and # { WRYBH } is used as a query condition placeholder of the pollution source.
It should be noted that in the embodiment, in the basic information of the pollution source and the basic information of the business project, the enterprise name, the unified social credit code, the legal person, the contact telephone of the legal person, the enterprise address, the business project, and the like are the same, and when the basic information of the pollution source and the basic information of the business project are associated, the field information corresponding to the identifier is specifically matched or the association degree is calculated.
As one way to realize, classifying the basic information of a plurality of pollution sources includes: and classifying the basic information of the plurality of pollution sources according to a multi-dimensional comparison algorithm. The multi-dimensional comparison algorithm adopted in the embodiment is the prior art, and the embodiment does not make excessive requirements on the algorithm, and only needs to classify the basic information of a plurality of pollution sources.
As an implementation manner, please refer to fig. 5, the enterprise name of the pollution source basic information includes field information corresponding to the enterprise name, the unified social credit code of the pollution source basic information includes field information corresponding to the unified social credit code, the legal person of the pollution source basic information includes field information corresponding to the legal person, the legal person contact phone of the pollution source basic information includes field information corresponding to the legal person contact phone, the enterprise address of the pollution source basic information includes field information corresponding to the enterprise address, and the business item of the pollution source basic information includes field information corresponding to the business item; merging the basic information of the pollution sources of the same type through a merging module to obtain the basic information of the main pollution source, wherein the merging module comprises the following steps:
step 31: and selecting the pollution source basic information with the most field information types contained in the same type of multiple pollution source basic information as the main pollution source basic information.
Step 32: and determining missing field information in the basic information of the main pollution source, acquiring the missing field information from the other similar basic information of the pollution source, and supplementing the missing field information to the basic information of the main pollution source.
In this embodiment, more specifically, when the merging module merges basic information of similar pollution sources, each piece of basic information of the similar pollution sources corresponds to corresponding field information of the pollution sources, specifically, with reference to fig. 3, taking one piece of basic information of the pollution sources as an example, an enterprise name (i.e., a company name in the figure) identifier is used to identify an enterprise name, which may be a chinese field, an english field or abbreviation, a picture, or the like, the enterprise name identifier in fig. 3 is an "enterprise name" field, and the "XXX company" followed by the identifier is the field information corresponding to the enterprise name. By analogy, the unified social credit code identification is used for identifying the unified social credit code, the legal person identification is used for identifying the name of the legal person, the legal person contact telephone identification is used for identifying the contact telephone of the legal person, the enterprise address identification is used for identifying the address of the corresponding enterprise of the pollution source, and the business item identification is used for identifying the business item. The system selects the basic information of the pollution source with the most kinds of field information corresponding to the above multiple identifications in the same kind of basic information of the pollution source as the basic information of the main pollution source, for example, in a basic information card of the same kind of pollution source displayed on a merging interface of the pollution source shown in fig. 3, selects one basic information card of the pollution source with the most kinds of field information correspondingly written in an enterprise name, a unified social credit code, a legal person contact telephone, an enterprise address and a business project as the basic information of the main pollution source, then determines the kind of field information missing in the basic information of the main pollution source, and supplements the field information corresponding to the kind of field information in the basic information cards of other pollution sources to the basic information of the main pollution source, so that the basic information of the main pollution source is more complete.
As a possible implementation manner, please refer to fig. 7, after the basic information of the pollution sources of the same type is merged to obtain the basic information of the main pollution source, the method further includes:
and 7: editing the field information in the basic information of the main pollution source based on the editing operation of the user, wherein the editing comprises at least one of supplementing new field information, modifying original field information and deleting original field information. And/or the presence of a gas in the gas,
and 8: and if the integrity degree of the field information of the rest similar pollution source basic information is higher, replacing the corresponding field information in the main pollution source basic information with the field information with the higher integrity degree.
In this embodiment, after the basic information of the similar pollution sources is combined to obtain the basic information of the main pollution source, the situation that the field information is incomplete or incomplete may also exist in each item of field information in the basic information of the main pollution source, and at this time, the corresponding field information in each item of information needs to be complemented. Specifically, completing field information corresponding to each item of information in the basic information of the main pollution source can be achieved through two modes, one mode is that a cleaning person observes the basic information of the main pollution source, when a certain item of field information in other basic information of the pollution source is found to be more complete, a completing button of the basic information of the main pollution source is clicked, the field information in the basic information of the main pollution source is edited by referring to the other basic information of the pollution source, and the specifically edited information comprises new field information supplementing, original field information modifying and original field information deleting. Another method is that a cleaning person observes basic information of a main pollution source, and as shown in fig. 8, when it is found that the integrity of some item of field information in the basic information of other pollution sources is higher, the cleaning person clicks a completion button (as shown by C in fig. 8, the completion button is called) of the basic information of other pollution sources, and after receiving a click operation, the system directly replaces the field information with higher integrity into corresponding field information in the basic information of the main pollution source, so as to achieve automatic information completion. For example, the data completion dialog box shown in fig. 8 includes a main pollution source basic information column and other pollution source basic information columns, and the main pollution source basic information column and the other pollution source basic information columns are correspondingly arranged, corresponding field information is filled below the "enterprise name a" identifier of the main pollution source basic information side, corresponding field information B is also filled below the "enterprise name" identifier of the other pollution source basic information sides, but the field information below the "enterprise name" identifier of the other pollution source basic information sides is more complete, at this time, when the completion button C is clicked, the field information of the other pollution source basic information sides automatically replaces the field information B of the main pollution source basic information side, so as to supplement the field information corresponding to each identifier of the main pollution source basic information more completely.
As a possible implementation manner, referring to fig. 9, if the field information of the remaining similar pollution source basic information is more complete, replacing the corresponding field information in the primary pollution source basic information with the more complete field information includes:
step 81: and displaying a merging interface of the same type of basic information of the plurality of pollution sources, wherein the merging interface displays the identification and the corresponding field information of the basic information of the main pollution source, and the identifications and the corresponding field information of the basic information of the other pollution sources.
Step 82: and receiving an instruction for selecting the identification, and replacing the field information corresponding to the identification of the selected rest pollution source basic information with the field information corresponding to the identification of the selected rest pollution source basic information in response to the instruction.
In this embodiment, when the field information with a higher integrity is used to replace the corresponding field information in the primary pollution source basic information, specifically, the primary pollution source basic information and the other multiple pollution source basic information of the same class are displayed on an interface, where an identifier of the primary pollution source basic information and the corresponding field information, identifiers of the other pollution source basic information and the corresponding field information are displayed on the interface, then a cleaning worker operates a certain identifier of the pollution source basic information with a higher integrity by clicking, and after the system receives a click operation instruction, the complete field information corresponding to the identifier directly replaces the field information corresponding to the identifier of the primary pollution source basic information, thereby completing completion of the primary pollution source basic information and facilitating more accurate subsequent matching of the primary pollution source basic information and the service project basic information.
As a possible implementation manner, referring to fig. 10, the second obtaining module extracts a plurality of business names, unified social credit codes, juridical persons, juridical person contact phones, business addresses, and business items from the business item data to obtain basic information of the business items, including:
step 51: and extracting the enterprise name, the unified social credit code, the legal person contact telephone, the enterprise address and various field information corresponding to the business item from the business item data.
Step 52: mapping various field information extracted from the service project data to a preset service project basic information identifier to obtain service project basic information; the basic information identifier of the service item at least comprises the service item, and further comprises: at least one of a business name, a unified social credit code, a legal person contact phone, and a business address.
In this embodiment, for extracting the basic information of the service item, the second obtaining module of the pollution source service data management system obtains the service item data from the external service system, and extracts field information corresponding to the enterprise name, the unified social credit code, the legal contact phone, the enterprise address, the service item, and the like of the service item data, and maps the field information corresponding to each service item data to the basic information identifier of the service item preset in the system in a mode of the intermediate table or the source table, so that each identifier corresponds to the corresponding field information, and finally obtains the basic information of the service item.
Specifically, referring to fig. 11, a service data configuration page is entered, where a corresponding service item name (including solid waste data, secondary pollution data, construction item information, environmental credit information, fixed waste, administrative penalty information, pollution discharge license information, and the like) acquired by the system, a data source to which the service item belongs, a table name corresponding to the service item, and a list of service item operations are displayed in the page, then the second acquisition module edits field information of the extracted enterprise name, unified social credit code, corporate contact phone, corporate address, service item, and the like of each service item in an edit window in a service item operation list, so as to form service item basic information corresponding to the service item data, and the edited service item basic information can be previewed through a preview window in the service item operation list, so that a cleaning person can check whether the extracted information is accurate.
As a possible implementation manner, please refer to fig. 12, after the calculating of the association degree of the primary pollution source basic information and the business item basic information by the association module, and associating the primary pollution source basic information with the business item basic information, the method further includes:
and step 9: and displaying the basic information of the unassociated business items.
Step 10: and receiving an instruction for adding the basic information of the pollution source, and responding to the instruction to add the basic information of the pollution source corresponding to the basic information of the unassociated business project.
In this embodiment, after the association module associates the primary pollution source basic information with the service item basic information, for the service item basic information that is not associated with the pollution source basic information, that is, the service data is not associated with a proper pollution source, at this time, one pollution source basic information may be added in the page on which the pollution source basic information is displayed, specifically, each item of information corresponding to the pollution source basic information corresponds to each item of information in the unassociated service item basic information, so that all the service item data can be associated with the corresponding pollution source data, and the service item corresponding to each pollution source can be found visually, thereby facilitating the subsequent provision of a targeted abatement measure for each pollution source.
As a possible implementation manner, after associating the primary pollution source basic information with the business item basic information, the associating degree of which exceeds a preset value, the method further includes: and storing the basic information of the main pollution source associated with the basic information of the business project, and supervising the associated basic information of the main pollution source.
In this embodiment, after associating the basic information of the pollution source with the basic information of the service item, in an actual application, the basic information of the main pollution source associated with the basic information of the service item may be stored, when the associated basic information of the main pollution source needs to be queried, a cleaner enters an operation log interface shown in fig. 15 in the system, each item of information of the basic information of each main pollution source, a service item associated with the basic information of each main pollution source, and various operation process data are displayed in the system, a query dialog box is displayed on the operation log interface, the cleaner only needs to input a query object (such as a name of the pollution source, a code of the pollution source, and the like) in the dialog box, so that an operation record (including process data of the operation) for the pollution source can be directly found, different operation types have different process data, any one button of the basic information of the main pollution source is clicked, the operation detail interface shown in fig. 16 is entered, and details of the basic information of the main pollution source in the basic information can be displayed, and details of the basic information of the main pollution source before the operation of the main pollution source can be traced conveniently.
According to a second aspect, referring to FIG. 17, an embodiment provides a fixed-source data governance system, comprising:
a first obtaining module 100, configured to obtain a plurality of pollution source data, where the pollution source data at least includes an enterprise name or a unified social credit code;
the first information extraction module 200 is configured to extract multiple types of enterprise names, unified social credit codes, legal persons, legal person contact phones, enterprise addresses and business items from multiple pollution source data, respectively, to obtain multiple pollution source basic information; the basic information of the pollution source at least comprises a business name or a unified social credit code, and further comprises the following steps: at least one of a corporate person, a corporate contact phone, an enterprise address, and one or more business items;
the merging module 300 is configured to classify the multiple pollution source basic information and merge the same kind of pollution source basic information to obtain main pollution source basic information;
a second obtaining module 400, configured to obtain service item data, where the service item data at least includes a service item, and the service item is used to represent an item type for monitoring a pollution source;
the second information extracting module 500 is configured to extract a plurality of business names, unified social credit codes, legal persons, legal person contact phones, business addresses, and business items from the business item data to obtain basic information of the business items, where the basic information of the business items at least includes the business items, and further includes: at least one of a business name, a uniform social credit code, a legal person contact phone, and a business address;
and the association module 600 is configured to perform association degree calculation on the pollution source basic information and the service item basic information, and associate the main pollution source basic information with an association degree exceeding a preset value with the service item basic information.
The fixed source data management system in this embodiment includes a first obtaining module 100, a first information extracting module 200, a merging module 300, a second obtaining module 400, a second information extracting module 500, and an association module 600, where for the first obtaining module 100, the pollution source data management system obtains multiple pollution source data from second national pollution source census data through the first obtaining module 100, and specifically, the obtained pollution source data includes an enterprise name, a unified social credit code, a legal person contact phone number, an enterprise address, a business project, a pollution source address, a pollution source name, a pollution source number, a pollution discharge license number, a business license registration number, and the like, where the obtained pollution source data at least includes the enterprise name or the unified social credit code, so as to classify the pollution sources in the following process.
And then, the first information extraction module 200 is used for respectively extracting various information in information such as enterprise names, unified social credit codes, legal persons, legal person contact telephones, enterprise addresses, business items and the like in each pollution source data to obtain a plurality of pollution source basic information. The basic information of the pollution sources obtained by extracting the data of each pollution source mainly comprises various information such as enterprise names, unified social credit codes, legal persons, contact telephones of the legal persons, enterprise addresses and business items.
The pollution source data treatment system compares and classifies a plurality of pollution source basic information by adopting a multi-dimensional comparison algorithm through a merging module 300, and displays the compared and classified results in the form of information cards as shown in fig. 3, each information card displays the pollution source basic information corresponding to the pollution source data, a cleaning person can visually see the pollution source basic information corresponding to each pollution source data and judge which pollution source basic information belongs to the same type, and then the pollution source basic information of the same type is merged to one pollution source basic information through the merging function of the system to obtain the main pollution source basic information. In addition, when the basic information of the pollution source cannot be classified through the extracted information, the cleaning personnel opens a detail button on the information card, and the pollution source data management system displays a detail page after receiving a command for triggering the detail button, wherein as shown in fig. 4, the detail page is used for displaying information such as a pollution source address, a pollution source name, a pollution source number, a pollution discharge license number, a business license registration number and the like. In addition, when repeated pollution source basic information appears, names and unified social credit codes of pollution sources can be inquired through the existing websites inquired by enterprises or other enterprises, and whether the pollution sources are overlapped or not is judged or which pollution source basic information is the latest pollution source basic information corresponding to the pollution source is judged.
The pollution source data management system then obtains the business project data from the business system of the government or the enterprise through the second obtaining module 400. The service project data at least comprises service projects, and the service projects are used for representing project types for monitoring various pollution sources, for example, construction project information, environment petition information, fixed waste, administrative penalty information, pollution discharge license information, pollution source online data, VOC pollution sources and the like belong to the service projects.
The pollution source data is generally obtained from the second national pollution source general survey data, so that the system for obtaining the pollution source data and the service project data belongs to two completely different and unrelated systems, so that the two data are required to be searched and matched one by one in the process of treating the pollution source at present, time and labor are wasted, and the accuracy is low. In this embodiment, after the second obtaining module 400 obtains the service item data from the service system of a government or an enterprise, the service item data is associated with the pollution source data, so as to facilitate better management of the pollution source.
Then, the second information extraction module 500 extracts various information in the information such as the enterprise name, the unified social credit code, the legal person contact telephone, the enterprise address, the business item and the like in each business item data to obtain the basic information of the business item. The basic information of the business project obtained by extracting the data of each business project mainly comprises various information such as the business project, the enterprise name, the unified social credit code, the legal person, the contact telephone of the legal person, the enterprise address and the like.
Finally, the association module 600 calculates the association degree of the combined basic information of the main pollution source and the basic information of the business project by using a multidimensional comparison algorithm, and then displays the association degree result, for example, the association degree result is displayed on a basic information card of the main pollution source of a display interface. Specifically, the cleaning personnel can select the service item data and then check the association degree on the primary pollution source basic information card, that is, the comprehensive similarity displayed on the information card in fig. 6, when the comprehensive similarity exceeds a preset value, the cleaning personnel can input an association instruction to associate the primary pollution source basic information with the service item basic information, and also can directly judge through the system that when the comprehensive similarity exceeds the preset value, the primary pollution source basic information is directly associated with the service item basic information.
In addition, for specific limitations on functions of each module in the system, reference may be made to the above limitations on the module corresponding to each step in the fixed source data governance method, and this embodiment is not described herein again.
The present invention has been described in terms of specific examples, which are provided to aid understanding of the invention and are not intended to be limiting. For a person skilled in the art to which the invention pertains, several simple deductions, modifications or substitutions may be made according to the idea of the invention.

Claims (10)

1. A fixed source data governance method is characterized by comprising the following steps:
acquiring a plurality of pollution source data, wherein the pollution source data at least comprises an enterprise name or a unified social credit code;
respectively extracting multiple types of enterprise names, unified social credit codes, legal persons, legal person contact telephones, enterprise addresses and business items from the multiple pollution source data to obtain multiple pollution source basic information; the basic information of the pollution source at least comprises an enterprise name or a unified social credit code, and further comprises the following steps: at least one of a corporate person, a corporate contact phone, an enterprise address, and one or more business items;
classifying the basic information of the plurality of pollution sources, and combining the basic information of the same type of pollution sources to obtain the basic information of the main pollution source;
acquiring service project data, wherein the service project data at least comprises service projects, and the service projects are used for representing project types for monitoring pollution sources;
extracting a plurality of business names, unified social credit codes, legal persons, legal person contact phones, business addresses and business items from the business item data to obtain basic information of the business items, wherein the basic information of the business items at least comprises the business items, and the method further comprises the following steps: at least one of a business name, a uniform social credit code, a corporate person, a corporate contact phone, and a business address;
and calculating the association degree of the basic information of the primary pollution source and the basic information of the service project, and associating the basic information of the primary pollution source with the basic information of the service project, wherein the association degree of the basic information of the primary pollution source exceeds a preset value.
2. The fixed-source data governance method according to claim 1, wherein said extracting a plurality of enterprise names, unified social credit codes, jurisdictions, jurisdictional telephones, enterprise addresses, and business items from said plurality of pollution source data, respectively, to obtain a plurality of pollution source base information comprises:
respectively extracting various field information corresponding to an enterprise name, a unified social credit code, a legal person contact telephone, an enterprise address and a business project from the plurality of pollution source data;
mapping various field information extracted from pollution source data to a preset pollution source basic information identifier to obtain a plurality of pollution source basic information; wherein, the pollution source basic information mark at least comprises an enterprise name mark or a unified social credit code mark, and the method further comprises the following steps: at least one of a corporate identity, a corporate contact telephone identity, a business address identity, and one or more business item identities.
3. The fixed-source data governance method according to claim 1, wherein said classifying said plurality of pollution source base information comprises:
and classifying the basic information of the plurality of pollution sources according to a multi-dimensional comparison algorithm.
4. The fixed-source data governance method according to claim 1, wherein the enterprise name of the pollution source basic information comprises field information corresponding to an enterprise name, the unified social credit code of the pollution source basic information comprises field information corresponding to a unified social credit code, the legal person of the pollution source basic information comprises field information corresponding to a legal person, the legal person contact phone of the pollution source basic information comprises field information corresponding to a legal person contact phone, the enterprise address of the pollution source basic information comprises field information corresponding to an enterprise address, and the business item of the pollution source basic information comprises field information corresponding to a business item; the merging of the basic information of the similar pollution sources to obtain the basic information of the main pollution source comprises the following steps:
selecting the pollution source basic information with the most kinds of field information contained in the same type of pollution source basic information as main pollution source basic information;
and determining missing field information in the basic information of the main pollution source, acquiring the missing field information from the other similar pollution source basic information, and supplementing the missing field information to the basic information of the main pollution source.
5. The fixed source data governance method according to claim 4, wherein after merging the same kind of pollution source basic information to obtain the main pollution source basic information, the method further comprises:
editing field information in the basic information of the main pollution source based on editing operation of a user, wherein the editing comprises at least one of supplementing new field information, modifying original field information and deleting original field information; and/or the presence of a gas in the gas,
and if the integrity degree of the field information of the rest similar pollution source basic information is higher, replacing the corresponding field information in the main pollution source basic information with the field information with the higher integrity degree.
6. The fixed-source data governance method according to claim 5, wherein if the field information of the remaining similar pollution source basic information is more complete, replacing the corresponding field information in the primary pollution source basic information with the field information with the more complete field information comprises:
displaying a merging interface of the same type of the basic information of the plurality of pollution sources, wherein the merging interface displays the identification and the corresponding field information of the basic information of the main pollution source, and the identifications and the corresponding field information of the basic information of the other pollution sources;
receiving an instruction for selecting the identification, and replacing the field information corresponding to the identification in the primary pollution source basic information with the selected field information corresponding to the identification of the rest pollution source basic information in response to the instruction.
7. The fixed-source data governance method of claim 1, wherein said extracting a plurality of business names, uniform social credit codes, juridical persons, juridical person contact phones, business addresses, and business items from said business item data to obtain business item base information comprises:
extracting enterprise names, unified social credit codes, legal persons, legal person contact phones, enterprise addresses and various field information corresponding to the business items from the business item data;
mapping various field information extracted from the service project data to a preset service project basic information identifier to obtain service project basic information; the basic information identifier of the service item at least comprises the service item, and further comprises: at least one of a business name, a unified social credit code, a legal person contact phone, and a business address.
8. The fixed-source data governance method according to claim 1, wherein after calculating the degree of association between the primary pollution source basic information and the business project basic information and associating the primary pollution source basic information with the degree of association exceeding a preset value with the business project basic information, the method further comprises:
displaying basic information of the unassociated business items;
and receiving an instruction for adding the basic information of the pollution source, and responding to the instruction, and adding the basic information of the pollution source corresponding to the basic information of the unassociated service project.
9. The fixed-source data governance method according to claim 1, wherein after associating the primary pollution source basic information whose association degree exceeds a preset value with the business project basic information, further comprising:
and storing the basic information of the main pollution source associated with the basic information of the business project, and supervising the associated basic information of the main pollution source.
10. A fixed-source data governance system, comprising:
the system comprises a first acquisition module, a second acquisition module and a third acquisition module, wherein the first acquisition module is used for acquiring a plurality of pollution source data, and the pollution source data at least comprises enterprise names or unified social credit codes;
the first information extraction module is used for extracting a plurality of enterprise names, unified social credit codes, legal persons, legal person contact telephones, enterprise addresses and business items from the plurality of pollution source data respectively to obtain a plurality of pollution source basic information; the basic information of the pollution source at least comprises an enterprise name or a unified social credit code, and further comprises the following steps: at least one of a corporate person, a corporate contact phone, an enterprise address, and one or more business items;
the merging module is used for classifying the basic information of the plurality of pollution sources and merging the basic information of the same type of pollution sources to obtain the basic information of the main pollution source;
a second obtaining module, configured to obtain service item data, where the service item data at least includes a service item, and the service item is used to represent an item type for monitoring a pollution source;
a second information extraction module, configured to extract multiple types of an enterprise name, a unified social credit code, a legal person contact phone, an enterprise address, and a business item from the business item data to obtain basic information of the business item, where the basic information of the business item at least includes the business item, and the second information extraction module further includes: at least one of a business name, a uniform social credit code, a legal person contact phone, and a business address;
and the association module is used for calculating the association degree of the basic information of the pollution source and the basic information of the service project, and associating the basic information of the main pollution source with the basic information of the service project, wherein the association degree of the basic information of the main pollution source exceeds a preset value.
CN202211364351.1A 2022-11-02 2022-11-02 Fixed source data management method and system Pending CN115640931A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211364351.1A CN115640931A (en) 2022-11-02 2022-11-02 Fixed source data management method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211364351.1A CN115640931A (en) 2022-11-02 2022-11-02 Fixed source data management method and system

Publications (1)

Publication Number Publication Date
CN115640931A true CN115640931A (en) 2023-01-24

Family

ID=84947693

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211364351.1A Pending CN115640931A (en) 2022-11-02 2022-11-02 Fixed source data management method and system

Country Status (1)

Country Link
CN (1) CN115640931A (en)

Similar Documents

Publication Publication Date Title
CN101231651A (en) Computer apparatus and method, for calculating importance of electronic document on computer network
CN103810212A (en) Automated database index creation method and system
WO2008105611A1 (en) Database auto-building method for link of search data in gis system using cad drawings
CN111814472A (en) Text recognition method, device, equipment and storage medium
CN110704880A (en) Correlation method of engineering drawings
CN112231417A (en) Data classification method and device, electronic equipment and storage medium
CN110362596A (en) A kind of control method and device of text Extracting Information structural data processing
CN111143370B (en) Method, apparatus and computer-readable storage medium for analyzing relationships between a plurality of data tables
CN111191153A (en) Information technology consultation service display device
CN107291951B (en) Data processing method, device, storage medium and processor
CN112416992A (en) Industry type identification method, system and equipment based on big data and keywords
KR20100037325A (en) System and method for construction automatic bibliography based pattern, and recording medium therefor
CN111967437A (en) Text recognition method, device, equipment and storage medium
CN109902148B (en) Automatic enterprise name completion method for address book contacts
CN109388648B (en) Method for extracting personnel information and relation person from electronic record
CN104240107A (en) Community data screening system and method thereof
CN115640931A (en) Fixed source data management method and system
CN110019237B (en) System and method for analyzing criminal whereabouts based on map
CN110705297A (en) Enterprise name-identifying method, system, medium and equipment
CN102663205B (en) Software realization method and system for vehicle assembly shop tooling site management
CN112612817B (en) Data processing method, device, terminal equipment and computer readable storage medium
CN114840519A (en) Data labeling method, equipment and storage medium
CN114495138A (en) Intelligent document identification and feature extraction method, device platform and storage medium
CN114118944A (en) Forensic laboratory grading management method, terminal device and storage medium
CN113836181A (en) Data query method and device combining RPA and AI, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination