CN117371531A - Carbon policy knowledge graph construction system - Google Patents
Carbon policy knowledge graph construction system Download PDFInfo
- Publication number
- CN117371531A CN117371531A CN202311418018.9A CN202311418018A CN117371531A CN 117371531 A CN117371531 A CN 117371531A CN 202311418018 A CN202311418018 A CN 202311418018A CN 117371531 A CN117371531 A CN 117371531A
- Authority
- CN
- China
- Prior art keywords
- carbon
- policy
- data
- carbon policy
- knowledge
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 229910052799 carbon Inorganic materials 0.000 title claims abstract description 201
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 title claims abstract description 195
- 238000010276 construction Methods 0.000 title claims abstract description 43
- 238000012545 processing Methods 0.000 claims abstract description 30
- 238000009960 carding Methods 0.000 claims abstract description 12
- 238000013480 data collection Methods 0.000 claims abstract description 11
- 230000000007 visual effect Effects 0.000 claims abstract description 9
- 230000011218 segmentation Effects 0.000 claims description 39
- 238000004422 calculation algorithm Methods 0.000 claims description 28
- 238000012216 screening Methods 0.000 claims description 26
- 238000000605 extraction Methods 0.000 claims description 13
- 230000004927 fusion Effects 0.000 claims description 12
- 238000012423 maintenance Methods 0.000 claims description 12
- 238000000034 method Methods 0.000 claims description 9
- 238000013459 approach Methods 0.000 claims description 6
- 238000007635 classification algorithm Methods 0.000 claims description 6
- 238000005065 mining Methods 0.000 claims description 6
- 238000004140 cleaning Methods 0.000 claims description 5
- 238000005516 engineering process Methods 0.000 claims description 5
- 238000011835 investigation Methods 0.000 claims description 5
- 238000010606 normalization Methods 0.000 claims description 5
- 238000002372 labelling Methods 0.000 claims description 3
- 230000037361 pathway Effects 0.000 claims description 3
- 230000001681 protective effect Effects 0.000 claims description 3
- 238000007670 refining Methods 0.000 claims description 3
- 230000009286 beneficial effect Effects 0.000 abstract description 6
- 238000011161 development Methods 0.000 abstract description 5
- 238000011160 research Methods 0.000 abstract description 4
- 238000006243 chemical reaction Methods 0.000 abstract description 2
- 238000011156 evaluation Methods 0.000 abstract description 2
- 238000010586 diagram Methods 0.000 description 4
- 238000012544 monitoring process Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012800 visualization Methods 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/02—Knowledge representation; Symbolic representation
- G06N5/022—Knowledge engineering; Knowledge acquisition
- G06N5/025—Extracting rules from data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/34—Browsing; Visualisation therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Business, Economics & Management (AREA)
- Computational Linguistics (AREA)
- Databases & Information Systems (AREA)
- Tourism & Hospitality (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- Evolutionary Computation (AREA)
- Educational Administration (AREA)
- Animal Behavior & Ethology (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Software Systems (AREA)
- Development Economics (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Economics (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- Strategic Management (AREA)
- General Business, Economics & Management (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a carbon policy knowledge graph construction system, which relates to the technical field of knowledge graph construction and comprises a data collection module; a data processing module; a carbon policy database; a map construction module; carbon policy knowledge graph; a service application module; and maintaining an updating module. The carbon policy knowledge graph construction system carries out carding and conversion on carbon policy data according to the format of entity-relation-entity, is beneficial to stroking out the carbon policy business venation, is beneficial to the directional development of subsequent thematic application research, is constructed and formed to facilitate the visual display and efficient search and query of carbon policy information, provides powerful support for the establishment and evaluation of government, enterprise and company double-carbon policies, and provides the service of carbon policy hotspot supervision, carbon policy index pair mark and thematic analysis, not only helps the low-carbon high-quality development of the company, but also helps the decision analysis of government, enterprise and company double-carbon targets.
Description
Technical Field
The invention relates to the technical field of knowledge graph construction, in particular to a carbon policy knowledge graph construction system.
Background
Knowledge maps are structured semantic knowledge bases that describe concepts and their interrelationships in the physical world in symbolic form. The 'entity-relation-entity' triples are taken as basic constituent units, and the entities form a net knowledge structure through relations. The policy knowledge graph refers to a structured graph formed by integrating relevant knowledge and information in the policy domain, so as to facilitate policy makers and executives to find and utilize the knowledge information. The electric carbon policy can be systematically combed through the carbon policy knowledge graph, so that powerful guidance is made for enterprises to apply the electric carbon policy, and in recent years, along with the continuous development of technologies such as big data, artificial intelligence and the like, the research of the carbon policy knowledge graph is also receiving attention.
For example, patent document 202211286741.1 discloses a method, a device, equipment and a storage medium for constructing an electric carbon policy knowledge graph, wherein the method comprises the steps of obtaining an electric carbon policy text and a pre-constructed policy ontology model, wherein the policy ontology in the policy ontology model comprises policy entities, policy attributes and policy relations; based on the policy ontology model, adopting a trained policy ontology extraction model to extract policy ontology information of each policy ontology in the electric carbon policy text; and importing the policy ontology information into a target graphic database to generate an electric carbon policy knowledge graph.
However, existing atlas-building systems similar to the above-mentioned documents still suffer from the following disadvantages:
the existing map construction system cannot well comb and convert carbon policy data, is not beneficial to stroking and clearing carbon policy business venues, influences the map construction efficiency and logic, and the constructed carbon policy knowledge map lacks service application support, is not convenient for visual display and efficient search and query of carbon policy information, cannot provide carbon policy related services, and cannot provide good assistance for governments, enterprises and companies.
Therefore, there is an urgent need to improve the shortcomings, and the present invention is to study and improve the existing structure and shortcomings, and provide a carbon policy knowledge graph construction system.
Disclosure of Invention
The invention aims to provide a carbon policy knowledge graph construction system for solving the problems in the background technology.
In order to achieve the above purpose, the present invention provides the following technical solutions: a carbon policy knowledge graph construction system, comprising:
the data collection module is used for collecting carbon policy data from various approaches, integrating the collected data and then sending the integrated data to the appointed module;
the data processing module is used for processing the carbon policy data and processing the carbon policy data into a data format meeting the construction requirements through word segmentation, screening and classification operations which are sequentially carried out;
the carbon policy database is used for storing the processed carbon policy data, guaranteeing the integrity of the data through encryption or other protective measures, and guaranteeing the safety of data access and retrieval through data access authority control;
a map construction module for performing the steps of:
extracting unstructured data in the collected carbon policy files from a data source, identifying the carbon policy file names of the extracted text data, linking the extracted entity relationship and attribute relationship through a constructed knowledge graph algorithm, establishing an association relationship, visually displaying through a graph database, fusing the extracted text data, eliminating repeated and conflicting information, and combing and converting carbon policy knowledge;
a carbon policy knowledge graph, wherein the carbon policy knowledge graph is a structured graph formed by refining, extracting, correlating and integrating related knowledge and information in the electric carbon policy field;
the service application module is used for utilizing the constructed carbon policy knowledge graph, opening an information retrieval function, carrying out accurate or fuzzy matching based on the names of the carbon policy files, quickly and accurately checking the corresponding association relations among the files, facilitating the visual display and efficient retrieval and inquiry of the carbon policy information, and supporting the mining analysis of the carbon policy information;
and the maintenance updating module is used for maintaining the framework and the rules of the carbon policy knowledge graph and updating the content of the carbon policy knowledge graph.
Further, the collection pathways of the data collection module include the internet, expert research, internal documentation, and government platforms, and the carbon policy data collected includes, but is not limited to, policy type, policy hierarchy, release area, release year, and carbon policy keywords.
Further, the data processing module comprises a text word segmentation unit, a manual screening unit and a text classification unit, wherein the output end of the text word segmentation unit is connected with the manual screening unit, and the output end of the manual screening unit is connected with the text classification unit.
Further, the text word segmentation unit is used for segmenting continuous Chinese text into words or phrases with semantic units according to rules and algorithms, spaces or other boundary marks are added between the words, the manual screening unit is used for manually screening the text after word segmentation according to screening rules, and the text classification unit is used for classifying the screened text in sequence based on text labeling and text classification algorithms.
Further, the word segmentation algorithm of the text word segmentation unit comprises a jiaba word segmentation algorithm and a TextRank algorithm, wherein the jiaba word segmentation algorithm is used for segmenting continuous Chinese text into meaningful words, the statistics and rules are based, the dictionary-based word segmentation and the statistics-based word segmentation method are combined, the TextRank algorithm is used for constructing a network according to the co-occurrence relation among words, and edges in the constructed network are undirected and authorized edges.
Further, the map construction module comprises a knowledge extraction unit, a knowledge link unit and a knowledge fusion unit, wherein the output end of the knowledge extraction unit is connected with the knowledge link unit, and the output end of the knowledge link unit is connected with the knowledge fusion unit.
Further, the knowledge extraction unit is used for extracting unstructured data in the collected carbon policy file from a data source, carrying out entity identification on the extracted text data, wherein the entity is a carded carbon policy file name, the knowledge linking unit is used for linking the extracted entity relationship and attribute relationship through a constructed knowledge graph algorithm, establishing an association relationship between the entity and between the entity and the attribute, and carrying out the association condition corresponding to the entity and the attribute through a graph database visualization, the knowledge fusion unit is used for fusing the extracted text data, eliminating repeated and conflicting information through data cleaning, normalization processing and unification processing, manually carding carbon policy knowledge triplet information, and carding and converting the carbon policy according to the format of an entity-relationship-entity triplet.
Further, the service application module comprises carbon policy hotspot monitoring, carbon policy index benchmarking and thematic analysis, wherein the carbon policy hotspot monitoring is performed in three dimensions of a time dimension, a region dimension and a hierarchy dimension, the carbon policy index benchmarking comprises three indexes of carbon emission, carbon transaction and carbon emission reduction, and the thematic analysis comprises carbon sink analysis and carbon technology analysis.
Further, one output end of the maintenance updating module is connected with the data collecting module, and the other output end of the maintenance updating module is connected with the carbon policy knowledge graph.
Further, the use flow of the carbon policy knowledge graph construction system is as follows:
the method comprises the steps that firstly, a data collection module collects carbon policy data such as policy types, policy levels, release areas, release years, carbon policy keywords and the like through Internet, expert investigation, internal files and government platform approaches, and the carbon policy data is integrated and then sent to a data processing module;
step two, the data processing module cuts continuous Chinese texts into words or phrases with semantic units according to rules and algorithms through a text word segmentation unit, spaces or other boundary marks are added between the words, then a manual screening unit manually screens the text after word segmentation according to screening rules, and a text classification unit sequentially classifies the screened texts based on the text marks and the text classification algorithm and stores the classified texts into a carbon policy database;
extracting unstructured data in the collected carbon policy files from a carbon policy database by a map construction module through a knowledge extraction unit, identifying the names of the carbon policy files of the extracted text data, linking the extracted entity relations and attribute relations through a knowledge linking unit, establishing association relations among the entities and attribute through a constructed knowledge map algorithm, visually displaying the association conditions corresponding to the entities and the attribute through a map database, fusing the extracted text data through a knowledge fusion unit, eliminating repeated and conflicting information through data cleaning, normalization processing and unification processing, manually carding carbon policy knowledge triplet information, and carding and converting carbon policy knowledge to form a carbon policy knowledge map;
and fourthly, the service application module utilizes the constructed carbon policy knowledge graph to open an information retrieval function, performs accurate or fuzzy matching based on the names of the carbon policy documents, rapidly and accurately checks the corresponding association relation among the documents, facilitates visual display and efficient retrieval and inquiry of the carbon policy information, supports mining and analysis of the carbon policy information, and maintains the framework and rules of the carbon policy knowledge graph through the maintenance and update module to update the content of the carbon policy knowledge graph.
The invention provides a carbon policy knowledge graph construction system, which has the following beneficial effects:
the invention carries out carding and conversion on the carbon policy data according to the format of entity-relation-entity, is beneficial to stroking out the carbon policy business venation, is beneficial to the directional development of the subsequent thematic application research, constructs and forms the carbon policy knowledge graph to facilitate the visual display and efficient retrieval and inquiry of the carbon policy information, provides powerful support for the establishment and evaluation of the double-carbon policies of governments, enterprises and companies, and provides the service of carbon policy hotspot supervision, carbon policy index pair mark and thematic analysis, not only helps the low-carbon high-quality development of the enterprises, but also helps the decision analysis of the double-carbon targets of the governments, the enterprises and the companies.
Drawings
FIG. 1 is a schematic diagram of a construction flow of a carbon policy knowledge graph construction system according to the present invention;
FIG. 2 is a schematic diagram of the overall architecture of a carbon policy knowledge graph construction system according to the present invention;
FIG. 3 is a flowchart illustrating the operation of a data processing module of the carbon policy knowledge graph construction system according to the present invention;
FIG. 4 is a flowchart illustrating the operation of a graph construction module of the carbon policy knowledge graph construction system according to the present invention;
FIG. 5 is a schematic diagram of knowledge fusion of a carbon policy knowledge graph construction system according to the present invention;
fig. 6 is a service application block diagram of a carbon policy knowledge graph construction system according to the present invention.
Detailed Description
Embodiments of the present invention are described in further detail below with reference to the accompanying drawings and examples. The following examples are illustrative of the invention but are not intended to limit the scope of the invention.
As shown in fig. 1 to 6, a carbon policy knowledge graph construction system includes:
the data collection module is used for collecting carbon policy data from various approaches, integrating the collected data and then sending the integrated data to the appointed module; the collection pathways of the data collection module include the internet, expert investigation, internal documentation, and government platforms, and the collected carbon policy data includes, but is not limited to, policy type, policy hierarchy, release area, release year, and carbon policy keywords;
the data processing module is used for processing the carbon policy data and processing the carbon policy data into a data format meeting the construction requirement through word segmentation, screening and classification operations which are sequentially carried out; the data processing module comprises a text word segmentation unit, a manual screening unit and a text classification unit, wherein the output end of the text word segmentation unit is connected with the manual screening unit, and the output end of the manual screening unit is connected with the text classification unit; the text word segmentation unit is used for segmenting continuous Chinese text into words or phrases with semantic units according to rules and algorithms, spaces or other boundary marks are added between the words, the manual screening unit is used for manually screening the text after word segmentation according to screening rules, and the text classification unit is used for classifying the screened text in sequence based on text labeling and text classification algorithms; the word segmentation algorithm of the text word segmentation unit comprises a jiaba word segmentation algorithm and a TextRank algorithm, wherein the jiaba word segmentation algorithm is used for segmenting continuous Chinese text into meaningful words, the word segmentation algorithm is based on statistics and rules and combines a word segmentation method based on dictionary and a word segmentation method based on statistics, the TextRank algorithm is used for constructing a network according to the co-occurrence relation among words, and edges in the constructed network are undirected and authorized edges;
the carbon policy database is used for storing the processed carbon policy data, guaranteeing the integrity of the data through encryption or other protective measures, and guaranteeing the safety of data access and retrieval through data access authority control;
the map construction module is used for executing the following steps:
extracting unstructured data in the collected carbon policy files from a data source, identifying the carbon policy file names of the extracted text data, linking the extracted entity relationship and attribute relationship through a constructed knowledge graph algorithm, establishing an association relationship, visually displaying through a graph database, fusing the extracted text data, eliminating repeated and conflicting information, and combing and converting carbon policy knowledge;
the map construction module comprises a knowledge extraction unit, a knowledge link unit and a knowledge fusion unit, wherein the output end of the knowledge extraction unit is connected with the knowledge link unit, and the output end of the knowledge link unit is connected with the knowledge fusion unit; the knowledge extraction unit is used for extracting unstructured data in the collected carbon policy files from a data source, carrying out entity identification on the extracted text data, wherein an entity is a carded carbon policy file name, the knowledge linking unit is used for linking the extracted entity relationship and attribute relationship through a constructed knowledge graph algorithm, establishing an association relationship between the entity and an association condition corresponding to the attribute, carrying out visualization of a graph database, and carrying out association conditions corresponding to the entity and the attribute, the knowledge fusion unit is used for fusing the extracted text data, eliminating repeated and conflicting information through data cleaning, normalization processing and unification processing, manually carding carbon policy knowledge triplet information, and carding and converting carbon policy knowledge according to an entity-relationship-entity triplet format;
a carbon policy knowledge graph, which is a structured graph formed by refining, extracting, correlating and integrating related knowledge and information in the electric carbon policy field;
the service application module is used for utilizing the constructed carbon policy knowledge graph, opening an information retrieval function, carrying out accurate or fuzzy matching based on the names of the carbon policy files, quickly and accurately checking the corresponding association relations among the files, facilitating the visual display and efficient retrieval and inquiry of the carbon policy information, and supporting the mining analysis of the carbon policy information; the service application module comprises carbon policy hotspot monitoring, carbon policy index benchmarking and thematic analysis, wherein the carbon policy hotspot monitoring is performed in three dimensions of time dimension, region dimension and hierarchy dimension, the carbon policy index benchmarking comprises three indexes of carbon emission, carbon transaction and carbon emission reduction, and the thematic analysis comprises carbon sink analysis and carbon technology analysis;
the maintenance updating module is used for maintaining the framework and rules of the carbon policy knowledge graph and updating the content of the carbon policy knowledge graph; one output end of the maintenance updating module is connected with the data collecting module, and the other output end of the maintenance updating module is connected with the carbon policy knowledge graph.
In summary, with reference to fig. 1 to 6, the carbon policy knowledge graph construction system has the following usage flow when in use:
the method comprises the steps that firstly, a data collection module collects carbon policy data such as policy types, policy levels, release areas, release years, carbon policy keywords and the like through Internet, expert investigation, internal files and government platform approaches, and the carbon policy data is integrated and then sent to a data processing module;
step two, the data processing module cuts continuous Chinese texts into words or phrases with semantic units according to rules and algorithms through a text word segmentation unit, spaces or other boundary marks are added between the words, then a manual screening unit manually screens the text after word segmentation according to screening rules, and a text classification unit sequentially classifies the screened texts based on the text marks and the text classification algorithm and stores the classified texts into a carbon policy database;
extracting unstructured data in the collected carbon policy files from a carbon policy database by a map construction module through a knowledge extraction unit, identifying the names of the carbon policy files of the extracted text data, linking the extracted entity relations and attribute relations through a knowledge linking unit, establishing association relations among the entities and attribute through a constructed knowledge map algorithm, visually displaying the association conditions corresponding to the entities and the attribute through a map database, fusing the extracted text data through a knowledge fusion unit, eliminating repeated and conflicting information through data cleaning, normalization processing and unification processing, manually carding carbon policy knowledge triplet information, and carding and converting carbon policy knowledge to form a carbon policy knowledge map;
and fourthly, the service application module utilizes the constructed carbon policy knowledge graph to open an information retrieval function, performs accurate or fuzzy matching based on the names of the carbon policy documents, rapidly and accurately checks the corresponding association relation among the documents, facilitates visual display and efficient retrieval and inquiry of the carbon policy information, supports mining and analysis of the carbon policy information, and maintains the framework and rules of the carbon policy knowledge graph through the maintenance and update module to update the content of the carbon policy knowledge graph.
The embodiments of the invention have been presented for purposes of illustration and description, and are not intended to be exhaustive or limited to the invention in the form disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art. The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, and to enable others of ordinary skill in the art to understand the invention for various embodiments with various modifications as are suited to the particular use contemplated.
Claims (10)
1. A carbon policy knowledge graph construction system, comprising:
the data collection module is used for collecting carbon policy data from various approaches, integrating the collected data and then sending the integrated data to the appointed module;
the data processing module is used for processing the carbon policy data and processing the carbon policy data into a data format meeting the construction requirements through word segmentation, screening and classification operations which are sequentially carried out;
the carbon policy database is used for storing the processed carbon policy data, guaranteeing the integrity of the data through encryption or other protective measures, and guaranteeing the safety of data access and retrieval through data access authority control;
a map construction module for performing the steps of:
extracting unstructured data in the collected carbon policy files from a data source, identifying the carbon policy file names of the extracted text data, linking the extracted entity relationship and attribute relationship through a constructed knowledge graph algorithm, establishing an association relationship, visually displaying through a graph database, fusing the extracted text data, eliminating repeated and conflicting information, and combing and converting carbon policy knowledge;
a carbon policy knowledge graph, wherein the carbon policy knowledge graph is a structured graph formed by refining, extracting, correlating and integrating related knowledge and information in the electric carbon policy field;
the service application module is used for utilizing the constructed carbon policy knowledge graph, opening an information retrieval function, carrying out accurate or fuzzy matching based on the names of the carbon policy files, quickly and accurately checking the corresponding association relations among the files, facilitating the visual display and efficient retrieval and inquiry of the carbon policy information, and supporting the mining analysis of the carbon policy information;
and the maintenance updating module is used for maintaining the framework and the rules of the carbon policy knowledge graph and updating the content of the carbon policy knowledge graph.
2. The carbon policy knowledge graph construction system of claim 1, wherein collection pathways of said data collection module include internet, expert investigation, internal documentation, and government platforms, and the collected carbon policy data includes, but is not limited to, policy type, policy hierarchy, release area, release year, and carbon policy keywords.
3. The carbon policy knowledge graph construction system of claim 1, wherein the data processing module comprises a text word segmentation unit, a manual screening unit and a text classification unit, wherein the output end of the text word segmentation unit is connected with the manual screening unit, and the output end of the manual screening unit is connected with the text classification unit.
4. The system according to claim 1, wherein the text word segmentation unit is configured to segment continuous chinese text into words or phrases with semantic units according to rules and algorithms, and to add spaces or other boundary marks between words, the manual screening unit is configured to manually screen the text after word segmentation according to screening rules, and the text classification unit is configured to sequentially classify the screened text based on text labeling and text classification algorithms.
5. The system according to claim 4, wherein the word segmentation algorithm of the text word segmentation unit includes a jiaba word segmentation algorithm for segmenting continuous chinese text into meaningful words, based on statistics and rules, and a TextRank algorithm for constructing a network according to co-occurrence relations between words, and edges in the constructed network are undirected weighted edges, in combination with dictionary-based word segmentation and statistical-based word segmentation methods.
6. The carbon policy knowledge graph construction system according to claim 1, wherein the graph construction module comprises a knowledge extraction unit, a knowledge linking unit and a knowledge fusion unit, wherein an output end of the knowledge extraction unit is connected with the knowledge linking unit, and an output end of the knowledge linking unit is connected with the knowledge fusion unit.
7. The system according to claim 6, wherein the knowledge extraction unit is configured to extract unstructured data in the collected carbon policy file from a data source, identify the extracted text data as an entity, and comb the carbon policy knowledge triplet information, and comb and convert the carbon policy knowledge according to a format of a "entity-relation-entity" triplet.
8. The system of claim 1, wherein the service application module includes a carbon policy hotspot monitor, a carbon policy index pair label and a topic analysis, the carbon policy hotspot monitor is divided into three dimensions of a time dimension, a region dimension and a hierarchy dimension, the carbon policy index pair label includes three indexes of carbon emission, carbon transaction and carbon emission reduction, and the topic analysis includes a carbon sink analysis and a carbon technology analysis.
9. The system of claim 1, wherein one output of the maintenance update module is connected to the data collection module and the other output of the maintenance update module is connected to the carbon policy knowledge graph.
10. The carbon policy knowledge graph construction system of any one of claims 1-9, wherein the carbon policy knowledge graph construction system comprises the following steps:
the method comprises the steps that firstly, a data collection module collects carbon policy data such as policy types, policy levels, release areas, release years, carbon policy keywords and the like through Internet, expert investigation, internal files and government platform approaches, and the carbon policy data is integrated and then sent to a data processing module;
step two, the data processing module cuts continuous Chinese texts into words or phrases with semantic units according to rules and algorithms through a text word segmentation unit, spaces or other boundary marks are added between the words, then a manual screening unit manually screens the text after word segmentation according to screening rules, and a text classification unit sequentially classifies the screened texts based on the text marks and the text classification algorithm and stores the classified texts into a carbon policy database;
extracting unstructured data in the collected carbon policy files from a carbon policy database by a map construction module through a knowledge extraction unit, identifying the names of the carbon policy files of the extracted text data, linking the extracted entity relations and attribute relations through a knowledge linking unit, establishing association relations among the entities and attribute through a constructed knowledge map algorithm, visually displaying the association conditions corresponding to the entities and the attribute through a map database, fusing the extracted text data through a knowledge fusion unit, eliminating repeated and conflicting information through data cleaning, normalization processing and unification processing, manually carding carbon policy knowledge triplet information, and carding and converting carbon policy knowledge to form a carbon policy knowledge map;
and fourthly, the service application module utilizes the constructed carbon policy knowledge graph to open an information retrieval function, performs accurate or fuzzy matching based on the names of the carbon policy documents, rapidly and accurately checks the corresponding association relation among the documents, facilitates visual display and efficient retrieval and inquiry of the carbon policy information, supports mining and analysis of the carbon policy information, and maintains the framework and rules of the carbon policy knowledge graph through the maintenance and update module to update the content of the carbon policy knowledge graph.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311418018.9A CN117371531A (en) | 2023-10-30 | 2023-10-30 | Carbon policy knowledge graph construction system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311418018.9A CN117371531A (en) | 2023-10-30 | 2023-10-30 | Carbon policy knowledge graph construction system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117371531A true CN117371531A (en) | 2024-01-09 |
Family
ID=89394368
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311418018.9A Pending CN117371531A (en) | 2023-10-30 | 2023-10-30 | Carbon policy knowledge graph construction system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117371531A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117708350A (en) * | 2024-02-06 | 2024-03-15 | 成都草根有智创新科技有限公司 | Enterprise policy information association method and device and electronic equipment |
-
2023
- 2023-10-30 CN CN202311418018.9A patent/CN117371531A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117708350A (en) * | 2024-02-06 | 2024-03-15 | 成都草根有智创新科技有限公司 | Enterprise policy information association method and device and electronic equipment |
CN117708350B (en) * | 2024-02-06 | 2024-05-14 | 成都草根有智创新科技有限公司 | Enterprise policy information association method and device and electronic equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111708773B (en) | Multi-source scientific and creative resource data fusion method | |
CN108364124B (en) | International capacity cooperative risk assessment and decision service system based on big data | |
CN112380318A (en) | Enterprise policy matching method based on label similarity | |
CN107147639A (en) | A kind of actual time safety method for early warning based on Complex event processing | |
CN110533212A (en) | Urban waterlogging public sentiment monitoring and pre-alarming method based on big data | |
CN112288247B (en) | Soil heavy metal risk identification method based on space interaction relationship | |
CN111538741B (en) | Deep learning analysis method and system for big data of alarm condition | |
CN109657058A (en) | A kind of abstracting method of notice information | |
CN117371531A (en) | Carbon policy knowledge graph construction system | |
KR102396771B1 (en) | A method for extracting disaster cause automatically | |
CN116384889A (en) | Intelligent analysis method for information big data based on natural language processing technology | |
CN110188092B (en) | System and method for mining new type contradiction dispute in people mediation | |
CN114860882A (en) | Fair competition review auxiliary method based on text classification model | |
CN113239208A (en) | Mark training model based on knowledge graph | |
CN110472075A (en) | A kind of isomeric data classification storage method and system based on machine learning | |
CN114693906A (en) | Travel reimbursement abnormal behavior detection method and system based on space-time rule | |
CN110532492A (en) | A kind of forum data management classification system and method | |
CN115545437A (en) | Financial enterprise operation risk early warning method based on multi-source heterogeneous data fusion | |
CN117473512A (en) | Vulnerability risk assessment method based on network mapping | |
Memon et al. | Harvesting covert networks: a case study of the iMiner database | |
CN117436729A (en) | Government system based data management and data analysis method | |
Bondoc et al. | An intelligent road traffic information system using text analysis in the most congested roads in Metro Manila | |
González-Conejero et al. | Organized crime structure modelling for european law enforcement agencies interoperability through ontologies | |
CN115757832A (en) | Case detecting and handling model system based on knowledge graph technology | |
WO2022092497A1 (en) | System for providing similar case information, and method therefor |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |