CN117371531A - Carbon policy knowledge graph construction system - Google Patents

Carbon policy knowledge graph construction system Download PDF

Info

Publication number
CN117371531A
CN117371531A CN202311418018.9A CN202311418018A CN117371531A CN 117371531 A CN117371531 A CN 117371531A CN 202311418018 A CN202311418018 A CN 202311418018A CN 117371531 A CN117371531 A CN 117371531A
Authority
CN
China
Prior art keywords
carbon
policy
data
carbon policy
knowledge
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311418018.9A
Other languages
Chinese (zh)
Inventor
仝翠芝
张惠
刘洪斌
刘彦志
王之昕
王冲
高岩
武文鹏
田伟
李肖
李顺杰
梁雨婷
陈泽坤
王静芝
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Jibei Power Co ltd Smart Distribution Network Center
State Grid Corp of China SGCC
Original Assignee
State Grid Jibei Power Co ltd Smart Distribution Network Center
State Grid Corp of China SGCC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Jibei Power Co ltd Smart Distribution Network Center, State Grid Corp of China SGCC filed Critical State Grid Jibei Power Co ltd Smart Distribution Network Center
Priority to CN202311418018.9A priority Critical patent/CN117371531A/en
Publication of CN117371531A publication Critical patent/CN117371531A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/022Knowledge engineering; Knowledge acquisition
    • G06N5/025Extracting rules from data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Business, Economics & Management (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Tourism & Hospitality (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Educational Administration (AREA)
  • Animal Behavior & Ethology (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Software Systems (AREA)
  • Development Economics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a carbon policy knowledge graph construction system, which relates to the technical field of knowledge graph construction and comprises a data collection module; a data processing module; a carbon policy database; a map construction module; carbon policy knowledge graph; a service application module; and maintaining an updating module. The carbon policy knowledge graph construction system carries out carding and conversion on carbon policy data according to the format of entity-relation-entity, is beneficial to stroking out the carbon policy business venation, is beneficial to the directional development of subsequent thematic application research, is constructed and formed to facilitate the visual display and efficient search and query of carbon policy information, provides powerful support for the establishment and evaluation of government, enterprise and company double-carbon policies, and provides the service of carbon policy hotspot supervision, carbon policy index pair mark and thematic analysis, not only helps the low-carbon high-quality development of the company, but also helps the decision analysis of government, enterprise and company double-carbon targets.

Description

Carbon policy knowledge graph construction system
Technical Field
The invention relates to the technical field of knowledge graph construction, in particular to a carbon policy knowledge graph construction system.
Background
Knowledge maps are structured semantic knowledge bases that describe concepts and their interrelationships in the physical world in symbolic form. The 'entity-relation-entity' triples are taken as basic constituent units, and the entities form a net knowledge structure through relations. The policy knowledge graph refers to a structured graph formed by integrating relevant knowledge and information in the policy domain, so as to facilitate policy makers and executives to find and utilize the knowledge information. The electric carbon policy can be systematically combed through the carbon policy knowledge graph, so that powerful guidance is made for enterprises to apply the electric carbon policy, and in recent years, along with the continuous development of technologies such as big data, artificial intelligence and the like, the research of the carbon policy knowledge graph is also receiving attention.
For example, patent document 202211286741.1 discloses a method, a device, equipment and a storage medium for constructing an electric carbon policy knowledge graph, wherein the method comprises the steps of obtaining an electric carbon policy text and a pre-constructed policy ontology model, wherein the policy ontology in the policy ontology model comprises policy entities, policy attributes and policy relations; based on the policy ontology model, adopting a trained policy ontology extraction model to extract policy ontology information of each policy ontology in the electric carbon policy text; and importing the policy ontology information into a target graphic database to generate an electric carbon policy knowledge graph.
However, existing atlas-building systems similar to the above-mentioned documents still suffer from the following disadvantages:
the existing map construction system cannot well comb and convert carbon policy data, is not beneficial to stroking and clearing carbon policy business venues, influences the map construction efficiency and logic, and the constructed carbon policy knowledge map lacks service application support, is not convenient for visual display and efficient search and query of carbon policy information, cannot provide carbon policy related services, and cannot provide good assistance for governments, enterprises and companies.
Therefore, there is an urgent need to improve the shortcomings, and the present invention is to study and improve the existing structure and shortcomings, and provide a carbon policy knowledge graph construction system.
Disclosure of Invention
The invention aims to provide a carbon policy knowledge graph construction system for solving the problems in the background technology.
In order to achieve the above purpose, the present invention provides the following technical solutions: a carbon policy knowledge graph construction system, comprising:
the data collection module is used for collecting carbon policy data from various approaches, integrating the collected data and then sending the integrated data to the appointed module;
the data processing module is used for processing the carbon policy data and processing the carbon policy data into a data format meeting the construction requirements through word segmentation, screening and classification operations which are sequentially carried out;
the carbon policy database is used for storing the processed carbon policy data, guaranteeing the integrity of the data through encryption or other protective measures, and guaranteeing the safety of data access and retrieval through data access authority control;
a map construction module for performing the steps of:
extracting unstructured data in the collected carbon policy files from a data source, identifying the carbon policy file names of the extracted text data, linking the extracted entity relationship and attribute relationship through a constructed knowledge graph algorithm, establishing an association relationship, visually displaying through a graph database, fusing the extracted text data, eliminating repeated and conflicting information, and combing and converting carbon policy knowledge;
a carbon policy knowledge graph, wherein the carbon policy knowledge graph is a structured graph formed by refining, extracting, correlating and integrating related knowledge and information in the electric carbon policy field;
the service application module is used for utilizing the constructed carbon policy knowledge graph, opening an information retrieval function, carrying out accurate or fuzzy matching based on the names of the carbon policy files, quickly and accurately checking the corresponding association relations among the files, facilitating the visual display and efficient retrieval and inquiry of the carbon policy information, and supporting the mining analysis of the carbon policy information;
and the maintenance updating module is used for maintaining the framework and the rules of the carbon policy knowledge graph and updating the content of the carbon policy knowledge graph.
Further, the collection pathways of the data collection module include the internet, expert research, internal documentation, and government platforms, and the carbon policy data collected includes, but is not limited to, policy type, policy hierarchy, release area, release year, and carbon policy keywords.
Further, the data processing module comprises a text word segmentation unit, a manual screening unit and a text classification unit, wherein the output end of the text word segmentation unit is connected with the manual screening unit, and the output end of the manual screening unit is connected with the text classification unit.
Further, the text word segmentation unit is used for segmenting continuous Chinese text into words or phrases with semantic units according to rules and algorithms, spaces or other boundary marks are added between the words, the manual screening unit is used for manually screening the text after word segmentation according to screening rules, and the text classification unit is used for classifying the screened text in sequence based on text labeling and text classification algorithms.
Further, the word segmentation algorithm of the text word segmentation unit comprises a jiaba word segmentation algorithm and a TextRank algorithm, wherein the jiaba word segmentation algorithm is used for segmenting continuous Chinese text into meaningful words, the statistics and rules are based, the dictionary-based word segmentation and the statistics-based word segmentation method are combined, the TextRank algorithm is used for constructing a network according to the co-occurrence relation among words, and edges in the constructed network are undirected and authorized edges.
Further, the map construction module comprises a knowledge extraction unit, a knowledge link unit and a knowledge fusion unit, wherein the output end of the knowledge extraction unit is connected with the knowledge link unit, and the output end of the knowledge link unit is connected with the knowledge fusion unit.
Further, the knowledge extraction unit is used for extracting unstructured data in the collected carbon policy file from a data source, carrying out entity identification on the extracted text data, wherein the entity is a carded carbon policy file name, the knowledge linking unit is used for linking the extracted entity relationship and attribute relationship through a constructed knowledge graph algorithm, establishing an association relationship between the entity and between the entity and the attribute, and carrying out the association condition corresponding to the entity and the attribute through a graph database visualization, the knowledge fusion unit is used for fusing the extracted text data, eliminating repeated and conflicting information through data cleaning, normalization processing and unification processing, manually carding carbon policy knowledge triplet information, and carding and converting the carbon policy according to the format of an entity-relationship-entity triplet.
Further, the service application module comprises carbon policy hotspot monitoring, carbon policy index benchmarking and thematic analysis, wherein the carbon policy hotspot monitoring is performed in three dimensions of a time dimension, a region dimension and a hierarchy dimension, the carbon policy index benchmarking comprises three indexes of carbon emission, carbon transaction and carbon emission reduction, and the thematic analysis comprises carbon sink analysis and carbon technology analysis.
Further, one output end of the maintenance updating module is connected with the data collecting module, and the other output end of the maintenance updating module is connected with the carbon policy knowledge graph.
Further, the use flow of the carbon policy knowledge graph construction system is as follows:
the method comprises the steps that firstly, a data collection module collects carbon policy data such as policy types, policy levels, release areas, release years, carbon policy keywords and the like through Internet, expert investigation, internal files and government platform approaches, and the carbon policy data is integrated and then sent to a data processing module;
step two, the data processing module cuts continuous Chinese texts into words or phrases with semantic units according to rules and algorithms through a text word segmentation unit, spaces or other boundary marks are added between the words, then a manual screening unit manually screens the text after word segmentation according to screening rules, and a text classification unit sequentially classifies the screened texts based on the text marks and the text classification algorithm and stores the classified texts into a carbon policy database;
extracting unstructured data in the collected carbon policy files from a carbon policy database by a map construction module through a knowledge extraction unit, identifying the names of the carbon policy files of the extracted text data, linking the extracted entity relations and attribute relations through a knowledge linking unit, establishing association relations among the entities and attribute through a constructed knowledge map algorithm, visually displaying the association conditions corresponding to the entities and the attribute through a map database, fusing the extracted text data through a knowledge fusion unit, eliminating repeated and conflicting information through data cleaning, normalization processing and unification processing, manually carding carbon policy knowledge triplet information, and carding and converting carbon policy knowledge to form a carbon policy knowledge map;
and fourthly, the service application module utilizes the constructed carbon policy knowledge graph to open an information retrieval function, performs accurate or fuzzy matching based on the names of the carbon policy documents, rapidly and accurately checks the corresponding association relation among the documents, facilitates visual display and efficient retrieval and inquiry of the carbon policy information, supports mining and analysis of the carbon policy information, and maintains the framework and rules of the carbon policy knowledge graph through the maintenance and update module to update the content of the carbon policy knowledge graph.
The invention provides a carbon policy knowledge graph construction system, which has the following beneficial effects:
the invention carries out carding and conversion on the carbon policy data according to the format of entity-relation-entity, is beneficial to stroking out the carbon policy business venation, is beneficial to the directional development of the subsequent thematic application research, constructs and forms the carbon policy knowledge graph to facilitate the visual display and efficient retrieval and inquiry of the carbon policy information, provides powerful support for the establishment and evaluation of the double-carbon policies of governments, enterprises and companies, and provides the service of carbon policy hotspot supervision, carbon policy index pair mark and thematic analysis, not only helps the low-carbon high-quality development of the enterprises, but also helps the decision analysis of the double-carbon targets of the governments, the enterprises and the companies.
Drawings
FIG. 1 is a schematic diagram of a construction flow of a carbon policy knowledge graph construction system according to the present invention;
FIG. 2 is a schematic diagram of the overall architecture of a carbon policy knowledge graph construction system according to the present invention;
FIG. 3 is a flowchart illustrating the operation of a data processing module of the carbon policy knowledge graph construction system according to the present invention;
FIG. 4 is a flowchart illustrating the operation of a graph construction module of the carbon policy knowledge graph construction system according to the present invention;
FIG. 5 is a schematic diagram of knowledge fusion of a carbon policy knowledge graph construction system according to the present invention;
fig. 6 is a service application block diagram of a carbon policy knowledge graph construction system according to the present invention.
Detailed Description
Embodiments of the present invention are described in further detail below with reference to the accompanying drawings and examples. The following examples are illustrative of the invention but are not intended to limit the scope of the invention.
As shown in fig. 1 to 6, a carbon policy knowledge graph construction system includes:
the data collection module is used for collecting carbon policy data from various approaches, integrating the collected data and then sending the integrated data to the appointed module; the collection pathways of the data collection module include the internet, expert investigation, internal documentation, and government platforms, and the collected carbon policy data includes, but is not limited to, policy type, policy hierarchy, release area, release year, and carbon policy keywords;
the data processing module is used for processing the carbon policy data and processing the carbon policy data into a data format meeting the construction requirement through word segmentation, screening and classification operations which are sequentially carried out; the data processing module comprises a text word segmentation unit, a manual screening unit and a text classification unit, wherein the output end of the text word segmentation unit is connected with the manual screening unit, and the output end of the manual screening unit is connected with the text classification unit; the text word segmentation unit is used for segmenting continuous Chinese text into words or phrases with semantic units according to rules and algorithms, spaces or other boundary marks are added between the words, the manual screening unit is used for manually screening the text after word segmentation according to screening rules, and the text classification unit is used for classifying the screened text in sequence based on text labeling and text classification algorithms; the word segmentation algorithm of the text word segmentation unit comprises a jiaba word segmentation algorithm and a TextRank algorithm, wherein the jiaba word segmentation algorithm is used for segmenting continuous Chinese text into meaningful words, the word segmentation algorithm is based on statistics and rules and combines a word segmentation method based on dictionary and a word segmentation method based on statistics, the TextRank algorithm is used for constructing a network according to the co-occurrence relation among words, and edges in the constructed network are undirected and authorized edges;
the carbon policy database is used for storing the processed carbon policy data, guaranteeing the integrity of the data through encryption or other protective measures, and guaranteeing the safety of data access and retrieval through data access authority control;
the map construction module is used for executing the following steps:
extracting unstructured data in the collected carbon policy files from a data source, identifying the carbon policy file names of the extracted text data, linking the extracted entity relationship and attribute relationship through a constructed knowledge graph algorithm, establishing an association relationship, visually displaying through a graph database, fusing the extracted text data, eliminating repeated and conflicting information, and combing and converting carbon policy knowledge;
the map construction module comprises a knowledge extraction unit, a knowledge link unit and a knowledge fusion unit, wherein the output end of the knowledge extraction unit is connected with the knowledge link unit, and the output end of the knowledge link unit is connected with the knowledge fusion unit; the knowledge extraction unit is used for extracting unstructured data in the collected carbon policy files from a data source, carrying out entity identification on the extracted text data, wherein an entity is a carded carbon policy file name, the knowledge linking unit is used for linking the extracted entity relationship and attribute relationship through a constructed knowledge graph algorithm, establishing an association relationship between the entity and an association condition corresponding to the attribute, carrying out visualization of a graph database, and carrying out association conditions corresponding to the entity and the attribute, the knowledge fusion unit is used for fusing the extracted text data, eliminating repeated and conflicting information through data cleaning, normalization processing and unification processing, manually carding carbon policy knowledge triplet information, and carding and converting carbon policy knowledge according to an entity-relationship-entity triplet format;
a carbon policy knowledge graph, which is a structured graph formed by refining, extracting, correlating and integrating related knowledge and information in the electric carbon policy field;
the service application module is used for utilizing the constructed carbon policy knowledge graph, opening an information retrieval function, carrying out accurate or fuzzy matching based on the names of the carbon policy files, quickly and accurately checking the corresponding association relations among the files, facilitating the visual display and efficient retrieval and inquiry of the carbon policy information, and supporting the mining analysis of the carbon policy information; the service application module comprises carbon policy hotspot monitoring, carbon policy index benchmarking and thematic analysis, wherein the carbon policy hotspot monitoring is performed in three dimensions of time dimension, region dimension and hierarchy dimension, the carbon policy index benchmarking comprises three indexes of carbon emission, carbon transaction and carbon emission reduction, and the thematic analysis comprises carbon sink analysis and carbon technology analysis;
the maintenance updating module is used for maintaining the framework and rules of the carbon policy knowledge graph and updating the content of the carbon policy knowledge graph; one output end of the maintenance updating module is connected with the data collecting module, and the other output end of the maintenance updating module is connected with the carbon policy knowledge graph.
In summary, with reference to fig. 1 to 6, the carbon policy knowledge graph construction system has the following usage flow when in use:
the method comprises the steps that firstly, a data collection module collects carbon policy data such as policy types, policy levels, release areas, release years, carbon policy keywords and the like through Internet, expert investigation, internal files and government platform approaches, and the carbon policy data is integrated and then sent to a data processing module;
step two, the data processing module cuts continuous Chinese texts into words or phrases with semantic units according to rules and algorithms through a text word segmentation unit, spaces or other boundary marks are added between the words, then a manual screening unit manually screens the text after word segmentation according to screening rules, and a text classification unit sequentially classifies the screened texts based on the text marks and the text classification algorithm and stores the classified texts into a carbon policy database;
extracting unstructured data in the collected carbon policy files from a carbon policy database by a map construction module through a knowledge extraction unit, identifying the names of the carbon policy files of the extracted text data, linking the extracted entity relations and attribute relations through a knowledge linking unit, establishing association relations among the entities and attribute through a constructed knowledge map algorithm, visually displaying the association conditions corresponding to the entities and the attribute through a map database, fusing the extracted text data through a knowledge fusion unit, eliminating repeated and conflicting information through data cleaning, normalization processing and unification processing, manually carding carbon policy knowledge triplet information, and carding and converting carbon policy knowledge to form a carbon policy knowledge map;
and fourthly, the service application module utilizes the constructed carbon policy knowledge graph to open an information retrieval function, performs accurate or fuzzy matching based on the names of the carbon policy documents, rapidly and accurately checks the corresponding association relation among the documents, facilitates visual display and efficient retrieval and inquiry of the carbon policy information, supports mining and analysis of the carbon policy information, and maintains the framework and rules of the carbon policy knowledge graph through the maintenance and update module to update the content of the carbon policy knowledge graph.
The embodiments of the invention have been presented for purposes of illustration and description, and are not intended to be exhaustive or limited to the invention in the form disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art. The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, and to enable others of ordinary skill in the art to understand the invention for various embodiments with various modifications as are suited to the particular use contemplated.

Claims (10)

1. A carbon policy knowledge graph construction system, comprising:
the data collection module is used for collecting carbon policy data from various approaches, integrating the collected data and then sending the integrated data to the appointed module;
the data processing module is used for processing the carbon policy data and processing the carbon policy data into a data format meeting the construction requirements through word segmentation, screening and classification operations which are sequentially carried out;
the carbon policy database is used for storing the processed carbon policy data, guaranteeing the integrity of the data through encryption or other protective measures, and guaranteeing the safety of data access and retrieval through data access authority control;
a map construction module for performing the steps of:
extracting unstructured data in the collected carbon policy files from a data source, identifying the carbon policy file names of the extracted text data, linking the extracted entity relationship and attribute relationship through a constructed knowledge graph algorithm, establishing an association relationship, visually displaying through a graph database, fusing the extracted text data, eliminating repeated and conflicting information, and combing and converting carbon policy knowledge;
a carbon policy knowledge graph, wherein the carbon policy knowledge graph is a structured graph formed by refining, extracting, correlating and integrating related knowledge and information in the electric carbon policy field;
the service application module is used for utilizing the constructed carbon policy knowledge graph, opening an information retrieval function, carrying out accurate or fuzzy matching based on the names of the carbon policy files, quickly and accurately checking the corresponding association relations among the files, facilitating the visual display and efficient retrieval and inquiry of the carbon policy information, and supporting the mining analysis of the carbon policy information;
and the maintenance updating module is used for maintaining the framework and the rules of the carbon policy knowledge graph and updating the content of the carbon policy knowledge graph.
2. The carbon policy knowledge graph construction system of claim 1, wherein collection pathways of said data collection module include internet, expert investigation, internal documentation, and government platforms, and the collected carbon policy data includes, but is not limited to, policy type, policy hierarchy, release area, release year, and carbon policy keywords.
3. The carbon policy knowledge graph construction system of claim 1, wherein the data processing module comprises a text word segmentation unit, a manual screening unit and a text classification unit, wherein the output end of the text word segmentation unit is connected with the manual screening unit, and the output end of the manual screening unit is connected with the text classification unit.
4. The system according to claim 1, wherein the text word segmentation unit is configured to segment continuous chinese text into words or phrases with semantic units according to rules and algorithms, and to add spaces or other boundary marks between words, the manual screening unit is configured to manually screen the text after word segmentation according to screening rules, and the text classification unit is configured to sequentially classify the screened text based on text labeling and text classification algorithms.
5. The system according to claim 4, wherein the word segmentation algorithm of the text word segmentation unit includes a jiaba word segmentation algorithm for segmenting continuous chinese text into meaningful words, based on statistics and rules, and a TextRank algorithm for constructing a network according to co-occurrence relations between words, and edges in the constructed network are undirected weighted edges, in combination with dictionary-based word segmentation and statistical-based word segmentation methods.
6. The carbon policy knowledge graph construction system according to claim 1, wherein the graph construction module comprises a knowledge extraction unit, a knowledge linking unit and a knowledge fusion unit, wherein an output end of the knowledge extraction unit is connected with the knowledge linking unit, and an output end of the knowledge linking unit is connected with the knowledge fusion unit.
7. The system according to claim 6, wherein the knowledge extraction unit is configured to extract unstructured data in the collected carbon policy file from a data source, identify the extracted text data as an entity, and comb the carbon policy knowledge triplet information, and comb and convert the carbon policy knowledge according to a format of a "entity-relation-entity" triplet.
8. The system of claim 1, wherein the service application module includes a carbon policy hotspot monitor, a carbon policy index pair label and a topic analysis, the carbon policy hotspot monitor is divided into three dimensions of a time dimension, a region dimension and a hierarchy dimension, the carbon policy index pair label includes three indexes of carbon emission, carbon transaction and carbon emission reduction, and the topic analysis includes a carbon sink analysis and a carbon technology analysis.
9. The system of claim 1, wherein one output of the maintenance update module is connected to the data collection module and the other output of the maintenance update module is connected to the carbon policy knowledge graph.
10. The carbon policy knowledge graph construction system of any one of claims 1-9, wherein the carbon policy knowledge graph construction system comprises the following steps:
the method comprises the steps that firstly, a data collection module collects carbon policy data such as policy types, policy levels, release areas, release years, carbon policy keywords and the like through Internet, expert investigation, internal files and government platform approaches, and the carbon policy data is integrated and then sent to a data processing module;
step two, the data processing module cuts continuous Chinese texts into words or phrases with semantic units according to rules and algorithms through a text word segmentation unit, spaces or other boundary marks are added between the words, then a manual screening unit manually screens the text after word segmentation according to screening rules, and a text classification unit sequentially classifies the screened texts based on the text marks and the text classification algorithm and stores the classified texts into a carbon policy database;
extracting unstructured data in the collected carbon policy files from a carbon policy database by a map construction module through a knowledge extraction unit, identifying the names of the carbon policy files of the extracted text data, linking the extracted entity relations and attribute relations through a knowledge linking unit, establishing association relations among the entities and attribute through a constructed knowledge map algorithm, visually displaying the association conditions corresponding to the entities and the attribute through a map database, fusing the extracted text data through a knowledge fusion unit, eliminating repeated and conflicting information through data cleaning, normalization processing and unification processing, manually carding carbon policy knowledge triplet information, and carding and converting carbon policy knowledge to form a carbon policy knowledge map;
and fourthly, the service application module utilizes the constructed carbon policy knowledge graph to open an information retrieval function, performs accurate or fuzzy matching based on the names of the carbon policy documents, rapidly and accurately checks the corresponding association relation among the documents, facilitates visual display and efficient retrieval and inquiry of the carbon policy information, supports mining and analysis of the carbon policy information, and maintains the framework and rules of the carbon policy knowledge graph through the maintenance and update module to update the content of the carbon policy knowledge graph.
CN202311418018.9A 2023-10-30 2023-10-30 Carbon policy knowledge graph construction system Pending CN117371531A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311418018.9A CN117371531A (en) 2023-10-30 2023-10-30 Carbon policy knowledge graph construction system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311418018.9A CN117371531A (en) 2023-10-30 2023-10-30 Carbon policy knowledge graph construction system

Publications (1)

Publication Number Publication Date
CN117371531A true CN117371531A (en) 2024-01-09

Family

ID=89394368

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311418018.9A Pending CN117371531A (en) 2023-10-30 2023-10-30 Carbon policy knowledge graph construction system

Country Status (1)

Country Link
CN (1) CN117371531A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117708350A (en) * 2024-02-06 2024-03-15 成都草根有智创新科技有限公司 Enterprise policy information association method and device and electronic equipment

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117708350A (en) * 2024-02-06 2024-03-15 成都草根有智创新科技有限公司 Enterprise policy information association method and device and electronic equipment
CN117708350B (en) * 2024-02-06 2024-05-14 成都草根有智创新科技有限公司 Enterprise policy information association method and device and electronic equipment

Similar Documents

Publication Publication Date Title
CN111708773B (en) Multi-source scientific and creative resource data fusion method
CN108364124B (en) International capacity cooperative risk assessment and decision service system based on big data
CN112380318A (en) Enterprise policy matching method based on label similarity
CN107147639A (en) A kind of actual time safety method for early warning based on Complex event processing
CN110533212A (en) Urban waterlogging public sentiment monitoring and pre-alarming method based on big data
CN112288247B (en) Soil heavy metal risk identification method based on space interaction relationship
CN111538741B (en) Deep learning analysis method and system for big data of alarm condition
CN109657058A (en) A kind of abstracting method of notice information
CN117371531A (en) Carbon policy knowledge graph construction system
KR102396771B1 (en) A method for extracting disaster cause automatically
CN116384889A (en) Intelligent analysis method for information big data based on natural language processing technology
CN110188092B (en) System and method for mining new type contradiction dispute in people mediation
CN114860882A (en) Fair competition review auxiliary method based on text classification model
CN113239208A (en) Mark training model based on knowledge graph
CN110472075A (en) A kind of isomeric data classification storage method and system based on machine learning
CN114693906A (en) Travel reimbursement abnormal behavior detection method and system based on space-time rule
CN110532492A (en) A kind of forum data management classification system and method
CN115545437A (en) Financial enterprise operation risk early warning method based on multi-source heterogeneous data fusion
CN117473512A (en) Vulnerability risk assessment method based on network mapping
Memon et al. Harvesting covert networks: a case study of the iMiner database
CN117436729A (en) Government system based data management and data analysis method
Bondoc et al. An intelligent road traffic information system using text analysis in the most congested roads in Metro Manila
González-Conejero et al. Organized crime structure modelling for european law enforcement agencies interoperability through ontologies
CN115757832A (en) Case detecting and handling model system based on knowledge graph technology
WO2022092497A1 (en) System for providing similar case information, and method therefor

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination