CN115544336A - Industrial chain generation method, device, computer equipment and storage medium - Google Patents

Industrial chain generation method, device, computer equipment and storage medium Download PDF

Info

Publication number
CN115544336A
CN115544336A CN202211529390.2A CN202211529390A CN115544336A CN 115544336 A CN115544336 A CN 115544336A CN 202211529390 A CN202211529390 A CN 202211529390A CN 115544336 A CN115544336 A CN 115544336A
Authority
CN
China
Prior art keywords
industry
target
industries
target industry
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211529390.2A
Other languages
Chinese (zh)
Inventor
龚起航
周尚礼
郑楷洪
李胜
刘玉仙
曾璐琨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CSG Digital Power GRID Research Institute Co Ltd
Original Assignee
Southern Power Grid Digital Grid Research Institute Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southern Power Grid Digital Grid Research Institute Co Ltd filed Critical Southern Power Grid Digital Grid Research Institute Co Ltd
Priority to CN202211529390.2A priority Critical patent/CN115544336A/en
Publication of CN115544336A publication Critical patent/CN115544336A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/904Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/26Visual data mining; Browsing structured data
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9024Graphs; Linked lists
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/906Clustering; Classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本申请涉及一种产业链生成方法、装置、计算机设备和存储介质。所述方法包括:根据各个目标产业的产业电量数据和产业生产数据,确定所述各个目标产业之间的数据关联度;根据所述各个目标产业的名称信息之间的距离,确定所述各个目标产业之间的名称关联度;根据所述各个目标产业的产业关联信息中的实体关系信息,确定所述各个目标产业之间的关系关联度;根据所述各个目标产业之间的所述数据关联度、所述名称关联度和所述关系关联度,生成所述各个目标产业对应的产业链图谱。采用本方法能够提高产业链构建时效性。

Figure 202211529390

The present application relates to an industrial chain generation method, device, computer equipment and storage medium. The method includes: according to the industrial electricity data and industrial production data of each target industry, determining the data correlation degree between the various target industries; according to the distance between the name information of the various target industries, determining the The degree of name association between industries; according to the entity relationship information in the industry association information of each target industry, determine the degree of relationship association between each target industry; according to the data association between each target industry degree, the degree of name association, and the degree of relationship association to generate an industrial chain map corresponding to each target industry. Adopting this method can improve the timeliness of industrial chain construction.

Figure 202211529390

Description

产业链生成方法、装置、计算机设备和存储介质Industrial chain generation method, device, computer equipment and storage medium

技术领域technical field

本申请涉及大数据技术领域,特别是涉及一种产业链生成方法、装置、计算机设备、存储介质和计算机程序产品。The present application relates to the field of big data technology, and in particular to an industrial chain generation method, device, computer equipment, storage medium and computer program product.

背景技术Background technique

产业链的上下游能够反映不同行业之间的关联关系,为指导宏观经济生产和指导行业发展等提供了研究基础。The upstream and downstream of the industrial chain can reflect the relationship between different industries, and provide a research basis for guiding macroeconomic production and industry development.

目前,现有产业链主要是通过人工手动建立可视化的产业链,然而这种手动建立产业链的方式需要依据专家经验进行产业链构建,往往需要花费大量的时间,因此,这种方式存在时效性较差的问题。At present, the existing industrial chain mainly builds a visual industrial chain manually. However, this method of manually establishing an industrial chain needs to build an industrial chain based on expert experience, which often takes a lot of time. Therefore, this method is time-sensitive. Poor question.

发明内容Contents of the invention

基于此,有必要针对上述技术问题,提供一种能够提高产业链构建时效性的产业链生成方法、装置、计算机设备、计算机可读存储介质和计算机程序产品。Based on this, it is necessary to provide an industrial chain generation method, device, computer equipment, computer readable storage medium and computer program product that can improve the timeliness of industrial chain construction in view of the above technical problems.

第一方面,本申请提供了一种产业链生成方法。所述方法包括:In a first aspect, the present application provides a method for generating an industrial chain. The methods include:

根据各个目标产业的产业电量数据和产业生产数据,确定所述各个目标产业之间的数据关联度;According to the industrial electricity data and industrial production data of each target industry, determine the data correlation between the various target industries;

根据所述各个目标产业的名称信息之间的距离,确定所述各个目标产业之间的名称关联度;According to the distance between the name information of the various target industries, determine the degree of name association between the various target industries;

根据所述各个目标产业的产业关联信息中的实体关系信息,确定所述各个目标产业之间的关系关联度;According to the entity relationship information in the industry association information of each target industry, determine the degree of relationship association between each target industry;

根据所述各个目标产业之间的所述数据关联度、所述名称关联度和所述关系关联度,生成所述各个目标产业对应的产业链图谱。According to the data association degree, the name association degree and the relationship association degree among the various target industries, an industry chain map corresponding to each target industry is generated.

在其中一个实施例中,根据各个目标产业的产业电量数据和产业生产数据,确定所述各个目标产业之间的数据关联度,包括:In one of the embodiments, according to the industrial electricity data and industrial production data of each target industry, the data correlation degree between the various target industries is determined, including:

根据所述各个目标产业的产业电量数据,得到所述各个目标产业的历史电量增长率;According to the industrial electricity data of each target industry, the historical electricity growth rate of each target industry is obtained;

根据所述各个目标产业的产业生产数据,得到所述各个目标产业的历史生产增长率;According to the industrial production data of each target industry, the historical production growth rate of each target industry is obtained;

根据所述历史电量增长率和所述历史生产增长率,对所述各个目标产业进行聚类处理,得到所述各个目标产业的产业分类结果;According to the historical electricity growth rate and the historical production growth rate, perform clustering processing on the target industries to obtain the industry classification results of the target industries;

根据所述各个目标产业的产业分类结果,得到所述各个目标产业之间的数据关联度。According to the industry classification results of the various target industries, the degree of data association between the various target industries is obtained.

在其中一个实施例中,根据所述各个目标产业的名称信息之间的距离,确定所述各个目标产业之间的名称关联度,包括:In one of the embodiments, according to the distance between the name information of the various target industries, determining the degree of name association between the various target industries includes:

在检测到第一目标产业的名称信息与第二目标产业的名称信息不同的情况下,获取将所述第一目标产业的名称信息更新为所述第二目标产业的名称信息所需的文本更新次数;所述第一目标产业为所述各个目标产业中任意一个目标产业;所述第二目标产业为所述各个目标产业中除所述第一目标产业以外的任意一个目标产业;In a case where it is detected that the name information of the first target industry is different from the name information of the second target industry, obtaining a text update required to update the name information of the first target industry to the name information of the second target industry times; the first target industry is any target industry among the various target industries; the second target industry is any target industry among the various target industries except the first target industry;

根据所述文本更新次数,得到所述第一目标产业的名称信息和所述第二目标产业的名称信息之间的距离;Obtaining the distance between the name information of the first target industry and the name information of the second target industry according to the number of text updates;

根据所述距离,得到所述第一目标产业和所述第二目标产业之间的名称关联度,作为所述各个目标产业之间的名称关联度。According to the distance, the degree of name association between the first target industry and the second target industry is obtained as the degree of name association between the respective target industries.

在其中一个实施例中,根据所述各个目标产业的产业关联信息中的实体关系信息,确定所述各个目标产业之间的关系关联度,包括:In one of the embodiments, according to the entity relationship information in the industry association information of each target industry, determining the degree of relationship between the various target industries includes:

对所述各个目标产业的产业关联信息进行实体关系提取处理,得到各个所述产业关联信息中的实体关系信息;Perform entity relationship extraction processing on the industry-related information of each target industry to obtain entity relationship information in each of the industry-related information;

根据产业关系字典库中的产业之间的上下游关系,对所述实体关系信息中的目标产业进行上游概率判断和下游概率判断,得到所述实体关系信息中的目标产业的上游概率和下游概率;According to the upstream and downstream relationships between industries in the industrial relationship dictionary database, perform upstream probability judgment and downstream probability judgment on the target industry in the entity relationship information, and obtain the upstream probability and downstream probability of the target industry in the entity relationship information ;

根据所述实体关系信息中的目标产业的所述上游概率和所述下游概率,得到所述各个目标产业之间的关系关联度。According to the upstream probability and the downstream probability of the target industry in the entity relationship information, the degree of relationship association between the various target industries is obtained.

在其中一个实施例中,根据所述各个目标产业之间的所述数据关联度、所述名称关联度和所述关系关联度,生成所述各个目标产业对应的产业链图谱,包括:In one of the embodiments, according to the data association degree, the name association degree and the relationship association degree among the various target industries, the industry chain map corresponding to each target industry is generated, including:

根据所述各个目标产业之间的所述数据关联度、所述名称关联度和所述关系关联度,确定所述各个目标产业之间的产业关联概率;determining the industry association probability between the various target industries according to the data association degree, the name association degree and the relationship association degree among the various target industries;

根据所述上游概率、所述下游概率和所述产业关联概率,确定所述各个目标产业对应的上下游产业;determining the upstream and downstream industries corresponding to the respective target industries according to the upstream probability, the downstream probability and the industry association probability;

对所述各个目标产业对应的上下游产业进行可视化图形处理,得到所述产业链图谱。Visual graphics processing is performed on the upstream and downstream industries corresponding to each target industry to obtain the industrial chain map.

在其中一个实施例中,在根据所述各个目标产业之间的所述数据关联度、所述名称关联度和所述关系关联度,生成所述各个目标产业对应的产业链图谱之后,还包括:In one of the embodiments, after generating the industry chain map corresponding to each target industry according to the data association degree, the name association degree, and the relationship association degree among the various target industries, it further includes :

获取所述产业链图谱中各个目标产业在当前时间段的当前电量数据;Obtain the current power data of each target industry in the current time period in the industrial chain map;

根据所述各个目标产业的当前电量数据,得到所述各个目标产业的当前电量增长率;According to the current electricity data of each target industry, the current electricity growth rate of each target industry is obtained;

对所述各个目标产业的所述当前电量增长率和所述历史电量增长率进行同比评估和环比评估,得到所述各个目标产业的评估结果;conducting year-on-year and ring-to-ring assessments of the current electricity growth rate and the historical electricity growth rate of each target industry to obtain the evaluation results of each target industry;

在所述评估结果为异常的情况下,在所述产业链图谱中对所述评估结果对应的目标产业进行上下游更新,得到所述当前时间段的产业链图谱。If the evaluation result is abnormal, the target industry corresponding to the evaluation result is updated upstream and downstream in the industry chain map to obtain the industry chain map of the current time period.

第二方面,本申请还提供了一种产业链生成装置。所述装置包括:In the second aspect, the present application also provides an industrial chain generation device. The devices include:

第一关联度确定模块,用于根据各个目标产业的产业电量数据和产业生产数据,确定所述各个目标产业之间的数据关联度;The first correlation degree determination module is used to determine the data correlation degree between the various target industries according to the industrial electricity data and industrial production data of each target industry;

第二关联度确定模块,用于根据所述各个目标产业的名称信息之间的距离,确定所述各个目标产业之间的名称关联度;The second association degree determination module is used to determine the name association degree between the various target industries according to the distance between the name information of the various target industries;

第三关联度确定模块,用于根据所述各个目标产业的产业关联信息中的实体关系信息,确定所述各个目标产业之间的关系关联度;The third association degree determination module is used to determine the relationship association degree between the various target industries according to the entity relationship information in the industry association information of the various target industries;

产业链图谱生成模块,用于根据所述各个目标产业之间的所述数据关联度、所述名称关联度和所述关系关联度,生成所述各个目标产业对应的产业链图谱。An industry chain graph generation module, configured to generate an industry chain graph corresponding to each target industry according to the data correlation degree, the name correlation degree, and the relationship correlation degree among the various target industries.

第三方面,本申请还提供了一种计算机设备。所述计算机设备包括存储器和处理器,所述存储器存储有计算机程序,所述处理器执行所述计算机程序时实现以下步骤:In a third aspect, the present application also provides a computer device. The computer device includes a memory and a processor, the memory stores a computer program, and the processor implements the following steps when executing the computer program:

根据各个目标产业的产业电量数据和产业生产数据,确定所述各个目标产业之间的数据关联度;According to the industrial electricity data and industrial production data of each target industry, determine the data correlation between the various target industries;

根据所述各个目标产业的名称信息之间的距离,确定所述各个目标产业之间的名称关联度;According to the distance between the name information of the various target industries, determine the degree of name association between the various target industries;

根据所述各个目标产业的产业关联信息中的实体关系信息,确定所述各个目标产业之间的关系关联度;According to the entity relationship information in the industry association information of each target industry, determine the degree of relationship association between each target industry;

根据所述各个目标产业之间的所述数据关联度、所述名称关联度和所述关系关联度,生成所述各个目标产业对应的产业链图谱。According to the data association degree, the name association degree and the relationship association degree among the various target industries, an industry chain map corresponding to each target industry is generated.

第四方面,本申请还提供了一种计算机可读存储介质。所述计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现以下步骤:In a fourth aspect, the present application also provides a computer-readable storage medium. The computer-readable storage medium has a computer program stored thereon, and when the computer program is executed by a processor, the following steps are implemented:

根据各个目标产业的产业电量数据和产业生产数据,确定所述各个目标产业之间的数据关联度;According to the industrial electricity data and industrial production data of each target industry, determine the data correlation between the various target industries;

根据所述各个目标产业的名称信息之间的距离,确定所述各个目标产业之间的名称关联度;According to the distance between the name information of the various target industries, determine the degree of name association between the various target industries;

根据所述各个目标产业的产业关联信息中的实体关系信息,确定所述各个目标产业之间的关系关联度;According to the entity relationship information in the industry association information of each target industry, determine the degree of relationship association between each target industry;

根据所述各个目标产业之间的所述数据关联度、所述名称关联度和所述关系关联度,生成所述各个目标产业对应的产业链图谱。According to the data association degree, the name association degree and the relationship association degree among the various target industries, an industry chain map corresponding to each target industry is generated.

第五方面,本申请还提供了一种计算机程序产品。所述计算机程序产品,包括计算机程序,该计算机程序被处理器执行时实现以下步骤:In a fifth aspect, the present application also provides a computer program product. The computer program product includes a computer program, and when the computer program is executed by a processor, the following steps are implemented:

根据各个目标产业的产业电量数据和产业生产数据,确定所述各个目标产业之间的数据关联度;According to the industrial electricity data and industrial production data of each target industry, determine the data correlation between the various target industries;

根据所述各个目标产业的名称信息之间的距离,确定所述各个目标产业之间的名称关联度;According to the distance between the name information of the various target industries, determine the degree of name association between the various target industries;

根据所述各个目标产业的产业关联信息中的实体关系信息,确定所述各个目标产业之间的关系关联度;According to the entity relationship information in the industry association information of each target industry, determine the degree of relationship association between each target industry;

根据所述各个目标产业之间的所述数据关联度、所述名称关联度和所述关系关联度,生成所述各个目标产业对应的产业链图谱。According to the data association degree, the name association degree and the relationship association degree among the various target industries, an industry chain map corresponding to each target industry is generated.

上述产业链生成方法、装置、计算机设备、存储介质和计算机程序产品,根据各个目标产业的产业电量数据和产业生产数据,确定各个目标产业之间的数据关联度;根据各个目标产业的名称信息之间的距离,确定各个目标产业之间的名称关联度;根据各个目标产业的产业关联信息中的实体关系信息,确定各个目标产业之间的关系关联度;根据各个目标产业之间的数据关联度、名称关联度和关系关联度,生成各个目标产业对应的产业链图谱。采用本方法,通过各个目标产业的产业电量数据和产业生产数据来确定各个目标产业之间的数据关联度,通过各个目标产业的产业关联信息来确定各个目标产业之间的关系关联度,并结合各个目标产业之间的名称关联度来综合生成产业链图谱,解决了传统技术中构建产业链的数据维度单一的问题,使得生成得到的产业链图谱能够更全面的展示出各个目标产业在经济因素、生产因素和文本因素等多方面因素上的关联,还无需依靠人工经验建立产业链图谱,有效地提高了生成得到的产业链图谱的时效性。The above-mentioned industrial chain generation method, device, computer equipment, storage medium, and computer program product determine the data correlation between each target industry according to the industrial electricity data and industrial production data of each target industry; according to the name information of each target industry According to the distance between each target industry, determine the degree of name association between each target industry; according to the entity relationship information in the industry association information of each target industry, determine the relationship association degree between each target industry; according to the data association degree between each target industry , name correlation degree and relationship correlation degree, and generate an industrial chain map corresponding to each target industry. Using this method, the data correlation degree between each target industry is determined through the industrial electricity data and industrial production data of each target industry, and the relationship correlation degree between each target industry is determined through the industry correlation information of each target industry, and combined with The name correlation between each target industry is used to comprehensively generate an industrial chain map, which solves the problem of single data dimension in building an industrial chain in traditional technology, so that the generated industrial chain map can more comprehensively display the economic factors of each target industry , production factors, and textual factors, etc., and does not need to rely on manual experience to establish an industrial chain map, which effectively improves the timeliness of the generated industrial chain map.

附图说明Description of drawings

图1为一个实施例中产业链生成方法的流程示意图;Fig. 1 is a schematic flow diagram of an industry chain generating method in an embodiment;

图2为一个实施例中产业链图谱的示意图;Fig. 2 is a schematic diagram of an industrial chain map in an embodiment;

图3为一个实施例中得到当前时间段的产业链图谱步骤的流程示意图;Fig. 3 is a schematic flow chart of the steps of obtaining the industry chain map of the current time period in one embodiment;

图4为另一个实施例中产业链生成方法的流程示意图;Fig. 4 is a schematic flow chart of an industry chain generation method in another embodiment;

图5为一个实施例中产业链生成装置的结构框图;Fig. 5 is a structural block diagram of an industrial chain generation device in an embodiment;

图6为一个实施例中计算机设备的内部结构图。Figure 6 is an internal block diagram of a computer device in one embodiment.

具体实施方式detailed description

为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

在一个实施例中,如图1所示,提供了一种产业链生成方法,本实施例以该方法应用于终端进行举例说明,可以理解的是,该方法也可以应用于服务器,还可以应用于包括终端和服务器的系统,并通过终端和服务器的交互实现。其中,终端可以但不限于是各种个人计算机、笔记本电脑、智能手机、平板电脑、物联网设备和便携式可穿戴设备。本实施例中,该方法包括以下步骤:In one embodiment, as shown in FIG. 1 , a method for generating an industry chain is provided. In this embodiment, the method is applied to a terminal for illustration. It can be understood that this method can also be applied to a server, or It is based on a system including a terminal and a server, and is realized through the interaction between the terminal and the server. Among them, the terminal can be, but not limited to, various personal computers, notebook computers, smart phones, tablet computers, Internet of Things devices and portable wearable devices. In this embodiment, the method includes the following steps:

步骤S101,根据各个目标产业的产业电量数据和产业生产数据,确定各个目标产业之间的数据关联度。Step S101, according to the industrial electricity data and industrial production data of each target industry, determine the data correlation degree between each target industry.

其中,产业电量数据是指描述目标产业的用电情况的数据;例如,产业电量数据包括但不限于是目标产业的实时用电量和历年的用电量;产业电量数据用于反映目标产业的一线生产因素。产业生产数据是指描述目标产业的生产经济的数据;例如,产业生产数据包括但不限于是目标产业的生产总值;产业生产数据反映目标产业的经济因素。Among them, the industrial electricity data refers to the data describing the electricity consumption of the target industry; for example, the industrial electricity data includes but not limited to the real-time electricity consumption of the target industry and the electricity consumption of the past years; the industrial electricity data is used to reflect the target industry’s electricity consumption. First-line production factors. Industrial production data refers to data describing the production economy of the target industry; for example, industrial production data includes but is not limited to the total output value of the target industry; industrial production data reflects the economic factors of the target industry.

其中,数据关联度是指用于度量目标产业两两之间在生产因素和经济因素方面的关联程度的指标。目标产业是指用于构建产业链图谱的产业;目标产业的数量至少为两个。Among them, the degree of data correlation refers to an index used to measure the degree of correlation between two target industries in terms of production factors and economic factors. The target industry refers to the industry used to construct the industrial chain map; the number of target industries is at least two.

具体地,若用户需要分析不同产业之间的关联关系,可以向终端发送产业链图谱查看请求,则终端可以从计量电网平台的计量自动化系统中,获得与产业链图谱查看请求关联的一个或多个目标产业的产业用电数据。在获取到的产业用电数据为结构化的数据库表形式的情况下,终端可以对该产业用电数据进行数据清洗和数据校验处理,得到处理后的产业用电数据。终端还可以对记载有产业生产数据的统计网站进行爬虫处理,以从该统计网站上获取到至少两个目标产业的产业生产数据;在获取到的产业生产数据为半结构化形式的情况下,将该半结构化形式的产业生产数据转换为结构化的产业生产数据,然后对该结构化的产业生产数据进行数据清洗和数据校验处理,得到处理后的产业生产数据。进而终端根据各个目标产业的处理后的产业用电数据和处理后的产业生产数据,对至少两个目标产业进行聚类,得到至少两个目标产业的产业分类结果,根据至少两个目标产业在产业分类结果中类间或类内的距离的远近程度,来确定各个目标产业两两之间的数据关联度。Specifically, if the user needs to analyze the relationship between different industries, he can send an industry chain graph viewing request to the terminal, and the terminal can obtain one or more information associated with the industrial chain graph viewing request from the metering automation system of the metering grid platform. Industrial electricity consumption data of a target industry. When the acquired industrial power consumption data is in the form of a structured database table, the terminal can perform data cleaning and data verification processing on the industrial power consumption data to obtain processed industrial power consumption data. The terminal can also perform crawler processing on the statistical website that records the industrial production data, so as to obtain the industrial production data of at least two target industries from the statistical website; when the obtained industrial production data is in a semi-structured form, Convert the semi-structured industrial production data into structured industrial production data, and then perform data cleaning and data verification processing on the structured industrial production data to obtain processed industrial production data. Furthermore, the terminal clusters at least two target industries according to the processed industrial power consumption data and processed industrial production data of each target industry, and obtains the industry classification results of at least two target industries. The degree of distance between classes or within classes in the industry classification results is used to determine the degree of data correlation between each pair of target industries.

步骤S102,根据各个目标产业的名称信息之间的距离,确定各个目标产业之间的名称关联度。Step S102, according to the distance between the name information of each target industry, determine the degree of name association between each target industry.

其中,名称信息是指目标产业的产业名称。名称关联度是指度量目标产业两两之间在名称方面的关联程度的指标。Wherein, the name information refers to the industry name of the target industry. The degree of name association refers to an indicator to measure the degree of association between two target industries in terms of names.

具体地,终端根据各个目标产业的名称信息,计算得到各个目标产业的名称信息之间的距离。各个目标产业的名称信息之间的距离越小,各个目标产业之间的名称关联度越大;反之,各个目标产业的名称信息之间的距离越大,各个目标产业之间的名称关联度越小。Specifically, the terminal calculates the distance between the name information of each target industry according to the name information of each target industry. The smaller the distance between the name information of each target industry, the greater the name correlation between each target industry; on the contrary, the larger the distance between the name information of each target industry, the greater the name correlation between each target industry. Small.

步骤S103,根据各个目标产业的产业关联信息中的实体关系信息,确定各个目标产业之间的关系关联度。Step S103, according to the entity relationship information in the industry association information of each target industry, determine the relationship degree between each target industry.

其中,产业关联信息是可能包含有产业的上下游关系的文本数据。实体关系信息是指用于描述实体间(例如两个产业之间)的上下游关系的信息。关系关联度指度量目标产业两两之间在产业链中的上下游关系方面的关联程度的指标。Among them, the industry-related information is text data that may include upstream and downstream relationships of industries. Entity relationship information refers to information used to describe the upstream and downstream relationship between entities (for example, between two industries). The degree of relational relation refers to the index that measures the degree of relation between two target industries in terms of the upstream and downstream relations in the industrial chain.

具体地,终端可以分别对与各个目标产业关联的行业年报、新闻报告、政策指南等文本进行文本提取处理,得到各个目标产业的产业关联信息;进而终端对各个目标产业的产业关联信息进行实体关系抽取处理,得到各个产业关联信息中的实体关系信息;根据各个实体关系信息之间的上游概率和下游概率,得到各个目标产业之间的关系关联度。Specifically, the terminal can perform text extraction processing on texts such as industry annual reports, news reports, and policy guidelines associated with each target industry, to obtain industry-related information of each target industry; The extraction process obtains the entity relationship information in each industry association information; according to the upstream probability and downstream probability between each entity relationship information, the relationship association degree between each target industry is obtained.

步骤S104,根据各个目标产业之间的数据关联度、名称关联度和关系关联度,生成各个目标产业对应的产业链图谱。Step S104, according to the data correlation degree, name correlation degree and relationship correlation degree between each target industry, generate an industry chain map corresponding to each target industry.

其中,产业链图谱是指用于可视化展示产业链的上游和下游的图形数据。Among them, the industrial chain map refers to the graphic data used to visually display the upstream and downstream of the industrial chain.

具体地,终端各个目标产业之间的数据关联度、名称关联度和关系关联度,确定各个产业之间的产业关联度;其中,产业关联度是指综合了数据关联度、名称关联度和关系关联度,用于综合评价不同产业之间的关联程度的指标。进而终端根据各个产业之间的产业关联度,确定各个目标产业对应的上游产业和下游产业,然后对各个目标产业对应的上游产业和下游产业进行可视化处理,得到与上述步骤S101中的产业链图谱查看请求对应的产业链图谱。终端在生成产业链图谱后,还可以展示产业链图谱,以供用户实时查看产业链图谱。Specifically, the degree of data association, name association, and relationship association between each target industry of the terminal determines the industry association between various industries; where, the industry association refers to the combination of data association, name association, and relationship Correlation degree is an index used to comprehensively evaluate the degree of correlation between different industries. Furthermore, the terminal determines the upstream industry and downstream industry corresponding to each target industry according to the degree of industrial correlation between each industry, and then performs visualization processing on the upstream industry and downstream industry corresponding to each target industry to obtain the industrial chain map in the above step S101 View the industrial chain map corresponding to the request. After the terminal generates the industrial chain map, it can also display the industrial chain map for users to view the industrial chain map in real time.

上述产业链生成方法中,根据各个目标产业的产业电量数据和产业生产数据,确定各个目标产业之间的数据关联度;根据各个目标产业的名称信息之间的距离,确定各个目标产业之间的名称关联度;根据各个目标产业的产业关联信息中的实体关系信息,确定各个目标产业之间的关系关联度;根据各个目标产业之间的数据关联度、名称关联度和关系关联度,生成各个目标产业对应的产业链图谱。采用本方法,通过各个目标产业的产业电量数据和产业生产数据来确定各个目标产业之间的数据关联度,通过各个目标产业的产业关联信息来确定各个目标产业之间的关系关联度,并结合各个目标产业之间的名称关联度来综合生成产业链图谱,解决了传统技术中构建产业链的数据维度单一的问题,使得生成得到的产业链图谱能够更全面的展示出各个目标产业之间在经济因素、生产因素等多方面因素上的关联,还无需依靠人工经验来建立产业链图谱,有效地提高了生成得到的产业链图谱的时效性。In the above industrial chain generation method, according to the industrial power data and industrial production data of each target industry, the data correlation degree between each target industry is determined; according to the distance between the name information of each target industry, the relationship between each target industry is determined. Name association degree; according to the entity relationship information in the industry association information of each target industry, determine the relationship association degree between each target industry; according to the data association degree, name association degree and relationship association degree between each target industry, generate each The industrial chain map corresponding to the target industry. Using this method, the data correlation degree between each target industry is determined through the industrial electricity data and industrial production data of each target industry, and the relationship correlation degree between each target industry is determined through the industry correlation information of each target industry, and combined with The name correlation between each target industry is used to comprehensively generate an industrial chain map, which solves the problem of single data dimension in building an industrial chain in traditional technology, so that the generated industrial chain map can more comprehensively show the relationship between various target industries. There is no need to rely on manual experience to establish the industrial chain map, which effectively improves the timeliness of the generated industrial chain map.

在一个实施例中,上述步骤S101,根据各个目标产业的产业电量数据和产业生产数据,确定各个目标产业之间的数据关联度,具体包括如下内容:根据各个目标产业的产业电量数据,得到各个目标产业的历史电量增长率;根据各个目标产业的产业生产数据,得到各个目标产业的历史生产增长率;根据历史电量增长率和历史生产增长率,对各个目标产业进行聚类处理,得到各个目标产业的产业分类结果;根据各个目标产业的产业分类结果,得到各个目标产业之间的数据关联度。In one embodiment, the above step S101, according to the industrial power data and industrial production data of each target industry, determines the data correlation between each target industry, specifically including the following content: according to the industrial power data of each target industry, get each The historical power growth rate of the target industry; according to the industrial production data of each target industry, the historical production growth rate of each target industry is obtained; according to the historical power growth rate and historical production growth rate, each target industry is clustered to obtain each target Industry classification results of industries: According to the industry classification results of each target industry, the data correlation between each target industry is obtained.

其中,历史电量增长率是指描述目标产业的电量数据在历史时间段的增长情况的数据;历史电量增长率包括但不限于是历史电量环比增长率和历史电量同比增长率。历史生产增长率是指描述目标产业的生产数据在历史时间段的增长情况的数据;历史生产增长率包括但不限于是历史生产环比增长率和历史生产同比增长率。Among them, the historical electricity growth rate refers to the data describing the growth of the electricity data of the target industry in the historical time period; the historical electricity growth rate includes but is not limited to the historical electricity chain growth rate and the historical electricity year-on-year growth rate. The historical production growth rate refers to the data describing the growth of the production data of the target industry in the historical time period; the historical production growth rate includes but is not limited to the historical production chain growth rate and the historical production year-on-year growth rate.

具体地,终端根据各个目标产业的历史产业电量数据,得到各个目标产业的历史电量环比增长率和历史电量同比增长率;根据各个目标产业的历史产业生产数据,得到各个目标产业的历史生产环比增长率和历史生产同比增长率;进而终端对各个目标产业的历史电量环比增长率、历史电量同比增长率、历史生产环比增长率和历史生产同比增长率进行聚类处理,可以是对各个目标产业的历史电量环比增长率、历史电量同比增长率、历史生产环比增长率和历史生产同比增长率进行K均值(K-means)聚类处理,则终端得到各个目标产业的产业分类结果。终端根据各个目标产业的产业分类结果,得到各个目标产业之间的数据关联度,可以是首先为各个目标产业之间设置相同的初始数据关联度,然后对处于同一产业分类中的两两目标产业之间的初始数据关联度进行增加,对处于不同产业分类中的两两目标产业之间的初始数据关联度进行减少,得到两两目标产业之间的更新后数据关联度;接着获取处于同一产业分类中的目标产业两两之间的距离,按照距离从大到小的顺序,对处于同一产业分类中的目标产业两两之间的更新后数据关联度进行递增,得到目标产业两两之间的数据关联度,即处于同一产业分类中的目标产业两两之间的距离越小,目标产业两两之间的数据关联度越大。Specifically, the terminal obtains the historical power consumption growth rate and the historical power consumption growth rate of each target industry according to the historical industry power data of each target industry; according to the historical industrial production data of each target industry, the historical production chain growth rate of each target industry is obtained Rate and historical production year-on-year growth rate; and then the terminal clusters the historical power chain growth rate, historical power year-on-year growth rate, historical production chain growth rate and historical production year-on-year growth rate of each target industry, which can be a method for each target industry The historical power chain growth rate, historical power year-on-year growth rate, historical production chain growth rate and historical production year-on-year growth rate are clustered by K-means, and the terminal obtains the industry classification results of each target industry. According to the industry classification results of each target industry, the terminal obtains the data correlation degree between each target industry. It can first set the same initial data correlation degree for each target industry, and then set the same initial data correlation degree for each target industry, and then pair two target industries in the same industry classification Increase the initial data correlation degree between two target industries in different industry classifications, reduce the initial data correlation degree between two target industries in different industry classifications, and obtain the updated data correlation degree between two target industries; The distance between two pairs of target industries in the classification, according to the order of the distance from large to small, the updated data correlation degree between the target industries in the same industry classification is increased, and the distance between the target industries is obtained. The data correlation degree of the target industry, that is, the smaller the distance between the target industries in the same industry classification, the greater the data correlation degree between the target industries.

本实施例中,通过对各个目标产业的历史电量增长率和历史生产增长率进行聚类处理,得到各个目标产业的产业分类结果;进而根据各个目标产业的产业分类结果,得到各个目标产业之间的数据关联度,结合目标产业的历史电量增长率和历史生产增长率,来分析出目标产业之间的数据关联度,能够挖掘出产业链的上下游产业在电量和生产方面的规律,使得生成得到的产业链图谱能够更全面、更精细化的展示出各个目标产业之间在经济因素、生产因素等多方面因素上的关联。In this embodiment, the industry classification results of each target industry are obtained by clustering the historical electricity growth rate and historical production growth rate of each target industry; Combined with the historical power growth rate and historical production growth rate of the target industry to analyze the data correlation degree between the target industries, it can dig out the laws of the upstream and downstream industries in the industrial chain in terms of power consumption and production, so that the generated The obtained industrial chain map can more comprehensively and finely show the relationship between various target industries in terms of economic factors, production factors and other factors.

在一个实施例中,上述步骤S102,根据各个目标产业的名称信息之间的距离,确定各个目标产业之间的名称关联度,具体包括如下内容:在检测到第一目标产业的名称信息与第二目标产业的名称信息不同的情况下,获取将第一目标产业的名称信息更新为第二目标产业的名称信息所需的文本更新次数;第一目标产业为各个目标产业中任意一个目标产业;第二目标产业为各个目标产业中除第一目标产业以外的任意一个目标产业;根据文本更新次数,得到第一目标产业的名称信息和第二目标产业的名称信息之间的距离;根据距离,得到第一目标产业和第二目标产业之间的名称关联度,作为各个目标产业之间的名称关联度。In one embodiment, the above step S102, according to the distance between the name information of each target industry, determines the degree of name association between each target industry, which specifically includes the following content: after detecting the name information of the first target industry and the second If the name information of the two target industries is different, obtain the number of text updates required to update the name information of the first target industry to the name information of the second target industry; the first target industry is any one of the target industries; The second target industry is any target industry in each target industry except the first target industry; according to the number of text updates, the distance between the name information of the first target industry and the name information of the second target industry is obtained; according to the distance, The name association degree between the first target industry and the second target industry is obtained as the name association degree between each target industry.

其中,文本更新次数是指按照第二目标产业的名称信息对将第一目标产业的名称信息进行更新处理,更新处理最小所需次数。Wherein, the number of times of updating the text refers to updating the name information of the first target industry according to the name information of the second target industry, and the minimum number of times required for update processing.

具体地,终端将所有的目标产业中任意一个目标产业作为第一目标产业,同时将所有的目标产业中除第一目标产业以外的任意一个目标产业作为第二目标产业;进而检测获取到的第一目标产业的产业名称和获取到的第二目标产业的产业名称是否相同;在检测到第一目标产业的名称信息与第二目标产业的名称信息不同的情况下,终端按照第二目标产业的名称信息中的每个字符,对第一目标产业的名称信息中与第二目标产业的名称信息不同的字符进行字符转换,其中,第一目标产业的名称信息中与第二目标产业的名称信息相同的字符无需进行字符转换,得到将第一目标产业的名称信息完全更新为第二目标产业的名称信息最小所需的字符转换次数,将字符转换次数作为文本更新次数。Specifically, the terminal takes any target industry in all target industries as the first target industry, and at the same time takes any target industry in all target industries except the first target industry as the second target industry; and then detects the obtained second target industry Whether the industry name of the first target industry is the same as the acquired industry name of the second target industry; if it is detected that the name information of the first target industry is different from the name information of the second target industry, the terminal For each character in the name information, character conversion is performed on the characters in the name information of the first target industry that are different from the name information of the second target industry, wherein the name information of the first target industry is different from the name information of the second target industry The same characters do not need to be converted, and the minimum number of character conversions required to completely update the name information of the first target industry to the name information of the second target industry is obtained, and the number of character conversions is used as the number of text updates.

进一步地,终端按照文本更新次数从小到大的顺序,对第一目标产业的名称信息和第二目标产业的名称信息之间的初始距离进行递增,得到第一目标产业的名称信息和第二目标产业的名称信息之间的距离,即第一目标产业的名称信息和第二目标产业的名称信息之间的文本更新次数越少,第一目标产业的名称信息和第二目标产业的名称信息之间的距离越近。终端按照第一目标产业的名称信息和第二目标产业的名称信息之间的距离从大到小的顺序,对第一目标产业和第二目标产业之间的初始名称关联度进行递增,得到第一目标产业和第二目标产业之间的名称关联度,即第一目标产业的名称信息和第二目标产业的名称信息之间的距离越小,第一目标产业和第二目标产业之间的名称关联度越大,各个目标产业之间的名称关联度。Further, the terminal increments the initial distance between the name information of the first target industry and the name information of the second target industry in ascending order of the number of text updates to obtain the name information of the first target industry and the name information of the second target industry The distance between the name information of the industry, that is, the less the number of text updates between the name information of the first target industry and the name information of the second target industry, the greater the distance between the name information of the first target industry and the name information of the second target industry. The closer the distance between. According to the descending order of the distance between the name information of the first target industry and the name information of the second target industry, the terminal increases the initial name association degree between the first target industry and the second target industry, and obtains the first The name correlation degree between the first target industry and the second target industry, that is, the smaller the distance between the name information of the first target industry and the name information of the second target industry, the smaller the distance between the first target industry and the second target industry. The greater the degree of name association, the degree of name association between various target industries.

举例说明,假设目标产业A的名称信息是黑色金属矿采选业,目标产业B的名称信息是黑色金属冶炼与压延加工业,目标产业C的名称信息是热力生产和供应业;若是将目标产业A的名称信息转换为目标产业B的名称信息,则需将目标产业A的“矿采选”转换为“冶炼与压延加工”,最少需要对7个字符进行字符转换,因此,目标产业B的名称信息与目标产业A的名称信息之间的距离为7;若是将目标产业C的名称信息转换为目标产业B的名称信息,则需将目标产业C中的“热力生产和供应”转换为“黑色金属冶炼与压延加工”,最少需要对11个字符进行字符转换,因此,目标产业B的名称信息与目标产业C的名称信息之间的距离为11,所以目标产业B与目标产业A之间的名称关联度,大于目标产业B与目标产业C之间的名称关联度。For example, assume that the name information of target industry A is ferrous metal mining and dressing industry, the name information of target industry B is ferrous metal smelting and rolling processing industry, and the name information of target industry C is heat production and supply industry; if the target industry To convert the name information of target industry B into the name information of target industry B, it is necessary to convert the "mining and dressing" of target industry A into "smelting and rolling processing". At least 7 characters need to be converted. Therefore, the target industry B's The distance between the name information and the name information of the target industry A is 7; if the name information of the target industry C is converted into the name information of the target industry B, the "heat production and supply" in the target industry C needs to be converted into " Ferrous metal smelting and rolling processing", at least 11 characters need to be converted. Therefore, the distance between the name information of target industry B and the name information of target industry C is 11, so the distance between target industry B and target industry A The name association degree of is greater than the name association degree between target industry B and target industry C.

在本实施例中,在检测到第一目标产业的名称信息与第二目标产业的名称信息不同的情况下,获取将第一目标产业的名称信息更新为第二目标产业的名称信息最小所需的文本更新次数;进而根据文本更新次数,得到第一目标产业的名称信息和第二目标产业的名称信息之间的距离;根据距离,得到第一目标产业和第二目标产业之间的名称关联度,作为各个目标产业之间的名称关联度,依据相近产业在名称上也较为相似的特性,获取到了各个目标产业之间的名称关联度,从而后续能够利用名称关联度来确定各个目标产业之间的产业关联度,有利于提高生成的产业链图谱的准确性。In this embodiment, when it is detected that the name information of the first target industry is different from the name information of the second target industry, the minimum required information for updating the name information of the first target industry to the name information of the second target industry is acquired. The number of text updates; then according to the number of text updates, the distance between the name information of the first target industry and the name information of the second target industry is obtained; according to the distance, the name association between the first target industry and the second target industry is obtained As the degree of name association between various target industries, based on the characteristics that similar industries have relatively similar names, the degree of name association between various target industries is obtained, so that the degree of name association can be used to determine the relationship between each target industry. The degree of industrial correlation between them is conducive to improving the accuracy of the generated industrial chain map.

在一个实施例中,上述步骤S103,根据各个目标产业的产业关联信息中的实体关系信息,确定各个目标产业之间的关系关联度,具体包括如下内容:对各个目标产业的产业关联信息进行实体关系提取处理,得到各个产业关联信息中的实体关系信息;根据产业关系字典库中的产业之间的上下游关系,对实体关系信息中的目标产业进行上游概率判断和下游概率判断,得到实体关系信息中的目标产业的上游概率和下游概率;根据实体关系信息中的目标产业的上游概率和下游概率,得到各个目标产业之间的关系关联度。In one embodiment, the above step S103, according to the entity relationship information in the industry association information of each target industry, determines the degree of relationship association between each target industry, specifically including the following content: Entity is carried out on the industry association information of each target industry Relational extraction process to obtain the entity relationship information in the associated information of each industry; according to the upstream and downstream relationship between industries in the industrial relationship dictionary database, carry out upstream probability judgment and downstream probability judgment on the target industry in the entity relationship information, and obtain the entity relationship The upstream probability and downstream probability of the target industry in the information; according to the upstream probability and downstream probability of the target industry in the entity relationship information, the relationship degree between each target industry is obtained.

其中,产业关系字典库是指根据常规的产业之间的上下游关系构建得到的字典库。上游概率是指产业位于产业链中的上游的概率。下游概率是指产业位于产业链中的下游的概率。Wherein, the industrial relationship dictionary refers to a dictionary constructed according to the conventional upstream and downstream relationships between industries. Upstream probability refers to the probability that an industry is located upstream in the industry chain. Downstream probability refers to the probability that an industry is located downstream in the industry chain.

具体地,终端对各个目标产业的产业关联信息进行实体提取处理,得到各个产业关联信息中的实体产业信息;进而终端对实体产业信息进行实体关系提取处理,可以是对实体产业信息进行基于规则的实体关系提取,则终端得到实体关系信息。终端根据产业关系字典库中产业之间的上下游关系,对实体关系信息中的目标产业进行上游概率判断,得到实体关系信息中的目标产业的上游概率,对实体关系信息中的目标产业进行下游概率判断,得到实体关系信息中的目标产业的下游概率;根据实体关系信息中的目标产业的上游概率和下游概率,可以是综合实体关系信息中两个目标产业的上游概率和下游概率,来得到目标产业两两之间的关系关联度。Specifically, the terminal performs entity extraction processing on the industry-related information of each target industry to obtain the entity industry information in each industry-related information; furthermore, the terminal performs entity relationship extraction processing on the entity industry information, which may be a rule-based Entity relationship extraction, the terminal obtains entity relationship information. According to the upstream and downstream relationship between industries in the industrial relationship dictionary, the terminal judges the upstream probability of the target industry in the entity relationship information, obtains the upstream probability of the target industry in the entity relationship information, and performs downstream analysis of the target industry in the entity relationship information. Probability judgment, to obtain the downstream probability of the target industry in the entity relationship information; according to the upstream probability and downstream probability of the target industry in the entity relationship information, the upstream probability and downstream probability of the two target industries in the entity relationship information can be combined to obtain The degree of correlation between the target industries.

举例说明,假设获取到的产业关联信息“新闻报道印尼2021全年出口煤炭4亿吨,其中有超过1.8亿吨销往中国,占到总出口量61%以上,占中国总进口煤炭的75%。事件主要涉及关联煤炭开采和洗选业,致使1-3月期间能耗指标增长234MW/万元,同比3%;次级关联行业电力、热力生产和供应业,致使1-3月期间能耗指标增长164MW/万元,同比5%”,则可以对该产业关联信息进行实体提取处理,得到实体产业信息为“煤炭开采和洗选业”和“行业电力、热力生产和供应业”,且“煤炭开采和洗选业”和“行业电力、热力生产和供应业”之间存在实体关系,根据产业关系字典库中产业之间的上下游关系,对“煤炭开采和洗选业”和“行业电力、热力生产和供应业”进行上游概率判断和下游概率判断,得到“行业电力、热力生产和供应业”的下游概率高于“煤炭开采和洗选业”,同时“煤炭开采和洗选业”的上游概率高于“行业电力、热力生产和供应业”,即可认为“行业电力、热力生产和供应业”有较大概率是“煤炭开采和洗选业”的下游。For example, assume that the obtained industry-related information "News reports that Indonesia will export 400 million tons of coal in 2021, of which more than 180 million tons will be sold to China, accounting for more than 61% of the total export volume and 75% of China's total coal imports The incident mainly involved related coal mining and washing industries, resulting in an increase of 234MW/10,000 yuan in energy consumption indicators during the period from January to March, a year-on-year increase of 3%; If the consumption index increased by 164MW/10,000 yuan, 5% year-on-year", then the entity extraction process of the industry-related information can be carried out, and the entity industry information can be obtained as "coal mining and washing industry" and "industrial power, heat production and supply industry". Moreover, there is an entity relationship between "coal mining and washing industry" and "industry electric power, heat production and supply industry". The upstream probability judgment and downstream probability judgment of "industry electric power, heat power production and supply industry" are judged, and the downstream probability of "industry electric power, heat power production and supply industry" is higher than that of "coal mining and washing industry", while "coal mining and washing industry The upstream probability of the “selection industry” is higher than that of the “industrial electricity, heat production and supply industry”, which means that the “industrial electricity, heat production and supply industry” has a higher probability of being the downstream of the “coal mining and washing industry”.

在本实施例中,通过根据产业关系字典库中的产业之间的上下游关系,对各个目标产业的产业关联信息中提取出的实体关系信息中的目标产业进行上游概率判断和下游概率判断,得到实体关系信息中的目标产业的上游概率和下游概率;进而根据实体关系信息中的目标产业的上游概率和下游概率,得到各个目标产业之间的关系关联度,无需专家等人员依靠人工经验判断各个目标产业的上下游关系,能够从产业关联信息中科学的提取出各个产业之间的关系关联度,有效地提高了产业链图谱的生成效率。In this embodiment, according to the upstream and downstream relationships between industries in the industrial relationship dictionary, the target industry in the entity relationship information extracted from the industry association information of each target industry is judged by upstream probability and downstream probability, Obtain the upstream probability and downstream probability of the target industry in the entity relationship information; then, according to the upstream probability and downstream probability of the target industry in the entity relationship information, obtain the relationship correlation between each target industry, without the need for experts and other personnel to rely on manual experience to judge The upstream and downstream relationships of each target industry can scientifically extract the degree of relationship between industries from the industry-related information, effectively improving the efficiency of generating industrial chain maps.

在一个实施例中,上述步骤S104,根据各个目标产业之间的数据关联度、名称关联度和关系关联度,生成各个目标产业对应的产业链图谱,具体包括如下内容:根据各个目标产业之间的数据关联度、名称关联度和关系关联度,确定各个目标产业之间的产业关联概率;根据上游概率、下游概率和产业关联概率,确定各个目标产业对应的上下游产业;对各个目标产业对应的上下游产业进行可视化图形处理,得到产业链图谱。In one embodiment, the above step S104 generates an industry chain map corresponding to each target industry according to the data association degree, name association degree, and relationship association degree among each target industry, specifically including the following content: According to the relationship between each target industry According to the data correlation degree, name correlation degree and relationship correlation degree, determine the industry correlation probability between each target industry; determine the upstream and downstream industries corresponding to each target industry according to the upstream probability, downstream probability and industry correlation probability; The upstream and downstream industries are visualized and processed to obtain the industrial chain map.

其中,产业关联概率是指描述两个目标产业之间存在关联的概率。Among them, the industry correlation probability refers to the probability of describing the correlation between two target industries.

具体地,终端综合各个目标产业之间的数据关联度、名称关联度和关系关联度,可以是对各个目标产业之间的数据关联度、名称关联度和关系关联度进行加权,还可以是仅依据数据关联度、名称关联度和关系关联度中任意一个单方面的关联度,则终端各个目标产业之间的产业关联概率;终端对各个目标产业之间的产业关联概率、上游概率和下游概率进行矩阵处理,得到关联度矩阵;根据关联度矩阵,确定各个目标产业对应的上下游产业;对各个目标产业对应的上下游产业进行可视化图形处理,可以是利用图形数据库构建各个目标产业的可视化节点,并按照各个目标产业对应的上下游产业,对可视化节点进行连接,得到产业链图谱;例如,利用neo4j(一种NOSQL图形数据库)构建各个目标产业的可视化节点,并按照各个目标产业对应的上下游产业,对可视化节点进行连接,得到产业链图谱,产业链图谱的示意图如图2所示。Specifically, the terminal integrates the data association degree, name association degree and relationship association degree between each target industry, and may weight the data association degree, name association degree and relationship association degree between each target industry, or may only According to any unilateral correlation degree among data correlation degree, name correlation degree and relationship correlation degree, the industry correlation probability between each target industry of the terminal; the industry correlation probability, upstream probability and downstream probability of each target industry between the terminal Perform matrix processing to obtain a correlation matrix; determine the upstream and downstream industries corresponding to each target industry according to the correlation matrix; perform visual graphics processing on the upstream and downstream industries corresponding to each target industry, which can be used to build visual nodes for each target industry by using a graph database , and connect the visualization nodes according to the upstream and downstream industries corresponding to each target industry to obtain the industrial chain map; for example, use neo4j (a NOSQL graph database) to construct the visualization nodes of each target industry, and according to the upstream For downstream industries, connect the visualized nodes to obtain the industrial chain map. The schematic diagram of the industrial chain map is shown in Figure 2.

在本实施例中,通过各个目标产业之间的数据关联度、名称关联度和关系关联度,确定各个目标产业之间的产业关联概率;进而根据上游概率、下游概率和产业关联概率,确定各个目标产业对应的上下游产业;对各个目标产业对应的上下游产业进行可视化图形处理,得到产业链图谱,使得获取到的产业链图谱能够反映出各个目标产业之间的数据关联度、名称关联度和关系关联度,提高了获取到的产业链图谱的关系面,还无需依靠人工经验建立产业链图谱,有效地提高了产业链图谱的生成效率。In this embodiment, the industry association probability between each target industry is determined through the data association degree, name association degree and relationship association degree between each target industry; and then according to the upstream probability, downstream probability and industry association probability, each The upstream and downstream industries corresponding to the target industry; Visual graphics processing is performed on the upstream and downstream industries corresponding to each target industry to obtain an industrial chain map, so that the obtained industrial chain map can reflect the data correlation degree and name correlation degree between each target industry The degree of correlation with the relationship improves the relationship surface of the obtained industrial chain map, and does not need to rely on manual experience to establish the industrial chain map, which effectively improves the generation efficiency of the industrial chain map.

在一个实施例中,如图3所示,在根据各个目标产业之间的数据关联度、名称关联度和关系关联度,生成各个目标产业对应的产业链图谱之后,还包括:In one embodiment, as shown in Figure 3, after generating the industry chain map corresponding to each target industry according to the data association degree, name association degree and relationship association degree between each target industry, it also includes:

步骤S301,获取产业链图谱中各个目标产业在当前时间段的当前电量数据。Step S301, obtaining the current electricity data of each target industry in the industry chain map in the current time period.

步骤S302,根据各个目标产业的当前电量数据,得到各个目标产业的当前电量增长率。Step S302, according to the current power data of each target industry, the current power growth rate of each target industry is obtained.

步骤S303,对各个目标产业的当前电量增长率和历史电量增长率进行同比评估和环比评估,得到各个目标产业的评估结果。In step S303, year-on-year and ring-to-ring evaluations are performed on the current electricity growth rate and the historical electricity growth rate of each target industry to obtain the evaluation results of each target industry.

步骤S304,在评估结果为异常的情况下,在产业链图谱中对评估结果对应的目标产业进行上下游更新,得到当前时间段的产业链图谱。Step S304, if the evaluation result is abnormal, update the target industry corresponding to the evaluation result upstream and downstream in the industry chain map to obtain the industry chain map of the current time period.

其中,当前电量增长率是指描述目标产业的电量数据在历史时间段的增长情况的数据。当前电量增长率包括但不限于是当前电量环比增长率和当前电量环比增长率。Among them, the current electricity growth rate refers to the data describing the growth of the electricity data of the target industry in the historical time period. The current power growth rate includes but is not limited to the current power chain growth rate and the current power chain chain growth rate.

具体地,终端在上述步骤S104中生成得到产业链图谱之后,终端获取产业链图谱中各个目标产业在当前时间段的当前电量数据,然后根据各个目标产业的当前电量数据和在历史时间段的产业电量数据,计算得到各个目标产业在当前时间段的当前电量环比增长率和当前电量环比增长率。终端对各个目标产业的当前电量环比增长率和历史电量环比增长率进行环比评估,得到各个目标产业的电量环比评估结果;对各个目标产业的当前电量同比增长率和历史电量同比增长率进行同比评估,得到各个目标产业的电量同比评估结果;终端综合电量环比评估结果和电量环比评估结果得到各个目标产业的评估结果,或者根据电量环比评估结果或电量环比评估结果中任意一个评估结果,确定各个目标产业的评估结果;在评估结果为异常的情况下,在产业链图谱中对评估结果为异常的目标产业进行上下游更新,得到当前时间段更新后的产业链图谱。Specifically, after the terminal generates the industry chain map in the above step S104, the terminal obtains the current power data of each target industry in the current time period in the industry chain map, and then according to the current power data of each target industry and the industry in the historical time period Power data, calculate the current power chain growth rate and the current power chain growth rate of each target industry in the current time period. The terminal conducts a quarter-on-quarter evaluation of the current power chain growth rate and historical power chain growth rate of each target industry, and obtains the power chain evaluation results of each target industry; conducts a year-on-year evaluation of the current power year-on-year growth rate and historical power year-on-year growth rate of each target industry , to get the year-on-year evaluation results of each target industry; the terminal comprehensive power chain evaluation results and the power chain evaluation results get the evaluation results of each target industry, or determine each target according to the power chain evaluation results or any one of the power chain evaluation results The evaluation result of the industry; if the evaluation result is abnormal, update the upstream and downstream of the target industry whose evaluation result is abnormal in the industrial chain map, and obtain the updated industrial chain map of the current time period.

在本实施例中,根据产业链图谱中各个目标产业在当前时间段的当前电量数据,得到各个目标产业的当前电量增长率;进而对各个目标产业的当前电量增长率和历史电量增长率进行同比评估和环比评估,得到各个目标产业的评估结果;在评估结果为异常的情况下,在产业链图谱中对评估结果对应的目标产业进行上下游更新,得到当前时间段的产业链图谱,能够及时发现产业链图谱中存在电量波动异常的产业,实现了对产业链图谱的实时筛查和自动更新,从而进一步地提高了产业链图谱的时效性,还能提高产业链图谱的准确性。In this embodiment, according to the current power data of each target industry in the current time period in the industry chain map, the current power growth rate of each target industry is obtained; and then the current power growth rate and historical power growth rate of each target industry are compared Evaluation and ring-to-ring evaluation to obtain the evaluation results of each target industry; when the evaluation results are abnormal, update the target industry corresponding to the evaluation results in the industrial chain map to obtain the industrial chain map of the current time period, which can be timely It is found that there are industries with abnormal power fluctuations in the industrial chain map, which realizes real-time screening and automatic update of the industrial chain map, thereby further improving the timeliness of the industrial chain map and improving the accuracy of the industrial chain map.

在一个实施例中,如图4所示,提供了另一种产业链生成方法,以该方法应用于终端为例进行说明,包括以下步骤:In one embodiment, as shown in FIG. 4 , another industry chain generation method is provided. The method is applied to a terminal as an example for illustration, including the following steps:

步骤S401,根据各个目标产业的产业电量数据,得到各个目标产业的历史电量增长率;根据各个目标产业的产业生产数据,得到各个目标产业的历史生产增长率。Step S401, according to the industrial electricity data of each target industry, obtain the historical electricity growth rate of each target industry; according to the industrial production data of each target industry, obtain the historical production growth rate of each target industry.

步骤S402,根据历史电量增长率和历史生产增长率,对各个目标产业进行聚类处理,得到各个目标产业的产业分类结果;根据各个目标产业的产业分类结果,得到各个目标产业之间的数据关联度。Step S402, according to the historical electricity growth rate and historical production growth rate, perform clustering processing on each target industry to obtain the industry classification results of each target industry; according to the industry classification results of each target industry, obtain the data association between each target industry Spend.

步骤S403,在检测到第一目标产业的名称信息与第二目标产业的名称信息不同的情况下,获取将第一目标产业的名称信息更新为第二目标产业的名称信息所需的文本更新次数。Step S403, if it is detected that the name information of the first target industry is different from the name information of the second target industry, obtain the number of text updates required to update the name information of the first target industry to the name information of the second target industry .

其中,第一目标产业为各个目标产业中任意一个目标产业;第二目标产业为各个目标产业中除第一目标产业以外的任意一个目标产业。Among them, the first target industry is any target industry in each target industry; the second target industry is any target industry in each target industry except the first target industry.

步骤S404,根据文本更新次数,得到第一目标产业的名称信息和第二目标产业的名称信息之间的距离;根据距离,得到第一目标产业和第二目标产业之间的名称关联度,作为各个目标产业之间的名称关联度。Step S404, according to the number of text updates, the distance between the name information of the first target industry and the name information of the second target industry is obtained; according to the distance, the name correlation degree between the first target industry and the second target industry is obtained as The degree of name association between each target industry.

步骤S405,对各个目标产业的产业关联信息进行实体关系提取处理,得到各个产业关联信息中的实体关系信息。In step S405, the entity relationship extraction process is performed on the industry-related information of each target industry, and the entity relationship information in each industry-related information is obtained.

步骤S406,根据产业关系字典库中的产业之间的上下游关系,对实体关系信息中的目标产业进行上游概率判断和下游概率判断,得到实体关系信息中的目标产业的上游概率和下游概率。Step S406, according to the upstream and downstream relationship between industries in the industrial relationship dictionary database, perform upstream probability judgment and downstream probability judgment on the target industry in the entity relationship information, and obtain the upstream probability and downstream probability of the target industry in the entity relationship information.

步骤S407,根据实体关系信息中的目标产业的上游概率和下游概率,得到各个目标产业之间的关系关联度。Step S407, according to the upstream probability and downstream probability of the target industry in the entity relationship information, the degree of relational correlation between each target industry is obtained.

步骤S408,根据各个目标产业之间的数据关联度、名称关联度和关系关联度,确定各个目标产业之间的产业关联概率;根据上游概率、下游概率和产业关联概率,确定各个目标产业对应的上下游产业。Step S408, according to the data correlation degree, name correlation degree and relationship correlation degree between each target industry, determine the industry correlation probability between each target industry; according to the upstream probability, downstream probability and industry correlation probability, determine the corresponding upstream and downstream industries.

步骤S409,对各个目标产业对应的上下游产业进行可视化图形处理,得到产业链图谱。Step S409, performing visual graphic processing on the upstream and downstream industries corresponding to each target industry to obtain an industrial chain map.

上述产业链生成方法,能够实现以下有益效果:通过各个目标产业的产业电量数据和产业生产数据来确定各个目标产业之间的数据关联度,通过各个目标产业的产业关联信息来确定各个目标产业之间的关系关联度,并结合各个目标产业之间的名称关联度来综合生成产业链图谱,解决了传统技术中构建产业链的数据维度单一的问题,使得生成得到的产业链图谱能够更全面的展示出各个目标产业在经济因素、生产因素和文本因素等多方面因素上的关联,还无需依靠人工经验建立产业链图谱,有效地提高了生成得到的产业链图谱的时效性。The above industrial chain generation method can achieve the following beneficial effects: determine the data correlation between each target industry through the industrial power data and industrial production data of each target industry, and determine the relationship between each target industry through the industry related information of each target industry. The degree of relationship between them, combined with the degree of name association between each target industry to comprehensively generate an industrial chain map, which solves the problem of single data dimensions for building an industrial chain in traditional technologies, and enables the generated industrial chain map to be more comprehensive. It shows the correlation of various target industries in terms of economic factors, production factors and textual factors, and does not need to rely on manual experience to establish an industrial chain map, which effectively improves the timeliness of the generated industrial chain map.

为了更清晰阐明本公开实施例提供的产业链生成方法,以下以一个具体的实施例对上述产业链生成方法进行具体说明。提供了又一种产业链生成方法,可以应用于终端,具体包括如下内容:In order to clarify the industry chain generation method provided by the embodiments of the present disclosure more clearly, a specific embodiment will be used below to describe the above industry chain generation method in detail. Another industry chain generation method is provided, which can be applied to terminals, and specifically includes the following content:

第一是数据获取环节:从计量自动化系统获取实时的产业电量数据,包括133类典型产业的用电量,复工复产情况等;从统计局网站通过爬虫获取133类典型产业的产业生产数据;从行业年报、新闻报告、政策指南等文本中获取133类典型产业133类典型产业的产业关联信息。The first is the data acquisition link: obtain real-time industrial electricity data from the metering automation system, including the electricity consumption of 133 typical industries, the resumption of work and production, etc.; obtain the industrial production data of 133 typical industries through crawlers from the website of the Bureau of Statistics; Obtain the industry-related information of 133 typical industries in 133 types of typical industries from industry annual reports, news reports, policy guidelines and other texts.

第二是数据处理环节:1)从计量自动化系统中获得的产业电量数据为结构化库表数据,对产业电量数据进行数据清洗和数据校验,得到处理后的产业电量数据,以确保处理后的产业电量数据的质量;通过neo4j图形数据库中的neo4j-import模块将处理后的产业电量数据转换为产业链图谱中的实体。2)从统计局网站获得的产业生产数据为半结构化数据,将产业生产数据转化为结构化形式产业生产数据,然后对产业生产数据进行数据清洗和数据校验,得到处理后的产业生产数据,以确保处理后的产业生产数据的质量;通过neo4j图形数据库中的neo4j-import模块将处理后的产业生产数据转换为产业链图谱中的实体。3)对产业关联信息进行产业特征提取处理,得到产业关联信息中的产业特征信息;根据产业关系字典库,对产业特征信息进行实体产业匹配,得到产业特征信息对应的实体产业信息。The second is the data processing link: 1) The industrial electricity data obtained from the metering automation system is structured database table data, data cleaning and data verification are performed on the industrial electricity data, and the processed industrial electricity data is obtained to ensure that the industrial electricity data after processing The quality of the industrial power data; through the neo4j-import module in the neo4j graph database, the processed industrial power data is converted into entities in the industrial chain map. 2) The industrial production data obtained from the website of the Bureau of Statistics is semi-structured data, and the industrial production data is transformed into a structured form of industrial production data, and then data cleaning and data verification are performed on the industrial production data to obtain the processed industrial production data , to ensure the quality of the processed industrial production data; through the neo4j-import module in the neo4j graph database, the processed industrial production data is converted into entities in the industrial chain map. 3) Extract the industrial characteristics of the industry-related information to obtain the industry characteristic information in the industry-related information; according to the industrial relationship dictionary database, perform entity industry matching on the industry characteristic information, and obtain the entity industry information corresponding to the industry characteristic information.

第三是综合产业关联度计算环节:1)终端根据各个目标产业的历史产业电量数据,得到各个目标产业的历史电量环比增长率和历史电量同比增长率;根据各个目标产业的历史产业生产数据,得到各个目标产业的历史生产环比增长率和历史生产同比增长率;对各个目标产业的历史电量环比增长率、历史电量同比增长率、历史生产环比增长率和历史生产同比增长率进行K均值聚类,则终端得到各个目标产业的产业分类结果;终端根据各个目标产业的产业分类结果,得到各个目标产业之间的数据关联度。2)根据获取到的各个目标产业的名称信息,确定将一个目标产业的名称信息转换为另一个目标产业的名称信息所需的最小文本更新次数,根据所需的最小文本更新次数,计算得到各个目标产业的名称信息之间的距离。根据距离计算得到这两个产业之间的名称关联度。3)对各个目标产业的产业关联信息进行实体关系提取处理,得到各个产业关联信息中的实体关系信息;根据实体关系信息中的目标产业的上游概率和下游概率,得到各个目标产业之间的关系关联度。The third is the calculation of the comprehensive industry correlation degree: 1) The terminal obtains the historical electricity chain growth rate and the historical electricity year-on-year growth rate of each target industry according to the historical industry electricity data of each target industry; according to the historical industry production data of each target industry, Get the historical production chain growth rate and historical production year-on-year growth rate of each target industry; perform K-means clustering on the historical power chain growth rate, historical power year-on-year growth rate, historical production chain growth rate and historical production year-on-year growth rate of each target industry , the terminal obtains the industry classification results of each target industry; the terminal obtains the data correlation degree between each target industry according to the industry classification results of each target industry. 2) According to the obtained name information of each target industry, determine the minimum number of text updates required to convert the name information of one target industry into the name information of another target industry, and calculate each The distance between the name information of the target industry. According to the distance calculation, the name association degree between the two industries is obtained. 3) Extract the entity relationship of the industry-related information of each target industry to obtain the entity relationship information in each industry-related information; according to the upstream probability and downstream probability of the target industry in the entity relationship information, obtain the relationship between each target industry Correlation.

第四是产业链生成环节:根据各个目标产业之间的数据关联度、名称关联度和关系关联度,确定各个目标产业之间的产业关联概率;对各个目标产业之间的产业关联概率、上游概率和下游概率进行矩阵处理,得到关联度矩阵,作为综合产业关联度;根据综合产业关联度确定各个目标产业对应的上下游产业;利用neo4j(一种NOSQL图形数据库)构建各个目标产业的可视化节点,并按照各个目标产业对应的上下游产业,对可视化节点进行连接,得到产业链图谱。The fourth is the industrial chain generation link: according to the data correlation degree, name correlation degree and relationship correlation degree between each target industry, determine the industry correlation probability between each target industry; Probability and downstream probability are processed in a matrix to obtain a correlation matrix as a comprehensive industry correlation; determine the upstream and downstream industries corresponding to each target industry according to the comprehensive industry correlation; use neo4j (a NOSQL graph database) to build visual nodes for each target industry , and according to the upstream and downstream industries corresponding to each target industry, connect the visualized nodes to obtain the industrial chain map.

在本实施例中,通过各个目标产业的产业电量数据和产业生产数据来确定各个目标产业之间的数据关联度,通过各个目标产业的产业关联信息来确定各个目标产业之间的关系关联度,并结合各个目标产业之间的名称关联度来综合生成产业链图谱,解决了传统技术中构建产业链的数据维度单一的问题,使得生成得到的产业链图谱能够更全面的展示出各个目标产业在经济因素、生产因素和文本因素等多方面因素上的关联,还无需依靠人工经验建立产业链图谱,有效地提高了生成得到的产业链图谱的时效性。In this embodiment, the data association degree between each target industry is determined through the industrial electricity data and industrial production data of each target industry, and the relationship association degree between each target industry is determined through the industry association information of each target industry. Combined with the name correlation between each target industry to comprehensively generate an industrial chain map, it solves the problem of single data dimension in building an industrial chain in traditional technology, so that the generated industrial chain map can more comprehensively display the status of each target industry. There is no need to rely on manual experience to establish an industrial chain map, which effectively improves the timeliness of the generated industrial chain map.

应该理解的是,虽然如上所述的各实施例所涉及的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,如上所述的各实施例所涉及的流程图中的至少一部分步骤可以包括多个步骤或者多个阶段,这些步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤中的步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the steps in the flow charts involved in the above embodiments are shown sequentially according to the arrows, these steps are not necessarily executed sequentially in the order indicated by the arrows. Unless otherwise specified herein, there is no strict order restriction on the execution of these steps, and these steps can be executed in other orders. Moreover, at least some of the steps in the flow charts involved in the above-mentioned embodiments may include multiple steps or stages, and these steps or stages are not necessarily executed at the same time, but may be performed at different times For execution, the execution order of these steps or stages is not necessarily performed sequentially, but may be executed in turn or alternately with other steps or at least a part of steps or stages in other steps.

基于同样的发明构思,本申请实施例还提供了一种用于实现上述所涉及的产业链生成方法的产业链生成装置。该装置所提供的解决问题的实现方案与上述方法中所记载的实现方案相似,故下面所提供的一个或多个产业链生成装置实施例中的具体限定可以参见上文中对于产业链生成方法的限定,在此不再赘述。Based on the same inventive concept, an embodiment of the present application further provides an industry chain generation device for implementing the above-mentioned method for generating an industry chain. The solution to the problem provided by the device is similar to the implementation described in the above-mentioned method, so the specific limitations in one or more embodiments of the industrial chain generating device provided below can be referred to above for the industrial chain generating method limited and will not be repeated here.

在一个实施例中,如图5所示,提供了一种产业链生成装置500,包括:第一关联度确定模块501、第二关联度确定模块502、第三关联度确定模块503和产业链图谱生成模块504,其中:In one embodiment, as shown in FIG. 5 , an industry chain generation device 500 is provided, including: a first association degree determination module 501, a second association degree determination module 502, a third association degree determination module 503 and an industry chain Map generation module 504, wherein:

第一关联度确定模块501,用于根据各个目标产业的产业电量数据和产业生产数据,确定各个目标产业之间的数据关联度。The first correlation degree determination module 501 is configured to determine the data correlation degree between each target industry according to the industrial electricity data and industrial production data of each target industry.

第二关联度确定模块502,用于根据各个目标产业的名称信息之间的距离,确定各个目标产业之间的名称关联度。The second association degree determination module 502 is configured to determine the name association degree between each target industry according to the distance between the name information of each target industry.

第三关联度确定模块503,用于根据各个目标产业的产业关联信息中的实体关系信息,确定各个目标产业之间的关系关联度。The third association degree determination module 503 is configured to determine the relationship association degree between each target industry according to the entity relationship information in the industry association information of each target industry.

产业链图谱生成模块504,用于根据各个目标产业之间的数据关联度、名称关联度和关系关联度,生成各个目标产业对应的产业链图谱。The industry chain graph generation module 504 is used to generate an industry chain graph corresponding to each target industry according to the data correlation degree, name correlation degree and relationship correlation degree between each target industry.

在一个实施例中,第一关联度确定模块501,还用于根据各个目标产业的产业电量数据,得到各个目标产业的历史电量增长率;根据各个目标产业的产业生产数据,得到各个目标产业的历史生产增长率;根据历史电量增长率和历史生产增长率,对各个目标产业进行聚类处理,得到各个目标产业的产业分类结果;根据各个目标产业的产业分类结果,得到各个目标产业之间的数据关联度。In one embodiment, the first correlation degree determination module 501 is also used to obtain the historical electricity growth rate of each target industry according to the industrial electricity data of each target industry; Historical production growth rate; according to the historical electricity growth rate and historical production growth rate, each target industry is clustered to obtain the industry classification results of each target industry; according to the industry classification results of each target industry, the relationship between each target industry is obtained Data relevance.

在一个实施例中,第二关联度确定模块502,还用于在检测到第一目标产业的名称信息与第二目标产业的名称信息不同的情况下,获取将第一目标产业的名称信息更新为第二目标产业的名称信息所需的文本更新次数;第一目标产业为各个目标产业中任意一个目标产业;第二目标产业为各个目标产业中除第一目标产业以外的任意一个目标产业;根据文本更新次数,得到第一目标产业的名称信息和第二目标产业的名称信息之间的距离;根据距离,得到第一目标产业和第二目标产业之间的名称关联度,作为各个目标产业之间的名称关联度。In one embodiment, the second association degree determination module 502 is further configured to acquire and update the name information of the first target industry when it is detected that the name information of the first target industry is different from the name information of the second target industry. The number of text updates required for the name information of the second target industry; the first target industry is any target industry in each target industry; the second target industry is any target industry in each target industry except the first target industry; According to the number of text updates, the distance between the name information of the first target industry and the name information of the second target industry is obtained; according to the distance, the name correlation degree between the first target industry and the second target industry is obtained as each target industry The relationship between names.

在一个实施例中,第三关联度确定模块503,还用于对各个目标产业的产业关联信息进行实体关系提取处理,得到各个产业关联信息中的实体关系信息;根据产业关系字典库中的产业之间的上下游关系,对实体关系信息中的目标产业进行上游概率判断和下游概率判断,得到实体关系信息中的目标产业的上游概率和下游概率;根据实体关系信息中的目标产业的上游概率和下游概率,得到各个目标产业之间的关系关联度。In one embodiment, the third association degree determination module 503 is also used to extract the entity relationship from the industry association information of each target industry to obtain the entity relationship information in each industry association information; The upstream and downstream relationship between the target industry in the entity relationship information is judged by the upstream probability and downstream probability, and the upstream probability and downstream probability of the target industry in the entity relationship information are obtained; according to the upstream probability of the target industry in the entity relationship information and downstream probability to obtain the degree of relationship between each target industry.

在一个实施例中,产业链图谱生成模块504,还用于根据各个目标产业之间的数据关联度、名称关联度和关系关联度,确定各个目标产业之间的产业关联概率;根据上游概率、下游概率和产业关联概率,确定各个目标产业对应的上下游产业;对各个目标产业对应的上下游产业进行可视化图形处理,得到产业链图谱。In one embodiment, the industry chain map generation module 504 is also used to determine the industry association probability among each target industry according to the data association degree, name association degree and relationship association degree among each target industry; according to the upstream probability, The downstream probability and industry association probability determine the upstream and downstream industries corresponding to each target industry; perform visual graphic processing on the upstream and downstream industries corresponding to each target industry to obtain an industrial chain map.

在一个实施例中,产业链生成装置500还包括产业链更新模块,用于获取产业链图谱中各个目标产业在当前时间段的当前电量数据;根据各个目标产业的当前电量数据,得到各个目标产业的当前电量增长率;对各个目标产业的当前电量增长率和历史电量增长率进行同比评估和环比评估,得到各个目标产业的评估结果;在评估结果为异常的情况下,在产业链图谱中对评估结果对应的目标产业进行上下游更新,得到当前时间段的产业链图谱。In one embodiment, the industrial chain generation device 500 also includes an industrial chain update module, which is used to obtain the current electricity data of each target industry in the industry chain map in the current time period; according to the current electricity data of each target industry, obtain the The current electricity growth rate of each target industry; conduct year-on-year and ring-to-chain evaluations on the current electricity growth rate and historical electricity growth rate of each target industry, and obtain the evaluation results of each target industry; The target industry corresponding to the evaluation result is updated upstream and downstream to obtain the industrial chain map of the current time period.

上述产业链生成装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。Each module in the above-mentioned industrial chain generating device can be fully or partially realized by software, hardware and a combination thereof. The above-mentioned modules can be embedded in or independent of the processor in the computer device in the form of hardware, and can also be stored in the memory of the computer device in the form of software, so that the processor can invoke and execute the corresponding operations of the above-mentioned modules.

在一个实施例中,提供了一种计算机设备,该计算机设备可以是终端,其内部结构图可以如图6所示。该计算机设备包括处理器、存储器、输入/输出接口、通信接口、显示单元和输入装置。其中,处理器、存储器和输入/输出接口通过系统总线连接,通信接口、显示单元和输入装置通过输入/输出接口连接到系统总线。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质和内存储器。该非易失性存储介质存储有操作系统和计算机程序。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该计算机设备的输入/输出接口用于处理器与外部设备之间交换信息。该计算机设备的通信接口用于与外部的终端进行有线或无线方式的通信,无线方式可通过WIFI、移动蜂窝网络、NFC(近场通信)或其他技术实现。该计算机程序被处理器执行时以实现一种产业链生成方法。该计算机设备的显示单元用于形成视觉可见的画面,可以是显示屏、投影装置或虚拟现实成像装置。显示屏可以是液晶显示屏或者电子墨水显示屏,该计算机设备的输入装置可以是显示屏上覆盖的触摸层,也可以是计算机设备外壳上设置的按键、轨迹球或触控板,还可以是外接的键盘、触控板或鼠标等。In one embodiment, a computer device is provided. The computer device may be a terminal, and its internal structure may be as shown in FIG. 6 . The computer device includes a processor, a memory, an input/output interface, a communication interface, a display unit and an input device. Wherein, the processor, the memory and the input/output interface are connected through the system bus, and the communication interface, the display unit and the input device are connected to the system bus through the input/output interface. Wherein, the processor of the computer device is used to provide calculation and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and computer programs. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage medium. The input/output interface of the computer device is used for exchanging information between the processor and external devices. The communication interface of the computer device is used for wired or wireless communication with an external terminal, and the wireless mode can be realized through WIFI, mobile cellular network, NFC (Near Field Communication) or other technologies. When the computer program is executed by the processor, an industrial chain generation method is realized. The display unit of the computer equipment is used to form a visually visible picture, which may be a display screen, a projection device or a virtual reality imaging device. The display screen may be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer device may be a touch layer covered on the display screen, or a button, a trackball or a touch pad set on the casing of the computer device, or a External keyboard, touchpad or mouse etc.

本领域技术人员可以理解,图6中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in FIG. 6 is only a block diagram of a part of the structure related to the solution of this application, and does not constitute a limitation on the computer equipment to which the solution of this application is applied. The specific computer equipment can be More or fewer components than shown in the figures may be included, or some components may be combined, or have a different arrangement of components.

在一个实施例中,还提供了一种计算机设备,包括存储器和处理器,存储器中存储有计算机程序,该处理器执行计算机程序时实现上述各方法实施例中的步骤。In one embodiment, there is also provided a computer device, including a memory and a processor, where a computer program is stored in the memory, and the processor implements the steps in the above method embodiments when executing the computer program.

在一个实施例中,提供了一种计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时实现上述各方法实施例中的步骤。In one embodiment, a computer-readable storage medium is provided, on which a computer program is stored, and when the computer program is executed by a processor, the steps in the foregoing method embodiments are implemented.

在一个实施例中,提供了一种计算机程序产品,包括计算机程序,该计算机程序被处理器执行时实现上述各方法实施例中的步骤。In one embodiment, a computer program product is provided, including a computer program, and when the computer program is executed by a processor, the steps in the foregoing method embodiments are implemented.

需要说明的是,本申请所涉及的用户信息(包括但不限于用户设备信息、用户个人信息等)和数据(包括但不限于用于分析的数据、存储的数据、展示的数据等),均为经用户授权或者经过各方充分授权的信息和数据,且相关数据的收集、使用和处理需要遵守相关国家和地区的相关法律法规和标准。It should be noted that the user information (including but not limited to user device information, user personal information, etc.) and data (including but not limited to data used for analysis, stored data, displayed data, etc.) involved in this application are all It is information and data authorized by the user or fully authorized by all parties, and the collection, use and processing of relevant data need to comply with relevant laws, regulations and standards of relevant countries and regions.

本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一非易失性计算机可读取存储介质中,该计算机程序在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、数据库或其它介质的任何引用,均可包括非易失性和易失性存储器中的至少一种。非易失性存储器可包括只读存储器(Read-OnlyMemory,ROM)、磁带、软盘、闪存、光存储器、高密度嵌入式非易失性存储器、阻变存储器(ReRAM)、磁变存储器(Magnetoresistive Random Access Memory,MRAM)、铁电存储器(Ferroelectric Random Access Memory,FRAM)、相变存储器(Phase Change Memory,PCM)、石墨烯存储器等。易失性存储器可包括随机存取存储器(Random Access Memory,RAM)或外部高速缓冲存储器等。作为说明而非局限,RAM可以是多种形式,比如静态随机存取存储器(Static Random Access Memory,SRAM)或动态随机存取存储器(Dynamic RandomAccess Memory,DRAM)等。本申请所提供的各实施例中所涉及的数据库可包括关系型数据库和非关系型数据库中至少一种。非关系型数据库可包括基于区块链的分布式数据库等,不限于此。本申请所提供的各实施例中所涉及的处理器可为通用处理器、中央处理器、图形处理器、数字信号处理器、可编程逻辑器、基于量子计算的数据处理逻辑器等,不限于此。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented through computer programs to instruct related hardware, and the computer programs can be stored in a non-volatile computer-readable memory In the medium, when the computer program is executed, it may include the processes of the embodiments of the above-mentioned methods. Wherein, any reference to storage, database or other media used in the various embodiments provided in the present application may include at least one of non-volatile and volatile storage. Non-volatile memory can include read-only memory (Read-Only Memory, ROM), magnetic tape, floppy disk, flash memory, optical memory, high-density embedded non-volatile memory, resistive variable memory (ReRAM), magnetic variable memory (Magnetoresistive Random Access Memory, MRAM), Ferroelectric Random Access Memory (FRAM), Phase Change Memory (Phase Change Memory, PCM), graphene memory, etc. The volatile memory may include a random access memory (Random Access Memory, RAM) or an external cache memory and the like. As an illustration but not a limitation, the RAM can be in various forms, such as static random access memory (Static Random Access Memory, SRAM) or dynamic random access memory (Dynamic Random Access Memory, DRAM). The databases involved in the various embodiments provided in this application may include at least one of a relational database and a non-relational database. The non-relational database may include a blockchain-based distributed database, etc., but is not limited thereto. The processors involved in the various embodiments provided by this application can be general-purpose processors, central processing units, graphics processors, digital signal processors, programmable logic devices, data processing logic devices based on quantum computing, etc., and are not limited to this.

以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments can be combined arbitrarily. To make the description concise, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction in the combination of these technical features, they should be It is considered to be within the range described in this specification.

以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对本申请专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请的保护范围应以所附权利要求为准。The above-mentioned embodiments only express several implementation modes of the present application, and the description thereof is relatively specific and detailed, but should not be construed as limiting the patent scope of the present application. It should be noted that those skilled in the art can make several modifications and improvements without departing from the concept of the present application, and these all belong to the protection scope of the present application. Therefore, the protection scope of the present application should be determined by the appended claims.

Claims (10)

1. An industrial chain generation method, comprising:
determining data association degree between each target industry according to industrial electric quantity data and industrial production data of each target industry;
determining the name association degree among the target industries according to the distance among the name information of the target industries;
determining the relationship association degree among the target industries according to the entity relationship information in the industry association information of the target industries;
and generating an industry chain map corresponding to each target industry according to the data association degree, the name association degree and the relation association degree among the target industries.
2. The method of claim 1, wherein the determining the data association degree between the target industries according to the industry power data and the industry production data of the target industries comprises:
obtaining historical electric quantity growth rate of each target industry according to the industrial electric quantity data of each target industry;
obtaining historical production growth rate of each target industry according to the industrial production data of each target industry;
clustering each target industry according to the historical electric quantity growth rate and the historical production growth rate to obtain an industry classification result of each target industry;
and obtaining the data association degree among the target industries according to the industry classification result of each target industry.
3. The method according to claim 1, wherein the determining the name association degree between the target industries according to the distance between the name information of the target industries comprises:
under the condition that the name information of a first target industry is detected to be different from the name information of a second target industry, acquiring the number of times of text updating required for updating the name information of the first target industry into the name information of the second target industry; the first target industry is any one of the target industries; the second target industry is any one of the target industries except the first target industry;
obtaining the distance between the name information of the first target industry and the name information of the second target industry according to the text updating times;
and obtaining the name association degree between the first target industry and the second target industry according to the distance, wherein the name association degree is used as the name association degree between the target industries.
4. The method according to claim 1, wherein the determining the relationship association degree between the target industries according to entity relationship information in the industry association information of the target industries comprises:
performing entity relationship extraction processing on the industry associated information of each target industry to obtain entity relationship information in the industry associated information;
according to the upstream and downstream relation among industries in an industry relation dictionary library, performing upstream probability judgment and downstream probability judgment on the target industry in the entity relation information to obtain the upstream probability and the downstream probability of the target industry in the entity relation information;
and obtaining the relationship association degree between the target industries according to the upstream probability and the downstream probability of the target industries in the entity relationship information.
5. The method according to claim 4, wherein the generating an industry chain graph corresponding to each target industry according to the data association degree, the name association degree and the relationship association degree among the target industries comprises:
determining an industry association probability among the target industries according to the data association degree, the name association degree and the relationship association degree among the target industries;
determining an upstream industry and a downstream industry corresponding to each target industry according to the upstream probability, the downstream probability and the industry association probability;
and carrying out visual graphic processing on the upstream industry and the downstream industry corresponding to each target industry to obtain the industry chain map.
6. The method according to any one of claims 1 to 5, further comprising, after generating an industry chain map corresponding to each target industry according to the data association degree, the name association degree, and the relationship association degree between the target industries:
acquiring current electric quantity data of each target industry in the industry chain map in the current time period;
obtaining the current electric quantity growth rate of each target industry according to the current electric quantity data of each target industry;
performing a same-ratio evaluation and a ring-ratio evaluation on the current electric quantity growth rate and the historical electric quantity growth rate of each target industry to obtain an evaluation result of each target industry;
and under the condition that the evaluation result is abnormal, performing upstream and downstream updating on the target industry corresponding to the evaluation result in the industry chain map to obtain the industry chain map of the current time period.
7. An industrial chain generation apparatus, characterized in that the apparatus comprises:
the first association degree determining module is used for determining the data association degree between each target industry according to the industry electric quantity data and the industry production data of each target industry;
the second association degree determining module is used for determining the name association degree between the target industries according to the distance between the name information of the target industries;
a third association degree determining module, configured to determine, according to entity relationship information in the industry association information of each target industry, a relationship association degree between the target industries;
and the industry chain map generation module is used for generating an industry chain map corresponding to each target industry according to the data association degree, the name association degree and the relationship association degree among all target industries.
8. The apparatus according to claim 7, wherein the first association degree determining module is further configured to obtain a historical electricity quantity growth rate of each target industry according to the industry electricity quantity data of each target industry; obtaining the historical production growth rate of each target industry according to the industrial production data of each target industry; clustering each target industry according to the historical electric quantity growth rate and the historical production growth rate to obtain an industry classification result of each target industry; and obtaining the data association degree among the target industries according to the industry classification result of each target industry.
9. A computer device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor, when executing the computer program, implements the steps of the method of any of claims 1 to 6.
10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method of any one of claims 1 to 6.
CN202211529390.2A 2022-12-01 2022-12-01 Industrial chain generation method, device, computer equipment and storage medium Pending CN115544336A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211529390.2A CN115544336A (en) 2022-12-01 2022-12-01 Industrial chain generation method, device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211529390.2A CN115544336A (en) 2022-12-01 2022-12-01 Industrial chain generation method, device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
CN115544336A true CN115544336A (en) 2022-12-30

Family

ID=84721772

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211529390.2A Pending CN115544336A (en) 2022-12-01 2022-12-01 Industrial chain generation method, device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN115544336A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116738031A (en) * 2023-06-07 2023-09-12 广州市西美信息科技有限公司 An industry relationship analysis method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111104522A (en) * 2019-12-20 2020-05-05 武汉理工大学 Regional industry association effect trend prediction method based on knowledge graph
CN111159426A (en) * 2019-12-30 2020-05-15 武汉理工大学 An industrial graph fusion method based on graph convolutional neural network
CN113051365A (en) * 2020-12-10 2021-06-29 深圳证券信息有限公司 Industrial chain map construction method and related equipment
CN114117065A (en) * 2021-11-12 2022-03-01 国网福建省电力有限公司经济技术研究院 Knowledge graph construction method and system based on power production statistical service
CN114417020A (en) * 2022-03-29 2022-04-29 浙江省标准化研究院(金砖国家标准化(浙江)研究中心 浙江省物品编码中心) Industrial chain map construction system and method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111104522A (en) * 2019-12-20 2020-05-05 武汉理工大学 Regional industry association effect trend prediction method based on knowledge graph
CN111159426A (en) * 2019-12-30 2020-05-15 武汉理工大学 An industrial graph fusion method based on graph convolutional neural network
CN113051365A (en) * 2020-12-10 2021-06-29 深圳证券信息有限公司 Industrial chain map construction method and related equipment
CN114117065A (en) * 2021-11-12 2022-03-01 国网福建省电力有限公司经济技术研究院 Knowledge graph construction method and system based on power production statistical service
CN114417020A (en) * 2022-03-29 2022-04-29 浙江省标准化研究院(金砖国家标准化(浙江)研究中心 浙江省物品编码中心) Industrial chain map construction system and method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116738031A (en) * 2023-06-07 2023-09-12 广州市西美信息科技有限公司 An industry relationship analysis method and system

Similar Documents

Publication Publication Date Title
CN110335009A (en) Report generation method, device, computer equipment and storage medium
CN112306835A (en) User data monitoring and analyzing method, device, equipment and medium
CN114757602B (en) Supply side electric power carbon emission risk early warning method and device and computer equipment
CN115408280A (en) Flow distribution method, distribution model configuration method, equipment, media and products
CN116050856A (en) Carbon emission data processing method, device, equipment and storage medium
CN115544336A (en) Industrial chain generation method, device, computer equipment and storage medium
CN115757373A (en) Data warehouse cleaning method, device, computer equipment and storage medium
CN116304251A (en) Label processing method, device, computer equipment and storage medium
CN115204607A (en) Power grid engineering quality evaluation method and device, computer equipment and storage medium
CN115759018A (en) Report generating method, device, computer equipment and storage medium
Byun Enabling time-centric computation for efficient temporal graph traversals from multiple sources
CN116243168A (en) Method, device, equipment, medium and product for determining battery health
CN117312268B (en) Stream-batch integrated master data management method and device based on multi-source and multi-database
CN118627756A (en) Substation engineering digital twin model maturity detection method, device, computer equipment, readable storage medium and program product
CN118568202A (en) Query statement generation model processing method and device and computer equipment
CN113947334B (en) Configurable power trading risk monitoring method, device and computer equipment
CN117196394A (en) Evaluation index processing method, device, computer equipment and storage medium
CN115115246A (en) Data processing method, data processing device, computer equipment and storage medium
Oliazadeh et al. A note on the strong consistency of nonparametric estimation of shannon entropy in length-biased sampling
CN112733383B (en) Power data analysis method, device, computer equipment and storage medium
CN119442033A (en) Leakage point identification method, device, computer equipment, storage medium and program product
CN116090629A (en) Power consumption data processing method, device, computer equipment and storage medium
CN116029710A (en) Electricity bill data processing method, device, computer equipment and storage medium
CN116011671A (en) Orderly electricity industry screening method, device, computer equipment and storage medium
CN117495131A (en) Power consumption data prediction method, device, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20221230