CN115455304A - A method for matching supply and demand of scientific and technological achievements based on big data - Google Patents

A method for matching supply and demand of scientific and technological achievements based on big data Download PDF

Info

Publication number
CN115455304A
CN115455304A CN202211250898.9A CN202211250898A CN115455304A CN 115455304 A CN115455304 A CN 115455304A CN 202211250898 A CN202211250898 A CN 202211250898A CN 115455304 A CN115455304 A CN 115455304A
Authority
CN
China
Prior art keywords
scientific
group
data
matching
technological
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211250898.9A
Other languages
Chinese (zh)
Inventor
张建胜
雷晓辉
叶琰琰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Renren Crowdsourcing Technology Co ltd
Original Assignee
Beijing Renren Crowdsourcing Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Renren Crowdsourcing Technology Co ltd filed Critical Beijing Renren Crowdsourcing Technology Co ltd
Priority to CN202211250898.9A priority Critical patent/CN115455304A/en
Publication of CN115455304A publication Critical patent/CN115455304A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0605Supply or demand aggregation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Physics & Mathematics (AREA)
  • Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Marketing (AREA)
  • Development Economics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method for realizing scientific and technological achievement supply and demand matching based on big data, which comprises the following steps: data extraction, reading, extracting and recording document materials; processing and grouping the data in the same category, identifying the type according to the recorded extracted literature materials, and automatically grouping the data according to format matching of the literature materials; labeling data, calling key words stored in a media library to perform full text search on the input document material, and automatically listing matched key words; and the data label matching comprises a general label and a characteristic label, a plurality of keyword words are matched according to the lists, and the similarity of supply and demand matching is obtained by utilizing a calculation mode and weight so as to determine a unique label. The method is based on the scientific and technological achievement conversion system to achieve building and application of a supply and demand matching model of scientific and technological achievements, improves matching degree and matching efficiency of the scientific and technological achievements in a big data mode, and promotes conversion and application of the scientific and technological achievements.

Description

一种基于大数据实现科技成果供需匹配方法A method for matching supply and demand of scientific and technological achievements based on big data

技术领域technical field

本发明涉及科技成果自动匹配系统,具体涉及一种基于大数据实现科技成果供需匹配方法。The invention relates to an automatic matching system for scientific and technological achievements, in particular to a method for matching supply and demand of scientific and technological achievements based on big data.

背景技术Background technique

当前,数字化浪潮方兴未艾,信息化、数字化交织演进,数字经济展现出强大的发展韧性,数字科技创新加速了经济社会形态和运行模式的变革。通过深化数字科技在生产、运营、管理和营销等诸多环节的应用,实现企业以及产业层面的高速发展。At present, the wave of digitalization is in the ascendant, informationization and digitalization are intertwined and evolving, the digital economy has shown strong development resilience, and digital technology innovation has accelerated the transformation of economic and social forms and operating models. By deepening the application of digital technology in many links such as production, operation, management and marketing, the rapid development of enterprises and industries will be realized.

在科技创新领域科技成果数字化的转型起步较慢,因科技成果的匹配率低造成,造成技术经纪人无法有效抓取适合的供需数据,影响科技成果转化进程,造成我国科技成果转化率较低。In the field of scientific and technological innovation, the digital transformation of scientific and technological achievements has started slowly. Due to the low matching rate of scientific and technological achievements, technology brokers cannot effectively capture suitable supply and demand data, which affects the transformation process of scientific and technological achievements, resulting in a low conversion rate of scientific and technological achievements in my country.

发明内容Contents of the invention

本发明的目的是提供一种基于大数据实现科技成果供需匹配方法,The purpose of the present invention is to provide a method for matching supply and demand of scientific and technological achievements based on big data,

为了实现上述目的,本发明提供如下技术方案:一种基于大数据实现科技成果供需匹配方法,包括:In order to achieve the above object, the present invention provides the following technical solution: a method for matching supply and demand of scientific and technological achievements based on big data, including:

数据提取,读取、提取文献材料并录入;Data extraction, reading, extracting literature materials and entering them;

数据同类项处理及分组,根据录入的所述提取文献材料进行类型识别,根据文献材料的格式匹配,进行自动分组;Processing and grouping of similar items of data, performing type identification according to the extracted literature materials entered, and automatic grouping according to the format matching of the literature materials;

数据标签化,调取媒体库存储的关键词字在录入的所述文献材料进行全文查找,并自动罗列匹配的关键词字;Data labeling, call the keywords stored in the media library to search the full text of the document materials entered, and automatically list the matching keywords;

数据标签匹配,其包括通用标签及特性标签,根据罗列匹配多个所述关键词字,利用计算方式和权重得出供需匹配的相似度,以确定唯一标签。Data label matching, which includes common labels and characteristic labels, matches a plurality of the keywords according to the list, and uses the calculation method and weight to obtain the similarity of supply and demand matching, so as to determine the unique label.

作为优选的,所述数据提取包括对文件材料的录入分析,其处理方式包括以下步骤:As preferably, said data extraction includes input analysis to file material, and its processing method comprises the following steps:

S001、文本解析进行关键词提取;S001, text analysis for keyword extraction;

S002、生成对应的科技成果文本信息,包括:S002. Generate corresponding scientific and technological achievements text information, including:

科技成果基本信息;Basic information on scientific and technological achievements;

科技成果研究领域;Research field of scientific and technological achievements;

科技成果创新水平;Innovation level of scientific and technological achievements;

科研团队情况;Research team situation;

科技成果技术指标;Technical indicators of scientific and technological achievements;

科技成果交易中心。Scientific and technological achievements trading center.

作为优选的,所述自动分组包括实用专利、发明专利、软件著作权及论文。Preferably, the automatic grouping includes utility patents, invention patents, software copyrights and papers.

作为优选的,所述读取包括专利号读取、论文号读取及登记号读取,并通过协定检索网址以获取对应所述文献材料,并通过如下代码转化为可编辑的word文本格式:Preferably, the reading includes reading the patent number, reading the paper number and reading the registration number, and searching the website through the agreement to obtain the corresponding document material, and converting it into an editable word text format through the following code:

Figure BDA0003887257820000021
Figure BDA0003887257820000021

作为优选的,所述关键词字包括关键字、至少包括两个字且不超过五个字的关键词;Preferably, the keywords include keywords, keywords including at least two characters and no more than five characters;

所述关键词和所述关键字包含从所述通用标签及所述特性标签内提取的、且足以区分特征的字和词。The keywords and the keywords include words and phrases extracted from the general tags and the characteristic tags and sufficiently distinguishing features.

作为优选的,所述通用标签包括技成果基本信息、科技成果研究领域、科技成果创新水平、科研团队情况、科技成果技术领域、科技成果成熟度评估;Preferably, the general label includes basic information of technological achievements, research fields of scientific and technological achievements, innovation level of scientific and technological achievements, situation of scientific research teams, technical fields of scientific and technological achievements, and evaluation of the maturity of scientific and technological achievements;

所述特性标签包括科技成果进行应用、生产及有效化提取。The characteristic label includes application, production and effective extraction of scientific and technological achievements.

作为优选的,所述数据标签匹配采用的是供需匹配相似度,由供需匹配相似度判断匹配模型的精准程度,其具体的处理步骤如下:Preferably, the data label matching uses the similarity of supply and demand matching, and the accuracy of the matching model is judged by the similarity of supply and demand matching. The specific processing steps are as follows:

通用组别=科技成果的通用数据标签合并同类项并分组的标签数据集合,第一组分类为组别1,第n组分类为组别n;General group = the general data label of scientific and technological achievements is a collection of label data in which similar items are combined and grouped. The first group is classified as group 1, and the nth group is classified as group n;

特殊组别=科技成果的特殊数据标签合并同类项并分组的标签数据集合,第一组分类为组别A,第n组分类为组别N;Special group = special data tags of scientific and technological achievements are combined with similar items and grouped label data sets, the first group is classified as group A, and the nth group is classified as group N;

通用组别对比值=通用组别与产业需求数据对比度,第一组的对比度为组别1对比值,第n组分类为组别n对比值;Common group comparison value = comparison between general group and industrial demand data, the contrast of the first group is the comparison value of group 1, and the nth group is classified as the comparison value of group n;

特殊组别对比值=特性组别与产业需求数据对比度,第一组的对比度为组别A对比值,第n组分类为组别N对比值;Contrast value of special group = contrast between feature group and industry demand data, the contrast of the first group is the comparison value of group A, and the nth group is classified as the comparison value of group N;

近似值1=组别1对比值+......+组别n对比值;Approximate value 1 = comparative value of group 1 + ... + comparative value of group n;

近似值2=组别A对比值+......+组别N对比值;Approximate value 2 = comparative value of group A + ... + comparative value of group N;

(近似值1)*权重1+(近似值2)*权重2=供需匹配相似度。(approximate value 1) * weight 1 + (approximate value 2) * weight 2 = similarity of supply and demand matching.

在上述技术方案中,本发明提供的一种基于大数据实现科技成果供需匹配方法,具备以下有益效果:通过文本解析技术有效提取科技成果数据并进行标签化处理;通过市场化产品现状有效提取产业需求数据并进行标签化处理;搭建供需匹配模型、综合设计权重、制定分组及同类型合并规则进行匹配度辨识,通过供需匹配度给出科技成果转化概率,从而提高科技成果转化效率。In the above technical solution, the method for matching the supply and demand of scientific and technological achievements based on big data provided by the present invention has the following beneficial effects: effectively extract the data of scientific and technological achievements through text analysis technology and carry out labeling processing; Label the demand data; build a supply-demand matching model, comprehensively design weights, formulate grouping and same-type merging rules for matching degree identification, and give the transformation probability of scientific and technological achievements through the matching degree of supply and demand, thereby improving the transformation efficiency of scientific and technological achievements.

附图说明Description of drawings

为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明中记载的一些实施例,对于本领域普通技术人员来讲,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application or the prior art, the following will briefly introduce the accompanying drawings that are required in the embodiments. Obviously, the accompanying drawings in the following description are only described in the present invention For some embodiments of the present invention, those skilled in the art can also obtain other drawings according to these drawings.

图1为本发明实施例提供的数据提取对文献材料的解析结构示意图;Fig. 1 is a schematic diagram of the analytical structure of the document material provided by the data extraction provided by the embodiment of the present invention;

图2为本发明实施例提供的产业需求关键数据的结构示意图;Fig. 2 is a structural schematic diagram of the key data of industry demand provided by the embodiment of the present invention;

图3为本发明实施例提供的板件连接体和压紧件的结构示意图;Fig. 3 is a structural schematic diagram of a panel connecting body and a pressing member provided by an embodiment of the present invention;

图4为本发明实施例提供的板件连接体和压紧件的结构示意图。Fig. 4 is a schematic structural view of the plate connecting body and the pressing part provided by the embodiment of the present invention.

具体实施方式detailed description

下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

如图1-4所示,一种基于大数据实现科技成果供需匹配方法,包括:As shown in Figure 1-4, a method for matching supply and demand of scientific and technological achievements based on big data includes:

数据提取,读取、提取文献材料并录入;Data extraction, reading, extracting literature materials and entering them;

数据同类项处理及分组,根据录入的提取文献材料进行类型识别,根据文献材料的格式匹配,进行自动分组;Processing and grouping of similar items of data, type identification based on the extracted literature materials entered, and automatic grouping according to the format matching of the literature materials;

数据标签化,调取媒体库存储的关键词字在录入的文献材料进行全文查找,并自动罗列匹配的关键词字;Data labeling, call the keywords stored in the media library to search the full text of the entered literature materials, and automatically list the matching keywords;

数据标签匹配,其包括通用标签及特性标签,根据罗列匹配多个关键词字,利用计算方式和权重得出供需匹配的相似度,以确定唯一标签。Data label matching, which includes general labels and characteristic labels, matches multiple keywords according to the list, and uses calculation methods and weights to obtain the similarity of supply and demand matching to determine unique labels.

具体的,自动分组包括实用专利、发明专利、软件著作权及论文。Specifically, the automatic grouping includes utility patents, invention patents, software copyrights and papers.

再者,关键词字包括关键字、至少包括两个字且不超过五个字的关键词;Furthermore, keywords include keywords, keywords including at least two characters and no more than five characters;

关键词和关键字包含从通用标签及特性标签内提取的、且足以区分特征的字和词。Keywords and keywords include words and phrases that are extracted from general tags and feature tags and are sufficiently distinguishable.

进一步的,上述实施例中通用标签包括技成果基本信息、科技成果研究领域、科技成果创新水平、科研团队情况、科技成果技术领域、科技成果成熟度评估;Further, the general tags in the above embodiments include the basic information of technological achievements, the research field of scientific and technological achievements, the level of innovation of scientific and technological achievements, the situation of scientific research teams, the technical fields of scientific and technological achievements, and the assessment of the maturity of scientific and technological achievements;

特性标签包括科技成果进行应用、生产及有效化提取。The characteristic label includes the application, production and effective extraction of scientific and technological achievements.

上述实施例中,通过文本解析技术有效提取科技成果数据并进行标签化处理;通过市场化产品现状有效提取产业需求数据并进行标签化处理;搭建供需匹配模型、综合设计权重、制定分组及同类型合并规则进行匹配度辨识,通过供需匹配度给出科技成果转化概率,从而提高科技成果转化效率。In the above-mentioned embodiments, scientific and technological achievement data is effectively extracted and tagged through text analysis technology; industrial demand data is effectively extracted and tagged through the status quo of market-oriented products; supply and demand matching models are built, comprehensive design weights are established, and grouping and similar types are formulated. The matching degree is identified by the merging rules, and the transformation probability of scientific and technological achievements is given through the matching degree of supply and demand, so as to improve the transformation efficiency of scientific and technological achievements.

作为本发明进一步提供的一个实施例,数据提取包括对文件材料的录入分析,其处理方式包括以下步骤:As an embodiment further provided by the present invention, data extraction includes input and analysis of document materials, and its processing method includes the following steps:

S001、文本解析进行关键词提取;S001, text analysis for keyword extraction;

S002、生成对应的科技成果文本信息,包括:S002. Generate corresponding scientific and technological achievements text information, including:

科技成果基本信息;Basic information on scientific and technological achievements;

科技成果研究领域;Research field of scientific and technological achievements;

科技成果创新水平;Innovation level of scientific and technological achievements;

科研团队情况;Research team situation;

科技成果技术指标;Technical indicators of scientific and technological achievements;

科技成果交易中心。Scientific and technological achievements trading center.

作为本发明进一步提供的又一个实施例,读取包括专利号读取、论文号读取及登记号读取,并通过协定检索网址以获取对应文献材料,并通过如下代码转化为可编辑的word文本格式:As yet another embodiment further provided by the present invention, the reading includes patent number reading, paper number reading and registration number reading, and the website is retrieved through the agreement to obtain corresponding literature materials, and converted into editable word by the following code text format:

Figure BDA0003887257820000051
Figure BDA0003887257820000051

作为本发明进一步提供的再一个实施例,数据标签匹配采用的是供需匹配相似度,由供需匹配相似度判断匹配模型的精准程度,其具体的处理步骤如下:As yet another embodiment further provided by the present invention, the data label matching adopts the similarity of supply and demand matching, and the accuracy of the matching model is judged by the similarity of supply and demand matching. The specific processing steps are as follows:

通用组别=科技成果的通用数据标签合并同类项并分组的标签数据集合,第一组分类为组别1,第n组分类为组别n;General group = the general data label of scientific and technological achievements is a collection of label data in which similar items are combined and grouped. The first group is classified as group 1, and the nth group is classified as group n;

特殊组别=科技成果的特殊数据标签合并同类项并分组的标签数据集合,第一组分类为组别A,第n组分类为组别N;Special group = special data tags of scientific and technological achievements are combined with similar items and grouped label data sets, the first group is classified as group A, and the nth group is classified as group N;

通用组别对比值=通用组别与产业需求数据对比度,第一组的对比度为组别1对比值,第n组分类为组别n对比值;Common group comparison value = comparison between general group and industrial demand data, the contrast of the first group is the comparison value of group 1, and the nth group is classified as the comparison value of group n;

特殊组别对比值=特性组别与产业需求数据对比度,第一组的对比度为组别A对比值,第n组分类为组别N对比值;Contrast value of special group = contrast between feature group and industry demand data, the contrast of the first group is the comparison value of group A, and the nth group is classified as the comparison value of group N;

近似值1=组别1对比值+......+组别n对比值;Approximate value 1 = comparative value of group 1 + ... + comparative value of group n;

近似值2=组别A对比值+......+组别N对比值;Approximate value 2 = comparative value of group A + ... + comparative value of group N;

(近似值1)*权重1+(近似值2)*权重2=供需匹配相似度。(approximate value 1) * weight 1 + (approximate value 2) * weight 2 = similarity of supply and demand matching.

本领域内的技术人员应明白,本发明的实施例可提供为方法、系统、或计算机程序产品。因此,本发明可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art should understand that the embodiments of the present invention may be provided as methods, systems, or computer program products. Accordingly, the present invention can take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.

本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It should be understood that each procedure and/or block in the flowchart and/or block diagram, and a combination of procedures and/or blocks in the flowchart and/or block diagram can be realized by computer program instructions. These computer program instructions may be provided to a general purpose computer, special purpose computer, embedded processor, or processor of other programmable data processing equipment to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing equipment produce a An apparatus for realizing the functions specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to operate in a specific manner, such that the instructions stored in the computer-readable memory produce an article of manufacture comprising instruction means, the instructions The device realizes the function specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded onto a computer or other programmable data processing device, causing a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process, thereby The instructions provide steps for implementing the functions specified in the flow chart or blocks of the flowchart and/or the block or blocks of the block diagrams.

本发明中应用了具体实施例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本发明的限制。In the present invention, specific examples have been applied to explain the principles and implementation methods of the present invention, and the descriptions of the above examples are only used to help understand the method of the present invention and its core idea; meanwhile, for those of ordinary skill in the art, according to this The idea of the invention will have changes in the specific implementation and scope of application. To sum up, the contents of this specification should not be construed as limiting the present invention.

本申请的实施例还提供能够实现上述实施例中的方法中全部步骤的一种电子设备的具体实施方式,所述电子设备具体包括如下内容:Embodiments of the present application also provide a specific implementation of an electronic device capable of implementing all the steps in the methods in the above embodiments, and the electronic device specifically includes the following content:

处理器(processor)、存储器(memory)、通信接口(Communications Interface)和总线;processor (processor), memory (memory), communication interface (Communications Interface) and bus;

其中,所述处理器、存储器、通信接口通过所述总线完成相互间的通信;Wherein, the processor, the memory, and the communication interface complete mutual communication through the bus;

所述处理器用于调用所述存储器中的计算机程序,所述处理器执行所述计算机程序时实现上述实施例中的方法中的全部步骤,例如,所述处理器执行所述计算机程序时实现下述步骤:The processor is used to call the computer program in the memory, and when the processor executes the computer program, all the steps in the methods in the above embodiments are realized. For example, when the processor executes the computer program, the following The above steps:

数据提取界面显示;Data extraction interface display;

数据同类项处理及分组处理;Processing and grouping of similar items of data;

数据标签化界面显示;Data labeling interface display;

数据标签匹配界面显示。The data label matching interface is displayed.

本申请的实施例还提供能够实现上述实施例中的方法中全部步骤的一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机程序,该计算机程序被处理器执行时实现上述实施例中的方法的全部步骤,例如,所述处理器执行所述计算机程序时实现下述步骤:Embodiments of the present application also provide a computer-readable storage medium capable of implementing all the steps in the methods in the above-mentioned embodiments, and a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the above-mentioned All the steps of the method in the embodiment, for example, the following steps are implemented when the processor executes the computer program:

数据提取界面显示;Data extraction interface display;

数据同类项处理及分组处理;Processing and grouping of similar items of data;

数据标签化界面显示;Data labeling interface display;

数据标签匹配界面显示Data label matching interface display

本说明书中的各个实施例均采用递进的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。尤其,对于硬件+程序类实施例而言,由于其基本相似于方法实施例,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。虽然本说明书实施例提供了如实施例或流程图所述的方法操作步骤,但基于常规或者无创造性的手段可以包括更多或者更少的操作步骤。实施例中列举的步骤顺序仅仅为众多步骤执行顺序中的一种方式,不代表唯一的执行顺序。在实际中的装置或终端产品执行时,可以按照实施例或者附图所示的方法顺序执行或者并行执行(例如并行处理器或者多线程处理的环境,甚至为分布式数据处理环境)。术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、产品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、产品或者设备所固有的要素。在没有更多限制的情况下,并不排除在包括所述要素的过程、方法、产品或者设备中还存在另外的相同或等同要素。为了描述的方便,描述以上装置时以功能分为各种模块分别描述。当然,在实施本说明书实施例时可以把各模块的功能在同一个或多个软件和/或硬件中实现,也可以将实现同一功能的模块由多个子模块或子单元的组合实现等。以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。Each embodiment in this specification is described in a progressive manner, the same and similar parts of each embodiment can be referred to each other, and each embodiment focuses on the differences from other embodiments. In particular, for the hardware+program type embodiment, because it is basically similar to the method embodiment, the description is relatively simple, and for the related parts, please refer to the part of the description of the method embodiment. Although the embodiments of this specification provide the operation steps of the method described in the embodiments or flowcharts, more or fewer operation steps may be included based on conventional or non-inventive means. The sequence of steps enumerated in the embodiments is only one of the execution sequences of many steps, and does not represent the only execution sequence. When an actual device or terminal product is executed, the methods shown in the embodiments or drawings can be executed sequentially or in parallel (such as a parallel processor or multi-thread processing environment, or even a distributed data processing environment). The term "comprising", "comprising" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, product, or apparatus comprising a set of elements includes not only those elements, but also other elements not expressly listed elements, or also elements inherent in such a process, method, product, or apparatus. Without further limitations, it is not excluded that there are additional identical or equivalent elements in a process, method, product or device comprising said elements. For the convenience of description, when describing the above devices, functions are divided into various modules and described separately. Of course, when implementing the embodiments of this specification, the functions of each module can be realized in one or more pieces of software and/or hardware, or a module that realizes the same function can be realized by a combination of multiple submodules or subunits, etc. The device embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components can be combined or integrated. to another system, or some features may be ignored, or not implemented. In another point, the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical, mechanical or other forms. The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It should be understood that each procedure and/or block in the flowchart and/or block diagram, and a combination of procedures and/or blocks in the flowchart and/or block diagram can be realized by computer program instructions. These computer program instructions may be provided to a general purpose computer, special purpose computer, embedded processor, or processor of other programmable data processing equipment to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing equipment produce a An apparatus for realizing the functions specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

本领域技术人员应明白,本说明书的实施例可提供为方法、系统或计算机程序产品。因此,本说明书实施例可采用完全硬件实施例、完全软件实施例或结合软件和硬件方面的实施例的形式。而且,本说明书实施例可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。本说明书中的各个实施例均采用递进的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。尤其,对于系统实施例而言,由于其基本相似于方法实施例,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本说明书实施例的至少一个实施例或示例中。Those skilled in the art should understand that the embodiments of this specification may be provided as methods, systems or computer program products. Accordingly, the embodiments of the present description may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present description may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein. Each embodiment in this specification is described in a progressive manner, the same and similar parts of each embodiment can be referred to each other, and each embodiment focuses on the differences from other embodiments. In particular, for the system embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and for relevant parts, refer to part of the description of the method embodiment. In the description of this specification, descriptions referring to the terms "one embodiment", "some embodiments", "example", "specific examples", or "some examples" mean that specific features described in connection with the embodiment or example , structures, materials or features are included in at least one embodiment or example of the embodiments of this specification.

在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。以上所述仅为本说明书实施例的实施例而已,并不用于限制本说明书实施例。对于本领域技术人员来说,本说明书实施例可以有各种更改和变化。凡在本说明书实施例的精神和原理之内所作的任何修改、等同替换、改进等,均应包含在本说明书实施例的权利要求范围之内。In this specification, the schematic representations of the above terms are not necessarily directed to the same embodiment or example. In addition, those skilled in the art can combine and combine different embodiments or examples and features of different embodiments or examples described in this specification without conflicting with each other. The foregoing descriptions are merely examples of the embodiments of the present specification, and are not intended to limit the embodiments of the present specification. For those skilled in the art, various modifications and changes may be made to the embodiments of this specification. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the embodiments of this specification shall be included within the scope of claims of the embodiments of this specification.

Claims (9)

1. A method for realizing scientific and technological achievement supply and demand matching based on big data is characterized by comprising the following steps:
data extraction, reading, extracting and recording document materials;
processing and grouping the data in the same category, identifying the type according to the recorded extracted literature materials, and automatically grouping the data according to format matching of the literature materials;
labeling data, calling keyword words stored in a media library to perform full text search on the input literature material, and automatically listing matched keyword words;
and matching data labels, wherein the data labels comprise general labels and characteristic labels, matching a plurality of keyword words according to the lists, and obtaining similarity of supply and demand matching by using a calculation mode and weight so as to determine a unique label.
2. The big data based scientific and technological achievement supply and demand matching method according to claim 1, wherein the data extraction comprises input analysis of file materials, and the processing mode comprises the following steps:
s001, analyzing the text to extract keywords;
s002, generating corresponding scientific and technological achievement text information, including:
basic information of scientific and technological achievements;
the research field of scientific and technological achievements;
the innovation level of scientific and technological achievements;
(iv) a scientific research team situation;
technical indexes of scientific and technological achievements;
science and technology achievement trading center.
3. The big data based achievement science and technology achievement supply and demand matching method as claimed in claim 1, wherein the automatic grouping includes practical patents, invented patents, software copyrights and treatises.
4. The method as claimed in claim 1, wherein the reading includes reading patent number, reading thesis number and registration number, and searching web address by agreement to obtain the corresponding documentation material.
5. The big data based scientific and technological achievement supply and demand matching method according to claim 1, wherein the keyword words comprise keywords, keywords comprising at least two words and no more than five words;
the keywords and the keywords include words and phrases extracted from within the generic tags and the characteristic tags and sufficient to distinguish features.
6. The big data-based scientific and technological achievement supply and demand matching method based on claim 1 is characterized in that the general labels comprise technical achievement basic information, a scientific and technological achievement research field, a scientific and technological achievement innovation level, a scientific research team situation, a scientific and technological achievement technical field and a scientific and technological achievement maturity assessment;
the characteristic label comprises application, production and effective extraction of scientific and technological achievements.
7. The method for realizing scientific and technological achievement supply and demand matching based on big data as claimed in claim 1, wherein the data label matching adopts supply and demand matching similarity, the supply and demand matching similarity is used for judging the accuracy degree of the matching model, and the specific processing steps are as follows:
general group = general data labels of scientific and technological achievements merge label data sets of the same kind of items and grouping, the first group is classified as group 1, and the nth group is classified as group n;
special group = special data label of scientific and technological achievement combines the label data set of the same kind of item and grouping, the first group is classified as group A, the nth group is classified as group N;
the general group contrast value = general group to industry demand data contrast ratio, the contrast ratio of the first group is a group 1 contrast value, and the nth group is classified as a group n contrast value;
the contrast value of the special group = the contrast ratio of the characteristic group to the industrial demand data, the contrast ratio of the first group is a contrast value of a group A, and the nth group is classified as a contrast value of a group N;
approximate 1= group 1 vs +. N vs. value;
approximate 2= group a versus value + ·. · + group N versus value;
(approximation 1) weight 1+ (approximation 2) weight 2= supply-demand matching similarity.
8. An electronic device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, wherein the processor implements the steps of implementing the method for matching supply and demand for scientific and technological achievements based on big data according to any one of claims 1 to 7 when executing the program.
9. A computer-readable storage medium, on which a computer program is stored, wherein the computer program, when being executed by a processor, implements the steps of the big data based implementation of the method for matching scientific and technological achievements for supply and demand according to any one of claims 1 to 7.
CN202211250898.9A 2022-10-12 2022-10-12 A method for matching supply and demand of scientific and technological achievements based on big data Pending CN115455304A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211250898.9A CN115455304A (en) 2022-10-12 2022-10-12 A method for matching supply and demand of scientific and technological achievements based on big data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211250898.9A CN115455304A (en) 2022-10-12 2022-10-12 A method for matching supply and demand of scientific and technological achievements based on big data

Publications (1)

Publication Number Publication Date
CN115455304A true CN115455304A (en) 2022-12-09

Family

ID=84309030

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211250898.9A Pending CN115455304A (en) 2022-10-12 2022-10-12 A method for matching supply and demand of scientific and technological achievements based on big data

Country Status (1)

Country Link
CN (1) CN115455304A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116955538A (en) * 2023-08-16 2023-10-27 成都医星科技有限公司 Medical dictionary data matching method and device, electronic equipment and storage medium
CN118134079A (en) * 2024-01-30 2024-06-04 北京人人众包科技有限公司 A method and system for matching technical achievements and technical requirements

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111078862A (en) * 2019-12-06 2020-04-28 武汉理工大学 Active pushing method and device for scientific and technological achievements of colleges and universities
CN112380318A (en) * 2020-11-12 2021-02-19 中国科学技术大学智慧城市研究院(芜湖) Enterprise policy matching method based on label similarity
CN112528155A (en) * 2020-12-23 2021-03-19 广州博士信息技术研究院有限公司 Data processing and pushing method, device and medium based on scientific and technological achievement conversion
CN113918707A (en) * 2021-12-14 2022-01-11 中关村科技软件股份有限公司 Policy convergence and enterprise image matching recommendation method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111078862A (en) * 2019-12-06 2020-04-28 武汉理工大学 Active pushing method and device for scientific and technological achievements of colleges and universities
CN112380318A (en) * 2020-11-12 2021-02-19 中国科学技术大学智慧城市研究院(芜湖) Enterprise policy matching method based on label similarity
CN112528155A (en) * 2020-12-23 2021-03-19 广州博士信息技术研究院有限公司 Data processing and pushing method, device and medium based on scientific and technological achievement conversion
CN113918707A (en) * 2021-12-14 2022-01-11 中关村科技软件股份有限公司 Policy convergence and enterprise image matching recommendation method

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116955538A (en) * 2023-08-16 2023-10-27 成都医星科技有限公司 Medical dictionary data matching method and device, electronic equipment and storage medium
CN116955538B (en) * 2023-08-16 2024-03-19 成都医星科技有限公司 Medical dictionary data matching method and device, electronic equipment and storage medium
CN118134079A (en) * 2024-01-30 2024-06-04 北京人人众包科技有限公司 A method and system for matching technical achievements and technical requirements

Similar Documents

Publication Publication Date Title
JP6894534B2 (en) Information processing method and terminal, computer storage medium
CN112256762B (en) Enterprise portrait method, system, equipment and medium based on industrial map
CN110399339A (en) File classifying method, device, equipment and the storage medium of knowledge base management system
CN111125086B (en) Method, device, storage medium and processor for acquiring data resources
CN115455304A (en) A method for matching supply and demand of scientific and technological achievements based on big data
CN111522950B (en) A Rapid Identification System for Unstructured Massive Text Sensitive Data
Deselaers et al. Automatic medical image annotation in ImageCLEF 2007: Overview, results, and discussion
CN103425740A (en) IOT (Internet Of Things) faced material information retrieval method based on semantic clustering
CN111563382A (en) Text information acquisition method and device, storage medium and computer equipment
CN112035626A (en) A method, apparatus and electronic device for rapid identification of large-scale intent
CN112199937A (en) Short text similarity analysis method and system, computer equipment and medium
CN110263021B (en) Theme library generation method based on personalized label system
Nama et al. Implementation of K-Means Technique in Data Mining to Cluster Researchers Google Scholar Profile
Hanshal et al. Retracted article: Hybrid deep learning model for automatic fake news detection
CN115186151A (en) Resume screening method, device, equipment and storage medium
CN116150367A (en) An aspect-based sentiment analysis method and system
CN117708759B (en) Method and device for positioning industry link of enterprise
CN118779458A (en) A sensitive information analysis and identification method, system, device and readable storage medium
CN112487160A (en) Technical document tracing method and device, computer equipment and computer storage medium
CN103678355A (en) Text mining method and text mining device
CN114706927B (en) Data batch labeling method based on artificial intelligence and related equipment
CN117150046A (en) Method and system for automatic task decomposition based on contextual semantics
Fan et al. Literature review on Big Data and its application fields
KR20190100533A (en) Database module using artificial intelligence, economic data providing system and method using the same
CN114860898A (en) A software development knowledge base construction and application method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination