CN112287074A - Patent information prediction system based on data mining - Google Patents

Patent information prediction system based on data mining Download PDF

Info

Publication number
CN112287074A
CN112287074A CN202011351495.4A CN202011351495A CN112287074A CN 112287074 A CN112287074 A CN 112287074A CN 202011351495 A CN202011351495 A CN 202011351495A CN 112287074 A CN112287074 A CN 112287074A
Authority
CN
China
Prior art keywords
data
unit
analysis
data mining
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202011351495.4A
Other languages
Chinese (zh)
Inventor
曹亮
李湘丽
刘双印
徐龙琴
郭鹏飞
付志文
徐浩根
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongkai University of Agriculture and Engineering
Original Assignee
Zhongkai University of Agriculture and Engineering
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhongkai University of Agriculture and Engineering filed Critical Zhongkai University of Agriculture and Engineering
Priority to CN202011351495.4A priority Critical patent/CN112287074A/en
Publication of CN112287074A publication Critical patent/CN112287074A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A patent information prediction system based on data mining comprises a data screening module, a data mining module, a data analysis module and a result uploading module which are electrically connected in sequence; wherein: the data screening module screens key data information associated with the keyword requirement from the massive big data, the data mining module performs data mining on the key data information based on a preset rule, the data analysis module analyzes a mining result to obtain a keyword requirement analysis result, and the result uploading module uploads the analysis result to the service platform to be displayed. The whole navigation process realizes intellectualization, the intelligent system replaces manpower, the operation is relatively simple, the time and resources consumed by manpower are reduced, the processing speed is accelerated, the efficiency is improved, the statistical analysis result of the system is more accurate, and the error rate is reduced.

Description

Patent information prediction system based on data mining
Technical Field
The invention relates to the technical field of data analysis, in particular to a patent information prediction system based on data mining.
Background
With the rapid development of science and technology in China and the enhancement of protection consciousness of intellectual property rights of people, more and more enterprises, organizations and individuals are willing to protect their technologies, products, brands and works by laws and obtain protection by applying for patents, trademarks and copyrights.
Most of intellectual property rights are manually searched by the prior patent engineers according to the related fields and keywords in the process of application protection, while the patent engineers often only search a few authorized patents and rarely search invalid patents, particularly search the invalid patents after being reviewed, because the patents after being reviewed are relatively insufficient in technical innovation degree, but because the patents after being reviewed are capable of entering the invalid state, the patent engineers still have a certain market value space, namely the technology of the patent engineers is close to the practical application, the technology, products, processes or formulas and the like which are closest to the current state of the industry can be known from the patents, the industry is generally held, then key technical words (including processes, parameters or data and the like) are screened from the patents, and reverse search is carried out in reverse, searching blank areas of the key technical words, collecting cross-overlapped areas among the key technical words, and analyzing the blank areas and the cross-overlapped areas, wherein the blank areas are not in the prior art, and the cross-overlapped areas are more practical in the industry.
Through search and discovery, the invention patent of patent application number CN201110432218.0 discloses a patent information presentation method and system, comprising: s10, creating a patent list window and a plurality of sub-windows which are associated with the patent list window to respectively present different patent information; s20, downloading and storing patent information, wherein all patents in the patent information are presented in a patent list window in a list manner; s30, selecting the patent and the sub-window to be checked in the patent list window; and S40, the sub-window acquires the information content to be presented from the proprietary information database and presents the information content therein so as to facilitate browsing and viewing of the user.
Patent application No.: the invention patent of CN201010217459.9 discloses a method and a device for analyzing patent information, which are used to realize expandability of patent information analysis. The patent information analysis method comprises the following steps: receiving an analysis instruction initiated by a user and aiming at a specific analysis template; extracting the specific analysis template from at least one configured analysis template according to the analysis instruction, wherein the analysis template is used for defining analysis items and measurement indexes; searching data content corresponding to an analysis item defined by the specific analysis template by accessing a data source, searching a measurement index value meeting the data content according to a measurement index defined by the specific analysis template, and taking the searched data content and the measurement index value corresponding to the data content as an obtained analysis result; and presenting the obtained analysis result to a user.
Patent application No.: CN201210579351.3, discloses an information presentation method and apparatus, wherein in the information presentation method, attribute information of a data set is extracted; selecting three dimensions in the attribute information as an X axis, a Y axis and a Z axis of a three-dimensional space respectively; determining a corresponding three-dimensional coordinate for each sample data in the data set; and displaying each sample data in a three-dimensional space formed by the X axis, the Y axis and the Z axis. Since the plane data is presented in a three-dimensional manner, the user can know the relevance and aggregation among the sample data conveniently. The perception effect of the user is improved.
Patent application No.: the invention patent of CN200910216835.X discloses a patent retrieval method and a system, and the patent retrieval method comprises the following steps: acquiring a patent retrieval request of a user; reading a patent retrieval condition of a user from the patent retrieval request; judging whether the patent retrieval condition is a patent retrieval condition used by the system for carrying out background retrieval regularly; if yes, providing a retrieval result of the system for performing background retrieval regularly for the user; otherwise, searching is carried out according to the patent searching conditions of the user, and the searching result is provided for the user. According to the technical scheme, the relevant patent retrieval conditions are preset for the system, and the background retrieval is carried out regularly, so that when a retrieval request of a user is received, a retrieval result obtained by carrying out the background retrieval regularly by the system according to the preset relevant retrieval conditions can be provided for the user, and the waiting time of the user is greatly shortened.
In summary, we can see that the market is still relatively lack of data in the aspect of patent mining, so we need to solve this kind of problem to facilitate patent technology mining by patent technicians.
Disclosure of Invention
In order to solve the problems in the prior art, the invention provides a patent information prediction system based on data mining.
The technical scheme adopted by the invention for solving the technical problems is as follows:
a patent information prediction system based on data mining comprises a data screening module, a data mining module, a data analysis module and a result uploading module which are electrically connected in sequence. Wherein: the data screening module screens key data information associated with the keyword requirement from the massive big data, the data mining module performs data mining on the key data information based on a preset rule, the data analysis module analyzes a mining result to obtain a keyword requirement analysis result, and the result uploading module uploads the analysis result to the service platform to be displayed.
The invention also has the following additional technical features:
the technical scheme of the invention is further specifically optimized as follows: the data screening module comprises a keyword input unit, a patent retrieval unit, a patent screening unit, a patent file extraction unit, a patent classification unit, a technology capturing unit and a basic model unit.
Wherein:
and the keyword input unit is used for inputting the key words of the target technology.
And the patent retrieval unit is used for retrieving the input key words in the patent database.
And the patent screening unit is used for screening the patents which accord with the patent review invalid information of the key words from the patent database.
And the patent file extracting unit is used for extracting the patent file of the patent review invalidation information from the database.
And the patent classification unit is used for classifying the extracted patent documents according to patent types.
And the technical grabbing unit is used for grabbing technical parts in various patent documents.
And the basic model unit is used for generating a data basic model diagram from the classified patent documents and the captured technical parts.
The technical scheme of the invention is further specifically optimized as follows: the data mining module comprises a keyword input unit and a data mining calculation unit. Wherein:
the keyword input unit comprises basic keyword input and advanced keyword input, wherein the basic keyword input is used for inputting patent keywords according to one of options of patent keywords, technical fields, application units, patent right units, inventor, application time and authorization time. The advanced keyword input selects various combinations according to patent keywords, technical fields, application units, patent right units, inventors, application time and authorization time options to input the patent keywords.
And a data mining calculation unit using Apriori data mining algorithm. In the Apriori data mining algorithm, the minimum support among basic keyword input options in the selected keyword input unit (1) is set to S, and S is 40%, the minimum confidence is set to P, and P is 80%. In the Apriori data mining algorithm, the minimum support among the high-level keyword input options in the selected keyword input unit (1) is set to S, and S is 50%, the minimum confidence is set to P, and P is 60%.
The technical scheme of the invention is further specifically optimized as follows: the data analysis module comprises a word frequency analysis unit, a semantic analysis unit, a patent analysis unit, a document analysis unit, an analysis result processing unit and a data mining unit. Wherein:
and the word frequency analysis unit is used for carrying out word frequency analysis on the patent data and importing the patent data subjected to the word frequency analysis into the cloud computing patent database.
And the semantic analysis unit is used for performing semantic analysis on the document data and importing the document data subjected to the semantic analysis into the cloud computing document database.
And the patent analysis unit is used for outputting a patent analysis report after analyzing the patent data.
And the document analysis unit is used for outputting a document analysis report after analyzing the document data.
And the analysis result processing unit is used for processing the patent analysis report and the literature analysis report to obtain the technical data range of the non-reiteratable patent formed by the applied patent and the published technology.
And the data mining unit is used for merging the technical data of the non-reissuable patent and the international patent classification data and mining the patentable technical data of the cloud computing.
The technical scheme of the invention is further specifically optimized as follows: and the result uploading module uploads the result by adopting a Hash database asymmetric encryption algorithm and uploads the result to the server. The server is set as a cloud server, and the cloud server is connected with the patent database through a network. And the cloud server outputs a patent technology navigation report through cloud computing.
Compared with the prior art, the invention has the advantages that:
advantage (1): according to the method, through analysis methods such as semantic analysis and word frequency analysis, patent and literature data of a cloud are automatically searched, a patent analysis report and a literature analysis report are obtained after processing, meanwhile, the cloud computing patentable technology data are merged with IPC data and mined, finally, a cloud computing patentable technology analysis report is obtained, and a user can obtain patentable navigation information from related reports. The whole navigation process realizes intellectualization, the intelligent system replaces manpower, the operation is relatively simple, the time and resources consumed by manpower are reduced, the processing speed is accelerated, the efficiency is improved, the statistical analysis result of the system is more accurate, and the error rate is reduced.
Advantage (2): apriori data mining is carried out on the keywords of the local patent file claim claims in the patent database file claim by using an Apriori data mining algorithm of a data mining calculation unit, and if Apriori data mining shows that the technology is infringed, an early warning reminding module in an early warning module sends patent early warning reminding information to the local patent file to remind a user of carrying out patent file retrieval for patent infringement behaviors, so that the user is reminded of carrying out technical modification to avoid infringement behaviors, and the functions of keyword deep retrieval and patent data mining early warning reminding of the patent technology are realized.
Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a schematic structural diagram of the present invention.
Detailed Description
Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings, in order that the present disclosure may be more fully understood and fully conveyed to those skilled in the art. While the exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the invention is not limited to the embodiments set forth herein.
A patent information prediction system based on data mining comprises a data screening module, a data mining module, a data analysis module and a result uploading module which are electrically connected in sequence. Wherein: the data screening module screens key data information associated with the keyword requirement from the massive big data, the data mining module performs data mining on the key data information based on a preset rule, the data analysis module analyzes a mining result to obtain a keyword requirement analysis result, and the result uploading module uploads the analysis result to the service platform to be displayed.
The data screening module comprises a keyword input unit, a patent retrieval unit, a patent screening unit, a patent file extraction unit, a patent classification unit, a technology capturing unit and a basic model unit. Wherein:
and the keyword input unit is used for inputting the key words of the target technology.
And the patent retrieval unit is used for retrieving the input key words in the patent database.
And the patent screening unit is used for screening the patents which accord with the patent review invalid information of the key words from the patent database.
And the patent file extracting unit is used for extracting the patent file of the patent review invalidation information from the database.
And the patent classification unit is used for classifying the extracted patent documents according to patent types.
And the technical grabbing unit is used for grabbing technical parts in various patent documents.
And the basic model unit is used for generating a data basic model diagram from the classified patent documents and the captured technical parts.
The data mining module comprises a keyword input unit and a data mining calculation unit. Wherein:
the keyword input unit comprises basic keyword input and advanced keyword input, wherein the basic keyword input is used for inputting patent keywords according to one of options of patent keywords, technical fields, application units, patent right units, inventor, application time and authorization time. The advanced keyword input selects various combinations according to patent keywords, technical fields, application units, patent right units, inventors, application time and authorization time options to input the patent keywords.
And a data mining calculation unit using Apriori data mining algorithm. In the Apriori data mining algorithm, the minimum support among basic keyword input options in the selected keyword input unit (1) is set to S, and S is 40%, the minimum confidence is set to P, and P is 80%. In the Apriori data mining algorithm, the minimum support among the high-level keyword input options in the selected keyword input unit (1) is set to S, and S is 50%, the minimum confidence is set to P, and P is 60%.
The data analysis module comprises a word frequency analysis unit, a semantic analysis unit, a patent analysis unit, a document analysis unit, an analysis result processing unit and a data mining unit. Wherein:
and the word frequency analysis unit is used for carrying out word frequency analysis on the patent data and importing the patent data subjected to the word frequency analysis into the cloud computing patent database.
And the semantic analysis unit is used for performing semantic analysis on the document data and importing the document data subjected to the semantic analysis into the cloud computing document database.
And the patent analysis unit is used for outputting a patent analysis report after analyzing the patent data.
And the document analysis unit is used for outputting a document analysis report after analyzing the document data.
And the analysis result processing unit is used for processing the patent analysis report and the literature analysis report to obtain the technical data range of the non-reiteratable patent formed by the applied patent and the published technology.
And the data mining unit is used for merging the technical data of the non-reissuable patent and the international patent classification data and mining the patentable technical data of the cloud computing.
And the result uploading module uploads the result by adopting a Hash database asymmetric encryption algorithm and uploads the result to the server. The server is set as a cloud server, and the cloud server is connected with the patent database through a network. And the cloud server outputs a patent technology navigation report through cloud computing.
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention are clearly and completely described above with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are a part of the embodiments of the present invention, but not all of the embodiments. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.
Thus, the above detailed description of the embodiments of the invention presented in the drawings is not intended to limit the scope of the invention as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Claims (5)

1. A patent information prediction system based on data mining is characterized by comprising a data screening module, a data mining module, a data analysis module and a result uploading module which are electrically connected in sequence; wherein: the data screening module screens key data information associated with the keyword requirement from the massive big data, the data mining module performs data mining on the key data information based on a preset rule, the data analysis module analyzes a mining result to obtain a keyword requirement analysis result, and the result uploading module uploads the analysis result to the service platform to be displayed.
2. The patent information prediction system based on data mining of claim 1, characterized in that: the data screening module comprises a keyword input unit, a patent retrieval unit, a patent screening unit, a patent file extraction unit, a patent classification unit, a technology grabbing unit and a basic model unit; wherein:
a keyword input unit for inputting a keyword of a target technology;
the patent retrieval unit is used for retrieving the input key terms in the patent database;
the patent screening unit is used for screening out patents which accord with the patent review invalid information of the key words from the patent database;
the patent file extracting unit is used for extracting the patent file of the patent review invalid information from the database;
a patent classification unit for classifying the extracted patent documents according to patent types;
the technical grabbing unit is used for grabbing technical parts in various patent documents;
and the basic model unit is used for generating a data basic model diagram from the classified patent documents and the captured technical parts.
3. The patent information prediction system based on data mining of claim 1, characterized in that: the data mining module comprises a keyword input unit and a data mining calculation unit; wherein:
the key word input unit comprises basic key word input and advanced key word input, wherein the basic key word input is used for inputting the patent key words according to one of options of patent key words, technical fields, application units, patent right units, inventors, application time and authorization time; the advanced keyword input selects various combinations to input the patent keywords according to the options of the patent keywords, the technical field, the application units, the patent right units, the inventor, the application time and the authorization time;
a data mining calculation unit using Apriori data mining algorithm; in the Apriori data mining algorithm, the minimum support degree in basic keyword input options in a selected keyword input unit (1) is set to be S, wherein S is 40%, the minimum confidence degree is set to be P, and P is 80%; in the Apriori data mining algorithm, the minimum support among the high-level keyword input options in the selected keyword input unit (1) is set to S, and S is 50%, the minimum confidence is set to P, and P is 60%.
4. The patent information prediction system based on data mining of claim 1, characterized in that: the data analysis module comprises a word frequency analysis unit, a semantic analysis unit, a patent analysis unit, a document analysis unit, an analysis result processing unit and a data mining unit; wherein:
the word frequency analysis unit is used for carrying out word frequency analysis on the patent data and importing the patent data subjected to the word frequency analysis into a cloud computing patent database;
the semantic analysis unit is used for performing semantic analysis on the document data and importing the document data subjected to the semantic analysis into a cloud computing document database;
the patent analysis unit is used for outputting a patent analysis report after analyzing the patent data;
the document analysis unit is used for outputting a document analysis report after analyzing the document data;
the analysis result processing unit is used for processing the patent analysis report and the literature analysis report to obtain the technical data range of the non-reiteratable patent formed by the applied patent and the published technology;
and the data mining unit is used for merging the technical data of the non-reissuable patent and the international patent classification data and mining the patentable technical data of the cloud computing.
5. The patent information prediction system based on data mining of claim 1, characterized in that: the result uploading module uploads the result by adopting a Hash database asymmetric encryption algorithm and uploads the result to the server; the server is set as a cloud server, and the cloud server is connected with the patent database through a network; and the cloud server outputs a patent technology navigation report through cloud computing.
CN202011351495.4A 2020-11-26 2020-11-26 Patent information prediction system based on data mining Withdrawn CN112287074A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011351495.4A CN112287074A (en) 2020-11-26 2020-11-26 Patent information prediction system based on data mining

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011351495.4A CN112287074A (en) 2020-11-26 2020-11-26 Patent information prediction system based on data mining

Publications (1)

Publication Number Publication Date
CN112287074A true CN112287074A (en) 2021-01-29

Family

ID=74425519

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011351495.4A Withdrawn CN112287074A (en) 2020-11-26 2020-11-26 Patent information prediction system based on data mining

Country Status (1)

Country Link
CN (1) CN112287074A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116821319A (en) * 2023-08-30 2023-09-29 环球数科集团有限公司 Quick screening type processing system based on AIGC

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116821319A (en) * 2023-08-30 2023-09-29 环球数科集团有限公司 Quick screening type processing system based on AIGC
CN116821319B (en) * 2023-08-30 2023-10-27 环球数科集团有限公司 Quick screening type processing system based on AIGC

Similar Documents

Publication Publication Date Title
US9459950B2 (en) Leveraging user-to-tool interactions to automatically analyze defects in IT services delivery
US20150286896A1 (en) Image Analysis Device, Image Analysis System, and Image Analysis Method
WO2017097231A1 (en) Topic processing method and device
CN107872454B (en) Threat information monitoring and analyzing system and method for ultra-large Internet platform
JP5827208B2 (en) Document management system, document management method, and document management program
US10824915B2 (en) Artificial intelligence system for inspecting image reliability
EP3270303A1 (en) An automated monitoring and archiving system and method
CN111913860B (en) Operation behavior analysis method and device
CN108073681A (en) Retrieve device, search method and search program
CN106815605B (en) Data classification method and equipment based on machine learning
US20240176798A1 (en) Generating and presenting a searchable graph based on a graph query
CN107391684B (en) Method and system for generating threat information
JP6025487B2 (en) Forensic analysis system, forensic analysis method, and forensic analysis program
CN112287074A (en) Patent information prediction system based on data mining
WO2014084141A1 (en) Document management system, document management method, and document management program
EP4002152A1 (en) Data tagging and synchronisation system
CN112506930B (en) Data insight system based on machine learning technology
CN107562753B (en) Index word-based analysis method and device
US10296990B2 (en) Verifying compliance of a land parcel to an approved usage
CN107463570B (en) Document retrieval/analysis method and device
KR102676525B1 (en) Method for retreiving information related to policy using public data and apparauts thereof
CN112187768B (en) Method, device and equipment for detecting bad information website and readable storage medium
US20220269745A1 (en) System and Methods for Scrubbing Social Media Content
CN112084296A (en) Patent data mining system and method
CN114254081B (en) Enterprise big data search system, method and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication

Application publication date: 20210129

WW01 Invention patent application withdrawn after publication