CN111339123A - Double-retrieval patent database establishing method and device - Google Patents

Double-retrieval patent database establishing method and device Download PDF

Info

Publication number
CN111339123A
CN111339123A CN202010098419.0A CN202010098419A CN111339123A CN 111339123 A CN111339123 A CN 111339123A CN 202010098419 A CN202010098419 A CN 202010098419A CN 111339123 A CN111339123 A CN 111339123A
Authority
CN
China
Prior art keywords
database
keyword
patent database
patent document
obtaining
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202010098419.0A
Other languages
Chinese (zh)
Inventor
邓梅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Rainpat Data Service Co ltd
Original Assignee
Jiangsu Rainpat Data Service Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Rainpat Data Service Co ltd filed Critical Jiangsu Rainpat Data Service Co ltd
Priority to CN202010098419.0A priority Critical patent/CN111339123A/en
Publication of CN111339123A publication Critical patent/CN111339123A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/256Integrating or interfacing systems involving database management systems in federated or virtual databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3322Query formulation using system suggestions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • G06Q50/184Intellectual property management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Technology Law (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Tourism & Hospitality (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a method and a device for establishing a patent database of double retrieval, which are used for acquiring a first patent document, wherein the first patent document has a first keyword; acquiring a first patent database from a patent retrieval data platform according to a first keyword; obtaining a second patent document, the first patent document and the second patent document both being target patent documents; judging whether the second patent document exists in the first patent database; when a second patent document exists, obtaining a second keyword according to the second patent document; acquiring a second patent database from the patent retrieval data platform according to the second keyword; and obtaining a third patent database according to the first patent database and the second patent database. The technical problems that the existing double search needs to be processed by manual operation, the process is complicated, and the search database is incomplete are solved. The technical effects of processing the double patents according to the retrieval requirements, realizing automatic establishment of double retrieval result databases and ensuring comprehensiveness of retrieval results are achieved.

Description

Double-retrieval patent database establishing method and device
Technical Field
The invention relates to the technical field of data processing, in particular to a method and a device for establishing a patent database with double retrieval.
Background
Patent document search is to search for patents and patent documents. Chinese Patent Retrieval System (CPRS): the patent retrieval and full text browsing system is only used in a local area network of the national intellectual property office. The system comprises: the full text of the data recorded in the three Chinese patents and the invention and the utility model since 1985; bibliographic data and full text descriptions of U.S. patents since 1975; the entire descriptions of the patents and utility models have been filed since 1993. The patent literature retrieval is the basic work that enterprises comprehensively know the prior art, improves the research and development starting point and avoids intellectual property risks. Because original patent data disclosed on the internet is incomplete, language is obscure, and the original patent data is long and difficult to understand, enterprises have difficulty in searching if professional searching methods and skills are not mastered. With the continuous development and improvement of social systems, the number of patent documents is rapidly increased, so that the protection of the patent rights of enterprises in various countries is more and more important. For an enterprise, how to accurately retrieve and analyze information meeting the needs of the enterprise from a large amount of patent documents is very important for the development of the whole enterprise. With the diversification of search requirements according to the difference in search requirements, it is sometimes necessary to search both patent documents together, not only for a single patent document but also for both patent documents.
However, the applicant of the present invention finds that the prior art has at least the following technical problems:
in the prior art, the processing required by double-patent retrieval is manual operation, and the technical problems of complex process, time and labor consumption, low integrity and effectiveness of the retrieval database exist.
Disclosure of Invention
The embodiment of the invention provides a method and a device for establishing a patent database for double retrieval, which solve the technical problems that in the prior art, the processing for double-patent retrieval needs manual operation, the process is complicated, the time and the labor are consumed, and the integrity and the effectiveness of the retrieval database are not high.
In view of the above problems, the present application is proposed to provide a patent database building method and apparatus with double search.
In a first aspect, the present invention provides a method for building a patent database with double search, where the method includes: obtaining a first patent document, wherein the first patent document has a first keyword; obtaining a first patent database from the patent retrieval data platform according to the first keyword; obtaining a second patent document, the first patent document and the second patent document both being target patent documents; determining whether the second patent document exists in the first patent database; when the second patent literature exists in the first patent database, obtaining a second keyword according to the second patent literature; obtaining a second patent database from the patent retrieval data platform according to the second keyword; and obtaining a third patent database according to the first patent database and the second patent database.
Preferably, the obtaining a third patent database according to the first patent database and the second patent database includes: judging whether the first keyword and the second keyword are words of the same type; and when the first keyword and the second keyword are words of the same type, obtaining a third patent database according to the first patent database and the second patent database.
Preferably, after the determining whether the first keyword and the second keyword are words of the same type, the method includes: when the first keyword and the second keyword are not words of the same type, obtaining the number of the same patents in the first patent database and the second patent database; judging whether the number of the same patents in the first patent database and the second patent database meets a preset condition or not; and when the number meets the preset condition, obtaining a third patent database according to the first patent database and the second patent database.
Preferably, when the number satisfies a predetermined condition, the method includes: the number is up to 50% or more of the first patent database, and/or the number is up to 50% or more of the second patent database.
Preferably, the same type means that keywords can be mutually replaced based on the first patent document and the second patent document in the patent data retrieval platform.
In a second aspect, the present invention provides a double-search patent database building apparatus, including:
a first obtaining unit configured to obtain a first patent document, wherein the first patent document has a first keyword;
a second obtaining unit, configured to obtain a first patent database from the patent retrieval data platform according to the first keyword;
a third obtaining unit configured to obtain a second patent document, the first patent document and the second patent document both being target patent documents;
a first judgment unit configured to judge whether the second patent document exists in the first patent database;
a fourth obtaining unit configured to obtain a second keyword from the second patent document when the second patent document exists in the first patent database;
a fifth obtaining unit, configured to obtain a second patent database from the patent retrieval data platform according to the second keyword;
a sixth obtaining unit, configured to obtain a third patent database according to the first patent database and the second patent database.
Preferably, the apparatus further comprises:
a second judging unit, configured to judge whether the first keyword and the second keyword are words of the same type;
a seventh obtaining unit configured to obtain a third patent database from the first patent database and the second patent database when the first keyword and the second keyword are words of the same type.
Preferably, the apparatus further comprises:
an eighth obtaining unit, configured to obtain, when the first keyword and the second keyword are not words of the same type, the number of patents in the first patent database and the number of patents in the second patent database that are the same;
a third judging unit, configured to judge whether the same number of patents in the first patent database and the second patent database satisfies a predetermined condition;
a ninth obtaining unit configured to obtain a third patent database from the first patent database and the second patent database when the number satisfies the predetermined condition.
Preferably, the number is up to 50% or more of the first patent database, and/or the number is up to 50% or more of the second patent database.
Preferably, the same type means that keywords can be mutually replaced based on the first patent document and the second patent document in the patent data retrieval platform.
In a third aspect, the present invention provides a double-search patent database building apparatus, including a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor implements the steps of any one of the above methods when executing the program.
In a fourth aspect, the invention provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, performs the steps of any of the methods described above.
One or more technical solutions in the embodiments of the present application have at least one or more of the following technical effects:
according to the patent database establishing method and device for double retrieval provided by the embodiment of the invention, a first patent document is obtained, wherein the first patent document has a first keyword; obtaining a first patent database from the patent retrieval data platform according to the first keyword; obtaining a second patent document, the first patent document and the second patent document both being target patent documents; determining whether the second patent document exists in the first patent database; when the second patent literature exists in the first patent database, obtaining a second keyword according to the second patent literature; obtaining a second patent database from the patent retrieval data platform according to the second keyword; and obtaining a third patent database according to the first patent database and the second patent database. The technical effects of processing the double patents according to the retrieval requirements, realizing automatic establishment of a double retrieval result database, ensuring comprehensiveness of the retrieval results and improving the retrieval efficiency are achieved. Therefore, the technical problems that in the prior art, the processing required by double-patent retrieval is manual operation, the process is complicated, time and labor are consumed, and the integrity and the effectiveness of the retrieval database are not high are solved.
The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.
Drawings
FIG. 1 is a schematic flow chart illustrating a method for building a patent database with double search according to an embodiment of the present invention;
FIG. 2 is a schematic structural diagram of a patent database building apparatus for double search according to an embodiment of the present invention;
fig. 3 is a schematic structural diagram of another double-search patent database creation apparatus according to an embodiment of the present invention.
Description of reference numerals: a first obtaining unit 11, a second obtaining unit 12, a third obtaining unit 13, a first judging unit 14, a fourth obtaining unit 15, a fifth obtaining unit 16, a sixth obtaining unit 17, a bus 300, a receiver 301, a processor 302, a transmitter 303, a memory 304, and a bus interface 306.
Detailed Description
The embodiment of the invention provides a method and a device for establishing a patent database for double retrieval, which are used for solving the technical problems that in the prior art, the processing for double-patent retrieval needs is manual operation, the process is complicated, the time and the labor are consumed, and the integrity and the effectiveness of the retrieval database are not high.
The technical scheme provided by the invention has the following general idea:
obtaining a first patent document, wherein the first patent document has a first keyword; obtaining a first patent database from the patent retrieval data platform according to the first keyword; obtaining a second patent document, the first patent document and the second patent document both being target patent documents; determining whether the second patent document exists in the first patent database; when the second patent literature exists in the first patent database, obtaining a second keyword according to the second patent literature; obtaining a second patent database from the patent retrieval data platform according to the second keyword; and obtaining a third patent database according to the first patent database and the second patent database. The technical effects of processing the double patents according to the retrieval requirements, realizing automatic establishment of a double retrieval result database, ensuring comprehensiveness of the retrieval results and improving the retrieval efficiency are achieved.
The technical solutions of the present invention are described in detail below with reference to the drawings and specific embodiments, and it should be understood that the specific features in the embodiments and examples of the present invention are described in detail in the technical solutions of the present application, and are not limited to the technical solutions of the present application, and the technical features in the embodiments and examples of the present application may be combined with each other without conflict.
The term "and/or" herein is merely an association describing an associated object, meaning that three relationships may exist, e.g., a and/or B, may mean: a exists alone, A and B exist simultaneously, and B exists alone. In addition, the character "/" herein generally indicates that the former and latter related objects are in an "or" relationship.
Example one
Fig. 1 is a schematic flow chart of a method for establishing a patent database with double search according to an embodiment of the present invention. As shown in fig. 1, an embodiment of the present invention provides a method for building a patent database with double search, where the method includes:
step 110: a first patent document is obtained, wherein the first patent document has a first keyword.
Specifically, the first patent document is a target patent document, that is, a patent document to be searched, and a keyword thereof is determined by analyzing the content, title, and classification number of the first patent document, and the keyword is an information content input by a user when using a search engine, and can maximally summarize the information content that the user desires to search for. Keywords that are spoken in the search engine optimization SEO industry often refer to the core and main content of a web page. For a search engine, your web page is mainly about what aspect is that aspect can be attributed one (more often multiple) keyword. For a patent document, a keyword is a core description content of a patent document description subject, and a word capable of locating a main body and important invention content of the patent document.
Step 120: and obtaining a first patent database from the patent retrieval data platform according to the first keyword.
Specifically, after determining keywords of a first patent document, the keywords are used for searching a patent document database, the existing mature patent search data platform is mainly used for searching, such as enterprise search, patent office web pages and the like, the determined keywords are recorded through linking the patent search data platform, and a corresponding keyword patent document database is obtained, wherein the patent document database is a first patent database.
Step 130: a second patent document is obtained, both of which are the target patent documents.
Specifically, the second patent document is also a target patent document, that is, a patent document requiring a search, as in the first patent document, that is, the present embodiment has a double search request, and it is necessary to perform search processing on both the first patent document and the second patent document, and satisfy the search request concerning both the patent documents.
Step 140: determining whether the second patent document exists in the first patent database.
Specifically, the first patent database is obtained after the search processing of the first patent document, and the first patent database is a set of all patent documents which are obtained by searching the patent data platform through the first keyword of the first patent document and meet the requirement of the first keyword, namely are related to the first keyword. Whether the second patent document is in the first patent database or not, that is, whether the second patent document and the first patent document have a certain correlation or not is determined. The determination of whether the second patent document exists in the first patent database may be performed by searching using the title and/or the patentee information of the second patent document to determine whether the second patent document exists in the first patent document.
Step 150: and when the second patent document exists in the first patent database, obtaining a second keyword according to the second patent document.
Specifically, if the second patent document exists in the first patent database, it is described that the second patent document is related to the first patent document, and at least the second patent document has the same first keyword as the first patent document. In this case, the second keyword of the second patent document is determined according to the content, title, classification number information, and the like of the second patent document, the second keyword is a core word of the second patent document, and is a word capable of expressing the core content of the second patent document, and the second keyword may be the same as or different from the first keyword. And if the second keyword is the same as the first keyword, determining that the first patent database is a final patent search result, and if the second keyword is different from the first keyword, further performing search processing on the second keyword. In addition, in some patents, there may be a plurality of keywords, and the keywords may be further analyzed to determine the keywords that better meet the content of the invention of the second patent document as the second keywords, or the keywords may be searched separately. Of course, if there are a plurality of keywords, in which case the keywords are the same as the first keyword, the keywords may be removed and the search may be performed using another keyword other than the same keyword.
Step 160: and obtaining a second patent database from the patent retrieval data platform according to the second keyword.
Specifically, since the second patent database is obtained by performing data search again using the second keyword, which is the keyword of the second patent document, by using the patent search data platform, the second patent database is a collection of all patent documents including the second keyword in the patent documents that have been published in the past.
Step 170: and obtaining a third patent database according to the first patent database and the second patent database.
Specifically, a first patent database and a second patent database are obtained through analysis and search of a first patent document and analysis and search of a second patent document, in order to meet the double search requirements of the two patent documents, the two databases are merged, namely, the first patent database and the second patent database are merged to obtain a third patent database, the third patent database comprises all search results of the first patent document and the second patent document, and needless to say, repeated patents are possibly existed, the repeated patents are merged, so that the technical effects of processing the double patents according to the search requirements respectively, realizing automatic establishment of the double search result databases, ensuring the comprehensiveness of the search results and improving the search efficiency are achieved. Therefore, the technical problems that in the prior art, manual operation is needed to be carried out for double retrieval, the process is complicated, time and labor are consumed, and a retrieval database is incomplete are solved.
Further, the obtaining a third patent database according to the first patent database and the second patent database includes: judging whether the first keyword and the second keyword are words of the same type; and when the first keyword and the second keyword are words of the same type, obtaining a third patent database according to the first patent database and the second patent database.
Further, the same type means that keywords can be mutually replaced based on the first patent document and the second patent document in the patent data retrieval platform.
Specifically, when the first patent database and the second patent database are merged, firstly, whether the first keyword and the second keyword are words of the same type is judged, the same type judgment can be performed from multiple aspects such as the attribute, the word meaning and the used context of the first keyword, for example, the ratio given to the parameter is used for scoring, the similarity value is obtained through weighting calculation, then, the comparison is performed according to the set similarity threshold, and if the requirements are met, the first keyword and the second keyword are determined to be words of the same type. And when the first keyword and the second keyword meet the requirements of the same type of words, merging the first patent database and the second patent database, combining the first patent database and the second patent database and deleting the words repeatedly to obtain a finally determined retrieval database. The method realizes effective processing of double-patent retrieval results and ensures the integrity and the effectiveness of the retrieval database.
Further, after the determining whether the first keyword and the second keyword are words of the same type, the method includes: when the first keyword and the second keyword are not words of the same type, obtaining the number of the same patents in the first patent database and the second patent database; judging whether the number of the same patents in the first patent database and the second patent database meets a preset condition or not; and when the number meets the preset condition, obtaining a third patent database according to the first patent database and the second patent database.
Further, when the number meets a predetermined condition, the method includes: the number is up to 50% or more of the first patent database, and/or the number is up to 50% or more of the second patent database.
Specifically, if it is determined that the first keyword and the second keyword do not belong to the same type of word, that is, the first keyword and the second keyword cannot be replaced, the patents in the first patent database and the second patent database are further analyzed, the determination is performed according to the number of the same patents in the first patent database and the second patent database as a standard, when the number of the repeated patents in the first patent database and the second patent database is large, the content relevance of the two patents is also large, and when the number requirement is met, the first patent database and the second patent database are combined to obtain a finally determined search result, that is, a third patent database. If the number of the repeated patents in the first patent database and the second patent database is not large or is not large, the correlation between the first patent database and the second patent database is low or not correlated, and the correlation between the corresponding first patent document and the corresponding second patent document is also low or not correlated, at this time, the two patent databases cannot be merged, and the value of retrieval reference is not available. When the similarity of the number of the patent databases is judged, the number of the repeated patents is at least 50% of the number of the first patent database or/and the number of the repeated patents in the second patent database, and of course, the number of the repeated patents can be adjusted as required according to the reference requirement of the retrieval data result to ensure the effectiveness of the retrieval database.
Example two
Based on the same inventive concept as the double-search patent database establishing method in the foregoing embodiment, the present invention further provides a double-search patent database establishing method and apparatus, as shown in fig. 2, the apparatus includes:
a first obtaining unit 11, the first obtaining unit 11 being configured to obtain a first patent document, wherein the first patent document has a first keyword;
a second obtaining unit 12, where the second obtaining unit 12 is configured to obtain a first patent database from the patent retrieval data platform according to the first keyword;
a third obtaining unit 13, the third obtaining unit 13 being configured to obtain a second patent document, the first patent document and the second patent document both being target patent documents;
a first judgment unit 14, the first judgment unit 14 being configured to judge whether the second patent document exists in the first patent database;
a fourth obtaining unit 15, wherein the fourth obtaining unit 15 is configured to obtain a second keyword according to the second patent document when the second patent document exists in the first patent database;
a fifth obtaining unit 16, where the fifth obtaining unit 16 is configured to obtain a second patent database from the patent retrieval data platform according to the second keyword;
a sixth obtaining unit 17, where the sixth obtaining unit 17 is configured to obtain a third patent database according to the first patent database and the second patent database.
Further, the apparatus further comprises:
a second judging unit, configured to judge whether the first keyword and the second keyword are words of the same type;
a seventh obtaining unit configured to obtain a third patent database from the first patent database and the second patent database when the first keyword and the second keyword are words of the same type.
Further, the apparatus further comprises:
an eighth obtaining unit, configured to obtain, when the first keyword and the second keyword are not words of the same type, the number of patents in the first patent database and the number of patents in the second patent database that are the same;
a third judging unit, configured to judge whether the same number of patents in the first patent database and the second patent database satisfies a predetermined condition;
a ninth obtaining unit configured to obtain a third patent database from the first patent database and the second patent database when the number satisfies the predetermined condition.
Further, the number is up to 50% or more of the first patent database, and/or the number is up to 50% or more of the second patent database.
Further, the same type means that keywords can be mutually replaced based on the first patent document and the second patent document in the patent data retrieval platform.
Various modifications and specific examples of the double-search patent database creation method in the first embodiment of fig. 1 are also applicable to the double-search patent database creation apparatus in this embodiment, and a person skilled in the art can clearly know the implementation method of the double-search patent database creation apparatus in this embodiment through the foregoing detailed description of the double-search patent database creation method, so for the brevity of the description, detailed descriptions are omitted here.
EXAMPLE III
Based on the same inventive concept as the double-search patent database building method in the foregoing embodiment, the present invention further provides a double-search patent database building apparatus, as shown in fig. 3, including a memory 304, a processor 302, and a computer program stored in the memory 304 and executable on the processor 302, wherein the processor 302 implements the steps of any one of the double-search patent database building methods described above when executing the program.
Where in fig. 3 a bus architecture (represented by bus 300), bus 300 may include any number of interconnected buses and bridges, bus 300 linking together various circuits including one or more processors, represented by processor 302, and memory, represented by memory 304. The bus 300 may also link together various other circuits such as peripherals, voltage regulators, power management circuits, and the like, which are well known in the art, and therefore, will not be described any further herein. A bus interface 306 provides an interface between the bus 300 and the receiver 301 and transmitter 303. The receiver 301 and the transmitter 303 may be the same element, i.e., a transceiver, providing a means for communicating with various other apparatus over a transmission medium. The processor 302 is responsible for managing the bus 300 and general processing, and the memory 304 may be used for storing data used by the processor 302 in performing operations.
Example four
Based on the same inventive concept as the double-search patent database building method in the foregoing embodiment, the present invention also provides a computer-readable storage medium having a computer program stored thereon, which when executed by a processor implements the following steps: obtaining a first patent document, wherein the first patent document has a first keyword; obtaining a first patent database from the patent retrieval data platform according to the first keyword; obtaining a second patent document, the first patent document and the second patent document both being target patent documents; determining whether the second patent document exists in the first patent database; when the second patent literature exists in the first patent database, obtaining a second keyword according to the second patent literature; obtaining a second patent database from the patent retrieval data platform according to the second keyword; and obtaining a third patent database according to the first patent database and the second patent database.
In a specific implementation, when the program is executed by a processor, any method step in the first embodiment may be further implemented.
One or more technical solutions in the embodiments of the present application have at least one or more of the following technical effects:
according to the patent database establishing method and device for double retrieval provided by the embodiment of the invention, a first patent document is obtained, wherein the first patent document has a first keyword; obtaining a first patent database from the patent retrieval data platform according to the first keyword; obtaining a second patent document, the first patent document and the second patent document both being target patent documents; determining whether the second patent document exists in the first patent database; when the second patent literature exists in the first patent database, obtaining a second keyword according to the second patent literature; obtaining a second patent database from the patent retrieval data platform according to the second keyword; and obtaining a third patent database according to the first patent database and the second patent database. The technical effects of processing the double patents according to the retrieval requirements, realizing automatic establishment of a double retrieval result database, ensuring comprehensiveness of the retrieval results and improving the retrieval efficiency are achieved. Therefore, the technical problems that in the prior art, the processing required by double-patent retrieval is manual operation, the process is complicated, time and labor are consumed, and the integrity and the effectiveness of the retrieval database are not high are solved.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims (9)

1. A double-retrieval patent database building method is characterized by comprising the following steps:
obtaining a first patent document, wherein the first patent document has a first keyword;
obtaining a first patent database from the patent retrieval data platform according to the first keyword;
obtaining a second patent document, the first patent document and the second patent document both being target patent documents;
determining whether the second patent document exists in the first patent database;
when the second patent literature exists in the first patent database, obtaining a second keyword according to the second patent literature;
obtaining a second patent database from the patent retrieval data platform according to the second keyword;
and obtaining a third patent database according to the first patent database and the second patent database.
2. The method of claim 1, wherein said obtaining a third patent database from the first patent database and the second patent database comprises:
judging whether the first keyword and the second keyword are words of the same type;
and when the first keyword and the second keyword are words of the same type, obtaining a third patent database according to the first patent database and the second patent database.
3. The method of claim 2, wherein said determining whether the first keyword and the second keyword are of a same type of word comprises:
when the first keyword and the second keyword are not words of the same type, obtaining the number of the same patents in the first patent database and the second patent database;
judging whether the number of the same patents in the first patent database and the second patent database meets a preset condition or not;
and when the number meets the preset condition, obtaining a third patent database according to the first patent database and the second patent database.
4. The method of claim 3, wherein when the number satisfies a predetermined condition, comprising:
the number is up to 50% or more of the first patent database, and/or the number is up to 50% or more of the second patent database.
5. The method according to claim 2, wherein the same type refers to that keywords can be mutually replaced based on the first patent document and the second patent document in the patent data retrieval platform.
6. A double-search patent database building apparatus, comprising:
a first obtaining unit configured to obtain a first patent document, wherein the first patent document has a first keyword;
a second obtaining unit, configured to obtain a first patent database from the patent retrieval data platform according to the first keyword;
a third obtaining unit configured to obtain a second patent document, the first patent document and the second patent document both being target patent documents;
a first judgment unit configured to judge whether the second patent document exists in the first patent database;
a fourth obtaining unit configured to obtain a second keyword from the second patent document when the second patent document exists in the first patent database;
a fifth obtaining unit, configured to obtain a second patent database from the patent retrieval data platform according to the second keyword;
a sixth obtaining unit, configured to obtain a third patent database according to the first patent database and the second patent database.
7. The apparatus of claim 6, wherein the apparatus further comprises:
a second judging unit, configured to judge whether the first keyword and the second keyword are words of the same type;
a seventh obtaining unit configured to obtain a third patent database from the first patent database and the second patent database when the first keyword and the second keyword are words of the same type.
8. A dual-search patent database building apparatus comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the steps of the method of any one of claims 1-5 when executing the program.
9. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 5.
CN202010098419.0A 2020-02-18 2020-02-18 Double-retrieval patent database establishing method and device Withdrawn CN111339123A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010098419.0A CN111339123A (en) 2020-02-18 2020-02-18 Double-retrieval patent database establishing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010098419.0A CN111339123A (en) 2020-02-18 2020-02-18 Double-retrieval patent database establishing method and device

Publications (1)

Publication Number Publication Date
CN111339123A true CN111339123A (en) 2020-06-26

Family

ID=71181708

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010098419.0A Withdrawn CN111339123A (en) 2020-02-18 2020-02-18 Double-retrieval patent database establishing method and device

Country Status (1)

Country Link
CN (1) CN111339123A (en)

Similar Documents

Publication Publication Date Title
CN106649260B (en) Product characteristic structure tree construction method based on comment text mining
WO2020164276A1 (en) Webpage data crawling method, apparatus and system, and computer-readable storage medium
CN112988969A (en) Method, device, equipment and storage medium for text retrieval
CN111522905A (en) Document searching method and device based on database
CN113190687B (en) Knowledge graph determining method and device, computer equipment and storage medium
CN112883030A (en) Data collection method and device, computer equipment and storage medium
CN105373546A (en) Information processing method and system for knowledge services
CN110321446A (en) Related data recommended method, device, computer equipment and storage medium
CN108388556B (en) Method and system for mining homogeneous entity
CN112115252A (en) Intelligent auxiliary writing processing method and device, electronic equipment and storage medium
CN109388690A (en) Text searching method, inverted list generation method and system for text retrieval
CN111444312A (en) Method and device for multi-platform combined patent retrieval
CN111339123A (en) Double-retrieval patent database establishing method and device
CN111274364A (en) Automatic denoising method and device based on keyword retrieval data
CN113407678B (en) Knowledge graph construction method, device and equipment
CN114780700A (en) Intelligent question-answering method, device, equipment and medium based on machine reading understanding
CN113868510A (en) Data processing method and device and computer readable storage medium
CN111368055A (en) Retrieval method and device for patent database combined enterprise information platform
CN113360517A (en) Data processing method and device, electronic equipment and storage medium
CN112989163A (en) Vertical search method and system
US7657417B2 (en) Method, system and machine readable medium for publishing documents using an ontological modeling system
CN111353023A (en) Target database optimization method and device based on keyword retrieval
CN111309895A (en) Automatic denoising method and device for retrieval data
CN111324726A (en) Method and device for automatically drying patent database
CN111368062A (en) Verification method and device for denoising patent retrieval database

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication

Application publication date: 20200626

WW01 Invention patent application withdrawn after publication