CN111339127A - Patent database processing method and device for continuous iterative retrieval - Google Patents

Patent database processing method and device for continuous iterative retrieval Download PDF

Info

Publication number
CN111339127A
CN111339127A CN202010131699.0A CN202010131699A CN111339127A CN 111339127 A CN111339127 A CN 111339127A CN 202010131699 A CN202010131699 A CN 202010131699A CN 111339127 A CN111339127 A CN 111339127A
Authority
CN
China
Prior art keywords
keyword
patent database
database
obtaining
meets
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202010131699.0A
Other languages
Chinese (zh)
Inventor
邓梅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Rainpat Data Service Co ltd
Original Assignee
Jiangsu Rainpat Data Service Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Rainpat Data Service Co ltd filed Critical Jiangsu Rainpat Data Service Co ltd
Priority to CN202010131699.0A priority Critical patent/CN111339127A/en
Publication of CN111339127A publication Critical patent/CN111339127A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/2425Iterative querying; Query formulation based on the results of a preceding query
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2452Query translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • G06Q50/184Intellectual property management

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Technology Law (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Tourism & Hospitality (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a patent database processing method and device for continuous iterative retrieval, which comprises the following steps: obtaining a first patent document, wherein the first patent document has a first keyword; acquiring a first patent database from a patent retrieval data platform according to a first keyword; obtaining a second keyword from the first patent database, wherein the second keyword and the first keyword are different words; the first keyword and the second keyword have a first relationship; judging whether the first relation meets a preset condition or not; when a preset condition is met, obtaining a second patent database from the patent retrieval data platform according to a second keyword, wherein the second patent database comprises a patent document with the second keyword; and the third patent database is obtained according to the second patent database and the first patent database, so that the technical effect of providing patent documents with the same and/or similar research directions as the keywords for the user by using an iterative retrieval method is achieved, and the retrieval result is more comprehensive.

Description

Patent database processing method and device for continuous iterative retrieval
Technical Field
The invention relates to the technical field of data retrieval, in particular to a patent database processing method and device for continuous iterative retrieval.
Background
The research and development of patents are increasingly prominent, research and development personnel can obtain patents related to the research and development direction of the patents by searching in a patent database, and the patents are analyzed and researched, so that a new technology can be learned from the patents, and repeated research and development are avoided; and the research direction can be provided for the later research, so that the infringement is avoided.
However, the applicant of the present invention finds that the prior art has at least the following technical problems:
in the prior art, a patent search is carried out on a patent search platform by taking a certain keyword as a search condition, only patent documents containing the keyword can be obtained, but patent documents with keywords close to the keyword cannot be obtained, and more comprehensive patent search service cannot be provided for a user.
Disclosure of Invention
The embodiment of the invention provides a patent database processing method and device for continuous iterative retrieval, solves the technical problems that in the prior art, only patent documents containing keywords can be obtained by retrieving in a patent data retrieval platform by using the keywords, and the retrieval result is single, and achieves the technical effect that the patent documents in the same and/or similar research direction as the keywords are provided for a user by using an iterative retrieval method, so that the retrieval result is more comprehensive.
In view of the above problems, the present application is proposed to provide a patent database processing method and apparatus for continuous iterative search.
In a first aspect, the present invention provides a patent database processing method for continuous iterative search, including: obtaining a first patent document having a first keyword; obtaining a first patent database from a patent retrieval data platform according to the first keyword, wherein the first patent database comprises patent documents with the first keyword; obtaining a second keyword from the first patent database, wherein the second keyword and the first keyword are different words; and the first keyword and the second keyword have a first relationship; judging whether the first relation meets a preset condition or not; when a preset condition is met, obtaining a second patent database from the patent retrieval data platform according to the second keyword, wherein the second patent database comprises a patent document with the second keyword; and obtaining a third patent database according to the second patent database and the first patent database.
Preferably, the method comprises: obtaining a keyword list from the first patent database, wherein the keyword list comprises a first keyword and a second keyword, and the keyword list is a keyword list with a ranking frequency in the first patent database; the first keywords and the second keywords are keywords with similar occurrence frequencies.
Preferably, the method comprises: obtaining a third keyword from the keyword list, wherein the third keyword is different from the first keyword and the second keyword; obtaining a fourth patent database according to the third key words and the first patent database; obtaining a fifth patent database according to the third key words and the second patent database; judging the number of the fourth patent database and/or the fifth patent database; and when the number meets a preset condition, obtaining a third patent database according to the fourth patent database and the fifth patent database.
Preferably, the determining the number of the fourth patent database and/or the fifth patent database specifically includes: judging whether the third quantity is smaller than the first quantity or not; when the third number is smaller than the first number, judging that the third number meets the requirement; judging whether the fourth quantity is smaller than the second quantity; when the fourth number is smaller than the second number, judging that the fourth number meets the requirement; and when at least one of the third quantity and the fourth quantity meets the requirement, judging that the quantity of the fourth patent database and/or the fifth patent database meets a preset condition. Wherein the first number is the number of patent documents in the first patent database; the second number is the number of patent documents in the second patent database; the third number is the number of patent documents in the fourth patent database; the fourth number is the number of patent documents in the fifth patent database.
In a second aspect, the present invention provides a patent database processing apparatus for continuous iterative search, the apparatus comprising:
a first obtaining unit configured to obtain a first patent document having a first keyword;
a second obtaining unit, configured to obtain a first patent database from a patent retrieval data platform according to the first keyword, where the first patent database includes a patent document with the first keyword;
a third obtaining unit, configured to obtain a second keyword from the first patent database, where the second keyword and the first keyword are different words; and the first keyword and the second keyword have a first relationship;
a first judgment unit operable to judge whether or not the first relationship satisfies a predetermined condition,
a fourth obtaining unit, configured to obtain a second patent database from the patent retrieval data platform according to the second keyword when a predetermined condition is satisfied, where the second patent database includes a patent document with the second keyword;
and a fifth obtaining unit, configured to obtain a third patent database according to the second patent database and the first patent database.
Preferably, the method comprises:
a sixth obtaining unit, configured to obtain a keyword list from the first patent database, where the keyword list includes a first keyword and a second keyword, and the keyword list is a keyword list with a ranked appearance frequency in the first patent database; the first keywords and the second keywords are keywords with similar occurrence frequencies.
Preferably, the method comprises:
a seventh obtaining unit, configured to obtain a third keyword from the keyword list, where the third keyword is different from the first keyword and the second keyword;
an eighth obtaining unit, configured to obtain a fourth patent database according to the third keyword and the first patent database;
a ninth obtaining unit, configured to obtain a fifth patent database according to the third keyword and the second patent database;
the second judging unit is used for judging the number of the fourth patent database and/or the fifth patent database;
a tenth obtaining unit, configured to obtain a third patent database according to the fourth patent database and the fifth patent database when the number satisfies a predetermined condition.
Preferably, the determining the number of the fourth patent database and/or the fifth patent database specifically includes:
a third judging unit configured to judge whether the third number is smaller than the first number;
a first determination unit, configured to determine that the third number meets a requirement when the third number is smaller than the first number;
a fourth judging unit configured to judge whether the fourth number is smaller than the second number;
a second determination unit configured to determine that the fourth number satisfies a requirement when the fourth number is smaller than the second number;
and the third judging unit is used for judging that the quantity of the fourth patent database and/or the fifth patent database meets a preset condition when at least one of the third quantity and the fourth quantity meets the requirement.
Wherein the first number is the number of patent documents in the first patent database;
the second number is the number of patent documents in the second patent database;
the third number is the number of patent documents in the fourth patent database;
the fourth number is the number of patent documents in the fifth patent database.
In a third aspect, the present invention provides a patent database processing apparatus for continuous iterative search, including a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor implements the steps of any one of the above methods when executing the computer program.
In a fourth aspect, the invention provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, performs the steps of any of the methods described above.
One or more technical solutions in the embodiments of the present application have at least one or more of the following technical effects:
the embodiment of the invention provides a patent database processing method and device for continuous iterative retrieval, wherein the method comprises the following steps: obtaining a first patent document, wherein the first patent document has a first keyword, and the technical effect of obtaining a research direction concerned by a user is achieved; acquiring a first patent database from a patent retrieval data platform according to the first keyword, wherein the first patent database comprises a patent document with the first keyword, so that the technical effect of acquiring a basic database is achieved; obtaining a second keyword from the first patent database, wherein the second keyword and the first keyword are different words; the first keyword and the second keyword have a first relation, so that the technical effect of obtaining the second keyword similar to the first keyword and obtaining a retrieval basis for iterative retrieval is achieved; and judging whether the first relation meets a preset condition or not, so as to achieve the technical effects of detecting whether the second keyword meets the requirement or not and improving the accuracy of iterative search. And when a preset condition is met, obtaining a second patent database from the patent retrieval data platform according to the second keyword, wherein the second patent database comprises a patent document with the second keyword, and the technical effect of obtaining the second database by using an iteration method is achieved. And obtaining a third patent database according to the second patent database and the first patent database, so as to achieve the technical effect of expanding the first database. The method achieves the technical effect that the patent literature with the same and/or similar research direction as the keywords is provided for the user by utilizing the iterative retrieval method, so that the retrieval result is more comprehensive.
The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.
Drawings
FIG. 1 is a schematic flow chart illustrating a patent database processing method for continuous iterative search according to an embodiment of the present invention;
FIG. 2 is a schematic structural diagram of a patent database processing apparatus for continuous iterative search according to an embodiment of the present invention;
fig. 3 is a schematic structural diagram of another patent database processing apparatus for continuous iterative search according to an embodiment of the present invention.
Description of reference numerals: a first obtaining unit 11, a second obtaining unit 12, a third obtaining unit 13, a first judging unit 14, a fourth obtaining unit 15, a fifth obtaining unit 16, a bus 300, a receiver 301, a processor 302, a transmitter 303, a memory 304, and a bus interface 306.
Detailed Description
The embodiment of the invention provides a patent database processing method and device for continuous iterative retrieval, solves the technical problems that in the prior art, only patent documents containing keywords can be obtained by retrieving in a patent data retrieval platform by using the keywords, and the retrieval result is single, and achieves the technical effect that the patent documents in the same and/or similar research direction as the keywords are provided for a user by using an iterative retrieval method, so that the retrieval result is more comprehensive.
The technical scheme provided by the invention has the following general idea: obtaining a first patent document having a first keyword; obtaining a first patent database from a patent retrieval data platform according to the first keyword, wherein the first patent database comprises patent documents with the first keyword; obtaining a second keyword from the first patent database, wherein the second keyword and the first keyword are different words; and the first keyword and the second keyword have a first relationship; judging whether the first relation meets a preset condition or not; when a preset condition is met, obtaining a second patent database from the patent retrieval data platform according to the second keyword, wherein the second patent database comprises a patent document with the second keyword; and obtaining a third patent database according to the second patent database and the first patent database. The technical problems that in the prior art, only patent documents containing the keywords can be obtained by searching in a patent data search platform by using the keywords, and the search result is single are solved, and the technical effect that the patent documents in the same and/or similar research directions as the keywords are provided for a user by using an iterative search method, so that the search result is more comprehensive is achieved.
The technical solutions of the present invention are described in detail below with reference to the drawings and specific embodiments, and it should be understood that the specific features in the embodiments and examples of the present invention are described in detail in the technical solutions of the present application, and are not limited to the technical solutions of the present application, and the technical features in the embodiments and examples of the present application may be combined with each other without conflict.
The term "and/or" herein is merely an association describing an associated object, meaning that three relationships may exist, e.g., a and/or B, may mean: a exists alone, A and B exist simultaneously, and B exists alone. In addition, the character "/" herein generally indicates that the former and latter related objects are in an "or" relationship.
Example one
Fig. 1 is a schematic flow chart of a patent database processing method for continuous iterative search according to an embodiment of the present invention. As shown in fig. 1, an embodiment of the present invention provides a patent database processing method for continuous iterative search, where the method includes:
step 110: obtaining a first patent document having a first keyword;
specifically, the first document is a patent document closely related to the research direction of the user, and the first document comprises a plurality of keywords, wherein the first keywords can most embody the research direction and the research content of the first patent.
Step 120: obtaining a first patent database from a patent retrieval data platform according to the first keyword, wherein the first patent database comprises patent documents with the first keyword;
specifically, the patent search data platform includes all patent documents published before the search date, including authorized patent documents, failed patent documents, and unauthorized but published patents. And searching in the patent search data platform by taking the first keyword as a search condition, and putting searched patent documents into the first patent database to form the first patent database, wherein the patent documents in the first database all contain the first keyword.
Step 130: obtaining a second keyword from the first patent database, wherein the second keyword and the first keyword are different words; and the first keyword and the second keyword have a first relationship;
further, the method comprises: obtaining a keyword list from the first patent database, wherein the keyword list comprises a first keyword and a second keyword, and the keyword list is a keyword list with a ranking frequency in the first patent database;
the first keywords and the second keywords are keywords with similar occurrence frequencies.
Specifically, the full text of the patent documents in the first patent database is statistically analyzed, a list of words with a relatively high repetition rate is obtained, the words are sorted according to the occurrence frequency from high to low to form the keyword list, and the first keyword is also in the keyword list. The vocabulary with high frequency of occurrence is generally a common vocabulary and cannot represent the research direction of patent literature. The second keyword closely related to the first keyword generally has a similar frequency to the first keyword, and therefore, a word having a similar frequency to the first keyword is selected as the second keyword.
Step 140: judging whether the first relation meets a preset condition or not;
specifically, whether the first keyword and the second keyword belong to the same technical research direction or not is judged, and if the first keyword and the second keyword belong to the same technical research direction, the first relation meets a preset condition; if not, the first relation does not satisfy the predetermined relation, and step 130 is repeated to find the second keyword again until the second keyword belonging to the same research direction as the first keyword is found.
Step 150: when a preset condition is met, obtaining a second patent database from the patent retrieval data platform according to the second keyword, wherein the second patent database comprises a patent document with the second keyword;
specifically, after the second keyword satisfying the predetermined condition is found in the steps 130 and 140, the patent search data platform searches by using the second keyword as the search condition, and places the searched patent document into the second patent database, thereby obtaining a database of patent documents having the same or similar research direction as the first keyword.
Step 160: and obtaining a third patent database according to the second patent database and the first patent database.
Specifically, the first database and the second database are combined together to form the third database, that is, a patent database of patent documents with the same and/or similar research direction as the first keyword is obtained, and a user can obtain the technical development dynamics with the same and/or similar research direction as the first keyword by analyzing the patent documents in the third patent database.
Further, the method comprises:
obtaining a third keyword from the keyword list, wherein the third keyword is different from the first keyword and the second keyword;
obtaining a fourth patent database according to the third key words and the first patent database;
obtaining a fifth patent database according to the third key words and the second patent database;
specifically, the third keyword is obtained from the keyword list, and the third keyword may be a keyword having a frequency close to the first keyword and the second keyword, or may be a keyword that is the most representative of the research direction of the first patent document, in addition to the first keyword and the second keyword.
In the first patent database, the patent documents in the first patent database are searched by taking the third key word as a search target, and if the patent documents contain the third key word, the patent documents containing the third key word are put into the fourth patent database, namely, the patent documents in the fourth patent database contain both the first key word and the third key word, so that the technical effect of selecting the patent documents which are more consistent with the research direction of the first patent documents from the first patent database is achieved.
Similarly, in the second patent database, the patent documents in the second patent database are searched with the third keyword as a search target, and if the patent documents include the third keyword, the patent documents including the third keyword are placed in the fifth patent database, that is, the patent documents in the fifth patent database include both the second keyword and the third keyword, so that the technical effect of selecting the patent documents more consistent with the research direction of the first patent document from the second patent database is achieved.
Judging the number of the fourth patent database and/or the fifth patent database;
and when the number meets a preset condition, obtaining a third patent database according to the fourth patent database and the fifth patent database.
Further, the determining the number of the fourth patent database and/or the fifth patent database specifically includes:
judging whether the third quantity is smaller than the first quantity or not;
when the third number is smaller than the first number, judging that the third number meets the requirement;
judging whether the fourth quantity is smaller than the second quantity;
when the fourth number is smaller than the second number, judging that the fourth number meets the requirement;
and when at least one of the third quantity and the fourth quantity meets the requirement, judging that the quantity of the fourth patent database and/or the fifth patent database meets a preset condition.
Wherein the first number is the number of patent documents in the first patent database;
the second number is the number of patent documents in the second patent database;
the third number is the number of patent documents in the fourth patent database;
the fourth number is the number of patent documents in the fifth patent database
Specifically, when the number of patent documents in the fourth patent database is smaller than the number of patent documents in the first patent database, it indicates that the third keyword has a filtering effect on the patent documents in the first patent, and it is determined that the third number meets the requirement; otherwise, the third key word does not have a filtering effect on the patent documents in the first patent, and the third quantity is judged to be not satisfied.
Similarly, when the number of patent documents in the fifth patent database is smaller than the number of patent documents in the second patent database, it indicates that the third keyword has a filtering effect on the patent documents in the second patent, and it is determined that the fourth number meets the requirement; otherwise, the third key word does not have a filtering effect on the patent documents in the second patent, and the fourth quantity is judged to be not satisfied.
When at least one of the third quantity and the fourth quantity meets the requirement, judging that the quantity of the fourth patent database and/or the fifth patent database meets a preset condition; otherwise, judging that the number of the fourth patent database and/or the fifth patent database does not meet the preset condition, and needing to obtain the third key word in the keyword list again.
And when the number of the fourth patent databases and/or the fifth patent databases meets a preset condition, combining the fourth patent databases and the fifth patent databases together to form the third patent database.
Through the patent database processing method for continuous iterative retrieval in the embodiment, the technical problems that in the prior art, only patent documents containing the keywords can be obtained by retrieving in a patent data retrieval platform by using the keywords, and the retrieval result is single are solved, the technical effect that the patent documents in the same and/or similar research directions as the keywords are provided for a user by using the iterative retrieval method is achieved, and the retrieval result is more comprehensive.
Example two
Based on the same inventive concept as the patent database processing method of continuous iterative search in the foregoing embodiment, the present invention further provides a patent database processing apparatus of continuous iterative search, where the apparatus is applied to a first query platform, and as shown in fig. 2, the apparatus includes:
a first obtaining unit 11 for obtaining a first patent document having a first keyword;
a second obtaining unit 12, configured to obtain a first patent database from a patent retrieval data platform according to the first keyword, where the first patent database includes a patent document with the first keyword;
a third obtaining unit 13, configured to obtain a second keyword from the first patent database, where the second keyword and the first keyword are different words; and the first keyword and the second keyword have a first relationship;
a first judgment unit 14 for judging whether the first relation satisfies a predetermined condition,
a fourth obtaining unit 15, configured to obtain, when a predetermined condition is satisfied, a second patent database from the patent retrieval data platform according to the second keyword, where the second patent database includes a patent document with the second keyword;
a fifth obtaining unit 16, configured to obtain a third patent database according to the second patent database and the first patent database.
Preferably, the method comprises:
a sixth obtaining unit, configured to obtain a keyword list from the first patent database, where the keyword list includes a first keyword and a second keyword, and the keyword list is a keyword list with a ranked appearance frequency in the first patent database; the first keywords and the second keywords are keywords with similar occurrence frequencies.
Preferably, the method comprises:
a seventh obtaining unit, configured to obtain a third keyword from the keyword list, where the third keyword is different from the first keyword and the second keyword;
an eighth obtaining unit, configured to obtain a fourth patent database according to the third keyword and the first patent database;
a ninth obtaining unit, configured to obtain a fifth patent database according to the third keyword and the second patent database;
the second judging unit is used for judging the number of the fourth patent database and/or the fifth patent database;
a tenth obtaining unit, configured to obtain a third patent database according to the fourth patent database and the fifth patent database when the number satisfies a predetermined condition.
Preferably, the determining the number of the fourth patent database and/or the fifth patent database specifically includes:
a third judging unit configured to judge whether the third number is smaller than the first number;
a first determination unit, configured to determine that the third number meets a requirement when the third number is smaller than the first number;
a fourth judging unit configured to judge whether the fourth number is smaller than the second number;
a second determination unit configured to determine that the fourth number satisfies a requirement when the fourth number is smaller than the second number;
and the third judging unit is used for judging that the quantity of the fourth patent database and/or the fifth patent database meets a preset condition when at least one of the third quantity and the fourth quantity meets the requirement.
Wherein the first number is the number of patent documents in the first patent database;
the second number is the number of patent documents in the second patent database;
the third number is the number of patent documents in the fourth patent database;
the fourth number is the number of patent documents in the fifth patent database.
Various modifications and specific examples of the above-mentioned patent database processing method for continuous iterative search in the first embodiment of fig. 1 are also applicable to the patent database processing apparatus for continuous iterative search in this embodiment, and through the foregoing detailed description of the patent database processing method for continuous iterative search, those skilled in the art can clearly know the implementation method of the patent database processing apparatus for continuous iterative search in this embodiment, so for the brevity of the description, detailed descriptions are omitted here.
EXAMPLE III
Based on the same inventive concept as that of one of the above-mentioned embodiments of the patent database processing method for continuous iterative search, the present invention further provides a patent database processing apparatus for continuous iterative search, as shown in fig. 3, including a memory 304, a processor 302, and a computer program stored in the memory 304 and executable on the processor 302, wherein the processor 302 implements the steps of any one of the above-mentioned patent database processing methods for continuous iterative search when executing the program.
Where in fig. 3 a bus architecture (represented by bus 300), bus 300 may include any number of interconnected buses and bridges, bus 300 linking together various circuits including one or more processors, represented by processor 302, and memory, represented by memory 304. The bus 300 may also link together various other circuits such as peripherals, voltage regulators, power management circuits, and the like, which are well known in the art, and therefore, will not be described any further herein. A bus interface 306 provides an interface between the bus 300 and the receiver 301 and transmitter 303. The receiver 301 and the transmitter 303 may be the same element, i.e., a transceiver, providing a means for communicating with various other apparatus over a transmission medium. The processor 302 is responsible for managing the bus 300 and general processing, and the memory 304 may be used for storing data used by the processor 302 in performing operations.
Example four
Based on the same inventive concept as the patent database processing method of continuous iterative search in the foregoing embodiment, the present invention also provides a computer-readable storage medium on which a computer program is stored, which when executed by a processor implements the following steps:
obtaining a first patent document having a first keyword;
obtaining a first patent database from a patent retrieval data platform according to the first keyword, wherein the first patent database comprises patent documents with the first keyword;
obtaining a second keyword from the first patent database, wherein the second keyword and the first keyword are different words; and the first keyword and the second keyword have a first relationship;
judging whether the first relation meets a preset condition or not;
when a preset condition is met, obtaining a second patent database from the patent retrieval data platform according to the second keyword, wherein the second patent database comprises a patent document with the second keyword;
and obtaining a third patent database according to the second patent database and the first patent database.
In a specific implementation, when the program is executed by a processor, any method step in the first embodiment may be further implemented.
One or more technical solutions in the embodiments of the present application have at least one or more of the following technical effects:
the embodiment of the invention provides a patent database processing method and device for continuous iterative retrieval, wherein the method comprises the following steps: obtaining a first patent document, wherein the first patent document has a first keyword, and the technical effect of obtaining a research direction concerned by a user is achieved; acquiring a first patent database from a patent retrieval data platform according to the first keyword, wherein the first patent database comprises a patent document with the first keyword, so that the technical effect of acquiring a basic database is achieved; obtaining a second keyword from the first patent database, wherein the second keyword and the first keyword are different words; the first keyword and the second keyword have a first relation, so that the technical effect of obtaining the second keyword similar to the first keyword and obtaining a retrieval basis for iterative retrieval is achieved; and judging whether the first relation meets a preset condition or not, so as to achieve the technical effects of detecting whether the second keyword meets the requirement or not and improving the accuracy of iterative search. And when a preset condition is met, obtaining a second patent database from the patent retrieval data platform according to the second keyword, wherein the second patent database comprises a patent document with the second keyword, and the technical effect of obtaining the second database by using an iteration method is achieved. And obtaining a third patent database according to the second patent database and the first patent database, so as to achieve the technical effect of expanding the first database. The method achieves the technical effect that the patent literature with the same and/or similar research direction as the keywords is provided for the user by utilizing the iterative retrieval method, so that the retrieval result is more comprehensive.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims (7)

1. A patent database processing method for continuous iterative search, the method comprising:
obtaining a first patent document having a first keyword;
obtaining a first patent database from a patent retrieval data platform according to the first keyword, wherein the first patent database comprises patent documents with the first keyword;
obtaining a second keyword from the first patent database, wherein the second keyword and the first keyword are different words; and the first keyword and the second keyword have a first relationship;
judging whether the first relation meets a preset condition or not;
when a preset condition is met, obtaining a second patent database from the patent retrieval data platform according to the second keyword, wherein the second patent database comprises a patent document with the second keyword;
and obtaining a third patent database according to the second patent database and the first patent database.
2. The method of claim 1, wherein the method comprises:
obtaining a keyword list from the first patent database, wherein the keyword list comprises a first keyword and a second keyword, and the keyword list is a keyword list with a ranking frequency in the first patent database;
the first keywords and the second keywords are keywords with similar occurrence frequencies.
3. The method of claim 2, wherein the method comprises:
obtaining a third keyword from the keyword list, wherein the third keyword is different from the first keyword and the second keyword;
obtaining a fourth patent database according to the third key words and the first patent database;
obtaining a fifth patent database according to the third key words and the second patent database;
judging the number of the fourth patent database and/or the fifth patent database;
and when the number meets a preset condition, obtaining a third patent database according to the fourth patent database and the fifth patent database.
4. The method according to claim 3, wherein the determining the number of the fourth patent database and/or the fifth patent database is specifically:
judging whether the third quantity is smaller than the first quantity or not;
when the third number is smaller than the first number, judging that the third number meets the requirement;
judging whether the fourth quantity is smaller than the second quantity;
when the fourth number is smaller than the second number, judging that the fourth number meets the requirement;
when at least one of the third quantity and the fourth quantity meets the requirement, judging that the quantity of the fourth patent database and/or the fifth patent database meets a preset condition;
wherein the first number is the number of patent documents in the first patent database;
the second number is the number of patent documents in the second patent database;
the third number is the number of patent documents in the fourth patent database;
the fourth number is the number of patent documents in the fifth patent database.
5. A patent database processing apparatus for continuous iterative search, the apparatus comprising:
a first obtaining unit configured to obtain a first patent document having a first keyword;
a second obtaining unit, configured to obtain a first patent database from a patent retrieval data platform according to the first keyword, where the first patent database includes a patent document with the first keyword;
a third obtaining unit, configured to obtain a second keyword from the first patent database, where the second keyword and the first keyword are different words; and the first keyword and the second keyword have a first relationship;
a first judgment unit configured to judge whether the first relationship satisfies a predetermined condition;
a fourth obtaining unit, configured to obtain a second patent database from the patent retrieval data platform according to the second keyword when a predetermined condition is satisfied, where the second patent database includes a patent document with the second keyword;
and a fifth obtaining unit, configured to obtain a third patent database according to the second patent database and the first patent database.
6. A patent database processing apparatus for continuous iterative search, comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the steps of the method according to any one of claims 1 to 4 when executing the program.
7. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 4.
CN202010131699.0A 2020-02-29 2020-02-29 Patent database processing method and device for continuous iterative retrieval Withdrawn CN111339127A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010131699.0A CN111339127A (en) 2020-02-29 2020-02-29 Patent database processing method and device for continuous iterative retrieval

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010131699.0A CN111339127A (en) 2020-02-29 2020-02-29 Patent database processing method and device for continuous iterative retrieval

Publications (1)

Publication Number Publication Date
CN111339127A true CN111339127A (en) 2020-06-26

Family

ID=71183931

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010131699.0A Withdrawn CN111339127A (en) 2020-02-29 2020-02-29 Patent database processing method and device for continuous iterative retrieval

Country Status (1)

Country Link
CN (1) CN111339127A (en)

Similar Documents

Publication Publication Date Title
CN108804641B (en) Text similarity calculation method, device, equipment and storage medium
US10268758B2 (en) Method and system of acquiring semantic information, keyword expansion and keyword search thereof
EP2289007B1 (en) Search results ranking using editing distance and document information
US8161036B2 (en) Index optimization for ranking using a linear model
US7130849B2 (en) Similarity-based search method by relevance feedback
CN106897343B (en) Searching method, storing method and device for execution plan
US10599690B2 (en) Systems and methods for generating and using aggregated search indices and non-aggregated value storage
CN109299383B (en) Method and device for generating recommended word, electronic equipment and storage medium
US7110996B2 (en) System and method for determining numerical representations for categorical data fields and data processing system
JP6299596B2 (en) Query similarity evaluation system, evaluation method, and program
CN106815265B (en) Method and device for searching referee document
KR20220119745A (en) Methods for retrieving content, devices, devices and computer-readable storage media
US20040186833A1 (en) Requirements -based knowledge discovery for technology management
CN113204642A (en) Text clustering method and device, storage medium and electronic equipment
US10467530B2 (en) Searching text via function learning
CN106934007B (en) Associated information pushing method and device
CN111339127A (en) Patent database processing method and device for continuous iterative retrieval
CN105095385A (en) Method and device for outputting retrieval result
WO2021250950A1 (en) Method, system, and device for evaluating performance of document search
JP2019200582A (en) Search device, search method, and search program
CN111353022A (en) Information processing method and device for automatically expanding keywords to perform patent database retrieval
CN111143582A (en) Multimedia resource recommendation method and device for updating associative words in real time through double indexes
CN111026942A (en) Hot word extraction method, device, terminal and medium based on web crawler
CN111324640A (en) Method and device for automatically expanding database
CN112579841B (en) Multi-mode database establishment method, retrieval method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication

Application publication date: 20200626

WW01 Invention patent application withdrawn after publication