CN106777103B - Patent file indexing method and device - Google Patents

Patent file indexing method and device Download PDF

Info

Publication number
CN106777103B
CN106777103B CN201611157229.1A CN201611157229A CN106777103B CN 106777103 B CN106777103 B CN 106777103B CN 201611157229 A CN201611157229 A CN 201611157229A CN 106777103 B CN106777103 B CN 106777103B
Authority
CN
China
Prior art keywords
indexing
search
chinese
user
keywords
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201611157229.1A
Other languages
Chinese (zh)
Other versions
CN106777103A (en
Inventor
赵大川
景俊杰
黄菲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kehua Wanxiang Technology Co ltd
Original Assignee
Beijing Kehua Wanxiang Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kehua Wanxiang Technology Co ltd filed Critical Beijing Kehua Wanxiang Technology Co ltd
Priority to CN201611157229.1A priority Critical patent/CN106777103B/en
Publication of CN106777103A publication Critical patent/CN106777103A/en
Application granted granted Critical
Publication of CN106777103B publication Critical patent/CN106777103B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/38Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2216/00Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
    • G06F2216/11Patent retrieval

Abstract

The invention provides a patent document indexing method and device. Wherein, the method comprises the following steps: acquiring a search formula; forming indexing patent search conditions through search keywords in the search formula; searching patent files meeting the indexing patent searching conditions; and indexing the patent files through the indexing information options matched with the patent files. By the patent document indexing method and device provided by the embodiment of the invention, the indexing information can more intuitively and simply embody the field and technology related to the patent document.

Description

Patent file indexing method and device
Technical Field
The invention relates to the technical field of data processing, in particular to a patent file indexing method and device.
Background
At present, in order to better show the content related to the patent document, the patent retrieval website needs to summarize the information hidden, not explicitly shown or not highlighted in the patent document to obtain the indexing information of the patent document. When a user searches the patent document, the indexing information of the patent document is displayed together, so that the user can more comprehensively know the content covered by the patent document.
In the related art, in order to obtain the indexing information of the patent documents, a user inputs the patent information (such as the applicant, the application number, the patent name, and the like) of each patent document into the patent retrieval website to index each patent.
In the process of implementing the invention, the inventor finds that at least the following problems exist in the prior art:
in the process of indexing the patent documents, the patent retrieval website can only index the patent information of each patent document, the indexing content is less, the indexing content cannot intuitively and simply embody the fields and technologies related to the patent documents, and the help brought to users is very limited.
Disclosure of Invention
In view of this, an object of the embodiments of the present invention is to provide a method and an apparatus for indexing a patent document, so that indexing information can more intuitively and simply embody the fields and technologies related to the patent document.
In a first aspect, an embodiment of the present invention provides a patent document indexing method, including:
obtaining a search formula, wherein the search formula comprises: searching for a keyword;
forming indexing patent search conditions through search keywords in the search formula;
searching patent files meeting the indexing patent searching conditions;
and indexing the patent file through the indexing information option matched with the patent file.
With reference to the first aspect, an embodiment of the present invention provides a first possible implementation manner of the first aspect, where: forming indexing patent search conditions through the search keywords in the search formula, wherein the indexing patent search conditions comprise:
acquiring a search keyword in the search formula;
extracting Chinese participles from the search keywords through a participle algorithm;
and forming the indexing patent search condition according to the extracted Chinese word segmentation.
With reference to the first aspect, an embodiment of the present invention provides a second possible implementation manner of the first aspect, where: forming the indexing patent search condition according to the extracted Chinese word segmentation, comprising:
inquiring indexing information options comprising the Chinese participles from a preset indexing information set;
determining the indexing information options comprising the Chinese participles as indexing information options matched with the Chinese participles, and displaying the Chinese participles and the matched indexing information options to the user;
when the situation that the user selects the Chinese participle is monitored, determining an indexing search keyword comprising the Chinese participle selected by the user from the search keywords, and forming the indexing patent search condition according to the indexing search keyword and the search formula.
With reference to the first aspect, an embodiment of the present invention provides a third possible implementation manner of the first aspect, where: when the situation that the user selects the Chinese participle is monitored, determining an indexing search keyword comprising the Chinese participle selected by the user from the search keywords, and forming an indexing patent search condition according to the indexing search keyword and the search formula, wherein the indexing patent search condition comprises the following steps:
monitoring the Chinese word segmentation selected by the user;
querying the search formula through the Chinese participles selected by the user, and determining search keywords in the search formula, which comprise the Chinese participles selected by the user, as index search keywords;
when a plurality of indexing search keywords are determined, combining the plurality of indexing search keywords with logical operators among the plurality of indexing search keywords in the search formula to form an indexing limited search expression;
and combining the search expression and the indexing limit search expression through a logic and operator to form an indexing patent search condition.
With reference to the first aspect, an embodiment of the present invention provides a fourth possible implementation manner of the first aspect, where: indexing the patent file through an indexing information option matched with the patent file, including:
determining the indexing information option matched with the Chinese word segmentation selected by the user as the indexing information option matched with the patent document;
and performing indexing operation on the patent file through the determined indexing information option.
In a second aspect, an embodiment of the present invention further provides a patent document indexing apparatus, including:
an obtaining module, configured to obtain a search formula, where the search formula includes: searching for a keyword;
the forming module is used for forming indexing patent searching conditions through the searching keywords in the searching formula;
the searching module is used for searching patent files meeting the indexing patent searching conditions;
and the indexing module is used for indexing the patent files through indexing information options matched with the patent files.
With reference to the second aspect, an embodiment of the present invention provides a first possible implementation manner of the second aspect, where: the forming module includes:
the obtaining sub-module is used for obtaining the search keywords in the search formula;
the extraction sub-module is used for extracting Chinese participles from the search keywords through a participle algorithm;
and the forming submodule is used for forming the indexing patent search condition according to the extracted Chinese word segmentation.
With reference to the second aspect, embodiments of the present invention provide a second possible implementation manner of the second aspect, where: the forming sub-module includes:
the query unit is used for querying the indexing information options comprising the Chinese participles from a preset indexing information set;
the display unit is used for determining the indexing information options comprising the Chinese participles as indexing information options matched with the Chinese participles and displaying the Chinese participles and the matched indexing information options to the user;
and the forming unit is used for determining indexing search keywords comprising the Chinese participles selected by the user from the search keywords when the condition that the Chinese participles are selected by the user is monitored, and forming indexing patent search conditions according to the indexing search keywords and the search formula.
In combination with the second aspect, embodiments of the present invention provide a third possible implementation manner of the second aspect, where: the forming unit is specifically configured to:
monitoring the Chinese word segmentation selected by the user;
querying the search formula through the Chinese participles selected by the user, and determining search keywords in the search formula, which comprise the Chinese participles selected by the user, as index search keywords;
when a plurality of indexing search keywords are determined, combining the plurality of indexing search keywords with logical operators among the plurality of indexing search keywords in the search formula to form an indexing limited search expression;
and combining the search expression and the indexing limit search expression through a logic and operator to form an indexing patent search condition.
In combination with the second aspect, an embodiment of the present invention provides a fourth possible implementation manner of the second aspect, where: the indexing module comprises:
the determining submodule is used for determining the indexing information option matched with the Chinese word segmentation selected by the user as the indexing information option matched with the patent document;
and the indexing submodule is used for indexing the patent file according to the determined indexing information option.
The patent document indexing method and device provided by the embodiment of the invention can obtain the indexing patent search condition by searching the keywords in the formula, then search the patent documents meeting the indexing patent search condition, index the patent documents by the indexing information options matched with the patent documents, and index the patent documents by comparing with the prior art which only can index the patent information (such as applicant, application number, patent name and the like) in the patent documents, so that the patent documents meeting the indexing patent search condition can be indexed in different aspects by the indexing information options matched with the patent documents, and the fields and the technologies related to the patent documents can be indexed according to the different indexing patent search conditions, so that the indexing information of the patent documents can comprehensively, intuitively and simply embody the fields and the technologies related to the patent documents, and a user who refers to the patent documents can more intuitively know the fields and the technologies related to the patent documents, the user experience is improved.
In order to make the aforementioned and other objects, features and advantages of the present invention comprehensible, preferred embodiments accompanied with figures are described in detail below.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and for those skilled in the art, other related drawings can be obtained according to the drawings without inventive efforts.
FIG. 1 is a schematic diagram illustrating an application environment of a patent document indexing method provided by an embodiment of the invention;
fig. 2 shows a flowchart of a patent document indexing method provided in embodiment 1 of the present invention;
fig. 3 is a schematic structural diagram of a patent document indexing device provided in embodiment 2 of the present invention;
fig. 4 is a schematic structural diagram illustrating a module formed in a patent document indexing device according to embodiment 2 of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations. Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.
Fig. 1 is a block diagram showing a configuration of a server that can be applied to an embodiment of the present invention to perform a patent document indexing method. As shown in fig. 1, the server 100 includes: memory 101, processor 102, and network module 103.
The memory 101 may be used to store software programs and modules, such as program instructions/modules corresponding to the patent document indexing method and apparatus in the embodiment of the present invention, and the processor 102 executes various functional applications and data processing by running the software programs and modules stored in the memory 201, so as to implement the patent document indexing method in the embodiment of the present invention. Memory 101 may include high speed random access memory and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. Further, the software programs and modules may further include: an operating system 121 and a service module 122. The operating system 121, which may be LINUX, UNIX, WINDOWS, for example, may include various software components and/or drivers for managing system tasks (e.g., memory management, storage device control, power management, etc.), and may communicate with various hardware or software components to provide an operating environment for other software components. The service module 122 runs on the basis of the operating system 121, monitors a request from the network through a network service of the operating system 121, completes corresponding data processing according to the request, and returns a processing result to the client. That is, the service module 122 is used to provide network services to clients.
The network module 103 is used for receiving and transmitting network signals. The network signal may include a wireless signal or a wired signal.
It will be appreciated that the configuration shown in fig. 1 is merely illustrative and that server 100 may include more or fewer components than shown in fig. 1 or have a different configuration than shown in fig. 1. The components shown in fig. 1 may be implemented in hardware, software, or a combination thereof. In addition, the server in the embodiment of the present invention may further include a plurality of servers with different specific functions.
At present, in order to obtain the indexing information of patent documents, a patent retrieval website records the patent information (such as applicant, application number, patent name, etc.) of each patent document into the patent retrieval website by a user so as to index each patent. In the process of indexing the patent documents, the patent retrieval website can only index the patent information of each patent document, the indexing content is less, the indexing content cannot intuitively and simply embody the fields and technologies related to the patent documents, and the help brought to users is very limited. Based on the method and the device, the patent document indexing method and the device are provided.
Example 1
The embodiment provides a patent document indexing method, and an execution main body of the embodiment is a server. After the indexing patent searching condition is determined, the patent files meeting the indexing patent searching condition can be searched, and the indexing information items matched with the patent files are used for indexing the patent files, so that the indexing information of the patent files can comprehensively, intuitively and simply embody the fields and the technologies related to the patent files.
Referring to fig. 2, a flowchart of a patent document indexing method according to an embodiment of the present invention is shown, where the method includes the following steps:
and 200, acquiring a search formula.
Wherein, the search formula comprises: and searching for the keyword.
The search formula is a patent file search expression, and is input by a user and used for searching out a patent file which the user wants to index. Such as: the search formula may be:
the search formula shows that a user wants to index patent documents under the patent name (ti) of an IPC (ic) classification a61k35/78, which includes a load-sensitive hydraulic valve and a heavy-load flow valve.
The user refers to an indexing person who indexes the patent document.
The search keywords are words appearing in the search formula, and words consisting of Chinese participles appear in the patent name, abstract, claims and specification of the patent document. Therefore, the patent names, abstracts, claims and words appearing in the specification of the patent documents can be searched for patent documents to be indexed.
After the search expression including the keyword is obtained through the above step 200, the following step 202 may be continued to form the indexed patent search condition.
Step 202, forming indexing patent search conditions through the search keywords in the search formula.
The indexing patent search condition is a search expression formed by combining the search expression with an indexing limit search expression obtained by processing the search keyword, and is used for searching the patent files which need to be indexed by the user.
And step 204, searching for the patent files meeting the indexing patent searching conditions.
And step 206, performing indexing operation on the patent files through the indexing information options matched with the patent files.
The indexing information option is used for embodying the field and technology related to the patent documents meeting the indexing patent search conditions.
To sum up, the patent document indexing method provided in this embodiment obtains the indexed patent search condition by searching the keywords in the formula, then searches for the patent documents meeting the indexed patent search condition, indexes the patent documents by the indexing information options matching with the patent documents, and compared with the prior art that only the patent information (such as the applicant, the application number, the patent name, and the like) in the patent documents can be indexed, the patent documents meeting the indexed patent search condition can be indexed by the indexing information options matching with the patent documents, so that the fields and technologies related to the patent documents can be indexed in different aspects according to the difference of the indexed patent search condition, and thus the indexing information of the patent documents can embody the fields and technologies related to the patent documents comprehensively, intuitively and simply, so that a user looking up the patent documents can know the fields and technologies related to the patent documents more intuitively, the user experience is improved.
In the related art, a user is not a person skilled in each technical field and cannot fully and accurately know and familiarize each technical field, so that a search keyword input by the user sometimes cannot fully express a patent document that the user wants to index. In order to comprehensively retrieve patent documents which a user wants to index, the indexing patent search condition is formed by the search keywords in the search formula, and the indexing patent search condition comprises the following steps (1) to (3):
(1) acquiring search keywords in the search formula;
(2) extracting Chinese participles from the search keywords through a participle algorithm;
(3) and forming the indexing patent search condition according to the extracted Chinese word segmentation.
In the step (2), the chinese segmentation is a plurality of independent words obtained by splitting the search keyword. In one embodiment, according to the search keyword "load-sensitive hydraulic valve and heavy-load flow valve" in the above search formula, the following independent words can be obtained through the word segmentation algorithm: "load", "sensitive", "hydraulic", "heavy load", "flow", and "valve".
Moreover, the word segmentation algorithm described in the step (2) may adopt any existing Chinese word segmentation algorithm capable of splitting the search keyword, and is not described in detail herein.
In summary, the indexing patent search condition is formed by extracting Chinese word segmentation instead of the search keyword input by the user, so that the patent document which the user wants to index can be comprehensively retrieved, and the indexing accuracy is improved.
In the related technology, if the relevance of a plurality of keywords input by a user is not high, the search result contains too many patent documents which do not need to be indexed, and the user needs to manually screen the search result to determine the patent documents which need to be indexed, so that the indexing efficiency of the patent documents is reduced. In order to improve the indexing efficiency of the patent document, the indexing patent search condition is formed according to the extracted Chinese participles, and the indexing patent search condition comprises the following steps (1) to (3):
(1) inquiring indexing information options comprising the Chinese participles from a preset indexing information set;
(2) determining the indexing information options comprising the Chinese participles as indexing information options matched with the Chinese participles, and displaying the Chinese participles and the matched indexing information options to the user;
(3) when the fact that the user selects the Chinese segmentation is monitored, determining indexing search keywords comprising the Chinese segmentation selected by the user from the search keywords, and forming indexing patent search conditions according to the indexing search keywords and the search formula.
In the step (1), if the split chinese participle is "traffic", the chinese participle including "traffic" may be queried from a preset indexing information set, such as: flow valves, flow meters, etc. indexing information options.
The step (3) includes the following specific steps (31) to (35):
(31) monitoring the Chinese word segmentation selected by the user;
(32) inquiring the search formula according to the Chinese word segmentation selected by the user, and determining the search keyword comprising the Chinese word segmentation selected by the user in the search formula as an indexing search keyword;
(33) when a plurality of indexing search keywords are determined, combining a plurality of indexing search keywords with logical operators among the indexing search keywords in the search formula to form an indexing limited search expression;
(34) combining the search expression and the index limit search expression through a logic and operator to form an index patent search condition;
(35) and when only one indexing search keyword is determined, combining the search formula and the indexing search keyword through a logic and operator to form an indexing patent search condition.
Alternatively, the flow executed in the above step (32) to step (34) is described by the following embodiments, assuming that the search expression is: if and ic ═ ti ((one-two) not (three-one-four) and five-five) and ic ═ 123or 321), then the server determines the chinese participles from the search keywords "one-two, three-four-five" of the search formula as: first, second, third, fourth and fifth, if the Chinese word segmentation selected by the user is two and third, the server will determine the search keywords 'two and three' corresponding to the Chinese word segmentation two and three as indexing search keywords, and combine the search keywords 'two' and 'three and three' through the logical non-operator between the search keywords 'two' and 'three and three' in the search formula to form the indexing restriction search expression 'two three and three'. Then, combining the obtained indexing restriction search expression 'two nots three' with the search expression through a logical and operator to obtain indexing patent search conditions:
(ti) ((one-two) -not (three-four) and five-five) and ic ═ 123or 321) and ti ═ two-not-three.
In conclusion, the search formula is combined with the search keyword comprising the Chinese segmentation selected by the user to form the final indexing patent search condition, so that the relevance of the retrieved patent file is higher, the user does not need to manually determine the patent file, and the indexing efficiency of the patent file is improved.
In the related art, a user is not a technician in each technical field and cannot comprehensively and accurately know and familiarize each technical field, so search keywords input by the user often have the situations of non-speciality, non-standardization and inaccuracy, if patent documents are indexed through the search keywords, the indexing result is inaccurate, and the indexing effect on the patent documents is reduced. In order to improve the indexing effect of the patent file, the indexing operation is performed on the patent file through an indexing information option matched with the patent file, and the method comprises the following steps (1) to (2):
(1) determining the indexing information option matched with the Chinese word segmentation selected by the user as the indexing information option matched with the patent document;
(2) and performing indexing operation on the patent document through the determined indexing information option.
In conclusion, the patent documents are indexed through the indexing information option matched with the Chinese participle selected by the user, and the indexing is not directly indexed through the keyword used by the user, so that the standardization of the indexing information option can be obviously improved, the accuracy of an indexing result and the indexing effect on the patent documents are improved, and the later-stage patent document retrieval, reading and analysis work is facilitated.
Example 2
The embodiment provides a patent document indexing device, which is used for executing the patent document indexing method.
Referring to fig. 3, a schematic structural diagram of a patent document indexing device according to an embodiment of the present invention includes:
an obtaining module 300, configured to obtain a search formula, where the search formula includes: searching for a keyword;
a forming module 302, configured to form a patent indexing search condition according to the search keyword in the search formula;
a searching module 304, configured to search for a patent file meeting the indexed patent searching condition;
and the indexing module 306 is configured to perform an indexing operation on the patent document through an indexing information option matched with the patent document.
To sum up, the patent document indexing device provided in this embodiment obtains the indexing patent search condition by searching the keywords in the formula, then searches for the patent documents meeting the indexing patent search condition, indexes the patent documents by the indexing information options matching with the patent documents, and compared with the prior art that only the patent information (such as the applicant, the application number, the patent name, and the like) in the patent documents can be indexed, the patent documents meeting the indexing patent search condition can be indexed by the indexing information options matching with the patent documents, so that the fields and technologies related to the patent documents can be indexed in different aspects according to the difference of the indexing patent search condition, and thus the indexing information of the patent documents can embody the fields and technologies related to the patent documents comprehensively, intuitively and simply, so that a user looking up the patent documents can know the fields and technologies related to the patent documents more intuitively, the user experience is improved.
In the related art, a user is not a person skilled in each technical field and cannot fully and accurately know and familiarize each technical field, so that a search keyword input by the user sometimes cannot fully express a patent document that the user wants to index. In order to comprehensively retrieve the patent document that the user wants to index, referring to fig. 4, the forming module 302 includes:
an obtaining submodule 3020 configured to obtain a search keyword in the search formula;
an extraction submodule 3022, configured to extract a chinese word segmentation from the search keyword through a word segmentation algorithm;
a forming sub-module 3024 for forming the indexing patent search condition according to the extracted chinese participle.
In summary, the indexing patent search condition is formed by extracting Chinese word segmentation instead of the search keyword input by the user, so that the patent document which the user wants to index can be comprehensively retrieved, and the indexing accuracy is improved.
In the related technology, if the relevance of a plurality of keywords input by a user is not high, the search result contains too many patent documents which do not need to be indexed, and the user needs to manually screen the search result to determine the patent documents which need to be indexed, so that the indexing efficiency of the patent documents is reduced. In order to improve the indexing efficiency of the patent document, the sub-module 3024 is formed as described above, and includes:
the query unit is used for querying index information options comprising the Chinese participles from a preset index information set;
the display unit is used for determining the indexing information options comprising the Chinese participles as indexing information options matched with the Chinese participles and displaying the Chinese participles and the matched indexing information options to the user;
and the forming unit is used for determining an indexing search keyword comprising the Chinese participle selected by the user from the search keywords when the condition that the Chinese participle is selected by the user is monitored, and forming the indexing patent search condition according to the indexing search keyword and the search formula.
The forming unit is specifically configured to:
monitoring the Chinese word segmentation selected by the user;
inquiring the search formula according to the Chinese word segmentation selected by the user, and determining the search keyword comprising the Chinese word segmentation selected by the user in the search formula as an indexing search keyword;
when a plurality of indexing search keywords are determined, combining a plurality of indexing search keywords with logical operators among the indexing search keywords in the search formula to form an indexing limited search expression;
and combining the search expression and the index limit search expression through a logic and operator to form an index patent search condition.
In conclusion, the search formula is combined with the search keyword comprising the Chinese segmentation selected by the user to form the final indexing patent search condition, so that the relevance of the retrieved patent file is higher, the user does not need to manually determine the patent file, and the indexing efficiency of the patent file is improved.
In the related art, a user is not a technician in each technical field and cannot comprehensively and accurately know and familiarize each technical field, so search keywords input by the user often have the situations of non-speciality, non-standardization and inaccuracy, if patent documents are indexed through the search keywords, the indexing result is inaccurate, and the indexing effect on the patent documents is reduced. In order to improve the indexing effect of the patent document, the indexing module 306 includes:
the determining submodule is used for determining the indexing information option matched with the Chinese word segmentation selected by the user as the indexing information option matched with the patent document;
and the indexing submodule is used for indexing the patent files through the determined indexing information options.
In conclusion, the patent documents are indexed through the indexing information option matched with the Chinese participle selected by the user, and the indexing is not directly indexed through the keyword used by the user, so that the standardization of the indexing information option can be obviously improved, the accuracy of an indexing result and the indexing effect on the patent documents are improved, and the later-stage patent document retrieval, reading and analysis work is facilitated.
The computer program product for performing the patent document indexing method provided in the embodiment of the present invention includes a computer-readable storage medium storing a program code, where instructions included in the program code may be used to execute the method in the foregoing method embodiment, and specific implementation may refer to the method embodiment, which is not described herein again.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the several embodiments provided in the present application, it should be understood that the disclosed system, apparatus and method may be implemented in other ways. The above-described embodiments of the apparatus are merely illustrative, and for example, the above-described division of the units is only one type of division of logical functions, and there may be other divisions when actually implemented, and for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection of devices or units through some communication interfaces, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit.
The above functions, if implemented in the form of software functional units and sold or used as a separate product, may be stored in a computer-readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the above method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and all the changes or substitutions should be covered within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (6)

1. A patent document indexing method, comprising:
obtaining a search formula, wherein the search formula comprises: searching for a keyword;
forming indexing patent search conditions through search keywords in the search formula;
searching patent files meeting the indexing patent searching conditions;
indexing the patent file through an indexing information option matched with the patent file;
forming indexing patent search conditions through the search keywords in the search formula, wherein the indexing patent search conditions comprise:
acquiring a search keyword in the search formula;
extracting Chinese participles from the search keywords through a participle algorithm;
forming the indexing patent search condition according to the extracted Chinese word segmentation;
forming the indexing patent search condition according to the extracted Chinese word segmentation, comprising:
inquiring indexing information options comprising the Chinese participles from a preset indexing information set;
determining the indexing information options comprising the Chinese participles as indexing information options matched with the Chinese participles, and displaying the Chinese participles and the matched indexing information options to a user;
when the situation that the user selects the Chinese participle is monitored, determining an indexing search keyword comprising the Chinese participle selected by the user from the search keywords, and forming the indexing patent search condition according to the indexing search keyword and the search formula.
2. The method of claim 1, wherein when it is monitored that the user selects a chinese participle, determining an indexed search keyword including the user selected chinese participle from the search keywords, and forming the indexed patent search condition according to the indexed search keyword and the search formula comprises:
monitoring the Chinese word segmentation selected by the user;
querying the search formula through the Chinese participles selected by the user, and determining search keywords in the search formula, which comprise the Chinese participles selected by the user, as index search keywords;
when a plurality of indexing search keywords are determined, combining the plurality of indexing search keywords with logical operators among the plurality of indexing search keywords in the search formula to form an indexing limited search expression;
and combining the search expression and the indexing limit search expression through a logic and operator to form an indexing patent search condition.
3. The method of claim 2, wherein indexing the patent document with an indexing information option matching the patent document comprises:
determining the indexing information option matched with the Chinese word segmentation selected by the user as the indexing information option matched with the patent document;
and performing indexing operation on the patent file through the determined indexing information option.
4. A patent document indexing device, comprising:
a first obtaining module, configured to obtain a search formula, where the search formula includes: searching for a keyword;
the first forming module is used for forming indexing patent searching conditions through the searching keywords in the searching formula;
the searching module is used for searching patent files meeting the indexing patent searching conditions;
the indexing module is used for indexing the patent files through indexing information options matched with the patent files;
forming indexing patent search conditions through the search keywords in the search formula, wherein the indexing patent search conditions comprise:
the second acquisition module is used for acquiring the search keywords in the search formula;
the extraction module is used for extracting Chinese participles from the search keywords through a participle algorithm;
the second forming module is used for forming the indexing patent searching condition according to the extracted Chinese word segmentation;
forming the indexing patent search condition according to the extracted Chinese word segmentation, comprising:
the query module is used for querying indexing information options comprising the Chinese participles from a preset indexing information set;
the determining module is used for determining the indexing information options comprising the Chinese participles as indexing information options matched with the Chinese participles and displaying the Chinese participles and the matched indexing information options to a user;
and the third forming module is used for determining indexing search keywords comprising the Chinese participles selected by the user from the search keywords when the Chinese participles selected by the user are monitored, and forming the indexing patent search conditions according to the indexing search keywords and the search formula.
5. The apparatus according to claim 4, characterized in that the third forming module is specifically configured to:
monitoring the Chinese word segmentation selected by the user;
querying the search formula through the Chinese participles selected by the user, and determining search keywords in the search formula, which comprise the Chinese participles selected by the user, as index search keywords;
when a plurality of indexing search keywords are determined, combining the plurality of indexing search keywords with logical operators among the plurality of indexing search keywords in the search formula to form an indexing limited search expression;
and combining the search expression and the indexing limit search expression through a logic and operator to form an indexing patent search condition.
6. The apparatus of claim 5, wherein the indexing module comprises:
the determining submodule is used for determining the indexing information option matched with the Chinese word segmentation selected by the user as the indexing information option matched with the patent document;
and the indexing submodule is used for indexing the patent file according to the determined indexing information option.
CN201611157229.1A 2016-12-15 2016-12-15 Patent file indexing method and device Active CN106777103B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611157229.1A CN106777103B (en) 2016-12-15 2016-12-15 Patent file indexing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611157229.1A CN106777103B (en) 2016-12-15 2016-12-15 Patent file indexing method and device

Publications (2)

Publication Number Publication Date
CN106777103A CN106777103A (en) 2017-05-31
CN106777103B true CN106777103B (en) 2020-07-07

Family

ID=58888248

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611157229.1A Active CN106777103B (en) 2016-12-15 2016-12-15 Patent file indexing method and device

Country Status (1)

Country Link
CN (1) CN106777103B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200915107A (en) * 2007-09-28 2009-04-01 Hon Hai Prec Ind Co Ltd System and method for creating index of patent full text search
CN101692228A (en) * 2009-05-31 2010-04-07 上海汉光知识产权数据科技有限公司 Accurate and rapid automatic indexing method of patent documents
CN101692240A (en) * 2009-08-14 2010-04-07 北京中献电子技术开发中心 Rule-based method for patent abstract automatic extraction and keyword indexing

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7716226B2 (en) * 2005-09-27 2010-05-11 Patentratings, Llc Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects
CN1818906A (en) * 2006-03-10 2006-08-16 上海汉光知识产权数据科技有限公司 Indexing method of patent document
CN101661469A (en) * 2008-09-09 2010-03-03 山东科技大学 System and method for indexing and retrieving keywords of academic documents
US9940363B2 (en) * 2011-10-03 2018-04-10 Black Hills Ip Holdings, Llc Systems, methods and user interfaces in a patent management system
CN102929925A (en) * 2012-09-20 2013-02-13 百度在线网络技术(北京)有限公司 Search method and device based on browsing content

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200915107A (en) * 2007-09-28 2009-04-01 Hon Hai Prec Ind Co Ltd System and method for creating index of patent full text search
CN101692228A (en) * 2009-05-31 2010-04-07 上海汉光知识产权数据科技有限公司 Accurate and rapid automatic indexing method of patent documents
CN101692240A (en) * 2009-08-14 2010-04-07 北京中献电子技术开发中心 Rule-based method for patent abstract automatic extraction and keyword indexing

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
关于关键词标引的讨论;汤才祥;《http://wenku.baidu.com/view/f63b173d5901020206409c12.html》;20130814;42-47 *

Also Published As

Publication number Publication date
CN106777103A (en) 2017-05-31

Similar Documents

Publication Publication Date Title
US9008433B2 (en) Object tag metadata and image search
US20170132638A1 (en) Relevant information acquisition method and apparatus, and storage medium
CN105653537B (en) Paging query method and device for database application system
US8595229B2 (en) Search query generator apparatus
US10956469B2 (en) System and method for metadata correlation using natural language processing
CN103136228A (en) Image search method and image search device
US20200257695A1 (en) Keyword Reporting for Mobile Applications
EP2862104A1 (en) Search method and apparatus
EP2812815A1 (en) Web page retrieval method and device
US9558245B1 (en) Automatic discovery of relevant data in massive datasets
US20230044789A1 (en) Indexing Native Application Data
US20170147652A1 (en) Search servers, end devices, and search methods for use in a distributed network
Skluzacek et al. Skluma: An extensible metadata extraction pipeline for disorganized data
CN109684402A (en) One kind being based on big data platform metadata genetic connection implementation method
JP2007164633A (en) Content retrieval method, system thereof, and program thereof
CN116910374B (en) Knowledge graph-based health care service recommendation method, device and storage medium
CN106777103B (en) Patent file indexing method and device
US20180173787A1 (en) Data search method and device
JP6399514B2 (en) Method and system for performing search queries using and building block level indexes
JP6480991B2 (en) SEARCH DEVICE, SEARCH METHOD, AND SEARCH PROGRAM
CN112749258A (en) Data searching method and device, electronic equipment and storage medium
CN112434195A (en) Data analysis method and device, electronic equipment and computer readable storage medium
WO2017206604A1 (en) Processing and interaction method for use in data recommendation, device, and system
EP3432154A1 (en) Method and apparatus for providing search recommendation information
CN115563356B (en) Method and device for dynamically collecting and inquiring system interaction information based on monitoring service

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant