CN110209779B - Client, server, retrieval method and system thereof - Google Patents

Client, server, retrieval method and system thereof Download PDF

Info

Publication number
CN110209779B
CN110209779B CN201810323759.1A CN201810323759A CN110209779B CN 110209779 B CN110209779 B CN 110209779B CN 201810323759 A CN201810323759 A CN 201810323759A CN 110209779 B CN110209779 B CN 110209779B
Authority
CN
China
Prior art keywords
result data
document
retrieval result
document retrieval
data set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810323759.1A
Other languages
Chinese (zh)
Other versions
CN110209779A (en
Inventor
裘钢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suoyi Interactive Beijing Information Technology Co ltd
Original Assignee
Suoyi Interactive Beijing Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suoyi Interactive Beijing Information Technology Co ltd filed Critical Suoyi Interactive Beijing Information Technology Co ltd
Publication of CN110209779A publication Critical patent/CN110209779A/en
Application granted granted Critical
Publication of CN110209779B publication Critical patent/CN110209779B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Abstract

A client, comprising: a first receiving unit configured to receive a first document retrieval condition; a second receiving unit configured to receive a second document retrieval condition; at least one document retrieval result data in the first document retrieval result data set is not subordinate to the second document retrieval result data set, and at least one document retrieval result data in the second document retrieval result data set is not subordinate to the first document retrieval result data set; an output unit configured to output a combination of document retrieval result data, the combination of document retrieval result data including at least a first piece of document retrieval result data and a second piece of document retrieval result data, and: and a document reference relationship exists between a first document corresponding to the first document retrieval result data and a second document corresponding to the second document retrieval result data. Therefore, the method and the device realize direct output of a group of search documents with the association relationship.

Description

Client, server, retrieval method and system thereof
Technical Field
The present disclosure relates to the field of information processing, and for example, to a client, a server, a retrieval method, and a system thereof.
Background
In the prior art, in the aspect of document retrieval, only a first retrieval result aiming at a certain retrieval formula or a result of further screening based on the first retrieval result can be simply provided, but the retrieval result with relevance can not be directly obtained in the retrieval.
Disclosure of Invention
In order to solve the above problem, the present disclosure provides a document retrieval method including:
step S100: receiving a first document retrieval condition corresponding to a first document retrieval result dataset and a second document retrieval condition corresponding to a second document retrieval result dataset, and:
at least one document retrieval result data in the first document retrieval result data set is not affiliated with a second document retrieval result data set, and at least one document retrieval result data in the second document retrieval result data set is not affiliated with the first document retrieval result data set;
step S200: responding to the first literature retrieval condition and the second literature retrieval condition, and performing retrieval in at least one retrieval data set to obtain a first literature retrieval result set and a second literature retrieval result set; wherein the search data set comprises the first document search result set and a second document search result set;
step S300: outputting a document retrieval result data combination, wherein the document retrieval result data combination comprises at least a first piece of document retrieval result data and a second piece of document retrieval result data;
wherein the first piece of document retrieval result data is from a first document retrieval result data set, the second piece of document retrieval result data is from a second document retrieval result data set, and:
and a document reference relationship exists between a first document corresponding to the first document retrieval result data and a second document corresponding to the second document retrieval result data.
In addition, the present disclosure also provides a client, including:
a first receiving unit configured to receive a first document retrieval condition;
a second receiving unit configured to receive a second document retrieval condition;
wherein the first document retrieval condition corresponds to a first document retrieval result dataset, the second document retrieval condition corresponds to a second document retrieval result dataset, and:
at least one document retrieval result data in the first document retrieval result data set is not affiliated with a second document retrieval result data set, and at least one document retrieval result data in the second document retrieval result data set is not affiliated with the first document retrieval result data set;
an output unit configured to output a combination of document retrieval result data including at least a first piece of document retrieval result data and a second piece of document retrieval result data;
wherein the first piece of document retrieval result data is from a first document retrieval result data set, the second piece of document retrieval result data is from a second document retrieval result data set, and:
and a document reference relationship exists between a first document corresponding to the first document retrieval result data and a second document corresponding to the second document retrieval result data.
In addition, the present disclosure also provides a server, including:
a first receiving unit configured to receive a first document retrieval condition;
a second receiving unit configured to receive a second document retrieval condition;
wherein the first document retrieval condition corresponds to a first document retrieval result dataset, the second document retrieval condition corresponds to a second document retrieval result dataset, and:
at least one document retrieval result data in the first document retrieval result data set is not affiliated with a second document retrieval result data set, and at least one document retrieval result data in the second document retrieval result data set is not affiliated with the first document retrieval result data set;
a retrieval unit, which is used for responding to the first literature retrieval condition and the second literature retrieval condition, and executing retrieval in at least one retrieval data set and obtaining the first literature retrieval result set and the second literature retrieval result set; wherein the search data set comprises the first document search result set and a second document search result set;
an output unit configured to output a combination of document retrieval result data including at least a first piece of document retrieval result data and a second piece of document retrieval result data;
wherein the first piece of document retrieval result data is from a first document retrieval result data set, the second piece of document retrieval result data is from a second document retrieval result data set, and:
and a document reference relationship exists between a first document corresponding to the first document retrieval result data and a second document corresponding to the second document retrieval result data.
In addition, the present disclosure also provides a retrieval system, which performs any of the above-described methods.
In addition, the disclosure also provides a retrieval system, which comprises any one of the clients and any one of the servers.
Thus, the present disclosure can directly output a set of search documents having an association relationship.
Drawings
FIG. 1 is a schematic illustration of a method according to one embodiment of the present disclosure;
FIG. 2 is a schematic diagram of a client according to one embodiment of the present disclosure;
fig. 3 is a schematic diagram of a server according to an embodiment of the present disclosure.
Detailed Description
In order to make those skilled in the art understand the technical solutions disclosed in the present disclosure, the technical solutions of the various embodiments will be described below with reference to the embodiments and the related drawings, and the described embodiments are a part of the embodiments of the present disclosure, but not all of the embodiments. The terms "first," "second," and the like as used in this disclosure are used for distinguishing between different objects and not for describing a particular order. Furthermore, "include" and "have," as well as any variations thereof, are intended to cover and not to exclude inclusions. For example, a process, method, system, or article or apparatus that comprises a list of steps or elements is not limited to only those steps or elements but may alternatively include other steps or elements not expressly listed or inherent to such process, method, system, article, or apparatus.
Reference herein to "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the disclosure. The appearances of the phrase in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. It will be appreciated by those skilled in the art that the embodiments described herein may be combined with other embodiments.
Referring to fig. 1, in one embodiment, the present disclosure discloses a document retrieval method, comprising:
step S100: receiving a first document retrieval condition corresponding to a first document retrieval result dataset and a second document retrieval condition corresponding to a second document retrieval result dataset, and:
at least one document retrieval result data in the first document retrieval result data set is not affiliated with a second document retrieval result data set, and at least one document retrieval result data in the second document retrieval result data set is not affiliated with the first document retrieval result data set;
step S200: responding to the first literature retrieval condition and the second literature retrieval condition, and performing retrieval in at least one retrieval data set to obtain a first literature retrieval result set and a second literature retrieval result set; wherein the search data set comprises the first document search result set and a second document search result set;
step S300: outputting a document retrieval result data combination, wherein the document retrieval result data combination comprises at least a first piece of document retrieval result data and a second piece of document retrieval result data;
wherein the first piece of document retrieval result data is from a first document retrieval result data set, the second piece of document retrieval result data is from a second document retrieval result data set, and:
and a document reference relationship exists between a first document corresponding to the first document retrieval result data and a second document corresponding to the second document retrieval result data.
For the sake of understanding, this example will be described with specific search conditions. Those skilled in the art will recognize that the detailed description is not meant to limit the search methodology. Specific examples are as follows:
it is assumed that the retrieval of the data set is implemented in a database: for the first and second document retrieval result data sets and the retrieval data set, if understood from the perspective of a set, the retrieval data set may be understood as a union of the first document retrieval result data set and the second document retrieval result data set, and may also be understood as a set larger than the union, even a full set;
assume that the first search condition refers to a condition for searching for the following result: communication-side documents of organization A, wherein all document retrieval result data in the result is a first document retrieval result set; for example, the first search condition may be boolean search, and the corresponding search field relates to two fields, namely "document belonging organization" and "full text", where the content of the "document belonging organization" field is "a organization" and the content of the "full text" field is "communication"; alternatively, the first search condition may be a semantic search for a document in communication with the a organization;
similarly, it is assumed that the second search condition refers to a condition for searching for the following result: documents in the LTE aspect — all document search result data in the result is then a second document search result set;
based on such specific example, the above steps S100, S200 are easily understood by those skilled in the art.
In contrast, step S300 illustrates that the present embodiment focuses on outputting the document search result data in a combined manner, which is specifically described as follows:
suppose that the first literature search result set comprises 5 items of communication literature search result data of the organization A, namely 1-1, 1-2, 1-3, 1-4 and 1-5;
the second literature search result set is assumed to comprise 3 pieces of literature search result data in the LTE aspect, namely 2-1, 2-2 and 2-3;
suppose that 3 documents corresponding to the document retrieval result data 2-1, 2-2 and 2-3 all refer to the document corresponding to the document retrieval result data 1-1; at this time, the cited index of the document retrieval result data 1-1 may be considered to be 3;
suppose that 2 documents corresponding to the document retrieval result data 2-2 and 2-3 all quote the documents corresponding to the document retrieval result data 1-2; at this time, the cited index of the document retrieval result data 1-2 can be considered to be 2;
suppose that 3 documents corresponding to the document retrieval result data 2-1, 2-2 and 2-3 do not refer to the document corresponding to the document retrieval result data 1-3;
then, for step S300, the combination of the document retrieval result data output by it can be exemplarily expressed as:
combination 1: { literature search result data 1-1; document retrieval result data 2-1, 2-2, 2-3 };
and (3) combination 2: { literature search result data 1-2; document retrieval result data 2-2, 2-3 };
it can be understood that the above embodiment outputs the document retrieval result data combination with the reference relationship, which is convenient for the document researchers to identify, and further research and compare the related documents. A citation relationship is an associative relationship between documents. How the client displays the document retrieval result data combination is not a limitation of the present disclosure.
Incidentally, the above-described embodiments can be applied not only to patent documents, academic journal documents, but also to web documents, and any other documents having a citation relationship.
As for the search condition, it may be directly or indirectly received through various receiving means such as an input box, a menu, and the like. In the input box receiving mode, the retrieval condition can be various retrieval expressions, for example, the retrieval condition is input in the input box through a keyboard; in the menu receiving mode, the retrieval condition may be a selected character, for example, the selected character is selected by a mouse, and the retrieval condition is activated by popping up a menu through a left key or a right key.
In another embodiment of the present invention, the substrate is,
the first document corresponding to the first document retrieval result data and the second document corresponding to the second document retrieval result data comprise at least one same keyword in the whole text.
With this embodiment, it is further able to present the associations between documents through key fonts throughout.
In combination 1: { literature search result data 1-1; literature search result data 2-1, 2-2, 2-3} is taken as an example, and the following is assumed to exist:
the document corresponding to the document search result data 1-1 includes the following keywords: a mobile terminal;
in 2 documents corresponding to the document search result data 2-1 and 2-2, the full text of the document also includes keywords: a mobile terminal;
then, this means that the sub-combination 1.1 can be derived from combination 1: { literature search result data 1-1; the document retrieval result data 2-1, 2-2} has not only citation relation but also correlation relation of key words, so that the correlation relation of 3 documents corresponding to the document retrieval result data 1-1, 2-1, 2-2 is stronger and is easy to identify.
It should be noted that the same keyword does not mean that the same word or phrase is necessarily the same, because there are synonyms and vocabularies of other languages, for example, those skilled in the art can understand that in the communication field: the mobile terminal may also be expressed as a mobile terminal, or user equipment, or UE for short for user equipment.
It will be appreciated that where multiple documents have a citation relationship, the keyword index may also be defined as in the previous embodiment with the cited index. The more identical keywords involved, the stronger the relevance of the document.
In another embodiment of the present invention, the substrate is,
the full text of the first document corresponding to the first document retrieval result data and the full text of the second document corresponding to the second document retrieval result data comprise at least one same semantic concept or at least one approximate semantic concept.
It can be understood that unlike the keywords of the previous embodiment, this embodiment can further embody the association between documents by semantic concepts throughout. Keywords often relate to traditional boolean searches, while emerging semantic searches relate to the same semantic concepts or similar semantic concepts.
In combination 2: { literature search result data 1-2; document search result data 2-2, 2-3} is an example, and the following is assumed to exist:
the document corresponding to the document search result data 1-2 includes semantic concepts in its entirety: a smart phone;
the document search result data 2-2 corresponds to a document whose entire text also includes the same semantic concept: an iPhone; it can be understood that an iPhone is a specific smart phone brand, and can be considered to belong to the same semantic concept as a mobile phone;
the document search result data 2-3 corresponds to documents whose full text includes similar semantic concepts: an iPad; it can be understood that iPad, although not a smartphone, belongs to a smart tablet and belongs to a similar semantic concept.
Then, this means that the sub-combination 2.1 can be derived from combination 2: { literature search result data 1-2; document search result data 2-2}, and sub-combination 2.2: { literature search result data 1-2; document retrieval result data 2-3, and corresponding documents not only have citation relation, but also have semantic association relation, so that the association relation of 3 documents corresponding to the document retrieval result data 1-2, 2-2, 2-3 is stronger, and the document researchers can conveniently identify.
It will be appreciated that where multiple documents have a citation relationship, the semantic concept index may also be defined similarly to the cited index of the previous embodiment. The more identical or similar semantic concepts that are involved, the stronger the relevance of the document.
In another embodiment of the present invention, the substrate is,
the full texts of the first document corresponding to the first document retrieval result data and the second document corresponding to the second document retrieval result data comprise at least one similar picture.
Unlike the keywords and semantics mentioned in the previous embodiments, this embodiment focuses on the relevance between documents represented by the approximation of the picture. In the prior art, all the related art schemes for finding the image can be used in this embodiment, such as www.tineye.com or hundred degree image search function or other similar technologies. Similarly, the present embodiment may also define the picture index. The more pictures that relate to an approximation, the stronger the relevance of the document.
It can be understood that, for the reference relations, the same keywords, the same semantic concepts, the similar semantic concepts, and the similar pictures referred in the above embodiments, all belong to different kinds of association relations, and if the kinds of the related association relations are more, the higher the measurement index of the corresponding kind is, the stronger the association relation between documents is.
In another embodiment of the present invention, the substrate is,
step S300 further includes: and responding to a sorting condition to output the document retrieval result data combination.
In combination 1: { literature search result data 1-1; document retrieval result data 2-1, 2-2, 2-3 };
and (3) combination 2: { literature search result data 1-2; literature search result data 2-2, 2-3} are examples:
the number of pieces of document search result data related to the combinations 1 and 2 is different, the combination 1 totals 4 pieces of result data, and the combination 2 totals 3 pieces of result data, and the sorting condition may be an ascending order or a descending order, and if the order is an ascending order, the combination 2 is arranged before the combination 1, otherwise, the combination 1 is arranged before the combination 2.
That is, with the aid of ranking, the present embodiment is to facilitate improving the user experience, thereby facilitating later statistics or other processing.
As for the data set described in the above embodiments, it represents a collection of data, and the data set is stored in the form of a database or otherwise, and is not limited.
Further, referring to fig. 2, the present disclosure also discloses in one embodiment a corresponding client, comprising:
a first receiving unit configured to receive a first document retrieval condition;
a second receiving unit configured to receive a second document retrieval condition;
wherein the first document retrieval condition corresponds to a first document retrieval result dataset, the second document retrieval condition corresponds to a second document retrieval result dataset, and:
at least one document retrieval result data in the first document retrieval result data set is not affiliated with a second document retrieval result data set, and at least one document retrieval result data in the second document retrieval result data set is not affiliated with the first document retrieval result data set;
an output unit configured to output a combination of document retrieval result data including at least a first piece of document retrieval result data and a second piece of document retrieval result data;
wherein the first piece of document retrieval result data is from a first document retrieval result data set, the second piece of document retrieval result data is from a second document retrieval result data set, and:
and a document reference relationship exists between a first document corresponding to the first document retrieval result data and a second document corresponding to the second document retrieval result data.
Similar to the method related embodiment, this embodiment discloses a technical solution corresponding to the client through the corresponding functional unit.
With reference to the foregoing embodiments, it is preferred,
the first document corresponding to the first document retrieval result data and the second document corresponding to the second document retrieval result data comprise at least one same keyword in the whole text.
With reference to the foregoing embodiments, it is preferred,
the full text of the first document corresponding to the first document retrieval result data and the full text of the second document corresponding to the second document retrieval result data comprise at least one same semantic concept or at least one approximate semantic concept.
With reference to the foregoing embodiments, it is preferred,
the full texts of the first document corresponding to the first document retrieval result data and the second document corresponding to the second document retrieval result data comprise at least one similar picture.
With reference to the foregoing embodiments, it is preferred,
the output unit is also used for responding to a sorting condition and outputting the document retrieval result data combination.
Similar to the related embodiment of the method, this embodiment discloses a technical solution corresponding to the server side through corresponding functional units:
referring to FIG. 3, the present disclosure discloses in one embodiment a server comprising:
a first receiving unit configured to receive a first document retrieval condition;
a second receiving unit configured to receive a second document retrieval condition;
wherein the first document retrieval condition corresponds to a first document retrieval result dataset, the second document retrieval condition corresponds to a second document retrieval result dataset, and:
at least one document retrieval result data in the first document retrieval result data set is not affiliated with a second document retrieval result data set, and at least one document retrieval result data in the second document retrieval result data set is not affiliated with the first document retrieval result data set;
a retrieval unit, which is used for responding to the first literature retrieval condition and the second literature retrieval condition, and executing retrieval in at least one retrieval data set and obtaining the first literature retrieval result set and the second literature retrieval result set; wherein the search data set comprises the first document search result set and a second document search result set;
an output unit configured to output a combination of document retrieval result data including at least a first piece of document retrieval result data and a second piece of document retrieval result data;
wherein the first piece of document retrieval result data is from a first document retrieval result data set, the second piece of document retrieval result data is from a second document retrieval result data set, and:
and a document reference relationship exists between a first document corresponding to the first document retrieval result data and a second document corresponding to the second document retrieval result data.
With reference to the foregoing embodiments, it is preferred,
the first document corresponding to the first document retrieval result data and the second document corresponding to the second document retrieval result data comprise at least one same keyword in the whole text.
With reference to the foregoing embodiments, it is preferred,
the full text of the first document corresponding to the first document retrieval result data and the full text of the second document corresponding to the second document retrieval result data comprise at least one same semantic concept or at least one approximate semantic concept.
With reference to the foregoing embodiments, it is preferred,
the full texts of the first document corresponding to the first document retrieval result data and the second document corresponding to the second document retrieval result data comprise at least one similar picture.
With reference to the foregoing embodiments, it is preferred,
the output unit is also used for responding to a sorting condition and outputting the document retrieval result data combination.
Similar to the foregoing embodiments, the present disclosure further discloses the following technical solutions of the system through the following embodiments:
a retrieval system, said system performing any of the retrieval methods described above.
Similar to the foregoing embodiments, the present disclosure further discloses the following technical solutions of the system through the following embodiments:
a retrieval system, the system comprising a client as described in any of the preceding, and a server as described in any of the preceding.
The steps in the method of the embodiment of the present disclosure may be sequentially adjusted, combined, and deleted according to actual needs.
The units in the device of the embodiment of the disclosure can be combined, divided and deleted according to actual needs. It should be noted that, for simplicity of description, the above-mentioned method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present invention is not limited by the order of acts, as some steps may occur in other orders or concurrently in accordance with the invention. Furthermore, those skilled in the art should also appreciate that the embodiments described in the specification are preferred embodiments and that the acts, modules, and elements described herein are not necessarily required by the invention.
In the foregoing embodiments, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
In the several embodiments provided in the present disclosure, it should be understood that the disclosed apparatus may be implemented in other ways. For example, the above-described device embodiments are merely illustrative, and for example, the division of the units is only one logical division, and in actual implementation, there may be other divisions, for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the coupling or direct coupling or communication connection between the units or components may be through some interfaces, and the indirect coupling or communication connection between the devices or units may be in an electrical or other form.
The units described as separate parts may or may not be physically separate, may be located in one place, or may be distributed over a plurality of network elements. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present disclosure may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present disclosure may be embodied in the form of a software product, which is stored in a storage medium and includes several instructions for causing a computer device (which may be a smartphone, a personal digital assistant, a wearable device, a laptop, a tablet computer) to perform all or part of the steps of the method according to the embodiments of the present disclosure. And the aforementioned storage medium includes: a U-disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic or optical disk, and other various media capable of storing program codes.
As described above, the above embodiments are only used to illustrate the technical solutions of the present disclosure, and not to limit the same; although the present disclosure has been described in detail with reference to the foregoing embodiments, it should be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present disclosure.

Claims (5)

1. A client for obtaining a retrieval result with relevance directly at the time of retrieval, comprising:
a first receiving unit configured to receive a first document retrieval condition;
a second receiving unit configured to receive a second document retrieval condition;
wherein the first document retrieval condition corresponds to a first document retrieval result dataset, the second document retrieval condition corresponds to a second document retrieval result dataset, and:
at least one document retrieval result data in the first document retrieval result data set is not affiliated with a second document retrieval result data set, and at least one document retrieval result data in the second document retrieval result data set is not affiliated with the first document retrieval result data set;
an output unit configured to output a combination of document retrieval result data including at least a first piece of document retrieval result data and a second piece of document retrieval result data;
wherein the first piece of document retrieval result data is from a first document retrieval result data set, the second piece of document retrieval result data is from a second document retrieval result data set, and:
a document reference relationship exists between a first document corresponding to the first document retrieval result data and a second document corresponding to the second document retrieval result data; wherein the content of the first and second substances,
the full text of the first document corresponding to the first document retrieval result data and the full text of the second document corresponding to the second document retrieval result data comprise at least one same keyword; or the full text of the first document corresponding to the first document retrieval result data and the full text of the second document corresponding to the second document retrieval result data comprise at least one same semantic concept or at least one approximate semantic concept; or the full text of the first document corresponding to the first document retrieval result data and the full text of the second document corresponding to the second document retrieval result data comprise at least one approximate picture; wherein, the same keywords comprise completely same words or words, or synonyms, or corresponding vocabularies of other languages;
the output unit is also used for responding to a sorting condition and outputting the document retrieval result data combination.
2. A server for obtaining a retrieval result having an association directly at the time of retrieval, comprising:
a first receiving unit configured to receive a first document retrieval condition;
a second receiving unit configured to receive a second document retrieval condition;
wherein the first document retrieval condition corresponds to a first document retrieval result dataset, the second document retrieval condition corresponds to a second document retrieval result dataset, and:
at least one document retrieval result data in the first document retrieval result data set is not affiliated with a second document retrieval result data set, and at least one document retrieval result data in the second document retrieval result data set is not affiliated with the first document retrieval result data set;
a retrieval unit, which is used for responding to the first literature retrieval condition and the second literature retrieval condition, and executing retrieval in at least one retrieval data set and obtaining the first literature retrieval result data set and the second literature retrieval result data set; wherein the search data set comprises the first document search result data set and a second document search result data set;
an output unit configured to output a combination of document retrieval result data including at least a first piece of document retrieval result data and a second piece of document retrieval result data;
wherein the first piece of document retrieval result data is from a first document retrieval result data set, the second piece of document retrieval result data is from a second document retrieval result data set, and:
a document reference relationship exists between a first document corresponding to the first document retrieval result data and a second document corresponding to the second document retrieval result data; wherein the content of the first and second substances,
the full text of the first document corresponding to the first document retrieval result data and the full text of the second document corresponding to the second document retrieval result data comprise at least one same keyword; or the full text of the first document corresponding to the first document retrieval result data and the full text of the second document corresponding to the second document retrieval result data comprise at least one same semantic concept or at least one approximate semantic concept; or the full text of the first document corresponding to the first document retrieval result data and the full text of the second document corresponding to the second document retrieval result data comprise at least one approximate picture; wherein, the same keywords comprise completely same words or words, or synonyms, or corresponding vocabularies of other languages;
the output unit is also used for responding to a sorting condition and outputting the document retrieval result data combination.
3. A document retrieval method for obtaining a retrieval result having a correlation directly at the time of retrieval, comprising:
step S100: receiving a first document retrieval condition corresponding to a first document retrieval result dataset and a second document retrieval condition corresponding to a second document retrieval result dataset, and:
at least one document retrieval result data in the first document retrieval result data set is not affiliated with a second document retrieval result data set, and at least one document retrieval result data in the second document retrieval result data set is not affiliated with the first document retrieval result data set;
step S200: responding to the first literature retrieval condition and the second literature retrieval condition, and performing retrieval in at least one retrieval data set to obtain a first literature retrieval result data set and a second literature retrieval result data set; wherein the search data set comprises the first document search result data set and a second document search result data set;
step S300: outputting a document retrieval result data combination, wherein the document retrieval result data combination comprises at least a first piece of document retrieval result data and a second piece of document retrieval result data;
wherein the first piece of document retrieval result data is from a first document retrieval result data set, the second piece of document retrieval result data is from a second document retrieval result data set, and:
a document reference relationship exists between a first document corresponding to the first document retrieval result data and a second document corresponding to the second document retrieval result data; wherein the content of the first and second substances,
the full text of the first document corresponding to the first document retrieval result data and the full text of the second document corresponding to the second document retrieval result data comprise at least one same keyword; or the full text of the first document corresponding to the first document retrieval result data and the full text of the second document corresponding to the second document retrieval result data comprise at least one same semantic concept or at least one approximate semantic concept; or the full text of the first document corresponding to the first document retrieval result data and the full text of the second document corresponding to the second document retrieval result data comprise at least one approximate picture; wherein, the same keywords comprise completely same words or words, or synonyms, or corresponding vocabularies of other languages;
step S300 further includes: and responding to a sorting condition to output the document retrieval result data combination.
4. A retrieval system, said system performing the method of claim 3 above.
5. A retrieval system comprising the client of claim 1, the server of claim 2.
CN201810323759.1A 2018-02-05 2018-04-11 Client, server, retrieval method and system thereof Active CN110209779B (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN2018101164670 2018-02-05
CN201810116467 2018-02-05
CN201810116470 2018-02-05
CN2018101164702 2018-02-05

Publications (2)

Publication Number Publication Date
CN110209779A CN110209779A (en) 2019-09-06
CN110209779B true CN110209779B (en) 2021-11-30

Family

ID=67779039

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201810323759.1A Active CN110209779B (en) 2018-02-05 2018-04-11 Client, server, retrieval method and system thereof
CN201810323375.XA Active CN110309416B (en) 2018-02-05 2018-04-11 Client, server, retrieval method and system thereof

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201810323375.XA Active CN110309416B (en) 2018-02-05 2018-04-11 Client, server, retrieval method and system thereof

Country Status (1)

Country Link
CN (2) CN110209779B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102279893A (en) * 2011-09-19 2011-12-14 索意互动(北京)信息技术有限公司 Many-to-many automatic analysis method of document group
CN105956125A (en) * 2016-05-06 2016-09-21 长沙市麓智信息科技有限公司 Patent monitoring system and method
CN106557493A (en) * 2015-09-25 2017-04-05 索意互动(北京)信息技术有限公司 A kind of data retrieval method, device and data retrieval server

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001167087A (en) * 1999-12-14 2001-06-22 Fujitsu Ltd Device and method for retrieving structured document, program recording medium for structured document retrieval and index preparing method for structured document retrieval
JP2007183864A (en) * 2006-01-10 2007-07-19 Fujitsu Ltd File retrieval method and system therefor
CN100573531C (en) * 2008-07-04 2009-12-23 华中科技大学 A kind of document retrieval method based on association analysis
CN103257985A (en) * 2012-05-30 2013-08-21 韩俊 Device and method for simultaneously searching, inserting and displaying multiple cross-domain databases
CN103761307A (en) * 2014-01-22 2014-04-30 华为技术有限公司 Data processing device and data processing method
CN103886063B (en) * 2014-03-18 2017-03-08 国家电网公司 A kind of text searching method and device
CN104346446A (en) * 2014-10-27 2015-02-11 百度在线网络技术(北京)有限公司 Paper associated information recommendation method and device based on mapping knowledge domain
CN105989142A (en) * 2015-02-28 2016-10-05 华为技术有限公司 Data query method and device
CN107180059A (en) * 2016-03-11 2017-09-19 北大方正集团有限公司 Data retrieval method and data retrieval system
CN105938493A (en) * 2016-04-14 2016-09-14 乐视控股(北京)有限公司 Resource search method and apparatus
CN107463566A (en) * 2016-06-02 2017-12-12 索意互动(北京)信息技术有限公司 A kind of document retrieval method and system
CN106445916A (en) * 2016-09-19 2017-02-22 合肥清浊信息科技有限公司 Semantic analysis method for patent retrieval

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102279893A (en) * 2011-09-19 2011-12-14 索意互动(北京)信息技术有限公司 Many-to-many automatic analysis method of document group
CN106557493A (en) * 2015-09-25 2017-04-05 索意互动(北京)信息技术有限公司 A kind of data retrieval method, device and data retrieval server
CN105956125A (en) * 2016-05-06 2016-09-21 长沙市麓智信息科技有限公司 Patent monitoring system and method

Also Published As

Publication number Publication date
CN110209779A (en) 2019-09-06
CN110309416A (en) 2019-10-08
CN110309416B (en) 2021-11-30

Similar Documents

Publication Publication Date Title
US20240104127A1 (en) Method and system for sentiment analysis of information
CN109885773B (en) Personalized article recommendation method, system, medium and equipment
US20160034514A1 (en) Providing search results based on an identified user interest and relevance matching
CN106708940B (en) Method and device for processing pictures
CN104899322A (en) Search engine and implementation method thereof
CN108959236B (en) Medical literature classification model training method, medical literature classification method and device thereof
Nguyen et al. Real-time event detection using recurrent neural network in social sensors
US20120330955A1 (en) Document similarity calculation device
US20140201203A1 (en) System, method and device for providing an automated electronic researcher
US9558185B2 (en) Method and system to discover and recommend interesting documents
CN107885717B (en) Keyword extraction method and device
US20210117834A1 (en) Method and device for providing notes by using artificial intelligence-based correlation calculation
CN107844493B (en) File association method and system
US20180046628A1 (en) Ranking social media content
CN108875065B (en) Indonesia news webpage recommendation method based on content
US11928433B2 (en) Systems and methods for term prevalence-volume based relevance
CN108287850B (en) Text classification model optimization method and device
CN114298007A (en) Text similarity determination method, device, equipment and medium
CN113536763A (en) Information processing method, device, equipment and storage medium
CN105512270B (en) Method and device for determining related objects
CN110209779B (en) Client, server, retrieval method and system thereof
US20160170983A1 (en) Information management apparatus and information management method
KR20110094563A (en) The apparatus and method for searching related keyword of user-defined search keyword based using relationship of link-keywords in web documents
US11669555B2 (en) System and method of creating index
KR20150022583A (en) Apparatus for extracting keyword and method thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant