CN107679055B - Information retrieval method, server and readable storage medium - Google Patents

Information retrieval method, server and readable storage medium Download PDF

Info

Publication number
CN107679055B
CN107679055B CN201710490375.4A CN201710490375A CN107679055B CN 107679055 B CN107679055 B CN 107679055B CN 201710490375 A CN201710490375 A CN 201710490375A CN 107679055 B CN107679055 B CN 107679055B
Authority
CN
China
Prior art keywords
information
retrieved
content
query
retrieval
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710490375.4A
Other languages
Chinese (zh)
Other versions
CN107679055A (en
Inventor
韦邕
赵清源
吕梓燊
徐亮
肖京
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Technology Shenzhen Co Ltd
Original Assignee
Ping An Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Technology Shenzhen Co Ltd filed Critical Ping An Technology Shenzhen Co Ltd
Priority to CN201710490375.4A priority Critical patent/CN107679055B/en
Priority to PCT/CN2017/105016 priority patent/WO2019000697A1/en
Publication of CN107679055A publication Critical patent/CN107679055A/en
Application granted granted Critical
Publication of CN107679055B publication Critical patent/CN107679055B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2452Query translation
    • G06F16/24528Standardisation; Simplification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/248Presentation of query results

Abstract

The invention discloses an information retrieval method, a server and a readable storage medium, wherein the method comprises the following steps: after receiving the content to be retrieved, converting the content to be retrieved into the information to be retrieved in a preset information format; analyzing and acquiring to-be-retrieved content in the to-be-retrieved information according to the to-be-retrieved information, searching and acquiring a query result in a predetermined database according to the analyzed to-be-retrieved content, and converting the query result into query information in the preset information format; and analyzing and acquiring a query result according to the query information, and sequencing and outputting the query result. The invention improves the universality.

Description

Information retrieval method, server and readable storage medium
Technical Field
The present invention relates to the field of computer technologies, and in particular, to an information retrieval method, a server, and a readable storage medium.
Background
In the face of a large amount of information, retrieval becomes an indispensable key technology. In the existing information retrieval system, the format of information transmission among the steps of information input, analysis, matching, recall, sorting, output and the like is not uniformly specified, namely the format of information transmitted among the steps may be different. Thus, when a number of steps are required to be added or reduced to adapt the information retrieval system to various project application scenarios, a large number of code rewrites are required, and even the information retrieval system and the framework need to be changed, which is poor in universality.
Disclosure of Invention
The invention mainly aims to provide an information retrieval method, a server and a readable storage medium, aiming at improving the universality of an information retrieval system.
In order to achieve the above object, the present invention provides a search server, which includes a memory, a processor, and an information search system stored in the memory and operable on the processor, wherein the information search system, when executed by the processor, implements the following steps:
A. after receiving the content to be retrieved, converting the content to be retrieved into the information to be retrieved in a preset information format;
B. analyzing and acquiring to-be-retrieved content in the to-be-retrieved information according to the to-be-retrieved information, searching and acquiring a query result in a predetermined database according to the analyzed to-be-retrieved content, and converting the query result into query information in the preset information format;
C. and analyzing and acquiring a query result according to the query information, and sequencing and outputting the query result.
Preferably, the preset information format is a triple format, and the information in the preset information format includes an identification code of the information, information content data, and weight data corresponding to the information.
Preferably, the step B includes:
analyzing and acquiring to-be-retrieved content in the to-be-retrieved information according to the to-be-retrieved information, searching and acquiring a query result in a predetermined database according to the analyzed to-be-retrieved content, wherein the query result comprises the position and the occurrence frequency of the to-be-retrieved content in the predetermined database, and converting the query result into query information in a preset information format; the identification code in the query information is a preset identification code of the content to be retrieved, the information content data in the query information is position information of the content to be retrieved in a predetermined database, and the weight data in the query information is the occurrence frequency of the position of the content to be retrieved in the predetermined database;
the step C comprises the following steps:
analyzing and acquiring the position information of the content to be retrieved in a predetermined database and the occurrence frequency of the position of the content to be retrieved in the predetermined database according to the query information, sequencing according to the position information and the occurrence frequency of the content to be retrieved in the predetermined database, and outputting the sequenced query result.
Preferably, before the step a, the processor is further configured to execute the information retrieval system to implement the following steps:
after original retrieval information sent by a user is received, the original retrieval information is processed in a preset natural language processing mode, and contents to be retrieved in the original retrieval information are extracted.
Preferably, the information in the preset information format includes user identity data; the step C further comprises the following steps:
analyzing and acquiring an inquiry result and user identity data according to the inquiry information;
acquiring a historical retrieval record of the user according to the user identity data;
and matching the query result with the historical retrieval record of the user, and sequencing and outputting the query result matched with the historical retrieval record of the user.
In addition, in order to achieve the above object, the present invention further provides an information retrieval method applied to a retrieval server, the method including:
A. after receiving the content to be retrieved, converting the content to be retrieved into the information to be retrieved in a preset information format;
B. analyzing and acquiring to-be-retrieved content in the to-be-retrieved information according to the to-be-retrieved information, searching and acquiring a query result in a predetermined database according to the analyzed to-be-retrieved content, and converting the query result into query information in the preset information format;
C. and analyzing and acquiring a query result according to the query information, and sequencing and outputting the query result.
Preferably, the preset information format is a triple format, and the information in the preset information format includes an identification code of the information, information content data, and weight data corresponding to the information.
Preferably, the step B includes:
analyzing and acquiring to-be-retrieved content in the to-be-retrieved information according to the to-be-retrieved information, searching and acquiring a query result in a predetermined database according to the analyzed to-be-retrieved content, wherein the query result comprises the position and the occurrence frequency of the to-be-retrieved content in the predetermined database, and converting the query result into query information in a preset information format; the identification code in the query information is a preset identification code of the content to be retrieved, the information content data in the query information is position information of the content to be retrieved in a predetermined database, and the weight data in the query information is the occurrence frequency of the position of the content to be retrieved in the predetermined database;
the step C comprises the following steps:
analyzing and acquiring the position information of the content to be retrieved in a predetermined database and the occurrence frequency of the position of the content to be retrieved in the predetermined database according to the query information, sequencing according to the position information and the occurrence frequency of the content to be retrieved in the predetermined database, and outputting the sequenced query result.
Preferably, before the step a, the method further comprises:
after original retrieval information sent by a user is received, the original retrieval information is processed in a preset natural language processing mode, and contents to be retrieved in the original retrieval information are extracted.
Further, to achieve the above object, the present invention also provides a computer-readable storage medium storing an information retrieval system, which is executable by at least one processor to cause the at least one processor to perform the steps of the information retrieval method as described above.
According to the information retrieval method, the server and the readable storage medium, after the content to be retrieved is received, the content to be retrieved is converted into the information to be retrieved in the preset information format; when database query is carried out, analyzing and acquiring to-be-retrieved content according to-be-retrieved information in a preset information format, searching and acquiring a query result in a database, and converting the query result into query information in the preset information format; and when the result is output, the query result is obtained by analyzing the query information to sort and output the retrieval result. Because the transmission formats of the information flow among all the steps of the information retrieval can be unified into the same preset information format, when a plurality of steps are required to be added or reduced so that the information retrieval system is suitable for various different project application scenes, excessive code rewriting is not required, the system and the framework do not need to be changed, and the universality is greatly improved.
Drawings
FIG. 1 is a schematic diagram of an operating environment of an information retrieval system 10 according to a preferred embodiment of the present invention;
FIG. 2 is a functional block diagram of an embodiment of an information retrieval system according to the present invention;
FIG. 3 is a functional block diagram of another embodiment of an information retrieval system according to the present invention;
FIG. 4 is a flowchart illustrating an information retrieval method according to an embodiment of the present invention;
fig. 5 is a flowchart illustrating an information retrieval method according to another embodiment of the present invention.
The implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.
Detailed Description
In order to make the technical problems, technical solutions and advantageous effects to be solved by the present invention clearer and clearer, the present invention is further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
The invention provides an information retrieval system. Please refer to fig. 1, which is a schematic diagram illustrating an operating environment of an information retrieval system 10 according to a preferred embodiment of the present invention.
In the present embodiment, the information retrieval system 10 is installed and operated in the retrieval server 1. The search server 1 may include, but is not limited to, a memory 11 and a processor 12. Fig. 1 shows only the retrieval server 1 with components 11, 12, but it is to be understood that not all of the shown components are required to be implemented, and that more or fewer components may be implemented instead.
The storage 11 may in some embodiments be an internal storage unit of the search server 1, such as a hard disk or a memory of the search server 1. The memory 11 may also be an external storage device of the search server 1 in other embodiments, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like, which are provided on the search server 1. Further, the memory 11 may also include both an internal storage unit and an external storage device of the search server 1. The memory 11 is used for storing application software installed in the search server 1 and various types of data, such as program codes of the information search system 10. The memory 11 may also be used to temporarily store data that has been output or is to be output.
The processor 12 may be, in some embodiments, a Central Processing Unit (CPU), microprocessor or other data Processing chip for executing program codes stored in the memory 11 or Processing data, such as executing the information retrieval system 10.
Further, in some embodiments, the search server 1 may further include a display, which may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode) touch device, or the like in some embodiments. The display is used for displaying information processed in the search server 1 and displaying a visual user interface, such as an interface for a user to input content to be searched, an interface for displaying a query result after completion of search, and the like. The memory 11, processor 12 and the display of the search server 1 communicate with each other via a system bus.
Please refer to fig. 2, which is a functional block diagram of an embodiment of the information retrieval system 10 according to the present invention. In this embodiment, the information retrieval system 10 may be divided into one or more modules, and the one or more modules are stored in the memory 11 and executed by one or more processors (in this embodiment, the processor 12) to complete the present invention. For example, in fig. 2, the information retrieval system 10 may be divided into a conversion module 01, a query module 02, and an output module 03. The module referred to in the present invention refers to a series of computer program instruction segments capable of performing specific functions, and is more suitable than a program for describing the execution process of the information retrieval system 10 in the retrieval server 1. The following description will specifically describe the functions of the conversion module 01, the query module 02 and the output module 03.
The conversion module 01 is configured to convert, after receiving the content to be retrieved, the content to be retrieved into information to be retrieved in a preset information format.
Receiving input content of a user, analyzing the input content to obtain content to be retrieved, for example, receiving input content (including but not limited to voice, text, pictures, etc.) sent by the user through a browser, an APP client, etc., and analyzing the content to be retrieved in the input content, such as retrieval keywords, phrases, etc. And after the content to be retrieved is analyzed, converting the content to be retrieved into the information to be retrieved in the preset information format, and outputting the converted information to be retrieved in the preset information format to the next link of retrieval. The preset information format may be a predetermined format that facilitates information stream transmission and analysis, and may define the type of information included in the preset information format, for example, the information in the preset information format may include main content of the information, may further include an identification code, and may even include user information. For example, the information to be retrieved in the preset information format may include content to be retrieved corresponding to the information to be retrieved, such as a retrieval key, a phrase, and the like, and may further include an identification code corresponding to the content to be retrieved, so that a retrieval result corresponding to the content to be retrieved is identified subsequently according to the identification code, and the information to be retrieved in the preset information format may further include user information corresponding to the content to be retrieved, that is, user information of the content to be retrieved is input, so that the retrieval result corresponding to the content to be retrieved is subsequently associated with the user.
And the query module 02 is configured to parse and acquire content to be retrieved in the information to be retrieved according to the information to be retrieved, search and acquire a query result in a predetermined database according to the parsed content to be retrieved, and convert the query result into query information in the preset information format.
When querying, analyzing the information to be retrieved in the preset information format, for example, analyzing and obtaining the content to be retrieved in the information to be retrieved according to a predetermined analysis mode, and then searching and obtaining a query result in a predetermined database according to the analyzed content to be retrieved, where the predetermined database may be a local database or a remote database connected through a network, and is not limited herein. After the query result is obtained by searching in the database according to the content to be retrieved, the query result is converted into the query information in the preset information format, and the converted query information in the preset information format is output to the next link of retrieval, for example, a link of further processing the query result or a link of directly displaying and outputting the query result. The format of the query information is the same as that of the information to be retrieved, and the query information and the information to be retrieved are both predetermined information formats.
And the output module 03 is configured to analyze the query information to obtain a query result, and sort and output the query result.
After receiving the query information, the query information in the preset information format can be analyzed in a predetermined analysis mode, the content of the query information, namely the query result, is obtained, and the query result can be sequenced and output for the user to check the final retrieval query result, so that the retrieval process is completed.
In the embodiment, only the formats of information transmission among all steps in the information retrieval process, such as retrieval information input, query, output and the like, are unified, so that the input and the output of each link are ensured to be the same, and the processed data can be directly transmitted to the next link for continuous processing without any conversion. In the embodiment, if different methods are used for processing information, plug-ins with different functions can be developed to realize the required functions, and finally the plug-ins are arranged in a certain sequence, so that the method can be widely applied to various project application scenes without excessive code rewriting, and different projects can be realized only by developing and sequencing different plug-ins. The method aims at information retrieval under different conditions, does not need to reconstruct the whole code, can be used by only compiling a plurality of plug-ins, and has good universality.
After receiving the content to be retrieved, the embodiment converts the content to be retrieved into the information to be retrieved in the preset information format; when database query is carried out, analyzing and acquiring to-be-retrieved content according to-be-retrieved information in a preset information format, searching and acquiring a query result in a database, and converting the query result into query information in the preset information format; and when the result is output, the query result is obtained by analyzing the query information to sort and output the retrieval result. Because the transmission formats of the information flow among all the steps of the information retrieval can be unified into the same preset information format, when a plurality of steps are required to be added or reduced so that the information retrieval system is suitable for various different project application scenes, excessive code rewriting is not required, the system and the framework do not need to be changed, and the universality is greatly improved.
Further, in other embodiments, the preset information format is a triple format, and the information in the preset information format includes an identification code of the information, information content data, and weight data corresponding to the information.
In the embodiment, in consideration of the requirements of actual project application and universality, a preset information format for information transmission among links in an information retrieval process is set to be in the form of a triple (id, info, score), wherein the id is an identification code corresponding to information, such as a code; the info is the main information content and can be in various formats; and score is the weight corresponding to the information. The information is transmitted in and out in each link, such as retrieval information input, query, output and the like, in the format, the contents of the info and score can be correspondingly processed in each link, and then the info and score are packaged into an information group in the format and transmitted out of the link, and the next link is entered until a retrieval query result is finally output, so that the retrieval process is completed.
Further, in other embodiments, the query module 02 may further be configured to:
analyzing and acquiring to-be-retrieved content in the to-be-retrieved information according to the to-be-retrieved information, searching and acquiring a query result in a predetermined database according to the analyzed to-be-retrieved content, wherein the query result comprises the position and the occurrence frequency of the to-be-retrieved content in the predetermined database, and converting the query result into query information in a preset information format; the identification code in the query information is a preset identification code of the content to be retrieved, the information content data in the query information is position information of the content to be retrieved in a predetermined database, and the weight data in the query information is the occurrence frequency of the position of the content to be retrieved in the predetermined database;
the output module 03 may further be configured to:
analyzing and acquiring the position information of the content to be retrieved in a predetermined database and the occurrence frequency of the position of the content to be retrieved in the predetermined database according to the query information, sequencing according to the position information and the occurrence frequency of the content to be retrieved in the predetermined database, and outputting the sequenced query result.
In one example of a search, for example, it is necessary to search all articles in the database that contain the word "safe". First, the conversion module 01 converts the input into a preset information format, i.e., to-be-retrieved information of a triplet (1, "safe", 0), and then transmits the information to the search module 02 for searching. In the searching module 02, firstly, it is analyzed that the content to be searched in the information to be searched in the preset information format is "safe", then the searching module 02 finds all article numbers containing "safe" in the database, and through matching, the query information in the preset information format transmitted by the searching module 02 is an information group (1, "1 #4# 7" and "3 #1# 2"), wherein the second part "1 #4# 7" indicates that the articles with numbers 1, 4 and 7 contain "safe", and "3 #1# 2" indicates that three articles respectively contain three, one and two "safe". The output module 03 analyzes and obtains the query result according to the query information (1, "1 #4# 7", "3 #1# 2") in the preset information format, first finds the corresponding articles according to "1 #4# 7", then sorts according to "3 #1# 2", sequentially presents the first article, the seventh article and the fourth article to the user, and completes the information retrieval process.
As shown in fig. 3, another embodiment of the present invention provides an information retrieval system, which further includes, on the basis of the above embodiment:
the processing module 04 is configured to, after receiving original retrieval information sent by a user, process the original retrieval information in a preset natural language processing manner, and extract content to be retrieved from the original retrieval information.
In the embodiment, after original retrieval information sent by a user is received, Chinese natural language processing such as word segmentation, central word extraction, professional vocabulary extraction, matching, recall, knowledge graph, named entity recognition and the like can be performed on the original retrieval information, so that contents to be retrieved in the original retrieval information are effectively extracted, and the retrieval efficiency is improved.
Further, in other embodiments, the output module 03 may further be configured to:
analyzing and acquiring an inquiry result and user identity data according to the inquiry information; acquiring a historical retrieval record of the user according to the user identity data; and matching the query result with the historical retrieval record of the user, and sequencing and outputting the query result matched with the historical retrieval record of the user.
In this embodiment, the triple information may further include user identity data such as an identification number, a job number, a mobile phone number, and the like, and after the query result and the user identity data are obtained through analysis according to the query information, a historical search record of the user, such as a local historical search record of the user or a historical search record on a plurality of associated servers, browsers, and the internet, may be obtained according to the user identity data. And matching the query result obtained by analysis with the historical retrieval record of the user, and sequencing and outputting the query result matched with the historical retrieval record of the user, so that personalized retrieval contents are recommended to the user according to the personal information and the use history of the user, and the user experience is improved.
The invention further provides an information retrieval method.
Referring to fig. 4, fig. 4 is a flowchart illustrating an information retrieval method according to an embodiment of the present invention.
In one embodiment, the information retrieval method includes:
step S10, after receiving the content to be retrieved, converting the content to be retrieved into the information to be retrieved with a preset information format;
receiving input content of a user, analyzing the input content to obtain content to be retrieved, for example, receiving input content (including but not limited to voice, text, pictures, etc.) sent by the user through a browser, an APP client, etc., and analyzing the content to be retrieved in the input content, such as retrieval keywords, phrases, etc. And after the content to be retrieved is analyzed, converting the content to be retrieved into the information to be retrieved in the preset information format, and outputting the converted information to be retrieved in the preset information format to the next link of retrieval. The preset information format may be a predetermined format that facilitates information stream transmission and analysis, and may define the type of information included in the preset information format, for example, the information in the preset information format may include main content of the information, may further include an identification code, and may even include user information. For example, the information to be retrieved in the preset information format may include content to be retrieved corresponding to the information to be retrieved, such as a retrieval key, a phrase, and the like, and may further include an identification code corresponding to the content to be retrieved, so that a retrieval result corresponding to the content to be retrieved is identified subsequently according to the identification code, and the information to be retrieved in the preset information format may further include user information corresponding to the content to be retrieved, that is, user information of the content to be retrieved is input, so that the retrieval result corresponding to the content to be retrieved is subsequently associated with the user.
Step S20, analyzing and acquiring the content to be retrieved in the information to be retrieved according to the information to be retrieved, searching and acquiring a query result in a predetermined database according to the analyzed content to be retrieved, and converting the query result into the query information in the preset information format;
when querying, analyzing the information to be retrieved in the preset information format, for example, analyzing and obtaining the content to be retrieved in the information to be retrieved according to a predetermined analysis mode, and then searching and obtaining a query result in a predetermined database according to the analyzed content to be retrieved, where the predetermined database may be a local database or a remote database connected through a network, and is not limited herein. After the query result is obtained by searching in the database according to the content to be retrieved, the query result is converted into the query information in the preset information format, and the converted query information in the preset information format is output to the next link of retrieval, for example, a link of further processing the query result or a link of directly displaying and outputting the query result. The format of the query information is the same as that of the information to be retrieved, and the query information and the information to be retrieved are both predetermined information formats.
And step S30, analyzing and acquiring the query result according to the query information, and sequencing and outputting the query result.
After receiving the query information, the query information in the preset information format can be analyzed in a predetermined analysis mode, the content of the query information, namely the query result, is obtained, and the query result can be sequenced and output for the user to check the final retrieval query result, so that the retrieval process is completed.
In the embodiment, only the formats of information transmission among all steps in the information retrieval process, such as retrieval information input, query, output and the like, are unified, so that the input and the output of each link are ensured to be the same, and the processed data can be directly transmitted to the next link for continuous processing without any conversion. In the embodiment, if different methods are used for processing information, plug-ins with different functions can be developed to realize the required functions, and finally the plug-ins are arranged in a certain sequence, so that the method can be widely applied to various project application scenes without excessive code rewriting, and different projects can be realized only by developing and sequencing different plug-ins. The method aims at information retrieval under different conditions, does not need to reconstruct the whole code, can be used by only compiling a plurality of plug-ins, and has good universality.
After receiving the content to be retrieved, the embodiment converts the content to be retrieved into the information to be retrieved in the preset information format; when database query is carried out, analyzing and acquiring to-be-retrieved content according to-be-retrieved information in a preset information format, searching and acquiring a query result in a database, and converting the query result into query information in the preset information format; and when the result is output, the query result is obtained by analyzing the query information to sort and output the retrieval result. Because the transmission formats of the information flow among all the steps of the information retrieval can be unified into the same preset information format, when a plurality of steps are required to be added or reduced so that the information retrieval system is suitable for various different project application scenes, excessive code rewriting is not required, the system and the framework do not need to be changed, and the universality is greatly improved.
Further, in other embodiments, the preset information format is a triple format, and the information in the preset information format includes an identification code of the information, information content data, and weight data corresponding to the information.
In the embodiment, in consideration of the requirements of actual project application and universality, a preset information format for information transmission among links in an information retrieval process is set to be in the form of a triple (id, info, score), wherein the id is an identification code corresponding to information, such as a code; the info is the main information content and can be in various formats; and score is the weight corresponding to the information. The information is transmitted in and out in each link, such as retrieval information input, query, output and the like, in the format, the contents of the info and score can be correspondingly processed in each link, and then the info and score are packaged into an information group in the format and transmitted out of the link, and the next link is entered until a retrieval query result is finally output, so that the retrieval process is completed.
Further, in other embodiments, the step S20 may further include:
analyzing and acquiring to-be-retrieved content in the to-be-retrieved information according to the to-be-retrieved information, searching and acquiring a query result in a predetermined database according to the analyzed to-be-retrieved content, wherein the query result comprises the position and the occurrence frequency of the to-be-retrieved content in the predetermined database, and converting the query result into query information in a preset information format; the identification code in the query information is a preset identification code of the content to be retrieved, the information content data in the query information is position information of the content to be retrieved in a predetermined database, and the weight data in the query information is the occurrence frequency of the position of the content to be retrieved in the predetermined database;
the step S30 may further include:
analyzing and acquiring the position information of the content to be retrieved in a predetermined database and the occurrence frequency of the position of the content to be retrieved in the predetermined database according to the query information, sequencing according to the position information and the occurrence frequency of the content to be retrieved in the predetermined database, and outputting the sequenced query result.
In one example of a search, for example, it is necessary to search all articles in the database that contain the word "safe". First, the conversion module 01 converts the input into a preset information format, i.e., to-be-retrieved information of a triplet (1, "safe", 0), and then transmits the information to the search module 02 for searching. In the searching module 02, firstly, it is analyzed that the content to be searched in the information to be searched in the preset information format is "safe", then the searching module 02 finds all article numbers containing "safe" in the database, and through matching, the query information in the preset information format transmitted by the searching module 02 is an information group (1, "1 #4# 7" and "3 #1# 2"), wherein the second part "1 #4# 7" indicates that the articles with numbers 1, 4 and 7 contain "safe", and "3 #1# 2" indicates that three articles respectively contain three, one and two "safe". The output module 03 analyzes and obtains the query result according to the query information (1, "1 #4# 7", "3 #1# 2") in the preset information format, first finds the corresponding articles according to "1 #4# 7", then sorts according to "3 #1# 2", sequentially presents the first article, the seventh article and the fourth article to the user, and completes the information retrieval process.
As shown in fig. 5, another embodiment of the present invention provides an information retrieving method, based on the above embodiment, before the step S10, the method further includes:
step S40, after receiving the original retrieval information sent by the user, processing the original retrieval information by using a preset natural language processing method, and extracting the content to be retrieved in the original retrieval information.
In the embodiment, after original retrieval information sent by a user is received, Chinese natural language processing such as word segmentation, central word extraction, professional vocabulary extraction, matching, recall, knowledge graph, named entity recognition and the like can be performed on the original retrieval information, so that contents to be retrieved in the original retrieval information are effectively extracted, and the retrieval efficiency is improved.
Further, in other embodiments, the step S30 may further include:
analyzing and acquiring an inquiry result and user identity data according to the inquiry information; acquiring a historical retrieval record of the user according to the user identity data; and matching the query result with the historical retrieval record of the user, and sequencing and outputting the query result matched with the historical retrieval record of the user.
In this embodiment, the triple information may further include user identity data such as an identification number, a job number, a mobile phone number, and the like, and after the query result and the user identity data are obtained through analysis according to the query information, a historical search record of the user, such as a local historical search record of the user or a historical search record on a plurality of associated servers, browsers, and the internet, may be obtained according to the user identity data. And matching the query result obtained by analysis with the historical retrieval record of the user, and sequencing and outputting the query result matched with the historical retrieval record of the user, so that personalized retrieval contents are recommended to the user according to the personal information and the use history of the user, and the user experience is improved.
In addition, the present invention further provides a computer-readable storage medium, where an information retrieval system is stored, where the information retrieval system is executable by at least one processor, so that the at least one processor performs the steps of the information retrieval method in the foregoing embodiments, and the specific implementation procedures of the information retrieval method, such as steps S10, S20, S30, are as described above, and are not described herein again.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better embodiment. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which is stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal device (such as a mobile phone, a computer, a server, an air conditioner, or a network device) to execute the method according to the embodiments of the present invention.
The preferred embodiments of the present invention have been described above with reference to the accompanying drawings, and are not to be construed as limiting the scope of the invention. The above-mentioned serial numbers of the embodiments of the present invention are merely for description and do not represent the merits of the embodiments. Additionally, while a logical order is shown in the flow diagrams, in some cases, the steps shown or described may be performed in an order different than here.
Those skilled in the art can implement the invention in various modifications, such as features from one embodiment can be used in another embodiment to yield yet a further embodiment, without departing from the scope and spirit of the invention. Any modification, equivalent replacement and improvement made within the technical idea of using the present invention should be within the scope of the right of the present invention.

Claims (6)

1. A search server comprising a memory, a processor, and an information search system stored on the memory and operable on the processor, the information search system when executed by the processor performing the steps of:
A. after receiving the content to be retrieved, converting the content to be retrieved into the information to be retrieved in a preset information format; the preset information format is a triple format, and the information in the preset information format comprises an identification code of the information, information content data and weight data corresponding to the information; the method comprises the steps that information to be retrieved in a preset information format comprises content to be retrieved corresponding to the information to be retrieved, an identification code corresponding to the content to be retrieved and user information corresponding to the content to be retrieved, wherein the content to be retrieved comprises keywords or phrases, the identification code corresponding to the content to be retrieved is used for identifying a retrieval result corresponding to the content to be retrieved subsequently according to the identification code, and the user information is used for associating the retrieval result corresponding to the content to be retrieved with a user subsequently;
B. analyzing and acquiring to-be-retrieved content in the to-be-retrieved information according to the to-be-retrieved information, searching and acquiring a query result in a predetermined database according to the analyzed to-be-retrieved content, and converting the query result into query information in the preset information format;
C. analyzing and acquiring a query result according to the query information, and sequencing and outputting the query result;
the step B comprises the following steps:
analyzing and acquiring to-be-retrieved content in the to-be-retrieved information according to the to-be-retrieved information, searching and acquiring a query result in a predetermined database according to the analyzed to-be-retrieved content, wherein the query result comprises the position and the occurrence frequency of the to-be-retrieved content in the predetermined database, and converting the query result into query information in a preset information format; the identification code in the query information is a preset identification code of the content to be retrieved, the information content data in the query information is position information of the content to be retrieved in a predetermined database, and the weight data in the query information is the occurrence frequency of the position of the content to be retrieved in the predetermined database;
the step C comprises the following steps:
analyzing and acquiring the position information of the content to be retrieved in a predetermined database and the occurrence frequency of the position of the content to be retrieved in the predetermined database according to the query information, sequencing according to the position information and the occurrence frequency of the content to be retrieved in the predetermined database, and outputting the sequenced query result.
2. The search server of claim 1, wherein prior to said step a, said processor is further configured to execute said information search system to perform the steps of:
after original retrieval information sent by a user is received, the original retrieval information is processed in a preset natural language processing mode, and contents to be retrieved in the original retrieval information are extracted.
3. The search server according to claim 1, wherein the information in the preset information format includes user identification data; the step C further comprises the following steps:
analyzing and acquiring an inquiry result and user identity data according to the inquiry information;
acquiring a historical retrieval record of the user according to the user identity data;
and matching the query result with the historical retrieval record of the user, and sequencing and outputting the query result matched with the historical retrieval record of the user.
4. An information retrieval method is applied to a retrieval server, and is characterized by comprising the following steps:
A. after receiving the content to be retrieved, converting the content to be retrieved into the information to be retrieved in a preset information format; the preset information format is a triple format, and the information in the preset information format comprises an identification code of the information, information content data and weight data corresponding to the information; the method comprises the steps that information to be retrieved in a preset information format comprises content to be retrieved corresponding to the information to be retrieved, an identification code corresponding to the content to be retrieved and user information corresponding to the content to be retrieved, wherein the content to be retrieved comprises keywords or phrases, the identification code corresponding to the content to be retrieved is used for identifying a retrieval result corresponding to the content to be retrieved subsequently according to the identification code, and the user information is used for associating the retrieval result corresponding to the content to be retrieved with a user subsequently;
B. analyzing and acquiring to-be-retrieved content in the to-be-retrieved information according to the to-be-retrieved information, searching and acquiring a query result in a predetermined database according to the analyzed to-be-retrieved content, and converting the query result into query information in the preset information format;
C. analyzing and acquiring a query result according to the query information, and sequencing and outputting the query result;
the step B comprises the following steps:
analyzing and acquiring to-be-retrieved content in the to-be-retrieved information according to the to-be-retrieved information, searching and acquiring a query result in a predetermined database according to the analyzed to-be-retrieved content, wherein the query result comprises the position and the occurrence frequency of the to-be-retrieved content in the predetermined database, and converting the query result into query information in a preset information format; the identification code in the query information is a preset identification code of the content to be retrieved, the information content data in the query information is position information of the content to be retrieved in a predetermined database, and the weight data in the query information is the occurrence frequency of the position of the content to be retrieved in the predetermined database;
the step C comprises the following steps:
analyzing and acquiring the position information of the content to be retrieved in a predetermined database and the occurrence frequency of the position of the content to be retrieved in the predetermined database according to the query information, sequencing according to the position information and the occurrence frequency of the content to be retrieved in the predetermined database, and outputting the sequenced query result.
5. The information retrieval method according to claim 4, further comprising, before the step a:
after original retrieval information sent by a user is received, the original retrieval information is processed in a preset natural language processing mode, and contents to be retrieved in the original retrieval information are extracted.
6. A computer-readable storage medium storing an information retrieval system executable by at least one processor to cause the at least one processor to perform the steps of the information retrieval method of any one of claims 4-5.
CN201710490375.4A 2017-06-25 2017-06-25 Information retrieval method, server and readable storage medium Active CN107679055B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201710490375.4A CN107679055B (en) 2017-06-25 2017-06-25 Information retrieval method, server and readable storage medium
PCT/CN2017/105016 WO2019000697A1 (en) 2017-06-25 2017-09-30 Information search method, system, server, and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710490375.4A CN107679055B (en) 2017-06-25 2017-06-25 Information retrieval method, server and readable storage medium

Publications (2)

Publication Number Publication Date
CN107679055A CN107679055A (en) 2018-02-09
CN107679055B true CN107679055B (en) 2021-04-27

Family

ID=61134005

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710490375.4A Active CN107679055B (en) 2017-06-25 2017-06-25 Information retrieval method, server and readable storage medium

Country Status (2)

Country Link
CN (1) CN107679055B (en)
WO (1) WO2019000697A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108846066B (en) * 2018-06-06 2020-01-24 上海计算机软件技术开发中心 Visual data analysis method and system
CN109325093A (en) * 2018-08-24 2019-02-12 深圳职业技术学院 Bibliography automatic generation method, device and computer-readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1933547A (en) * 2006-06-13 2007-03-21 北京影立驰技术有限公司 System for making global search in wired TV one-way set-top box
CN102270232A (en) * 2011-07-21 2011-12-07 华中科技大学 Semantic data query system with optimized storage
CN102693310A (en) * 2012-05-28 2012-09-26 无锡成电科大科技发展有限公司 Resource description framework querying method and system based on relational database
CN103258029A (en) * 2013-05-08 2013-08-21 徐峰蕾 Method and system for retrieving information
CN104794247A (en) * 2015-05-14 2015-07-22 东南大学 Integrated query method for multi-structure database

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102760143A (en) * 2011-04-28 2012-10-31 国际商业机器公司 Method and device for dynamically integrating executing structures in database system
US9406020B2 (en) * 2012-04-02 2016-08-02 Taiger Spain Sl System and method for natural language querying
IN2013CH01237A (en) * 2013-03-21 2015-08-14 Infosys Ltd
CN104866498A (en) * 2014-02-24 2015-08-26 华为技术有限公司 Information processing method and device
CN105069171B (en) * 2015-08-31 2018-07-13 百度在线网络技术(北京)有限公司 Chinese character inquiry method and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1933547A (en) * 2006-06-13 2007-03-21 北京影立驰技术有限公司 System for making global search in wired TV one-way set-top box
CN102270232A (en) * 2011-07-21 2011-12-07 华中科技大学 Semantic data query system with optimized storage
CN102693310A (en) * 2012-05-28 2012-09-26 无锡成电科大科技发展有限公司 Resource description framework querying method and system based on relational database
CN103258029A (en) * 2013-05-08 2013-08-21 徐峰蕾 Method and system for retrieving information
CN104794247A (en) * 2015-05-14 2015-07-22 东南大学 Integrated query method for multi-structure database

Also Published As

Publication number Publication date
CN107679055A (en) 2018-02-09
WO2019000697A1 (en) 2019-01-03

Similar Documents

Publication Publication Date Title
US10896212B2 (en) System and methods for automating trademark and service mark searches
CN108932294B (en) Resume data processing method, device, equipment and storage medium based on index
US11321421B2 (en) Method, apparatus and device for generating entity relationship data, and storage medium
JP4427500B2 (en) Semantic analysis device, semantic analysis method, and semantic analysis program
CN110781276A (en) Text extraction method, device, equipment and storage medium
CN111177532A (en) Vertical search method, device, computer system and readable storage medium
US11599727B2 (en) Intelligent text cleaning method and apparatus, and computer-readable storage medium
CN109299235B (en) Knowledge base searching method, device and computer readable storage medium
CN112860727B (en) Data query method, device, equipment and medium based on big data query engine
CN110765761A (en) Contract sensitive word checking method and device based on artificial intelligence and storage medium
CN110738049B (en) Similar text processing method and device and computer readable storage medium
CN112035599A (en) Query method and device based on vertical search, computer equipment and storage medium
CN108062422B (en) Sorting method, intelligent terminal, system and storage medium for paging query
US20190147104A1 (en) Method and apparatus for constructing artificial intelligence application
CN111831920A (en) User demand analysis method and device, computer equipment and storage medium
CN107679055B (en) Information retrieval method, server and readable storage medium
CN113268615A (en) Resource label generation method and device, electronic equipment and storage medium
CN110263121B (en) Table data processing method, apparatus, electronic apparatus and computer readable storage medium
CN110765765A (en) Contract key clause extraction method and device based on artificial intelligence and storage medium
CN112417996A (en) Information processing method and device for industrial drawing, electronic equipment and storage medium
CN110688315A (en) Interface code detection report generation method, electronic device, and storage medium
CN115203445A (en) Multimedia resource searching method, device, equipment and medium
CN114201376A (en) Log analysis method and device based on artificial intelligence, terminal equipment and medium
CN113268588A (en) Text abstract extraction method, device, equipment, storage medium and program product
CN112749258A (en) Data searching method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant