WO2019000697A1 - Information search method, system, server, and readable storage medium - Google Patents

Information search method, system, server, and readable storage medium Download PDF

Info

Publication number
WO2019000697A1
WO2019000697A1 PCT/CN2017/105016 CN2017105016W WO2019000697A1 WO 2019000697 A1 WO2019000697 A1 WO 2019000697A1 CN 2017105016 W CN2017105016 W CN 2017105016W WO 2019000697 A1 WO2019000697 A1 WO 2019000697A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
retrieved
content
query
query result
Prior art date
Application number
PCT/CN2017/105016
Other languages
French (fr)
Chinese (zh)
Inventor
韦邕
赵清源
吕梓燊
徐亮
肖京
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2019000697A1 publication Critical patent/WO2019000697A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2452Query translation
    • G06F16/24528Standardisation; Simplification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/248Presentation of query results

Definitions

  • the present invention relates to the field of computer technologies, and in particular, to an information retrieval method, system, server, and readable storage medium.
  • the main object of the present invention is to provide an information retrieval method, system, server and readable storage medium, aiming at improving the versatility of the information retrieval system.
  • a first aspect of the present application provides a retrieval server, where the retrieval server includes a storage device and a processing device, and the storage device stores an information retrieval system operable on the processing device, the information The following steps are implemented when the retrieval system is executed by the processing device:
  • the second aspect of the present application provides an information retrieval method, which is applied to a retrieval server, and the method includes:
  • the content to be retrieved in the to-be-searched information is obtained according to the information to be retrieved, and the search result is obtained in a predetermined database according to the parsed content to be retrieved, and the query result is obtained.
  • the third aspect of the present application provides an information retrieval system, where the information retrieval system includes:
  • a conversion module configured to convert the content to be retrieved into information to be retrieved in a preset information format after receiving the content to be retrieved;
  • a query module configured to obtain, according to the information to be retrieved, the content to be retrieved in the to-be-searched information, perform a search in a predetermined database according to the parsed content to be retrieved, and obtain the query result, and convert the query result Query information for the preset information format;
  • an output module configured to parse the query result according to the query information, and sort and output the query result.
  • a fourth aspect of the present application provides a computer readable storage medium having stored thereon at least one computer readable instruction executable by a processing device to:
  • the information retrieval method, system, server and readable storage medium convert the content to be retrieved into the information to be retrieved in a preset information format after receiving the content to be retrieved; when the database query is performed, according to the preset information format
  • the information to be retrieved is parsed to obtain the content to be retrieved, and the query is obtained in the database to obtain the query result, and the query result is converted into the query information of the preset information format; when the result is output, the query is obtained by parsing the query information.
  • the result is to sort and output the search results.
  • FIG. 1 is a schematic diagram of an operating environment of a preferred embodiment of an information retrieval system 10 of the present invention
  • FIG. 2 is a schematic diagram of functional modules of an embodiment of an information retrieval system according to the present invention.
  • FIG. 3 is a schematic diagram of functional modules of another embodiment of an information retrieval system according to the present invention.
  • FIG. 4 is a schematic flow chart of an embodiment of an information retrieval method according to the present invention.
  • FIG. 5 is a schematic flow chart of another embodiment of an information retrieval method according to the present invention.
  • FIG. 1 is a schematic diagram of an operating environment of a preferred embodiment of the information retrieval system 10 of the present invention.
  • the information retrieval system 10 is installed and operated in the retrieval server 1.
  • the retrieval server 1 may include, but is not limited to, a storage device 11, a processing device 12.
  • Figure 1 shows only the retrieval server 1 with components 11, 12, but it should be understood that not all illustrated components may be implemented, and more or fewer components may be implemented instead.
  • the storage device 11 may in some embodiments be an internal storage unit of the retrieval server 1, such as a hard disk or memory of the retrieval server 1.
  • the storage device 11 may also be an external storage device of the retrieval server 1 in other embodiments, such as a plug-in hard disk equipped on the retrieval server 1, a smart memory card (SMC), and security. Digital (Secure Digital, SD) card, flash card (Flash Card), etc.
  • the storage device 11 may also include both an internal storage unit of the retrieval server 1 and an external storage device.
  • the storage device 11 is configured to store application software and various types of data installed in the search server 1, such as program codes of the information retrieval system 10, and the like.
  • the storage device 11 can also be used to temporarily store data that has been output or is about to be output.
  • the processing device 12 may be a central processing unit (CPU), a microprocessor or other data processing chip for running program code or processing data stored in the storage device 11, in some embodiments, For example, the information retrieval system 10 and the like are executed.
  • CPU central processing unit
  • microprocessor or other data processing chip for running program code or processing data stored in the storage device 11, in some embodiments, For example, the information retrieval system 10 and the like are executed.
  • the search server 1 may further include a display, which in some embodiments may be an LED display, a liquid crystal display, a touch liquid crystal display, and an OLED (Organic Light-Emitting Diode). ) Toucher, etc.
  • the display is for displaying information processed in the search server 1 and a user interface for displaying visualization, such as an interface for the user to input the content to be retrieved, an interface for displaying the query result after the retrieval is completed, and the like.
  • the storage device 11, the processing device 12 of the retrieval server 1 and the display communicate with each other through a system bus.
  • FIG. 2 is a functional block diagram of an embodiment of the information retrieval system 10 of the present invention.
  • the information retrieval system 10 may be divided into one or more modules, the one or more modules being stored in the storage device 11 and being processed by one or more processing devices (this Embodiments are performed by the processing device 12) to complete the present invention.
  • the information retrieval system 10 can be divided into a conversion module 01, a query module 02, and an output module 03.
  • a module referred to in the present invention refers to a series of computer program instruction segments capable of performing a specific function, and is more suitable than the program to describe the execution process of the information retrieval system 10 in the retrieval server 1. The following description will specifically describe the functions of the conversion module 01, the query module 02, and the output module 03.
  • the conversion module 01 is configured to convert the content to be retrieved into the information to be retrieved in a preset information format after receiving the content to be retrieved.
  • Receiving input content of the user, and parsing the input content to obtain content to be retrieved such as receiving input content (including but not limited to voice, text, picture, etc.) sent by the user through a browser, an APP client, etc., and parsing the input
  • the content to be retrieved in the content is such as a search keyword, a phrase, and the like.
  • the content to be retrieved is converted into the information to be retrieved in a preset information format, and the information to be retrieved in the converted preset information format is output to the next step of the search.
  • the preset information format may be a pre-determined format for facilitating information flow transmission and parsing, and may define a type of information included in the preset information format.
  • the information of the preset information format may include main content of the information. It may also contain an identification code and may even include user information and the like.
  • the information to be retrieved in the preset information format may include the main to-be-searched content corresponding to the information to be retrieved, such as a search keyword, a phrase, and the like, and may further include an identifier corresponding to the main content to be retrieved, so as to be subsequently
  • the identification code identifies the search result corresponding to the main content to be retrieved
  • the information to be retrieved in the preset information format may further include user information corresponding to the main content to be retrieved, that is, input user information of the main content to be retrieved, so as to be followed.
  • the search result corresponding to the main content to be retrieved is associated with the user.
  • the query module 02 is configured to obtain the content to be retrieved in the to-be-searched information according to the to-be-searched information, and perform a search in a predetermined database according to the parsed content to be retrieved, and obtain the query result, and the query result is obtained.
  • the query information is converted into the preset information format.
  • the information to be retrieved in the preset information format is first parsed, and the content to be retrieved in the to-be-searched information may be parsed according to a predetermined parsing manner, and then the content to be retrieved is determined according to the parsed content.
  • the query is performed in the database to obtain the query result.
  • the predetermined database may be a local database or a remote database connected through a network, which is not limited herein.
  • the query result is converted into the query information of the preset information format, and the information to be queried in the converted preset information format is output to the next searched Links, such as the process of further processing the query results or directly displaying the results of the output of the query results.
  • the format of the query information is the same as the format of the information to be retrieved, and all are predetermined information formats.
  • the output module 03 is configured to parse the query result according to the query information, and sort and output the query result.
  • the query information of the preset information format may be parsed according to a predetermined parsing manner, and the content of the query information, that is, the query result, may be obtained, and the query result may be sorted and outputted for the user. View the final search query results and complete the search process.
  • the embodiment After receiving the content to be retrieved, the embodiment converts the content to be retrieved into the information to be retrieved in a preset information format; when performing the database query, the content to be retrieved is parsed according to the information to be retrieved in the preset information format, and is performed in the database.
  • the search result is obtained, and the query result is converted into the query information of the preset information format; when the result is output, the query result is obtained by parsing the query information to perform sorting and outputting of the search result.
  • the preset information format is a triplet format, and the information of the preset information format includes an identification code of the information, information content data, and weight data corresponding to the information.
  • the preset information format for information transfer between each link in the information retrieval process is set to a form of a triplet (id, info, score) in consideration of the actual project application and versatility requirements, wherein id
  • info is the main information content, which can be in multiple formats
  • score is the weight corresponding to the information.
  • Each link such as the input and transmission of information in the search information input, query, output, etc., adopts this format.
  • the two parts of info and score can be processed correspondingly, and then packaged into this.
  • the formatted information group passes the link and enters the next link until the final output of the search query result completes the search process.
  • the foregoing query module 02 can also be used to:
  • the identifier code in the query information is a preset identifier of the content to be retrieved a code
  • the information content data in the query information is location information of the content to be retrieved in a predetermined database
  • the weight data in the query information is that the content to be retrieved is in a predetermined database.
  • the above output module 03 can also be used to:
  • the conversion module 01 converts the input into a preset information format, ie, a triple (1, "flat”
  • the information to be retrieved by An 0
  • the search module 02 the content to be retrieved in the information to be retrieved in the preset information format is first parsed as "Ping”, and then the search module 02, all the article numbers containing "Peace" are found in the database.
  • the query information of the preset information format transmitted by the search module 02 is the information group (1, “1#4#7", “3#1 #2"), where the second part “1#4#7” indicates that the articles numbered 1, 4, and 7 contain “safe”, and "3#1#2” indicates that the three articles contain three, respectively.
  • the output module 03 parses the query result according to the query information of the preset information format (1, “1#4#7”, “3#1#2”), first according to “1#4#” 7" find the corresponding article, and then sort according to "3#1#2", and then present the first article, the seventh article and the fourth article to the user, and complete the process of information retrieval.
  • another embodiment of the present invention provides an information retrieval system. Based on the foregoing embodiments, the method further includes:
  • the processing module 04 is configured to process the original search information by using a preset natural language processing manner after receiving the original search information sent by the user, and extract the content to be retrieved in the original search information.
  • the original search information after receiving the original search information sent by the user, the original search information may be subjected to Chinese natural language processing such as word cutting, central word extraction, professional vocabulary extraction, matching, recall, knowledge map, and named entity recognition.
  • Chinese natural language processing such as word cutting, central word extraction, professional vocabulary extraction, matching, recall, knowledge map, and named entity recognition.
  • the output module 03 may be further configured to:
  • the history of the search records matches the results of the query.
  • the user identity data such as the ID number, the job number, the mobile phone number, and the like may also be included in the transmission of the triplet information.
  • the The user identity data obtains the historical search record of the user, such as the local history retrieval record of the user or the historical retrieval record on multiple associated servers, browsers, and the Internet. Matching the parsed result with the historical search record of the user, and sorting and outputting the query result that matches the historical search record of the user, thereby implementing personalized recommendation according to the user's personal information and usage history. retrieve content to enhance the user experience.
  • the invention further provides an information retrieval method.
  • FIG. 4 is a schematic flowchart of an embodiment of an information retrieval method according to an embodiment of the present invention.
  • the information retrieval method comprises:
  • Step S10 After receiving the content to be retrieved, converting the content to be retrieved into information to be retrieved in a preset information format;
  • Receiving input content of the user, and parsing the input content to obtain content to be retrieved such as receiving input content (including but not limited to voice and text) sent by the user through a browser, an APP client, or the like Words, pictures, etc.), and parsing the content to be retrieved in the input content such as search keywords, phrases, and the like.
  • the content to be retrieved is converted into the information to be retrieved in a preset information format, and the information to be retrieved in the converted preset information format is output to the next step of the search.
  • the preset information format may be a pre-determined format for facilitating information flow transmission and parsing, and may define a type of information included in the preset information format.
  • the information of the preset information format may include main content of the information. It may also contain an identification code and may even include user information and the like.
  • the information to be retrieved in the preset information format may include the main to-be-searched content corresponding to the information to be retrieved, such as a search keyword, a phrase, and the like, and may further include an identifier corresponding to the main content to be retrieved, so as to be subsequently
  • the identification code identifies the search result corresponding to the main content to be retrieved
  • the information to be retrieved in the preset information format may further include user information corresponding to the main content to be retrieved, that is, input user information of the main content to be retrieved, so as to be followed.
  • the search result corresponding to the main content to be retrieved is associated with the user.
  • Step S20 parsing and acquiring the to-be-searched content in the to-be-retrieved information according to the to-be-searched information, performing a search in a predetermined database according to the parsed content to be retrieved, and obtaining the query result, and converting the query result into a Query information of a preset information format;
  • the information to be retrieved in the preset information format is first parsed, and the content to be retrieved in the to-be-searched information may be parsed according to a predetermined parsing manner, and then the content to be retrieved is determined according to the parsed content.
  • the query is performed in the database to obtain the query result.
  • the predetermined database may be a local database or a remote database connected through a network, which is not limited herein.
  • the query result is converted into the query information of the preset information format, and the information to be queried in the converted preset information format is output to the next searched Links, such as the process of further processing the query results or directly displaying the results of the output of the query results.
  • the format of the query information is the same as the format of the information to be retrieved, and all are predetermined information formats.
  • Step S30 Acquire a query result according to the query information, and sort and output the query result.
  • the query information of the preset information format may be parsed according to a predetermined parsing manner, and the content of the query information, that is, the query result, may be obtained, and the query result may be sorted and outputted for the user. View the final search query results and complete the search process.
  • the embodiment After receiving the content to be retrieved, the embodiment converts the content to be retrieved into a preset information format. Retrieving information; performing a database query, parsing the content to be retrieved according to the information to be retrieved in a preset information format, performing a search in the database to obtain a query result, and converting the query result into the query information of the preset information format; When the result is output, the query result is obtained by parsing the query information to sort and output the search result.
  • the preset information format is a triplet format, and the information of the preset information format includes an identification code of the information, information content data, and weight data corresponding to the information.
  • the preset information format for information transfer between each link in the information retrieval process is set to a form of a triplet (id, info, score) in consideration of the actual project application and versatility requirements, wherein id
  • info is the main information content, which can be in multiple formats
  • score is the weight corresponding to the information.
  • Each link such as the input and transmission of information in the search information input, query, output, etc., adopts this format.
  • the two parts of info and score can be processed correspondingly, and then packaged into this.
  • the formatted information group passes the link and enters the next link until the final output of the search query result completes the search process.
  • step S20 may further include:
  • the identifier code in the query information is a preset identifier of the content to be retrieved a code
  • the information content data in the query information is location information of the content to be retrieved in a predetermined database
  • the weight data in the query information is that the content to be retrieved is in a predetermined database.
  • step S30 may further include:
  • the conversion module 01 converts the input into a preset information format, that is, a triplet (1, "Ping", 0) to be retrieved information, and then passes the lookup module 02 for searching.
  • the search module 02 the content to be retrieved in the information to be retrieved in the preset information format is first parsed as “Ping”, and then the search module 02 finds all the article numbers containing “Peace” in the database, and by matching,
  • the query information of the preset information format sent by the search module 02 is an information group (1, "1#4#7", "3#1#2"),
  • the second part "1#4#7” indicates that the articles numbered 1, 4, and 7 contain "safe", while the "3#1#2" indicates that the three articles contain three, one, and two respectively.
  • the output module 03 parses the query result according to the query information of the preset information format (1, "1#4#7", "3#1#2"), first finds the corresponding article according to "1#4#7", and then according to "3#1#2" is sorted, and the user first presents the first article, the seventh article and the fourth article, and completes the process of information retrieval.
  • another embodiment of the present invention provides an information retrieval method.
  • the method further includes:
  • Step S40 After receiving the original search information sent by the user, the original search information is processed by using a preset natural language processing manner, and the content to be retrieved in the original search information is extracted.
  • the original search information after receiving the original search information sent by the user, the original search information may be subjected to Chinese natural language processing such as word cutting, central word extraction, professional vocabulary extraction, matching, recall, knowledge map, and named entity recognition.
  • Chinese natural language processing such as word cutting, central word extraction, professional vocabulary extraction, matching, recall, knowledge map, and named entity recognition.
  • step S30 may further include:
  • the history of the search records matches the results of the query.
  • the user identity data such as the ID number, the job number, the mobile phone number, and the like may also be included in the transmission of the triplet information.
  • the The user identity data obtains the historical search record of the user, such as the local history retrieval record of the user or the historical retrieval record on multiple associated servers, browsers, and the Internet. Matching the parsed result with the historical search record of the user, and sorting and outputting the query result that matches the historical search record of the user, thereby implementing personalized recommendation according to the user's personal information and usage history. retrieve content to enhance the user experience.
  • the present invention also provides a computer readable storage medium storing an information retrieval system, the information retrieval system being executable by at least one processing device to cause the at least one processing device to perform
  • the steps of the information retrieval method in the above embodiment, the specific implementation processes of the steps S10, S20, and S30 of the information retrieval method are as described above, and are not described herein again.
  • the foregoing embodiment method can be implemented by means of software plus a necessary general hardware platform, and can also be implemented by hardware, but in many cases, the former is A better implementation.
  • the technical solution of the present invention which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk,
  • the optical disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present invention.

Abstract

An information search method, system, server, and readable storage medium. The method comprises: upon receiving search content, converting the search content into search information in a predetermined information format (S10); performing, according to the search information, parsing to obtain the search content in the search information, performing, according to the search content obtained by parsing, a query in a predetermined database to obtain a query result, and converting the query result into query information in the predetermined information format (S20); and performing, according to the query information, parsing to obtain a query result, and sorting and outputting the query result (S30). The method improves universality.

Description

信息检索方法、系统、服务器及可读存储介质Information retrieval method, system, server and readable storage medium
优先权申明Priority claim
本申请基于巴黎公约申明享有2017年6月25日递交的申请号为CN 201710490375.4、名称为“信息检索方法、服务器及可读存储介质”中国专利申请的优先权,该中国专利申请的整体内容以参考的方式结合在本申请中。This application is based on the priority of the Chinese Patent Application entitled "Information Retrieval Method, Server and Readable Storage Media", filed on June 25, 2017, with the application number of CN 201710490375.4, which is filed on June 25, 2017. The manner of reference is incorporated in the present application.
技术领域Technical field
本发明涉及计算机技术领域,尤其涉及一种信息检索方法、系统、服务器及可读存储介质。The present invention relates to the field of computer technologies, and in particular, to an information retrieval method, system, server, and readable storage medium.
背景技术Background technique
面对大量信息,检索成为必不可少的关键技术。现有的信息检索系统中,信息输入、解析、匹配、召回、排序、输出等步骤之间信息传递的格式没有进行统一规定,即各个步骤之间传输的信息格式可能各不相同。这样,在需要增加或较少若干步骤以使信息检索系统适用于各种不同的项目应用场景时,需要做大量的代码改写,甚至还需改变信息检索系统及框架,通用性很差。In the face of a large amount of information, retrieval has become an essential key technology. In the existing information retrieval system, the format of information transmission between steps of information input, analysis, matching, recall, sorting, and output is not uniformly defined, that is, the format of information transmitted between steps may be different. In this way, when it is necessary to add or less steps to make the information retrieval system suitable for various project application scenarios, a large amount of code rewriting is required, and even the information retrieval system and framework need to be changed, and the versatility is poor.
发明内容Summary of the invention
本发明的主要目的在于提供一种信息检索方法、系统、服务器及可读存储介质,旨在提高信息检索系统的通用性。The main object of the present invention is to provide an information retrieval method, system, server and readable storage medium, aiming at improving the versatility of the information retrieval system.
为实现上述目的,本申请第一方面提供一种检索服务器,所述检索服务器包括存储设备、处理设备,所述存储设备上存储有可在所述处理设备上运行的信息检索系统,所述信息检索系统被所述处理设备执行时实现如下步骤:In order to achieve the above object, a first aspect of the present application provides a retrieval server, where the retrieval server includes a storage device and a processing device, and the storage device stores an information retrieval system operable on the processing device, the information The following steps are implemented when the retrieval system is executed by the processing device:
A、在收到待检索内容后,将所述待检索内容转化为预设信息格式的待检索信息;A. After receiving the content to be retrieved, converting the to-be-searched content into the information to be retrieved in a preset information format;
B、根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,并将所述查询结果转化为所述预设信息格式的查询信息;B. Acquire, according to the information to be retrieved, the content to be retrieved in the to-be-searched information, perform a search in a predetermined database according to the parsed content to be retrieved, and obtain the query result, and convert the query result into the Query information of a preset information format;
C、根据所述查询信息解析获取查询结果,并对所述查询结果进行排序及输出。C. Acquire a query result according to the query information, and sort and output the query result.
本申请第二方面提供一种信息检索方法,应用于检索服务器,所述方法包括:The second aspect of the present application provides an information retrieval method, which is applied to a retrieval server, and the method includes:
A、在收到待检索内容后,将所述待检索内容转化为预设信息格式的待检索信息;A. After receiving the content to be retrieved, converting the to-be-searched content into the information to be retrieved in a preset information format;
B、根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,并将所 述查询结果转化为所述预设信息格式的查询信息;B. The content to be retrieved in the to-be-searched information is obtained according to the information to be retrieved, and the search result is obtained in a predetermined database according to the parsed content to be retrieved, and the query result is obtained. Translating the query result into query information of the preset information format;
C、根据所述查询信息解析获取查询结果,并对所述查询结果进行排序及输出。C. Acquire a query result according to the query information, and sort and output the query result.
本申请第三方面提供一种信息检索系统,所述信息检索系统包括:The third aspect of the present application provides an information retrieval system, where the information retrieval system includes:
转化模块,用于在收到待检索内容后,将所述待检索内容转化为预设信息格式的待检索信息;a conversion module, configured to convert the content to be retrieved into information to be retrieved in a preset information format after receiving the content to be retrieved;
查询模块,用于根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,并将所述查询结果转化为所述预设信息格式的查询信息;a query module, configured to obtain, according to the information to be retrieved, the content to be retrieved in the to-be-searched information, perform a search in a predetermined database according to the parsed content to be retrieved, and obtain the query result, and convert the query result Query information for the preset information format;
输出模块,用于根据所述查询信息解析获取查询结果,并对所述查询结果进行排序及输出。And an output module, configured to parse the query result according to the query information, and sort and output the query result.
本申请第四方面提供一种计算机可读存储介质,其上存储有至少一个可被处理设备执行以实现以下操作的计算机可读指令:A fourth aspect of the present application provides a computer readable storage medium having stored thereon at least one computer readable instruction executable by a processing device to:
A、在收到待检索内容后,将所述待检索内容转化为预设信息格式的待检索信息;A. After receiving the content to be retrieved, converting the to-be-searched content into the information to be retrieved in a preset information format;
B、根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,并将所述查询结果转化为所述预设信息格式的查询信息;B. Acquire, according to the information to be retrieved, the content to be retrieved in the to-be-searched information, perform a search in a predetermined database according to the parsed content to be retrieved, and obtain the query result, and convert the query result into the Query information of a preset information format;
C、根据所述查询信息解析获取查询结果,并对所述查询结果进行排序及输出。C. Acquire a query result according to the query information, and sort and output the query result.
本发明提出的信息检索方法、系统、服务器及可读存储介质,在收到待检索内容后,将待检索内容转化为预设信息格式的待检索信息;进行数据库查询时,根据预设信息格式的待检索信息解析获取待检索内容,在数据库中进行查找获取查询结果,并将所述查询结果转化为所述预设信息格式的查询信息;在输出结果时,通过解析所述查询信息获取查询结果来进行检索结果的排序及输出。由于能将信息检索的各个步骤之间信息流的传输格式统一为相同的预设信息格式,这样,在需要增加或较少若干步骤以使信息检索系统适用于各种不同的项目应用场景时,不需要做过多代码改写,系统及框架不需要做任何改变,大大提升了通用性。The information retrieval method, system, server and readable storage medium provided by the invention convert the content to be retrieved into the information to be retrieved in a preset information format after receiving the content to be retrieved; when the database query is performed, according to the preset information format The information to be retrieved is parsed to obtain the content to be retrieved, and the query is obtained in the database to obtain the query result, and the query result is converted into the query information of the preset information format; when the result is output, the query is obtained by parsing the query information. The result is to sort and output the search results. Since the transmission format of the information flow between the steps of the information retrieval can be unified into the same preset information format, when more or less steps are needed to make the information retrieval system suitable for various project application scenarios, No need to do too much code rewriting, the system and framework do not need to make any changes, greatly improving the versatility.
附图说明DRAWINGS
图1为本发明信息检索系统10较佳实施例的运行环境示意图;1 is a schematic diagram of an operating environment of a preferred embodiment of an information retrieval system 10 of the present invention;
图2为本发明信息检索系统一实施例的功能模块示意图;2 is a schematic diagram of functional modules of an embodiment of an information retrieval system according to the present invention;
图3为本发明信息检索系统另一实施例的功能模块示意图;3 is a schematic diagram of functional modules of another embodiment of an information retrieval system according to the present invention;
图4为本发明信息检索方法一实施例的流程示意图;4 is a schematic flow chart of an embodiment of an information retrieval method according to the present invention;
图5为本发明信息检索方法另一实施例的流程示意图。FIG. 5 is a schematic flow chart of another embodiment of an information retrieval method according to the present invention.
本发明目的的实现、功能特点及优点将结合实施例,参照附图做进一步 说明。The implementation, functional features and advantages of the object of the present invention will be further described with reference to the accompanying drawings. Description.
具体实施方式Detailed ways
为了使本发明所要解决的技术问题、技术方案及有益效果更加清楚、明白,以下结合附图和实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。The present invention will be further described in detail below with reference to the accompanying drawings and embodiments, in order to make the present invention. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
本发明提供一种信息检索系统。请参阅图1,是本发明信息检索系统10较佳实施例的运行环境示意图。The present invention provides an information retrieval system. Please refer to FIG. 1, which is a schematic diagram of an operating environment of a preferred embodiment of the information retrieval system 10 of the present invention.
在本实施例中,所述的信息检索系统10安装并运行于检索服务器1中。该检索服务器1可包括,但不仅限于,存储设备11、处理设备12。图1仅示出了具有组件11、12的检索服务器1,但是应理解的是,并不要求实施所有示出的组件,可以替代的实施更多或者更少的组件。In the present embodiment, the information retrieval system 10 is installed and operated in the retrieval server 1. The retrieval server 1 may include, but is not limited to, a storage device 11, a processing device 12. Figure 1 shows only the retrieval server 1 with components 11, 12, but it should be understood that not all illustrated components may be implemented, and more or fewer components may be implemented instead.
所述存储设备11在一些实施例中可以是所述检索服务器1的内部存储单元,例如该检索服务器1的硬盘或内存。所述存储设备11在另一些实施例中也可以是所述检索服务器1的外部存储设备,例如所述检索服务器1上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,所述存储设备11还可以既包括所述检索服务器1的内部存储单元也包括外部存储设备。所述存储设备11用于存储安装于所述检索服务器1的应用软件及各类数据,例如所述信息检索系统10的程序代码等。所述存储设备11还可以用于暂时地存储已经输出或者将要输出的数据。The storage device 11 may in some embodiments be an internal storage unit of the retrieval server 1, such as a hard disk or memory of the retrieval server 1. The storage device 11 may also be an external storage device of the retrieval server 1 in other embodiments, such as a plug-in hard disk equipped on the retrieval server 1, a smart memory card (SMC), and security. Digital (Secure Digital, SD) card, flash card (Flash Card), etc. Further, the storage device 11 may also include both an internal storage unit of the retrieval server 1 and an external storage device. The storage device 11 is configured to store application software and various types of data installed in the search server 1, such as program codes of the information retrieval system 10, and the like. The storage device 11 can also be used to temporarily store data that has been output or is about to be output.
所述处理设备12在一些实施例中可以是一中央处理器(Central Processing Unit,CPU),微处理器或其他数据处理芯片,用于运行所述存储设备11中存储的程序代码或处理数据,例如执行所述信息检索系统10等。The processing device 12 may be a central processing unit (CPU), a microprocessor or other data processing chip for running program code or processing data stored in the storage device 11, in some embodiments, For example, the information retrieval system 10 and the like are executed.
进一步地,在一些实施例中所述检索服务器1还可以包括显示器,该显示器在一些实施例中可以是LED显示器、液晶显示器、触控式液晶显示器以及OLED(Organic Light-Emitting Diode,有机发光二极管)触摸器等。所述显示器用于显示在所述检索服务器1中处理的信息以及用于显示可视化的用户界面,例如供用户输入待检索内容的界面、显示检索完成后的查询结果的界面等。所述检索服务器1的存储设备11、处理设备12与所述显示器通过系统总线相互通信。Further, in some embodiments, the search server 1 may further include a display, which in some embodiments may be an LED display, a liquid crystal display, a touch liquid crystal display, and an OLED (Organic Light-Emitting Diode). ) Toucher, etc. The display is for displaying information processed in the search server 1 and a user interface for displaying visualization, such as an interface for the user to input the content to be retrieved, an interface for displaying the query result after the retrieval is completed, and the like. The storage device 11, the processing device 12 of the retrieval server 1 and the display communicate with each other through a system bus.
请参阅图2,是本发明信息检索系统10一实施例的功能模块图。在本实施例中,所述的信息检索系统10可以被分割成一个或多个模块,所述一个或者多个模块被存储于所述存储设备11中,并由一个或多个处理设备(本实施例为所述处理设备12)所执行,以完成本发明。例如,在图2中,所述的信息检索系统10可以被分割成转化模块01、查询模块02及输出模块03。本发明所称的模块是指能够完成特定功能的一系列计算机程序指令段,比程序更适合于描述所述信息检索系统10在所述检索服务器1中的执行过程。以下描述将具体介绍所述转化模块01、查询模块02及输出模块03的功能。 Please refer to FIG. 2, which is a functional block diagram of an embodiment of the information retrieval system 10 of the present invention. In this embodiment, the information retrieval system 10 may be divided into one or more modules, the one or more modules being stored in the storage device 11 and being processed by one or more processing devices (this Embodiments are performed by the processing device 12) to complete the present invention. For example, in FIG. 2, the information retrieval system 10 can be divided into a conversion module 01, a query module 02, and an output module 03. A module referred to in the present invention refers to a series of computer program instruction segments capable of performing a specific function, and is more suitable than the program to describe the execution process of the information retrieval system 10 in the retrieval server 1. The following description will specifically describe the functions of the conversion module 01, the query module 02, and the output module 03.
转化模块01,用于在收到待检索内容后,将所述待检索内容转化为预设信息格式的待检索信息。The conversion module 01 is configured to convert the content to be retrieved into the information to be retrieved in a preset information format after receiving the content to be retrieved.
接收用户的输入内容,并对该输入内容进行解析获取待检索内容,如接收用户通过浏览器、APP客户端等发送的输入内容(包括但不限于语音、文字、图片等),并解析该输入内容中的待检索内容如检索关键字、词组等。解析出待检索内容后,将所述待检索内容转化为预设信息格式的待检索信息,并将转化后的预设信息格式的待检索信息输出至检索的下一环节。其中,该预设信息格式可以为预先确定的方便信息流传输及解析的格式,可定义该预设信息格式中包含的信息类型,如该预设信息格式的信息中可包含信息的主要内容,还可包含标识码,甚至还可包括用户信息等。例如,在预设信息格式的待检索信息中,可包含待检索信息中所对应的主要待检索内容如检索关键字、词组等,还可包含该主要待检索内容对应的标识码,以便后续根据该标识码识别出该主要待检索内容对应的检索结果,预设信息格式的待检索信息中还可包括该主要待检索内容对应的用户信息,即输入该主要待检索内容的用户信息,以便后续将该主要待检索内容对应的检索结果与该用户相关联。Receiving input content of the user, and parsing the input content to obtain content to be retrieved, such as receiving input content (including but not limited to voice, text, picture, etc.) sent by the user through a browser, an APP client, etc., and parsing the input The content to be retrieved in the content is such as a search keyword, a phrase, and the like. After parsing the content to be retrieved, the content to be retrieved is converted into the information to be retrieved in a preset information format, and the information to be retrieved in the converted preset information format is output to the next step of the search. The preset information format may be a pre-determined format for facilitating information flow transmission and parsing, and may define a type of information included in the preset information format. For example, the information of the preset information format may include main content of the information. It may also contain an identification code and may even include user information and the like. For example, the information to be retrieved in the preset information format may include the main to-be-searched content corresponding to the information to be retrieved, such as a search keyword, a phrase, and the like, and may further include an identifier corresponding to the main content to be retrieved, so as to be subsequently The identification code identifies the search result corresponding to the main content to be retrieved, and the information to be retrieved in the preset information format may further include user information corresponding to the main content to be retrieved, that is, input user information of the main content to be retrieved, so as to be followed. The search result corresponding to the main content to be retrieved is associated with the user.
查询模块02,用于根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,并将所述查询结果转化为所述预设信息格式的查询信息。The query module 02 is configured to obtain the content to be retrieved in the to-be-searched information according to the to-be-searched information, and perform a search in a predetermined database according to the parsed content to be retrieved, and obtain the query result, and the query result is obtained. The query information is converted into the preset information format.
在进行查询时,先对预设信息格式的待检索信息进行解析,如可按预先确定的解析方式解析获取所述待检索信息中的待检索内容,再根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,该预先确定的数据库既可以是本地的数据库,也可以是通过网络连接的远程数据库,在此不做限定。根据待检索内容在数据库中进行查找获取查询结果后,将所述查询结果转化为所述预设信息格式的查询信息,并将转化后的预设信息格式的待查询信息输出至检索的下一环节,例如对查询结果进行进一步处理的环节或直接显示输出查询结果的环节。其中,所述查询信息的格式与所述待检索信息的格式相同,均为预先确定好的信息格式。When the query is performed, the information to be retrieved in the preset information format is first parsed, and the content to be retrieved in the to-be-searched information may be parsed according to a predetermined parsing manner, and then the content to be retrieved is determined according to the parsed content. The query is performed in the database to obtain the query result. The predetermined database may be a local database or a remote database connected through a network, which is not limited herein. After the query result is obtained in the database according to the content to be retrieved, the query result is converted into the query information of the preset information format, and the information to be queried in the converted preset information format is output to the next searched Links, such as the process of further processing the query results or directly displaying the results of the output of the query results. The format of the query information is the same as the format of the information to be retrieved, and all are predetermined information formats.
输出模块03,用于根据所述查询信息解析获取查询结果,并对所述查询结果进行排序及输出。The output module 03 is configured to parse the query result according to the query information, and sort and output the query result.
在接收到查询信息后,可按预先确定的解析方式对预设信息格式的查询信息进行解析,获取所述查询信息的内容即查询结果,即可对所述查询结果进行排序及输出,供用户查看最终的检索查询结果,完成检索的流程。After receiving the query information, the query information of the preset information format may be parsed according to a predetermined parsing manner, and the content of the query information, that is, the query result, may be obtained, and the query result may be sorted and outputted for the user. View the final search query results and complete the search process.
本实施例中仅对信息检索流程中的各个步骤如检索信息输入、查询、输出等之间信息传递的格式进行了统一,确保每一个环节的输入和输出都是一样的,这样处理后的数据可以直接传入下一个环节继续处理而不需要进行任何转变。本实施例中不干涉某个步骤的具体实现方法,如果想使用不同的方法来处理信息,可开发不同功能的插件来实现其需要的功能,最后将插件以一定的顺序排列起来即可,可以广泛通用于各种项目应用场景而不需要做过多代码改写,不同项目仅仅通过不同插件的开发和排序即可实现。即针对不 同情况的信息检索,不需要重构整个代码,只需要编写几个插件即可使用,有着良好的通用性。In this embodiment, only the formats of information transfer, such as search information input, query, output, etc., are unified in each step of the information retrieval process, ensuring that the input and output of each link are the same, so that the processed data is processed. You can proceed directly to the next step and continue processing without any changes. In this embodiment, the specific implementation method does not interfere with a certain step. If you want to use different methods to process information, you can develop plug-ins with different functions to implement the functions you need. Finally, you can arrange the plug-ins in a certain order. It is widely used in various project application scenarios without excessive code rewriting. Different projects can be realized only through the development and sorting of different plug-ins. That is not The same information retrieval, do not need to refactor the entire code, just need to write a few plug-ins to use, with good versatility.
本实施例在收到待检索内容后,将待检索内容转化为预设信息格式的待检索信息;进行数据库查询时,根据预设信息格式的待检索信息解析获取待检索内容,在数据库中进行查找获取查询结果,并将所述查询结果转化为所述预设信息格式的查询信息;在输出结果时,通过解析所述查询信息获取查询结果来进行检索结果的排序及输出。由于能将信息检索的各个步骤之间信息流的传输格式统一为相同的预设信息格式,这样,在需要增加或较少若干步骤以使信息检索系统适用于各种不同的项目应用场景时,不需要做过多代码改写,系统及框架不需要做任何改变,大大提升了通用性。After receiving the content to be retrieved, the embodiment converts the content to be retrieved into the information to be retrieved in a preset information format; when performing the database query, the content to be retrieved is parsed according to the information to be retrieved in the preset information format, and is performed in the database. The search result is obtained, and the query result is converted into the query information of the preset information format; when the result is output, the query result is obtained by parsing the query information to perform sorting and outputting of the search result. Since the transmission format of the information flow between the steps of the information retrieval can be unified into the same preset information format, when more or less steps are needed to make the information retrieval system suitable for various project application scenarios, No need to do too much code rewriting, the system and framework do not need to make any changes, greatly improving the versatility.
进一步地,在其他实施例中,所述预设信息格式为三元组格式,所述预设信息格式的信息包括信息的标识码、信息内容数据和该信息对应的权重数据。Further, in other embodiments, the preset information format is a triplet format, and the information of the preset information format includes an identification code of the information, information content data, and weight data corresponding to the information.
本实施例中,考虑到实际项目应用和通用性的需求,将信息检索流程中的各个环节之间信息传递的预设信息格式设置为三元组(id,info,score)的形式,其中id为信息对应标识码如编码;info为主要信息内容,可以为多种格式;而score则是该信息对应的权重。每一个环节如检索信息输入、查询、输出等中信息的传入和传出均采用这种格式,在每个环节内部可以对info和score两部分内容进行对应的处理,然后再封装成这种格式的信息组传出该环节,进入下一个环节,直至最终输出检索查询结果,完成检索的流程。In this embodiment, the preset information format for information transfer between each link in the information retrieval process is set to a form of a triplet (id, info, score) in consideration of the actual project application and versatility requirements, wherein id The information corresponding to the identification code is encoded; info is the main information content, which can be in multiple formats; and score is the weight corresponding to the information. Each link, such as the input and transmission of information in the search information input, query, output, etc., adopts this format. Within each link, the two parts of info and score can be processed correspondingly, and then packaged into this. The formatted information group passes the link and enters the next link until the final output of the search query result completes the search process.
进一步地,在其他实施例中,上述查询模块02还可以用于:Further, in other embodiments, the foregoing query module 02 can also be used to:
根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,所述查询结果包括所述待检索内容在预先确定的数据库中所处的位置及出现次数,将所述查询结果转化为所述预设信息格式的查询信息;其中,所述查询信息中的标识码为预设的所述待检索内容的标识码,所述查询信息中的信息内容数据为所述待检索内容在预先确定的数据库中所处的位置信息,所述查询信息中的权重数据为所述待检索内容在预先确定的数据库中所处的位置的出现次数;Acquiring and acquiring the to-be-searched content in the to-be-retrieved information according to the to-be-searched information, performing a search in a predetermined database according to the parsed content to be retrieved, and obtaining the query result, where the query result includes the content to be retrieved in advance Determining the location and the number of occurrences in the database, and converting the query result into the query information of the preset information format; wherein the identifier code in the query information is a preset identifier of the content to be retrieved a code, the information content data in the query information is location information of the content to be retrieved in a predetermined database, and the weight data in the query information is that the content to be retrieved is in a predetermined database. The number of occurrences of the location;
上述输出模块03还可以用于:The above output module 03 can also be used to:
根据所述查询信息解析获取所述待检索内容在预先确定的数据库中所处的位置信息及在预先确定的数据库中所处的位置的出现次数,并根据所述待检索内容在预先确定的数据库中所处的位置信息及出现次数进行排序,输出排序后的查询结果。Obtaining, according to the query information, location information of the content to be retrieved in a predetermined database and the number of occurrences of the location in the predetermined database, and according to the content to be retrieved in a predetermined database Sort the position information and the number of occurrences in the middle, and output the sorted query result.
在一个检索实例中,比如需要查找数据库中所有含有“平安”字样的文章。首先,转化模块01将该输入转化为预设信息格式即一个三元组(1,“平 安”,0)的待检索信息,然后传入查找模块02进行查找。在查找模块02中,首先解析出预设信息格式的待检索信息中的待检索内容为“平安”,然后该查找模块02就在数据库中找到所有含有“平安”的文章编号,通过匹配,该查找模块02传出的预设信息格式的查询信息为信息组(1,“1#4#7”,“3#1#2”),其中第二个部分“1#4#7”表示编号为1、4、7的文章含有“平安”,而“3#1#2”则表示三篇文章分别含有三个、一个和两个“平安”。输出模块03根据预设信息格式的查询信息(1,“1#4#7”,“3#1#2”)解析获取查询结果,首先按照“1#4#7”找到相应文章,然后根据“3#1#2”进行排序,依次给用户呈现第一篇文章,第七篇文章和第四篇文章,完成信息检索的流程。In a search instance, for example, you need to find all articles in the database that contain the word "Peace". First, the conversion module 01 converts the input into a preset information format, ie, a triple (1, "flat" The information to be retrieved by An", 0) is then sent to the search module 02 for searching. In the search module 02, the content to be retrieved in the information to be retrieved in the preset information format is first parsed as "Ping", and then the search module 02, all the article numbers containing "Peace" are found in the database. By matching, the query information of the preset information format transmitted by the search module 02 is the information group (1, "1#4#7", "3#1 #2"), where the second part "1#4#7" indicates that the articles numbered 1, 4, and 7 contain "safe", and "3#1#2" indicates that the three articles contain three, respectively. One and two “safe.” The output module 03 parses the query result according to the query information of the preset information format (1, “1#4#7”, “3#1#2”), first according to “1#4#” 7" find the corresponding article, and then sort according to "3#1#2", and then present the first article, the seventh article and the fourth article to the user, and complete the process of information retrieval.
如图3所示,本发明另一实施例提出一种信息检索系统,在上述实施例的基础上,还包括:As shown in FIG. 3, another embodiment of the present invention provides an information retrieval system. Based on the foregoing embodiments, the method further includes:
处理模块04,用于在收到用户发送的原始检索信息后,对所述原始检索信息利用预设的自然语言处理方式进行处理,提取出所述原始检索信息中的待检索内容。The processing module 04 is configured to process the original search information by using a preset natural language processing manner after receiving the original search information sent by the user, and extract the content to be retrieved in the original search information.
本实施例中,在接收到用户发送的原始检索信息后,可对所述原始检索信息进行切词、中心词提取、专业词汇提取、匹配、召回、知识图谱、命名实体识别等中文自然语言处理,以有效地提取出所述原始检索信息中的待检索内容,提高检索效率。In this embodiment, after receiving the original search information sent by the user, the original search information may be subjected to Chinese natural language processing such as word cutting, central word extraction, professional vocabulary extraction, matching, recall, knowledge map, and named entity recognition. In order to effectively extract the content to be retrieved in the original search information, and improve retrieval efficiency.
进一步地,在其他实施例中,上述输出模块03还可以用于:Further, in other embodiments, the output module 03 may be further configured to:
根据所述查询信息解析获取查询结果及用户身份数据;根据所述用户身份数据获取该用户的历史检索记录;将所述查询结果与该用户的历史检索记录进行匹配,并排序及输出与该用户的历史检索记录相匹配的查询结果。Obtaining the query result and the user identity data according to the query information; acquiring the historical search record of the user according to the user identity data; matching the query result with the historical search record of the user, and sorting and outputting the user with the user The history of the search records matches the results of the query.
本实施例中,在三元组信息的传递中还可包含用户身份数据如身份证号、工号、手机号等,在根据所述查询信息解析获取查询结果及用户身份数据后,可根据所述用户身份数据获取该用户的历史检索记录,如该用户在本地的历史检索记录或在多个关联服务器、浏览器、互联网上的历史检索记录。将解析得到的所述查询结果与该用户的历史检索记录进行匹配,并排序及输出与该用户的历史检索记录相匹配的查询结果,从而实现根据用户的个人信息和使用历史为其推荐个性化的检索内容,以提升用户体验。In this embodiment, the user identity data such as the ID number, the job number, the mobile phone number, and the like may also be included in the transmission of the triplet information. After the query result and the user identity data are obtained according to the query information, the The user identity data obtains the historical search record of the user, such as the local history retrieval record of the user or the historical retrieval record on multiple associated servers, browsers, and the Internet. Matching the parsed result with the historical search record of the user, and sorting and outputting the query result that matches the historical search record of the user, thereby implementing personalized recommendation according to the user's personal information and usage history. Retrieve content to enhance the user experience.
本发明进一步提供一种信息检索方法。The invention further provides an information retrieval method.
参照图4,图4为本发明信息检索方法一实施例的流程示意图。Referring to FIG. 4, FIG. 4 is a schematic flowchart of an embodiment of an information retrieval method according to an embodiment of the present invention.
在一实施例中,该信息检索方法包括:In an embodiment, the information retrieval method comprises:
步骤S10,在收到待检索内容后,将所述待检索内容转化为预设信息格式的待检索信息;Step S10: After receiving the content to be retrieved, converting the content to be retrieved into information to be retrieved in a preset information format;
接收用户的输入内容,并对该输入内容进行解析获取待检索内容,如接收用户通过浏览器、APP客户端等发送的输入内容(包括但不限于语音、文 字、图片等),并解析该输入内容中的待检索内容如检索关键字、词组等。解析出待检索内容后,将所述待检索内容转化为预设信息格式的待检索信息,并将转化后的预设信息格式的待检索信息输出至检索的下一环节。其中,该预设信息格式可以为预先确定的方便信息流传输及解析的格式,可定义该预设信息格式中包含的信息类型,如该预设信息格式的信息中可包含信息的主要内容,还可包含标识码,甚至还可包括用户信息等。例如,在预设信息格式的待检索信息中,可包含待检索信息中所对应的主要待检索内容如检索关键字、词组等,还可包含该主要待检索内容对应的标识码,以便后续根据该标识码识别出该主要待检索内容对应的检索结果,预设信息格式的待检索信息中还可包括该主要待检索内容对应的用户信息,即输入该主要待检索内容的用户信息,以便后续将该主要待检索内容对应的检索结果与该用户相关联。Receiving input content of the user, and parsing the input content to obtain content to be retrieved, such as receiving input content (including but not limited to voice and text) sent by the user through a browser, an APP client, or the like Words, pictures, etc.), and parsing the content to be retrieved in the input content such as search keywords, phrases, and the like. After parsing the content to be retrieved, the content to be retrieved is converted into the information to be retrieved in a preset information format, and the information to be retrieved in the converted preset information format is output to the next step of the search. The preset information format may be a pre-determined format for facilitating information flow transmission and parsing, and may define a type of information included in the preset information format. For example, the information of the preset information format may include main content of the information. It may also contain an identification code and may even include user information and the like. For example, the information to be retrieved in the preset information format may include the main to-be-searched content corresponding to the information to be retrieved, such as a search keyword, a phrase, and the like, and may further include an identifier corresponding to the main content to be retrieved, so as to be subsequently The identification code identifies the search result corresponding to the main content to be retrieved, and the information to be retrieved in the preset information format may further include user information corresponding to the main content to be retrieved, that is, input user information of the main content to be retrieved, so as to be followed. The search result corresponding to the main content to be retrieved is associated with the user.
步骤S20,根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,并将所述查询结果转化为所述预设信息格式的查询信息;Step S20, parsing and acquiring the to-be-searched content in the to-be-retrieved information according to the to-be-searched information, performing a search in a predetermined database according to the parsed content to be retrieved, and obtaining the query result, and converting the query result into a Query information of a preset information format;
在进行查询时,先对预设信息格式的待检索信息进行解析,如可按预先确定的解析方式解析获取所述待检索信息中的待检索内容,再根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,该预先确定的数据库既可以是本地的数据库,也可以是通过网络连接的远程数据库,在此不做限定。根据待检索内容在数据库中进行查找获取查询结果后,将所述查询结果转化为所述预设信息格式的查询信息,并将转化后的预设信息格式的待查询信息输出至检索的下一环节,例如对查询结果进行进一步处理的环节或直接显示输出查询结果的环节。其中,所述查询信息的格式与所述待检索信息的格式相同,均为预先确定好的信息格式。When the query is performed, the information to be retrieved in the preset information format is first parsed, and the content to be retrieved in the to-be-searched information may be parsed according to a predetermined parsing manner, and then the content to be retrieved is determined according to the parsed content. The query is performed in the database to obtain the query result. The predetermined database may be a local database or a remote database connected through a network, which is not limited herein. After the query result is obtained in the database according to the content to be retrieved, the query result is converted into the query information of the preset information format, and the information to be queried in the converted preset information format is output to the next searched Links, such as the process of further processing the query results or directly displaying the results of the output of the query results. The format of the query information is the same as the format of the information to be retrieved, and all are predetermined information formats.
步骤S30,根据所述查询信息解析获取查询结果,并对所述查询结果进行排序及输出。Step S30: Acquire a query result according to the query information, and sort and output the query result.
在接收到查询信息后,可按预先确定的解析方式对预设信息格式的查询信息进行解析,获取所述查询信息的内容即查询结果,即可对所述查询结果进行排序及输出,供用户查看最终的检索查询结果,完成检索的流程。After receiving the query information, the query information of the preset information format may be parsed according to a predetermined parsing manner, and the content of the query information, that is, the query result, may be obtained, and the query result may be sorted and outputted for the user. View the final search query results and complete the search process.
本实施例中仅对信息检索流程中的各个步骤如检索信息输入、查询、输出等之间信息传递的格式进行了统一,确保每一个环节的输入和输出都是一样的,这样处理后的数据可以直接传入下一个环节继续处理而不需要进行任何转变。本实施例中不干涉某个步骤的具体实现方法,如果想使用不同的方法来处理信息,可开发不同功能的插件来实现其需要的功能,最后将插件以一定的顺序排列起来即可,可以广泛通用于各种项目应用场景而不需要做过多代码改写,不同项目仅仅通过不同插件的开发和排序即可实现。即针对不同情况的信息检索,不需要重构整个代码,只需要编写几个插件即可使用,有着良好的通用性。In this embodiment, only the formats of information transfer, such as search information input, query, output, etc., are unified in each step of the information retrieval process, ensuring that the input and output of each link are the same, so that the processed data is processed. You can proceed directly to the next step and continue processing without any changes. In this embodiment, the specific implementation method does not interfere with a certain step. If you want to use different methods to process information, you can develop plug-ins with different functions to implement the functions you need. Finally, you can arrange the plug-ins in a certain order. It is widely used in various project application scenarios without excessive code rewriting. Different projects can be realized only through the development and sorting of different plug-ins. That is to say, for different situations of information retrieval, there is no need to refactor the entire code, only need to write a few plug-ins to use, and has good versatility.
本实施例在收到待检索内容后,将待检索内容转化为预设信息格式的待 检索信息;进行数据库查询时,根据预设信息格式的待检索信息解析获取待检索内容,在数据库中进行查找获取查询结果,并将所述查询结果转化为所述预设信息格式的查询信息;在输出结果时,通过解析所述查询信息获取查询结果来进行检索结果的排序及输出。由于能将信息检索的各个步骤之间信息流的传输格式统一为相同的预设信息格式,这样,在需要增加或较少若干步骤以使信息检索系统适用于各种不同的项目应用场景时,不需要做过多代码改写,系统及框架不需要做任何改变,大大提升了通用性。After receiving the content to be retrieved, the embodiment converts the content to be retrieved into a preset information format. Retrieving information; performing a database query, parsing the content to be retrieved according to the information to be retrieved in a preset information format, performing a search in the database to obtain a query result, and converting the query result into the query information of the preset information format; When the result is output, the query result is obtained by parsing the query information to sort and output the search result. Since the transmission format of the information flow between the steps of the information retrieval can be unified into the same preset information format, when more or less steps are needed to make the information retrieval system suitable for various project application scenarios, No need to do too much code rewriting, the system and framework do not need to make any changes, greatly improving the versatility.
进一步地,在其他实施例中,所述预设信息格式为三元组格式,所述预设信息格式的信息包括信息的标识码、信息内容数据和该信息对应的权重数据。Further, in other embodiments, the preset information format is a triplet format, and the information of the preset information format includes an identification code of the information, information content data, and weight data corresponding to the information.
本实施例中,考虑到实际项目应用和通用性的需求,将信息检索流程中的各个环节之间信息传递的预设信息格式设置为三元组(id,info,score)的形式,其中id为信息对应标识码如编码;info为主要信息内容,可以为多种格式;而score则是该信息对应的权重。每一个环节如检索信息输入、查询、输出等中信息的传入和传出均采用这种格式,在每个环节内部可以对info和score两部分内容进行对应的处理,然后再封装成这种格式的信息组传出该环节,进入下一个环节,直至最终输出检索查询结果,完成检索的流程。In this embodiment, the preset information format for information transfer between each link in the information retrieval process is set to a form of a triplet (id, info, score) in consideration of the actual project application and versatility requirements, wherein id The information corresponding to the identification code is encoded; info is the main information content, which can be in multiple formats; and score is the weight corresponding to the information. Each link, such as the input and transmission of information in the search information input, query, output, etc., adopts this format. Within each link, the two parts of info and score can be processed correspondingly, and then packaged into this. The formatted information group passes the link and enters the next link until the final output of the search query result completes the search process.
进一步地,在其他实施例中,上述步骤S20还可以包括:Further, in other embodiments, the foregoing step S20 may further include:
根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,所述查询结果包括所述待检索内容在预先确定的数据库中所处的位置及出现次数,将所述查询结果转化为所述预设信息格式的查询信息;其中,所述查询信息中的标识码为预设的所述待检索内容的标识码,所述查询信息中的信息内容数据为所述待检索内容在预先确定的数据库中所处的位置信息,所述查询信息中的权重数据为所述待检索内容在预先确定的数据库中所处的位置的出现次数;Acquiring and acquiring the to-be-searched content in the to-be-retrieved information according to the to-be-searched information, performing a search in a predetermined database according to the parsed content to be retrieved, and obtaining the query result, where the query result includes the content to be retrieved in advance Determining the location and the number of occurrences in the database, and converting the query result into the query information of the preset information format; wherein the identifier code in the query information is a preset identifier of the content to be retrieved a code, the information content data in the query information is location information of the content to be retrieved in a predetermined database, and the weight data in the query information is that the content to be retrieved is in a predetermined database. The number of occurrences of the location;
上述步骤S30还可以包括:The above step S30 may further include:
根据所述查询信息解析获取所述待检索内容在预先确定的数据库中所处的位置信息及在预先确定的数据库中所处的位置的出现次数,并根据所述待检索内容在预先确定的数据库中所处的位置信息及出现次数进行排序,输出排序后的查询结果。Obtaining, according to the query information, location information of the content to be retrieved in a predetermined database and the number of occurrences of the location in the predetermined database, and according to the content to be retrieved in a predetermined database Sort the position information and the number of occurrences in the middle, and output the sorted query result.
在一个检索实例中,比如需要查找数据库中所有含有“平安”字样的文章。首先,转化模块01将该输入转化为预设信息格式即一个三元组(1,“平安”,0)的待检索信息,然后传入查找模块02进行查找。在查找模块02中,首先解析出预设信息格式的待检索信息中的待检索内容为“平安”,然后该查找模块02就在数据库中找到所有含有“平安”的文章编号,通过匹配,该查找模块02传出的预设信息格式的查询信息为信息组(1,“1#4#7”,“3#1#2”), 其中第二个部分“1#4#7”表示编号为1、4、7的文章含有“平安”,而“3#1#2”则表示三篇文章分别含有三个、一个和两个“平安”。输出模块03根据预设信息格式的查询信息(1,“1#4#7”,“3#1#2”)解析获取查询结果,首先按照“1#4#7”找到相应文章,然后根据“3#1#2”进行排序,依次给用户呈现第一篇文章,第七篇文章和第四篇文章,完成信息检索的流程。In a search instance, for example, you need to find all articles in the database that contain the word "Peace". First, the conversion module 01 converts the input into a preset information format, that is, a triplet (1, "Ping", 0) to be retrieved information, and then passes the lookup module 02 for searching. In the search module 02, the content to be retrieved in the information to be retrieved in the preset information format is first parsed as “Ping”, and then the search module 02 finds all the article numbers containing “Peace” in the database, and by matching, The query information of the preset information format sent by the search module 02 is an information group (1, "1#4#7", "3#1#2"), The second part "1#4#7" indicates that the articles numbered 1, 4, and 7 contain "safe", while the "3#1#2" indicates that the three articles contain three, one, and two respectively. Peaceful." The output module 03 parses the query result according to the query information of the preset information format (1, "1#4#7", "3#1#2"), first finds the corresponding article according to "1#4#7", and then according to "3#1#2" is sorted, and the user first presents the first article, the seventh article and the fourth article, and completes the process of information retrieval.
如图5所示,本发明另一实施例提出一种信息检索方法,在上述实施例的基础上,在所述步骤S10之前,还包括:As shown in FIG. 5, another embodiment of the present invention provides an information retrieval method. On the basis of the foregoing embodiment, before the step S10, the method further includes:
步骤S40,在收到用户发送的原始检索信息后,对所述原始检索信息利用预设的自然语言处理方式进行处理,提取出所述原始检索信息中的待检索内容。Step S40: After receiving the original search information sent by the user, the original search information is processed by using a preset natural language processing manner, and the content to be retrieved in the original search information is extracted.
本实施例中,在接收到用户发送的原始检索信息后,可对所述原始检索信息进行切词、中心词提取、专业词汇提取、匹配、召回、知识图谱、命名实体识别等中文自然语言处理,以有效地提取出所述原始检索信息中的待检索内容,提高检索效率。In this embodiment, after receiving the original search information sent by the user, the original search information may be subjected to Chinese natural language processing such as word cutting, central word extraction, professional vocabulary extraction, matching, recall, knowledge map, and named entity recognition. In order to effectively extract the content to be retrieved in the original search information, and improve retrieval efficiency.
进一步地,在其他实施例中,上述步骤S30还可以包括:Further, in other embodiments, the foregoing step S30 may further include:
根据所述查询信息解析获取查询结果及用户身份数据;根据所述用户身份数据获取该用户的历史检索记录;将所述查询结果与该用户的历史检索记录进行匹配,并排序及输出与该用户的历史检索记录相匹配的查询结果。Obtaining the query result and the user identity data according to the query information; acquiring the historical search record of the user according to the user identity data; matching the query result with the historical search record of the user, and sorting and outputting the user with the user The history of the search records matches the results of the query.
本实施例中,在三元组信息的传递中还可包含用户身份数据如身份证号、工号、手机号等,在根据所述查询信息解析获取查询结果及用户身份数据后,可根据所述用户身份数据获取该用户的历史检索记录,如该用户在本地的历史检索记录或在多个关联服务器、浏览器、互联网上的历史检索记录。将解析得到的所述查询结果与该用户的历史检索记录进行匹配,并排序及输出与该用户的历史检索记录相匹配的查询结果,从而实现根据用户的个人信息和使用历史为其推荐个性化的检索内容,以提升用户体验。In this embodiment, the user identity data such as the ID number, the job number, the mobile phone number, and the like may also be included in the transmission of the triplet information. After the query result and the user identity data are obtained according to the query information, the The user identity data obtains the historical search record of the user, such as the local history retrieval record of the user or the historical retrieval record on multiple associated servers, browsers, and the Internet. Matching the parsed result with the historical search record of the user, and sorting and outputting the query result that matches the historical search record of the user, thereby implementing personalized recommendation according to the user's personal information and usage history. Retrieve content to enhance the user experience.
此外,本发明还提供一种计算机可读存储介质,所述计算机可读存储介质存储有信息检索系统,所述信息检索系统可被至少一个处理设备执行,以使所述至少一个处理设备执行如上述实施例中的信息检索方法的步骤,该信息检索方法的步骤S10、S20、S30等具体实施过程如上文所述,在此不再赘述。Moreover, the present invention also provides a computer readable storage medium storing an information retrieval system, the information retrieval system being executable by at least one processing device to cause the at least one processing device to perform The steps of the information retrieval method in the above embodiment, the specific implementation processes of the steps S10, S20, and S30 of the information retrieval method are as described above, and are not described herein again.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、 方法、物品或者装置中还存在另外的相同要素。It is to be understood that the term "comprises", "comprising", or any other variants thereof, is intended to encompass a non-exclusive inclusion, such that a process, method, article, or device comprising a series of elements includes those elements. It also includes other elements that are not explicitly listed, or elements that are inherent to such a process, method, article, or device. In the absence of more restrictions, an element defined by the phrase "including one..." is not excluded from the process of including the element, There are additional identical elements in the method, article or device.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件来实现,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本发明各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the foregoing embodiment method can be implemented by means of software plus a necessary general hardware platform, and can also be implemented by hardware, but in many cases, the former is A better implementation. Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk, The optical disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present invention.
以上参照附图说明了本发明的优选实施例,并非因此局限本发明的权利范围。上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。另外,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。The preferred embodiments of the present invention have been described above with reference to the drawings, and are not intended to limit the scope of the invention. The serial numbers of the embodiments of the present invention are merely for the description, and do not represent the advantages and disadvantages of the embodiments. Additionally, although logical sequences are shown in the flowcharts, in some cases the steps shown or described may be performed in a different order than the ones described herein.
本领域技术人员不脱离本发明的范围和实质,可以有多种变型方案实现本发明,比如作为一个实施例的特征可用于另一实施例而得到又一实施例。凡在运用本发明的技术构思之内所作的任何修改、等同替换和改进,均应在本发明的权利范围之内。 A person skilled in the art can implement the invention in various variants without departing from the scope and spirit of the invention. For example, the features of one embodiment can be used in another embodiment to obtain a further embodiment. Any modifications, equivalent substitutions and improvements made within the technical concept of the invention are intended to be included within the scope of the invention.

Claims (20)

  1. 一种检索服务器,其特征在于,所述检索服务器包括存储设备、处理设备,所述存储设备上存储有可在所述处理设备上运行的信息检索系统,所述信息检索系统被所述处理设备执行时实现如下步骤:A retrieval server, comprising: a storage device, a processing device, on which is stored an information retrieval system operable on the processing device, the information retrieval system being the processing device The following steps are implemented during execution:
    A、在收到待检索内容后,将所述待检索内容转化为预设信息格式的待检索信息;A. After receiving the content to be retrieved, converting the to-be-searched content into the information to be retrieved in a preset information format;
    B、根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,并将所述查询结果转化为所述预设信息格式的查询信息;B. Acquire, according to the information to be retrieved, the content to be retrieved in the to-be-searched information, perform a search in a predetermined database according to the parsed content to be retrieved, and obtain the query result, and convert the query result into the Query information of a preset information format;
    C、根据所述查询信息解析获取查询结果,并对所述查询结果进行排序及输出。C. Acquire a query result according to the query information, and sort and output the query result.
  2. 如权利要求1所述的检索服务器,其特征在于,所述预设信息格式为三元组格式,所述预设信息格式的信息包括信息的标识码、信息内容数据和该信息对应的权重数据。The search server according to claim 1, wherein the preset information format is a triplet format, and the information of the preset information format includes an identification code of the information, information content data, and weight data corresponding to the information. .
  3. 如权利要求2所述的检索服务器,其特征在于,所述步骤B包括:The retrieval server of claim 2 wherein said step B comprises:
    根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,所述查询结果包括所述待检索内容在预先确定的数据库中所处的位置及出现次数,将所述查询结果转化为所述预设信息格式的查询信息;其中,所述查询信息中的标识码为预设的所述待检索内容的标识码,所述查询信息中的信息内容数据为所述待检索内容在预先确定的数据库中所处的位置信息,所述查询信息中的权重数据为所述待检索内容在预先确定的数据库中所处的位置的出现次数;Acquiring and acquiring the to-be-searched content in the to-be-retrieved information according to the to-be-searched information, performing a search in a predetermined database according to the parsed content to be retrieved, and obtaining the query result, where the query result includes the content to be retrieved in advance Determining the location and the number of occurrences in the database, and converting the query result into the query information of the preset information format; wherein the identifier code in the query information is a preset identifier of the content to be retrieved a code, the information content data in the query information is location information of the content to be retrieved in a predetermined database, and the weight data in the query information is that the content to be retrieved is in a predetermined database. The number of occurrences of the location;
    所述步骤C包括:The step C includes:
    根据所述查询信息解析获取所述待检索内容在预先确定的数据库中所处的位置信息及在预先确定的数据库中所处的位置的出现次数,并根据所述待检索内容在预先确定的数据库中所处的位置信息及出现次数进行排序,输出排序后的查询结果。Obtaining, according to the query information, location information of the content to be retrieved in a predetermined database and the number of occurrences of the location in the predetermined database, and according to the content to be retrieved in a predetermined database Sort the position information and the number of occurrences in the middle, and output the sorted query result.
  4. 如权利要求1、2或3所述的检索服务器,其特征在于,在所述步骤A之前,所述处理设备还用于执行所述信息检索系统,以实现以下步骤:A retrieval server according to claim 1, 2 or 3, wherein prior to said step A, said processing device is further configured to execute said information retrieval system to implement the following steps:
    在收到用户发送的原始检索信息后,对所述原始检索信息利用预设的自然语言处理方式进行处理,提取出所述原始检索信息中的待检索内容。After receiving the original search information sent by the user, the original search information is processed by using a preset natural language processing manner, and the content to be retrieved in the original search information is extracted.
  5. 如权利要求1、2或3所述的检索服务器,其特征在于,所述预设信息格式的信息包括用户身份数据;所述步骤C还包括: The retrieval server according to claim 1, 2 or 3, wherein the information of the preset information format includes user identity data; and the step C further comprises:
    根据所述查询信息解析获取查询结果及用户身份数据;Obtaining the query result and the user identity data according to the query information;
    根据所述用户身份数据获取该用户的历史检索记录;Obtaining a historical search record of the user according to the user identity data;
    将所述查询结果与该用户的历史检索记录进行匹配,并排序及输出与该用户的历史检索记录相匹配的查询结果。Matching the query result with the user's historical search record, and sorting and outputting the query result that matches the user's historical search record.
  6. 一种信息检索方法,应用于检索服务器,其特征在于,所述方法包括:An information retrieval method is applied to a retrieval server, wherein the method comprises:
    A、在收到待检索内容后,将所述待检索内容转化为预设信息格式的待检索信息;A. After receiving the content to be retrieved, converting the to-be-searched content into the information to be retrieved in a preset information format;
    B、根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,并将所述查询结果转化为所述预设信息格式的查询信息;B. Acquire, according to the information to be retrieved, the content to be retrieved in the to-be-searched information, perform a search in a predetermined database according to the parsed content to be retrieved, and obtain the query result, and convert the query result into the Query information of a preset information format;
    C、根据所述查询信息解析获取查询结果,并对所述查询结果进行排序及输出。C. Acquire a query result according to the query information, and sort and output the query result.
  7. 如权利要求6所述的信息检索方法,其特征在于,所述预设信息格式为三元组格式,所述预设信息格式的信息包括信息的标识码、信息内容数据和该信息对应的权重数据。The information retrieval method according to claim 6, wherein the preset information format is a triplet format, and the information of the preset information format includes an identification code of the information, information content data, and a weight corresponding to the information. data.
  8. 如权利要求7所述的信息检索方法,其特征在于,所述步骤B包括:The information retrieval method according to claim 7, wherein said step B comprises:
    根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,所述查询结果包括所述待检索内容在预先确定的数据库中所处的位置及出现次数,将所述查询结果转化为所述预设信息格式的查询信息;其中,所述查询信息中的标识码为预设的所述待检索内容的标识码,所述查询信息中的信息内容数据为所述待检索内容在预先确定的数据库中所处的位置信息,所述查询信息中的权重数据为所述待检索内容在预先确定的数据库中所处的位置的出现次数;Acquiring and acquiring the to-be-searched content in the to-be-retrieved information according to the to-be-searched information, performing a search in a predetermined database according to the parsed content to be retrieved, and obtaining the query result, where the query result includes the content to be retrieved in advance Determining the location and the number of occurrences in the database, and converting the query result into the query information of the preset information format; wherein the identifier code in the query information is a preset identifier of the content to be retrieved a code, the information content data in the query information is location information of the content to be retrieved in a predetermined database, and the weight data in the query information is that the content to be retrieved is in a predetermined database. The number of occurrences of the location;
    所述步骤C包括:The step C includes:
    根据所述查询信息解析获取所述待检索内容在预先确定的数据库中所处的位置信息及在预先确定的数据库中所处的位置的出现次数,并根据所述待检索内容在预先确定的数据库中所处的位置信息及出现次数进行排序,输出排序后的查询结果。Obtaining, according to the query information, location information of the content to be retrieved in a predetermined database and the number of occurrences of the location in the predetermined database, and according to the content to be retrieved in a predetermined database Sort the position information and the number of occurrences in the middle, and output the sorted query result.
  9. 如权利要求6、7或8所述的信息检索方法,其特征在于,在所述步骤A之前,还包括:The information retrieval method according to claim 6, 7 or 8, wherein before the step A, the method further comprises:
    在收到用户发送的原始检索信息后,对所述原始检索信息利用预设的自然语言处理方式进行处理,提取出所述原始检索信息中的待检索内容。After receiving the original search information sent by the user, the original search information is processed by using a preset natural language processing manner, and the content to be retrieved in the original search information is extracted.
  10. 如权利要求6、7或8所述的信息检索方法,其特征在于,所述预设 信息格式的信息包括用户身份数据;所述步骤C还包括:The information retrieval method according to claim 6, 7 or 8, wherein said preset The information in the information format includes user identity data; the step C further includes:
    根据所述查询信息解析获取查询结果及用户身份数据;Obtaining the query result and the user identity data according to the query information;
    根据所述用户身份数据获取该用户的历史检索记录;Obtaining a historical search record of the user according to the user identity data;
    将所述查询结果与该用户的历史检索记录进行匹配,并排序及输出与该用户的历史检索记录相匹配的查询结果。Matching the query result with the user's historical search record, and sorting and outputting the query result that matches the user's historical search record.
  11. 一种信息检索系统,其特征在于,所述信息检索系统包括:An information retrieval system, characterized in that the information retrieval system comprises:
    转化模块,用于在收到待检索内容后,将所述待检索内容转化为预设信息格式的待检索信息;a conversion module, configured to convert the content to be retrieved into information to be retrieved in a preset information format after receiving the content to be retrieved;
    查询模块,用于根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,并将所述查询结果转化为所述预设信息格式的查询信息;a query module, configured to obtain, according to the information to be retrieved, the content to be retrieved in the to-be-searched information, perform a search in a predetermined database according to the parsed content to be retrieved, and obtain the query result, and convert the query result Query information for the preset information format;
    输出模块,用于根据所述查询信息解析获取查询结果,并对所述查询结果进行排序及输出。And an output module, configured to parse the query result according to the query information, and sort and output the query result.
  12. 如权利要求11所述的信息检索系统,其特征在于,所述预设信息格式为三元组格式,所述预设信息格式的信息包括信息的标识码、信息内容数据和该信息对应的权重数据。The information retrieval system according to claim 11, wherein the preset information format is a triplet format, and the information of the preset information format includes an identification code of the information, information content data, and a weight corresponding to the information. data.
  13. 如权利要求12所述的信息检索系统,其特征在于,所述查询模块还用于:The information retrieval system of claim 12, wherein the query module is further configured to:
    根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,所述查询结果包括所述待检索内容在预先确定的数据库中所处的位置及出现次数,将所述查询结果转化为所述预设信息格式的查询信息;其中,所述查询信息中的标识码为预设的所述待检索内容的标识码,所述查询信息中的信息内容数据为所述待检索内容在预先确定的数据库中所处的位置信息,所述查询信息中的权重数据为所述待检索内容在预先确定的数据库中所处的位置的出现次数;Acquiring and acquiring the to-be-searched content in the to-be-retrieved information according to the to-be-searched information, performing a search in a predetermined database according to the parsed content to be retrieved, and obtaining the query result, where the query result includes the content to be retrieved in advance Determining the location and the number of occurrences in the database, and converting the query result into the query information of the preset information format; wherein the identifier code in the query information is a preset identifier of the content to be retrieved a code, the information content data in the query information is location information of the content to be retrieved in a predetermined database, and the weight data in the query information is that the content to be retrieved is in a predetermined database. The number of occurrences of the location;
    所述输出模块还用于:The output module is further configured to:
    根据所述查询信息解析获取所述待检索内容在预先确定的数据库中所处的位置信息及在预先确定的数据库中所处的位置的出现次数,并根据所述待检索内容在预先确定的数据库中所处的位置信息及出现次数进行排序,输出排序后的查询结果。Obtaining, according to the query information, location information of the content to be retrieved in a predetermined database and the number of occurrences of the location in the predetermined database, and according to the content to be retrieved in a predetermined database Sort the position information and the number of occurrences in the middle, and output the sorted query result.
  14. 如权利要求11、12或13所述的信息检索系统,其特征在于,还包括:The information retrieval system of claim 11, 12 or 13, further comprising:
    处理模块,用于在收到用户发送的原始检索信息后,对所述原始检索信息利用预设的自然语言处理方式进行处理,提取出所述原始检索信息中的待检索内容。 The processing module is configured to process the original search information by using a preset natural language processing manner after receiving the original search information sent by the user, and extract the content to be retrieved in the original search information.
  15. 如权利要求11、12或13所述的信息检索系统,其特征在于,所述预设信息格式的信息包括用户身份数据;所述输出模块还用于:The information retrieval system according to claim 11, 12 or 13, wherein the information of the preset information format includes user identity data; the output module is further configured to:
    根据所述查询信息解析获取查询结果及用户身份数据;Obtaining the query result and the user identity data according to the query information;
    根据所述用户身份数据获取该用户的历史检索记录;Obtaining a historical search record of the user according to the user identity data;
    将所述查询结果与该用户的历史检索记录进行匹配,并排序及输出与该用户的历史检索记录相匹配的查询结果。Matching the query result with the user's historical search record, and sorting and outputting the query result that matches the user's historical search record.
  16. 一种计算机可读存储介质,其上存储有至少一个可被处理设备执行以实现以下操作的计算机可读指令:A computer readable storage medium having stored thereon at least one computer readable instruction executable by a processing device to:
    A、在收到待检索内容后,将所述待检索内容转化为预设信息格式的待检索信息;A. After receiving the content to be retrieved, converting the to-be-searched content into the information to be retrieved in a preset information format;
    B、根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,并将所述查询结果转化为所述预设信息格式的查询信息;B. Acquire, according to the information to be retrieved, the content to be retrieved in the to-be-searched information, perform a search in a predetermined database according to the parsed content to be retrieved, and obtain the query result, and convert the query result into the Query information of a preset information format;
    C、根据所述查询信息解析获取查询结果,并对所述查询结果进行排序及输出。C. Acquire a query result according to the query information, and sort and output the query result.
  17. 如权利要求16所述的计算机可读存储介质,其特征在于,所述预设信息格式为三元组格式,所述预设信息格式的信息包括信息的标识码、信息内容数据和该信息对应的权重数据。The computer readable storage medium according to claim 16, wherein the preset information format is a triplet format, and the information of the preset information format includes an identification code of the information, information content data, and corresponding information. Weight data.
  18. 如权利要求17所述的计算机可读存储介质,其特征在于,所述步骤B包括:The computer readable storage medium of claim 17, wherein said step B comprises:
    根据所述待检索信息解析获取所述待检索信息中的待检索内容,根据解析出的待检索内容在预先确定的数据库中进行查找获取查询结果,所述查询结果包括所述待检索内容在预先确定的数据库中所处的位置及出现次数,将所述查询结果转化为所述预设信息格式的查询信息;其中,所述查询信息中的标识码为预设的所述待检索内容的标识码,所述查询信息中的信息内容数据为所述待检索内容在预先确定的数据库中所处的位置信息,所述查询信息中的权重数据为所述待检索内容在预先确定的数据库中所处的位置的出现次数;Acquiring and acquiring the to-be-searched content in the to-be-retrieved information according to the to-be-searched information, performing a search in a predetermined database according to the parsed content to be retrieved, and obtaining the query result, where the query result includes the content to be retrieved in advance Determining the location and the number of occurrences in the database, and converting the query result into the query information of the preset information format; wherein the identifier code in the query information is a preset identifier of the content to be retrieved a code, the information content data in the query information is location information of the content to be retrieved in a predetermined database, and the weight data in the query information is that the content to be retrieved is in a predetermined database. The number of occurrences of the location;
    所述步骤C包括:The step C includes:
    根据所述查询信息解析获取所述待检索内容在预先确定的数据库中所处的位置信息及在预先确定的数据库中所处的位置的出现次数,并根据所述待检索内容在预先确定的数据库中所处的位置信息及出现次数进行排序,输出排序后的查询结果。Obtaining, according to the query information, location information of the content to be retrieved in a predetermined database and the number of occurrences of the location in the predetermined database, and according to the content to be retrieved in a predetermined database Sort the position information and the number of occurrences in the middle, and output the sorted query result.
  19. 如权利要求16、17或18所述的计算机可读存储介质,其特征在于, 在所述步骤A之前,还包括:A computer readable storage medium as claimed in claim 16, 17 or 18, wherein Before the step A, the method further includes:
    在收到用户发送的原始检索信息后,对所述原始检索信息利用预设的自然语言处理方式进行处理,提取出所述原始检索信息中的待检索内容。After receiving the original search information sent by the user, the original search information is processed by using a preset natural language processing manner, and the content to be retrieved in the original search information is extracted.
  20. 如权利要求16、17或18所述的计算机可读存储介质,其特征在于,所述预设信息格式的信息包括用户身份数据;所述步骤C还包括:The computer readable storage medium according to claim 16, 17 or 18, wherein the information of the preset information format includes user identity data; and the step C further comprises:
    根据所述查询信息解析获取查询结果及用户身份数据;Obtaining the query result and the user identity data according to the query information;
    根据所述用户身份数据获取该用户的历史检索记录;Obtaining a historical search record of the user according to the user identity data;
    将所述查询结果与该用户的历史检索记录进行匹配,并排序及输出与该用户的历史检索记录相匹配的查询结果。 Matching the query result with the user's historical search record, and sorting and outputting the query result that matches the user's historical search record.
PCT/CN2017/105016 2017-06-25 2017-09-30 Information search method, system, server, and readable storage medium WO2019000697A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710490375.4 2017-06-25
CN201710490375.4A CN107679055B (en) 2017-06-25 2017-06-25 Information retrieval method, server and readable storage medium

Publications (1)

Publication Number Publication Date
WO2019000697A1 true WO2019000697A1 (en) 2019-01-03

Family

ID=61134005

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/105016 WO2019000697A1 (en) 2017-06-25 2017-09-30 Information search method, system, server, and readable storage medium

Country Status (2)

Country Link
CN (1) CN107679055B (en)
WO (1) WO2019000697A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108846066B (en) * 2018-06-06 2020-01-24 上海计算机软件技术开发中心 Visual data analysis method and system
CN109325093A (en) * 2018-08-24 2019-02-12 深圳职业技术学院 Bibliography automatic generation method, device and computer-readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1933547A (en) * 2006-06-13 2007-03-21 北京影立驰技术有限公司 System for making global search in wired TV one-way set-top box
CN102270232A (en) * 2011-07-21 2011-12-07 华中科技大学 Semantic data query system with optimized storage
CN102693310A (en) * 2012-05-28 2012-09-26 无锡成电科大科技发展有限公司 Resource description framework querying method and system based on relational database
US20120278305A1 (en) * 2011-04-28 2012-11-01 International Business Machines Corporation Dynamic merging of executable structures in a database system
CN104794247A (en) * 2015-05-14 2015-07-22 东南大学 Integrated query method for multi-structure database

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9406020B2 (en) * 2012-04-02 2016-08-02 Taiger Spain Sl System and method for natural language querying
IN2013CH01237A (en) * 2013-03-21 2015-08-14 Infosys Ltd
CN103258029B (en) * 2013-05-08 2016-09-28 徐峰蕾 Information retrieval method and system
CN104866498A (en) * 2014-02-24 2015-08-26 华为技术有限公司 Information processing method and device
CN105069171B (en) * 2015-08-31 2018-07-13 百度在线网络技术(北京)有限公司 Chinese character inquiry method and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1933547A (en) * 2006-06-13 2007-03-21 北京影立驰技术有限公司 System for making global search in wired TV one-way set-top box
US20120278305A1 (en) * 2011-04-28 2012-11-01 International Business Machines Corporation Dynamic merging of executable structures in a database system
CN102270232A (en) * 2011-07-21 2011-12-07 华中科技大学 Semantic data query system with optimized storage
CN102693310A (en) * 2012-05-28 2012-09-26 无锡成电科大科技发展有限公司 Resource description framework querying method and system based on relational database
CN104794247A (en) * 2015-05-14 2015-07-22 东南大学 Integrated query method for multi-structure database

Also Published As

Publication number Publication date
CN107679055B (en) 2021-04-27
CN107679055A (en) 2018-02-09

Similar Documents

Publication Publication Date Title
WO2022134759A1 (en) Keyword generation method and apparatus, and electronic device and computer storage medium
CN107451153B (en) Method and device for outputting structured query statement
CN109670163B (en) Information identification method, information recommendation method, template construction method and computing device
WO2019227585A1 (en) Index-based resume data processing method, device, apparatus, and storage medium
US11308937B2 (en) Method and apparatus for identifying key phrase in audio, device and medium
US11392775B2 (en) Semantic recognition method, electronic device, and computer-readable storage medium
WO2015144065A1 (en) Semantic recognition-based help processing method and device
US20140006429A1 (en) Generation of technical description of report from functional description of report
CN108090351B (en) Method and apparatus for processing request message
US9798776B2 (en) Systems and methods for parsing search queries
JP2013541793A (en) Multi-mode search query input method
CN110457346B (en) Data query method, device and computer readable storage medium
WO2022134355A1 (en) Keyword prompt-based search method and apparatus, and electronic device and storage medium
CN108388606B (en) Method for checking base table field names in Sql sentences and computer equipment
CN112231417A (en) Data classification method and device, electronic equipment and storage medium
US20200342037A1 (en) System and method for search discovery
CN106570095B (en) XML data operation method and equipment
WO2019000697A1 (en) Information search method, system, server, and readable storage medium
CN114357480A (en) Data security query method, device and equipment based on SQL (structured query language) blood relationship
WO2019080412A1 (en) Data service method, electronic device and storage medium
WO2019071907A1 (en) Method for identifying help information based on operation page, and application server
CN113010542A (en) Service data processing method and device, computer equipment and storage medium
US10733213B2 (en) Structuring unstructured machine-generated content
CN110069489B (en) Information processing method, device and equipment and computer readable storage medium
CN114416847A (en) Data conversion method, device, server and storage medium

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17915866

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 17915866

Country of ref document: EP

Kind code of ref document: A1