CN114519128A - Method for combining and displaying multiple information sources in search automatic completion - Google Patents
Method for combining and displaying multiple information sources in search automatic completion Download PDFInfo
- Publication number
- CN114519128A CN114519128A CN202011296586.2A CN202011296586A CN114519128A CN 114519128 A CN114519128 A CN 114519128A CN 202011296586 A CN202011296586 A CN 202011296586A CN 114519128 A CN114519128 A CN 114519128A
- Authority
- CN
- China
- Prior art keywords
- search
- user
- record
- name
- value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
- G06F16/9032—Query formulation
- G06F16/90324—Query formulation using system suggestions
- G06F16/90328—Query formulation using system suggestions using search space presentation or visualization, e.g. category or range presentation and selection
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention provides a method for combining and displaying multiple information sources in search automatic completion, which is characterized by comprising the following steps: receiving a search request of a user; searching a prefix tree according to the search terms in the user search request to obtain a candidate completion term result; comparing the similarity between the search word in the user search request and the record name in the affinity relation table, and adding the record name into the candidate completing word result when the similarity reaches a preset threshold value; and sorting and returning the candidate completing word results. The method for combining and displaying multiple information sources in the automatic completion of search enables multiple information sources to be displayed in the candidate frame at the same time. The search requirements of various information in the small red book APP are met, a more convenient search entry is provided for the user, and the use experience of the user is improved.
Description
Technical Field
The invention relates to the field of information search, in particular to a method for automatically completing search terms input by a user.
Background
With the advent of the information age, information and data have been growing explosively. The importance of information search is increasingly prominent, and the requirements on the efficiency and accuracy of retrieval are also increasingly high. The automatic completion of the search terms is an important technology for improving the search efficiency and accuracy. The traditional search term automatic completion only focuses on the information of the search term, and the characteristics of an information source and an information base to be retrieved are not considered during completion, so that the completion technology is suitable for general searches of Google, Baidu and the like. But for a particular kind of information source, for example: aiming at one or more specific information sources such as notes, commodities, stars, enterprise numbers and concerned large V, when a user searches, if the user only completes the search words based on the self semantics or completes the search words based on a completion system trained by the data of the whole network search words, the accuracy of the search results is greatly reduced.
In addition, in information sources related to personal information, such as large V and star, the keyword of the retrieval target often contains an unusual character, for example, the real user name of "shandona" is "shandona", and even personalized special characters and character patterns are also provided, and in these cases, it is difficult to complement a proper associative word by using the existing search word automatic complementing mode.
Finally, the existing search completion system generally cannot solve personalized search completion, for example, a user a wants to retrieve a homepage of a user B, but the user B is not a popular or popular user, and the existing search term completion method generally cannot complete automatic completion at this time.
Disclosure of Invention
In view of the above drawbacks of the prior art, an object of the present invention is to provide a method for combining and displaying multiple information sources in search automatic completion, which is used for improving the accuracy of automatic completion of search terms and improving user experience in a specific information source search scenario.
To achieve the above and other related objects, the present invention provides a method for combining and presenting multiple information sources in search autocompletion, comprising: receiving a search request of a user; searching a prefix tree according to the search terms in the user search request to obtain a candidate completion term result; comparing the similarity between the search word in the user search request and the record name in the affinity relation table, and adding the record name into the candidate completing word result when the similarity reaches a preset threshold value; and sorting and returning the candidate completing word results.
Preferably, in the method for combining and presenting multiple information sources in search autocompletion, the prefix tree is constructed by: recording the searching behavior of the user within a period of time T; acquiring the name and ID value of each record in an information source; and constructing the prefix tree according to the user searching behavior and the name and the ID value recorded in the information source.
Preferably, in the method for combining and presenting multiple information sources in search autocompletion, the nodes of the prefix tree include a search term character string, and further include one or more attributes of a record name, an ID value, a heat degree, a category, and a frequency associated with the search term.
Preferably, in the method of integrating and presenting multiple information sources in search autocompletion, the affinity table includes record names and record IDs that are frequently focused by the user.
Preferably, in the method of integrating and presenting multiple information sources in search autocompletion as described above, the logical structure of the affinity table is in the form of a "key-value" pair, where "key" is the ID of the user and "value" is a list of records that the user often focuses on.
Preferably, in the method of combining and presenting multiple information sources in search autocompletion as described above, the record list includes the record name and the record ID.
The invention also provides a prefix tree for automatically completing the search terms, which is characterized by being constructed by the following method: recording the searching behavior of the user within a period of time T; acquiring the name and ID value of each record in an information source; and constructing the prefix tree according to the user searching behavior and the name and the ID value recorded in the information source.
Preferably, in the prefix tree for automatic completion of search terms, nodes of the prefix tree include search term character strings, and further include one or more attributes of record names, ID values, heat degrees, categories, and frequencies related to the search terms.
The method for combining and displaying multiple information sources in search automatic completion enables multiple information sources to be displayed in the candidate frame at the same time. The search requirements of various information in the small red book APP are met, a more convenient search entry is provided for the user, and the use experience of the user is improved.
Drawings
FIG. 1 is a flow chart of the operation of the automatic completion system for search terms according to the present invention.
Detailed Description
The embodiments of the present invention are described below with reference to specific embodiments, and other advantages and effects of the present invention will be easily understood by those skilled in the art from the disclosure of the present specification. The invention is capable of other and different embodiments and of being practiced or of being carried out in various ways, and its several details are capable of modification in various respects, all without departing from the spirit and scope of the present invention.
Please refer to the attached drawings. It should be noted that the drawings provided in the present embodiment are only for illustrating the basic idea of the present invention, and the components related to the present invention are only shown in the drawings rather than drawn according to the number, shape and size of the components in actual implementation, and the type, quantity and proportion of the components in actual implementation may be changed freely, and the layout of the components may be more complicated.
The invention firstly discloses a prefix tree construction method for searching automatic completion. In the existing prefix tree for searching automatic completion, each node is usually composed of a character string of a search term or a substring thereof, and based on the prefix tree, the automatic completion which can be provided can only be based on the semantics of the search term. In order to provide better search automatic completion for user experience based on different information source characteristics, user retrieval habits and the like, a new prefix tree needs to be respectively constructed for each information source, wherein the information sources can be user (enterprise) information sources, commodity information sources, note information sources and the like. The prefix tree construction process of the present invention is as follows.
Firstly, recording the searching behavior of the user within a period of time T, wherein the recorded information can comprise search terms provided by the user, the use frequency of the search terms, the input time of the search terms, the use sequence of the search terms and the like, and specifically recording which information and the number of the T period, and the increase and the deletion can be carried out according to the actual requirement. In this example, assuming that the T period is set to 2 months, the recorded information is described only with a search word for the sake of simplicity of description.
Then, the name and ID value of each record in the information source are obtained. Where the name of the record refers to the field information in the record that facilitates a person's call. Taking a star information source as an example, the information source records information such as a user name, a homepage address, and the like of the star, and the name recorded in the information source is the user name of the star, such as "shanyang na Nana" in the foregoing. The ID value of a record refers to a value that is easy for a computer to identify and is used to uniquely locate the record, and is typically an identifier that the database assigns to each record and that the user uniquely identifies one record. In this step, in addition to the recorded name and ID value, the recorded heat information and category information may be acquired simultaneously as needed.
And finally, constructing a candidate word prefix tree, wherein each node of the prefix tree not only comprises a search word character string, but also comprises information such as record names, ID values, heat degrees, categories, frequencies and the like related to the search words.
It should be noted that, for a user who uses the retrieval system for the first time, since there is no history data of the search behavior, a default prefix tree may be provided for the user, and after there is data of the search behavior of the user, the default prefix tree is modified and adjusted.
In order to better provide personalized autocompletion for the user, it is preferable to construct an affinity list in addition to the prefix tree described above. Through the historical search behaviors of the user, a group of search objects which are frequently concerned by the user can be mined, and an affinity list can be maintained for the user on the basis of the search objects. The logical structure of the list may be in the form of a "key-value" pair, where a "key" may use the user's ID and a "value" may be a list including record names and record IDs that the user often retrieves, focuses on, such as the user name of the star and the record ID of the star. In practice, it is usually necessary to continuously maintain and update the affinity list according to the search behavior record of the user, and the maintenance and update period depends on the application requirement, in this example, the maintenance is performed by once a day.
Based on the prefix tree and the affinity relationship table, the search term completion method of the present invention is shown in fig. 1. The process begins by receiving a search request submitted by a user, including a search term entered by the user.
S1 shows that the search term input by the user searches the prefix tree to obtain the preliminary search completion term result, and the nodes of the prefix tree already contain the information such as the record name, ID value, heat, category, etc., so the candidate completion term result also contains the information.
S2 represents sorting the candidate results. This step is the preferred step and the ranking criteria can be adjusted according to the business and actual needs. In this example, the ranking criteria is the popularity of the search terms, and in some other applications, the ranking may also be based on user preferences, relevance, and other factors.
S3 shows that the high density relation table is checked and the similarity between the search word inputted by the user and the record name in the "value" list of the affinity relation table is compared. If a certain similarity threshold is reached, the record is also added to the candidate list. Taking the star data source as an example, the similarity between the search word input by the user and the star user name in the list is compared, and if the similarity exceeds a preset threshold value, the record of the star is also added into the candidate list. In this step, further optimization can be performed for the characteristics of the information source: for example, for the note information source or the merchandise information source, the number of notes or the number of merchandise may be further shown in the candidate list.
The above only describes the processing steps for one information source, and if there are multiple information sources, the above steps may be respectively adopted for each information source, and finally, the return results of each information source are integrated and summarized as required and then returned.
In summary, the present invention provides a novel prefix tree, where nodes of the prefix tree can be constructed according to different information source characteristics, and the nodes include the information source characteristics, and based on the novel prefix tree, the search terms can be completed based on the semantics of the search terms input by the user, and also can be completed according to the information source characteristics. Preferably, the invention further provides an affinity relationship list, and the affinity relationship list is combined with the novel prefix tree and the affinity relationship list, so that more accurate and personalized search term completion can be provided for the user, and the user experience is improved.
The foregoing embodiments are merely illustrative of the principles and utilities of the present invention and are not intended to limit the invention. Modifications and variations can be made to the above-described embodiments by those skilled in the art without departing from the spirit and scope of the present invention, for example: when the returned result is displayed to the user, different types of information sources can be distinguished and identified; the above numbering of the steps and the words such as "first" and "second" before the steps are also only for convenience of description, and are not used to limit the order of the steps, and those skilled in the art can perform the steps in parallel, serially, synchronously, asynchronously, or in sequence according to actual situations. Accordingly, it is intended that all equivalent modifications or changes which can be made by those skilled in the art without departing from the spirit and technical spirit of the present invention be covered by the claims of the present invention.
Claims (8)
1. A method for combining and presenting multiple information sources in search autocompletion, comprising:
receiving a search request of a user;
searching a prefix tree according to the search terms in the user search request to obtain a candidate completion term result;
comparing the similarity between the search word in the user search request and the record name in the affinity relation table, and adding the record name into the candidate completing word result when the similarity reaches a preset threshold value;
and sorting and returning the candidate completing word results.
2. The method of integrating and presenting multiple information sources in search autocompletion as recited in claim 1, wherein the prefix tree is constructed by:
recording the searching behavior of the user within a period of time T;
acquiring the name and ID value of each record in an information source;
and constructing the prefix tree according to the user searching behavior and the name and the ID value recorded in the information source.
3. The method of claim 2, wherein the nodes of the prefix tree comprise a search term string and further comprise one or more attributes of record name, ID value, heat, category, frequency associated with the search term.
4. The method of integrating and presenting multiple information sources in search autocompletion as recited in claim 1, wherein the affinity table comprises record names and record IDs that are of frequent interest to the user.
5. The method for integrating and presenting multiple information sources in search autocompletion according to claim 4, wherein the logical structure of the affinity table is in the form of a "key-value" pair, wherein "key" is the ID of the user and "value" is a list of records that the user is interested in frequently.
6. The method of integrating and presenting multiple information sources in search autocompletion as recited in claim 5, wherein the list of records comprises the record name and the record ID.
7. A prefix tree for automatic completion of search terms is characterized by being constructed by the following method:
recording the searching behavior of the user within a period of time T;
acquiring the name and ID value of each record in an information source;
and constructing the prefix tree according to the user searching behavior and the name and the ID value recorded in the information source.
8. The method of claim 7, wherein the nodes of the prefix tree comprise a search term string and further comprise one or more attributes of record name, ID value, heat, category, frequency associated with the search term.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011296586.2A CN114519128A (en) | 2020-11-18 | 2020-11-18 | Method for combining and displaying multiple information sources in search automatic completion |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011296586.2A CN114519128A (en) | 2020-11-18 | 2020-11-18 | Method for combining and displaying multiple information sources in search automatic completion |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114519128A true CN114519128A (en) | 2022-05-20 |
Family
ID=81594573
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011296586.2A Pending CN114519128A (en) | 2020-11-18 | 2020-11-18 | Method for combining and displaying multiple information sources in search automatic completion |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114519128A (en) |
-
2020
- 2020-11-18 CN CN202011296586.2A patent/CN114519128A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11789952B2 (en) | Ranking enterprise search results based on relationships between users | |
US20060161545A1 (en) | Method and apparatus for ordering items within datasets | |
US20160078047A1 (en) | Method for obtaining search suggestions from fuzzy score matching and population frequencies | |
US9928296B2 (en) | Search lexicon expansion | |
US8316007B2 (en) | Automatically finding acronyms and synonyms in a corpus | |
KR101231560B1 (en) | Method and system for discovery and modification of data clusters and synonyms | |
US8527506B2 (en) | Media discovery and playlist generation | |
CN103177075B (en) | The detection of Knowledge based engineering entity and disambiguation | |
US20090198693A1 (en) | Method and apparatus for ordering items within datasets | |
CN101495955B (en) | Mobile device retrieval and navigation | |
US7158996B2 (en) | Method, system, and program for managing database operations with respect to a database table | |
US8589411B1 (en) | Enhanced retrieval of source code | |
US7769752B1 (en) | Method and system for updating display of a hierarchy of categories for a document repository | |
US20020073079A1 (en) | Method and apparatus for searching a database and providing relevance feedback | |
US20090094223A1 (en) | System and method for classifying search queries | |
US20040002945A1 (en) | Program for changing search results rank, recording medium for recording such a program, and content search processing method | |
US20100017378A1 (en) | Enhanced use of tags when storing relationship information of enterprise objects | |
JP2007249899A (en) | Retrieval processing program | |
US11308177B2 (en) | System and method for accessing and managing cognitive knowledge | |
US7756798B2 (en) | Extensible mechanism for detecting duplicate search items | |
US11475048B2 (en) | Classifying different query types | |
US11341141B2 (en) | Search system using multiple search streams | |
US20210081415A1 (en) | Query classification alteration based on user input | |
KR100672278B1 (en) | Personalized Search Method Using Bookmark List Of Web Browser And System For Enabling The Method | |
CN112380416A (en) | Method for updating course index, course searching method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |