CN104424342A - Method for keyword matching, and device, server and system of method - Google Patents

Method for keyword matching, and device, server and system of method Download PDF

Info

Publication number
CN104424342A
CN104424342A CN201310413491.8A CN201310413491A CN104424342A CN 104424342 A CN104424342 A CN 104424342A CN 201310413491 A CN201310413491 A CN 201310413491A CN 104424342 A CN104424342 A CN 104424342A
Authority
CN
China
Prior art keywords
keyword
word
query word
phrase
linked database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310413491.8A
Other languages
Chinese (zh)
Inventor
叶亚明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Ctrip Business Co Ltd
Original Assignee
Ctrip Computer Technology Shanghai Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ctrip Computer Technology Shanghai Co Ltd filed Critical Ctrip Computer Technology Shanghai Co Ltd
Priority to CN201310413491.8A priority Critical patent/CN104424342A/en
Publication of CN104424342A publication Critical patent/CN104424342A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Abstract

The invention discloses a method for keyword matching and a device, a server and a system of the method. The method comprises the steps of receiving a query word or a query phrase; detecting whether the query word or the query phrase needs to be corrected or not, and correcting the query word or the query phrase if the query word or the query phrase needs to be corrected; searching whether hot words which correspond to the query word or the query phrase in a mapping way exist in a hot word database or not, and taking out the hot words if the hot words exist in the hot word database; searching keywords which are associated with the query word or the query phrase from an associated keyword database, and taking out keywords of which the number is a preset value from the keywords according to the sequence of the association degree from large to small; ranking and outputting all keywords or all keywords and the hot words which are taken out according to the association degree or the association degree and the hot word frequency. The invention also discloses the device, the server and the system which use or correspond to the keyword matching method. According to the method, the device, the server and the system, disclosed by the invention, the searching accuracy is increased, in-time accurate response searching tendency is realized, and the like.

Description

Keyword match method and device, server and system
Technical field
The present invention relates to a kind of keyword match method and device, server and system, particularly relate to a kind of keyword match method and device, server and system of network retrieval.
Background technology
In current internet business model, user has become topmost sales mode according to the demand of oneself in the enterprising line search of product exhibition platform, the self-service mode of selecting and then complete transaction, but, due to indefinite to demand own of user, complete and search condition accurately cannot be provided.Therefore, how, input of incomplete or even wrong fuzzy according to client, provides the prompting of the key word meeting client's real demand, and promoting for the experience of user and business has very important meaning.
Currently needing fuzzy according to client, company or the business department of intelligent decision and key word prompting are made in the input of imperfect even mistake, particularly product category is various, user cannot completely clear and definite demand own and in the uncertain situation of input caused, present Search Hints functional realiey all more elementary, result for retrieval is often similar with the fog-level of user's input, thus cause result for retrieval inaccurate, and cannot be inclined to by accurate assurance user search, thus the change of retrieval tendentiousness can not be reacted timely and accurately, and then cause the inaccurate of result for retrieval further, or even mistake, therefore greatly have impact on the experience of user, then the professional skill of company is had influence on.
Summary of the invention
The technical problem to be solved in the present invention be the result for retrieval of retrieval mode in order to overcome prior art inaccurate, the defects such as retrieval tendentiousness change can not be reacted timely and accurately, a kind of keyword match method and device, server and system are provided, combine by hot word and key word the accuracy that the mode retrieved improves retrieval, and be inclined to by the historical data reaction retrieval that more new database is come promptly and accurately.
The present invention solves above-mentioned technical matters by following technical proposals:
The invention provides a kind of keyword match method, be characterized in, comprise the following steps:
S1, receive a query word or inquiry phrase;
Query word described in the present invention or inquiry phrase are that user or client need inquiry or the word searched or symbol etc., wherein said inquiry phrase is made up of multiple queries word, and the mode of multiple queries word composition phrase can be arbitrary, such as according to a definite sequence, or the mode of set etc.
S2, detect described query word or inquiry phrase the need of error correction, if desired described query word or inquiry phrase are then revised in error correction, otherwise enter step S3;
Wherein said error correction is error correction and the correcting mode of semanteme conventional in the Language Processing mode commonly used in currently available technology or spelling etc., so the present invention is no longer described in detail principle and the error correction procedure of described error correction.
In order to ensure the accuracy that subsequent key word and hot word are searched in the present invention, needing especially to check that whether the query word of acquisition or the semanteme of inquiry phrase or spelling be normal in advance, and carrying out the amendments such as correction to the query word or phrase that there are the problems such as spelling.
S3, search from a hot word database whether exist mapping pair should described query word or inquiry phrase hot word, if exist, take out described hot word, enter step S4 if do not exist;
Described hot word be exactly especially be characterized in certain hour in web search technology in prior art during in be queried or search the very high word of number of times or phrase, so described hot word database is exactly for storing the database that these were queried or searched the very high word of number of times or phrase.
The present invention is also also associated with the hot word during special time to query word and searching of inquiry phrase, and then improves the accuracy of searching.
S4, search with described query word from a keyword linked database or inquire about the keyword that phrase associates, and from described keyword, taking out according to degree of association order from big to small the keyword that quantity is a default value;
Described keyword is equally also the word for the index as commodity or service etc. especially prestored in web search technology in prior art, the wherein said degree of association be characterize keyword or corresponding relation relevant with it word between matching degree, and the setting of the described degree of association and value etc. can set according to actual needs, and concrete setting means can utilize the setting meanss such as the degree of association of existing network search technique, so do not do any restriction to the setting of the degree of association in the present invention, as long as can be characterized by the degree of association between the word making keyword in the present invention and associate with it.
S5, each keyword taken out or each keyword and hot word are sorted according to the degree of association or the degree of association and hot word frequency rate and export.
The present invention obtains by hot word and keyword the keyword and hot contamination that mate most with query word or phrase jointly, and then provides the accuracy of inquiry.And above-mentioned sortord can be any one the existing sortord based on the degree of association and hot word frequency rate.
Preferably, also comprise in step S1: the geographical location information obtaining the source place of described query word or inquiry phrase, and described geographical location information is added described query word or inquiry phrase.
Obtain the geographical location information in described source in the present invention from the source obtaining query word or phrase, and improve the accuracy of retrieval with described geographical location information as the supplementary of searching further.Wherein said geographical location information refers to by the information that can characterize the Composition of contents such as the geographic position in source in this technical field of especially navigator fix in prior art.
Preferably, further comprising the steps of after step s 5:
S6, receiving feedback information, detect in each keyword or each keyword and hot word whether containing taking-up in feedback information one or more, if, record the keyword in described query word or inquiry phrase and feedback information or keyword and hot word, otherwise record each keyword of described query word or inquiry phrase and taking-up or each keyword and hot word.
In the present invention also by the content of the feedback information from user or client judge user whether have selected output for the keyword of pointing out and hot word, and have selected output for the keyword of pointing out and hot word time, the keyword select user or client and hot word and query word or inquiry phrase entirety are recorded as successful case, otherwise as failed case, by query word or inquiry phrase with inquire about each keyword of obtaining and hot word is recorded as a whole.Data Source is provided by this analysis and comparison being recorded as follow-up data.
Preferably, further comprising the steps of after described step S6:
S71, when apart from last keyword linked database update time more than a preset time period time, calculate based on the keyword in the described query word recorded in last keyword linked database update time to current time or inquiry phrase and feedback information and/or described query word or inquiry phrase and each keyword of taking-up and the time point of record and upgrade the degree of association of each keyword in described keyword linked database.
Wherein the computing method of the degree of association of keyword described in the present invention can use any existing calculation of relationship degree algorithm or formula etc.The present invention utilizes the successful inquiring case and failed case that record in certain hour section to adjust the degree of association of each keyword in keyword linked database, thus the accuracy of Optimizing Queries further.Because data sample in this update mode is large, so can the better degree of association accurately revising each keyword.
Preferably, described step S71 is:
When apart from last keyword linked database update time more than a preset time period and current time is in a permitted hours section time, calculate based on the keyword in the described query word recorded in last keyword linked database update time to current time or inquiry phrase and feedback information and/or described query word or inquiry phrase and each keyword of taking-up and the time point of record and upgrade the degree of association of each keyword in described keyword linked database.
Wherein said permitted hours section is that user can need to set arbitrarily according to idle degrees such as systems, because whole updating keyword linked database can spend plenty of time system resource, so this renewal is arranged in special time period in the present invention the situation such as avoid system busy to upgrading and the impact of other application.
Preferably, further comprising the steps of after described step S6:
S72, calculate based on the keyword in the described query word of record or inquiry phrase and feedback information or described query word or inquiry phrase and each keyword of taking-up and upgrade the degree of association of each keyword in described keyword linked database.
The computing method of the degree of association of same described keyword can use any existing calculation of relationship degree algorithm or formula etc., the present invention can also in real time more new keywords linked database each be queried ground keyword the degree of association, the degree of association of more new keywords can be carried out with full out speed in this way, thus make can reacting the Query Result of user in time of whole system.
Preferably, further comprising the steps of after described step S71 or S72 or S6:
S73, from external search engine and/or described keyword linked database, obtain the word that access frequency is more than or equal to a visit frequency threshold value, and in hot word database, record the access frequency of described word and described word.
Not only utilize word that in keyword linked database, access frequency is high to upgrade hot word database in the present invention, can also be obtained by external search engine and upgrade hot word database, thus keep the real-time of hot word, and then improve inquiry accuracy.
External search engine described in the present invention is any one or more in existing search engine, such as Google, Baidu and Yahoo etc.
Preferably, described step S73 is:
From external search engine and/or described keyword linked database, obtain access frequency in the multiple time periods apart from current time different time length be more than or equal to the word of the visit frequency threshold value corresponding respectively to the time period described in each respectively, and in hot word database, record the access frequency of described word and described word.
That is, detect respectively in the present invention in different time length, the word that access frequency is high is also recorded in hot word database, and the access frequency due to hot word increases the express feature of speed, for the time period of each different length is provided with different visit frequency threshold value respectively, thus identify hot word further.
Present invention also offers a kind of keyword match device, be characterized in, described keyword match device comprises:
One receiver module, for receiving a query word or inquiry phrase;
One correction module, for detect described query word or inquiry phrase the need of error correction, if desired error correction and revise described query word or inquiry phrase;
One hot word and search module, for searching and taking out the hot word that mapping pair answers described query word or inquiry phrase from a hot word database;
One keyword retrieval module, for searching with described query word from a keyword linked database or inquiring about the keyword that phrase associates, and takes out according to degree of association order from big to small the keyword that quantity is a default value from described keyword;
One sequence output module, for sorting by each keyword taken out or each keyword and hot word according to the degree of association or the degree of association and hot word frequency rate and export.
Preferably, described geographical location information also for obtaining the geographical location information at the source place of described query word or inquiry phrase, and is added described query word or inquiry phrase by described hot word and search module.
Preferably, described keyword match device also comprises:
One feedback information detection logging modle, for receiving feedback information, detect in each keyword or each keyword and hot word whether containing taking-up in feedback information one or more, if, record the keyword in described query word or inquiry phrase and feedback information or keyword and hot word, otherwise record each keyword of described query word or inquiry phrase and taking-up or each keyword and hot word.
Preferably, described keyword match device also comprises:
One keyword linked database update module, for when apart from last keyword linked database update time more than a preset time period time, calculate based on the keyword in the described query word recorded in last keyword linked database update time to current time or inquiry phrase and feedback information and/or described query word or inquiry phrase and each keyword of taking-up and the time point of record and upgrade the degree of association of each keyword in described keyword linked database.
Preferably, described keyword linked database update module also for when apart from last keyword linked database update time more than a preset time period and current time is in a permitted hours section time, calculate based on the keyword in the described query word recorded in last keyword linked database update time to current time or inquiry phrase and feedback information and/or described query word or inquiry phrase and each keyword of taking-up and the time point of record and upgrade the degree of association of each keyword in described keyword linked database.
Preferably, described keyword match device also comprises:
One real time critical word association database update module, for calculating based on the described query word of record or each keyword of the keyword inquired about in phrase and feedback information or described query word or inquiry phrase and taking-up and upgrade the degree of association of each keyword in described keyword linked database.
Preferably, described keyword match device also comprises:
One hot word database update module, for obtaining the word that access frequency is more than or equal to a visit frequency threshold value from external search engine and/or described keyword linked database, and records the access frequency of described word and described word in hot word database.
Preferably, described hot word database update module is also more than or equal to the word of the visit frequency threshold value corresponding respectively to the time period described in each respectively for obtaining access frequency in the multiple time periods apart from current time different time length from external search engine and/or described keyword linked database, and in hot word database, record the access frequency of described word and described word.
Present invention also offers a kind of retrieval server, be characterized in, described retrieval server uses keyword match method as above.
Preferably, described retrieval server is small-size computer, mainframe computer or Distributed Computer System.
Present invention also offers a kind of keyword match system, be characterized in, described keyword match system comprises a server and some clients; Described server comprises a hot word database, a keyword linked database and a processing unit;
Wherein said processing unit receives a query word or inquiry phrase from described client, and detects described query word or inquire about phrase the need of error correction, and if desired described query word or inquiry phrase are then revised in error correction;
Described processing unit is also searched respectively and is taken out the hot word that mapping pair answers described query word or inquiry phrase from described hot word database, search with described query word from described keyword linked database or inquire about the keyword that phrase associates, and from described keyword, take out according to degree of association order from big to small the keyword that quantity is a default value, then each keyword taken out or each keyword and hot word are sorted according to the degree of association or the degree of association and hot word frequency rate and export described client to.
Preferably, described geographical location information also for obtaining the geographical location information of the client exporting described query word or inquiry phrase, and is added described query word or inquiry phrase by described processing unit.
Preferably, described server also comprises a record cell, described processing unit is from described client receiving feedback information, and detect in feedback information in each keyword or each keyword and hot word whether containing taking-up one or more, if, keyword in query word described in described recording unit records or inquiry phrase and feedback information or keyword and hot word, otherwise each keyword of query word described in described recording unit records or inquiry phrase and taking-up or each keyword and hot word.
Preferably, described processing unit, when once keyword linked database update time is more than a preset time period in distance, calculates based on the keyword in the described query word of described recording unit records in last keyword linked database update time to current time or inquiry phrase and feedback information and/or described query word or inquiry phrase and each keyword of taking-up and the time point of record and upgrades the degree of association of each keyword in described keyword linked database.
Preferably, described processing unit when in distance once keyword linked database update time more than a preset time period and current time is in a permitted hours section time, calculate based on the keyword in the described query word of recording unit records in last keyword linked database update time to current time or inquiry phrase and feedback information and/or described query word or inquiry phrase and each keyword of taking-up and the time point of record and upgrade the degree of association of each keyword in described keyword linked database.
Preferably, described processing unit calculates based on the described query word of recording unit records or each keyword of the keyword inquired about in phrase and feedback information or described query word or inquiry phrase and taking-up and upgrades the degree of association of each keyword in described keyword linked database.
Preferably, described processing unit obtains the word that access frequency is more than or equal to a visit frequency threshold value from external search engine and/or described keyword linked database, and in hot word database, record the access frequency of described word and described word.
Preferably, described processing unit obtains access frequency in the multiple time periods apart from current time different time length and is more than or equal to the word of the visit frequency threshold value corresponding respectively to the time period described in each respectively from external search engine and/or described keyword linked database, and in hot word database, record the access frequency of described word and described word.
Preferably, described client is mobile terminal.Described in certain the present invention, client is not limited only to mobile terminal, the electronic equipment of all right any kind.
For convenience of description, described server is divided into various module according to function and describes respectively by the present invention, so when implementing of the present invention, the function of each module can be realized in same or multiple software and/or hardware.
On the basis meeting this area general knowledge, above-mentioned each optimum condition, can combination in any, obtains the preferred embodiments of the invention.
Positive progressive effect of the present invention is:
Keyword match method of the present invention and device, server and system, the accuracy that the mode retrieved improves retrieval is combined by hot word and key word, and be inclined to by the historical data reaction retrieval that more new database is come promptly and accurately, namely realize the intellectuality retrieved.
Thus improve the intelligent level of Search Hints function, promote Consumer's Experience, especially good support is had to mobile subscriber, have in specific words: user by uncertain input can successfully obtain meet its really the prompting of needs keyword ratio increase, the mean place that the key word of the real selection of user occupies in all prompting sequences shifts to an earlier date, and key word prompting result can carry out continuable dynamic optimization etc. according to the feedback of user.
Accompanying drawing explanation
Fig. 1 is the structural representation of the keyword match system of embodiments of the invention 1.
Fig. 2 is the process flow diagram of the keyword match method of embodiments of the invention 1.
Embodiment
Mode below by embodiment further illustrates the present invention, but does not therefore limit the present invention among described scope of embodiments.
The present invention mainly comprises following two aspects:
First aspect, how to build and trasaction key linked database and hot word database, namely how design key word source is as knowledge base, and the historical keyword receiving local search also can initiatively record the respective queries key word coming from other outside entrances such as search engine simultaneously.
How second aspect, design retrieval mode more accurately, namely introduces the subsidiary condition of positional information dimension as inquiry.The relational degree taxis algorithm of reminder item has been done the design optimization catering to instant focus.
The effect obtained by keyword match of the present invention is that the accuracy of Search Results of prompting has had and significantly improves, and Consumer's Experience also more original Search Results prompting mode is greatly improved.After keyword match of the present invention, the corresponding Search Results of the key word that user can have the probability of more than 90% really to be wanted after uncertain input, the Search Results exporting prompting user selection appears at former positions of Search Results sequence with higher frequency, user feedback mechanisms provides continuable chess game optimization.
Below by following embodiment, the present invention is explained further.
Embodiment 1
As described in Figure 1, the keyword match system of the present embodiment comprises server 1 and a mobile terminal 2, the quantity of wherein said mobile terminal can be arbitrary, and the server 1 of the keyword match system of this enforcement can also carry out data interaction with other electronic equipments.
And described server 1 can be applicable to the hardware device of server for small-size computer, mainframe computer or Distributed Computer System etc.
Server 1 described in the present embodiment comprises hot word database 11, keyword linked database 12, processing unit 13 and a record cell 14.
Described hot word database 11 and keyword linked database 12 have recorded hot word and the information such as keyword and the degree of association thereof respectively.
Described processing unit 13 receives a query word from described mobile terminal 2, and detects described query word the need of error correction, and if desired described query word or inquiry phrase are then revised in error correction.
Described processing unit 13 is also searched respectively and is taken out the hot word that mapping pair answers described query word from described hot word database 11, the keyword associated with described query word is searched from described keyword linked database 12, and from described keyword, take out according to degree of association order from big to small the keyword that quantity is default value N, such as take out 5 keywords, then the keyword of taking-up or keyword and hot word are sorted according to the degree of association or the degree of association and hot word frequency rate and export described mobile terminal 2 to.
And wherein said processing unit 13 can obtain the geographical location information of described mobile terminal 2, and described geographical location information is added described query word.
Described record cell 14 is when described processing unit 13 detects part or all of in each keyword or each keyword and hot word whether containing taking-up from described mobile terminal 2 receiving feedback information, if, described record cell 14 records keyword in described query word and feedback information or keyword and hot word, otherwise described record cell 14 records each keyword or each keyword and hot word that described query word and described processing unit 13 export.
In the present embodiment, processing unit 13 can also upgrade described keyword linked database 12 contents, specifically, described processing unit 13 when distance on once more the time of new keywords linked database 12 exceed preset time period T and current time is in permitted hours section T ' time, keyword in the described query word recorded based on record cell 14 described in last update time to the time period Tt of current time and feedback information and/or the time point of each keyword of described query word and taking-up and the record of above-mentioned data calculate and upgrade the degree of association of each keyword in described keyword linked database 12.
Above-mentioned update mode is in a kind of database whole updating mode, data after renewal can react the variation tendency of user's query word or phrase significantly, but this update time is long, take resource many, so described processing unit 13 content that can also record according to described record cell 14 more new database in real time in another embodiment, although can not the variation tendency of realization response user query word or phrase completely, can make corresponding to the change of user's query word or phrase quickly.
Specifically, the keyword in the described query word that records based on record cell 14 of described processing unit 13 and feedback information or each keyword of described query word and taking-up calculate and upgrade the degree of association of each keyword in described keyword linked database 12.
In addition processing unit 13 described in the present embodiment from external search engine and/or described keyword linked database 12, obtain distance current time different time length multiple time periods in access frequency be more than or equal to the word of the visit frequency threshold value P corresponding respectively to the time period described in each respectively, and in hot word database 11, record the access frequency of described word and described word.
That is, described processing unit 13 detects respectively in different time length, and the word that access frequency is high is also recorded in hot word database 11.
Therefore specifically, as shown in Figure 2, the keyword match system of the present embodiment keyword match method comprise the following steps flow process:
One query word of S1, server 1 mobile terminal receive 2 input, and obtain the geographical location information of mobile terminal 2, and described geographical location information is added described query word.
S2, processing unit 13 detect described query word the need of error correction, and if desired described query word is then revised in error correction, otherwise enters step S3.If need error correction, such as, phonetic " rujia hotel " is treated in " as hotel of family " etc.
Whether S3, processing unit 13 are searched from hot word database 11 exists the hot word that mapping pair answers described query word, if exist, takes out described hot word, enters step S4 if do not exist.
S4, processing unit 13 search the keyword associated with described query word from keyword linked database 12, and from described keyword, take out according to degree of association order from big to small the keyword that quantity is default value N.Such as take out the keyword of quantity 5.
Each keyword taken out or each keyword and hot word sort according to the degree of association or the degree of association and hot word frequency rate and export mobile terminal 2 to by S5, processing unit 13.
S6, processing unit 13 receiving feedback information, detect in each keyword or each keyword and hot word whether containing taking-up in feedback information one or more, if, record cell 14 records keyword in described query word and feedback information or keyword and hot word, otherwise records each keyword of described query word and taking-up or each keyword and hot word.
If such as user to have selected in the result for retrieval of output certain keyword or keyword and hot word, system, by under respective data record, is namely recorded as successful case.Re-enter if user have selected, system also by under respective data record, is namely recorded as unsuccessfully case.
S7, when exceeding preset time period T update time apart from last keyword linked database 12 and current time is in permitted hours section T ', processing unit 13 calculates based on the keyword in the described query word recorded in last keyword linked database 12 update time to the time period Tt of current time and feedback information and/or each keyword of described query word and taking-up and the time point of record and upgrades the degree of association of each keyword in described keyword linked database 12.
S8, processing unit 13 obtain access frequency in the multiple time periods apart from current time different time length and are more than or equal to the word of the visit frequency threshold value P corresponding respectively to the time period described in each respectively from external search engine and/or described keyword linked database 12, and in hot word database 11, record the access frequency of described word and described word.
Such as, from external search engine and/or described keyword linked database 12, pull any word in step S8, whether check the access frequency of word in 1 minute more than 1000 times, if, then add hot word database 11, but also check word frequency whether in 1 hour higher than 10,000 times, if, add hot word database 11, and check further word frequency whether in 1 day higher than 200,000 times, if so, add hot word database 11.In addition those skilled in the art can do arbitrary setting and adjustment to the time point detected, time span and visit frequency threshold value P, and then can optimize the accuracy and the real-time that hot word are gone out to crawl further.
Described step S7 is a kind for the treatment of step of comprehensive renewal as mentioned above, in order to strengthen the reaction velocity to user's inquiry, to be processing unit 13 calculate based on each keyword of the keyword in the described query word of record and feedback information or described query word and taking-up and upgrade the degree of association of each keyword in described keyword linked database 12 described step S7 ' in another embodiment.
Being the set of all kinds of reminder item and the degree of association, is the value in dynamic change, is also the result of each iteration of algorithm
Specifically, in the treatment step of this comprehensive renewal of step S7, first check and whether meet complete update condition, determined by the time, check whether the nearest once full time interval upgraded reaches predetermined value, it is generally 24 hours, then the data in pulling data storehouse in historical data degree of association calculation can be carried out, generate new value, also can carry out corresponding weighed value adjusting according to time attribute simultaneously, and security update is generally set in system between 0 o'clock to the 3 o'clock comparatively idle time period and carries out, substantially all reminder items and the degree of association can be upgraded after completing calculating.
Step S7 ' is incremental update step, incremental update mainly calculates current hotspot data query, the data pulled are mainly from not calculated short term memory increment, data volume is smaller, what calculate is consuming time smaller, but to the response of immediate inquiring focus by very large lifting effect, after completing calculating, also corresponding reminder item and the degree of association can be upgraded
In addition step S7 ' also can judge whether to need to upgrade by the time, but now the time interval shorter, generally in a minute rank.
And its those skilled in the art also it should be noted that step S7, S7 ' and S8 perform always, in order to maintain the ageing of the data in reminder-data storehouse and correctness.
Wherein those skilled in the art it should be noted that step S7, S7 ' and S8 also can use simultaneously, namely in whole Keywords matching flow process, can use step S7, S7 ' and S8.In addition step S7, order between S7 ' and S8 are can be arbitrary, are not limited only to the particular order pointed by the present embodiment.
The change of some evaluation indexes of the result for retrieval after the present embodiment process is as follows: the hit rate of result for retrieval, namely user can find the exact service or commodity wanted and not need the probability of the manual input of second time after first time input from result for retrieval, has brought up to about 93% from original 71%.
The average precedence of result for retrieval, refers to the sequence precedence of result for retrieval item in all result for retrieval items that user selects.According to general theory, user can not like the prompting more than more than 10, and more tends to preceding prompting of sorting.In this tolerance, from average 4.5(, the prompting that user selects means that user has checked all options substantially) drop to 2.1 times (mean user can at First view determination option) left and right.
Known by the description of the embodiment of above keyword match system, those skilled in the art can be well understood to the mode that the application can add required general hardware platform by software and realize.Based on such understanding, the technical scheme of the application can embody with the form of software product the part that prior art contributes in essence in other words, this computer software product can be stored in storage medium, as ROM/RAM(ROM (read-only memory)/random access memory), magnetic disc, CD etc., comprising some instructions in order to make a computer equipment (can be personal computer, server, or the network equipment etc.) perform the method described in some part of each embodiment of the application or embodiment.
The application can be used in numerous general or special purpose computing system environment or configuration.Such as: personal computer, server computer, handheld device or portable set, laptop device, multicomputer system, the system based on microprocessor, set top box, programmable consumer-elcetronics devices, network PC(PC), small-size computer, mainframe computer, the distributed computing environment comprising above any system or equipment etc.
The application can describe in the general context of computer executable instructions, such as program module.Usually, program module comprises the routine, program, object, assembly, data structure etc. that perform particular task or realize particular abstract data type.Also can put into practice the application in a distributed computing environment, in these distributed computing environment, be executed the task by the remote processing devices be connected by communication network.In a distributed computing environment, program module can be arranged in the local and remote computer-readable storage medium comprising memory device.
Although the foregoing describe the specific embodiment of the present invention, it will be understood by those of skill in the art that these only illustrate, protection scope of the present invention is defined by the appended claims.Those skilled in the art, under the prerequisite not deviating from principle of the present invention and essence, can make various changes or modifications to these embodiments, but these change and amendment all falls into protection scope of the present invention.

Claims (27)

1. a keyword match method, is characterized in that, described keyword match method comprises the following steps:
S1, receive a query word or inquiry phrase;
S2, detect described query word or inquiry phrase the need of error correction, if desired described query word or inquiry phrase are then revised in error correction, otherwise enter step S3;
S3, search from a hot word database whether exist mapping pair should described query word or inquiry phrase hot word, if exist, take out described hot word, enter step S4 if do not exist;
S4, search with described query word from a keyword linked database or inquire about the keyword that phrase associates, and from described keyword, taking out according to degree of association order from big to small the keyword that quantity is a default value;
S5, each keyword taken out or each keyword and hot word are sorted according to the degree of association or the degree of association and hot word frequency rate and export.
2. keyword match method as claimed in claim 1, is characterized in that, also comprise in described step S1: the geographical location information obtaining the source place of described query word or inquiry phrase, and described geographical location information is added described query word or inquiry phrase.
3. keyword match method as claimed in claim 1, is characterized in that, further comprising the steps of after step s 5:
S6, receiving feedback information, detect in each keyword or each keyword and hot word whether containing taking-up in feedback information one or more, if, record the keyword in described query word or inquiry phrase and feedback information or keyword and hot word, otherwise record each keyword of described query word or inquiry phrase and taking-up or each keyword and hot word.
4. keyword match method as claimed in claim 3, is characterized in that, further comprising the steps of after described step S6:
S71, when apart from last keyword linked database update time more than a preset time period time, calculate based on the keyword in the described query word recorded in last keyword linked database update time to current time or inquiry phrase and feedback information and/or described query word or inquiry phrase and each keyword of taking-up and the time point of record and upgrade the degree of association of each keyword in described keyword linked database.
5. keyword match method as claimed in claim 4, it is characterized in that, described step S71 is:
When apart from last keyword linked database update time more than a preset time period and current time is in a permitted hours section time, calculate based on the keyword in the described query word recorded in last keyword linked database update time to current time or inquiry phrase and feedback information and/or described query word or inquiry phrase and each keyword of taking-up and the time point of record and upgrade the degree of association of each keyword in described keyword linked database.
6. keyword match method as claimed in claim 3, is characterized in that, further comprising the steps of after described step S6:
S72, calculate based on the keyword in the described query word of record or inquiry phrase and feedback information or described query word or inquiry phrase and each keyword of taking-up and upgrade the degree of association of each keyword in described keyword linked database.
7. the keyword match method according to any one of claim 4-6, is characterized in that, further comprising the steps of after described step S71 or S72 or S6:
S73, from external search engine and/or described keyword linked database, obtain the word that access frequency is more than or equal to a visit frequency threshold value, and in hot word database, record the access frequency of described word and described word.
8. keyword match method as claimed in claim 7, it is characterized in that, described step S73 is:
From external search engine and/or described keyword linked database, obtain access frequency in the multiple time periods apart from current time different time length be more than or equal to the word of the visit frequency threshold value corresponding respectively to the time period described in each respectively, and in hot word database, record the access frequency of described word and described word.
9. a keyword match device, is characterized in that, described keyword match device comprises:
One receiver module, for receiving a query word or inquiry phrase;
One correction module, for detect described query word or inquiry phrase the need of error correction, if desired error correction and revise described query word or inquiry phrase;
One hot word and search module, for searching and taking out the hot word that mapping pair answers described query word or inquiry phrase from a hot word database;
One keyword retrieval module, for searching with described query word from a keyword linked database or inquiring about the keyword that phrase associates, and takes out according to degree of association order from big to small the keyword that quantity is a default value from described keyword;
One sequence output module, for sorting by each keyword taken out or each keyword and hot word according to the degree of association or the degree of association and hot word frequency rate and export.
10. keyword match device as claimed in claim 9, it is characterized in that, described geographical location information also for obtaining the geographical location information at the source place of described query word or inquiry phrase, and is added described query word or inquiry phrase by described hot word and search module.
11. keyword match devices as claimed in claim 9, it is characterized in that, described keyword match device also comprises:
One feedback information detection logging modle, for receiving feedback information, detect in each keyword or each keyword and hot word whether containing taking-up in feedback information one or more, if, record the keyword in described query word or inquiry phrase and feedback information or keyword and hot word, otherwise record each keyword of described query word or inquiry phrase and taking-up or each keyword and hot word.
12. keyword match devices as claimed in claim 11, it is characterized in that, described keyword match device also comprises:
One keyword linked database update module, for when apart from last keyword linked database update time more than a preset time period time, calculate based on the keyword in the described query word recorded in last keyword linked database update time to current time or inquiry phrase and feedback information and/or described query word or inquiry phrase and each keyword of taking-up and the time point of record and upgrade the degree of association of each keyword in described keyword linked database.
13. keyword match devices as claimed in claim 12, it is characterized in that, described keyword linked database update module also for when apart from last keyword linked database update time more than a preset time period and current time is in a permitted hours section time, calculate based on the keyword in the described query word recorded in last keyword linked database update time to current time or inquiry phrase and feedback information and/or described query word or inquiry phrase and each keyword of taking-up and the time point of record and upgrade the degree of association of each keyword in described keyword linked database.
14. keyword match devices as claimed in claim 11, it is characterized in that, described keyword match device also comprises:
One real time critical word association database update module, for calculating based on the described query word of record or each keyword of the keyword inquired about in phrase and feedback information or described query word or inquiry phrase and taking-up and upgrade the degree of association of each keyword in described keyword linked database.
15. keyword match devices as claimed in claim 11, it is characterized in that, described keyword match device also comprises:
One hot word database update module, for obtaining the word that access frequency is more than or equal to a visit frequency threshold value from external search engine and/or described keyword linked database, and records the access frequency of described word and described word in hot word database.
16. keyword match devices as claimed in claim 15, it is characterized in that, described hot word database update module is also more than or equal to the word of the visit frequency threshold value corresponding respectively to the time period described in each respectively for obtaining access frequency in the multiple time periods apart from current time different time length from external search engine and/or described keyword linked database, and in hot word database, record the access frequency of described word and described word.
17. 1 kinds of retrieval servers, is characterized in that, described retrieval server uses the keyword match method according to any one of claim 1-8.
18. retrieval servers as claimed in claim 17, it is characterized in that, described retrieval server is small-size computer, mainframe computer or Distributed Computer System.
19. 1 kinds of keyword match systems, is characterized in that, described keyword match system comprises a server and some clients; Described server comprises a hot word database, a keyword linked database and a processing unit;
Wherein said processing unit receives a query word or inquiry phrase from described client, and detects described query word or inquire about phrase the need of error correction, and if desired described query word or inquiry phrase are then revised in error correction;
Described processing unit is also searched respectively and is taken out the hot word that mapping pair answers described query word or inquiry phrase from described hot word database, search with described query word from described keyword linked database or inquire about the keyword that phrase associates, and from described keyword, take out according to degree of association order from big to small the keyword that quantity is a default value, then each keyword taken out or each keyword and hot word are sorted according to the degree of association or the degree of association and hot word frequency rate and export described client to.
20. keyword match systems as claimed in claim 19, it is characterized in that, described geographical location information also for obtaining the geographical location information of the client exporting described query word or inquiry phrase, and is added described query word or inquiry phrase by described processing unit.
21. keyword match systems as claimed in claim 19, it is characterized in that, described server also comprises a record cell, described processing unit is from described client receiving feedback information, and detect in feedback information in each keyword or each keyword and hot word whether containing taking-up one or more, if, keyword in query word described in described recording unit records or inquiry phrase and feedback information or keyword and hot word, otherwise each keyword of query word described in described recording unit records or inquiry phrase and taking-up or each keyword and hot word.
22. keyword match systems as claimed in claim 21, it is characterized in that, described processing unit, when once keyword linked database update time is more than a preset time period in distance, calculates based on the keyword in the described query word of described recording unit records in last keyword linked database update time to current time or inquiry phrase and feedback information and/or described query word or inquiry phrase and each keyword of taking-up and the time point of record and upgrades the degree of association of each keyword in described keyword linked database.
23. keyword match systems as claimed in claim 22, it is characterized in that, described processing unit when in distance once keyword linked database update time more than a preset time period and current time is in a permitted hours section time, calculate based on the keyword in the described query word of recording unit records in last keyword linked database update time to current time or inquiry phrase and feedback information and/or described query word or inquiry phrase and each keyword of taking-up and the time point of record and upgrade the degree of association of each keyword in described keyword linked database.
24. keyword match systems as claimed in claim 21, it is characterized in that, described processing unit calculates based on the described query word of recording unit records or each keyword of the keyword inquired about in phrase and feedback information or described query word or inquiry phrase and taking-up and upgrades the degree of association of each keyword in described keyword linked database.
25. keyword match systems as claimed in claim 21, it is characterized in that, described processing unit obtains the word that access frequency is more than or equal to a visit frequency threshold value from external search engine and/or described keyword linked database, and in hot word database, record the access frequency of described word and described word.
26. keyword match systems as claimed in claim 25, it is characterized in that, described processing unit obtains access frequency in the multiple time periods apart from current time different time length and is more than or equal to the word of the visit frequency threshold value corresponding respectively to the time period described in each respectively from external search engine and/or described keyword linked database, and in hot word database, record the access frequency of described word and described word.
27. keyword match systems according to any one of claim 19-26, it is characterized in that, described client is mobile terminal.
CN201310413491.8A 2013-09-11 2013-09-11 Method for keyword matching, and device, server and system of method Pending CN104424342A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310413491.8A CN104424342A (en) 2013-09-11 2013-09-11 Method for keyword matching, and device, server and system of method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310413491.8A CN104424342A (en) 2013-09-11 2013-09-11 Method for keyword matching, and device, server and system of method

Publications (1)

Publication Number Publication Date
CN104424342A true CN104424342A (en) 2015-03-18

Family

ID=52973315

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310413491.8A Pending CN104424342A (en) 2013-09-11 2013-09-11 Method for keyword matching, and device, server and system of method

Country Status (1)

Country Link
CN (1) CN104424342A (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016150002A1 (en) * 2015-03-24 2016-09-29 中兴通讯股份有限公司 Method and apparatus for filtering voice and/or character information, and terminal
CN106528616A (en) * 2016-09-30 2017-03-22 厦门快商通科技股份有限公司 Language error correcting method and system for use in human-computer interaction process
CN108280183A (en) * 2018-01-23 2018-07-13 余绍志 A kind of information transmission system based on big data matching and GPS positioning
CN108319631A (en) * 2017-01-16 2018-07-24 网智服务国际股份有限公司 Intelligent control system of knowledge base and feedback method thereof
CN108647355A (en) * 2018-05-16 2018-10-12 平安普惠企业管理有限公司 Methods of exhibiting, device, equipment and the storage medium of test case
CN109241381A (en) * 2017-07-04 2019-01-18 武汉默联股份有限公司 Information matching method and device
CN109299105A (en) * 2018-10-29 2019-02-01 中国地质大学(北京) A kind of retrieval of local area network geologic data and acquisition methods, device
WO2019041195A1 (en) * 2017-08-30 2019-03-07 深圳市云中飞网络科技有限公司 Application resource processing method and related product
CN109471926A (en) * 2018-10-30 2019-03-15 广东原昇信息科技有限公司 Intelligent word making method based on NLP and company information
CN109858473A (en) * 2018-12-28 2019-06-07 天津幸福生命科技有限公司 A kind of adaptive method for correcting error, device, readable medium and electronic equipment
CN110188274A (en) * 2019-05-30 2019-08-30 口口相传(北京)网络技术有限公司 Search for error correction method and device
CN110222252A (en) * 2019-06-14 2019-09-10 宜春宜联科技有限公司 Information retrieval method, device and equipment
CN110490712A (en) * 2019-08-21 2019-11-22 浙江中国轻纺城网络有限公司 A kind of commodity class heading search method, system and storage medium
CN111696545A (en) * 2019-03-15 2020-09-22 北京京东尚科信息技术有限公司 Speech recognition error correction method, device and storage medium
CN112307073A (en) * 2019-08-30 2021-02-02 北京字节跳动网络技术有限公司 Information query method, device, equipment and storage medium
CN113129057A (en) * 2021-04-16 2021-07-16 河南省信息咨询设计研究有限公司 Software cost information processing method and device, computer equipment and storage medium
CN113704233A (en) * 2021-10-29 2021-11-26 飞狐信息技术(天津)有限公司 Keyword detection method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1755671A (en) * 2004-09-30 2006-04-05 北京大学 Automatic error correction method for query words in search engine
CN102360358A (en) * 2011-09-28 2012-02-22 百度在线网络技术(北京)有限公司 Keyword recommendation method and system
CN102387207A (en) * 2011-10-21 2012-03-21 华为技术有限公司 Push method and system based on user feedback information
CN103092877A (en) * 2011-11-04 2013-05-08 百度在线网络技术(北京)有限公司 Method and device for recommending keyword
CN103136224A (en) * 2011-11-24 2013-06-05 百度时代网络技术(北京)有限公司 Recommendation method and device for keywords

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1755671A (en) * 2004-09-30 2006-04-05 北京大学 Automatic error correction method for query words in search engine
CN102360358A (en) * 2011-09-28 2012-02-22 百度在线网络技术(北京)有限公司 Keyword recommendation method and system
CN102387207A (en) * 2011-10-21 2012-03-21 华为技术有限公司 Push method and system based on user feedback information
CN103092877A (en) * 2011-11-04 2013-05-08 百度在线网络技术(北京)有限公司 Method and device for recommending keyword
CN103136224A (en) * 2011-11-24 2013-06-05 百度时代网络技术(北京)有限公司 Recommendation method and device for keywords

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016150002A1 (en) * 2015-03-24 2016-09-29 中兴通讯股份有限公司 Method and apparatus for filtering voice and/or character information, and terminal
CN106528616A (en) * 2016-09-30 2017-03-22 厦门快商通科技股份有限公司 Language error correcting method and system for use in human-computer interaction process
CN106528616B (en) * 2016-09-30 2019-12-17 厦门快商通科技股份有限公司 Language error correction method and system in human-computer interaction process
CN108319631A (en) * 2017-01-16 2018-07-24 网智服务国际股份有限公司 Intelligent control system of knowledge base and feedback method thereof
CN109241381A (en) * 2017-07-04 2019-01-18 武汉默联股份有限公司 Information matching method and device
CN109241381B (en) * 2017-07-04 2022-01-28 武汉默联股份有限公司 Information matching method and device
WO2019041195A1 (en) * 2017-08-30 2019-03-07 深圳市云中飞网络科技有限公司 Application resource processing method and related product
CN108280183A (en) * 2018-01-23 2018-07-13 余绍志 A kind of information transmission system based on big data matching and GPS positioning
CN108647355A (en) * 2018-05-16 2018-10-12 平安普惠企业管理有限公司 Methods of exhibiting, device, equipment and the storage medium of test case
CN109299105A (en) * 2018-10-29 2019-02-01 中国地质大学(北京) A kind of retrieval of local area network geologic data and acquisition methods, device
CN109471926A (en) * 2018-10-30 2019-03-15 广东原昇信息科技有限公司 Intelligent word making method based on NLP and company information
CN109858473A (en) * 2018-12-28 2019-06-07 天津幸福生命科技有限公司 A kind of adaptive method for correcting error, device, readable medium and electronic equipment
CN109858473B (en) * 2018-12-28 2023-03-07 天津幸福生命科技有限公司 Self-adaptive deviation rectifying method and device, readable medium and electronic equipment
CN111696545B (en) * 2019-03-15 2023-11-03 北京汇钧科技有限公司 Speech recognition error correction method, device and storage medium
CN111696545A (en) * 2019-03-15 2020-09-22 北京京东尚科信息技术有限公司 Speech recognition error correction method, device and storage medium
CN110188274A (en) * 2019-05-30 2019-08-30 口口相传(北京)网络技术有限公司 Search for error correction method and device
CN110188274B (en) * 2019-05-30 2021-06-08 口口相传(北京)网络技术有限公司 Search error correction method and device
CN110222252A (en) * 2019-06-14 2019-09-10 宜春宜联科技有限公司 Information retrieval method, device and equipment
CN110490712A (en) * 2019-08-21 2019-11-22 浙江中国轻纺城网络有限公司 A kind of commodity class heading search method, system and storage medium
CN112307073A (en) * 2019-08-30 2021-02-02 北京字节跳动网络技术有限公司 Information query method, device, equipment and storage medium
CN113129057A (en) * 2021-04-16 2021-07-16 河南省信息咨询设计研究有限公司 Software cost information processing method and device, computer equipment and storage medium
CN113704233A (en) * 2021-10-29 2021-11-26 飞狐信息技术(天津)有限公司 Keyword detection method and system
CN113704233B (en) * 2021-10-29 2022-03-01 飞狐信息技术(天津)有限公司 Keyword detection method and system

Similar Documents

Publication Publication Date Title
CN104424342A (en) Method for keyword matching, and device, server and system of method
US10296658B2 (en) Use of context-dependent statistics to suggest next steps while exploring a dataset
US9171078B2 (en) Automatic recommendation of vertical search engines
US8190556B2 (en) Intellegent data search engine
US10380498B1 (en) Platform services to enable one-click execution of the end-to-end sequence of modeling steps
US10380144B2 (en) Business intelligence (BI) query and answering using full text search and keyword semantics
US9747365B2 (en) Query understanding pipeline
KR101793222B1 (en) Updating a search index used to facilitate application searches
CN101273350B (en) Click distance determination
US20140214711A1 (en) Intelligent job recruitment system and method
US20230350909A1 (en) Cloud inference system
US20080104113A1 (en) Uniform resource locator scoring for targeted web crawling
CN105701216A (en) Information pushing method and device
US9002867B1 (en) Modifying ranking data based on document changes
US10789149B2 (en) Duplicate bug report detection using machine learning algorithms and automated feedback incorporation
US20110208715A1 (en) Automatically mining intents of a group of queries
US9031886B2 (en) Pluggable modules in a cascading learning system
US20140114949A1 (en) Knowledge Management System
US9690858B1 (en) Predicting categorized completions of a partial search term
CN111913954A (en) Intelligent data standard catalog generation method and device
CN110737779A (en) Knowledge graph construction method and device, storage medium and electronic equipment
CN108984737B (en) Resume retrieval method and device
US9720984B2 (en) Visualization engine for a knowledge management system
US20220138592A1 (en) Computer prediction of relevant data from multiple disparate sources
CN103365645A (en) Method and equipment for maintaining software system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20160203

Address after: 200335 Shanghai city Changning District Admiralty Road No. 968 Building No. 16 10 floor

Applicant after: SHANGHAI XIECHENG BUSINESS CO., LTD.

Address before: 200335 Shanghai City, Changning District Fuquan Road No. 99, Ctrip network technology building

Applicant before: Ctrip computer technology (Shanghai) Co., Ltd.

C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150318

RJ01 Rejection of invention patent application after publication