CN110502692A - Information retrieval method, device, equipment and storage medium based on search engine - Google Patents

Information retrieval method, device, equipment and storage medium based on search engine Download PDF

Info

Publication number
CN110502692A
CN110502692A CN201910623814.3A CN201910623814A CN110502692A CN 110502692 A CN110502692 A CN 110502692A CN 201910623814 A CN201910623814 A CN 201910623814A CN 110502692 A CN110502692 A CN 110502692A
Authority
CN
China
Prior art keywords
retrieval
information
algorithm
search result
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910623814.3A
Other languages
Chinese (zh)
Other versions
CN110502692B (en
Inventor
吴峻
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Puhui Enterprise Management Co Ltd
Original Assignee
Ping An Puhui Enterprise Management Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Puhui Enterprise Management Co Ltd filed Critical Ping An Puhui Enterprise Management Co Ltd
Priority to CN201910623814.3A priority Critical patent/CN110502692B/en
Publication of CN110502692A publication Critical patent/CN110502692A/en
Application granted granted Critical
Publication of CN110502692B publication Critical patent/CN110502692B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention relates to data processing fields, disclose a kind of information retrieval method based on search engine, comprising the following steps: when receiving information retrieval requests, obtain the corresponding retrieval information of the information retrieval requests and account identification;According to the retrieval information and the account identification, the corresponding target retrieval algorithm of the information retrieval requests is configured;Information retrieval is carried out according to the retrieval information and the target retrieval algorithm, obtains the first search result with the retrieval information matches;First search result is sent to the corresponding terminal of the account identification, is checked so that the terminal corresponds to user.The invention also discloses a kind of information indexing device based on search engine, equipment and storage mediums.According to retrieval information and account identification flexible configuration target retrieval algorithm, and according to target, searching algorithm carries out information retrieval to search engine in the present invention, improves the flexibility and accuracy of information retrieval.

Description

Information retrieval method, device, equipment and storage medium based on search engine
Technical field
The present invention relates to field of information processing, more particularly to the information retrieval method based on search engine, device, equipment and Storage medium.
Background technique
With the fast development of internet, the increase of the network information, in order to realize rapidly information searching, search engine with Birth.
Search engine, which refers to, specially provides a kind of website of retrieval service on internet, the server of these websites passes through net The modes such as network search software or network login, are collected into local for the page info of websites a large amount of on internet Intenet, pass through Working process establishes information database and index data base, so that the various retrievals proposed to user make a response, provides user Required information or associated pointers.The search channel of user mainly includes free word full-text search, keyword retrieval, systematic searching And the retrieval of other specific informations.The data retrieval demand of user individual is unable to satisfy in existing search engine.
Summary of the invention
The main purpose of the present invention is to provide a kind of information retrieval method based on search engine, device, equipment and deposit Storage media, it is intended to solve the technical issues of present search engine can not flexibly, accurately carry out information retrieval.
To achieve the above object, the present invention provides the information retrieval method based on search engine, described to be based on search engine Information retrieval method the following steps are included:
When receiving information retrieval requests, the corresponding retrieval information of the information retrieval requests and account identification are obtained;
According to the retrieval information and the account identification, configures the corresponding target retrieval of the information retrieval requests and calculate Method;
Information retrieval is carried out according to the retrieval information and the target retrieval algorithm, is obtained and the retrieval information matches The first search result;
First search result is sent to the corresponding terminal of the account identification, is looked into so that the terminal corresponds to user It sees.
Optionally, described according to the retrieval information and the account identification, it is corresponding to configure the information retrieval requests The step of target retrieval algorithm, comprising:
The information type for obtaining the retrieval information, inquires preset algorithm recommendation tables, it is corresponding to obtain the information type First searching algorithm;
The history scoring of the corresponding each default searching algorithm of the account identification is obtained, and it is highest default that history scored Searching algorithm is as the second searching algorithm;
When first searching algorithm is with the second searching algorithm difference, first searching algorithm and second are examined The corresponding target retrieval algorithm of information retrieval requests described in the high conduct of priority in rope algorithm.
Optionally, described that first search result is sent to the corresponding terminal of the account identification, for the end After the step of holding corresponding user to check, comprising:
Receive the user behavior data based on first search result that the terminal is sent, wherein user's row It include: browsing time and browsing frequency for data;
Obtain that the user behavior data includes to the browsing time, clear for respectively retrieving entry in first search result Frequency is look at, using browsing time longest or the browsing highest retrieval entry of frequency as really hitting information;
Information retrieval is carried out by the default searching algorithm in addition to the target retrieval algorithm, is obtained and the retrieval information Matched second search result;
According to the true hit information, first search result and second search result, each default inspection is updated The history of rope algorithm scores.
Optionally, described according to the true hit information, first search result and second search result, more The step of history scoring of new each default searching algorithm, comprising:
Sequence of the true hit information in first search result and second search result is obtained, is obtained The target retrieval result of sequence at first;
The scoring of the corresponding searching algorithm of the target retrieval result is adjusted, to complete the update of history scoring.
Optionally, described according to the retrieval information and the account identification, it is corresponding to configure the information retrieval requests After the step of target retrieval algorithm, comprising:
After target retrieval algorithm configuration completion, at least two presetting databases are judged whether there is;
At least two presetting database if it exists then obtains the access frequency of each presetting database, by the access frequency The highest presetting database of rate is as the corresponding target database of information retrieval requests.
Optionally, described that information retrieval is carried out according to the retrieval information and the target retrieval algorithm, obtain with it is described The step of retrieving the first search result of information matches, comprising:
The retrieval information is subjected to word segmentation processing by the segmentation methods in the target retrieval algorithm, obtains the retrieval The corresponding keyword set of information;
The target database is inquired, by the similarity algorithm in the target retrieval algorithm, is obtained and the keyword The retrieval entry of sets match, and arrange each retrieval entry and obtain the first search result.
Optionally, described when receiving information retrieval requests, obtain the corresponding retrieval information of the information retrieval requests And after the step of account identification, comprising:
Obtain the account identification history retrieval record, by the history retrieval record in each history search result with The retrieval information is compared, and judges to examine in the history retrieval record with the presence or absence of the history with the retrieval information matches Hitch fruit
When there is the history search result with the retrieval information matches, the history search result is sent to described The corresponding terminal of account identification is checked so that the terminal corresponds to user;
When there is no the history search result with the retrieval information matches, step is executed: according to the retrieval information With the account identification, the corresponding target retrieval algorithm of the information retrieval requests is configured.
In addition, to achieve the above object, the present invention also provides a kind of information indexing device based on search engine, the bases Include: in the information indexing device of search engine
Request receiving module, for when receiving information retrieval requests, obtaining the corresponding inspection of the information retrieval requests Rope information and account identification;
Algorithm configuration module, for configuring the information retrieval requests according to the retrieval information and the account identification Corresponding target retrieval algorithm;
Information searching module is obtained for carrying out information retrieval according to the retrieval information and the target retrieval algorithm With the first search result of the retrieval information matches;
As a result sending module, for first search result to be sent to the corresponding terminal of the account identification, for The terminal corresponds to user and checks.
In addition, to achieve the above object, the present invention also provides a kind of information searching devices based on search engine;
The information searching device based on search engine includes: memory, processor and is stored on the memory And the computer program that can be run on the processor, in which:
It realizes when the computer program is executed by the processor as described above based on the information retrieval of search engine The step of method.
In addition, to achieve the above object, the present invention also provides computer storage mediums;
Computer program, the realization when computer program is executed by processor are stored in the computer storage medium Such as the step of the above-mentioned information retrieval method based on search engine.
A kind of information retrieval method based on search engine, device, equipment and the storage medium that the embodiment of the present invention proposes, When server receives information retrieval requests, the corresponding retrieval information of the information retrieval requests and account identification are obtained;Root According to the retrieval information and the account identification, the corresponding target retrieval algorithm of the information retrieval requests is configured;According to described It retrieves information and the target retrieval algorithm carries out information retrieval, obtain the first search result with the retrieval information matches; First search result is sent to the corresponding terminal of the account identification, is checked so that the terminal corresponds to user.This hair Bright middle search engine corresponding server is according to retrieval information and account identification flexible configuration target retrieval algorithm, that is, in the present invention Server retrieves record according to the history of real-time retrieval information and the corresponding retrieval account of account identification, is configured flexibly target retrieval Algorithm, and according to target searching algorithm carries out information retrieval, improves the flexibility and accuracy of information retrieval, meets information inspection The individual demand of rope.
Detailed description of the invention
Fig. 1 is the apparatus structure schematic diagram for the hardware running environment that the embodiment of the present invention is related to;
Fig. 2 is that the present invention is based on the flow diagrams of the information retrieval method first embodiment of search engine;
Fig. 3 is that the present invention is based on the functional block diagrams of one embodiment of information indexing device of search engine.
The embodiments will be further described with reference to the accompanying drawings for the realization, the function and the advantages of the object of the present invention.
Specific embodiment
It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.
As shown in Figure 1, the server that Fig. 1 is the hardware running environment that the embodiment of the present invention is related to (is called and is based on searching Index the information searching device held up, wherein the information searching device based on search engine can be by individually being drawn based on search The information indexing device held up is constituted, be also possible to be combined by other devices with the information indexing device based on search engine formed) Structural schematic diagram.
Server of the embodiment of the present invention refers to a management resource and provides the computer of service for user, is generally divided into file Server, database server and apps server.The computer or computer system for running the above software are also referred to as Server.For common PC (personal computer) personal computer, server is in stability, safety, property Energy etc. requires higher;As shown in Figure 1, the server may include: processor 1001, such as central processing unit (Central Processing Unit, CPU), network interface 1004, user interface 1003, memory 1005, communication bus 1002, hardware such as chipset, disk system, network etc..Wherein, communication bus 1002 is for realizing the connection between these components Communication.User interface 1003 may include display screen (Display), input unit such as keyboard (Keyboard), optional user Interface 1003 can also include standard wireline interface and wireless interface.Network interface 1004 optionally may include having for standard Line interface, wireless interface (such as Wireless Fidelity WIreless-FIdelity, WIFI interface).Memory 1005 can be high speed with Machine accesses memory (random access memory, RAM), is also possible to stable memory (non-volatile ), such as magnetic disk storage memory.Memory 1005 optionally can also be the storage dress independently of aforementioned processor 1001 It sets.
Optionally, server can also include camera, RF (Radio Frequency, radio frequency) circuit, sensor, sound Frequency circuit, WiFi module;Input unit, than display screen, touch screen;Network interface can be blue in blanking wireless interface in addition to WiFi Tooth, probe etc..It will be understood by those skilled in the art that server architecture shown in Fig. 1 does not constitute the restriction to server, It may include perhaps combining certain components or different component layouts than illustrating more or fewer components.
As shown in Figure 1, the computer software product, which is stored in a storage medium, (storage medium: is called computer storage Medium, computer media, readable medium, readable storage medium storing program for executing, computer readable storage medium are directly medium etc., storage Medium can be non-volatile readable storage medium, such as RAM, magnetic disk, CD) in, including some instructions use is so that an end End equipment (can be mobile phone, computer, server, air conditioner or the network equipment etc.) executes each embodiment institute of the present invention The method stated, as may include operating system, network communication module, use in a kind of memory 1005 of computer storage medium Family interface module and computer program.
In server shown in Fig. 1, network interface 1004 be mainly used for connect background data base, with background data base into Row data communication;User interface 1003 is mainly used for connection client, and (client, is called user terminal or terminal, and the present invention is implemented Example terminal can be also possible to mobile terminal with fixed terminal, e.g., intelligent air condition, intelligent electric lamp, intelligent power with network savvy, Intelligent sound box, autonomous driving vehicle, PC, smart phone, tablet computer, E-book reader, portable computer etc., are wrapped in terminal Containing sensor such as optical sensor, motion sensor and other sensors, details are not described herein), data are carried out with client Communication;And processor 1001 can be used for calling the computer program stored in memory 1005, and it is real to execute the present invention or less Step in the information retrieval method based on search engine of example offer is provided.
The present embodiment provides a kind of information retrieval methods based on search engine, should the information retrieval side based on search engine Method is applied to search engine corresponding server as shown in Figure 1, and the search engine in the application is directed to AIML (Quan Mingwei Artificial Intelligence Markup Language (artificial intelligence markup language), AIML are a kind of creation natures The XML language of lingware agency) Normalization rule JAVA development language obtains, can preferably from different application software into Row docking, realizes efficiently and accurately information retrieval, specifically:
It is described based on search the present invention is based in the first embodiment of the information retrieval method of search engine referring to Fig. 2 The information retrieval method of engine includes:
Step S10 obtains the corresponding retrieval information of the information retrieval requests and account when receiving information retrieval requests Family mark.
When the corresponding server of search engine receives information retrieval requests, server obtains to be carried in information retrieval requests Retrieval information (retrieval information can be understood as user input query information) and account identification (account identification refers to unique knowledge The identification information of other user, for example, retrieval customer accounting code).
It is understood that the triggering form for the information retrieval requests that server receives in the present embodiment does not limit specifically It is fixed, that is, information retrieval requests can be what user actively triggered, for example, user is based on terminal speech or text inputs: " attached Closely which nice restaurant has " information retrieval requests are triggered, information retrieval requests are sent to server by terminal, and server receives To information retrieval requests, " nearby which nice restaurant has " is used as retrieval information by server, and server asks information retrieval Account name in asking is referred to as account identification;Alternatively, information retrieval requests can also be automatic trigger, for example, user is preparatory Be arranged 8 points of daily morning in the terminal broadcasts weather forecast automatically, and terminal is asked in the 8 automatic trigger information retrievals of daily morning It asks, server receives information retrieval requests, and for server by " weather forecast " as retrieval information, server asks information retrieval Account name in asking is referred to as account identification.
Step S20 configures the corresponding target of the information retrieval requests according to the retrieval information and the account identification Searching algorithm.
Server configures target retrieval algorithm according to retrieval information and account identification, specifically, comprising:
Server determines the first searching algorithm according to retrieval information, that is, each default searching algorithm processing of server inquiry The retrieval information efficiency, server is using the highest searching algorithm of efficiency as the first searching algorithm, for example, retrieval information are as follows: " such as What improves writing efficiency ", server is 0.5 second by the retrieval time that canonical searching algorithm is retrieved, and server is according to machine The retrieval time that study searching algorithm is retrieved is 1 second, then server is using canonical searching algorithm as the first searching algorithm.
Server determines the second searching algorithm according to account identification, that is, sets in server previously according to history retrieval record The corresponding searching algorithm of different Account Types is set, the account is obtained in server and identifies corresponding Account Type, then obtaining should The corresponding searching algorithm of Account Type is as the second searching algorithm;For example, account identification is xxx, corresponding Account Type is two Grade Account Type, server obtain the corresponding machine learning searching algorithm of secondary account type, and server retrieves machine learning Algorithm is as the second searching algorithm.
After server determines the first searching algorithm and the second searching algorithm, server judges the first searching algorithm and Whether two searching algorithms are identical, and when the first searching algorithm is identical as the second searching algorithm, server is directly retrieved, When one searching algorithm and the second searching algorithm difference, server obtains corresponding first priority of the first searching algorithm and second Corresponding second priority of searching algorithm, if the first priority is higher than the second priority, server makees the first searching algorithm It is on the contrary for target retrieval algorithm.
It should be added that the priority of searching algorithm, can be it is pre-set, can also be according to application scenarios It is specific to be arranged, it is not construed as limiting in the present embodiment.
Step S30 carries out information retrieval according to the retrieval information and the target retrieval algorithm, obtains and the retrieval First search result of information matches.
After server determines target retrieval algorithm, server according to target searching algorithm processing retrieval information, and obtain First search result obtains for example, according to target the segmentation methods in searching algorithm carry out word segmentation processing to retrieval information to server Retrieve the corresponding keyword set of information;Then, (presetting database refers to pre-set packet to server inquiry presetting database Database containing different information types, presetting database can be through search program (Indexer), be commonly called as " spider " (Spider) program or " robot " (Robot) program, the web database of foundation;Alternatively, presetting database can also be pre- First establish other kinds of database), the server according to target similarity algorithm in searching algorithm will retrieval information and present count It is compared according to the presupposed information for including in library, acquisition and the matched retrieval entry of keyword set, and arranges each retrieval entry Obtain the first search result.
First search result is sent to the corresponding terminal of the account identification, for the terminal pair by step S40 It is checked using family.
For server after obtaining the first search result, it is corresponding that the first search result is sent to account identification by server Terminal, so that terminal corresponds to user and checks search result.Search engine corresponding server is according to retrieval information and account in the present invention Family identifies flexible configuration target retrieval algorithm, that is, server is according to real-time retrieval information and the corresponding inspection of account identification in the present invention The history of rope account retrieves record, is configured flexibly target retrieval algorithm, and according to target searching algorithm carries out information retrieval, improves The flexibility and accuracy of information retrieval, meets the individual demand of information retrieval.
Further, on the basis of first embodiment of the invention, propose that the present invention is based on the inspections of the information of search engine The second embodiment of Suo Fangfa.
The present embodiment is the refinement of step S20 in first embodiment, and a kind of configuration target is specifically illustrated in the present embodiment The scheme of searching algorithm, the information retrieval method based on search engine include:
Step S21 obtains the information type of the retrieval information, inquires preset algorithm recommendation tables, obtain the info class Corresponding first searching algorithm of type.
Server obtains the information type of retrieval information, and information type refers to retrieval information by information attribute, information The information category that appearance or information function etc. are classified, for example, server can be divided by the information content by information is retrieved Are as follows: type of message, data type and knowledge type.Then, server inquires preset algorithm recommendation tables, obtains preset algorithm and recommends Corresponding first searching algorithm of information type in table.
Preset algorithm recommendation tables refer to pre-set information type and searching algorithm mapping table, for example, server according to History retrieval record statistics obtains: the highest algorithm of retrieval information retrieval accuracy rate of data type is canonical matching algorithm, then The retrieval information of data type and canonical matching algorithm are established into mapping relations in preset algorithm recommendation tables;For another example, server Retrieve record statistics according to history to obtain: the highest algorithm of retrieval information retrieval accuracy rate of type of message is machine learning retrieval The retrieval information of type of message and machine learning searching algorithm are then established mapping relations in preset algorithm recommendation tables by algorithm.
Step S22 obtains the history scoring of the corresponding each default searching algorithm of the account identification, and most by history scoring High default searching algorithm is as the second searching algorithm.
Server obtains the history scoring of the corresponding each default searching algorithm of account identification, that is, presets in server There are multiple searching algorithms, e.g., canonical matching algorithm and machine learning searching algorithm etc., it is king xx that server, which obtains account identification, Corresponding: the historical assessment of canonical matching algorithm is 8 points, the scoring of the history of machine learning searching algorithm is 6 points, and server will just Then matching algorithm is as the second searching algorithm.
It is to be understood that identical default searching algorithm might not in the different corresponding history scorings of account identification It is identical, that is, the history scoring of default searching algorithm is obtained according to the history of user retrieval record: for example, setting in server Searching algorithm p (canonical searching algorithm) and default searching algorithm q (machine learning searching algorithm), account mark are preset in being equipped with 1 has 3 history retrieval records:
The search result that searching algorithm p output is preset in first time history retrieval record is ordered as A1 and A2;Default retrieval The search result of algorithm q output is ordered as A2 and A1;It is A1 that user, which really hits information,;Then server is by default searching algorithm p History scoring be updated to 0+1, it is 0 that the scoring of default searching algorithm q, which is remained unchanged,;
The search result that searching algorithm p output is preset in second of history retrieval record is ordered as B1 and B2;Default retrieval The search result of algorithm q output is ordered as B2 and B1;It is B2 that user, which really hits information,;Then server is by default searching algorithm p History scoring to remain unchanged be 1, the history scoring of default searching algorithm q is updated to 0+1;
The search result that searching algorithm p output is preset in third time history retrieval record is ordered as C1 and C2;Default retrieval The search result of algorithm q output is ordered as C2 and C1;It is C1 that user, which really hits information,;Server is by default searching algorithm p's History scoring is updated to 1+1=2, and it is 1 that the history scoring of default searching algorithm q, which is remained unchanged,;Server is retrieved according to history Record determines that the history scoring of default searching algorithm p is 2 points, and presetting the scoring of searching algorithm q history is 1 point.Server will be preset The target retrieval algorithm that searching algorithm p is used as.
After server determines the first searching algorithm and the second searching algorithm, server judges the first searching algorithm and the Whether two searching algorithms are identical, and when the first searching algorithm is identical with the second searching algorithm, server is by determining searching algorithm It is retrieved.
Step S23, when first searching algorithm is with the second searching algorithm difference, by first searching algorithm Target retrieval algorithm corresponding with information retrieval requests described in the high conduct of priority in the second searching algorithm.
When server determines the first searching algorithm and the second searching algorithm difference, server obtains the first searching algorithm Second priority of the first priority and the second searching algorithm, wherein the priority of searching algorithm, can be it is pre-set, It can also be server according to retrieval scene flexible setting;Then, server compares the first priority and the second priority, clothes Business device calculates the corresponding target retrieval of the high conduct information retrieval requests of priority in the first searching algorithm and the second searching algorithm Method.
Server configures target retrieval algorithm according to retrieval information and account identification in the present embodiment, effectively guarantees The flexibility of target retrieval algorithm setting, so that the searching algorithm that server is different according to unused user setting, realizes Index, which is held up, realizes decoupling with searching algorithm.
Further, on the basis of second embodiment of the invention, propose that the present invention is based on the inspections of the information of search engine The 3rd embodiment of Suo Fangfa.
Server carries out the searching algorithm history scoring that step S22 in second embodiment is related to automatic in the present embodiment It updates, the information retrieval method based on search engine includes:
Step S50, receives the user behavior data based on first search result that the terminal is sent, described in acquisition User behavior data include to respectively retrieved in first search result entry browsing time, browsing frequency, when by browsing Between longest or the highest retrieval entry of browsing frequency as really hitting information.
First search result is sent to the corresponding terminal of account identification by server, so that terminal user checks the first retrieval As a result, terminal acquires user behavior data of the user based on the first search result, terminal feeds back the user behavior data of acquisition To server, the user behavior data based on the first search result that server receiving terminal is sent, for example, server is by first The relevant file of search result 10 " how improving writing efficiency " is sent to terminal, and user clicks second once at the terminal Browsing time is 20 seconds, and it is 90 seconds that user clicks the 5th browsing time at the terminal, and terminal acquires user behavior data (user behavior data includes: browsing time and browsing frequency), and user behavior data is sent to server.
Server obtains the browsing time that entry is respectively retrieved in corresponding first search result of user behavior data, browsing frequency Rate, that is, server analyzes user behavior data, and server is by browsing time longest or browses the highest retrieval item of frequency Mesh is as true hit information.For example, the 5th article of server is as true hit information.
Step S60, by addition to the target retrieval algorithm default searching algorithm carry out information retrieval, obtain with it is described Retrieve the second search result of information matches.
After server determines true hit information, default searching algorithm of the server in addition to target retrieval algorithm Information retrieval is carried out, obtains the second search result with retrieval information matches, it is to be appreciated that default searching algorithm can wrap One or more is included, in addition, server, which carries out information retrieval according to searching algorithm, is referred to first embodiment, the present embodiment In do not repeat.
Step S70 is updated according to the true hit information, first search result and second search result The history of each default searching algorithm scores.
Specifically, each default searching algorithm of server update history score update the step of include:
Step a obtains row of the true hit information in first search result and second search result Sequence obtains the target retrieval result of sequence at first;
Step b adjusts the scoring of the corresponding searching algorithm of the target retrieval result, to complete the update of history scoring.
For example, target retrieval algorithm is p, it further include default searching algorithm q and s, service in addition to target retrieval algorithm is p Device is ordered as A1, A2 and A3 according to the first search result that target retrieval algorithm p is obtained;Server is according to default searching algorithm q The second obtained search result is ordered as A2, A3 and A1;Server is arranged according to the second search result that default searching algorithm s is obtained Sequence is A3, A2 and A2;It is A2 that user, which really hits information,;Then server is according to true hit information in the first search result and the Sequence in two search results, for server by the history scoring+1 of default searching algorithm q, server keeps target retrieval algorithm p It is not adjusted with the scoring of the history of default searching algorithm s, to update going through for each searching algorithm of account identification pair in server History retrieval scoring.
Server updates account identification all according to the browsing situation of user during each retrieval in the present embodiment The history of corresponding each default searching algorithm scores, so that more reasonable according to the determining searching algorithm of history scoring.
Further, on the basis of the above embodiment of the present invention, propose that the present invention is based on the inspections of the information of search engine The fourth embodiment of Suo Fangfa.
The present embodiment be in first embodiment after step S30 the step of, illustrate to take in fourth embodiment of the invention The target database of business device configuration retrieval, so that information retrieval is more comprehensive.The information retrieval method based on search engine Include:
Step S80 judges whether there is at least two preset datas after target retrieval algorithm configuration completion Library.
After the completion of server target retrieval algorithm configuration, server judges whether there is at least two preset datas Library, that is, corresponding presetting database is retrieved in the present embodiment can be one, can also be multiple, preset if only existing one Database, then server retrieves presetting database according to target retrieval algorithm.
Step S90, at least two presetting databases, then obtain the access frequency of each presetting database if it exists, by institute The highest presetting database of access frequency is stated as the corresponding target database of information retrieval requests.
At least two presetting database if it exists, then server obtains the access frequency of each presetting database, server Using the highest presetting database of access frequency as the corresponding target database of information retrieval requests, examined with to object library Rope.It should be added that: server can also be or pre- according to the information content in each presetting database in the present embodiment If the more new information of database, determines target database, is not illustrated herein.
In the present embodiment after determining target database, in server execution first embodiment the step of step S40, clothes Business device retrieves target database, comprising:
The retrieval information is carried out word segmentation processing by the segmentation methods in the target retrieval algorithm, obtained by step S41 The corresponding keyword set of the retrieval information.
Segmentation methods in retrieval information according to target searching algorithm are carried out word segmentation processing by server, that is, server determines The redundancy retrieved in information retains key message, obtains the corresponding keyword set of retrieval information.
Step S42 inquires the target database, by the similarity algorithm in the target retrieval algorithm, acquisition and institute The matched retrieval entry of keyword set is stated, and arranges each retrieval entry and obtains the first search result.
Server inquire target database, server obtain target retrieval algorithm in similarity algorithm, server according to Keyword in set of keywords is compared similarity algorithm with the presupposed information in mesh database, and server will be similar Information of the degree higher than 80% as with the matched retrieval entry of keyword set, server arranges each institute according to the height of relative degree It states retrieval entry and obtains the first search result.
In the present embodiment when search engine corresponding server can call the data information of multiple presetting databases, service Device determines target database according to the information of presetting database, it is ensured that information retrieval is more comprehensive.
Further, on the basis of the above embodiment of the present invention, propose that the present invention is based on the inspections of the information of search engine The 5th embodiment of Suo Fangfa.
The present embodiment be in first embodiment step S10 after the step of, the present embodiment get retrieval information and After account identification, server judges whether there is history retrieval record, specifically, the information retrieval based on search engine Method includes:
Step S100 obtains the history retrieval record of the account identification, by each history in history retrieval record Search result is compared with the retrieval information, judges to whether there is and the retrieval information in the history retrieval record The history search result matched.
The history that server obtains account identification retrieves record, wherein history retrieval record account identification corresponds to user's History retrieves information, and each history search result in history retrieval record is compared by server with retrieval information, and judgement is gone through With the presence or absence of the history search result with retrieval information matches in history retrieval record.
Step S110, when there is the history search result with the retrieval information matches, by the history search result It is sent to the corresponding terminal of the account identification, is checked so that the terminal corresponds to user.
When server determines to have the history search result with retrieval information matches, server sends out history search result Terminal corresponding to account identification is sent, is checked so that terminal corresponds to user;Determine to be not present and retrieval information matches in server History search result when, execute step: step S20 in first embodiment: according to the retrieval information and the account identification, Configure the corresponding target retrieval algorithm of the information retrieval requests.In the present embodiment when receiving information retrieval requests after, Server first queried the retrieval record of history, in order to avoid the repeated retrieval of identical retrieval information, improve the retrieval of information Efficiency.
In addition, the embodiment of the present invention also proposes a kind of information indexing device based on search engine, the base referring to Fig. 3 Include: in the information indexing device of search engine
Request receiving module 10, it is corresponding for when receiving information retrieval requests, obtaining the information retrieval requests Retrieve information and account identification;
Algorithm configuration module 20, for configuring the information retrieval and asking according to the retrieval information and the account identification Seek corresponding target retrieval algorithm;
Information searching module 30 is obtained for carrying out information retrieval according to the retrieval information and the target retrieval algorithm To the first search result with the retrieval information matches;
As a result sending module 40, for first search result to be sent to the corresponding terminal of the account identification, with User is corresponded to for the terminal to check.
Optionally, the algorithm configuration module 20, comprising:
Query unit inquires preset algorithm recommendation tables, obtains the letter for obtaining the information type of the retrieval information Cease corresponding first searching algorithm of type;
Acquiring unit, the history for obtaining the corresponding each default searching algorithm of the account identification score, and by history Highest default searching algorithm score as the second searching algorithm;
Determination unit, for when first searching algorithm is with the second searching algorithm difference, described first to be examined The corresponding target retrieval algorithm of information retrieval requests described in the high conduct of priority in rope algorithm and the second searching algorithm.
Optionally, the information indexing device based on search engine, comprising:
Behavioral data obtains module, the user behavior based on first search result sent for receiving the terminal Data, wherein the user behavior data includes: browsing time and browsing frequency;
Information determination module is hit, for obtaining that the user behavior data includes to each in first search result Browsing time, the browsing frequency for retrieving entry, using browsing time longest or the browsing highest retrieval entry of frequency as true life Middle information;
Second retrieval module, for carrying out information retrieval by the default searching algorithm in addition to the target retrieval algorithm, Obtain the second search result with the retrieval information matches;
Score update module, for according to the true hit information, first search result and second retrieval As a result, updating the history scoring of each default searching algorithm.
Optionally, the scoring update module, comprising:
Sort acquiring unit, for obtaining the true hit information in first search result and second retrieval As a result the sequence in obtains the target retrieval result of sequence at first;
Updating unit is adjusted, for adjusting the scoring of the corresponding searching algorithm of the target retrieval result, to complete history The update of scoring.
Optionally, the information indexing device based on search engine, comprising:
Quantity judgment module, for judging whether there is at least two after target retrieval algorithm configuration completion Presetting database;
Database determining module then obtains the visit of each presetting database at least two presetting database if it exists Frequency is asked, using the highest presetting database of the access frequency as the corresponding target database of information retrieval requests.
Optionally, the information searching module 30, comprising:
Word segmentation processing unit, for segmenting the retrieval information by the segmentation methods in the target retrieval algorithm Processing, obtains the corresponding keyword set of the retrieval information;
Match query unit, for inquiring the target database, by the similarity algorithm in the target retrieval algorithm, Acquisition and the matched retrieval entry of the keyword set, and arrange each retrieval entry and obtain the first search result.
Optionally, the information indexing device based on search engine, comprising:
Historical query module, the history for obtaining the account identification retrieve record, will be in history retrieval record Each history search result be compared with the retrieval information, judge to whether there is in history retrieval record and the inspection The history search result of rope information matches;
History output module, for exist and it is described retrieval information matches history search result when, by the history Search result is sent to the corresponding terminal of the account identification, checks so that the terminal corresponds to user;
There is no with it is described retrieval information matches history search result when, execute algorithm configuration module 20 the step of: According to the retrieval information and the account identification, the corresponding target retrieval algorithm of the information retrieval requests is configured.
Wherein, the step of each Implement of Function Module of the information indexing device based on search engine can refer to base of the present invention In each embodiment of the information retrieval method of search engine, details are not described herein again.
In addition, the embodiment of the present invention also proposes a kind of computer storage medium.
Computer program, the realization when computer program is executed by processor are stored in the computer storage medium Operation in information retrieval method provided by the above embodiment based on search engine.
It should be noted that, in this document, relational terms such as first and second and the like are used merely to a reality Body/operation/object is distinguished with another entity/operation/object, without necessarily requiring or implying these entity/operations/ There are any actual relationship or orders between object;The terms "include", "comprise" or its any other variant meaning Covering non-exclusive inclusion, so that the process, method, article or the system that include a series of elements not only include that A little elements, but also including other elements that are not explicitly listed, or further include for this process, method, article or The intrinsic element of system.In the absence of more restrictions, the element limited by sentence "including a ...", is not arranged Except there is also other identical elements in process, method, article or the system for including the element.
For device embodiment, since it is substantially similar to the method embodiment, related so describing fairly simple Place illustrates referring to the part of embodiment of the method.The apparatus embodiments described above are merely exemplary, wherein making It may or may not be physically separated for the unit of separate part description.In can selecting according to the actual needs Some or all of the modules realize the purpose of the present invention program.Those of ordinary skill in the art are not making the creative labor In the case where, it can it understands and implements.
The serial number of the above embodiments of the invention is only for description, does not represent the advantages or disadvantages of the embodiments.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can be realized by means of software and necessary general hardware platform, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on this understanding, technical solution of the present invention substantially in other words does the prior art The part contributed out can be embodied in the form of software products, which is stored in one as described above In storage medium (such as ROM/RAM, magnetic disk, CD), including some instructions are used so that terminal device (it can be mobile phone, Computer, server, air conditioner or network equipment etc.) execute method described in each embodiment of the present invention.
The above is only a preferred embodiment of the present invention, is not intended to limit the scope of the invention, all to utilize this hair Equivalent structure or equivalent flow shift made by bright specification and accompanying drawing content is applied directly or indirectly in other relevant skills Art field, is included within the scope of the present invention.

Claims (10)

1. a kind of information retrieval method based on search engine, which is characterized in that the information retrieval side based on search engine Method the following steps are included:
When receiving information retrieval requests, the corresponding retrieval information of the information retrieval requests and account identification are obtained;
According to the retrieval information and the account identification, the corresponding target retrieval algorithm of the information retrieval requests is configured;
Information retrieval is carried out according to the retrieval information and the target retrieval algorithm, obtains the with the retrieval information matches One search result;
First search result is sent to the corresponding terminal of the account identification, is checked so that the terminal corresponds to user.
2. as described in claim 1 based on the information retrieval method of search engine, which is characterized in that described according to the retrieval Information and the account identification, the step of configuring the information retrieval requests corresponding target retrieval algorithm, comprising:
The information type for obtaining the retrieval information, inquires preset algorithm recommendation tables, obtains the information type corresponding first Searching algorithm;
The history scoring of the corresponding each default searching algorithm of the account identification is obtained, and history is scored highest default retrieval Algorithm is as the second searching algorithm;
When first searching algorithm is with the second searching algorithm difference, first searching algorithm and the second retrieval are calculated The corresponding target retrieval algorithm of information retrieval requests described in the high conduct of priority in method.
3. as claimed in claim 2 based on the information retrieval method of search engine, which is characterized in that described to be examined described first Hitch fruit is sent to the corresponding terminal of the account identification, after corresponding to the step of user checks for the terminal, comprising:
Receive the user behavior data based on first search result that the terminal is sent, wherein the user behavior number According to include: the browsing time and browsing frequency;
Obtain that the user behavior data includes to browsing time, the browsing frequency for respectively retrieving entry in first search result Rate, using browsing time longest or the browsing highest retrieval entry of frequency as true hit information;
Information retrieval is carried out by the default searching algorithm in addition to the target retrieval algorithm, is obtained and the retrieval information matches The second search result;
According to the true hit information, first search result and second search result, updates each default retrieval and calculate The history of method scores.
4. as claimed in claim 3 based on the information retrieval method of search engine, which is characterized in that described according to described true Information, first search result and second search result are hit, the step of the history scoring of each default searching algorithm is updated Suddenly, comprising:
Sequence of the true hit information in first search result and second search result is obtained, sequence is obtained Target retrieval result at first;
The scoring of the corresponding searching algorithm of the target retrieval result is adjusted, to complete the update of history scoring.
5. as described in claim 1 based on the information retrieval method of search engine, which is characterized in that described according to the retrieval Information and the account identification, after the step of configuring the information retrieval requests corresponding target retrieval algorithm, comprising:
After target retrieval algorithm configuration completion, at least two presetting databases are judged whether there is;
At least two presetting database if it exists then obtains the access frequency of each presetting database, most by the access frequency High presetting database is as the corresponding target database of information retrieval requests.
6. as claimed in claim 5 based on the information retrieval method of search engine, which is characterized in that described according to the retrieval Information and the target retrieval algorithm carry out information retrieval, obtain the step with the first search result of the retrieval information matches Suddenly, comprising:
The retrieval information is subjected to word segmentation processing by the segmentation methods in the target retrieval algorithm, obtains the retrieval information Corresponding keyword set;
The target database is inquired, by the similarity algorithm in the target retrieval algorithm, is obtained and the keyword set Matched retrieval entry, and arrange each retrieval entry and obtain the first search result.
7. as described in claim 1 based on the information retrieval method of search engine, which is characterized in that described to receive information When retrieval request, after the step of obtaining the corresponding retrieval information of the information retrieval requests and account identification, comprising:
Obtain the account identification history retrieval record, by the history retrieval record in each history search result with it is described Retrieval information is compared, and judges to tie in the history retrieval record with the presence or absence of the history retrieval with the retrieval information matches Fruit;
When there is the history search result with the retrieval information matches, the history search result is sent to the account Corresponding terminal is identified, is checked so that the terminal corresponds to user;
When there is no the history search result with the retrieval information matches, step is executed: according to the retrieval information and institute Account identification is stated, the corresponding target retrieval algorithm of the information retrieval requests is configured.
8. a kind of information indexing device based on search engine, which is characterized in that the information retrieval dress based on search engine It sets and includes:
Request receiving module, for when receiving information retrieval requests, obtaining the corresponding retrieval letter of the information retrieval requests Breath and account identification;
Algorithm configuration module, for it is corresponding to configure the information retrieval requests according to the retrieval information and the account identification Target retrieval algorithm;
Information searching module, for obtaining and institute according to the retrieval information and target retrieval algorithm progress information retrieval State the first search result of retrieval information matches;
As a result sending module, for first search result to be sent to the corresponding terminal of the account identification, for described Terminal corresponds to user and checks.
9. a kind of information searching device based on search engine, which is characterized in that the information retrieval based on search engine is set It is standby to include: memory, processor and be stored in the computer program that run on the memory and on the processor, In:
When the computer program is executed by the processor realize as described in any one of claims 1 to 7 based on search The step of information retrieval method of engine.
10. a kind of computer storage medium, which is characterized in that be stored with computer program, institute in the computer storage medium State the information based on search engine realized as described in any one of claims 1 to 7 when computer program is executed by processor The step of search method.
CN201910623814.3A 2019-07-10 2019-07-10 Information retrieval method, device, equipment and storage medium based on search engine Active CN110502692B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910623814.3A CN110502692B (en) 2019-07-10 2019-07-10 Information retrieval method, device, equipment and storage medium based on search engine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910623814.3A CN110502692B (en) 2019-07-10 2019-07-10 Information retrieval method, device, equipment and storage medium based on search engine

Publications (2)

Publication Number Publication Date
CN110502692A true CN110502692A (en) 2019-11-26
CN110502692B CN110502692B (en) 2023-02-03

Family

ID=68585581

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910623814.3A Active CN110502692B (en) 2019-07-10 2019-07-10 Information retrieval method, device, equipment and storage medium based on search engine

Country Status (1)

Country Link
CN (1) CN110502692B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111291057A (en) * 2020-02-26 2020-06-16 上海云鱼智能科技有限公司 User information indexing method, device, server and storage medium of IM tool
CN111506818A (en) * 2020-04-22 2020-08-07 中国民航信息网络股份有限公司 Flight data processing method and device
CN111813902A (en) * 2020-05-21 2020-10-23 车智互联(北京)科技有限公司 Intelligent response method and system and computing device
CN113779305A (en) * 2021-07-30 2021-12-10 北京达佳互联信息技术有限公司 Information retrieval method and device and electronic equipment

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1858733A (en) * 2005-11-01 2006-11-08 华为技术有限公司 Information searching system and searching method
CN101183364A (en) * 2006-11-24 2008-05-21 腾讯科技(深圳)有限公司 Information searching method, searching engine customer terminal/server and system
CN105141903A (en) * 2015-08-13 2015-12-09 中国科学院自动化研究所 Method for retrieving object in video based on color information
CN105653559A (en) * 2014-11-28 2016-06-08 国际商业机器公司 Method and device for searching in database
CN108280225A (en) * 2018-02-12 2018-07-13 北京吉高软件有限公司 A kind of semantic retrieving method and searching system
CN109271552A (en) * 2018-08-22 2019-01-25 北京达佳互联信息技术有限公司 Pass through the method, apparatus of picture retrieval video, electronic equipment and storage medium
CN109344232A (en) * 2018-11-13 2019-02-15 平安科技(深圳)有限公司 A kind of public feelings information search method and terminal device
CN109933708A (en) * 2019-01-25 2019-06-25 平安科技(深圳)有限公司 Information retrieval method, device, storage medium and computer equipment

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1858733A (en) * 2005-11-01 2006-11-08 华为技术有限公司 Information searching system and searching method
CN101183364A (en) * 2006-11-24 2008-05-21 腾讯科技(深圳)有限公司 Information searching method, searching engine customer terminal/server and system
CN105653559A (en) * 2014-11-28 2016-06-08 国际商业机器公司 Method and device for searching in database
CN105141903A (en) * 2015-08-13 2015-12-09 中国科学院自动化研究所 Method for retrieving object in video based on color information
CN108280225A (en) * 2018-02-12 2018-07-13 北京吉高软件有限公司 A kind of semantic retrieving method and searching system
CN109271552A (en) * 2018-08-22 2019-01-25 北京达佳互联信息技术有限公司 Pass through the method, apparatus of picture retrieval video, electronic equipment and storage medium
CN109344232A (en) * 2018-11-13 2019-02-15 平安科技(深圳)有限公司 A kind of public feelings information search method and terminal device
CN109933708A (en) * 2019-01-25 2019-06-25 平安科技(深圳)有限公司 Information retrieval method, device, storage medium and computer equipment

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111291057A (en) * 2020-02-26 2020-06-16 上海云鱼智能科技有限公司 User information indexing method, device, server and storage medium of IM tool
CN111506818A (en) * 2020-04-22 2020-08-07 中国民航信息网络股份有限公司 Flight data processing method and device
CN111813902A (en) * 2020-05-21 2020-10-23 车智互联(北京)科技有限公司 Intelligent response method and system and computing device
CN111813902B (en) * 2020-05-21 2024-02-23 车智互联(北京)科技有限公司 Intelligent response method, system and computing device
CN113779305A (en) * 2021-07-30 2021-12-10 北京达佳互联信息技术有限公司 Information retrieval method and device and electronic equipment
CN113779305B (en) * 2021-07-30 2024-01-02 北京达佳互联信息技术有限公司 Information retrieval method and device and electronic equipment

Also Published As

Publication number Publication date
CN110502692B (en) 2023-02-03

Similar Documents

Publication Publication Date Title
US11120344B2 (en) Suggesting follow-up queries based on a follow-up recommendation machine learning model
US11914588B1 (en) Determining a user-specific approach for disambiguation based on an interaction recommendation machine learning model
CN110502692A (en) Information retrieval method, device, equipment and storage medium based on search engine
US10885026B2 (en) Translating a natural language request to a domain-specific language request using templates
CN108733713B (en) Data query method and device in data warehouse
CN101636935B (en) Location in search queries
US9495460B2 (en) Merging search results
US10713269B2 (en) Determining a presentation format for search results based on a presentation recommendation machine learning model
CN101796515B (en) Query statistics provider
US11727459B2 (en) Search query-based replacement part interface
US20070288444A1 (en) Web-based customer service interface
JP6346218B2 (en) Search method, apparatus and server for online trading platform
US11170016B2 (en) Navigating hierarchical components based on an expansion recommendation machine learning model
US11315162B2 (en) Internet of things (IoT) configurator
US20110191290A1 (en) Predictive categorization
JP2015528611A (en) Dynamic data acquisition method and system
CN107085600A (en) POI recommends method, device, equipment and computer-readable recording medium
KR101972904B1 (en) Project matching methods and system using chatbot, confidence measurement and project management
CN109753504A (en) Data query method and device
CN110442791A (en) Data push method and system
CN106874402A (en) Searching method and device
CN111581479A (en) One-stop data processing method and device, storage medium and electronic equipment
WO2021129259A1 (en) Method for dynamically and quickly loading module according to usage habits of user
CN111078998B (en) Information retrieval method, device, storage medium and server
TWI684147B (en) Cloud self-service analysis platform and analysis method thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant