CN107807957A - entity library generating method and device - Google Patents

entity library generating method and device Download PDF

Info

Publication number
CN107807957A
CN107807957A CN201710916101.7A CN201710916101A CN107807957A CN 107807957 A CN107807957 A CN 107807957A CN 201710916101 A CN201710916101 A CN 201710916101A CN 107807957 A CN107807957 A CN 107807957A
Authority
CN
China
Prior art keywords
entity
demand
user
search
click
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710916101.7A
Other languages
Chinese (zh)
Inventor
余晓龙
张华泉
王浩
张向征
邬小鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201710916101.7A priority Critical patent/CN107807957A/en
Publication of CN107807957A publication Critical patent/CN107807957A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/288Entity relationship models

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a kind of entity library generating method and device, the above method to include:Entity mobility models collection of illustrative plates is established based on vertical search class website data;The relevant information of demand entity according to involved by the search history of user record parses the historical search behavior of the user;Using demand entity as keyword, the relevant information generation entity storehouse of the demand entity with reference to involved by the historical search behavior of the entity mobility models collection of illustrative plates and the user.According to entity library generating method provided by the invention, various types of information contents can be obtained, and the relevant information of the demand entity of user can be accurately determined by the analysis to user's history search behavior, binding entity knowledge mapping structure meets the entity storehouse of user individual entity demand based on the demand entity actual by user, when user carries out entity search, the relevant information for meeting user's search need can quickly and be accurately provided the user.

Description

Entity library generating method and device
Technical field
The present invention relates to Internet technical field, more particularly to a kind of entity library generating method and device.
Background technology
With the continuous development of Internet technology, increasing people enters the transmission of row information with exchanging by internet, Therefore, it is available for people to obtain various information based on the powerful information bank that internet is set up.At present, relative to biography The keyword search of system, entity search are a kind of more novel search forms.
But same entity there may be ambiguity, and it is involved at present only the unitary demand of single entities is identified, When user carries out information search, the demand of user can not be recognized accurately, and then accurately search knot can not be provided the user Fruit.
The content of the invention
The invention provides a kind of entity library generating method and device to overcome above mentioned problem or solve at least in part Above mentioned problem.
According to an aspect of the invention, there is provided a kind of entity library generating method, including:
Entity mobility models collection of illustrative plates is established based on vertical search class website data;
Demand entity involved by parsing the historical search behavior of the user is recorded according to the search history of user Relevant information;
Using demand entity as keyword, with reference to involved by the historical search behavior of the entity mobility models collection of illustrative plates and the user And demand entity relevant information generation entity storehouse.
Alternatively, it is described according to involved by the search history of user record parses the historical search behavior of the user The relevant information of demand entity, including:
Search daily record and/or the click logs of the user are obtained, based on the search daily record and/or click logs to institute The history click information for stating user does entity link and/or subject classification, parses the need involved by the history click information The relevant information of realistic body.
Alternatively, it is described using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates and the history of the user The relevant information generation entity storehouse of demand entity involved by search behavior, including:
Using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates and the historical search behavior institute of the user The demand entity and/or demand type corresponding with the demand entity being related to establish user's request click model, and generation includes institute State the entity storehouse of user's request click model.
Alternatively, it is described using demand entity as keyword, with reference to going through for the entity mobility models collection of illustrative plates and the user Demand entity and/or demand type corresponding with the demand entity involved by history search behavior establish user's request and click on mould Type, generation include the entity storehouse of the user's request click model, including:
Using demand entity as keyword, demand entity involved by historical search behavior to the user and/or with Demand type corresponding to the demand entity is polymerize, and generates entity demand queue;
Demand intensity is calculated according to the history click information of the user, the demand intensity is needed added to the entity Ask in queue, generation includes the entity storehouse of the user's request click model.
Alternatively, it is described using demand entity as keyword, with reference to going through for the entity mobility models collection of illustrative plates and the user Demand entity and/or demand type corresponding with the demand entity involved by history search behavior establish user's request and click on mould Type, after generation includes the entity storehouse of the user's request click model, in addition to:
The user's request click model is updated with predetermined period.
Alternatively, it is described that the user's request click model is updated with predetermined period, including:
Established by on-line study method and click on feedback model in real time, the entity demand of user is monitored with the predetermined period Change, the sequence of the entity demand queue in the user's request click model is adjusted by online feedback mechanism.
According to another aspect of the present invention, a kind of generating means in entity storehouse are additionally provided, including:
Module is established, vertical search class website data is configured to and establishes entity mobility models collection of illustrative plates;
Parsing module, it is configured to according to involved by the search history of user record parses the historical search behavior of the user And demand entity relevant information;
Entity storehouse generation module, be configured to using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates with it is described The relevant information generation entity storehouse of demand entity involved by the historical search behavior of user.
Alternatively, the parsing module is additionally configured to:
Search daily record and/or the click logs of the user are obtained, based on the search daily record and/or click logs to institute The history click information for stating user does entity link and/or subject classification, parses the need involved by the history click information The relevant information of realistic body.
Alternatively, entity storehouse generation module is additionally configured to:
Using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates and the historical search behavior institute of the user The demand entity and/or demand type corresponding with the demand entity being related to establish user's request click model, and generation includes institute State the entity storehouse of user's request click model.
Alternatively, entity storehouse generation module is additionally configured to:
Using demand entity as keyword, demand entity involved by historical search behavior to the user and/or with Demand type corresponding to the demand entity is polymerize, and generates entity demand queue;According to the history click information of the user Demand intensity is calculated, the demand intensity is added in the entity demand queue, generation includes the user's request and clicked on The entity storehouse of model.
Alternatively, said apparatus also includes:
Update module, it is configured to update the user's request click model with predetermined period.
Alternatively, the update module is additionally configured to:
Established by on-line study method and click on feedback model in real time, the entity demand of user is monitored with the predetermined period Change, the sequence of the entity demand queue in the user's request click model is adjusted by online feedback mechanism.
According to a further aspect of the invention, a kind of computer program, including computer-readable code are additionally provided, works as institute When stating computer-readable code and running on the computing device, cause the entity storehouse described in the computing device any of the above-described Generation method.
According to a further aspect of the invention, a kind of computer-readable medium is additionally provided, wherein storing the calculating Machine program.
The invention provides a kind of entity library generating method and device, based on entity library generating method provided by the invention, Entity mobility models collection of illustrative plates first can be established based on vertical search class website data, and institute is gone out by the historical search behavioural analysis of user The relevant information for the demand entity being related to, and then combine relevant information and the entity mobility models collection of illustrative plates generation entity storehouse of demand entity. According to entity library generating method provided by the invention, entity mobility models figure is established by the information obtained to vertical search class website Spectrum, can obtain various types of information contents, and can accurately determine user by the analysis to user's history search behavior Demand entity relevant information, binding entity knowledge mapping structure meets user based on the demand entity actual by user Property entity demand entity storehouse, when user carries out entity search, can quickly and accurately provide the user and meet user and search The relevant information of rope demand.
Described above is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention, And can be practiced according to the content of specification, and in order to allow above and other objects of the present invention, feature and advantage can Become apparent, below especially exemplified by the embodiment of the present invention.
According to the accompanying drawings will be brighter to the detailed description of the specific embodiment of the invention, those skilled in the art Above-mentioned and other purposes, the advantages and features of the present invention.
Brief description of the drawings
By reading the detailed description of hereafter preferred embodiment, it is various other the advantages of and benefit it is common for this area Technical staff will be clear understanding.Accompanying drawing is only used for showing the purpose of preferred embodiment, and is not considered as to the present invention Limitation.And in whole accompanying drawing, identical part is denoted by the same reference numerals.In the accompanying drawings:
Fig. 1 is entity library generating method schematic flow sheet according to embodiments of the present invention;
Fig. 2 is generation schematic diagram in entity storehouse according to embodiments of the present invention;
Fig. 3 is entity demand queue arrangement schematic diagram according to embodiments of the present invention;
Fig. 4 is the method flow schematic diagram according to embodiments of the present invention that entity search is carried out based on entity storehouse;
Fig. 5 is the structural representation of entity storehouse generating means according to embodiments of the present invention;
Fig. 6 is the structural representation of entity storehouse according to the preferred embodiment of the invention generating means;
Fig. 7 is the apparatus structure schematic diagram according to embodiments of the present invention that entity search is carried out based on entity storehouse;
Fig. 8 is the apparatus structure schematic diagram according to the preferred embodiment of the invention that entity search is carried out based on entity storehouse;
Fig. 9 it is according to embodiments of the present invention be used to perform according to the generation method in the entity storehouse of the present invention and/or based on reality Body storehouse carries out the block diagram representation of the computing device of the method for entity search;
Figure 10 is to be used to keeping or carrying the generation side for realizing the entity storehouse according to the present invention according to embodiments of the present invention Method and/or based on entity storehouse carry out entity search method program code memory cell schematic diagram.
Embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although the disclosure is shown in accompanying drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here Limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure Completely it is communicated to those skilled in the art.
Fig. 1 is entity library generating method schematic flow sheet according to embodiments of the present invention, as shown in figure 1, according to the present invention The entity library generating method of embodiment includes:
Step S102, entity mobility models collection of illustrative plates is established based on vertical search class website data;
Step S104, the demand according to involved by the search history of user record parses the historical search behavior of user are real The relevant information of body;
Step S106, using demand entity as keyword, the historical search behavior institute of binding entity knowledge mapping and user The relevant information generation entity storehouse for the demand entity being related to.
Based on entity library generating method provided by the invention, entity mobility models first can be established based on vertical search class website data Collection of illustrative plates, and the relevant information of involved demand entity is gone out by the historical search behavioural analysis of user, and then combine demand Relevant information and entity mobility models collection of illustrative plates generation the entity storehouse of entity.According to entity library generating method provided by the invention, by right The information that vertical search class website obtains establishes entity mobility models collection of illustrative plates, can obtain various types of information contents, and by The analysis of family historical search behavior can accurately determine the relevant information of the demand entity of user, real with the actual demand of user Binding entity knowledge mapping structure meets the entity storehouse of user individual entity demand based on body, and entity search is carried out in user When, on the basis of user's query intention is understood, the entity that user wants to look for is analyzed, result entity is concluded and organized And it is presented to user in a manner of special type shows.User no longer needs oneself to go that knowledge is found and concluded from search result, subtracts Lack the cost that user obtains information, lift Consumer's Experience.
Entity mobility models collection of illustrative plates is to include the incidence relation between various entities and concept, and entity, concept.It is real establishing During body knowledge mapping, it can monitor and excavate encyclopaedia data, core word bank, the vertical resource data such as class website or searching class website Middle extraction concept, entity, attribute and relation, establish entity mobility models collection of illustrative plates based on above-mentioned resource data, realize the sequential of knowledge Fusion and multi-data source fusion, and then establish the entity mobility models collection of illustrative plates with vast resources data.Entity mobility models collection of illustrative plates is can be with Constantly update, according to the real-time change of above-mentioned all kinds of resource datas, entity mobility models collection of illustrative plates is also to implement renewal.
Entity mobility models collection of illustrative plates is the knowledge mapping for including mass data.And it is different for the required entity of each user 's.It is determined that user demand entity when, can according to the search history of user record be analyzed.Preferably, above-mentioned steps S104 can further include:Obtain search daily record and/or the click logs of user, search daily record based on user and/or Click logs do entity link and/or subject classification to the history click information of user, parse user's history click information institute The relevant information for the demand entity being related to.
For example, when user inputs " Lee ", the entity to be looked for is probably " singer Lee ", it is also possible to " sportsman Lee Certain ", at this moment search daily record and click logs of the can based on user judges that user often listens to the song of Lee, at this moment, The demand entity for being assured that out user is " singer Lee ".
The search history record of user is can to reflect that user searches for the data message of custom, passes through the search day to user The analysis of will and click logs, the click document to user do entity link and/or subject classification, accurately parse user's The relevant information of demand entity.Wherein, the document clicked on to user does entity link, that is, the document for filtering out user's click is corresponding Title in entity, by the entity link into entity storehouse corresponding entity and with the entity corresponding demand.User's The relevant information of demand entity can include the demand entity and demand type of user.When user clicks on " Lee's (hip hop, rock, rap,pop, Hand) _ encyclopaedia " when, then it can be linked in entity storehouse " Lee (pop singer) ", corresponding demand is encyclopaedia demand.Document " Lee Certain _ song online test listening " " Lee (pop singer) " that will be linked in entity storehouse, corresponding demand is music demand.
Above-mentioned steps S106 is referred to, using demand entity as keyword, the history of binding entity knowledge mapping and user are searched The relevant information generation entity storehouse of demand entity involved by Suo Hangwei.Preferably, can be real with demand when generating entity storehouse Body as keyword, demand entity involved by the historical search behavior of binding entity knowledge mapping and user and/or with this The information such as demand type, click location of user corresponding to demand entity establishes user's request click model, and generation includes user The entity storehouse of demand click model.When receiving the searching request from user, it is possible to the use directly in entity storehouse Family demand click model quickly judges and linked to the relevant information for meeting user's request.
Alternatively, when establishing user's request click model, the history of user can be searched using demand entity as keyword Demand entities and/or with the demand entity corresponding demand type of the Suo Hangwei as involved by search and/or click logs are carried out Polymerization, generate entity demand queue.
Fig. 2, which shows to click on user after document does entity link according to the search and click logs of user, generates entity storehouse Schematic diagram.In Fig. 2, entity can be that the search term of user's input is " Lee ", be related to user's search and the point of " Lee " It is 500 to hit daily record to include " Lee (pop singer _ encyclopaedia) " corresponding number of clicks respectively;" Lee _ song online test listening ", point Number is hit as 400;" Lee _ (tennis player) _ encyclopaedia ", number of clicks 300;" Lee _ picture " number of clicks is 300; " Lee _ Lee's song complete works _ special edition " number of clicks is 100;" Lee _ sports star _ race " number of clicks is 50;Get After the search of user and click logs, entity link is done to the demand entity in user's search and click logs respectively, and it is right The demand entity and demand type corresponding with demand entity are polymerize.Demand entity in Fig. 2 includes " " Lee's (stream Row singer) " and " Lee (tennis player) ", can will be on " Lee when being polymerize with demand entity and demand type (pop singer) " is aggregated to together with demand type " encyclopaedia ", " music " and " news " etc., on " Lee's (tennis Member) " be aggregated to together with demand type " encyclopaedia ", " picture ", " news ", " video " and " microblogging " etc., it is correspondingly, every kind of Demand entity and demand type can also be searched for according to user and number of clicks calculates corresponding demand intensity.Can from Fig. 2 To find out, the demand intensity of " Lee (pop singer _ encyclopaedia) " is 500;The demand intensity of " Lee _ song online test listening " is 500;The demand intensity of " Lee _ (tennis player) _ encyclopaedia " is 300;The demand of " Lee _ (tennis player) _ picture " is strong Spend for 200;The demand intensity of " Lee _ (tennis player) _ news " is 50.In Fig. 2 simply schematically illustrate according to Family is searched for and the mode that is polymerize to user's request entity and demand type of click logs, in actual applications, can be with Using other modes to generation entity demand queue and user's request click model, here is omitted.
Fig. 3 shows the arrangement mode of entity demand queue, and the queue of entity demand can include demand entity and demand Type.In figure 3, demand entity can include " Lee (pop singer) ", " Lee (tennis player) ", its corresponding demand Type can be " encyclopaedia ", " music ", " picture " and " news " or other.Further, can also going through according to user History click information calculates each demand entity and the demand intensity of demand type, and demand intensity is added into entity demand queue In.As shown in figure 3, the history click information based on user calculates, the demand intensity of Lee (pop singer) encyclopaedia is 500, Lee The demand intensity of certain (pop singer) music is 500, and the demand intensity of Lee (tennis player) encyclopaedia is 300, Lee's (tennis Sportsman) demand intensity of picture is 200, the demand intensity of Lee (tennis player) news is 50.In actual applications, need Ask intensity calculating can be according to a certain user search and click logs, can also integrate most users search and click on day Will, it can be adjusted according to different situations.The queue of entity demand can the height of intensity according to demand be ranked up, to meet user Demand when carrying out entity search.Certain demand entity and demand type are not limited to this, can also include other entities And correlation type, the present invention do not limit.
Further, after the generation of user's request click model, user's request click can also be updated with predetermined period Model.User's request click model can be timing renewal or real-time update.Because the search behavior of user is at any time It may occur, therefore, search behavior timing or real-time update user's request click model based on user can meet user The change of search need, and then can more efficiently provide the user search result.
Preferably, when updating user's request click model, it can be established by on-line study method and be click on feeding back Model detects the entity changes in demand of user with predetermined period, is adjusted by online feedback mechanism in user's request click model The sequence of entity demand queue.Assuming that the news on Lee tennis player has been broken out suddenly, " Lee (tennis player), The user of this demand of news " clicks on to increase suddenly, then can be incited somebody to action by online feedback mechanism " Lee (tennis player), newly The sequence up-regulation of this demand of news ".Assuming that user within some period to " Lee (pop singer), music " this demand Number of clicks increase, then the sequence of this demand can be raised.
In entity library generating method provided in an embodiment of the present invention, pass through the historical search of entity mobility models collection of illustrative plates and user The relevant information generation entity storehouse of demand entity involved by behavior, can be when user carries out entity search quickly and efficiently Identify the search intention of user.And entity of embodiment of the present invention storehouse can also be upgraded in time according to the demand of user, with full The search need of sufficient user's different time sections.
Fig. 4 is the method according to embodiments of the present invention that entity search is carried out based on entity storehouse, as shown in figure 4, according to this The entity search method based on entity storehouse of inventive embodiments, including:
Step S402, the query from user is received, and determine the entity word frequency of the query;
Step S404, the entity word frequency based on the query are determined and query similarity highest high frequencies query;
Step S406, using above-mentioned high frequency query as entity word, relevant information search is carried out into entity storehouse.
In entity search method provided in an embodiment of the present invention, a variety of realities in the query from user are may recognize that Body search intention, determine with the entity Word similarity highest high frequency query in query, be entity based on high frequency query Word carries out the search of relevant information into magnanimity information and with user's request click model entity storehouse, to provide use Family meets the search result of the demand intensity of its search intention.
In the present embodiment, the statistics for the query that can be initiated in advance user, judge that user sends out for same query The height of the frequency risen, and a predetermined threshold value is set, and then subsequently received query is judged.If sentence Disconnected query entity word frequency is greater than or equal to predetermined threshold value, it is determined that the query is high frequency query, now, can be straight Connect with the query sheets as entity word, and with the entity word, relevant information search is carried out into entity storehouse.If the query's Entity word frequency is less than predetermined threshold value, it is determined that the query is low frequency query, now, it is possible to in low frequency query Entity is index search and low frequency query similarity highest high frequency query, and using high frequency query as entity word, to reality Relevant information search is carried out in body storehouse.
The scheme provided based on the present embodiment, high frequency query can serve as entity word in itself, therefore received come The search that carries out relevant information is can be directly in entity storehouse from the query of user.For low frequency query, do not have in possible entity storehouse Having directly includes low frequency query corresponding entities in itself.At this moment, it is possible to the entity in low frequency query is first analyzed, with Entity in low frequency query is searched for index by related algorithm and low frequency query similarity highest query, is based on The query is the search that entity word carries out relevant information into entity storehouse.The scheme provided based on the present embodiment, no matter use by oneself The query at family belongs to high frequency query or low frequency query, can the quickly query-related information into entity storehouse, and then accurately Provide the user Query Result corresponding with the query of user.
Preferably, include entity demand queue in the user's request click model in entity storehouse, therefore, it is determined that with it is low , can be real to the high frequency query for having calculated that entity demand queue is established during frequency query similarity highest high frequency query Body query inverted indexs, after identifying the entity in low frequency query, correlation is found by entity query inverted indexs Query lists, each query in low frequency query and query row is calculated by simarank, deep learning correlation technique Similarity, then it is that entity word carries out related letter into entity storehouse to find out to low frequency query similarity highest high frequencies query The search of breath.
For example, if it is " Lee " to receive the query from user, by real in " Lee " this query The frequency of " Lee " of pronouns, general term for nouns, numerals and measure words is analyzed, and determines that the query belongs to high frequency query, now can is directly arrived with " Lee " The search of relevant information in entity storehouse.Related entities demand team has been had calculated that in user's request click model in entity storehouse Row, at this moment can is directly presented to user as entity search result.
If it is " relevant information of Lee " to receive the query from user, by the entity word in this query The frequency of " relevant information of Lee " is analyzed, and determines that the query belongs to low frequency query.If directly with the " correlation of Lee Information " is entity word, possibly accurately can not obtain relevant information directly from entity storehouse.Now, can be will calculate The high frequency query of entity demand queue establishes entity query inverted indexs.
Inverted index comes to be needed to be recorded to search according to the value of attribute in practical application, each single item in this concordance list All include the address of a property value and each record with the property value.Due to not being to determine property value by recording, but The position of record, thus referred to as inverted index (invertedindex) are determined by property value.Entity query inverted indexs are Entity corresponding with high frequency query will be obtained based on the high frequency query having calculated that, by real corresponding to high frequency query Body can link to high frequency query.When the query received is the relevant information of Lee " when ", may recognize that in the query Entity be " Lee ", by query inverted indexs find correlation query lists, pass through simrank or depth study Correlation technique can is by the similarity of each query in " relevant information of Lee " and query lists, if calculating " Lee Similarity highest query in the relevant information of certain " and query lists, or similarity are more than the query of predetermined threshold value and are " Lee ", then it is the search that entity word carries out relevant information into entity storehouse with " Lee ", that is to say, that the query " phases of Lee Pass information " inherits " Lee " this high frequency query entity demand queue.
It should be noted that in practical application, above-mentioned all optional embodiments can be any group by the way of combining Close, form the alternative embodiment of the present invention, this is no longer going to repeat them.
Based on the entity library generating method that each embodiment provides above, based on same inventive concept, the embodiment of the present invention is also Provide a kind of generating means in entity storehouse, Fig. 5 be according to the structural representation of the entity storehouse generating means of the embodiment of the present invention, As shown in figure 5, the generating means in the entity storehouse of the embodiment of the present invention can include:
Module 510 is established, vertical search class website data is configured to and establishes entity mobility models collection of illustrative plates;
Parsing module 520, it is configured to parse the historical search behavior of the user according to the search history of user record The relevant information of involved demand entity;
Entity storehouse generation module 530, it is configured to using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates and institute State the relevant information generation entity storehouse of the demand entity involved by the historical search behavior of user.
In a preferred embodiment of the invention, parsing module 520 is additionally configured to:
Search daily record and/or the click logs of the user are obtained, based on the search daily record and/or click logs to institute The history click information for stating user does entity link and/or subject classification, parses the need involved by the history click information The relevant information of realistic body.
In a preferred embodiment of the invention, entity storehouse generation module 530 is also configured as:
Using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates and the historical search behavior institute of the user The demand entity and/or demand type corresponding with the demand entity being related to establish user's request click model, and generation includes institute State the entity storehouse of user's request click model.
In a preferred embodiment of the invention, entity storehouse generation module 530 is also configured as:
Using demand entity as keyword, demand entity involved by historical search behavior to the user and/or with Demand type corresponding to the demand entity is polymerize, and generates entity demand queue;According to the history click information of the user Demand intensity is calculated, the demand intensity is added in the entity demand queue, generation includes the user's request and clicked on The entity storehouse of model.
In a preferred embodiment of the invention, as shown in fig. 6, said apparatus can also include:
Update module 540, it is configured to update the user's request click model with predetermined period.
In a preferred embodiment of the invention, update module 540 is also configured as:
Established by on-line study method and click on feedback model in real time, the entity demand of user is monitored with the predetermined period Change, the sequence of the entity demand queue in the user's request click model is adjusted by online feedback mechanism.
The embodiment of the present invention additionally provides a kind of computer program, including computer-readable code, when the computer can When reading code is run on the computing device, cause the generation side in the entity storehouse described in the computing device any of the above-described Method.
The embodiment of the present invention additionally provides a kind of computer-readable medium, wherein storing above-mentioned computer program.
Fig. 7 shows that what is provided according to embodiments of the present invention carries out the apparatus structure signal of entity search based on entity storehouse Figure, as shown in fig. 7, the device provided in an embodiment of the present invention that entity search is carried out based on entity storehouse can include:
Receiving module 710, it is configured to receive the query from user, and determines the entity word frequency of the query;
Determining module 720, the entity word frequency for being configured to the query are determined and the query similarity highests High frequency query;
Search module 730, it is configured to using high frequency query as entity word, relevant information search is carried out into entity storehouse.
In a preferred embodiment of the invention, as shown in figure 8, determining module 720 can also include:
First determining unit 721, the entity word frequency for being configured to state query are higher than predetermined threshold value, it is determined that the query is For high frequency query;
Second determining unit 722, if the entity word frequency for being configured to query is less than the predetermined threshold value, it is determined that should Query low frequency query, using the entity in low frequency query as index search and the low frequency query similarities highest high frequency query。
In a preferred embodiment of the invention, the second determining unit 722 is also configured as:
High frequency query for having calculated that entity demand queue establishes entity query inverted indexs;Identify described low Entity in frequency query, related query lists are found by the query inverted indexs;Calculate the low frequency query with The similarity of each query in the query lists, find out and the low frequency query similarities highest high frequency query。
In a preferred embodiment of the invention, as shown in figure 8, search module 730 can also include:
Link unit 731, it is configured to do entity link to the entity word, the entity word is linked into the entity storehouse In corresponding demand entity and/or demand type corresponding with the demand entity.
In a preferred embodiment of the invention, search module 730 is also configured as:
Entity storehouse is generated in the following manner:Entity mobility models collection of illustrative plates is established based on vertical search class website data;According to user Search history record parse the relevant information of demand entity involved by the historical search behavior of the user;It is real with demand Body is as keyword, the phase of the demand entity with reference to involved by the entity mobility models collection of illustrative plates with the historical search behavior of the user Close information generation entity storehouse.
The embodiment of the present invention additionally provides a kind of computer program, including computer-readable code, when the computer can When reading code is run on the computing device, cause to carry out in fact based on entity storehouse described in the computing device any of the above-described The method of body search.
The embodiment of the present invention additionally provides a kind of computer-readable medium, wherein storing above-mentioned computer program.
The embodiments of the invention provide a kind of generation method and device in entity storehouse, the reality provided according to embodiments of the present invention Body library generating method, entity mobility models collection of illustrative plates is established by the information obtained to vertical search class website, can be obtained various types of The information content, and the related letter of the demand entity of user can be accurately determined by the analysis to user's history search behavior Breath, binding entity knowledge mapping structure meets the entity of user individual entity demand based on the demand entity of user's reality Storehouse, when user carries out entity search, it can quickly and accurately provide the user the relevant information for meeting user's search need.Enter One step, the embodiment of the present invention additionally provides a kind of method and device that entity search is carried out based on entity storehouse, by being connect The query from user received, after the entity word frequency for determining query, it is possible to pass through the frequency to entity word in query The analysis of degree determine with the query similarity highest query, and then carried out using the query as entity word into entity storehouse The search of relevant information, quickly to provide the user the search result of entity search.
In the specification that this place provides, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention Example can be put into practice in the case of these no details.In some instances, known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help to understand one or more of each inventive aspect, Above in the description to the exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:I.e. required guarantor The application claims of shield features more more than the feature being expressly recited in each claim.It is more precisely, such as following Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following embodiment are expressly incorporated in the embodiment, wherein each claim is in itself Separate embodiments all as the present invention.
Those skilled in the art, which are appreciated that, to be carried out adaptively to the module in the equipment in embodiment Change and they are arranged in one or more equipment different from the embodiment.Can be the module or list in embodiment Member or component be combined into a module or unit or component, and can be divided into addition multiple submodule or subelement or Sub-component.In addition at least some in such feature and/or process or unit exclude each other, it can use any Combination is disclosed to all features disclosed in this specification (including adjoint claim, summary and accompanying drawing) and so to appoint Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification (including adjoint power Profit requires, summary and accompanying drawing) disclosed in each feature can be by providing the alternative features of identical, equivalent or similar purpose come generation Replace.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments In included some features rather than further feature, but the combination of the feature of different embodiments means in of the invention Within the scope of and form different embodiments.For example, in detail in the claims, embodiment claimed it is one of any Mode it can use in any combination.
The all parts embodiment of the present invention can be realized with hardware, or to be run on one or more processor Software module realize, or realized with combinations thereof.It will be understood by those of skill in the art that it can use in practice Microprocessor or digital signal processor (DSP) realize entity storehouse generating means according to embodiments of the present invention and/or base The some or all functions of some or all parts of the device of entity search are carried out in entity storehouse.It is of the invention acceptable real Now be for perform method as described herein some or all equipment or program of device (for example, computer journey Sequence and computer program product).Such program for realizing the present invention can store on a computer-readable medium, or can be with Form with one or more signal.Such signal can be downloaded from internet website and obtained, or be believed in carrier There is provided on number, or provided in the form of any other.
It can realize according to the generation method in the entity storehouse of the present invention and/or be carried out based on entity storehouse for example, Fig. 9 is shown The block diagram of the computing device of the method for entity search.The computing device conventionally comprises processor 910 and in the form of memory 920 Computer program product or computer-readable medium.Memory 920 can be that such as (electric erasable can for flash memory, EEPROM Program read-only memory), EPROM, hard disk or ROM etc electronic memory.There is memory 920 storage to be used to perform State the memory space 930 of the program code 931 of any method and step in method.For example, the memory space of store program codes 830 can store each program code 931 for being respectively used to realize the various steps in above method.These program codes can To read or be written to from one or more computer program product in this one or more computer program product. These computer program products include the program code carrier of such as hard disk, compact-disc (CD), storage card or floppy disk etc.This The computer program product of sample is usually portable or static memory cell as shown in Figure 10.The memory cell can have Memory paragraph, memory space with the similar arrangement of memory 920 in Fig. 9 computing device etc..Program code can be for example with suitable When form is compressed.Generally, memory cell can for performing the computer of steps of a method in accordance with the invention including being stored with Reader code 931 ', you can with the program code read by such as 910 etc processor, when these program codes are by calculating When equipment is run, cause each step in the computing device method described above.
It should be noted that the present invention will be described rather than limits the invention for above-described embodiment, and ability Field technique personnel can design alternative embodiment without departing from the scope of the appended claims.In the claims, Any reference symbol between bracket should not be configured to limitations on claims.Word "comprising" does not exclude the presence of not Element or step listed in the claims.Word "a" or "an" before element does not exclude the presence of multiple such Element.The present invention can be by means of including the hardware of some different elements and being come by means of properly programmed computer real It is existing.In if the unit claim of equipment for drying is listed, several in these devices can be by same hardware branch To embody.The use of word first, second, and third does not indicate that any order.These words can be explained and run after fame Claim.
So far, although those skilled in the art will appreciate that detailed herein have shown and described multiple showing for the present invention Example property embodiment, still, still can be direct according to present disclosure without departing from the spirit and scope of the present invention It is determined that or derive many other variations or modifications for meeting the principle of the invention.Therefore, the scope of the present invention is understood that and recognized It is set to and covers other all these variations or modifications.

Claims (10)

1. a kind of entity library generating method, including:
Entity mobility models collection of illustrative plates is established based on vertical search class website data;
The correlation of demand entity according to involved by the search history of user record parses the historical search behavior of the user Information;
Using demand entity as keyword, with reference to involved by the historical search behavior of the entity mobility models collection of illustrative plates and the user The relevant information generation entity storehouse of demand entity.
2. according to the method for claim 1, wherein, described recorded according to the search history of user parses the user's The relevant information of demand entity involved by historical search behavior, including:
Search daily record and/or the click logs of the user are obtained, based on the search daily record and/or click logs to the use The history click information at family does entity link and/or subject classification, and the demand parsed involved by the history click information is real The relevant information of body.
3. method according to claim 1 or 2, wherein, it is described using demand entity as keyword, know with reference to the entity Know the relevant information generation entity storehouse of collection of illustrative plates and the demand entity involved by the historical search behavior of the user, including:
Using demand entity as keyword, with reference to involved by the historical search behavior of the entity mobility models collection of illustrative plates and the user Demand entity and/or demand type corresponding with the demand entity establish user's request click model, generation includes the use The entity storehouse of family demand click model.
4. according to the method described in claim any one of 1-3, wherein, it is described using demand entity as keyword, with reference to described Demand entity involved by the historical search behavior of entity mobility models collection of illustrative plates and the user and/or corresponding with the demand entity Demand type establishes user's request click model, and generation includes the entity storehouse of the user's request click model, including:
Using demand entity as keyword, demand entity involved by historical search behavior to the user and/or needed with this Demand type is polymerize corresponding to realistic body, generates entity demand queue;
Demand intensity is calculated according to the history click information of the user, the demand intensity is added to the entity demand team In row, generation includes the entity storehouse of the user's request click model.
5. according to the method described in claim any one of 1-4, wherein, it is described using demand entity as keyword, with reference to described Demand entity involved by the historical search behavior of entity mobility models collection of illustrative plates and the user and/or corresponding with the demand entity Demand type establishes user's request click model, after generation includes the entity storehouse of the user's request click model, in addition to:
The user's request click model is updated with predetermined period.
6. according to the method described in claim any one of 1-5, wherein, it is described that the user's request click is updated with predetermined period Model, including:
Established by on-line study method and click on feedback model in real time, the entity demand that user is monitored with the predetermined period becomes Change, the sequence of the entity demand queue in the user's request click model is adjusted by online feedback mechanism.
7. a kind of entity storehouse generating means, including:
Module is established, vertical search class website data is configured to and establishes entity mobility models collection of illustrative plates;
Parsing module, it is configured to according to involved by the search history of user record parses the historical search behavior of the user The relevant information of demand entity;
Entity storehouse generation module, it is configured to using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates and the user Historical search behavior involved by demand entity relevant information generation entity storehouse.
8. device according to claim 7, wherein, the parsing module is additionally configured to:
Search daily record and/or the click logs of the user are obtained, based on the search daily record and/or click logs to the use The history click information at family does entity link and/or subject classification, and the demand parsed involved by the history click information is real The relevant information of body.
9. a kind of computer program, including computer-readable code, when the computer-readable code is run on the computing device When, cause entity library generating method of the computing device as described in any one of claim 1 to 6.
A kind of 10. computer-readable medium, wherein storing computer program as claimed in claim 9.
CN201710916101.7A 2017-09-30 2017-09-30 entity library generating method and device Pending CN107807957A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710916101.7A CN107807957A (en) 2017-09-30 2017-09-30 entity library generating method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710916101.7A CN107807957A (en) 2017-09-30 2017-09-30 entity library generating method and device

Publications (1)

Publication Number Publication Date
CN107807957A true CN107807957A (en) 2018-03-16

Family

ID=61592704

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710916101.7A Pending CN107807957A (en) 2017-09-30 2017-09-30 entity library generating method and device

Country Status (1)

Country Link
CN (1) CN107807957A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109145200A (en) * 2018-07-13 2019-01-04 百度在线网络技术(北京)有限公司 Promote method, apparatus, equipment and the computer storage medium showed
CN109635125A (en) * 2018-12-20 2019-04-16 广东小天才科技有限公司 Vocabulary atlas building method and electronic equipment
CN109766444A (en) * 2018-12-10 2019-05-17 北京百度网讯科技有限公司 The application database generation method and its device of knowledge mapping
CN110263180A (en) * 2019-06-13 2019-09-20 北京百度网讯科技有限公司 It is intended to knowledge mapping generation method, intension recognizing method and device
CN110674313A (en) * 2019-09-20 2020-01-10 四川长虹电器股份有限公司 Method for dynamically updating knowledge graph based on user log
CN110674307A (en) * 2019-08-21 2020-01-10 北京邮电大学 Knowledge deduction method and system for knowledge center network
CN111091006A (en) * 2019-12-20 2020-05-01 北京百度网讯科技有限公司 Entity intention system establishing method, device, equipment and medium
CN112015919A (en) * 2020-09-15 2020-12-01 重庆广播电视大学重庆工商职业学院 Dialogue management method based on learning auxiliary knowledge graph
CN112507123A (en) * 2020-12-04 2021-03-16 北京搜狗科技发展有限公司 Data processing method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103995870A (en) * 2014-05-21 2014-08-20 百度在线网络技术(北京)有限公司 Interactive searching method and device
US20140297644A1 (en) * 2013-04-01 2014-10-02 Tencent Technology (Shenzhen) Company Limited Knowledge graph mining method and system
CN104598556A (en) * 2015-01-04 2015-05-06 百度在线网络技术(北京)有限公司 Search method and search device
CN105404680A (en) * 2015-11-25 2016-03-16 百度在线网络技术(北京)有限公司 Searching recommendation method and apparatus
CN105447005A (en) * 2014-08-08 2016-03-30 百度在线网络技术(北京)有限公司 Object push method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140297644A1 (en) * 2013-04-01 2014-10-02 Tencent Technology (Shenzhen) Company Limited Knowledge graph mining method and system
CN103995870A (en) * 2014-05-21 2014-08-20 百度在线网络技术(北京)有限公司 Interactive searching method and device
CN105447005A (en) * 2014-08-08 2016-03-30 百度在线网络技术(北京)有限公司 Object push method and device
CN104598556A (en) * 2015-01-04 2015-05-06 百度在线网络技术(北京)有限公司 Search method and search device
CN105404680A (en) * 2015-11-25 2016-03-16 百度在线网络技术(北京)有限公司 Searching recommendation method and apparatus

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109145200A (en) * 2018-07-13 2019-01-04 百度在线网络技术(北京)有限公司 Promote method, apparatus, equipment and the computer storage medium showed
US11164210B2 (en) 2018-07-13 2021-11-02 Baidu Online Network Technology (Beijing) Co., Ltd. Method, device and computer storage medium for promotion displaying
CN109766444A (en) * 2018-12-10 2019-05-17 北京百度网讯科技有限公司 The application database generation method and its device of knowledge mapping
CN109635125A (en) * 2018-12-20 2019-04-16 广东小天才科技有限公司 Vocabulary atlas building method and electronic equipment
CN109635125B (en) * 2018-12-20 2021-01-26 广东小天才科技有限公司 Vocabulary atlas building method and electronic equipment
CN110263180B (en) * 2019-06-13 2021-06-04 北京百度网讯科技有限公司 Intention knowledge graph generation method, intention identification method and device
CN110263180A (en) * 2019-06-13 2019-09-20 北京百度网讯科技有限公司 It is intended to knowledge mapping generation method, intension recognizing method and device
CN110674307A (en) * 2019-08-21 2020-01-10 北京邮电大学 Knowledge deduction method and system for knowledge center network
CN110674313A (en) * 2019-09-20 2020-01-10 四川长虹电器股份有限公司 Method for dynamically updating knowledge graph based on user log
CN110674313B (en) * 2019-09-20 2022-12-13 四川长虹电器股份有限公司 Method for dynamically updating knowledge graph based on user log
CN111091006A (en) * 2019-12-20 2020-05-01 北京百度网讯科技有限公司 Entity intention system establishing method, device, equipment and medium
CN111091006B (en) * 2019-12-20 2023-08-29 北京百度网讯科技有限公司 Method, device, equipment and medium for establishing entity intention system
CN112015919A (en) * 2020-09-15 2020-12-01 重庆广播电视大学重庆工商职业学院 Dialogue management method based on learning auxiliary knowledge graph
CN112507123A (en) * 2020-12-04 2021-03-16 北京搜狗科技发展有限公司 Data processing method and device
WO2022116527A1 (en) * 2020-12-04 2022-06-09 北京搜狗科技发展有限公司 Data processing method and device

Similar Documents

Publication Publication Date Title
CN107807957A (en) entity library generating method and device
JP6515624B2 (en) Method of identifying lecture video topics and non-transitory computer readable medium
CN107679186A (en) The method and device of entity search is carried out based on entity storehouse
AU2014201827B2 (en) Scoring concept terms using a deep network
JP5575902B2 (en) Information retrieval based on query semantic patterns
US9483462B2 (en) Generating training data for disambiguation
CN111797214A (en) FAQ database-based problem screening method and device, computer equipment and medium
CN106557480B (en) Method and device for realizing query rewriting
CN103339623A (en) Internet search related methods and apparatus
US10152478B2 (en) Apparatus, system and method for string disambiguation and entity ranking
EP1587009A2 (en) Content propagation for enhanced document retrieval
US20170154116A1 (en) Method and system for recommending contents based on social network
JP5543020B2 (en) Research mission identification
US11200244B2 (en) Keyword reporting for mobile applications
JP2015191655A (en) Method and apparatus for generating recommendation page
JP2003330948A (en) Device and method for evaluating web page
CN102693271A (en) Network information recommending method and system
US7949646B1 (en) Method and apparatus for building sales tools by mining data from websites
CN105069077A (en) Search method and device
CN108170293A (en) Input the personalized recommendation method and device of association
CN104951484A (en) Search result processing method and search result processing device
Yerva et al. Entity-based classification of twitter messages
CN107908616A (en) The method and apparatus of anticipation trend word
CN110008396B (en) Object information pushing method, device, equipment and computer readable storage medium
Yerva et al. What have fruits to do with technology? The case of Orange, Blackberry and Apple

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180316