CN107807957A - entity library generating method and device - Google Patents
entity library generating method and device Download PDFInfo
- Publication number
- CN107807957A CN107807957A CN201710916101.7A CN201710916101A CN107807957A CN 107807957 A CN107807957 A CN 107807957A CN 201710916101 A CN201710916101 A CN 201710916101A CN 107807957 A CN107807957 A CN 107807957A
- Authority
- CN
- China
- Prior art keywords
- entity
- demand
- user
- search
- click
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/288—Entity relationship models
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention provides a kind of entity library generating method and device, the above method to include:Entity mobility models collection of illustrative plates is established based on vertical search class website data;The relevant information of demand entity according to involved by the search history of user record parses the historical search behavior of the user;Using demand entity as keyword, the relevant information generation entity storehouse of the demand entity with reference to involved by the historical search behavior of the entity mobility models collection of illustrative plates and the user.According to entity library generating method provided by the invention, various types of information contents can be obtained, and the relevant information of the demand entity of user can be accurately determined by the analysis to user's history search behavior, binding entity knowledge mapping structure meets the entity storehouse of user individual entity demand based on the demand entity actual by user, when user carries out entity search, the relevant information for meeting user's search need can quickly and be accurately provided the user.
Description
Technical field
The present invention relates to Internet technical field, more particularly to a kind of entity library generating method and device.
Background technology
With the continuous development of Internet technology, increasing people enters the transmission of row information with exchanging by internet,
Therefore, it is available for people to obtain various information based on the powerful information bank that internet is set up.At present, relative to biography
The keyword search of system, entity search are a kind of more novel search forms.
But same entity there may be ambiguity, and it is involved at present only the unitary demand of single entities is identified,
When user carries out information search, the demand of user can not be recognized accurately, and then accurately search knot can not be provided the user
Fruit.
The content of the invention
The invention provides a kind of entity library generating method and device to overcome above mentioned problem or solve at least in part
Above mentioned problem.
According to an aspect of the invention, there is provided a kind of entity library generating method, including:
Entity mobility models collection of illustrative plates is established based on vertical search class website data;
Demand entity involved by parsing the historical search behavior of the user is recorded according to the search history of user
Relevant information;
Using demand entity as keyword, with reference to involved by the historical search behavior of the entity mobility models collection of illustrative plates and the user
And demand entity relevant information generation entity storehouse.
Alternatively, it is described according to involved by the search history of user record parses the historical search behavior of the user
The relevant information of demand entity, including:
Search daily record and/or the click logs of the user are obtained, based on the search daily record and/or click logs to institute
The history click information for stating user does entity link and/or subject classification, parses the need involved by the history click information
The relevant information of realistic body.
Alternatively, it is described using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates and the history of the user
The relevant information generation entity storehouse of demand entity involved by search behavior, including:
Using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates and the historical search behavior institute of the user
The demand entity and/or demand type corresponding with the demand entity being related to establish user's request click model, and generation includes institute
State the entity storehouse of user's request click model.
Alternatively, it is described using demand entity as keyword, with reference to going through for the entity mobility models collection of illustrative plates and the user
Demand entity and/or demand type corresponding with the demand entity involved by history search behavior establish user's request and click on mould
Type, generation include the entity storehouse of the user's request click model, including:
Using demand entity as keyword, demand entity involved by historical search behavior to the user and/or with
Demand type corresponding to the demand entity is polymerize, and generates entity demand queue;
Demand intensity is calculated according to the history click information of the user, the demand intensity is needed added to the entity
Ask in queue, generation includes the entity storehouse of the user's request click model.
Alternatively, it is described using demand entity as keyword, with reference to going through for the entity mobility models collection of illustrative plates and the user
Demand entity and/or demand type corresponding with the demand entity involved by history search behavior establish user's request and click on mould
Type, after generation includes the entity storehouse of the user's request click model, in addition to:
The user's request click model is updated with predetermined period.
Alternatively, it is described that the user's request click model is updated with predetermined period, including:
Established by on-line study method and click on feedback model in real time, the entity demand of user is monitored with the predetermined period
Change, the sequence of the entity demand queue in the user's request click model is adjusted by online feedback mechanism.
According to another aspect of the present invention, a kind of generating means in entity storehouse are additionally provided, including:
Module is established, vertical search class website data is configured to and establishes entity mobility models collection of illustrative plates;
Parsing module, it is configured to according to involved by the search history of user record parses the historical search behavior of the user
And demand entity relevant information;
Entity storehouse generation module, be configured to using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates with it is described
The relevant information generation entity storehouse of demand entity involved by the historical search behavior of user.
Alternatively, the parsing module is additionally configured to:
Search daily record and/or the click logs of the user are obtained, based on the search daily record and/or click logs to institute
The history click information for stating user does entity link and/or subject classification, parses the need involved by the history click information
The relevant information of realistic body.
Alternatively, entity storehouse generation module is additionally configured to:
Using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates and the historical search behavior institute of the user
The demand entity and/or demand type corresponding with the demand entity being related to establish user's request click model, and generation includes institute
State the entity storehouse of user's request click model.
Alternatively, entity storehouse generation module is additionally configured to:
Using demand entity as keyword, demand entity involved by historical search behavior to the user and/or with
Demand type corresponding to the demand entity is polymerize, and generates entity demand queue;According to the history click information of the user
Demand intensity is calculated, the demand intensity is added in the entity demand queue, generation includes the user's request and clicked on
The entity storehouse of model.
Alternatively, said apparatus also includes:
Update module, it is configured to update the user's request click model with predetermined period.
Alternatively, the update module is additionally configured to:
Established by on-line study method and click on feedback model in real time, the entity demand of user is monitored with the predetermined period
Change, the sequence of the entity demand queue in the user's request click model is adjusted by online feedback mechanism.
According to a further aspect of the invention, a kind of computer program, including computer-readable code are additionally provided, works as institute
When stating computer-readable code and running on the computing device, cause the entity storehouse described in the computing device any of the above-described
Generation method.
According to a further aspect of the invention, a kind of computer-readable medium is additionally provided, wherein storing the calculating
Machine program.
The invention provides a kind of entity library generating method and device, based on entity library generating method provided by the invention,
Entity mobility models collection of illustrative plates first can be established based on vertical search class website data, and institute is gone out by the historical search behavioural analysis of user
The relevant information for the demand entity being related to, and then combine relevant information and the entity mobility models collection of illustrative plates generation entity storehouse of demand entity.
According to entity library generating method provided by the invention, entity mobility models figure is established by the information obtained to vertical search class website
Spectrum, can obtain various types of information contents, and can accurately determine user by the analysis to user's history search behavior
Demand entity relevant information, binding entity knowledge mapping structure meets user based on the demand entity actual by user
Property entity demand entity storehouse, when user carries out entity search, can quickly and accurately provide the user and meet user and search
The relevant information of rope demand.
Described above is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention,
And can be practiced according to the content of specification, and in order to allow above and other objects of the present invention, feature and advantage can
Become apparent, below especially exemplified by the embodiment of the present invention.
According to the accompanying drawings will be brighter to the detailed description of the specific embodiment of the invention, those skilled in the art
Above-mentioned and other purposes, the advantages and features of the present invention.
Brief description of the drawings
By reading the detailed description of hereafter preferred embodiment, it is various other the advantages of and benefit it is common for this area
Technical staff will be clear understanding.Accompanying drawing is only used for showing the purpose of preferred embodiment, and is not considered as to the present invention
Limitation.And in whole accompanying drawing, identical part is denoted by the same reference numerals.In the accompanying drawings:
Fig. 1 is entity library generating method schematic flow sheet according to embodiments of the present invention;
Fig. 2 is generation schematic diagram in entity storehouse according to embodiments of the present invention;
Fig. 3 is entity demand queue arrangement schematic diagram according to embodiments of the present invention;
Fig. 4 is the method flow schematic diagram according to embodiments of the present invention that entity search is carried out based on entity storehouse;
Fig. 5 is the structural representation of entity storehouse generating means according to embodiments of the present invention;
Fig. 6 is the structural representation of entity storehouse according to the preferred embodiment of the invention generating means;
Fig. 7 is the apparatus structure schematic diagram according to embodiments of the present invention that entity search is carried out based on entity storehouse;
Fig. 8 is the apparatus structure schematic diagram according to the preferred embodiment of the invention that entity search is carried out based on entity storehouse;
Fig. 9 it is according to embodiments of the present invention be used to perform according to the generation method in the entity storehouse of the present invention and/or based on reality
Body storehouse carries out the block diagram representation of the computing device of the method for entity search;
Figure 10 is to be used to keeping or carrying the generation side for realizing the entity storehouse according to the present invention according to embodiments of the present invention
Method and/or based on entity storehouse carry out entity search method program code memory cell schematic diagram.
Embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although the disclosure is shown in accompanying drawing
Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here
Limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure
Completely it is communicated to those skilled in the art.
Fig. 1 is entity library generating method schematic flow sheet according to embodiments of the present invention, as shown in figure 1, according to the present invention
The entity library generating method of embodiment includes:
Step S102, entity mobility models collection of illustrative plates is established based on vertical search class website data;
Step S104, the demand according to involved by the search history of user record parses the historical search behavior of user are real
The relevant information of body;
Step S106, using demand entity as keyword, the historical search behavior institute of binding entity knowledge mapping and user
The relevant information generation entity storehouse for the demand entity being related to.
Based on entity library generating method provided by the invention, entity mobility models first can be established based on vertical search class website data
Collection of illustrative plates, and the relevant information of involved demand entity is gone out by the historical search behavioural analysis of user, and then combine demand
Relevant information and entity mobility models collection of illustrative plates generation the entity storehouse of entity.According to entity library generating method provided by the invention, by right
The information that vertical search class website obtains establishes entity mobility models collection of illustrative plates, can obtain various types of information contents, and by
The analysis of family historical search behavior can accurately determine the relevant information of the demand entity of user, real with the actual demand of user
Binding entity knowledge mapping structure meets the entity storehouse of user individual entity demand based on body, and entity search is carried out in user
When, on the basis of user's query intention is understood, the entity that user wants to look for is analyzed, result entity is concluded and organized
And it is presented to user in a manner of special type shows.User no longer needs oneself to go that knowledge is found and concluded from search result, subtracts
Lack the cost that user obtains information, lift Consumer's Experience.
Entity mobility models collection of illustrative plates is to include the incidence relation between various entities and concept, and entity, concept.It is real establishing
During body knowledge mapping, it can monitor and excavate encyclopaedia data, core word bank, the vertical resource data such as class website or searching class website
Middle extraction concept, entity, attribute and relation, establish entity mobility models collection of illustrative plates based on above-mentioned resource data, realize the sequential of knowledge
Fusion and multi-data source fusion, and then establish the entity mobility models collection of illustrative plates with vast resources data.Entity mobility models collection of illustrative plates is can be with
Constantly update, according to the real-time change of above-mentioned all kinds of resource datas, entity mobility models collection of illustrative plates is also to implement renewal.
Entity mobility models collection of illustrative plates is the knowledge mapping for including mass data.And it is different for the required entity of each user
's.It is determined that user demand entity when, can according to the search history of user record be analyzed.Preferably, above-mentioned steps
S104 can further include:Obtain search daily record and/or the click logs of user, search daily record based on user and/or
Click logs do entity link and/or subject classification to the history click information of user, parse user's history click information institute
The relevant information for the demand entity being related to.
For example, when user inputs " Lee ", the entity to be looked for is probably " singer Lee ", it is also possible to " sportsman Lee
Certain ", at this moment search daily record and click logs of the can based on user judges that user often listens to the song of Lee, at this moment,
The demand entity for being assured that out user is " singer Lee ".
The search history record of user is can to reflect that user searches for the data message of custom, passes through the search day to user
The analysis of will and click logs, the click document to user do entity link and/or subject classification, accurately parse user's
The relevant information of demand entity.Wherein, the document clicked on to user does entity link, that is, the document for filtering out user's click is corresponding
Title in entity, by the entity link into entity storehouse corresponding entity and with the entity corresponding demand.User's
The relevant information of demand entity can include the demand entity and demand type of user.When user clicks on " Lee's (hip hop, rock, rap,pop,
Hand) _ encyclopaedia " when, then it can be linked in entity storehouse " Lee (pop singer) ", corresponding demand is encyclopaedia demand.Document " Lee
Certain _ song online test listening " " Lee (pop singer) " that will be linked in entity storehouse, corresponding demand is music demand.
Above-mentioned steps S106 is referred to, using demand entity as keyword, the history of binding entity knowledge mapping and user are searched
The relevant information generation entity storehouse of demand entity involved by Suo Hangwei.Preferably, can be real with demand when generating entity storehouse
Body as keyword, demand entity involved by the historical search behavior of binding entity knowledge mapping and user and/or with this
The information such as demand type, click location of user corresponding to demand entity establishes user's request click model, and generation includes user
The entity storehouse of demand click model.When receiving the searching request from user, it is possible to the use directly in entity storehouse
Family demand click model quickly judges and linked to the relevant information for meeting user's request.
Alternatively, when establishing user's request click model, the history of user can be searched using demand entity as keyword
Demand entities and/or with the demand entity corresponding demand type of the Suo Hangwei as involved by search and/or click logs are carried out
Polymerization, generate entity demand queue.
Fig. 2, which shows to click on user after document does entity link according to the search and click logs of user, generates entity storehouse
Schematic diagram.In Fig. 2, entity can be that the search term of user's input is " Lee ", be related to user's search and the point of " Lee "
It is 500 to hit daily record to include " Lee (pop singer _ encyclopaedia) " corresponding number of clicks respectively;" Lee _ song online test listening ", point
Number is hit as 400;" Lee _ (tennis player) _ encyclopaedia ", number of clicks 300;" Lee _ picture " number of clicks is 300;
" Lee _ Lee's song complete works _ special edition " number of clicks is 100;" Lee _ sports star _ race " number of clicks is 50;Get
After the search of user and click logs, entity link is done to the demand entity in user's search and click logs respectively, and it is right
The demand entity and demand type corresponding with demand entity are polymerize.Demand entity in Fig. 2 includes " " Lee's (stream
Row singer) " and " Lee (tennis player) ", can will be on " Lee when being polymerize with demand entity and demand type
(pop singer) " is aggregated to together with demand type " encyclopaedia ", " music " and " news " etc., on " Lee's (tennis
Member) " be aggregated to together with demand type " encyclopaedia ", " picture ", " news ", " video " and " microblogging " etc., it is correspondingly, every kind of
Demand entity and demand type can also be searched for according to user and number of clicks calculates corresponding demand intensity.Can from Fig. 2
To find out, the demand intensity of " Lee (pop singer _ encyclopaedia) " is 500;The demand intensity of " Lee _ song online test listening " is
500;The demand intensity of " Lee _ (tennis player) _ encyclopaedia " is 300;The demand of " Lee _ (tennis player) _ picture " is strong
Spend for 200;The demand intensity of " Lee _ (tennis player) _ news " is 50.In Fig. 2 simply schematically illustrate according to
Family is searched for and the mode that is polymerize to user's request entity and demand type of click logs, in actual applications, can be with
Using other modes to generation entity demand queue and user's request click model, here is omitted.
Fig. 3 shows the arrangement mode of entity demand queue, and the queue of entity demand can include demand entity and demand
Type.In figure 3, demand entity can include " Lee (pop singer) ", " Lee (tennis player) ", its corresponding demand
Type can be " encyclopaedia ", " music ", " picture " and " news " or other.Further, can also going through according to user
History click information calculates each demand entity and the demand intensity of demand type, and demand intensity is added into entity demand queue
In.As shown in figure 3, the history click information based on user calculates, the demand intensity of Lee (pop singer) encyclopaedia is 500, Lee
The demand intensity of certain (pop singer) music is 500, and the demand intensity of Lee (tennis player) encyclopaedia is 300, Lee's (tennis
Sportsman) demand intensity of picture is 200, the demand intensity of Lee (tennis player) news is 50.In actual applications, need
Ask intensity calculating can be according to a certain user search and click logs, can also integrate most users search and click on day
Will, it can be adjusted according to different situations.The queue of entity demand can the height of intensity according to demand be ranked up, to meet user
Demand when carrying out entity search.Certain demand entity and demand type are not limited to this, can also include other entities
And correlation type, the present invention do not limit.
Further, after the generation of user's request click model, user's request click can also be updated with predetermined period
Model.User's request click model can be timing renewal or real-time update.Because the search behavior of user is at any time
It may occur, therefore, search behavior timing or real-time update user's request click model based on user can meet user
The change of search need, and then can more efficiently provide the user search result.
Preferably, when updating user's request click model, it can be established by on-line study method and be click on feeding back
Model detects the entity changes in demand of user with predetermined period, is adjusted by online feedback mechanism in user's request click model
The sequence of entity demand queue.Assuming that the news on Lee tennis player has been broken out suddenly, " Lee (tennis player),
The user of this demand of news " clicks on to increase suddenly, then can be incited somebody to action by online feedback mechanism " Lee (tennis player), newly
The sequence up-regulation of this demand of news ".Assuming that user within some period to " Lee (pop singer), music " this demand
Number of clicks increase, then the sequence of this demand can be raised.
In entity library generating method provided in an embodiment of the present invention, pass through the historical search of entity mobility models collection of illustrative plates and user
The relevant information generation entity storehouse of demand entity involved by behavior, can be when user carries out entity search quickly and efficiently
Identify the search intention of user.And entity of embodiment of the present invention storehouse can also be upgraded in time according to the demand of user, with full
The search need of sufficient user's different time sections.
Fig. 4 is the method according to embodiments of the present invention that entity search is carried out based on entity storehouse, as shown in figure 4, according to this
The entity search method based on entity storehouse of inventive embodiments, including:
Step S402, the query from user is received, and determine the entity word frequency of the query;
Step S404, the entity word frequency based on the query are determined and query similarity highest high frequencies query;
Step S406, using above-mentioned high frequency query as entity word, relevant information search is carried out into entity storehouse.
In entity search method provided in an embodiment of the present invention, a variety of realities in the query from user are may recognize that
Body search intention, determine with the entity Word similarity highest high frequency query in query, be entity based on high frequency query
Word carries out the search of relevant information into magnanimity information and with user's request click model entity storehouse, to provide use
Family meets the search result of the demand intensity of its search intention.
In the present embodiment, the statistics for the query that can be initiated in advance user, judge that user sends out for same query
The height of the frequency risen, and a predetermined threshold value is set, and then subsequently received query is judged.If sentence
Disconnected query entity word frequency is greater than or equal to predetermined threshold value, it is determined that the query is high frequency query, now, can be straight
Connect with the query sheets as entity word, and with the entity word, relevant information search is carried out into entity storehouse.If the query's
Entity word frequency is less than predetermined threshold value, it is determined that the query is low frequency query, now, it is possible to in low frequency query
Entity is index search and low frequency query similarity highest high frequency query, and using high frequency query as entity word, to reality
Relevant information search is carried out in body storehouse.
The scheme provided based on the present embodiment, high frequency query can serve as entity word in itself, therefore received come
The search that carries out relevant information is can be directly in entity storehouse from the query of user.For low frequency query, do not have in possible entity storehouse
Having directly includes low frequency query corresponding entities in itself.At this moment, it is possible to the entity in low frequency query is first analyzed, with
Entity in low frequency query is searched for index by related algorithm and low frequency query similarity highest query, is based on
The query is the search that entity word carries out relevant information into entity storehouse.The scheme provided based on the present embodiment, no matter use by oneself
The query at family belongs to high frequency query or low frequency query, can the quickly query-related information into entity storehouse, and then accurately
Provide the user Query Result corresponding with the query of user.
Preferably, include entity demand queue in the user's request click model in entity storehouse, therefore, it is determined that with it is low
, can be real to the high frequency query for having calculated that entity demand queue is established during frequency query similarity highest high frequency query
Body query inverted indexs, after identifying the entity in low frequency query, correlation is found by entity query inverted indexs
Query lists, each query in low frequency query and query row is calculated by simarank, deep learning correlation technique
Similarity, then it is that entity word carries out related letter into entity storehouse to find out to low frequency query similarity highest high frequencies query
The search of breath.
For example, if it is " Lee " to receive the query from user, by real in " Lee " this query
The frequency of " Lee " of pronouns, general term for nouns, numerals and measure words is analyzed, and determines that the query belongs to high frequency query, now can is directly arrived with " Lee "
The search of relevant information in entity storehouse.Related entities demand team has been had calculated that in user's request click model in entity storehouse
Row, at this moment can is directly presented to user as entity search result.
If it is " relevant information of Lee " to receive the query from user, by the entity word in this query
The frequency of " relevant information of Lee " is analyzed, and determines that the query belongs to low frequency query.If directly with the " correlation of Lee
Information " is entity word, possibly accurately can not obtain relevant information directly from entity storehouse.Now, can be will calculate
The high frequency query of entity demand queue establishes entity query inverted indexs.
Inverted index comes to be needed to be recorded to search according to the value of attribute in practical application, each single item in this concordance list
All include the address of a property value and each record with the property value.Due to not being to determine property value by recording, but
The position of record, thus referred to as inverted index (invertedindex) are determined by property value.Entity query inverted indexs are
Entity corresponding with high frequency query will be obtained based on the high frequency query having calculated that, by real corresponding to high frequency query
Body can link to high frequency query.When the query received is the relevant information of Lee " when ", may recognize that in the query
Entity be " Lee ", by query inverted indexs find correlation query lists, pass through simrank or depth study
Correlation technique can is by the similarity of each query in " relevant information of Lee " and query lists, if calculating " Lee
Similarity highest query in the relevant information of certain " and query lists, or similarity are more than the query of predetermined threshold value and are
" Lee ", then it is the search that entity word carries out relevant information into entity storehouse with " Lee ", that is to say, that the query " phases of Lee
Pass information " inherits " Lee " this high frequency query entity demand queue.
It should be noted that in practical application, above-mentioned all optional embodiments can be any group by the way of combining
Close, form the alternative embodiment of the present invention, this is no longer going to repeat them.
Based on the entity library generating method that each embodiment provides above, based on same inventive concept, the embodiment of the present invention is also
Provide a kind of generating means in entity storehouse, Fig. 5 be according to the structural representation of the entity storehouse generating means of the embodiment of the present invention,
As shown in figure 5, the generating means in the entity storehouse of the embodiment of the present invention can include:
Module 510 is established, vertical search class website data is configured to and establishes entity mobility models collection of illustrative plates;
Parsing module 520, it is configured to parse the historical search behavior of the user according to the search history of user record
The relevant information of involved demand entity;
Entity storehouse generation module 530, it is configured to using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates and institute
State the relevant information generation entity storehouse of the demand entity involved by the historical search behavior of user.
In a preferred embodiment of the invention, parsing module 520 is additionally configured to:
Search daily record and/or the click logs of the user are obtained, based on the search daily record and/or click logs to institute
The history click information for stating user does entity link and/or subject classification, parses the need involved by the history click information
The relevant information of realistic body.
In a preferred embodiment of the invention, entity storehouse generation module 530 is also configured as:
Using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates and the historical search behavior institute of the user
The demand entity and/or demand type corresponding with the demand entity being related to establish user's request click model, and generation includes institute
State the entity storehouse of user's request click model.
In a preferred embodiment of the invention, entity storehouse generation module 530 is also configured as:
Using demand entity as keyword, demand entity involved by historical search behavior to the user and/or with
Demand type corresponding to the demand entity is polymerize, and generates entity demand queue;According to the history click information of the user
Demand intensity is calculated, the demand intensity is added in the entity demand queue, generation includes the user's request and clicked on
The entity storehouse of model.
In a preferred embodiment of the invention, as shown in fig. 6, said apparatus can also include:
Update module 540, it is configured to update the user's request click model with predetermined period.
In a preferred embodiment of the invention, update module 540 is also configured as:
Established by on-line study method and click on feedback model in real time, the entity demand of user is monitored with the predetermined period
Change, the sequence of the entity demand queue in the user's request click model is adjusted by online feedback mechanism.
The embodiment of the present invention additionally provides a kind of computer program, including computer-readable code, when the computer can
When reading code is run on the computing device, cause the generation side in the entity storehouse described in the computing device any of the above-described
Method.
The embodiment of the present invention additionally provides a kind of computer-readable medium, wherein storing above-mentioned computer program.
Fig. 7 shows that what is provided according to embodiments of the present invention carries out the apparatus structure signal of entity search based on entity storehouse
Figure, as shown in fig. 7, the device provided in an embodiment of the present invention that entity search is carried out based on entity storehouse can include:
Receiving module 710, it is configured to receive the query from user, and determines the entity word frequency of the query;
Determining module 720, the entity word frequency for being configured to the query are determined and the query similarity highests
High frequency query;
Search module 730, it is configured to using high frequency query as entity word, relevant information search is carried out into entity storehouse.
In a preferred embodiment of the invention, as shown in figure 8, determining module 720 can also include:
First determining unit 721, the entity word frequency for being configured to state query are higher than predetermined threshold value, it is determined that the query is
For high frequency query;
Second determining unit 722, if the entity word frequency for being configured to query is less than the predetermined threshold value, it is determined that should
Query low frequency query, using the entity in low frequency query as index search and the low frequency query similarities highest high frequency
query。
In a preferred embodiment of the invention, the second determining unit 722 is also configured as:
High frequency query for having calculated that entity demand queue establishes entity query inverted indexs;Identify described low
Entity in frequency query, related query lists are found by the query inverted indexs;Calculate the low frequency query with
The similarity of each query in the query lists, find out and the low frequency query similarities highest high frequency
query。
In a preferred embodiment of the invention, as shown in figure 8, search module 730 can also include:
Link unit 731, it is configured to do entity link to the entity word, the entity word is linked into the entity storehouse
In corresponding demand entity and/or demand type corresponding with the demand entity.
In a preferred embodiment of the invention, search module 730 is also configured as:
Entity storehouse is generated in the following manner:Entity mobility models collection of illustrative plates is established based on vertical search class website data;According to user
Search history record parse the relevant information of demand entity involved by the historical search behavior of the user;It is real with demand
Body is as keyword, the phase of the demand entity with reference to involved by the entity mobility models collection of illustrative plates with the historical search behavior of the user
Close information generation entity storehouse.
The embodiment of the present invention additionally provides a kind of computer program, including computer-readable code, when the computer can
When reading code is run on the computing device, cause to carry out in fact based on entity storehouse described in the computing device any of the above-described
The method of body search.
The embodiment of the present invention additionally provides a kind of computer-readable medium, wherein storing above-mentioned computer program.
The embodiments of the invention provide a kind of generation method and device in entity storehouse, the reality provided according to embodiments of the present invention
Body library generating method, entity mobility models collection of illustrative plates is established by the information obtained to vertical search class website, can be obtained various types of
The information content, and the related letter of the demand entity of user can be accurately determined by the analysis to user's history search behavior
Breath, binding entity knowledge mapping structure meets the entity of user individual entity demand based on the demand entity of user's reality
Storehouse, when user carries out entity search, it can quickly and accurately provide the user the relevant information for meeting user's search need.Enter
One step, the embodiment of the present invention additionally provides a kind of method and device that entity search is carried out based on entity storehouse, by being connect
The query from user received, after the entity word frequency for determining query, it is possible to pass through the frequency to entity word in query
The analysis of degree determine with the query similarity highest query, and then carried out using the query as entity word into entity storehouse
The search of relevant information, quickly to provide the user the search result of entity search.
In the specification that this place provides, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention
Example can be put into practice in the case of these no details.In some instances, known method, structure is not been shown in detail
And technology, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help to understand one or more of each inventive aspect,
Above in the description to the exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes
In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:I.e. required guarantor
The application claims of shield features more more than the feature being expressly recited in each claim.It is more precisely, such as following
Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore,
Thus the claims for following embodiment are expressly incorporated in the embodiment, wherein each claim is in itself
Separate embodiments all as the present invention.
Those skilled in the art, which are appreciated that, to be carried out adaptively to the module in the equipment in embodiment
Change and they are arranged in one or more equipment different from the embodiment.Can be the module or list in embodiment
Member or component be combined into a module or unit or component, and can be divided into addition multiple submodule or subelement or
Sub-component.In addition at least some in such feature and/or process or unit exclude each other, it can use any
Combination is disclosed to all features disclosed in this specification (including adjoint claim, summary and accompanying drawing) and so to appoint
Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification (including adjoint power
Profit requires, summary and accompanying drawing) disclosed in each feature can be by providing the alternative features of identical, equivalent or similar purpose come generation
Replace.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments
In included some features rather than further feature, but the combination of the feature of different embodiments means in of the invention
Within the scope of and form different embodiments.For example, in detail in the claims, embodiment claimed it is one of any
Mode it can use in any combination.
The all parts embodiment of the present invention can be realized with hardware, or to be run on one or more processor
Software module realize, or realized with combinations thereof.It will be understood by those of skill in the art that it can use in practice
Microprocessor or digital signal processor (DSP) realize entity storehouse generating means according to embodiments of the present invention and/or base
The some or all functions of some or all parts of the device of entity search are carried out in entity storehouse.It is of the invention acceptable real
Now be for perform method as described herein some or all equipment or program of device (for example, computer journey
Sequence and computer program product).Such program for realizing the present invention can store on a computer-readable medium, or can be with
Form with one or more signal.Such signal can be downloaded from internet website and obtained, or be believed in carrier
There is provided on number, or provided in the form of any other.
It can realize according to the generation method in the entity storehouse of the present invention and/or be carried out based on entity storehouse for example, Fig. 9 is shown
The block diagram of the computing device of the method for entity search.The computing device conventionally comprises processor 910 and in the form of memory 920
Computer program product or computer-readable medium.Memory 920 can be that such as (electric erasable can for flash memory, EEPROM
Program read-only memory), EPROM, hard disk or ROM etc electronic memory.There is memory 920 storage to be used to perform
State the memory space 930 of the program code 931 of any method and step in method.For example, the memory space of store program codes
830 can store each program code 931 for being respectively used to realize the various steps in above method.These program codes can
To read or be written to from one or more computer program product in this one or more computer program product.
These computer program products include the program code carrier of such as hard disk, compact-disc (CD), storage card or floppy disk etc.This
The computer program product of sample is usually portable or static memory cell as shown in Figure 10.The memory cell can have
Memory paragraph, memory space with the similar arrangement of memory 920 in Fig. 9 computing device etc..Program code can be for example with suitable
When form is compressed.Generally, memory cell can for performing the computer of steps of a method in accordance with the invention including being stored with
Reader code 931 ', you can with the program code read by such as 910 etc processor, when these program codes are by calculating
When equipment is run, cause each step in the computing device method described above.
It should be noted that the present invention will be described rather than limits the invention for above-described embodiment, and ability
Field technique personnel can design alternative embodiment without departing from the scope of the appended claims.In the claims,
Any reference symbol between bracket should not be configured to limitations on claims.Word "comprising" does not exclude the presence of not
Element or step listed in the claims.Word "a" or "an" before element does not exclude the presence of multiple such
Element.The present invention can be by means of including the hardware of some different elements and being come by means of properly programmed computer real
It is existing.In if the unit claim of equipment for drying is listed, several in these devices can be by same hardware branch
To embody.The use of word first, second, and third does not indicate that any order.These words can be explained and run after fame
Claim.
So far, although those skilled in the art will appreciate that detailed herein have shown and described multiple showing for the present invention
Example property embodiment, still, still can be direct according to present disclosure without departing from the spirit and scope of the present invention
It is determined that or derive many other variations or modifications for meeting the principle of the invention.Therefore, the scope of the present invention is understood that and recognized
It is set to and covers other all these variations or modifications.
Claims (10)
1. a kind of entity library generating method, including:
Entity mobility models collection of illustrative plates is established based on vertical search class website data;
The correlation of demand entity according to involved by the search history of user record parses the historical search behavior of the user
Information;
Using demand entity as keyword, with reference to involved by the historical search behavior of the entity mobility models collection of illustrative plates and the user
The relevant information generation entity storehouse of demand entity.
2. according to the method for claim 1, wherein, described recorded according to the search history of user parses the user's
The relevant information of demand entity involved by historical search behavior, including:
Search daily record and/or the click logs of the user are obtained, based on the search daily record and/or click logs to the use
The history click information at family does entity link and/or subject classification, and the demand parsed involved by the history click information is real
The relevant information of body.
3. method according to claim 1 or 2, wherein, it is described using demand entity as keyword, know with reference to the entity
Know the relevant information generation entity storehouse of collection of illustrative plates and the demand entity involved by the historical search behavior of the user, including:
Using demand entity as keyword, with reference to involved by the historical search behavior of the entity mobility models collection of illustrative plates and the user
Demand entity and/or demand type corresponding with the demand entity establish user's request click model, generation includes the use
The entity storehouse of family demand click model.
4. according to the method described in claim any one of 1-3, wherein, it is described using demand entity as keyword, with reference to described
Demand entity involved by the historical search behavior of entity mobility models collection of illustrative plates and the user and/or corresponding with the demand entity
Demand type establishes user's request click model, and generation includes the entity storehouse of the user's request click model, including:
Using demand entity as keyword, demand entity involved by historical search behavior to the user and/or needed with this
Demand type is polymerize corresponding to realistic body, generates entity demand queue;
Demand intensity is calculated according to the history click information of the user, the demand intensity is added to the entity demand team
In row, generation includes the entity storehouse of the user's request click model.
5. according to the method described in claim any one of 1-4, wherein, it is described using demand entity as keyword, with reference to described
Demand entity involved by the historical search behavior of entity mobility models collection of illustrative plates and the user and/or corresponding with the demand entity
Demand type establishes user's request click model, after generation includes the entity storehouse of the user's request click model, in addition to:
The user's request click model is updated with predetermined period.
6. according to the method described in claim any one of 1-5, wherein, it is described that the user's request click is updated with predetermined period
Model, including:
Established by on-line study method and click on feedback model in real time, the entity demand that user is monitored with the predetermined period becomes
Change, the sequence of the entity demand queue in the user's request click model is adjusted by online feedback mechanism.
7. a kind of entity storehouse generating means, including:
Module is established, vertical search class website data is configured to and establishes entity mobility models collection of illustrative plates;
Parsing module, it is configured to according to involved by the search history of user record parses the historical search behavior of the user
The relevant information of demand entity;
Entity storehouse generation module, it is configured to using demand entity as keyword, with reference to the entity mobility models collection of illustrative plates and the user
Historical search behavior involved by demand entity relevant information generation entity storehouse.
8. device according to claim 7, wherein, the parsing module is additionally configured to:
Search daily record and/or the click logs of the user are obtained, based on the search daily record and/or click logs to the use
The history click information at family does entity link and/or subject classification, and the demand parsed involved by the history click information is real
The relevant information of body.
9. a kind of computer program, including computer-readable code, when the computer-readable code is run on the computing device
When, cause entity library generating method of the computing device as described in any one of claim 1 to 6.
A kind of 10. computer-readable medium, wherein storing computer program as claimed in claim 9.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710916101.7A CN107807957A (en) | 2017-09-30 | 2017-09-30 | entity library generating method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710916101.7A CN107807957A (en) | 2017-09-30 | 2017-09-30 | entity library generating method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107807957A true CN107807957A (en) | 2018-03-16 |
Family
ID=61592704
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710916101.7A Pending CN107807957A (en) | 2017-09-30 | 2017-09-30 | entity library generating method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107807957A (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109145200A (en) * | 2018-07-13 | 2019-01-04 | 百度在线网络技术(北京)有限公司 | Promote method, apparatus, equipment and the computer storage medium showed |
CN109635125A (en) * | 2018-12-20 | 2019-04-16 | 广东小天才科技有限公司 | Vocabulary atlas building method and electronic equipment |
CN109766444A (en) * | 2018-12-10 | 2019-05-17 | 北京百度网讯科技有限公司 | The application database generation method and its device of knowledge mapping |
CN110263180A (en) * | 2019-06-13 | 2019-09-20 | 北京百度网讯科技有限公司 | It is intended to knowledge mapping generation method, intension recognizing method and device |
CN110674313A (en) * | 2019-09-20 | 2020-01-10 | 四川长虹电器股份有限公司 | Method for dynamically updating knowledge graph based on user log |
CN110674307A (en) * | 2019-08-21 | 2020-01-10 | 北京邮电大学 | Knowledge deduction method and system for knowledge center network |
CN111091006A (en) * | 2019-12-20 | 2020-05-01 | 北京百度网讯科技有限公司 | Entity intention system establishing method, device, equipment and medium |
CN112015919A (en) * | 2020-09-15 | 2020-12-01 | 重庆广播电视大学重庆工商职业学院 | Dialogue management method based on learning auxiliary knowledge graph |
CN112507123A (en) * | 2020-12-04 | 2021-03-16 | 北京搜狗科技发展有限公司 | Data processing method and device |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103995870A (en) * | 2014-05-21 | 2014-08-20 | 百度在线网络技术(北京)有限公司 | Interactive searching method and device |
US20140297644A1 (en) * | 2013-04-01 | 2014-10-02 | Tencent Technology (Shenzhen) Company Limited | Knowledge graph mining method and system |
CN104598556A (en) * | 2015-01-04 | 2015-05-06 | 百度在线网络技术(北京)有限公司 | Search method and search device |
CN105404680A (en) * | 2015-11-25 | 2016-03-16 | 百度在线网络技术(北京)有限公司 | Searching recommendation method and apparatus |
CN105447005A (en) * | 2014-08-08 | 2016-03-30 | 百度在线网络技术(北京)有限公司 | Object push method and device |
-
2017
- 2017-09-30 CN CN201710916101.7A patent/CN107807957A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140297644A1 (en) * | 2013-04-01 | 2014-10-02 | Tencent Technology (Shenzhen) Company Limited | Knowledge graph mining method and system |
CN103995870A (en) * | 2014-05-21 | 2014-08-20 | 百度在线网络技术(北京)有限公司 | Interactive searching method and device |
CN105447005A (en) * | 2014-08-08 | 2016-03-30 | 百度在线网络技术(北京)有限公司 | Object push method and device |
CN104598556A (en) * | 2015-01-04 | 2015-05-06 | 百度在线网络技术(北京)有限公司 | Search method and search device |
CN105404680A (en) * | 2015-11-25 | 2016-03-16 | 百度在线网络技术(北京)有限公司 | Searching recommendation method and apparatus |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109145200A (en) * | 2018-07-13 | 2019-01-04 | 百度在线网络技术(北京)有限公司 | Promote method, apparatus, equipment and the computer storage medium showed |
US11164210B2 (en) | 2018-07-13 | 2021-11-02 | Baidu Online Network Technology (Beijing) Co., Ltd. | Method, device and computer storage medium for promotion displaying |
CN109766444A (en) * | 2018-12-10 | 2019-05-17 | 北京百度网讯科技有限公司 | The application database generation method and its device of knowledge mapping |
CN109635125A (en) * | 2018-12-20 | 2019-04-16 | 广东小天才科技有限公司 | Vocabulary atlas building method and electronic equipment |
CN109635125B (en) * | 2018-12-20 | 2021-01-26 | 广东小天才科技有限公司 | Vocabulary atlas building method and electronic equipment |
CN110263180B (en) * | 2019-06-13 | 2021-06-04 | 北京百度网讯科技有限公司 | Intention knowledge graph generation method, intention identification method and device |
CN110263180A (en) * | 2019-06-13 | 2019-09-20 | 北京百度网讯科技有限公司 | It is intended to knowledge mapping generation method, intension recognizing method and device |
CN110674307A (en) * | 2019-08-21 | 2020-01-10 | 北京邮电大学 | Knowledge deduction method and system for knowledge center network |
CN110674313A (en) * | 2019-09-20 | 2020-01-10 | 四川长虹电器股份有限公司 | Method for dynamically updating knowledge graph based on user log |
CN110674313B (en) * | 2019-09-20 | 2022-12-13 | 四川长虹电器股份有限公司 | Method for dynamically updating knowledge graph based on user log |
CN111091006A (en) * | 2019-12-20 | 2020-05-01 | 北京百度网讯科技有限公司 | Entity intention system establishing method, device, equipment and medium |
CN111091006B (en) * | 2019-12-20 | 2023-08-29 | 北京百度网讯科技有限公司 | Method, device, equipment and medium for establishing entity intention system |
CN112015919A (en) * | 2020-09-15 | 2020-12-01 | 重庆广播电视大学重庆工商职业学院 | Dialogue management method based on learning auxiliary knowledge graph |
CN112507123A (en) * | 2020-12-04 | 2021-03-16 | 北京搜狗科技发展有限公司 | Data processing method and device |
WO2022116527A1 (en) * | 2020-12-04 | 2022-06-09 | 北京搜狗科技发展有限公司 | Data processing method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107807957A (en) | entity library generating method and device | |
JP6515624B2 (en) | Method of identifying lecture video topics and non-transitory computer readable medium | |
CN107679186A (en) | The method and device of entity search is carried out based on entity storehouse | |
AU2014201827B2 (en) | Scoring concept terms using a deep network | |
JP5575902B2 (en) | Information retrieval based on query semantic patterns | |
US9483462B2 (en) | Generating training data for disambiguation | |
CN111797214A (en) | FAQ database-based problem screening method and device, computer equipment and medium | |
CN106557480B (en) | Method and device for realizing query rewriting | |
CN103339623A (en) | Internet search related methods and apparatus | |
US10152478B2 (en) | Apparatus, system and method for string disambiguation and entity ranking | |
EP1587009A2 (en) | Content propagation for enhanced document retrieval | |
US20170154116A1 (en) | Method and system for recommending contents based on social network | |
JP5543020B2 (en) | Research mission identification | |
US11200244B2 (en) | Keyword reporting for mobile applications | |
JP2015191655A (en) | Method and apparatus for generating recommendation page | |
JP2003330948A (en) | Device and method for evaluating web page | |
CN102693271A (en) | Network information recommending method and system | |
US7949646B1 (en) | Method and apparatus for building sales tools by mining data from websites | |
CN105069077A (en) | Search method and device | |
CN108170293A (en) | Input the personalized recommendation method and device of association | |
CN104951484A (en) | Search result processing method and search result processing device | |
Yerva et al. | Entity-based classification of twitter messages | |
CN107908616A (en) | The method and apparatus of anticipation trend word | |
CN110008396B (en) | Object information pushing method, device, equipment and computer readable storage medium | |
Yerva et al. | What have fruits to do with technology? The case of Orange, Blackberry and Apple |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180316 |