CN105493082A - Person search utilizing entity expansion - Google Patents

Person search utilizing entity expansion Download PDF

Info

Publication number
CN105493082A
CN105493082A CN201480037264.2A CN201480037264A CN105493082A CN 105493082 A CN105493082 A CN 105493082A CN 201480037264 A CN201480037264 A CN 201480037264A CN 105493082 A CN105493082 A CN 105493082A
Authority
CN
China
Prior art keywords
search
inquiry
related entities
people
expansion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201480037264.2A
Other languages
Chinese (zh)
Inventor
J.奥尔蒙特
M.E.戴维斯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Technology Licensing LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US13/931,922 external-priority patent/US20150006520A1/en
Application filed by Microsoft Technology Licensing LLC filed Critical Microsoft Technology Licensing LLC
Publication of CN105493082A publication Critical patent/CN105493082A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3338Query expansion

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Presented are systems and methods, as well as computer readable media, for responding to a search query for content (or references to content) relating to a person identified in the search query. According to various embodiments, upon receiving a search query from a computer user, related entity data is obtained from at least one related entity source for the identified person. Related entity data comprises at least one of a related entity (or entities) or a category associated with the identified person. An expanded search query is generated according to the search query from the computer user and the related entity data. Search results are obtained according to the expanded search query and a search results presentation is generated and returned to the computer user in response to the search query.

Description

The people's search utilizing entity to expand
Background technology
The content of locating on the internet about particular person can be challenging.There are the many factors making " people's search " difficulty: most of name is not unique.In any given area, the several body with same name may be there is.In addition, the web existence of any given people may be low, and the Search Results for these personnel arranged by the result about the more well-known individuality with same name.
Summary of the invention
There is provided following summary of the invention introduce in simplified form below the selection of concept that further describes in a specific embodiment.Content of the present invention is not intended to the key feature or the essential feature that identify theme required for protection, is also not intended to the scope for limiting theme required for protection.
According to each side of disclosed theme, receive search inquiry from computer user, this search inquiry mark finds the people for its content (or quoting content).When receiving search inquiry from computer user, obtain related entities data from least one the related entities source for identified people.Related entities data comprise at least one in related entities (or multiple entity) or the classification that is associated with identified people.According to from the search inquiry of computer user and the related entities data genaration search inquiry through expansion.Obtain Search Results according to the search inquiry through expansion, and generate Search Results in response to search inquiry and present and return it to computer user.
According to the other aspect of disclosed theme, present a kind of computer-readable medium of load capacity calculation machine executable instruction.When performing at the computing system of at least one processor comprising the instruction performed from medium retrieval, computing system is configured to implement the method for making response to the search inquiry from user.More specifically, in response to the search inquiry received from computer user, wherein search inquiry mark finds the people for its content (or quoting content), obtains related entities data from least one the related entities source for identified people.According to from the search inquiry of computer user and the related entities data genaration search inquiry through expansion.Obtain Search Results according to the search inquiry through expansion, and generate Search Results in response to search inquiry and present and return it to computer user.
According to the aspect other again of disclosed theme, present a kind of computer system for making response to the search inquiry for the content relating to people.This computer system comprises processor and storer, wherein processor perform as add-on assemble part or be combined the instruction stored in memory with add-on assemble, to make response to the search inquiry for the content relating to people.Inquiry topic identified component that these add-on assembles comprise (unrestricted as explanation), related entities retrieval component, Query Builder, Search Results retrieval component and Search Results through expansion present maker.In operation, inquire about topic identified component to be configured to determine to find the identity for the people of its related content from search inquiry.Related entities retrieval component obtains the related entities data corresponding to identified people from related entities source.After acquisition related entities data, through expansion Query Builder from for relate to identified people content search inquiry and from the inquiry of related entities data genaration through expansion.According to various embodiment, related entities data comprise at least one in the related entities or classification be associated with the people identified of search inquiry.Search Results retrieval component obtains Search Results according to the search inquiry through expansion from content repository.After this, Search Results presents maker and generates Search Results according to the Search Results quoting the content corresponding to identified people and present and presented by Search Results and turn back to computer user.
Accompanying drawing explanation
The aforementioned aspect of disclosed theme and many adjoint advantages will become and be more prone to understand, because will be understood them better when considering by reference to the accompanying drawings by referring to description below, wherein:
Fig. 1 is the block diagram of the networked environment of each side of the theme be suitable for disclosed in realization;
Fig. 2 be a diagram that the process flow diagram of the example routine of the result for being provided improvement in response to the search inquiry about the content for particular person by query expansion;
Fig. 3 be a diagram that the process flow diagram of the example routine for generating the search inquiry through expansion of each side according to disclosed theme;
Figure 4 and 5 illustrate the element of the search inquiry through expansion; And
Fig. 6 be a diagram that the block diagram of the example components of the search engine being configured to the result providing improvement in response to the search inquiry from computer user.
Embodiment
For purposes of clarity, term " exemplary " use in the document should be interpreted as illustration or the example of serving as something or other, and it should not be interpreted as the ideal of this things and/or leading illustration.Entity corresponds to abstract or tangible things, and it comprises (unrestricted as explanation): people, place, group, concept, activity etc.
Turn to Fig. 1, Fig. 1 be a diagram that the block diagram of the exemplary networking environment 100 of each side of the theme be suitable for disclosed in realization, especially about the Search Results providing improvement in response to the search inquiry about people to computer user.Exemplary networking environment 100 comprises one or more subscriber computers of the network 108 being connected to such as the Internet, wide area network or WAN etc. and so on, such as subscriber computer 102-106.Subscriber computer comprises, unrestricted as explanation: desk-top computer (such as desk-top computer 104); Laptop computer (such as laptop computer 102); Flat computer (such as flat computer 106); Mobile device (not shown); Game console (not shown); Personal digital assistant (not shown); Deng.Subscriber computer can be configured to be connected to network 108 by mode that is wired and/or wireless connections.Only for purposes of illustration, network 108 is illustrated as between subscriber computer 102-106 and search engine 110 by exemplary networking environment 100, and same between search engine 110 and website 112-116.But this diagram should not be interpreted as implying that these are independent networks.
What be connected to network 108 equally is various networking site, comprises website 110-116.Exemplarily unrestricted, the networking site being connected to network 108 comprises the news sources 112 and 114, social networking site 116 etc. that are configured to the search inquiry from computer user be made to the search engine 110 of response, the various news article of trustship and content.The computer user of such as computer user 101 and so on can navigate to these and other networking site to access content via the subscriber computer of such as subscriber computer 102 and so on, comprises news content.
According to each side of disclosed theme, search engine 110 is configured to provide Search Results (typically with the form quoted to content available on network 108) in response to the search inquiry from computer user.Especially, in response to the search inquiry for the information about particular person received from computer user, search engine 110 relates to the content of identified people according to the message identification in its content repository, the content identified based at least some generates Search Results and presents, and is presented by Search Results and be supplied to computer user.
Fig. 1 also comprises social network site 116 and various news sources illustratively, comprises news site 112-114.As readily appreciated, social network site 116 is to provide the online website/service of platform, this platform Computer user can set up describe user various in profile, build and the relation of other computer user, groups etc. and social networks.In social network site 116, computer user can set up or indicate various interest, activity and the background about those in his/her social networks.In fact, those skilled in the art will understand, and whether computer user can indicate as may be people, place, group, concept, activity etc. by the preference in the special entity on the social networking service of social networking site 116 trustship or interest, this entity usually.Although only comprise a social network site 116 in illustrative network environment 100, this is only illustrative and should be considered as the restriction to disclosed theme.In an actual embodiment, the social network site of any number being connected to network 108 can be there is.
As known in the art, search engine 110 is configured to (directly or by service call and/or web grabber indirectly) and comprises news site 112 and 114, social networking site 116 and the such as blog multiple content source with other website of registration center (not shown) and so on and communicate to obtain the information about the content available at each website place.Information about available content can also push to search engine from various service and/or networking site.This information (typically as quoting content) is stored in the content repository, makes search engine can obtain content to make response to the search inquiry from the such as computer user of computer user 101 and so on from this content repository.Search engine 110 can also from the information of the acquisitions such as search query log, network browsing history, purchase history about any given individuality.This information obtained from various website and content typically carry out index according to keyword and phrase, to make it possible to mark and visit information rapidly.In addition, except the information in the content repository being stored in search engine, search engine 110 can also be configured to when making response to search inquiry from other website obtaining information.Such as, according to each side of disclosed theme, when making response to search inquiry, search engine 110 can obtain data from one or more social networking site of such as social network site 116 and so on, as turning back to the relevant information of requesting computer user and/or turning back to the information of the relevant information of requesting computer user as assist search engine mark.
In order to further illustrate each side of disclosed theme, with reference now to Fig. 2.Fig. 2 is the process flow diagram for being provided the example routine of the result of improvement in response to the search inquiry about the content corresponding to particular person by query expansion.At block 202, place starts, and search engine 110 receives search inquiry from the computer user of such as computer user 101 and so on, and this search inquiry request corresponds to the content of particular person.
As readily appreciated, search inquiry typically (but not exclusively) is text string.Such as, the search inquiry for the content relating to people can be " BruceWayne ".Correspondingly, when there is the several body with same name, at block 204 place, the people of the theme identified uniquely as search inquiry attempted by search engine.According to each side of disclosed theme, search engine attempt according to relate at least general information of requesting computer user and customizing messages come uniquely identification request for the people of its content.Unrestricted as explanation, general information comprises: corresponding to the popularity of search inquiry of people with the name identified in search inquiry; There is the epidemic trend of the people of the name identified in search inquiry; Other term in search inquiry and/or phrase (such as " BruceWayneSeattle " or " BruceWayneMicrosoft "); Commissarial image; Deng.Unrestricted as explanation, the customizing messages relating to requesting computer user can comprise: current location; In first search inquiry history; Current and workplace before; On current and educational institution before; Social networks; Preference (explicit and implicit identification); General chart connectivity between requesting computer user and the potential object of search inquiry and the number of common friends; Physical distance between request user and potential object; The position of friend; Position before; Deng.Typically but not exclusively, Globally Unique Identifier at least internally can be associated with the people of the theme as search inquiry by search engine 110.And, once be identifier as the people of the theme of search inquiry, search engine 110 can acquisition or reset name in response to the Search Results of search inquiry in use the Globally Unique Identifier be associated.
Certainly, the order presented in block 202 and 204 should be regarded as illustrative and unrestricted disclosed theme.Under various condition, the identity finding the people of the content for it can be known before submission/reception searching request.Such as, particular person can be designated as one of suggestion automatically by suggestion search recommendation automatically, and typically, the unique identities of the people of this suggestion is known.Alternatively, another service can be submitted to for the searching request for people identifying people uniquely to search service, to make the identity not needing to determine this people.Correspondingly, although disclose an embodiment about the block 202 and 204 of Fig. 2, this illustrates an embodiment, and unrestricted disclosed theme.
Find for the searching request of the people of its content about mark, the name that also may there is wherein this people is not known but there is provided the time that can cause some information identifying this people uniquely.Such as, computer user may not know the presidential name of SeattleSeahawks, but being usually enough to submitting text " general manager (GM) of SeattleSeahawks " Computer user to identify the people found for its content, in block 204, the identity of this people can being determined.
At block 206 place, after identifying the people as the theme of search inquiry, search engine 110 obtains the related entities data corresponding to identified people.According to each side of disclosed theme, related entities data comprise the entity relating to identified people.Related entities is identified people's relative entity for some reason.Although some reasons may be known, other may be unknown and imply according to statistics similarity.Such as, assuming that the employee of the Ren Shi company A identified and be the member of working group Z.Based on this employer-employee relationship, the related entities of the people identified will typically comprise " company A " and " working group (Workgroup) Z ".Other related entities produced by this identical employer-employee relationship can comprise peer cooperation person.Based on the employer-employee relationship that this is identical, also have other entity also can comprise other (before) working groups, past and current co-worker etc.Facilitating in above example, the people identified can also be the alumnus of specific university.Thus, this university can be the related entities of identified people, and the specific institute in the university attended school of the people identified, the degree authorized, the scholastic attainment, classmate etc. of people that identify.Again further, assuming that the people identified also has the enthusiasm for gardening, the people identified may be the member of local gardening expert, and as a result, local gardening expert can be the related entities of the senior member of identified people and group.
According to each side of disclosed theme, search engine 110 obtains related entities data from one or more related entities source.The various information about identified people that search engine 110 can store trustship or store from user profiles storage vault (the user profiles storage vault 628 of such as Fig. 6), and be therefore one of related entities source.Such as, search engine 110 can store the subscriber profile information corresponding to computer user.This subscriber profile information can based on the information of the information of explicit identification (people from identified) and implicit identification (such as from the information that search inquiry, browsing histories etc. are derived).The such as social networking site of social networking site 116 and so on represents additional related entities source.As indicated above, the people of social networking site people that such as search inquiry is identified and so on can set up and the relation of other entities (it comprises people, tissue, activity, cause etc.) and social networks.Certainly, various related entities source can be there is, each trustship wherein can indicate the information of the relation between identified people and other entities, and search engine 110 can be configured to obtain related entities data from these related entities sources of any number.
Should understand, may comprise by the related entities information of each related entities source trustship the information that identified people wishes to keep secret.In order to solve this problem, according to each side of disclosed theme, search engine identification request computer user, and if identified, can obtain the trial using in related entities information and use is given to the license of requesting computer user.In various embodiments, computer user's certification himself is required or herself is to access the information about identified people.Unrestricted as explanation, other requirement can comprise requesting computer user and sign in in one or more service so that access and/or to check otherwise by confined content.
As about above example advised, related entities source can by one or more category associations to individual (people identified of such as search inquiry).Correspondingly, the related entities data obtained from related entities source can also comprise categorical data.Be advantageously used (as discussed in more detail below) in the search inquiry that categorical data (set and others both actual relationships of each class about the potential relation limited by classification) can receive in expansion.In the above examples, related entities source can have the various classifications be associated with identified people, comprises " employee ", " alumnus " and " gardener ".And each related entities source can be safeguarded and what be limited by the classification information meaning to be associated with classification.This classification information generally includes may in the list belonging to potential (although the not necessarily required) relation existed between the first instance (such as identified people) of particular category and other entities.The set of potential relation can be defined as and comprise " employer ", " working group ", " current supvr ", " direct report ", " colleague " etc. by " employee " classification.Correspondingly, each entity being classified as " employee " then can have as limited by potential set of relationship with the relation of other entities.Certainly, although classification limits the set of potential relation, do not require that such other entity relates to other entities based on each potential relation.Again further, the given entity such as corresponding to the entity of the people identified of search inquiry and so on can be associated with multiple classification.Except limited classification, classification can also be inferred.Such as, employee may be interested in the Previous work performed before company, is " colleague " to make inferred classification.
At block 208 place, identify/determine the search model of the search inquiry be applied to through expansion.This search model comprise for various elements (term and the phrase) weighting of search inquiry through expansion to improve the information of Search Results.The also not all query term that the search inquiry be applied to by search model through expanding recognizes at least in part through the search inquiry of expansion is all equivalent, and namely some query term are more important than other query term in mark is for the relevant search content of identified people.Typically, although be not exclusively, support/weighting employs relevant inquiring term or education relevant inquiring term to provide the Search Results of improvement when presenting the correlativity of various Search Results (or or rather, the content quoted by Search Results) to specific user.According to various embodiment, the selection of search model can based on the information about requesting computer user.Such as, if known requesting computer user is being educated in the university, then education model can selected.Alternatively, can according to about identified people information to search engine 110 or external source can information make (comprising from related entities data) selection of search model.Again in additional embodiment, the selection of search model can be made according to the information of both the people identified about requesting computer user and search inquiry.
At block 210 place, according to the search inquiry generated for the determined search model of identified people through expansion.The search inquiry generated through expansion is discussed in further detail about Fig. 3.More specifically, Fig. 3 be a diagram that the process flow diagram for the example routine 300 according to the search inquiry of the related entities data genaration obtained from related entities source through expanding.At block 302 place, screening element and the people identified of the search inquiry received are included as the original sections of the search inquiry through expansion.Although this may make to be copied in original sections by received search inquiry simply to necessitate, initial search query may need not be copied simply.Usual requesting computer user may misspell and be screened any one in element by the name of people found or the mark be associated with this people.Such as, the search inquiry received can be " BruseWaynMicrosoft ", attempts to find the content corresponding to " BruseWayn " being operated in " Microsoft ".If can determine misspelling name (or one or more screening element), then it will be not too effective for initial search query being included in the search inquiry of expansion.Thus, in the block 204 of routine 200, mark people.The correction (although in routine 200 and 300 not explicitly recall) to screening element can also be carried out.
Except the query term of search inquiry is included in through expansion search inquiry in except, from obtained related entities statistical conversion query term and comprised/be incorporated in through expansion search inquiry.Particularly, at block 304 place, according to determined search model, the related entities (relating to identified people) from obtained related entities data is included in the related entities section of the search inquiry of expansion.At block 306 place, according to search model, be included in through the class instance section of the search inquiry of expansion from the query term comprising the two categorical data of classification (as entity) and class instance (as described below) and derive.After this, at block 308 place, return through expansion search inquiry and routine 300 stop.
In order to the section described above of the search inquiry through expansion is described better, with reference to figure 4.Fig. 4 illustrates the exemplary search inquiry 400 through expansion corresponding to above example, namely for people " BruseWayn ".For this example, assuming that this people identified " BruseWayn " is associated with an only classification employee.As shown in the search inquiry 400 through expansion, original sections 402 comprises initial search query text 404 " Bruse.Wayn " and relates to the replaceable name of identified people, is in this case " BatmanDark.KnightMatches.MaloneCaped.Crusader ".And the access right that will have all information of not all computer user certainly.In competent example, and nonowner may know the replaceable name may quoting " BruseWayn " uniquely.But when requesting computer user has complete authority, such information can be useful to obtaining the result improved.About the operational symbol 406 ". " between two search inquiry names, this represents instruction two names " Bruse " and " Wayn " and should be regarded as preferably " Bruse " and appear at the exemplary convention on " Wayn " side with this order, although do not force to occur together or the two must occur---and be only highly preferred.Certainly, this agreement (and other operational symbol in this figure) is only illustrative and should be regarded as the theme disclosed in restriction.Other syntax agreement comprises (unrestricted as explanation): operational symbol 408 " inbody: " indicates when any one the word/term found in content body between bracket to search engine 110 that it should mate document; " noalter: " instruction should not revise the operational symbol of the spelling of term; And " norelax: " instruction term is important and the operational symbol that cannot abandon in matching content.Operational symbol 410 "+" indicates the cascade of other search arithmetic symbol and/or token to search engine.
Search inquiry 400 through expansion also comprises related entities section 412, and it comprises the related entities of the people identified of search inquiry, such as text 416 " Research ".Be included in further again through expansion search inquiry in be class instance section 414, it comprises the class instance of classification " employee ".As the above mentioned, class instance section 414 comprises the class instance of classification (" employee ") and such as text 418 " Workgroup " and so on.These entries help how may know that identified people's (being " BruseWayn " in this case) bears results based on computer user alternatively.As can be seen, the search inquiry through expansion for particular person adopts search inquiry, such as " BruseWayn ", and utilizes related entities and class instance to carry out expanding query to identify the content corresponding to identified people better.About operational symbol " rankonly: ", when finding coupling token/value (such as " Research ") in a document, the operation of this operational symbol allows the rank of document rise.It operates in the document making not require to obtain in result and finds the term of specifying, but will document ranking be caused to be more be correlated with if found.Operational symbol " word: ", if operate into the one or more tokens (such as " Workgroup ") found in a document in bracket, is mated on document.In a sense, operational symbol " word: " is operating as max(or maximal value) type of operational symbol, compares each token between bracket and document and returns the single maximal value of token rank.Particularly, if more than one token matched, only return the value of maximum coupling token." norank: " token (not shown) will require that the token that (identifying between parenthesis) specifies is the required sequence or the correlativity that still do not affect the document in total result in result document.In the combination of AND operator " rankonly: ", if wherein find one or more token, the sequence of the document of the rank increase of document.
Although generally comprise text token (such as " Bruse.Wayn ") through the inquiry 400 and 500 of expansion, it is to be appreciated that this is illustrative and should be regarded as the theme disclosed in restriction.In alternative embodiments, the one or more tokens in the search inquiry of expansion can be the unique identifier of mark for the searching of people and/or related entities.Such as, the search inquiry 500 through expansion comprises the operational symbol 510 comprising Facebook numeric identifier (" 740049358 ") and the operational symbol 512 comprising Facebook user identifier (" t-drake ").Certainly, any specific identifier source can be used and Facebook identifier is only illustrative.
As suggested above, the people identified can be associated with more than one classification.Thus, although the search inquiry 400 through expansion of Fig. 4 describes the information from single classification, this is to illustrate.Similarly, Fig. 5 illustrates the exemplary search inquiry 500 through expansion corresponding to above example, namely for identified people " BruseWayn ", but comprises the information of from two classifications--employer and education--in this example.As can be seen, the search inquiry 500 through expansion comprises original sections 502 and related entities section 504 and class instance section 506.As seen in related entities section 504 and class instance section 506, when finding the more related entities for identified people and when getting other more information various types of corresponded to for identified people, the search inquiry through expansion becomes more in detail and becomes the content corresponding to the people identified of search inquiry with the engine identification that assists search of comprising property.
At block 212 place, obtain Search Results according to the search inquiry through expansion.Obtain Search Results according to search inquiry (in this case, there is the search inquiry of the term through expansion according to related entities and classification) to be well known in the art.According to each side of disclosed theme, according to from received search inquiry query term and obtain Search Results according to from the query term of related entities statistical conversion alternatively.In other words, be intended to expand the scope of the content/Search Results corresponding to identified people from the query term of the search inquiry through expansion of related entities statistical conversion, but be not mandatory term from these query term of related entities statistical conversion.In this way (namely from the query term of related entities statistical conversion be " optionally "), the scope of content is extended to the scope relating to identified people instead of constriction content potentially, if those query term are not optional words by the search inquiry through expansion.
At block 214 place, generate Search Results according to obtained Search Results at least in part and present.Typically, generate one or more search results pages according to obtained Search Results, those results that wherein score is the highest are present in the first page presented.At block 216 place, after Search Results presents in generation, return to requesting computer user present at least part of in response to search inquiry.According to various embodiment, the result turning back to requesting computer user is organized according to the various classifications of the information about target person.After this, routine 200 stops.
Although do not show in routine 200, additional step can be taked after returning results to computer user.Unrestricted as explanation, one or more processes on the equipment of computer user can monitor the activity of the computer user about provided result, and compared to other client computers user about certain content cost how long etc. such as computer user follows which quotes (hyperlink), which is avoided.Be committed to search engine by the activity of supervisory computer user, the deduction about particular person and/or entity can be carried out, these deductions can be taken into account by subsequent query.In fact, some or all (support and oppose particular result) in these deductions may be used for forming search model discussed above.
About routine 200 and 300, although these routines are stated about discrete step, these steps should be regarded as being logic in itself and maybe can not correspond to any reality and/or the discrete step of specific implementation mode.The order presenting these steps in various routine should be interpreted as the only orderliness that step can be implemented with it yet.And although these routines comprise the various novel features of disclosed theme, other step (unlisted) also can be implemented in the execution of routine.In addition, those skilled in the art will understand, and multiple step be combined or be comprised to the logic step of these routines can.The step of routine 200 and 300 can walk abreast or serial is implemented, or precalculates.Usually but not exclusively, functional being embodied in of various routine performs such as following about in the computer hardware described by Fig. 6 and/or the software in system (such as application, system service, storehouse etc.).In various embodiments, all or some in various routine can also be embodied in hardware module, comprises SOC (system on a chip), computer system.
Although be embodied in application (being also called computer program), app(little, the application of general single or narrow object) and/or method in routine in state many novel aspect of disclosed theme, these aspects can also be presented as by being also called the computer executable instructions that the computer-readable medium of computer-readable recording medium stores.As the skilled person will recognize, computer-readable medium can trustship computer executable instructions for retrieving after a while and performing.When performing the computer executable instructions be stored in computer readable storage devices, they implement various step, method and/or function, comprise above those steps, method and the routine that describe about routine 200 and 300.The example of computer-readable medium includes but not limited to: the such as optical storage medium of Blu-ray disc, digital video disc (DVD), compact disk (CD), CD casket etc. and so on; Comprise the magnetic-based storage media of hard disk drive, floppy disk, tape etc.; The such as memory storage device of random-access memory (ram), ROM (read-only memory) (ROM), memory card, thumb actuator etc. and so on; Cloud memory storage (i.e. online storage service); Deng.But for the purpose of this disclosure, computer-readable medium gets rid of carrier wave and transmitting signal clearly.
Turn to Fig. 6 now, Fig. 6 be a diagram that the block diagram of the example components of the search engine being configured to the result providing improvement in response to the search inquiry from computer user.As shown in Figure 6, search engine 110 comprises the processor 602(or processing unit that are interconnected by system bus 610) and storer 604.As those skilled in the art will understand, storer 604 typically (but not always) comprises both volatile memory 606 and nonvolatile memory 608.Just retain or storage information as long as this storer of volatile memory 606 is supplied to electric power.By contrast, nonvolatile memory 608 even can store (or retaining) information when power supply is unavailable.Generally speaking, RAM and cpu cache storer are the examples of volatile memory, and ROM and memory card are the examples of nonvolatile memory.
Processor 602 is being implemented to perform the instruction retrieved from storer 604 in various function, especially in response to the search inquiry of the result of the improvement had by query expansion.Processor 602 can comprise any various commercially available processor, such as single processor, multiprocessor, monokaryon unit and multinuclear unit.And those skilled in the art will understand, the novel aspect of disclosed theme can utilize other computer system configurations to put into practice, and other computer system configurations includes but not limited to: microcomputer; Mainframe computer, personal computer (such as desk-top computer, laptop computer, flat computer etc.); Handheld computing device, such as smart phone, personal digital assistant etc.; Based on microprocessor or programmable consumer electronics; Game console etc.
System bus 610 is provided for the interface that various assembly is intercomed mutually.System bus 610 can be can interconnect various assembly (comprising inside and outside both assemblies) some types bus structure in any one.Search engine 110 comprises the network communication components 612 for the miscellaneous equipment on interconnection network website and other computing machine (include but not limited to the subscriber computer of such as subscriber computer 102-106 and so on, comprise other website of website 112-116) and computer network 108 further.Network communication components 612 can be configured to via wired connection, wireless connections or the two come and the miscellaneous equipment on the external network of such as network 108 and so on and communication for service.
Search engine 110 also comprises inquiry topic identified component 614, and it is configured to the theme obtaining identification search inquiry, the people identified in such as search inquiry as described above.Be included in equally in search engine 110 is related entities retrieval component 616.Related entities retrieval component 616 obtains the related entities data of the related entities (or more generally, the related entities of the theme of search inquiry) corresponding to identified people.As mentioned before, related entities data comprise related entities, the classification be associated with identified people and the categorical data corresponding to the classification be associated.Related entities retrieval component 616 obtains related entities data from the such as above related entities source described about Fig. 2.Through expansion Query Builder 618 according to the related entities data obtained by related entities retrieval component 616 from be received from computer user search inquiry generate through expansion search inquiry.
The search inquiry through expansion that Search Results retrieval component is configured to according to being generated by the enquiring component 618 through expansion obtains Search Results from content repository 626.Search model assembly 624 is configured to select search model (as described above) and search model is applied to obtained Search Results.Search Results presents maker 620 and generates Search Results and present, and typically comprises one or more search results pages, for presenting to requesting computer user in response to search inquiry.
Those skilled in the art will understand, the various assemblies of the search engine 110 of Fig. 6 described above can be implemented as in computer system can executive software module, hardware module (comprising SoC--SOC (system on a chip)) or the combination of the two.And each in various assembly can be implemented as independently, the process that cooperates or equipment, operates in combination with one or more computer system.Certainly, should understand further, the above various assemblies described about search engine 110 should be regarded as the logic module for implementing various described function.As art technology, people understands, and logic module (or subsystem) or can not can directly correspond to actual, discrete assembly in man-to-man mode.In an actual embodiment, the various assemblies of each computer system cooperating process that can combine or decompose across multiple actual component and/or be embodied as on computer network 108.
Except except the enterprising line operate of search engine 110, each side of disclosed theme can be implemented on other computing equipment and/or is distributed on multiple computing equipment, comprises the equipment of computer user.Such as; according to various embodiment; can by accessing trustship on the website protected with at least some content of searching request height correlation; namely when computer user is certified and/or maintain login state about the opening of website time; this content can be used for him/her, but otherwise this content is limited to other people.In response to the searching request from computer user, search engine (or other service) can obtain related entities data by the equipment of computer user from this access restricted site indirectly; The equipment (such as computer user maintains thereon and logins state about the current of website) of computer user represents search service access related entities data.In fact, in various embodiments, the one or more assemblies on the equipment of computer user are from anticipating that the access restricted site of searching request obtains the data corresponding to other people.
Although about obtaining relate to that the computer user playing the part of positive role in the content of particular person defines in disclosed theme many, each side of disclosed theme suitably and advantageously can be applied to the automatic generation of the content relating to people.Such as, the various search inquiries (search inquiry through expansion) about one or more people can be made, " up-to-date " content on the Internet about this people (or multiple people) can be used when asking.An example will be arrange environment again, and the user when the new images/video/News Stories of this user occurs on the internet can be apprised of.Certainly, each side of disclosed theme can be applied to topic in addition to a person or entity.Such as, can arrange and automatically generate page with the latest news of display about rock-climbing, the Supreme Judicial Court etc.
Although described the various novel aspect of disclosed theme, should understand, these aspects are exemplary and should be interpreted as restrictive.The modification to various aspect and change can be made, and do not depart from the scope in disclosed theme.

Claims (10)

1., for making a computer implemented method for response to search inquiry, the method comprises:
Receive search inquiry, this search inquiry mark for its content relating to it by the people found;
Obtain related entities data from related entities source, these related entities data comprise at least one in the classification or related entities be associated with the people identified of search inquiry;
The search inquiry through expanding based on received search inquiry and related entities data genaration;
Search Results is obtained according to the search inquiry through expansion;
Present according to obtained plain result generation Search Results of searching; And
Return Search Results to present in response to search inquiry.
2. the computer implemented method of claim 1:
The search inquiry wherein generated through expansion comprise merge from the query term of related entities statistical conversion with receive search the query term that element inquires about; And
Wherein obtain Search Results according to the search inquiry through expansion comprise the query term according to received search inquiry and obtain Search Results according to the query term from related entities statistical conversion alternatively.
3. the computer implemented method of claim 2, the classification be wherein associated with identified people is included in the search inquiry through expanding as the query term from related entities statistical conversion.
4. the computer implemented method of claim 3, wherein related entities data comprise the multiple classifications be associated with identified people.
5. the computer implemented method of claim 4, wherein related entities data comprise the categorical data corresponding to each classification in related entities data, and wherein this classification packet draws together the one or more class instances be merged in the search inquiry of expansion as query term.
6. the computer implemented method of claim 2, comprises further and selects search model from multiple search model and selected search model be applied to the generation of the search inquiry through expansion,
Wherein from multiple search model select search model comprise following one or more:
According to the channel selection search model corresponding to requesting computer user; And
According to the channel selection search model corresponding to identified people.
7. a computer-readable medium for load capacity calculation machine executable instruction, described instruction is when any one in the method that the enforcement when the computing system of the processor at least comprising the instruction performed from medium retrieval performs is set forth about claim 1-6.
8. one kind for making the computer system of response to the search inquiry for the content relating to people, this system comprises processor and storer, wherein processor performs the instruction that stores in memory as the part of add-on assemble or be combined with add-on assemble to make response to the search inquiry for the content relating to people, and described add-on assemble comprises:
Inquiry topic identified component, it is configured to determine to find the identity for the people of its related content from search inquiry;
Related entities retrieval component, it is for obtaining the related entities data corresponding to identified people from related entities source;
Through the Query Builder of expansion, its from for relate to identified people content search inquiry and from related entities data genaration through expansion inquiry, wherein related entities data comprise at least one in the related entities or classification be associated with the people identified of search inquiry;
Search Results retrieval component, it is configured to obtain Search Results according to the search inquiry through expansion from content repository, and described Search Results quotes the content corresponding to identified people; And
Search Results presents maker, and it is configured to present according to the Search Results generation Search Results quoting the content corresponding to identified people and presented by Search Results turn back to computer user.
9. the computer system of claim 8, wherein Search Results retrieval component is by according to the query term of received search inquiry and alternatively obtaining Search Results from the query term in the search inquiry of expansion of related entities data and obtain according to the search inquiry through expansion the Search Results corresponding to identified people according to being merged into.
10. the computer system of claim 8, comprises further:
Search model assembly, it is configured to select search model from multiple search model; And search model is supplied to through expansion Query Builder for according to search model always auto-correlation solid data search inquiry generate through expansion inquiry,
Query Builder wherein through expanding generates the inquiry through expansion according to search model; And
Wherein Search Results present maker according to search model generate Search Results present.
CN201480037264.2A 2013-06-29 2014-06-24 Person search utilizing entity expansion Pending CN105493082A (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US13/931922 2013-06-29
US13/931,922 US20150006520A1 (en) 2013-06-10 2013-06-29 Person Search Utilizing Entity Expansion
PCT/US2014/043750 WO2014209925A1 (en) 2013-06-29 2014-06-24 Person search utilizing entity expansion

Publications (1)

Publication Number Publication Date
CN105493082A true CN105493082A (en) 2016-04-13

Family

ID=51210813

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201480037264.2A Pending CN105493082A (en) 2013-06-29 2014-06-24 Person search utilizing entity expansion

Country Status (4)

Country Link
EP (1) EP3014486A1 (en)
KR (1) KR20160026907A (en)
CN (1) CN105493082A (en)
WO (1) WO2014209925A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107704480A (en) * 2016-08-08 2018-02-16 百度(美国)有限责任公司 Extension and the method and system and computer media for strengthening knowledge graph
CN109791544A (en) * 2016-09-30 2019-05-21 微软技术许可有限责任公司 To analyzing when scheming the inquiry inquired across subgraph
CN110059113A (en) * 2018-01-08 2019-07-26 国际商业机器公司 The problem of knowledge based figure, corrects
CN110291515A (en) * 2017-02-13 2019-09-27 微软技术许可有限责任公司 Distributed index search in computing system
CN112052314A (en) * 2019-06-05 2020-12-08 国际商业机器公司 Method and system for providing suggestions to complete a query
CN113111647A (en) * 2021-04-06 2021-07-13 北京字跳网络技术有限公司 Information processing method and device, terminal and storage medium
US11748506B2 (en) 2017-02-27 2023-09-05 Microsoft Technology Licensing, Llc Access controlled graph query spanning

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102017853B1 (en) 2016-09-06 2019-09-03 주식회사 카카오 Method and apparatus for searching
CN113297452A (en) * 2020-05-26 2021-08-24 阿里巴巴集团控股有限公司 Multi-level search method, multi-level search device and electronic equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110119243A1 (en) * 2009-10-30 2011-05-19 Evri Inc. Keyword-based search engine results using enhanced query strategies
CN102902806A (en) * 2012-10-17 2013-01-30 深圳市宜搜科技发展有限公司 Method and system for performing inquiry expansion by using search engine
CN102955697A (en) * 2012-11-08 2013-03-06 沈阳建筑大学 Aspect orientation-based component base building method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110119243A1 (en) * 2009-10-30 2011-05-19 Evri Inc. Keyword-based search engine results using enhanced query strategies
CN102902806A (en) * 2012-10-17 2013-01-30 深圳市宜搜科技发展有限公司 Method and system for performing inquiry expansion by using search engine
CN102955697A (en) * 2012-11-08 2013-03-06 沈阳建筑大学 Aspect orientation-based component base building method

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107704480A (en) * 2016-08-08 2018-02-16 百度(美国)有限责任公司 Extension and the method and system and computer media for strengthening knowledge graph
CN109791544A (en) * 2016-09-30 2019-05-21 微软技术许可有限责任公司 To analyzing when scheming the inquiry inquired across subgraph
CN110291515A (en) * 2017-02-13 2019-09-27 微软技术许可有限责任公司 Distributed index search in computing system
CN110291515B (en) * 2017-02-13 2023-08-15 微软技术许可有限责任公司 Distributed index searching in computing systems
US11748506B2 (en) 2017-02-27 2023-09-05 Microsoft Technology Licensing, Llc Access controlled graph query spanning
CN110059113A (en) * 2018-01-08 2019-07-26 国际商业机器公司 The problem of knowledge based figure, corrects
CN110059113B (en) * 2018-01-08 2023-07-21 国际商业机器公司 Method, system and computer readable storage medium for processing queries
CN112052314A (en) * 2019-06-05 2020-12-08 国际商业机器公司 Method and system for providing suggestions to complete a query
CN113111647A (en) * 2021-04-06 2021-07-13 北京字跳网络技术有限公司 Information processing method and device, terminal and storage medium

Also Published As

Publication number Publication date
WO2014209925A1 (en) 2014-12-31
KR20160026907A (en) 2016-03-09
EP3014486A1 (en) 2016-05-04

Similar Documents

Publication Publication Date Title
CN105493082A (en) Person search utilizing entity expansion
US20150006520A1 (en) Person Search Utilizing Entity Expansion
JP4906846B2 (en) Scoring user compatibility in social networks
US9529910B2 (en) Systems and methods for an expert-informed information acquisition engine utilizing an adaptive torrent-based heterogeneous network solution
Carmel et al. Personalized social search based on the user's social network
US9330418B2 (en) System and method for creating a family tree data structure
JP5230751B2 (en) A recommendation system using social behavior analysis and vocabulary classification
US7725525B2 (en) Method and apparatus for internet-based human network brokering
US8788476B2 (en) Method and system of triggering a search request
CN101438279B (en) Search system and methods with integration of user annotations from a trust network
US20150095319A1 (en) Query Expansion, Filtering and Ranking for Improved Semantic Search Results Utilizing Knowledge Graphs
US20060085373A1 (en) Method and apparatus for creating relationships over a network
US20180232421A1 (en) Query intent clustering for automated sourcing
US9489458B1 (en) Suggesting interaction among members of a social network
US20090100032A1 (en) Method and system for creation of user/guide profile in a human-aided search system
WO2009023983A1 (en) Dynamically naming communities within online social networks
CN102792300A (en) User role based customizable semantic search
WO2012040692A2 (en) Presenting social search results
US10719889B2 (en) Secondary profiles with confidence scores
KR20180050786A (en) Automatic question-answering system based on searching tag, and method thereof
JP2008217674A (en) Information processing system, information processing method, server computer and program
WO2007030137A2 (en) Verified personal credit search system and method thereof
WO2010076804A2 (en) Determining presence of a user in an online environment
CN104516944A (en) NXD query monitor
Chakurkar et al. A web mining approach for personalized E-learning system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20160413