CN105224555A - A kind of methods, devices and systems of search - Google Patents

A kind of methods, devices and systems of search Download PDF

Info

Publication number
CN105224555A
CN105224555A CN201410261086.3A CN201410261086A CN105224555A CN 105224555 A CN105224555 A CN 105224555A CN 201410261086 A CN201410261086 A CN 201410261086A CN 105224555 A CN105224555 A CN 105224555A
Authority
CN
China
Prior art keywords
user
word string
query word
information
query
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410261086.3A
Other languages
Chinese (zh)
Other versions
CN105224555B (en
Inventor
张友书
张阔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Original Assignee
Beijing Sogou Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sogou Technology Development Co Ltd filed Critical Beijing Sogou Technology Development Co Ltd
Priority to CN201410261086.3A priority Critical patent/CN105224555B/en
Publication of CN105224555A publication Critical patent/CN105224555A/en
Application granted granted Critical
Publication of CN105224555B publication Critical patent/CN105224555B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

Embodiments provide a kind of methods, devices and systems of search, described method comprises: when receiving the first query word string that first user is submitted to, search for described first query word string, obtains the network information of coupling; Search second user with described first user with same or similar query intention; Wherein, described second user has community information; Judge whether described first query word string meets the privacy conditions preset; When described first query word string meets default privacy conditions, confidential treatment is carried out to the community information of described second user; The described network information and the community information of the second user carried out after confidential treatment are synthesized the first Search Results.The embodiment of the present invention improves the security of community information, avoids first user and repeats to carry out loaded down with trivial details artificial filter to the network information of magnanimity, decrease expending of first user time and efforts, substantially increase the efficiency of acquisition of information, quality and capacity.

Description

A kind of methods, devices and systems of search
Technical field
The present invention relates to the technical field of search, particularly relate to a kind of method of search, a kind of device of search and a kind of system of search.
Background technology
Along with developing rapidly of network, the network information sharply increases.User, in order to find the required network information in the network information of magnanimity, uses search engine to search for usually.
Search engine refers to automatically gather information from the Internet, after certain arrangement, is supplied to the system that user carries out inquiring about.Network information vastness is multifarious, and has no order, and all network informations are as the island one by one on vast sea, web page interlinkage is bridge crisscross between these islands, and search engine, then for user draws an open-and-shut information map, consult at any time for user.
But, the contradiction that the speed of network information growth and people obtain between information needed ability is more and more outstanding, the excessive network information makes user will carry out loaded down with trivial details artificial filter when search network information, at substantial time and efforts, and the search efficiency of the network information is very low.
Summary of the invention
Embodiment of the present invention technical matters to be solved is to provide a kind of method of search, in order to expending of less user time and energy, improves the search efficiency of the network information.
Accordingly, the embodiment of the present invention additionally provides a kind of device of search and a kind of system of search, in order to ensure the implementation and application of said method.
In order to solve the problem, the embodiment of the invention discloses a kind of method of search, comprising:
When receiving the first query word string that first user is submitted to, searching for described first query word string, obtaining the network information of coupling;
Search second user with described first user with same or similar query intention; Wherein, described second user has community information;
Judge whether described first query word string meets the privacy conditions preset; When described first query word string meets default privacy conditions, confidential treatment is carried out to the community information of described second user;
The described network information and the community information of the second user carried out after confidential treatment are synthesized the first Search Results.
The embodiment of the invention also discloses a kind of device of search, comprising:
Network information search module, for when receiving the first query word string that first user is submitted to, searches for described first query word string, obtains the network information of coupling;
User searches module, for searching second user with described first user with same or similar query intention; Wherein, described second user has community information;
Privacy conditions judge module, for judging whether described first query word string meets the privacy conditions preset;
Confidential treatment module, for when described first query word string meets default privacy conditions, carries out confidential treatment to the community information of described second user;
First Search Results synthesis module, for synthesizing the first Search Results by the described network information and the community information of the second user carried out after confidential treatment.
The embodiment of the invention also discloses a kind of system of search, described system comprises server and the first client, and first user is at described first client logs;
Wherein, described server comprises:
Network information search module, for when receiving the first query word string that first user is submitted to, searches for described first query word string, obtains the network information of coupling;
User searches module, for searching second user with described first user with same or similar query intention; Wherein, described second user has community information;
Privacy conditions judge module, for judging whether described first query word string meets the privacy conditions preset;
Confidential treatment module, for when described first query word string meets default privacy conditions, carries out confidential treatment to the community information of described second user;
First Search Results synthesis module, for synthesizing the first Search Results by the described network information and the community information of the second user carried out after confidential treatment;
First Search Results returns module, for described first Search Results is returned first user;
Described first client comprises:
First query word string submits module to, for submitting the first query word string to described server;
First Search Results receiver module, for receiving the first Search Results that described server returns;
First Search Results display module, for showing described first Search Results.
Compared with prior art, the embodiment of the present invention comprises following advantage:
The the first query word string submitted to first user in the embodiment of the present invention is searched for, obtain the network information of coupling, and search second user with first user with same or similar query intention, confidential treatment is carried out when judging the satisfied privacy conditions preset, and the community information of the network information and the second user is synthesized Search Results, when making to relate to the privacy requirements such as privacy at first user, screen in the community good friend of user by analyzing search daily record, the second user of same requirements is had with user, make first user can carry out interaction with the second user screened with regard to identical demand based on community information, and the community information of the second user is maintained secrecy, then first user directly can obtain the information that the second user formerly arranged, the information that the information of second user's manual sorting returns than machinery is more effective, and improve the security of community information, avoid first user to repeat to carry out loaded down with trivial details artificial filter to the network information of magnanimity, decrease expending of first user time and efforts, decrease the system resources consumption of subscriber equipment and website, decrease taking of the network bandwidth, substantially increase the efficiency of acquisition of information, quality and capacity.
The embodiment of the present invention is when the community information of the second user is triggered by first user, set up first user to be connected with the communication of the second user, first user and the second user is made to carry out communication under community information anonymity or disclosed situation, substantially increase the dirigibility of communication modes, also substantially increase the security of communication.
Accompanying drawing explanation
Fig. 1 is the flow chart of steps of the embodiment of the method 1 of a kind of search of the present invention;
Fig. 2 is the flow chart of steps of the embodiment of the method 2 of a kind of search of the present invention;
Fig. 3 is the structured flowchart of the device embodiment 1 of a kind of search of the present invention;
Fig. 4 is the structured flowchart of the device embodiment 2 of a kind of search of the present invention;
Fig. 5 is the structured flowchart of the system embodiment 1 of a kind of search of the present invention;
Fig. 6 is the structured flowchart of the system embodiment 2 of a kind of search of the present invention.
Embodiment
For enabling above-mentioned purpose of the present invention, feature and advantage become apparent more, and below in conjunction with the drawings and specific embodiments, the present invention is further detailed explanation.
With reference to Fig. 1, show the flow chart of steps of the embodiment of the method 1 of a kind of search of the present invention, specifically can comprise the steps:
Step 101, when receiving the first query word string that first user is submitted to, searches for described first query word string, obtains the network information of coupling;
The application embodiment of the present invention, first user can at the first client logs, and first user can submit the first query word string to by the first user end to server, the network information of request search and this first query word String matching.
In the embodiment of the present invention, when receiving the first query word string that first user is submitted to, then according to this first query word string Rapid Detection network information in index database, the covariance mapping of the network information and inquiry can be carried out, the result that will export sorted.
Be described for search engine, the search routine of search engine is divided into two parts, and one is front end user request process, and two is that rear end makes data procedures.
One, front end user request process:
1. receive request: receive the query word string that user inputs at search engine;
2. query word analysis: word segmentation processing is carried out to query word string;
3. retrieve: according to word segmentation result, from the inverted index made in advance, search the network information of the candidate relevant to word segmentation result;
4. sort: for the network information of candidate, sort according to content relevance, the dimension such as ageing;
5. represent: by the webpage after sequence at search engine webpage representation out.
Two, rear end makes data procedures:
1. webpage capture: adopt crawler technology, by the linking relationship between webpage, captures the network information of internet and preserves.
2. compilation of index: analyze the network information capturing preservation, such as, carry out word segmentation processing to web page title and page text, makes inverted index, for front end user request process according to word segmentation result.
Step 102, searches second user with described first user with same or similar query intention; Wherein, described second user can have community information;
Each searching request that user sends may imply potential query intention behind, in the invention process, can find out first user query intention behind according to query word string, then for different search intentions, coupling meets the second user of the query intention of first user.
In specific implementation, community's friend relation can be had between described first user and described second user, then can associate social account in the embodiment of the present invention, such as immediate communication tool user, all types of websites (as forum, mhkc, portal website etc.) registered user etc., associate community's friend relation that social account can obtain first user, in the good friend user of first user, search coupling second user.
It should be noted that, community's friend relation can comprise one or more levels friend relation, such as, the user of one-level friend relation can be the good friend user of active user, secondary good friend user can be each self-corresponding good friend user of good friend user of active user etc., and the embodiment of the present invention is not limited this.
Certainly, can have non-community friend relation between described first user and described second user, namely the second user can be strange user for first user, then can search the second user of coupling in the embodiment of the present invention in global scope.
Wherein, described second user can have community information, community can be that some social groups or social organization are gathered in the collectively owned business that is mutually related in life formed in some fields, such as forum, microblogging, mhkc, portal website, instant communicating system etc., namely community information can comprise user's head portrait, user's name, user ID, address etc.
In one preferred embodiment of the invention, step 102 can comprise following sub-step:
Sub-step S11, obtains the first query intention information of described first user and the second query intention information of described second user respectively;
First query intention information can be the information of mark first user query intention, and the second query intention information can be the information of mark second user query intention.
In a kind of preferred exemplary of the embodiment of the present invention, described first query intention information can comprise first eigenvector, and described second query intention information can comprise second feature vector; Wherein, first eigenvector can be the vector information of mark first user query intention, and second feature vector can be the vector information of mark second user query intention; Described first eigenvector can be determined according to described first query word string, and described second feature vector can be determined according to described second query word string, wherein, and the query word string that described second query word string formerly can be submitted to for described second user.
In this example, by analyzing query word string, Search Results and search daily record, the feature of the query intention representing query word string can be searched, calculates eigenwert, thus query word string is expressed as proper vector.
The proper vector that the query intention of query word string is relevant can be divided into three major types, the first kind can be the proper vector of query word string itself, Equations of The Second Kind can be the proper vector with point word association of query word string, 3rd class can be the proper vector associated with the network information of query word String matching, and these proper vectors may be used to the query intention representing query word string.
Then in specific implementation, described first eigenvector can comprise following at least one: the first query word string, with the proper vector of point word association of the first query word string, the proper vector that associates with the network information of the first query word String matching;
Described second feature vector can comprise following at least one: the second query word string, with the proper vector of point word association of the second query word string, the proper vector that associates with the network information of the second query word String matching.
In a kind of preferred exemplary of the invention process, the proper vector of the described point word association with the first query word string can comprise following at least one: the importance degree of the part of speech of participle of the synonym string of the first query word string, the participle of the first query word string, the first query word string, the synonym of the participle of the first query word string, the participle of the first query word string;
The described proper vector associated with the network information of the first query word String matching can comprise following at least one:
With the title of the network information of the first query word String matching, with the banner of the network information of the first query word String matching, with the history click information of the network information of the first query word String matching, other query word strings of associating with the first query word string;
The proper vector of the described point word association with the second query word string can comprise following at least one:
The importance degree of the part of speech of participle of the synonym string of the second query word string, the participle of the second query word string, the second query word string, the synonym of the participle of the second query word string, the participle of the second query word string;
The described proper vector associated with the network information of the second query word String matching can comprise following at least one:
With the title of the network information of the second query word String matching, with the banner of the network information of the second query word String matching, with the history click information of the network information of the second query word String matching, other query word strings of associating with the second query word string.
The example of first/second feature vector can be as follows:
1, query word string itself;
Such as, the query word string " Haidian women and children " itself of user's submission.
2, the synonym string of query word string;
In this example, the synonym string of query word string can be found in the synonym dictionary made in advance.Such as, " Haidian healthcare hospital for women & children " and " Haidian women and children " is synonym, " new the semi-gods and the semi-devils " and " the good version of the semi-gods and the semi-devils clock Chinese " is synonym (this kind of synonym can along with actual change, be always synonym with the semi-gods and the semi-devils of up-to-date an edition).
3, the participle term of query word string;
In this example, participle can be carried out to query word, obtain the term after participle.Such as, have two [Haidian women and children, file] the term after query word string " Haidian women and children file " participle.
4, the part of speech of the participle term of query word string;
In this example, part of speech analysis can be carried out to participle term, obtain the part of speech of participle term.Such as, the part of speech that participle term [Haidian women and children, file] is corresponding is [noun, verb].
5, the synonym of the participle term of query word string;
In this example, the synonym of participle term can be searched in the synonym dictionary made in advance.Such as, the synonym of participle term [Haidian women and children, file] is [Haidian healthcare hospital for women & children files].
6, the importance degree of the participle term of query word string;
In this example, by statistics search daily record, TF (TermFrequency, word frequency) and the IDF (InverseDocumentFrequency, anti-document frequency) of each participle term can be obtained.TF-IDF is a kind of statistical method, in order to assess the significance level of a words for a copy of it file in a file set or a corpus.The importance of words to be directly proportional increase along with the number of times that it occurs hereof, the decline but the frequency that can occur in corpus along with it is inversely proportional to simultaneously.The importance degree of each participle term then can be represented in this example by TF-IDF.Such as, in participle term [Haidian women and children, official website], the TF-IDF value of " Haidian women and children " is higher than the TF-IDF value of " official website ", then " Haidian women and children " are higher than " official website " importance degree, comprise more quantity of information.
7, with the title of the network information of query word String matching;
In this example, the title of the network information can refer to corresponding with query word string, the title of front N (N is positive integer, such as 10) the bar Search Results that search engine returns, and may be used for the relevant text of locating query word string and keyword.Such as, search " Taobao ", first three title of the Search Results returned an is respectively " Taobao-wash in a pan! I likes ", " at will strolling-Taobao " and " Taobao ".
8, with the banner of the network information of query word String matching;
In this example, banner can be the information that can represent a well-determined webpage, such as Uniform Resource Identifier (UniformResourceIdentifier, URI), Uniform Resource Identifier specifically can comprise URL(uniform resource locator) (UniformResourceLocator again, or uniform resource name (UniformResourceName, URN) etc. URL).The URL of M (M is positive integer, such as 10) the bar network information before being specifically as follows Search Results, may be used for the relevant network address of locating query word string and website.Such as, search " Taobao ", first three URL of Search Results is respectively http://www.***.com/ ", " http://guang.***.com/ " and " http://shuo.***.com/ ".
9, with the history click information of the network information of query word String matching;
In this example, history click information can be the user of this query word string of search, the statistics of the click situation in Search Results.Which network information is weighed more important, more relevant to query word string by user behavior.Such as, user search " Taobao " 10000 times, the click of first three URL is for shown in table 1.
Table 1, history click information table
First three URL of Search Results Number of clicks Ratio
http://www.***.com/ 8000 80%
http://guang.***.com/ 1000 10%
http://shuo.***.com/ 1000 10%
Can be shown by table 1, the URL of the Article 1 network information is more relevant to query word string.
10, other query word strings associated with query word string;
In this example, can search for and submit to the user of this query word string also to search for which other query word string, may be used for some concepts representing that query word string is relevant.Such as, the user of search " 18 is large ", has also searched for " two Conferences ", " 18 spirit of party " etc.
Certainly, just exemplarily, when implementing the embodiment of the present invention, can arrange other first/second feature vectors according to actual conditions, the embodiment of the present invention is not limited this above-mentioned first/second feature vector.In addition, except above-mentioned first/second feature vector, those skilled in the art can also adopt other first/second feature vector according to actual needs, and the embodiment of the present invention is not also limited this.
Sub-step S12, calculates the similarity of described first query intention information and described second query intention information;
In specific implementation, according to the similarity of query intention, query word string can be carried out cluster.
In a kind of preferred exemplary of the embodiment of the present invention, sub-step S12 can comprise following sub-step further:
Sub-step S121, calculates the similarity between described first eigenvector and described second feature vector.
In this example, for the proper vector determined by query word string, clustering algorithm (such as hierarchical clustering algorithm/kmeans algorithm etc.) can be used to calculate similarity, then according to similarity, query word string is carried out category division.
Such as, first eigenvector corresponding to " Haidian women and children file flow process " the first query word string " Haidian women and children file " in table 2 and the second query word string and second feature vector, identical part has:
1, the participle term of the query word string participle term that has two importance degrees high is identical, is respectively " Haidian women and children " and " filing ";
2, with the click logs of the network information of query word String matching, the 1st article of history click information is identical with the 2nd article of history click information;
3, comprise " Haidian women and children file " in other query word strings associated with query word string in " Haidian women and children file flow process ".
Table 2, proper vector contrast table
In the cluster process using clustering algorithm, can quantize these same sections and calculate the similarity of first eigenvector and second feature vector.
Sub-step S13, when described similarity is greater than default similarity threshold, judges that described first user and described second user have same or analogous query intention.
In specific implementation, when similarity exceedes default similarity threshold, then the first query word string and the second query word string can gather is a class, and namely first user and the second user have same or analogous query intention.
First eigenvector is more similar with second feature vector, and the first query word string and the second query word string are more likely that to be gathered in cluster process be a class, and first user is more similar with the query intention of the second user, even identical.
Such as, it is a class that the first query word string " Haidian women and children file " and the second query word string " Haidian women and children file flow process " can gather, and it is a class that the first query word string " loan application " and the second query word string " apply for loan flow process " can gather.
In specific implementation, can after user inquire about, preserve the corresponding relation of user and query intention, query word string/proper vector and query intention thereof, follow-uply search second user with first user with same or similar query intention to facilitate.
Such as, this corresponding relation can be preserved according to form as shown in table 3.
Table 3, user-query intention, query word string/proper vector-query intention corresponding lists
When searching second user with first user with same or similar query intention, according to user-query intention, the query word string/proper vector-query intention corresponding lists of preserving, with the first eigenvector of first user, calculate same or analogous second user with first user query intention.
Concrete calculation procedure is as follows:
1, the first eigenvector A of first user is determined;
2, the proper vector A1 in A and user-query intention, query word string/proper vector-query intention corresponding lists is adopted, A2 ... An (n is positive integer) calculates similarity, the query intention i that the feature phase vector Ai (i is positive integer) finding similarity the highest is corresponding;
3, according to the query intention i that the 2nd step obtains, in user-query intention, query word string/proper vector-query intention corresponding lists, second user of query intention i is found.
Such as, in user-query intention shown in table 3, query word string/proper vector-query intention corresponding lists, for search " file Haidian women and children " first user, the query word string that the second feature vector finding similarity the highest is corresponding is " Haidian women and children file ", corresponding query intention is query intention 1, and the second user of query intention 1 correspondence has user 1, user 2 and user 3.
Step 103, judges whether described first query word string meets the privacy conditions preset;
In specific implementation, user may browse the network multimedia information relating to privacy, or be unsuitable for the network information of carrying out wide-scale distribution in a network, such as the network information of medical class of curing the disease, now needs to maintain secrecy to user identity.
In the embodiment of the present invention, judge whether the first query word string meets the privacy conditions preset, and mainly contains two dimensions.
One of them dimension can be analyze the query intention of user, then in one preferred embodiment of the invention, step 103 can comprise following sub-step:
Sub-step S21, searches the synonym string of described first query word string;
Sub-step S22, carries out word segmentation processing to the synonym string of described first query word string and described first query word string, obtains one or more inquiry participle;
Sub-step S23, according to described inquiry participle and the co-occurrence number of times of secret word preset and/or spacing distance, the secret weight corresponding to described one or more inquiry participle configuration;
Sub-step S24, according to the described inquiry participle after configure weights, obtains the secret weight of the secret weight of described first query word string and the synonym string of described first query word string respectively;
Sub-step S25, is set to the secret weight of target by the mean value of the secret weight of the synonym string of the secret weight of described first query word string and described first query word string;
Sub-step S26, when described target right to keep confidential is great when default weight threshold, judges that described first query word string meets default privacy conditions.
In embodiments of the present invention, participle can be carried out by the first query word string of inputting user and synonym string thereof, each inquiry participle after cutting be calculated and the correlation degree of the topic such as privacy/sensitivity.When the content of inquiring about participle relates to privacy/sensitive subjects, obtain the secret weight of inquiry participle computed in advance, then to each inquiry participle summation, calculate the secret weight of whole first query word string and synonym string thereof, and adopt the average of secret weight as the secret weight of final target.When the secret weight of target exceedes certain weight threshold, then can be judged to meet the privacy conditions preset.
In specific implementation, secret dictionary can be made in advance, record in this secret dictionary inquiry participle and with the co-occurrence number of times of secret word preset and/or spacing distance.
Below the manufacturing process of secret dictionary is illustrated.
Such as, the first query word string that user submits to is " Haidian women and children file ", in this example, using " privacy " as the secret word preset.
First, word segmentation processing can be carried out to the title of the network information and/or text, count the number of times (i.e. co-occurrence number of times) that " privacy " and " Haidian women and children " appears at same title and/or text jointly, and the distance at interval between " privacy " and " Haidian women and children ", the distance at this interval can identify with number of words.
In specific implementation, can after user inquires about, the inquiry participle counted, co-occurrence number of times, spacing distance (can identify spacing distance with average interval number of words) are made secret dictionary, as shown in table 4, follow-uply judge that whether the first query word string meets the privacy conditions preset to facilitate, namely in this secret dictionary, find the first at least part of query word string, then can think that the first query word string meets default privacy conditions.
Table 4, secret dictionary
Inquiry participle With the co-occurrence number of times of secret word With the spacing distance of secret word
Haidian women and children 500 10
The semi-gods and the semi-devils 1 100
It should be noted that, co-occurrence number of times is larger, more likely meets privacy conditions, and its secret weight can be larger.Spacing distance is less, more likely meets privacy conditions, and its secret weight can be larger.
Such as, the co-occurrence number of times of " Haidian women and children " and " privacy ", be far longer than " the semi-gods and the semi-devils " and " privacy " co-occurrence number of times, the spacing distance of " Haidian women and children " and " privacy ", be far smaller than the spacing distance of " the semi-gods and the semi-devils " and " privacy ", then can think that " Haidian women and children " more meet privacy conditions than " the semi-gods and the semi-devils ".
In other embodiments, directly can also set up weighted value in secret dictionary, dynamic-configuration respectively inquires about secret weight corresponding to participle, searches the secret weight of corresponding inquiry participle according to secret weight.
Wherein another dimension can be analyze the history exchange way of user, then in one preferred embodiment of the invention, step 103 can comprise following sub-step:
Sub-step S31, searches the ratio for other user anonymity communications of described same or similar query intention in the whole network;
Sub-step S32, when described ratio is greater than default proportion threshold value, judges that described first query word string meets default privacy conditions.
In specific implementation, can analyze in same or similar query intention, the application embodiment of the present invention selects the ratio of the anonymous user exchanged, and ratio is more high more likely meets the privacy conditions preset.
Below two dimensions of the whether satisfied privacy conditions preset of above-mentioned judgement first query word string are illustrated.
Such as, the first query word string that user submits to is " Haidian women and children file ":
1, the synonym string of the first query word string is searched.Such as, the synonym string of " Haidian women and children file " is " Haidian women and children file attack strategy ";
2, participle is carried out to the first query word string and synonym string thereof.Such as, the word segmentation result of " Haidian women and children file " is " Haidian women and children ", " filing "; The word segmentation result of " Haidian women and children file attack strategy " is " Haidian women and children ", " filing ", " attack strategy ".
3, according to the secret weight dictionary that makes in advance, calculate the secret weight of each participle in word segmentation result, then summation calculating mean value are as the secret weight of current queries.
4, find other users searching for same queries word string and synonym string in history, select the ratio of Anonymous communication.Such as, " Haidian women and children file " exchanges ratio with the anonymity of " Haidian women and children file attack strategy " 70%.
5, judge according to word segmentation result, synonym string and historic user behavior.Such as, for " Haidian women and children file ", " Haidian women and children " are in secret dictionary, and secret weight is very high, historic user Anonymous communication ratio 70%, and the two combination can judge that " Haidian women and children file " is for meeting the privacy conditions preset.
In like manner, for the first query word strings such as " the semi-gods and the semi-devils ", default privacy conditions can not met by analysis and distinguishing.
Step 104, when described first query word string meets default privacy conditions, carries out confidential treatment to the community information of described second user;
When meeting the privacy conditions preset, can show that the search behavior privacy of user is comparatively strong, generally not wanting to show oneself identity, then need to carry out confidential treatment.
In one preferred embodiment of the invention, step 104 can comprise following sub-step:
Sub-step S41, carries out anonymity process to the community information of described second user;
Sub-step S42, carries out the communications portal object of communication with the community information of the second user after anonymity process structure and described second user.
Anonymity, relative to the behavior of tool true identity, is a kind of behavior of not signing or taking an alias.Such as, community information is adopted unified acquiescence head portrait and default name etc.
In the embodiment of the present invention, construct a communications portal object, first user can be made can to carry out communication by this communications portal object and the second user.
Step 105, synthesizes the first Search Results by the described network information and the community information of the second user carried out after confidential treatment.
In the embodiment of the present invention, can using the community information of the network information and the second user as final Search Results.
In one preferred embodiment of the invention, step 105 can comprise following sub-step:
Sub-step S51, calculates described first user and closely spends with described associating of second user;
In the embodiment of the present invention, affect the factor that first user and the second user-association spend closely and can comprise three parts, Part I is the similarity of query intention, and Part II is the familiarity of first user and the second user, and Part III is the familiarity of the second user to query intention.
In a kind of preferred exemplary of the embodiment of the present invention, sub-step S51 can comprise following sub-step further:
Sub-step S511, to the similarity of described first query intention information and described second query intention information, and/or, the related information between described first user and described second user, and/or, the weight that described second user is corresponding to the historical operation information record configuration of described second query intention;
Sub-step S512, to the similarity of the described first query intention information after configure weights and described second query intention information, and/or, related information between described first user and described second user, and/or, the historical operation information of described second user to described second query intention carries out read group total, obtains described first user and closely spends with described associating of second user.
In this example, can pass through historical data and search log analysis, the similarity of described second query intention information, and/or, related information between described first user and described second user, and/or, described second user to the numerical value of each factor in the historical operation information of described second query intention, then according to the actual requirements with experience configure weights, such as importance degree is higher, its weight then can be larger, finally by various factors weighted calculation, obtains final association and closely spend.
In actual applications, the similarity of the first query intention information and the second query intention information can calculate in a step 102.Query word string is more similar, and query intention is then more similar.
Such as, first user search " Haidian women and children file ", second user A searched for " Haidian women and children file flow process ", second user B searched for " Haidian women and children ", so the second user A than the query intention of the second user B closer to first user, then first user and the second user A associate closely spend than the second user B associate closely spend larger.
In a kind of preferred exemplary of the embodiment of the present invention, the related information between described first user and described second user can comprise following at least one:
Quantity, the dwelling places of the average contact number of times in preset time period, the average contact duration in preset time period, common good friend;
In this example, related information can identify the familiarity of first user and the second user, the second user more often contacted, and its familiarity is higher, then degree is then higher closely in association.
The historical operation information of described second user to described second query intention can comprise following at least one:
The network information that searching times corresponding to described second query intention, described second query intention are corresponding browse search continuous days corresponding to duration, described second query intention.
In this example, historical operation information can identify the level of understanding of the second user to this query intention, second user more, more familiar to this query intention spended time, and it is understood higher, then degree is then higher closely in association.
For the searching times that the second query intention is corresponding, can find in user-query intention as shown in table 3, query word string/proper vector-query intention corresponding lists, such as, sequence for the searching times of query intention 1 correspondence can be user 2> user 3> user 1.
For the history number of clicks of the network information of mating with described second query intention, the number of clicks of the second user to the second query word string can be obtained from search daily record, number of clicks is more, then can illustrate that webpage quantity, the content browsed are more, higher to the familiarity of the second query intention.
Browse duration for the network information corresponding to the second query intention, from search daily record, statistics can obtain the time quantum that the second user browses the second query word string related web page, the browsing time is longer, then higher to the familiarity of the second query intention.
For the search continuous days that the second query intention is corresponding, from search daily record, statistics the continuous days that the second user inquires about same query intention can be obtained.Number of days is more, the duration is longer, then can illustrate that the second user is more familiar to the second query intention.Such as, the second user A continues a search in month " Japan's tourism ", and the second user B continues search in three days " Japan's tourism ", then can think that the second user A is more familiar to " Japan's tourism " this query intention than the second user B.
Such as first user search " Haidian women and children file flow process ", second user with same or similar query intention has three, is respectively the second user A, the second user B, the second user C, and the factor of impact association degree is closely as shown in table 5.
Table 5, association spend contrast table closely
Wherein, the second user A compares with the second user C, frequent the same with first user contact, but this query intention more familiar.Second user C compares with the second user B, contacts frequently with first user, more familiar to this query intention.
Sub-step S52, the community information of the second user after closely spending confidential treatment according to described association sorts;
In this example, can sort from high to low according to the close degree of association, i.e. order sequence; When then, also can sort from low to high according to the close degree of association in this example, i.e. Bit-reversed, the embodiment of the present invention is not limited this.
Such as, the association shown in table 5 is spent closely: 155>135>117.2, and the clooating sequence that can obtain the second user is: the second user A> second user C> second user B.
Sub-step S53, synthesizes the first Search Results by the community information of the second user after the described network information and sequence.
In one preferred embodiment of the invention, step 105 can comprise following sub-step:
Sub-step S61, calculates described first user and closely spends with described associating of second user;
In a kind of preferred exemplary of the embodiment of the present invention, sub-step S61 can comprise following sub-step further:
Sub-step S611, to the similarity of described first query intention information and described second query intention information, and/or, the related information between described first user and described second user, and/or, the weight that described second user is corresponding to the historical operation information record configuration of described second query intention;
Sub-step S612, to the similarity of the described first query intention information after configure weights and described second query intention information, and/or, related information between described first user and described second user, and/or, the historical operation information of described second user to described second query intention carries out read group total, obtains described first user and closely spends with described associating of second user.
In a kind of preferred exemplary of the embodiment of the present invention, the related information between described first user and described second user can comprise following at least one:
Quantity, the dwelling places of the average contact number of times in preset time period, the average contact duration in preset time period, common good friend;
The historical operation information of described second user to described second query intention can comprise following at least one:
The network information that searching times corresponding to described second query intention, described second query intention are corresponding browse search continuous days corresponding to duration, described second query intention.
It should be noted that, due to the application basic simlarity of sub-step S61 and sub-step S51, so description is fairly simple, relevant part illustrates see the part of sub-step S51, and the embodiment of the present invention is not described in detail at this.
Sub-step S62, spends closely by described association in the community information being configured in the second user carried out after confidential treatment;
In embodiments of the present invention, under score value corresponding to (the i.e. association degree closely) degree of knowing well of the second user and/or the level of understanding is attached to community information, carry out reference with regard to this query selection when the second user exchanged for first user.
Sub-step S63, synthesizes the first Search Results by the described network information with the community information being configured with the second user associating degree closely.
The the first query word string submitted to first user in the embodiment of the present invention is searched for, obtain the network information of coupling, and search second user with first user with same or similar query intention, confidential treatment is carried out when judging the satisfied privacy conditions preset, and the community information of the network information and the second user is synthesized Search Results, when making to relate to the privacy requirements such as privacy at first user, screen in the community good friend of user by analyzing search daily record, the second user of same requirements is had with user, make first user can carry out interaction with the second user screened with regard to identical demand based on community information, and the community information of the second user is maintained secrecy, then first user directly can obtain the information that the second user formerly arranged, the information that the information of second user's manual sorting returns than machinery is more effective, and improve the security of community information, avoid first user to repeat to carry out loaded down with trivial details artificial filter to the network information of magnanimity, decrease expending of first user time and efforts, decrease the system resources consumption of subscriber equipment and website, decrease taking of the network bandwidth, substantially increase the efficiency of acquisition of information, quality and capacity.
With reference to Fig. 2, show the flow chart of steps of the embodiment of the method 2 of a kind of search of the present invention, specifically can comprise the steps:
Step 201, when receiving the first query word string that first user is submitted to by mobile client, searches for described first query word string, obtains the radio network information of coupling;
Step 202, searches second user with described first user with same or similar query intention; Wherein, described second user has community information;
Step 203, judges whether described first query word string meets the privacy conditions preset;
Step 204, when described first query word string meets default privacy conditions, carries out confidential treatment to the community information of described second user;
Step 205, synthesizes the first Search Results by the described network information and the community information of the second user carried out after confidential treatment, and is back to mobile client;
Step 206, when described first query word string does not meet the privacy conditions preset, synthesizes the second Search Results by the community information of the described network information and described second user;
In embodiments of the present invention, first query word string does not meet the privacy conditions preset, then can show that the search behavior privacy of user is poor, can directly show the community information of the second user, such as, directly show the community information such as personalized head portrait, the pet name of user.
In one preferred embodiment of the invention, step 206 can comprise as described in the calculating of step 105 in embodiment 1 first user with as described in the second user associate the process closely spent, closely spend by described association and the community information of the second user is sorted, the community information of the second user after the described network information and sequence is synthesized the second Search Results.
Step 207, when the community information of described second user is triggered by described first user, sets up described first user and is connected with the communication of described second user.
The application embodiment of the present invention, first user can at the first client logs, and the second user can at the second client logs, and the instant communication software of the correspondence directly called in mobile client is linked up with the second corresponding user.
In specific implementation, the first client and the second client can be dependent mobile applications, such as mobile browser, and also can be independently application program, such as immediate communication tool, the embodiment of the present invention be restricted this.
In embodiments of the present invention, the first Search Results or the second Search Results can be returned to the first client, first client shows this first Search Results or the second Search Results, wherein, the community information of the second user can formerly be configured to communications portal object, first user clicks the modes such as community information by mouse can trigger this communications portal object, initiates the request carrying out communication with the second user.When receiving this request, the communication mechanism of self can be applied, the such as communication mechanism of immediate communication tool itself, also can by other communication mechanism of open interface interchange, the such as communication mechanism of the community website such as microblogging, forum, set up first user and the second user, the communication namely between the first client with the second client is connected.
In one preferred embodiment of the invention, first user can have community information, then step 207 can comprise following sub-step:
Sub-step S81, sends the communication request of carrying out communication with first user to the second user;
Sub-step S82, when receiving for the anonymous instruction of the community information of described first user or open instruction, carries out anonymity process or open process to the community information of described first user;
Sub-step S83, when receiving for the anonymous instruction of the community information of described second user or open instruction, carries out anonymity process or open process to the community information of described second user;
Sub-step S84, adopts the community information of described second user after the community information of the described first user carried out after anonymous process or open process and, anonymous process or open process, sets up described first user and be connected with the communication of described second user.
The application embodiment of the present invention, can send the communication request of carrying out communication with first user to the second client, and the second user can select to carry out communication or refusal communication.
After selection communication, second user can send the anonymous instruction of the community information of the second user or open instruction by the second user end to server, instruction server carries out anonymity process or open process to the community information of the second user, in addition, first user also can send the anonymous instruction of the community information of first user or open instruction by the first user end to server, and instruction server carries out anonymity process or open process to the community information of first user.
On the page of Search Results, after showing the community information of the second relevant user, the communication process of first user and the second user, according to the difference of secret situation, first user and second is with carrying out anonymity per family or disclosed mode carries out communication.
Below the communication of first user and the second user is illustrated.
1, first user clicks the head portrait of the second user chosen, and request exchanges with the second user, then system sends communication request to the second user;
2, whether system is ready open community information to first user prompting;
3, first user is selected open or anonymous;
4, after the second user receives request:
If what 4.1 first users were selected is open community information, then the second user is when receiving communication request, can see the community information of first user;
If what 4.2 first users were selected is anonymous way, then the second user is when receiving communication request, the community information of first user can not be seen, can only see and such as give tacit consent to the anonymous information such as head portrait and default name, second user only knows has a user (can be good friend, also can be stranger) to ask to exchange with oneself.
In a kind of preferred exemplary of the embodiment of the present invention, can enclose the first query word string in the communication request sent to the second user, the first user making the second user understand oneself is want according to which particular problem to exchange.
Further, when first user selects anonymous way, can also according to the monthly average contact number of times between good friend, the monthly average contact duration between good friend, common good friend's number, whether judge the familiarity of first user and the second user in factors such as same cities, and score value corresponding for degree of knowing well is attached in communication request and carries out reference for the second user, second user can select carefully to exchange with first user, deal with several or do not exchange, and the embodiment of the present invention is not limited this.
5, the second user selects open or anonymous identity to exchange with first user.
Especially, when first user selects anonymous way to send communication request, the second user can exchange with first user with open or anonymous form with query selection according to the degree of knowing well in communication request.In communication process, first user and the second user all select anonymity if any one side or both sides, then can be transformed to open state at any time, thus protect the security with regard to privacy concern demand between user.
6, according to the selection situation to community information of user's first user, the second user, four kinds of communication modes may be had:
6.1, the open community information of first user---the open community information of the second user;
6.2, open community information---the second user anonymity of first user;
6.3, first user is anonymous---the open community information of the second user;
6.4, first user is anonymous---the second user anonymity.
The embodiment of the present invention is when the community information of the second user is triggered by first user, set up first user to be connected with the communication of the second user, first user and the second user is made to carry out communication under community information anonymity or disclosed situation, substantially increase the dirigibility of communication modes, also substantially increase the security of communication.
With reference to Fig. 3, show the structured flowchart of the device embodiment 1 of a kind of search of the present invention, specifically can comprise as lower module:
Network information search module 301, for when receiving the first query word string that first user is submitted to, searches for described first query word string, obtains the network information of coupling;
User searches module 302, for searching second user with described first user with same or similar query intention; Wherein, described second user can have community information;
Privacy conditions judge module 303, for judging whether described first query word string meets the privacy conditions preset;
Confidential treatment module 304, for when described first query word string meets default privacy conditions, carries out confidential treatment to the community information of described second user;
First Search Results synthesis module 305, for synthesizing the first Search Results by the described network information and the community information of the second user carried out after confidential treatment.
In one preferred embodiment of the invention, described user searches module 302 and can comprise following submodule:
Query intention acquisition of information submodule, for the second query intention information of the first query intention information and described second user that obtain described first user respectively;
Query intention information Similarity Measure submodule, for calculating the similarity of described first query intention information and described second query intention information;
First judges submodule, for when described similarity is greater than default similarity threshold, judges that described first user and described second user have same or analogous query intention.
In one preferred embodiment of the invention, described first query intention information can comprise first eigenvector, and described first eigenvector can be determined according to described first query word string;
Described second query intention information can comprise second feature vector, and described second feature vector can be determined according to described second query word string;
Wherein, described second query word string is the query word string that described second user formerly submits to.
In one preferred embodiment of the invention, described query intention information Similarity Measure submodule can comprise following submodule:
Proper vector Similarity Measure submodule, for calculating the similarity between described first eigenvector and described second feature vector.
In a kind of preferred exemplary of the embodiment of the present invention, described first eigenvector can comprise following at least one:
First query word string, with the proper vector of point word association of the first query word string, the proper vector that associates with the network information of the first query word String matching;
Described second feature vector can comprise following at least one:
Second query word string, with the proper vector of point word association of the second query word string, the proper vector that associates with the network information of the second query word String matching.
In a kind of preferred exemplary of the embodiment of the present invention, the proper vector of the described point word association with the first query word string can comprise following at least one:
The importance degree of the part of speech of participle of the synonym string of the first query word string, the participle of the first query word string, the first query word string, the synonym of the participle of the first query word string, the participle of the first query word string;
The described proper vector associated with the network information of the first query word String matching can comprise following at least one:
With the title of the network information of the first query word String matching, with the banner of the network information of the first query word String matching, with the history click information of the network information of the first query word String matching, other query word strings of associating with the first query word string;
The proper vector of the described point word association with the second query word string can comprise following at least one:
The importance degree of the part of speech of participle of the synonym string of the second query word string, the participle of the second query word string, the second query word string, the synonym of the participle of the second query word string, the participle of the second query word string;
The described proper vector associated with the network information of the second query word String matching comprises following at least one:
With the title of the network information of the second query word String matching, with the banner of the network information of the second query word String matching, with the history click information of the network information of the second query word String matching, other query word strings of associating with the second query word string.
In a kind of preferred embodiment of the present invention, described privacy conditions judge module 303 comprises following submodule:
Synonym string searches submodule, for searching the synonym string of described first query word string;
Word segmentation processing submodule, for carrying out word segmentation processing to the synonym string of described first query word string and described first query word string, obtains one or more inquiry participle;
Right to keep confidential reshuffles submodule, for according to described inquiry participle and the co-occurrence number of times of secret word preset and/or spacing distance, and the secret weight corresponding to described one or more inquiry participle configuration;
Read group total implementation sub-module, for according to the described inquiry participle after configure weights, obtains the secret weight of the secret weight of described first query word string and the synonym string of described first query word string respectively;
Target right to keep confidential is reseted and is put submodule, and the mean value for the secret weight of the synonym string by the secret weight of described first query word string and described first query word string is set to the secret weight of target;
Second judges submodule, for great when default weight threshold at described target right to keep confidential, judges that described first query word string meets default privacy conditions.
In one preferred embodiment of the invention, described privacy conditions judge module 303 can comprise following submodule:
Ratio searches submodule, for searching the ratio for other user anonymity communications of described same or similar query intention in the whole network;
3rd judges submodule, for when described ratio is greater than default proportion threshold value, judges that described first query word string meets default privacy conditions.
In one preferred embodiment of the invention, institute's confidential treatment module 304 can comprise following submodule:
Anonymous process submodule, for carrying out anonymity process to the community information of described second user;
Communications portal object formation submodule, for carrying out the communications portal object of communication with the community information of the second user after anonymity process structure and described second user.
In one preferred embodiment of the invention, described first Search Results synthesis module 305 can comprise following submodule:
Association spends calculating sub module closely, closely spends with described associating of second user for calculating described first user;
Community information sorting sub-module, the community information for the second user after closely spending confidential treatment according to described association sorts;
First synthon module, for synthesizing the first Search Results by the community information of the second user after the described network information and sequence.
In one preferred embodiment of the invention, described first Search Results synthesis module 305 can comprise following submodule:
Association spends calculating sub module closely, calculates described first user and closely spends with described associating of second user;
Association is degree configuration submodule closely, for described association is spent closely be configured in the second user carried out after confidential treatment community information in;
Second synthon module, for synthesizing the first Search Results by the described network information with the community information being configured with the second user associating degree closely.
In one preferred embodiment of the invention, described association is closely spent calculating sub module and can be comprised following submodule:
Weight configuration submodule, for the similarity to described first query intention information and described second query intention information, and/or, related information between described first user and described second user, and/or, the weight that described second user is corresponding to the historical operation information record configuration of described second query intention;
Second read group total submodule, for the similarity to the described first query intention information after configure weights and described second query intention information, and/or, related information between described first user and described second user, and/or, the historical operation information of described second user to described second query intention carries out read group total, obtains described first user and closely spends with described associating of second user.
In the one of the embodiment of the present invention is preferred, the related information between described first user and described second user can comprise following at least one:
Quantity, the dwelling places of the average contact number of times in preset time period, the average contact duration in preset time period, common good friend;
The historical operation information of described second user to described second query intention can comprise following at least one:
The network information that searching times corresponding to described second query intention, described second query intention are corresponding browse search continuous days corresponding to duration, described second query intention.
In one preferred embodiment of the invention, between described first user and described second user, can friend relation be had, or, non-friend relation can be had between described first user and described second user.
With reference to Fig. 4, show the structured flowchart of the device embodiment 2 of a kind of search of the present invention, specifically can comprise as lower module:
Network information search module 401, for when receiving the first query word string that first user is submitted to, searches for described first query word string, obtains the network information of coupling;
User searches module 402, for searching second user with described first user with same or similar query intention; Wherein, described second user can have community information;
Privacy conditions judge module 403, for judging whether the first query word string meets the privacy conditions preset;
Confidential treatment module 404, for when described first query word string meets default privacy conditions, carries out confidential treatment to the community information of described second user;
First Search Results synthesis module 405, for synthesizing the first Search Results by the described network information and the community information of the second user carried out after confidential treatment;
Second Search Results synthesis module 406, for when described first query word string does not meet the privacy conditions preset, synthesizes the second Search Results by the community information of the described network information and described second user.
Communication link block 407, for when the community information of described second user is triggered by described first user, sets up described first user and is connected with the communication of described second user.
In one preferred embodiment of the invention, described first user can have community information, and described communication link block 407 can comprise following submodule:
Communication request sends submodule, for sending the communication request of carrying out communication with first user to the second user;
First process submodule, for when receiving for the anonymous instruction of the community information of described first user or open instruction, carrying out anonymity process to the community information of described first user or openly to process;
Second process submodule, for when receiving for the anonymous instruction of the community information of described second user or open instruction, carrying out anonymity process to the community information of described second user or openly to process;
Set up submodule, for adopt the community information of the described first user carried out after anonymous process or open process with, the community information of described second user after anonymous process or openly process, sets up described first user and is connected with the communication of described second user.
In a preferred embodiment of the present invention, described second Search Results synthesis module 406 comprises following submodule:
Association spends calculating sub module closely, calculates described first user and closely spends with described associating of second user;
Community information sorting sub-module, sorts to the community information of described second user for closely spending according to described association;
3rd synthon module, for synthesizing the second Search Results by the community information of the second user after the described network information and sequence.
In one preferred embodiment of the invention, described association is closely spent calculating sub module and is comprised following submodule:
Weight configuration submodule, for the similarity to described first query intention information and described second query intention information, and/or, related information between described first user and described second user, and/or, the weight that described second user is corresponding to the historical operation information record configuration of described second query intention;
Second read group total submodule, for the similarity to the described first query intention information after configure weights and described second query intention information, and/or, related information between described first user and described second user, and/or, the historical operation information of described second user to described second query intention carries out read group total, obtains described first user and closely spends with described associating of second user.
In a kind of preferred exemplary of the embodiment of the present invention, the related information between described first user and described second user can comprise following at least one:
Quantity, the dwelling places of the average contact number of times in preset time period, the average contact duration in preset time period, common good friend;
The historical operation information of described second user to described second query intention can comprise following at least one:
The network information that searching times corresponding to described second query intention, described second query intention are corresponding browse search continuous days corresponding to duration, described second query intention.
With reference to Fig. 5, show the structured flowchart of the system embodiment 1 of a kind of search of the present invention, described system can comprise server 510 and the first client 520, and first user can log in described first client 520;
Described server 510 can comprise as lower module:
Network information search module 511, for when receiving the first query word string that first user is submitted to, searches for described first query word string, obtains the network information of coupling;
User searches module 512, for searching second user with described first user with same or similar query intention; Wherein, described second user can have community information;
Privacy conditions judge module 513, for judging whether the first query word string meets the privacy conditions preset;
Confidential treatment module 514, for when described first query word string meets default privacy conditions, carries out confidential treatment to the community information of described second user;
First Search Results synthesis module 515, for synthesizing the first Search Results by the described network information and the community information of the second user carried out after confidential treatment;
First Search Results returns module 516, for described first Search Results is returned first user;
Described first client 520 can comprise as lower module:
First query word string submits module 521 to, for submitting the first query word string to described server;
First Search Results receiver module 522, for receiving the first Search Results that described server 510 returns;
First Search Results display module 523, for showing described first Search Results.
In one preferred embodiment of the invention, described user searches module 512 and can comprise following submodule:
Query intention acquisition of information submodule, for the second query intention information of the first query intention information and described second user that obtain described first user respectively;
Query intention information Similarity Measure submodule, for calculating the similarity of described first query intention information and described second query intention information;
First judges submodule, for when described similarity is greater than default similarity threshold, judges that described first user and described second user have same or analogous query intention.
In one preferred embodiment of the invention, described first query intention information can comprise first eigenvector, and described first eigenvector can be determined according to described first query word string;
Described second query intention information can comprise second feature vector, and described second feature vector can be determined according to described second query word string;
Wherein, described second query word string is the query word string that described second user formerly submits to.
In one preferred embodiment of the invention, described query intention information Similarity Measure submodule can comprise following submodule:
Proper vector Similarity Measure submodule, for calculating the similarity between described first eigenvector and described second feature vector.
In a kind of preferred exemplary of the embodiment of the present invention, described first eigenvector comprises following at least one:
First query word string, with the proper vector of point word association of the first query word string, the proper vector that associates with the network information of the first query word String matching;
Described second feature vector can comprise following at least one:
Second query word string, with the proper vector of point word association of the second query word string, the proper vector that associates with the network information of the second query word String matching.
In a kind of preferred exemplary of the embodiment of the present invention, the proper vector of the described point word association with the first query word string can comprise following at least one:
The importance degree of the part of speech of participle of the synonym string of the first query word string, the participle of the first query word string, the first query word string, the synonym of the participle of the first query word string, the participle of the first query word string;
The described proper vector associated with the network information of the first query word String matching can comprise following at least one:
With the title of the network information of the first query word String matching, with the banner of the network information of the first query word String matching, with the history click information of the network information of the first query word String matching, other query word strings of associating with the first query word string;
The proper vector of the described point word association with the second query word string can comprise following at least one:
The importance degree of the part of speech of participle of the synonym string of the second query word string, the participle of the second query word string, the second query word string, the synonym of the participle of the second query word string, the participle of the second query word string;
The described proper vector associated with the network information of the second query word String matching comprises following at least one:
With the title of the network information of the second query word String matching, with the banner of the network information of the second query word String matching, with the history click information of the network information of the second query word String matching, other query word strings of associating with the second query word string.
In one preferred embodiment of the invention, described privacy conditions judge module 513 comprises following submodule:
Synonym string searches submodule, for searching the synonym string of described first query word string;
Word segmentation processing submodule, for carrying out word segmentation processing to the synonym string of described first query word string and described first query word string, obtains one or more inquiry participle;
Right to keep confidential reshuffles submodule, for according to described inquiry participle and the co-occurrence number of times of secret word preset and/or spacing distance, and the secret weight corresponding to described one or more inquiry participle configuration;
Read group total implementation sub-module, for according to the described inquiry participle after configure weights, obtains the secret weight of the secret weight of described first query word string and the synonym string of described first query word string respectively;
Target right to keep confidential is reseted and is put submodule, and the mean value for the secret weight of the synonym string by the secret weight of described first query word string and described first query word string is set to the secret weight of target;
Second judges submodule, for great when default weight threshold at described target right to keep confidential, judges that described first query word string meets default privacy conditions.
In one preferred embodiment of the invention, described privacy conditions judge module 513 comprises following submodule:
Ratio searches submodule, for searching the ratio for other user anonymity communications of described same or similar query intention in the whole network;
3rd judges submodule, for when described ratio is greater than default proportion threshold value, judges that described first query word string meets default privacy conditions.
In one preferred embodiment of the invention, institute's confidential treatment module 514 can comprise following submodule:
Anonymous process submodule, for carrying out anonymity process to the community information of described second user;
Communications portal object formation submodule, for carrying out the communications portal object of communication with the community information of the second user after anonymity process structure and described second user.
In one preferred embodiment of the invention, the first Search Results synthesis module 515 comprises following submodule:
Association spends calculating sub module closely, closely spends with associating of the second user for calculating first user;
Community information sorting sub-module, the community information for the second user after closely spending confidential treatment according to described association sorts;
First synthon module, for synthesizing the first Search Results by the community information of the second user after the described network information and sequence.
In one preferred embodiment of the invention, the first Search Results synthesis module 305 comprises following submodule:
Association spends calculating sub module closely, calculates described first user and closely spends with described associating of second user;
Association is degree configuration submodule closely, for described association is spent closely be configured in the second user carried out after confidential treatment community information in;
Second synthon module, for synthesizing the first Search Results by the described network information with the community information being configured with the second user associating degree closely.
In one preferred embodiment of the invention, described association is closely spent calculating sub module and is comprised following submodule:
Weight configuration submodule, for the similarity to described first query intention information and described second query intention information, and/or, related information between described first user and described second user, and/or, the weight that described second user is corresponding to the historical operation information record configuration of described second query intention;
Second read group total submodule, for the similarity to the described first query intention information after configure weights and described second query intention information, and/or, related information between described first user and described second user, and/or, the historical operation information of described second user to described second query intention carries out read group total, obtains described first user and closely spends with described associating of second user.
In the one of the embodiment of the present invention is preferred, the related information between described first user and described second user can comprise following at least one:
Quantity, the dwelling places of the average contact number of times in preset time period, the average contact duration in preset time period, common good friend;
The historical operation information of described second user to described second query intention can comprise following at least one:
The network information that searching times corresponding to described second query intention, described second query intention are corresponding browse search continuous days corresponding to duration, described second query intention.
In one preferred embodiment of the invention, between described first user and described second user, can friend relation be had, or, non-friend relation can be had between described first user and described second user.
With reference to Fig. 6, show the structured flowchart of the system embodiment 2 of a kind of search of the present invention, described system can comprise server 610, first client 620 and the second client 630;
Described server 610 can comprise as lower module:
Network information search module 611, for when receiving the first query word string that first user is submitted to, searches for described first query word string, obtains the network information of coupling;
User searches module 612, for searching second user with described first user with same or similar query intention; Wherein, described second user can have community information;
Privacy conditions judge module 613, for judging whether the first query word string meets the privacy conditions preset;
Confidential treatment module 614, for when described first query word string meets default privacy conditions, carries out confidential treatment to the community information of described second user;
First Search Results synthesis module 615, for synthesizing the first Search Results by the described network information and the community information of the second user carried out after confidential treatment;
First Search Results returns module 616, for described first Search Results is returned first user;
Second Search Results synthesis module 617, for when described first query word string does not meet the privacy conditions preset, synthesizes the second Search Results by the community information of the described network information and described second user;
Second Search Results returns module 618, for described second Search Results is returned first user;
Communication link block 619, for when the community information of described second user is triggered by described first user, sets up described first user and is connected with the communication of described second user;
Described first client 620 can comprise as lower module:
First query word string submits module 621 to, for submitting the first query word string to described server 610;
First Search Results receiver module 622, for receiving the first Search Results that described server 610 returns;
First Search Results display module 623, for showing described first Search Results;
Second Search Results receiver module 624, for receiving the second Search Results that described server 610 returns;
Second Search Results display module 625, for showing described second Search Results;
First communication module 626, for carrying out communication with described second user;
Second client 630 of telling can comprise as lower module:
Second communication module 631, for carrying out communication with described first user.
In one preferred embodiment of the invention, described first user can have community information, and described communication link block 619 can comprise following submodule:
Communication request sends submodule, for sending the communication request of carrying out communication with first user to the second user;
First process submodule, for when receiving for the anonymous instruction of the community information of described first user or open instruction, carrying out anonymity process to the community information of described first user or openly to process;
Second process submodule, for when receiving for the anonymous instruction of the community information of described second user or open instruction, carrying out anonymity process to the community information of described second user or openly to process;
Set up submodule, for adopt the community information of the described first user carried out after anonymous process or open process with, adopt the community information of described second user carried out after anonymous process or open process, set up described first user and be connected with the communication of described second user;
Described first communication module 626 can comprise following submodule:
First refers to sending module, for sending the anonymous instruction of the community information of first user or open instruction to described server;
Described second communication module 631 can comprise following submodule:
Communication request receive submodule, for receive described server send, carry out the communication request of communication with first user;
Second refers to send submodule, for sending the anonymous instruction of the community information of the second user or open instruction to described server.
In one preferred embodiment of the invention, the second Search Results synthesis module 617 comprises following submodule:
Association spends calculating sub module closely, closely spends with described associating of second user for calculating described first user;
Community information sorting sub-module, sorts to the community information of the second user for closely spending according to association;
3rd synthon module, for synthesizing the second Search Results by the community information of the second user after the described network information and sequence.
In one preferred embodiment of the invention, described association is closely spent calculating sub module and can be comprised following submodule:
Weight configuration submodule, for the similarity to described first query intention information and described second query intention information, and/or, related information between described first user and described second user, and/or, the weight that described second user is corresponding to the historical operation information record configuration of described second query intention;
Second read group total submodule, for the similarity to the described first query intention information after configure weights and described second query intention information, and/or, related information between described first user and described second user, and/or, the historical operation information of described second user to described second query intention carries out read group total, obtains described first user and closely spends with described associating of second user.
In a kind of preferred exemplary of the embodiment of the present invention, the related information between described first user and described second user can comprise following at least one:
Quantity, the dwelling places of the average contact number of times in preset time period, the average contact duration in preset time period, common good friend;
The historical operation information of described second user to described second query intention can comprise following at least one:
The network information that searching times corresponding to described second query intention, described second query intention are corresponding browse search continuous days corresponding to duration, described second query intention.
For device, system embodiment, due to itself and embodiment of the method basic simlarity, so description is fairly simple, relevant part illustrates see the part of embodiment of the method.
Above to the system of the method for a kind of search provided by the present invention, a kind of device of search and a kind of search, be described in detail, apply specific case herein to set forth principle of the present invention and embodiment, the explanation of above embodiment just understands method of the present invention and core concept thereof for helping; Meanwhile, for one of ordinary skill in the art, according to thought of the present invention, all will change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (17)

1. a method for search, is characterized in that, comprising:
When receiving the first query word string that first user is submitted to, searching for described first query word string, obtaining the network information of coupling;
Search second user with described first user with same or similar query intention; Wherein, described second user has community information;
Judge whether described first query word string meets the privacy conditions preset; When described first query word string meets default privacy conditions, confidential treatment is carried out to the community information of described second user;
The described network information and the community information of the second user carried out after confidential treatment are synthesized the first Search Results.
2. method according to claim 1, is characterized in that, also comprises:
When the community information of described second user is triggered by described first user, set up described first user and be connected with the communication of described second user.
3. method according to claim 2, is characterized in that, described first user has community information, and the described step setting up the communication link of described first user and described second user comprises:
The communication request of carrying out communication with described first user is sent to described second user;
When receiving for the anonymous instruction of the community information of described first user or open instruction, anonymity process or open process are carried out to the community information of described first user;
When receiving for the anonymous instruction of the community information of described second user or open instruction, anonymity process or open process are carried out to the community information of described second user;
Adopt the community information of described second user after the community information of the described first user carried out after anonymous process or open process and, anonymous process or open process, set up described first user and be connected with the communication of described second user.
4. the method according to claim 1 or 2 or 3, is characterized in that, described in search second user with described first user with same or similar query intention step comprise:
Obtain the first query intention information of described first user and the second query intention information of described second user respectively;
Calculate the similarity of described first query intention information and described second query intention information;
When described similarity is greater than default similarity threshold, judge that described first user and described second user have same or analogous query intention.
5. method according to claim 4, is characterized in that, described first query intention information comprises first eigenvector, and described first eigenvector is determined according to described first query word string;
Described second query intention information comprises second feature vector, and described second feature vector is determined according to described second query word string; Wherein, described second query word string is the query word string that described second user formerly submits to.
6. method according to claim 5, is characterized in that, described first eigenvector comprises following at least one:
First query word string, with the proper vector of point word association of the first query word string, the proper vector that associates with the network information of the first query word String matching;
Described second feature vector comprises following at least one:
Second query word string, with the proper vector of point word association of the second query word string, the proper vector that associates with the network information of the second query word String matching.
7. the method according to claim 1 or 2 or 3, is characterized in that, the described step judging that whether described first query word string meets the privacy conditions preset comprises:
Search the synonym string of described first query word string;
Word segmentation processing is carried out to the synonym string of described first query word string and described first query word string, obtains one or more inquiry participle;
According to described inquiry participle and the co-occurrence number of times of secret word preset and/or spacing distance, the secret weight corresponding to described one or more inquiry participle configuration;
The secret weight of the secret weight of described first query word string and the synonym string of described first query word string is obtained respectively according to the described inquiry participle after configure weights;
The mean value of the secret weight of the synonym string of the secret weight of described first query word string and described first query word string is set to the secret weight of target;
When described target right to keep confidential is great when default weight threshold, judge that described first query word string meets default privacy conditions.
8. the method according to claim 1 or 2 or 3, is characterized in that, the described step judging that whether described first query word string meets the privacy conditions preset comprises:
Search the ratio for other user anonymity communications of described same or similar query intention in the whole network;
When described ratio is greater than default proportion threshold value, judge that described first query word string meets default privacy conditions.
9. the method according to claim 1 or 2 or 3, is characterized in that, the step that the described community information to described second user carries out confidential treatment comprises:
Anonymity process is carried out to the community information of described second user;
The communications portal object of communication is carried out with the community information of the second user after anonymity process structure and described second user.
10. the method according to claim 1 or 2 or 3, is characterized in that, described step of the described network information and the community information of the second user that carries out after confidential treatment being synthesized the first Search Results comprises:
Calculate described first user closely to spend with described associating of second user;
Community information corresponding to the second user after closely spending confidential treatment according to described association sorts;
Community information corresponding for the second user after the described network information and sequence is synthesized the first Search Results.
11. methods according to claim 1 or 2 or 3 or 5 or 6, is characterized in that having community's friend relation between described first user and described second user.
The device of 12. 1 kinds of search, is characterized in that, comprising:
Network information search module, for when receiving the first query word string that first user is submitted to, searches for described first query word string, obtains the network information of coupling;
User searches module, for searching second user with described first user with same or similar query intention; Wherein, described second user has community information;
Privacy conditions judge module, for judging whether described first query word string meets the privacy conditions preset;
Confidential treatment module, for when described first query word string meets default privacy conditions, carries out confidential treatment to the community information of described second user;
First Search Results synthesis module, for synthesizing the first Search Results by the described network information and the community information of the second user carried out after confidential treatment.
13. devices according to claim 12, is characterized in that, also comprise:
Communication link block, for when the community information of described second user is triggered by described first user, sets up described first user and is connected with the communication of described second user.
14. devices according to claim 13, is characterized in that, described first user has community information, and described communication link block comprises:
Communication request sends submodule, for sending the communication request of carrying out communication with first user to the second user;
First process submodule, for when receiving for the anonymous instruction of the community information of described first user or open instruction, carrying out anonymity process to the community information of described first user or openly to process;
Second process submodule, for when receiving for the anonymous instruction of the community information of described second user or open instruction, carrying out anonymity process to the community information of described second user or openly to process;
Set up submodule, for adopt the community information of the described first user carried out after anonymous process or open process with, the community information of described second user after anonymous process or openly process, sets up described first user and is connected with the communication of described second user.
The system of 15. 1 kinds of search, it is characterized in that, described system comprises server and the first client, and first user is at described first client logs;
Wherein, described server comprises:
Network information search module, for when receiving the first query word string that first user is submitted to, searches for described first query word string, obtains the network information of coupling;
User searches module, for searching second user with described first user with same or similar query intention; Wherein, described second user has community information;
Privacy conditions judge module, for judging whether described first query word string meets the privacy conditions preset;
Confidential treatment module, for when described first query word string meets default privacy conditions, carries out confidential treatment to the community information of described second user;
First Search Results synthesis module, for synthesizing the first Search Results by the described network information and the community information of the second user carried out after confidential treatment;
First Search Results returns module, for described first Search Results is returned first user;
Described first client comprises:
First query word string submits module to, for submitting the first query word string to described server;
First Search Results receiver module, for receiving the first Search Results that described server returns;
First Search Results display module, for showing described first Search Results.
16. systems according to claim 15, is characterized in that, described system also comprises the second client, and the second user is at described second client logs;
Described server also comprises:
Communication link block, for when the community information of described second user is triggered by described first user, sets up described first user and is connected with the communication of described second user;
Described first client also comprises:
First communication module, for carrying out communication with described second user;
Described second client comprises:
Second communication module, for carrying out communication with described first user.
17. systems according to claim 16, is characterized in that, described first user has community information, and described communication link block comprises:
Communication request sends submodule, for sending the communication request of carrying out communication with first user to the second user;
First process submodule, for when receiving for the anonymous instruction of the community information of described first user or open instruction, carrying out anonymity process to the community information of described first user or openly to process;
Second process submodule, for when receiving for the anonymous instruction of the community information of described second user or open instruction, carrying out anonymity process to the community information of described second user or openly to process;
Set up submodule, for adopt the community information of the described first user carried out after anonymous process or open process with, the community information of described second user after anonymous process or openly process, sets up described first user and is connected with the communication of described second user;
Described first communication module comprises:
First instruction sends submodule, for sending the anonymous instruction of the community information of first user or open instruction to described server;
Described second communication module comprises:
Communication request receive submodule, for receive described server send, carry out the communication request of communication with first user;
Second refers to send submodule, for sending the anonymous instruction of the community information of the second user or open instruction to described server.
CN201410261086.3A 2014-06-12 2014-06-12 Searching method, device and system Active CN105224555B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410261086.3A CN105224555B (en) 2014-06-12 2014-06-12 Searching method, device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410261086.3A CN105224555B (en) 2014-06-12 2014-06-12 Searching method, device and system

Publications (2)

Publication Number Publication Date
CN105224555A true CN105224555A (en) 2016-01-06
CN105224555B CN105224555B (en) 2019-12-10

Family

ID=54993528

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410261086.3A Active CN105224555B (en) 2014-06-12 2014-06-12 Searching method, device and system

Country Status (1)

Country Link
CN (1) CN105224555B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106980651A (en) * 2017-03-02 2017-07-25 中电海康集团有限公司 A kind of knowledge based collection of illustrative plates crawls seed list update method and device
CN107862067A (en) * 2017-11-17 2018-03-30 中国银行股份有限公司 A kind of screening technique and device of bank loan data query
CN109543077A (en) * 2018-10-16 2019-03-29 清华大学 Community search method
CN111666417A (en) * 2020-04-13 2020-09-15 百度在线网络技术(北京)有限公司 Method and device for generating synonyms, electronic equipment and readable storage medium
CN115225471A (en) * 2022-07-15 2022-10-21 中国工商银行股份有限公司 Log analysis method and device

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1741012A (en) * 2004-08-23 2006-03-01 富士施乐株式会社 Test search apparatus and method
CN101136869A (en) * 2006-08-30 2008-03-05 高鹏 Method for generating search intention based contacts group of instant communication system
CN101206674A (en) * 2007-12-25 2008-06-25 北京科文书业信息技术有限公司 Enhancement type related search system and method using commercial articles as medium
CN102394762A (en) * 2011-11-01 2012-03-28 陈晓亮 Many-people-involved on-line communication system method
CN103034672A (en) * 2011-09-29 2013-04-10 云壤(北京)信息技术有限公司 Social search system and social search method
CN103109291A (en) * 2010-08-16 2013-05-15 费斯布克公司 People directory with social privacy and contact association features
CN103116587A (en) * 2011-11-17 2013-05-22 阿里巴巴集团控股有限公司 Excavating method and data searching method and device for keywords capable of defaulting
CN103379024A (en) * 2012-04-26 2013-10-30 腾讯科技(深圳)有限公司 Method for issuing microblog information and server
CN103412876A (en) * 2013-07-12 2013-11-27 宇龙计算机通信科技(深圳)有限公司 Network platform and method for looking for people or items through network platform
CN103425662A (en) * 2012-05-16 2013-12-04 腾讯科技(深圳)有限公司 Information search method and device in network community

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1741012A (en) * 2004-08-23 2006-03-01 富士施乐株式会社 Test search apparatus and method
CN101136869A (en) * 2006-08-30 2008-03-05 高鹏 Method for generating search intention based contacts group of instant communication system
CN101206674A (en) * 2007-12-25 2008-06-25 北京科文书业信息技术有限公司 Enhancement type related search system and method using commercial articles as medium
CN103109291A (en) * 2010-08-16 2013-05-15 费斯布克公司 People directory with social privacy and contact association features
CN103034672A (en) * 2011-09-29 2013-04-10 云壤(北京)信息技术有限公司 Social search system and social search method
CN102394762A (en) * 2011-11-01 2012-03-28 陈晓亮 Many-people-involved on-line communication system method
CN103116587A (en) * 2011-11-17 2013-05-22 阿里巴巴集团控股有限公司 Excavating method and data searching method and device for keywords capable of defaulting
CN103379024A (en) * 2012-04-26 2013-10-30 腾讯科技(深圳)有限公司 Method for issuing microblog information and server
CN103425662A (en) * 2012-05-16 2013-12-04 腾讯科技(深圳)有限公司 Information search method and device in network community
CN103412876A (en) * 2013-07-12 2013-11-27 宇龙计算机通信科技(深圳)有限公司 Network platform and method for looking for people or items through network platform

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106980651A (en) * 2017-03-02 2017-07-25 中电海康集团有限公司 A kind of knowledge based collection of illustrative plates crawls seed list update method and device
CN107862067A (en) * 2017-11-17 2018-03-30 中国银行股份有限公司 A kind of screening technique and device of bank loan data query
CN109543077A (en) * 2018-10-16 2019-03-29 清华大学 Community search method
CN111666417A (en) * 2020-04-13 2020-09-15 百度在线网络技术(北京)有限公司 Method and device for generating synonyms, electronic equipment and readable storage medium
CN111666417B (en) * 2020-04-13 2023-06-23 百度在线网络技术(北京)有限公司 Method, device, electronic equipment and readable storage medium for generating synonyms
CN115225471A (en) * 2022-07-15 2022-10-21 中国工商银行股份有限公司 Log analysis method and device

Also Published As

Publication number Publication date
CN105224555B (en) 2019-12-10

Similar Documents

Publication Publication Date Title
CN101641694B (en) Federated search implemented across multiple search engines
US9324113B2 (en) Presenting social network connections on a search engine results page
USRE48437E1 (en) Collecting and scoring online references
US8903800B2 (en) System and method for indexing food providers and use of the index in search engines
US7756867B2 (en) Ranking documents
US8095545B2 (en) System and methodology for a multi-site search engine
US20130054569A1 (en) Vertical Search-Based Query Method, System and Apparatus
CN101496003A (en) Compatibility scoring of users in a social network
WO2005111787A2 (en) A method for indexing and searching geocoded pages of a web site
CN103390000B (en) A kind of web search method and web page search system
CN102402619A (en) Search method and device
US20100293448A1 (en) Centralized website local content customization
WO2012040692A2 (en) Presenting social search results
CN105224555A (en) A kind of methods, devices and systems of search
WO2016078533A1 (en) Search method, apparatus, and device and non-volatile computer storage medium
US20110173192A1 (en) Search method, system and device
CN105159898A (en) Searching method and searching device
US9973950B2 (en) Technique for data traffic analysis
Juárez et al. Toward a privacy agent for information retrieval
CN101788981A (en) Deep web mobile search method, server and system
US8630992B1 (en) URL rank variability determination
TWI483129B (en) Retrieval method and device
US20130226900A1 (en) Method and system for non-ephemeral search
CN105159899B (en) Searching method and device
Mounika et al. Advanced Graph Analytics Algorithms On Genre Based Recommending System

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant