CN105824951A - Retrieval method and retrieval device - Google Patents

Retrieval method and retrieval device Download PDF

Info

Publication number
CN105824951A
CN105824951A CN201610170303.7A CN201610170303A CN105824951A CN 105824951 A CN105824951 A CN 105824951A CN 201610170303 A CN201610170303 A CN 201610170303A CN 105824951 A CN105824951 A CN 105824951A
Authority
CN
China
Prior art keywords
information
author
retrieval
social network
network sites
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610170303.7A
Other languages
Chinese (zh)
Other versions
CN105824951B (en
Inventor
郝运峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201610170303.7A priority Critical patent/CN105824951B/en
Publication of CN105824951A publication Critical patent/CN105824951A/en
Application granted granted Critical
Publication of CN105824951B publication Critical patent/CN105824951B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • Human Resources & Organizations (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a retrieval method and a retrieval device. A specific embodiment of the method comprises the following steps of receiving a retrieval request from a user, wherein the retrieval request comprises retrieval keywords; performing retrieval operation on at least one reserved social networking site according to the retrieval keywords to generate a retrieval information set; scoring each piece of retrieval information in the retrieval information set according to website information of the social networking site, corresponding to the retrieval information and author information about contents of the social networking site, corresponding to the retrieval information; sorting all pieces of retrieval information according to scores to generate a set of sorted retrieval information as a retrieval result. According to the embodiment, the retrieval result has higher pertinence.

Description

Search method and device
Technical field
The application relates to field of computer technology, is specifically related to Internet technical field, particularly relates to search method and device.
Background technology
Search engine ranking refer to search engine send one can be in the online program finding new web page and capturing file, this program is commonly called web crawlers.Web crawlers is known webpage beginning from data base, accesses these webpages and capture file just as the browser of normal users.After processing search word, search engine collator is started working, and finds out all webpages comprising search word from index data base, and calculates before which webpage should come according to rank algorithm, then returns " search " page by certain form.So search engine just need only can complete and return the desired Search Results of user within one or two second.
At present, Search Results has substantial amounts of social network sites original content, and existing search engine rank algorithm mainly carries out ranking with content relevance, website rank, the factor such as ageing to the webpage comprising search word, do not consider author's factor of original content, thus, there is social network sites related data under-utilized so that retrieval lacking of property of result.
Summary of the invention
The purpose of the application is to propose search method and the device of a kind of improvement, solves the technical problem that background section above is mentioned.
First aspect, this application provides a kind of search method, and described method includes: receiving the retrieval request of user, wherein, described retrieval request includes search key;According to described search key, at least one predetermined social network sites is carried out search operaqtion, generate retrieval information aggregate;To in described retrieval information aggregate each retrieval information, according to the author information of the content of the site information of social network sites corresponding to this retrieval information social network sites corresponding with this retrieval information, this retrieval information is marked;According to scoring, each bar retrieval information is ranked up, generates the set of ranked retrieval information as retrieval result.
In certain embodiments, described site information includes the website rank of described website.
In certain embodiments, described to each retrieval information in described retrieval information aggregate, the author information of the content of the social network sites that site information according to social network sites corresponding to this retrieval information is corresponding with this retrieval information, before marking this retrieval information, described method also includes: obtain the author information of the content of the site information of social network sites corresponding to the described retrieval information social network sites corresponding with described retrieval information.
In certain embodiments, the author information of the content of the social network sites that the site information of the social network sites that described acquisition described retrieval information is corresponding is corresponding with described retrieval information, including: the author information of the content of the site information of social network sites corresponding to the described retrieval information social network sites corresponding with described retrieval information is captured by web crawlers technology.
In certain embodiments, described method also includes: receive the author information of site information, content information and/or the content of at least one predetermined social network sites active push described.
In certain embodiments, described author information include following at least one: author's essential information and author's behavioural information;Wherein, described author's essential information include following at least one: whether authors' name, author pass through official's certification of social network sites the corresponding grade of social network sites, author in the concerns quantity of corresponding social network sites and author;Described author's behavioural information include following at least one: the touching quantity of the content that the forwardings quantity of content that the replys quantity of content that the issuing time of content that author issues on corresponding social network sites, author issue on corresponding social network sites, author issue on corresponding social network sites, author issue on corresponding social network sites and author on the social network sites of correspondence the content of issue represent quantity.
Second aspect, this application provides a kind of retrieval device, and described device includes: receive unit, is configured to receive the retrieval request of user, and wherein, described retrieval request includes search key;Retrieval unit, is configured to, according to described search key, at least one predetermined social network sites is carried out search operaqtion, generates retrieval information aggregate;Scoring unit, is configured to each retrieval information in described retrieval information aggregate, according to the author information of the content of the site information of social network sites corresponding to this retrieval information social network sites corresponding with this retrieval information, marks this retrieval information;Sequencing unit, is configured to, according to scoring, be ranked up each bar retrieval information, generates the set of ranked retrieval information as retrieval result.
In certain embodiments, described site information includes the website rank of described website.
In certain embodiments, described device also includes: acquiring unit, is configured to obtain the author information of the content of the site information of social network sites corresponding to the described retrieval information social network sites corresponding with described retrieval information.
In certain embodiments, described acquiring unit is configured to further: captured the author information of the content of the site information of social network sites corresponding to the described retrieval information social network sites corresponding with described retrieval information by web crawlers technology.
In certain embodiments, described device also includes: receive unit, is configured to receive the author information of site information, content information and/or the content of at least one predetermined social network sites active push described.
In certain embodiments, described author information include following at least one: author's essential information and author's behavioural information;Wherein, described author's essential information include following at least one: whether authors' name, author pass through official's certification of social network sites the corresponding grade of social network sites, author in the concerns quantity of corresponding social network sites and author;Described author's behavioural information include following at least one: the touching quantity of the content that the forwardings quantity of content that the replys quantity of content that the issuing time of content that author issues on corresponding social network sites, author issue on corresponding social network sites, author issue on corresponding social network sites, author issue on corresponding social network sites and author on the social network sites of correspondence the content of issue represent quantity.
The search method of the application offer and device, by the search key in utilizing user search to ask, predetermined social network sites is carried out retrieval and generate retrieval information aggregate, then according to the author information that the social network sites that every retrieval information in retrieval information aggregate is corresponding is corresponding with this retrieval information, this retrieval information is marked, finally according to scoring, each retrieval information is ranked up, and using the retrieval information aggregate after sequence as retrieval result, thus effectively make use of the author information of social network sites so that retrieval result has more specific aim.
Accompanying drawing explanation
By reading the detailed description being made non-limiting example made with reference to the following drawings, other features, purpose and advantage will become more apparent upon:
Fig. 1 is that the application can apply to exemplary system architecture figure therein;
Fig. 2 is the flow chart of an embodiment of the search method according to the application;
Fig. 3 is the schematic diagram of an application scenarios of the search method according to the application;
Fig. 4 is the flow chart of another embodiment of the search method according to the application;
Fig. 5 is the structural representation of an embodiment of the retrieval device according to the application;
Fig. 6 is adapted for the structural representation of the computer system of the server for realizing the embodiment of the present application.
Detailed description of the invention
With embodiment, the application is described in further detail below in conjunction with the accompanying drawings.It is understood that specific embodiment described herein is used only for explaining related invention, rather than the restriction to this invention.It also should be noted that, for the ease of describing, accompanying drawing illustrate only the part relevant to about invention.
It should be noted that in the case of not conflicting, the embodiment in the application and the feature in embodiment can be mutually combined.Describe the application below with reference to the accompanying drawings and in conjunction with the embodiments in detail.
Fig. 1 shows search method or the exemplary system architecture 100 of embodiment of retrieval device that can apply the application.
As it is shown in figure 1, system architecture 100 can include terminal unit 101,102,103, network 104 and server 105.Network 104 is in order to provide the medium of communication link between terminal unit 101,102,103 and server 105.Network 104 can include various connection type, the most wired, wireless communication link or fiber optic cables etc..
User can use terminal unit 101,102,103 mutual with server 105 by network 104, to receive or to send message etc..The application of various telecommunication customer end, such as web browser applications, searching class application, social platform software etc. can be installed on terminal unit 101,102,103.
Terminal unit 101,102,103 can be to have display screen and support retrieval and the various electronic equipments of web page browsing, include but not limited to smart mobile phone, panel computer, E-book reader, MP3 player (MovingPictureExpertsGroupAudioLayerIII, dynamic image expert's compression standard audio frequency aspect 3), MP4 (MovingPictureExpertsGroupAudioLayerIV, dynamic image expert's compression standard audio frequency aspect 4) player, pocket computer on knee and desk computer etc..
Server 105 can be to provide the server of various service, such as, the retrieval request generated on terminal unit 101,102,103 provides the backstage retrieval server with search engine functionality supported.The data such as the retrieval request received can be analyzed waiting and process by backstage retrieval server, and retrieval result (such as webpage data) is fed back to terminal unit.
It should be noted that the search method that the embodiment of the present application is provided typically is performed by server 105, correspondingly, retrieval device is generally positioned in server 105.
It should be understood that the number of terminal unit, network and the server in Fig. 1 is only schematically.According to realizing needs, can have any number of terminal unit, network and server.
With continued reference to Fig. 2, it is shown that according to the flow process 200 of an embodiment of the search method of the application.Described search method, comprises the following steps:
Step 201, receives the retrieval request of user.
In the present embodiment, search method runs on electronic equipment thereon (the such as server shown in Fig. 1, especially there is the server of search engine functionality) its terminal carrying out retrieving can be utilized by wired connection mode or radio connection to receive the retrieval request of user from user, wherein, above-mentioned retrieval request includes search key.The request content of above-mentioned retrieval request includes but not limited to word, picture and voice.As example, when the retrieval request of above-mentioned user is picture, above-mentioned electronic equipment can call OCR (OpticalCharacterRecognition, optical character recognition) software interface carries out Text region to the picture in retrieval request, and acquirement comprises the recognition result of at least one search key;When the retrieval request of above-mentioned user is voice, above-mentioned electronic equipment can by speech recognition software (such as, Viavoice) interface carries out Text region to the voice in retrieval request, and obtains the recognition result comprising at least one search key.
Step 202, carries out search operaqtion according to search key at least one predetermined social network sites, generates retrieval information aggregate.
In the present embodiment, search method runs on the content that can prestore a plurality of predetermined social network sites on electronic equipment thereon, these contents can be carried out search operaqtion, in order to present on a web browser as retrieval information.
In the present embodiment, above-mentioned electronic equipment, based on the search key in the retrieval request of the user received in step 201, carries out search operaqtion at least one predetermined social network sites, generate retrieval information aggregate, wherein, above-mentioned retrieval information can be info web, it is also possible to is snapshots of web pages.
In the present embodiment, above-mentioned predetermined social network sites can be the website set manually;It can also be the website of above-mentioned electronic equipment acquiescence;Can also is that when website meets predetermined condition, the website of above-mentioned electronic equipment sets itself, such as, when the always amount of posting of website is more than 1,000,000, this website can be set as social network sites by above-mentioned electronic equipment;When total customer volume of website is more than 500,000, this website can also be set as social network sites by above-mentioned electronic equipment;When total visit capacity of website is more than 5,000,000, this website can also be set as social network sites by above-mentioned electronic equipment.
In the present embodiment, search key word can be mated the most one by one by above-mentioned electronic equipment with the retrieval information from predetermined social network sites, and determines whether this retrieval information can be put in retrieval information aggregate according to the number of the key word included by the retrieval information of every social network sites.Such as, if the retrieval information of certain social network sites includes at least one search key, then this retrieval information can be put in retrieval information aggregate.
Step 203, to each retrieval information in retrieval information aggregate, according to the author information of the content of the site information of social network sites corresponding to this retrieval information social network sites corresponding with this retrieval information, marks to this retrieval information.
In the present embodiment, this retrieval information for each retrieval information in the retrieval information aggregate generated in step 202, is marked by above-mentioned electronic equipment according to the author information of the content of the site information of social network sites corresponding to this retrieval information social network sites corresponding with this retrieval information.
In some optional implementations of the present embodiment, site information can include the website rank (PR value, PageRank value) of this website.It should be noted that PR value can also be the webpage rank of the webpage that the website that above-mentioned site information is corresponding is comprised.PR value is used to show webpage or a standard of website grade, and rank is 0 to 10 respectively.Such as, PR value be 1 website show that this website is hardly important, and the website that PR value is 7 to 10 shows that this website is critically important.
In some optional implementations of the present embodiment, author information can include following at least one: author's essential information and author's behavioural information.Wherein, author's essential information can include following at least one: whether authors' name (author ID), author pass through official's certification of website the corresponding grade of social network sites, author in the concern quantity (vermicelli quantity) of corresponding social network sites and author;Author's behavioural information can include following at least one: the forwarding quantity of the content that the replys quantity of content that the issuing time of content that author issues on corresponding social network sites, author issue on corresponding social network sites, author issue on corresponding social network sites, the touching quantity of the content that author issues on corresponding social network sites and author on corresponding social network sites the content of issue represent quantity.
In the present embodiment, according to the concern quantity of the author of the content of the PR value of retrieval social network sites corresponding to the information social network sites corresponding with this retrieval information, this retrieval information can be marked.
Below equation can be utilized to calculate the mark of retrieval information.
K = R f a n s ( E f a n s Max f a n s ) * R 1 * K 1
Wherein, K is the mark of retrieval information, RfansFor the concern quantity of the author tune weight coefficient in the ranking of social network sites, EfansFor the concern quantity of author, MaxfansFor retrieval information derives from the highest concern quantity of the author of same social network sites, R with above-mentioned author1For author concern quantity retrieval information rank in tune weight coefficient, K1PR value for this social network sites.Wherein, tune weight coefficient can be the coefficient of the above-mentioned electronic equipment importance for weighing a parameter set in advance.As example, work as RfansIt is 0.8, K1It is 6, EfansIt is 1000, MaxfansIt is 10000, R1When being 2, the mark of this retrieval information is 0.96.
In the present embodiment, it is also possible to according to the reply quantity of the content that the author of the content of the PR value of retrieval social network sites corresponding to the information social network sites corresponding with this retrieval information issues on this social network sites, this retrieval information is marked.
Below equation can be utilized to calculate the mark of retrieval information.
K = R r e p l y Σ i = 1 n ( T i o l d T i n o w N i r e p l y ) * K 1 * R 2
Wherein, K is the mark of retrieval information, RreplyFor author the reply quantity of the content of issue tune weight coefficient in the ranking of social network sites, T on corresponding social network sitesioldTime during i-th content, T is issued for authorinowFor current time, NireplyIt is the reply quantity of i-th content, K1For the PR value of this social network sites, R2For content reply quantity retrieval information rank in tune weight coefficient, wherein, i Yu n is natural number.As example, work as RreplyIt is 1.2,It is 0.999, N1replyIt is 1000,It is 0.998, N2replyIt is 500, K1It is 8, R2When being 0.9, the mark of this retrieval information is 12942.72.
In the present embodiment, can also be according to the author of the content of the PR value of retrieval social network sites corresponding to the information social network sites corresponding with this retrieval information at the user gradation of this social network sites, author is in the concern quantity (vermicelli quantity) of this social network sites, whether author is by official's certification of this social network sites, the issuing time of the content that author issues on this social network sites, the reply quantity of the content that author issues on this social network sites, the content that the touching quantity of the content that the forwarding quantity of the content that author issues on this social network sites and author issue on this social network sites and author issue on this social network sites represent quantity, this retrieval information is marked.Now, the rank score of mark and the author's grade of retrieval information rank score, the rank score of author's history liveness in social network sites and author's historical influence power in the above-mentioned electronic equipment with search engine functionality in social network sites is relevant.
It is possible, firstly, to utilize the rank score in corresponding social network sites of the author's grade in below equation calculating retrieval information.
K 2 = R g r a d e ( E g r a d e Max g r a d e ) + R f a n s ( E f a n s Max f a n s ) + V
Wherein, K2For author's grade rank score in social network sites, RgradeFor the user gradation of the author tune weight coefficient in the ranking of social network sites, EgradeFor the user gradation of author, MaxgradeFor retrieval information derives from the highest user gradation of the author of same social network sites, R with above-mentioned authorfansFor the concern quantity of the author tune weight coefficient in the ranking of social network sites, EfansFor the concern quantity of author, MaxfansFor deriving from the highest concern quantity of the author of same social network sites in retrieval information with above-mentioned author, V is author by the official's certification of this social network sites tune weight coefficient in the ranking of social network sites.
It is then possible to utilize below equation to calculate the rank score of the history liveness in social network sites of the author in retrieval information.
K 3 = R r e p l y Σ i = 1 n ( T i o l d T i n o w N i r e p l y ) + R s h a r e Σ i = 1 n ( T i o l d T i n o w N i s h a r e )
Wherein, K3For the rank score of author's history liveness in social network sites, RreplyFor author the reply quantity of the content of issue tune weight coefficient in the ranking of social network sites, T on corresponding social network sitesioldTime during i-th content, T is issued for authorinowFor current time, NireplyIt is the reply quantity of i-th content, RshareFor author the forwarding quantity of the content of issue tune weight coefficient in the ranking of social network sites, N on corresponding social network sitesishareBeing the forwarding quantity of i-th content, wherein, i Yu n is natural number.
Then, it is possible to use below equation calculates the rank score of the historical influence power in the above-mentioned electronic equipment with search engine functionality of the author in retrieval information.There is the server of search engine functionality
K 4 = Σ i = 1 n ( N i c l i c k N i s h o w * T i o l d T i n o w )
Wherein, K4For the rank score of author's historical influence power in the above-mentioned electronic equipment with search engine functionality, NiclickIt is i-th content click volume in above-mentioned electronic equipment, NishowIt is i-th content amount of representing in above-mentioned electronic equipment, TioldTime during i-th content, T is issued for authorinowFor current time, wherein, i Yu n is natural number.
Finally, it is possible to use below equation calculates the mark of retrieval information.
K=R1*K1*K2+R2*K1*K3+R3*K4
Wherein, K is the mark of retrieval information, R1For author's grade ranking in social network sites tune weight coefficient in retrieval information rank, R2For author's history liveness in social network sites ranking retrieval information rank in tune weight coefficient, R3For author's historical influence power in the above-mentioned electronic equipment with search engine functionality ranking retrieval information rank in tune weight coefficient, K1For the PR value of this social network sites, K2For author's grade rank score in social network sites, K3For the rank score of author's history liveness in social network sites, K4Rank score for author's historical influence power in the above-mentioned electronic equipment with search engine functionality.
Step 204, according to scoring, is ranked up each bar retrieval information, generates the set of ranked retrieval information as retrieval result.
In the present embodiment, above-mentioned electronic equipment is according to the mark of the retrieval information obtained in step 203, each bar retrieval information is ranked up by the order descending according to above-mentioned mark, and using the set of at least one retrieval information comprising above-mentioned each bar retrieval information after sequence as retrieval result.
The method that above-described embodiment of the application provides, by at least one predetermined social network sites being carried out search operaqtion according to the search key in the retrieval request receiving user, this retrieval information is marked by the author information corresponding with retrieval information further according to the social network sites information that each retrieval information in the retrieval information aggregate that search operaqtion obtains is corresponding, according to appraisal result, retrieval information is ranked up, obtains retrieval information aggregate after sorted as retrieval result.The method effectively make use of the author information of social network sites so that retrieval result has more specific aim.
It it is a schematic diagram of the application scenarios of the search method according to the present embodiment with continued reference to Fig. 3, Fig. 3.In the application scenarios of Fig. 3, user first passes through terminal unit (client) and initiates a retrieval request " The Old Town of Lijiang ";Afterwards, at least one predetermined social network sites is retrieved by above-mentioned electronic equipment according to the search key " The Old Town of Lijiang ", " Lijing " and " ancient city " in retrieval request " The Old Town of Lijiang ", and generation comprises at least one " The Old Town of Lijiang ", the retrieval information 301 of " Lijing " or " ancient city ", retrieval information 302 and retrieval information 303, and put it in retrieval information aggregate;Then, this retrieval information is marked by above-mentioned electronic equipment according to the author information of the site information of A blog of retrieval information 301 correspondence in the retrieval information aggregate A author corresponding with this retrieval information, mark is 6.8, this retrieval information is marked by the author information of the B author that the site information of the B blog according to retrieval information 302 correspondence is corresponding with this retrieval information, mark is 5.3, this retrieval information is marked by the author information of the C author that the site information of the C blog according to retrieval information 303 correspondence is corresponding with this retrieval information, and mark is 4.7;Finally, it is ranked up by above-mentioned electronic equipment according to the scoring of retrieval information 301, retrieval information 302 and retrieval information 303, and the retrieval result of generation is as shown in Figure 3.
Each retrieval information in retrieval information aggregate is ranked up by the method that above-described embodiment of the application provides by the author information of the site information of social network sites corresponding to the retrieval information content corresponding with retrieval information so that retrieval result has more specific aim.
With further reference to Fig. 4, it illustrates the flow process 400 of another embodiment of search method.The flow process 400 of this search method, comprises the following steps:
Step 401, receives the retrieval request of user.
In the present embodiment, search method runs on electronic equipment thereon (the such as server shown in Fig. 1) and its terminal carrying out retrieving can be utilized by wired connection mode or radio connection to receive the retrieval request of user from user, wherein, above-mentioned retrieval request includes search key.
Step 402, carries out search operaqtion according to search key at least one predetermined social network sites, generates retrieval information aggregate.
In the present embodiment, above-mentioned electronic equipment, based on the search key in the retrieval request of the user received in step 401, carries out search operaqtion at least one predetermined social network sites, generates retrieval information aggregate.Wherein, above-mentioned predetermined social network sites can be the website set manually;It can also be the website of above-mentioned electronic equipment acquiescence;Can also is that when website meets predetermined condition, the website of above-mentioned electronic equipment sets itself.
Step 403, the author information of the content of the social network sites that the site information of the social network sites that acquisition retrieval information is corresponding is corresponding with retrieval information.
In the present embodiment, each in the retrieval information aggregate generated in above-mentioned electronic equipment obtaining step 402 retrieves the author information that the site information of social network sites corresponding to information retrieves the content of social network sites corresponding to information with each.
In some optional implementations of the present embodiment, above-mentioned electronic equipment can capture the author information of the content of the site information of social network sites corresponding to retrieval information and social network sites corresponding to the information of retrieval by web crawlers technology, wherein, web crawlers is otherwise known as webpage Aranea, network robot or webpage follower, it is a kind of according to certain rule, automatically captures program or the script of web message.
In some optional implementations of the present embodiment, above-mentioned electronic equipment can also passively receive the author information of site information, content information and/or the content of at least one predetermined social network sites active push.
Step 404, to each retrieval information in retrieval information aggregate, according to the author information of the content of the site information of social network sites corresponding to this retrieval information social network sites corresponding with this retrieval information, marks to this retrieval information.
In the present embodiment, this retrieval information, for the author information of the content of the site information of social network sites corresponding to the retrieval information got in step 403 social network sites corresponding with retrieval information, is marked by above-mentioned electronic equipment.
In the present embodiment, according to the concern quantity of the author of the content of the PR value of retrieval social network sites corresponding to the information social network sites corresponding with this retrieval information, this retrieval information can be marked.
In the present embodiment, it is also possible to according to the reply quantity of the content that the author of the content of the PR value of retrieval social network sites corresponding to the information social network sites corresponding with this retrieval information issues on this social network sites, this retrieval information is marked.
In the present embodiment, can also be according to the author of the content of the PR value of retrieval social network sites corresponding to the information social network sites corresponding with this retrieval information at the user gradation of this social network sites, author is in the concern quantity (vermicelli quantity) of this social network sites, whether author is by official's certification of this social network sites, the issuing time of the content that author issues on this social network sites, the reply quantity of the content that author issues on this social network sites, the content that the touching quantity of the content that the forwarding quantity of the content that author issues on this social network sites and author issue on this social network sites and author issue on this social network sites represent quantity, this retrieval information is marked.
Step 405, according to scoring, is ranked up each bar retrieval information, generates the set of ranked retrieval information as retrieval result.
In the present embodiment, above-mentioned electronic equipment is according to the mark of the retrieval information obtained in step 404, each bar retrieval information is ranked up by the order descending according to above-mentioned mark, and using the set of at least one retrieval information comprising above-mentioned each bar retrieval information after sequence as retrieval result.
Figure 4, it is seen that compared with the embodiment that Fig. 2 is corresponding, the flow process 400 of the search method in the present embodiment highlights the step obtaining site information and author information.Thus, the scheme that the present embodiment describes can introduce the related data of more site information and author information, thus realizes more comprehensively retrieving choosing and more effectively retrieving result of information.
With further reference to Fig. 5, as to the realization of method shown in above-mentioned each figure, this application provides a kind of embodiment retrieving device, this device embodiment is corresponding with the embodiment of the method shown in Fig. 2, and this device specifically can apply in various electronic equipment.
As it is shown in figure 5, the retrieval device 500 described in the present embodiment includes: receive unit 501, retrieval unit 502, scoring unit 503 and sequencing unit 504.Wherein, receiving unit 501 and be configured to receive the retrieval request of user, wherein, retrieval request includes search key;Retrieval unit 502 is configured to, according to search key, at least one predetermined social network sites is carried out search operaqtion, generates retrieval information aggregate;Scoring unit 503 is configured to each retrieval information in retrieval information aggregate, according to the author information of the content of the site information of social network sites corresponding to this retrieval information social network sites corresponding with this retrieval information, marks this retrieval information;And sequencing unit 504 is configured to according to scoring, each bar retrieval information is ranked up, generates the set of ranked retrieval information as retrieval result.
In the present embodiment, the reception unit 501 of retrieval device 500 can utilize its terminal carrying out retrieving to receive the retrieval request of user by wired connection mode or radio connection from user, wherein, includes search key in above-mentioned retrieval request.
In the present embodiment, retrieval device 500 can prestore the content of a plurality of predetermined social network sites, these contents can be carried out search operaqtion, in order to present on a web browser as retrieval information.Thus, the retrieval unit 502 of retrieval device 500 can carry out search operaqtion based on the search key that reception unit 501 obtains at least one predetermined social network sites, generates retrieval information aggregate.Wherein, above-mentioned predetermined social network sites can be the website set manually;It can also be the website of above-mentioned electronic equipment acquiescence;Can also is that when website meets predetermined condition, the website of above-mentioned electronic equipment sets itself.
In the present embodiment, this retrieval information can be marked for each retrieval information in the retrieval information aggregate generated in retrieval unit 502 by the scoring unit 503 of retrieval device 500 according to the author information of the content of the site information of social network sites corresponding to this retrieval information social network sites corresponding with this retrieval information.
In the present embodiment, above-mentioned sequencing unit 504 can be according to the mark of the retrieval information obtained in above-mentioned scoring unit 503, each bar retrieval information is ranked up by the order descending according to above-mentioned mark, and using the set of at least one retrieval information comprising above-mentioned each bar retrieval information after sequence as retrieval result.
In some optional implementations of the present embodiment, above-mentioned site information can include the website rank (PR value, PageRank value) of this website.It should be noted that PR value can also be the webpage rank of the webpage that the website that above-mentioned site information is corresponding is comprised.PR value is used to show webpage or a standard of website grade, and rank is 0 to 10 respectively.
In some optional implementations of the present embodiment, above-mentioned retrieval device 500 also includes: acquiring unit (not shown), for obtaining the author information of the content of the site information of social network sites corresponding to each retrieval information in the above-mentioned retrieval information aggregate social network sites corresponding with each retrieval information.
In some optional implementations of the present embodiment, above-mentioned acquiring unit can capture the author information of the content of the site information of social network sites corresponding to retrieval information and social network sites corresponding to the information of retrieval by web crawlers technology, wherein, web crawlers is otherwise known as webpage Aranea, network robot or webpage follower, it is a kind of according to certain rule, automatically captures program or the script of web message.
In some optional implementations of the present embodiment, above-mentioned retrieval device 500 also includes: receive unit (not shown), for receiving the author information of site information, content information and/or the content of at least one predetermined social network sites active push.
In some optional implementations of the present embodiment, above-mentioned author information include following at least one: author's essential information and author's behavioural information;Wherein, above-mentioned author's essential information include following at least one: whether authors' name, author pass through official's certification of social network sites the corresponding grade of social network sites, author in the concerns quantity of corresponding social network sites and author;Above-mentioned author's behavioural information include following at least one: the touching quantity of the content that the forwardings quantity of content that the replys quantity of content that the issuing time of content that author issues on corresponding social network sites, author issue on corresponding social network sites, author issue on corresponding social network sites, author issue on corresponding social network sites and author on the social network sites of correspondence the content of issue represent quantity.
Below with reference to Fig. 6, it illustrates the structural representation of the computer system 600 being suitable to the server for realizing the embodiment of the present application.
As shown in Figure 6, computer system 600 includes CPU (CPU) 601, and it can be loaded into the program random access storage device (RAM) 603 and perform various suitable action and process according to the program being stored in read only memory (ROM) 602 or from storage part 608.In RAM603, also storage has system 600 to operate required various programs and data.CPU601, ROM602 and RAM603 are connected with each other by bus 604.Input/output (I/O) interface 605 is also connected to bus 604.
It is connected to I/O interface 605: include the importation 606 of keyboard, mouse etc. with lower component;Output part 607 including such as cathode ray tube (CRT), liquid crystal display (LCD) etc. and speaker etc.;Storage part 608 including hard disk etc.;And include the communications portion 609 of the NIC of such as LAN card, modem etc..Communications portion 609 performs communication process via the network of such as the Internet.Driver 610 is connected to I/O interface 605 also according to needs.Detachable media 611, such as disk, CD, magneto-optic disk, semiconductor memory etc., be arranged in driver 610 as required, in order to the computer program read from it is mounted into storage part 608 as required.
Especially, according to embodiment of the disclosure, the process described above with reference to flow chart may be implemented as computer software programs.Such as, embodiment of the disclosure and include a kind of computer program, it includes the computer program being tangibly embodied on machine readable media, and described computer program comprises the program code for performing the method shown in flow chart.In such embodiments, this computer program can be downloaded and installed from network by communications portion 609, and/or is mounted from detachable media 611.When this computer program is performed by CPU (CPU) 601, perform the above-mentioned functions limited in the present processes.
Flow chart in accompanying drawing and block diagram, it is illustrated that according to system, architectural framework in the cards, function and the operation of method and computer program product of the various embodiment of the application.In this, each square frame in flow chart or block diagram can represent a module, program segment or a part for code, and a part for described module, program segment or code comprises the executable instruction of one or more logic function for realizing regulation.It should also be noted that at some as in the realization replaced, the function marked in square frame can also occur to be different from the order marked in accompanying drawing.Such as, two square frames succeedingly represented can essentially perform substantially in parallel, and they can also perform sometimes in the opposite order, and this is depending on involved function.It will also be noted that, the combination of the square frame in each square frame in block diagram and/or flow chart and block diagram and/or flow chart, can realize by the special hardware based system of the function or operation that perform regulation, or can realize with the combination of specialized hardware with computer instruction.
It is described in the embodiment of the present application involved unit to realize by the way of software, it is also possible to realize by the way of hardware.Described unit can also be arranged within a processor, for example, it is possible to be described as: a kind of processor includes receiving unit, retrieval unit, scoring unit and sequencing unit.Wherein, the title of these unit is not intended that the restriction to this unit itself under certain conditions, such as, receives unit and is also described as " receiving the unit of the retrieval request of user ".
As on the other hand, present invention also provides a kind of nonvolatile computer storage media, this nonvolatile computer storage media can be the nonvolatile computer storage media described in above-described embodiment included in device;Can also be individualism, be unkitted the nonvolatile computer storage media allocating in terminal.Above-mentioned nonvolatile computer storage media storage has one or more program, when one or more program is performed by an equipment so that described equipment: receiving the retrieval request of user, wherein, described retrieval request includes search key;According to described search key, at least one predetermined social network sites is carried out search operaqtion, generate retrieval information aggregate;To in described retrieval information aggregate each retrieval information, according to the author information of the content of the site information of social network sites corresponding to this retrieval information social network sites corresponding with this retrieval information, this retrieval information is marked;According to scoring, each bar retrieval information is ranked up, generates the set of ranked retrieval information as retrieval result.
Above description is only the preferred embodiment of the application and the explanation to institute's application technology principle.Skilled artisan would appreciate that, invention scope involved in the application, it is not limited to the technical scheme of the particular combination of above-mentioned technical characteristic, also should contain in the case of without departing from described inventive concept, other technical scheme being carried out combination in any by above-mentioned technical characteristic or its equivalent feature and being formed simultaneously.Such as features described above and (but not limited to) disclosed herein have the technical characteristic of similar functions and replace mutually and the technical scheme that formed.

Claims (12)

1. a search method, it is characterised in that described method includes:
Receiving the retrieval request of user, wherein, described retrieval request includes search key;
According to described search key, at least one predetermined social network sites is carried out search operaqtion, generate retrieval information aggregate;
To in described retrieval information aggregate each retrieval information, according to the author information of the content of the site information of social network sites corresponding to this retrieval information social network sites corresponding with this retrieval information, this retrieval information is marked;
According to scoring, each bar retrieval information is ranked up, generates the set of ranked retrieval information as retrieval result.
Method the most according to claim 1, it is characterised in that described site information includes the website rank of described website.
3. according to the method one of claim 1-2 Suo Shu, it is characterized in that, described to each retrieval information in described retrieval information aggregate, the author information of the content of the social network sites that site information according to social network sites corresponding to this retrieval information is corresponding with this retrieval information, before marking this retrieval information, described method also includes:
Obtain the author information of the content of the site information of social network sites corresponding to the described retrieval information social network sites corresponding with described retrieval information.
Method the most according to claim 3, it is characterised in that the author information of the content of the social network sites that the site information of the social network sites that described acquisition described retrieval information is corresponding is corresponding with described retrieval information, including:
The author information of the content of the site information of social network sites corresponding to the described retrieval information social network sites corresponding with described retrieval information is captured by web crawlers technology.
Method the most according to claim 3, it is characterised in that described method also includes: receive the author information of site information, content information and/or the content of at least one predetermined social network sites active push described.
Method the most according to claim 5, it is characterised in that described author information include following at least one: author's essential information and author's behavioural information;Wherein, described author's essential information include following at least one: whether authors' name, author pass through official's certification of social network sites the corresponding grade of social network sites, author in the concerns quantity of corresponding social network sites and author;Described author's behavioural information include following at least one: the touching quantity of the content that the forwardings quantity of content that the replys quantity of content that the issuing time of content that author issues on corresponding social network sites, author issue on corresponding social network sites, author issue on corresponding social network sites, author issue on corresponding social network sites and author on the social network sites of correspondence the content of issue represent quantity.
7. a retrieval device, it is characterised in that described device includes:
Receiving unit, be configured to receive the retrieval request of user, wherein, described retrieval request includes search key;
Retrieval unit, is configured to, according to described search key, at least one predetermined social network sites is carried out search operaqtion, generates retrieval information aggregate;
Scoring unit, is configured to each retrieval information in described retrieval information aggregate, according to the author information of the content of the site information of social network sites corresponding to this retrieval information social network sites corresponding with this retrieval information, marks this retrieval information;
Sequencing unit, is configured to, according to scoring, be ranked up each bar retrieval information, generates the set of ranked retrieval information as retrieval result.
Device the most according to claim 7, it is characterised in that described site information includes the website rank of described website.
9. according to the device one of claim 7-8 Suo Shu, it is characterised in that described device also includes:
Acquiring unit, is configured to obtain the author information of the content of the site information of social network sites corresponding to the described retrieval information social network sites corresponding with described retrieval information.
Device the most according to claim 9, it is characterised in that described acquiring unit is configured to further:
The author information of the content of the site information of social network sites corresponding to the described retrieval information social network sites corresponding with described retrieval information is captured by web crawlers technology.
11. devices according to claim 9, it is characterised in that described device also includes:
Receive unit, be configured to receive the author information of site information, content information and/or the content of at least one predetermined social network sites active push described.
12. devices according to claim 11, it is characterised in that described author information include following at least one: author's essential information and author's behavioural information;Wherein, described author's essential information include following at least one: whether authors' name, author pass through official's certification of social network sites the corresponding grade of social network sites, author in the concerns quantity of corresponding social network sites and author;Described author's behavioural information include following at least one: the touching quantity of the content that the forwardings quantity of content that the replys quantity of content that the issuing time of content that author issues on corresponding social network sites, author issue on corresponding social network sites, author issue on corresponding social network sites, author issue on corresponding social network sites and author on the social network sites of correspondence the content of issue represent quantity.
CN201610170303.7A 2016-03-23 2016-03-23 Search method and device Active CN105824951B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610170303.7A CN105824951B (en) 2016-03-23 2016-03-23 Search method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610170303.7A CN105824951B (en) 2016-03-23 2016-03-23 Search method and device

Publications (2)

Publication Number Publication Date
CN105824951A true CN105824951A (en) 2016-08-03
CN105824951B CN105824951B (en) 2019-10-11

Family

ID=56524074

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610170303.7A Active CN105824951B (en) 2016-03-23 2016-03-23 Search method and device

Country Status (1)

Country Link
CN (1) CN105824951B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107122414A (en) * 2017-03-31 2017-09-01 广东神马搜索科技有限公司 Search result recommends method, equipment, search engine and electronic equipment
CN113468425A (en) * 2021-06-30 2021-10-01 北京百度网讯科技有限公司 Knowledge content distribution method and device, electronic equipment and storage medium
CN114372190A (en) * 2022-03-22 2022-04-19 湖南大学 Internet mass data retrieval method and retrieval system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102426610A (en) * 2012-01-13 2012-04-25 中国科学院计算技术研究所 Microblog rank searching method and microblog searching engine
CN102737090A (en) * 2012-03-21 2012-10-17 袁行远 Webpage searching result ordering method and device
CN103246670A (en) * 2012-02-09 2013-08-14 深圳市腾讯计算机系统有限公司 Microblog sorting, searching, display method and system
CN103455615A (en) * 2013-09-10 2013-12-18 中国地质大学(武汉) Method for sequencing filtering and retrieving WeChat accounts
CN103823906A (en) * 2014-03-19 2014-05-28 北京邮电大学 Multi-dimension searching sequencing optimization algorithm and tool based on microblog data
WO2014102734A1 (en) * 2012-12-27 2014-07-03 Ramana Ch Venkata Systems and methods for collecting, sorting and posting information on a social media profile

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102426610A (en) * 2012-01-13 2012-04-25 中国科学院计算技术研究所 Microblog rank searching method and microblog searching engine
CN103246670A (en) * 2012-02-09 2013-08-14 深圳市腾讯计算机系统有限公司 Microblog sorting, searching, display method and system
CN102737090A (en) * 2012-03-21 2012-10-17 袁行远 Webpage searching result ordering method and device
WO2014102734A1 (en) * 2012-12-27 2014-07-03 Ramana Ch Venkata Systems and methods for collecting, sorting and posting information on a social media profile
CN103455615A (en) * 2013-09-10 2013-12-18 中国地质大学(武汉) Method for sequencing filtering and retrieving WeChat accounts
CN103823906A (en) * 2014-03-19 2014-05-28 北京邮电大学 Multi-dimension searching sequencing optimization algorithm and tool based on microblog data

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107122414A (en) * 2017-03-31 2017-09-01 广东神马搜索科技有限公司 Search result recommends method, equipment, search engine and electronic equipment
CN113468425A (en) * 2021-06-30 2021-10-01 北京百度网讯科技有限公司 Knowledge content distribution method and device, electronic equipment and storage medium
CN114372190A (en) * 2022-03-22 2022-04-19 湖南大学 Internet mass data retrieval method and retrieval system
CN114372190B (en) * 2022-03-22 2022-05-17 湖南大学 Internet mass data retrieval method and retrieval system

Also Published As

Publication number Publication date
CN105824951B (en) 2019-10-11

Similar Documents

Publication Publication Date Title
CN108804450B (en) Information pushing method and device
CN107105031A (en) Information-pushing method and device
CN101000623A (en) Method for image identification search by mobile phone photographing and device using the method
US20120297296A1 (en) Contract authoring system and method
CN103605502B (en) Form page display method and server
CN105677931A (en) Information search method and device
CN103020226A (en) Method and device for acquiring search result
CN104239331A (en) Method and device for ranking comment search engines
CN102339311B (en) Method and equipment for searching webpage content on user equipment on basis of query classification
CN104919457A (en) Method and apparatus for enriching social media to improve personalized user experience
CN106774975A (en) Input method and device
CN107958078A (en) Information generating method and device
CN105488205A (en) Page generation method and page generation apparatus
CN105786207A (en) Information input method and device
CN112632139A (en) Information pushing method and device based on PMIS system, computer equipment and medium
CN103870553A (en) Input resource pushing method and system
CN107169077A (en) Method and apparatus for pushed information
EP2423837B1 (en) Method and system for viewing web page and computer program product thereof
CN112016290A (en) Automatic document typesetting method, device, equipment and storage medium
KR20080019392A (en) Document auto editing system and the method thereof
CN103544150A (en) Method and system for providing recommendation information for mobile terminal browser
CN105824951A (en) Retrieval method and retrieval device
CN107656910A (en) Method and apparatus for generating list
CN107729573A (en) Information-pushing method and device
CN110351672B (en) Information pushing method and device and electronic equipment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant