CN104484414A - Processing method and device of favourite information - Google Patents

Processing method and device of favourite information Download PDF

Info

Publication number
CN104484414A
CN104484414A CN201410784236.9A CN201410784236A CN104484414A CN 104484414 A CN104484414 A CN 104484414A CN 201410784236 A CN201410784236 A CN 201410784236A CN 104484414 A CN104484414 A CN 104484414A
Authority
CN
China
Prior art keywords
information
search
searching
object search
collection information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410784236.9A
Other languages
Chinese (zh)
Other versions
CN104484414B (en
Inventor
罗吉喜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201410784236.9A priority Critical patent/CN104484414B/en
Publication of CN104484414A publication Critical patent/CN104484414A/en
Application granted granted Critical
Publication of CN104484414B publication Critical patent/CN104484414B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9562Bookmark management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention provides a processing method and device of favourite information. The method comprises the steps of : when a first search request based on a first user identifier is received, returning a first search engine result page acquired by a first search object through searching according to a first search request; when one or more first favourite information returned by the first search engine result page is received, establishing a first incidence relation among the first user identifier, the first search object and one or more first favourite information; when a second search request based on the first user identifier is received, searching by adopting the first search object in the second search request to obtain a second search engine result page; according to the first incidence relation, embedding one or more first favourite information associated with the first user identifier and the first search object into the second search engine result page; returning to the second search engine result page. By utilizing the embodiment, the operation convenience and the privacy are greatly enhanced.

Description

A kind of disposal route of collection information and device
Technical field
The present invention relates to technical field of data processing, particularly relate to a kind of disposal route of collection information and a kind for the treatment of apparatus of collection information.
Background technology
Along with the fast development of the network technology, especially enter the mobile Internet epoch, the network information sharply increases, and which includes a large amount of webpages.
User generally uses browser to browse webpage, and browser generally provides favorite function, and collection is the webpage facilitating user record oneself to like when being online, commonly using.Collection information is put in a file, can opens when thinking and find.
Some browser provides the function of network storage collection information now, and user is login account in the same browser of different terminals, just can load the collection information that this account is formerly collected.
The mode of this collection collection, needs in different terminals, to install same browser, complex operation, and can show the collection information that this account is all after login account, privacy is very low.
In addition, some website provides network profile specially, and a webpage is distributed for user in this website, and user can collect collection information in the web page.
The mode of this collection collection, although without the need to installing specific browser, as long as other users load this webpage, just can obtain the collection information that this user collects, privacy is very low.
Summary of the invention
In view of the above problems, the present invention is proposed to provide a kind of overcoming the problems referred to above or a kind of disposal route of collection information solved the problem at least in part and the treating apparatus of corresponding a kind of collection information.
According to one aspect of the present invention, provide a kind of disposal route of collection information, comprising:
When receiving the first searching request based on first user mark, returning and adopting the first object search of described first searching request to carry out searching for the first result of page searching obtained;
When receiving the one or more first collection information returned by described first result of page searching, set up the first incidence relation of described first user mark, described first object search and described one or many first collection information;
When receiving the second searching request based on first user mark, adopting the first object search in described second searching request to search for, obtaining search second results page;
According to described first incidence relation, will identify in the second result of page searching described in one or more first collection information insertion of associating with described first object search with described first user;
Return described second result of page searching.
Alternatively, described method also comprises:
In described first incidence relation, the first label information is increased to described one or more first collection information.
Alternatively, described return the step of described second result of page searching before, described method also comprises:
When described first user mark has tag subscriptions information, search one or more second incidence relations of coupling; Described second incidence relation is the incidence relation of the second user ID, the second object search and one or more second collection information, and described one or more second collection information has the second label information; Described tag subscriptions information is mated with described second label information and/or described first object search mates with described second object search;
Described one or more second collection information is extracted from described one or more second incidence relation;
By in the second result of page searching described in described one or more second collection information insertion.
Alternatively, the described step extracting described one or more second collection information from described one or more second incidence relation comprises:
The second collection information in described one or more second incidence relation is contrasted;
Extract one or more the second identical collection information.
Alternatively, described method also comprises:
When receiving the process request for described one or many first collection information, to described one or more first collection information configuration feature website information.
Alternatively, described method also comprises:
When receiving the load request sent based on described feature website information, return described one or more first collection information.
Alternatively, described first collection packets of information draws together website information and title, and described second collection packets of information draws together website information and title.
Alternatively, described according to described first incidence relation, comprise with the step that described first user identifies in the second result of page searching described in one or more first collection information insertion of associating with described first object search:
The one or more first collection information identifying with described first user and associate with described first object search are searched in the first incidence relation preset;
By in the second result of page searching described in described one or more first collection information insertion.
Alternatively, return the step adopting the first object search of described first searching request to carry out searching for the first result of page searching obtained described in comprise:
Extract the first object search in described first searching request;
When described first object search is text message, search for the webpage mated with described text message in a database; Described webpage has summary info;
The summary info of described webpage is adopted to generate the first result of page searching;
Return described first result of page searching.
Alternatively, return the step adopting the first object search of described first searching request to carry out searching for the first result of page searching obtained described in comprise:
Extract the first object search in described first searching request;
When described first object search is image information, identify the Web page image information similar or identical with described image information in a database;
Described Web page image information is adopted to generate the first result of page searching;
Return described first result of page searching.
Alternatively, return the step adopting the first object search of described first searching request to carry out searching for the first result of page searching obtained described in comprise:
Extract the object search in described first searching request;
When described first object search is voice data, identify described voice data characteristic of correspondence text message;
Search for the webpage mated with described feature text message in a database; Described webpage has the second summary info;
The summary info of described webpage is adopted to generate the first result of page searching;
Return described first result of page searching.
Alternatively, the first object search in described second searching request of described employing is searched for, and the step obtaining search second results page comprises:
Extract the first object search in described second searching request;
When described first object search is text message, search for the webpage mated with described text message in a database; Described webpage has summary info;
The summary info of described webpage is adopted to generate the second result of page searching.
Alternatively, the first object search in described second searching request of described employing is searched for, and the step obtaining search second results page comprises:
Extract the first object search in described second searching request;
When described first object search is image information, identify the Web page image information similar or identical with described image information in a database;
Described Web page image information is adopted to generate the first result of page searching.
Alternatively, the first object search in described second searching request of described employing is searched for, and the step obtaining search second results page comprises:
Extract the first object search in described second searching request;
When described first object search is voice data, identify described voice data characteristic of correspondence text message;
Search for the webpage mated with described feature text message in a database; Described webpage has the second summary info;
The summary info of described webpage is adopted to generate the second result of page searching.
According to a further aspect in the invention, provide a kind for the treatment of apparatus of collection information, comprising:
First search module, is suitable for when receiving the first searching request based on first user mark, returns and adopts the first object search of described first searching request to carry out searching for the first result of page searching obtained;
Set up module, being suitable for when receiving the one or more first collection information returned by described first result of page searching, setting up the first incidence relation of described first user mark, described first object search and described one or many first collection information;
Second search module, is suitable for, when receiving the second searching request based on first user mark, adopting the first object search in described second searching request to search for, obtaining search second results page;
First merge module, is suitable for according to described first incidence relation, will identify in the second result of page searching described in one or more first collection information insertion of associating with described first object search with described first user;
First returns module, is suitable for returning described second result of page searching.
Alternatively, described method also comprises:
Increase module, be suitable in described first incidence relation, the first label information is increased to described one or more first collection information.
Alternatively, described method also comprises:
Search module, be suitable for, when described first user mark has tag subscriptions information, searching one or more second incidence relations of coupling; Described second incidence relation is the incidence relation of the second user ID, the second object search and one or more second collection information, and described one or more second collection information has the second label information; Described tag subscriptions information is mated with described second label information and/or described first object search mates with described second object search;
Extraction module, is suitable for from described one or more second incidence relation, extract described one or more second collection information;
Second merge module, is suitable in the second result of page searching described in described one or more second collection information insertion.
Alternatively, described extraction module is also suitable for:
The second collection information in described one or more second incidence relation is contrasted;
Extract one or more the second identical collection information.
Alternatively, described method also comprises:
Configuration module, is suitable for when receiving the process request for described one or many first collection information, to described one or more first collection information configuration feature website information.
Alternatively, described method also comprises:
Second returns module, is suitable for, when receiving the load request sent based on described feature website information, returning described one or more first collection information.
Alternatively, described first collection packets of information draws together website information and title, and described second collection packets of information draws together website information and title.
Alternatively, described first merge module is also suitable for:
The one or more first collection information identifying with described first user and associate with described first object search are searched in the first incidence relation preset;
By in the second result of page searching described in described one or more first collection information insertion.
Alternatively, described first search module is also suitable for:
Extract the first object search in described first searching request;
When described first object search is text message, search for the webpage mated with described text message in a database; Described webpage has summary info;
The summary info of described webpage is adopted to generate the first result of page searching;
Return described first result of page searching.
Alternatively, described first search module is also suitable for:
Extract the first object search in described first searching request;
When described first object search is image information, identify the Web page image information similar or identical with described image information in a database;
Described Web page image information is adopted to generate the first result of page searching;
Return described first result of page searching.
Alternatively, described first search module is also suitable for:
Extract the object search in described first searching request;
When described first object search is voice data, identify described voice data characteristic of correspondence text message;
Search for the webpage mated with described feature text message in a database; Described webpage has the second summary info;
The summary info of described webpage is adopted to generate the first result of page searching;
Return described first result of page searching.
Alternatively, described second search module is also suitable for:
Extract the first object search in described second searching request;
When described first object search is text message, search for the webpage mated with described text message in a database; Described webpage has summary info;
The summary info of described webpage is adopted to generate the second result of page searching.
Alternatively, described second search module is also suitable for:
Extract the first object search in described second searching request;
When described first object search is image information, identify the Web page image information similar or identical with described image information in a database;
Described Web page image information is adopted to generate the first result of page searching.
Alternatively, described second search module is also suitable for:
Extract the first object search in described second searching request;
When described first object search is voice data, identify described voice data characteristic of correspondence text message;
Search for the webpage mated with described feature text message in a database; Described webpage has the second summary info;
The summary info of described webpage is adopted to generate the second result of page searching.
The embodiment of the present invention is for the first searching request, set up the first incidence relation of first user mark, the first object search and one or many first collection information, for the second searching request, one or many first collection information is returned according to this first incidence relation, on the one hand, carry out displaying collection information based on the page, avoid and specific browser is installed, improve the simplicity of operation; On the other hand, using the first object search as the entrance showing collection information, avoid login account, load certain webpage and directly load collection information, substantially increase privacy.
The embodiment of the present invention based on text message, image information, audio-frequency information etc. as object search, text message can conveniently input, ensure that simplicity, image information, audio-frequency information due to complexity higher, the probability of input same text information can be reduced, improve the complicacy of object search, further increase privacy.
The embodiment of the present invention is in incidence relation, label information is increased to one or more collection information, support that user is by the tag subscriptions information of coupling, and, the object search of coupling, the information that other users of direct acquisition formerly arranged, the information often returned than search engine machinery due to the information of manual sorting is more effective, avoid user to repeat to carry out loaded down with trivial details artificial filter to the info web of magnanimity, decrease expending of user time and energy, decrease the system resources consumption of subscriber equipment and website, decrease taking of the network bandwidth, substantially increase the efficiency of acquisition of information, quality and capacity.
The embodiment of the present invention is collection information configuration feature website information, load this feature website information and then can obtain this collection information, the information that other users of direct acquisition formerly arranged, the information often returned than search engine machinery due to the information of manual sorting is more effective, avoid user to repeat to carry out loaded down with trivial details artificial filter to the info web of magnanimity, decrease expending of user time and energy, decrease the system resources consumption of subscriber equipment and website, decrease taking of the network bandwidth, substantially increase the efficiency of acquisition of information, quality and capacity.
Above-mentioned explanation is only the general introduction of technical solution of the present invention, in order to technological means of the present invention can be better understood, and can be implemented according to the content of instructions, and can become apparent, below especially exemplified by the specific embodiment of the present invention to allow above and other objects of the present invention, feature and advantage.
Accompanying drawing explanation
By reading hereafter detailed description of the preferred embodiment, various other advantage and benefit will become cheer and bright for those of ordinary skill in the art.Accompanying drawing only for illustrating the object of preferred implementation, and does not think limitation of the present invention.And in whole accompanying drawing, represent identical parts by identical reference symbol.In the accompanying drawings:
Fig. 1 shows the flow chart of steps of the disposal route embodiment 1 of a kind of collection information according to an embodiment of the invention;
Fig. 2 shows a kind of according to an embodiment of the invention exemplary plot of adding collection information;
Fig. 3 shows a kind of according to an embodiment of the invention exemplary plot of showing collection information;
Fig. 4 shows the flow chart of steps of the disposal route embodiment 2 of a kind of collection information according to an embodiment of the invention;
Fig. 5 shows a kind of according to an embodiment of the invention exemplary plot of adding tag subscriptions information;
Fig. 6 shows a kind of according to an embodiment of the invention structured flowchart for the treatment of apparatus embodiment 1 of collection information; And
Fig. 7 shows a kind of according to an embodiment of the invention structured flowchart for the treatment of apparatus embodiment 2 of collection information.
Embodiment
Below with reference to accompanying drawings exemplary embodiment of the present disclosure is described in more detail.Although show exemplary embodiment of the present disclosure in accompanying drawing, however should be appreciated that can realize the disclosure in a variety of manners and not should limit by the embodiment set forth here.On the contrary, provide these embodiments to be in order to more thoroughly the disclosure can be understood, and complete for the scope of the present disclosure can be conveyed to those skilled in the art.
With reference to Fig. 1, show a kind of according to an embodiment of the invention flow chart of steps of disposal route embodiment 1 of collection information, specifically can comprise the steps:
Step 101, when receiving the first searching request based on first user mark, returning and adopting the first object search of described first searching request to carry out searching for the first result of page searching obtained;
In specific implementation, user can from any one electronic equipment access services device (as search engine), this electronic equipment specifically can comprise mobile device, such as mobile phone, PDA (Personal DigitalAssistant, personal digital assistant), laptop computer, palm PC etc., also fixed equipment can be comprised, such as personal computer, intelligent television etc.
These electronic equipments can support the operating system comprising Android (Android), IOS, WindowsPhone or windows etc., usually can the application program of running browser or built-in miniature browser.
First searching request can refer to the instruction of the search that user sends and certain object search relevant information.
Such as, user can initiate the first searching request by inputting certain object search in the webpage of search engine, or at the search plug-in unit (plug-ins of browser, can by carrying out alternately, increasing function of search in a browser with browser, search engine etc.) etc. input certain object search and initiate first searching request etc.When user clicks search control in search-engine web page, be just equivalent to receive the instruction initiated based on the first searching request of search engine; Equally, when inputting certain object search and click confirming button or press enter key in search plug-in unit, be also equivalent to receive the instruction initiated based on the first searching request of search engine.
Wherein, first user mark and the first object search can be comprised in described first searching request;
First user mark can for representing the information of a well-determined user, and such as, user ID (abbreviation of IDentity, identify label number), other information of binding with user ID, as mailbox, telephone number etc.
First object search can comprise text message, pictorial information, audio-frequency information etc., and the embodiment of the present invention is not limited this.
In actual applications, request header information can be initiated the first searching request by HTTP (Hypertext transfer protocol, HTTP) agreement to the server at search engine place by the application program of browser or built-in miniature browser.This server receive this request after etc. pending, the application program of the most backward browser or built-in miniature browser returns response.
In the embodiment of the present invention, when receiving the first object search that user submits to, then can according to this first object search information that Rapid Detection is relevant in a database, the covariance mapping of the information of carrying out and inquiry, sorts to the result that will export and returns to the application program of browser or built-in miniature browser.
In a kind of embodiment of the present invention, step 101 can comprise following sub-step:
Sub-step S11, extracts the first object search in described first searching request;
Sub-step S12, when described first object search is text message, searches for the webpage mated with described text message in a database; Described webpage has summary info;
Sub-step S13, adopts the summary info of described webpage to generate the first result of page searching;
Sub-step S14, returns described first result of page searching.
In specific implementation, if the first object search is text message, then can search for relevant webpage based on modes such as inverted indexs.
Be described for search engine, the search routine of search engine is divided into two parts, and one is front end user request process, and two is that rear end makes data procedures.
One, front end user request process:
1. receive request: receive the text message that user inputs at search engine;
2. query word analysis: word segmentation processing is carried out to text message;
3. retrieve: according to word segmentation result, from the inverted index made in advance, search the webpage of the candidate relevant to word segmentation result;
4. sort: for the webpage of candidate, sort according to content relevance, the dimension such as ageing;
5. represent: the summary info of the webpage after sequence is shown at result of page searching.
Two, rear end makes data procedures:
1. webpage capture: adopt crawler technology, by the linking relationship between webpage, captures the webpage of internet and preserves.
2. compilation of index: analyze the webpage capturing preservation, such as, carry out word segmentation processing to web page title and page text, makes inverted index, for front end user request process according to word segmentation result.
In a kind of embodiment of the present invention, step 101 can comprise following sub-step:
Sub-step S21, extracts the first object search in described first searching request;
Sub-step S22, when described first object search is image information, identifies the Web page image information similar or identical with described image information in a database;
Sub-step S23, adopts described Web page image information to generate the first result of page searching;
Sub-step S24, returns described first result of page searching.
In specific implementation, if the first object search is pictorial information, then can search similar or identical Web page image information by modes such as picture analogies degree.
In embodiments of the present invention, the characteristic information that can extract in image information and Web page image information carries out the calculating of similarity.
Wherein, characteristic information can comprise at least one in shape facility information and color characteristic information; Shape facility information can refer to the information of token image style characteristic, and color characteristic information can refer to the information of token image color characteristics.
The method for expressing of shape facility information mainly contains two classes, and a class is provincial characteristics, and it is mainly for the whole shape area of image; Another kind of is contour feature, its for be the outer boundary of object.
The typical method extracting shape facility information comprises boundary characteristic value method (outer boundary of image), geometry parameter method (image geometry parameterized treatment), shape invariance moments method (looking for Image Moment Invariants feature), Fourier's shape description method (fourier transform method) etc.
Color characteristic information can be described by the color characteristic of image or image-region, and it has globality.
The typical method extracting color characteristic information comprises color histogram, color set, color moment etc.
Certainly, just exemplarily, when implementing the embodiment of the present invention, can arrange other characteristic informations according to actual conditions, the embodiment of the present invention is not limited this above-mentioned characteristic information.
In a kind of embodiment of the present invention, step 101 can comprise following sub-step:
Sub-step S31, extracts the object search in described first searching request;
Sub-step S32, when described first object search is voice data, identifies described voice data characteristic of correspondence text message;
Sub-step S33, searches for the webpage mated with described feature text message in a database; Described webpage has the second summary info;
Sub-step S34, adopts the summary info of described webpage to generate the first result of page searching;
Sub-step S35, returns described first result of page searching.
In specific implementation, if the first object search is voice data, then can identifies text message corresponding to this voice data, then search for relevant webpage based on modes such as inverted indexs.
In actual applications, the voice data that electronic equipment can be sent by sound card equipment collection users such as microphones, or, the voice data gathered directly is uploaded by electronic equipment, and by speech recognition technology (Automatic Speech Recognition, ASR), the vocabulary content (i.e. speech data) in the voice of the mankind is converted to computer-readable input (i.e. text message).
At present, speech recognition technology is realized by speech recognition system usually.Large vocabulary speech recognition system many employings statistical-simulation spectrometry technology of main flow.The speech recognition system of typical Corpus--based Method mode identification method is made up of following basic module:
1, signal transacting and characteristic extracting module; The main task of this module extracts feature from voice data, for acoustic model process.Meanwhile, it generally also includes some signal processing technologies, to reduce the impact that the factors such as neighbourhood noise, channel, speaker cause feature as far as possible.
2, acoustic model; The many employings of speech recognition system carry out modeling based on single order Hidden Markov Model (HMM).
3, pronunciation dictionary; Pronunciation dictionary comprises the speech recognition system treatable word finder of institute and pronunciation thereof.The actual mapping providing acoustic model and language model of pronunciation dictionary.
4, language model; Language model to speech recognition system for language carry out modeling.In theory, comprise regular language, context-free grammar can as language model at interior various language models, but current various system generally adopt or the N unit syntax of Corpus--based Method and variant thereof.
5, demoder; Demoder is one of core of speech recognition system, and its task is the signal to input, according to acoustics, language model and dictionary, finds the word string that can export this signal with maximum probability.Can relation between the above-mentioned module of understanding clearly from mathematical angle.
Certainly, just exemplarily, when implementing the embodiment of the present invention, can arrange other object searches and way of search according to actual conditions, the embodiment of the present invention is not limited this for above-mentioned object search and way of search.In addition, except above-mentioned object search and way of search, those skilled in the art can also adopt other object search and way of search according to actual needs, and the embodiment of the present invention is not also limited this.
The embodiment of the present invention based on text message, image information, audio-frequency information etc. as object search, text message can conveniently input, ensure that simplicity, image information, audio-frequency information due to complexity higher, the probability of input same text information can be reduced, improve the complicacy of object search, further increase privacy.
Step 102, when receiving the one or more first collection information returned by described first result of page searching, sets up the first incidence relation of described first user mark, described first object search and described one or many first collection information;
Under http protocol, the application program of browser or built-in miniature browser can receive the document of HTML (Hypertext Markup Language, HTML (Hypertext Markup Language)) type from the server at search engine place.
The application program of browser or built-in miniature browser can resolve html document, generate the object of tree structure, i.e. DOM (Document Object Model, document dbject model), each node to liking on DOM, and these objects can represent the web page resources such as word, picture.The application program of browser or built-in miniature browser can start to show this html document, and obtain the address of wherein embedded web page resources, then browser is initiated request to server again and is obtained these web page resources, and shows the first result of page searching in the html document of the application program of browser or built-in miniature browser.
In specific implementation, can provide the control of input first collection information at the first result of page searching, user can input the first collection information by this control.
Wherein, comprise can website information and title for described first collection information.
Such as, as shown in Figure 2, if user have input the first object search 201 " learning materials ", then, in the first result of page searching, control 501 as shown in Figure 5 can be provided, or, can provide control 202 as shown in Figure 2 and control 203, this control 202 may be used for inputting website information, and this control 203 may be used for inputting title, as " library.ABC.com " can be inputted in control 202, input in " library " at control 203, then can generate a first collection information; In control 202, input " english.ABC.com ", input " English materials " at control 203, then can generate a first collection information; In control 202, input " chinese.ABC.com ", input " Chinese language data " at control 203, then can generate first collection information etc.
In specific implementation, search engine can set up the first incidence relation of first user mark, the first object search and one or many first collection information, stores in a database, to confirm to generate collection information.
Because this one or more collection information belongs is in same first object search, pictograph ground, can be referred to as the first incidence relation to collect box, and this first object search can for opening the key of this collection box.
Step 103, when receiving the second searching request based on first user mark, adopting the first object search in described second searching request to search for, obtaining search second results page;
In specific implementation, the second searching request can refer to the instruction of the search that user sends and certain object search relevant information.
Such as, user can initiate the second searching request by inputting certain object search in the webpage of search engine, or at the search plug-in unit (plug-ins of browser, can by carrying out alternately, increasing function of search in a browser with browser, search engine etc.) etc. input certain object search and initiate second searching request etc.When user clicks search control in search-engine web page, be just equivalent to receive the instruction initiated based on the second searching request of search engine; Equally, when inputting certain object search and click confirming button or press enter key in search plug-in unit, be also equivalent to receive the instruction initiated based on the second searching request of search engine.
Wherein, first user mark and the first object search can be comprised in described second searching request;
Request header information can be initiated the first searching request by HTTP (Hypertext transfer protocol, HTTP) agreement to the server at search engine place by the application program of browser or built-in miniature browser in actual applications.This server receive this request after etc. pending, the application program of the most backward browser or built-in miniature browser returns response.
In inventive embodiments, user can search for regard to the first identical object search, when search engine receives the first object search of user's submission, then can according to this first object search information that Rapid Detection is relevant in a database, the covariance mapping of the information of carrying out and inquiry, sorts to the result that will export.
In a kind of embodiment of the present invention, step 103 can comprise following sub-step:
Sub-step S41, extracts the first object search in described second searching request;
Sub-step S42, when described first object search is text message, searches for the webpage mated with described text message in a database; Described webpage has summary info;
Sub-step S43, adopts the summary info of described webpage to generate the second result of page searching.
In specific implementation, if the first object search is text message, then can search for relevant webpage based on modes such as inverted indexs.
Be described for search engine, the search routine of search engine is divided into two parts, and one is front end user request process, and two is that rear end makes data procedures.
One, front end user request process:
1. receive request: receive the text message that user inputs at search engine;
2. query word analysis: word segmentation processing is carried out to text message;
3. retrieve: according to word segmentation result, from the inverted index made in advance, search the webpage of the candidate relevant to word segmentation result;
4. sort: for the webpage of candidate, sort according to content relevance, the dimension such as ageing;
5. represent: the summary info of the webpage after sequence is shown at result of page searching.
Two, rear end makes data procedures:
1. webpage capture: adopt crawler technology, by the linking relationship between webpage, captures the webpage of internet and preserves.
2. compilation of index: analyze the webpage capturing preservation, such as, carry out word segmentation processing to web page title and page text, makes inverted index, for front end user request process according to word segmentation result.
In a kind of embodiment of the present invention, step 103 can comprise following sub-step:
Sub-step S51, extracts the first object search in described second searching request;
Sub-step S52, when described first object search is image information, identifies the Web page image information similar or identical with described image information in a database;
Sub-step S53, adopts described Web page image information to generate the first result of page searching.
In specific implementation, if the first object search is pictorial information, then can search similar or identical Web page image information by modes such as picture analogies degree.
In embodiments of the present invention, the characteristic information that can extract in image information and Web page image information carries out the calculating of similarity.
Wherein, characteristic information can comprise at least one in shape facility information and color characteristic information; Shape facility information can refer to the information of token image style characteristic, and color characteristic information can refer to the information of token image color characteristics.
The method for expressing of shape facility information mainly contains two classes, and a class is provincial characteristics, and it is mainly for the whole shape area of image; Another kind of is contour feature, its for be the outer boundary of object.
The typical method extracting shape facility information comprises boundary characteristic value method (outer boundary of image), geometry parameter method (image geometry parameterized treatment), shape invariance moments method (looking for Image Moment Invariants feature), Fourier's shape description method (fourier transform method) etc.
Color characteristic information can be described by the color characteristic of image or image-region, and it has globality.
The typical method extracting color characteristic information comprises color histogram, color set, color moment etc.
Certainly, just exemplarily, when implementing the embodiment of the present invention, can arrange other characteristic informations according to actual conditions, the embodiment of the present invention is not limited this above-mentioned characteristic information.
In a kind of embodiment of the present invention, step 103 can comprise following sub-step:
Sub-step S61, extracts the first object search in described second searching request;
Sub-step S62, when described first object search is voice data, identifies described voice data characteristic of correspondence text message;
Sub-step S63, searches for the webpage mated with described feature text message in a database; Described webpage has the second summary info;
Sub-step S64, adopts the summary info of described webpage to generate the second result of page searching.
In specific implementation, if the first object search is voice data, then can identifies text message corresponding to this voice data, then search for relevant webpage based on modes such as inverted indexs.
In actual applications, the voice data that electronic equipment can be sent by sound card equipment collection users such as microphones, or, the voice data gathered directly is uploaded by electronic equipment, and by speech recognition technology (Automatic Speech Recognition, ASR), the vocabulary content (i.e. speech data) in the voice of the mankind is converted to computer-readable input (i.e. text message).
At present, speech recognition technology is realized by speech recognition system usually.Large vocabulary speech recognition system many employings statistical-simulation spectrometry technology of main flow.The speech recognition system of typical Corpus--based Method mode identification method is made up of following basic module:
1, signal transacting and characteristic extracting module; The main task of this module extracts feature from voice data, for acoustic model process.Meanwhile, it generally also includes some signal processing technologies, to reduce the impact that the factors such as neighbourhood noise, channel, speaker cause feature as far as possible.
2, acoustic model; The many employings of speech recognition system carry out modeling based on single order Hidden Markov Model (HMM).
3, pronunciation dictionary; Pronunciation dictionary comprises the speech recognition system treatable word finder of institute and pronunciation thereof.The actual mapping providing acoustic model and language model of pronunciation dictionary.
4, language model; Language model to speech recognition system for language carry out modeling.In theory, comprise regular language, context-free grammar can as language model at interior various language models, but current various system generally adopt or the N unit syntax of Corpus--based Method and variant thereof.
5, demoder; Demoder is one of core of speech recognition system, and its task is the signal to input, according to acoustics, language model and dictionary, finds the word string that can export this signal with maximum probability.Can relation between the above-mentioned module of understanding clearly from mathematical angle.
Certainly, just exemplarily, when implementing the embodiment of the present invention, can arrange other object searches and way of search according to actual conditions, the embodiment of the present invention is not limited this for above-mentioned object search and way of search.In addition, except above-mentioned object search and way of search, those skilled in the art can also adopt other object search and way of search according to actual needs, and the embodiment of the present invention is not also limited this.
Step 104, according to described first incidence relation, will identify in the second result of page searching described in one or more first collection information insertion of associating with described first object search with described first user;
In the embodiment of the present invention, if user formerly has the first collection information with regard to the first object search collection, then this user is when this first object search of rear search, and search engine can obtain formerly collects the first collection information, embeds in the second result of page searching.
In a kind of embodiment of the present invention, step 104 can comprise following sub-step:
Sub-step S71, searches the one or more first collection information identifying with described first user and associate with described first object search in the first incidence relation preset;
Sub-step S72, by the second result of page searching described in described one or more first collection information insertion.
The application embodiment of the present invention, user can in advance by the one or more first collection information of the first object search collection, and search engine establishes the first incidence relation of first user mark, the first object search and one or more first collection information.
Then search engine can pass through this first incidence relation, can find out the one or more first collection information identifying with first user and associate with the first object search.
Search engine can will return to the application program of browser or built-in miniature browser in one or more first collection information insertion second result of page searching.
Step 105, returns described second result of page searching.
Under http protocol, the application program of browser or built-in miniature browser can receive the document of HTML (Hypertext Markup Language, HTML (Hypertext Markup Language)) type from the server at search engine place.
The application program of browser or built-in miniature browser can resolve html document, generate the object of tree structure, i.e. DOM (Document Object Model, document dbject model), each node to liking on DOM, and these objects can represent the web page resources such as word, picture.The application program of browser or built-in miniature browser can start to show this html document, and obtain the address of wherein embedded web page resources, then browser is initiated request to server again and is obtained these web page resources, and shows the second result of page searching in the html document of the application program of browser or built-in miniature browser.
Such as, as shown in Figure 3, if user have input the first object search 301 " learning materials ", then can in the second result of page searching, control 303 and control 304 are provided, this control 303 may be used for loading title, and this control 304 may be used for loading website information, loads " library " as loaded " library.ABC.com " in control 304, in control 303; In control 304, load " english.ABC.com ", in control 303, load " English materials "; In control 304, load " chinese.ABC.com ", load " Chinese language data " etc. in control 303.
It should be noted that, when the first collection information being loaded with user formerly collecting in browser or built-in miniature browser, user's control 305 that can continue through as shown in Figure 3 continues to add collection information, and the embodiment of the present invention is not limited this.
When user clicks first collection information (as website information), then can load the corresponding page at new window.
The embodiment of the present invention based on text message, image information, audio-frequency information etc. as object search, text message can conveniently input, ensure that simplicity, image information, audio-frequency information due to complexity higher, the probability of input same text information can be reduced, improve the complicacy of object search, further increase privacy.
With reference to Fig. 4, show a kind of according to an embodiment of the invention flow chart of steps of disposal route embodiment 2 of collection information, specifically can comprise the steps:
Step 401, when receiving the first searching request based on first user mark, returning and adopting the first object search of described first searching request to carry out searching for the first result of page searching obtained;
Step 402, when receiving the one or more first collection information returned by described first result of page searching, sets up the first incidence relation of described first user mark, described first object search and described one or many first collection information;
Step 403, when receiving the second searching request based on first user mark, adopting the first object search in described second searching request to search for, obtaining search second results page;
Step 404, according to described first incidence relation, will identify in the second result of page searching described in one or more first collection information insertion of associating with described first object search with described first user;
Step 405, in described first incidence relation, increases by the first label information to described one or more first collection information.
In actual applications, after the collection first collection information in collection box (i.e. the first incidence relation) is slowly more and more, the first label information can be stamped to each collection box (i.e. the first incidence relation).
Such as, for " library.ABC.com ", " library ", " english.ABC.com ", " English materials ", " chinese.ABC.com ", " Chinese language data " these collection information, can configure " university's data " this first label information.
In oneainstance, can provide a control in the first result of page searching, such as, the control 302 shown in Fig. 3, user can manually add the first label information by this control.
In another scenario, can when subscriber authorisation, search engine adds the first label information automatically.
Specifically, search engine adds the first label information after natural language processing technique (Natural LanguageProcessing, NLP) can be adopted to analyze webpage corresponding to website information.Wherein, natural language processing boarding is roughly divided into two aspects, and one is superficial layer analyzing, and as participle, part-of-speech tagging, usually only need carry out analyzing and processing to the subrange of webpage corresponding to website information; Another aspect language is carried out to the process of deep layer, needs the webpage corresponding to website information to carry out global analysis, usually analyzes these three levels of syntax, semanteme and pragmatic when analyzing.
Step 406, when described first user mark has tag subscriptions information, searches one or more second incidence relations of coupling;
The application embodiment of the present invention, user can submit the first tag subscriptions information to, to subscribe to interested label information.Such as, control 502 as shown in Figure 5 can be clicked, submit " fantasy novel ", " ride fan ", " campus joke ", " university's data " etc. the first tag subscriptions information to.
Based on the sight of search, active user can obtain required information more easily.Such as, active user is a diet fan, subscribes to the tag subscriptions information of diet class, this user search somewhere or certain cuisine time, more cuisines information can be searched for; Active user is travel enthusiasts, subscribes to the tag subscriptions information of GT grand touring, this user search somewhere time, local more travel information can be searched for.
In specific implementation, described second incidence relation can be the incidence relation of the second user ID, the second object search and one or more second collection information, and described one or more second collection information falls to having the second label information; Described tag subscriptions information can state the second label information coupling and/or described first object search can state the second object search coupling;
In a kind of alternate exemplary of the embodiment of the present invention, described 3rd collection information can comprise website information and title.
In embodiments of the present invention, when judging whether the first tag subscriptions information mates with the 3rd object search with the second label information, the second object search, be carry out judging according to the matched rule preset.
This matched rule preset is natural language processing analysis rule, or, be also regular expression rule, or, be also the combination of the two.
Wherein, natural language processing analysis rule is roughly divided into two aspects, and one is superficial layer analyzing, and as participle, part-of-speech tagging, only need carry out analyzing and processing to the subrange of sentence usually; Another aspect language is carried out to the process of deep layer, needs to carry out global analysis to sentence, usually analyzes these three levels of syntax, semanteme and pragmatic when analyzing.
Regular expression rule is generally that the character having specific meanings by some represents matched rule, and such as, character " ^ " mates the beginning an of input or a line, as " ^a " coupling " an A ", and does not mate " An a "; Character " $ " mates the ending an of input or a line, as " a $ " coupling " An a ", and does not mate " an A "; Character " * " mates metacharacter 0 time or repeatedly above, as " ba* " will mate " b ", and " ba ", " baa " and " baaa " etc.
Under normal circumstances, natural language processing analysis rule is mainly used to solve synon problem, and regular expression rule is mainly used to process long-tail word.In addition, also more self-defined matched rules.
By the setting of matched rule, determine the second label information, the second object search that match with tag subscriptions information, the first object search exactly, and, when tag subscriptions information, the second object search have a little bias, such as, there is a wrongly written or mispronounced characters in the second object search or lost a word, at this moment, according to natural language processing analysis rule, still determine the actual keyword wanted of user.
Such as, if formerly other users " university's data " this second label information that has been the second collection information configuration, this this second object search of the second collection information correspondence " learning materials ", then active user have subscribed " university's data " this label information, and, during search " learning materials " this first object search, then can obtain the second collection information of formerly other users collection, such as " library.ABC.com ", " library ", " english.ABC.com ", " English materials ", " chinese.ABC.com ", " Chinese language data " etc.
The embodiment of the present invention is in incidence relation, label information is increased to one or more collection information, support that user is by the tag subscriptions information of coupling, and, the object search of coupling, the information that other users of direct acquisition formerly arranged, the information often returned than search engine machinery due to the information of manual sorting is more effective, avoid user to repeat to carry out loaded down with trivial details artificial filter to the info web of magnanimity, decrease expending of user time and energy, decrease the system resources consumption of subscriber equipment and website, decrease taking of the network bandwidth, substantially increase the efficiency of acquisition of information, quality and capacity.
Step 407, extracts described one or more second collection information from described one or more second incidence relation;
In embodiments of the present invention, the second collection information can be extracted to be shared with other users.
In a kind of embodiment of the present invention, step 407 can comprise following sub-step:
Sub-step S81, contrasts the second collection information in described one or more second incidence relation;
Sub-step S82, extracts one or more the second identical collection information.
In specific implementation, if the second collection information is many, then can extract the second identical collection Information Sharing active user.
Further, the embodiment of the present invention can also extract one or more frequency of occurrence higher than predetermined threshold value or the highest the second one or more collection Information Sharing active user of the frequency, and the embodiment of the present invention is not limited this.
Step 408, by the second result of page searching described in described one or more second collection information insertion.
To state in one or more second collection information insertion second result of page searching, and return to the application program of browser or built-in miniature browser, and then show.
Step 409, returns described second result of page searching.
Step 410, when receiving the process request for described one or many first collection information, to described one or more first collection information configuration feature website information.
Step 411, when receiving the load request sent based on described feature website information, returns described one or more first collection information.
In embodiments of the present invention, the first collection information that active user collects, can be shared with other users.
Such as, user, when to ride new hand, can inquire to the bicyclist of old qualifications and record of service, and the several bicyclist forum of request recommendation, to obtain faster, more information.
Specifically, active user can send process request by the application program of browser or built-in miniature browser to search engine, asks the first collection information (as bicyclist forum) configuration feature website information, to share other users.
Search engine can be this first collection information (as bicyclist forum) configuration feature website information, and returns to the application program of browser or built-in miniature browser.
Active user obtains feature website information, then can be distributed to other users by approach such as mail, immediate communication tool, forum, microbloggings.
Other users by loading feature website information, can obtain the first collection information (as bicyclist forum) of active user's collection.
The embodiment of the present invention is collection information configuration feature website information, load this feature website information and then can obtain this collection information, the information that other users of direct acquisition formerly arranged, the information often returned than search engine machinery due to the information of manual sorting is more effective, avoid user to repeat to carry out loaded down with trivial details artificial filter to the info web of magnanimity, decrease expending of user time and energy, decrease the system resources consumption of subscriber equipment and website, decrease taking of the network bandwidth, substantially increase the efficiency of acquisition of information, quality and capacity.
For embodiment of the method, in order to simple description, therefore it is all expressed as a series of combination of actions, but those skilled in the art should know, the embodiment of the present invention is not by the restriction of described sequence of movement, because according to the embodiment of the present invention, some step can adopt other orders or carry out simultaneously.Secondly, those skilled in the art also should know, the embodiment described in instructions all belongs to preferred embodiment, and involved action might not be that the embodiment of the present invention is necessary.
With reference to Fig. 6, show a kind of according to an embodiment of the invention structured flowchart for the treatment of apparatus embodiment 1 of collection information, specifically can comprise as lower module:
First search module 601, is suitable for when receiving the first searching request based on first user mark, returns and adopts the first object search of described first searching request to carry out searching for the first result of page searching obtained;
Set up module 602, being suitable for when receiving the one or more first collection information returned by described first result of page searching, setting up the first incidence relation of described first user mark, described first object search and described one or many first collection information;
Second search module 603, is suitable for, when receiving the second searching request based on first user mark, adopting the first object search in described second searching request to search for, obtaining search second results page;
First merge module 604, is suitable for according to described first incidence relation, will identify in the second result of page searching described in one or more first collection information insertion of associating with described first object search with described first user;
First returns module 605, is suitable for returning described second result of page searching.
In specific implementation, described first collection information can comprise website information and title, and described second collection information can comprise website information and title.
In a kind of embodiment of the present invention, described first merge module 604 can also be suitable for:
The one or more first collection information identifying with described first user and associate with described first object search are searched in the first incidence relation preset;
By in the second result of page searching described in described one or more first collection information insertion.
In a kind of embodiment of the present invention, described first search module 601 can also be suitable for:
Extract the first object search in described first searching request;
When described first object search is text message, search for the webpage mated with described text message in a database; Described webpage has summary info;
The summary info of described webpage is adopted to generate the first result of page searching;
Return described first result of page searching.
In a kind of embodiment of the present invention, described first search module 601 can also be suitable for:
Extract the first object search in described first searching request;
When described first object search is image information, identify the Web page image information similar or identical with described image information in a database;
Described Web page image information is adopted to generate the first result of page searching;
Return described first result of page searching.
In a kind of embodiment of the present invention, described first search module 601 can also be suitable for:
Extract the object search in described first searching request;
When described first object search is voice data, identify described voice data characteristic of correspondence text message;
Search for the webpage mated with described feature text message in a database; Described webpage has the second summary info;
The summary info of described webpage is adopted to generate the first result of page searching;
Return described first result of page searching.
In a kind of embodiment of the present invention, described second search module 603 can also be suitable for:
Extract the first object search in described second searching request;
When described first object search is text message, search for the webpage mated with described text message in a database; Described webpage has summary info;
The summary info of described webpage is adopted to generate the second result of page searching.
In a kind of embodiment of the present invention, described second search module 603 can also be suitable for:
Extract the first object search in described second searching request;
When described first object search is image information, identify the Web page image information similar or identical with described image information in a database;
Described Web page image information is adopted to generate the first result of page searching.
In a kind of embodiment of the present invention, described second search module 603 can also be suitable for:
Extract the first object search in described second searching request;
When described first object search is voice data, identify described voice data characteristic of correspondence text message;
Search for the webpage mated with described feature text message in a database; Described webpage has the second summary info;
The summary info of described webpage is adopted to generate the second result of page searching.
With reference to Fig. 7, show a kind of according to an embodiment of the invention structured flowchart for the treatment of apparatus embodiment 2 of collection information, specifically can comprise as lower module:
First search module 701, is suitable for when receiving the first searching request based on first user mark, returns and adopts the first object search of described first searching request to carry out searching for the first result of page searching obtained;
Set up module 702, being suitable for when receiving the one or more first collection information returned by described first result of page searching, setting up the first incidence relation of described first user mark, described first object search and described one or many first collection information;
Second search module 703, is suitable for, when receiving the second searching request based on first user mark, adopting the first object search in described second searching request to search for, obtaining search second results page;
First merge module 704, is suitable for according to described first incidence relation, will identify in the second result of page searching described in one or more first collection information insertion of associating with described first object search with described first user;
Increase module 705, be suitable in described first incidence relation, the first label information is increased to described one or more first collection information.
Search module 706, be suitable for, when described first user mark has tag subscriptions information, searching one or more second incidence relations of coupling; Described second incidence relation is the incidence relation of the second user ID, the second object search and one or more second collection information, and described one or more second collection information has the second label information; Described tag subscriptions information is mated with described second label information and/or described first object search mates with described second object search;
Extraction module 707, is suitable for from described one or more second incidence relation, extract described one or more second collection information;
Second merge module 708, is suitable in the second result of page searching described in described one or more second collection information insertion.
First returns module 709, is suitable for returning described second result of page searching.
Configuration module 710, is suitable for when receiving the process request for described one or many first collection information, to described one or more first collection information configuration feature website information.
Second returns module 711, is suitable for, when receiving the load request sent based on described feature website information, returning described one or more first collection information.
In a kind of embodiment of the present invention, described extraction module 707 can also be suitable for:
The second collection information in described one or more second incidence relation is contrasted;
Extract one or more the second identical collection information.
For device embodiment, due to itself and embodiment of the method basic simlarity, so description is fairly simple, relevant part illustrates see the part of embodiment of the method.
Intrinsic not relevant to any certain computer, virtual system or miscellaneous equipment with display at this algorithm provided.Various general-purpose system also can with use based on together with this teaching.According to description above, the structure constructed required by this type systematic is apparent.In addition, the present invention is not also for any certain programmed language.It should be understood that and various programming language can be utilized to realize content of the present invention described here, and the description done language-specific is above to disclose preferred forms of the present invention.
In instructions provided herein, describe a large amount of detail.But can understand, embodiments of the invention can be put into practice when not having these details.In some instances, be not shown specifically known method, structure and technology, so that not fuzzy understanding of this description.
Similarly, be to be understood that, in order to simplify the disclosure and to help to understand in each inventive aspect one or more, in the description above to exemplary embodiment of the present invention, each feature of the present invention is grouped together in single embodiment, figure or the description to it sometimes.But, the method for the disclosure should be construed to the following intention of reflection: namely the present invention for required protection requires feature more more than the feature clearly recorded in each claim.Or rather, as claims below reflect, all features of disclosed single embodiment before inventive aspect is to be less than.Therefore, the claims following embodiment are incorporated to this embodiment thus clearly, and wherein each claim itself is as independent embodiment of the present invention.
Those skilled in the art are appreciated that and adaptively can change the module in the equipment in embodiment and they are arranged in one or more equipment different from this embodiment.Module in embodiment or unit or assembly can be combined into a module or unit or assembly, and multiple submodule or subelement or sub-component can be put them in addition.Except at least some in such feature and/or process or unit be mutually repel except, any combination can be adopted to combine all processes of all features disclosed in this instructions (comprising adjoint claim, summary and accompanying drawing) and so disclosed any method or equipment or unit.Unless expressly stated otherwise, each feature disclosed in this instructions (comprising adjoint claim, summary and accompanying drawing) can by providing identical, alternative features that is equivalent or similar object replaces.
In addition, those skilled in the art can understand, although embodiments more described herein to comprise in other embodiment some included feature instead of further feature, the combination of the feature of different embodiment means and to be within scope of the present invention and to form different embodiments.Such as, in the following claims, the one of any of embodiment required for protection can use with arbitrary array mode.
All parts embodiment of the present invention with hardware implementing, or can realize with the software module run on one or more processor, or realizes with their combination.It will be understood by those of skill in the art that the some or all functions that microprocessor or digital signal processor (DSP) can be used in practice to realize according to the some or all parts in the treatment facility of the collection information of the embodiment of the present invention.The present invention can also be embodied as part or all equipment for performing method as described herein or device program (such as, computer program and computer program).Realizing program of the present invention and can store on a computer-readable medium like this, or the form of one or more signal can be had.Such signal can be downloaded from internet website and obtain, or provides on carrier signal, or provides with any other form.
The present invention will be described instead of limit the invention to it should be noted above-described embodiment, and those skilled in the art can design alternative embodiment when not departing from the scope of claims.In the claims, any reference symbol between bracket should be configured to limitations on claims.Word " comprises " not to be got rid of existence and does not arrange element in the claims or step.Word "a" or "an" before being positioned at element is not got rid of and be there is multiple such element.The present invention can by means of including the hardware of some different elements and realizing by means of the computing machine of suitably programming.In the unit claim listing some devices, several in these devices can be carry out imbody by same hardware branch.Word first, second and third-class use do not represent any order.Can be title by these word explanations.
The embodiment of the invention discloses the disposal route of A1, a kind of collection information, comprising:
When receiving the first searching request based on first user mark, returning and adopting the first object search of described first searching request to carry out searching for the first result of page searching obtained;
When receiving the one or more first collection information returned by described first result of page searching, set up the first incidence relation of described first user mark, described first object search and described one or many first collection information;
When receiving the second searching request based on first user mark, adopting the first object search in described second searching request to search for, obtaining search second results page;
According to described first incidence relation, will identify in the second result of page searching described in one or more first collection information insertion of associating with described first object search with described first user;
Return described second result of page searching.
A2, method as described in A1, also comprise:
In described first incidence relation, the first label information is increased to described one or more first collection information.
A3, method as described in A1 or A2, described return the step of described second result of page searching before, described method also comprises:
When described first user mark has tag subscriptions information, search one or more second incidence relations of coupling; Described second incidence relation is the incidence relation of the second user ID, the second object search and one or more second collection information, and described one or more second collection information has the second label information; Described tag subscriptions information is mated with described second label information and/or described first object search mates with described second object search;
Described one or more second collection information is extracted from described one or more second incidence relation;
By in the second result of page searching described in described one or more second collection information insertion.
A4, method as described in A3, the described step extracting described one or more second collection information from described one or more second incidence relation comprises:
The second collection information in described one or more second incidence relation is contrasted;
Extract one or more the second identical collection information.
A5, method as described in A1, also comprise:
When receiving the process request for described one or many first collection information, to described one or more first collection information configuration feature website information.
A6, method as described in A5, also comprise:
When receiving the load request sent based on described feature website information, return described one or more first collection information.
A7, method as described in A1 or A2 or A4 or A5 or A6, described first collection packets of information draws together website information and title, and described second collection packets of information draws together website information and title.
A8, method as described in A1 or A2 or A4 or A5 or A6, described according to described first incidence relation, comprise with the step that described first user identifies in the second result of page searching described in one or more first collection information insertion of associating with described first object search:
The one or more first collection information identifying with described first user and associate with described first object search are searched in the first incidence relation preset;
By in the second result of page searching described in described one or more first collection information insertion.
A9, method as described in A1 or A2 or A4 or A5 or A6, described in return the step adopting the first object search of described first searching request to carry out searching for the first result of page searching obtained and comprise:
Extract the first object search in described first searching request;
When described first object search is text message, search for the webpage mated with described text message in a database; Described webpage has summary info;
The summary info of described webpage is adopted to generate the first result of page searching;
Return described first result of page searching.
A10, method as described in A1 or A2 or A4 or A5 or A6, described in return the step adopting the first object search of described first searching request to carry out searching for the first result of page searching obtained and comprise:
Extract the first object search in described first searching request;
When described first object search is image information, identify the Web page image information similar or identical with described image information in a database;
Described Web page image information is adopted to generate the first result of page searching;
Return described first result of page searching.
A11, method as described in A1 or A2 or A4 or A5 or A6, described in return the step adopting the first object search of described first searching request to carry out searching for the first result of page searching obtained and comprise:
Extract the object search in described first searching request;
When described first object search is voice data, identify described voice data characteristic of correspondence text message;
Search for the webpage mated with described feature text message in a database; Described webpage has the second summary info;
The summary info of described webpage is adopted to generate the first result of page searching;
Return described first result of page searching.
A12, method as described in A1 or A2 or A4 or A5 or A6, the first object search in described second searching request of described employing is searched for, and the step obtaining search second results page comprises:
Extract the first object search in described second searching request;
When described first object search is text message, search for the webpage mated with described text message in a database; Described webpage has summary info;
The summary info of described webpage is adopted to generate the second result of page searching.
A13, method as described in A1 or A2 or A4 or A5 or A6, the first object search in described second searching request of described employing is searched for, and the step obtaining search second results page comprises:
Extract the first object search in described second searching request;
When described first object search is image information, identify the Web page image information similar or identical with described image information in a database;
Described Web page image information is adopted to generate the first result of page searching.
A14, method as described in A1 or A2 or A4 or A5 or A6, the first object search in described second searching request of described employing is searched for, and the step obtaining search second results page comprises:
Extract the first object search in described second searching request;
When described first object search is voice data, identify described voice data characteristic of correspondence text message;
Search for the webpage mated with described feature text message in a database; Described webpage has the second summary info;
The summary info of described webpage is adopted to generate the second result of page searching.
The embodiment of the invention also discloses the treating apparatus of B15, a kind of collection information, comprising:
First search module, is suitable for when receiving the first searching request based on first user mark, returns and adopts the first object search of described first searching request to carry out searching for the first result of page searching obtained;
Set up module, being suitable for when receiving the one or more first collection information returned by described first result of page searching, setting up the first incidence relation of described first user mark, described first object search and described one or many first collection information;
Second search module, is suitable for, when receiving the second searching request based on first user mark, adopting the first object search in described second searching request to search for, obtaining search second results page;
First merge module, is suitable for according to described first incidence relation, will identify in the second result of page searching described in one or more first collection information insertion of associating with described first object search with described first user;
First returns module, is suitable for returning described second result of page searching.
B16, device as described in B15, also comprise:
Increase module, be suitable in described first incidence relation, the first label information is increased to described one or more first collection information.
B17, device as described in B15 or B16, also comprise:
Search module, be suitable for, when described first user mark has tag subscriptions information, searching one or more second incidence relations of coupling; Described second incidence relation is the incidence relation of the second user ID, the second object search and one or more second collection information, and described one or more second collection information has the second label information; Described tag subscriptions information is mated with described second label information and/or described first object search mates with described second object search;
Extraction module, is suitable for from described one or more second incidence relation, extract described one or more second collection information;
Second merge module, is suitable in the second result of page searching described in described one or more second collection information insertion.
B18, device as described in B17, described extraction module is also suitable for:
The second collection information in described one or more second incidence relation is contrasted;
Extract one or more the second identical collection information.
B19, device as described in B15, also comprise:
Configuration module, is suitable for when receiving the process request for described one or many first collection information, to described one or more first collection information configuration feature website information.
B20, device as described in B19, also comprise:
Second returns module, is suitable for, when receiving the load request sent based on described feature website information, returning described one or more first collection information.
B21, device as described in B15 or B16 or B18 or B19 or B20, described first collection packets of information draws together website information and title, and described second collection packets of information draws together website information and title.
B22, device as described in B15 or B16 or B18 or B19 or B20, described first merge module is also suitable for:
The one or more first collection information identifying with described first user and associate with described first object search are searched in the first incidence relation preset;
By in the second result of page searching described in described one or more first collection information insertion.
B23, device as described in B15 or B16 or B18 or B19 or B20, described first search module is also suitable for:
Extract the first object search in described first searching request;
When described first object search is text message, search for the webpage mated with described text message in a database; Described webpage has summary info;
The summary info of described webpage is adopted to generate the first result of page searching;
Return described first result of page searching.
B24, device as described in B15 or B16 or B18 or B19 or B20, described first search module is also suitable for:
Extract the first object search in described first searching request;
When described first object search is image information, identify the Web page image information similar or identical with described image information in a database;
Described Web page image information is adopted to generate the first result of page searching;
Return described first result of page searching.
B25, device as described in B15 or B16 or B18 or B19 or B20, described first search module is also suitable for:
Extract the object search in described first searching request;
When described first object search is voice data, identify described voice data characteristic of correspondence text message;
Search for the webpage mated with described feature text message in a database; Described webpage has the second summary info;
The summary info of described webpage is adopted to generate the first result of page searching;
Return described first result of page searching.
B26, device as described in B15 or B16 or B18 or B19 or B20, described second search module is also suitable for:
Extract the first object search in described second searching request;
When described first object search is text message, search for the webpage mated with described text message in a database; Described webpage has summary info;
The summary info of described webpage is adopted to generate the second result of page searching.
B27, device as described in B15 or B16 or B18 or B19 or B20, described first search module is also suitable for:
Extract the first object search in described second searching request;
When described first object search is image information, identify the Web page image information similar or identical with described image information in a database;
Described Web page image information is adopted to generate the first result of page searching.
B28, device as described in B15 or B16 or B18 or B19 or B20, described first search module is also suitable for:
Extract the first object search in described second searching request;
When described first object search is voice data, identify described voice data characteristic of correspondence text message;
Search for the webpage mated with described feature text message in a database; Described webpage has the second summary info;
The summary info of described webpage is adopted to generate the second result of page searching.

Claims (10)

1. a disposal route for collection information, comprising:
When receiving the first searching request based on first user mark, returning and adopting the first object search of described first searching request to carry out searching for the first result of page searching obtained;
When receiving the one or more first collection information returned by described first result of page searching, set up the first incidence relation of described first user mark, described first object search and described one or many first collection information;
When receiving the second searching request based on first user mark, adopting the first object search in described second searching request to search for, obtaining search second results page;
According to described first incidence relation, will identify in the second result of page searching described in one or more first collection information insertion of associating with described first object search with described first user;
Return described second result of page searching.
2. the method for claim 1, is characterized in that, also comprises:
In described first incidence relation, the first label information is increased to described one or more first collection information.
3. method as claimed in claim 1 or 2, is characterized in that, described return the step of described second result of page searching before, described method also comprises:
When described first user mark has tag subscriptions information, search one or more second incidence relations of coupling; Described second incidence relation is the incidence relation of the second user ID, the second object search and one or more second collection information, and described one or more second collection information has the second label information; Described tag subscriptions information is mated with described second label information and/or described first object search mates with described second object search;
Described one or more second collection information is extracted from described one or more second incidence relation;
By in the second result of page searching described in described one or more second collection information insertion.
4. method as claimed in claim 3, it is characterized in that, the described step extracting described one or more second collection information from described one or more second incidence relation comprises:
The second collection information in described one or more second incidence relation is contrasted;
Extract one or more the second identical collection information.
5. the method for claim 1, is characterized in that, also comprises:
When receiving the process request for described one or many first collection information, to described one or more first collection information configuration feature website information.
6. method as claimed in claim 5, is characterized in that, also comprise:
When receiving the load request sent based on described feature website information, return described one or more first collection information.
7. the method as described in claim 1 or 2 or 4 or 5 or 6, it is characterized in that, described first collection packets of information draws together website information and title, and described second collection packets of information draws together website information and title.
8. the method as described in claim 1 or 2 or 4 or 5 or 6, it is characterized in that, described according to described first incidence relation, comprise with the step that described first user identifies in the second result of page searching described in one or more first collection information insertion of associating with described first object search:
The one or more first collection information identifying with described first user and associate with described first object search are searched in the first incidence relation preset;
By in the second result of page searching described in described one or more first collection information insertion.
9. the method as described in claim 1 or 2 or 4 or 5 or 6, is characterized in that, described in return and adopt the first object search of described first searching request to carry out searching for the step of the first result of page searching obtained to comprise:
Extract the first object search in described first searching request;
When described first object search is text message, search for the webpage mated with described text message in a database; Described webpage has summary info;
The summary info of described webpage is adopted to generate the first result of page searching;
Return described first result of page searching.
10. a treating apparatus for collection information, comprising:
First search module, is suitable for when receiving the first searching request based on first user mark, returns and adopts the first object search of described first searching request to carry out searching for the first result of page searching obtained;
Set up module, being suitable for when receiving the one or more first collection information returned by described first result of page searching, setting up the first incidence relation of described first user mark, described first object search and described one or many first collection information;
Second search module, is suitable for, when receiving the second searching request based on first user mark, adopting the first object search in described second searching request to search for, obtaining search second results page;
First merge module, is suitable for according to described first incidence relation, will identify in the second result of page searching described in one or more first collection information insertion of associating with described first object search with described first user;
First returns module, is suitable for returning described second result of page searching.
CN201410784236.9A 2014-12-16 2014-12-16 A kind for the treatment of method and apparatus of collection information Active CN104484414B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410784236.9A CN104484414B (en) 2014-12-16 2014-12-16 A kind for the treatment of method and apparatus of collection information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410784236.9A CN104484414B (en) 2014-12-16 2014-12-16 A kind for the treatment of method and apparatus of collection information

Publications (2)

Publication Number Publication Date
CN104484414A true CN104484414A (en) 2015-04-01
CN104484414B CN104484414B (en) 2018-12-28

Family

ID=52758955

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410784236.9A Active CN104484414B (en) 2014-12-16 2014-12-16 A kind for the treatment of method and apparatus of collection information

Country Status (1)

Country Link
CN (1) CN104484414B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107636767A (en) * 2015-04-14 2018-01-26 曼朵计量公司 Parameter probability context-free grammar for food intake dose
CN107666431A (en) * 2016-07-29 2018-02-06 腾讯科技(深圳)有限公司 Bookmark communication message acquisition methods and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102722481A (en) * 2011-03-29 2012-10-10 阿里巴巴集团控股有限公司 Processing method and searching method for user favorite data
CN103064851A (en) * 2011-10-20 2013-04-24 阿里巴巴集团控股有限公司 Information search method and information search device of website content
CN103186666A (en) * 2013-03-01 2013-07-03 北京百度网讯科技有限公司 Method, device and equipment for searching based on favorites
CN103246746A (en) * 2013-05-23 2013-08-14 百度在线网络技术(北京)有限公司 Method, device and system for searching information

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102722481A (en) * 2011-03-29 2012-10-10 阿里巴巴集团控股有限公司 Processing method and searching method for user favorite data
CN103064851A (en) * 2011-10-20 2013-04-24 阿里巴巴集团控股有限公司 Information search method and information search device of website content
CN103186666A (en) * 2013-03-01 2013-07-03 北京百度网讯科技有限公司 Method, device and equipment for searching based on favorites
CN103246746A (en) * 2013-05-23 2013-08-14 百度在线网络技术(北京)有限公司 Method, device and system for searching information

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107636767A (en) * 2015-04-14 2018-01-26 曼朵计量公司 Parameter probability context-free grammar for food intake dose
CN107666431A (en) * 2016-07-29 2018-02-06 腾讯科技(深圳)有限公司 Bookmark communication message acquisition methods and device

Also Published As

Publication number Publication date
CN104484414B (en) 2018-12-28

Similar Documents

Publication Publication Date Title
CN110781276B (en) Text extraction method, device, equipment and storage medium
CN107346336B (en) Information processing method and device based on artificial intelligence
KR102288249B1 (en) Information processing method, terminal, and computer storage medium
CN109670163B (en) Information identification method, information recommendation method, template construction method and computing device
CN110020422B (en) Feature word determining method and device and server
CN106776503B (en) Text semantic similarity determination method and device
CN110968684B (en) Information processing method, device, equipment and storage medium
CN112395506A (en) Information recommendation method and device, electronic equipment and storage medium
CN111177532A (en) Vertical search method, device, computer system and readable storage medium
CN107943792B (en) Statement analysis method and device, terminal device and storage medium
CN107341399A (en) Assess the method and device of code file security
CN104750791A (en) Image retrieval method and device
CN111563382A (en) Text information acquisition method and device, storage medium and computer equipment
CN111061837A (en) Topic identification method, device, equipment and medium
CN113806588A (en) Method and device for searching video
CN115757991A (en) Webpage identification method and device, electronic equipment and storage medium
CN112035723A (en) Resource library determination method and device, storage medium and electronic device
CN113569118B (en) Self-media pushing method, device, computer equipment and storage medium
CN107766498A (en) Method and apparatus for generating information
CN104778232A (en) Searching result optimizing method and device based on long query
CN104484414B (en) A kind for the treatment of method and apparatus of collection information
CN112417996A (en) Information processing method and device for industrial drawing, electronic equipment and storage medium
CN114491010A (en) Training method and device of information extraction model
CN114238735B (en) Intelligent internet data acquisition method
CN110472121A (en) Card information searching method, device, electronic equipment and computer readable storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20220718

Address after: Room 801, 8th floor, No. 104, floors 1-19, building 2, yard 6, Jiuxianqiao Road, Chaoyang District, Beijing 100015

Patentee after: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Address before: 100088 room 112, block D, 28 new street, new street, Xicheng District, Beijing (Desheng Park)

Patentee before: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Patentee before: Qizhi software (Beijing) Co.,Ltd.