CN104484414B - A kind for the treatment of method and apparatus of collection information - Google Patents

A kind for the treatment of method and apparatus of collection information Download PDF

Info

Publication number
CN104484414B
CN104484414B CN201410784236.9A CN201410784236A CN104484414B CN 104484414 B CN104484414 B CN 104484414B CN 201410784236 A CN201410784236 A CN 201410784236A CN 104484414 B CN104484414 B CN 104484414B
Authority
CN
China
Prior art keywords
information
searching
search
object search
page
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410784236.9A
Other languages
Chinese (zh)
Other versions
CN104484414A (en
Inventor
罗吉喜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201410784236.9A priority Critical patent/CN104484414B/en
Publication of CN104484414A publication Critical patent/CN104484414A/en
Application granted granted Critical
Publication of CN104484414B publication Critical patent/CN104484414B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention provides a kind for the treatment of method and apparatus of collection information, the described method includes: returning when receiving the first searching request based on the first user identifier and scanning for the first result of page searching obtained using the first object search of the first searching request;When receiving one or more the first collection information returned by the first result of page searching, the first incidence relation of the first user identifier, the first object search and one or more first collection information is established;It when receiving the second searching request based on the first user identifier, is scanned for using the first object search in the second searching request, obtains the second results page of search;According to the first incidence relation, will be embedded in the second result of page searching with the associated one or more first collection information of the first user identifier and the first object search;Return to the second result of page searching.The embodiment of the present invention improves the simplicity of operation, substantially increases privacy.

Description

A kind for the treatment of method and apparatus of collection information
Technical field
The present invention relates to technical field of data processing, more particularly to the processing method and a kind of receipts of a kind of collection information The processing unit of hiding folder information.
Background technique
With the fast development of the network technology, especially into mobile internet era, the network information is sharply increased, In include a large amount of webpage.
User generally uses browser to browse webpage, and browser generally provides favorite function, and collection is on being User record oneself is facilitated to like when net, common webpage.Collection information is put into a file, think when Time, which can be opened, to be found.
Present certain browsers provide the function of network storage collection information, and user is in the same clear of different terminals Look at login account in device, so that it may load the collection information that the account is formerly collected.
This mode for collecting collection, needs to install same browser in different terminals, cumbersome, also, steps on All collection information of the account can be shown after record account, and privacy is very low.
In addition, certain websites specially provide network profile, which is that user distributes a webpage, and user can be Collection information is collected in the webpage.
This mode for collecting collection, although without installing specific browser, as long as other users load should Webpage, can obtain the collection information of user collection, and privacy is very low.
Summary of the invention
In view of the above problems, it proposes on the present invention overcomes the above problem or at least be partially solved in order to provide one kind State the processing method and a kind of corresponding processing unit of collection information of a kind of collection information of problem.
According to one aspect of the present invention, a kind of processing method of collection information is provided, comprising:
When receiving the first searching request based on the first user identifier, the using first searching request is returned One object search scans for the first result of page searching obtained;
When receiving one or more the first collection information returned by first result of page searching, establish First incidence relation of first user identifier, first object search and one or more first collection information;
When receiving the second searching request based on the first user identifier, using first in second searching request Object search scans for, and obtains the second results page of search;
It, will be with first user identifier and first object search associated one according to first incidence relation Or multiple first collection information are embedded in second result of page searching;
Return to second result of page searching.
Optionally, the method also includes:
In first incidence relation, the first label information is increased to one or more of first collection information.
Optionally, before the return second result of page searching the step of, the method also includes:
When first user identifier has tag subscriptions information, searches matched one or more second associations and close System;Second incidence relation is the pass of second user mark, the second object search and one or more second collection information Connection relationship, one or more of second collection information have the second label information;The tag subscriptions information and described the The matching of two label informations and/or first object search are matched with second object search;
One or more of second collection information are extracted from one or more of second incidence relations;
One or more of second collection information are embedded in second result of page searching.
Optionally, described that one or more of second collections are extracted from one or more of second incidence relations The step of information includes:
The second collection information in one or more of second incidence relations is compared;
Extract one or more identical second collection information.
Optionally, the method also includes:
When receiving for one or more first collection information processing request, to one or more of the One collection information configuration feature website information.
Optionally, the method also includes:
When receiving the load request based on feature website information transmission, returns to one or more of first and receive Hiding folder information.
Optionally, the first collection information includes website information and title, and the second collection information includes net Location information and title.
Optionally, described according to first incidence relation, it will be with first user identifier and first search pair As the step that associated one or more first collection information are embedded in second result of page searching includes:
It is searched in preset first incidence relation associated with first user identifier and first object search One or more first collection information;
One or more of first collection information are embedded in second result of page searching.
Optionally, described return is searched using first that the first object search of first searching request scans for obtaining The step of rope results page includes:
Extract the first object search in first searching request;
When first object search is text information, search for and the matched net of the text information in the database Page;The webpage has summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
Optionally, described return is searched using first that the first object search of first searching request scans for obtaining The step of rope results page includes:
Extract the first object search in first searching request;
When first object search is image information, identify in the database similar or identical with described image information Web page image information;
First result of page searching is generated using the Web page image information;
Return to first result of page searching.
Optionally, described return is searched using first that the first object search of first searching request scans for obtaining The step of rope results page includes:
Extract the object search in first searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
Optionally, first object search using in second searching request scans for, and obtains search second The step of results page includes:
Extract the first object search in second searching request;
When first object search is text information, search for and the matched net of the text information in the database Page;The webpage has summary info;
Second result of page searching is generated using the summary info of the webpage.
Optionally, first object search using in second searching request scans for, and obtains search second The step of results page includes:
Extract the first object search in second searching request;
When first object search is image information, identify in the database similar or identical with described image information Web page image information;
First result of page searching is generated using the Web page image information.
Optionally, first object search using in second searching request scans for, and obtains search second The step of results page includes:
Extract the first object search in second searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
Second result of page searching is generated using the summary info of the webpage.
According to another aspect of the present invention, a kind of processing unit of collection information is provided, comprising:
First search module, suitable for returning and using institute when receiving the first searching request based on the first user identifier The first object search for stating the first searching request scans for the first result of page searching obtained;
Module is established, suitable for receiving the collection of one or more first returned by first result of page searching When pressing from both sides information, first user identifier, first object search and one or more first collection information are established First incidence relation;
Second search module, suitable for when receiving the second searching request based on the first user identifier, using described The first object search in two searching requests scans for, and obtains the second results page of search;
First insertion module, is suitable for according to first incidence relation, will be with first user identifier and described first The associated one or more first collection information of object search are embedded in second result of page searching;
First return module is adapted to return to second result of page searching.
Optionally, the method also includes:
Increase module, be suitable in first incidence relation, one or more of first collection information are increased First label information.
Optionally, the method also includes:
Searching module is suitable for searching matched one or more when first user identifier has tag subscriptions information A second incidence relation;Second incidence relation is that second user mark, the second object search and one or more second are received The incidence relation of hiding folder information, one or more of second collection information have the second label information;The tag subscriptions Information is matched with second label information and/or first object search is matched with second object search;
Extraction module, suitable for extracting one or more of second collections from one or more of second incidence relations Press from both sides information;
Second insertion module, is suitable for one or more of second collection information being embedded in second search results pages In face.
Optionally, the extraction module is further adapted for:
The second collection information in one or more of second incidence relations is compared;
Extract one or more identical second collection information.
Optionally, the method also includes:
Configuration module, suitable for when receiving for one or more first collection information processing request, to institute State one or more first collection information configuration feature website informations.
Optionally, the method also includes:
Second return module, suitable for returning to institute when receiving the load request based on feature website information transmission State one or more first collection information.
Optionally, the first collection information includes website information and title, and the second collection information includes net Location information and title.
Optionally, the first insertion module is further adapted for:
It is searched in preset first incidence relation associated with first user identifier and first object search One or more first collection information;
One or more of first collection information are embedded in second result of page searching.
Optionally, first search module is further adapted for:
Extract the first object search in first searching request;
When first object search is text information, search for and the matched net of the text information in the database Page;The webpage has summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
Optionally, first search module is further adapted for:
Extract the first object search in first searching request;
When first object search is image information, identify in the database similar or identical with described image information Web page image information;
First result of page searching is generated using the Web page image information;
Return to first result of page searching.
Optionally, first search module is further adapted for:
Extract the object search in first searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
Optionally, second search module is further adapted for:
Extract the first object search in second searching request;
When first object search is text information, search for and the matched net of the text information in the database Page;The webpage has summary info;
Second result of page searching is generated using the summary info of the webpage.
Optionally, second search module is further adapted for:
Extract the first object search in second searching request;
When first object search is image information, identify in the database similar or identical with described image information Web page image information;
First result of page searching is generated using the Web page image information.
Optionally, second search module is further adapted for:
Extract the first object search in second searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
Second result of page searching is generated using the summary info of the webpage.
The embodiment of the present invention is being directed to the first searching request, establish the first user identifier, the first object search and one or First incidence relation of more first collection information, for the second searching request, according to first incidence relation return one or More first collection information, on the one hand, collection information is shown based on the page, the specific browser of installation is avoided, mentions The high simplicity of operation;On the other hand, using the first object search as the entrance for showing collection information, login account is avoided Number, load some webpage and be loaded directly into collection information, substantially increase privacy.
The embodiment of the present invention is based on text information, image information, audio-frequency information etc. and is used as object search, and text information can be with Facilitate input, ensure that simplicity, image information, audio-frequency information due to complexity it is higher, it is possible to reduce input same text The probability of information improves the complexity of object search, further improves privacy.
The embodiment of the present invention increases label information in incidence relation, to one or more collection information, supports user By matched tag subscriptions information, and, matched object search directly obtains the information that other users formerly arranged, Since the information of manual sorting is often more more efficient than the information that search engine machinery returns, avoids user and repeat to magnanimity Webpage information carries out cumbersome artificial filter, reduces the consuming of user time and energy, decreases user equipment and website System resources consumption, decrease the occupancy of network bandwidth, substantially increase the efficiency, quality and capacity of acquisition of information.
The embodiment of the present invention is collection information configuration feature website information, and load this feature website information can then obtain The collection information directly obtains the information that other users formerly arranged, since the information of manual sorting is often drawn than search The information for holding up mechanical return is more efficient, avoids user and repeats to carry out cumbersome artificial filter to the webpage information of magnanimity, subtracts The consuming for having lacked user time and energy decreases the system resources consumption of user equipment and website, decreases Netowrk tape Wide occupancy substantially increases the efficiency, quality and capacity of acquisition of information.
The above description is only an overview of the technical scheme of the present invention, in order to better understand the technical means of the present invention, And it can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage can It is clearer and more comprehensible, the followings are specific embodiments of the present invention.
Detailed description of the invention
By reading the following detailed description of the preferred embodiment, various other advantages and benefits are common for this field Technical staff will become clear.The drawings are only for the purpose of illustrating a preferred embodiment, and is not considered as to the present invention Limitation.And throughout the drawings, the same reference numbers will be used to refer to the same parts.In the accompanying drawings:
Fig. 1 shows a kind of step of the processing method embodiment 1 of collection information according to an embodiment of the invention Rapid flow chart;
Fig. 2 shows a kind of exemplary diagrams for adding collection information according to an embodiment of the invention;
Fig. 3 shows a kind of exemplary diagram for showing collection information according to an embodiment of the invention;
Fig. 4 shows a kind of step of the processing method embodiment 2 of collection information according to an embodiment of the invention Rapid flow chart;
Fig. 5 shows a kind of exemplary diagram for adding tag subscriptions information according to an embodiment of the invention;
Fig. 6 shows a kind of structure of the processing device embodiment 1 of collection information according to an embodiment of the invention Block diagram;And
Fig. 7 shows a kind of structure of the processing device embodiment 2 of collection information according to an embodiment of the invention Block diagram.
Specific embodiment
Exemplary embodiments of the present disclosure are described in more detail below with reference to accompanying drawings.Although showing the disclosure in attached drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here It is limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure It is fully disclosed to those skilled in the art.
Referring to Fig.1, a kind of processing method embodiment 1 of collection information according to an embodiment of the invention is shown Flow chart of steps can specifically include following steps:
Step 101, it when receiving the first searching request based on the first user identifier, returns using first search First object search of request scans for the first result of page searching obtained;
In the concrete realization, user can access server (such as search engine) from any electronic equipment, the electronics Equipment can specifically include mobile device, such as (Personal Digital Assistant, individual digital help by mobile phone, PDA Reason), laptop computer, palm PC etc., also may include fixed equipment, such as personal computer, smart television etc..
These electronic equipments can support to include Android (Android), IOS, WindowsPhone or windows etc. Operating system can usually run the application program of browser or built-in miniature browser.
First searching request can refer to the search and the instruction of some object search relevant information that user issues.
For example, user can initiate the first searching request by inputting some object search in the webpage of search engine, Or (plug-ins, can be by interacting, in browser with browser, search engine etc. in the search plug-in unit of browser Middle increase function of search) etc. input some object search and initiate first searching request etc..When user is in search-engine web page When clicking search control, it is equivalent to receive the instruction for initiating the first searching request based on search engine;Equally, when searching When inputting some object search in rope plug-in unit and clicking confirming button or press enter key, also corresponds to receive and initiate to be based on searching Index the instruction for the first searching request held up.
It wherein, may include the first user identifier and the first object search in first searching request;
First user identifier can be the information that can represent the user that one uniquely determines, for example, User ID (abbreviation of IDentity, identity number), the other information with User ID binding, such as mailbox, telephone number.
First object search may include text information, pictorial information, audio-frequency information etc., the embodiment of the present invention to this not It limits.
In practical applications, request header information can be passed through HTTP by the application program of browser or built-in miniature browser Server where (Hypertext transfer protocol, hypertext transfer protocol) from agreement to search engine initiates the One searching request.The server such as receives after the request to be processed, last the answering to browser or built-in miniature browser With program returning response.
It, then can be according to first search when receiving the first object search of user's submission in the embodiment of the present invention The object relevant information of Rapid Detection in the database carries out the covariance mapping of information and inquiry, to the result that will be exported It is ranked up and returns to the application program of browser or built-in miniature browser.
In an alternative embodiment of the invention, step 101 may include following sub-step:
Sub-step S11 extracts the first object search in first searching request;
Sub-step S12 is searched for and the text information in the database when first object search is text information Matched webpage;The webpage has summary info;
Sub-step S13 generates the first result of page searching using the summary info of the webpage;
Sub-step S14 returns to first result of page searching.
In the concrete realization, if the first object search is text information, phase can be searched for based on modes such as inverted indexs The webpage of pass.
It is illustrated by taking search engine as an example, the search routine of search engine is divided into two parts, first is that front end user is asked Process is sought, second is that rear end makes data procedures.
One, front end user request process:
1. receiving request: receiving the text information that user inputs in search engine;
2. query word is analyzed: carrying out word segmentation processing to text information;
3. retrieval: according to word segmentation result, from the inverted index of pre-production, searching candidate's relevant to word segmentation result Webpage;
4. sequence: for candidate webpage, being ranked up according to dimensions such as content relevance, timeliness;
5. showing: the summary info of the webpage after sequence is shown in result of page searching.
Two, rear end makes data procedures:
1. webpage capture: grabbing the webpage of internet and preservation by the linking relationship between webpage using crawler technology.
2. compilation of index: analyzing the webpage for having grabbed preservation, such as divide web page title and page text Word processing makes inverted index according to word segmentation result, uses for front end user request process.
In an alternative embodiment of the invention, step 101 may include following sub-step:
Sub-step S21 extracts the first object search in first searching request;
Sub-step S22 is identified and described image information in the database when first object search is image information Similar or identical Web page image information;
Sub-step S23 generates the first result of page searching using the Web page image information;
Sub-step S24 returns to first result of page searching.
In the concrete realization, it if the first object search is pictorial information, can be searched by modes such as picture similarities Similar or identical Web page image information.
In embodiments of the present invention, the characteristic information that can be extracted in image information and Web page image information carries out similarity Calculating.
Wherein, characteristic information may include at least one of shape feature information and color characteristic information;Shape feature Information can refer to that the information of characterization image style characteristic, color characteristic information can refer to the information of characterization image color characteristics.
There are two main classes for the representation method of shape feature information, and one kind is provincial characteristics, mainly for the entire of image Shape area;Another kind of is contour feature, is directed to the outer boundary of object.
The typical method for extracting shape feature information includes boundary characteristic value method (outer boundary of image), geometry parameter method (Fourier becomes for (image geometry parameterized treatment), shape invariance moments method (looking for Image Moment Invariants feature), Fourier's shape description method Change method) etc..
Color characteristic information can be through the color characteristic of image or image-region and describe, it has globality.
The typical method for extracting color characteristic information includes color histogram, color set, color moment etc..
Certainly, features described above information is intended only as example, in implementing the embodiments of the present invention, can set according to the actual situation Other characteristic informations are set, the embodiments of the present invention are not limited thereto.
In an alternative embodiment of the invention, step 101 may include following sub-step:
Sub-step S31 extracts the object search in first searching request;
Sub-step S32 identifies the corresponding feature of the audio data when first object search is audio data Text information;
Sub-step S33 is searched for and the matched webpage of feature text information in the database;The webpage has second Summary info;
Sub-step S34 generates the first result of page searching using the summary info of the webpage;
Sub-step S35 returns to first result of page searching.
In the concrete realization, if the first object search is audio data, the corresponding text of the audio data can be identified This information, then relevant webpage is searched for based on modes such as inverted indexs.
In practical applications, electronic equipment can acquire the audio data that user issues by sound card equipments such as microphones, Alternatively, directly uploading the audio data acquired by electronic equipment, and pass through speech recognition technology (Automatic Speech Recognition, ASR) by the vocabulary content (i.e. voice data) in the voice of the mankind be converted to it is computer-readable input (i.e. Text information).
Currently, speech recognition technology is usually realized by speech recognition system.The large vocabulary speech recognition system of mainstream is more Using statistical-simulation spectrometry technology.Typically based on the speech recognition system of statistical pattern recognition method by following basic mould Block is constituted:
1, signal processing and characteristic extracting module;The main task of the module is that feature is extracted from audio data, for sound Learn model treatment.Meanwhile it generally also includes some signal processing technologies, to reduce ambient noise as far as possible, channel, speak The factors such as people are influenced caused by feature.
2, acoustic model;Speech recognition system is mostly used to be modeled based on single order Hidden Markov Model.
3, pronunciation dictionary;Pronunciation dictionary includes the word finder and its pronunciation that speech recognition system can be handled.Pronunciation dictionary Actually provide the mapping of acoustic model and language model.
4, language model;The language model language targeted to speech recognition system models.Theoretically, including canonical Language, the various language models including context-free grammar all can serve as language model, but various systems are generally adopted at present Or N-gram and its variant based on statistics.
5, decoder;Decoder is one of core of speech recognition system, and task is the signal to input, according to sound It learns, language model and dictionary, searching can export the word string of the signal with maximum probability.It can more clearly from mathematical angle Understand the relationship between above-mentioned module.
Certainly, above-mentioned object search and way of search are intended only as example, in implementing the embodiments of the present invention, can basis Other object searches and way of search is arranged in actual conditions, and the embodiments of the present invention are not limited thereto.In addition, being searched in addition to above-mentioned Outside rope object and way of search, those skilled in the art can also use other object searches and searcher according to actual needs Formula, the embodiment of the present invention are also without restriction to this.
The embodiment of the present invention is based on text information, image information, audio-frequency information etc. and is used as object search, and text information can be with Facilitate input, ensure that simplicity, image information, audio-frequency information due to complexity it is higher, it is possible to reduce input same text The probability of information improves the complexity of object search, further improves privacy.
Step 102, when receive by first result of page searching return first collection of one or more believe When breath, the first of first user identifier, first object search and one or more first collection information is established Incidence relation;
Under http protocol, the application program of browser or built-in miniature browser can be from the service where search engine Device receives the document of HTML (Hypertext Markup Language, hypertext markup language) type.
The application program of browser or built-in miniature browser can parse html document, generate the object of tree, That is DOM (Document Object Model, document dbject model), each object is a node on DOM, and these are right As the web page resources such as text, picture can be represented.The application program of browser or built-in miniature browser can start to show this Html document, and the address of wherein embedded web page resources is obtained, then browser initiates request to server to obtain this again A little web page resources, and the first search results pages are shown in the html document of the application program in browser or built-in miniature browser Face.
In the concrete realization, the control of the first collection information of input, user can be provided in the first result of page searching The first collection information can be inputted by the control.
Wherein, the first collection information include can be with website information and title.
For example, as shown in Fig. 2, if user has input the first object search 201 " learning materials ", in the first search result In the page, control 501 as shown in Figure 5 can be provided, alternatively, control 202 and control 203 as shown in Figure 2 can be provided, it should Control 202 can be used for inputting website information, which can be used for inputting title, can such as input in control 202 " library.ABC.com ", " library " is inputted in control 203, then a first collection information can be generated;In control " english.ABC.com " is inputted in 202, inputs " English materials " in control 203, then the first collection letter can be generated Breath;" chinese.ABC.com " is inputted in control 202, inputs " Chinese language data " in control 203, then can be generated one the One collection information etc..
In the concrete realization, search engine can establish the first user identifier, the first object search and one or more first First incidence relation of collection information, storage generate collection information in the database, with confirmation.
Since the one or more collection information belongs to same first object search, pictograph, it can be by first Incidence relation is referred to as to collect box, which can be the key of this collection box of opening.
Step 103, when receiving the second searching request based on the first user identifier, using second searching request In the first object search scan for, obtain search the second results page;
In the concrete realization, the second searching request can refer to the search and some object search relevant information that user issues Instruction.
For example, user can initiate the second searching request by inputting some object search in the webpage of search engine, Or (plug-ins, can be by interacting, in browser with browser, search engine etc. in the search plug-in unit of browser Middle increase function of search) etc. input some object search and initiate second searching request etc..When user is in search-engine web page When clicking search control, it is equivalent to receive the instruction for initiating the second searching request based on search engine;Equally, when searching When inputting some object search in rope plug-in unit and clicking confirming button or press enter key, also corresponds to receive and initiate to be based on searching Index the instruction for the second searching request held up.
It wherein, may include the first user identifier and the first object search in second searching request;
Request header information can be passed through HTTP by the application program of browser or built-in miniature browser in practical applications Server where (Hypertext transfer protocol, hypertext transfer protocol) from agreement to search engine initiates the One searching request.The server such as receives after the request to be processed, last the answering to browser or built-in miniature browser With program returning response.
In inventive embodiments, user can just identical first object search be scanned for, when search engine receives use When the first object search that family is submitted, then can according to the first object search relevant information of Rapid Detection in the database, The covariance mapping for carrying out information and inquiry, is ranked up the result that will be exported.
In an alternative embodiment of the invention, step 103 may include following sub-step:
Sub-step S41 extracts the first object search in second searching request;
Sub-step S42 is searched for and the text information in the database when first object search is text information Matched webpage;The webpage has summary info;
Sub-step S43 generates the second result of page searching using the summary info of the webpage.
In the concrete realization, if the first object search is text information, phase can be searched for based on modes such as inverted indexs The webpage of pass.
It is illustrated by taking search engine as an example, the search routine of search engine is divided into two parts, first is that front end user is asked Process is sought, second is that rear end makes data procedures.
One, front end user request process:
1. receiving request: receiving the text information that user inputs in search engine;
2. query word is analyzed: carrying out word segmentation processing to text information;
3. retrieval: according to word segmentation result, from the inverted index of pre-production, searching candidate's relevant to word segmentation result Webpage;
4. sequence: for candidate webpage, being ranked up according to dimensions such as content relevance, timeliness;
5. showing: the summary info of the webpage after sequence is shown in result of page searching.
Two, rear end makes data procedures:
1. webpage capture: grabbing the webpage of internet and preservation by the linking relationship between webpage using crawler technology.
2. compilation of index: analyzing the webpage for having grabbed preservation, such as divide web page title and page text Word processing makes inverted index according to word segmentation result, uses for front end user request process.
In an alternative embodiment of the invention, step 103 may include following sub-step:
Sub-step S51 extracts the first object search in second searching request;
Sub-step S52 is identified and described image information in the database when first object search is image information Similar or identical Web page image information;
Sub-step S53 generates the first result of page searching using the Web page image information.
In the concrete realization, it if the first object search is pictorial information, can be searched by modes such as picture similarities Similar or identical Web page image information.
In embodiments of the present invention, the characteristic information that can be extracted in image information and Web page image information carries out similarity Calculating.
Wherein, characteristic information may include at least one of shape feature information and color characteristic information;Shape feature Information can refer to that the information of characterization image style characteristic, color characteristic information can refer to the information of characterization image color characteristics.
There are two main classes for the representation method of shape feature information, and one kind is provincial characteristics, mainly for the entire of image Shape area;Another kind of is contour feature, is directed to the outer boundary of object.
The typical method for extracting shape feature information includes boundary characteristic value method (outer boundary of image), geometry parameter method (Fourier becomes for (image geometry parameterized treatment), shape invariance moments method (looking for Image Moment Invariants feature), Fourier's shape description method Change method) etc..
Color characteristic information can be through the color characteristic of image or image-region and describe, it has globality.
The typical method for extracting color characteristic information includes color histogram, color set, color moment etc..
Certainly, features described above information is intended only as example, in implementing the embodiments of the present invention, can set according to the actual situation Other characteristic informations are set, the embodiments of the present invention are not limited thereto.
In an alternative embodiment of the invention, step 103 may include following sub-step:
Sub-step S61 extracts the first object search in second searching request;
Sub-step S62 identifies the corresponding feature of the audio data when first object search is audio data Text information;
Sub-step S63 is searched for and the matched webpage of feature text information in the database;The webpage has second Summary info;
Sub-step S64 generates the second result of page searching using the summary info of the webpage.
In the concrete realization, if the first object search is audio data, the corresponding text of the audio data can be identified This information, then relevant webpage is searched for based on modes such as inverted indexs.
In practical applications, electronic equipment can acquire the audio data that user issues by sound card equipments such as microphones, Alternatively, directly uploading the audio data acquired by electronic equipment, and pass through speech recognition technology (Automatic Speech Recognition, ASR) by the vocabulary content (i.e. voice data) in the voice of the mankind be converted to it is computer-readable input (i.e. Text information).
Currently, speech recognition technology is usually realized by speech recognition system.The large vocabulary speech recognition system of mainstream is more Using statistical-simulation spectrometry technology.Typically based on the speech recognition system of statistical pattern recognition method by following basic mould Block is constituted:
1, signal processing and characteristic extracting module;The main task of the module is that feature is extracted from audio data, for sound Learn model treatment.Meanwhile it generally also includes some signal processing technologies, to reduce ambient noise as far as possible, channel, speak The factors such as people are influenced caused by feature.
2, acoustic model;Speech recognition system is mostly used to be modeled based on single order Hidden Markov Model.
3, pronunciation dictionary;Pronunciation dictionary includes the word finder and its pronunciation that speech recognition system can be handled.Pronunciation dictionary Actually provide the mapping of acoustic model and language model.
4, language model;The language model language targeted to speech recognition system models.Theoretically, including canonical Language, the various language models including context-free grammar all can serve as language model, but various systems are generally adopted at present Or N-gram and its variant based on statistics.
5, decoder;Decoder is one of core of speech recognition system, and task is the signal to input, according to sound It learns, language model and dictionary, searching can export the word string of the signal with maximum probability.It can more clearly from mathematical angle Understand the relationship between above-mentioned module.
Certainly, above-mentioned object search and way of search are intended only as example, in implementing the embodiments of the present invention, can basis Other object searches and way of search is arranged in actual conditions, and the embodiments of the present invention are not limited thereto.In addition, being searched in addition to above-mentioned Outside rope object and way of search, those skilled in the art can also use other object searches and searcher according to actual needs Formula, the embodiment of the present invention are also without restriction to this.
Step 104, it according to first incidence relation, will be closed with first user identifier and first object search One or more the first collection information of connection is embedded in second result of page searching;
In the embodiment of the present invention, if user has formerly had the first collection information, the use with regard to the collection of the first object search Family is in rear search first object search, search engine the first collection information of available first collection, and insertion second is searched In rope results page.
In an alternative embodiment of the invention, step 104 may include following sub-step:
Sub-step S71 is searched and first user identifier and first search in preset first incidence relation The associated one or more first collection information of object;
One or more of first collection information are embedded in second result of page searching by sub-step S72.
Using the embodiment of the present invention, user can first pass through the first object search in advance and collect one or more first collections Information, search engine establish the first of the first user identifier, the first object search and one or more first collection information Incidence relation.
Then search engine can be found out and the first user identifier and the first search pair by first incidence relation As associated one or more first collection information.
One or more first collection information can be embedded in the second result of page searching and be returned to by search engine The application program of browser or built-in miniature browser.
Step 105, second result of page searching is returned.
Under http protocol, the application program of browser or built-in miniature browser can be from the service where search engine Device receives the document of HTML (Hypertext Markup Language, hypertext markup language) type.
The application program of browser or built-in miniature browser can parse html document, generate the object of tree, That is DOM (Document Object Model, document dbject model), each object is a node on DOM, and these are right As the web page resources such as text, picture can be represented.The application program of browser or built-in miniature browser can start to show this Html document, and the address of wherein embedded web page resources is obtained, then browser initiates request to server to obtain this again A little web page resources, and the second search results pages are shown in the html document of the application program in browser or built-in miniature browser Face.
For example, as shown in figure 3, can be searched for second if user has input the first object search 301 " learning materials " In results page, control 303 and control 304 are provided, which can be used for loading title, which can be used for adding Website information is carried, " library.ABC.com " can be such as loaded in control 304, loads " library " in control 303;? " english.ABC.com " is loaded in control 304, loads " English materials " in control 303;It is loaded in control 304 " chinese.ABC.com ", " Chinese language data " etc. is loaded in control 303.
The first collection that user formerly collects is loaded in browser or built-in miniature browser it should be noted that working as Information is pressed from both sides, user can continue through control 305 as shown in Figure 3 and continue to add collection information, and the embodiment of the present invention is to this It is without restriction.
When the user clicks when the first collection information (such as website information), then corresponding page can be loaded in new window Face.
The embodiment of the present invention is based on text information, image information, audio-frequency information etc. and is used as object search, and text information can be with Facilitate input, ensure that simplicity, image information, audio-frequency information due to complexity it is higher, it is possible to reduce input same text The probability of information improves the complexity of object search, further improves privacy.
Referring to Fig. 4, a kind of processing method embodiment 2 of collection information according to an embodiment of the invention is shown Flow chart of steps can specifically include following steps:
Step 401, it when receiving the first searching request based on the first user identifier, returns using first search First object search of request scans for the first result of page searching obtained;
Step 402, when receive by first result of page searching return first collection of one or more believe When breath, the first of first user identifier, first object search and one or more first collection information is established Incidence relation;
Step 403, when receiving the second searching request based on the first user identifier, using second searching request In the first object search scan for, obtain search the second results page;
Step 404, it according to first incidence relation, will be closed with first user identifier and first object search One or more the first collection information of connection is embedded in second result of page searching;
Step 405, in first incidence relation, the first mark is increased to one or more of first collection information Sign information.
In practical applications, collection the first collection information collected in box (i.e. the first incidence relation) is slowly more and more Later, the first label information can be stamped to each collection box (i.e. the first incidence relation).
For example, for " library.ABC.com ", " library ", " english.ABC.com ", " English materials ", " chinese.ABC.com ", " Chinese language data " these collection information can configure " university's data " this first label letter Breath.
In oneainstance, a control can be provided in the first result of page searching, for example, control shown in Fig. 3 302, user can add the first label information by the control manually.
In another scenario, can be in the case where user authorize, search engine adds the first label information automatically.
Specifically, search engine can use natural language processing technique (Natural Language Processing, NLP) the first label information is added after the corresponding webpage of analysis website information.Wherein, natural language processing is lodged Two levels are roughly divided into, one is superficial layer analyzing, is such as segmented, part-of-speech tagging, usually only need to be to the corresponding webpage of website information Subrange be analyzed and processed;Another level is that the processing of deep layer is carried out to language, is needed corresponding to website information Webpage carries out global analysis, and in analysis, usually to syntax, semanteme and pragmatic, these three levels are analyzed.
Step 406, when first user identifier has tag subscriptions information, matched one or more second are searched Incidence relation;
Using the embodiment of the present invention, user can submit the first tag subscriptions information, to subscribe to interested label information. For example, control 502 as shown in Figure 5 can be clicked, " fantasy novel ", " ride fan ", " campus joke ", " university are submitted Data " etc. the first tag subscriptions information.
Scene based on search, active user can quickly and conveniently obtain required information.For example, active user is Diet fan, subscribes to the tag subscriptions information of diet class, and user search may search for more somewhere or when some cuisine More cuisines information;Active user is a travel enthusiasts, subscribes to the tag subscriptions information of tourism, which searches for some ground Fang Shi may search for local more travel informations.
In the concrete realization, second incidence relation can for second user mark, the second object search and one or The incidence relation of multiple second collection information, one or more of second collection information are fallen with the second label information; The tag subscriptions information, which can state the matching of the second label information and/or first object search, can state the second object search Matching;
In an alternative example of an embodiment of the present invention, the third collection information may include website information and name Claim.
In embodiments of the present invention, judge the first tag subscriptions information and the second label information, the second object search with It when whether third object search matches, is judged according to preset matching rule.
The preset matching rule is natural language processing analysis rule, alternatively, and regular expression rule, alternatively, It is also the combination of the two.
Wherein, natural language processing analysis rule is roughly divided into two levels, and one is superficial layer analyzing, such as segments, part of speech Mark usually need to only be analyzed and processed the subrange of sentence;Another level is that the processing of deep layer is carried out to language, is needed Global analysis is carried out to sentence, usually these three levels are analyzed to syntax, semanteme and pragmatic in analysis.
Regular expression rule indicates matching rule generally by some characters with specific meanings, for example, Character " ^ " matches an input or the beginning of a line, such as " ^a " matching " an A ", and mismatches " An a ";Character " $ " matching one A input or the ending of a line, such as " a " matching " An a ", and mismatch " an A ";Character " * " match front metacharacter 0 time or Repeatedly, such as " ba* " will match " b ", " ba ", " baa " and " baaa " etc..
Under normal conditions, natural language processing analysis rule is mainly used to solve synonymous word problem, regular expression rule Then it is mainly used to handle long-tail word.In addition, also customized some matching rules.
By the setting of matching rule, second to match with tag subscriptions information, the first object search is accurately determined Label information, the second object search, moreover, when tag subscriptions information, the second object search have a little bias, for example, second searches There is a wrong word in rope object or lost a word, at this moment, according to natural language processing analysis rule, still determines to use The keyword that family actually wants to.
For example, if formerly other users are the second collection information configuration " university's data " this second label information, The second collection information correspond to " learning materials " this second object search, then active user have subscribed " university's data " this A label information, and, when searching for " learning materials " this first object search, then it can obtain the of first other users collection Two collection information, such as " library.ABC.com ", " library ", " english.ABC.com ", " English materials ", " chinese.ABC.com ", " Chinese language data " etc..
The embodiment of the present invention increases label information in incidence relation, to one or more collection information, supports user By matched tag subscriptions information, and, matched object search directly obtains the information that other users formerly arranged, Since the information of manual sorting is often more more efficient than the information that search engine machinery returns, avoids user and repeat to magnanimity Webpage information carries out cumbersome artificial filter, reduces the consuming of user time and energy, decreases user equipment and website System resources consumption, decrease the occupancy of network bandwidth, substantially increase the efficiency, quality and capacity of acquisition of information.
Step 407, one or more of second collection letters are extracted from one or more of second incidence relations Breath;
In embodiments of the present invention, the second collection information can be extracted to be shared with other users.
In an alternative embodiment of the invention, step 407 may include following sub-step:
Sub-step S81 compares the second collection information in one or more of second incidence relations;
Sub-step S82 extracts one or more identical second collection information.
In the concrete realization, if the second collection information is relatively more, identical second collection information point can be extracted Enjoy active user.
Further, the embodiment of the present invention can also extract one or more frequency of occurrences most higher than preset threshold or the frequency High the second one or more collection information share active user, and the embodiments of the present invention are not limited thereto.
Step 408, one or more of second collection information are embedded in second result of page searching.
One or more second collection information will be stated to be embedded in the second result of page searching, return to browser or built-in The application program of minibrowser, and then be shown.
Step 409, second result of page searching is returned.
Step 410, when receiving the processing request for one or more first collection information, to one Or multiple first collection information configuration feature website informations.
Step 411, it when receiving the load request based on feature website information transmission, returns one or more A first collection information.
In embodiments of the present invention, the first collection information that active user is collected, can be shared with other users.
For example, user is also when riding new hand, can inquire to the bicyclist of old qualification, request recommendation is several to be ridden Passerby forum, to obtain faster, more information.
Specifically, active user can be sent out by the application program of browser or built-in miniature browser to search engine Processing request out is requested to the first collection information (such as bicyclist forum) configuration feature website information, to share other users.
Search engine can be the first collection information (such as bicyclist forum) the configuration feature website information, and return to The application program of browser or built-in miniature browser.
Active user obtains feature website information, then can pass through the approach such as mail, immediate communication tool, forum, microblogging Distribute them to other users.
Other users can obtain the first collection information of active user's collection (such as by loading feature website information Bicyclist forum).
The embodiment of the present invention is collection information configuration feature website information, and load this feature website information can then obtain The collection information directly obtains the information that other users formerly arranged, since the information of manual sorting is often drawn than search The information for holding up mechanical return is more efficient, avoids user and repeats to carry out cumbersome artificial filter to the webpage information of magnanimity, subtracts The consuming for having lacked user time and energy decreases the system resources consumption of user equipment and website, decreases Netowrk tape Wide occupancy substantially increases the efficiency, quality and capacity of acquisition of information.
For embodiment of the method, for simple description, therefore, it is stated as a series of action combinations, but this field Technical staff should be aware of, and embodiment of that present invention are not limited by the describe sequence of actions, because implementing according to the present invention Example, some steps may be performed in other sequences or simultaneously.Secondly, those skilled in the art should also know that, specification Described in embodiment belong to preferred embodiment, the actions involved are not necessarily necessary for embodiments of the present invention.
Referring to Fig. 6, a kind of processing device embodiment 1 of collection information according to an embodiment of the invention is shown Structural block diagram can specifically include following module:
First search module 601, suitable for returning and using when receiving the first searching request based on the first user identifier First object search of first searching request scans for the first result of page searching obtained;
Module 602 is established, suitable for receiving the one or more first returned by first result of page searching When collection information, establishes first user identifier, first object search and one or more first collections and believe First incidence relation of breath;
Second search module 603, suitable for when receiving the second searching request based on the first user identifier, using described The first object search in second searching request scans for, and obtains the second results page of search;
First insertion module 604, is suitable for according to first incidence relation, will be with first user identifier and described the The associated one or more first collection information of one object search are embedded in second result of page searching;
First return module 605 is adapted to return to second result of page searching.
In the concrete realization, the first collection information may include website information and title, second collection Information may include website information and title.
In an alternative embodiment of the invention, the first insertion module 604 can be adapted to:
It is searched in preset first incidence relation associated with first user identifier and first object search One or more first collection information;
One or more of first collection information are embedded in second result of page searching.
In an alternative embodiment of the invention, first search module 601 can be adapted to:
Extract the first object search in first searching request;
When first object search is text information, search for and the matched net of the text information in the database Page;The webpage has summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
In an alternative embodiment of the invention, first search module 601 can be adapted to:
Extract the first object search in first searching request;
When first object search is image information, identify in the database similar or identical with described image information Web page image information;
First result of page searching is generated using the Web page image information;
Return to first result of page searching.
In an alternative embodiment of the invention, first search module 601 can be adapted to:
Extract the object search in first searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
In an alternative embodiment of the invention, second search module 603 can be adapted to:
Extract the first object search in second searching request;
When first object search is text information, search for and the matched net of the text information in the database Page;The webpage has summary info;
Second result of page searching is generated using the summary info of the webpage.
In an alternative embodiment of the invention, second search module 603 can be adapted to:
Extract the first object search in second searching request;
When first object search is image information, identify in the database similar or identical with described image information Web page image information;
First result of page searching is generated using the Web page image information.
In an alternative embodiment of the invention, second search module 603 can be adapted to:
Extract the first object search in second searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
Second result of page searching is generated using the summary info of the webpage.
Referring to Fig. 7, a kind of processing device embodiment 2 of collection information according to an embodiment of the invention is shown Structural block diagram can specifically include following module:
First search module 701, suitable for returning and using when receiving the first searching request based on the first user identifier First object search of first searching request scans for the first result of page searching obtained;
Module 702 is established, suitable for receiving the one or more first returned by first result of page searching When collection information, establishes first user identifier, first object search and one or more first collections and believe First incidence relation of breath;
Second search module 703, suitable for when receiving the second searching request based on the first user identifier, using described The first object search in second searching request scans for, and obtains the second results page of search;
First insertion module 704, is suitable for according to first incidence relation, will be with first user identifier and described the The associated one or more first collection information of one object search are embedded in second result of page searching;
Increase module 705, be suitable in first incidence relation, one or more of first collection information are increased Add the first label information.
Searching module 706, be suitable for first user identifier have tag subscriptions information when, search it is matched one or Multiple second incidence relations;Second incidence relation is second user mark, the second object search and one or more second The incidence relation of collection information, one or more of second collection information have the second label information;The label is ordered Read that information is matched with second label information and/or first object search is matched with second object search;
Extraction module 707, suitable for extracting one or more of second from one or more of second incidence relations Collection information;
Second insertion module 708 is suitable for tying one or more of second collection information insertion second search In the fruit page.
First return module 709 is adapted to return to second result of page searching.
Configuration module 710 is right suitable for when receiving for one or more first collection information processing request One or more of first collection information configuration feature website informations.
Second return module 711, suitable for returning when receiving the load request based on feature website information transmission One or more of first collection information.
In an alternative embodiment of the invention, the extraction module 707 can be adapted to:
The second collection information in one or more of second incidence relations is compared;
Extract one or more identical second collection information.
For device embodiment, since it is basically similar to the method embodiment, related so being described relatively simple Place illustrates referring to the part of embodiment of the method.
Algorithm and display are not inherently related to any particular computer, virtual system, or other device provided herein. Various general-purpose systems can also be used together with teachings based herein.As described above, it constructs required by this kind of system Structure be obvious.In addition, the present invention is also not directed to any particular programming language.It should be understood that can use various Programming language realizes summary of the invention described herein, and the description done above to language-specific is to disclose this hair Bright preferred forms.
In the instructions provided here, numerous specific details are set forth.It is to be appreciated, however, that implementation of the invention Example can be practiced without these specific details.In some instances, well known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this specification.
Similarly, it should be understood that in order to simplify the disclosure and help to understand one or more of the various inventive aspects, Above in the description of exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes In example, figure or descriptions thereof.However, the disclosed method should not be interpreted as reflecting the following intention: i.e. required to protect Shield the present invention claims features more more than feature expressly recited in each claim.More precisely, as following Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following specific embodiment are expressly incorporated in the specific embodiment, wherein each claim itself All as a separate embodiment of the present invention.
Those skilled in the art will understand that can be carried out adaptively to the module in the equipment in embodiment Change and they are arranged in one or more devices different from this embodiment.It can be the module or list in embodiment Member or component are combined into a module or unit or component, and furthermore they can be divided into multiple submodule or subelement or Sub-component.Other than such feature and/or at least some of process or unit exclude each other, it can use any Combination is to all features disclosed in this specification (including adjoint claim, abstract and attached drawing) and so disclosed All process or units of what method or apparatus are combined.Unless expressly stated otherwise, this specification is (including adjoint power Benefit require, abstract and attached drawing) disclosed in each feature can carry out generation with an alternative feature that provides the same, equivalent, or similar purpose It replaces.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments In included certain features rather than other feature, but the combination of the feature of different embodiments mean it is of the invention Within the scope of and form different embodiments.For example, in the following claims, embodiment claimed is appointed Meaning one of can in any combination mode come using.
Various component embodiments of the invention can be implemented in hardware, or to run on one or more processors Software module realize, or be implemented in a combination thereof.It will be understood by those of skill in the art that can be used in practice In the processing equipment of microprocessor or digital signal processor (DSP) to realize collection information according to an embodiment of the present invention Some or all components some or all functions.The present invention is also implemented as executing side as described herein Some or all device or device programs (for example, computer program and computer program product) of method.It is such It realizes that program of the invention can store on a computer-readable medium, or can have the shape of one or more signal Formula.Such signal can be downloaded from an internet website to obtain, and perhaps be provided on the carrier signal or with any other shape Formula provides.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and ability Field technique personnel can be designed alternative embodiment without departing from the scope of the appended claims.In the claims, Any reference symbol between parentheses should not be configured to limitations on claims.Word "comprising" does not exclude the presence of not Element or step listed in the claims.Word "a" or "an" located in front of the element does not exclude the presence of multiple such Element.The present invention can be by means of including the hardware of several different elements and being come by means of properly programmed computer real It is existing.In the unit claims listing several devices, several in these devices can be through the same hardware branch To embody.The use of word first, second, and third does not indicate any sequence.These words can be explained and be run after fame Claim.
The embodiment of the invention discloses A1, a kind of processing method of collection information, comprising:
When receiving the first searching request based on the first user identifier, the using first searching request is returned One object search scans for the first result of page searching obtained;
When receiving one or more the first collection information returned by first result of page searching, establish First incidence relation of first user identifier, first object search and one or more first collection information;
When receiving the second searching request based on the first user identifier, using first in second searching request Object search scans for, and obtains the second results page of search;
It, will be with first user identifier and first object search associated one according to first incidence relation Or multiple first collection information are embedded in second result of page searching;
Return to second result of page searching.
A2, method as described in a1, further includes:
In first incidence relation, the first label information is increased to one or more of first collection information.
A3, method as described in a1 or a2, the return second result of page searching the step of before, the side Method further include:
When first user identifier has tag subscriptions information, searches matched one or more second associations and close System;Second incidence relation is the pass of second user mark, the second object search and one or more second collection information Connection relationship, one or more of second collection information have the second label information;The tag subscriptions information and described the The matching of two label informations and/or first object search are matched with second object search;
One or more of second collection information are extracted from one or more of second incidence relations;
One or more of second collection information are embedded in second result of page searching.
A4, the method as described in A3, it is described extracted from one or more of second incidence relations it is one or more The step of a second collection information includes:
The second collection information in one or more of second incidence relations is compared;
Extract one or more identical second collection information.
A5, method as described in a1, further includes:
When receiving for one or more first collection information processing request, to one or more of the One collection information configuration feature website information.
A6, method as described in a5, further includes:
When receiving the load request based on feature website information transmission, returns to one or more of first and receive Hiding folder information.
A7, the method as described in A1 or A2 or A4 or A5 or A6, the first collection information includes website information and name Claim, the second collection information includes website information and title.
A8, the method as described in A1 or A2 or A4 or A5 or A6, it is described according to first incidence relation, it will be with described One user identifier and associated one or more first collection information insertion the second search knots of first object search Step in the fruit page includes:
It is searched in preset first incidence relation associated with first user identifier and first object search One or more first collection information;
One or more of first collection information are embedded in second result of page searching.
A9, the method as described in A1 or A2 or A4 or A5 or A6, first returned using first searching request Object search scan for obtain the first result of page searching the step of include:
Extract the first object search in first searching request;
When first object search is text information, search for and the matched net of the text information in the database Page;The webpage has summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
A10, the method as described in A1 or A2 or A4 or A5 or A6, first returned using first searching request Object search scan for obtain the first result of page searching the step of include:
Extract the first object search in first searching request;
When first object search is image information, identify in the database similar or identical with described image information Web page image information;
First result of page searching is generated using the Web page image information;
Return to first result of page searching.
A11, the method as described in A1 or A2 or A4 or A5 or A6, first returned using first searching request Object search scan for obtain the first result of page searching the step of include:
Extract the object search in first searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
A12, the method as described in A1 or A2 or A4 or A5 or A6, first using in second searching request are searched Rope object scans for, and obtains the step of searching for the second results page and includes:
Extract the first object search in second searching request;
When first object search is text information, search for and the matched net of the text information in the database Page;The webpage has summary info;
Second result of page searching is generated using the summary info of the webpage.
A13, the method as described in A1 or A2 or A4 or A5 or A6, first using in second searching request are searched Rope object scans for, and obtains the step of searching for the second results page and includes:
Extract the first object search in second searching request;
When first object search is image information, identify in the database similar or identical with described image information Web page image information;
First result of page searching is generated using the Web page image information.
A14, the method as described in A1 or A2 or A4 or A5 or A6, first using in second searching request are searched Rope object scans for, and obtains the step of searching for the second results page and includes:
Extract the first object search in second searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
Second result of page searching is generated using the summary info of the webpage.
The embodiment of the invention also discloses B15, a kind of processing unit of collection information, comprising:
First search module, suitable for returning and using institute when receiving the first searching request based on the first user identifier The first object search for stating the first searching request scans for the first result of page searching obtained;
Module is established, suitable for receiving the collection of one or more first returned by first result of page searching When pressing from both sides information, first user identifier, first object search and one or more first collection information are established First incidence relation;
Second search module, suitable for when receiving the second searching request based on the first user identifier, using described The first object search in two searching requests scans for, and obtains the second results page of search;
First insertion module, is suitable for according to first incidence relation, will be with first user identifier and described first The associated one or more first collection information of object search are embedded in second result of page searching;
First return module is adapted to return to second result of page searching.
B16, the device as described in B15, further includes:
Increase module, be suitable in first incidence relation, one or more of first collection information are increased First label information.
B17, the device as described in B15 or B16, further includes:
Searching module is suitable for searching matched one or more when first user identifier has tag subscriptions information A second incidence relation;Second incidence relation is that second user mark, the second object search and one or more second are received The incidence relation of hiding folder information, one or more of second collection information have the second label information;The tag subscriptions Information is matched with second label information and/or first object search is matched with second object search;
Extraction module, suitable for extracting one or more of second collections from one or more of second incidence relations Press from both sides information;
Second insertion module, is suitable for one or more of second collection information being embedded in second search results pages In face.
B18, the device as described in B17, the extraction module are further adapted for:
The second collection information in one or more of second incidence relations is compared;
Extract one or more identical second collection information.
B19, the device as described in B15, further includes:
Configuration module, suitable for when receiving for one or more first collection information processing request, to institute State one or more first collection information configuration feature website informations.
B20, the device as described in B19, further includes:
Second return module, suitable for returning to institute when receiving the load request based on feature website information transmission State one or more first collection information.
B21, the device as described in B15 or B16 or B18 or B19 or B20, the first collection information include network address letter Breath and title, the second collection information includes website information and title.
B22, the device as described in B15 or B16 or B18 or B19 or B20, the first insertion module are further adapted for:
It is searched in preset first incidence relation associated with first user identifier and first object search One or more first collection information;
One or more of first collection information are embedded in second result of page searching.
B23, the device as described in B15 or B16 or B18 or B19 or B20, first search module are further adapted for:
Extract the first object search in first searching request;
When first object search is text information, search for and the matched net of the text information in the database Page;The webpage has summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
B24, the device as described in B15 or B16 or B18 or B19 or B20, first search module are further adapted for:
Extract the first object search in first searching request;
When first object search is image information, identify in the database similar or identical with described image information Web page image information;
First result of page searching is generated using the Web page image information;
Return to first result of page searching.
B25, the device as described in B15 or B16 or B18 or B19 or B20, first search module are further adapted for:
Extract the object search in first searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
B26, the device as described in B15 or B16 or B18 or B19 or B20, second search module are further adapted for:
Extract the first object search in second searching request;
When first object search is text information, search for and the matched net of the text information in the database Page;The webpage has summary info;
Second result of page searching is generated using the summary info of the webpage.
B27, the device as described in B15 or B16 or B18 or B19 or B20, first search module are further adapted for:
Extract the first object search in second searching request;
When first object search is image information, identify in the database similar or identical with described image information Web page image information;
First result of page searching is generated using the Web page image information.
B28, the device as described in B15 or B16 or B18 or B19 or B20, first search module are further adapted for:
Extract the first object search in second searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
Second result of page searching is generated using the summary info of the webpage.

Claims (24)

1. a kind of processing method of collection information, comprising:
When receiving the first searching request based on the first user identifier, returns and searched using the first of first searching request Rope object scans for the first result of page searching obtained, includes the first user identifier and first in first searching request Object search;
When receiving one or more the first collection information returned by first result of page searching, described in foundation First incidence relation of the first user identifier, first object search and one or more of first collection information;
When receiving the second searching request based on the first user identifier, using the first search in second searching request Object scans for, and obtains the second results page of search;It include that the first user identifier and first is searched in second searching request Rope object;
According to first incidence relation, lookup simultaneously will be with first user identifier and first object search associated one A or multiple first collection information are embedded in second result of page searching;
Return to second result of page searching;
When receiving the processing request for one or more of first collection information, to one or more of first Collection information configuration feature website information;
When receiving the load request based on feature website information transmission, one or more of first collections are returned Information.
2. the method as described in claim 1, which is characterized in that further include:
In first incidence relation, the first label information is increased to one or more of first collection information.
3. method according to claim 1 or 2, which is characterized in that in the step for returning to second result of page searching Before rapid, the method also includes:
When first user identifier has tag subscriptions information, matched one or more second incidence relations are searched;Institute Stating the second incidence relation is that second user identifies, the second object search is associated with one or more second collection information System, one or more of second collection information have the second label information;The tag subscriptions information and second mark Label information matches and/or first object search are matched with second object search;
One or more of second collection information are extracted from one or more of second incidence relations;
One or more of second collection information are embedded in second result of page searching.
4. method as claimed in claim 3, which is characterized in that described to be extracted from one or more of second incidence relations The step of one or more of second collection information includes:
The second collection information in one or more of second incidence relations is compared;
Extract one or more identical second collection information.
5. method as claimed in claim 4, which is characterized in that the first collection information includes website information and title, The second collection information includes website information and title.
6. the method as described in claims 1 or 2 or 4, which is characterized in that it is described according to first incidence relation, it will be with institute It states the first user identifier and the associated one or more first collection information insertions described second of first object search is searched Step in rope results page includes:
It is searched and first user identifier and first object search associated one in preset first incidence relation Or multiple first collection information;
One or more of first collection information are embedded in second result of page searching.
7. the method as described in claims 1 or 2 or 4, which is characterized in that the returned using first searching request One object search scan for obtain the first result of page searching the step of include:
Extract the first object search in first searching request;
When first object search is text information, search for and the matched webpage of the text information in the database;Institute Webpage is stated with summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
8. the method as described in claims 1 or 2 or 4, which is characterized in that the returned using first searching request One object search scan for obtain the first result of page searching the step of include:
Extract the first object search in first searching request;
When first object search is image information, identify and the similar or identical net of described image information in the database Page image information;
First result of page searching is generated using the Web page image information;
Return to first result of page searching.
9. the method as described in claims 1 or 2 or 4, which is characterized in that the returned using first searching request One object search scan for obtain the first result of page searching the step of include:
Extract the object search in first searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
10. the method as described in claims 1 or 2 or 4, which is characterized in that using in second searching request One object search scans for, and obtains the step of searching for the second results page and includes:
Extract the first object search in second searching request;
When first object search is text information, search for and the matched webpage of the text information in the database;Institute Webpage is stated with summary info;
Second result of page searching is generated using the summary info of the webpage.
11. the method as described in claims 1 or 2 or 4, which is characterized in that using in second searching request One object search scans for, and obtains the step of searching for the second results page and includes:
Extract the first object search in second searching request;
When first object search is image information, identify and the similar or identical net of described image information in the database Page image information;
First result of page searching is generated using the Web page image information.
12. the method as described in claims 1 or 2 or 4, which is characterized in that using in second searching request One object search scans for, and obtains the step of searching for the second results page and includes:
Extract the first object search in second searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
Second result of page searching is generated using the summary info of the webpage.
13. a kind of processing unit of collection information, comprising:
First search module is returned using described the suitable for when receiving the first searching request based on the first user identifier First object search of one searching request scans for the first result of page searching obtained, includes in first searching request First user identifier and the first object search;
Module is established, suitable for receiving the first collection of the one or more letter returned by first result of page searching When breath, establish first user identifier, first object search and one or more of first collection information the One incidence relation;
Second search module, suitable for being searched using described second when receiving the second searching request based on the first user identifier The first object search in rope request scans for, and obtains the second results page of search;It include the in second searching request One user identifier and the first object search;
First insertion module, is suitable for according to first incidence relation, searches and will be with first user identifier and described the The associated one or more first collection information of one object search are embedded in second result of page searching;
First return module is adapted to return to second result of page searching;
Configuration module, suitable for receive for one or more of first collection information processing request when, to described One or more first collection information configuration feature website informations;
Second return module, suitable for returning to described one when receiving the load request based on feature website information transmission A or multiple first collection information.
14. device as claimed in claim 13, which is characterized in that further include:
Increase module, be suitable in first incidence relation, first is increased to one or more of first collection information Label information.
15. device according to claim 13 or 14, which is characterized in that further include:
Searching module is suitable for searching matched one or more the when first user identifier has tag subscriptions information Two incidence relations;Second incidence relation is second user mark, the second object search and one or more second collections The incidence relation of information, one or more of second collection information have the second label information;The tag subscriptions information It is matched with second label information and/or first object search is matched with second object search;
Extraction module, suitable for extracting one or more of second collection letters from one or more of second incidence relations Breath;
Second insertion module, is suitable for one or more of second collection information being embedded in second result of page searching In.
16. device as claimed in claim 15, which is characterized in that the extraction module is further adapted for:
The second collection information in one or more of second incidence relations is compared;
Extract one or more identical second collection information.
17. device as claimed in claim 15, which is characterized in that the first collection information includes website information and name Claim, the second collection information includes website information and title.
18. the device as described in claim 13 or 14 or 16, which is characterized in that the first insertion module is further adapted for:
It is searched and first user identifier and first object search associated one in preset first incidence relation Or multiple first collection information;
One or more of first collection information are embedded in second result of page searching.
19. the device as described in claim 13 or 14 or 16, which is characterized in that first search module is further adapted for:
Extract the first object search in first searching request;
When first object search is text information, search for and the matched webpage of the text information in the database;Institute Webpage is stated with summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
20. the device as described in claim 13 or 14 or 16, which is characterized in that first search module is further adapted for:
Extract the first object search in first searching request;
When first object search is image information, identify and the similar or identical net of described image information in the database Page image information;
First result of page searching is generated using the Web page image information;
Return to first result of page searching.
21. the device as described in claim 13 or 14 or 16, which is characterized in that first search module is further adapted for:
Extract the object search in first searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
First result of page searching is generated using the summary info of the webpage;
Return to first result of page searching.
22. the device as described in claim 13 or 14 or 16, which is characterized in that second search module is further adapted for:
Extract the first object search in second searching request;
When first object search is text information, search for and the matched webpage of the text information in the database;Institute Webpage is stated with summary info;
Second result of page searching is generated using the summary info of the webpage.
23. the device as described in claim 13 or 14 or 16, which is characterized in that second search module is further adapted for:
Extract the first object search in second searching request;
When first object search is image information, identify and the similar or identical net of described image information in the database Page image information;
First result of page searching is generated using the Web page image information.
24. the device as described in claim 13 or 14 or 16, which is characterized in that second search module is further adapted for:
Extract the first object search in second searching request;
When first object search is audio data, the corresponding feature text information of the audio data is identified;
Search and the matched webpage of feature text information in the database;The webpage has the second summary info;
Second result of page searching is generated using the summary info of the webpage.
CN201410784236.9A 2014-12-16 2014-12-16 A kind for the treatment of method and apparatus of collection information Active CN104484414B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410784236.9A CN104484414B (en) 2014-12-16 2014-12-16 A kind for the treatment of method and apparatus of collection information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410784236.9A CN104484414B (en) 2014-12-16 2014-12-16 A kind for the treatment of method and apparatus of collection information

Publications (2)

Publication Number Publication Date
CN104484414A CN104484414A (en) 2015-04-01
CN104484414B true CN104484414B (en) 2018-12-28

Family

ID=52758955

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410784236.9A Active CN104484414B (en) 2014-12-16 2014-12-16 A kind for the treatment of method and apparatus of collection information

Country Status (1)

Country Link
CN (1) CN104484414B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DK3283993T3 (en) * 2015-04-14 2022-02-14 Mandometer Ab A PROBABILITY-, CONTEXT-FREE GRAMMAR FOR FOOD INGESTION
CN107666431B (en) * 2016-07-29 2021-01-15 腾讯科技(深圳)有限公司 Bookmark communication message acquisition method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102722481A (en) * 2011-03-29 2012-10-10 阿里巴巴集团控股有限公司 Processing method and searching method for user favorite data
CN103064851A (en) * 2011-10-20 2013-04-24 阿里巴巴集团控股有限公司 Information search method and information search device of website content
CN103186666A (en) * 2013-03-01 2013-07-03 北京百度网讯科技有限公司 Method, device and equipment for searching based on favorites
CN103246746A (en) * 2013-05-23 2013-08-14 百度在线网络技术(北京)有限公司 Method, device and system for searching information

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102722481A (en) * 2011-03-29 2012-10-10 阿里巴巴集团控股有限公司 Processing method and searching method for user favorite data
CN103064851A (en) * 2011-10-20 2013-04-24 阿里巴巴集团控股有限公司 Information search method and information search device of website content
CN103186666A (en) * 2013-03-01 2013-07-03 北京百度网讯科技有限公司 Method, device and equipment for searching based on favorites
CN103246746A (en) * 2013-05-23 2013-08-14 百度在线网络技术(北京)有限公司 Method, device and system for searching information

Also Published As

Publication number Publication date
CN104484414A (en) 2015-04-01

Similar Documents

Publication Publication Date Title
CN106776503B (en) Text semantic similarity determination method and device
US20150019586A1 (en) System and method for sharing tagged multimedia content elements
US8577882B2 (en) Method and system for searching multilingual documents
CN108334533A (en) keyword extracting method and device, storage medium and electronic device
CN113590850A (en) Multimedia data searching method, device, equipment and storage medium
CN108334489B (en) Text core word recognition method and device
CN104090929A (en) Recommendation method and device of personalized picture
CN109614504A (en) A kind of management system and method for internet electronic book
CN106960030A (en) Pushed information method and device based on artificial intelligence
CN103116635B (en) Field-oriented method and system for collecting invisible web resources
US20140040232A1 (en) System and method for tagging multimedia content elements
WO2009061420A1 (en) Object recognition and database population
CN106383875A (en) Artificial intelligence-based man-machine interaction method and device
CN103440243A (en) Teaching resource recommendation method and device thereof
CN113806588B (en) Method and device for searching video
US10372746B2 (en) System and method for searching applications using multimedia content elements
CN107766234A (en) A kind of assessment method, the apparatus and system of the webpage health degree based on mobile device
CN103226601B (en) A kind of method and apparatus of picture searching
CN113038153A (en) Financial live broadcast violation detection method, device and equipment and readable storage medium
US11176209B2 (en) Dynamically augmenting query to search for content not previously known to the user
US9454568B2 (en) Method, apparatus and computer storage medium for acquiring hot content
CN105159898B (en) A kind of method and apparatus of search
CN104484414B (en) A kind for the treatment of method and apparatus of collection information
KR20200013843A (en) System and method for providing product manual based on chatbot
US20130230248A1 (en) Ensuring validity of the bookmark reference in a collaborative bookmarking system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20220718

Address after: Room 801, 8th floor, No. 104, floors 1-19, building 2, yard 6, Jiuxianqiao Road, Chaoyang District, Beijing 100015

Patentee after: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Address before: 100088 room 112, block D, 28 new street, new street, Xicheng District, Beijing (Desheng Park)

Patentee before: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Patentee before: Qizhi software (Beijing) Co.,Ltd.