CN103246726B - Method, device and system for searching network information - Google Patents
Method, device and system for searching network information Download PDFInfo
- Publication number
- CN103246726B CN103246726B CN201310169964.4A CN201310169964A CN103246726B CN 103246726 B CN103246726 B CN 103246726B CN 201310169964 A CN201310169964 A CN 201310169964A CN 103246726 B CN103246726 B CN 103246726B
- Authority
- CN
- China
- Prior art keywords
- triggering
- item
- data
- search
- data source
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Abstract
The invention discloses a method, a device and a system for searching network information. The method includes: using a preset mapping rule to match a trigger item corresponding to key search data when key search data from a request end is received; using the matched trigger item to search a trigger file and acquire a data source where a search result corresponding to key search data is; and acquiring the search result from the acquired data source, and returning the search result to the request end; wherein the trigger item is extracted from key search data used in a network, the trigger file is generated by the trigger item and related data source position information, and the search result is generated by collecting and integrating network information containing the trigger item in advance.
Description
Technical field
The present invention relates to Internet technical field, more particularly to a kind of searching method of the network information, device and system.
Background technology
With the popularization of Internet technology, internet has been one of main source that current user obtains information.Internet
In be stored with the network data of magnanimity, user can pass through search engine and the required network information is obtained from internet.
Prior art provide information search scheme in, user can search engine provide entrance in input inquiry
Word, search engine captures in a network information according to the query word, and Search Results are back to into user by webpage.
However, at least there is following defect in the information search scheme that prior art is provided:
Existing scheme depends on search engine real-time crawl in a network when Search Results are obtained, but search is drawn
The ability for holding up this real-time grasping manipulation is extremely limited, and the information content for grabbing in real time every time is less, content is also incomplete, user
Needs click on the peer link in the webpage for returning, and search operation is performed repeatedly, and longitudinal accession page layer by layer searches searching for needs
Hitch fruit.
For example, if user accesses a video, search results pages only occur associated video, lack the details letter of correlation
Breath, user is if necessary to inquire about, in addition it is also necessary to further to access other webpages or carry out further other operations etc., from
And cause that search time is long, searching results accuracy is poor, and due to needing to process a large amount of access requests, cause to search plain engine
Data grabber pressure is also larger, data providing data processing load is heavier.
The content of the invention
In view of the above problems, it is proposed that the present invention so as to provide one kind overcome the problems referred to above or at least in part solve on
State searching method, device and the system of the network information of problem.
According to one aspect of the present invention, a kind of searching method of the network information is embodiments provided, including:
It is crucial using the matching of default mapping ruler and the search when receiving from the search critical data of request end
The corresponding triggering item of data, the triggering item is that the search critical data used in network is carried out extracting what is obtained;
Using the triggering item querying triggering file for matching, the number that the corresponding Search Results of search critical data are located is known
According to source, the triggering file is generated by triggering item and associated data source location information;
Search Results are obtained from the data source known, the Search Results are back to into request end, the Search Results are by pre-
First the network information including triggering item is collected and is integrated and is generated.
Wherein, it is above-mentioned to be included using default mapping ruler matching triggering item corresponding with the search critical data:Utilize
Default natural language processing analysis rule matches triggering item corresponding with search critical data;And/or, using default canonical
Expression formula rule match triggering item corresponding with search critical data.
Wherein, above-mentioned triggering item is to carry out extracting including of obtaining to the search critical data used in network:According to searching
The usage frequency and/or attention rate grade of rope critical data extracts triggering item from search critical data, wherein, the crucial number of search
According to usage frequency and/or attention rate higher grade, at least part of data in the search critical data be chosen for trigger item
Probability it is bigger.
Wherein, mentioned above searching results are collected by the network information in advance to including triggering item and are integrated and generate bag
Include:
Captured in a network using web crawlers, collection includes triggering the network information of item, removes the net collected
Identical data in network information, and many item datas of identical meanings are merged into by an item data using normalization mode;And/or,
The data-interface provided from partner obtains the network information for including triggering item, removes the identical number in the network information for getting
According to, and many item datas of identical meanings are merged into by an item data using normalization mode.
Wherein, above-mentioned triggering file is by including that triggering item and associated data source location information are generated:For each
Triggering item configures one or more type attributes;By each triggering item under affiliated each type attribute with corresponding data source
The association of positional information, generates triggering file.
Wherein, it is above-mentioned to be included using default mapping ruler matching triggering item corresponding with the search critical data:Utilize
The type attribute of default mapping ruler matching triggering item corresponding with the search critical data and the triggering item;
It is above-mentioned to utilize the triggering item querying triggering file for matching, know that the corresponding Search Results of search critical data are located
Data source include:Using the type attribute querying triggering file of the triggering item and the triggering item for matching, know that search is crucial
One or more data sources that the corresponding Search Results of data are located.
Wherein, data source location information for data source uniform resource position mark URL, and/or, data source location information by
MD5 value of the triggering item under affiliated type attribute is generated.
Wherein, it is above-mentioned to be included using default mapping ruler matching triggering item corresponding with the search critical data:Utilize
The type attribute of default mapping ruler matching triggering item corresponding with the search critical data and the triggering item;
Above-mentioned to obtain Search Results from the data source known, the Search Results are back to into request end includes:
The corresponding Search Results of triggering item for matching are obtained from the data source known, and according to the triggering item for matching
Type attribute the display state of each data division in the Search Results that get is set and shows grade, by Search Results with
And the display state of each data division and displaying grade are back to request end in Search Results.
Wherein, the type attribute of the triggering item that above-mentioned basis is matched arranges each data portion in the Search Results for getting
The display state and displaying grade for dividing includes:
The display state of the corresponding data division of type attribute of the triggering item for matching is set to show, shows grade
It is set to the first estate;The display state of the corresponding data division of type attribute of the triggering item not matched is set to hide
Or pack up, show that grade is set to the second grade;Wherein, the first estate is higher than the second grade.
Wherein, above-mentioned to obtain Search Results from the data source known, the Search Results are back to into request end includes:When
When there are no corresponding Search Results at least one data source known, the crawl in real time from data source server includes touching
The network information of item is sent out, is recorded the network information as the corresponding Search Results of corresponding triggering item in data source, and should
Search Results are back to request end.
Wherein, said method also includes:It is crucial according to the search when receiving from the search critical data of request end
Data carry out in real time in a network the crawl of info web, obtain capturing result;Supplement of the result as Search Results will be captured
Information, is back to request end after merging with Search Results.
According to a further aspect in the invention, a kind of searcher of the network information is embodiments provided, including:
Communication interface, is suitable to receive the search critical data from request end, and, the Search Results for getting are returned
To request end;
Adaptation, is suitable to match triggering item corresponding with the search critical data, the triggering using default mapping ruler
Item is that the search critical data used in network is carried out extracting what is obtained;
Trigger, is suitable to using the triggering item querying triggering file for matching, and knows the corresponding search of search critical data
As a result the data source being located, the triggering file is generated by triggering item and associated data source location information;
Getter, is suitable to obtain Search Results from the data source known, the Search Results are by advance to including triggering item
The network information be collected and integrate and generate.
Wherein, adaptation, is suitable to corresponding with search critical data using the matching of default natural language processing analysis rule
Triggering item, and/or, using default regular expression rule match and the search corresponding triggering item of critical data;
Wherein, above-mentioned triggering item is crucial from search according to the usage frequency and/or attention rate grade of search critical data
What extracting data was obtained, the usage frequency and/or attention rate grade of above-mentioned search critical data is higher, the search critical data
In at least part of data be chosen for trigger item probability it is bigger.
Wherein, each triggering item is configured with one or more type attributes, and triggering file triggers item affiliated by by each
Each type attribute under with corresponding data source location information associate and generate,
Adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the search critical data and the triggering
The type attribute of item;
Trigger, is suitable to the type attribute querying triggering file using the triggering item and the triggering item for matching, and knows and searches
One or more data sources that the corresponding Search Results of rope critical data are located.
Wherein, data source location information for data source uniform resource position mark URL, and/or, data source location information by
MD5 value of the triggering item under affiliated type attribute is generated.
Wherein, each triggering item is configured with one or more type attributes, and triggering file triggers item affiliated by by each
Each type attribute under with corresponding data source location information associate and generate,
Adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the search critical data and the triggering
The type attribute of item;
Trigger, is suitable to from the data source known obtain the corresponding Search Results of triggering item for matching, and according to
The type attribute of the triggering item allotted arranges the display state of each data division in the Search Results for getting and shows grade;
Communication interface, is suitable to that Search Results are back to into request end according to the display state and displaying grade of Search Results.
Wherein, trigger, is further adapted for the display state of the corresponding data division of type attribute of the triggering item that will be matched
It is set to show, shows that grade is set to the first estate;By the corresponding data division of type attribute of the triggering item not matched
Display state be set to hide or pack up, show that grade is set to the second grade;Wherein, the first estate is higher than the second grade.
Wherein, getter, is suitable to when there are no corresponding Search Results at least one data source known, from data
Crawl in real time in origin server includes triggering the network information of item, using the network information as the corresponding search of corresponding triggering item
As a result record in data source, and indicate that the Search Results are back to request end by communication interface.
According to another aspect of the invention, a kind of search system of the network information is embodiments provided, including:Such as
The searcher and cache database of the above-mentioned network information.
Cache database, is suitable to storage and is collected by the network information in advance to including triggering item and is integrated and generate
Search Results.
The searcher of the network information, is suitable to obtain Search Results from cache database.
Wherein, said system also includes crawl server, is suitable to when receiving from the search critical data of request end,
The crawl of info web is carried out in real time in the data source server of storage corresponding web page information according to the search critical data,
Obtain capturing result, the crawl result is respectively sent to into the searcher and cache database of the network information;
The searcher of the network information, is suitable to crawl result as the side information of Search Results, closes with Search Results
And after be back to request end;
Cache database, is suitable to merge to be stored in by crawl result accordingly trigger in the corresponding Search Results of item.
Wherein, said system also includes Data Collection integrated service device, is suitable to be grabbed in a network using web crawlers
Take, collection includes triggering the network information of item, removes the identical data in the network information collected, and using normalization mode
Many item datas of identical meanings are merged into into an item data;And/or, the data-interface provided from partner is obtained to be included triggering item
The network information, remove the identical data in the network information that gets, and using normalization mode by the multinomial of identical meanings
Data merge into an item data.
Wherein, above-mentioned cache database is realized by data snapshot memory access.
According to another aspect of the invention, a kind of searching method of the network information is embodiments provided, including:
When receiving from the medicine search critical data of request end, using the matching of default mapping ruler and the medicine
The corresponding triggering item of search critical data, the triggering item is to carry out extraction to the medicine search critical data used in network to obtain
's;
Using the triggering item querying triggering file for matching, know that the corresponding Search Results of medicine search critical data are located
Medical Data source, the triggering file is generated by triggering item and associated Medical Data source location information;
Search Results are obtained from the Medical Data source known, the Search Results request end is back to into, the Search Results
It is collected by the network information in advance to including triggering item and is integrated and generates.
According to another aspect of the invention, a kind of searcher of the network information is embodiments provided, including:
Communication interface, is suitable to receive the medicine search critical data from request end, and, by the Search Results for getting
It is back to request end;
Adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the medicine search critical data, should
Triggering item is that the medicine search critical data used in network is carried out extracting what is obtained;
Trigger, is suitable to using the triggering item querying triggering file for matching, and knows that medicine search critical data is corresponding
The Medical Data source that Search Results are located, the triggering file is generated by triggering item and associated Medical Data source location information
's;
Getter, is suitable to obtain Search Results from the Medical Data source known, the Search Results are by advance to including tactile
The network information for sending out item is collected and integrates and generate.
From the above mentioned, search rate is high, the data that demand degree is high are used as triggering item by choosing in advance for the embodiment of the present invention,
And valuable information in network is collected and is integrated, obtain the detailed Search Results for including triggering item, then performing
During information search, Search Results corresponding with the triggering item that request end matches can be returned directly to request end.
Because the advance Search Results integrated can include and trigger the various detailed information that item is associated, request end passes through
The Search Results can get the information for needing search, so as to simplify search operation, shorten search time, improve and search
The accuracy of hitch fruit, also, the quantity of access request is sent due to significantly reducing request end, this programme is greatly reduced to be searched
Index holds up the pressure for capturing vertical data in a network, alleviates the burden of data providing.
Described above is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention,
And can be practiced according to the content of specification, and in order to allow the above and other objects of the present invention, feature and advantage can
Become apparent, below especially exemplified by the specific embodiment of the present invention.
Description of the drawings
By the detailed description for reading hereafter preferred embodiment, various other advantages and benefit is common for this area
Technical staff will be clear from understanding.Accompanying drawing is only used for illustrating the purpose of preferred embodiment, and is not considered as to the present invention
Restriction.And in whole accompanying drawing, it is denoted by the same reference numerals identical part.In the accompanying drawings:
Fig. 1 shows the search system structural representation of the network information according to an embodiment of the invention;
Fig. 2 shows a kind of structural representation of the searcher of network information according to an embodiment of the invention;With
And
Fig. 3 shows the result of page searching screenshotss signal returned to client according to an embodiment of the invention
Figure;
Fig. 4 shows that another result of page searching screenshots returned to client according to an embodiment of the invention show
It is intended to.
Specific embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although showing the disclosure in accompanying drawing
Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure and should not be by embodiments set forth here
Limited.On the contrary, there is provided these embodiments are able to be best understood from the disclosure, and can be by the scope of the present disclosure
Complete conveys to those skilled in the art.
Existing Search Results represent in the page, only can show the link comprising query word and some simple informations, user
If clicking certain link, need to jump to third-party website acquisition or check information etc., this processing mode is brought
Search time is long, searching results accuracy is poor and network lateral pressure is larger problem etc..For these problems, this programme
Data are collected and are integrated in advance, from third party's (data providing) valuable information is got in advance, directly thrown in
To on the result of page searching of the system server, request end is back to.This programme can apply such as search for Medical Data,
In polytype data search scenes such as education information, digital data, automobile or consumer industry data, below by each
Embodiment is described in detail to this programme.
A kind of search system 100 of network information that one embodiment of the invention is provided, referring to Fig. 1, including the network information
Searcher 110, cache database 120, Data Collection integrated service device 130, crawl server 140.Illustrate in Fig. 1
Client and data origin server.Illustrate separately below.
Cache database 120 is suitable to store and is collected by the network information in advance to including triggering item and is integrated and generate
Search Results.
Exemplary, cache database 120 can be realized by data snapshot memory access.I.e. the present embodiment adopts data snapshot
Mechanism come store uncorrected data or HTML (HyperText Markup Language, the HTML) data of webpage with
And XML (ExtensibleMarkupLanguage, extensible markup language) data structure information etc., carried out using data snapshot
The mode of storage has the advantages that access speed is fast, is easy to show.
The searcher 110 of the network information is suitable to obtain Search Results from cache database 120.The search dress of the network information
Putting 110 includes communication interface 114, adaptation 113, trigger 112 and getter 111.The tool of the searcher 110 of the network information
Body structure and the method for operation are illustrated in other embodiments of the invention.
Wherein, said system 100 also includes Data Collection integrated service device 130.The Data Collection integrated service device 130 can
To get Search Results by following at least one modes:
Mode one, Data Collection integrated service device 130 are suitable to be captured in a network using web crawlers, and collection includes
The network information of triggering item, removes the identical data in the network information collected, and using normalization mode by identical meanings
Many item datas merge into an item data.Under this mode, Data Collection integrated service device 130 will can be collected from not
Unified form is converted to the network information of website or webpage, is easy to storage and follow-up process.
Specifically, web crawlers is in triggering item list according to storing and triggering item (such as trigger word) corresponding URL, to number
According to web data corresponding with URL is captured in origin server, web data can be analyzed and taken pictures after crawl, being formed should
The corresponding data snapshot of webpage.The corresponding trigger words of the URL are included in the data snapshot, using the data snapshot as the trigger word
Corresponding Search Results, associated storage is in cache database 120 together with the search terms.
The searcher 110 of Data Collection integrated service device 130 or the network information can be according to knowing in the present embodiment
Each triggering item, the corresponding type attribute of triggering item and data source (such as data snapshot memory access) positional information are associated, raw
Into triggering file, and the triggering file is stored in the searcher 110 of the network information, so that the searcher of the network information
110 obtain data message automatically according to this triggering file in cache database 120.Key-value pair can be adopted under a kind of mode
(key-value) form, using data source location information as key, using key value is navigated to, from the corresponding numbers of value
The corresponding solid data of Search Results is obtained out according to source.
Mode two, Data Collection integrated service device 130 obtain the net for including triggering item from the data-interface that partner provides
Network information, removes the identical data in the network information that gets, and using normalization mode by many item datas of identical meanings
Merge into an item data.Under this mode, Data Collection integrated service device 130 can obtain the XML data knot of partner's offer
Structure information, merges, duplicate removal and normalized etc. according to the data structure information to the network information for getting.
When using normalization mode, for example, for the data with multiple titles, a such as item data has formal name
Title, the pet name, English name and other multiple common names, this multiple title substantially have identical implication, then will be many by this
The data comprising triggering item that individual title is collected respectively merge into an item data, using the data after merging as the triggering item
Search Results.
Data Collection integrated service device 130 can in advance choose triggering item (such as trigger word), Data Collection integrated service device 130
In advance the search critical data used in network can be collected and stored to database, when selection operation is performed, from the number
Search critical data is extracted according to storehouse.Triggering item is that the search used request end in network (such as the user of client-side) is closed
Key data carries out extracting what is obtained, and a kind of extracting mode can be:According to the usage frequency and/or attention rate of search critical data
Grade extracts triggering item from search critical data, when the usage frequency and/or attention rate grade of search critical data are higher, is somebody's turn to do
The probability that at least part of data in search critical data are chosen for triggering item is bigger.Above-mentioned usage frequency can be by net
The access times of critical data are searched in network to carry out statistics and obtains, and above-mentioned attention rate grade can pass through request end feedback etc.
Level evaluation information is obtained.Data Collection integrated service device 130 can (and the triggering item be corresponding by the triggering item for selecting
URL) store into triggering item list.
Different Search Results, or a triggering item tool occurs in different scenes in view of identical triggering item
There are multiple implications, the triggering item correspondence volume Search Results are also different under different implications, in order to improve the accuracy of Search Results,
Data Collection integrated service device 130 can arrange one or more type attributes for triggering item, collect and integrate triggering item respectively and exist
Search Results under each type attribute, so that Search Results have higher precision, disclosure satisfy that special scenes or spy
Determine the search need under implication.
Wherein, said system 100 also includes that crawl server 140 is suitable to receiving the search key number from request end
According to when, according to the search critical data storage corresponding web page information data source server in carry out info web in real time
Crawl, obtains capturing result, and the crawl result is respectively sent to into the searcher and cache database of the network information.
The present embodiment can adopt the mechanism of the parallel search of searcher 110 of crawl server 140 and the network information.Often
After the searcher 110 of the network information gets search critical data, while the search critical data is distributed to into crawl clothes
Business device 140, by the crawl server 140 directly access outside data source server, obtain crawl result.Meanwhile, network
The searcher 110 of information obtains Search Results from cache database 120.The searcher 110 pairs of the network information is from caching
The Search Results obtained in database 120 and the crawl result obtained in crawl server 140 are merged.That is the network information
Searcher 110 be suitable to will crawl result as the side information of Search Results, request is back to after merging with Search Results
End.
Choose whether as needed using the crawl result of crawl server 140 as pre- in cache database 120
The supplement of the Search Results first integrated, when needed, crawl server 140 sends the crawl result for grabbing to data cached
Storehouse 120, cache database 120 will capture result merging and be stored in the corresponding Search Results of corresponding triggering item.
From the above mentioned, the embodiment of the present invention is by integrating in advance third-party information, and carries out classification analysis to triggering item,
By the information of triggering item corresponding subdivision under respective type attribute, there is provided to request end such that it is able to improve data search
Accuracy, shortens search time, and can reduce the pressure of partner's data, services, alleviates the vertical data of web crawlers
Crawl pressure, allow request end (such as user) can directly return result of page searching in get oneself required for letter
Breath, realizes rapider, accurate, polynary data search, meets user's request.
Another embodiment of the invention provides a kind of searcher of the network information, referring to Fig. 2, including communication interface
114th, adaptation 113, trigger 112 and getter 111.
Communication interface 114 is suitable to receive the search critical data from request end, and, the Search Results for getting are returned
It is back to request end.Search Results are returned to request end by communication interface 114 in the form of a web page.Referring to Fig. 3, it is shown that triggering item is
When " coronary heart disease ", the result of page searching screenshotss schematic diagram that communication interface is returned to client.The search knot shown in the webpage
Fruit is provided with three display boxes, includes in a display box " general introduction ", " cause of disease ", " symptom ", " diet ", " in advance of coronary heart disease
It is anti-", " treatment ", " inspections ", " diagnosing examination " and " complication " much information, wherein " general introduction " is partly set to dispaly state,
Other items are collapsed state;Another display box is the information related to " coronary heart disease _ look for hospital ";It is in another display box
The information related to " coronary heart disease _ look for expert ".
Referring to Fig. 4, the screenshotss exemplary plot of another search result web page provided for the present embodiment, it is provided with the webpage
Two display boxes, show in more detail the letter being associated with " Chinese People's Liberation Army General Hospital " and section office in hospital and expert
Breath.
Adaptation 113 is suitable to match triggering item corresponding with the search critical data using default mapping ruler, and this is touched
It is that the search critical data used in network is carried out extracting what is obtained to send out item.Above-mentioned default mapping ruler includes but does not limit to
In natural language processing analysis (the Natural Language that can indicate that triggering item and search critical data corresponding relation
Processing, NLP) regular and/or regular expression rule.Specifically, adaptation 113 is suitable to using default natural language
Treatment Analysis rule match triggering item corresponding with search critical data, and/or, using default regular expression rule match
Triggering item corresponding with search critical data.
Wherein, above-mentioned triggering item is crucial from search according to the usage frequency and/or attention rate grade of search critical data
What extracting data was obtained, the usage frequency and/or attention rate grade of above-mentioned search critical data is higher, the search critical data
In at least part of data be chosen for trigger item probability it is bigger.
Trigger 112 is suitable to using the triggering item querying triggering file that matches, knows search critical data is corresponding and search
The data source that hitch fruit is located, the triggering file is generated by triggering item and associated data source location information.Can be by
Triggering file is stored in trigger 112, it is also possible to which triggering file is stored in cache database, and trigger 112 is needing
The triggering file is extracted from cache database using triggering in file.
Triggering item and associated data source location information can only be included in triggering file, or, each triggering item is matched somebody with somebody
One or more type attributes are equipped with, triggering file is by several with corresponding under affiliated each type attribute by each triggering item
Associate according to source location information and generate, in this case, triggering file includes triggering item, triggers the type attribute of item and associated
Data source location information.The mapping ruler that then adaptation 113 is used can indicate that triggering item corresponding with search critical data
With the type attribute of the triggering item, adaptation 113 is matched corresponding with the search critical data tactile using default mapping ruler
Item and the type attribute of the triggering item are sent out, and trigger 112 is looked into using the type attribute of the triggering item and the triggering item that match
Triggering file is ask, one or more data sources that the corresponding Search Results of search critical data are located are known.
Data source location information is the information for being capable of unique identification's data source in systems, and such as data source location information is
The URL (Uniform Resource Locator, URL) of data source, and/or, data source location information is by touching
Send out MD5 value of the item under affiliated type attribute to generate, such as the type attribute to triggering item and triggering item carries out MD5 computings, will transport
Result is calculated as data source location information.
Getter 111 is suitable to obtain Search Results from the data source known, the Search Results are by advance to including triggering
The network information of item is collected and integrates and generate.For example, getter 111 from the cache database known, (deposit by data snapshot
Take device) middle acquisition Search Results.
Further, the type attribute of item is triggered by Intelligent Recognition in the present embodiment, to touching in result of page searching
The block of item different type attribute is sent out, the combination for the operation such as freely packing up, hide or showing can be passed through, realize Search Results
The page flexibly represents.
Under this scene, adaptation 113 is suitable to corresponding with the search critical data using the matching of default mapping ruler
The type attribute of triggering item and the triggering item;
Trigger 112 is suitable to from the data source known obtain the corresponding Search Results of triggering item for matching, and according to
The type attribute of the triggering item for matching arranges the display state of each data division in the Search Results for getting and displaying etc.
Level;Display state can include showing, hide or pack up, and show that grade can include that the first estate, the second grade etc. are multiple
Rank, the different displaying priority of different displaying grade correspondences, for example, and when the first estate is higher than the second grade, the first estate
Search Results displaying priority higher than the second grade Search Results displaying priority, by the Search Results of the first estate
It is arranged on top or other positions for being most easily concerned about of search result web page.
Communication interface 114 is suitable to that Search Results are back to into request according to the display state and displaying grade of Search Results
End.For example, communication interface according to display state and can show hierarchical arrangement Search Results in the search result web page for returning
Position, and the display box at Search Results place is set to show, hiding or pack up.
Wherein, in the present embodiment when there are no corresponding Search Results in data source, getter 111 is suitable to from data come
Crawl in real time in source server includes triggering the network information of item, using the network information as the corresponding search knot of corresponding triggering item
Fruit record indicates that the Search Results are back to request end by communication interface 114 in data source.
Further, the result page that search can be directed in the present embodiment provides believable checking, for example, can lead in advance
Cross to web site name, domain name, log-on message, address, legal person, website record information, ICP (Web content service), and manufacturer
The checking of the information such as the qualification in national authority certification authority, judges Search Results whether secure and trusted, and in Search Results net
Verification mark is provided in page, the whether safe and reliable markup information of Search Results is shown, so that user can be more believable
Search Results are selected in environment.Also, the present embodiment can with send set into the search result web page of request end
User feedback interface is put, as shown in " complaint " button in Fig. 3 and Fig. 4, is disappeared from the feedback of user by feedback interface reception
Breath, such as receive user report malice or unreal information message, the present embodiment can with reference to the feedback message of user to searching
The security of hitch fruit is judged and is marked.
From the above mentioned, search rate is high, the data that demand degree is high are used as triggering item by choosing in advance for the embodiment of the present invention,
And valuable information in network is collected and is integrated, obtain the detailed Search Results for including triggering item, then performing
During information search, Search Results corresponding with the triggering item that request end matches can be returned directly to request end.
Because the advance Search Results integrated can include and trigger the various detailed information that item is associated, request end passes through
The Search Results can get the information for needing search, so as to simplify search operation, shorten search time, improve and search
The accuracy of hitch fruit, also, the quantity of access request is sent due to significantly reducing request end, this programme is greatly reduced to be searched
Index holds up the pressure for capturing vertical data in a network, alleviates the burden of data providing.
Another embodiment of the invention provides a kind of searching method of the network information, referring to Fig. 5, comprises the steps:
Step S500:When receiving from the search critical data of request end, using default mapping ruler matching with
The corresponding triggering item of the search critical data, the triggering item is to carry out extraction to the search critical data used in network to obtain
's;
Step S502:Using the triggering item querying triggering file for matching, the corresponding search knot of search critical data is known
The data source (such as data snapshot memory access) that fruit is located, the triggering file is by triggering item and associated data source location information
Generate, the data source location information for data source URL, and/or, the data source location information is by triggering item in affiliated type
MD5 values under attribute are generated.
Step S504:Search Results are obtained from the data source known, the Search Results request end is back to into, the search
As a result it is collected by the network information in advance to including triggering item and integrates and generate.
Wherein, above-mentioned steps S500 include:Matched using default natural language processing analysis rule crucial with search
The corresponding triggering item of data;And/or, using the triggering corresponding with search critical data of default regular expression rule match
.And, triggering item is extracted from search critical data according to the usage frequency and/or attention rate grade of search critical data,
Wherein, the usage frequency and/or attention rate higher grade for searching for critical data, at least part of data in the search critical data
The probability for being chosen for triggering item is bigger.
Wherein, the generation of Search Results includes in above-mentioned steps S504:Captured in a network using web crawlers, received
Collection includes triggering the network information of item, the identical data in the network information collected of removal, and using normalization mode by phase
An item data is merged into many item datas of implication;And/or, the data-interface provided from partner obtains the net for including triggering item
Network information, removes the identical data in the network information that gets, and using normalization mode by many item datas of identical meanings
Merge into an item data.
Wherein, the generation of file is triggered in above-mentioned steps S502 to be included:One or more types are configured for each triggering item
Attribute;By each association of the triggering item under affiliated each type attribute with corresponding data source location information, triggering is generated
File.
Wherein, when the present embodiment is a triggering item configuration polytype attribute, above-mentioned steps S500 also include:Utilize
The type attribute of default mapping ruler matching triggering item corresponding with the search critical data and the triggering item;
Above-mentioned steps S502 also include:Using the type attribute querying triggering text of the triggering item and the triggering item for matching
Part, knows one or more data sources that the corresponding Search Results of search critical data are located.
Above-mentioned steps S504 also include:The corresponding Search Results of triggering item for matching are obtained from the data source known,
And arranged according to the type attribute of the triggering item for matching each data division in the Search Results that get display state and
Show grade, the display state of each data division in Search Results and Search Results and displaying grade are back to into request end.
Wherein, in step S504, by the display state of the corresponding data division of type attribute of the triggering item for matching
It is set to show, shows that grade is set to the first estate;By the corresponding data division of type attribute of the triggering item not matched
Display state be set to hide or pack up, show that grade is set to the second grade;Wherein, the first estate is higher than the second grade.
Wherein, above-mentioned steps S504 also include:When there are no corresponding Search Results at least one data source known
When, the crawl in real time from data source server includes triggering the network information of item, using the network information as corresponding triggering item
Corresponding Search Results are recorded in data source, and the Search Results are back to into request end.
Further, the present embodiment also provides a kind for the treatment of mechanism of parallel search, is performing above-mentioned steps S500 extremely
While S504 carries out data search, said method also includes:When receiving from the search critical data of request end, according to
The search critical data carries out in real time in a network the crawl of info web, obtains capturing result;Result will be captured as search
As a result side information, is back to request end after merging with Search Results.
The specific works mode of each step may refer to the device and system enforcement of the present invention in the inventive method embodiment
Example, will not be described here.
From the above mentioned, the embodiment of the present invention due to the advance Search Results integrated can include it is various with what triggering item was associated
Detailed information, request end can get the information for needing search by the Search Results, so as to simplify search operation, contracting
Short search time, the accuracy of Search Results is improve, also, the number of access request is sent due to significantly reducing request end
Amount, this programme greatly reduces the pressure that search engine captures in a network vertical data, alleviates the burden of data providing.
For the scene of Medical Data search, another embodiment of the invention additionally provides a kind of searcher of the network information
Method, including following process:
When receiving from the medicine search critical data of request end, using the matching of default mapping ruler and the medicine
The corresponding triggering item of search critical data, the triggering item is to carry out extraction to the medicine search critical data used in network to obtain
's;
Using the triggering item querying triggering file for matching, know that the corresponding Search Results of medicine search critical data are located
Medical Data source, the triggering file is generated by triggering item and associated Medical Data source location information;
Search Results are obtained from the Medical Data source known, the Search Results request end is back to into, the Search Results
It is collected by the network information in advance to including triggering item and is integrated and generates.The Search Results can pass through Search Results net
The mode of page is sent to request end, and the example of search result web page may refer to Fig. 3 and Fig. 4.
Above-mentioned medicine search critical data is the data for including that hospital, doctor, medicine, medicine equipment etc. are related to medicine,
Above-mentioned Medical Data source is the database of Medical Data of being stored with, and such as database can be by the data of the Medical Data that is stored with
Snapshot memory access.
For the scene of Medical Data search, another embodiment of the invention additionally provides a kind of search dress of network information
Put, including:
Communication interface is suitable to receive the medicine search critical data from request end, and, by the Search Results for getting
It is back to request end;Adaptation is suitable to using default mapping ruler matching triggering corresponding with the medicine search critical data
, the triggering item is that the medicine search critical data used in network is carried out extracting what is obtained;Trigger is suitable to using matching
The triggering item querying triggering file for going out, knows the Medical Data source that the corresponding Search Results of medicine search critical data are located, should
Triggering file is generated by triggering item and associated Medical Data source location information;Getter is suitable to from the medicine known
Data source obtains Search Results, and the Search Results are collected by the network information in advance to including triggering item and are integrated and give birth to
Into.
Above-mentioned medicine search critical data is the data for including that hospital, doctor, medicine, medicine equipment etc. are related to medicine,
Above-mentioned Medical Data source is the database of Medical Data of being stored with, and such as database can be by the data of the Medical Data that is stored with
Snapshot memory access.
Another embodiment of the invention provides a kind of searching method of the network information:Wherein:It is described to be reflected using default
Penetrating rule match triggering item corresponding with the search critical data includes:
Using default natural language processing analysis rule matching triggering item corresponding with search critical data,
And/or,
Using default regular expression rule match triggering item corresponding with search critical data.
Another embodiment of the invention provides a kind of searching method of the network information:Wherein, the triggering item is to net
Search critical data used in network carries out extracting including of obtaining:
Triggering item is extracted from search critical data according to the usage frequency and/or attention rate grade of search critical data,
Wherein, the usage frequency and/or attention rate higher grade for searching for critical data, at least part of data in the search critical data
The probability for being chosen for triggering item is bigger.
Another embodiment of the invention provides a kind of searching method of the network information, wherein, it is described from the number known
Search Results are obtained according to source, the Search Results are back to into request end includes:
When there are no corresponding Search Results at least one data source known, from data source server in real time
Crawl includes the network information of the triggering item, is counting the network information as the corresponding Search Results record of corresponding triggering item
According to source, and the Search Results are back to into request end.
Another embodiment of the invention provides a kind of searching method of the network information, wherein, methods described also includes:
When receiving from the search critical data of request end, carried out in real time in a network according to the search critical data
The crawl of info web, obtains capturing result;
Using the crawl result as the side information of the Search Results, it is back to after merging with the Search Results and asks
Ask end.
Another embodiment of the invention provides a kind of searcher of the network information, wherein,
The adaptation, is suitable to corresponding with search critical data using the matching of default natural language processing analysis rule
Triggering item, and/or, using default regular expression rule match triggering item corresponding with search critical data;
Wherein, the triggering item is crucial from search according to the usage frequency and/or attention rate grade of search critical data
What extracting data was obtained, the usage frequency and/or attention rate grade of the search critical data is higher, the search critical data
In at least part of data be chosen for trigger item probability it is bigger.
Another embodiment of the invention provides a kind of searcher of the network information, wherein, described each triggering item is matched somebody with somebody
Be equipped with one or more type attributes, the triggering file by by each triggering item under affiliated each type attribute with it is corresponding
Data source location information association and generate,
The adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the search critical data and is somebody's turn to do
The type attribute of triggering item;
The trigger, is suitable to the type attribute querying triggering file using the triggering item and the triggering item for matching, and obtains
Know one or more data sources that the corresponding Search Results of the search critical data are located.
Another embodiment of the invention provides a kind of searcher of the network information, wherein, the data source location letter
The uniform resource position mark URL for data source is ceased, and/or, the data source location information is by triggering item in affiliated type attribute
Under MD5 values generate.
Another embodiment of the invention provides a kind of searcher of the network information, wherein, described each triggering item is matched somebody with somebody
Be equipped with one or more type attributes, the triggering file by by each triggering item under affiliated each type attribute with it is corresponding
Data source location information association and generate,
The adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the search critical data and is somebody's turn to do
The type attribute of triggering item;
The trigger, is suitable to from the data source known obtain the corresponding Search Results of triggering item for matching, and root
Display state and displaying according to each data division in the Search Results that the type attribute setting of the triggering item for matching gets
Grade;
The communication interface, is suitable to that Search Results are back to into request according to the display state and displaying grade of Search Results
End.
Another embodiment of the invention provides a kind of searcher of the network information, wherein, the trigger is further adapted for
The display state of the corresponding data division of type attribute of the triggering item for matching is set to show, shows that grade is set to the
One grade;By do not match triggering item the corresponding data division of type attribute display state be set to hide or pack up,
Show that grade is set to the second grade;Wherein, described the first estate is higher than second grade.
Another embodiment of the invention provides a kind of searcher of the network information, wherein,
The getter, is suitable to when there are no corresponding Search Results at least one data source known, from data
Crawl in real time in origin server includes the network information of the triggering item, and the network information is corresponding as corresponding triggering item
Search Results are recorded in data source, and indicate that the Search Results are back to request end by the communication interface.
Another embodiment of the invention provides a kind of search system of the network information, including:
Including the searcher and cache database of the above-mentioned network information,
The cache database, is suitable to store by advance to being collected and integrating including the network information for triggering item
And the Search Results for generating;
The searcher of the network information, is suitable to obtain Search Results from the cache database.
Another embodiment of the invention provides a kind of search system of the network information, wherein, the system also includes grabbing
Take server,
The crawl server, is suitable to when receiving from the search critical data of request end, crucial according to the search
Data carry out in real time the crawl of info web in the data source server of storage corresponding web page information, obtain capturing result,
The crawl result is respectively sent to into the searcher and cache database of the network information;
The searcher of the network information, is suitable to the result that captures as the side information of the Search Results,
Request end is back to after merging with the Search Results;
The cache database, is suitable to merge to be stored in by the crawl result accordingly trigger the corresponding Search Results of item
In.
Another embodiment of the invention provides a kind of search system of the network information, wherein, the system also includes grabbing
Server is taken, wherein, the system also includes Data Collection integrated service device, is suitable to be grabbed in a network using web crawlers
Take, collection includes triggering the network information of item, removes the identical data in the network information collected, and using normalization mode
Many item datas of identical meanings are merged into into an item data;And/or, the data-interface provided from partner is obtained to be included triggering item
The network information, remove the identical data in the network information that gets, and using normalization mode by the multinomial of identical meanings
Data merge into an item data.
Another embodiment of the invention provides a kind of search system of the network information, wherein, the system also includes grabbing
Server is taken, wherein, the cache database is realized by data snapshot memory access.
However, this programme is not limited to apply the scene in Medical Data search, it is also possible to apply this programme and searching
Suo Jiaoyu information, digital data, automobile, consumer industry data or other any search fields, or weather, train
Ticket, plane ticket, stock, fund, shopping information, purchase by group, the search technique field such as film, music, novel, question and answer.
Provided herein algorithm and display be not inherently related to any certain computer, virtual system or miscellaneous equipment.
Various general-purpose systems can also be used together based on teaching in this.As described above, construct required by this kind of system
Structure be obvious.Additionally, the present invention is also not for any certain programmed language.It is understood that, it is possible to use it is various
Programming language realizes the content of invention described herein, and the description done to language-specific above is to disclose this
Bright preferred forms.
In specification mentioned herein, a large amount of details are illustrated.It is to be appreciated, however, that the enforcement of the present invention
Example can be put into practice in the case of without these details.In some instances, known method, structure is not been shown in detail
And technology, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help understand one or more in each inventive aspect, exist
Above in the description of the exemplary embodiment of the present invention, each feature of the present invention is grouped together into single enforcement sometimes
In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:I.e. required guarantor
The more features of feature that the application claims ratio of shield is expressly recited in each claim.More precisely, such as following
Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore,
Thus the claims for following specific embodiment are expressly incorporated in the specific embodiment, wherein each claim itself
All as the separate embodiments of the present invention.
Those skilled in the art are appreciated that can be carried out adaptively to the module in the equipment in embodiment
Change and they are arranged in one or more equipment different from the embodiment.Can be the module or list in embodiment
Unit or component are combined into a module or unit or component, and can be divided in addition multiple submodule or subelement or
Sub-component.In addition at least some in such feature and/or process or unit is excluded each other, can adopt any
Combine to all features disclosed in this specification (including adjoint claim, summary and accompanying drawing) and so disclosed
Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification is (including adjoint power
Profit is required, summary and accompanying drawing) disclosed in each feature can it is identical by offers, be equal to or the alternative features of similar purpose carry out generation
Replace.
Although additionally, it will be appreciated by those of skill in the art that some embodiments described herein include other embodiments
In included some features rather than further feature, but the combination of the feature of different embodiments means in of the invention
Within the scope of and form different embodiments.For example, in the following claims, embodiment required for protection appoint
One of meaning can in any combination mode using.
The present invention all parts embodiment can be realized with hardware, or with one or more processor operation
Software module realize, or with combinations thereof realization.It will be understood by those of skill in the art that can use in practice
Microprocessor or digital signal processor (DSP) are come in the searcher for realizing the network information according to embodiments of the present invention
The some or all functions of some or all parts.The present invention is also implemented as performing method as described herein
Some or all equipment or program of device (for example, computer program and computer program).Such reality
The program of the existing present invention can be stored on a computer-readable medium, or can have the form of one or more signal.
Such signal can be downloaded from internet website and obtained, or be provided on carrier signal, or in any other form
There is provided.
It should be noted that above-described embodiment the present invention will be described rather than limits the invention, and ability
Field technique personnel can design without departing from the scope of the appended claims alternative embodiment.In the claims,
Any reference symbol between bracket should not be configured to limitations on claims.Word "comprising" is not excluded the presence of not
Element listed in the claims or step.Word "a" or "an" before element does not exclude the presence of multiple such
Element.The present invention can come real by means of the hardware for including some different elements and by means of properly programmed computer
It is existing.If in the unit claim for listing equipment for drying, several in these devices can be by same hardware branch
To embody.The use of word first, second, and third does not indicate that any order.These words can be explained and be run after fame
Claim.
Claims (23)
1. a kind of searching method of the network information, including:
When receiving from the search critical data of request end, using the matching of default mapping ruler and the search critical data
Corresponding triggering item, the triggering item is that the search critical data used in network is carried out extracting what is obtained;
Using the triggering item querying triggering file for matching, according to the data source location information associated in triggering file is known
The data source that the corresponding Search Results of search critical data are located, the triggering file is by triggering item and associated data source
What positional information was generated, then only include triggering item and associated data source location information in the triggering file, or, each
Triggering item is configured with one or more type attributes, and the triggering file triggers item in affiliated each type attribute by by each
Under with corresponding data source location information associate and generates, then it is described triggering file include trigger item, trigger item type belong to
Property and associated data source location information;Wherein, the data source location information be in the search system of the network information only
The information in one property mark data source;
Search Results are obtained from the data source known, the Search Results are back to into request end, the Search Results are by advance
The network information including the triggering item is collected and is integrated and generated, and corresponding data source is stored in after generation
In.
2. method according to claim 1, wherein, the Search Results are by advance to including the network letter of the triggering item
Breath be collected and integrate and generate including:
Captured in a network using web crawlers, collection includes triggering the network information of item, removed the network letter collected
Identical data in breath, and many item datas of identical meanings are merged into by an item data using normalization mode;And/or
The data-interface provided from partner obtains the network information for including triggering item, removes the phase in the network information for getting
Same data, and many item datas of identical meanings are merged into by an item data using normalization mode.
3. method according to claim 1, wherein,
It is described to be included using default mapping ruler matching triggering item corresponding with the search critical data:
Using default mapping ruler matching triggering item corresponding with the search critical data and the type attribute of the triggering item;
It is described to utilize the triggering item querying triggering file for matching, know that the corresponding Search Results of the search critical data are located
Data source include:
Using the type attribute querying triggering file of the triggering item and the triggering item for matching, the search critical data pair is known
One or more data sources that the Search Results answered are located.
4. method according to claim 1, wherein, the data source location information for data source URL
URL, and/or, the data source location information is generated by MD5 values of the item under affiliated type attribute is triggered.
5. method according to claim 1, wherein,
It is described to be included using default mapping ruler matching triggering item corresponding with the search critical data:
Using default mapping ruler matching triggering item corresponding with the search critical data and the type attribute of the triggering item;
Described to obtain Search Results from the data source known, the Search Results are back to into request end includes:
The corresponding Search Results of triggering item for matching are obtained from the data source known, and according to the class of the triggering item for matching
Type attribute arranges the display state of each data division in the Search Results that get and shows grade, by the Search Results with
And the display state of each data division and displaying grade are back to request end in Search Results.
6. method according to claim 5, wherein, the type attribute of the triggering item that the basis is matched is arranged and got
Search Results in each data division display state and show grade include:
The display state of the corresponding data division of type attribute of the triggering item for matching is set to show, shows that grade is arranged
For the first estate;
The display state of the corresponding data division of type attribute of the triggering item not matched is set to hide or pack up, show
Grade is set to the second grade;
Wherein, described the first estate is higher than second grade.
7. method according to claim 1, wherein, it is described using the matching of default mapping ruler and the search critical data
Corresponding triggering item includes:
Using default natural language processing analysis rule matching triggering item corresponding with search critical data,
And/or,
Using default regular expression rule match triggering item corresponding with search critical data.
8. method according to claim 1, wherein, the triggering item is that the search critical data used in network is carried out
What extraction was obtained includes:
Triggering item is extracted from search critical data according to the usage frequency and/or attention rate grade of search critical data, wherein,
The usage frequency and/or attention rate grade of search critical data is higher, and at least part of data in the search critical data are selected
The probability for being taken as triggering item is bigger.
9. method according to claim 1, wherein, it is described to obtain Search Results from the data source known, this is searched for
As a result being back to request end includes:
When there are no corresponding Search Results at least one data source known, capture in real time from data source server
Including the network information of the triggering item, record the network information as the corresponding Search Results of corresponding triggering item in data source
In, and the Search Results are back to into request end.
10. method according to claim 1, wherein, methods described also includes:
When receiving from the search critical data of request end, webpage is carried out in real time in a network according to the search critical data
The crawl of information, obtains capturing result;
Using the crawl result as the side information of the Search Results, request is back to after merging with the Search Results
End.
A kind of 11. searchers of the network information, including:
Communication interface, is suitable to receive the search critical data from request end, and, the Search Results for getting are back to please
Ask end;
Adaptation, is suitable to match triggering item corresponding with the search critical data, the triggering item using default mapping ruler
It is that the search critical data used in network is carried out extracting what is obtained;
Trigger, is suitable to using the triggering item querying triggering file for matching, according to the data source location associated in triggering file
Information knows the data source that the corresponding Search Results of the search critical data are located, and the triggering file is by triggering item and phase
What the data source location information of association was generated, then only include triggering item and associated data source location letter in the triggering file
Breath, or, each triggering item is configured with one or more type attributes, and the triggering file triggers item affiliated by by each
Associate with corresponding data source location information under each type attribute and generate, then the triggering file includes triggering item, touches
Send out the type attribute and associated data source location information of item;Wherein, the data source location information is in the network information
The information of unique identification's data source in search system;
Getter, is suitable to obtain Search Results from the data source known, the Search Results are by advance to including the triggering
The network information of item is collected and integrates and generate, and is stored in after generation in corresponding data source.
12. devices according to claim 11, wherein,
The adaptation, is suitable to using default natural language processing analysis rule matching triggering corresponding with search critical data
, and/or, using default regular expression rule match triggering item corresponding with search critical data;
Wherein, it is described triggering item be according to search critical data usage frequency and/or attention rate grade from search critical data
Middle to extract what is obtained, the usage frequency and/or attention rate grade of the search critical data is higher, in the search critical data
The probability that at least part of data are chosen for triggering item is bigger.
13. devices according to claim 11, wherein,
The adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the search critical data and the triggering
The type attribute of item;
The trigger, is suitable to the type attribute querying triggering file using the triggering item and the triggering item for matching, and knows institute
State one or more data sources that the corresponding Search Results of search critical data are located.
14. devices according to claim 11, wherein, the data source location information is positioned for the unified resource of data source
Symbol URL, and/or, the data source location information is generated by MD5 values of the item under affiliated type attribute is triggered.
15. devices according to claim 11, wherein,
The adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the search critical data and the triggering
The type attribute of item;
The trigger, is suitable to from the data source known obtain the corresponding Search Results of triggering item for matching, and according to
The type attribute of the triggering item allotted arranges the display state of each data division in the Search Results for getting and shows grade;
The communication interface, is suitable to that Search Results are back to into request end according to the display state and displaying grade of Search Results.
16. devices according to claim 15, wherein, the trigger is further adapted for the type for triggering item that will be matched
The display state of the corresponding data division of attribute is set to show, shows that grade is set to the first estate;By touching for not matching
The display state for sending out the corresponding data division of type attribute of item is set to hide or pack up, show that grade is set to second etc.
Level;Wherein, described the first estate is higher than second grade.
17. devices according to claim 11, wherein,
The getter, is suitable to when there are no corresponding Search Results at least one data source known, from data source
Crawl in real time in server includes the network information of the triggering item, using the network information as the corresponding search of corresponding triggering item
As a result record in data source, and indicate that the Search Results are back to request end by the communication interface.
A kind of 18. search systems of the network information, including:
The searcher and cache database of the network information as described in above-mentioned any one of claim 11 to 17,
The cache database, is suitable to storage and is collected by the network information in advance to including the triggering item and is integrated and give birth to
Into Search Results;
The searcher of the network information, is suitable to obtain Search Results from the cache database.
19. systems according to claim 18, wherein, the system also includes crawl server,
The crawl server, is suitable to when receiving from the search critical data of request end, according to the search critical data
Carry out the crawl of info web in real time in the data source server of storage corresponding web page information, obtain capturing result, by this
Crawl result is respectively sent to the searcher and cache database of the network information;
The searcher of the network information, is suitable to the result that captures as the side information of the Search Results, with institute
State after Search Results merge and be back to request end;
The cache database, is suitable to merge to be stored in by the crawl result accordingly trigger in the corresponding Search Results of item.
20. systems according to claim 18, wherein, the system also includes Data Collection integrated service device, is suitable to profit
Captured in a network with web crawlers, collection includes triggering the network information of item, in removing the network information collected
Identical data, and many item datas of identical meanings are merged into by an item data using normalization mode;And/or, carry from partner
For data-interface obtain the network information for including triggering item, the identical data in the network information that gets of removal, and adopt
Many item datas of identical meanings are merged into an item data by normalization mode.
21. systems according to claim 18, wherein, the cache database is realized by data snapshot memory access.
A kind of 22. searching methods of the network information, including:
When receiving from the medicine search critical data of request end, using the matching of default mapping ruler and the medicine search
The corresponding triggering item of critical data, the triggering item is to carry out extraction to the medicine search critical data used in network to obtain
's;
Using the triggering item querying triggering file for matching, known according to the Medical Data source location information associated in triggering file
The medicine searches for the Medical Data source that the corresponding Search Results of critical data are located, and the triggering file is by triggering item and phase
What the Medical Data source location information of association was generated, then only include triggering item and associated Medical Data in the triggering file
Source location information, or, each triggering item is configured with one or more type attributes, and the triggering file triggers item by by each
Associate with corresponding Medical Data source location information under affiliated each type attribute and generate, then wrap in the triggering file
Include triggering item, the type attribute of triggering item and associated Medical Data source location information;Wherein, the Medical Data source position
Information is the information in unique identification's Medical Data source in the search system of the network information;
Obtain Search Results from the Medical Data source known, the Search Results be back to into request end, the Search Results by
The network information including the triggering item is collected in advance and is integrated and is generated, and corresponding doctor is stored in after generation
In medicine data source.
A kind of 23. searchers of the network information, including:
Communication interface, is suitable to receive the medicine search critical data from request end, and, the Search Results for getting are returned
To request end;
Adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the medicine search critical data, described to touch
It is that the medicine search critical data used in network is carried out extracting what is obtained to send out item;
Trigger, is suitable to using the triggering item querying triggering file for matching, according to the Medical Data source associated in triggering file
Positional information knows the Medical Data source that the corresponding Search Results of the medicine search critical data are located, and the triggering file is
Generated by triggering item and associated Medical Data source location information, then only include triggering item and correlation in the triggering file
The Medical Data source location information of connection, or, each triggering item is configured with one or more type attributes, the triggering file by
Each triggering item is associated under affiliated each type attribute with corresponding Medical Data source location information and is generated, then it is described
Triggering file includes triggering item, the type attribute of triggering item and associated Medical Data source location information;Wherein, the doctor
Medicine data source location information is the information in unique identification's Medical Data source in the search system of the network information;
Getter, is suitable to obtain Search Results from the Medical Data source known, the Search Results are by advance to including described
The network information of triggering item is collected and integrates and generate, and is stored in after generation in corresponding Medical Data source.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310169964.4A CN103246726B (en) | 2013-05-09 | 2013-05-09 | Method, device and system for searching network information |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310169964.4A CN103246726B (en) | 2013-05-09 | 2013-05-09 | Method, device and system for searching network information |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103246726A CN103246726A (en) | 2013-08-14 |
CN103246726B true CN103246726B (en) | 2017-04-12 |
Family
ID=48926246
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310169964.4A Active CN103246726B (en) | 2013-05-09 | 2013-05-09 | Method, device and system for searching network information |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103246726B (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106708856A (en) * | 2015-11-13 | 2017-05-24 | 百度在线网络技术(北京)有限公司 | Information retrieval method and apparatus |
US10362060B2 (en) * | 2015-12-30 | 2019-07-23 | International Business Machines Corporation | Curtailing search engines from obtaining and controlling information |
CN105930528B (en) * | 2016-06-03 | 2020-09-08 | 腾讯科技(深圳)有限公司 | Webpage caching method and server |
CN106202260B (en) * | 2016-06-29 | 2021-07-27 | 百度在线网络技术(北京)有限公司 | Search method and device and search engine |
CN108519984B (en) * | 2018-02-07 | 2022-11-04 | 平安科技(深圳)有限公司 | Weather data processing method, server and computer readable storage medium |
CN110765275B (en) * | 2019-10-14 | 2023-02-07 | 深圳平安医疗健康科技服务有限公司 | Search method, search device, computer equipment and storage medium |
CN112214505A (en) * | 2020-10-21 | 2021-01-12 | 北京金堤征信服务有限公司 | Data synchronization method and device, computer readable storage medium and electronic equipment |
CN112807697A (en) * | 2021-01-28 | 2021-05-18 | 北京达佳互联信息技术有限公司 | List generation method and device, electronic equipment and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1940922A (en) * | 2005-09-30 | 2007-04-04 | 腾讯科技(深圳)有限公司 | Method and system for improving information search speed |
CN102663088A (en) * | 2012-03-31 | 2012-09-12 | 百度在线网络技术(北京)有限公司 | Method and equipment for providing search results |
CN102930054A (en) * | 2012-11-19 | 2013-02-13 | 北京奇虎科技有限公司 | Data search method and data search system |
CN103034663A (en) * | 2011-09-29 | 2013-04-10 | 阿里巴巴集团控股有限公司 | Information searching method and equipment |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1609848A (en) * | 2003-10-23 | 2005-04-27 | 肖宁 | Predefined keywords electronic file searching method |
US20080319960A1 (en) * | 2007-06-25 | 2008-12-25 | Yuan-Jung Chang | Information searching method, information searching system and inputting device thereof |
CN102831253B (en) * | 2012-09-25 | 2015-01-21 | 北京科东电力控制系统有限责任公司 | Distributed full-text retrieval system |
-
2013
- 2013-05-09 CN CN201310169964.4A patent/CN103246726B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1940922A (en) * | 2005-09-30 | 2007-04-04 | 腾讯科技(深圳)有限公司 | Method and system for improving information search speed |
CN103034663A (en) * | 2011-09-29 | 2013-04-10 | 阿里巴巴集团控股有限公司 | Information searching method and equipment |
CN102663088A (en) * | 2012-03-31 | 2012-09-12 | 百度在线网络技术(北京)有限公司 | Method and equipment for providing search results |
CN102930054A (en) * | 2012-11-19 | 2013-02-13 | 北京奇虎科技有限公司 | Data search method and data search system |
Also Published As
Publication number | Publication date |
---|---|
CN103246726A (en) | 2013-08-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103246726B (en) | Method, device and system for searching network information | |
Raghavan | Digital forensic research: current state of the art | |
CN103890709B (en) | Key value database based on caching maps and replicates | |
Lehmberg et al. | The mannheim search join engine | |
KR101775883B1 (en) | Method and system for processing information of a stream of information | |
CN102831214B (en) | time series search engine | |
CN104050223B (en) | Pivot face for text mining and search | |
US20140344274A1 (en) | Information structuring system | |
CN107145496A (en) | The method for being matched image with content item based on keyword | |
KR20120129982A (en) | Marker search system for augmented reality service | |
US9344507B2 (en) | Method of processing web access information and server implementing same | |
CN103617213B (en) | Method and system for identifying newspage attributive characters | |
CN102663060B (en) | Method and device for identifying tampered webpage | |
CN110352427A (en) | System and method for collecting data associated with the fraudulent content in networked environment | |
Vijiyarani et al. | Research issues in web mining | |
CN104067273A (en) | Grouping search results into a profile page | |
CN112765366A (en) | APT (android Package) organization portrait construction method based on knowledge map | |
CN105095175A (en) | Method and device for obtaining truncated web title | |
US8700624B1 (en) | Collaborative search apps platform for web search | |
Arshad et al. | A multilayered semantic framework for integrated forensic acquisition on social media | |
CN104317867A (en) | System for carrying out entity clustering on web pictures returned by search engine | |
Bissyandé et al. | Orion: A software project search engine with integrated diverse software artifacts | |
KR20160009850A (en) | Method of Disease Information Analysis System | |
Alonso et al. | Clustering of search results using temporal attributes | |
CN106980658A (en) | Video labeling method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20170309 Address after: Room 2309, building 20, building 12, No. 93 Jianguo Road, Beijing, Chaoyang District, China Applicant after: Beijing Fu Tong Tong Technology Co., Ltd. Address before: 100088 Beijing city Xicheng District xinjiekouwai Street 28, block D room 112 (Desheng Park) Applicant before: Beijing Qihu Technology Co., Ltd. Applicant before: Qizhi Software (Beijing) Co., Ltd. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant |