CN103246726B - Method, device and system for searching network information - Google Patents

Method, device and system for searching network information Download PDF

Info

Publication number
CN103246726B
CN103246726B CN201310169964.4A CN201310169964A CN103246726B CN 103246726 B CN103246726 B CN 103246726B CN 201310169964 A CN201310169964 A CN 201310169964A CN 103246726 B CN103246726 B CN 103246726B
Authority
CN
China
Prior art keywords
triggering
item
data
search
data source
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310169964.4A
Other languages
Chinese (zh)
Other versions
CN103246726A (en
Inventor
李天华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Fu Tong Tong Technology Co., Ltd.
Original Assignee
Beijing Fu Tong Tong Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Fu Tong Tong Technology Co Ltd filed Critical Beijing Fu Tong Tong Technology Co Ltd
Priority to CN201310169964.4A priority Critical patent/CN103246726B/en
Publication of CN103246726A publication Critical patent/CN103246726A/en
Application granted granted Critical
Publication of CN103246726B publication Critical patent/CN103246726B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention discloses a method, a device and a system for searching network information. The method includes: using a preset mapping rule to match a trigger item corresponding to key search data when key search data from a request end is received; using the matched trigger item to search a trigger file and acquire a data source where a search result corresponding to key search data is; and acquiring the search result from the acquired data source, and returning the search result to the request end; wherein the trigger item is extracted from key search data used in a network, the trigger file is generated by the trigger item and related data source position information, and the search result is generated by collecting and integrating network information containing the trigger item in advance.

Description

A kind of searching method of the network information, device and system
Technical field
The present invention relates to Internet technical field, more particularly to a kind of searching method of the network information, device and system.
Background technology
With the popularization of Internet technology, internet has been one of main source that current user obtains information.Internet In be stored with the network data of magnanimity, user can pass through search engine and the required network information is obtained from internet.
Prior art provide information search scheme in, user can search engine provide entrance in input inquiry Word, search engine captures in a network information according to the query word, and Search Results are back to into user by webpage.
However, at least there is following defect in the information search scheme that prior art is provided:
Existing scheme depends on search engine real-time crawl in a network when Search Results are obtained, but search is drawn The ability for holding up this real-time grasping manipulation is extremely limited, and the information content for grabbing in real time every time is less, content is also incomplete, user Needs click on the peer link in the webpage for returning, and search operation is performed repeatedly, and longitudinal accession page layer by layer searches searching for needs Hitch fruit.
For example, if user accesses a video, search results pages only occur associated video, lack the details letter of correlation Breath, user is if necessary to inquire about, in addition it is also necessary to further to access other webpages or carry out further other operations etc., from And cause that search time is long, searching results accuracy is poor, and due to needing to process a large amount of access requests, cause to search plain engine Data grabber pressure is also larger, data providing data processing load is heavier.
The content of the invention
In view of the above problems, it is proposed that the present invention so as to provide one kind overcome the problems referred to above or at least in part solve on State searching method, device and the system of the network information of problem.
According to one aspect of the present invention, a kind of searching method of the network information is embodiments provided, including:
It is crucial using the matching of default mapping ruler and the search when receiving from the search critical data of request end The corresponding triggering item of data, the triggering item is that the search critical data used in network is carried out extracting what is obtained;
Using the triggering item querying triggering file for matching, the number that the corresponding Search Results of search critical data are located is known According to source, the triggering file is generated by triggering item and associated data source location information;
Search Results are obtained from the data source known, the Search Results are back to into request end, the Search Results are by pre- First the network information including triggering item is collected and is integrated and is generated.
Wherein, it is above-mentioned to be included using default mapping ruler matching triggering item corresponding with the search critical data:Utilize Default natural language processing analysis rule matches triggering item corresponding with search critical data;And/or, using default canonical Expression formula rule match triggering item corresponding with search critical data.
Wherein, above-mentioned triggering item is to carry out extracting including of obtaining to the search critical data used in network:According to searching The usage frequency and/or attention rate grade of rope critical data extracts triggering item from search critical data, wherein, the crucial number of search According to usage frequency and/or attention rate higher grade, at least part of data in the search critical data be chosen for trigger item Probability it is bigger.
Wherein, mentioned above searching results are collected by the network information in advance to including triggering item and are integrated and generate bag Include:
Captured in a network using web crawlers, collection includes triggering the network information of item, removes the net collected Identical data in network information, and many item datas of identical meanings are merged into by an item data using normalization mode;And/or, The data-interface provided from partner obtains the network information for including triggering item, removes the identical number in the network information for getting According to, and many item datas of identical meanings are merged into by an item data using normalization mode.
Wherein, above-mentioned triggering file is by including that triggering item and associated data source location information are generated:For each Triggering item configures one or more type attributes;By each triggering item under affiliated each type attribute with corresponding data source The association of positional information, generates triggering file.
Wherein, it is above-mentioned to be included using default mapping ruler matching triggering item corresponding with the search critical data:Utilize The type attribute of default mapping ruler matching triggering item corresponding with the search critical data and the triggering item;
It is above-mentioned to utilize the triggering item querying triggering file for matching, know that the corresponding Search Results of search critical data are located Data source include:Using the type attribute querying triggering file of the triggering item and the triggering item for matching, know that search is crucial One or more data sources that the corresponding Search Results of data are located.
Wherein, data source location information for data source uniform resource position mark URL, and/or, data source location information by MD5 value of the triggering item under affiliated type attribute is generated.
Wherein, it is above-mentioned to be included using default mapping ruler matching triggering item corresponding with the search critical data:Utilize The type attribute of default mapping ruler matching triggering item corresponding with the search critical data and the triggering item;
Above-mentioned to obtain Search Results from the data source known, the Search Results are back to into request end includes:
The corresponding Search Results of triggering item for matching are obtained from the data source known, and according to the triggering item for matching Type attribute the display state of each data division in the Search Results that get is set and shows grade, by Search Results with And the display state of each data division and displaying grade are back to request end in Search Results.
Wherein, the type attribute of the triggering item that above-mentioned basis is matched arranges each data portion in the Search Results for getting The display state and displaying grade for dividing includes:
The display state of the corresponding data division of type attribute of the triggering item for matching is set to show, shows grade It is set to the first estate;The display state of the corresponding data division of type attribute of the triggering item not matched is set to hide Or pack up, show that grade is set to the second grade;Wherein, the first estate is higher than the second grade.
Wherein, above-mentioned to obtain Search Results from the data source known, the Search Results are back to into request end includes:When When there are no corresponding Search Results at least one data source known, the crawl in real time from data source server includes touching The network information of item is sent out, is recorded the network information as the corresponding Search Results of corresponding triggering item in data source, and should Search Results are back to request end.
Wherein, said method also includes:It is crucial according to the search when receiving from the search critical data of request end Data carry out in real time in a network the crawl of info web, obtain capturing result;Supplement of the result as Search Results will be captured Information, is back to request end after merging with Search Results.
According to a further aspect in the invention, a kind of searcher of the network information is embodiments provided, including:
Communication interface, is suitable to receive the search critical data from request end, and, the Search Results for getting are returned To request end;
Adaptation, is suitable to match triggering item corresponding with the search critical data, the triggering using default mapping ruler Item is that the search critical data used in network is carried out extracting what is obtained;
Trigger, is suitable to using the triggering item querying triggering file for matching, and knows the corresponding search of search critical data As a result the data source being located, the triggering file is generated by triggering item and associated data source location information;
Getter, is suitable to obtain Search Results from the data source known, the Search Results are by advance to including triggering item The network information be collected and integrate and generate.
Wherein, adaptation, is suitable to corresponding with search critical data using the matching of default natural language processing analysis rule Triggering item, and/or, using default regular expression rule match and the search corresponding triggering item of critical data;
Wherein, above-mentioned triggering item is crucial from search according to the usage frequency and/or attention rate grade of search critical data What extracting data was obtained, the usage frequency and/or attention rate grade of above-mentioned search critical data is higher, the search critical data In at least part of data be chosen for trigger item probability it is bigger.
Wherein, each triggering item is configured with one or more type attributes, and triggering file triggers item affiliated by by each Each type attribute under with corresponding data source location information associate and generate,
Adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the search critical data and the triggering The type attribute of item;
Trigger, is suitable to the type attribute querying triggering file using the triggering item and the triggering item for matching, and knows and searches One or more data sources that the corresponding Search Results of rope critical data are located.
Wherein, data source location information for data source uniform resource position mark URL, and/or, data source location information by MD5 value of the triggering item under affiliated type attribute is generated.
Wherein, each triggering item is configured with one or more type attributes, and triggering file triggers item affiliated by by each Each type attribute under with corresponding data source location information associate and generate,
Adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the search critical data and the triggering The type attribute of item;
Trigger, is suitable to from the data source known obtain the corresponding Search Results of triggering item for matching, and according to The type attribute of the triggering item allotted arranges the display state of each data division in the Search Results for getting and shows grade;
Communication interface, is suitable to that Search Results are back to into request end according to the display state and displaying grade of Search Results.
Wherein, trigger, is further adapted for the display state of the corresponding data division of type attribute of the triggering item that will be matched It is set to show, shows that grade is set to the first estate;By the corresponding data division of type attribute of the triggering item not matched Display state be set to hide or pack up, show that grade is set to the second grade;Wherein, the first estate is higher than the second grade.
Wherein, getter, is suitable to when there are no corresponding Search Results at least one data source known, from data Crawl in real time in origin server includes triggering the network information of item, using the network information as the corresponding search of corresponding triggering item As a result record in data source, and indicate that the Search Results are back to request end by communication interface.
According to another aspect of the invention, a kind of search system of the network information is embodiments provided, including:Such as The searcher and cache database of the above-mentioned network information.
Cache database, is suitable to storage and is collected by the network information in advance to including triggering item and is integrated and generate Search Results.
The searcher of the network information, is suitable to obtain Search Results from cache database.
Wherein, said system also includes crawl server, is suitable to when receiving from the search critical data of request end, The crawl of info web is carried out in real time in the data source server of storage corresponding web page information according to the search critical data, Obtain capturing result, the crawl result is respectively sent to into the searcher and cache database of the network information;
The searcher of the network information, is suitable to crawl result as the side information of Search Results, closes with Search Results And after be back to request end;
Cache database, is suitable to merge to be stored in by crawl result accordingly trigger in the corresponding Search Results of item.
Wherein, said system also includes Data Collection integrated service device, is suitable to be grabbed in a network using web crawlers Take, collection includes triggering the network information of item, removes the identical data in the network information collected, and using normalization mode Many item datas of identical meanings are merged into into an item data;And/or, the data-interface provided from partner is obtained to be included triggering item The network information, remove the identical data in the network information that gets, and using normalization mode by the multinomial of identical meanings Data merge into an item data.
Wherein, above-mentioned cache database is realized by data snapshot memory access.
According to another aspect of the invention, a kind of searching method of the network information is embodiments provided, including:
When receiving from the medicine search critical data of request end, using the matching of default mapping ruler and the medicine The corresponding triggering item of search critical data, the triggering item is to carry out extraction to the medicine search critical data used in network to obtain 's;
Using the triggering item querying triggering file for matching, know that the corresponding Search Results of medicine search critical data are located Medical Data source, the triggering file is generated by triggering item and associated Medical Data source location information;
Search Results are obtained from the Medical Data source known, the Search Results request end is back to into, the Search Results It is collected by the network information in advance to including triggering item and is integrated and generates.
According to another aspect of the invention, a kind of searcher of the network information is embodiments provided, including:
Communication interface, is suitable to receive the medicine search critical data from request end, and, by the Search Results for getting It is back to request end;
Adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the medicine search critical data, should Triggering item is that the medicine search critical data used in network is carried out extracting what is obtained;
Trigger, is suitable to using the triggering item querying triggering file for matching, and knows that medicine search critical data is corresponding The Medical Data source that Search Results are located, the triggering file is generated by triggering item and associated Medical Data source location information 's;
Getter, is suitable to obtain Search Results from the Medical Data source known, the Search Results are by advance to including tactile The network information for sending out item is collected and integrates and generate.
From the above mentioned, search rate is high, the data that demand degree is high are used as triggering item by choosing in advance for the embodiment of the present invention, And valuable information in network is collected and is integrated, obtain the detailed Search Results for including triggering item, then performing During information search, Search Results corresponding with the triggering item that request end matches can be returned directly to request end.
Because the advance Search Results integrated can include and trigger the various detailed information that item is associated, request end passes through The Search Results can get the information for needing search, so as to simplify search operation, shorten search time, improve and search The accuracy of hitch fruit, also, the quantity of access request is sent due to significantly reducing request end, this programme is greatly reduced to be searched Index holds up the pressure for capturing vertical data in a network, alleviates the burden of data providing.
Described above is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention, And can be practiced according to the content of specification, and in order to allow the above and other objects of the present invention, feature and advantage can Become apparent, below especially exemplified by the specific embodiment of the present invention.
Description of the drawings
By the detailed description for reading hereafter preferred embodiment, various other advantages and benefit is common for this area Technical staff will be clear from understanding.Accompanying drawing is only used for illustrating the purpose of preferred embodiment, and is not considered as to the present invention Restriction.And in whole accompanying drawing, it is denoted by the same reference numerals identical part.In the accompanying drawings:
Fig. 1 shows the search system structural representation of the network information according to an embodiment of the invention;
Fig. 2 shows a kind of structural representation of the searcher of network information according to an embodiment of the invention;With And
Fig. 3 shows the result of page searching screenshotss signal returned to client according to an embodiment of the invention Figure;
Fig. 4 shows that another result of page searching screenshots returned to client according to an embodiment of the invention show It is intended to.
Specific embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although showing the disclosure in accompanying drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure and should not be by embodiments set forth here Limited.On the contrary, there is provided these embodiments are able to be best understood from the disclosure, and can be by the scope of the present disclosure Complete conveys to those skilled in the art.
Existing Search Results represent in the page, only can show the link comprising query word and some simple informations, user If clicking certain link, need to jump to third-party website acquisition or check information etc., this processing mode is brought Search time is long, searching results accuracy is poor and network lateral pressure is larger problem etc..For these problems, this programme Data are collected and are integrated in advance, from third party's (data providing) valuable information is got in advance, directly thrown in To on the result of page searching of the system server, request end is back to.This programme can apply such as search for Medical Data, In polytype data search scenes such as education information, digital data, automobile or consumer industry data, below by each Embodiment is described in detail to this programme.
A kind of search system 100 of network information that one embodiment of the invention is provided, referring to Fig. 1, including the network information Searcher 110, cache database 120, Data Collection integrated service device 130, crawl server 140.Illustrate in Fig. 1 Client and data origin server.Illustrate separately below.
Cache database 120 is suitable to store and is collected by the network information in advance to including triggering item and is integrated and generate Search Results.
Exemplary, cache database 120 can be realized by data snapshot memory access.I.e. the present embodiment adopts data snapshot Mechanism come store uncorrected data or HTML (HyperText Markup Language, the HTML) data of webpage with And XML (ExtensibleMarkupLanguage, extensible markup language) data structure information etc., carried out using data snapshot The mode of storage has the advantages that access speed is fast, is easy to show.
The searcher 110 of the network information is suitable to obtain Search Results from cache database 120.The search dress of the network information Putting 110 includes communication interface 114, adaptation 113, trigger 112 and getter 111.The tool of the searcher 110 of the network information Body structure and the method for operation are illustrated in other embodiments of the invention.
Wherein, said system 100 also includes Data Collection integrated service device 130.The Data Collection integrated service device 130 can To get Search Results by following at least one modes:
Mode one, Data Collection integrated service device 130 are suitable to be captured in a network using web crawlers, and collection includes The network information of triggering item, removes the identical data in the network information collected, and using normalization mode by identical meanings Many item datas merge into an item data.Under this mode, Data Collection integrated service device 130 will can be collected from not Unified form is converted to the network information of website or webpage, is easy to storage and follow-up process.
Specifically, web crawlers is in triggering item list according to storing and triggering item (such as trigger word) corresponding URL, to number According to web data corresponding with URL is captured in origin server, web data can be analyzed and taken pictures after crawl, being formed should The corresponding data snapshot of webpage.The corresponding trigger words of the URL are included in the data snapshot, using the data snapshot as the trigger word Corresponding Search Results, associated storage is in cache database 120 together with the search terms.
The searcher 110 of Data Collection integrated service device 130 or the network information can be according to knowing in the present embodiment Each triggering item, the corresponding type attribute of triggering item and data source (such as data snapshot memory access) positional information are associated, raw Into triggering file, and the triggering file is stored in the searcher 110 of the network information, so that the searcher of the network information 110 obtain data message automatically according to this triggering file in cache database 120.Key-value pair can be adopted under a kind of mode (key-value) form, using data source location information as key, using key value is navigated to, from the corresponding numbers of value The corresponding solid data of Search Results is obtained out according to source.
Mode two, Data Collection integrated service device 130 obtain the net for including triggering item from the data-interface that partner provides Network information, removes the identical data in the network information that gets, and using normalization mode by many item datas of identical meanings Merge into an item data.Under this mode, Data Collection integrated service device 130 can obtain the XML data knot of partner's offer Structure information, merges, duplicate removal and normalized etc. according to the data structure information to the network information for getting.
When using normalization mode, for example, for the data with multiple titles, a such as item data has formal name Title, the pet name, English name and other multiple common names, this multiple title substantially have identical implication, then will be many by this The data comprising triggering item that individual title is collected respectively merge into an item data, using the data after merging as the triggering item Search Results.
Data Collection integrated service device 130 can in advance choose triggering item (such as trigger word), Data Collection integrated service device 130 In advance the search critical data used in network can be collected and stored to database, when selection operation is performed, from the number Search critical data is extracted according to storehouse.Triggering item is that the search used request end in network (such as the user of client-side) is closed Key data carries out extracting what is obtained, and a kind of extracting mode can be:According to the usage frequency and/or attention rate of search critical data Grade extracts triggering item from search critical data, when the usage frequency and/or attention rate grade of search critical data are higher, is somebody's turn to do The probability that at least part of data in search critical data are chosen for triggering item is bigger.Above-mentioned usage frequency can be by net The access times of critical data are searched in network to carry out statistics and obtains, and above-mentioned attention rate grade can pass through request end feedback etc. Level evaluation information is obtained.Data Collection integrated service device 130 can (and the triggering item be corresponding by the triggering item for selecting URL) store into triggering item list.
Different Search Results, or a triggering item tool occurs in different scenes in view of identical triggering item There are multiple implications, the triggering item correspondence volume Search Results are also different under different implications, in order to improve the accuracy of Search Results, Data Collection integrated service device 130 can arrange one or more type attributes for triggering item, collect and integrate triggering item respectively and exist Search Results under each type attribute, so that Search Results have higher precision, disclosure satisfy that special scenes or spy Determine the search need under implication.
Wherein, said system 100 also includes that crawl server 140 is suitable to receiving the search key number from request end According to when, according to the search critical data storage corresponding web page information data source server in carry out info web in real time Crawl, obtains capturing result, and the crawl result is respectively sent to into the searcher and cache database of the network information.
The present embodiment can adopt the mechanism of the parallel search of searcher 110 of crawl server 140 and the network information.Often After the searcher 110 of the network information gets search critical data, while the search critical data is distributed to into crawl clothes Business device 140, by the crawl server 140 directly access outside data source server, obtain crawl result.Meanwhile, network The searcher 110 of information obtains Search Results from cache database 120.The searcher 110 pairs of the network information is from caching The Search Results obtained in database 120 and the crawl result obtained in crawl server 140 are merged.That is the network information Searcher 110 be suitable to will crawl result as the side information of Search Results, request is back to after merging with Search Results End.
Choose whether as needed using the crawl result of crawl server 140 as pre- in cache database 120 The supplement of the Search Results first integrated, when needed, crawl server 140 sends the crawl result for grabbing to data cached Storehouse 120, cache database 120 will capture result merging and be stored in the corresponding Search Results of corresponding triggering item.
From the above mentioned, the embodiment of the present invention is by integrating in advance third-party information, and carries out classification analysis to triggering item, By the information of triggering item corresponding subdivision under respective type attribute, there is provided to request end such that it is able to improve data search Accuracy, shortens search time, and can reduce the pressure of partner's data, services, alleviates the vertical data of web crawlers Crawl pressure, allow request end (such as user) can directly return result of page searching in get oneself required for letter Breath, realizes rapider, accurate, polynary data search, meets user's request.
Another embodiment of the invention provides a kind of searcher of the network information, referring to Fig. 2, including communication interface 114th, adaptation 113, trigger 112 and getter 111.
Communication interface 114 is suitable to receive the search critical data from request end, and, the Search Results for getting are returned It is back to request end.Search Results are returned to request end by communication interface 114 in the form of a web page.Referring to Fig. 3, it is shown that triggering item is When " coronary heart disease ", the result of page searching screenshotss schematic diagram that communication interface is returned to client.The search knot shown in the webpage Fruit is provided with three display boxes, includes in a display box " general introduction ", " cause of disease ", " symptom ", " diet ", " in advance of coronary heart disease It is anti-", " treatment ", " inspections ", " diagnosing examination " and " complication " much information, wherein " general introduction " is partly set to dispaly state, Other items are collapsed state;Another display box is the information related to " coronary heart disease _ look for hospital ";It is in another display box The information related to " coronary heart disease _ look for expert ".
Referring to Fig. 4, the screenshotss exemplary plot of another search result web page provided for the present embodiment, it is provided with the webpage Two display boxes, show in more detail the letter being associated with " Chinese People's Liberation Army General Hospital " and section office in hospital and expert Breath.
Adaptation 113 is suitable to match triggering item corresponding with the search critical data using default mapping ruler, and this is touched It is that the search critical data used in network is carried out extracting what is obtained to send out item.Above-mentioned default mapping ruler includes but does not limit to In natural language processing analysis (the Natural Language that can indicate that triggering item and search critical data corresponding relation Processing, NLP) regular and/or regular expression rule.Specifically, adaptation 113 is suitable to using default natural language Treatment Analysis rule match triggering item corresponding with search critical data, and/or, using default regular expression rule match Triggering item corresponding with search critical data.
Wherein, above-mentioned triggering item is crucial from search according to the usage frequency and/or attention rate grade of search critical data What extracting data was obtained, the usage frequency and/or attention rate grade of above-mentioned search critical data is higher, the search critical data In at least part of data be chosen for trigger item probability it is bigger.
Trigger 112 is suitable to using the triggering item querying triggering file that matches, knows search critical data is corresponding and search The data source that hitch fruit is located, the triggering file is generated by triggering item and associated data source location information.Can be by Triggering file is stored in trigger 112, it is also possible to which triggering file is stored in cache database, and trigger 112 is needing The triggering file is extracted from cache database using triggering in file.
Triggering item and associated data source location information can only be included in triggering file, or, each triggering item is matched somebody with somebody One or more type attributes are equipped with, triggering file is by several with corresponding under affiliated each type attribute by each triggering item Associate according to source location information and generate, in this case, triggering file includes triggering item, triggers the type attribute of item and associated Data source location information.The mapping ruler that then adaptation 113 is used can indicate that triggering item corresponding with search critical data With the type attribute of the triggering item, adaptation 113 is matched corresponding with the search critical data tactile using default mapping ruler Item and the type attribute of the triggering item are sent out, and trigger 112 is looked into using the type attribute of the triggering item and the triggering item that match Triggering file is ask, one or more data sources that the corresponding Search Results of search critical data are located are known.
Data source location information is the information for being capable of unique identification's data source in systems, and such as data source location information is The URL (Uniform Resource Locator, URL) of data source, and/or, data source location information is by touching Send out MD5 value of the item under affiliated type attribute to generate, such as the type attribute to triggering item and triggering item carries out MD5 computings, will transport Result is calculated as data source location information.
Getter 111 is suitable to obtain Search Results from the data source known, the Search Results are by advance to including triggering The network information of item is collected and integrates and generate.For example, getter 111 from the cache database known, (deposit by data snapshot Take device) middle acquisition Search Results.
Further, the type attribute of item is triggered by Intelligent Recognition in the present embodiment, to touching in result of page searching The block of item different type attribute is sent out, the combination for the operation such as freely packing up, hide or showing can be passed through, realize Search Results The page flexibly represents.
Under this scene, adaptation 113 is suitable to corresponding with the search critical data using the matching of default mapping ruler The type attribute of triggering item and the triggering item;
Trigger 112 is suitable to from the data source known obtain the corresponding Search Results of triggering item for matching, and according to The type attribute of the triggering item for matching arranges the display state of each data division in the Search Results for getting and displaying etc. Level;Display state can include showing, hide or pack up, and show that grade can include that the first estate, the second grade etc. are multiple Rank, the different displaying priority of different displaying grade correspondences, for example, and when the first estate is higher than the second grade, the first estate Search Results displaying priority higher than the second grade Search Results displaying priority, by the Search Results of the first estate It is arranged on top or other positions for being most easily concerned about of search result web page.
Communication interface 114 is suitable to that Search Results are back to into request according to the display state and displaying grade of Search Results End.For example, communication interface according to display state and can show hierarchical arrangement Search Results in the search result web page for returning Position, and the display box at Search Results place is set to show, hiding or pack up.
Wherein, in the present embodiment when there are no corresponding Search Results in data source, getter 111 is suitable to from data come Crawl in real time in source server includes triggering the network information of item, using the network information as the corresponding search knot of corresponding triggering item Fruit record indicates that the Search Results are back to request end by communication interface 114 in data source.
Further, the result page that search can be directed in the present embodiment provides believable checking, for example, can lead in advance Cross to web site name, domain name, log-on message, address, legal person, website record information, ICP (Web content service), and manufacturer The checking of the information such as the qualification in national authority certification authority, judges Search Results whether secure and trusted, and in Search Results net Verification mark is provided in page, the whether safe and reliable markup information of Search Results is shown, so that user can be more believable Search Results are selected in environment.Also, the present embodiment can with send set into the search result web page of request end User feedback interface is put, as shown in " complaint " button in Fig. 3 and Fig. 4, is disappeared from the feedback of user by feedback interface reception Breath, such as receive user report malice or unreal information message, the present embodiment can with reference to the feedback message of user to searching The security of hitch fruit is judged and is marked.
From the above mentioned, search rate is high, the data that demand degree is high are used as triggering item by choosing in advance for the embodiment of the present invention, And valuable information in network is collected and is integrated, obtain the detailed Search Results for including triggering item, then performing During information search, Search Results corresponding with the triggering item that request end matches can be returned directly to request end.
Because the advance Search Results integrated can include and trigger the various detailed information that item is associated, request end passes through The Search Results can get the information for needing search, so as to simplify search operation, shorten search time, improve and search The accuracy of hitch fruit, also, the quantity of access request is sent due to significantly reducing request end, this programme is greatly reduced to be searched Index holds up the pressure for capturing vertical data in a network, alleviates the burden of data providing.
Another embodiment of the invention provides a kind of searching method of the network information, referring to Fig. 5, comprises the steps:
Step S500:When receiving from the search critical data of request end, using default mapping ruler matching with The corresponding triggering item of the search critical data, the triggering item is to carry out extraction to the search critical data used in network to obtain 's;
Step S502:Using the triggering item querying triggering file for matching, the corresponding search knot of search critical data is known The data source (such as data snapshot memory access) that fruit is located, the triggering file is by triggering item and associated data source location information Generate, the data source location information for data source URL, and/or, the data source location information is by triggering item in affiliated type MD5 values under attribute are generated.
Step S504:Search Results are obtained from the data source known, the Search Results request end is back to into, the search As a result it is collected by the network information in advance to including triggering item and integrates and generate.
Wherein, above-mentioned steps S500 include:Matched using default natural language processing analysis rule crucial with search The corresponding triggering item of data;And/or, using the triggering corresponding with search critical data of default regular expression rule match .And, triggering item is extracted from search critical data according to the usage frequency and/or attention rate grade of search critical data, Wherein, the usage frequency and/or attention rate higher grade for searching for critical data, at least part of data in the search critical data The probability for being chosen for triggering item is bigger.
Wherein, the generation of Search Results includes in above-mentioned steps S504:Captured in a network using web crawlers, received Collection includes triggering the network information of item, the identical data in the network information collected of removal, and using normalization mode by phase An item data is merged into many item datas of implication;And/or, the data-interface provided from partner obtains the net for including triggering item Network information, removes the identical data in the network information that gets, and using normalization mode by many item datas of identical meanings Merge into an item data.
Wherein, the generation of file is triggered in above-mentioned steps S502 to be included:One or more types are configured for each triggering item Attribute;By each association of the triggering item under affiliated each type attribute with corresponding data source location information, triggering is generated File.
Wherein, when the present embodiment is a triggering item configuration polytype attribute, above-mentioned steps S500 also include:Utilize The type attribute of default mapping ruler matching triggering item corresponding with the search critical data and the triggering item;
Above-mentioned steps S502 also include:Using the type attribute querying triggering text of the triggering item and the triggering item for matching Part, knows one or more data sources that the corresponding Search Results of search critical data are located.
Above-mentioned steps S504 also include:The corresponding Search Results of triggering item for matching are obtained from the data source known, And arranged according to the type attribute of the triggering item for matching each data division in the Search Results that get display state and Show grade, the display state of each data division in Search Results and Search Results and displaying grade are back to into request end.
Wherein, in step S504, by the display state of the corresponding data division of type attribute of the triggering item for matching It is set to show, shows that grade is set to the first estate;By the corresponding data division of type attribute of the triggering item not matched Display state be set to hide or pack up, show that grade is set to the second grade;Wherein, the first estate is higher than the second grade.
Wherein, above-mentioned steps S504 also include:When there are no corresponding Search Results at least one data source known When, the crawl in real time from data source server includes triggering the network information of item, using the network information as corresponding triggering item Corresponding Search Results are recorded in data source, and the Search Results are back to into request end.
Further, the present embodiment also provides a kind for the treatment of mechanism of parallel search, is performing above-mentioned steps S500 extremely While S504 carries out data search, said method also includes:When receiving from the search critical data of request end, according to The search critical data carries out in real time in a network the crawl of info web, obtains capturing result;Result will be captured as search As a result side information, is back to request end after merging with Search Results.
The specific works mode of each step may refer to the device and system enforcement of the present invention in the inventive method embodiment Example, will not be described here.
From the above mentioned, the embodiment of the present invention due to the advance Search Results integrated can include it is various with what triggering item was associated Detailed information, request end can get the information for needing search by the Search Results, so as to simplify search operation, contracting Short search time, the accuracy of Search Results is improve, also, the number of access request is sent due to significantly reducing request end Amount, this programme greatly reduces the pressure that search engine captures in a network vertical data, alleviates the burden of data providing.
For the scene of Medical Data search, another embodiment of the invention additionally provides a kind of searcher of the network information Method, including following process:
When receiving from the medicine search critical data of request end, using the matching of default mapping ruler and the medicine The corresponding triggering item of search critical data, the triggering item is to carry out extraction to the medicine search critical data used in network to obtain 's;
Using the triggering item querying triggering file for matching, know that the corresponding Search Results of medicine search critical data are located Medical Data source, the triggering file is generated by triggering item and associated Medical Data source location information;
Search Results are obtained from the Medical Data source known, the Search Results request end is back to into, the Search Results It is collected by the network information in advance to including triggering item and is integrated and generates.The Search Results can pass through Search Results net The mode of page is sent to request end, and the example of search result web page may refer to Fig. 3 and Fig. 4.
Above-mentioned medicine search critical data is the data for including that hospital, doctor, medicine, medicine equipment etc. are related to medicine, Above-mentioned Medical Data source is the database of Medical Data of being stored with, and such as database can be by the data of the Medical Data that is stored with Snapshot memory access.
For the scene of Medical Data search, another embodiment of the invention additionally provides a kind of search dress of network information Put, including:
Communication interface is suitable to receive the medicine search critical data from request end, and, by the Search Results for getting It is back to request end;Adaptation is suitable to using default mapping ruler matching triggering corresponding with the medicine search critical data , the triggering item is that the medicine search critical data used in network is carried out extracting what is obtained;Trigger is suitable to using matching The triggering item querying triggering file for going out, knows the Medical Data source that the corresponding Search Results of medicine search critical data are located, should Triggering file is generated by triggering item and associated Medical Data source location information;Getter is suitable to from the medicine known Data source obtains Search Results, and the Search Results are collected by the network information in advance to including triggering item and are integrated and give birth to Into.
Above-mentioned medicine search critical data is the data for including that hospital, doctor, medicine, medicine equipment etc. are related to medicine, Above-mentioned Medical Data source is the database of Medical Data of being stored with, and such as database can be by the data of the Medical Data that is stored with Snapshot memory access.
Another embodiment of the invention provides a kind of searching method of the network information:Wherein:It is described to be reflected using default Penetrating rule match triggering item corresponding with the search critical data includes:
Using default natural language processing analysis rule matching triggering item corresponding with search critical data,
And/or,
Using default regular expression rule match triggering item corresponding with search critical data.
Another embodiment of the invention provides a kind of searching method of the network information:Wherein, the triggering item is to net Search critical data used in network carries out extracting including of obtaining:
Triggering item is extracted from search critical data according to the usage frequency and/or attention rate grade of search critical data, Wherein, the usage frequency and/or attention rate higher grade for searching for critical data, at least part of data in the search critical data The probability for being chosen for triggering item is bigger.
Another embodiment of the invention provides a kind of searching method of the network information, wherein, it is described from the number known Search Results are obtained according to source, the Search Results are back to into request end includes:
When there are no corresponding Search Results at least one data source known, from data source server in real time Crawl includes the network information of the triggering item, is counting the network information as the corresponding Search Results record of corresponding triggering item According to source, and the Search Results are back to into request end.
Another embodiment of the invention provides a kind of searching method of the network information, wherein, methods described also includes:
When receiving from the search critical data of request end, carried out in real time in a network according to the search critical data The crawl of info web, obtains capturing result;
Using the crawl result as the side information of the Search Results, it is back to after merging with the Search Results and asks Ask end.
Another embodiment of the invention provides a kind of searcher of the network information, wherein,
The adaptation, is suitable to corresponding with search critical data using the matching of default natural language processing analysis rule Triggering item, and/or, using default regular expression rule match triggering item corresponding with search critical data;
Wherein, the triggering item is crucial from search according to the usage frequency and/or attention rate grade of search critical data What extracting data was obtained, the usage frequency and/or attention rate grade of the search critical data is higher, the search critical data In at least part of data be chosen for trigger item probability it is bigger.
Another embodiment of the invention provides a kind of searcher of the network information, wherein, described each triggering item is matched somebody with somebody Be equipped with one or more type attributes, the triggering file by by each triggering item under affiliated each type attribute with it is corresponding Data source location information association and generate,
The adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the search critical data and is somebody's turn to do The type attribute of triggering item;
The trigger, is suitable to the type attribute querying triggering file using the triggering item and the triggering item for matching, and obtains Know one or more data sources that the corresponding Search Results of the search critical data are located.
Another embodiment of the invention provides a kind of searcher of the network information, wherein, the data source location letter The uniform resource position mark URL for data source is ceased, and/or, the data source location information is by triggering item in affiliated type attribute Under MD5 values generate.
Another embodiment of the invention provides a kind of searcher of the network information, wherein, described each triggering item is matched somebody with somebody Be equipped with one or more type attributes, the triggering file by by each triggering item under affiliated each type attribute with it is corresponding Data source location information association and generate,
The adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the search critical data and is somebody's turn to do The type attribute of triggering item;
The trigger, is suitable to from the data source known obtain the corresponding Search Results of triggering item for matching, and root Display state and displaying according to each data division in the Search Results that the type attribute setting of the triggering item for matching gets Grade;
The communication interface, is suitable to that Search Results are back to into request according to the display state and displaying grade of Search Results End.
Another embodiment of the invention provides a kind of searcher of the network information, wherein, the trigger is further adapted for The display state of the corresponding data division of type attribute of the triggering item for matching is set to show, shows that grade is set to the One grade;By do not match triggering item the corresponding data division of type attribute display state be set to hide or pack up, Show that grade is set to the second grade;Wherein, described the first estate is higher than second grade.
Another embodiment of the invention provides a kind of searcher of the network information, wherein,
The getter, is suitable to when there are no corresponding Search Results at least one data source known, from data Crawl in real time in origin server includes the network information of the triggering item, and the network information is corresponding as corresponding triggering item Search Results are recorded in data source, and indicate that the Search Results are back to request end by the communication interface.
Another embodiment of the invention provides a kind of search system of the network information, including:
Including the searcher and cache database of the above-mentioned network information,
The cache database, is suitable to store by advance to being collected and integrating including the network information for triggering item And the Search Results for generating;
The searcher of the network information, is suitable to obtain Search Results from the cache database.
Another embodiment of the invention provides a kind of search system of the network information, wherein, the system also includes grabbing Take server,
The crawl server, is suitable to when receiving from the search critical data of request end, crucial according to the search Data carry out in real time the crawl of info web in the data source server of storage corresponding web page information, obtain capturing result, The crawl result is respectively sent to into the searcher and cache database of the network information;
The searcher of the network information, is suitable to the result that captures as the side information of the Search Results, Request end is back to after merging with the Search Results;
The cache database, is suitable to merge to be stored in by the crawl result accordingly trigger the corresponding Search Results of item In.
Another embodiment of the invention provides a kind of search system of the network information, wherein, the system also includes grabbing Server is taken, wherein, the system also includes Data Collection integrated service device, is suitable to be grabbed in a network using web crawlers Take, collection includes triggering the network information of item, removes the identical data in the network information collected, and using normalization mode Many item datas of identical meanings are merged into into an item data;And/or, the data-interface provided from partner is obtained to be included triggering item The network information, remove the identical data in the network information that gets, and using normalization mode by the multinomial of identical meanings Data merge into an item data.
Another embodiment of the invention provides a kind of search system of the network information, wherein, the system also includes grabbing Server is taken, wherein, the cache database is realized by data snapshot memory access.
However, this programme is not limited to apply the scene in Medical Data search, it is also possible to apply this programme and searching Suo Jiaoyu information, digital data, automobile, consumer industry data or other any search fields, or weather, train Ticket, plane ticket, stock, fund, shopping information, purchase by group, the search technique field such as film, music, novel, question and answer.
Provided herein algorithm and display be not inherently related to any certain computer, virtual system or miscellaneous equipment. Various general-purpose systems can also be used together based on teaching in this.As described above, construct required by this kind of system Structure be obvious.Additionally, the present invention is also not for any certain programmed language.It is understood that, it is possible to use it is various Programming language realizes the content of invention described herein, and the description done to language-specific above is to disclose this Bright preferred forms.
In specification mentioned herein, a large amount of details are illustrated.It is to be appreciated, however, that the enforcement of the present invention Example can be put into practice in the case of without these details.In some instances, known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help understand one or more in each inventive aspect, exist Above in the description of the exemplary embodiment of the present invention, each feature of the present invention is grouped together into single enforcement sometimes In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:I.e. required guarantor The more features of feature that the application claims ratio of shield is expressly recited in each claim.More precisely, such as following Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following specific embodiment are expressly incorporated in the specific embodiment, wherein each claim itself All as the separate embodiments of the present invention.
Those skilled in the art are appreciated that can be carried out adaptively to the module in the equipment in embodiment Change and they are arranged in one or more equipment different from the embodiment.Can be the module or list in embodiment Unit or component are combined into a module or unit or component, and can be divided in addition multiple submodule or subelement or Sub-component.In addition at least some in such feature and/or process or unit is excluded each other, can adopt any Combine to all features disclosed in this specification (including adjoint claim, summary and accompanying drawing) and so disclosed Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification is (including adjoint power Profit is required, summary and accompanying drawing) disclosed in each feature can it is identical by offers, be equal to or the alternative features of similar purpose carry out generation Replace.
Although additionally, it will be appreciated by those of skill in the art that some embodiments described herein include other embodiments In included some features rather than further feature, but the combination of the feature of different embodiments means in of the invention Within the scope of and form different embodiments.For example, in the following claims, embodiment required for protection appoint One of meaning can in any combination mode using.
The present invention all parts embodiment can be realized with hardware, or with one or more processor operation Software module realize, or with combinations thereof realization.It will be understood by those of skill in the art that can use in practice Microprocessor or digital signal processor (DSP) are come in the searcher for realizing the network information according to embodiments of the present invention The some or all functions of some or all parts.The present invention is also implemented as performing method as described herein Some or all equipment or program of device (for example, computer program and computer program).Such reality The program of the existing present invention can be stored on a computer-readable medium, or can have the form of one or more signal. Such signal can be downloaded from internet website and obtained, or be provided on carrier signal, or in any other form There is provided.
It should be noted that above-described embodiment the present invention will be described rather than limits the invention, and ability Field technique personnel can design without departing from the scope of the appended claims alternative embodiment.In the claims, Any reference symbol between bracket should not be configured to limitations on claims.Word "comprising" is not excluded the presence of not Element listed in the claims or step.Word "a" or "an" before element does not exclude the presence of multiple such Element.The present invention can come real by means of the hardware for including some different elements and by means of properly programmed computer It is existing.If in the unit claim for listing equipment for drying, several in these devices can be by same hardware branch To embody.The use of word first, second, and third does not indicate that any order.These words can be explained and be run after fame Claim.

Claims (23)

1. a kind of searching method of the network information, including:
When receiving from the search critical data of request end, using the matching of default mapping ruler and the search critical data Corresponding triggering item, the triggering item is that the search critical data used in network is carried out extracting what is obtained;
Using the triggering item querying triggering file for matching, according to the data source location information associated in triggering file is known The data source that the corresponding Search Results of search critical data are located, the triggering file is by triggering item and associated data source What positional information was generated, then only include triggering item and associated data source location information in the triggering file, or, each Triggering item is configured with one or more type attributes, and the triggering file triggers item in affiliated each type attribute by by each Under with corresponding data source location information associate and generates, then it is described triggering file include trigger item, trigger item type belong to Property and associated data source location information;Wherein, the data source location information be in the search system of the network information only The information in one property mark data source;
Search Results are obtained from the data source known, the Search Results are back to into request end, the Search Results are by advance The network information including the triggering item is collected and is integrated and generated, and corresponding data source is stored in after generation In.
2. method according to claim 1, wherein, the Search Results are by advance to including the network letter of the triggering item Breath be collected and integrate and generate including:
Captured in a network using web crawlers, collection includes triggering the network information of item, removed the network letter collected Identical data in breath, and many item datas of identical meanings are merged into by an item data using normalization mode;And/or
The data-interface provided from partner obtains the network information for including triggering item, removes the phase in the network information for getting Same data, and many item datas of identical meanings are merged into by an item data using normalization mode.
3. method according to claim 1, wherein,
It is described to be included using default mapping ruler matching triggering item corresponding with the search critical data:
Using default mapping ruler matching triggering item corresponding with the search critical data and the type attribute of the triggering item;
It is described to utilize the triggering item querying triggering file for matching, know that the corresponding Search Results of the search critical data are located Data source include:
Using the type attribute querying triggering file of the triggering item and the triggering item for matching, the search critical data pair is known One or more data sources that the Search Results answered are located.
4. method according to claim 1, wherein, the data source location information for data source URL URL, and/or, the data source location information is generated by MD5 values of the item under affiliated type attribute is triggered.
5. method according to claim 1, wherein,
It is described to be included using default mapping ruler matching triggering item corresponding with the search critical data:
Using default mapping ruler matching triggering item corresponding with the search critical data and the type attribute of the triggering item;
Described to obtain Search Results from the data source known, the Search Results are back to into request end includes:
The corresponding Search Results of triggering item for matching are obtained from the data source known, and according to the class of the triggering item for matching Type attribute arranges the display state of each data division in the Search Results that get and shows grade, by the Search Results with And the display state of each data division and displaying grade are back to request end in Search Results.
6. method according to claim 5, wherein, the type attribute of the triggering item that the basis is matched is arranged and got Search Results in each data division display state and show grade include:
The display state of the corresponding data division of type attribute of the triggering item for matching is set to show, shows that grade is arranged For the first estate;
The display state of the corresponding data division of type attribute of the triggering item not matched is set to hide or pack up, show Grade is set to the second grade;
Wherein, described the first estate is higher than second grade.
7. method according to claim 1, wherein, it is described using the matching of default mapping ruler and the search critical data Corresponding triggering item includes:
Using default natural language processing analysis rule matching triggering item corresponding with search critical data,
And/or,
Using default regular expression rule match triggering item corresponding with search critical data.
8. method according to claim 1, wherein, the triggering item is that the search critical data used in network is carried out What extraction was obtained includes:
Triggering item is extracted from search critical data according to the usage frequency and/or attention rate grade of search critical data, wherein, The usage frequency and/or attention rate grade of search critical data is higher, and at least part of data in the search critical data are selected The probability for being taken as triggering item is bigger.
9. method according to claim 1, wherein, it is described to obtain Search Results from the data source known, this is searched for As a result being back to request end includes:
When there are no corresponding Search Results at least one data source known, capture in real time from data source server Including the network information of the triggering item, record the network information as the corresponding Search Results of corresponding triggering item in data source In, and the Search Results are back to into request end.
10. method according to claim 1, wherein, methods described also includes:
When receiving from the search critical data of request end, webpage is carried out in real time in a network according to the search critical data The crawl of information, obtains capturing result;
Using the crawl result as the side information of the Search Results, request is back to after merging with the Search Results End.
A kind of 11. searchers of the network information, including:
Communication interface, is suitable to receive the search critical data from request end, and, the Search Results for getting are back to please Ask end;
Adaptation, is suitable to match triggering item corresponding with the search critical data, the triggering item using default mapping ruler It is that the search critical data used in network is carried out extracting what is obtained;
Trigger, is suitable to using the triggering item querying triggering file for matching, according to the data source location associated in triggering file Information knows the data source that the corresponding Search Results of the search critical data are located, and the triggering file is by triggering item and phase What the data source location information of association was generated, then only include triggering item and associated data source location letter in the triggering file Breath, or, each triggering item is configured with one or more type attributes, and the triggering file triggers item affiliated by by each Associate with corresponding data source location information under each type attribute and generate, then the triggering file includes triggering item, touches Send out the type attribute and associated data source location information of item;Wherein, the data source location information is in the network information The information of unique identification's data source in search system;
Getter, is suitable to obtain Search Results from the data source known, the Search Results are by advance to including the triggering The network information of item is collected and integrates and generate, and is stored in after generation in corresponding data source.
12. devices according to claim 11, wherein,
The adaptation, is suitable to using default natural language processing analysis rule matching triggering corresponding with search critical data , and/or, using default regular expression rule match triggering item corresponding with search critical data;
Wherein, it is described triggering item be according to search critical data usage frequency and/or attention rate grade from search critical data Middle to extract what is obtained, the usage frequency and/or attention rate grade of the search critical data is higher, in the search critical data The probability that at least part of data are chosen for triggering item is bigger.
13. devices according to claim 11, wherein,
The adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the search critical data and the triggering The type attribute of item;
The trigger, is suitable to the type attribute querying triggering file using the triggering item and the triggering item for matching, and knows institute State one or more data sources that the corresponding Search Results of search critical data are located.
14. devices according to claim 11, wherein, the data source location information is positioned for the unified resource of data source Symbol URL, and/or, the data source location information is generated by MD5 values of the item under affiliated type attribute is triggered.
15. devices according to claim 11, wherein,
The adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the search critical data and the triggering The type attribute of item;
The trigger, is suitable to from the data source known obtain the corresponding Search Results of triggering item for matching, and according to The type attribute of the triggering item allotted arranges the display state of each data division in the Search Results for getting and shows grade;
The communication interface, is suitable to that Search Results are back to into request end according to the display state and displaying grade of Search Results.
16. devices according to claim 15, wherein, the trigger is further adapted for the type for triggering item that will be matched The display state of the corresponding data division of attribute is set to show, shows that grade is set to the first estate;By touching for not matching The display state for sending out the corresponding data division of type attribute of item is set to hide or pack up, show that grade is set to second etc. Level;Wherein, described the first estate is higher than second grade.
17. devices according to claim 11, wherein,
The getter, is suitable to when there are no corresponding Search Results at least one data source known, from data source Crawl in real time in server includes the network information of the triggering item, using the network information as the corresponding search of corresponding triggering item As a result record in data source, and indicate that the Search Results are back to request end by the communication interface.
A kind of 18. search systems of the network information, including:
The searcher and cache database of the network information as described in above-mentioned any one of claim 11 to 17,
The cache database, is suitable to storage and is collected by the network information in advance to including the triggering item and is integrated and give birth to Into Search Results;
The searcher of the network information, is suitable to obtain Search Results from the cache database.
19. systems according to claim 18, wherein, the system also includes crawl server,
The crawl server, is suitable to when receiving from the search critical data of request end, according to the search critical data Carry out the crawl of info web in real time in the data source server of storage corresponding web page information, obtain capturing result, by this Crawl result is respectively sent to the searcher and cache database of the network information;
The searcher of the network information, is suitable to the result that captures as the side information of the Search Results, with institute State after Search Results merge and be back to request end;
The cache database, is suitable to merge to be stored in by the crawl result accordingly trigger in the corresponding Search Results of item.
20. systems according to claim 18, wherein, the system also includes Data Collection integrated service device, is suitable to profit Captured in a network with web crawlers, collection includes triggering the network information of item, in removing the network information collected Identical data, and many item datas of identical meanings are merged into by an item data using normalization mode;And/or, carry from partner For data-interface obtain the network information for including triggering item, the identical data in the network information that gets of removal, and adopt Many item datas of identical meanings are merged into an item data by normalization mode.
21. systems according to claim 18, wherein, the cache database is realized by data snapshot memory access.
A kind of 22. searching methods of the network information, including:
When receiving from the medicine search critical data of request end, using the matching of default mapping ruler and the medicine search The corresponding triggering item of critical data, the triggering item is to carry out extraction to the medicine search critical data used in network to obtain 's;
Using the triggering item querying triggering file for matching, known according to the Medical Data source location information associated in triggering file The medicine searches for the Medical Data source that the corresponding Search Results of critical data are located, and the triggering file is by triggering item and phase What the Medical Data source location information of association was generated, then only include triggering item and associated Medical Data in the triggering file Source location information, or, each triggering item is configured with one or more type attributes, and the triggering file triggers item by by each Associate with corresponding Medical Data source location information under affiliated each type attribute and generate, then wrap in the triggering file Include triggering item, the type attribute of triggering item and associated Medical Data source location information;Wherein, the Medical Data source position Information is the information in unique identification's Medical Data source in the search system of the network information;
Obtain Search Results from the Medical Data source known, the Search Results be back to into request end, the Search Results by The network information including the triggering item is collected in advance and is integrated and is generated, and corresponding doctor is stored in after generation In medicine data source.
A kind of 23. searchers of the network information, including:
Communication interface, is suitable to receive the medicine search critical data from request end, and, the Search Results for getting are returned To request end;
Adaptation, is suitable to using default mapping ruler matching triggering item corresponding with the medicine search critical data, described to touch It is that the medicine search critical data used in network is carried out extracting what is obtained to send out item;
Trigger, is suitable to using the triggering item querying triggering file for matching, according to the Medical Data source associated in triggering file Positional information knows the Medical Data source that the corresponding Search Results of the medicine search critical data are located, and the triggering file is Generated by triggering item and associated Medical Data source location information, then only include triggering item and correlation in the triggering file The Medical Data source location information of connection, or, each triggering item is configured with one or more type attributes, the triggering file by Each triggering item is associated under affiliated each type attribute with corresponding Medical Data source location information and is generated, then it is described Triggering file includes triggering item, the type attribute of triggering item and associated Medical Data source location information;Wherein, the doctor Medicine data source location information is the information in unique identification's Medical Data source in the search system of the network information;
Getter, is suitable to obtain Search Results from the Medical Data source known, the Search Results are by advance to including described The network information of triggering item is collected and integrates and generate, and is stored in after generation in corresponding Medical Data source.
CN201310169964.4A 2013-05-09 2013-05-09 Method, device and system for searching network information Active CN103246726B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310169964.4A CN103246726B (en) 2013-05-09 2013-05-09 Method, device and system for searching network information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310169964.4A CN103246726B (en) 2013-05-09 2013-05-09 Method, device and system for searching network information

Publications (2)

Publication Number Publication Date
CN103246726A CN103246726A (en) 2013-08-14
CN103246726B true CN103246726B (en) 2017-04-12

Family

ID=48926246

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310169964.4A Active CN103246726B (en) 2013-05-09 2013-05-09 Method, device and system for searching network information

Country Status (1)

Country Link
CN (1) CN103246726B (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106708856A (en) * 2015-11-13 2017-05-24 百度在线网络技术(北京)有限公司 Information retrieval method and apparatus
US10362060B2 (en) * 2015-12-30 2019-07-23 International Business Machines Corporation Curtailing search engines from obtaining and controlling information
CN105930528B (en) * 2016-06-03 2020-09-08 腾讯科技(深圳)有限公司 Webpage caching method and server
CN106202260B (en) * 2016-06-29 2021-07-27 百度在线网络技术(北京)有限公司 Search method and device and search engine
CN108519984B (en) * 2018-02-07 2022-11-04 平安科技(深圳)有限公司 Weather data processing method, server and computer readable storage medium
CN110765275B (en) * 2019-10-14 2023-02-07 深圳平安医疗健康科技服务有限公司 Search method, search device, computer equipment and storage medium
CN112214505A (en) * 2020-10-21 2021-01-12 北京金堤征信服务有限公司 Data synchronization method and device, computer readable storage medium and electronic equipment
CN112807697A (en) * 2021-01-28 2021-05-18 北京达佳互联信息技术有限公司 List generation method and device, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1940922A (en) * 2005-09-30 2007-04-04 腾讯科技(深圳)有限公司 Method and system for improving information search speed
CN102663088A (en) * 2012-03-31 2012-09-12 百度在线网络技术(北京)有限公司 Method and equipment for providing search results
CN102930054A (en) * 2012-11-19 2013-02-13 北京奇虎科技有限公司 Data search method and data search system
CN103034663A (en) * 2011-09-29 2013-04-10 阿里巴巴集团控股有限公司 Information searching method and equipment

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1609848A (en) * 2003-10-23 2005-04-27 肖宁 Predefined keywords electronic file searching method
US20080319960A1 (en) * 2007-06-25 2008-12-25 Yuan-Jung Chang Information searching method, information searching system and inputting device thereof
CN102831253B (en) * 2012-09-25 2015-01-21 北京科东电力控制系统有限责任公司 Distributed full-text retrieval system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1940922A (en) * 2005-09-30 2007-04-04 腾讯科技(深圳)有限公司 Method and system for improving information search speed
CN103034663A (en) * 2011-09-29 2013-04-10 阿里巴巴集团控股有限公司 Information searching method and equipment
CN102663088A (en) * 2012-03-31 2012-09-12 百度在线网络技术(北京)有限公司 Method and equipment for providing search results
CN102930054A (en) * 2012-11-19 2013-02-13 北京奇虎科技有限公司 Data search method and data search system

Also Published As

Publication number Publication date
CN103246726A (en) 2013-08-14

Similar Documents

Publication Publication Date Title
CN103246726B (en) Method, device and system for searching network information
Raghavan Digital forensic research: current state of the art
CN103890709B (en) Key value database based on caching maps and replicates
Lehmberg et al. The mannheim search join engine
KR101775883B1 (en) Method and system for processing information of a stream of information
CN102831214B (en) time series search engine
CN104050223B (en) Pivot face for text mining and search
US20140344274A1 (en) Information structuring system
CN107145496A (en) The method for being matched image with content item based on keyword
KR20120129982A (en) Marker search system for augmented reality service
US9344507B2 (en) Method of processing web access information and server implementing same
CN103617213B (en) Method and system for identifying newspage attributive characters
CN102663060B (en) Method and device for identifying tampered webpage
CN110352427A (en) System and method for collecting data associated with the fraudulent content in networked environment
Vijiyarani et al. Research issues in web mining
CN104067273A (en) Grouping search results into a profile page
CN112765366A (en) APT (android Package) organization portrait construction method based on knowledge map
CN105095175A (en) Method and device for obtaining truncated web title
US8700624B1 (en) Collaborative search apps platform for web search
Arshad et al. A multilayered semantic framework for integrated forensic acquisition on social media
CN104317867A (en) System for carrying out entity clustering on web pictures returned by search engine
Bissyandé et al. Orion: A software project search engine with integrated diverse software artifacts
KR20160009850A (en) Method of Disease Information Analysis System
Alonso et al. Clustering of search results using temporal attributes
CN106980658A (en) Video labeling method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20170309

Address after: Room 2309, building 20, building 12, No. 93 Jianguo Road, Beijing, Chaoyang District, China

Applicant after: Beijing Fu Tong Tong Technology Co., Ltd.

Address before: 100088 Beijing city Xicheng District xinjiekouwai Street 28, block D room 112 (Desheng Park)

Applicant before: Beijing Qihu Technology Co., Ltd.

Applicant before: Qizhi Software (Beijing) Co., Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant