CN1325076A - Comprehensive network Chinese information searcher - Google Patents

Comprehensive network Chinese information searcher Download PDF

Info

Publication number
CN1325076A
CN1325076A CN 00115797 CN00115797A CN1325076A CN 1325076 A CN1325076 A CN 1325076A CN 00115797 CN00115797 CN 00115797 CN 00115797 A CN00115797 A CN 00115797A CN 1325076 A CN1325076 A CN 1325076A
Authority
CN
China
Prior art keywords
engine
degree
correlation
feedback
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 00115797
Other languages
Chinese (zh)
Inventor
林宏
鲍劲松
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
WANWEI INFORMATION TECHN CO Ltd SHANGHAI
Original Assignee
WANWEI INFORMATION TECHN CO Ltd SHANGHAI
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by WANWEI INFORMATION TECHN CO Ltd SHANGHAI filed Critical WANWEI INFORMATION TECHN CO Ltd SHANGHAI
Priority to CN 00115797 priority Critical patent/CN1325076A/en
Publication of CN1325076A publication Critical patent/CN1325076A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention relates to a network Chinese information comprehensive searcher, its system includes: input end, noise filtering, interpretation end and automatic regulating adaptive treamtent for the engine of the partner, according to the established network analog mathematical modal analyzing and searching column information, and according to user's request editing and providing for user to use. Said analog mathematical model analysis adopts Wideway Search engine to make unified mathematical model analysis and subsumption for feedback pages of all engines, and unify degree of correlation for different feedback results. Suppose the condition is: original engine weight KL, page self correlation degree C, page feedback time T, one engine feedback result number N, using bell-shaped pulse mathermatical formula to make analysis and obtain unified degree of correlation.

Description

Network Chinese informix searcher
The present invention relates to the disposal system of computer network information, particularly a kind of network Chinese informix searcher.
The current Internet page whole world has 1,000,000,000 approximately, relates to the various aspects of daily life.Once people were also for the information on the Internet was few and worried, nowadays but were trapped in knowledge explosion, too many information makes people condition at a loss as to what to do.From being born of YAHOO search engine, to the appearance of increasing search engine today, demonstrate the attention of people invariably to information searching, how the information world in vastness finds own desired information, becomes the problem that people have to think.
The METAENGINE that occurred on INTERNET in recent years is that the database with each search engine puts together, for the user provides cover ratio wider, more accurate search, because a tame engine is difficult to all pages in the limit world, technically, all impossible in the operation.
Along with popularizing of Chinese INTERNET, Chinese number of netizens just doubled every the several months, and the quantity of information of Chinese is also increasing.Same information searching problem pendulum is in face of people, provide the engine of Chinese search mainly to be at present, Chinese YAHOO, SOHU Chinese, Sina's Chinese, day net, several families such as move about unhurriedly, the Chinese information that they include is limited, classification, arrangement to Chinese simultaneously do not have careful processing, and the information that people often find not is that they are desired.
So far go back the METAENGINE of neither one Chinese on the INTERNET, if the database of each Chinese engine is put together, the result to search chooses carefully simultaneously, allows Chinese also can enjoy the benefit that advanced inquiry brings so.
Purpose of the present invention is exactly to satisfy the demand of user to the query and search of Chinese information for the problem that solves prior art.Adopt the SERVLET technology, use the network environment of the up-to-date present domestic complexity of technical finesse.
System architecture of the present invention comprises a kind of network Chinese informix searcher, system architecture mainly comprises input end, noise filtering, explanation is held and the automatic adjustment adaptation of the other side's engine is handled, the information that searches according to set network analog mathematics model analysis, and ask according to the user, hand over the user to use after the layout, it is characterized in that described network analog mathematics model analysis is to adopt Wideway Search engine that the page of all engine feedback is made unified mathematics model analysis to sort out, and variant feedback result unified the degree of correlation, this degree of correlation analysis condition is: the weight KL of original engine, the degree of correlation C of the page itself, the time T of page feedback, the number of results N of an engine feedback must unify the degree of correlation behind bell pulse mathematics formula analysis.
Fig. 1 is a system architecture synoptic diagram of the present invention
Further specify embodiments of the invention below in conjunction with accompanying drawing
The user proposes retrieval request such as descriptor, keyword etc. at input end, enter page intellectual analysis engine, this engine has been included the most authoritative search engine of present Chinese such as Chinese YAHOO, Chinese EXITE, Sina's Chinese, Sohu's Chinese, Beijing University's sky net, has been moved about unhurriedly, flyings Chinese, Chinese network allusion quotation, Omron, searcher, Netease's Chinese.This search engine is also supported English, includes present world-technology forefront, five tame search engine: YAHOO, ALAVESTA, NORTHERNLIGHT, DIRECTHIT, GOOGLE that data is the most complete.Because the Chinese present network bandwidth is limited, the analysis of the page often will be depended on the result of original engine.In order from finite information, to obtain more information, this engine adopts intelligent inference to analyze the page, with the request of user input through noise filtering, to information classify automatically handle after, automatically find out relevant speech by explaining end from the page that feeds back, count in the degree of correlation, engine adjustment to the other side adapts to processing automatically then, user's request meanwhile enters parallel page request engine and searches for, and the information that searches returned feedback page intellectual analysis engine, the information that searches according to set network analog mathematics model analysis, and ask to hand over the user to use after the layout according to the user.
The network analog mathematics model analysis adopts the WIDEWAYSEARCH engine that the page of all engine feedback is made unified mathematics model analysis to sort out, and the result of different engine feedback is had the unified degree of correlation.
This degree of correlation analysis condition:
1, the weight KL of original engine: this is basic weight.Because there is difference in Chinese engine masses, so with the engine classification, for good engine, the weight of the record that feeds back to will be higher than the result of other engine feedback.
2, the degree of correlation C of the page itself: the result to every feedback makes intellectual analysis, and the degree of correlation of judged result is promptly done the full-text search analysis to brief introduction.
3, the time T of page feedback: because the degree of correlation and server and user's distance dependent, meet user's needs especially such as certain bar record of YAHOO, still will be linked to this page may change the user 10 minutes, and the validity of this record will fall under suspicion.Therefore monitor the feedback result of every record, offer the spended time of this record of user's reference links.And do not make the user do useless trial.
4, the number of results N of an engine feedback: it will be for referencial use by the final degree of correlation, and we notice that the number of results of feedback is relevant with the database size of this engine in fact, and the cover ratio of this engine obviously can judge that the authority of this record exerts an influence to the user.
Behind bell pulse mathematics formula analysis, must unify the degree of correlation according to above degree of correlation analysis condition.
Advantage of the present invention is to adopt many classification to process, in the network environment of present domestic complexity The user can classify according to the engine of the degree of correlation, time, domain name, selection, allows the user The SERVLET technique construction is adopted in easier convenient navigation in information, and very big stretching arranged Special load has been done from suitable to service routine simultaneously to deal with the request of a large number of users in the space Should process, can offer the many and meticulous information of user, and can greatly reduce that the user waits for Time.

Claims (1)

1, a kind of network Chinese informix searcher, system architecture mainly comprises input end, noise filtering, explanation is held and the automatic adjustment adaptation of the other side's engine is handled, the information that searches according to set network analog mathematics model analysis, and ask according to the user, hand over the user to use after the layout, it is characterized in that described network analog mathematics model analysis is to adopt the WidewaySearch engine that the page of all engine feedback is made unified mathematics model analysis to sort out, and variant feedback result unified the degree of correlation, this degree of correlation analysis condition is: the weight KL of original engine, the degree of correlation C of the page itself, the time T of page feedback, the number of results N of an engine feedback must unify the degree of correlation behind bell pulse mathematics formula analysis.
CN 00115797 2000-05-23 2000-05-23 Comprehensive network Chinese information searcher Pending CN1325076A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 00115797 CN1325076A (en) 2000-05-23 2000-05-23 Comprehensive network Chinese information searcher

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 00115797 CN1325076A (en) 2000-05-23 2000-05-23 Comprehensive network Chinese information searcher

Publications (1)

Publication Number Publication Date
CN1325076A true CN1325076A (en) 2001-12-05

Family

ID=4585239

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 00115797 Pending CN1325076A (en) 2000-05-23 2000-05-23 Comprehensive network Chinese information searcher

Country Status (1)

Country Link
CN (1) CN1325076A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100371932C (en) * 2004-03-23 2008-02-27 南京大学 Expandable and customizable theme centralized universile-web net reptile setup method
CN101454782A (en) * 2006-03-29 2009-06-10 甲骨文国际公司 Contextual search of a collaborative environment
CN1648902B (en) * 2004-01-26 2010-12-08 微软公司 System and method for a unified and blended search
CN103970816A (en) * 2013-01-24 2014-08-06 国际商业机器公司 Simulating Accesses For Archived Content

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1648902B (en) * 2004-01-26 2010-12-08 微软公司 System and method for a unified and blended search
CN100371932C (en) * 2004-03-23 2008-02-27 南京大学 Expandable and customizable theme centralized universile-web net reptile setup method
CN101454782A (en) * 2006-03-29 2009-06-10 甲骨文国际公司 Contextual search of a collaborative environment
CN101454782B (en) * 2006-03-29 2014-01-29 甲骨文国际公司 Contextual search of a collaborative environment
CN103970816A (en) * 2013-01-24 2014-08-06 国际商业机器公司 Simulating Accesses For Archived Content
CN103970816B (en) * 2013-01-24 2017-04-05 国际商业机器公司 The method and system of mark content to be issued

Similar Documents

Publication Publication Date Title
Wu et al. Query selection techniques for efficient crawling of structured web sources
Tanudjaja et al. Persona: A contextualized and personalized web search
JP5114380B2 (en) Reranking and enhancing the relevance of search results
CN100440224C (en) Automatization processing method of rating of merit of search engine
US20020091661A1 (en) Method and apparatus for automatic construction of faceted terminological feedback for document retrieval
CN1389811A (en) Intelligent search method of search engine
CN101908071A (en) Method and device thereof for improving search efficiency of search engine
WO2008109485A1 (en) Personalized shopping recommendation based on search units
WO2010065345A1 (en) System and methods for automatic clustering of ranked and categorized search objects
KR20040029895A (en) Search system
US20070271228A1 (en) Documentary search procedure in a distributed system
Sharma et al. The anatomy of web crawlers
KR20030069640A (en) System and method for geting information on hierarchical and conceptual clustering
Shekhar et al. An architectural framework of a crawler for retrieving highly relevant web documents by filtering replicated web collections
Jin et al. Tise: A temporal search engine for web contents
CN1325076A (en) Comprehensive network Chinese information searcher
US20030018617A1 (en) Information retrieval using enhanced document vectors
Yan et al. An improved PageRank method based on genetic algorithm for web search
Jadidoleslamy Introduction to metasearch engines and result merging strategies: a survey
US7490082B2 (en) System and method for searching internet domains
Satokar et al. Web search result personalization using web mining
Sugiyama et al. A method of improving feature vector for web pages reflecting the contents of their out-linked pages
Yu et al. The design and realization of open-source search engine based on Nutch
US20100076964A1 (en) Instance-Class-Attribute Matching Web Page Ranking
Pardakhe et al. Enhancement of web search engine results using keyword frequency based ranking

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication