CN102609473B - Method and system for website accessing - Google Patents

Method and system for website accessing Download PDF

Info

Publication number
CN102609473B
CN102609473B CN201210016303.3A CN201210016303A CN102609473B CN 102609473 B CN102609473 B CN 102609473B CN 201210016303 A CN201210016303 A CN 201210016303A CN 102609473 B CN102609473 B CN 102609473B
Authority
CN
China
Prior art keywords
domain name
user
website
data base
index data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210016303.3A
Other languages
Chinese (zh)
Other versions
CN102609473A (en
Inventor
钟进发
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201210016303.3A priority Critical patent/CN102609473B/en
Publication of CN102609473A publication Critical patent/CN102609473A/en
Application granted granted Critical
Publication of CN102609473B publication Critical patent/CN102609473B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention disclosed a method and a system for website accessing. The method includes steps of A, building an index database in advance, a plurality of domain name bodies and at least one piece of information about an integral domain name corresponding to each domain name body; B receiving the body of a specific domain name corresponding to a specific website input by a user and sending the body of the specific domain name to the index database through a webpage server; C, using the body of the specific domain name as a key word to search for the integral domain name information corresponding to the body of the specific domain name and returning the searching results to the user. The invention further constructs the website accessing system. By the method, the user can access the specific website without inputting a suffix of the domain name, website accessing time is saved, problems caused by misspelling of the suffix are avoided, and accordingly user experience is enhanced.

Description

A kind of Website access method and system
Technical field
The present invention relates to Internet technical field, relate in particular to a kind of Website access method and system.
Background technology
Domain name on internet, is equivalent to the house number of website, has relation one to one with website, inputs domain name and just can be connected to specific website on browser.Domain name is comprised of the main body of domain name and the suffix of domain name, and domain name suffix refers to a rearmost thing of domain name, and as " domain name A.COM ", " domain name A " before round dot is the main body of domain name, and " .COM " is below exactly suffix.The domain name of different suffix has different implications, the industrial characteristic of ordinary representation domain name or regional attribute, and as " .com " represents commercial company, " .gov " represents government organs, " .cn " and " .us " represents respectively China and U.S.A domain name.The whole domain name that must input website when user uses browser access some websites conventionally, comprises domain name main body and suffix.The defeated suffix of user needs to strike keyboard several times more, can reduce the efficiency of user's access websites; Particularly, for the Chinese suffix of be about to releasing, after the defeated English domain name main body of user, need to switch to Chinese character coding input method, Chinese suffix, such as ". company " and ". network ", need strike more keyboard when typewriting, and obviously can affect user's experience.In addition, user sometimes can forget suffix name and cannot access specific website; Or be strayed into another incoherent website because of the suffix of input error, and wasted the valuable time, for incoherent website, increased visit capacity simultaneously.
Summary of the invention
The technical problem to be solved in the present invention is, thereby could access websites make the defect that website visiting efficiency is lower, error rate is high for the above-mentioned Fully-Qualified Domain Name of need to inputting of prior art, and the Website access method that a kind of efficiency is high, error rate is low is provided.
The technical solution adopted for the present invention to solve the technical problems is: construct a kind of Website access method, it is characterized in that, comprising:
A. set up in advance index data base, described index data base stores a plurality of domain name main bodys and at least one information about Fully-Qualified Domain Name corresponding with each domain name main body, Fully-Qualified Domain Name is comprised of domain name main body and domain name suffix, and according to Fully-Qualified Domain Name language or the address of corresponding website the information of Fully-Qualified Domain Name is carried out to classification and ordination;
B. receive the domain name main body of the corresponding certain domain name of specific website of user's input, and the domain name main body of described certain domain name is sent to index data base by web page server;
The information of the corresponding Fully-Qualified Domain Name of domain name main body that the domain name main body of described certain domain name of C. take in index data base is certain domain name described in keyword query, and Query Result is returned to user, described Query Result comprises the information of the unique preferred Fully-Qualified Domain Name being screened from corresponding classification and ordination according to user profile by system, the domain name main body of the corresponding domain name of information of this preferred Fully-Qualified Domain Name is identical with the domain name main body of user's input, during sequence, the information of described unique preferred Fully-Qualified Domain Name is ranked the first.
In Website access method of the present invention, described Website access method also comprises:
D. index data base is upgraded.
In Website access method of the present invention, described steps A comprises:
A1. generate new keyword as domain name main body;
A2. generated domain name main body is mated with all domain name suffix, be combined into a plurality of complete domain names;
A3. described a plurality of complete domain names are filtered, to delete non-existent domain name;
A4. the corresponding website of domain name after filtering is analyzed, and the domain name after to described filtration sorts according to analysis result, to generate full ranking results;
A5. deposit described full ranking results in index data base.
In Website access method of the present invention, between described steps A 4 and steps A 5, also comprise:
A6. described full ranking results is carried out to classification and ordination according to language or the address of the corresponding website of domain name after filtering, to generate classification and ordination result;
Described steps A 5 also comprises:
Deposit described classification and ordination result in index data base.
In Website access method of the present invention, described step C comprises:
C1. the main body of described certain domain name of take in index data base is inquired about as keyword, and judges whether to inquire the information of the Fully-Qualified Domain Name corresponding with the main body of described certain domain name, if so, performs step C2; If not, perform step C3;
C2. according to user profile, full ranking results or classification and ordination result are screened, and the ranking results after screening is returned to user as Query Result;
C3. the main body of described certain domain name is mated with all domain name suffix, be combined into a plurality of complete domain names;
C4. described a plurality of complete domain names are filtered, to delete non-existent domain name, and the quantity of the domain name after judgement filtration, if the quantity of the domain name after filtering is 0, perform step C5; If the quantity of the domain name after filtering is 1, perform step C6; If the quantity of the domain name after filtering is greater than 1, perform step C7;
C5. to user feedback domain name, do not exist;
C6. this domain name after filtering to user feedback;
C7. the corresponding website of domain name after filtering is analyzed, and according to filtering out related web site in analysis result and the corresponding website of the domain name of user profile from described filtration;
C8. judge the quantity of described related web site, if the quantity of related web site is 0, perform step C9; If the quantity of related web site is 1, perform step C10; If the quantity of related web site is greater than 1, perform step C11;
C9. the corresponding website of domain name after filtering is analyzed, and according to analysis result, the domain name after to described filtration sorts, and to generate full ranking results, and full ranking results is returned to user;
C10. the domain name of this unique related web site is returned to user;
C11. related web site is analyzed, and generated relevance ranking result according to analysis result, and relevance ranking result is returned to user as Query Result.
In Website access method of the present invention, in step C4, if the quantity of the domain name after filtering is 0, also perform step C12; If the quantity of the domain name after filtering is 1, also perform step C13; If the quantity of the domain name after filtering is greater than 1, also perform step C14;
C12. do not exist the information of Fully-Qualified Domain Name to be stored to index data base the main body of described certain domain name;
C13. the corresponding unique complete domain name of the main body of described certain domain name is stored to index data base;
C14. the corresponding website of domain name after filtering is analyzed, and the domain name after to described filtration sorts according to analysis result, to generate full ranking results;
C15. described full ranking results is carried out to classification and ordination according to language or the address of the corresponding website of domain name after filtering, to generate classification and ordination result;
C16. deposit described full ranking results and classification and ordination result in index data base.
In Website access method of the present invention, in described step D, according at least one in following, index data base is upgraded:
According to registered user's active feedback, upgrade index data base;
According to a plurality of users' history access record, upgrade index data base;
The new website of periodic search is to upgrade index data base.
In Website access method of the present invention, the step of upgrading index data base according to user's active feedback comprises:
D1. receive priority domain name or blacklist domain name that the first registered user feeds back, and be committed to index data base;
D2. index data base is inputted the first registered user priority domain name or blacklist domain name are distributed at least one other registered user, to investigate;
D3. at least one other registered user investigates described priority domain name or the corresponding website of blacklist domain name, and investigation result is returned to index data base;
D4. index data base is according to investigation result, and whether the feedback of evaluating described the first registered user is accurate;
If D5. the first registered user's feedback conforms to investigation result, according to the first registered user's feedback, upgrade index data base; Otherwise, do not upgrade index data base.
In Website access method of the present invention, following analysis is carried out in the corresponding website of domain name after filtering: the language of renewal frequency, hour of log-on, web site contents amount, employing, the corresponding area of Website server IP, linking relationship and domain name suffix information.
In Website access method of the present invention, described user profile comprises non-registered users information or information of registered users, and non-registered users information comprises: IP address, the history access record relevant to IP address, the setting of classification and ordination result type; Information of registered users comprises: station address, user preference setting, IP address, user's history access record.
In Website access method of the present invention, described user profile is arranged as according to weight order from high to low: user preference setting, the setting of classification and ordination result type, user's history access record, history access record, station address, the IP address relevant to IP address.
In Website access method of the present invention, described step B also comprises: receive the user that quotes of user's input, and the weight of information of quoting user is between the setting of classification and ordination result type and user's history access record;
Step C2 also comprises: according to quoted user's information, full ranking results or classification and ordination result are screened.
The present invention also constructs a kind of website visiting system, comprising:
Index data base is set up module, for setting up in advance index data base, described index data base stores a plurality of domain name main bodys and at least one information about Fully-Qualified Domain Name corresponding with each domain name main body, Fully-Qualified Domain Name is comprised of domain name main body and domain name suffix, and according to Fully-Qualified Domain Name language or the address of corresponding website the information of Fully-Qualified Domain Name is carried out to classification and ordination;
User's load module, for receiving the domain name main body of the corresponding certain domain name of specific website of user's input, and is sent to index data base by the domain name main body of described certain domain name by web page server;
Enquiry module, for take the information of the corresponding Fully-Qualified Domain Name of domain name main body that the domain name main body of described certain domain name is certain domain name described in keyword query at index data base, and Query Result is returned to user, described Query Result comprises the information of the unique preferred Fully-Qualified Domain Name being screened from corresponding classification and ordination according to user profile by system, the domain name main body of the corresponding domain name of information of this preferred Fully-Qualified Domain Name is identical with the domain name main body of user's input, during sequence, the information of described unique preferred Fully-Qualified Domain Name is ranked the first.
In website visiting system of the present invention, described website visiting system also comprises:
Update module, for upgrading index data base.
In website visiting system of the present invention, described index data base is set up module and is comprised:
Keyword generation unit, for generating new keyword as domain name main body;
The first assembled unit, for generated domain name main body is mated with all domain name suffix, is combined into a plurality of complete domain names;
Filter element, for described a plurality of complete domain names are filtered, to delete non-existent domain name;
The first full sequencing unit, for the corresponding website of domain name after filtering is analyzed, and the domain name after to described filtration sorts according to analysis result, to generate full ranking results;
The first storage unit, for depositing described full ranking results in index data base.
In website visiting system of the present invention, described index data base is set up module and is also comprised:
The first classification and ordination unit, for described full ranking results is carried out to classification and ordination according to language or the address of the corresponding website of domain name after filtering, to generate classification and ordination result;
The second storage unit, for depositing described classification and ordination result in index data base.
In website visiting system of the present invention, described enquiry module comprises:
Inquiry and judging unit, inquire about as keyword for take the main body of described certain domain name at index data base, and judge whether to inquire the information of the complete domain name corresponding with the main body of described certain domain name;
First returns to unit, for when inquiring the information of the complete domain name corresponding with the main body of described certain domain name, according to user profile, full ranking results or classification and ordination result are screened, and the ranking results after screening is returned to user as Query Result;
The second assembled unit, for when not inquiring the information of the complete domain name corresponding with the main body of described certain domain name, mates the main body of described certain domain name with all domain name suffix, be combined into a plurality of complete domain names;
Filter and judging unit, for described a plurality of complete domain names are filtered, to delete non-existent domain name, and the quantity of the domain name after judgement filtration;
Second returns to unit, for the quantity of the domain name after filtration, is 0 o'clock, to user feedback domain name, does not exist;
The 3rd returns to unit, for the quantity of the domain name after filtration, is 1 o'clock, this domain name after filtering to user feedback;
Screening unit, is greater than at 1 o'clock for the quantity of the domain name after filtration, the corresponding website of domain name after filtering analyzed, and according to filtering out related web site in analysis result and the corresponding website of the domain name of user profile from described filtration;
Quantity judging unit, for judging the quantity of described related web site;
The 4th returns to unit, for the quantity at related web site, is 0 o'clock, the corresponding website of domain name after filtering is analyzed, and according to analysis result, the domain name after to described filtration sorts, and to generate full ranking results, and full ranking results is returned to user;
The 5th returns to unit, for the quantity at related web site, is 1 o'clock, and the domain name of this unique related web site is returned to user;
The 6th returns to unit, for the quantity at related web site, is greater than at 1 o'clock, related web site is analyzed, and generated relevance ranking result according to analysis result, and relevance ranking result is returned to user as Query Result.
In website visiting system of the present invention, described enquiry module also comprises:
The 3rd storage unit, is 0 o'clock for the quantity of the domain name after filtration, does not exist the information of Fully-Qualified Domain Name to be stored to index data base the main body of described certain domain name;
The 4th storage unit, is 1 o'clock for the quantity of the domain name after filtration, and the corresponding unique complete domain name of the main body of described certain domain name is stored to index data base;
The second full sequencing unit, is greater than at 1 o'clock for the quantity of the domain name after filtration, the corresponding website of domain name after filtering is analyzed, and the domain name after to described filtration sorts according to analysis result, to generate full ranking results;
The second classification and ordination unit, for described full ranking results is carried out to classification and ordination according to language or the address of the corresponding website of domain name after filtering, to generate classification and ordination result;
The 5th storage unit, for depositing described full ranking results and classification and ordination result in index data base.
In website visiting system of the present invention, described update module is for upgrading index data base according to following at least one:
According to registered user's active feedback, upgrade index data base;
According to a plurality of users' history access record, upgrade index data base;
The new website of periodic search is to upgrade index data base.
In website visiting system of the present invention, the described first full sequencing unit carries out following analysis to the corresponding website of domain name after filtering: the language of renewal frequency, hour of log-on, web site contents amount, employing, the corresponding area of Website server IP, linking relationship and domain name suffix information.
In website visiting system of the present invention, described user profile comprises non-registered users information or information of registered users, and non-registered users information comprises: IP address, the history access record relevant to IP address, the setting of classification and ordination result type; Information of registered users comprises: station address, user preference setting, IP address, user's history access record.
Implement technical scheme of the present invention, due to pre-stored in index data base, there are a plurality of domain name main bodys and at least one information about Fully-Qualified Domain Name corresponding with each domain name main body, when user need to access certain specific website, only need to input the domain name main body of this specific website, index data base will be take the information of the corresponding Fully-Qualified Domain Name of main body that the main body of this certain domain name is this certain domain name of keyword query automatically, and Query Result is returned to user, therefore, user may have access to this specific website without the suffix of input domain name, saved the time of user's access websites, improved the efficiency of user's access websites, and avoided the trouble brought due to the misspelling of suffix, thereby improved user's experience.
Accompanying drawing explanation
Below in conjunction with drawings and Examples, the invention will be further described, in accompanying drawing:
Fig. 1 is the process flow diagram of Website access method embodiment mono-of the present invention;
Fig. 2 is the process flow diagram of step S10 preferred embodiment in Fig. 1;
Fig. 3 is the process flow diagram of step S30 preferred embodiment in Fig. 1;
Fig. 4 is the display interface figure of Query Result preferred embodiment;
Fig. 5 is the logical diagram of website visiting system embodiment one of the present invention;
Fig. 6 is the logical diagram of website visiting system embodiment two of the present invention;
Fig. 7 is the logical diagram that in Fig. 5, index data base is set up module preferred embodiment;
Fig. 8 is the logical diagram of enquiry module preferred embodiment in Fig. 5.
Embodiment
As shown in Figure 1, in the process flow diagram of Website access method embodiment mono-of the present invention, this Website access method comprises:
S10. set up in advance index data base, described index data base stores a plurality of domain name main bodys and at least one information about Fully-Qualified Domain Name corresponding with each domain name main body, should be noted that, when there is corresponding Fully-Qualified Domain Name in this domain name main body, " about the information of Fully-Qualified Domain Name " is the Fully-Qualified Domain Name of existing correspondence, when the Fully-Qualified Domain Name corresponding with this domain name main body do not exist, for " about the information of Fully-Qualified Domain Name " is " domain name does not exist " information;
S20. receive the main body of the corresponding certain domain name of specific website of user's input, and the main body of described certain domain name is sent to index data base by web page server;
The information of the corresponding Fully-Qualified Domain Name of main body that the main body of described certain domain name of S30. take in index data base is certain domain name described in keyword query, and Query Result is returned to user.
Implement this technical scheme, due to pre-stored in index data base, there are a plurality of domain name main bodys and at least one information about Fully-Qualified Domain Name corresponding with each domain name main body, when user need to access certain specific website, only need to input the domain name main body of this specific website, index data base will be take the information of the corresponding Fully-Qualified Domain Name of main body that the main body of this certain domain name is certain domain name described in keyword query automatically, and Query Result is returned to user, therefore, user may have access to this specific website without the suffix of input domain name, saved the time of user's access websites, and avoided the trouble brought due to the misspelling of suffix, thereby improved user's experience.
Fig. 2 is the process flow diagram of step S10 preferred embodiment in Fig. 1, and in the preferred embodiment, step S10 further comprises:
S101. generate new keyword as domain name main body;
S102. generated domain name main body is mated with all domain name suffix, be combined into a plurality of complete domain names;
S103. described a plurality of complete domain names are filtered, to delete non-existent domain name;
S104. the corresponding website of domain name after filtering is analyzed, and the domain name after to described filtration sorts according to analysis result, to generate full ranking results;
S105. described full ranking results is carried out to classification and ordination according to language and/or the area of the corresponding website of domain name after filtering, to generate classification and ordination result;
S106. deposit described full ranking results, described classification and ordination result in index data base.
Illustrate the concrete steps of step S10 below: first, in step S101, index data base generates new keyword as domain name main body, is sent to suffix match server.Particularly, the keyword that index data base generates meets certain condition, includes but not limited to following domain name main body:
The domain name main body of the well-known website that 1. flow is large, rate of people logging in is higher, the official website etc. that comprises website that visit capacity ranks first, star personality's personal website, esbablished corporation or mechanism, website domain name such as front ten myriabits of screening whole world row, can obtain by third party's statistics, also can realize by existing web page analysis technology;
2. pure digi-tal, such as all 8 domain name main bodys that form with interior arabic numeral;
3. adopt the method for exhaustion to include all character combinations in certain limit, character is containing 26 Latin alphabets from a to z and "-", all combinations of the character string forming such as 1 to 8 character;
4. character serially adds arabic numeral, such as adding numeral after character string and total length is no more than all combinations of 8;
5. arabic numeral padding string, such as padding string after numeral and total length are no more than all combinations of 8;
6. the combination in any of character and arabic numeral, such as the Latin alphabet and "-" and arabic numeral combination in any (total length is no more than 6);
7. adopt the mode of looking up the dictionary to enumerate out the phonetic of all conventional words, word, phrase and abbreviation thereof, and conventional English phrase (comprising word, phrase and abbreviation thereof); Such as total length is no more than the Chinese pinyin of 16;
8. the combination of arabic numeral and phonetic or phrase, such as adding all combinations (total length is no more than 12, and numeral is no more than 4) that add numeral after all combinations, Chinese pinyin or the English phrase of Chinese pinyin or English phrase after numeral;
9. the combination of Chinese pinyin and English phrase, adds English phrase (total length is no more than 12) such as adding after Chinese pinyin to add after English phrase, English phrase after adding Chinese pinyin, English phrase after Chinese pinyin, Chinese pinyin;
10. the domain name being comprised of the language outside Chinese and english, such as total length is no more than the French word of 16.
1. the condition that it should be noted that collects the domain name of well-known website, guarantees that the website domain name that most of users often access can find at index data base.Condition 2.~10. conclude the domain name feature summed up most of websites on current internet.Domain name is comprised of characters such as numeral, the Latin alphabet and "-" substantially at present, by above 1.~10. the domain name of listed conditional capture can cover the domain name of most websites on internet, foundation is usingd the main body of these domain names as the index data base of keyword, and object is that the most Query Results that guarantee user can return in real time.If when domain name naming rule changes in the future, above-mentioned condition also can correspondingly be upgraded.
In step S102, the domain name main body generating, at suffix match server, mate with all domain name suffix, to form a plurality of complete domain names, for example, for convenience of description, suppose that all domain name suffix in internet comprise " .CN ", " .PT ", " .COM ", " .ORG ", " .NET ", the keyword that index data base passes to suffix match server is " abc ", through domain name suffix match, form five complete domain names, i.e. " abc.CN ", " abc.PT ", " abc.COM ", " abc.ORG " and " abc.NET ".In step S103, described a plurality of complete domain names are filtered, to delete non-existent domain name.In this step, can use prior art to identify non-existent domain name, such as the search spider by search engine, directly access the corresponding website of domain name abc.COM and see if there is content and return, if return without content, define domain name abc.COM and do not exist.In step S104, website corresponding to domain name after filtering described in suffix match server access, the corresponding website of domain name after filtering is analyzed, and extract website Useful Information and content, it should be noted that in the specific implementation, the information of extraction website and content and filtration domain name can be carried out simultaneously, if website corresponding to certain domain name returned without content, may be defined to this domain name and do not exist; If meaningful the returning in the corresponding website of certain domain name, can extract information and the content of website.Then, analyze relatively from the information of each website extraction and the foundation of content conduct sequence, generate full ranking results, should be noted that, generating full ranking results can be completed by prior art automatically by server, the Web Spider Web Spider program that existing search engine generally adopts just can realize the function of the content of extracting from each website, according to predetermined ordering rule, web site contents is analyzed to sequence and also belongs to prior art, is not described in detail here.For example, if detect " abc.COM ", do not exist, deleted, then access respectively " abc.CN ", " abc.PT ", " abc.ORG " and " abc.NET ", from remaining these four websites extraction Useful Informations and content, send analysis sequencing unit to.Also should be noted that, in this step, during the corresponding website of domain name after analysis and filter, can make a concrete analysis of the following information of this website: the information such as the language of renewal frequency, hour of log-on, web site contents amount, employing, the corresponding area of Website server IP, linking relationship and domain name suffix information.Also can use for reference the super link analysis technology of similar search engine, the backward chaining number of analyzing different web sites judges importance and the concerned degree of website.No matter adopt that methods analyst, final object is that such as the higher website of more comprehensive, the concerned degree of content, appropriate level is also higher to these website gradings.On the contrary, the less and most of hyperlink of content are not all pointed to the website of webpage in station, and appropriate level is lower (said is here comparatively speaking, in practical application, can do special processing to the larger navigation type website of visit capacity) also.Then according to rank, to website, sort from high to low, generate full ranking results.The rank of supposing above-mentioned 4 websites is respectively: " abc.CN "=2.1, " abc.PT "=1.9, " abc.ORG "=1.8, " abc.NET "=9.5.The full ranking results just sorting by rank is " abc.NET, abc.CN, abc.PT, abc.ORG ".In step S105, also can classification and ordination by described full ranking results, to generate classification and ordination result, segmentation according to the region that comprises language that website adopts, the IP address of Website server is corresponding, the implication of domain name suffix etc.Preferably, analyze sequencing unit and segment according to the language of website and/or area, be divided into towards language-specific user's website with towards particular locality user's website.For example, to reaching a conclusion after the content analysis of 4 of above-mentioned full ranking results websites: " abc.ORG " and " abc.CN " is Chinese website, " abc.PT " and " abc.NET " is Portuguese website, deducibility thus " abc.ORG " and " abc.CN " are main towards understanding Chinese user, " abc.PT " and " abc.NET " is main towards the user who understands Portuguese, therefore, according to the difference of language, can be subdivided into 2 classification and ordination results, be classification and ordination result (1) " abc.CN, abc.ORG " and classification and ordination result (2) " abc.NET, abc.PT ".In addition, in practical application, also need to segment ranking results according to region corresponding to the IP address of Website server, the language that while pressing region segmentation ranking results, reference site adopts simultaneously, region corresponding to server ip address of supposing website " abc.ORG " is Singapore, the region corresponding to server ip address of website " abc.CN " is China, and region corresponding to the server ip address of website " abc.PT " and " abc.NET " is Portugal, according to the difference in region, can segment out again three classification and ordination results so, wherein, classification and ordination result (3) comprises " abc.CN, abc.ORG ", classification and ordination result (4) comprises " abc.ORG, abc.CN ", classification and ordination result (5) comprises " abc.NET, abc.PT ".Classification and ordination result (3) is applicable to Chinese user, and classification and ordination result (4) is applicable to Singapore user, and classification and ordination result (5) is applicable to Portugal user.In a upper example, the classification and ordination result (3) that is applicable to Chinese user has also comprised the website " abc.ORG " of Singapore, Main Basis is that Chinese user be take Chinese as main, website " abc.ORG " is also Chinese website, possible some Chinese user needs access " abc.ORG ", so allow " abc.ORG " to appear in the classification and ordination result that is applicable to Chinese user.Classification and ordination result (4) order is different with the order of full ranking results, and this is that the Chinese user of Singapore may more get used to accessing the website of Singapore this locality because " abc.ORG " and " abc.CN " rank is more or less the same.In preference, the implication of domain name suffix is not as classification and ordination result Main Basis, but it also has impact to classification and ordination result, suppose website " abc.CN " in have Chinese edition and English edition, Chinese is suitable with English content, cannot distinguish and take which kind of language as main, by analyzing suffix, the implication of " .CN " is CHINESE REGION, deducibility " abc.CN " be take Chinese as main, in Chinese classification and ordination result, will preferentially obtain sequence, the position after not participating in sequence or come in English classification and ordination result.Language as for how identifying website employing, has several different methods: the language meta label of the character code type of analyzing web page content, anacom file, analysis html source file etc., are not described in detail prior art here.Special circumstances, a plurality of countries and regions, multilingual user may often access certain global well-known website, analyze sequencing unit and allow this class website to appear in a plurality of classification and ordination results simultaneously.Because global well-known Websites quantity is few, end user's work mode is screened also and can in the short time, be completed.Above-mentioned classification is only a specific embodiment of the present invention, is used for explaining claim, the scope being not intended to limit the present invention.In practical application, also can segment more classification and ordination result.Be finally that above-mentioned full ranking results and classification and ordination result are deposited in the concordance list that the keyword of index data base is corresponding, for example, be deposited in the concordance list of above-mentioned keyword " abc ".After storage, index data base continues as next keyword and creates concordance list, and so circulation is carried out, until meet above-mentioned condition 1.~index table creation of all keywords is 10. complete.
In particular cases a kind of, above-mentioned, set up in index data base process, after filtering, find that all domain names corresponding with certain keyword do not exist, can skip the program of analyzing sequence, still can deposit the information of described keyword and " domain name is non-existent " in index data base.
Current search engine is to find other webpage by the chained address of webpage, its limitation is search less than just setting up soon or without the website of external linkage, and the wherein page of throwing the net of the just website of finding by chained address, what easily occur finding by chained address repeatedly is all the different pages of same website.What the present invention paid close attention to is unique domain name of website, rather than the concrete page of website, and the search strategy that obviously routine search engine adopts is not suitable for the present invention.The present invention sets up index data base and has adopted the method for exhaustion, can search firm establishment soon or without the website of external linkage.Certainly, adopt the method for exhaustion can relate to huge operand.For example above-mentioned condition 1.~keyword that 10. comprises may reach hundreds billion of domain names, because most of domain names are non-existent, data according to statistics, website on internet is greatly about 100,000,000 left and right at present, so only need the domain name analysis sequence to 100,000,000 left and right, greatly reduce the difficulty that creates concordance list, the computer cluster forming with many computing machines domain name analysis sequence to 100,000,000 left and right within the limited time is feasible.
Fig. 3 is the process flow diagram of the preferred embodiment of step S30 in Fig. 1, and S30 is further comprising the steps for this step:
S301. the main body of described certain domain name of take in index data base is inquired about as keyword, and judges whether to inquire the information of the Fully-Qualified Domain Name corresponding with the main body of described certain domain name, if so, performs step S302; If not, perform step S303;
S302. according to user profile, full ranking results or classification and ordination result are screened, and the ranking results after screening is returned to user as Query Result;
S303. the main body of described certain domain name is mated with all domain name suffix, be combined into a plurality of complete domain names;
S304. described a plurality of complete domain names are filtered, to delete non-existent domain name, and the quantity of the domain name after judgement filtration, if the quantity of the domain name after filtering is 0, perform step S305 and step S306; If the quantity of the domain name after filtering is 1, perform step S307 and step S308; If the quantity of the domain name after filtering is greater than 1, perform step S309 and step S314;
S305. to user feedback domain name, do not exist;
S306. do not exist the information of Fully-Qualified Domain Name to be stored to index data base the main body of described certain domain name;
S307. this domain name after filtering to user feedback;
S308. the corresponding unique complete domain name of the main body of described certain domain name is stored to index data base;
S309. the corresponding website of domain name after filtering is analyzed, and according to filtering out related web site in analysis result and the corresponding website of the domain name of user profile from described filtration;
S310. judge the quantity of described related web site, if the quantity of related web site is 0, perform step S311; If the quantity of related web site is 1, perform step S312; If the quantity of related web site is greater than 1, perform step S313;
S311. the corresponding website of domain name after filtering is analyzed, and according to analysis result, the domain name after to described filtration sorts, and to generate full ranking results, and full ranking results is returned to user;
S312. the domain name of this unique related web site is returned to user;
S313. related web site is analyzed, and generated relevance ranking result according to analysis result, and relevance ranking result is returned to user as Query Result;
S314. the corresponding website of domain name after filtering is analyzed, and the domain name after to described filtration sorts according to analysis result, to generate full ranking results;
S315. described full ranking results is carried out to classification and ordination according to language and/or the area of the corresponding website of domain name after filtering, to generate classification and ordination result;
S316. deposit described full ranking results and classification and ordination result in index data base.
Implement this technical scheme, only can avoid some uncorrelated website because domain name main body is identical with well-known website just can obtain very high visit capacity, thereby weaken the realistic meaning of cybersquatting, standard the order of domain name registration.On the other hand, for well-known website, do not need again in order to protect self rights and interests and the identical website of a plurality of domain name main bodys of registration in advance yet, thereby reduced expense.
Illustrate the concrete steps of step S30 below: when user will access some websites, need be in the input frame input domain name main body at subscriber equipment interface.The subscriber equipment here can be anyly can by mouse, keyboard, touch-screen, telepilot or voice-operated device, carry out the electronic product of man-machine interaction with user, includes but not limited to computing machine, smart mobile phone, PAD etc.When web page server obtains described domain name main body (can pass through page technology, such as JSP, PHP or ASP etc., obtains the domain name main body that user inputs), and using described domain name main body as keyword, be sent to index data base and inquire about.If index data base has existed described concordance list corresponding to keyword,, according to the classification and ordination result of user profile screening concordance list or full ranking results, the ranking results after screening returns to user as Query Result.The preferential selection classification and ordination result corresponding with user profile during screening, if just consider full ranking results without corresponding classification and ordination result.Wherein, user profile comprises non-registered users information or information of registered users.Particularly, non-registered users information at least comprises following any one: IP address, the history access record relevant to IP address, the setting of classification and ordination result type.Information of registered users at least comprises following any one: station address, user preference setting, IP address, user's history access record.Preferably, index data base gives respectively different weights to above multinomial user profile, and the weight of every essential information is arranged as follows from high to low: user's preference arranges the history access record > that > user the is set history access record > station address > IP address relevant to IP address of > classification and ordination result type.The principle of screening and sequencing result is: preferentially meet the user profile that weight is higher, ignoring the user profile higher with weight has the information of conflicting.The weight that user preference arranges is the highest in user basic information, top-priority foundation while being screening and sequencing result.
Illustrate respectively the impact of user profile on ranking results below:
For non-registered users, suppose that the affiliated region, IP address of certain non-registered users is China's Mainland, return to the classification and ordination result that is applicable to China's Mainland user; If the region under user's IP address is Britain, return to the classification and ordination result that is applicable to Britain user.User's history access record is also important reference information, for example, if the history access record of certain fixed ip address shows that most of website of user's access is Chinese website, returns to the classification and ordination result that is applicable to Chinese user; If the history access record of certain fixed ip address shows Chinese website and the English website of this user's access and approximately respectively accounts for half, the Chinese website filtering out from full ranking results and the domain name of English website, and the domain name after screening is returned to user as Query Result.It is the setting of classification and ordination result type that non-registered users also has a most important information, and when user wishes to obtain other classification and ordination result, index data base allows user that the type of ranking results is freely set.For example, the region under IP address is the website that certain user of China wants to access India, and this user can change the type of Query Result into " India user ", returns to the classification and ordination result that is applicable to India user.The function that arranges of classification and ordination result type also allows user to obtain the Query Result that comprises multiple classification and ordination result, for example, user P wishes to access the website of Portuguese, but user P is just to Portugal, the website of Brazil and mo is interested, user P can change the type of Query Result into " Portuguese=Portugal user+Brazilian user+mo user ", index data base can filter out and be applicable to Portugal from full ranking results, the domain name of Brazil and mo user's Portuguese website, and the domain name after screening is returned to user P as Query Result.Logical relation between the formula that the example of described user P the is mentioned multinomial condition that just user arranges for convenience of description and the metaphor done, here "+" be equivalent to " or ", "=" is equivalent to " comprising ", the Portuguese on "=" left side is precondition, requires only to return the domain name of above-mentioned three regional Portuguese websites.
For registered user, also can return to different Query Results according to the different IP address of registered user and history access record, implementation method and non-registered users are similar.Different, registered user's essential information has increased " station address " and " preference setting ".Station address is inconsistent with the region under IP address sometimes, such as, the registered address of registered user A is China, when user A is used internet to Britain, the region under its IP address just becomes Britain.For this class user, if without relevant historical Visitor Logs, index data base screens according to the higher station address of weight the Query Result that this returns, and returns to the classification and ordination result that is applicable to Chinese user; If there is relevant history access record, index data base can screen the Query Result that this returns according to the relevant historical Visitor Logs of user A.For example, the keyword of user A input is " abc ", the history access record of user A is: once access " abc.edu.cn ", twice access " abc.net ", 35 access " abc.org ", obviously, the history access record of " abc.org " is maximum, and index data base can the classification and ordination result under " abc.org " return to user A (Query Result need ranked first abc.org position) as Query Result; If user A is the history access record relevant to domain name main body " abc " not, his history access record shows repeatedly to access in the recent period take the domain name that " .edu.cn " be suffix, therefrom deducibility user A may be student or teacher, also may Shi Dui the interested people in website of educational institution, index data base can return to user A as Query Result using " abc.edu.cn " and affiliated classification and ordination result thereof, and (prerequisite is that domain name abc.edu.cn exists, if there is not this domain name, index data base can be ignored the essential information that this history access record provides automatically).Registered user also has an essential information user preference setting, it is the most important essential information of registered user, include but not limited to that registered user is made as priority or blacklist to certain domain name, also comprise " setting of classification and ordination result type " function identical with non-registered users.That is to say, registered user's " preference setting " than " setting of classification and ordination result type " of non-registered users many function of priority or blacklist is set.For example, the keyword of user A input is " abc ", if " abc.net " is made as to priority in this user's data, by " abc.net ", the position in Query Result is adjusted to first; If " abc.net " is made as to blacklist in user's data, " abc.net " rejected from Query Result.
In addition, also it should be noted that the mode of obtaining user profile includes, but are not limited to: according to user in the IP address being recorded by network-side or subscriber equipment end during browsing page or extract in the cookies information of subscriber equipment and user's history access record etc.; Registered user's preference setting and the address that materials for registration is filled in can obtain in this user's data, and when registered user is when the subscriber equipment login user account number, index data base just can be identified this user's identity, thereby acquisition is to essential information that should user.
For being easier to understand index data base, the person skilled of this area returns to different Query Results according to the essential information of different user, now comprehensively give an example: suppose to have six user A1, A2, A3, A4, A5 and A6, they are " abc " in input frame input domain name main body.Index data base has found the concordance list that comprises keyword " abc ", this concordance list totally five ranking results, be respectively full ranking results p1, be applicable to Chinese user classification and ordination result p2, be applicable to Britain user classification and ordination result p3, be applicable to Chinese user's classification and ordination result p4 and be applicable to English user's classification and ordination result p5.The domain name that these five ranking results comprise is respectively p1 (abc.net, abc.com, abc.cn, abc.org, abc.uk, abc.gov), p2 (abc.net, abc.cn, abc.com), p3 (abc.uk, abc.org, abc.gov), p4 (abc.net, abc.com, abc.cn), p5 (abc.org, abc.uk, abc.gov).The information of supposing user A1 is " IP address affiliated area is Britain ", index data base selects the classification and ordination result p3 that is applicable to Britain user to return to user A1 as Query Result according to " IP address affiliated area is Britain ", and what user A1 obtained returns results as " abc.uk, abc.org, abc.gov ", the information of supposing user A2 is " address that materials for registration is filled in and IP address affiliated area are all China ", and index data base selects the classification and ordination result p2 that is applicable to Chinese user to return to user A2 as Query Result, the information of supposing user A3 is " the materials for registration address of filling in is China, IP address affiliated area is Britain, website overwhelming majority of history access record is Chinese website ", and the classification and ordination result p4 that index data base selection is applicable to Chinese user returns to user A3 as Query Result, the information of supposing user A4 is for " the materials for registration address of filling in is China, IP address affiliated area is Britain, the website overwhelming majority of history access record is Chinese website, history access record shows repeatedly access websites abc.org ", relatively in the essential information of user A4, " history access record shows repeatedly access websites abc.org " presses close to keyword " abc " most, deducibility user A4 is likely and thinks access websites abc.org thus, index data base is selected to include the classification and ordination result p3 of " abc.org " or P5 and as Query Result, is returned to user A3 (Query Result should guarantee that the sequence of abc.org is first, if select p3, after abc.uk and abc.org sequence need being exchanged, again ranking results is returned), the information of supposing user A5 is " address that materials for registration is filled in and IP address affiliated area are all Japan ", and index data base is not applicable to the classification and ordination result of Japanese user, selects full ranking results p1 to return to user A5 as Query Result, the information of supposing user A6 is for " IP address affiliated area and the materials for registration address of filling in be all Chinese, user is made as abc.net blacklist and abc.com is made as to priority, first index data base selects classification and ordination result p2 and the p4 that includes priority website " abc.com ", secondly with reference to " IP address affiliated area and materials for registration fill in address be all Chinese ", from p2 and p4, select the classification and ordination result p2 that is applicable to Chinese user, again according to priority setting and the blacklist of user A6, abc.com is adjusted to first in the position of ranking results, abc.net is rejected from ranking results, what final user A6 obtained returns results as " abc.com, abc.cn ".
When user's access websites, if the keyword of user's input is longer, when index data base is not found the concordance list of associated dns name, described keyword will be sent to suffix match server, mate with all domain name suffix, form a plurality of complete domain names, then the plurality of complete domain name is filtered, delete non-existent domain name, and access website corresponding to domain name after described filtration, from website, extract Useful Information and content.Above flow process with set up index data base, no longer repetition of explanation.Because the keyword of user input exceed set up index data base condition 1.~10. included scope, can infer, the nearly following characteristics of keyword of user's input: character or numeral are longer, the domain name main body that does not belong to well-known website, do not belong to conventional phonetic or word etc., in reality, take this class keywords as the possibility of domain name main body less, the probability that the domain name main body that occurs a plurality of websites is identical with the keyword of user's input is very little especially, can draw an inference: the domain name main body can not find at index data base, the probability of website that occurs a plurality of same domain name main bodys is minimum.So in most of the cases, if the domain name main body of neither one website is identical with keyword after filtering, or only have the domain name main body of a website identical with keyword, both of these case can directly return to user by Query Result, need not be through analysis sequencing unit below.With this simultaneously, suffix match server can generate the concordance list of the keyword that comprises user's input, deposit in index data base and use for inquiry next time, when this user is next time or when other users input identical keyword, index data base just can be directly synchronously returns to user from the concordance list information extraction of described keyword and (when domain name exists, unique domain name is returned to user; When domain name does not exist, the non-existent information of domain name is fed back to user).If during the domain name quantity >1 of the effective website after filtering, draw Liang Tiao branch.
Article one, the flow process of branch is for returning to user by Query Result.Particularly, according to user profile, website corresponding to the domain name from described filtration filters out related web site.According to the method for user profile screening website, above, describe in detail, no longer repeat.Here according to the quantity of related web site, divide again two kinds of processing modes, the first situation, when described related web site only has 1, the domain name of this unique related web site is returned to user as Query Result, for example user profile is " often accessing Chinese website ", when filtering out related web site and only having a Chinese website, the domain name of this Chinese website is returned to user.The second situation, when the quantity of related web site is when being not equal to 1, i.e. quantity=0 of related web site or >1.If related web site quantity is 0, analyze information and the content of the corresponding website of domain name after more described filtration, generate full ranking results, and full ranking results is returned to user as Query Result; If the quantity >1 of related web site, information and the content of the more described related web site of analysis, generate relevance ranking result, and relevance ranking result returned to user as Query Result.
Should be noted that, the method that the method for analyzing web site adopts when setting up index data base when search index database is different: in the stage of building database, the time of analyzing web site is more sufficient, and the website of same domain name main body is more, so need to extract from each website webpage as much as possible to obtain more comprehensive information, can also assess with reference to external linkage quantity the pouplarity of website if desired, this is conventional analysis sort method, can obtain more objective sort by, but cannot meet user and wish that the short time obtains the requirement of Query Result.And the time marquis of response user inquiry, must express-analysis compare the rank size of website, and proposing here a kind ofly according to website homepage, to assess website other Simple calculation method of level, computing formula is as follows:
Website rank V=D 1/3* (D/10+1.5Ln+Lw 1/3) * (Ly/Lz) 4/ 1000
Wherein, D represents the size of website homepage content, and unit is Kb (when D>3000Kb, calculating by 3000Kb); Ln represents the link number (if there are a plurality of links to point to identical address, calculating by 1 link) of other page of sensing self website that website homepage comprises; Lw represents the link number of the sensing external website that website homepage comprises; Ly represents effective link number (if Ln is 0, Ly is also 0, and corresponding website rank is also 0) of other page of sensing self website that website homepage comprises; Lz representative is from the link number of random other page of sensing self website selected of website homepage (wherein Lz≤Ln, if Ln is 0, the value of Ly/Lz is 0).For example, the size of supposing certain website homepage content is 1000Kb, the link number that points to self other page of website is 310, the link number that points to external website is 27, analyze sequencing unit from the random links of selecting 5 these other pages of website of sensing of homepage, it is effective links that discovery only has 4 links, and webpage that one of them link is pointed to does not exist, by above data substitution formula:
V=1000 1/3*(1000/10+1.5*310+27 1/3)*(4/5) 4/1000
=10*(100+465+3)*0.4096/1000=2.326528,
The rank that draws thus this website is about 2.3.Why the time marquis of response user inquiry, adopt Simple calculation method to grade to website, is for the consideration of shortening the processing time on the one hand; Another reason is the inference drawing above: the domain name main body can not find at index data base, occurs that the probability of website of a plurality of same domain name main bodys is minimum, so can adopt Simple calculation method to a few website grading.
Now illustrate the implementation method of article one branch, the keyword of supposing user A input is " abc ", totally three of effective domain name after filtration, be respectively " abc.CN ", " abc.PT " and " abc.ORG ", by Simple calculation method, give website grading corresponding to above-mentioned 3 domain names as follows: " abc.CN "=2.1, " abc.PT "=1..9, " abc.ORG "=1.8, if three domain names above are further screened according to user A essential information, find the domain name not meeting, the related web site quantity namely filtering out is 0, by rank from high to low time ordered pair three domain names after filtering sort, generate full ranking results " abc.CN, abc.PT, abc.ORG ", and full ranking results is returned to user as Query Result, if to finding to only have " abc.CN " to meet after 3 domain name screenings above, related web site quantity is 1, " abc.CN " is returned to user according to user A essential information, if three domain name rear discoveries of screening " abc.CN " and " abc.PT " are above met according to user A essential information, be that related web site quantity is 2, by rank from high to low time these 2 related web sites of ordered pair sort, generate relevance ranking result " abc.CN, abc.PT ", and relevance ranking result is returned to user as Query Result.
When user obtains Query Result, pressing " ENTER " key or other key/button is website corresponding to domain name ranked first in addressable described Query Result; User also can select other domain name in described Query Result by mouse or keyboard/key/button.For example, the domain name main body of supposing user's input is " domain name A ", Query Result is " domain name A.com.cn, domain name A.net, domain name A.CN, domain name A.com, domain name A.tw, domain name A.hk ", directly presses the network address " www. domain name A.com.cn " that " ENTER " key ranked first in just can access queries result after this user input " domain name A ".Query Result can present in a variety of forms in subscriber equipment, a preferred version is with the form of dynamic drop-down menu, to appear at the display window of subscriber equipment, as shown in Figure 4, user can click any network address in drop-down menu by keyboard up and down arrow keys or mouse, in default setting, user directly presses " ENTER " key access is in drop-down menu, to ranked first the website ranked first in Query Result namely.Dynamically drop-down menu displaying Query Result can be used accomplished in many ways, for example, utilize JavaScript to obtain by XMLHTTP the index data that index data base arranges out, and dynamically generate an IFRAME show (note, JavaScrip is a kind of client script language of OO regime type; XMLHTTP is one group of application programming interface collection of functions, can be called by JavaScript, by HTTP transceiving data between browser and index data base, can be dynamically at the display window of subscriber equipment, upgrades Query Result; IFRAME is the document in document namely, or the unsteady framework of picture, can realize the effect of drop-down menu as Figure 4 shows).
The second branch of drawing during the domain name quantity >1 of the effective website after filtration generates full ranking results and classification and ordination result, be finally that the full ranking results generating and classification and ordination result are joined and using the domain name main body of described user's input as the concordance list of keyword, and deposit in index data base and use for inquiry next time.Second branch carries out step S314 in above-mentioned response user query script to step S316, this process is with identical to step S106 with the step S104 setting up above in index data base, object is to supplement keyword and the concordance list of index data base, is no longer repeated in this description here.Second branch can carry out with article one branch simultaneously, also can move the free time after the complete user's inquiry of article one branch process.While processing second branch, can to website, grade by conventional web analytics method, also can adopt article one branch Simple calculation method used.
In a preferred embodiment, step S20 also comprises: receive the user that quotes of user's input, and the weight of information of quoting user is between the setting of classification and ordination result type and user's history access record.Correspondingly, step C2 also comprises: according to quoted user's information, full ranking results or classification and ordination result are screened.Implement this technical scheme, the essential information that can allow user to quote other users is used as the foundation of sequence, such as, the preference setting that allows user A to quote registered user B, or quote the full detail of registered user B, the information of quoting is also as a part for the information of user A, and index data base just can provide ranking results more accurately according to these information.
In a further advantageous embodiment, while inquiring about for the domain name main body that guarantees to input according to user in index data base, more approach user's request, inquire more accurately the information of corresponding Fully-Qualified Domain Name, this Website access method also comprises: index data base is upgraded.For example, according at least one in following, index data base is upgraded: (1) upgrades index data base according to registered user's active feedback; (2) according to a plurality of users' history access record, upgrade index data base; (3) the new website of periodic search is to upgrade index data base.Illustrate this three kinds of situations below:
(1) according to registered user's active feedback, upgrade index data base.Specific as follows: to receive priority domain name or blacklist domain name that the first registered user (for example registered user A) feeds back, and be committed to index data base; The priority domain name that index data base is inputted the first registered user or blacklist domain name are distributed at least one other registered user (such as registered user B, C, D, E etc.), to investigate; After registered user B, C, D, E etc. receive an assignment, priority domain name or website corresponding to blacklist domain name that investigation user A submits to, and investigation result is returned to index data base.Then, index data base is according to the investigation result of registered user B, C, D, E etc., by the feedback of certain rules evaluation registered user A whether accurate (such as basis, the minority is subordinate to the majority, or comprehensively the weight of registered user B, C, D, E etc. adds and subtracts to do evaluation more accurately mutually).If the feedback of evaluation result confirm registration user A conforms to investigation result, increase the weighted value of registered user A or give suitable award, upgrade the concordance list of index data base simultaneously; Otherwise do not upgrade index data base, and reduce the weighted value of registered user A or give suitable punishment.In addition, whether the increase and decrease of the weighted value of registered user B, C, D, E etc. is also accurately associated with their investigation result.The weighted value that it should be noted that registered user is for evaluating registered user's value and an index of resolution characteristic, representing with VIP.For example, the VIP value of stipulating firm registered user is 1, and a month VIP of every mistake increases by 0.1; User accurately submits a correct priority domain name or blacklist domain name to, and its VIP increases by 0.2, otherwise reduces 0.2; Registered user is a domain name of investigation accurately, and its VIP increases by 0.1, otherwise reduces 0.1.
With an object lesson, illustrate below: for example, suppose that registered user A submits to index data base as blacklist domain name " domain name 100.com ", and to enclose reason be " viruliferous website ".Index data base will be investigated the information of " domain name 100.com " to registered user's issue of other VIP value >=1, suppose that 4 people such as registered user B, C, D, E have initiatively applied for this task.At the appointed time, registered user B, C, D, E return to index data base by investigation result.Index data base can receive investigation result in many ways, such as can be to select the form of button: open when submitting the page of investigation result to when registered user B, C, D, E are used browser, the radio button that can click in the page by mouse is selected " being viruliferous website " or " not being viruliferous website ", and index data base just can obtain investigation result according to their click.Then whether accurate by adding up all investigation results if evaluating the feedback of registered user A for index data base.The investigation result of supposing registered user B, C, D tri-people is " being viruliferous website ", the investigation result of registered user E is " not being viruliferous website ", according to the principle that the minority is subordinate to the majority, index data base judges " domain name 100.com is viruliferous website " thus, the investigation result of the feedback of registered user A and registered user B, C, D is correct, and the investigation result of registered user E is wrong.Then, index data base can be rejected " domain name 100.com " from ranking results.Index data base is after revision, and except those are made as " domain name 100.com " open air of using of priority, when other users input " domain name 100 " again, Query Result there will not be " domain name 100.com ".Domestic consumer can only pass through the domain name of input tape suffix, could access " domain name 100.com ".In the time of index data base revision ranking results, add 0.2 to the VIP value of registered user A, the VIP value of registered user B, C, D increases respectively 0.1.Registered user E is because investigation result is incorrect, and VIP value reduces 0.1.
Index data base also can judge investigation result in conjunction with the VIP value that participates in the user of investigation.For example, suppose that registered user A submits to index data base " domain name 100.com " as priority domain name, and enclose reason and be: " domain name 100.com " is abundanter than ranked first position " domain name 100.ORG " content, have more users access, suggestion " domain name 100.com " comes " domain name 100.ORG " above.Registered user B, C, D, E participate in investigation.The investigation result of returning is: registered user B, C agree with the proposal of registered user A, and registered user D, E oppose.The VIP value of supposing registered user B, C, D, E is respectively 1.5,1.3,3.5 and 2.6, and the VIP value of registered user B, C adds up to 1.5+1.3=2.8, and the VIP value sum of registered user D, E is 6.1.Index data base is greater than favoring party's VIP value according to the VIP value of opposition side, thereby concludes that the feedback of registered user A is wrong, and index data base need not be revised.The VIP value of user A is reduced to 0.2, the VIP value of registered user D, E increases respectively 0.1 simultaneously, and registered user B, C are because investigation result is incorrect, and VIP value reduces respectively 0.1.
(2) according to a plurality of users' history access record, upgrade index data base.Specific as follows: to obtain respectively the access times of a plurality of users to the different web sites of same domain name main body; The rate of people logging in of above-mentioned each website of data statistics that index data base provides according to Visitor Logs; Analyze sequencing unit according to the classification and ordination result in statistics revision concordance list or full ranking results.Wherein, website visiting rate can be calculated with following formula:
Website visiting rate V=(P1+P2+P3+ ... + Pn)/N
Wherein, P1, P2, P3 ... Pn is respectively the probability that N user accesses the website of adding up.Account form is Pn=Fn/Sn, here, Fn by N user access the number of times of statistics website, Sn by N user access domain name main body with the total degree of identical all websites, statistics website.
For example, suppose that index data base need to revise the classification and ordination result that keyword is " being applicable to Chinese user " of " abc ", original classification and ordination result is " abc.NET ", " abc.CN ", " abc.ORG ".Index data base is a plurality of users of random selection from Chinese user, for convenience of description, are decided to be 3, are respectively user Y1, Y2 and Y3, obtain them and in one month, access in the past the number of times of above-mentioned website.Visitor Logs demonstration, the number of times that user Y1 accesses above-mentioned 3 websites is followed successively by: 4 times, 20 times and 1 time; The number of times that user Y2 accesses above-mentioned 3 websites is followed successively by: 6 times, 92 times and 2 times; The number of times that user Y3 accesses above-mentioned 3 websites is followed successively by: 45 times, 2 times and 3 times.The rate of people logging in of above-mentioned each website of data statistics then providing according to Visitor Logs.First add up " abc.NET ", user Y1 access " abc.NET " 4 times, in access classification and ordination result, the number of times of all websites is: 4+20+1=25, the probability of user Y1 access " abc.NET " is: 4/25=0.16; The probability that in like manner, can calculate user Y2 access " abc NET " is: 6/100=0.06; The probability of user Y3 access " abc.NET " is: 45/50=0.9, substitution formula can calculate rate of people logging in V=(P1+P2+P3)/3=(the 0.16+0.06+0.9)/3=0.373 of website " abc.NET ".Similarly, can calculate rate of people logging in=(0.8+0.92+0.04)/3=0.587 of website " abc.CN ".The rate of people logging in of website " abc.ORG "=(0.04+0.02+0.06)/3=0.04.Statistics shows that the rate of people logging in of " abc.CN " is the highest, surpasses the website " abc.NET " ranked first.By rule, before the domain name that rate of people logging in is high should come, so will be " abc.CN, abc.NET, abc.ORG " by ranking results revision.Can revise in the same way other classification and ordination results and full ranking results.
(3) the new website of periodic search is to upgrade index data base.On internet, have website open-minded every day, has every day website to close, and only relying on the scheme of first two revision index data base is obviously the information that can not obtain in time above-mentioned website.For this reason, index data base can upgrade concordance list by the new website of periodic search.Search for new website and can adopt the method for exhaustion identical with setting up before index data base, namely search for all keywords in setting range.The keyword of searching for during revision index data base comprises: when (a) domain name naming rule changes, and the keyword that rebaptism rule allows; (b) existing keyword in index data base.The keyword of above-mentioned condition (a) is that index data base does not have, for example, while supposing to set up index data base, do not allow with Chinese as domain name main body, so index data base does not have this class domain name, domain name naming rule had change afterwards, allow Chinese as domain name main body, this just need to supplement the concordance list of this class keywords.Method can adopt the flow process identical with setting up before index data base, no longer repeats.Above-mentioned condition (b) is for revising the concordance list of existing keyword, respectively the concordance list regular update to all keywords in index data base.
Revise step and set up before index data base similar, for example, comprise: (1) by certain keyword of index data base respectively with all suffix match, form a plurality of Fully-Qualified Domain Names, then filter, delete non-existent domain name, and access website corresponding to domain name after described filtration, the corresponding website of domain name after filtering is analyzed, therefrom extract simple information and content, domain name according to described simple information and content after to described filtration is classified, and forms classification domain name; (2) with reference to described keyword original full ranking results, the domain name after to described filtration sorts, and generates new full ranking results; (3) with reference to the original classification and ordination result of described keyword, described classification domain name is sorted, generate new classification and ordination result; (4) new full ranking results and new classification and ordination result replaced to original ranking results and deposit the concordance list of described keyword in.
In above-mentioned steps (1), with set up index data base difference and be: simple information and content are extracted in website corresponding to domain name that only need be from filtering, such as capturing website homepage by spider, information and the content of extraction are mainly used to websites collection.According to language and/or area segment website, with to set up before sorting technique that index data base adopts identical.Described classification domain name is not to each domain name sequence, only to the domain name classification after filtering.
In above-mentioned steps (2), domain name with reference to described keyword original full ranking results after to described filtration sorts, particularly, the domain name that original full ranking results exists comes above and by the relative precedence sequence of original full ranking results, and the non-existent domain name of original full ranking results is after namely new domain name comes.For example, the concordance list that the keyword that index data base need be revised is " abc ", original full ranking results is " abc.net ", " abc.cn ", " abc.pt ", " abc.org ", " abc.com " " abc.cc ", after one month, index data base by the above-mentioned step (1) of ordering to take " abc " filtering as domain name main body, delete non-existent domain name, the domain name after filtration " abc.cc ", " abc.so ", " abc.tv ", " abc.cn ", " abc.pt ", " abc.org ".Domain name after filtration " abc.so ", " abc.tv " do not appear in original full ranking results, after should coming; Domain name after filtration " abc.cc ", " abc.cn ", " abc.pt ", " abc.org " appear in original full ranking results simultaneously, before should arranging.Then by the relative precedence sequence of original full ranking results, form new full ranking results " abc.cn ", " abc.pt ", " abc.org ", " abc.cc ", " abc.so ", " abc.tv ", should be noted that, during revision concordance list, new domain name is put behind, the sequencing between new domain name can be any.
In above-mentioned steps (3), with reference to the original classification and ordination result of described keyword, described classification domain name is sorted, identical with the principle of above-mentioned steps (2).For example, suppose that keyword " aaa " classification domain name has two groups: towards Chinese user's classification domain name " aaa.com ", " aaa.net ", " aaa.cn " with towards English user's classification domain name " aaa.org ", " aaa.edu "; Original classification and ordination result has two groups: towards Chinese user's classification domain name " aaa.com ", " aaa.cn " with towards French user's classification domain name " aaa.org ", " aaa.net ".First the classification domain name towards Chinese user is sorted, in accordance with regulations, towards Chinese user's classification domain name " aaa.com ", " aaa.cn ", appear in the classification and ordination result at Chinese family, before should arranging simultaneously; And " aaa.net " only appears in French user's classification and ordination result, do not appear in the classification and ordination result at Chinese family, after should arranging in accordance with regulations, form new classification and ordination result " aaa.com ", " aaa.cn ", " aaa.net " towards Chinese user; Then the classification domain name towards English user is sorted, original concordance list does not have the classification and ordination result towards English user, so classification domain name " aaa.org ", " aaa.edu " towards English user are equivalent to new domain name, sequencing between new domain name can be any in accordance with regulations, classification and ordination result " aaa.org ", " aaa.edu " towards English user that finally must make new advances.
In above-mentioned steps (4), the ranking results of using step (2) and step (3) acquisition replaces original ranking results, after the concordance list revision of described keyword, continue the concordance list of the next keyword of revision index data base, until the concordance list update all of all keywords.
The present invention also takes into account the demand that user freely selects, and with identical by existing technology, allows the Fully-Qualified Domain Name of user's input tape suffix, then presses " ENTER " key or other key/button and directly accesses the accurate network address that Fully-Qualified Domain Name is corresponding.
Fig. 5 is the logical diagram of website visiting system embodiment one of the present invention, and this website visiting system comprises: index data base is set up module 10, user's load module 20 and enquiry module 30.Wherein, index data base is set up module 10 for setting up in advance index data base, and described index data base stores a plurality of domain name main bodys and at least one information about Fully-Qualified Domain Name corresponding with each domain name main body; User's load module 20 is for receiving the main body of the corresponding certain domain name of specific website of user input, and the main body of described certain domain name is sent to index data base by web page server; Enquiry module 30 is for take the information of the corresponding Fully-Qualified Domain Name of main body that the main body of described certain domain name is certain domain name described in keyword query at index data base, and Query Result is returned to user.
Fig. 6 is the logical diagram of website visiting system embodiment two of the present invention, and this website visiting system comprises: index data base is set up module 10, user's load module 20, enquiry module 30 and update module 40.Wherein, it is identical that index data base is set up the logical organization that module 10, user's load module 20 and enquiry module 30 set up module 10, user's load module 20 and enquiry module 30 with index data base in the embodiment mono-shown in Fig. 5, at this, do not repeat, update module 40 is below only described, this update module 40 is for upgrading index data base.Preferably, update module 40 is for upgrading index data base according to following at least one: according to registered user's active feedback, upgrade index data base; According to a plurality of users' history access record, upgrade index data base; The new website of periodic search is to upgrade index data base.
Fig. 7 is the building-block of logic that in above-described embodiment, index data base is set up module 10, and this index data base is set up module 10 and can further be comprised: keyword generation unit 101, the first assembled unit 102, the full sequencing unit 104 of filter element 103, first, the first storage unit 105, the first classification and ordination unit 106 and the second storage unit 107.Wherein, keyword generation unit 101 is for generating new keyword as domain name main body; The first assembled unit 102, for generated domain name main body is mated with all domain name suffix, is combined into a plurality of complete domain names; Filter element 103 is for described a plurality of complete domain names are filtered, to delete non-existent domain name; The first full sequencing unit 104 is for analyzing the corresponding website of domain name after filtering, and the domain name after to described filtration sorts according to analysis result, to generate full ranking results, preferably, following analysis is carried out in the corresponding website of domain name after 104 pairs of filtrations of the first full sequencing unit: the language of renewal frequency, hour of log-on, web site contents amount, employing, the corresponding area of Website server IP, linking relationship and domain name suffix information; The first storage unit 105 is for depositing described full ranking results in index data base.The first classification and ordination unit 106 is for described full ranking results is carried out to classification and ordination according to language and/or the area of the corresponding website of domain name after filtering, to generate classification and ordination result; The second storage unit 107 is for depositing described classification and ordination result in index data base.
Fig. 8 is the building-block of logic of enquiry module 30 in above-described embodiment, and described enquiry module 30 comprises: inquiry and judging unit 301, first return to unit 302, the second assembled unit 303, filtration and judging unit 304, second and return to unit 305, the 3rd storage unit the 306, the 3rd and return to unit 307, the 4th and return to unit 308, screening unit 309, quantity judging unit 310, the 4th and return to unit 311, the 5th and return to unit 312, the 6th and return to the full sequencing unit 314 in unit 313, second, the second classification and ordination unit 315, the 5th storage unit 316.Wherein, inquiry and judging unit 301 are inquired about as keyword for take the main body of described certain domain name at index data base, and judge whether to inquire the information of the complete domain name corresponding with the main body of described certain domain name; First returns to unit 302 for when inquiring the information of the complete domain name corresponding with the main body of described certain domain name, according to user profile, full ranking results or classification and ordination result are screened, and the ranking results after screening is returned to user as Query Result, preferably, described user profile comprises non-registered users information or information of registered users, and non-registered users information comprises: IP address, the history access record relevant to IP address, the setting of classification and ordination result type; Information of registered users comprises: station address, user preference setting, IP address, user's history access record, and being arranged as according to weight order from high to low: user preference setting, the setting of classification and ordination result type, user's history access record, history access record, station address, the IP address relevant to IP address; The second assembled unit 303, for when not inquiring the information of the complete domain name corresponding with the main body of described certain domain name, mates the main body of described certain domain name with all domain name suffix, be combined into a plurality of complete domain names; Filter and judging unit 304 for described a plurality of complete domain names are filtered, to delete non-existent domain name, and the quantity of the domain name of judgement after filtering; Second to return to unit 305 be 0 o'clock for the quantity of the domain name after filtration, to user feedback domain name, do not exist; The 3rd storage unit 306 is 0 o'clock for the quantity of the domain name after filtration, does not exist the information of Fully-Qualified Domain Name to be stored to index data base the main body of described certain domain name; The 3rd to return to unit 307 be 1 o'clock for the quantity of the domain name after filtration, this domain name after filtering to user feedback; The 4th storage unit 308 is 1 o'clock for the quantity of the domain name after filtration, and the corresponding unique complete domain name of the main body of described certain domain name is stored to index data base; Screening unit 309 is greater than at 1 o'clock for the quantity of the domain name after filtration, the corresponding website of domain name after filtering analyzed, and according to filtering out related web site in analysis result and the corresponding website of the domain name of user profile from described filtration; Quantity judging unit 310 is for judging the quantity of described related web site; The 4th to return to unit 311 be 0 o'clock for the quantity at related web site, the corresponding website of domain name after filtering is analyzed, and according to analysis result, the domain name after to described filtration sorts, and to generate full ranking results, and full ranking results is returned to user; The 5th to return to unit 312 be 1 o'clock for the quantity at related web site, and the domain name of this unique related web site is returned to user; The 6th returns to unit 313 is greater than at 1 o'clock for the quantity at related web site, related web site is analyzed, and generated relevance ranking result according to analysis result, and relevance ranking result is returned to user as Query Result.The second full sequencing unit 314 is greater than at 1 o'clock for the quantity of the domain name after filtration, the corresponding website of domain name after filtering is analyzed, and the domain name after to described filtration sorts according to analysis result, to generate full ranking results; The second classification and ordination unit 315 is for described full ranking results is carried out to classification and ordination according to language and/or the area of the corresponding website of domain name after filtering, to generate classification and ordination result; The 5th storage unit 316 is for depositing described full ranking results and classification and ordination result in index data base.
The foregoing is only the preferred embodiments of the present invention, be not limited to the present invention, for a person skilled in the art, the present invention can have various modifications and variations.Within the spirit and principles in the present invention all, any modification of doing, be equal to replacement, improvement etc., within all should being included in claim scope of the present invention.

Claims (20)

1. a Website access method, is characterized in that, comprising:
A. set up in advance index data base, described index data base stores a plurality of domain name main bodys and at least one information about Fully-Qualified Domain Name corresponding with each domain name main body, Fully-Qualified Domain Name is comprised of domain name main body and domain name suffix, and according to Fully-Qualified Domain Name language or the address of corresponding website the information of Fully-Qualified Domain Name is carried out to classification and ordination;
B. receive the domain name main body of the corresponding certain domain name of specific website of user's input, and the domain name main body of described certain domain name is sent to index data base by web page server;
The information of the corresponding Fully-Qualified Domain Name of domain name main body that the domain name main body of described certain domain name of C. take in index data base is certain domain name described in keyword query, and Query Result is returned to user, described Query Result comprises the information of the unique preferred Fully-Qualified Domain Name being screened from corresponding classification and ordination according to user profile by system, the domain name main body of the corresponding domain name of information of this preferred Fully-Qualified Domain Name is identical with the domain name main body of user's input, during sequence, the information of described unique preferred Fully-Qualified Domain Name is ranked the first.
2. Website access method according to claim 1, is characterized in that, described Website access method also comprises:
D. index data base is upgraded.
3. Website access method according to claim 1, is characterized in that, described steps A comprises:
A1. generate new keyword as domain name main body;
A2. generated domain name main body is mated with all domain name suffix, be combined into a plurality of complete domain names;
A3. described a plurality of complete domain names are filtered, to delete non-existent domain name;
A4. the corresponding website of domain name after filtering is analyzed, and the domain name after to described filtration sorts according to analysis result, to generate full ranking results;
A5. deposit described full ranking results in index data base.
4. Website access method according to claim 3, is characterized in that, between described steps A 4 and steps A 5, also comprises:
A6. described full ranking results is divided according to language or the address of the corresponding website of domain name after filtering
Class sequence, to generate classification and ordination result;
Described steps A 5 also comprises:
Deposit described classification and ordination result in index data base.
5. Website access method according to claim 4, is characterized in that, described step C comprises:
C1. the main body of described certain domain name of take in index data base is inquired about as keyword, and judges whether to inquire the information of the Fully-Qualified Domain Name corresponding with the main body of described certain domain name, if so, performs step C2;
If not, perform step C3;
C2. according to user profile, full ranking results or classification and ordination result are screened, and the ranking results after screening is returned to user as Query Result;
C3. the main body of described certain domain name is mated with all domain name suffix, be combined into a plurality of complete domain names;
C4. described a plurality of complete domain names are filtered, to delete non-existent domain name, and the quantity of the domain name after judgement filtration, if the quantity of the domain name after filtering is 0, perform step C5; If the quantity of the domain name after filtering is 1, perform step C6; If the quantity of the domain name after filtering is greater than 1, perform step C7;
C5. to user feedback domain name, do not exist;
C6. this domain name after filtering to user feedback;
C7. the corresponding website of domain name after filtering is analyzed, and according to filtering out related web site in analysis result and the corresponding website of the domain name of user profile from described filtration;
C8. judge the quantity of described related web site, if the quantity of related web site is 0, perform step C9; If the quantity of related web site is 1, perform step C10; If the quantity of related web site is greater than 1, perform step C11;
C9. the corresponding website of domain name after filtering is analyzed, and according to analysis result, the domain name after to described filtration sorts, and to generate full ranking results, and full ranking results is returned to user;
C10. the domain name of this unique related web site is returned to user;
C11. related web site is analyzed, and generated relevance ranking result according to analysis result, and relevance ranking result is returned to user as Query Result.
6. Website access method according to claim 5, is characterized in that, in step C4, if the quantity of the domain name after filtering is 0, also performs step C12; If the quantity of the domain name after filtering is 1, also perform step C13; If the quantity of the domain name after filtering is greater than 1, also perform step C14;
C12. do not exist the information of Fully-Qualified Domain Name to be stored to index data base the main body of described certain domain name;
C13. the corresponding unique complete domain name of the main body of described certain domain name is stored to index data base;
C14. the corresponding website of domain name after filtering is analyzed, and the domain name after to described filtration sorts according to analysis result, to generate full ranking results;
C15. described full ranking results is carried out to classification and ordination according to language or the address of the corresponding website of domain name after filtering, to generate classification and ordination result;
C16. deposit described full ranking results and classification and ordination result in index data base.
7. Website access method according to claim 2, is characterized in that, in described step D, according at least one in following, index data base is upgraded:
According to registered user's active feedback, upgrade index data base;
According to a plurality of users' history access record, upgrade index data base;
The new website of periodic search is to upgrade index data base.
8. Website access method according to claim 7, is characterized in that, the step of upgrading index data base according to user's active feedback comprises:
D1. receive priority domain name or blacklist domain name that the first registered user feeds back, and be committed to index data base;
D2. index data base is inputted the first registered user priority domain name or blacklist domain name are distributed at least one other registered user, to investigate;
D3. at least one other registered user investigates described priority domain name or the corresponding website of blacklist domain name, and investigation result is returned to index data base;
D4. index data base is according to investigation result, and whether the feedback of evaluating described the first registered user is accurate;
If D5. the first registered user's feedback conforms to investigation result, according to the first registered user's feedback, upgrade index data base; Otherwise, do not upgrade index data base.
9. Website access method according to claim 3, it is characterized in that, following analysis is carried out in the corresponding website of domain name after filtering: the language of renewal frequency, hour of log-on, web site contents amount, employing, the corresponding area of Website server IP, linking relationship and domain name suffix information.
10. Website access method according to claim 5, it is characterized in that, described user profile comprises non-registered users information or information of registered users, and non-registered users information comprises: IP address, the history access record relevant to IP address, the setting of classification and ordination result type; Information of registered users comprises: station address, user preference setting, IP address, user's history access record.
11. Website access methods according to claim 10, it is characterized in that, described user profile is arranged as according to weight order from high to low: user preference setting, the setting of classification and ordination result type, user's history access record, history access record, station address, the IP address relevant to IP address.
12. Website access methods according to claim 11, is characterized in that, described step B also comprises: receive the user that quotes of user's input, and the weight of information of quoting user is between the setting of classification and ordination result type and user's history access record;
Step C2 also comprises: according to quoted user's information, full ranking results or classification and ordination result are screened.
13. 1 kinds of website visiting systems, is characterized in that, comprising:
Index data base is set up module, for setting up in advance index data base, described index data base stores a plurality of domain name main bodys and at least one information about Fully-Qualified Domain Name corresponding with each domain name main body, Fully-Qualified Domain Name is comprised of domain name main body and domain name suffix, and according to Fully-Qualified Domain Name language or the address of corresponding website the information of Fully-Qualified Domain Name is carried out to classification and ordination;
User's load module, for receiving the domain name main body of the corresponding certain domain name of specific website of user's input, and is sent to index data base by the domain name main body of described certain domain name by web page server;
Enquiry module, for take the information of the corresponding Fully-Qualified Domain Name of domain name main body that the domain name main body of described certain domain name is certain domain name described in keyword query at index data base, and Query Result is returned to user, described Query Result comprises the information of the unique preferred Fully-Qualified Domain Name being screened from corresponding classification and ordination according to user profile by system, the domain name main body of the corresponding domain name of information of this preferred Fully-Qualified Domain Name is identical with the domain name main body of user's input, during sequence, the information of described unique preferred Fully-Qualified Domain Name is ranked the first.
14. website visiting systems according to claim 13, is characterized in that, described website visiting system also comprises:
Update module, for upgrading index data base.
15. website visiting systems according to claim 13, is characterized in that, described index data base is set up module and comprised:
Keyword generation unit, for generating new keyword as domain name main body;
The first assembled unit, for generated domain name main body is mated with all domain name suffix, is combined into a plurality of complete domain names;
Filter element, for described a plurality of complete domain names are filtered, to delete non-existent domain name;
The first full sequencing unit, for the corresponding website of domain name after filtering is analyzed, and the domain name after to described filtration sorts according to analysis result, to generate full ranking results;
The first storage unit, for depositing described full ranking results in index data base.
16. website visiting systems according to claim 15, is characterized in that, described index data base is set up module and also comprised:
The first classification and ordination unit, for described full ranking results is carried out to classification and ordination according to language or the address of the corresponding website of domain name after filtering, to generate classification and ordination result;
The second storage unit, for depositing described classification and ordination result in index data base.
17. website visiting systems according to claim 16, it is characterized in that, described enquiry module comprises: inquiry and judging unit, for take the main body of described certain domain name at index data base, inquire about as keyword, and judge whether to inquire the information of the complete domain name corresponding with the main body of described certain domain name;
First returns to unit, for when inquiring the information of the complete domain name corresponding with the main body of described certain domain name, according to user profile, full ranking results or classification and ordination result are screened, and the ranking results after screening is returned to user as Query Result;
The second assembled unit, for when not inquiring the information of the complete domain name corresponding with the main body of described certain domain name, mates the main body of described certain domain name with all domain name suffix, be combined into a plurality of complete domain names;
Filter and judging unit, for described a plurality of complete domain names are filtered, to delete non-existent domain name, and the quantity of the domain name after judgement filtration;
Second returns to unit, for the quantity of the domain name after filtration, is 0 o'clock, to user feedback domain name, does not exist; The 3rd returns to unit, for the quantity of the domain name after filtration, is 1 o'clock, this domain name after filtering to user feedback;
Screening unit, is greater than at 1 o'clock for the quantity of the domain name after filtration, the corresponding website of domain name after filtering analyzed, and according to filtering out related web site in analysis result and the corresponding website of the domain name of user profile from described filtration;
Quantity judging unit, for judging the quantity of described related web site;
The 4th returns to unit, for the quantity at related web site, is 0 o'clock, the corresponding website of domain name after filtering is analyzed, and according to analysis result, the domain name after to described filtration sorts, and to generate full ranking results, and full ranking results is returned to user;
The 5th returns to unit, for the quantity at related web site, is 1 o'clock, and the domain name of this unique related web site is returned to user;
The 6th returns to unit, for the quantity at related web site, is greater than at 1 o'clock, related web site is analyzed, and generated relevance ranking result according to analysis result, and relevance ranking result is returned to user as Query Result.
18. website visiting systems according to claim 17, is characterized in that, described enquiry module also comprises:
The 3rd storage unit, is 0 o'clock for the quantity of the domain name after filtration, does not exist the information of Fully-Qualified Domain Name to be stored to index data base the main body of described certain domain name;
The 4th storage unit, is 1 o'clock for the quantity of the domain name after filtration, and the corresponding unique complete domain name of the main body of described certain domain name is stored to index data base;
The second full sequencing unit, is greater than at 1 o'clock for the quantity of the domain name after filtration, the corresponding website of domain name after filtering is analyzed, and the domain name after to described filtration sorts according to analysis result, to generate full ranking results;
The second classification and ordination unit, for described full ranking results is carried out to classification and ordination according to language or the address of the corresponding website of domain name after filtering, to generate classification and ordination result;
The 5th storage unit, for depositing described full ranking results and classification and ordination result in index data base.
19. website visiting systems according to claim 14, is characterized in that, described update module is used for
According at least one in following, upgrade index data base:
According to registered user's active feedback, upgrade index data base;
According to a plurality of users' history access record, upgrade index data base;
The new website of periodic search is to upgrade index data base.
20. website visiting systems according to claim 17, it is characterized in that, described user profile comprises non-registered users information or information of registered users, and non-registered users information comprises: IP address, the history access record relevant to IP address, the setting of classification and ordination result type; Information of registered users comprises: station address, user preference setting, IP address, user's history access record.
CN201210016303.3A 2012-01-17 2012-01-17 Method and system for website accessing Active CN102609473B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210016303.3A CN102609473B (en) 2012-01-17 2012-01-17 Method and system for website accessing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210016303.3A CN102609473B (en) 2012-01-17 2012-01-17 Method and system for website accessing

Publications (2)

Publication Number Publication Date
CN102609473A CN102609473A (en) 2012-07-25
CN102609473B true CN102609473B (en) 2014-11-12

Family

ID=46526845

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210016303.3A Active CN102609473B (en) 2012-01-17 2012-01-17 Method and system for website accessing

Country Status (1)

Country Link
CN (1) CN102609473B (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105045878A (en) * 2015-07-21 2015-11-11 北京信景软件有限公司 Website building system and method for simultaneously operating a plurality of websites in one website space
CN105721624B (en) * 2016-01-22 2019-06-21 中国互联网络信息中心 A kind of novel authoritative domain name resolution service method and apparatus
CN108763404A (en) * 2018-05-22 2018-11-06 深圳市茁壮网络股份有限公司 A kind of access address fault-tolerance approach and fault tolerance facility
CN109492088A (en) * 2018-09-19 2019-03-19 平安科技(深圳)有限公司 Search result optimization sequencing method, device and computer readable storage medium
CN109376187A (en) * 2018-12-17 2019-02-22 北京京东金融科技控股有限公司 A kind of querying method and device based on block chain
CN109788082B (en) * 2019-01-23 2021-09-28 深圳互联先锋科技有限公司 Method and system for efficient domain name detection
CN109951448A (en) * 2019-01-31 2019-06-28 中国互联网络信息中心 Domain name authentic authentication method and device based on block chain
CN109784761A (en) * 2019-01-31 2019-05-21 中国互联网络信息中心 Domain name ranking method, device, electronic equipment and storage medium based on block chain
CN109905388B (en) * 2019-02-20 2021-12-07 中国互联网络信息中心 Domain name credit processing method and system based on block chain
CN110704716B (en) * 2019-10-31 2022-04-22 中国科学院计算机网络信息中心 Cultural relic identification and service method based on Chinese domain name
CN111198771A (en) * 2019-11-29 2020-05-26 云深互联(北京)科技有限公司 Method, device, equipment and storage medium for realizing platform general service
CN112632159B (en) * 2020-12-01 2021-09-28 腾讯科技(深圳)有限公司 Database access control method and device, electronic equipment and storage medium
CN112905643B (en) * 2021-03-11 2022-12-16 广西电力职业技术学院 Method and system for automatically retrieving from automobile fault case library
CN113312926B (en) * 2021-06-07 2024-10-29 浙江贰贰网络有限公司 Domain name meaning translation method
CN113449160A (en) * 2021-06-30 2021-09-28 平安科技(深圳)有限公司 Intelligent data screening method, device, equipment and medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1365239A (en) * 2001-01-11 2002-08-21 英华达股份有限公司 Method for inputting tracing and intelligent matching web site on radio application protocol browser
CN1941726A (en) * 2006-07-18 2007-04-04 魏新成 Method for improving input display to domain address
CN101539949A (en) * 2008-11-13 2009-09-23 北京搜狗科技发展有限公司 URL completion prompting method and device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1365239A (en) * 2001-01-11 2002-08-21 英华达股份有限公司 Method for inputting tracing and intelligent matching web site on radio application protocol browser
CN1941726A (en) * 2006-07-18 2007-04-04 魏新成 Method for improving input display to domain address
CN101539949A (en) * 2008-11-13 2009-09-23 北京搜狗科技发展有限公司 URL completion prompting method and device

Also Published As

Publication number Publication date
CN102609473A (en) 2012-07-25

Similar Documents

Publication Publication Date Title
CN102609473B (en) Method and system for website accessing
US8935197B2 (en) Systems and methods for facilitating open source intelligence gathering
CN102073699B (en) For improving the method for Search Results, device and equipment based on user behavior
CN102073725B (en) Method for searching structured data and search engine system for implementing same
US20130046771A1 (en) Systems and methods for facilitating the gathering of open source intelligence
CN101373485A (en) Method and apparatus for providing web page access entrance
CN104915413A (en) Health monitoring method and health monitoring system
CN1487442A (en) Method and system for practicing automatic completion in pages
CN110134845A (en) Project public sentiment monitoring method, device, computer equipment and storage medium
CN105718533A (en) Information pushing method and device
CN101382954A (en) Method and system for providing web site collection name
CN102930058A (en) Method and device for realizing search in address field of browser
CN104866582A (en) Method and apparatus for displaying page information
JP4962980B2 (en) Search result classification apparatus and method using click log
CN109934631A (en) Question and answer information processing method, device and computer equipment
CN103106234A (en) Searching method and device of webpage content
CN105468627A (en) Method and system for shielding and filtering web page contents
JP5982968B2 (en) Electronic book display device, collection information display program, and collection information display method
CN103718179A (en) Information processing apparatus, information processing method, information processing program, and storage medium having information processing program stored therein
CN105808636B (en) Hypertext link pushing system based on APP information data
JP6433270B2 (en) Content search result providing system and content search result providing method
CN114281327A (en) Method for realizing automatic sequencing of page components through weight algorithm
CN103136316A (en) Website navigation system and method
US20170061008A1 (en) System and method for conducting a search
CN105354225A (en) Network search result recommendation method and electronic device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant