CN110866170A - Importance evaluation method, search method and system for Tor darknet service based on site quality - Google Patents

Importance evaluation method, search method and system for Tor darknet service based on site quality Download PDF

Info

Publication number
CN110866170A
CN110866170A CN201910992292.4A CN201910992292A CN110866170A CN 110866170 A CN110866170 A CN 110866170A CN 201910992292 A CN201910992292 A CN 201910992292A CN 110866170 A CN110866170 A CN 110866170A
Authority
CN
China
Prior art keywords
evaluation
website
value
page
dark
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910992292.4A
Other languages
Chinese (zh)
Inventor
王学宾
时金桥
王大魁
尹泽林
赵璨
高悦
陈牧谦
王美琪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Information Engineering of CAS
Original Assignee
Institute of Information Engineering of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Information Engineering of CAS filed Critical Institute of Information Engineering of CAS
Priority to CN201910992292.4A priority Critical patent/CN110866170A/en
Publication of CN110866170A publication Critical patent/CN110866170A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses an importance evaluation method, a searching method and a searching system of a Tor darknet service based on site quality. The method comprises the following steps: 1) constructing a Tor dark website point quality evaluation index system, and determining evaluation indexes and corresponding weights; 2) acquiring webpage information of a target Tor dark website point, and determining an evaluation value of aesthetic evaluation, an evaluation value of usability evaluation, an evaluation value of multimedia support evaluation and an evaluation value of content richness evaluation of the target Tor dark website point according to the acquired information; 3) determining the evaluation value of the website reputation evaluation of the target Tor dark website according to whether a user feedback page exists in the target Tor dark website, whether the website is recorded by Tor2web service and whether the dark website address has readability; 4) and determining the importance of the target Tor dark website point according to each evaluation value and corresponding weight of the target Tor dark website point. The invention can accurately evaluate the quality of the station.

Description

Importance evaluation method, search method and system for Tor darknet service based on site quality
Technical Field
The invention relates to an importance evaluation method, a searching method and a searching system of a Tor darknet service based on site quality, and belongs to the technical field of computer networks.
Background
The whole internet can be divided into a Surface network (Surface web) and a deep network (deep web) according to the distribution status. The surface network refers to a web which can be indexed by a traditional web search engine and is formed by taking a static web page which can be reached by hyperlink as a main part. Deep networks, also known as Deep Net, Invisible web or Hidden web, refer to a collection of resources stored in a network database that are not accessible via hyperlinks and that need to be accessed via dynamic web technology. Common deep web content includes dynamic web pages, web pages that require login, unlinked web pages, web pages with limited access, web pages in non-HTML/text formats, and the like. The darknet (Dark web) is particularly a web service in an anonymous network hiding service, and belongs to a part of a deep network.
Although the darknet was designed at the beginning to protect the identity of the service operator from being traced by others, the illegal contents of the darknet later constitute a new security threat in anonymous networks. Various countries have taken a series of measures against the darknet. In 2013, in 10 months, the FBI utilizes the site bug end to remove the Silk road1.0 site which is specially sold with illegal goods and services in the Tor network, and catches the site operator. In 11 months 2014, the FBI combines multiple departments such as the american immigration and customs enforcement agency (ICE), the national security institute (HSI), the european police association (Europol), the european inspection and management association (Eurojust), and the like, and terminates the Silk Road 2.0, more than 400 contraband products such as drugs, guns, and the like, and Tor-darknet service sites for employment of businesses such as the murder and the killer, and the like, by using the code of the Operation public agency (operations enterprises). With the increasing rampant of illegal activities in the hidden network, importance evaluation work aiming at hidden services becomes more and more necessary, and is an increasingly important research direction in the field of anonymous networks.
Currently, research aiming at the Tor dark network focuses on anonymous cracking and network measurement of protocol level, and an importance evaluation method of Tor dark network service based on site quality does not appear at present.
Disclosure of Invention
The invention aims to provide a method, a method and a system for evaluating the importance of a Tor darknet service based on site quality. The number of sites in the Tor darknet is tens of thousands, the number of pages is tens of millions, and although the difference is many orders of magnitude relative to the surface network, the redundancy is not poor, and the low quality is even the garbage data. In addition, the importance degree of the pages and the sites is different, the important pages or the information published by the sites are more important, the content quality is higher, and the reading crowd is wider. Therefore, the evaluation of the importance of the Tor darknet service is also an important aspect for constructing a Tor darknet search engine, and has great influence on the relevance ranking aspect in the retrieval; when the search service is performed, after the importance evaluation result of the dark web service is obtained, the dark web search results can be ranked according to the importance evaluation value. Similarly, establishing an importance evaluation method of web pages and sites suitable for the internet in a surface network is an important research subject in the field of information retrieval, and is an important ranking index after the relevance ranking of the retrieval ranking relay query.
The relevant attributes of the web page or the site itself may also constitute criteria for measuring the importance of the site. The quality of the web pages is different from that of the website, generally, the high-quality web pages or the website generally have rich content information, the representation of the web pages is known in the field, the web pages conform to the human aesthetics and contain a plurality of multimedia information and the like, the low-quality web pages generally have single content information, the known degree is not enough, the human aesthetics is not followed, the information expression form is single, and the web pages are full of advertisement information and the like. Therefore, the quality of a web page or site can be measured from its own relevant attributes. The existing commercial search engine uses a site quality evaluation method which is mainly specified by a developer, and in an evaluation method in academia, the specification is mainly analogized based on a software quality evaluation method.
The method evaluates the importance of the Tor darknet by using the existing evaluation method based on the site quality in the analogy surface layer network, and the evaluation result can be used as the index of the relevance retrieval ranking to be applied to a Tor darknet search engine system.
The technical scheme of the invention is as follows:
a method for evaluating the importance of a Tor darknet service based on site quality comprises the following steps:
1) constructing a Tor dark website point quality evaluation index system, and determining evaluation indexes and corresponding weights; the evaluation indexes comprise aesthetic evaluation, usability evaluation, multimedia support evaluation, content richness evaluation and website reputation evaluation of the Tor darknet site;
2) acquiring webpage information of a target Tor dark website point, and determining an evaluation value of aesthetic evaluation, an evaluation value of usability evaluation, an evaluation value of multimedia support evaluation and an evaluation value of content richness evaluation of the target Tor dark website point according to the acquired webpage information;
3) determining the evaluation value of the website reputation evaluation of the target Tor dark website according to whether a user feedback page exists in the target Tor dark website, whether the website is recorded by Tor2web service and whether the dark website address has readability;
4) and determining the importance of the target Tor-dark website point according to the evaluation value of aesthetic evaluation, the evaluation value of usability evaluation, the evaluation value of multimedia support evaluation, the evaluation value of content richness evaluation, the evaluation value of website reputation evaluation and corresponding weights.
Further, the method for judging whether the hidden network service address has readability comprises the following steps: and judging whether the first bits of the dark net service address are matched with words of an English dictionary or not, and if the matched words exist, judging that the dark net service address has readability.
Further, the method for calculating the evaluation value of the aesthetic evaluation comprises the following steps: first, determining characteristics and weights thereof constituting aesthetic effects of the website, wherein the characteristics comprise: pictures, page presentations and resolution, color and underlining emphasis; determining the characteristic value of the picture characteristic according to whether the website page has a determined picture size attribute, whether the page displays only one large picture, whether each picture in the page has an image tag and whether each image tag in the page has a link attribute; determining a characteristic value of page display and resolution characteristic according to the resolution supported by the page and the table size; determining a characteristic value of the color characteristic according to the number of the color types and the RGB range in the page and whether color blindness is considered; determining a feature value of an underline emphasized feature according to whether all of the underline links in the page are hyperlinks; and then, calculating to obtain the evaluation value of the aesthetic evaluation according to the characteristic value of each characteristic and the corresponding weight.
Further, the method for calculating the evaluation value of the usability evaluation includes: firstly, determining features for calculating usability evaluation, wherein the features comprise consistency features, navigation bar features and annotation features of a page; then, designing according to whether a CSS file is adopted in a website page to determine a consistency characteristic value of the page, determining a navigation bar characteristic value of the page according to whether a frame is adopted in the website page, whether a link returning to a home page exists in each sub-page and whether a menu bar exists in the home page, and determining an annotation characteristic value of the page according to whether the link and the table in the website page both contain a label and whether META description exists in each page; and then, calculating to obtain an evaluation value of the usability evaluation according to the characteristic value of each characteristic and the corresponding weight.
Further, the method for calculating the evaluation value of the multimedia support evaluation comprises the following steps: firstly, determining characteristics for calculating multimedia support evaluation, including a plug-in characteristic, a multimedia attribute characteristic, a page multimedia quantity characteristic and a preview characteristic of a page; determining a plug-in characteristic value, a multimedia attribute characteristic value, a page multimedia quantity characteristic value and a preview characteristic value according to the website page information; and then, calculating to obtain an evaluation value of the multimedia support evaluation according to the characteristic value of each characteristic and the corresponding weight.
Further, the method for calculating the evaluation value of the content richness evaluation includes: firstly, determining characteristics for calculating content richness evaluation, including bulletin board characteristics, navigation board characteristics, search box characteristics and automatic refreshing characteristics of a page; then determining a bulletin board characteristic value, a navigation board characteristic value, a search box characteristic value and an automatic refreshing characteristic value according to the website page information; and then, calculating to obtain an evaluation value of the content richness evaluation according to the characteristic value of each characteristic and the corresponding weight.
An importance evaluation system of Tor dark web service based on site quality is characterized by comprising a Tor dark web site quality evaluation index system, an evaluation index value calculation module and an importance evaluation module; wherein,
a Tor dark website point quality evaluation index system used for determining evaluation indexes and corresponding weights; the evaluation indexes comprise aesthetic evaluation, usability evaluation, multimedia support evaluation, content richness evaluation and website reputation evaluation of the Tor darknet site;
the evaluation index value calculation module is used for acquiring webpage information of the target Tor dark website point and determining an evaluation value of aesthetic evaluation, an evaluation value of usability evaluation, an evaluation value of multimedia support evaluation and an evaluation value of content richness evaluation of the target Tor dark website point according to the acquired webpage information; determining the evaluation value of the website reputation evaluation of the target Tor dark website according to whether a user feedback page exists in the target Tor dark website, whether the website is recorded by Tor2web service and whether the dark website address has readability;
and the importance evaluation module is used for determining the importance of the target Tor-dark website point according to the evaluation value of the aesthetic evaluation, the evaluation value of the usability evaluation, the evaluation value of the multimedia support evaluation, the evaluation value of the content richness evaluation, the evaluation value of the website reputation evaluation and the corresponding weight.
Further, the evaluation index value calculation module judges whether the first bits of the dark web service address are matched with words of an English dictionary or not, and if the first bits of the dark web service address are matched with the words of the English dictionary, the dark web service address is judged to have readability.
Furthermore, the weight of aesthetic evaluation and the weight of website reputation evaluation are higher than the weight of usability evaluation, and the weight of usability evaluation is higher than the weight of multimedia support evaluation and content richness evaluation.
The Tor dark web service searching method based on the site quality is characterized in that the searching results are ranked according to the importance evaluation value of the dark web service obtained by the evaluation method.
Compared with the prior art, the invention has the following positive effects:
1. based on the existing quality evaluation framework of the surface layer network site, aiming at the improvement of the Tor darknet service in the aspect of website reputation evaluation, two Tor darknet service characteristics of Tor2web and domain name readability are introduced, so that the Tor darknet service is better suitable for Tor darknet scenes.
2. Simple experiments are carried out on the evaluation method in a mode of selecting representative sites and carrying out manual judgment, and the experiments show that the quality of the sites can be correctly utilized to score the importance of the Tor darknet sites.
3. The search results of the darknet services are ranked according to the importance of the darknet services, so that a searcher can preferentially see the most important darknet sites, and the searcher can conveniently obtain required information in time.
Drawings
FIG. 1 is a flow chart of the method of the present invention.
Detailed Description
The technical solution of the present invention is further described in detail below with reference to the accompanying drawings.
The method flow of the invention is shown in fig. 1, and the relevant evaluation indexes in the existing quality evaluation standard of the surface network website are investigated and analyzed, the evaluation indexes used in the quality evaluation standard of the surface network website are summarized, and simultaneously, 5 dimensions and respective weights influencing the quality evaluation indexes of the Tor dark website are determined by combining the unique characteristics of the Tor dark website, as shown in table 1.
Table 1 shows the dimensions and weights of the quality evaluation of the Tor dark site points
Serial number Dimension name Weight of
1 Aesthetic assessment 0.3
2 Ease of use assessment 0.2
3 Multimedia support assessment 0.1
4 Content richness assessment 0.1
5 Website reputation assessment 0.3
The scores from these five dimensions are weighted and summed for the final Tor dark site point quality score. The score of each dimension is xiThe weights are respectively given to wiThe final score E may be expressed as:
Figure BDA0002238641870000041
each dimension will be described in detail below.
The aesthetic evaluation is a part with higher weight in the website quality evaluation and is composed of four aspects of features and sub-features under the features. The characteristics of the four aspects are formed according to related components and attributes forming the aesthetic effect of the website, and the four aspects comprise pictures, page display and resolution, colors and underlines for emphasizing four parts. The reason for choosing the aesthetic assessment as one of the five assessment dimensions is that the aesthetic assessment focuses on the visual design of the website, and a good visual design is very attractive to website visitors. At the same time we tend to consider that the more important a darknet website is, the better its visual design is. The specific features and sub-features, evaluation methods and feature weights are shown in table 2. The specific calculation method is that the average value of the sub-features under each feature is calculated firstly, then the average value is multiplied by the feature weight and accumulated, and the final score of the aesthetic evaluation dimension can be obtained. Scores for the other four dimensions are calculated as above.
TABLE 2 characteristics and sub-characteristics for aesthetic evaluation, evaluation methods and characteristic weights
Figure BDA0002238641870000051
Usability assessment is also the higher weight part of website quality assessment. Similar to the aesthetic assessment, the ease of use assessment is also composed of the relevant feature and sub-features under that feature. The related characteristics comprise consistency, navigation bars and comments. The reason for selecting the usability assessment is that the user-friendly page design will have a positive impact on the user experience, and a good website design should take into account the usability of the website for the user. The specific features and sub-features, evaluation methods and feature weights of interest are shown in table 3.
TABLE 3 characteristics and sub-characteristics for ease of use assessment, assessment method and characteristic weights
Figure BDA0002238641870000061
The multimedia support evaluation is composed of four characteristics including four aspects of plug-in, multimedia attribute, page multimedia quantity and preview. The reason for selecting the multimedia support evaluation is that multimedia plays an irreplaceable role in webpage display, and websites containing multimedia are often more expressive in information display. Specific characteristics, evaluation methods and characteristic weights are shown in table 4.
Table 4 shows the characteristics, evaluation methods and characteristic weights of multimedia support evaluation
Figure BDA0002238641870000062
The content richness evaluation is composed of four characteristics including a bulletin board, an information navigation board, a search box and whether to close the automatic refreshing. The reason for selecting the content richness evaluation is that the content is the primary standard for displaying the webpage quality, and the website with rich content and content display brings higher reading value to the user. Specific characteristics, evaluation methods and characteristic weights are shown in table 5.
Table 5 shows the characteristics, evaluation methods and characteristic weights of the evaluation of content richness
Figure BDA0002238641870000063
Figure BDA0002238641870000071
The reputation evaluation of the website mainly comprises three characteristics, including three aspects of user feedback page, Tor2web listing and domain name readability. It should be noted that the reputation of the Tor intranet site is different from the reputation of the overlay network site due to the anonymity of the Tor network itself. The design of the latter two features combines the top-level network reputation evaluation feature with the characteristic of the Tor darknet itself. The reason for adopting the reputation evaluation of the website is that the reputation represents the authority of the website, and a website with good reputation is more easily favored by users. Specific characteristics, evaluation methods and characteristic weights are shown in table 6.
Table 6 shows the evaluation method and the feature weight table
Figure BDA0002238641870000072
And multiplying and accumulating the results after evaluation according to the five dimensions respectively according to the weights shown in the table 1 to obtain the total score of the website evaluation. The closer the total score obtained is to 1, the more important the site is; the closer to 0, the less important the station is.
For the experiment of the site quality evaluation part, the invention will take 5 representative Tor darknet sites and perform score calculation according to the evaluation criteria described above. The Tor dark site swatch selected and its brief introduction are as follows:
1. the hidden net Chinese forum website http:// xkow4 × 7cusncz. onion/, is the more active Chinese forum website in the current Tor hidden net, the online time is as long as two years, about 100 visits are made daily, and more than 30 posts are updated daily.
A Deepdotweb darknet portal site is a famous news information portal site in the Tor darknet, anonymous privacy-related reports are reported every day, posts are updated, the online time is up to two years, the number of accesses per day is estimated to be tens of thousands of times, and the website contains functions of multimedia, searching and the like, and is a relatively perfect darknet portal site.
3. The hidden web hidden wiki, the website http:// wikit:// ta4qgz4.onion, is an important navigation type website in the Tor hidden web, the mirror image sites are numerous, most users in the hidden web visit the entrance of the Tor hidden web, the daily visit amount is estimated to be tens of thousands of people, the page style is similar to that of Wikipedia, the website comprises the functions of navigation, search and the like, and the website is a relatively perfect hidden web portal site.
4. Spanish personal blog, website http:// vwycl:. 33xykha. onion/, is a person-maintained Tor darknet blog, employing wordpress website templates, only one blog, and the daily access is expected to be at most ten digits.
5. The resource sharing site is a website which only provides a resource uploading interface, and is extremely crude, and the daily access is expected to be tens of digits at most.
For the above sites, score calculations were performed in 5 dimensions, respectively, and then a composite score was calculated, as shown in table 7.
Table 7 shows the website quality assessment scores
Figure BDA0002238641870000081
The experimental result shows that the deep web hidden portal website is relatively perfect and obtains the highest score because the deep web hidden portal website contains pages such as multimedia, a search box, user feedback and the like, the deep web hidden wiki has no deep web website in the website content display mode and has the score immediately after the deep web hidden portal website, the deep web Chinese forum website has the third rank because the audience is smaller than the first two, and the personal blog and the simple and crude resource sharing website are ranked at the end. The importance evaluation value of the dark net is used as an important weight value to influence the sorting of the search results of the dark net, the weight value can be 1, the weight value can also be adjusted according to the actual search results, and the importance evaluation value of the dark net plays a guiding role in the sorting of the search results. If the search result of the darknet service comprises the 5 darknet websites, the 5 darknet websites are ranked according to the importance evaluation scores in the search result, so that the searcher can preferentially see the more important search result.
In addition, the system of the importance evaluation index of the darknet service has objectivity, the final scores and the ranking of the 5 darknet services are matched with the importance degree of the 5 types of websites which is determined according to the common general knowledge in the field, and the importance evaluation method of the invention is proved to be capable of correctly utilizing the quality of the website to evaluate the importance of the Tor darknet website, so that the method has objectivity and practical significance.
Although specific details of the invention, algorithms and figures are disclosed for illustrative purposes, these are intended to aid in the understanding of the contents of the invention and the implementation in accordance therewith, as will be appreciated by those skilled in the art: various substitutions, changes and modifications are possible without departing from the spirit and scope of the present invention and the appended claims. The invention should not be limited to the preferred embodiments and drawings disclosed herein, but rather should be defined only by the scope of the appended claims.

Claims (10)

1. A method for evaluating the importance of a Tor darknet service based on site quality comprises the following steps:
1) constructing a Tor dark website point quality evaluation index system, and determining evaluation indexes and corresponding weights; the evaluation indexes comprise aesthetic evaluation, usability evaluation, multimedia support evaluation, content richness evaluation and website reputation evaluation of the Tor darknet site;
2) acquiring webpage information of a target Tor dark website point, and determining an evaluation value of aesthetic evaluation, an evaluation value of usability evaluation, an evaluation value of multimedia support evaluation and an evaluation value of content richness evaluation of the target Tor dark website point according to the acquired webpage information;
3) determining the evaluation value of the website reputation evaluation of the target Tor dark website according to whether a user feedback page exists in the target Tor dark website, whether the website is recorded by Tor2web service and whether the dark website address has readability;
4) and determining the importance of the target Tor-dark website point according to the evaluation value of aesthetic evaluation, the evaluation value of usability evaluation, the evaluation value of multimedia support evaluation, the evaluation value of content richness evaluation, the evaluation value of website reputation evaluation and corresponding weights.
2. The method of claim 1, wherein the determining whether the darknet service address is readable comprises: and judging whether the first bits of the dark net service address are matched with words of an English dictionary or not, and if the matched words exist, judging that the dark net service address has readability.
3. The method of claim 1, wherein the evaluation value of the aesthetic measure is calculated by: first, determining characteristics and weights thereof constituting aesthetic effects of the website, wherein the characteristics comprise: pictures, page presentations and resolution, color and underlining emphasis; determining the characteristic value of the picture characteristic according to whether the website page has a determined picture size attribute, whether the page displays only one large picture, whether each picture in the page has an image tag and whether each image tag in the page has a link attribute; determining a characteristic value of page display and resolution characteristic according to the resolution supported by the page and the table size; determining a characteristic value of the color characteristic according to the number of the color types and the RGB range in the page and whether color blindness is considered; determining a feature value of an underline emphasized feature according to whether all of the underline links in the page are hyperlinks; and then, calculating to obtain the evaluation value of the aesthetic evaluation according to the characteristic value of each characteristic and the corresponding weight.
4. The method of claim 1, wherein the evaluation value for the ease of use assessment is calculated by: firstly, determining features for calculating usability evaluation, wherein the features comprise consistency features, navigation bar features and annotation features of a page; then, designing according to whether a CSS file is adopted in a website page to determine a consistency characteristic value of the page, determining a navigation bar characteristic value of the page according to whether a frame is adopted in the website page, whether a link returning to a home page exists in each sub-page and whether a menu bar exists in the home page, and determining an annotation characteristic value of the page according to whether the link and the table in the website page both contain a label and whether META description exists in each page; and then, calculating to obtain an evaluation value of the usability evaluation according to the characteristic value of each characteristic and the corresponding weight.
5. The method of claim 1, wherein the evaluation value of the multimedia support evaluation is calculated by: firstly, determining characteristics for calculating multimedia support evaluation, including a plug-in characteristic, a multimedia attribute characteristic, a page multimedia quantity characteristic and a preview characteristic of a page; determining a plug-in characteristic value, a multimedia attribute characteristic value, a page multimedia quantity characteristic value and a preview characteristic value according to the website page information; and then, calculating to obtain an evaluation value of the multimedia support evaluation according to the characteristic value of each characteristic and the corresponding weight.
6. The method of claim 1, wherein the evaluation value of the content richness assessment is calculated by: firstly, determining characteristics for calculating content richness evaluation, including bulletin board characteristics, navigation board characteristics, search box characteristics and automatic refreshing characteristics of a page; then determining a bulletin board characteristic value, a navigation board characteristic value, a search box characteristic value and an automatic refreshing characteristic value according to the website page information; and then, calculating to obtain an evaluation value of the content richness evaluation according to the characteristic value of each characteristic and the corresponding weight.
7. An importance evaluation system of Tor dark web service based on site quality is characterized by comprising a Tor dark web site quality evaluation index system, an evaluation index value calculation module and an importance evaluation module; wherein,
a Tor dark website point quality evaluation index system used for determining evaluation indexes and corresponding weights; the evaluation indexes comprise aesthetic evaluation, usability evaluation, multimedia support evaluation, content richness evaluation and website reputation evaluation of the Tor darknet site;
the evaluation index value calculation module is used for acquiring webpage information of the target Tor dark website point and determining an evaluation value of aesthetic evaluation, an evaluation value of usability evaluation, an evaluation value of multimedia support evaluation and an evaluation value of content richness evaluation of the target Tor dark website point according to the acquired webpage information; determining the evaluation value of the website reputation evaluation of the target Tor dark website according to whether a user feedback page exists in the target Tor dark website, whether the website is recorded by Tor2web service and whether the dark website address has readability;
and the importance evaluation module is used for determining the importance of the target Tor-dark website point according to the evaluation value of the aesthetic evaluation, the evaluation value of the usability evaluation, the evaluation value of the multimedia support evaluation, the evaluation value of the content richness evaluation, the evaluation value of the website reputation evaluation and the corresponding weight.
8. The system of claim 7, wherein the evaluation index value calculation module determines that the dark web service address has readability by determining whether the first several digits of the dark web service address match words of an english dictionary, and if there are matching words.
9. The system of claim 7, wherein the aesthetic measure and the reputation measure are each weighted higher than the ease of use measure, which is weighted higher than the multimedia support measure and the content enrichment measure.
10. A method for searching a Tor dark network service based on site quality is characterized in that importance evaluation values of the dark network service obtained by the method of any one of claims 1 to 6 are ranked to obtain a search result.
CN201910992292.4A 2019-10-18 2019-10-18 Importance evaluation method, search method and system for Tor darknet service based on site quality Pending CN110866170A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910992292.4A CN110866170A (en) 2019-10-18 2019-10-18 Importance evaluation method, search method and system for Tor darknet service based on site quality

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910992292.4A CN110866170A (en) 2019-10-18 2019-10-18 Importance evaluation method, search method and system for Tor darknet service based on site quality

Publications (1)

Publication Number Publication Date
CN110866170A true CN110866170A (en) 2020-03-06

Family

ID=69652826

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910992292.4A Pending CN110866170A (en) 2019-10-18 2019-10-18 Importance evaluation method, search method and system for Tor darknet service based on site quality

Country Status (1)

Country Link
CN (1) CN110866170A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111966946A (en) * 2020-09-10 2020-11-20 北京百度网讯科技有限公司 Method, device, equipment and storage medium for identifying authority value of page
CN113568779A (en) * 2021-06-25 2021-10-29 杭州雅观科技有限公司 Community data backup system based on routing equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103729374B (en) * 2012-10-15 2017-04-19 北京搜狗信息服务有限公司 Information search method and search engine
CN107341183A (en) * 2017-05-31 2017-11-10 中国科学院信息工程研究所 A kind of Website classification method based on darknet website comprehensive characteristics
CN108810025A (en) * 2018-07-19 2018-11-13 平安科技(深圳)有限公司 A kind of security assessment method of darknet, server and computer-readable medium
US20180351916A1 (en) * 2014-04-11 2018-12-06 Nant Holdings Ip, Llc Fabric-based anonymity management, systems and methods

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103729374B (en) * 2012-10-15 2017-04-19 北京搜狗信息服务有限公司 Information search method and search engine
US20180351916A1 (en) * 2014-04-11 2018-12-06 Nant Holdings Ip, Llc Fabric-based anonymity management, systems and methods
CN107341183A (en) * 2017-05-31 2017-11-10 中国科学院信息工程研究所 A kind of Website classification method based on darknet website comprehensive characteristics
CN108810025A (en) * 2018-07-19 2018-11-13 平安科技(深圳)有限公司 A kind of security assessment method of darknet, server and computer-readable medium

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
KANG LI: "Out-of-band Discovery and Evaluation for Tor Hidden Services", 《SAC2016》 *
刘培朋: "匿名网络中隐藏服务的发现与追踪", 《中国博士学位论文全文数据库》 *
李抗: "Tor隐藏服务的发现与测量", 《万方数据知识服务平台》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111966946A (en) * 2020-09-10 2020-11-20 北京百度网讯科技有限公司 Method, device, equipment and storage medium for identifying authority value of page
CN113568779A (en) * 2021-06-25 2021-10-29 杭州雅观科技有限公司 Community data backup system based on routing equipment

Similar Documents

Publication Publication Date Title
US10825047B2 (en) Apparatus and method of selection and placement of targeted messages into a search engine result page
Thelwall A history of webometrics
US8321278B2 (en) Targeted advertisements based on user profiles and page profile
Bennett et al. Inferring and using location metadata to personalize web search
US8099406B2 (en) Method for human editing of information in search results
US20140189480A1 (en) Dynamic aggregation and display of contextually relevant content
US20120095834A1 (en) Systems and methods for using a behavior history of a user to augment content of a webpage
US20120296918A1 (en) Credibility Information in Returned Web Results
KR20110085995A (en) Providing search results
US8515986B2 (en) Query pattern generation for answers coverage expansion
KR101566616B1 (en) Advertisement decision supporting system using big data-processing and method thereof
US20120066233A1 (en) System and methods for mapping user reviewed and rated websites to specific user activities
Kalogeropoulos et al. ‘I saw the news on Facebook’: brand attribution when accessing news from distributed environments
Ting et al. What does hotel website content say about a property—an evaluation of upscale hotels in Taiwan and China
EP2339526A1 (en) System and method for monitoring visits to a target site
US20130066800A1 (en) Method of aggregating consumer reviews
US20170186035A1 (en) Method of and server for selection of a targeted message for placement into a search engine result page in response to a user search request
EP2933734A1 (en) Method and system for the structural analysis of websites
KR100987058B1 (en) Method and system for providing advertising service using the keywords of internet contents and program recording medium
US11651039B1 (en) System, method, and user interface for a search engine based on multi-document summarization
AU2016346740A1 (en) Internet content providing server and computer readable recording medium embodying same method
US8380732B2 (en) Systematic process for creating large numbers of relevant, contextual marginal comments based on existing discussions of quotations and links
CN110866170A (en) Importance evaluation method, search method and system for Tor darknet service based on site quality
Sohail Search Engine Optimization Methods & Search Engine Indexing for CMS Applications
CN109165264B (en) Webpage analysis method and device based on diversified thermodynamic diagrams

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20200306

WD01 Invention patent application deemed withdrawn after publication