KR20100031198A - Apparatus and method of creating white-list of search-engine - Google Patents
Apparatus and method of creating white-list of search-engine Download PDFInfo
- Publication number
- KR20100031198A KR20100031198A KR1020080090174A KR20080090174A KR20100031198A KR 20100031198 A KR20100031198 A KR 20100031198A KR 1020080090174 A KR1020080090174 A KR 1020080090174A KR 20080090174 A KR20080090174 A KR 20080090174A KR 20100031198 A KR20100031198 A KR 20100031198A
- Authority
- KR
- South Korea
- Prior art keywords
- information
- domain
- search
- tree structure
- search results
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/316—Indexing structures
- G06F16/322—Trees
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3325—Reformulation based on results of preceding query
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
Abstract
The present invention relates to an apparatus and method for generating a white list of a search engine, and more particularly, comprising: a domain information collecting unit for recommending domains for each subject on a plurality of subjects from a plurality of clients through the Internet; And generating a tree structure using a plurality of topics and one or more domain information belonging to each topic, wherein the nodes of the tree structure may include one domain information and recommendation information for the corresponding domain, and include the domain information. Is a sub-node of the subject to which it belongs, and between the sub-nodes belonging to the same topic, the tree structure generation unit to determine the layer according to the recommendation information; characterized in that, according to the present invention the search engine is a high search It is possible to maintain the quality of the results, and netizens are provided with incentives to author high quality content.
Description
The present invention relates to a method for constructing and updating information based on criteria and basis used by a search engine to generate search results of quality content.
Since the advent of the so-called Web 2.0 era, much of the content generated on the Internet has been authored directly by netizens, including a mix of end-user experience and expert knowledge. If such high-quality content can be selected and returned as a search result, there can be no doubt that it will be a quantum leap in comparison with the prior art, which simply returns random web pages and contents related to the search word.
In particular, the number of web pages present on the Internet is already exploding to the extent that it is difficult for search engines to index, and indexing without obscuring any web pages is not worth the information or even low-value web pages. By including them as targets, the results may be contrary to the improvement of the quality of the search results, and the concept of returning to the search results in consideration of sites having "quality content" in the first place can lead to a revolutionary paradigm shift. will be.
However, without adopting the rhetoric of Web 2.0, the adoption of this methodology has provided incentives for netizens working on different sites to author quality content, and how high quality a site can be on a particular topic. It is necessary to prepare a method or a measure to determine whether the content is held.
The present invention has been made to solve the above problems of the prior art, and to provide an apparatus and method for generating a white list of a search engine for encouraging the improvement of the search result quality of the search engine and the active authoring of content by netizens. do.
In order to achieve the above object, the apparatus for generating a white list of a search engine of the present invention comprises: a domain information collecting unit for recommending domains for each subject on a plurality of subjects from a plurality of clients through the Internet; And
A tree structure is created by using a plurality of topics and one or more domain information belonging to each topic, and the nodes of the tree structure may include one domain information and recommendation information about the domain. And a tree structure generation unit that becomes a subordinate node of the subject to which it belongs, and determines the level according to the recommendation information among the subordinate nodes belonging to the same subject.
Meanwhile, in order to achieve the above object, the method of generating a whitelist of a search engine of the present invention collects information on correlations between respective topics and domains by receiving domain recommendations from a plurality of clients for each subject on a plurality of topics. Making;
The tree structure is created by using the collected information, and the nodes of the tree structure may include one domain information and recommendation information for the corresponding domain. Generating a tree structure between the lower nodes belonging to the same subject to determine the level according to the recommendation information; And
And receiving a query from a client, obtaining a search result by referring to a tree structure, and returning the search result.
According to the present invention as described above there is an effect that the domains included in the white list is motivated to have a higher quality content.
In particular, each netizen used in the domain can be actively engaged in the domain by giving the search engine the opportunity to easily expose their authoring activities to the unspecified number of users. In addition to acting as an incentive to do so, it can directly induce reconsideration as a speaker in the entire Internet space, leading to active netizen participation appropriate for the Web 2.0 era.
In addition, due to this synergy, domains have higher quality contents, and by using a white list including these domains, a search engine can maintain high quality search results.
Hereinafter, a configuration of an apparatus for generating a white list of a search engine according to the present invention will be described with reference to the accompanying drawings.
FIG. 1 is a reference diagram exemplarily illustrating a structure of a white list, and FIG. 2 is a functional block diagram showing a white list generating apparatus of a search engine according to the present invention.
The dictionary meaning of a whitelist is "a list of preferred items in the whole" and is used as the opposite concept of a blacklist, which usually means a list of unfavorable or rejected items.
Meanwhile, in the present invention, the white list is data of a tree structure in which each node has a specific domain address as a value, and the domain of the node located at the top of the tree structure is evaluated to include valuable information related to a specific search word or a specific topic. Say that.
FIG. 1 shows an example of a white list according to the present invention. Since the root node has no value, the root node may be represented by four list structures when the root node is removed. It will be collectively referred to as a tree structure.
On the other hand, in Figure 1, the root node is connected to four sub-nodes, each link from the left has a value of "news", "shopping", "blog", "hobby", respectively, by the "news" link Linked subnodes have a value of "next", subnodes linked by a "shopping" link have a value of "auction", subnodes linked by a link called "blog" have a value of "tistory", link "hobby" It can be seen that the child node connected by has no value.
In this case, the values "news", "shopping", "blog", and "hobby" that a link has means a subject to which a child node belongs, and "next", "auction", and "tistory" which each node has, respectively. Refers to the domain name. In the example of FIG. 1, a letter is shown for clarity, but the domain means a domain name registered in the International Network Information Center (InterNIC) or the Korea Internet Information Center (KRNIC). In the example, "www.daum.net" , "www.auction.co.kr" and "www.tistory.com".
On the other hand, "hobby" is divided into four sub-topics, "game," "picture", "car", and "programming", respectively, that is, subjects within a white list may have a hierarchical structure. The subnode linked by the third link in "Car" has a value of "baby dream (www.bobaedream.com)", while the "baby dream" node is again a child of "www.encar.com". As a node, the "enka" node is connected to a subnode called "Naver car" (car.naver.com).
The higher the fidelity is in the subject to which the node belongs to the domain belonging to the node above the white list. In the example of FIG. 1, in the subject “hobbies-cars”, the “baby dream” node is the highest node, the “enka” node is the next node, and the leaf “Naver car” is the lowest node. This means that in the "Hobby-Car" theme, the "baby dream" domain (www.bobaedream.com) is rated as having the best quality information about "Hobby-Car".
In other words, it is more likely that "baby dream" web pages will contain better quality information than the web pages of the leaf "car.naver.com".
Meanwhile, according to FIG. 2, the
First, as shown in FIG. 3, the domain information collecting unit receives recommendation from the netizens about which domains have the best content about the respective subjects. Netizens participate in the domain
Meanwhile, the tree
In this case, each node of the tree structure may include one domain information and recommendation information (information indicating how many recommendations have been received) for the corresponding domain. Meanwhile, as in the root node of FIG. 1, there may be a node having no subject information or a subject such as "car" or "hobby" that is not domain information.
On the other hand, a node including domain information and recommendation information about the domain becomes a sub node of a subject to which each belongs. In the example of FIG. 1, the topics are stored in links instead of being stored in nodes, but whether the topics are stored in links or in separate nodes, these topics each include nodes with subdomains whose domain belongs to them. .
In addition, each node in the generated white list is located in the white list according to recommendation information about domain information that it includes, that is, how many netizens recommend the corresponding domain. Preferably, as shown in the example of FIG. 1, a domain that has been highly recommended is determined as an upper node, and a domain that has not been recommended as a lower node is determined.
Meanwhile, the
That is, the
Meanwhile, as illustrated in FIG. 3, the tree
The updating of the tree structure may be based on the frequency of netizens clicking on a link and moving to the web page. Alternatively, the tree structure may be separately evaluated by the netizens and reflected.
On the other hand, using the white list to expose the domains belonging to the higher priority nodes as search results may be an incentive for each domain to secure quality information in addition to improving the quality of the search results.
Furthermore, the more active the netizens working in the domain, the more frequently the articles, articles, photos, videos, etc., written by the user are exposed to the unspecified number through the search engine, thus acting as an incentive.
As such, in order to encourage netizens who are active in each domain to create more active content, the
In general, web sites where content is created by members are generally operated as a member system, and each member uses an ID to identify himself / herself. The identification information is preferably an ID used in each domain. As the evaluation information, the activity amount for the ID can be used most easily, and other netizens' evaluation of separately exposed search results, or the frequency of the author's contents exposed as search results or the search results are accessed. In some cases, it may be used. In the example of FIG. 4, it can be confirmed that the ID of the author and the evaluation information of the corresponding creator are displayed for each search result through a star shape.
Hereinafter, a method of generating a white list of a search engine according to the present invention will be described with reference to FIG. 5. 5 is a flowchart illustrating a time series of a search method using a white list according to the present invention.
First, as shown in FIG. 3, domain topics are received from netizens for each subject for a plurality of subjects, and information on correlation between the subjects and domains is collected (S110).
Subsequently, a white list of a tree structure is generated using the collected information. In this case, the node of the white list may include one domain information and recommendation information for the corresponding domain. Meanwhile, the node including the domain information belongs to the subject to which it belongs (ie, becomes a subnode), and the layer is determined between nodes belonging to the same subject according to the recommendation information (S120).
Subsequently, when a query is received, a search result is obtained by referring to a white list and then returned (S130). As shown in FIG. 4, identification information for identifying an author for each search result and evaluation of the corresponding author are provided. Information can also be displayed together. Furthermore, the evaluation information of the authors may be updated by analyzing netizens' evaluation of the search results, the frequency of the contents created by the corresponding authors in the search results, or the frequency accessed by the search results.
Meanwhile, the whitelist is dynamically updated through the feedback on the search result (S140). As a result, the quality of the search results can be maintained at a high level.
Although the present invention has been described in detail with reference to several embodiments, the present invention is not limited to these embodiments and is not construed, but may be freely modified and implemented within the scope of the technical idea described in the claims.
1 is a reference diagram exemplarily illustrating a structure of a white list.
2 is a functional block diagram showing an apparatus for generating a white list of a search engine according to the present invention;
3 is a screen example illustrating a web page where a domain information collecting unit receives a domain recommendation for each subject by netizens.
4 is an exemplary screen illustrating an example of displaying identification information for identifying an author and convenience information on the corresponding author in a search result.
5 is a flowchart illustrating a time series of a search method using a white list according to the present invention.
<Explanation of symbols for the main parts of the drawings>
110: domain information collecting unit 120: tree structure generation unit
130: search processing unit
Claims (12)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020080090174A KR100987284B1 (en) | 2008-09-12 | 2008-09-12 | Apparatus and method of creating white-list of search-engine |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020080090174A KR100987284B1 (en) | 2008-09-12 | 2008-09-12 | Apparatus and method of creating white-list of search-engine |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20100031198A true KR20100031198A (en) | 2010-03-22 |
KR100987284B1 KR100987284B1 (en) | 2010-10-12 |
Family
ID=42180917
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020080090174A KR100987284B1 (en) | 2008-09-12 | 2008-09-12 | Apparatus and method of creating white-list of search-engine |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR100987284B1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101448177B1 (en) * | 2013-04-17 | 2014-10-07 | 주식회사 다음커뮤니케이션 | A method for providing search result and server thereof |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102001813B1 (en) | 2018-12-10 | 2019-07-18 | 한국남동발전 주식회사 | Apparatus and method for detecting abnormal behavior of nonstandard protocol payload using deep neural network algorithm |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100403714B1 (en) * | 2000-06-10 | 2003-11-01 | 씨씨알 주식회사 | System and method for facilitating internet search by providing web document layout image and web site structure |
KR100509276B1 (en) * | 2001-08-20 | 2005-08-22 | 엔에이치엔(주) | Method for searching web page on popularity of visiting web pages and apparatus thereof |
-
2008
- 2008-09-12 KR KR1020080090174A patent/KR100987284B1/en active IP Right Grant
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101448177B1 (en) * | 2013-04-17 | 2014-10-07 | 주식회사 다음커뮤니케이션 | A method for providing search result and server thereof |
Also Published As
Publication number | Publication date |
---|---|
KR100987284B1 (en) | 2010-10-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Wallstam et al. | Evaluating the social impacts of events: In search of unified indicators for effective policymaking | |
Mariné-Roig | Measuring destination image through travel reviews in search engines | |
Dickison et al. | Multilayer social networks | |
Alcázar et al. | The effect of user-generated content on tourist behavior: the mediating role of destination image | |
Marine-Roig et al. | Perceived image specialisation in multiscalar tourism destinations | |
Goo et al. | Smart tourism technologies’ ambidexterity: balancing tourist’s worries and novelty seeking for travel satisfaction | |
Coghlan | Volunteer tourism as an emerging trend or an expansion of ecotourism? A look at potential clients' perceptions of volunteer tourism organisations | |
Ćulić et al. | Rookie tourism destinations—the effects of attractiveness factors on destination image and revisit intention with the satisfaction mediation effect | |
Villa et al. | Echo chamber detection and analysis: A topology-and content-based approach in the COVID-19 scenario | |
Kuosa | Different approaches of pattern management and strategic intelligence | |
Al-Taie et al. | Python for graph and network analysis | |
Lee et al. | Korean DMZ tourists' perceived similarity and shared beliefs in predicting place attachment and support for tourism development | |
JP2011513802A (en) | A recommendation system using social behavior analysis and vocabulary classification | |
Tang et al. | I should not recommend it to you even if you will like it: the ethics of recommender systems | |
Traag | Algorithms and dynamical models for communities and reputation in social networks | |
Sheth et al. | Semantic filtering for social data | |
Tao et al. | How collaborators make sense of tasks together: A comparative analysis of collaborative sensemaking behavior in collaborative information‐seeking tasks | |
Xia et al. | Spread of tweets in climate discussions: A case study of the 2019 Nobel Peace Prize announcement | |
Edwin et al. | The effects of tourism information quality in shaping tourists’ visiting interest | |
Kang et al. | How do cultural classes emerge from assimilation and distinction? An extension of the Cucker-Smale flocking model | |
Montoro-Pons et al. | Analyzing online search patterns of music festival tourists | |
KR100987284B1 (en) | Apparatus and method of creating white-list of search-engine | |
Cheng et al. | Identifying and recommending user-interested attributes with values | |
Calero Valdez | Human and algorithmic contributions to misinformation online-Identifying the culprit | |
Narayan et al. | Constructing transnational and virtual ethnic identities: A study of the discourse and networks of ethnic student organisations in the USA and UK |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right | ||
GRNT | Written decision to grant | ||
FPAY | Annual fee payment |
Payment date: 20131004 Year of fee payment: 4 |
|
FPAY | Annual fee payment |
Payment date: 20141006 Year of fee payment: 5 |
|
FPAY | Annual fee payment |
Payment date: 20150921 Year of fee payment: 6 |
|
FPAY | Annual fee payment |
Payment date: 20160928 Year of fee payment: 7 |
|
FPAY | Annual fee payment |
Payment date: 20190904 Year of fee payment: 10 |