KR101153534B1 - Method and system for automatically tagging web data and local data - Google Patents

Method and system for automatically tagging web data and local data Download PDF

Info

Publication number
KR101153534B1
KR101153534B1 KR1020050109311A KR20050109311A KR101153534B1 KR 101153534 B1 KR101153534 B1 KR 101153534B1 KR 1020050109311 A KR1020050109311 A KR 1020050109311A KR 20050109311 A KR20050109311 A KR 20050109311A KR 101153534 B1 KR101153534 B1 KR 101153534B1
Authority
KR
South Korea
Prior art keywords
tagging
list
search
data
program
Prior art date
Application number
KR1020050109311A
Other languages
Korean (ko)
Other versions
KR20070051569A (en
Inventor
장준기
Original Assignee
엔에이치엔(주)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 엔에이치엔(주) filed Critical 엔에이치엔(주)
Priority to KR1020050109311A priority Critical patent/KR101153534B1/en
Publication of KR20070051569A publication Critical patent/KR20070051569A/en
Application granted granted Critical
Publication of KR101153534B1 publication Critical patent/KR101153534B1/en

Links

Images

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Library & Information Science (AREA)
  • Computational Linguistics (AREA)

Abstract

The present invention provides a user with a tagging list for supporting a tagging search method in a part of an information retrieval program that supports a keyword search method for web data and local data. The present invention relates to a method for automatically tagging data and an automatic tagging system for data. According to the present invention, in the information retrieval program of a user terminal driven for a retrieval job, the tagging retrieval method and the keyword retrieval method are supported at the same time, so that the data retrieval method and data can be performed automatically by the user's preferred retrieval method. An automatic tagging system can be provided.

Keyword search, tagging search, search engine, information search program

Description

Automatic tagging method and data tagging system for web data and local data {METHOD AND SYSTEM FOR AUTOMATICALLY TAGGING WEB DATA AND LOCAL DATA}

1 is a diagram schematically illustrating a network connection of an automatic data tagging system according to the present invention.

2 is a block diagram showing an automatic data tagging system according to an embodiment of the present invention.

FIG. 3 is a diagram illustrating an example of a process of creating an index table by a search engine as an example of automatic tagging and tagging list creation according to the present invention.

4 is a diagram illustrating an example of automatic tagging processing and tagging list creation according to the present invention, illustrating a process of creating a tagging table by a list generating means.

5 is a flowchart illustrating a method of automatically tagging data according to an embodiment of the present invention.

6 is an internal block diagram of a general purpose computer device that may be employed to perform the method for automatic data tagging in accordance with the present invention.

<Explanation of symbols for main parts of the drawings>

200: Automatic data tagging system 210: List preparation means

220: list extraction means 230: list control means

240: search control means 140: Web database

145: local database 150: list database

The present invention provides a user with a tagging list for supporting a tagging search method in a part of an information retrieval program that supports a keyword search method for web data and local data. The present invention relates to a method for automatically tagging data and an automatic tagging system for data.

In order to systematically manage a large amount of data and to provide accurate and effective search services when a user's search request occurs, a search operation by inputting a search word is becoming common. Search operation by inputting a search word can generally understand a search operation using a search engine, and registers a predetermined search word and a corresponding search list in a database in advance, and searches a search list that matches a search word input by a user. Searching from the database is provided to the user.

The search operation by inputting the search word registers the search list order in the database in advance by a predetermined advertiser, which does not require user intervention when creating the search list order, and can search for all data registered in the database. There is this.

On the other hand, in a search operation by inputting a search word, the user must remember a search word that enables the user to search the search list order in order to search the information desired by the user. The problem is that it is not easy to grasp the contents of the entire information because it is limited to the search list order.

In addition to the above search operation by inputting a search word, there may be a search operation by tagging (TOP DOWN), and the search operation by tagging allows a user to directly associate a tag for each data and It is a method of expressing information to be provided as a search result according to the frequency.

In other words, the tagging search operation creates a list to be provided as a search result by using a tag manually registered by the user, and enables the user to easily grasp the contents of the entire information by varying the expression method of the tag in the list. Means a search method.

However, in the search operation by tagging, it is cumbersome for the user to directly register the tag for each data, and the reality is that the search or access to the information is only possible for the information to which the tag is associated.

If there is a search method that combines the above two search methods and combines the data, there is no premise that the user should remember the keyword in advance even in the search operation for the entire entire data. It may be easy, allowing the user to maximize the ease of searching.

Therefore, there is an urgent need for a new data retrieval method that provides a more efficient retrieval operation by providing a retrieval method that simultaneously supports a keyword retrieval method and a tag retrieval method for data retrieval.

The present invention has been made to solve the above problems, in the information search program of the user terminal driven for the search operation, by supporting the tagging search method and the keyword search method at the same time, the user's preferred search method flexible search operation An object of the present invention is to provide an automatic data tagging method and an automatic data tagging system.

In addition, the present invention organically integrates the existing keyword search method and tagging search method allows the user to roughly grasp the entire information history of the retrieved data without requiring the user's manual tagging operation to facilitate the easy search operation An object of the present invention is to provide an automatic data tagging method and an automatic data tagging system.

Another object of the present invention is to provide an automatic data tagging method and an automatic data tagging system capable of automatically performing a tagging process for data.

Another object of the present invention is to provide an automatic data tagging method and an automatic data tagging system capable of enabling an accurate retrieval operation without requiring a user to accurately remember a keyword to be searched, unlike the existing keyword search method. It is done.

Automatic data tagging method according to the present invention for achieving the above object, maintaining a list database for recording one or more tagging list, extracting a tagging list from the list database in conjunction with the operation of the information search program in the user terminal And including the extracted tagging list in a partial region of the information retrieval program, and confirming a keyword input into a search field of the information retrieval program or a selection input for a tagging list included in the information retrieval program. And providing a search result searched by a predetermined search engine to the information retrieval program in response to the identified keyword input or selection input, wherein the tagging list includes web data held by a predetermined web server or the Holds in the user terminal Characterized in that the written from the point of view of the local data.

In addition, as a technical configuration of the present invention for achieving the above object, the automatic data tagging system, the list database for recording one or more tagging list, and the tagging list from the list database in conjunction with the operation of the information search program in the user terminal List extraction means for extracting, list control means for including the extracted tagging list in a partial region of the information retrieval program, inputting a keyword into a search field of the information retrieval program, or a tagging list included in the information retrieval program. And a search control means for confirming a selection input and providing a search result searched by a predetermined search engine to the information search program in response to the identified keyword input or the selection input.

Hereinafter, an automatic data tagging method and a data automatic tagging system will be described with reference to the accompanying drawings.

"TAGGING" continuously used in the present specification corresponds one or more index names to each of web data held by a web server to be searched or local data held by a user terminal, and corresponds to the user who wants to search. By presenting a part or all of the index name, it is possible to mean a search method that allows the search history to be performed more conveniently by briefly grasping the entire details of the web data and local data.

Here, the web data may refer to data held in a web server, and the index name may be a web index name representing web data. The local data may refer to data held by the user terminal, and the index name may be a local index name representing local data.

The searching operation by tagging corresponds to an index name for each data in advance, lists the corresponding index names in a tagging list, presents them to the user, and then provides the user with data associated with the index name selected by the user. Such a tagging search operation may allow a user who is provided with a tagging list to grasp the detailed information of the data to be searched through an index name included in the tagging list to be distinguished from the search operation by keyword input.

In particular, in the present embodiment, in the information retrieval program driven by the user terminal for the retrieval job, the tagging retrieval method and the keyword retrieval method can be supported at the same time, so that the user can freely search according to the preferred retrieval method. The information retrieval program may be any program that supports retrieval of web data and local data. For example, the information retrieval program may be an example of a data search program (web browser) such as Internet Exploration and navigation.

1 is a diagram schematically illustrating a network connection of an automatic data tagging system according to the present invention.

The automatic data tagging system 100 of the present invention may be flexibly included in the inside or outside of the user terminal 110, and in the present embodiment, the present invention is limited to being included in the user terminal 110 for convenience of description. do.

The automatic data tagging system 100 creates a tagging list in consideration of the index tables created by the search engines 130 and 135. The index table includes web names and local data including index names (web index names, local index names) extracted from a plurality of web data and local data to be searched, and the index names (web index names, local index names). The number of files, the total frequency included in web data and local data, and the like can be recorded.

That is, the data automatic tagging system 100 determines the method of noting the index name (web index name, local index name) in consideration of the number of files and the total frequency, and displays the index name (web index) according to the determined notation method. Name, local index name) as a tagging list. For example, the automatic data tagging system 100 allows the user to display the entire contents of the search result by displaying the local index name included in the tagging list more prominently as it is included in the plurality of local data or as frequently included in the entire local data. This can be easily grasped. Here, grasping the entire contents may mean that the user 120 understands the general contents of the search results, such as what keywords are included in the search results and which keywords are representative of the search results. have.

In particular, the automatic data tagging system 100 of the present invention is a keyword search method for searching web data and local data according to a keyword input of a user 120 on one information search program, and a user 120 for an information search program. The tagging search method for providing a tagging list according to the search position of the C. Accordingly, the user 120 using the automatic data tagging system 100 of the present invention searches for data by selecting and inputting an index name from a tagging list provided by a tagging search method according to a user's preferred search method. You can perform a task or search a data by entering a keyword in a keyword search method.

In addition, the automatic data tagging system 100 may receive a service area to be searched by the search engines 130 and 135 from the user 120, and may search for data related to the input service area. Here, the service area may be a blog, mail, scrap, bookmark, etc. associated with the user 120, the automatic data tagging system 100 by using a result (tag list) searched only in a specific search area specified by the user 120 The user 120 may provide a predetermined connection service.

For example, a case in which a 'personal blog' relating to the user 120 constructed on the web as the service area is designated by the user 120 is illustrated. When the keyword 'photo' or category 'news article' is received by the user 120 under such conditions, the data tagging system 100 automatically maintains the web maintained in the web database 130 in response to the designated 'personal blog'. Among the data, the web data related to the keyword 'photo' or the web data related to the category 'news article' may be searched. In addition, the automatic data tagging system 100 may provide a community service that connects a common result of the 'personal blog', that is, a plurality of users 120 to create a common tag list.

The search engines 130 and 135 may be roughly classified into a web search engine 130 supporting a search operation on web data and a local search engine 135 supporting a search operation on local data.

First, the web search engine 130 calculates and indexes each web index name extracted from the web data by calculating a total frequency obtained by adding the number of files of the web data from which the web index name is extracted and the frequency of the index name in each web data. It is responsible for creating tables.

In addition, the web search engine 130 performs a search operation on the web data, and corresponds to a web corresponding to a keyword input generated on an information search program of the user terminal 110 or a selection input for a web index name of a tagging list. It retrieves and extracts data from the web database 130.

In addition, the web search engine 130 may provide the web service to the user 120 having a predetermined contract relationship with respect to the provision of the web service. As a web service provided by the web search engine 130, for example, a blog service, a webmail service, a scrap service, a bookmark service, and the like may be exemplified, and the web search engine 130 may be generated in the process of providing each web service. The web data may be independently stored and maintained for each user 120 in the web database 140.

For example, the web search engine 130 that receives the uploaded document data (web data) from the user 120 who receives the blog service is a predetermined storage area of the web database 140 individually assigned to the user 120. The received document data can be stored. In addition, the web search engine 130 may independently store various web data generated in association with the user 120 as the allocated storage area for each blog service of the user 120.

In addition, the local search engine 135 calculates, for each local index name extracted from the local data, a total frequency obtained by adding up the total number of files of the local data from which the local index name is extracted and the frequency of the index name in each local data. It is responsible for creating a. In addition, the local search engine 135 may perform a search operation on the local data, and responds according to a keyword input or a selection input for the local index name of the tagging list inputted on the information search program of the user terminal 110. Local data may be retrieved from the local database 145 and extracted.

The user terminal 110 searches for a tagging list in the list database 150 according to the operation of the information retrieval program, and includes the retrieved tagging list in a partial area of the information retrieval program to expose the user 120.

In particular, when the user 120 selects a specific index name included in the tagging list, the user terminal 110 transmits a predetermined selection signal related to the index name selection to the data tagging system 100 and the data. In the tagging automatic system 100, the web data and the local data searched by the selection signal may be displayed on the screen in a form that the user 120 can see.

In addition, the user terminal 110 may have a function of designating a location of a web document to be searched for, and through such a function, web data to be searched to a specific service area according to an active decision of the user 120. Web documents).

For example, when the user terminal 110 designates a webmail service as a search target, the data tagging system 100 automatically stores a storage area individually allocated to the user 120 of the user terminal 110 in the web database 140. It can identify and search web data (mail, messages, etc.) related to the web mail service held in the identified storage area. Subsequently, the automatic data tagging system 100 of the present invention may provide the user 120 with a tagging list created through a tagging process, limited to the searched web data.

In addition, the user terminal 110 allows a keyword or category input to an information retrieval program so that a search operation on local data held by the user terminal 110 is possible, and the data automatic tagging system 100 of the present invention is applied to the input keyword or category. In response, the local data is retrieved from the local database 145, and the retrieved local data is exposed to the user 120 through the information retrieval program.

The user 120 has a user terminal 110 and freely inputs a keyword or tagging search method by inputting a keyword (or category) or selecting a tagging list on the information search program of the user terminal 110. It may mean an internet user performing a search operation of data and local data.

The automatic data tagging system 100 includes a tagging list by a tagging search method on an information search program supporting a keyword search method, and freely supports a keyword search method or a tagging search method according to a user's selection. Do it.

In particular, the automatic data tagging system 100 generates and provides a tagging list in which the index names to be included in a plurality of data or the frequency names within the data are markedly provided to the user 120 for the index names to be included in the tagging list. As a result, grasping the contents of the search result may be approximately recognized by the user 120 by the index name included in the tagging list. Hereinafter, a detailed configuration of the data automatic tagging system 200 of the present invention will be described with reference to FIG. 2.

2 is a block diagram showing an automatic data tagging system according to an embodiment of the present invention.

The automatic data tagging system 200 of the present invention includes a list creating means 210, a list extracting means 220, a list control means 230 and a search control means 240.

In addition, the automatic data tagging system 200 of the present invention includes a web database 140 for storing web data held in a web server, a local database 145 for storing local data held in a user terminal 110, and data. It may further include a list database 150 for storing a tagging list created as a target.

The web database 140 may store web data that is open to the general public, allocate an independent storage area to an individual user 120, and assign web data generated by the user 120 in a specific web service to the allocated storage area. You can also save.

The local database 145 may store local data held in the user terminal 110 in correspondence with a predetermined keyword or category, and extract specific local data in response to a search request according to a keyword or category input.

The list database 150 records one or more tagging lists, and when the information retrieval program is driven for a retrieval operation in the user terminal 110, the information retrieval is performed using a predetermined tagging list retrieved by the data automatic tagging system 200. It can be included in some areas of the program. The list database 150 may be configured inside or outside the user terminal 120 in response to the search engines 130 and 135.

In particular, the list database 150 may record each tagging list in association with predetermined category information, and the data automatic tagging system 200 may generate a corresponding tagging list according to the search category position of the driven information retrieval program. Allow for selective extraction. For example, when the search category location (or location of the searched directory) of the information search program is 'newspaper', the automatic data tagging system 200 extracts a tagging list of category information corresponding to the 'newspaper' and searches the information. Can be included in the program

The tagging list of the list database 150 may be created by the list creation means 210, and the list creation means 210 may be an index table previously created by the search engines 130 and 135, more specifically, an index recorded in the index table. The tagging list can be created by determining the indexing method in consideration of the 'number of files' or 'total frequency' associated with the name.

The search engines 130 and 135 which create the index table extract the index name from each of the collected web data and the local data, calculate the frequency including the index name from the extracted index name and the individual data, and record the index name in the index table.

Here, the index name may be defined as a character combination included in web data or local data and having a meaning, and the search engines 130 and 135 may include one or more character combinations that can identify meanings among web data or local data collected according to a predetermined condition. Can be extracted by index name.

Thereafter, the search engines 130 and 135 obtain information on the number of data extracted from the index name, the frequency including the individual index name in each data, and record the obtained information on the index table based on the extracted index name. . Accordingly, the index table records information on the index name, the name (file name) of the extracted data, and the number including the index name in the data.

In addition, the search engines 130 and 135 calculate the total frequency by summing the number of files of data from which the index name is extracted in association with the index name, and the frequency in each data. That is, the search engines 130 and 135 rearrange the information of the index table based on the index name, and count the total number of web data or local data associated with one index name, and also include one index name in the web data or local data. Count the total number that will be.

Thereafter, the list creating means 210 may assign a predetermined rank to the index name in consideration of the number of files and the total frequency of the index table, and create a tagging table in which the index names are arranged according to the assigned rank. That is, the list creating means 210 determines the position of the index name in the tagging list according to the ranking (array order) of the tagging table. In the above rank assignment, the list creation means 210 gives a higher rank as the number of indexes with a larger number of files, and the index name with a higher total frequency for a plurality of index names having the same number of files is relatively higher. A descending ranking is made to give a higher ranking.

In addition, the list creating means 210 selects at least one index name to be included in the tagging list by referring to the ranking of each web data or local data in the tagging table.

In this case, the tagging list may include only one of web data and local data, or in some embodiments, may include both web data and local data.

The list creating unit 210 may select, for example, index names within n ranks set according to the screen size (View Size) of the user terminal 110 exposing the tagging list to the user 120. In addition, as another example, the operator of the present system can flexibly adjust the setting of the n degree so that the index name determined to be exposed to the user 120 is included in the tagging list.

In addition, when both the web data and the local data are included in the tagging list, the list generating unit 210 may select the index names within n ranks by combining both the web data and the local data.

In addition, the list creating means 210 determines the notation method for each index name, and creates a tagging list so that the index name is displayed according to the determined notation method. That is, the list creating means 210 determines the marking method by index name in consideration of the calculated number of files and the total frequency, that is, the arrangement order in the tagging table, and creates a tagging list so that the index name is displayed according to the determined notation method. Do it.

In this case, the notation method relates to the notation size or notation density for each index name included in the tagging list, and the list creation means 210 indicates the index size in proportion to the higher the assigned level, the larger the notation in the tagging list. The higher the consensus level is, the more conspicuous the index concentration of the corresponding index name is and the darker the tagging list is.

That is, the list creating means 210 causes the user 120 to be markedly (larger and darker) in the tagging list, so that the index name having a larger number of files and the total frequency is displayed in the tagging list later. Index names having a large number of files and a total frequency are clearly identified.

In addition, the list creating means 210 may create a tagging list by arranging the index names in a predetermined character array order so that each of the index names included in the tagging list is easily recognized by the user 120. For example, when the character system of the index name selected to be included in the tagging list is' Hangul ', the index name is referred to as the character sequence order' ㄱ, ㄴ, ㄷ,... ', Or if the character system is' alphabet ', the index name is assigned to the character sequence' a, b, c,... Can be arranged according to ' In addition, as another embodiment, it is possible to arrange the index names having a relatively high rank sequentially from the top of the tagging list.

The created tagging list is recorded and maintained in a predetermined storage area of the list database 150.

In addition, when the web data to be tagged by the user 120 is limited to web data related to a specific service area, the list creating unit 210 may search a predetermined storage space of the web database 140 corresponding to the web service area. The search engine 130 may search and create a tagging list of a specific area using the searched web data. For example, when a tagging request for web data related to 'personal blog' is generated by the user 120, the web search engine 130 may select and search only web data independently stored in association with 'personal blog'. The list creating unit 210 may create a tagging list for the 'personal blog' using the searched web data.

When the search request is received from the user 120 while the tagging list is maintained in the list database 150, the list extracting means 220 is adapted to drive the information search program in the user terminal 110 for the search operation. In conjunction, the tagging list is extracted from the list database 150. As described above, the list extracting means 220 identifies the search category position of the driven information search program, and selectively extracts a tagging list of category information corresponding to the identified search category position. Here, the search category location may mean a directory location searched by the information search program.

The list control means 230 serves to include the extracted specific tagging list in a part of the information retrieval program. That is, the list control means 230 may expose the specific tagging list to the user 120 who generated the search request through the information search program of the user terminal 110.

In addition, when one index name included in the exposed tagging list is selected by the user 120, the data automatic tagging system 200 provides web data and local data to the user 120 as a search result. That is, the tagging list included in a partial region of the information retrieval program by the list control means 230 is exposed to the user 120 through the user terminal 110 which drives the information retrieval program. The information retrieval program driven by the user terminal 110 is in a state in which a search field for supporting the keyword search method and a tagging list for supporting the tagging search method are included (see FIG. 3).

The search control unit 240 provides the search results searched by the search engines 130 and 135 to the user 120 through the information search program. To this end, the search engines 130 and 135 respond to the user 120 when a user inputs a keyword in a search field by a keyword input method or when the user 120 selects some index names included in a tagging list. Search the web data in the web database 140, or search the local data in the local database (145).

According to an embodiment, when the user 120 selects "museum" in the information retrieval program, after first showing the document "doc.doc", "me document.doc", and "multi document.doc" in the tagging list, According to the user's 120 selection, specific data may be searched.

Therefore, according to the present invention, the tagging search method and the keyword search method can be simultaneously supported in an information search program driven by the user terminal 110 for a search operation, so that the search method can be freely selected according to the user's preferred search method. You can do a search.

As described above, the automatic data tagging system 200 of the present invention includes a tagging list in an information retrieval program driven by the user terminal 110 for a retrieval operation, as well as a keyword retrieval method according to keyword input. Search methods are also supported. In addition, the automatic data tagging system 200 according to the present invention not only processes a fast search task, which is an advantage of the keyword search method, but also processes a search task with a relatively high accuracy without having to accurately remember keywords as a search target, which is an advantage of the tag search method. You can do that.

3 and 4 are diagrams illustrating an example of automatic tagging and tagging list creation according to the present invention. FIG. 3 shows a process of creating an index table by the search engines 130 and 135, and FIG. 4 shows a list creating means 210. FIG. Illustrates the process of creating a tagging table.

In FIG. 3, the automatic data tagging system 200 according to the present invention performs a search process by performing a keyword search method and a tagging search method on web data held by a web server or local data held by the user terminal 110. Explain. In FIG. 3, search engines 130 and 135 create an index table in association with any index name 'document'.

That is, the search engines 130 and 135 are data to be searched (targeted for tagging) as the data corresponding to the 'document' from the web database 140 or from the local database 145. DOC, multi-document.DOC 'selectively search.

 The search engines 130 and 135 calculate an index name and a frequency in which the index name is included in the corresponding data from each of the retrieved data 'Document.DOC', 'Document.DOC', 'Document.DOC', and use this to create an index table. . In the index table of FIG. 3, for example, the index name 'museum' and the like is extracted from the data 'temporary document.DOC', and the index name 'museum' has a frequency of '10' in the data 'temporary document.DOC'. Record that it is.

Thereafter, as shown in FIG. 4, the list generating unit 210 adds the number of files and the total frequency of the data including the index name based on the extracted index name, and creates a tagging table using the summed information. . For example, in the index table of FIG. 3, the index name 'Museum' has 10 times of local data 'Document.DOC', 50 times of local data 'or Document.DOC', and 100 times of local data 'Multi Document.DOC'. In this case, the list generating means 210 records the number of files for the index name 'museum' as 3 and the total frequency as 160 (10 + 50 + 100) in the tagging table of FIG. 4.

In addition, the list generating means 210 sorts the index name in consideration of the number of files and the total frequency of the tagging table, and creates a tagging list in which the index names are placed in the sorting order and records the list in the list database 150. .

In this case, the list generating means 210 may select the number of index names to be included in the tagging table in consideration of View Size. In FIG. 4B, the view size is 5, and the index names are sorted up to the fifth highest in the tagging table. In the example of FIG. 4, ii) of FIG. 4 exemplifies selecting index names sorted up to the top 10 in the tagging table as View Size is 10. FIG.

In addition, the list generating unit 210 considers the sorted index name in the tagging table, that is, the number of files and the total frequency, so that the index name having a relatively high sort order in the tagging list is displayed in large and dark colors. 4) and ii) illustrate a tagging list in which an index name 'museum' having the highest sort order is marked relatively larger and darker than other index names.

In addition, the list creating means 210 may arrange the index names included in the tagging list according to the character arrangement order. For example, as shown in (i) of FIG. You can create a tagging list arranged in order of "Abandonment, Reasoning, Accepted."

Thereafter, the automatic data tagging system 200 provides the created tagging list to the user 120 through the information search program of the user terminal 110.

Accordingly, the user 120 provided with the tagging list can grasp the details of the web data and the local data to be searched through the tagging list. In addition, the automatic data tagging system 200 of the present invention provides the user 120 with a remarkable representation in the tagging list for the index name with the highest frequency included or included in the web data and local data to be searched. This allows the index name to be chosen naturally.

Hereinafter, the workflow of the automatic data tagging system 200 according to an embodiment of the present invention will be described in detail.

5 is a flowchart illustrating a method of automatically tagging data according to an embodiment of the present invention.

The automatic data tagging method of the present invention may be performed by the automatic data tagging system 200 described above.

First, the automatic data tagging system 200 maintains a list database 150 that records one or more tagging lists (S510). This step (S510) is a list by using the index name extracted from the web data and local data collected for the creation of the index table in the search engine 130, 135, and the frequency including the index name in the individual web data and local data, The tagging list is created by the creating means 210, and the created tagging list is recorded in the list database 150.

In particular, the automatic data tagging system 200 in this step (S510) determines the notation method by index name, that is, the notation size or notation concentration for each index name in consideration of the calculated number of files and the total frequency, based on the Create a tagging list that lists and indexes each index name. In this case, the automatic data tagging system 200 displays the index number included in the tagging list as the number of files increases, and as the total frequency of the index names increases, the mark size of the corresponding index name is proportionally increased, and the marking density is proportionally darkened. Be sure to Accordingly, when the tagging list is later exposed to the user 120, the automatic tagging system 200 makes the index name with a large number of files and the high frequency of the total frequency appear relatively more prominently than the other index names. It is possible to derive the user's 120 selection for and to provide the user 120 with a more accurate search result.

In addition, the automatic data tagging system 200 in operation S510 may limit the number of index names included in the tagging list in consideration of the view size according to the size of the tagging screen (the screen of the user terminal 110). For example, as shown in (i) of FIG. 4, when View Size is 5, five index names may be sequentially identified with a relatively large number of files and a total frequency.

In addition, the automatic data tagging system 200 in the step (S510) is arranged so that the index name included in the tagging list is arranged in a predetermined character arrangement order so that the user 120 searches for the index name of the tagging list. It can be easily identified among multiple index names.

In addition, the data automatic tagging system 200 extracts a tagging list from the list database 150 in association with driving the information retrieval program in the user terminal 110 (S520). This step (S520) is a process of identifying a search category location of a driven information search program, for example, a web browser, and selectively extracting a tagging list of category information corresponding to the identified search category location. For this purpose, each tagging list is recorded on the list database 150 in association with predetermined category information.

Next, the automatic data tagging system 200 includes the extracted tagging list in a partial region of the information retrieval program (S530). The step (S530) is a process of exposing the extracted tagging list to the user 120 of the user terminal 110 by providing the extracted tagging list to the user terminal 110 that has driven the information retrieval program. By including the information retrieval program of the method, the keyword retrieval method as well as the tagging retrieval method using the information retrieval program is possible.

In addition, the automatic data tagging system 200 confirms a keyword input into a search field of the information retrieval program or a selection input for a tagging list of the information retrieval program (S540). The step S540 is a process of flexibly supporting a keyword search method or a tagging search method according to a user's preferred search method, for example, by the user 120 who prefers a tagging search method. 200 may confirm that an arbitrary index name is selected from the index names of the exposed tagging list. Meanwhile, by the user 120 who prefers a keyword search method, the automatic data tagging system 200 may confirm that a predetermined keyword is input to a search field provided in the information search program.

Subsequently, the data automatic tagging system 200 provides a search result to the information retrieval program in response to the confirmed keyword input or selection input (S550). This step (S550) identifies the index name selected by the user 120 from the input keyword or tagging list, and extracts the web data and local data corresponding to the keyword or the identified index name from the list database 150 to the user. The process of providing 120 is.

To this end, the web database 140 or the local database 145 records the index name (or keyword) and the web data and local data associated with the index name (keyword) in correspondence, and the data automatic tagging system 200 records the Search the web database and local data corresponding to the input keyword or the selected index name directly in the list database 150, or check the file location of the web data and the local data in the list database 150 and access the confirmed file location. The relevant web data and local data may be provided to the user 120.

Therefore, according to the present invention, a tagging search method and a keyword search method can be simultaneously supported in an information search program of a user terminal driven for a search job, thereby enabling a flexible search job to a user's preferred search method.

In addition, according to the present invention, by performing the tagging process for the data automatically, the conventional user 120 can reduce the effort of the user 120 by avoiding the direct tagging process for each data. In addition, by the automatic data tagging system 200 of the present invention, it is possible to facilitate the user 120 to grasp the approximate information on the data to be retrieved through the tagging list created by the automatic tagging process.

In addition, as another embodiment of the present invention, when a predetermined storage environment change for data occurs, the data automatic tagging system 200 may automatically update the previously created tagging list.

That is, the data automatic tagging system 200 may be configured for a tagging list that has been previously created when any one of the generation, change, and deletion of data, such as creation, change, or deletion, of the data held in association with the user 120 changes. Update processing can be performed. Therefore, according to the automatic data tagging system 200, even when the user 120 changes the local data held by the user 120 at any time, the user 120 is flexibly updated according to the changed storage environment. It is possible to ensure that an optimal search environment and search results are provided.

As another embodiment of the automatic data tagging method of the present invention, it will be described to perform automatic tagging on data generated in connection with a service provided by the user 120.

The automatic data tagging system 200 may receive a service area related to the service from the user 120 as a range restriction on the web data to be searched. For example, if a blog service is designated as a service area by the user 120, the data tagging system 200 automatically retrieves the web data from the allocated storage area of the web database 140 associated with the designated service area 'blog service'. You can search.

Thereafter, the automatic data tagging system 200 automatically performs a tagging process for the web data searched in relation to the service area 'blog service', and may create a tagging list through a process similar to the above-described tagging process.

In addition, the automatic data tagging system 200 identifies a plurality of tagging lists created by each of the users 120 for the same service area, and selects the top N index names having a high number of files and a total frequency in each of the identified tagging lists. Can be compared with each other. As a result of the comparison, when the users of the tag list having the compared N index names are identified, the automatic data tagging system 200 may connect the plurality of identified users.

Hereinafter, the automatic data tagging system 200 confirms a user A and a user B who have performed a search operation on a service area for providing a blog service, and compares the tagging list created in association with the confirmed two users. At this time, if N is set to 3, the automatic data tagging system 200 may select the index name within the top three positions having a high number of files and a total frequency in each tagging list.

Under the above conditions, the automatic data tagging system 200 identifies three index names of 'movie, star wars, and light sword' which are within the top three having high number of files and total frequency from the user A's tagging list, and user B It is assumed that three index names of 'Star Wars, lightsaber, and movie' are sequentially identified from the tagging list.

In this case, the three index names identified are the same, and the automatic data tagging system 200 may determine that a similar interest is shared between the user A and the user B for the blog service, and may interconnect the two users. As the connection method, for example, both parties may exchange various contact information, induce subscription to a predetermined community, and the like.

Accordingly, according to the present invention, it is possible to construct a virtual network that shares interests between individuals by analyzing a plurality of tagging lists and interconnecting a plurality of users (individuals) having similar interests identified through them.

Embodiments of the present invention include computer readable media including program instructions for performing various computer implemented operations. The computer readable medium may include program instructions, local data files, local data structures, or the like, alone or in combination. The media may be those specially designed and constructed for the purposes of the present invention, or they may be of the kind well-known and available to those having skill in the computer software arts. Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tape, optical recording media such as CD-ROMs, DVDs, magnetic-optical media such as floppy disks, and ROM, RAM, flash memory, and the like. Hardware devices specifically configured to store and execute the same program instructions are included. The medium may be a transmission medium such as an optical or metal wire, a waveguide, or the like including a carrier wave for transmitting a signal specifying a program command, a local data structure, or the like. Examples of program instructions include not only machine code generated by a compiler, but also high-level language code that can be executed by a computer using an interpreter or the like.

6 is an internal block diagram of a general purpose computer device that may be employed to perform the method for automatic data tagging in accordance with the present invention.

Computer device 600 includes one or more processors 610 coupled with a main memory device including random access memory (RAM) 620 and read only memory (ROM) 630. The processor 610 may also be called a central processing unit (CPU). As is well known in the art, the ROM 630 serves to transfer local data and instructions to the CPU unidirectionally, and the RAM 620 typically transmits local data and instructions bidirectionally. Used to send to. RAM 620 and ROM 630 may include any suitable form of computer readable media. Mass storage 640 is bidirectionally coupled to processor 610 to provide additional local data storage capability, and may be any of the computer readable recording media described above. The mass storage device 640 is used to store programs, local data, and the like, and is a secondary memory device such as a hard disk which is generally slower than the main memory device. Certain mass storage devices such as CD ROM 660 may be used. The processor 610 may include one or more input / output interfaces such as a video monitor, trackball, mouse, keyboard, microphone, touchscreen display, card reader, magnetic or paper tape reader, voice or handwriting reader, joystick, or other known computer input / output device. 650 is connected. Finally, the processor 610 may be connected to a wired or wireless communication network through the network interface 670. Through this network connection, the procedure of the method described above can be performed. The apparatus and tools described above are well known to those skilled in the computer hardware and software arts.

The hardware device described above may be configured to operate as one or more software modules to perform the operations of the present invention.

Although specific embodiments of the present invention have been described so far, various modifications are possible without departing from the scope of the present invention. Therefore, the scope of the present invention should not be limited to the described embodiments, but should be determined by the equivalents of the claims and the claims.

As can be seen from the above description, according to the present invention, in the information retrieval program of a user terminal driven for a retrieval operation, a tagging retrieval scheme and a keyword retrieval scheme are simultaneously supported to perform a flexible retrieval operation with a user preferred retrieval scheme. A data automatic tagging method and a data automatic tagging system can be provided.

In addition, according to the present invention, by integrating the existing keyword search method and tagging search method organically, the user can easily grasp the entire information of the retrieved data without requiring the user's manual tagging operation, thereby making it easy to search. It is possible to provide an automatic data tagging method and an automatic data tagging system to support a job.

In addition, according to the present invention, it is possible to provide a data automatic tagging method and a data automatic tagging system capable of automatically performing a tagging process for data.

Further, according to the present invention, unlike the existing keyword search method, to provide a data automatic tagging method and a data automatic tagging system that can enable accurate search operation without requiring the user to accurately remember the keyword to be searched Can be.

Claims (10)

delete In the automatic data tagging method, Maintaining a list database for recording one or more tagging lists; Extracting a tagging list from the list database in association with driving an information retrieval program in a user terminal; Including the extracted tagging list in a partial region of the information retrieval program; Confirming a keyword input into a search field of the information retrieval program or a selection input for a tagging list included in the information retrieval program; And Providing the information search program with search results searched by a search engine in response to the identified keyword input or selection input; Including, Each of the tagging lists is recorded in the list database in association with category information. Extracting a tagging list from the list database, Identifying a search category position of the driven information retrieval program and selectively extracting a tagging list of category information corresponding to the identified search category position from the list database; Automatic tagging method comprising a. In the automatic data tagging method, Maintaining a list database for recording one or more tagging lists; Extracting a tagging list from the list database in association with driving an information retrieval program in a user terminal; Including the extracted tagging list in a partial region of the information retrieval program; Confirming a keyword input into a search field of the information retrieval program or a selection input for a tagging list included in the information retrieval program; And Providing the information search program with search results searched by a search engine in response to the identified keyword input or selection input; Including, Maintaining a list database for recording the tagging list, Maintaining a web database that records a web index name and web data from which the web index name is extracted; And Maintaining a local database that records a local index name and local data from which the local index name is extracted; Including, Providing the search results to the information search program, When the index name of the tagging list in which the selection input is made by the search engine is a web index name, the file location of corresponding web data is searched in the web database, and the web data of the searched file location is searched as the search result. Providing the program; And Searching for the file location of the corresponding local data in the local database and providing the searched local data as the search result to the information search program when the index name of the tagging list in which the selection input is made is a local index name; Automatic tagging method comprising a. In the automatic data tagging method, Maintaining a list database for recording one or more tagging lists; Extracting a tagging list from the list database in association with driving an information retrieval program in a user terminal; Including the extracted tagging list in a partial region of the information retrieval program; Confirming a keyword input into a search field of the information retrieval program or a selection input for a tagging list included in the information retrieval program; And Providing the information search program with search results searched by a search engine in response to the identified keyword input or selection input; Including, Maintaining a list database for recording the tagging list, Determining a notation method regarding a notation size or notation concentration for each index name included in the tagging list; And Recording a tagging list indicating the index name in the list database according to the determined notation method; Including, Determining the notation method, Assigning a ranking to each index name according to the number of files and the total frequency associated with the index name; And Determining the notation method by selecting index names within a set level, and specifying a notation size or notation density of each of the selected index names Automatic tagging method comprising a. 5. The method of claim 4, The determining of the notation method by designating the notation size or notation concentration of each of the selected index names, Displaying the index size of the index name in proportion to the higher equality level; or Marking the density of the corresponding index name in proportion to the higher the assigned level Automatic tagging method comprising a. 5. The method of claim 4, And the tagging list is created by arranging the index names in a character array order. A computer-readable recording medium having recorded thereon a program for executing the method of any one of claims 2 to 6. delete In the data automatic tagging system, A list database for recording one or more tagging lists; List extracting means for extracting a tagging list from the list database in association with driving an information retrieval program in a user terminal; List control means for including the extracted tagging list in a partial region of the information retrieval program; Confirming a keyword input to a search field of the information search program or a selection input for a tagging list included in the information search program, and searching for the search result searched by a search engine in response to the confirmed keyword input or selection input; Search control means provided by the program; And List creation means for creating a tagging list for web data held by a web server or local data held by the user terminal and recording the tagging list in the list database. More, The list creation means, And a tagging method for determining an index name included in the tagging list, and recording a tagging list indicating the index name in the list database according to the determined notation method. In the data automatic tagging system, A list database for recording one or more tagging lists; List extracting means for extracting a tagging list from the list database in association with driving an information retrieval program in a user terminal; List control means for including the extracted tagging list in a partial region of the information retrieval program; Confirming a keyword input to a search field of the information search program or a selection input for a tagging list included in the information search program, and searching for the search result searched by a search engine in response to the confirmed keyword input or selection input; Search control means provided by the program / RTI &gt; The list database records and associates category information with each of the tagging lists. The list extracting means, Identifying a search category position of the driven information retrieval program, and selectively extracting a tagging list of category information corresponding to the identified search category position from the list database.
KR1020050109311A 2005-11-15 2005-11-15 Method and system for automatically tagging web data and local data KR101153534B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020050109311A KR101153534B1 (en) 2005-11-15 2005-11-15 Method and system for automatically tagging web data and local data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020050109311A KR101153534B1 (en) 2005-11-15 2005-11-15 Method and system for automatically tagging web data and local data

Publications (2)

Publication Number Publication Date
KR20070051569A KR20070051569A (en) 2007-05-18
KR101153534B1 true KR101153534B1 (en) 2012-06-11

Family

ID=38274692

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020050109311A KR101153534B1 (en) 2005-11-15 2005-11-15 Method and system for automatically tagging web data and local data

Country Status (1)

Country Link
KR (1) KR101153534B1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100947367B1 (en) * 2008-03-17 2010-03-15 경기대학교 산학협력단 Method, device for tagging of data and computer readable record-medium on which program for executing method thereof
US9514123B2 (en) 2014-08-21 2016-12-06 Dropbox, Inc. Multi-user search system with methodology for instant indexing
US9384226B1 (en) 2015-01-30 2016-07-05 Dropbox, Inc. Personal content item searching system and method
US9183303B1 (en) 2015-01-30 2015-11-10 Dropbox, Inc. Personal content item searching system and method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR19990055219A (en) * 1997-12-27 1999-07-15 윤덕용 HTML (TM) document storage and retrieval system
KR100487858B1 (en) * 2000-10-04 2005-05-27 (주)넥스트아이앤시 Customized intelligence information providing system and method thereof, and A saving device readable by computer

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR19990055219A (en) * 1997-12-27 1999-07-15 윤덕용 HTML (TM) document storage and retrieval system
KR100487858B1 (en) * 2000-10-04 2005-05-27 (주)넥스트아이앤시 Customized intelligence information providing system and method thereof, and A saving device readable by computer

Also Published As

Publication number Publication date
KR20070051569A (en) 2007-05-18

Similar Documents

Publication Publication Date Title
US8166013B2 (en) Method and system for crawling, mapping and extracting information associated with a business using heuristic and semantic analysis
KR100478019B1 (en) Method and system for generating a search result list based on local information
RU2456661C2 (en) Efficient navigation of search results
CN103098051B (en) Search engine optmization assistant
JP4782683B2 (en) Personalized searchable library with emphasis capability and access to electronic images of text based on user ownership of corresponding physical text
US9146999B2 (en) Search keyword improvement apparatus, server and method
US20070055657A1 (en) System for generating and managing context information
US20130124515A1 (en) Method for document search and analysis
CA2404319A1 (en) Method and system for gathering, organizing, and displaying information from data searches
US20090222298A1 (en) Data Mining Method for Automatic Creation of Organizational Charts
KR101007613B1 (en) Data registration/search support device using a keyword
JP2020135891A (en) Methods, apparatus, devices and media for providing search suggestions
CN102750081A (en) Information processing apparatus, information processing method, and program
US20060112142A1 (en) Document retrieval method and apparatus using image contents
JP2008191982A (en) Retrieval result output device
JP2005202714A (en) Document retrieval system
KR101153534B1 (en) Method and system for automatically tagging web data and local data
JP2006004062A (en) Image database creation device and image search method
JPH0944516A (en) Information filtering device
KR100667917B1 (en) A method of providing website searching service and a system thereof
KR100716113B1 (en) Method and system for tagging web-data in web search
JP2001117940A (en) Device and method for retrieving information and computer readable recording medium recording program for computer to execute the method
KR100721408B1 (en) Method and system for tagging local-data
KR101878937B1 (en) System for providing personalized information, method thereof, and recordable medium storing the method
JP4787590B2 (en) Collection search method, collection search system and collection search program

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
E701 Decision to grant or registration of patent right
GRNT Written decision to grant
FPAY Annual fee payment

Payment date: 20160329

Year of fee payment: 5

FPAY Annual fee payment

Payment date: 20170328

Year of fee payment: 6