KR20060135173A

KR20060135173A - File management system

Info

Publication number: KR20060135173A
Application number: KR1020050054899A
Authority: KR
Inventors: 김영철
Original assignee: 동아시테크주식회사
Priority date: 2005-06-24
Filing date: 2005-06-24
Publication date: 2006-12-29

Abstract

A file management system is provided to enable each client to share files of other clients by searching and classifying the files into each subject, and enable each client to search/share the files by extracting the files from the clients connected through the network and classifying the extracted files into each subject according to a classification rule. A file searcher(201) is connected to at least one client(100) through the network, searches the files of each client, and indexes each searched file according to at least one field. A database(203) stores the searched files according to index setting of each field. The file searcher provides a file list corresponding to a search request by responding to the search request from the client, and calls the corresponding file from the database and provides the file to the client when the client selects the desired file from the file list. An index database(202) stores a unique ID of each searched file according to the index setting.

Description

File management system

도 1은 본 발명에 따른 파일관리 시스템의 일 예에 대한 블록개념도,1 is a block diagram of an example of a file management system according to the present invention;

도 2는 이름이 포함된 파일을 인덱스 하는 방법을 개념적으로 설명하기 위한 도면,2 is a view for conceptually explaining a method of indexing a file including a name;

도 3은 본 발명의 다른 실시예에 대한 블록개념도,3 is a block diagram of another embodiment of the present invention;

도 4는 클라이언트에서 실행되는 매니저 프로그램의 메인 인터페이스 화면의 일 예를 나타내는 도면, 4 is a diagram illustrating an example of a main interface screen of a manager program executed in a client;

도 5는 시소러스 검색 결과에 따른 결과화면의 일 예를 나타내는 도면,5 is a diagram illustrating an example of a result screen according to a thesaurus search result;

도 6은 온톨로지 검색에 의한 검색 결과물과, 그에 대응되는 인터페이스 화면의 일 예에 대한 도면, 6 is a diagram illustrating an example of a search result by ontology search and an interface screen corresponding thereto;

도 7과 도 8은 각각 확장검색과 클러스터 검색에 대한 인터페이스 화면의 일 예를 나타내는 도면, 7 and 8 are diagrams illustrating an example of an interface screen for extended search and cluster search, respectively;

도 9는 검색자가 검색한 파일중 검색자가 선택한 파일에 포함되는 단어의 분포를 표시하는 인터페이스 화면의 일 예에 따른 도면, 그리고9 is a diagram illustrating an interface screen displaying a distribution of words included in a file selected by a searcher among files searched by the searcher;

도 10은 파일관리 시스템에 접속되는 각 클라이언트가 설치된 매니저 프로그램을 이용하여 각 클라이언트별로 지식맵을 형성하는 인터페이스 화면의 일 예를 나타낸다.10 illustrates an example of an interface screen for forming a knowledge map for each client by using a manager program in which each client connected to the file management system is installed.

*도면의 주요 부분에 대한 부호의 설명** Description of the symbols for the main parts of the drawings *

100 : 클라이언트 200 : 파일관리 시스템100: client 200: file management system

201 : 파일검색부 202 : 인덱스 데이터베이스201: File Search Unit 202: Index Database

203 : 데이터베이스 204 : 검색 로봇203: Database 204: Search Robot

본 발명은 파일관리 시스템에 관한 것으로, 특히 네트워크 접속된 각 클라이언트의 파일을 상호 검색하고 이용할 수 있도록 함으로써 각 클라이언트가 상호간의 파일을 분야별로 공유 및 검색할 수 있도록 하는 파일관리 시스템에 관한 것이다. The present invention relates to a file management system, and more particularly, to a file management system that enables each client to share and search each other's files by field, by mutually searching and using files of each networked client.

대학이나 국립 및 기업의 연구기관에 분포되는 다수의 클라이언트(예컨데 퍼스널 컴퓨터, 노트북, PDA등)에는 클라이언트를 운용하는 각각의 연구원의 연구성과가 파일 형태로 저장된다. 이들 파일들은 연구원의 관리하에 특정 폴더(예컨데 "My document" 폴더)에 저장되며, 전자 및 기계분야의 프로그램에 의해 생성되는 파일의 경우에 프로그램에 의해 생성된 파일은 해당 프로그램을 구성하는 폴더 중 어느 한 폴더에 저장되는 경우가 많다. 이 외에 서류 작업을 위주로 하는 회사의 경우에도 각 회사원의 단말기에 의해 생성된 파일은 회사원에 의해 정해진 폴더에 저장된다. 이러한 파일 관리 방법은 각 개인별로 파일을 관리하고 사용하는데는 큰 문제가 없으나 각 개인이 생성한 파일을 타인과 공유하기는 어렵다. 물론, 회사나 연구기관내에 파일서버(file server)를 마련하고, 파일서버를 통해 각 개인의 파일을 공유하는 방안이 있으나, 이는 각 개인이 파일서버에 파일을 등록하는 경우에 한해서 공유가 가능한 문제가 있다. 특히 연구기관에 종사하는 연구원의 경우 공통된 연구주제에 대해 다수의 연구원이 연구를 진행하는 경우가 많으며 타 연구원에 대해 자료(파일) 요청을 하여야만 자료(파일)을 얻을 수 있다. 이때, 자료를 요청한 연구원은 타 연구원이 어떤 자료를 가지고 있는지는 알지 못하게 된다. Many clients (eg, personal computers, laptops, PDAs, etc.) distributed in universities, national and corporate research institutes store the research results of each researcher who runs the client in the form of a file. These files are stored in a specific folder (for example, "My document" folder) under the researcher's control, and in the case of files created by programs in the electronic and mechanical fields, the files generated by the program are not included in any of the folders that constitute the program. Often stored in one folder. In addition, in the case of a company mainly for paper work, the files generated by the terminal of each office worker are stored in a folder determined by the office worker. This file management method does not have a big problem in managing and using files for each individual, but it is difficult to share files created by each individual with others. Of course, there is a method of providing a file server in a company or a research institute and sharing a file of each individual through the file server, but this is a problem that can be shared only when each individual registers a file in the file server. There is. In particular, in the case of researchers working in research institutes, many researchers conduct research on a common research topic, and data (files) can be obtained only by requesting data (files) from other researchers. At this time, the researcher who requested the data does not know what the other researcher has.

따라서, 본 발명의 목적은 각 개인별 클라이언트가 타 클라이언트의 파일을 상호 검색하고, 검색된 파일을 주제별로 분류함으로서 각 개인별 파일 공유를 가능케 하는 파일관리 시스템을 제공함에 있다. 또한, 본 발명의 다른 목적은, 네트워크 접속된 각 클라이언트로 부터 파일을 추출하고, 추출된 파일을 기 마련된 분류기준에 따라 분류한 후, 이를 각 클라이언트가 검색 및 공유할 수 있도록 하는 파일관리 시스템을 제공함에 있다.Accordingly, an object of the present invention is to provide a file management system that enables each individual client to mutually search for files of other clients and classify the searched files by subject to enable file sharing for each individual. Another object of the present invention is to provide a file management system for extracting files from each networked client, classifying the extracted files according to a predetermined classification criteria, and then searching and sharing them with each client. In providing.

상기한 목적은 본 발명에 따라, 적어도 하나의 클라이언트와 네트워크 접속되며, 상기 각 클라이언트에 마련되는 파일들을 검색하고, 검색된 각 파일을 적어도 하나의 분야에 따라 인덱스 처리하는 파일검색부, 및 상기 검색된 파일들을 상기 분야별 인덱스 설정에 따라 저장하는 파일 데이터베이스를 포함하며, 상기 파일검색부는, 상기 클라이언트로부터 검색 요청에 응답하여 상기 검색 요청에 대응되는 파일 리스트를 상기 클라이언트에 제공하고, 상기 클라이언트가 상기 파일리스 트에서 원하는 파일을 선택 시, 이에 대응되는 파일을 상기 파일 데이터베이스에서 호출하여 제공하는 파일관리 시스템에 의해 달성된다.According to the present invention, the above object is network-connected with at least one client, a file search unit for searching files provided in each client, and indexing each searched file according to at least one field, and the searched file. And a file database for storing the data according to the field index setting, wherein the file search unit provides the client with a list of files corresponding to the search request in response to a search request from the client, and the client provides the file list. When selecting a desired file in the, it is achieved by a file management system to call the corresponding file in the file database and provide.

바람직하게는, 상기 인덱스 설정에 따라 상기 검색된 각 파일에 대한 고유 식별자를 구비하는 인덱스 데이터베이스를 더 포함한다.Advantageously, further comprising an index database having a unique identifier for each retrieved file in accordance with said index setting.

상기 파일검색부는, 상기 클라이언트로부터 검색 요청이 발생 시, 상기 인덱스 데이터베이스를 참조하여 상기 검색 요청에 대응되는 파일 리스트를 상기 클라이언트에 제공하고, 상기 클라이언트가 상기 파일리스트에서 원하는 파일을 선택 시, 이에 대응되는 파일을 상기 파일 데이터베이스에서 호출하여 제공하는 것이 바람직하다.When a search request is generated from the client, the file search unit provides the client with a file list corresponding to the search request by referring to the index database, and when the client selects a desired file from the file list, It is desirable to provide a file to be called from the file database.

상기 파일검색부는, 상기 파일의 확장자, 및 상기 파일명에 포함된 단어를 토대로 인덱스 처리하는 것이 바람직하다.The file search unit preferably indexes the file based on the file extension and the words included in the file name.

상기 파일검색부는, 상기 파일명에 대해 형태소 분석을 통한 단어 추출을 수행하고, 추출된 단어를 토대로 인덱스 처리하는 것이 바람직하다.It is preferable that the file search unit performs word extraction through morphological analysis on the file name and indexes the extracted word based on the extracted word.

상기 클라이언트에는, 상기 분류된 파일을 상기 분야별로 억세스 하기 위한 매니저 프로그램이 설치되며, 상기 매니저 프로그램을 통해 상기 파일 데이터베이스에 구비된 파일들을 상기 분야별로 검색하는 것이 바람직하다.The client is provided with a manager program for accessing the classified file for each sector, and searching for files in the file database for each sector through the manager program.

상기 파일검색부는, 상기 각 클라이언트로부터 파일을 수집하는 파일수집 모듈, 상기 파일수집 모듈에서 수집된 파일을 인덱스 처리하며, 상기 클라이언트로부터의 검색 요청에 응답하여 상기 인덱스 처리된 파일들 중 대응되는 파일에 대한 파일리스트를 생성하는 파일검색부, 및 상기 각 클라이언트에 설치된 상기 매니저 프로그램과 상호 통신하며, 상기 매니저 프로그램을 통한 검색요청을 상기 파일검색부로 제공하고, 상기 검색요청에 대응되는 파일리스트를 상기 파일검색부로부터 제공받아 이를 상기 매니저 프로그램으로 제공하는 검색모듈을 포함하는 것이 바람직하다.The file search unit indexes a file collecting module for collecting files from each client and a file collected by the file collecting module, and searches for a corresponding file among the indexed files in response to a search request from the client. A file search unit for generating a file list for the file, and communicating with the manager program installed in each of the clients, providing a search request through the manager program to the file search unit, and providing a file list corresponding to the search request. It is preferable to include a search module for receiving from the search unit and providing it to the manager program.

바람직하게는, 주기적으로 웹사이트를 방문하고, 상기 웹사이트로부터 문서파일을 추출하는 검색로봇을 더 포함한다.Preferably, the apparatus further includes a search robot that periodically visits a website and extracts a document file from the website.

상기 파일 데이터베이스는, 상기 클라이언트들에 저장된 파일과 상기 웹사이트로부터 추출된 파일이 상기 분야별로 구비되는 것이 바람직하다.The file database may include files stored in the clients and files extracted from the website for each of the fields.

상기한 목적은 본 발명에 따라, 적어도 하나의 클라이언트, 및 상기 클라이언트와 네트워크 접속되며, 상기 각 클라이언트로부터 검색된 파일을 인덱스 하여 저장하는 파일서버를 포함하며, 상기 클라이언트는, 상기 네트워크를 통해 접속된 타 클라이언트에 대해 검색어에 의한 검색을 수행하며, 검색된 파일을 기 설정된 트리 형상의 디렉터리에 등록하는 파일검색 모듈을 포함하고, 상기 파일검색 모듈에 의해 검색된 파일은 상기 파일서버에 등록되는 파일검색 시스템에 의해 달성된다.The above object includes at least one client and a file server network-connected with the client, the file server indexing and storing a file retrieved from each client, wherein the client is connected to the other network through the network. A file search module for searching the client by a search word and registering the searched file in a predetermined tree-shaped directory, wherein the file searched by the file search module is registered by a file search system registered in the file server. Is achieved.

상기 파일검색 모듈은, 상기 클라이언트 각각에 설치되며, 상기 각 클라이언트별로 상기 트리 형상의 디렉토리 구조를 상이하게 설정 가능한 것이 바람직하다.The file retrieval module is preferably installed in each of the clients, and can set the tree-shaped directory structure differently for each client.

바람직하게는, 상기 파일서버는, 상기 네트워크를 통해 접속되는 웹사이트로부터 파일을 수집하는 검색로봇을 더 포함한다.Preferably, the file server further comprises a search robot that collects files from a website connected via the network.

상기 파일서버는, 상기 각 클라이언트로부터 제공된 파일과, 상기 검색로봇 을 통해 수집된 파일에 대한 인덱스 정보를 구비하는 인덱스 데이터베이스, 및 상기 인덱스 정보에 대응되는 파일을 구비하는 파일 데이터베이스를 포함하는 것이 바람직하다.Preferably, the file server includes a file database including a file provided from each client, an index database including index information of a file collected through the search robot, and a file corresponding to the index information. .

상기한 목적은 본 발명에 따라, 적어도 하나의 클라이언트 각각에 구비되는 파일을 검색하고, 상기 검색된 파일 각각에 대해 기 마련되는 인덱스 기준을 적용하여 분류하는 파일검색부, 상기 인덱스 기준에 따라 상기 검색된 각 파일에 대한 고유 식별자를 구비하는 인덱스 데이터베이스, 및 상기 인덱스 처리되어 분류된 각 파일을 저장하는 파일 데이터베이스를 포함하며, 상기 파일검색부는, 상기 클라이언트로부터 검색 요청이 발생 시, 상기 인덱스 데이터베이스를 참조하여 상기 검색 요청에 대응되는 파일 리스트를 상기 클라이언트에 제공하고, 상기 클라이언트가 상기 파일리스트에서 원하는 파일을 선택 시, 이에 대응되는 파일을 상기 파일 데이터베이스에서 호출하여 제공하는 파일관리 시스템에 의해 달성된다.According to an embodiment of the present invention, a file search unit for searching a file included in each of the at least one client and classifying the searched file by applying an index criterion provided for each of the searched files; An index database having a unique identifier for a file, and a file database for storing each indexed and classified file, wherein the file search unit refers to the index database when a search request is generated from the client; It is achieved by a file management system that provides a file list corresponding to a search request to the client, and when the client selects a desired file from the file list, calls the corresponding file from the file database and provides the file.

이하, 도면을 참조하여 본 발명을 상세히 설명하도록 한다. Hereinafter, the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명에 따른 파일관리 시스템의 일 예에 대한 블록개념도를 나타낸다.Figure 1 shows a block diagram of an example of a file management system according to the present invention.

도시된 파일관리 시스템은, 파일검색부(201), 인덱스 데이터베이스(202), 데이터베이스(DB)(203), 및 검색로봇(204)을 구비한다. 파일검색부(201)는 각 클라이언트(예컨데 참조부호 "100", 이하 생략함)의 저장매체(예컨데 하드디스크 드라이브)에 저장된 파일을 수집한다. 이때, 파일검색부(201)의 파일수집은 다음의 방법에 의해 수행될 수 있다.The illustrated file management system includes a file search unit 201, an index database 202, a database (DB) 203, and a search robot 204. The file retrieval unit 201 collects files stored in a storage medium (for example, a hard disk drive) of each client (for example, reference numeral “100”, hereinafter omitted). At this time, the file collection of the file search unit 201 may be performed by the following method.

1) 파일검색부가 일정 주기(예컨데 수분 ∼ 수시간)마다 각 클라이언트의 파일을 조회하여 수집하는 방법.1) The file search unit inquires and collects files of each client every predetermined period (for example, several minutes to several hours).

2) 각 클라이언트에 파일관리를 위한 매니저 프로그램을 설치하고, 설치된 매니저 프로그램에 의해 신규 생성된 파일정보를 얻어 수집하는 방법.2) A method of installing a manager program for file management in each client and obtaining and collecting file information newly generated by the installed manager program.

파일검색부(201)가 1)의 방법에 의해 파일을 수집하는 경우 파일검색부(201)는 정해진 시간마다 각 클라이언트에 억세스 하여 파일을 검색하고, 신규 생성되거나 수정(삭제)된 파일을 수집한다. 이때, 파일검색부(201)는 각 클라이언트로부터 수집된 파일을 분류하고 인덱싱하여 인덱스 데이터베이스(202)로 제공한다. 수집된 파일의 분류는 파일의 명칭(또는 내용)에 포함된 단어 및 확장자를 참조하여 수행된다. 또한, 파일검색부(201)는 파일의 명칭에 대해 형태소 분석을 통한 단어 추출을 수행하고 추출된 단어를 통해 파일을 분류할 수 있다. 예컨대, 파일의 명칭이 "유비쿼터스란 무엇인가.pdf" 인 경우 파일명에 대한 형태소 분석을 통해 해당 파일에서 "유비쿼터스"라는 명사를 추출하고, 추출된 명사에 따라 분류를 수행하게 된다. 이와 같은 파일의 수집이 연구소에서 이루어진 경우, 파일(유비쿼터스란 무엇인가.pdf)은 유비쿼터스 기술분야의 파일인 것으로 볼 수 있다. 파일검색부(201)가 2)의 방법에 의해 파일을 수집하는 경우, 파일검색부(201)는 각 클라이언트에 설치된 매니저 프로그램으로부터 신규 생성된, 또는 변경된 파일 정보를 제공받아 파일을 수집하게 된다. 이 경우, 파일검색부(201)는 일정 주기마다 각 클라이언트의 저장매체에 저장된 파일을 검색하지 않아도 된다. When the file search unit 201 collects files by the method of 1), the file search unit 201 accesses each client at a predetermined time, searches for files, and collects newly created or modified (deleted) files. . In this case, the file search unit 201 classifies and indexes files collected from each client and provides the index to the index database 202. Classification of the collected files is performed by referring to words and extensions included in the name (or contents) of the file. In addition, the file search unit 201 may perform word extraction through morphological analysis on the name of the file and classify the file through the extracted word. For example, if the name of the file is "What is ubiquitous.pdf", the noun "ubiquitous" is extracted from the file through morphological analysis of the file name, and the classification is performed according to the extracted noun. If such a collection of files is made in a laboratory, the file (what is ubiquitous.pdf) may be considered to be a file in the ubiquitous technical field. When the file retrieval unit 201 collects files by the method of 2), the file retrieval unit 201 collects files by receiving newly generated or changed file information from a manager program installed in each client. In this case, the file retrieval unit 201 does not need to retrieve the file stored in the storage medium of each client at predetermined intervals.

인덱스 데이터베이스(202)는 파일검색부(201)에 의해 수집된 파일 각각에 대한 인덱스 정보를 저장한다. 인덱스 정보는 각 파일에 대한 전체 정보 대신에 각 파일을 대표하는 고유식별자를 포함한다. 이는 도 2를 함께 참조하여 설명하도록 한다. The index database 202 stores index information for each file collected by the file search unit 201. Index information includes unique identifiers representing each file instead of the full information for each file. This will be described with reference to FIG. 2 together.

도 2는 이름이 포함된 파일을 인덱스 하는 방법을 개념적으로 설명하기 위한 도면을 나타낸다. 도시된 바와 같이, 이름을 포함하는 파일의 경우, 이를 이름별로 분류하고자 할때, 김씨, 강씨, 감씨, 및 금씨 성을 가진 이름은 "ㄱ"에, 나씨, 노씨성을 가진 이름은 "ㄴ"으로 분류할 수 있다. 김씨 성은 "ㄱ" 필드로 분류되어 인덱싱 되며, 파일검색부(201)는 김씨성을 가진 이름을 검색 시, "ㄱ" 필드만을 검색하여 이를 즉각 찾을 수 있다. 마찬가지의 요령으로 ㄴ ∼ ㅋ에 해당하는 성씨에 대해서도 동일하게 적용 할 수 있다. 이와 같은 인덱싱에 의한 분류는 파일검색부(201)가 사람의 이름이 포함된 파일을 ㄱ ∼ ㅋ의 필드로 분류하여 인덱스 데이터베이스(202)에 저장하고, 클라이언트로부터의 검색 요청에 응답하여 인덱스 데이터베이스(202)를 통해 해당 파일을 검색함으로서 파일검색부(201)의 검색속도를 향상시킬 수 있다. 한편, ㄱ ∼ ㅋ에 따라 분류된 파일은 각각 고유식별자를 가지게 되는데, 도면에서는 고유 식별자로서 A1 ∼ A3가 예시되어 있다. 고유식별자(예컨대 A1 ∼ A3)는 데이터베이스(DB)(203)에 저장된 파일에 대응되는 대표값으로서, 파일검색부(201)는 이를 이용하여 데이터베이스(DB)(203)에 저장된 파일을 즉각 억세스 할 수 있다. 이에 따라, 파일검색부(201)는 클라이언트로부터 김씨성을 가진 이름에 대한 검색 요청이 수신되면 인덱스 데이터베이스(202)에 저장된 ㄱ 필드를 검색 하여 해당 이름이 포함된 파일의 인덱스 정보(에컨데 A1)를 찾고, 인덱스 정보(A1)에 대응되는 파일을 데이터베이스(DB)(203)에서 억세스 하여 클라이언트로 제공하게 된다. 도 2는 인덱스를 수행하는 방법을 설명하기 위해 이해와 설명이 용이한 이용자의 이름을 참조하여 설명하였으나, 파일에 대한 인덱싱은 이름 이외에도 특정 기술분야의 기술용어, 파일작성자, 파일생성일, 갱신일, 파일의 확장자, 파일의 수정여부와 같은 다양한 기준으로 인덱스 될 수 있다. 도 2는 이와 같은 다양한 기준들 중 어느 하나일 뿐이며, 본 발명이 이에 의해 한정되지 않는다.2 is a diagram for conceptually describing a method of indexing a file including a name. As shown, in the case of a file containing a name, when the names are classified by name, the names with the last name of Kim, Kang, Gam, and Kum are given to "ㄱ", and the names with Na, old name are "B". Can be classified as Kim's surname is indexed by being classified into a "b" field, the file search unit 201 can immediately find this by searching only the "a" field when searching for a name with Kim. With the same tips, the same can be applied to surnames corresponding to b-ㅋ. In this sorting by indexing, the file searching unit 201 classifies a file containing a person's name into a field of a to ㅋ and stores it in the index database 202, and responds to a search request from a client. The search speed of the file search unit 201 may be improved by searching the file through 202. On the other hand, the files classified according to a ~ will have a unique identifier, respectively, A1 to A3 is illustrated as a unique identifier in the figure. The unique identifiers (for example, A1 to A3) are representative values corresponding to the files stored in the database (DB) 203, and the file search unit 201 can immediately access the files stored in the database (DB) 203 using them. Can be. Accordingly, the file search unit 201 searches for a field stored in the index database 202 when the search request for the name having the last name from the client is received, and index information of the file including the name (eg, A1). And the file corresponding to the index information A1 is accessed from the database 203 and provided to the client. Although FIG. 2 has been described with reference to a user's name that is easy to understand and explain in order to explain how to perform an index, indexing of a file may include technical terms, file creator, file creation date, update date, It can be indexed by various criteria such as file extension and file modification. 2 is only one of these various criteria, and the invention is not limited thereby.

한편, 상기 도 2를 참조하여 설명된 인덱싱 방법은 파일명에 포함된 이름을 이용하는 방식이나 기술 분야별로 인덱싱을 수행하는 방법도 있다. 예컨대, 연구기관의 연구대상 중 하나가 "휴대폰"인 경우, 휴대폰을 기준으로, 휴대폰에 장착되는 "RF"(Radio Frequency), 휴대폰에 장착되는 "DSP"(Digitl Signal Procssor), 및 "폴더"와 같은 기술용어가 포함된 파일, 또는 파일명을 휴대폰이라는 검색어에 매칭시켜 제공할 수도 있다. 여기서, 파일에 포함되는 기술용어는 파일의 파일명이 될 수도 있으나, 이 외에, 파일에 포함되는 텍스트를 통해 획득될 수도 있다. 이는 추후 상세히 설명하도록 한다.On the other hand, the indexing method described with reference to FIG. 2 may include a method of using a name included in a file name or a method of indexing for each technical field. For example, if one of the research institutes of the research institute is a "mobile phone", based on a mobile phone, "RF" (Radio Frequency) mounted on the mobile phone, "DSP" (Digitl Signal Procssor) mounted on the mobile phone, and "folder" A file including a technical term, or a file name may be provided by matching a search word of a mobile phone. Here, the description term included in the file may be a file name of the file, but may be obtained through text included in the file. This will be described later in detail.

도 3은 본 발명의 다른 실시예에 대한 블록개념도를 나타낸다.Figure 3 shows a block diagram of another embodiment of the present invention.

도시된 파일관리 시스템은, 도 1에 도시된 것과 유사하되, 파일검색부(201)와 인덱스 데이터베이스(206)를 각 클라이언트에 내재시킨 것을 특징으로 한다. 이에 따라, 각 클라이언트는 네트워크를 통해 파일서버(205)와 접속되며, 각 클라이언트에서 상호 검색되고 공유되는 파일은 파일서버(205)에 제공되어 저장된다. 이를 위해 각 클라이언트에는 타 클라이언트에 구비되는 파일을 검색하고, 그 결과를 클라이언트의 사용자에게 제공하는 기능, 및 클라이언트에 구비된 파일을 파일서버(205)에 제공하는 기능을 구비하는 매니저 프로그램이 설치된다. 매니저 프로그램은 각 클라이언트의 사용자별로 검색 트리(search tree)를 형성 가능하며, 각 검색 트리를 통해 원하는 검색결과를 얻을 수 있도록 한다. 이는 도 4와 도 5를 함께 참조하여 설명하도록 한다. The illustrated file management system is similar to that shown in FIG. 1, but has a file retrieval unit 201 and an index database 206 embedded in each client. Accordingly, each client is connected to the file server 205 through a network, and the files searched and shared with each other are provided to the file server 205 and stored. To this end, each client is provided with a manager program having a function of searching for a file provided to another client, providing a result to a user of the client, and providing a file provided to the file to the file server 205. . The manager program can form a search tree for each user of the client, and can obtain a desired search result through each search tree. This will be described with reference to FIGS. 4 and 5 together.

도 4는 클라이언트에서 실행되는 매니저 프로그램의 메인 인터페이스 화면의 일 예를 나타낸다.4 illustrates an example of a main interface screen of a manager program executed on a client.

도시된 매니저 프로그램은 통합검색, 시소러스(thesaurus) 검색, 전문용어 검색, FAQ 검색, 온톨로지(Ontology) 검색, My 검색, 및 관리 메뉴를 구비한다. 각 메뉴를 정리하여 설명하면 아래와 같다.The illustrated manager program includes an integrated search, thesaurus search, terminology search, FAQ search, ontology search, My search, and management menu. Each menu is summarized as follows.

1) 통합검색은 웹검색 및 각 클라이언트에 마련되는 파일을 모두 검색하는 메뉴를 나타낸다. 통합검색은 검색자가 매니저 프로그램에 입력한 검색어를 포함하는 모든 문서를 검색하고 이에 대한 리스트를 사용자에게 디스플레이한다. 여기서, 통합검색 메뉴에 따른 검색결과물은 정확도, 등록일, 출처, 및 파일형식에 따라 소팅(sorting)될 수 있다.1) The integrated search shows a menu for searching both web search and files provided in each client. The integrated search retrieves all documents containing the search term entered by the searcher in the manager program and displays the list for the user. Here, the search results according to the integrated search menu may be sorted according to accuracy, registration date, source, and file format.

2) 시소러스(thesaurus) 검색은 검색어와 연관된 단어가 포함된 결과물을 검색자에게 제공하는 검색메뉴를 나타낸다. 이는 도 5를 함께 참조하여 설명하도록 한다. 도 5에 도시된 바와 같이, 검색자가 매니저 프로그램에 검색어 "복막"을 기입하고 이에 대한 파일 검색을 요청 시, 검색 결과는 "복막"을 포함한 단어와, "복 막"과 연관된 단어(예컨데 복부, 복막후강, 장막, 배막, 복강경등)에 대한 검색결과를 포함한다. 이러한 검색 방법은 검색자가 정확한 파일명을 알지 못하거나, 찾고자 하는 파일에 대한 개념만을 가지고 있는 경우 유용하다. 이를 위해 도 1과 도 3에는 도시되지 않았으나, 파일검색부(201)의 내부에 이들 단어(예컨데 "복막"과 "장막")의 연관 관계에 대한 정보가 마련되거나, 파일검색부(201)에 이들 단어의 연관 정보를 제공하는 데이터베이스를 더 포함하는 것이 바람직하다.2) Thesaurus search refers to a search menu that provides searchers with results that include words associated with a search word. This will be described with reference to FIG. 5 together. As shown in FIG. 5, when a searcher enters the search word "peritoneum" in the manager program and requests a file search for the file, the search result is a word including "peritoneum" and a word associated with "peritoneum" (eg, abdomen, Peritoneal cavity, peritoneal cavity, membrane, laparoscope, etc.). This search method is useful when the searcher does not know the exact file name or has only the concept of the file to be searched. For this purpose, although not shown in FIG. 1 and FIG. 3, information on the relation between these words (eg, "peritoneum" and "veil") is provided inside the file search unit 201, or the file search unit 201 is provided. It is preferable to further include a database for providing association information of these words.

3) 전문용어 검색은 매니저 프로그램을 통해 입력한 검색어에 대응되는 사전, 및 관련 어휘에 대한 용어해설을 표시한다. FAQ(Frequency Asked Question) 검색은 검색어에 대해 질의 응답이 이루어진 인터넷 문서나 파일에 대한 검색 기능을 의미한다. 통상 인터넷을 통해 FAQ 문서를 보고자 하는 경우 매니저 프로그램은 검색 로봇(204)을 통해 수집된 인터넷 문서나 파일을 검색하여야 한다. 검색 로봇(204)에 의해 수집된 인터넷 문서나 파일에 대한 인덱스 정보는 인덱스 데이터베이스(DB)(203)에 저장되고, 파일이나 인터넷 문서는 데이터베이스(DB)(203)에 저장되므로 매니저 프로그램은 파일서버(205)를 통해 데이터베이스(DB)(203)에 저장된 인터넷 문서나 파일을 획득하게 된다.3) The term search displays a glossary of dictionaries and related vocabulary corresponding to the search word entered through the manager program. Frequently Asked Question (FAQ) search refers to a search function for an Internet document or a file in which a query is answered. In general, in order to view the FAQ document through the Internet, the manager program should search the Internet document or file collected through the search robot 204. The index information for Internet documents or files collected by the search robot 204 is stored in the index database (DB) 203, and the files or Internet documents are stored in the database (DB) 203, so that the manager program is a file server. Through 205, Internet documents or files stored in the database (DB) 203 are obtained.

4) 온톨로지(ontology) 검색은 검색자가 매니저 프로그램을 통해 입력한 검색어에 대해 검색어와 의미적으로 연관을 갖는 단어를 검색하고 그 결과물을 검색자에게 제공한다. 예컨대, 검색어가 "면역학" 인 경우, 검색어가 의학에 관련된 내용이므로 파일검색부(201)는 의학, 면역학, 및 면역과 의학에 대한 연관성이 높은 "혈청학"등에 대한 파일을 검색하게 된다.4) Ontology search searches the words semantically related to the search word for the search word entered by the searcher through the manager program and provides the searcher with the result. For example, if the search word is "immunology", since the search word is related to medicine, the file search unit 201 searches a file for medicine, immunology, and "serumology" having a high correlation with immunity and medicine.

한편, 온톨로지 검색방법은 도 1과 도 3에 도시된 파일검색부(201)를 통해 구현될 수도 있으나 파일검색부(201)와 접속되는 별도의 검색모듈을 통해 구현되거나 검색어에 대해 연관성을 가지는 단어정보를 구비하는 데이터베이스를 통해 구현될 수 있다. 이하, 온톨로지 검색은 도 6을 함께 참조하여 설명하도록 한다.Meanwhile, the ontology search method may be implemented through the file search unit 201 illustrated in FIGS. 1 and 3, but may be implemented through a separate search module connected to the file search unit 201 or having a relevance to a search word. It can be implemented through a database with information. Hereinafter, the ontology search will be described with reference to FIG. 6.

도 6은 온톨로지 검색에 의한 검색 결과물과, 그에 대응되는 인터페이스 화면의 일 예를 나타낸다. 검색자가 메인 메뉴에서 "온톨로지 검색"을 선택 후, 검색어 "단백질"을 기입하여 검색을 수행하면 검색어 각 클라이언트에 구비되는 파일들 중 검색어 "단백질"이 포함된 파일("C-반응성 단백질")과, 검색 결과파일과 유사 파일들에 대한 상호 관계(예컨대, 도면에서 "Activation". "reactant", "sub_class_of")를 표시하고 이를 트리 구조로 나열함으로서 검색자로 하여금 원하는 검색물을 좀 더 쉽게 찾을 수 있도록 한다. 6 illustrates an example of a search result by ontology search and an interface screen corresponding thereto. When the searcher selects "Ontology Search" from the main menu and enters the search term "protein", the searcher enters the file containing the search term "protein" among the files included in each client of the search term ("C-reactive protein") and By displaying the search results files and their interrelationships for similar files (eg, "Activation", "reactant", "sub_class_of" in the drawing) and listing them in a tree structure, it makes it easier for searchers to find what they are looking for. Make sure

도 7과 도 8은 각각 확장검색과 클러스터 검색에 대한 인터페이스 화면의 일 예를 나타낸다.7 and 8 illustrate examples of interface screens for extended search and cluster search, respectively.

먼저 도 7은 검색어에 의해 검색된 파일들 중 어느 하나의 일 측에 마련되는 확장검색키(미도시)를 검색자가 누르는 경우 화면에 표시되는 확장검색 인터페이스 화면을 나타낸다. 도시된 바와 같이, 확장검색은 검색자가 선택한 파일로부터 파일을 구성하는 단어들을 추출하고 추출된 단어에 대해 재검색, 시소러스 검색, 전문용어 검색, 및 온톨로지 검색을 재 수행하도록 한다. 도면에서 사용자가 선택한 파일을 구성하는 단어는 "직접", "유비쿼터스", "소개", "존재", "라틴어", "컴퓨텅" ···, 및 "온라인" 등이 있으며 검색자는 이들 단어들 중 하나를 선택 후, 선택한 단어에 대해 재 검색을 수행할 수 있다. 만일 상기한 단어들 중 검색자가 용어적 의미를 알고자 하는 단어가 있다면 검색자는 "전문용어" 탭을 선택하여 그에 대한 해답을 얻을 수 있다.First, FIG. 7 illustrates an extended search interface screen displayed on a screen when a searcher presses an extended search key (not shown) provided on one side of files searched by a search word. As shown, the extended search extracts words constituting the file from the file selected by the searcher and re-performs the searched word again, thesaurus search, terminology search, and ontology search. The words that make up the file selected by the user in the drawing include "direct", "ubiquitous", "introduction", "existence", "latin", "computon" ..., and "online". After selecting one of these items, you can rescan the selected word. If there is a word among the above words that the searcher wants to know the term meaning, the searcher can select the "Terminology" tab and get an answer.

도 8은 클러스터링 검색에 대한 인터페이스 화면을 나타낸다. 도시된 바와 같이, 클러스터링 검색은 검색 결과와 의미적으로 유사한 카테고리를 묶어 보여준다. 도면에서 검색어가 "인터넷"인 경우, 인터넷이 포함되지는 않았더라도 인터넷과 연관된 "유비쿼터스.doc", 인터넷 라이선스에 대한 "License.txt"에 대한 파일이 검색자에게 제공되는 것이 예시되어 있다.8 shows an interface screen for clustering search. As shown, the clustering search groups together categories that are semantically similar to the search results. In the drawing, when the search word is "Internet", even though the Internet is not included, the searcher is provided with a file for "Ubiquitous.doc" and "License.txt" for the Internet license.

도 9는 검색자가 검색한 파일중 검색자가 선택한 파일에 포함되는 단어의 분포를 표시하는 인터페이스 화면의 일 예를 나타낸다.9 illustrates an example of an interface screen displaying a distribution of words included in a file selected by a searcher among files searched by the searcher.

도시된 바와 같이, 검색자는 자신의 검색어에 대해 검색된 파일들 중 어느 하나를 선택하고, 선택된 파일에 포함되는 단어의 분포를 통해 해당 파일의 기술종류를 분류할 수 있다. 이를 위해 검색부(201)는 수집된 파일 각각에 대한 단어를 추출하고, 추출된 단어에 대해 기 설정된 기술분류표를 참조하여 단어의 분포 빈도를 산출할 수 있다. 기술분류표는 전자사전과 같이 단어가 집합된 데이터들에 대해 인터넷, 무선통신, 건강, 및 기타 연구소나 회사 조직이 주로 공유하는 개념에 대한 분류기준표를 의미한다. 여기서, 도 7 내지 도 9에 예시된 검색방법은 파일검색부(201)내에 시소러스 검색, 온톨로지 검색, 전문용어 검색, 및 통합검색등을 지원하는 프로그램 모듈을 설치하여 구현하여도 되나, 파일검색부(201)가 각 클라이언트에서 수집된 파일을 저장하는 파일 데이터베이스(203)와 파일 데이터베이스(203) 에 대한 인덱스 데이터베이스(202)와 연동되어 구현되어도 무방하다.As illustrated, the searcher may select any one of the files searched for the search word and classify the description type of the corresponding file through distribution of words included in the selected file. To this end, the search unit 201 may extract a word for each of the collected files, and calculate a frequency of distribution of the word by referring to a predetermined technical classification table for the extracted word. A technical classification table refers to a classification table for concepts mainly shared by the Internet, wireless communication, health, and other research institutes or company organizations for word-gathered data such as electronic dictionaries. Here, the search method illustrated in FIGS. 7 to 9 may be implemented by installing a program module supporting the thesaurus search, ontology search, terminology search, and integrated search in the file search unit 201. 201 may be implemented in conjunction with the file database 203 storing the files collected by each client and the index database 202 for the file database 203.

도시된 바와 같이, 각 클라이언트에 설치된 매니저 프로그램은 검색자에 의해 각 관심분야 또는 기술분야별로 디렉토리(②)를 형성할 수 있다. 도면에서 최상위 디렉토리는 "지식맵"이며, 그 아래로 "건강관리", "기관", "기술", "음식과 재료" ···, 및 "화학물과 약물"의 순으로 디렉토리가 도시되어 있다. 각 디렉토리는 검색자에 의해 임의로 생성 및 삭제가 가능하며, 각 디렉토리는 검색자에 의해 설정된 검색식에 따라 파일을 검색 및 등록 하거나, 검색자가 임의로 선택하는 파일이 등록될 수 있다. 이와 같은 디렉토리 구조는 검색자 스스로가 원하는 파일을 쉽게 찾을 수 있도록 하며, 검색자로 하여금 자신이 원하는 파일만을 선택적으로 검색할 수 있도록 한다.As shown, the manager program installed in each client may form a directory ② by each searcher or technical field by a searcher. In the figure, the top-level directory is the "knowledge map", and the directories are shown below in the order of "health care", "organ", "technology", "food and ingredients", and "chemicals and drugs". have. Each directory may be arbitrarily created and deleted by a searcher, and each directory may search for and register a file according to a search formula set by the searcher, or a file arbitrarily selected by a searcher may be registered. This directory structure allows the searcher to find the desired file easily, and allows the searcher to selectively search only the desired file.

상기한 바와 같이, 본 발명은 각 개인별 클라이언트가 타 클라이언트의 파일을 상호 검색하고, 검색된 파일을 주제별로 분류함으로서 각 개인별 파일 공유를 가능케 한다. 또한, 본 발명은 단일 검색어에 대응되는 검색 이외에도 검색어와 연관된 정보에 대해 검색하고 이를 검색자에게 제공함으로서 검색자가 필요로 하는 다양한 정보를 제공할 수 있다.As described above, the present invention enables each individual client to mutually search for files of other clients and classify the searched files by subject to enable file sharing for each individual. In addition, the present invention may provide a variety of information required by the searcher by searching for information associated with the search word and providing the searcher in addition to a search corresponding to a single search word.

Claims

Networked with at least one client,

A file search unit for searching files provided in the clients and indexing the searched files according to at least one field; And

And a file database for storing the retrieved files according to the sector index setting.

The file search unit may provide a file list corresponding to the search request to the client in response to a search request from the client, and when the client selects a desired file from the file list, the file search unit may select a corresponding file from the file database. File management system, characterized in that provided by calling.

The method of claim 1,

And an index database having a unique identifier for each searched file according to the index setting.

The method of claim 2,

The file search unit,

When a search request is generated from the client, a file list corresponding to the search request is provided to the client by referring to the index database, and when the client selects a desired file from the file list, the corresponding file is stored in the file. File management system, characterized in that provided by calling from the database.

The method according to any one of claims 1 to 3,

The file search unit,

And indexing the file based on the extension of the file and a word included in the file name.

The method of claim 4, wherein

The file search unit,

And extracting the word through the morpheme analysis of the file name, and indexing the extracted word based on the extracted word.

The method of claim 1,

The client,

A manager program for accessing the classified file for each field is installed.

File management system, characterized in that for searching the files provided in the file database by the manager program by the field.

The method of claim 6,

The file search unit,

A file collection module for collecting files from each client;

A file retrieval unit for indexing files collected by the file collection module and generating a file list of corresponding files among the indexed files in response to a search request from the client; And

And mutually communicating with the manager program installed in each client, providing a search request through the manager program to the file search unit, receiving a file list corresponding to the search request from the file search unit, and providing it to the manager program. File management system comprising a; search module.

The method of claim 1,

And a search robot which periodically visits a website and extracts a document file from the website.

The method of claim 8,

The file database,

The file management system, characterized in that the file stored in the client and the file extracted from the website is provided for each field.

At least one client; And

And a file server network-connected with the client and indexing and storing a file retrieved from each client.

The client,

A file search module for searching other clients connected through the network by a search word and registering the searched file in a predetermined tree-shaped directory, and the file searched by the file search module is stored in the file server. File search system, characterized in that registered.

The method of claim 10,

The file search module,

A file search system installed in each of the clients and configured to set the tree-shaped directory structure differently for each client.

The method of claim 10,

The file server,

And a search robot that collects a file from a website connected through the network.

The method of claim 12,

The file server,

An index database including index files for files provided from the clients and files collected through the search robot; And

And a file database having a file corresponding to the index information.

A file search unit for searching a file included in each of the at least one client and classifying the searched file by applying an index criterion provided to each of the searched files;

An index database having a unique identifier for each retrieved file according to the index criteria; And

And a file database for storing each file classified and indexed.

When a search request is generated from the client, the file search unit provides the client with a file list corresponding to the search request by referring to the index database, and when the client selects a desired file from the file list, File management system, characterized in that for providing a file called by the file database.