WO2011074729A1 - Ontology-based instance identification system and method - Google Patents

Ontology-based instance identification system and method Download PDF

Info

Publication number
WO2011074729A1
WO2011074729A1 PCT/KR2009/007593 KR2009007593W WO2011074729A1 WO 2011074729 A1 WO2011074729 A1 WO 2011074729A1 KR 2009007593 W KR2009007593 W KR 2009007593W WO 2011074729 A1 WO2011074729 A1 WO 2011074729A1
Authority
WO
WIPO (PCT)
Prior art keywords
instance
ontology
identified
identification
registered
Prior art date
Application number
PCT/KR2009/007593
Other languages
French (fr)
Korean (ko)
Inventor
김평
정한민
이승우
이미경
김재한
류범종
Original Assignee
한국과학기술정보연구원
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 한국과학기술정보연구원 filed Critical 한국과학기술정보연구원
Publication of WO2011074729A1 publication Critical patent/WO2011074729A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology

Definitions

  • the present invention relates to an ontology-based instance identification system and method thereof, and more particularly, in a process of converting data of a relational database management system (RDBMS) into an instance of a web ontology language (OWL) ontology.
  • RDBMS relational database management system
  • OWL web ontology language
  • the present invention relates to an ontology-based instance identification system and method for assigning unique identifiers (URIs) to data of a database management system to express relationships between instances.
  • URIs unique identifiers
  • the semantic web technology for solving the above problems expresses the meaning and association of words using ontologies, and extends the semantic relations through inference and expresses them with various services, thereby improving accessibility and user satisfaction. Is also contributing to.
  • the present invention converts data of a relational database management system (RDBMS) into an instance of Web Ontology Language (OWL) ontology, and generates an ontology instance with an identifier to provide accurate information. It is to provide an ontology-based instance identification system and method for improving the accessibility of information through finding or semantic inference.
  • RDBMS relational database management system
  • OWL Web Ontology Language
  • ontology register and ontology register which register ontology database of ontology database of class, attribute, and relationship between classes, ontology database, instance to be identified is entered.
  • the identification target instance is identified as the same instance as the existing instance by the controller, the identification identifier is assigned the same identifier assigned to the existing instance among the instances of the ontology registered in the ontology DB.
  • an ontology-based instance identification system comprising: an instance identifier granter for assigning a unique identifier
  • the ontology register of the ontology-based instance identification system registers an ontology consisting of class, attribute, and relationship information between classes in the ontology DB of the database.
  • the class-instance connector receives an input of an instance to be identified. Selecting and attaching a class to be allocated to the instance; Acquiring information necessary for identification of an instance to be identified by the reference and additional information obtaining unit; identifying an instance of an input identification object among instances of a registered ontology, and identifying an instance of an instance of a registered ontology If the instance is not identified, the instance identifier granter provides a unique identifier; to provide an ontology-based instance identification method comprising a.
  • the ontology-based instance identification system and method thereof generate an ontology instance with identifiers when converting data from a relational database management system (RDBMS) into an instance of a web ontology language (OWL) ontology. It is effective to improve the accessibility of information through finding information or semantic reasoning.
  • RDBMS relational database management system
  • OWL web ontology language
  • FIG. 1 is a diagram illustrating an OntoURI system for automatically generating RDBMS data as an ontology instance given a URI by using a mapping rule of a DB schema and an ontology schema and an identification rule defined to identify an instance.
  • FIG. 2 is a diagram illustrating a data conversion process using an OntoURI system.
  • FIG. 3 is a diagram illustrating an ontology-based instance identification system according to the present invention.
  • FIG. 4 is a diagram illustrating interdependence between instances in an ontology-based instance identification system according to the present invention
  • FIG. 5 is a flowchart illustrating an instance identification method by an ontology-based instance identification system according to the present invention.
  • an OntoURI system for automatically generating RDBMS data corresponding to a system higher than the present invention as an ontology instance given a URI will be briefly described.
  • FIG. 1 is a diagram illustrating an OntoURI system for automatically generating RDBMS data as an ontology instance given a URI by using a mapping rule of a DB schema and an ontology schema and an identification rule defined to identify an instance.
  • the OntoURI system supports a user interface for creating a DBMS- ontology mapping rule and transmits the generated rule to the server.
  • the rule editor 110 configures a server environment for linking external services and resources. Rules generated by the server management unit 120, the monitoring unit 130 to determine whether the DBMS data changes, the statistical unit 140 for providing statistical information on the progress or the created instance, the rule editing unit 110 Rule management unit 150 for managing the data, instance identification unit 160 for instance identification by applying to the instance identification rule, the scheduler 170 for registering and performing a schedule for the data conversion operation, mapping rules and DB monitoring results Instance generation unit 180 to create an instance using the DB, DB monitoring unit 190 and ontology to grasp the change of the DBMS to be converted It comprises a management information model (200) for storing and managing the schema instance.
  • a management information model 200
  • FIG. 2 is a diagram illustrating a data conversion process using an OntoURI system.
  • the data conversion process by the OntoURI system is composed of seven steps.
  • the seven-step data conversion process mentioned above is to set up the environment for inference engine, search engine, and other external services through the server environment setup, and the JDBC environment for DBMS interworking, and ontology schema registration.
  • Step 2 specifying the URI prefix and format for the server
  • step 3 registering the connection information of the target DB
  • step 4 creating the mapping rule and registering it with the URI server using the mapping rule editor, and authority data required for the instance identification rule.
  • the present invention normalizes data by using authority data and identification rules in the process of converting DB data into an ontology instance, and normalized data.
  • the above-described instance identification unit 600 and steps 4 and 5 in the data conversion process will be described.
  • FIG. 3 is a diagram illustrating an ontology-based instance identification system according to the present invention.
  • the ontology-based instance identification system includes an ontology register 300, a class-instance connector 400, a controller 500, a reference and additional information obtainer 600, and an instance identifier adder. 700 and the database unit 800.
  • the ontology register 300 registers an ontology consisting of classes, attributes, and relationship information between classes in the ontology DB 810 of the database unit 800.
  • the ontology registrar 300 may refer to the author or subject of the publisher referred to to assign a unique identifier to a single article. Register information and related information.
  • the class-instance connector 400 selects and connects a class to which an instance is assigned when an instance to be identified is input.
  • the reference and additional information obtainer 600 acquires information necessary for instance identification when the instance to be identified and the class of the instance are determined.
  • the control unit 500 has the information obtained by the reference and additional information acquirer 600, and identifies whether the input identification target instance exists among the ontology registered in the ontology DB 810.
  • the instance identifier assigning unit 700 gives the same identifier as the instance identified in the ontology DB 810, and identifies the object. If the instance does not exist because it does not exist in the ontology DB 810, a unique identifier is assigned to the input identification target instance.
  • the authority data referenced by the reference and additional information obtainer 600 is used to obtain representative values of the name organization publishers for the names actually represented in the academic information metadata as described in the table below.
  • the reference and additional information acquirer 600 is the international name corresponding to the institution name with the full name as the actual value of the instance to be identified.
  • the control unit 500 has the information obtained by the reference and additional information acquirer 600, and represents the actual value "International Business Machines Corporation” and the representative.
  • the value "IBM” is judged to be the same value and the identification of the entered instance is performed.
  • control unit 500 recognizes the actual value "MS” as an abbreviation of "MicroSoft” which is a specific institution and identifies the instance to be identified.
  • the actual value is treated as an instance with the same identifier as the identifier already assigned to the actual value, and if there is no representative value corresponding to the actual value, the actual value is identified as an instance.
  • the rule is applied to the existing instance to which the identifier is assigned, and if it is determined to be a new instance, the new identifier is given to the instance of ontology.
  • An academic paper ontology is composed of a class including a paper, an institution, a topic, an author, a source, and local information.
  • the information instance identification is identified in order from an instance of a class having low interdependence between classes, as shown in FIG.
  • FIG. 4 is a diagram illustrating interdependence between classes in an ontology-based instance identification system according to the present invention.
  • Only authority data is used for geographical name identification.
  • a unique identifier assigned to the city is selected by referring to the city name and the country name.
  • the present invention focuses on the author name identification, and various qualities are used to identify the author's name, such as the article written by the author, the institution of the author, the e-mail address of the author, the subject of the author's article, and the author's co-author information.
  • the class is the class to be identified
  • the qualities are the attributes used for identification
  • the classification is the description of how to apply the qualities to the instance
  • the match is the value match or the number of values matched
  • the order is the relation order
  • the source is the attribute
  • the sources and weights from which values can be obtained are the importance of each property.
  • the application order belonging to the division is an instance identification order, which prioritizes instance identification of a class with low dependence, and a reference value is a candidate group to which the same URI can be assigned when the values of specified qualities are the same.
  • a criterion for classification, a feature is a criterion to give the same URI as the existing instance when the attribute and threshold final weight of the instance exceeds the threshold value.
  • an match belonging to a match is a case where values match exactly, and a number is a number of values that match exactly.
  • the single belonging to the order is a case where only one attribute value exists, and the multiple is a case where two or more attribute values exist.
  • OntoURI When an administrator defines a rule for identifying instances on the OntoURI server and assigns a weight to each rule, OntoURI applies the rules defined to identify the instance during instance identification, and then adds a threshold with the sum of the final weights. The same URI or different URI will be allocated depending on whether or not it is exceeded.
  • the instance identification rule and the weight of each rule may be selected in consideration of the property value of the corresponding domain and the importance of each property.
  • the URI mapping information given through the identifier of the existing DB and OntoURI is used.
  • the ontology register 300 of the ontology-based instance identification system registers an ontology, which consists of class, attribute, and relationship information between classes, in the ontology DB 810 of the database unit 800 (S100).
  • the class-instance connector 400 selects and connects a class to be assigned to the instance (S200).
  • the reference and additional information obtainer 600 performs a step of obtaining information necessary for instance identification (S300).
  • the controller 500 selects an input identification instance among the registered ontology instances. Identifies a step (S400).
  • the instance granting unit 700 performs a step of assigning a unique identifier (S500) and is input from an instance of a registered ontology. If the identified instance is identified, a step of assigning the same identifier as that of the identified instance is performed.
  • S500 unique identifier
  • the present invention relates to an ontology-based instance identification system and a method thereof, wherein an instance to be identified in the process of converting data of a relational database management system (RDBMS) into an instance of a web ontology language (OWL) ontology Is entered, it identifies the existence of the instance to be identified to the instance of ontology already registered, and if the identified instance is identified, assigns the same identifier as the identified instance, and if it is not identified, assigns a new unique identifier. It is effective to find the correct information or improve the accessibility of information through semantic reasoning.
  • RDBMS relational database management system
  • OWL web ontology language

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Disclosed is an ontology-based instance identification system and method. The present invention is advantageous of improving the data access through either information refinement or semantic inference by providing a uniformed resource identifier (URI) to the relational database management system (RDBMS) data, in a process of converting the RDBMS data into the instance of web ontology language (OWL) ontology.

Description

온톨로지 기반 인스턴스 식별 시스템 및 그 방법Ontology-based Instance Identification System and Its Method
본 발명은 온톨로지 기반 인스턴스 식별 시스템 및 그 방법에 관한 것으로, 더욱 상세하게는 관계형 데이터베이스 관리 시스템(RDBMS:relational database management system)의 데이터를 OWL(Web Ontology Language)온톨로지의 인스턴스로 변환하는 과정에서, 관계형 데이터 베이스 관리 시스템의 데이터에 고유의 식별자(URI:Uniformed Resource Identifier)를 부여하여 인스턴스간 관계를 표현할 수 있도록 하는 온톨로지 기반 인스턴스 식별 시스템 및 그 방법에 관한 것이다.The present invention relates to an ontology-based instance identification system and method thereof, and more particularly, in a process of converting data of a relational database management system (RDBMS) into an instance of a web ontology language (OWL) ontology. The present invention relates to an ontology-based instance identification system and method for assigning unique identifiers (URIs) to data of a database management system to express relationships between instances.
오늘날 정보의 종류와 개수가 기하급수적으로 증가함에 따라 연구자들이 필요한 정보를 획득하고 분석하는데 소요되는 시간도 점차 늘어나고 있는 실정이지만, 기존의 키워드 검색과 같은 정보 서비스는 특정 분야의 도메인을 대상으로 하는 버티컬 검색이나 의미처리를 통한 보다 정확한 검색 서비스에 대한 요구에 부응하지 못하고 있으며, 그에 따른 사용자 만족도도 점차 낮아지고 있다는 문제점이 있다.Today, as the type and number of information increases exponentially, the time required for researchers to acquire and analyze the required information is gradually increasing. However, existing information services such as keyword search are vertical for specific domains. There is a problem in that they cannot meet the demand for a more accurate search service through search or semantic processing, and user satisfaction is gradually lowered accordingly.
상술한 문제점을 해결하기 위한 시맨틱 웹 기술은 온톨로지를 사용해서 단어의 의미와 연관 관계를 표현하고, 추론을 통해 의미적 연관 관계를 확장하고 이를 다양한 서비스로 표현함으로써, 정보의 접근성은 물론 사용자 만족도 향상에도 기여하고 있다.The semantic web technology for solving the above problems expresses the meaning and association of words using ontologies, and extends the semantic relations through inference and expresses them with various services, thereby improving accessibility and user satisfaction. Is also contributing to.
하지만 시맨틱 웹 기술의 실용화 수준과 온톨로지로 지식을 표현하고 관리하는 것이 쉽지 않기 때문에 기존의 정보 서비스를 시맨틱 서비스가 대체하는 것은 쉽지 않다. However, it is not easy to replace existing information services with semantic services because it is not easy to express and manage knowledge with the practical level and ontology of semantic web technology.
따라서, 기존의 많은 데이터들이 RDBMS로 저장 및 관리되고 있고 이를 대상으로 시맨틱 서비스를 제공하기 위해서는 DBMS 데이터를 온톨로지의 인스턴스로 변환하는 과정이 필요하다.Therefore, many existing data are stored and managed in RDBMS, and in order to provide semantic services for them, it is necessary to convert DBMS data into instances of ontology.
RDBMS 데이터를 대상으로 온톨로지로 변환하는 종래 기술에서는 RDBMS 스키마를 대상으로 온톨로지를 자동 생성하거나, DB스키마와 온톨로지 스키마 매핑 정보를 사용하여 온톨로지 인스턴스로 변환하는 연구가 수행되었다. In the conventional technology of converting RDBMS data into ontology, researches for automatically generating ontology for RDBMS schema or converting into ontology instance using DB schema and ontology schema mapping information have been conducted.
하지만, 기존 RDBMS 데이터가 식별자(URI:Uniformed Resource Identifier)를 부여받지 못한 상태이거나 또는 새롭게 식별되어야 하는 경우에는 단순하게 온톨로지의 인스턴스로 전환하기 어렵다는 문제점이 있다.However, when the existing RDBMS data is not given a Uniform Resource Identifier (URI) or needs to be newly identified, it is difficult to simply switch to an ontology instance.
온톨로지 인스턴스의 경우 URI를 부여하고 URI를 통해 인스턴스간 관계를 표현하기 때문에 데이터 변환 과정에서 새롭게 URI를 부여할 필요가 있음에도, 종래기술에서는 데이터 변환 과정에서 인스턴스 식별을 통해 URI를 새로 부여하고 관리하는 방법은 다루지 않았다.In the case of ontology instances, since URIs are assigned and the relationships between instances are expressed through URIs, it is necessary to newly assign URIs in the data transformation process. However, in the prior art, a method of newly granting and managing URIs through instance identification in the data transformation process is provided. Did not cover.
따라서, 본 발명은 상술한 문제점을 해결하기 위하여 관계형 데이터베이스 관리 시스템(RDBMS:relational database management system)의 데이터를 OWL(Web Ontology Language)온톨로지의 인스턴스로 변환시, 식별자가 부여된 온톨로지 인스턴스를 생성하여 정확한 정보를 찾거나 의미간 추론을 통해 정보의 접근성이 향상되도록 하는 온톨로지 기반 인스턴스 식별 시스템 및 그 방법을 제공하는 것이다.Therefore, in order to solve the above-mentioned problem, the present invention converts data of a relational database management system (RDBMS) into an instance of Web Ontology Language (OWL) ontology, and generates an ontology instance with an identifier to provide accurate information. It is to provide an ontology-based instance identification system and method for improving the accessibility of information through finding or semantic inference.
상술한 목적을 달성하기 위하여, 클래스와 속성, 클래스간의 관계정보로 이루어진 온톨로지를 데이터 베이스부의 온톨로지DB에 등록시키는 온톨로지 등록기, 온톨로지 등록기에 의해 온톨로지가 등록된 후, 식별 대상이 되는 인스턴스가 입력되면 인스턴스가 할당되는 클래스를 선정하여 연결하는 클래스-인스턴스 연결기, 식별 대상이 되는 인스턴스와 인스턴스의 클래스가 확정이 되면 인스턴스 식별에 필요한 정보를 획득하는 참조 및 추가정보 획득기, 인스턴스 식별하기 위하여 획득된 정보를 가지고, 온톨로지DB에 등록된 온톨로지의 인스턴스 중, 입력된 식별 대상 인스턴스를 식별하는 제어부, 제어부에 의해서 식별 대상 인스턴스가 기존 인스턴스와 동일한 인스턴스로 식별된 경우 기존 인스턴스에 부여된 동일한 식별자를 부여하고 식별이 되지않은 경우 고유 식별자를 부여하는 인스턴스 식별자 부여부를 포함하는 것을 특징으로 하는 온톨로지 기반 인스턴스 식별 시스템과In order to achieve the above object, after ontology is registered by ontology register and ontology register which register ontology database of ontology database of class, attribute, and relationship between classes, ontology database, instance to be identified is entered. Is a class-instance connector that selects and connects a class to which a class is assigned, a reference and additional information obtainer for obtaining information necessary for instance identification, and information obtained to identify an instance. If the identification target instance is identified as the same instance as the existing instance by the controller, the identification identifier is assigned the same identifier assigned to the existing instance among the instances of the ontology registered in the ontology DB. Be And an ontology-based instance identification system, comprising: an instance identifier granter for assigning a unique identifier
온톨로지 기반 인스턴스 식별 시스템의 온톨로지 등록기가 클래스와 속성, 클래스간의 관계정보로 이루어진 온톨로지를 데이터 베이스부의 온톨로지DB에 등록시키는 단계, 온톨로지가 등록되면, 클래스-인스턴스 연결기가 식별 대상이 되는 인스턴스의 입력를 수신하여, 인스턴스에 할당될 클래스를 선정하여 연결하는 단계; 참조 및 추가정보 획득기가 식별 대상이 되는 인스턴스의 식별에 필요한 정보를 획득하는 단계, 제어부가 등록된 온톨로지의 인스턴스 중, 입력된 식별 대상 인스턴스를 식별하는 단계, 등록된 온톨로지의 인스턴스 중에 식별 대상이 되는 인스턴스를 식별하지 못한 경우, 인스턴스 식별자 부여부가 고유의 식별자를 부여하는 단계;를 포함하는 것을 특징으로 하는 온톨로지 기반 인스턴스 식별 방법을 제공하는 것이다.The ontology register of the ontology-based instance identification system registers an ontology consisting of class, attribute, and relationship information between classes in the ontology DB of the database.When the ontology is registered, the class-instance connector receives an input of an instance to be identified. Selecting and attaching a class to be allocated to the instance; Acquiring information necessary for identification of an instance to be identified by the reference and additional information obtaining unit; identifying an instance of an input identification object among instances of a registered ontology, and identifying an instance of an instance of a registered ontology If the instance is not identified, the instance identifier granter provides a unique identifier; to provide an ontology-based instance identification method comprising a.
본 발명에 따른 온톨로지 기반 인스턴스 식별 시스템 및 그 방법은 관계형 데이터베이스 관리 시스템(RDBMS:relational database management system)의 데이터를 OWL(Web Ontology Language)온톨로지의 인스턴스로 변환시, 식별자가 부여된 온톨로지 인스턴스를 생성하여 정확한 정보를 찾거나 의미간 추론을 통해 정보의 접근성을 향상시킬 수 있는 효과가 있다.The ontology-based instance identification system and method thereof according to the present invention generate an ontology instance with identifiers when converting data from a relational database management system (RDBMS) into an instance of a web ontology language (OWL) ontology. It is effective to improve the accessibility of information through finding information or semantic reasoning.
도 1은 DB 스키마와 온톨로지 스키마의 매핑 규칙 및 인스턴스를 식별하기 위해 정의된 식별 규칙을 사용해서 RDBMS 데이터를 URI가 부여된 온톨로지 인스턴스로 자동 생성하는 OntoURI 시스템을 도시한 도면, 1 is a diagram illustrating an OntoURI system for automatically generating RDBMS data as an ontology instance given a URI by using a mapping rule of a DB schema and an ontology schema and an identification rule defined to identify an instance.
도 2는 OntoURI 시스템에 의한 데이터 변환과정을 도시한 도면2 is a diagram illustrating a data conversion process using an OntoURI system.
도 3은 본 발명에 따른 온톨로지 기반 인스턴스 식별 시스템을 도시한 도면,3 is a diagram illustrating an ontology-based instance identification system according to the present invention;
도 4는 본 발명에 따른 온톨로지 기반 인스턴스 식별 시스템에서 인스턴스간의 상호 의존도를 도시한 도면,4 is a diagram illustrating interdependence between instances in an ontology-based instance identification system according to the present invention;
그리고,And,
도 5는 본 발명에 따른 온톨로지 기반 인스턴스 식별 시스템에 의한 인스턴스 식별 방법을 설명하기 위한 흐름도이다.5 is a flowchart illustrating an instance identification method by an ontology-based instance identification system according to the present invention.
본 명세서 및 청구범위에 사용된 용어나 단어는 통상적이거나 사전적인 의미로 한정 해석되지 아니하며, 발명자는 그 자신의 발명을 가장 최선의 방법으로 설명하기 위해 용어의 개념을 적절하게 정의할 수 있다는 원칙에 입각하여 본 발명의 기술적 사상에 부합하는 의미와 개념으로 해석되어야만 한다.The terms or words used in this specification and claims are not to be construed as limiting in their usual or dictionary meanings, and the inventors may properly define the concept of terms in order to best explain their invention in the best way possible. It should be interpreted as meaning and concept corresponding to the technical idea of the present invention.
이하, 첨부된 도면을 참조하여 본 발명에 따른 온톨로지 기반 인스턴스 식별시스템 및 그 방법에 대하여 상세히 설명한다.Hereinafter, an ontology-based instance identification system and method thereof according to the present invention will be described in detail with reference to the accompanying drawings.
먼저, 본 발명에 따른 온톨로지 기반 인스턴스 식별 시스템 및 그 방법에 대하여 설명하기 전에, 본원발명보다 상위 시스템에 해당되는 RDBMS 데이터를 URI가 부여된 온톨로지 인스턴스로 자동 생성하는 OntoURI 시스템에 대하여 간단히 설명한다.First, before describing the ontology-based instance identification system and method thereof according to the present invention, an OntoURI system for automatically generating RDBMS data corresponding to a system higher than the present invention as an ontology instance given a URI will be briefly described.
도 1은 DB 스키마와 온톨로지 스키마의 매핑 규칙 및 인스턴스를 식별하기 위해 정의된 식별 규칙을 사용해서 RDBMS 데이터를 URI가 부여된 온톨로지 인스턴스로 자동 생성하는 OntoURI 시스템을 도시한 도면이다.FIG. 1 is a diagram illustrating an OntoURI system for automatically generating RDBMS data as an ontology instance given a URI by using a mapping rule of a DB schema and an ontology schema and an identification rule defined to identify an instance.
도 1에 도시된 바와 같이 OntoURI시스템은 DBMS-온톨로지 매핑 규칙생성을 위한 사용자 인터페이스를 지원하고 생성된 규칙을 서버로 전송하는 규칙 편집부(110), 외부 서비스 및 자원 연계를 위한 서버 환경을 설정하기 위한 서버 관리 부(120), DBMS의 데이터 변경 여부를 파악하기 위한 모니터링부(130), 진행된 작업이나 생성된 인스턴스에 대한 통계 정보를 제공하는 통계부(140), 규칙 편집부(110)에서 생성된 규칙을 관리하는 규칙 관리부(150), 인스턴스 식별 규칙에 적용해서 인스턴스 식별을 수행하는 인스턴스 식별부(160), 데이터 변환 작업에 대한 일정을 등록하고 수행하는 스케쥴부(170), 매핑 규칙과 DB 모니터링 결과를 이용해서 인스턴스를 생성하는 인스턴스 생성부(180), 변환 대상이 되는 DBMS의 변경사항을 파악하기 위한 DB 모니터링부(190) 및 온톨로지 스키마와 인스턴스를 저장하고 관리하기 위한 정보 모델 관리부(200)를 포함하여 이루어진다.As shown in FIG. 1, the OntoURI system supports a user interface for creating a DBMS- ontology mapping rule and transmits the generated rule to the server. The rule editor 110 configures a server environment for linking external services and resources. Rules generated by the server management unit 120, the monitoring unit 130 to determine whether the DBMS data changes, the statistical unit 140 for providing statistical information on the progress or the created instance, the rule editing unit 110 Rule management unit 150 for managing the data, instance identification unit 160 for instance identification by applying to the instance identification rule, the scheduler 170 for registering and performing a schedule for the data conversion operation, mapping rules and DB monitoring results Instance generation unit 180 to create an instance using the DB, DB monitoring unit 190 and ontology to grasp the change of the DBMS to be converted It comprises a management information model (200) for storing and managing the schema instance.
상술한 구성요소를 가진 OntoURI 시스템에 의한 데이터 변환 과정에 대하여 도 2를 참조하여 간단히 설명한다.The data conversion process by the OntoURI system having the above-described components will be briefly described with reference to FIG.
도 2는 OntoURI 시스템에 의한 데이터 변환과정을 도시한 도면이다.2 is a diagram illustrating a data conversion process using an OntoURI system.
도 2에 도시된 바와 같이 OntoURI 시스템에 의한 데이터 변환과정은 크게 7단계로 이루어진다.As shown in FIG. 2, the data conversion process by the OntoURI system is composed of seven steps.
언급한 7단계의 데이터 변환과정은 서버환경 설정을 통해 추론엔진과 검색엔진, 기타 외부 서비스 연계를 위한 환경설정과 DBMS 연동을 위한 JDBC 환경 설정하는 1단계, 온톨로기 스키마 등록과 함께 온톨로지 인스턴스 표현을 위한 URI Prefix와 형식을 지정하는 2단계, 대상DB의 접속 정보를 등록하는 3단계, 매핑 규칙 편집기를 사용해서 매핑 규칙을 작성하고 URI 서버에 등록하는 단계 4단계, 인스턴스 식별 규칙에 필요한 전거 데이터 및 식별 규칙을 작성하는 5단계, 스케쥴러를 통한 데이터 변환 작업을 수행하는 6단계, 최종 생성된 URI가 할당된 인스턴스를 추론엔진과 검색엔진에 웹 서비스를 통해 전달하는 7단계로 구성된다.The seven-step data conversion process mentioned above is to set up the environment for inference engine, search engine, and other external services through the server environment setup, and the JDBC environment for DBMS interworking, and ontology schema registration. Step 2 specifying the URI prefix and format for the server, step 3 registering the connection information of the target DB, step 4 creating the mapping rule and registering it with the URI server using the mapping rule editor, and authority data required for the instance identification rule. And five steps of creating identification rules, six steps of performing data transformation operations through the scheduler, and seven steps of delivering the final generated URI to the inference engine and the search engine through a web service.
특히, 상술한 RDBMS 데이터를 URI가 부여된 온톨로지 인스턴스로 자동 생성하는 OntoURI 시스템 중, 본원 발명에서는 DB 데이터를 온톨로지 인스턴스로 변환하는 과정에서 전거 데이터와 식별규칙을 사용해서 데이터를 정규화하고, 정규화된 데이터에 고유한 식별자를 할당하고 관리하기 위하여, 상술한 인스턴스 식별부(600)와 데이터 변환과정에서의 4단계 및 5단계에 대하여 중점적으로 설명한다.In particular, in the OntoURI system which automatically generates the above-described RDBMS data into an ontology instance given a URI, the present invention normalizes data by using authority data and identification rules in the process of converting DB data into an ontology instance, and normalized data. In order to assign and manage a unique identifier, the above-described instance identification unit 600 and steps 4 and 5 in the data conversion process will be described.
도 3은 본 발명에 따른 온톨로지 기반 인스턴스 식별 시스템을 도시한 도면이다.3 is a diagram illustrating an ontology-based instance identification system according to the present invention.
도 3에 도시된 바와 같이 본 발명에 따른 온톨로지 기반 인스턴스 식별 시스템은 온톨로지 등록기(300), 클래스-인스턴스 연결기(400), 제어부(500), 참조 및 추가정보 획득기(600), 인스턴스 식별자 부여기(700) 및 데이터 베이스부(800)로 구성된다.As shown in FIG. 3, the ontology-based instance identification system according to the present invention includes an ontology register 300, a class-instance connector 400, a controller 500, a reference and additional information obtainer 600, and an instance identifier adder. 700 and the database unit 800.
온톨로지 등록기(300)는 클래스와 속성, 클래스간의 관계정보로 이루어진 온톨로지를 데이터 베이스부(800)의 온톨로지DB(810)에 저장시킴으로써 등록한다.The ontology register 300 registers an ontology consisting of classes, attributes, and relationship information between classes in the ontology DB 810 of the database unit 800.
구체적인 실시예로서, 논문 클래스의 경우 논문의 메타 정보에는 논문의 출판사, 저자, 주제와 관계되어 있기 때문에, 온톨로지 등록기(300)는 하나의 논문에 고유 식별자를 부여하기 위해서 참조되는 출판사의 저자, 주제정보와 연관정보를 등록하게 된다.As a specific embodiment, in the case of the thesis class, since the meta information of the article is related to the publisher, the author, and the subject of the article, the ontology registrar 300 may refer to the author or subject of the publisher referred to to assign a unique identifier to a single article. Register information and related information.
클래스-인스턴스 연결기(400)는 온톨로지 등록기(300)에 의해 온톨로지가 등록된 후, 식별 대상이 되는 인스턴스가 입력되면 인스턴스가 할당되는 클래스를 선정하여 연결한다.After the ontology is registered by the ontology registerer 300, the class-instance connector 400 selects and connects a class to which an instance is assigned when an instance to be identified is input.
참조 및 추가정보 획득기(600)는 식별 대상이 되는 인스턴스와 인스턴스의 클래스가 확정이 되면 인스턴스 식별에 필요한 정보를 획득한다.The reference and additional information obtainer 600 acquires information necessary for instance identification when the instance to be identified and the class of the instance are determined.
제어부(500)는 참조 및 추가정보 획득기(600)에 의해서 획득된 정보를 가지고, 온톨로지DB(810)에 등록된 온톨로지 중, 입력된 식별 대상 인스턴스의 존재여부를 식별한다.The control unit 500 has the information obtained by the reference and additional information acquirer 600, and identifies whether the input identification target instance exists among the ontology registered in the ontology DB 810.
인스턴스 식별자 부여부(700)는 제어부(500)에 의해서 식별 대상 인스턴스가 온톨로지DB(810)에 존재하여 식별된 경우, 온토로지DB(810)에서 식별된 인스턴스와 동일한 식별자를 부여하고, 식별 대상 인스턴스가 온톨로지DB(810)에 존재하지 않아 식별되지 않은 경우, 입력된 식별 대상 인스턴스에 고유 식별자를 부여한다.When the instance to be identified is present and identified in the ontology DB 810 by the control unit 500, the instance identifier assigning unit 700 gives the same identifier as the instance identified in the ontology DB 810, and identifies the object. If the instance does not exist because it does not exist in the ontology DB 810, a unique identifier is assigned to the input identification target instance.
상술한 구성을 가진 본원발명에 따른 온톨로지 기반 인스턴스 식별 시스템에 대하여, 학술논문 DB 데이터를 대상으로 온톨로지 인스턴스로 변환하는 과정에서 인스턴스식별이 어떻게 이루어지는 더욱 상세히 설명한다.The ontology-based instance identification system according to the present invention having the above-described configuration will be described in more detail how instance identification is performed in the process of converting thesis DB data into an ontology instance.
참조 및 추가 정보 획득기(600)가 참조하는 전거 데이터는 아래의 표에 기재된 바와 같이 학술 정보 메타 데이터에서 실제 나타난 명칭을 대상으로 지명 기관면 출판사의 대표값을 얻기 위해서 사용된다.The authority data referenced by the reference and additional information obtainer 600 is used to obtain representative values of the name organization publishers for the names actually represented in the academic information metadata as described in the table below.
표 1
대표값 실제값 유형 클래스
IBM International Business Machines Corporation full name institution
MicroSoft MS abbreviation institution
ACADEMIC PRESS, INC. Academic Press expression publication
Table 1
Representative value Actual value type class
IBM International Business Machines Corporation full name institution
Microsoft MS abbreviation institution
ACADEMIC PRESS, INC. Academic Press expression publication
위에 표에 기재된 내용을 근거로 전거데이터에 대하여 구체적으로 설명하면, 참조 및 추가정보 획득기(600)가 입력된 식별 대상이 되는 인스턴스에 실제 값으로 풀네임으로 기관명칭에 해당되는 "International Business Machines Corporation"를 대표값"MS"로 표시한다는 전거 데이터를 획득한 경우, 제어부(500)는 참조 및 추가정보 획득기(600)에 의해서 획득된 정보를 가지고, 실제값"International Business Machines Corporation"과 대표값"IBM"이 동일한 값으로 판단하고 입력되는 인스턴스에 대한 식별작업을 수행한다.When the authority data is described in detail based on the contents of the above table, the reference and additional information acquirer 600 is the international name corresponding to the institution name with the full name as the actual value of the instance to be identified. When the authority data indicating that "Corporation" is represented by the representative value "MS" is obtained, the control unit 500 has the information obtained by the reference and additional information acquirer 600, and represents the actual value "International Business Machines Corporation" and the representative. The value "IBM" is judged to be the same value and the identification of the entered instance is performed.
마찬가지로, 제어부(500)는 실제값 "MS"를 특정 기관인 "MicroSoft"의 약어로서 인식하고 식별 대상이 되는 인스턴스를 식별한다.Similarly, the control unit 500 recognizes the actual value "MS" as an abbreviation of "MicroSoft" which is a specific institution and identifies the instance to be identified.
실제값에 해당되는 대표값이 존재하는 경우, 실제값은 이미 대표값에 부여된 식별자와 동일한 식별자가 부여된 인스턴스로 처리되며, 실제값에 해당되는 대표값이 존재하지 않은 경우 실제값은 인스턴스 식별 규칙을 적용해서 식별자가 부여된 기존 인스턴스와 비교한 후, 신규 인스턴스로 판정되면 신규 식별자를 부여받아 온톨로지의 인스턴스로 변환된다.If there is a representative value corresponding to the actual value, the actual value is treated as an instance with the same identifier as the identifier already assigned to the actual value, and if there is no representative value corresponding to the actual value, the actual value is identified as an instance. The rule is applied to the existing instance to which the identifier is assigned, and if it is determined to be a new instance, the new identifier is given to the instance of ontology.
학술논문 온톨로지는 논문, 기관, 주제, 저자, 출처, 지역정보를 포함하는 클래스로 구성되며, 정보 인스턴스식별은 도 4에 도시된 바와 같이 클래스간 상호의존도가 낮은 클래스의 인스턴스부터 순서대로 식별된다.An academic paper ontology is composed of a class including a paper, an institution, a topic, an author, a source, and local information. The information instance identification is identified in order from an instance of a class having low interdependence between classes, as shown in FIG.
도 4는 본 발명에 따른 온톨로지 기반 인스턴스 식별 시스템에서 클래스간 상호 의존도를 도시한 도면이다. 4 is a diagram illustrating interdependence between classes in an ontology-based instance identification system according to the present invention.
도 4에서, 클래스간의 의존도는 실선으로, 식별을 위해 사용되는 정보는 점선으로 표시하였다.In FIG. 4, the dependency between classes is indicated by a solid line, and information used for identification is indicated by a dotted line.
지명 식별에는 전거 데이터만 사용되며, 특정 도시를 식별할 경우 도시명과 나라명을 참조하여 해당 도시에 부여된 고유의 식별자를 선택하게 된다.Only authority data is used for geographical name identification. When a particular city is identified, a unique identifier assigned to the city is selected by referring to the city name and the country name.
즉, 대표값 "서울"에 고유 식별자가 부여된 경우, 전거 데이터를 근거로 하여 실제값에 해당되는 "SEOUL"을 "서울"에 부여된 고유 식별자를 선택하여 부여하게 된다.That is, when a unique identifier is assigned to the representative value "Seoul", "SEOUL" corresponding to the actual value is selected and assigned to the "Seoul" based on authority data.
본 발명에서는 저자명 식별에 중점을 두고 있으며, 저자명 식별을 위해서는 저자가 작성한 논문, 저자의 소속 기관, 저자의 전자우편 주소, 저자의 논문 주제, 저자의 공저자 정보 등 다양한 자질들이 사용된다.The present invention focuses on the author name identification, and various qualities are used to identify the author's name, such as the article written by the author, the institution of the author, the e-mail address of the author, the subject of the author's article, and the author's co-author information.
표 2
클래스 자질 구분 일치 차수 출처 가중치
person order 1
person name pivot exact single OntoURI
person hasInstiution feature exact multiple OntoURI 0.7
person hasCoauthor feature exact number OntoURI 0.3
person threshold 0.6
TABLE 2
class endowment division Same Degree source weight
person order One
person name pivot exact single Ontouri
person hasInstiution feature exact multiple Ontouri 0.7
person hasCoauthor feature exact number Ontouri 0.3
person threshold 0.6
위의 [표2]는 저자명을 식별하기 위한 인스턴스 식별규칙의 일부로, 인스턴스 식별 규칙은 다음과 같다.[Table 2] above is part of the instance identification rule for identifying the author's name. The instance identification rule is as follows.
클래스는 식별 대상이 되는 클래스, 자질은 식별에 사용되는 속성, 구분은 인스턴스에 자질을 적용하는 방법의 기술, 일치는 값의 일치 여부 또는 일치된 값의 개수, 차수는 관계 차수, 출처는 속성의 값을 얻을 수 있는 출처 및 자중치는 속성별 중요도이다.The class is the class to be identified, the qualities are the attributes used for identification, the classification is the description of how to apply the qualities to the instance, the match is the value match or the number of values matched, the order is the relation order, and the source is the attribute The sources and weights from which values can be obtained are the importance of each property.
한편, 구분에 속하는 적용 순서(order)는 인스턴스 식별 순서로서 의존도가 낮은 클래스의 인스턴스 식별을 우선 처리하며, 기준 값(pivot)은 지정된 자질의 값이 동일할 경우 동일 URI가 할당될 수 있는 후보군으로 분류하기 위한 기준, 속성(feature)은 인스턴스의 속성, 임계값(threshold) 최종 가중치가 임계갑을 넘는 경우 기존 인스턴스와 동일한 URI를 부여할 기준이다.On the other hand, the application order belonging to the division is an instance identification order, which prioritizes instance identification of a class with low dependence, and a reference value is a candidate group to which the same URI can be assigned when the values of specified qualities are the same. A criterion for classification, a feature, is a criterion to give the same URI as the existing instance when the attribute and threshold final weight of the instance exceeds the threshold value.
또한, 일치에 속하는 정합(exact)은 값들이 정확하게 일치하는 경우이고, 개수(number)는 정확하게 일치한 값들의 개수이다.In addition, an match belonging to a match is a case where values match exactly, and a number is a number of values that match exactly.
그리고, 차수에 속하는 단일(single)은 속성 값이 한 개만 존재하는 경우이고, 다중(multiple)은 속성 값이 2개 이상 존재하는 경우이다.And, the single belonging to the order is a case where only one attribute value exists, and the multiple is a case where two or more attribute values exist.
관리자가 OntoURI 서버에 인스턴스를 식별하기 위한 규칙을 정의하고 각 규칙의 가중치를 지정하면, OntoURI는 인스턴스 식별 과정에서 해당 인스턴스를 식별하기 위해 정의된 규칙들을 적용한 후, 최종 가중치의 합이 지정된 임계값을 초과하는지 여부에 따라 동일 URI 또는 서로 다른 URI를 할당하게 된다. When an administrator defines a rule for identifying instances on the OntoURI server and assigns a weight to each rule, OntoURI applies the rules defined to identify the instance during instance identification, and then adds a threshold with the sum of the final weights. The same URI or different URI will be allocated depending on whether or not it is exceeded.
인스턴스 식별 규칙과 각 규칙별 가중치는 해당 도메인의 속성값과 각 속성별 중요도를 고려해서 선정하면 된다. The instance identification rule and the weight of each rule may be selected in consideration of the property value of the corresponding domain and the importance of each property.
또한 기존 DB의 식별자와 OntoURI를 통해 부여받은 URI 매핑 정보를 통해서 기존 DB의 식별자와 연계된 정보를 변환하는 경우 DB 식별자와 매핑된 URI를 사용하게 된다.In addition, when the information associated with the identifier of the existing DB is converted through the URI mapping information given through the identifier of the existing DB and OntoURI, the URI mapped with the DB identifier is used.
상술한 구성요소를 갖는 온톨로지 기반 인스턴스 식별 시스템에 의한 인스턴스 식별 방법에 대하여 설명한다.An instance identification method by an ontology-based instance identification system having the above components will be described.
온톨로지 기반 인스턴스 식별 시스템의 온톨로지 등록기(300)가 클래스와 속성, 클래스간의 관계정보로 이루어진 온톨로지를 데이터 베이스부(800)의 온톨로지DB(810)에 등록시키는 단계를 수행한다(S100).The ontology register 300 of the ontology-based instance identification system registers an ontology, which consists of class, attribute, and relationship information between classes, in the ontology DB 810 of the database unit 800 (S100).
온톨로지 등록기(300)에 의해 온톨로지가 등록된 후, 식별 대상이 되는 인스턴스가 입력되면, 클래스-인스턴스 연결기(400)가 인스턴스에 할당될 클래스를 선정하여 연결하는 단계를 수행한다(S200).After the ontology is registered by the ontology register 300, when an instance to be identified is input, the class-instance connector 400 selects and connects a class to be assigned to the instance (S200).
'S200'단계에서 식별 대상이 되는 인스턴스와 연관된 클래스의 선정이 완료되면 참조 및 추가정보 획득기(600)는 인스턴스 식별에 필요한 정보를 획득하는 단계를 수행한다(S300).When the selection of the class associated with the instance to be identified in step 'S200' is completed, the reference and additional information obtainer 600 performs a step of obtaining information necessary for instance identification (S300).
'S200'단계와 'S300'단계에서 식별 대상이 되는 인스턴스와 연관된 클래스의 선정이 완료되고, 식별에 필요한 정보가 획득되면, 제어부(500)는 등록된 온톨로지의 인스턴스 중, 입력된 식별 대상 인스턴스를 식별하는 단계를 수행한다(S400).When the selection of the class associated with the instance to be identified in steps S200 and S300 is completed, and information necessary for identification is obtained, the controller 500 selects an input identification instance among the registered ontology instances. Identifies a step (S400).
'S400'단계에서 제어부(500)에 의해 입력된 식별 대상 인스턴스를 식별되지 않은 경우, 인스턴스 부여부(700)는 고유 식별자를 부여하는 단계를 수행하고(S500), 등록된 온톨로지의 인스턴스에서 입력된 식별 대상 인스턴스를 식별된 경우, 식별된 인스턴스와 동일한 식별자를 부여하는 단계를 수행한다.(600)If the identification target instance input by the control unit 500 is not identified in step S400, the instance granting unit 700 performs a step of assigning a unique identifier (S500) and is input from an instance of a registered ontology. If the identified instance is identified, a step of assigning the same identifier as that of the identified instance is performed.
이상에서 본 발명은 실시예로 기재된 구체적인 예에 대해서만 상세히 설명되었지만, 당업자가 본 발명의 기술사상 범위 내에서 다양한 변형과 수정이 가능하다는 것은 명백하며, 이러한 변형과 수정 역시 첨부된 특허청구범위에 속한다는 것은 당연하다.Although the present invention has been described in detail only with respect to specific examples described in the embodiments, it is apparent that various modifications and variations can be made by those skilled in the art within the scope of the present invention, and such variations and modifications also belong to the appended claims. Of course it is.
본 발명은 온톨로지 기반 인스턴스 식별 시스템 및 그 방법에 관한 것으로서, 관계형 데이터베이스 관리 시스템(RDBMS:relational database management system)의 데이터를 OWL(Web Ontology Language)온톨로지의 인스턴스로 변환하는 과정에서, 식별 대상이 되는 인스턴스가 입력되면 이미 등록된 온톨로지의 인스턴스에 식별대상이 되는 인스턴스의 존재여부를 식별하여, 식별 대상 인스턴스가 식별된 경우 식별된 인스턴스와 동일한 식별자를 부여하고, 식별이 되지않은 경우 새로운 고유 식별자를 부여함으로써 정확한 정보를 찾거나 의미간 추론을 통해 정보의 접근성을 향상시킬 수 있는 효과가 있다.The present invention relates to an ontology-based instance identification system and a method thereof, wherein an instance to be identified in the process of converting data of a relational database management system (RDBMS) into an instance of a web ontology language (OWL) ontology Is entered, it identifies the existence of the instance to be identified to the instance of ontology already registered, and if the identified instance is identified, assigns the same identifier as the identified instance, and if it is not identified, assigns a new unique identifier. It is effective to find the correct information or improve the accessibility of information through semantic reasoning.

Claims (8)

  1. 클래스와 속성, 클래스간의 관계정보로 이루어진 온톨로지를 데이터 베이스부(800)의 온톨로지DB(810)에 등록시키는 온톨로지 등록기(300);An ontology register 300 for registering an ontology, which consists of classes, attributes, and relationship information between classes, in the ontology DB 810 of the database unit 800;
    상기 온톨로지 등록기(300)에 의해 온톨로지가 등록된 후, 식별 대상이 되는 인스턴스가 입력되면 인스턴스가 할당되는 클래스를 선정하여 연결하는 클래스-인스턴스 연결기(400);A class-instance connector 400 that selects and connects a class to which an instance is assigned when an ontology is registered by the ontology register 300 and an instance to be identified is input;
    상기 식별 대상이 되는 인스턴스와 상기 인스턴스의 클래스가 확정이 되면 상기 인스턴스 식별에 필요한 정보를 획득하는 참조 및 추가정보 획득기(600);A reference and additional information obtainer 600 for acquiring information necessary for identifying the instance when the instance to be identified and the class of the instance are determined;
    상기 참조 및 추가정보 획득기(600)에서 획득된 인스턴스를 식별하기 위하여 획득된 정보를 가지고, 온톨로지DB(810)에 등록된 온톨로지의 인스턴스 중, 상기 입력된 식별 대상 인스턴스를 식별하는 제어부(500);및The control unit 500 having the information obtained to identify the instance obtained by the reference and additional information obtainer 600, and identifying the input identification target instance among the instances of the ontology registered in the ontology DB 810. ; And
    상기 제어부(500)에 의해서 식별 대상 인스턴스가 식별된 경우 온톨로지DB(810)에서 식별된 인스턴스와 동일한 식별자를 부여하고, 식별이 되지않은 경우 고유 식별자를 부여하는 인스턴스 식별자 부여부(700);를 포함하는 것을 특징으로 하는 온톨로지 기반 인스턴스 식별 시스템.An instance identifier granter 700 for assigning the same identifier as the instance identified in the ontology DB 810 when the target object to be identified by the controller 500 is identified, and for assigning a unique identifier if not identified. Ontology-based instance identification system, characterized in that.
  2. 제 1항에 있어서, The method of claim 1,
    상기 참조 및 추가정보 획득기(600)는 The reference and additional information obtainer 600 is
    온톨로지의 클래스에 등록된 인스터스의 대표값과 상기 대표값과 그 표현이 상이하지만 의미가 동일한 실제값으로 이루어진 전거 데이터를 획득하는 것을 특징으로 하는 온톨로지 기반 인스턴스 식별 시스템. An ontology-based instance identification system, characterized by obtaining authority data comprising representative values of an instance registered in a class of an ontology, and actual values having different meanings but identical representations.
  3. 제 1항에 있어서,The method of claim 1,
    상기 제어부(500)는 온톨로지DB(810)에 등록된 온톨로지의 인스터스 중, 상호 의존도가 낮은 인스턴스부터 순서대로 식별하는 것을 특징으로 하는 온톨로지 기반 인스턴스 식별 시스템.The control unit (500) is an ontology-based instance identification system, characterized in that from among the instances of the ontology registered in the ontology DB (810), in order from the low interdependence instance in order.
  4. 제 1항에 있어서,The method of claim 1,
    상기 인스턴스 식별자 부여부(700)는 The instance identifier providing unit 700 is
    상기 인스턴스를 식별하기 위해서 가중치가 설정된 인스턴스 식별 규칙이 적용하여, 최종 가중치의 합이 지정된 임계값을 초과하는지 여부에 따라 고유 식별자를 부여하는 것을 특징으로 하는 온톨로지 기반 인스턴스 식별 시스템.An ontology-based instance identification system according to claim 1, wherein a weighted instance identification rule is applied to identify the instance, and a unique identifier is assigned according to whether the sum of the final weights exceeds a specified threshold.
  5. (a) 온톨로지 기반 인스턴스 식별 시스템의 온톨로지 등록기(300)가 클래스와 속성, 클래스간의 관계정보로 이루어진 온톨로지를 데이터 베이스부(800)의 온톨로지DB(810)에 등록시키는 단계;(a) registering the ontology DB (810) of the database unit (800) by the ontology register (300) of the ontology-based instance identification system;
    (b) 상기 (a)단계에서 온톨로지가 등록되면, 클래스-인스턴스 연결기(400)가 식별 대상이 되는 인스턴스의 입력을 수신하여, 인스턴스에 할당될 클래스를 선정하여 연결하는 단계;(b) when the ontology is registered in step (a), the class-instance connector 400 receives an input of an instance to be identified, selecting a class to be allocated to the instance, and connecting the ontology;
    (c) 참조 및 추가정보 획득기(600)가 상기 식별 대상이 되는 인스턴스의 식별에 필요한 정보를 획득하는 단계;(c) obtaining, by the reference and additional information obtainer 600, information necessary for identification of the instance to be identified;
    (d) 상기 (c)단계 이후 제어부(500)가 등록된 온톨로지의 인스턴스 중, 입력된 식별 대상 인스턴스를 식별하는 단계; 및(d) after the step (c), the controller 500 identifies the input identification target instance among the registered ontology instances; And
    (e) 상기 (d)단계에서 등록된 온톨로지의 인스턴스에 식별 대상이 되는 인스턴스를 식별하지 못한 경우, 인스턴스 식별자 부여부(700)가 고유의 식별자를 부여하는 단계;를 포함하는 것을 특징으로 하는 온톨로지 기반 인스턴스 식별 방법.(e) if the instance to be identified is not identified to the instance of the ontology registered in the step (d), the instance identifier assigning unit 700 gives a unique identifier; the ontology, characterized in that it comprises a How to identify the base instance.
  6. 제 5항에 있어서, The method of claim 5,
    상기 (e)단계는 Step (e) is
    (e`) 등록된 온톨로지의 인스터스에 식별 대상이 되는 인스턴스를 식별한 경우, 식별된 인스턴스와 동일한 식별자를 부여하는 단계;를 더 포함하는 것을 특징으로 하는 온톨로지 기반 인스턴스 식별 방법.and (e`) when identifying an instance to be identified to an instance of the registered ontology, assigning the same identifier as the identified instance.
  7. 제 5항에 있어서, The method of claim 5,
    상기 (d) 단계에서,In step (d),
    상기 제어부(500)는 The control unit 500
    온톨로지DB(810)에 등록된 온톨로지의 인스터스 중, 상호 의존도가 낮은 인스턴스부터 순서대로 식별하는 것을 특징으로 하는 온톨로지 기반 인스턴스 식별 방법.Ontology-based instance identification method, characterized in that to identify among the instances of the ontology registered in the ontology DB (810) in order from the instance with low interdependence.
  8. 제 5항에 있어서, The method of claim 5,
    상기 (e) 단계에서,In the step (e),
    상기 인스턴스 식별자 부여부(700)는 The instance identifier providing unit 700 is
    상기 인스턴스를 식별하기 위해서 가중치가 설정된 인스턴스 식별 규칙을 적용하여, 최종 가중치의 합이 지정된 임계값을 초과하는지 여부에 따라 고유 식별자를 부여하는 것을 특징으로 하는 온톨로지 기반 인스턴스 식별 방법.Applying a weighted instance identification rule to identify the instance, and assigning a unique identifier according to whether the sum of the final weights exceeds a specified threshold.
PCT/KR2009/007593 2009-12-18 2009-12-18 Ontology-based instance identification system and method WO2011074729A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1020090126605A KR101081870B1 (en) 2009-12-18 2009-12-18 Method and system for instance identification using ontology
KR10-2009-0126605 2009-12-18

Publications (1)

Publication Number Publication Date
WO2011074729A1 true WO2011074729A1 (en) 2011-06-23

Family

ID=44167476

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2009/007593 WO2011074729A1 (en) 2009-12-18 2009-12-18 Ontology-based instance identification system and method

Country Status (2)

Country Link
KR (1) KR101081870B1 (en)
WO (1) WO2011074729A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102343285B1 (en) * 2015-06-30 2021-12-28 한국전자통신연구원 Personalized service providing method and apparatus thereof
KR102142957B1 (en) * 2018-08-24 2020-08-12 주식회사 리스트 Machine-lead conversation service method and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20080078147A (en) * 2007-02-22 2008-08-27 한국전자통신연구원 The method and apparatus for extracting instance for constructing ontology
KR20090025054A (en) * 2007-09-05 2009-03-10 한국과학기술정보연구원 System and method for nistance registering based on history
KR20090100425A (en) * 2007-01-17 2009-09-23 인터내셔널 비지네스 머신즈 코포레이션 Querying data and an associated ontology in a database management system
KR20090112157A (en) * 2008-04-23 2009-10-28 재단법인서울대학교산학협력재단 Ranking processing method for semantic web resources

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20090100425A (en) * 2007-01-17 2009-09-23 인터내셔널 비지네스 머신즈 코포레이션 Querying data and an associated ontology in a database management system
KR20080078147A (en) * 2007-02-22 2008-08-27 한국전자통신연구원 The method and apparatus for extracting instance for constructing ontology
KR20090025054A (en) * 2007-09-05 2009-03-10 한국과학기술정보연구원 System and method for nistance registering based on history
KR20090112157A (en) * 2008-04-23 2009-10-28 재단법인서울대학교산학협력재단 Ranking processing method for semantic web resources

Also Published As

Publication number Publication date
KR20110069987A (en) 2011-06-24
KR101081870B1 (en) 2011-11-09

Similar Documents

Publication Publication Date Title
US5953716A (en) Querying heterogeneous data sources distributed over a network using context interchange
Lambrix et al. SAMBO—a system for aligning and merging biomedical ontologies
WO2018084682A1 (en) Method, apparatus and computer program for mapping medical data
Jeffery et al. Research information management: the CERIF approach
US20070143285A1 (en) System and method for matching schemas to ontologies
EP1018086A1 (en) Search system and method based on multiple ontologies
Alani et al. Managing reference: Ensuring referential integrity of ontologies for the semantic web
CN111143074B (en) Method and device for distributing translation files
CN112307052B (en) Data management method, service system, terminal and storage medium
WO2017131325A1 (en) System and method for verifying and correcting knowledge base
Craig et al. DREAM Principles and FAIR Metrics from the PORTAL-DOORS Project for the Semantic Web
WO2011074729A1 (en) Ontology-based instance identification system and method
WO2013008978A1 (en) Object identification result searching system and method
Afonin Ontology models for access control systems
Hsu et al. Comparing the use of research resource identifiers and natural language processing for citation of databases, software, and other digital artifacts
WO2013008979A1 (en) Entity-identifying device and method using multiple ontologies
Athenodorou et al. Open science in lattice gauge theory community
US20090077021A1 (en) System for Managing SameAs Relationships Between Ontology Instances and Method for the Same
Lambrix et al. A framework for aligning ontologies
US20010051976A1 (en) Facilitator having a distributed configuration, a dual cell apparatus used for the same, and an integrated cell apparatus used for the same
CN111404878B (en) Method and system for converting ipv4 address into ipv6 address
CN106227898A (en) A kind of Big Dipper moves wearable device data and administers platform and data administering method thereof
WO2018169168A1 (en) Electronic document management system and method for providing user-specific translation dictionaries
EP3511843A1 (en) Method and system for integrating data received from one or more data sources
WO2024043744A1 (en) Device and method for supporting annotation generation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09852338

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09852338

Country of ref document: EP

Kind code of ref document: A1