WO2011155736A3 - Method for dynamically generating additional terms for each meaning of every natural language expression; dictionary manager, documentation generator, term annotator, search system, and device for building document information system based on the method - Google Patents

Method for dynamically generating additional terms for each meaning of every natural language expression; dictionary manager, documentation generator, term annotator, search system, and device for building document information system based on the method Download PDF

Info

Publication number
WO2011155736A3
WO2011155736A3 PCT/KR2011/004113 KR2011004113W WO2011155736A3 WO 2011155736 A3 WO2011155736 A3 WO 2011155736A3 KR 2011004113 W KR2011004113 W KR 2011004113W WO 2011155736 A3 WO2011155736 A3 WO 2011155736A3
Authority
WO
WIPO (PCT)
Prior art keywords
meaning
term
expressions
task
natural language
Prior art date
Application number
PCT/KR2011/004113
Other languages
French (fr)
Korean (ko)
Other versions
WO2011155736A9 (en
WO2011155736A2 (en
Inventor
박동민
Original Assignee
Park Dong Min
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Park Dong Min filed Critical Park Dong Min
Publication of WO2011155736A2 publication Critical patent/WO2011155736A2/en
Publication of WO2011155736A3 publication Critical patent/WO2011155736A3/en
Publication of WO2011155736A9 publication Critical patent/WO2011155736A9/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

The present invention relates to changing an information system comprising natural language expressions to an information system based on unit expressions of meaning, which is accompanied by functional changes for an information search system, term dictionary, documentation generator, and term converter. The accuracy of current search systems is very low. This is because natural language represents many meanings using few words. Due to the problem of expressions becoming longer and more difficult to recollect as the number of terms increases, people use a small number of terms in a repetitive manner. When unit expressions of meaning having 1 term corresponding to 1 meaning are introduced, the accuracy of a search system can approach 100%. The present invention presents a method for easily generating unit expressions of meaning, and a method for efficiently applying the generated unit expressions of meaning to documents from around the world. The method for creating unit expressions of meaning is a technique of breaking down each natural language term into the number of respective meanings thereof. Because this is a matter of a simple breakdown of terms, anyone can generate expressions. The task of applying generated terms to documents from around the world is formidable. For this task, according to the present invention, instead of changing each word that is repetitively used, alignment is performed for each word, and certain aligned word groups are simultaneously processed. Even if one word has been used several hundred billion times in documents throughout the world, there is no need to perform term conversions several hundred billion times. If the word in question has several meanings, the task of conversion can be performed simply by way of several sorting commands. Even if the repetitive use of terms does not impose a large load on term conversion, because the number of unit expressions of meaning itself is enormous, term conversion is not simple. The task of processing close to 10 billion unit expressions of meaning is daunting. A method for solving this difficulty is to equally distribute the task to a number of users. The greatest factor contributing to the ambiguity of natural language is the presence of innumerable proper nouns. These encroach on the domains of nouns, adjectives, verbs, and all other parts of speech, causing semantic confusion. While not limited to people's names, when considering proper nouns only in that context, there are over 10 billion terms in this category since the global population exceeds 6 billion. The present invention persentss a configuration in which this prodigious task is equally allotted to a countless number of users. Users having needs may perform tasks to fulfill their requirements and benefit from their work. If the users feel that term conversion is required, the users may perform term generation and term conversion tasks so that a state that is always satisfactory for users can be maintained. The present invention provides: 1) a unit expression of meaning dictionary manager that can easily generate unit expressions of meaning; and 2) a search annotator which is a means for categorizing words and converting (annotating) words belonging to a word group into unit expressions of meaning. The annotator operates as part of a search system. The alignment and search of words uses existing search system functions. Also the present invention provides 3) a unit expression of meaning converter (annotator) performing a function similar to the search annotator. The task of making a global information system based on unit expressions of meaning is an enormous endeavor. However, the problem of natural language being unclear in meaning presents a large obstacle for development in many fields. The present invention provides a basis for achieving considerable advances in the semantic web field, search system field, language translation field, and artificial intelligence field, by providing clear language thereto.
PCT/KR2011/004113 2010-06-07 2011-06-06 Method for dynamically generating additional terms for each meaning of every natural language expression; dictionary manager, document generator, term annotator, search system, and device for building a document information system based on the method WO2011155736A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR10-2010-0053571 2010-06-07
KR1020100053571A KR20110133909A (en) 2010-06-07 2010-06-07 Semantic dictionary manager, semantic text editor, semantic term annotator, semantic search engine and semantic information system builder based on the method defining semantic term instantly to identify the exact meanings of each word

Publications (3)

Publication Number Publication Date
WO2011155736A2 WO2011155736A2 (en) 2011-12-15
WO2011155736A3 true WO2011155736A3 (en) 2012-04-19
WO2011155736A9 WO2011155736A9 (en) 2012-06-21

Family

ID=45098509

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2011/004113 WO2011155736A2 (en) 2010-06-07 2011-06-06 Method for dynamically generating additional terms for each meaning of every natural language expression; dictionary manager, document generator, term annotator, search system, and device for building a document information system based on the method

Country Status (2)

Country Link
KR (1) KR20110133909A (en)
WO (1) WO2011155736A2 (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140188456A1 (en) * 2012-12-27 2014-07-03 Abbyy Development Llc Dictionary Markup System and Method
KR101602342B1 (en) * 2014-07-10 2016-03-11 네이버 주식회사 Method and system for providing information conforming to the intention of natural language query
KR101661198B1 (en) * 2014-07-10 2016-10-04 네이버 주식회사 Method and system for searching by using natural language query
US9946762B2 (en) 2014-09-16 2018-04-17 International Business Machines Corporation Building a domain knowledge and term identity using crowd sourcing
CN110059167A (en) * 2019-01-23 2019-07-26 艾肯特公司 Natural expression processing method, response method, equipment and the system of natural intelligence
CN111008594B (en) * 2019-12-04 2024-04-19 科大讯飞股份有限公司 Error-correction question review method, related device and readable storage medium
CN117709375B (en) * 2024-02-01 2024-05-24 成都帆点创想科技有限公司 Text translation method and device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR19980029011A (en) * 1996-10-25 1998-07-15 구자홍 Vocabulary automatic classification device and method
KR20020036059A (en) * 2000-11-07 2002-05-16 옥철영 Method for disambiguating word-sense based on semantic informations extracted from definitions in dictionary
KR20070059881A (en) * 2005-12-06 2007-06-12 한국전자통신연구원 Method and apparatus for constructing database of semantic co-occurrence pattern

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR19980029011A (en) * 1996-10-25 1998-07-15 구자홍 Vocabulary automatic classification device and method
KR20020036059A (en) * 2000-11-07 2002-05-16 옥철영 Method for disambiguating word-sense based on semantic informations extracted from definitions in dictionary
KR20070059881A (en) * 2005-12-06 2007-06-12 한국전자통신연구원 Method and apparatus for constructing database of semantic co-occurrence pattern

Also Published As

Publication number Publication date
WO2011155736A9 (en) 2012-06-21
WO2011155736A2 (en) 2011-12-15
KR20110133909A (en) 2011-12-14

Similar Documents

Publication Publication Date Title
Weitz et al. " Do you trust me?" Increasing user-trust by integrating virtual agents in explainable AI interaction design
WO2011155736A3 (en) Method for dynamically generating additional terms for each meaning of every natural language expression; dictionary manager, documentation generator, term annotator, search system, and device for building document information system based on the method
Nguyen et al. NEU-chatbot: Chatbot for admission of National Economics University
Vertanen et al. The imagination of crowds: conversational AAC language modeling using crowdsourcing and large data sources
Chopra et al. Natural language processing
Swampillai et al. Extracting relations within and across sentences
JP6526470B2 (en) Pre-construction method of vocabulary semantic patterns for text analysis and response system
Speck et al. Named Entity Recognition using FOX.
Horsmann et al. Fast or Accurate?-A Comparative Evaluation of PoS Tagging Models.
Kishimoto et al. A knowledge-augmented neural network model for implicit discourse relation classification
Cangelosi et al. Speech and language in humanoid robots
Shaikh et al. Composing conversational negation
Chang et al. Illinoissl: A JAVA library for structured prediction
Xu et al. ChicHealth@ MEDIQA 2021: Exploring the limits of pre-trained seq2seq models for medical summarization
KR20060066588A (en) Natural language to ontology-based knowledge transformation system and method
Nakajima et al. Automatic extraction of future references from news using morphosemantic patterns with application to future trend prediction
Ríssola et al. Predicting Relevant Conversation Turns for Improved Retrieval in Multi-Turn Conversational Search.
Pakray et al. A statistics-based semantic textual entailment system
Radhika et al. Semantic role extraction and general concept understanding in malayalam using Paninian grammar
Sawant et al. AI Model to Generate SQL Queries from Natural Language Instructions through Voice
Biju Vocabulary-constrained Question Generation with Rare Word Masking and Dual Attention
Liu et al. Chinese event recognition via ensemble model
Lee et al. A two-step approach for efficient domain selection in multi-domain dialog systems
Das et al. Hybrid sandhi-splitter for malayalam using unicode
Altman Taming the dragon: Effective use of dragon naturally speaking speech recognition software as an avenue to universal access

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11792647

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11792647

Country of ref document: EP

Kind code of ref document: A2