WO2011155736A3 - Method for dynamically generating additional terms for each meaning of every natural language expression; dictionary manager, documentation generator, term annotator, search system, and device for building document information system based on the method - Google Patents
Method for dynamically generating additional terms for each meaning of every natural language expression; dictionary manager, documentation generator, term annotator, search system, and device for building document information system based on the method Download PDFInfo
- Publication number
- WO2011155736A3 WO2011155736A3 PCT/KR2011/004113 KR2011004113W WO2011155736A3 WO 2011155736 A3 WO2011155736 A3 WO 2011155736A3 KR 2011004113 W KR2011004113 W KR 2011004113W WO 2011155736 A3 WO2011155736 A3 WO 2011155736A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- meaning
- term
- expressions
- task
- natural language
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/313—Selection or weighting of terms for indexing
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Abstract
The present invention relates to changing an information system comprising natural language expressions to an information system based on unit expressions of meaning, which is accompanied by functional changes for an information search system, term dictionary, documentation generator, and term converter. The accuracy of current search systems is very low. This is because natural language represents many meanings using few words. Due to the problem of expressions becoming longer and more difficult to recollect as the number of terms increases, people use a small number of terms in a repetitive manner. When unit expressions of meaning having 1 term corresponding to 1 meaning are introduced, the accuracy of a search system can approach 100%. The present invention presents a method for easily generating unit expressions of meaning, and a method for efficiently applying the generated unit expressions of meaning to documents from around the world. The method for creating unit expressions of meaning is a technique of breaking down each natural language term into the number of respective meanings thereof. Because this is a matter of a simple breakdown of terms, anyone can generate expressions. The task of applying generated terms to documents from around the world is formidable. For this task, according to the present invention, instead of changing each word that is repetitively used, alignment is performed for each word, and certain aligned word groups are simultaneously processed. Even if one word has been used several hundred billion times in documents throughout the world, there is no need to perform term conversions several hundred billion times. If the word in question has several meanings, the task of conversion can be performed simply by way of several sorting commands. Even if the repetitive use of terms does not impose a large load on term conversion, because the number of unit expressions of meaning itself is enormous, term conversion is not simple. The task of processing close to 10 billion unit expressions of meaning is daunting. A method for solving this difficulty is to equally distribute the task to a number of users. The greatest factor contributing to the ambiguity of natural language is the presence of innumerable proper nouns. These encroach on the domains of nouns, adjectives, verbs, and all other parts of speech, causing semantic confusion. While not limited to people's names, when considering proper nouns only in that context, there are over 10 billion terms in this category since the global population exceeds 6 billion. The present invention persentss a configuration in which this prodigious task is equally allotted to a countless number of users. Users having needs may perform tasks to fulfill their requirements and benefit from their work. If the users feel that term conversion is required, the users may perform term generation and term conversion tasks so that a state that is always satisfactory for users can be maintained. The present invention provides: 1) a unit expression of meaning dictionary manager that can easily generate unit expressions of meaning; and 2) a search annotator which is a means for categorizing words and converting (annotating) words belonging to a word group into unit expressions of meaning. The annotator operates as part of a search system. The alignment and search of words uses existing search system functions. Also the present invention provides 3) a unit expression of meaning converter (annotator) performing a function similar to the search annotator. The task of making a global information system based on unit expressions of meaning is an enormous endeavor. However, the problem of natural language being unclear in meaning presents a large obstacle for development in many fields. The present invention provides a basis for achieving considerable advances in the semantic web field, search system field, language translation field, and artificial intelligence field, by providing clear language thereto.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR10-2010-0053571 | 2010-06-07 | ||
KR1020100053571A KR20110133909A (en) | 2010-06-07 | 2010-06-07 | Semantic dictionary manager, semantic text editor, semantic term annotator, semantic search engine and semantic information system builder based on the method defining semantic term instantly to identify the exact meanings of each word |
Publications (3)
Publication Number | Publication Date |
---|---|
WO2011155736A2 WO2011155736A2 (en) | 2011-12-15 |
WO2011155736A3 true WO2011155736A3 (en) | 2012-04-19 |
WO2011155736A9 WO2011155736A9 (en) | 2012-06-21 |
Family
ID=45098509
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2011/004113 WO2011155736A2 (en) | 2010-06-07 | 2011-06-06 | Method for dynamically generating additional terms for each meaning of every natural language expression; dictionary manager, document generator, term annotator, search system, and device for building a document information system based on the method |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR20110133909A (en) |
WO (1) | WO2011155736A2 (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140188456A1 (en) * | 2012-12-27 | 2014-07-03 | Abbyy Development Llc | Dictionary Markup System and Method |
KR101602342B1 (en) * | 2014-07-10 | 2016-03-11 | 네이버 주식회사 | Method and system for providing information conforming to the intention of natural language query |
KR101661198B1 (en) * | 2014-07-10 | 2016-10-04 | 네이버 주식회사 | Method and system for searching by using natural language query |
US9946762B2 (en) | 2014-09-16 | 2018-04-17 | International Business Machines Corporation | Building a domain knowledge and term identity using crowd sourcing |
CN110059167A (en) * | 2019-01-23 | 2019-07-26 | 艾肯特公司 | Natural expression processing method, response method, equipment and the system of natural intelligence |
CN111008594B (en) * | 2019-12-04 | 2024-04-19 | 科大讯飞股份有限公司 | Error-correction question review method, related device and readable storage medium |
CN117709375B (en) * | 2024-02-01 | 2024-05-24 | 成都帆点创想科技有限公司 | Text translation method and device |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR19980029011A (en) * | 1996-10-25 | 1998-07-15 | 구자홍 | Vocabulary automatic classification device and method |
KR20020036059A (en) * | 2000-11-07 | 2002-05-16 | 옥철영 | Method for disambiguating word-sense based on semantic informations extracted from definitions in dictionary |
KR20070059881A (en) * | 2005-12-06 | 2007-06-12 | 한국전자통신연구원 | Method and apparatus for constructing database of semantic co-occurrence pattern |
-
2010
- 2010-06-07 KR KR1020100053571A patent/KR20110133909A/en not_active Application Discontinuation
-
2011
- 2011-06-06 WO PCT/KR2011/004113 patent/WO2011155736A2/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR19980029011A (en) * | 1996-10-25 | 1998-07-15 | 구자홍 | Vocabulary automatic classification device and method |
KR20020036059A (en) * | 2000-11-07 | 2002-05-16 | 옥철영 | Method for disambiguating word-sense based on semantic informations extracted from definitions in dictionary |
KR20070059881A (en) * | 2005-12-06 | 2007-06-12 | 한국전자통신연구원 | Method and apparatus for constructing database of semantic co-occurrence pattern |
Also Published As
Publication number | Publication date |
---|---|
WO2011155736A9 (en) | 2012-06-21 |
WO2011155736A2 (en) | 2011-12-15 |
KR20110133909A (en) | 2011-12-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Weitz et al. | " Do you trust me?" Increasing user-trust by integrating virtual agents in explainable AI interaction design | |
WO2011155736A3 (en) | Method for dynamically generating additional terms for each meaning of every natural language expression; dictionary manager, documentation generator, term annotator, search system, and device for building document information system based on the method | |
Nguyen et al. | NEU-chatbot: Chatbot for admission of National Economics University | |
Vertanen et al. | The imagination of crowds: conversational AAC language modeling using crowdsourcing and large data sources | |
Chopra et al. | Natural language processing | |
Swampillai et al. | Extracting relations within and across sentences | |
JP6526470B2 (en) | Pre-construction method of vocabulary semantic patterns for text analysis and response system | |
Speck et al. | Named Entity Recognition using FOX. | |
Horsmann et al. | Fast or Accurate?-A Comparative Evaluation of PoS Tagging Models. | |
Kishimoto et al. | A knowledge-augmented neural network model for implicit discourse relation classification | |
Cangelosi et al. | Speech and language in humanoid robots | |
Shaikh et al. | Composing conversational negation | |
Chang et al. | Illinoissl: A JAVA library for structured prediction | |
Xu et al. | ChicHealth@ MEDIQA 2021: Exploring the limits of pre-trained seq2seq models for medical summarization | |
KR20060066588A (en) | Natural language to ontology-based knowledge transformation system and method | |
Nakajima et al. | Automatic extraction of future references from news using morphosemantic patterns with application to future trend prediction | |
Ríssola et al. | Predicting Relevant Conversation Turns for Improved Retrieval in Multi-Turn Conversational Search. | |
Pakray et al. | A statistics-based semantic textual entailment system | |
Radhika et al. | Semantic role extraction and general concept understanding in malayalam using Paninian grammar | |
Sawant et al. | AI Model to Generate SQL Queries from Natural Language Instructions through Voice | |
Biju | Vocabulary-constrained Question Generation with Rare Word Masking and Dual Attention | |
Liu et al. | Chinese event recognition via ensemble model | |
Lee et al. | A two-step approach for efficient domain selection in multi-domain dialog systems | |
Das et al. | Hybrid sandhi-splitter for malayalam using unicode | |
Altman | Taming the dragon: Effective use of dragon naturally speaking speech recognition software as an avenue to universal access |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11792647 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 11792647 Country of ref document: EP Kind code of ref document: A2 |