KR920005023A - Morphological Analysis of Hangul Sentences - Google Patents

Morphological Analysis of Hangul Sentences Download PDF

Info

Publication number
KR920005023A
KR920005023A KR1019900013727A KR900013727A KR920005023A KR 920005023 A KR920005023 A KR 920005023A KR 1019900013727 A KR1019900013727 A KR 1019900013727A KR 900013727 A KR900013727 A KR 900013727A KR 920005023 A KR920005023 A KR 920005023A
Authority
KR
South Korea
Prior art keywords
node
word
morpheme
generated
routine
Prior art date
Application number
KR1019900013727A
Other languages
Korean (ko)
Inventor
천성희
Original Assignee
이헌조
주식회사 금성사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 이헌조, 주식회사 금성사 filed Critical 이헌조
Priority to KR1019900013727A priority Critical patent/KR920005023A/en
Publication of KR920005023A publication Critical patent/KR920005023A/en

Links

Landscapes

  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)

Abstract

내용 없음No content

Description

한글문장의 형태소 분석방식Morphological Analysis of Hangul Sentences

본 내용은 요부공개 건이므로 전문내용을 수록하지 않았음As this is a public information case, the full text was not included.

제2도는 본 발명에 따른 형태소 분석 설명도.2 is an explanatory diagram of morphological analysis according to the present invention.

제3도는 본 발명에 따른 한글문장의 형태소 분석 제어흐름도.3 is a morphological analysis control flow diagram of Hangul sentence according to the present invention.

Claims (2)

영역분할된 좌표에 따라 문자인식을 한 후 인식한 문자의 형태소를 분석하는 방식에 있어서, 형태소 단위의 한글사전정보와 임의의 형태소의 좌/우측접속정보표 및 접속정보행렬정보를 구비한 후 한 어절을 추출하여 상기 한글사전정보에 포함되는 최단일치 형태소를 루트노드(root node)로 생성하며, 루트노드가 생성되지 않으면 오인식 어절로 판정하고, 생성된 루트노드가 좌측공백과 접속가능하면 우측노드 생성로틴을, 접속불가이면 좌측노드 생성루틴을 수행하며, 상기 우측노드 생성루틴은 우측 형태소를 우측노드로 생성하고, 우측노드가 생성되지 않으면 상기 좌측노드 생성로틴을 수행하며, 우측노드가 생성되면 어절의 끝인지를 판별하여 끝일때 우측공백과 접속가능이면 분석을 마치고, 접속불가이면 상기 좌측노드 생성루틴수행을 하며, 어절의 끝이 아닐 때 상위 노드와 접속가능이면 다음 우측노드 생성루틴을, 접속불가이면 상기 좌측노드 생성루틴을 수행하고, 상기 좌측노드 생성루틴은 좌측형태소를 좌측노드로 생성하며, 좌측노드와 생성되지 않으면 오인식 어절로 판정하고, 좌측 노드가 생성되면 어절의 끝일지를 판별하여 끝일때 우측공백과 접속가능이면 궤환하고, 좌측노드가 어절의 끝이 아닐때 상위 좌측노드가 존재하지 않으면 상기 우측노드 생성루틴으로 궤환하며, 존재하면 현 좌측노드와 상위 좌측노드의 접속 가능을 판별하여 접속불가 및 접속가능이고 어절의 끝이 아니면 상기 우측노드 생성루틴으로 궤환하고, 접속가능일 때 어절의 끝이고 우측공백과 접속불가이면 상기 좌측 노드 생성루틴으로 궤환하며, 우측공백과 접속가능이면 분석을 마치도록 하는 것을 특징으로 하는 한글문장의 형태소 분석방식.In the method of analyzing the morpheme of the recognized character after character recognition according to the area-divided coordinates, after having the Hangul dictionary information in the morpheme unit and the left / right access information table and the access information matrix information of any morpheme, Extract the word and generate the shortest morpheme included in the Hangul dictionary information as a root node. If the root node is not generated, it is determined as a misrecognition word. If the generated root node is accessible with the left space, the right node If the generated rotin is not accessible, the left node generating routine is performed. The right node generating routine generates the right morpheme as the right node. If the right node is not generated, the left node generating routine is performed. If the end of the word is determined to determine whether the end of the connection can be connected to the right blank, if the connection is impossible, the left node generation routine is performed, If it is not the end of the clause, if it is connectable with an upper node, the next right node generation routine is executed. If it is not accessible, the left node generation routine is performed. The left node generation routine generates a left stem as a left node and is not generated with the left node. If not, it is determined as a misunderstanding word, and if the left node is generated, it is determined whether it is the end of the word, and if it is possible to connect with the right space at the end, it is fed back. It returns to the routine, and if it exists, it determines whether the current left node and the upper left node can be connected and cannot be connected or connected. If the connection is not possible, it returns to the left node generation routine, and if the connection is possible with the right space, the analysis is completed. Stemming way of Hangul sentence to. 제1항에 있어서, 한글사전은 헤쉬(Hash)평션에 의해 목차 테이블과 데이타블록으로 하고, 헤쉬링크(Link) 리스트에 의해 단어의 삽입, 삭제 및 검색하도록 하는 것을 특징으로 하는 한글문장의 형태소 분석방식.10. The morpheme analysis of Korean sentences according to claim 1, wherein the Hangul dictionary is a table of contents and a data block by a hash function, and a word is inserted, deleted, and searched by a hash list. system. ※ 참고사항 : 최초출원 내용에 의하여 공개하는 것임.※ Note: The disclosure is based on the initial application.
KR1019900013727A 1990-08-31 1990-08-31 Morphological Analysis of Hangul Sentences KR920005023A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1019900013727A KR920005023A (en) 1990-08-31 1990-08-31 Morphological Analysis of Hangul Sentences

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1019900013727A KR920005023A (en) 1990-08-31 1990-08-31 Morphological Analysis of Hangul Sentences

Publications (1)

Publication Number Publication Date
KR920005023A true KR920005023A (en) 1992-03-28

Family

ID=67542732

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1019900013727A KR920005023A (en) 1990-08-31 1990-08-31 Morphological Analysis of Hangul Sentences

Country Status (1)

Country Link
KR (1) KR920005023A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR19980066877A (en) * 1997-01-29 1998-10-15 김광호 Morphological interpretation based on types of unregistered words
KR20030035248A (en) * 2001-10-30 2003-05-09 주식회사 아이버스 Method for searching by tree-structured words and computer readable medium having stored thereon computer executable instruction for performing the method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR19980066877A (en) * 1997-01-29 1998-10-15 김광호 Morphological interpretation based on types of unregistered words
KR20030035248A (en) * 2001-10-30 2003-05-09 주식회사 아이버스 Method for searching by tree-structured words and computer readable medium having stored thereon computer executable instruction for performing the method

Similar Documents

Publication Publication Date Title
US5640575A (en) Method and apparatus of translation based on patterns
US8027966B2 (en) Method and system for searching a multi-lingual database
JP2783558B2 (en) Summary generation method and summary generation device
JPH0510703B2 (en)
JPH0535783A (en) Translating device for document with typographical information
KR920005023A (en) Morphological Analysis of Hangul Sentences
JPH0344343B2 (en)
Kempen et al. Author environments: Fifth generation text processors
Al-tarawneh et al. Novel approach for Arabic spell-checker: based on radix search tree
JPS6395573A (en) Method for processing unknown word in analysis of japanese sentence morpheme
JP2817497B2 (en) Dictionary editing device
JPH0350668A (en) Character processor
JPH0346865B2 (en)
Sureka et al. An efficiency dependency parser using hybrid approach for tamil language
JPS58127230A (en) Kanji (chinese character)-kana (japanese syllabary) converter
Rasha et al. Novel approach for arabic spell-checker: Based on radix search tree
JPH0724057B2 (en) Electronics
JPH03130873A (en) Text base retrieving system
JPS62226270A (en) Sentence preparing device
Grishman Session 10: THE LEXICON
JPH0340060A (en) Kana (japanese syllabary) to kanji (chinese character) converter
JPH02184961A (en) Dictionary retrieve method
De Luca et al. Annotating Arabic Words with English Wordnet Synsets: An Arabic Wordnet Interface
JPH0332108B2 (en)
JPH02110771A (en) Electronic translation device

Legal Events

Date Code Title Description
WITN Application deemed withdrawn, e.g. because no request for examination was filed or no examination fee was paid