KR950000543B1

KR950000543B1 - Korean-character code generator

Info

Publication number: KR950000543B1
Application number: KR1019920012078A
Authority: KR
Inventors: 정희성
Original assignee: 정희성
Priority date: 1992-07-07
Filing date: 1992-07-07
Publication date: 1995-01-24
Also published as: KR940002731A

Abstract

This generator employs Hangul code system which is adaptable to many functional characteristics such as information exchange, character display, information processing. Hangul character recognizer (1) recognizes input code as Hangul code and outputs it. Hangul character phase structure detector (2) detects phase structure referencing Hangul consonant/vowel code table (4). Hangul character geometric code generator (3) finally outputs character code.

Description

Hangul Character Code Generator

제 1 도는 본 발명의 장치 구성도.1 is a device configuration diagram of the present invention.

제 2 도는 본 발명의 한글자모코드 테이블.2 is a Hangul alphabet code table of the present invention.

제 3 도는 한글자모코드의 내부정보 구성도.3 is a diagram illustrating the internal information of a Hangul alphabet code.

제 4 도는 본 발명장치에서의 데이터 흐름도.4 is a data flow diagram in the apparatus of the present invention.

제 5 도는 본 발명장치를 DIS 10646에 응용한 구성도.5 is a block diagram of the present invention applied to the DIS 10646.

제 6 도는 제 5 도에서 DIS 10616의 BMP에의 영역 할당도.6 is an area allocation diagram of the DIS 10616 in BMP in FIG.

본 발명은 한글문자정보를 컴퓨터의 내부표현의 코드계열로 생성하고 한글문자코드생성장치에 관한 것으로 좀더 상세하게는 한글문자의 구조특성 분석에 따른 한글코드계의 구성과 그 코드계를 이용하여 입력된 한글자모를 한글문자로 인식하고 위상구조검출기 기하코드생성기를 거쳐 문자형태의 정보형식코드 표현을 출력토록 함으로써 고신뢰성, 범용성, 호환성을 가진 한글문자코드를 생성하는 장치에 관한 것이다.The present invention relates to an apparatus for generating Korean character information as a code sequence of an internal expression of a computer and to a device for generating Korean character codes. More particularly, the structure of the Korean character system based on the structural characteristics analysis of Korean characters and input using the code system The present invention relates to an apparatus for generating Korean character codes with high reliability, generality and compatibility by recognizing Korean characters as Hangul characters and outputting the information type code representations in character form through a geometry code generator.

한글문자정보를 컴퓨터 코드로 표현하는 방식에는 속칭 N바이트형, 조합형, 완성형등이 있으나 이들 방식은 출현당시의 기능수요만을 목표로 개발되었기 때문에 문자코드의 용도확대에 따른 범용성과 호환성에 문제점이 있었다.Korean character information can be represented by computer code as N byte type, combination type, complete type, etc., but these methods were developed only for functional demands at the time of appearance. .

즉 N바이트형은 대형컴퓨터에서 한글표시기능을, 조합형은 퍼스널컴퓨터에서 한글표시기능의 강화, 표시 문자의 고품질화를 기술 목표로, 또 완성형은 컴퓨터 통신기술과 이용의 확대에 따른 기술수요의 기능을 중시하였기 때문에 각 방식을 문자코드의 다양한 기술적 수요에 적용함에는 제한을 받는 문제점을 내포하고 있었다.In other words, the N-byte type aims to display Korean characters on large computers, and the combination type aims to enhance Korean characters on personal computers and improve the quality of display characters. Because of the importance, the application of each method to the various technical demands of the character code was limited.

본 발명은 한글문자코드의 용도별 기능특성, 곧 정보교환용, 문자표시용, 정보처리용등의 목적 특성에 균질적으로 적용될 수 있는 고효율의 한글코드체계와 코드생성장치를 제공하는데 그 목적이 있다.An object of the present invention is to provide a high-efficiency Korean code system and code generation device that can be applied homogeneously to the functional characteristics of Korean character codes, that is, for information exchange, character display, and information processing. .

제 1 도는 본 발명의 장치구성도이다.1 is a device configuration diagram of the present invention.

본 장치는 도시된 바와 같이 한글문자인식기 1, 한글문자위상구조검출기 2, 한글문자기하코드생성기 3, 한글자모코드테이블 4로 구성된 원칩하드웨어이다. 한글문자인식기 1은 입력되는 한글자모코드를 한글문자코드로 인식하여 출력하는 기능동작을 하며 한글문자인식기 1의 출력은 한글문자위상구조검출기 2의 입력이 된다. 한글문자위상구조검출기 2는 입력된 문자코드로부터 한글자모코드테이블 4를 참조하여 문자의 위상구조를 검출하는 기능으로 동작되며 그 결과는 한글문자기하코드생성기 3의 입력으로 된다.This device is a one-chip hardware consisting of a Hangul character recognizer 1, Hangul character phase structure detector 2, Hangul character geometric code generator 3, Hangul alphabet code table 4 as shown. The Hangul Character Recognizer 1 recognizes and outputs the Hangul character code as the Hangul Character Code. The output of the Hangul Character Recognizer 1 becomes the input of the Hangul Character Phase Structure Detector 2. The Hangul character phase structure detector 2 operates by detecting the phase structure of the character by referring to the Hangul alphabet code table 4 from the input character code, and the result is the input of the Hangul character geometry code generator 3.

한글문자기하코드생성기 3은 한글문자위상구조검출기 2로부터의 출력을 입력으로 하고 한글자모코드테이블 4의 정보를 참조하여 소거의 문자코드를 최종적으로 출력하는 것이다.The Hangul character geometry code generator 3 inputs the output from the Hangul character phase structure detector 2 and finally outputs the character code of erasing by referring to the information of the Hangul alphabet code table 4.

제 2 도는 본 발명의 한글자모 코드테이블 4의 코드구성 및 자모정보의 표현방법과 이용체계를 도시하고 있는데 코드테이블의 크기는 256바이트 셀(byte cell)로서 구성하고 셀은 다시 그룹화하여 8개의 그룹공간을 구성한다.2 shows a code structure of the Hangul Jamo code table 4 and a method of expressing and using the Jamo information of the present invention. The size of the code table is configured as 256 byte cells and the cells are grouped again to form 8 groups. Construct space.

이 8개의 그룹공간은 한글문자의 위상구조 유형과 대응하여 각 공간그룹의 각 셀에는 문자의 위상구조형에 대응하는 한글문자합성자모가 할당되어 있다.These eight group spaces correspond to the topological structure type of the Hangul characters, and each cell of each spatial group is assigned a Hangul character synthesis letter corresponding to the topological structure type of the character.

제 1 도에서의 한글문자 인식기 1은 한글자모(자음, 모음)를 연속입력하여 한글 음절에 맞는 문자를 구성가능한 자모계열인가 아닌가를 검출하여 문자구성이 가능한 입력계열(input string)이면 한글문자 구성이 가능하다는 출력신호 1을 생성하고, 그 입력계열 중 문자합성 가능자모를 제 1 도의 한글문자위상구조검출기 2에 보내는 기능을 가지고 있다. 한글자모 입력에서 한글문자로 구성이 가능한 문자의 형태(pattern)는 다음과 같다.The Hangul Character Recognizer 1 in FIG. 1 detects whether a character is suitable for Hangul syllables by continuously inputting a Hangul Jamo (consonant, vowel), and if it is an input string that can be composed of Hangul characters. It is possible to generate an output signal 1 indicating that this is possible, and to send the character synthesis possible alphabet among the input sequences to the Hangul character phase structure detector 2 of FIG. The patterns of the characters that can be composed of Hangul characters in the Hangul alphabet input are as follows.

① 제 1 형(자음＋수직모음 : 가, 예등)① Type 1 (consonants + vertical vowels: a, yes, etc.)

② 제 2 형(자음＋수평모음 : 고, 요등)② Type 2 (consonants + horizontal vowels: high, yo, etc.)

③ 제 3 형(자음＋수직수평 2중모음 : 의, 와등)③ Type 3 (consonants + vertical horizontal double vowels: right, back, etc.)

④ 제 4 형(자음＋수직수평＋자음 : 각, 낙등)④ Type 4 (consonants + vertical horizontal + consonants: angles, falls, etc.)

⑤ 제 5 형(자음＋수평모음＋자음 : 곡, 녹등)⑤ Type 5 (consonants + horizontal vowels + consonants: music, green, etc.)

⑥ 제 6 형(자음＋수직수평 2중모음＋자음 : 등)⑥ Type 6 (consonants + vertical horizontal double vowels + consonants: etc.)

한글문자인식기 1은 위와 같은 문자패턴을 입력된 자모로부터 인식하는 장치로써 그 구성을 밀리(mealy)형 머쉰의 전이도(automaton)로 나타내면 다음과 같다.The Hangul Character Recognizer 1 is a device that recognizes the above character pattern from the input letter, and its configuration is expressed as an automaton of a mealy type machine as follows.

또, 위의 오포마톤을 상태 테이블로 나타내며 다음과 같다.In addition, the opomaton above is represented by a state table as follows.

한글문자인식기 1의 초기상태는 항상 S₀에 있고, 이 상태에서 자음(ㄱ, ㄴ, ㄷ, ㄹ, ㅁ, ㅂ, ㅅ, ㅇ, ㅈ, ㅊ, ㅋ, ㅌ, ㅍ, ㅎ, ㄲ, ㄸ, ㅃ, ㅆ)이 입력되면 한글 음절에 해당하는 문자를 만들어 내기 위하여 다음 입력을 기다리는 상태 S₁으로 옴기고 문자구성이 불가능하다는 출력신호를 0으로 한다.The initial state of the Hangul Character Recognizer 1 is always at S ₀ , and in this state, the consonants , school boy, ㅆ) this is when to produce the letter corresponding to the Hangul syllable Contributions ohms to state S _1, and then wait for the output signal that the character configuration is not possible to zero input.

상태 S₁에서 다음입력을 기다려 입력이 모음이면, 입력버퍼에 입력자모를 저장한채로 다음 상태 S₂로 옮기며, 입력이 자음이라면 S₁으로 되돌아 간다.If the input is a vowel after waiting for the next input in state S ₁ , it moves to the next state S ₂ with the input letter stored in the input buffer and returns to S ₁ if the input is a consonant.

상태 S₂에서는 입력이 자음이면 문자인식 검출신호를 /0으로 한 채 상태 S₃로 옮겨, 다음 입력을 기다리며 입력이 모음이라면 이중모음일 가능성이 크므로 S₂상태에 그대로 머물고 다음 입력을 기다린다. S₃의 상태에서는 입력이 자음이면 문자인식검출신호의 출력을 /0으로 한 채 다음 상태 S₄로 옮긴다. 입력이 모음이라면 S₁에서 S₂까지의 입력자모열로 한글문자의 하나(자음+모음)가 합성이 가능하므로 문자인식검출신호를 /1로하여 입력버퍼 가운데에서 S₀과 S₁의 입력을 제 1 도의 위상구조검출기 2로 보낸 후, 나머지 입력자모, 즉 S₂와 S₃에서의 입력자모를 입력버퍼에 남긴 채로 상태를 S₄로 옮긴다. 상태 S₂에서는 입력자모가 자음이면 S₀에서 S₃까지의 입력자모로서 한글 한 문자 구성이 가능하므로 문자인식검출신호를 /1로 하고, S₀에서 S₃까지의 입력계열을 위상구조검출기로 보낸후, 다음 상태를 S₁로 옮긴다. 이때 입력버퍼에는 S₄에서의 입력 즉 자음하나가 들어있게 된다. 또 S₄에서의 입력이 모음일 경우 S₀에서 S₄까지의 입력계열 가운데에서 S₃에서 S₂까지의 입력계열이 한글문자 한 개로 구성 가능해지므로 S₀에서 S₂까지의 입력계열을 문자인식검출신호 /1과 함께 위상구조검출기 2로 보내고 다음 상태를 S₂로 옮기게 된다. 이때 입력버퍼에는 S₃와 S₄에서의 입력 즉 자음, 모음의 2개가 남아있게 된다.State S ₂ in the input consonant when character recognition replaces the detection signal in one state S ₃ to / 0, if it is input while waiting for the next input bar staying intact S ₂ state is larger likely to be diphthong waits for the next input. In the state of S ₃ , if the input is a consonant, the character recognition signal output is shifted to the next state S ₄ with the output of / 0. If the input is a vowel, one of Korean characters (consonant + vowel) can be synthesized in the input string from S ₁ to S ₂ , so the character recognition signal is set to / 1 to input S ₀ and S ₁ from the input buffer After sending to the phase structure detector 2 of FIG. 1, the state is transferred to S ₄ while leaving the remaining input letters, i.e., the input letters at S ₂ and S _3, in the input buffer. In state S ₂ , if the input letter is consonant, Korean characters can be composed as the input letter from S ₀ to S ₃ , so the character recognition signal is set to / 1, and the input sequence from S ₀ to S _{3 is} converted to After sending, transfer the next state to S ₁ . At this time, the input buffer contains an input from S ₄ , that is, one consonant. In S ₄ the input is a collection one case because the input series in the middle input line of the S ₀ to S ₄ to S ₂ in S ₃ enables the open-circuit Hangul character configuration recognize the input series of the S ₀ to S ₂ characters in Along with the detection signal / 1, it is sent to phase detector 2 and the next state is transferred to S ₂ The input buffer it is possible with 2 of the type that is consonant, vowel in S ₃ and S ₄ remain.

위의 상태 전이도의 상태 테이블에서 입력 Del은 Deliminater(끊기)를 나타내며, 쉼표등의 입력키 값을 가리킨다. 상태 S₂에서 입력이 Del이면 S₀에서 S₁까지 자음과 모음이 입력되고, S₂에서 스페이스이므로 자음과 모음으로 한글 한 문자가 구성가능하게 되어 문자인식 검출신호는 /1이 되고, S₀과 S₁의 입력계열을 위상구조검출기 2로 보내진다. S₃에서 Del이면 자음+모음+자음의 문자구성이, S₄에서 Del이면 자음+모음+자음+자음의 문자구성이 자동적으로 구성되어 위상구조검출기 2로 그 상태까지의 입력계열이 넘겨지게 되는 것이다.In the state table of the state transition diagram above, the input Del represents a Deliminater and indicates an input key value such as a comma. If the input is Del while S ₂ is input to the vowels and consonants in the S ₀ to S _1, because it is in the S ₂ space, making it possible that the Hangul consonants and vowels constructed and the character recognition detection signal / 1, S ₀ The input sequence of and S ₁ is sent to the phase structure detector 2. If S ₃ is Del, consonants + vowels + consonants are composed of letters, and if S ₄ is Del, consonants + vowels + consonants are composed of letters automatically, and the phase sequence detector 2 passes the input sequence up to that state. will be.

구체적인 예를들어 한글문자인식기 1의 구성과 기능에 대해서 말하자면, 먼저 초기상태 S₀에서 '1'이 입력되면 'ㄴ'은 자음이므로 자음 한자로서는 음절구성이 불가능하므로 문자인식검출신호를 0으로 출력하고, 다음 입력대기상태를 S₁으로 옮긴다. S₁에서 'ㅏ'가 입력되면 'ㅏ'는 모음이므로 S₀에서의 'ㄴ'과 S₁에서의 'ㅏ'를 합성하면 '나'로써 한글음절 구성이 가능하다. 그러나 그 상태까지로는 '나'인지 '날'까지의 입력이 있을 것인지를 결정할 수 없어 입력먼저 보기(look ahead)를 해야하므로 문자인식검출기신호는 0으로 한채, 다음 입력을 보기 위하여 다음 상태를 S₀로 옮긴 후 다음 입력을 기다린다.As a specific example, the configuration and function of the Hangul character recognizer 1, when '1' is input from the initial state S ₀ , 'b' is a consonant, so it is impossible to compose a syllable with a consonant Hanja. and it moves the next input standby state to S _1. If 'ㅏ' is input in S ₁ , 'ㅏ' is a vowel, so if you combine 'n' in S ₀ and 'ㅏ' in S ₁ , you can construct a Korean syllable as 'I'. However, until that state, we cannot determine whether there is an input to 'I' or 'day', so we need to look ahead, so the character recognition signal is set to 0 and the next state is displayed to see the next input. Move to S ₀ and wait for the next input.

S₂에서의 입력이 자음 'ㄹ'이라면 S₀에서 S₂까지의 입력계열로 '날'를 인식, 한글문자로 생성할 수 있으나 '날'인지 '나라'를 위한 'ㄴㅏㄹ'인지는 아직 결정할 수 없는 상태(nonderterministic state) 이어서 문자인식검출신호를 1로 할 수 없고, 다시 다음 상태로 가서 입력을 기다려야 한다. 다음 상태는 S₃로서 S₃에서 자음 'ㄱ'이 들어오면 여기 상태에서도 '날기'의 것인지 또는 '낡'를 위한 자음인지를 분간할 수 없어 다시 다음 입력을 확인하기 위하여 다음 상태인 S₄로 옮기여 다름 입력을 기다리게 된다. 만일 S₃에서 모음 'ㅏ'가 입력되면 지금까지의 입력계열 'ㄴ ㅏ ㄹ ㅏ'중에서 'ㄴ ㅏ'가 '나'로 문자구성이 가능하므로 문자인식검출신호를 1로 하고, S₀과 S₁의 입력자모 'ㄴ''ㅏ'를 위상구조검출기 2로 보내고, S₂와 S₃에서의 입력 'ㄹ'과 'ㅏ'는 입력버퍼에 저장한 채로 S₂로 다음 입력상태를 옮긴다.If the input from S ₂ is the consonant 'ㄹ', the input sequence from S ₀ to S ₂ can recognize 'day' and generate it as Hangul characters, but it is still unknown whether it is 'day' or 'ㄴ ㅏㄹ' for 'country'. Nonderterministic state Subsequently, the character recognition signal cannot be set to 1, and it must go to the next state again and wait for input. The following states as S ₃ comes in two at S ₃ consonants 'b' can not discern whether this consonant to whether the 'Fly' or 'beat' in the state back to the S ₄ following conditions in order to determine the next input It will move and wait for the next input. If the vowel 'ㅏ' is input in S ₃ , the character recognition signal is set to 1 and S ₀ and S because 'b ㅏ' can be composed of 'B' in the input sequence 'B ㅏ ㄹ ㅏ'. 'send to the phase detector structure 2, at the input of S ₂ and S _3' ₁ input alphabet "b""of the trestle d" and "trestle" is to move the next type state while stored in the input buffer to S _2.

만일 '맑고'를 입력시키려고 하면 'ㅁ ㅏ ㄹ ㄱ ㄱ ㅎ'가 입력계열로 S₀→S₁→S₂→S₃→S₄로 상태가 전이되며, S₄에서 ㄱ이 입력되면 S₀에서 S₄까지의 입력계열 'ㅁ ㅏ ㄹ ㄱ'와 문자인식검출신호"1"를 위상구조검출기 2로 넘긱 되고 문자화가 아직 못되는 'ㄱ'를 입력버퍼에 남긴채로 S₁으로 전이하여 다음 입력을 기다리게 된다.If the attempt to enter a "clear""Wh trestle d b b heh 'input sequence _{_{_{S 0 → S 1 → S 2}}} → S 3 → and state transitions to S ₄ in, when in S ₄ b is input from the S ₀ The input sequence 'ㅁ ㅏ ㄹ ㄱ' up to S ₄ and the character recognition signal "1" are transferred to the phase structure detector 2, and the next input is transferred to S ₁ without leaving the 'buffer' in the input buffer. I will wait.

물론 S₂, S₃, S₄에서 스페이스 또는 쉼표의 같은 끊기 키를 입력시키면 그때그때 실시간으로 '나''날''맑'이 한글문자 생성단위로 인식되어 한글문자위상구조 검출기 2로 보내져 정해진 절차에 따라 처리된다. 물론 입력코드는 조합형 원성형을 그대로 이용할 수 있으므로 입력코드의 혼환성이 유지된다.Of course, if you enter the same break key of space or comma in S ₂ , S ₃ , and S ₄ , then, in real time, `` me '' and `` sunny '' will be recognized as the Hangul character generation unit and sent to Hangul character phase structure detector 2. It is processed according to the procedure. Of course, the input code can use the combination type as it is, the compatibility of the input code is maintained.

또 한글문자위상구조검출기 2는 상기 한글문자인식기 1로부터의 출력을 입력으로 하여 한글자모코드테이블 4를 참조하여 한글문자의 위상구조를 추출한다.Also, the Hangul character phase structure detector 2 extracts the topological structure of the Hangul characters by referring to the Hangul alphabet code table 4 using the output from the Hangul character recognizer 1 as an input.

여기에서 구조란 어느 집합내의 개개의 요소간에 존재하는 관계를 말하면 한글문자에서의 구조에는 순서구조, 위상구조, 대수구조가 있고 한글문자의 위상구조는 완전한 한글문자의 자형에 공통적인 도형요소를 추상화한 개념으로서 예를들어 (하류)=(한)은 위상적으로 같은 것이나 다만 일차원 배열과 이차원 배열의 기하구조가 다를 뿐이다. 한글자모를 합성하여 생성할 수 있는 문자의 수는 11172자인데 이들 자모집합에서 공통의 구조를 찾으면 6가지 유형으로 분류할 수 있으므로 이를 분류된 문자의 의미로 표현하면 정보 위상구조가 추출되는 것이다.Here, the structure refers to the relationship existing between individual elements in a set. In the structure of Hangul characters, there are order structure, topology structure, and algebraic structure. As a concept, for example, (downstream) = (a) is topologically the same, except that the geometry of one-dimensional and two-dimensional arrays is different. The number of characters that can be generated by synthesizing the Hangul alphabet is 11172 characters. If a common structure is found in these sets, the information topology can be extracted by expressing them in the meaning of the classified characters.

2차원 표기의 문자에서 위상구조를 추출하는 경우 한 문자에 포함된 모음의 종류와 종성자음의 유무가 위상구조를 결정하는 의미정보가 된다. 즉 모음이 종류를 수평모음(ㅗ, ㅛ, ㅜ, ㅠ, ㅡ........등), 수직 모음(ㅏ, ㅑ, ㅓ, ㅕ,....등)으로 분류되며 중성자음이 있는 문자는 다시 각 자모의 기하데이터가 달라지므로 이들 자모를 다시 대소로 분류하면 위상구조의 유형은 6개가 되는데 대표적인 유형을 밝히면 제 1 형은 (가), 제 2 형은 (고), 제 3 형의 (과), 제 4 형은 (한), 제 5 형은 (글), 제 6 형은 (퀘ㄹ)이 된다 여기에서 괄호안의 문자는 유형의 예지문자에 불과하다.When extracting a topological structure from a letter of two-dimensional notation, the type of vowel included in one letter and the presence or absence of the final consonant become semantic information for determining the topological structure. That is, the vowels are classified into horizontal vowels (ㅗ, ㅛ, TT, ㅠ, ㅡ ........) and vertical vowels (ㅏ, ㅑ, ㅓ, ㅕ, ...., etc.) and neutron consonants Since the geometric data of each letter is different again, the letters are classified into small and large again, and the types of topologies are 6 types.The representative types are the first type (A), the second type (I), and the second type. Type 3, type 4 is (Han), type 5 is (Writ), and type 6 is (Qué). Here, the letters in parentheses are merely type predictive characters.

이 위상정보를 추출하여 출력하는 기능을 한글문자 위상구조검출기 2가 행하는 것인데 그 기능동작은 다음과 같다.The Korean character phase structure detector 2 performs a function of extracting and outputting the phase information. The function operation is as follows.

한글문자인식기 1에서의 출력코드는 한글문자로서 합성이 가능한가 어떤가의 결정에 따라 한 물질에 해당하는 문자경계정보 뿐이기 때문에 코드의 실제값은 한글자모인식기 1의 입력자모코드값 즉 한글문자를 풀어 쓰기했을 때와 같게 된다. 따라서 한글문자인식기 1의 출력코드를 한글문자의 표시방법, 즉 2차원적으로 모아썼을 때는 문자형에 따라 글자안에서의 한글자모의 모양이 정해지므로, 우선 그 대상문자가 어떤 위상구조에 속하고 있는가를 한글문자위상구조검출기 2가 검출해 내게 되는데 위상구조의 검출에 쓰이는 정보는 한문자에 포함된 모음의 종류 (수직, 수평, 이중) 종성자음의 유무에 따라 그 값을 구한다.The output code of Hangul Character Recognizer 1 is only the character boundary information corresponding to a substance, depending on whether it can be synthesized as Hangul characters. It will be the same as when writing. Therefore, when the output code of the Hangul Character Recognizer 1 is collected in two dimensions, that is, the shape of the Hangul alphabet in the character is determined according to the character type, first, which phase structure the target character belongs to? The character phase structure detector 2 detects the information. The information used to detect the phase structure is obtained according to the type of vowels (vertical, horizontal, and double) included in one character.

예를들면 (가)이면 모음은 (ㅏ)로 수직모음이므로 제 1 영역의 위상구조에 속하게 되므로 제 1 영역을 나타내는 비트열(000)를 셋트하여 출력하는 것이다.For example, if (a), the vowel is a vertical vowel (ㅏ) and thus belongs to the phase structure of the first region, so that a bit string (000) representing the first region is set and output.

또 한글문자기하코드생성기 3은 상기의 한글문자위상구조검출기 2의 출력정보를 입력으로 하고 한글자모코드테이블 4를 이용하게 최종문자코드를 출력하게 된다.The Hangul character geometry code generator 3 inputs the output information of the Hangul character phase structure detector 2 and outputs the final character code using the Hangul alphabet code table 4.

한글기하코드 생성기 3의 구성은 다음과 같다. 상기 한글문자위상 구조검출기 2에서 처리되어 출력되는 코드정보는 한글문자 형태별(제 1 형에서 제 6 형까지 중 하나)에 따라 문자를 구성하는 자모의 코드 1바이트 정보가운데에서 제 2 도의 b₅b₆b₄의 자리비트가 문자의 위상 코드로 각각 셋트된 후, 나머지 b₀b₁b₂b₃b₄의 비트자리에는 문자인식기에 입력될 때의 코드정보를 그대로 간직하고 있다. 이와 같은 자모코드로 구성된 입력데이타는 한글기하코드 생성기 3으로 입력된다.The structure of the Korean geometric code generator 3 is as follows. The code information processed and output by the Hangul character phase structure detector 2 is b ₅ b of FIG. 2 in the one-byte information of the letter of the Jamo constituting the character according to the Hangul character type (one of type 1 to type 6). _{After 6} b ₄ digit bits are set to the character phase code, the remaining bit information of b ₀ b ₁ b ₂ b ₃ b ₄ retains the code information when it is input to the character recognizer. The input data consisting of the alphabet codes is input to the Korean geometric code generator 3.

이 한글기하코드 생성기 3에서는 한글자모의 각각 코드 1바이트 중, b₀b₁b₂b₃b₄를 제 2 도의 한글자모코드 테이블에 정의된 각 그룹별의 각 자모코드에 대응하여 일치하도록 비트연산처리를 행한다. 비트연산은 각 비트간의 논리 연산자 또는 마스크 비트를 이용한다.In the Hangul geometric code generator 3, bits ₀ b ₁ b ₂ b ₃ b ₄ of the one letter of each Hangul alphabet correspond to each alphabet code of each group defined in the Hangul alphabet code table of FIG. Arithmetic processing is performed. Bit operations use logical operators or mask bits between each bit.

이와 같은 연산처리를 도형적으로 설명하면 다음과 같다.This operation is described graphically as follows.

b₅b₆b₇의 비트정보는 위상구조검출기에서 문자의 형태에 따라 그 값이 정해지고 b₀에서 b₄까지의 5비트 정보는 자모코드테이블에 정의된 각 그룹별(군사형태별) 자모의 코드 즉 제 2 도의 코드테이블 가운데인 비트 값 중, b₀에서 b₇에까지의 값이 셋트되도록 비트연산처리를 행한다. 구체적 예를들면 '나'의 위상구조검출기의 코드정보는 '나'가 제 1 형의 문자이므로 ????000(ㄴ), ????000(ㅏ)로 처리되어 출력된 후, 기하코드 생성기에 입력되어 ?????(ㄴ)과 ?????(ㅏ)의 비트 정보는 문자코드테이블 제 1 그룹의 'ㄴ'과 'ㅏ'에 정의된대로 01000(ㄴ)과 11000(ㅏ)로 바트연산처리후 각각 셋트되어 전체적으로 01000(기하데이터) 000(위상데이터), 11000(기하데아터), 000(위상데이터)로 0100000011000000(나)의 최종 출력을 얻게 되며, 이 최종출력이 한글 폰터 테이블의 주소가 된다. 그후 폰트생성장치(font generator)에 의해 디스플레이 또는 프린터에 찍히게 되는 것이다.The bit information of b ₅ b ₆ b ₇ is determined according to the type of characters in the phase structure detector, and the 5-bit information from b ₀ to b ₄ is the letter of each group (military type) defined in the alphabet code table. A bit operation is performed so that a value from b ₀ to b ₇ is set among the code, i.e., bit values in the code table of FIG. For example, the code information of the phase structure detector of 'I' is treated as ???? 000 (b), ???? 000 (ㅏ) because 'I' is the first type of character, and then the geometry The bit information of ???? (b) and ????? (ㅏ) input to the code generator is 01000 (b) and 11000 as defined in 'b' and 'ㅏ' of the first group of character code tables. (I) After each Baht operation is set, the final output is 0000000 (geometric data) 000 (phase data), 11000 (geometric data), 000 (phase data) and 0100000011000000 (b). This is the address of this Hangul fonter table. It is then stamped on a display or printer by a font generator.

제 2 도는 본 발명에서 한글문자위상구조검출기 2 및 한글문자기하코드 생성기 3의 기능수행에 핵심이 되는 한글자모코드테이블 4의 상세한 내용을 나타내고 있는데 테이블 구성과 한글문자정보의 코드표현에 대한 설계원리는 다음과 같다.2 shows the details of the Korean alphabet code table 4, which is the core of the function of the Hangul character phase structure detector 2 and the Hangul character geometric code generator 3 in the present invention, and the design principle of the table structure and the code expression of the Hangul character information. Is as follows.

이 테이블의 내부구성에 있어 물리적인 코드영역의 크기는 2⁸=258바이트 셀 메모리 공간(byte cell memory space)과 번지공간(address space)으로 구성한다. 이 번지공간은 실제의 코드값에 대응하며 여기에는 다시 구역공간과 지역공간을 나타내는 정보가 들어있어 위상구조공간과 기하구조공간에 각각 대응케 된다.In the internal structure of this table, the physical code area is composed of 2 ⁸ = 258 byte cell memory space and address space. This address space corresponds to the actual code value, which in turn contains information representing the area space and the area space, corresponding to the topology space and the geometry space, respectively.

제 2 도에서 세로로 1열과 2열에는 제 1 형, 3열과 4열에는 제 2 형, 5열과 6열에는 제 3 형, 7열과 8열에는 제 4 형, 9열과 10열에는 제 5 형과, 11열, 12열에는 제 6 형이 각각 정의되어 있고 13, 14열에는 제4, 6형의 종성자음이, 15, 16열에는 제 5 형의 종성자음을 각각 정의하여 할당하고 상기의 그룹화 결과의 영역을 각각 제1, 2, 3, 4, 5, 6, 7, 8그룹이라 명명한다.In Fig. 2, vertically in columns 1 and 2, type 1, type 2 in columns 3 and 4, type 3 in columns 5 and 6, type 4 in columns 7 and 8, and type 5 in columns 9 and 10 And 6, type 6 are defined in columns 11 and 12, and 4 and 6 type consonants are defined in rows 13 and 14, and 5 type type consonants are defined and assigned in columns 15 and 16, respectively. The areas of the grouping result are named first, second, third, fourth, fifth, six, seven, eight groups, respectively.

각 그룹내의 자모는 표시기능의 제어정보를 가지고 있다. 즉 제 1 그룹에서 제 3 그룹의 할당자모중 자음, 제 4 그룹에서 제 6 그룹의 할당자모 모두를 스페이싱을 필요로 하지 않는 자모(non-spacing character)로 지정한다.The letter in each group has control information of the display function. That is, all consonants of the third group's allocators in the first group and allocators of the sixth group in the fourth group are designated as non-spacing characters.

제 3 도는 제 1 도의 한글문자위상검출기 2의 한글자모코드테이블 4에서의 한 자모에 해당하는 출력코드 형식과 내부정보의 구성을 나타낸다. b0에서 b4비트까지는 각 자모 분별을 위한 수치정보(한글문자기하코드)이며 b5에서 b7까지의 비트정보는 위상구조정보(한글문자위상구조코드)이고 b0에서 b7까지의 자모식별코드에 의하여 한글문자가 식별된다.FIG. 3 shows an output code format corresponding to one letter in the Hangul alphabet code table 4 of the Hangul character phase detector 2 of FIG. Bits b0 to b4 are numerical information (Hangul character geometry code) for each Jamo classification, and bit information from b5 to b7 is phase structure information (Hangul character phase structure code), and Korean characters are identified by the letter codes from b0 to b7. Is identified.

한글문자기하코드생성기 3의 작동기능을 살펴보면 상기 한글문자위상구조검출기 2에서의 출력코드는 입력문자가 어떤 문자의 위상구조에 속하는 가를 결정하는 정보를 포함하고 있으므로 다음 단계는 그 위상구조안에서 각자모가 어떤 기하데이터를 갖는가를 검출하고 합당한 코드를 생성하는 기능동작을 하여야 할것인바, 이미 한글문자위상구조검출기 2에서의 출력코드는 한글자모코드테이블 4의 위상그룹에 각각 연결되어 있는 상황이므로 한글문자위상구조검출기 2의 출력코드중 b0에서 b4까지의 자모분별데이터를 한글자모코드테이블 4의 물리적 테이터와 일치하도록 매핑함수(mapping function)를 사용, 소정의 출력코드를 얻게 되는데 이 매핑함수는 간단한 비트연산(bit operation)으로 산출되므로 최종출력을 얻게되는 것이다.Looking at the operation function of the Hangul character geometry code generator 3, the output code of the Hangul character phase structure detector 2 contains information for determining which character's topological structure belongs to the input character. It should detect the geometric data and generate a proper code. Since the output code of Hangul Character Phase Structure Detector 2 is connected to the phase group of Hangul Code Table 4, the Hangul Character Phase Among the output codes of the structure detector 2, the mapping function from b0 to b4 is mapped to the physical data of the Korean alphabet code table 4 to obtain a predetermined output code. This mapping function is a simple bit operation. It is calculated as (bit operation), so you get the final output.

제 4 도는 제 1 도에 도시한 장치구성의 실제 기능동작을 나타내는 데이터 흐름도이다. 제 1 도의 장치에(ㅎㅜㄴ, ㅁㅣㄴ, ㅁㅣㄴ, ㅈㅓㅇ, ㅇㅡㅁ)에 해당하는 자모의 입력코드가 들어 왔다고 할 때 한글문자인식기 1은 그 기능동작에 따라 (ㅎㅜㄴ, ㅁㅣㄴ, ㅁㅣㄴ, ㅈㅓㅇ, ㅇㅡㅁ)에 해당하는 문자경계를 가진 정보의 코드를 기능 결과로 출력하게 되는데 4문자중(ㅎㅜㄴ)자를 예로 설명한다. 제 1 도의 한글문자위상 구조검출기 2에 (ㅎㅜㄴ)자가 입려되면 제 3 도의 데이터 흐름도에서와 같이 (ㅎㅜㄴ)자는 종성자음이 있고 그것의 모음이 수평모음이므로 (ㅎㅜㄴ)의 상위 3비트에는 (100)의 비트패턴을 셋트하고 종성자음의 상위 3비트에는 (111)의 비트패턴을 셋트한 후, 출력한다.FIG. 4 is a data flow diagram showing the actual functional operation of the apparatus configuration shown in FIG. When the input code of Jamo corresponding to the device of Figure 1 (ㅎㅜㄴ, ㅁ ㅣ ㄴ, ㅁ ㅣ ㄴ, ㅈ ㅓㅇ, ㅇ ㅡ ㅁ) has been entered, the Hangul Character Recognizer 1 has the function operation (ㅎㅜㄴ, ㅁ ㅣ B, ㅁ ㅣ ㄴ, ㅈ ㅓㅇ, ㅇ ㅡ ㅁ) will output a code of information with a text boundary as a result of the function. If (ㅎㅎㄴ) is applied to the Hangul Character Phase Structure Detector 2 of FIG. 1, as in the data flow diagram of FIG. 3, the (ㅎㅎㄴ) has a consonant consonant and its vowels are horizontal vowels. The bit pattern of (111) is set in the upper 3 bits of the final consonant, and then output.

제 1 도의 한글문자기하코드생성기 3은 이 비트패턴을 입력으로 해서 한글자모코드테이블 4의 코드테이블 대응 위상영역으로 각각 링크(link)한후, 나머비 자모식별코드의 5비트를 한글자모코드테이블 4의 자모표현코도로 변환하는 매핑(mapping) 함수를 이용하여 소정의 최종 문자출력코드를 출력한다. 입력코드를 완성형의 KSC 5636으로 곰을 경우 (ㅎㅜㄴ)은 (01001001 10101001 01000111)의 비트패턴을 가진 3바이트 길이의 한 문자코드로 생성된다. 전체의 예시는 (ㅎㅜㄴ)=(01111001 11001001 00100111)=(01001001 10101001 01000111)이며 비트순위는 바이트의 제일 오른쪽이 상위비트이다.The Hangul character geometric code generator 3 of FIG. 1 inputs this bit pattern as an input to the code table corresponding phase region of the Hangul alphabet code table 4, and then converts the 5 bits of the surplus non-self identification code into the Hangul alphabet code table 4 A predetermined final character output code is output by using a mapping function that converts to a Jamoexpression of. If the input code is enclosed in the complete KSC 5636, the character code is generated as a three-byte long character code with a bit pattern of (01001001 10101001 01000111). An example of the whole is (ㅎㅎㄴ) = (01111001 11001001 00100111) = (01001001 10101001 01000111) and the bit order is the rightmost bit of the byte.

본 발명장치는 한글입력코드를 7단위 한글 날자 부호(KS C5636)를 사용하는 것을 전제로 하고 있다. 따라서 명세서 제 1 도의 한글문자 인식기 1로의 한글 자모 입력코드는 키보드의 키인(Key in)에 의해 발생되는데 이때 발생되는 코드가 KS C5636에서 정의된 7단위 한글 날자부호이다.The present invention is based on the premise that the Korean input code uses the seven-character Hangul date code (KS C5636). Therefore, the Hangul alphabet input code to the Hangul character recognizer 1 of FIG. 1 is generated by a key in of the keyboard, and the generated code is a 7 unit Hangul date code defined in KS C5636.

이 코드데이타가 한글문자식기 1로 입력되는데 한글문자인식기 1은 자모 입력계열에서 한 문자로 조합 가능한 자모계열만을 인식, 검출해내는 기능과 구성을 이루고 있어 한글문자인식기의 출력코드는 문자로서 인식된 자모개수 만의 코드정보로 입력당시의 코드 즉 KS C 5636 코드값을 그대로 유지하고 있다. 연속 입출력된 자모입력계열리 한글문자인식기의 처리를 거치면 문자경계를 가진 문자입력계열정보를 지나게 되고, 이와 같은 문자계열이 한글문자위상검출기로 입력되면 7단위 입력코드가 명세서 제 2 도의 한글자모 코드 테이블에 정의한 것과 같은 8단위 부호체계로 바뀌게 된다. 이와 같은 코드변환은 KS C5636코드가 7단위 코드라할지라도 비이트 단위이므로 MSB의 정보를 이용하지 않을 뿐 8단위 부호화 길이는 다르지 않다. 다만 b b b의 비트정보를 제 2 도의 비트값으로 같게 비트연산처리를 행하는 곳이 문자위상검출기의 기능이므로 제 2 도의 한글자모코드 테이블 4를 문자위상검출기 2에서 쓰기 시작한다. 한글문자위상검출기 2부터의 출력코드는 1바이트 중의 b 즉 MSB까지의 정보를 전부 이용하는 효과를 가지며, 이 8단위 부호가 한글기하코드 생성기 3으로 입력되어 나머지 5비트의 정보를 일치시켜 출력하게 되는데 이 최종코드는 한글문자표시를 위한 폰트테이블의 어드레스(address)와 일치한다.This code data is input to the Hangul Character Recognizer 1, and the Hangul Character Recognizer 1 recognizes and detects only the Jamo sequences that can be combined into one character in the Jamo input sequence.The output code of the Hangul Character Recognizer is recognized as a character. The code information of the number of letters only keeps the code at the time of input, that is, the KS C 5636 code value. When the input / output of continuous input / output of Jamo input system is processed by Hangul character recognizer, it passes through the character input sequence information having a character boundary. When such a character sequence is input to the Hangul character phase detector, the 7 unit input code is the Hangul Jamo code of FIG. You will be replaced with the 8-unit code system defined in the table. In this code conversion, even if the KS C5636 code is a 7-unit code, it does not use information of the MSB, but the 8-bit coding length is not different. However, since the bit operation of the bit information of b b b as the bit value of FIG. 2 is performed by the character phase detector, the Hangul Jamo code table 4 of FIG. 2 is started by the character phase detector 2. The output code from Hangul character phase detector 2 has the effect of using all the information up to b, i.e., MSB, in one byte, and this 8-unit code is input to the Hangul geometry code generator 3 to match the remaining 5 bits of information. This final code matches the address of the font table for displaying Korean characters.

위의 과정을 도형적으로 표시하면 다음과 같다.The above process is represented graphically as follows.

따라서 제 2 도의 한글자모코드 테이블은 한글문자위상검출기 2의 처리과정과 한글기하코드 생성기 3의 처리과정에서 공통으로 사용된다. 구체적인 예를들어 각 구성요소 간의 처리연결과정을 설명하면 다음과 같다.Therefore, the Hangul Jamocode Table of FIG. 2 is commonly used in the process of Hangul character phase detector 2 and the process of Hangul geometric code generator 3. As a specific example, the process connection process between each component will be described as follows.

'ㄴ ㅏ ㄹ ㅏ(00100000, 01000110, 10010010, 01000110 : KS C 5636의 코드 값)'이 한글문자인식기로 입력되면 소정의 처리를 거쳐 '나라(0010000001000110, 1001001001000110)'로 두 음절의 문자로 인식되어 각각 2바이트 씩의 경계를 가긴 코드정보로 변환한다. 이 코드에는 어떻게 모아서 표시하라는 표시정보가 포함되어 있지 않으므로 표시정보를 부가하기 위하여 첫 번째 처리과정인 위상검출기로 입력된다. 위상검출기의 처리과정이 끝나면 제 1 형으로 '나라(0010000001000000, 10010000001000000)'으로 코드변환과정을 거쳐 출력된다. 이 코드정보에는 제 1 형이라는 위상정보만을 가지고 있을 뿐 'ㄴ'이나 'ㅏ' 가 어떤 기하데이터를 가져야 한다는 정보를 가지고 있지 않으므로 한글기하코드 생성기의 처리를 거쳐야 한다. 한글기하코드생성 처리과정을 거치면 '나라(0100000011001000, 1010000011001000)'로 코드변화되어 출력되는데 이 코드가 따라 미련되어 있는 한글자모폰트의 어드레스가 되어 그 어드레스 속에 저장되어 있는 한글자모폰트가 디스플레이되거나 프린트되는 것이다.If 'B ㅏ ㄹ ㅏ (00100000, 01000110, 10010010, 01000110: Code value of KS C 5636)' is inputted into Hangul character recognizer, it is recognized as 'Nara (0010000001000110, 1001001001000110)' as two characters The code information is converted into code information having a boundary of 2 bytes each. Since the code does not include display information for collecting and displaying the code, it is input to the phase detector, which is the first process to add the display information. After the process of the phase detector is finished, the code is converted to 'country (0010000001000000, 10010000001000000)' as the first type and then output. This code information has only the topological information of type 1 and does not have any information that 'b' or 'ㅏ' should have any geometric data. Therefore, it must be processed by the Korean geometric code generator. When the Hangeul geometric code generation process is performed, the code is converted into 'Nara (0100000011001000, 1010000011001000)', and this code is followed by the address of the Hangul alphabet font, and the Hangul alphabet font stored in the address is displayed or printed. will be.

제 5 도는 본 발명의 정보교환용 코드로서의 일 실시예를 나타낸다. 제 5 도의 구성은 제 1 도에 도시된 본 발명장치 5와 그 출력코드 C1, DIS 10646의 국제공동문자판(BMP : Basic Multilingnal Plan)과 BMP의 출력코드 C2로 되어있다.5 shows one embodiment as an information exchange code of the present invention. The configuration of FIG. 5 is composed of the apparatus 5 of the present invention shown in FIG. 1, its output code C1, the basic multilingnal plan (BMP: Basic Multilingnal Plan) of DIS 10646, and the output code C2 of BMP.

제 5 도의 BMP에 본 발명의 한글자모토드테이블의 메모리공간, 번지공간을 그대로 매핑하여 이용하면 코드표현과 이용도에서 가장 효율이 높게된다.If the memory space and address space of the Hangul alphabet table of the present invention are mapped to the BMP of FIG. 5 as it is, the efficiency is high in code expression and usage.

제 6 도는 DIS 10646 BMP에 본 발명의 코드테이블을 배치한 경우이다.6 shows a case where the code table of the present invention is arranged in the DIS 10646 BMP.

DIS 10646의 BMP는 논리적, 물리적 공간크기가 256×256바이트로 한자모딩 2바이트로 표현된다. 따라서 본 발명의 한글자모코드테이블을 BMP의 한행에 배치했을 경우, 1바이트 크기의 자모표현이 2바이트의 표현으로 되나 1바이트의 한글자모를 BMP의 크기에 상관없이 그대로 1바이트의 출력코드로 표현하는 제어법은 다음과 같다.The BMP of DIS 10646 is represented by two-byte Chinese character modalities with a logical and physical space size of 256 × 256 bytes. Therefore, when the Hangul Jamo code table of the present invention is arranged on one line of BMP, the one-byte Jamo representation is represented by two bytes, but one-byte Hangul alphabet is represented as one-byte output code regardless of BMP size. The control method is as follows.

정보교환용 코드방식을 제 6 도와 같이 BMP의 A영역중 1행을 할당받아 한글자모의 집합으로 정의했을 때, 사용테이블의 지정을 첫 바이트의 코드로 초기치를 설정한 후 위상구조 3비트를 1행내의 위상영역으로 링크(link)하는 제어코드로 사용할 수 하고 이어서 나머지 5비트도 한글자모영역과 대응하는 두 번째 바이트의 번지공간을 그대로 이용하여 출력코드로 사용하면 실제의 코드영역은 2바이트 표현이나 한글자모당 1바이트로 통신이 가능해지게 되는 것이다.As in the 6th method of information exchange code method, when 1 row of A area of BMP is allocated and defined as a set of Korean alphabets, the initial value of the designation table is set to the code of the first byte. It can be used as a control code to link to the phase area in a row, and then the remaining 5 bits can be used as the output code using the address space of the second byte corresponding to the Hangul alphabet area as it is. However, communication will be possible with one byte per Hangul alphabet.

이상에서 보는 바와 같이 본 발명의 한글문자생성기의 구성원리는 한글문자구조의 구조특성과 잘 조화를 이루며 문자의 코드표현력, 정보교환코드로서의 본 발명의 특징은 다음과 같다.As described above, the members of the Hangul character generator of the present invention harmonize well with the structural characteristics of the Hangul character structure, and the characteristics of the present invention as code representation power and information exchange code of a character are as follows.

문자정보 교환의 통신환경에는 7비트계가 기본이며, 그 확장으로 8비트계가 있으므로, 7, 8비트계에서 업 다운(up-down) 호환성을 유지하는 문자코드체계가 가장 효율적이고 바람직하게 되는데 본 발명장치의 코드 표현은 7, 8비트계에서 호환적 이용이 가능하다. 또 조합형, 완성형 그 밖의 입력코드 또는 입력장치에서의 특징 즉 2벌식, 3벌식 자판형식 또는 출력코드 등에 제한을 두지 않는다.In the communication environment of the character information exchange, the 7-bit system is the basic, and since the 8-bit system is extended, the character code system that maintains up-down compatibility in the 7, 8-bit system is most efficient and desirable. The code representation of the device is compatible with the 7 and 8 bit systems. In addition, no limitation is imposed on the combination type, completion type or other input code or characteristics of the input device such as double type, triple type keyboard type or output code.

한편 문자코드확장방식에는 ISO-2022의 국제규격와 현재 개발도상에 있는 DIS 10646이 있는데 이 두방식의 차이는 코드영역에 대한 세계각국의 문자 할당영역과 방법에 있다.On the other hand, the character code extension method includes the international standard of ISO-2022 and DIS 10646, which is currently in development. The difference between the two methods lies in the character assignment area and method of each country in the world for the code area.

ISO-2022의 문자코드 영역의 할당은 256×256의 코드영역에 각 나라문자의 종류와 수의 특징에 맞도록 싱글바이트계는 97(7비트), 96(8비트)로 나누고 멀티바이트용으로 94×94, 96×96(2바이트계)가 있으나, 한글 한자 영역으로는 ISO-2022의 D영역에 94×94가 할당되어 있다.The allocation of the character code area of ISO-2022 is divided into 97 (7 bits) and 96 (8 bits) in the single byte system in accordance with the characteristics of each country character type and number in the 256 × 256 code area. There are 94x94 and 96x96 (2 byte systems), but 94x94 is assigned to the D area of ISO-2022 as the Hangul kanji area.

본 발명의 코드영역의 크기는 256바이트이므로 ISO-2022코드의 C영역중 96×3의 영역을 잡아 본 발명의 한글코드테이블을 그대로 옮기면 한글코드테이블의 번지는 절대번지가 되고 그것에 대응하는 각 코드 영역의 상대번지는 바로 ISO-2022의 규격에 따른 교환용 코드가 된다. 물론 싱글방트계인 본 발명코드계는 ISO-2022의 2바이트계로 한자모당 2바이트로 늘어나는 계산이 되나 실제로는 1바이트계 그대로 전송할 수 있다.Since the size of the code area of the present invention is 256 bytes, if the area of the Korean code table of the present invention is taken as 96 × 3 in the C area of the ISO-2022 code, the address of the Hangul code table becomes an absolute address, and each code corresponding thereto is The relative address of the area is the replacement code according to the standard of ISO-2022. Of course, the present invention code system, which is a single-band system, is calculated to increase to 2 bytes per kanji by the 2-byte system of ISO-2022.

반면 DIS 10646을 대응했을 경우는 DIS의 BMP의 256×256중 A영역의 한행을 할당받아 본 발명방식의 코드를 그대로 매핑시키면 1행의 크기로 할당이 가능하며 따라서 ISO-2022의 방식도바든 실재 코드테이블의 제어에 있어 효율적 방식이다.On the other hand, when DIS 10646 is applied, if one line of area A is allocated among 256 × 256 of BMP of DIS and the code of the present invention is mapped as it is, the size of one line can be allocated. Therefore, the ISO-2022 method is also present. It is an efficient way to control code tables.

상기와 같이 본 발명은 장치기능, 코드표현의 변경없이 ISO-2022 및 DIS 10646방식에 적용할 수 있는 것이다.As described above, the present invention can be applied to ISO-2022 and DIS 10646 without changing device functions and code expressions.

본 발명장치는 선행기술에 비하여 출력코드의 크기, 코드표현력, 코드생성에서 이용하는 정보표현, 장치구성의 기능동작에 특징이 있는 것으로 성행기술에서는 2바이트 고정길이로 한글문자를 코드표현하며 출력코드는 문자간의 식별정보만으로 구성되어 있음에 비하여 본 발명방식에서는 한글문자코드 표현은 2바이트 혹은 3바이트의 가변길이이나 한글문자는 자＋모, 자＋모, ＋자 등의 두 유형이 구성원리이기 때문에 문자구성요소와 코드구성요소의 구조적 대응성을 유지하므로 코드응용 기술에서 처리의 효율이 높다.Compared to the prior art, the apparatus of the present invention is characterized by the size of the output code, code expression power, information expression used in code generation, and functional operation of the device configuration. In the present invention, the Hangul character code expression is composed of two bytes or three bytes of variable length, but the Hangul characters are characters + character, character + character, character + character. Since the structural correspondence between the character element and the code element is maintained, the processing efficiency is high in the code application technology.

또 조합형의 코드생성계에서는 조합형 코드를 문자조합의 내부표현으로 사용하나 본 방식에 비해 구조적 체계 기술이 아니며 장치구성의 기능분리성, 확장성에서 볼 때도 본 방식이 뛰어나다.Combination type code generation system uses combinatorial code as internal representation of character combination, but it is not structural system technology compared to this method, and it is excellent in terms of function separation and extensibility of device configuration.

그 결과 본 발명에 의한 코드체계는 기존의 코드체계와의 호환성, 문자표시기능의 편의성, 국제규격의 정보통신시스템에의 적용성, 새로운 문자코드기법에 따른 처리기술 대응성을 보증할 수 있게 된다.As a result, the code system according to the present invention can guarantee compatibility with existing code systems, convenience of character display function, applicability to international standard information communication system, and correspondence of processing technology according to new character code technique. .

Claims

Hangul Character Recognizer that inputs Hangul characters and recognizes them as Hangul character codes by inputting Hangul characters and outputs the information type code representation in character form. , The Korean character phase structure detector 2 which extracts the phase structure of the character by referring to the Korean alphabet code table 4 from the output code representation, the Korean alphabet code table 4 which simultaneously represents the phase information and the geometric information, and the desired character code Hangul character code generator, characterized in that consisting of the geometric character generator 3 to output.

The Hangul Jamo code table 4, which simultaneously represents the geometric information of the topological information, has a spacing and a non-spacing information representation as the identification, discrimination information, and display control functions of the Hangul alphabet. Hangul character code generator.