JPH036657A - Word dictionary registering device - Google Patents

Word dictionary registering device

Info

Publication number
JPH036657A
JPH036657A JP1140619A JP14061989A JPH036657A JP H036657 A JPH036657 A JP H036657A JP 1140619 A JP1140619 A JP 1140619A JP 14061989 A JP14061989 A JP 14061989A JP H036657 A JPH036657 A JP H036657A
Authority
JP
Japan
Prior art keywords
word
speech
dictionary
kanji
kana
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP1140619A
Other languages
Japanese (ja)
Inventor
Takahiro Mizutani
水谷 貴広
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Solution Innovators Ltd
Original Assignee
NEC Software Chubu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Software Chubu Ltd filed Critical NEC Software Chubu Ltd
Priority to JP1140619A priority Critical patent/JPH036657A/en
Publication of JPH036657A publication Critical patent/JPH036657A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To attain the batch conversion of the stem and the inflectional ending of a word for each paragraph by registering the proper types of parts of speech to the words registered by users. CONSTITUTION:An input control part 11 inputs the KANJI (Chinese character) description and the KANA (Japanese syllabary) reading of a word to be registered via a character input device 2. A dictionary retrieving part 12 reads the corresponding KANA reading and type of the part of speech out of a word dictionary 3 for the KANJI description shown by a part-of-speech analyzing part 13. The part 13 obtains the stem and the type of the part of speed for the KANJI description and the KANA reading to be registered based on the KANJI description and the KANA reading of the input word and the information obtained via the part 12. A part-of-speech register part 14 registers the KANJI description, the KANA reading, and the type of the part of speech of the produced word into the word dictionary 3. Thus it is possible to convert en bloc the stems and the inflectional endings of those words having the proper parts of speech and given previously for each paragraph.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明は、単語の仮名読みを漢字に変換する仮名漢字変
換装置において使用される単語辞書に対して、単語の登
録を行う単語辞書登録装置に関する。
[Detailed Description of the Invention] [Field of Industrial Application] The present invention is a word dictionary registration device that registers words in a word dictionary used in a kana-kanji conversion device that converts kana readings of words into kanji. Regarding.

〔従来の技術〕[Conventional technology]

従来、単語辞書への利用者忙よる単語の登録は、単語辞
書登録装置が利用者より単語の漢字表記と仮名読みを受
は取り、受は取ったままの漢字表記と仮名読み、および
単語の品詞種別として名詞を固定的に割り当てることK
より行なっている。
Conventionally, when registering a word in a word dictionary, the word dictionary registration device receives the kanji transcription and kana reading of the word from the user, and then receives the original kanji transcription, kana reading, and the word's original reading. Fixedly assigning a noun as a part of speech typeK
I'm doing more.

〔発明が解決しようとする課題〕[Problem to be solved by the invention]

上述した従来の単語辞書登録装置では、単語の品詞種別
として名詞が登録されることから、動詞や形容詞などの
名詞以外の品詞種別を持つ単語を登録しようとした場合
に単語が本来持つ品詞種別で登録されず、このために登
録された単語を仮名漢字変換装置において変換を行うと
、品詞種別として名詞が登録されているために動詞や形
容詞として扱えず、予め提供された適切な品詞を持つ単
語のように単語の語幹と活用語尾を文節単位に−括して
変換できない欠点がある。
In the conventional word dictionary registration device described above, nouns are registered as the part-of-speech type of a word, so when an attempt is made to register a word with a part-of-speech type other than a noun, such as a verb or adjective, the word's original part-of-speech type is not registered. When a word that is not registered and is converted using a kana-kanji conversion device is converted into a word that cannot be treated as a verb or adjective because a noun is registered as the part of speech type, the word with the appropriate part of speech provided in advance It has the disadvantage that the stem and conjugated ending of a word cannot be converted in units of clauses, as in the case of ``-''.

〔課題を解決するための手段〕[Means to solve the problem]

本発明による単語辞書登録装置は、登録する単語の漢字
表記と仮名読みを入力する入力制御部と、単語辞書から
単語の情報を読み取る辞書検索部と、入力制御部を用い
て入力された単語の漢字表記と仮名読み、および辞書検
索部を用いて得た情報をもとに単語の品詞別を分析する
品詞分析部と、品詞分析部により求められた単語の品詞
種別と漢字表記と仮名読みを単語辞書へ登録する辞書登
録部を有している。
The word dictionary registration device according to the present invention includes an input control unit for inputting the kanji notation and kana pronunciation of the word to be registered, a dictionary search unit for reading word information from the word dictionary, and a word dictionary registration device for inputting words using the input control unit. A part-of-speech analysis section that analyzes the part of speech of a word based on information obtained using the kanji notation, kana reading, and dictionary search section, and a part-of-speech type, kanji notation, and kana reading of the word determined by the part-of-speech analysis section. It has a dictionary registration section for registering words in a dictionary.

〔実施例〕〔Example〕

次に1本発明について図面を参照して詳細に説明する。 Next, one embodiment of the present invention will be explained in detail with reference to the drawings.

第1図は、本発明の一実施例の構成を示すブロック図で
ある。
FIG. 1 is a block diagram showing the configuration of an embodiment of the present invention.

本実施例の単語辞書登録装置1は、入力制御部11と、
辞書検索部12と、品詞分析部13と、辞書登録部14
から構成され、文字入力装置2と、単語辞書3とが接続
される。
The word dictionary registration device 1 of this embodiment includes an input control section 11,
Dictionary search unit 12, part of speech analysis unit 13, and dictionary registration unit 14
A character input device 2 and a word dictionary 3 are connected.

入力制御部11では文字入力装置2を用いて登録する単
語の漢字表記と仮名読みの入力を行う。
The input control unit 11 uses the character input device 2 to input the kanji notation and kana reading of the word to be registered.

辞書検索部12では品詞分析部13より示される漢字表
記に該当する仮名読みと品詞種別を単語辞書3から読み
取る。品詞分析部13では入力制御部11を用いて入力
された単語の漢字表記と仮名読み、および辞書検索部1
2を用いて得られる情報から登録する漢字表記と仮名読
みの語幹と品詞種別を割り出す。辞書登録部14は品詞
分析部13で作成された単語の漢字表記と仮名読みと品
詞種別を単語辞書3へ登録する。
The dictionary search unit 12 reads the kana reading and part of speech type corresponding to the kanji notation indicated by the part of speech analysis unit 13 from the word dictionary 3. The part-of-speech analysis unit 13 uses the input control unit 11 to determine the kanji notation and kana reading of the input word, and the dictionary search unit 1
2. Determine the stem and part of speech of the kanji notation and kana reading to be registered from the information obtained using 2. The dictionary registration unit 14 registers the kanji notation, kana reading, and part of speech type of the word created by the part of speech analysis unit 13 in the word dictionary 3.

第2図は、単語辞書登録装置1の動作手力頁を示すフロ
ーチャートである。
FIG. 2 is a flowchart showing the operation manual page of the word dictionary registration device 1.

第2図を用いて単語辞書登録装置1の動作を説明する。The operation of the word dictionary registration device 1 will be explained using FIG.

初めに、入力制御部11が文字入力装置2を用いて、登
録する単語の漢字表記と仮名読みを入力させる(STE
P2−01)。次に、入力された漢字表記と仮名読みか
ら品詞分析部13が品詞種別を分析する(STEP2−
11〜5TEP2−42)。
First, the input control unit 11 uses the character input device 2 to input the kanji notation and kana reading of the word to be registered (STE
P2-01). Next, the part-of-speech analysis unit 13 analyzes the part-of-speech type based on the input kanji notation and kana reading (STEP 2-
11-5TEP2-42).

品詞分析部13では、まず、4L語が名詞か否かを判断
するために漢字表記に送り仮名が含まれているかを確認
する(STEP2−11)。漢字表記に送り仮名が含ま
れていなければ品詞種別を名詞と判断しく5TEP2−
21)、漢字表記に送り仮名が含まれていれば品詞種別
は名詞以外と判断する。
The part-of-speech analysis unit 13 first checks whether the 4L word is a noun or not, and whether or not the kanji notation includes okigana (STEP 2-11). If the kanji notation does not include okukana, the part of speech type should be determined as a noun. 5TEP2-
21), if the kanji notation includes okurikana, the part of speech type is determined to be other than noun.

例えば、漢字表記が「出願」、仮名読みが「しゅつがん
」であれば、漢字表記の末尾は仮名でないので送り仮名
がないと判断し、品詞種別は名詞とする。よって、登録
する単語の漢字表記は「出願」、仮名読みは「しゆつが
ん」、品詞種別は名詞となる。また、漢字表記が「読込
む」であれば、末尾が仮名の「む」であるので送り仮名
があると判断し、品詞種別は名詞以外と見なして分析を
進める。
For example, if the kanji notation is "application" and the kana reading is "shutsugan", the last part of the kanji notation is not a kana, so it is determined that there is no okigana, and the part of speech type is set to noun. Therefore, the kanji notation of the word to be registered is "application," the kana reading is "shiyutsugan," and the part of speech is noun. Furthermore, if the kanji notation is ``read'', the ending is the kana ``mu'', so it is determined that there is an okrigana, and the part of speech type is considered to be other than a noun, and the analysis proceeds.

品詞種別を名詞以外と判断した賜金、登録する単語が複
合語であれば単語辞書3が既に有している単語の品詞種
別を参照し割り出す。登録する単語が複合語でないなら
ば、送り仮名に含まれる語用語尾から品詞種別を割り出
す。
If the word to be registered is a compound word whose part-of-speech type is determined to be other than a noun, it is determined by referring to the part-of-speech type of the word already held in the word dictionary 3. If the word to be registered is not a compound word, the part of speech type is determined from the word ending included in the okurikana.

登録しようとする単語が既に単語辞書3が有する単語を
語末に持つ複合語であるかを確認する、すなわち、登録
しようとする漢字表記と仮名読みの末尾と一致する漢字
表記と仮名読みを持つ単語を辞書検索部12を用いて単
語辞書3から検索し、検索できたならば複合語であり、
検索できなければ複合語で4まないと判断する(STE
P2−12)。
Check whether the word you are trying to register is a compound word that has a word already included in the word dictionary 3 at the end of the word, that is, a word whose kanji notation and kana reading match the ending of the kanji notation and kana reading you are trying to register. is searched from the word dictionary 3 using the dictionary search unit 12, and if it can be searched, it is a compound word,
If it cannot be searched, it is determined that it is not a compound word (STE
P2-12).

複合語であれば、検索した単語の品詞種別を登録する品
詞種別とじ(STEP2−31)、検索した単語の漢字
表記と仮名読みをもとに登録しようとする漢字表記と仮
名読みから活用語尾を取り除き語幹のみにする(STE
P2−32)。
If it is a compound word, the part of speech type is registered (STEP 2-31) to register the part of speech type of the searched word, and the conjugated ending is determined based on the kanji notation and kana reading of the searched word. Remove and leave only the stem (STE)
P2-32).

複合語でなければ、予め記憶している各品詞種別に接続
する一般的な活用語尾の中から漢字表記の送り仮名に含
まれるものを抽出し、抽出した活用語尾が接続しつる品
詞種別を登録する品詞別と判断しく5TEP2−41)
、抽出した活用語尾をもとに登録しようとする漢字表記
と仮名読みから活用語尾を取り除き語幹のみにする(S
TEP2−42)。
If it is not a compound word, extract those included in the kanji-written okurikana from common conjugated endings that connect to each pre-memorized part of speech type, and register the part of speech type that the extracted conjugated endings connect to. 5TEP2-41)
, remove the conjugated endings from the kanji notation and kana reading to be registered based on the extracted conjugated endings, leaving only the stem (S
TEP2-42).

例えば、登録しようとする単語の漢字表記[読込む」、
仮名読みが「よみこむ」の場合、単語辞書3から単語の
末尾に相当する「込む」(辞書には「込む」の語幹であ
る、漢字表1ピが「込」、仮名読みが「こ」、品詞種別
が動詞で登録されている。「込」は動詞なので「込む」
を活用し「読込む」の末尾と一致する)が検索できたな
らば、「読込む」は複合語と見なし、登録する単語の漢
字表記は「読込」、仮名読みは「よみこ」、品詞種別は
動詞となる。
For example, the kanji notation [read] of the word you are trying to register,
If the kana reading is ``yomikomu'', the word dictionary 3 shows ``kumu'' which corresponds to the end of the word (the dictionary contains the root of ``kumu'', the kanji table 1 is ``kumi'', and the kana reading is ``ko''). , the part-of-speech type is registered as a verb. "Kumu" is a verb, so "Kumu"
If you are able to search for ``read'' (which matches the ending of ``read''), then ``read'' is considered a compound word, and the kanji notation of the word to be registered is ``read'', the kana reading is ``yomiko'', and the part of speech is The type is verb.

最後に、品詞分析部13により得られた漢字表記と仮名
読みの語幹と品詞種別を辞1f種別を辞書登録部14を
用いて単語辞書3へ登録する(STEP2−02)。
Finally, the kanji notation, the stem of the kana reading, and the part of speech type obtained by the part-of-speech analysis unit 13 are registered into the word dictionary 3 using the dictionary registration unit 14 (STEP 2-02).

〔発明の効果〕〔Effect of the invention〕

以上説明したように本発明は、利用者により登録される
単語に対して適切な品詞種別が登録されることにより、
登録された単語を仮名漢字変換装置において変換を行う
と、予め提供されている適切な品詞種別を持つ語と同様
に単語の語幹と活用語尾を文節単位に一括して変換でき
、変換の効率を向上させる効果がある。
As explained above, the present invention enables the user to register appropriate part-of-speech types for words registered by the user.
When registered words are converted using a kana-kanji conversion device, the stems and conjugated endings of the words can be converted at once for each clause, similar to the words with the appropriate part of speech type provided in advance, increasing the efficiency of conversion. It has the effect of improving

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の一実施例の構成を示すブロック図、第
2図は単語辞書登録装置1の動作手順を示すフローチャ
ートである。 1・・・・・・単語辞書登録装置、2・・・・・・文字
入力装置、3・・・・・・単語辞書、11・−・・・・
入力制御部、12・・・・・・辞書検索部、13・・・
・・・品詞分析部、14・・・・・・辞書登録部。
FIG. 1 is a block diagram showing the configuration of an embodiment of the present invention, and FIG. 2 is a flowchart showing the operating procedure of the word dictionary registration device 1. As shown in FIG. 1...Word dictionary registration device, 2...Character input device, 3...Word dictionary, 11...-...
Input control unit, 12... Dictionary search unit, 13...
...Part of speech analysis department, 14...Dictionary registration department.

Claims (1)

【特許請求の範囲】[Claims] 単語辞書へ登録する単語の漢字表記と仮名読みを入力す
る入力制御部と、前記単語辞書から単語の情報を読み取
る辞書検索部と、前記入力された単語の漢字表記と仮名
読み、および前記得られた単語の情報をもとに単語の品
詞種別を分析する品詞分析部と、該品詞分析部により求
められた単語の品詞種別と漢字表記と仮名読みを前記単
語辞書へ登録する辞書登録部を有し、登録する単語の品
詞種別として漢字表記と仮名読みから適切な品詞種別を
分析し割り当てることを特徴とする単語辞書登録装置。
an input control unit that inputs the kanji notation and kana reading of the word to be registered in the word dictionary; a dictionary search unit that reads word information from the word dictionary; the kanji notation and kana reading of the input word; a part-of-speech analysis unit that analyzes the part-of-speech type of a word based on information about the word, and a dictionary registration unit that registers the part-of-speech type, kanji notation, and kana reading of the word determined by the part-of-speech analysis unit to the word dictionary. A word dictionary registration device that analyzes and assigns an appropriate part of speech type of a word to be registered from kanji notation and kana reading.
JP1140619A 1989-06-02 1989-06-02 Word dictionary registering device Pending JPH036657A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP1140619A JPH036657A (en) 1989-06-02 1989-06-02 Word dictionary registering device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP1140619A JPH036657A (en) 1989-06-02 1989-06-02 Word dictionary registering device

Publications (1)

Publication Number Publication Date
JPH036657A true JPH036657A (en) 1991-01-14

Family

ID=15272921

Family Applications (1)

Application Number Title Priority Date Filing Date
JP1140619A Pending JPH036657A (en) 1989-06-02 1989-06-02 Word dictionary registering device

Country Status (1)

Country Link
JP (1) JPH036657A (en)

Similar Documents

Publication Publication Date Title
JPH0433069B2 (en)
JPH036657A (en) Word dictionary registering device
JPH03116375A (en) Information retriever
JP2621999B2 (en) Document processing device
JPS613268A (en) Kana and kanji conversion processor
JP2715419B2 (en) Translation equipment
JPH04372047A (en) Kana/kanji converter
JPS59103136A (en) Kana (japanese syllabary)/kanji (chinese character) processor
JPH0668070A (en) Compound word dictionary registering device
JPS595335A (en) Japanese language input device
JPH0468466A (en) Kana / kanji converting device
JPH03116265A (en) Kana/kanji converter
JPS63136264A (en) Mechanical translating device
JPH02110771A (en) Electronic translation device
JPH08171568A (en) Multilingual input method
JPS60225972A (en) Switching device of clause inputting level
JPH0816910B2 (en) Language analyzer
JPS63133228A (en) Information extracting device
JPS58127230A (en) Kanji (chinese character)-kana (japanese syllabary) converter
JPH0727526B2 (en) Kana-Kanji converter
JPH02140869A (en) Sentence structure analyzing method
JPH0385671A (en) Document preparation supporting device
JPH03240876A (en) Machine translation device
JPH0394367A (en) Japanese input system
JPH08241315A (en) Word registering mechanism for document processor