JPH06161995A - Method and device for shaping name data - Google Patents

Method and device for shaping name data

Info

Publication number
JPH06161995A
JPH06161995A JP4318876A JP31887692A JPH06161995A JP H06161995 A JPH06161995 A JP H06161995A JP 4318876 A JP4318876 A JP 4318876A JP 31887692 A JP31887692 A JP 31887692A JP H06161995 A JPH06161995 A JP H06161995A
Authority
JP
Japan
Prior art keywords
name
surname
data
characters
character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP4318876A
Other languages
Japanese (ja)
Inventor
Shingo Yudasaka
新吾 湯田坂
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Seiko Epson Corp
Original Assignee
Seiko Epson Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Seiko Epson Corp filed Critical Seiko Epson Corp
Priority to JP4318876A priority Critical patent/JPH06161995A/en
Publication of JPH06161995A publication Critical patent/JPH06161995A/en
Pending legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)

Abstract

PURPOSE:To accurately perform last name punctuation by obtaining the number of characters of a full name from full name data, obtaining the number of words and word length by retrieving a last name dictionary, and punctuating the full name data between a last name and a first/middle name by a last name punctuation means. CONSTITUTION:The data of the full name, for example, (Shinichi Kobayakawa) is inputted to an original data buffer 10, and the data are taken out by a full name data acquisition means 20. After that, last name dictionary retrieval 30 is performed as comparing the last name dictionary with the full name data, and a result in which the full name is segmented into (ko/hayakawa/shinichi) can be obtained. At this time, the number 3 of words is obtained by a word number acquisition 50, and the word length 1:2:2 is obtained by a word length acquisition means 40. In such a case, the number 5 of full name characters can be obtained immediately from the full name data acquisition 20 by a full name character number acquisition means 60. The calculation 70 of the number of last name characters can be performed based on the data in the word number acquisition 50, the word length acquisition 40, and the full name character number acquisition 60, and the last name punctuation 80 is performed by the obtained number of last name characters, then, it is outputted to a data buffer 90 after shaping.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、姓と名を区別していな
い連続した氏名データを、姓と名に区切った文字列に自
動的に変換する氏名データ整形方法および装置に関す
る。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a name data shaping method and apparatus for automatically converting continuous name data without distinction between first name and last name into a character string delimited by first name and last name.

【0002】[0002]

【従来の技術】従来は、特開平4−33053号公報に
示すように、氏名データの文字数から統計的に見て、例
えば4文字の名前であれば姓文字数を2文字にして姓:
名を2:2に決定するか、あるいは全く別の方法とし
て、姓名辞書から一致する姓名を探し出して姓名区切り
を行っていた。前者の方法は正確性に欠けていたし、後
者の方法では辞書容量が膨大なものになってしまうとい
う欠点があった。
2. Description of the Related Art Conventionally, as shown in Japanese Unexamined Patent Publication No. 4-33053, statistically viewed from the number of characters of name data, for example, in the case of a four-character name, the surname is set to two characters and the surname:
The first name was determined to be 2: 2, or as a completely different method, a matching first and last name was searched for from the first and last name dictionary, and the first and last names were separated. The former method lacked accuracy, and the latter method had the disadvantage of enormous dictionary capacity.

【0003】[0003]

【発明が解決しようとする課題】本発明の目的は、必要
最小限の姓名単語を格納した姓名辞書を使って姓名区切
りを行い、辞書に格納されていない姓名の場合でも、辞
書を検索して得られた情報を基に、従来よりも正確な姓
名区切りを行う、氏名データ整形方法および装置を提供
することである。
SUMMARY OF THE INVENTION An object of the present invention is to perform a surname surname separation using a surname and surname dictionary storing a minimum necessary surname and surname word, and search the dictionary even for surname and surname not stored in the dictionary. It is an object of the present invention to provide a name data shaping method and device for performing more accurate surname separation based on the obtained information.

【0004】[0004]

【課題を解決するための手段】本発明は、姓と名が連続
した文字列からなる氏名データと、姓名文字列と単漢字
を格納した姓名辞書とを備え、氏名データに一致する単
語を姓名辞書から検索し、検索で一致した単語の長さと
単語の数、および氏名データの文字数から、姓文字数を
算出して、氏名データを姓と名に区切ることを特徴とす
る。
Means for Solving the Problem The present invention comprises full name data consisting of a character string in which a family name and a given name are continuous, and a family name dictionary storing a family name character string and single kanji. It is characterized by searching the dictionary and calculating the number of surname characters from the length and number of words matched in the search and the number of characters of the name data, and dividing the name data into surname and surname.

【0005】さらに、本発明は、姓と名が連続した文字
列からなる氏名データを格納した氏名データ格納部と、
姓名文字列と単漢字を格納した姓名辞書を具備し、氏名
データ格納部から氏名データを取得する氏名データ取得
手段と、氏名データに一致する単語を姓名辞書から検索
する姓名辞書検索手段と、前記検索で一致した単語の長
さを取得する単語長取得手段と、前記検索で一致した単
語の数を取得する単語数取得手段と、氏名データの文字
数を取得する氏名文字数取得手段と、単語長、単語数、
氏名文字数から姓文字数を算出する姓文字数算出手段
と、氏名データを姓と名に区切る姓名区切り手段を持つ
ことを特徴とする。
Further, according to the present invention, a name data storage unit storing name data consisting of a character string in which a family name and a first name are consecutive,
A first and last name dictionary storing a first and last name character string and single kanji, a first and last name data acquisition means for obtaining first and last name data from a first and last name data storage part, a first and last name dictionary search means for searching a word matching the first and last name data, A word length acquisition unit that acquires the length of the matched word in the search, a word number acquisition unit that acquires the number of matched words in the search, a name character number acquisition unit that acquires the number of characters of the name data, and a word length, Number of words,
It is characterized by having a surname character number calculation means for calculating the surname character number from the full name character number and a surname and surname delimiter means for delimiting the name data into surname and surname.

【0006】[0006]

【作用】氏名データから氏名文字数を得、姓名辞書を検
索して単語数と単語長を得る。
[Function] The number of characters of the name is obtained from the name data, the number of words and the length of the word are obtained by searching the surname dictionary.

【0007】上記3つのデータ長から、姓文字数を算出
する。そして、姓名区切り手段によって、氏名データを
姓と名に区切る。この位置に空白1字を挿入すれば、氏
名データ、例えば「山田太郎」は、「山田 太郎」とい
う姓と名が区切られた形式に整形される。
The number of surname characters is calculated from the above three data lengths. Then, the surname and surname delimiter separates the name data into surname and surname. If a blank character is inserted at this position, the name data, for example, "Taro Yamada" will be formatted into a form in which the surname and first name "Taro Yamada" are separated.

【0008】[0008]

【実施例】図1は、本実施例の構成ブロック図である。
1が氏名データ整形装置である。元データバッファ10
に一例として、「小早川進一」という姓名のデータを入
力する。次に、元データバッファにあるデータを氏名デ
ータ取得手段20によって取り出す。姓名辞書と氏名デ
ータを比較しながら姓名辞書検索30を行い、「小/早
川/進一」と分離された結果を得る。ここで単語数取得
50において単語数3が取得され、単語長取得40にお
いて単語長1:2:2が取得される。ここで、上記氏名
データ取得20から氏名文字数取得60によって氏名文
字数5が直ちに得られる。単語数取得50、単語長取得
40、氏名文字数取得60の3種類のデータを基にし
て、姓文字数の算出70が行なわれる。
DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 is a block diagram of the structure of this embodiment.
1 is a name data shaping device. Original data buffer 10
As an example, input the data of the first and last name "Shinichi Kobayakawa". Next, the name data acquisition means 20 extracts the data in the original data buffer. The surname and surname dictionary is searched while comparing the surname and surname dictionary with the name data to obtain a result separated from "small / Hayakawa / Shinichi". Here, the word number acquisition 50 acquires the word number 3, and the word length acquisition 40 acquires the word length 1: 2: 2. Here, the name character number 5 is immediately obtained from the name data acquisition 20 by the name character number acquisition 60. The number of surname characters is calculated 70 based on the three types of data: word number acquisition 50, word length acquisition 40, and name character number acquisition 60.

【0009】姓文字数算出70のステップは図3と図4
と図5に詳述される。姓文字数算出70で得られた姓文
字数により、姓名区切り80が行なわれる。姓名区切り
が行なわれたデータを整形後データバッファ90に出力
する。
The steps of calculating the number of surname characters 70 are shown in FIGS.
And detailed in FIG. The surname and surname delimiter 80 is performed based on the surname and character number obtained in the surname character number calculation 70. The data delimited by the family name is output to the data buffer 90 after shaping.

【0010】図2において、元データ欄(あ)行は、
「山田一郎」という場合の例である。姓名辞書検索の結
果、「山田/一郎」に分離され、整形後のデータとし
て、「山田」と「一郎」が得られる。この例題の場合
は、姓名辞書の検索結果だけで姓と名を区切ることがで
きる。また、単語数が2、単語長が2:2、氏名文字数
が4ということがわかる。
In FIG. 2, the original data column (A) line is
This is an example of the case of "Ichiro Yamada". As a result of the first and last name dictionary search, "Yamada / Ichiro" is separated, and "Yamada" and "Ichiro" are obtained as the data after shaping. In this example, the surname and given name can be separated only by the search result of the surname dictionary. Also, it can be seen that the number of words is 2, the word length is 2: 2, and the number of name characters is 4.

【0011】元データ欄(い)行は、「武浩」という場
合の例である。姓名辞書検索の結果では「武浩」として
分離されないが、本発明により、単語数が1、単語長が
2、氏名文字数が2ということがわかり、これらの数字
データを基にして後述する図3以降の本発明の説明によ
り整形され、整形後データとして「武」と「浩」が得ら
れる。
The original data column (I) line is an example of the case of "Takehiro". Although it is not separated as "Takehiro" in the result of the first and last name dictionary search, the present invention reveals that the number of words is 1, the word length is 2, and the number of name characters is 2. Based on these numerical data, FIG. According to the description of the present invention described above, “Take” and “Hiro” are obtained as the data after shaping.

【0012】元データ欄(う)行は、「鈴木浩之助」と
いう場合の例である。姓名辞書検索の結果では「鈴木/
浩之/助」と分離されてしまうが、本発明により、単語
数が3、単語長が2:2:1、氏名文字数が5というこ
とがわかり、これらの数字データを基にして後述する図
3以降の本発明の説明により整形され、整形後データと
して「鈴木」と「浩之助」が得られる。
The original data column (u) line is an example in the case of "Konosuke Suzuki". The result of the first and last name dictionary search is "Suzuki /
However, according to the present invention, it is found that the number of words is 3, the word length is 2: 2: 1, and the number of name characters is 5. Therefore, FIG. By the following description of the present invention, shaping is performed, and "Suzuki" and "Konosuke" are obtained as the shaped data.

【0013】元データ欄(え)行は、「東西山二郎」と
いう場合の例である。姓名辞書検索の結果では「東/西
山/二郎」と分離されてしまうが、本発明により、単語
数が3、単語長が1:2:2、氏名文字数が5というこ
とがわかり、これらの数字データを基にして後述する図
3以降の本発明の説明により整形され、整形後データと
して「東西山」と「二郎」が得られる。
The original data column (e) line is an example in the case of "Jiro Higashinishiyama". In the result of the first and last name dictionary search, it is separated from "Higashi / Nishiyama / Jiro", but according to the present invention, it is found that the number of words is 3, the word length is 1: 2: 2, and the number of name characters is 5. The data is shaped according to the description of the present invention from FIG. 3 onward based on the data, and “Higashi Nishiyama” and “Jiro” are obtained as the shaped data.

【0014】元データ欄(お)行は、「南北三太郎」と
いう場合の例である。姓名辞書検索の結果では「南/北
/三太郎」と分離されてしまうが、本発明により、単語
数が3、単語長が1:1:3、氏名文字数が5というこ
とがわかり、これらの数字データを基にして後述する図
3以降の本発明の説明により整形され、整形後データと
して「南北」と「三太郎」が得られる。
The original data column (O) line is an example in the case of "Santaro Namboku". In the result of the search of the first and last name dictionary, it is separated from “Minami / North / Santaro”, but according to the present invention, it is found that the number of words is 3, the word length is 1: 1: 3, and the number of name characters is 5. Based on the data, the data is shaped by the description of the present invention, which will be given later with reference to FIG. 3, so that “north-south” and “Santaro” are obtained as the shaped data.

【0015】元データ欄(か)行は、「上中下四郎」と
いう場合の例である。姓名辞書検索の結果では「上/中
/下/四郎」と分離されてしまうが、本発明により、単
語数が4、単語長が1:1:1:2、氏名文字数が5と
いうことがわかり、これらの数字データを基にして後述
する図3以降の本発明の説明により整形され、整形後デ
ータとして「鈴木」と「浩之助」が得られる。
The original data column (or) line is an example in the case of "Shiro Kaminakashita". In the result of the search of the surname and surname dictionary, "upper / middle / lower / shiro" is separated, but according to the present invention, it is found that the number of words is 4, the word length is 1: 1: 1: 2, and the number of name characters is 5. Based on these numerical data, shaping is performed by the description of the present invention after FIG. 3 described later, and “Suzuki” and “Honosuke” are obtained as the shaped data.

【0016】図3は、本発明によるフローチャートであ
る。図1の姓文字数算出70を詳述したものである。1
00から128はステップの番号である。図3の左半分
のフローチャートを図4を参照しながら説明する。氏名
データから姓名辞書を検索して、単語長と単語数を得る
(100、101、102、103、104)。また、
氏名データから氏名文字数を得る(107)。
FIG. 3 is a flowchart according to the present invention. 2 is a detailed description of the surname character number calculation 70 of FIG. 1
00 to 128 are step numbers. The flowchart of the left half of FIG. 3 will be described with reference to FIG. The surname dictionary is searched from the name data to obtain the word length and the number of words (100, 101, 102, 103, 104). Also,
The number of name characters is obtained from the name data (107).

【0017】図4のステップ105で単語の数が左欄の
略図に示す通り2つの場合は、右欄の略図に示す通り1
番目の単語が姓、2番目の単語が名であるとして、姓文
字数=1番目の単語長とし、図3のフローチャートで説
明すると、単語数=2がyesと判断105されて、1
番目の単語長を姓文字数に代入する106。図3の10
5において、単語数=2がnoの場合は氏名文字数が取
得される107。氏名文字数が2未満の場合は、姓と名
の文字数の合計が1文字ということは実際にはありえな
いので108、処理を終了する。108においてnoの
場合は、文字数=2、文字数=3、文字数=4、文字数
が6以上の場合に分けて判断していく。
If the number of words is two as shown in the schematic diagram in the left column in step 105 of FIG. 4, 1 as shown in the schematic diagram in the right column.
Assuming that the second word is the surname and the second word is the first name, the number of surname characters is the first word length. Explaining with the flowchart of FIG.
The 106th word length is substituted 106 for the number of surname characters. 10 of FIG.
In 5, when the number of words = 2 is no, the number of name characters is acquired 107. If the number of characters of the first name is less than 2, the total number of characters of the surname and the first name cannot be 1 character in practice, so the process is terminated. In the case of no in 108, the number of characters = 2, the number of characters = 3, the number of characters = 4, and the number of characters is 6 or more.

【0018】氏名文字数が2文字の場合、図4のステッ
プ109の左欄の略図と右欄の略図に示す通りである。
文字数=2においては、姓文字数に1を代入する10
9、110。
When the number of name characters is two, it is as shown in the schematic diagram in the left column and the schematic diagram in the right column of step 109 in FIG.
When the number of characters = 2, substitute 1 for the number of surname characters 10
9,110.

【0019】氏名文字数が3文字の場合は、図4のステ
ップ111の左欄の略図に示す通り、3文字一体の場合
と、3文字が別々の場合が考えられるが、両者とも統計
的にみて右欄の略図に示す通り2:1の分けかたにな
る。図3のステップ111でYESの場合に姓文字数に
2を代入する113。
When the number of characters of the name is three, as shown in the schematic diagram in the left column of step 111 in FIG. 4, there may be a case where the three characters are integrated or a case where the three characters are different. As shown in the schematic in the right column, the division is 2: 1. If YES in step 111 of FIG. 3, 2 is substituted for the number of surname characters 113.

【0020】氏名文字数が4文字の場合は、図4のステ
ップ112の左欄の略図に示す通り、4文字一体の場合
と、2文字の前後に1文字ずつ分離した場合と、4文字
全部分離した場合と、2文字が後半にある場合と、前半
にある場合の5種類の場合すべてが右欄の略図に示す通
り2:2の分けかたになる。図3のステップ112、1
13に示す通りである。氏名文字数が6文字以上の場合
は、実際の例が僅少であることと、分けかたのパターン
が多すぎて、処理時間がかかりすぎて処理能力が犠牲と
なるので処理を行なわないことにした(図3ステップ1
14)。氏名文字数が5文字の場合を図3の右半分のフ
ローチャートで図5を参照しながら説明する。
When the number of characters of the name is four, as shown in the schematic diagram in the left column of step 112 in FIG. 4, the case of four characters integrated, the case of separating one character before and after two characters, and the case of separating all four characters All of the five cases, that is, the case where the two characters are in the latter half and the case where the two characters are in the first half, are divided into 2: 2 as shown in the schematic diagram in the right column. Steps 112 and 1 in FIG.
As shown in 13. If the number of characters in the name is 6 or more, there are few actual examples, and there are too many patterns of division, and it takes too much processing time and sacrifices processing capacity. (Figure 3 Step 1
14). The case where the number of name characters is 5 will be described with reference to FIG. 5 in the flowchart on the right half of FIG.

【0021】図5のステップ115の左欄に示す通り、
3文字ブロックがきて次に1文字が2回連続した場合、
右欄の略図に示す通り3:2の分けかたになる。図3に
おいて1番目の単語長=3の場合ステップ115は、姓
文字数に3を代入する116。 図5のステップ117
の左欄に示す通り2文字ブロックが前後にきた場合と、
2文字ブロックがはじめに2つ連続してきた場合と、2
文字ブロックがはじめに1つきて次に1文字が3つ連続
した場合は、すべて右欄の略図に示す通り2:3の分け
かたになる。図3のステップ117、118に示す通り
である。
As shown in the left column of step 115 in FIG.
If a three-character block comes and then one character continues twice,
As shown in the schematic in the right column, it is divided into 3: 2. If the first word length = 3 in FIG. 3, step 115 substitutes 116 for the number of surname characters. Step 117 of FIG.
As shown in the left column of, when two character blocks come before and after,
When two 2-character blocks first appear consecutively,
When a character block is preceded by 1 and then 3 characters are consecutive, all are divided into 2: 3 as shown in the schematic diagram in the right column. This is as shown in steps 117 and 118 of FIG.

【0022】図5のステップ120の左欄に示す通り、
3文字ブロックの前後に1文字ずつ分かれた場合は、処
理を行なわない。図3のステップ120で2番目の単語
長=3がYESの場合に示す通りである。
As shown in the left column of step 120 of FIG.
If one character is divided before and after the three-character block, no processing is performed. This is as shown in the case where the second word length = 3 is YES in step 120 of FIG.

【0023】図5のステップ121の左欄に示す通り、
1文字の後に2文字ブロックが2回連続した場合と、1
文字の後に2文字ブロックが1回きた場合は、すべて右
欄の略図に示す通り3:2の分けかたになる。図3のス
テップ121、122に示す通りである。
As shown in the left column of step 121 in FIG.
If a two-character block continues twice after one character,
When a two-character block comes once after a character, it is divided into 3: 2 as shown in the schematic diagram in the right column. This is as shown in steps 121 and 122 of FIG.

【0024】図5のステップ123の左欄に示す通り、
1文字が2回連続した後に2文字ブロックがきた場合
と、1文字が2回連続した後に3文字ブロックがきた場
合は、すべて右欄の略図に示す通り2:3の分けかたに
なる。図3のステップ123、124、125に示す通
りである。
As shown in the left column of step 123 in FIG.
When a two-character block comes after one character continues twice and when a three-character block comes after one character continues twice, they are all divided into 2: 3 as shown in the schematic diagram in the right column. This is as shown in steps 123, 124, and 125 of FIG.

【0025】図5のステップ126の左欄に示す通り、
1文字が3回連続した後に2文字ブロックがきた場合
は、右欄の略図に示す通り3:2の分けかたになる。図
3のステップ126、127に示す通りである。
As shown in the left column of step 126 in FIG.
When a two-character block comes after one character continues three times, it is divided into 3: 2 as shown in the schematic diagram in the right column. This is as shown in steps 126 and 127 of FIG.

【0026】図5のステップ126=NOの左欄に示す
通り、1文字が5回連続した場合、処理を終了する。図
3のステップ126で、4番目の単語長=2がNOの場
合に示す通りである。
As shown in the left column of step 126 = NO in FIG. 5, when one character continues five times, the process ends. This is as shown when the fourth word length = 2 is NO in step 126 of FIG.

【0027】以上の処理で、姓名辞書の検索結果が完全
でなかった場合でも、かなり正確な姓文字数を得ること
ができる。姓文字数が確定した場合は、姓文字数の後に
空白を1つ挿入して姓名区切りを行う(128)。
By the above processing, a fairly accurate number of surname characters can be obtained even if the search result of the surname dictionary is not complete. When the number of surname characters is fixed, one blank is inserted after the number of surname characters to separate the surname (128).

【0028】図6は、本実施例をパーソナルコンピュー
タで実現した場合のハードウェア構成図である。パーソ
ナルコンピュータ200は、制御手段として作動するC
PU201と、プログラムを格納したり、氏名データの
バッファやワークとして使用するRAM202と、BI
OSなどのシステムプログラムが格納されているROM
203とから構成される。また、このパーソナルコンピ
ュータ200には、データなどを表示するCRT204
と、ユーザーからの指令などが入力されるキーボード2
05と、データなどが格納される磁気ディスク206と
が備えられている。
FIG. 6 is a hardware configuration diagram when this embodiment is realized by a personal computer. The personal computer 200 has a C functioning as a control means.
PU201, RAM202 which stores a program, and is used as a buffer and a work of name data, BI
ROM that stores system programs such as OS
And 203. The personal computer 200 also has a CRT 204 for displaying data and the like.
And a keyboard 2 for inputting commands from the user
05 and a magnetic disk 206 for storing data and the like.

【0029】[0029]

【発明の効果】本発明を用いれば、従来よりも姓名辞書
のサイズをコンパクトにでき、辞書検索時間も短縮でき
る。さらに、従来よりも正確に、かつ、多様な氏名デー
タに対して姓名区切りを行うことができる。名刺管理装
置に用いられているデータベースのアプリケーションソ
フトウェアプログラムで扱われる氏名データのうち、姓
と名を区別していない連続した氏名データをコンピュー
タシステムで自動的に区切り、氏名データの整形を容易
に実現する。姓と名を分離することで氏名データの表示
や印刷がきれいで見易くなる。
According to the present invention, the family name dictionary can be made smaller in size and the dictionary search time can be shortened as compared with the conventional case. Furthermore, it is possible to perform the surname delimiter more accurately than ever before for various name data. Of the name data handled by the application software program of the database used in the business card management device, the computer system automatically separates continuous name data that does not distinguish between first name and last name, making it easy to shape the name data. To do. By separating the family name and first name, the display and printing of the name data is beautiful and easy to see.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の構成ブロック図である。FIG. 1 is a configuration block diagram of the present invention.

【図2】本発明の具体的説明図である。FIG. 2 is a specific explanatory diagram of the present invention.

【図3】本発明の具体的説明図である。FIG. 3 is a specific explanatory diagram of the present invention.

【図4】本発明の具体的説明図である。FIG. 4 is a specific explanatory diagram of the present invention.

【図5】本発明の具体的説明図である。FIG. 5 is a specific explanatory diagram of the present invention.

【図6】従来のハードウェア構成図である。FIG. 6 is a conventional hardware configuration diagram.

【符号の説明】[Explanation of symbols]

1:氏名データ整形装置 10:元データバッファ 20:氏名データ取得手段 30:姓名辞書検索手段 40:単語長取得手段 50:単語数取得手段 60:氏名文字数取得手段 70:姓文字数算出手段 80:姓名区切り手段 90:整形後データバッファ 1: Name data shaping device 10: Original data buffer 20: Name data acquisition means 30: First and last name dictionary search means 40: Word length acquisition means 50: Word number acquisition means 60: First and last name character number acquisition means 70: First and last name character number calculation means 80: First and last name Separator 90: Data buffer after shaping

Claims (2)

【特許請求の範囲】[Claims] 【請求項1】姓と名が連続した文字列からなる氏名デー
タと、姓名文字列と単漢字を格納した姓名辞書とを備
え、前記氏名データに一致する単語を前記姓名辞書から
検索し、検索して一致した単語の長さと単語の数と氏名
データの文字数とから、姓文字数を算出して氏名データ
を姓と名に区切ることを特徴とする氏名データ整形方
法。
1. A full name data consisting of a character string in which a surname and a first name are consecutive, and a surname and first name dictionary storing a first and last name character string and a single kanji character are searched, and a word that matches the first and last name data is searched from the surname and first name dictionary and searched. A name data shaping method, characterized in that the number of surname characters is calculated from the length of the matched words, the number of words, and the number of characters of the name data to divide the name data into surname and surname.
【請求項2】姓と名が連続した文字列からなる氏名デー
タを格納した氏名データ格納部と、 姓名文字列と単漢字を格納した姓名辞書を具備し、 前記氏名データ格納部から前記氏名データを取得する氏
名データ取得手段と、 前記氏名データに一致する単語を前記姓名辞書から検索
する姓名辞書検索手段と、 前記検索で一致した単語の長さを取得する単語長取得手
段と、 前記検索で一致した単語の数を取得する単語数取得手段
と、 前記氏名データの文字数を取得する氏名文字数取得手段
と、 単語長、単語数、氏名文字数から姓文字数を算出する姓
文字数算出手段と、 前記氏名データを姓と名に区切る姓名区切り手段とを備
えたことを特徴とする氏名データ整形装置。
2. A name data storage unit for storing name data consisting of character strings in which surnames and first names are consecutive, and a surname and first name dictionary storing character names and single kanji characters, and the name data storage unit stores the name data. A full name data acquisition unit for obtaining a name, a full name dictionary search unit for searching for a word matching the name data from the surname dictionary, a word length acquisition unit for acquiring the length of the word matched in the search, and A word number acquisition means for acquiring the number of matched words, a name character number acquisition means for acquiring the number of characters of the name data, a surname character number calculation means for calculating the surname character number from the word length, the number of words, and the name character number, and the name A name data shaping device comprising a surname and first name separating means for separating data into first and last names.
JP4318876A 1992-11-27 1992-11-27 Method and device for shaping name data Pending JPH06161995A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP4318876A JPH06161995A (en) 1992-11-27 1992-11-27 Method and device for shaping name data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP4318876A JPH06161995A (en) 1992-11-27 1992-11-27 Method and device for shaping name data

Publications (1)

Publication Number Publication Date
JPH06161995A true JPH06161995A (en) 1994-06-10

Family

ID=18103956

Family Applications (1)

Application Number Title Priority Date Filing Date
JP4318876A Pending JPH06161995A (en) 1992-11-27 1992-11-27 Method and device for shaping name data

Country Status (1)

Country Link
JP (1) JPH06161995A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010113678A (en) * 2008-11-10 2010-05-20 Advanced Media Inc Full name analysis method, full name analysis device, voice recognition device, and full name frequency data generation method
JP2011096245A (en) * 2009-09-30 2011-05-12 Kanagawa Univ Kanji compound word dividing method and kanji compound word dividing device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010113678A (en) * 2008-11-10 2010-05-20 Advanced Media Inc Full name analysis method, full name analysis device, voice recognition device, and full name frequency data generation method
JP2011096245A (en) * 2009-09-30 2011-05-12 Kanagawa Univ Kanji compound word dividing method and kanji compound word dividing device

Similar Documents

Publication Publication Date Title
US5544049A (en) Method for performing a search of a plurality of documents for similarity to a plurality of query words
JP3691844B2 (en) Document processing method
US5523945A (en) Related information presentation method in document processing system
JP2832988B2 (en) Data retrieval system
JPH02271468A (en) Data processing method
JPH04281565A (en) Document retrieving device
JP2693914B2 (en) Search system
US7039646B2 (en) Method and system for compressing varying-length columns during index high key generation
JPH06161995A (en) Method and device for shaping name data
JP2817103B2 (en) Data search device and data search method
JP2595934B2 (en) Kana-Kanji conversion processor
JP3983000B2 (en) Compound word segmentation device and Japanese dictionary creation device
JPH056398A (en) Document register and document retrieving device
JPH04340163A (en) Keyword retrieval system
JP2835335B2 (en) Data search device and data search method
JP2792147B2 (en) Character processing method and device
JP3343941B2 (en) Example sentence search system
JPH06325091A (en) Similarity evaluation type data base retrieval device
JPH10334105A (en) Relative word display device and medium where program for relative word display is recorded
JP3436109B2 (en) Related search formula search device and computer-readable recording medium storing related search formula search program
JPH03150668A (en) Input character string normalization system for retrieval system
JPH03209564A (en) Literature data registering method
JPH0353378A (en) Name retrieving system for retrieval of family name of same-pronunciation/different-character and different-character/same-pronunciation
JPH05165889A (en) Document retrieval device
JPH03118661A (en) Word retrieving device