JPH06161995A

JPH06161995A - Method and device for shaping name data

Info

Publication number: JPH06161995A
Application number: JP4318876A
Authority: JP
Inventors: Shingo Yudasaka; 新吾湯田坂
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 1992-11-27
Filing date: 1992-11-27
Publication date: 1994-06-10

Abstract

PURPOSE:To accurately perform last name punctuation by obtaining the number of characters of a full name from full name data, obtaining the number of words and word length by retrieving a last name dictionary, and punctuating the full name data between a last name and a first/middle name by a last name punctuation means. CONSTITUTION:The data of the full name, for example, (Shinichi Kobayakawa) is inputted to an original data buffer 10, and the data are taken out by a full name data acquisition means 20. After that, last name dictionary retrieval 30 is performed as comparing the last name dictionary with the full name data, and a result in which the full name is segmented into (ko/hayakawa/shinichi) can be obtained. At this time, the number 3 of words is obtained by a word number acquisition 50, and the word length 1:2:2 is obtained by a word length acquisition means 40. In such a case, the number 5 of full name characters can be obtained immediately from the full name data acquisition 20 by a full name character number acquisition means 60. The calculation 70 of the number of last name characters can be performed based on the data in the word number acquisition 50, the word length acquisition 40, and the full name character number acquisition 60, and the last name punctuation 80 is performed by the obtained number of last name characters, then, it is outputted to a data buffer 90 after shaping.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、姓と名を区別していな
い連続した氏名データを、姓と名に区切った文字列に自
動的に変換する氏名データ整形方法および装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a name data shaping method and apparatus for automatically converting continuous name data without distinction between first name and last name into a character string delimited by first name and last name.

【０００２】[0002]

【従来の技術】従来は、特開平４−３３０５３号公報に
示すように、氏名データの文字数から統計的に見て、例
えば４文字の名前であれば姓文字数を２文字にして姓：
名を２：２に決定するか、あるいは全く別の方法とし
て、姓名辞書から一致する姓名を探し出して姓名区切り
を行っていた。前者の方法は正確性に欠けていたし、後
者の方法では辞書容量が膨大なものになってしまうとい
う欠点があった。2. Description of the Related Art Conventionally, as shown in Japanese Unexamined Patent Publication No. 4-33053, statistically viewed from the number of characters of name data, for example, in the case of a four-character name, the surname is set to two characters and the surname:
The first name was determined to be 2: 2, or as a completely different method, a matching first and last name was searched for from the first and last name dictionary, and the first and last names were separated. The former method lacked accuracy, and the latter method had the disadvantage of enormous dictionary capacity.

【０００３】[0003]

【発明が解決しようとする課題】本発明の目的は、必要
最小限の姓名単語を格納した姓名辞書を使って姓名区切
りを行い、辞書に格納されていない姓名の場合でも、辞
書を検索して得られた情報を基に、従来よりも正確な姓
名区切りを行う、氏名データ整形方法および装置を提供
することである。SUMMARY OF THE INVENTION An object of the present invention is to perform a surname surname separation using a surname and surname dictionary storing a minimum necessary surname and surname word, and search the dictionary even for surname and surname not stored in the dictionary. It is an object of the present invention to provide a name data shaping method and device for performing more accurate surname separation based on the obtained information.

【０００４】[0004]

【課題を解決するための手段】本発明は、姓と名が連続
した文字列からなる氏名データと、姓名文字列と単漢字
を格納した姓名辞書とを備え、氏名データに一致する単
語を姓名辞書から検索し、検索で一致した単語の長さと
単語の数、および氏名データの文字数から、姓文字数を
算出して、氏名データを姓と名に区切ることを特徴とす
る。Means for Solving the Problem The present invention comprises full name data consisting of a character string in which a family name and a given name are continuous, and a family name dictionary storing a family name character string and single kanji. It is characterized by searching the dictionary and calculating the number of surname characters from the length and number of words matched in the search and the number of characters of the name data, and dividing the name data into surname and surname.

【０００５】さらに、本発明は、姓と名が連続した文字
列からなる氏名データを格納した氏名データ格納部と、
姓名文字列と単漢字を格納した姓名辞書を具備し、氏名
データ格納部から氏名データを取得する氏名データ取得
手段と、氏名データに一致する単語を姓名辞書から検索
する姓名辞書検索手段と、前記検索で一致した単語の長
さを取得する単語長取得手段と、前記検索で一致した単
語の数を取得する単語数取得手段と、氏名データの文字
数を取得する氏名文字数取得手段と、単語長、単語数、
氏名文字数から姓文字数を算出する姓文字数算出手段
と、氏名データを姓と名に区切る姓名区切り手段を持つ
ことを特徴とする。Further, according to the present invention, a name data storage unit storing name data consisting of a character string in which a family name and a first name are consecutive,
A first and last name dictionary storing a first and last name character string and single kanji, a first and last name data acquisition means for obtaining first and last name data from a first and last name data storage part, a first and last name dictionary search means for searching a word matching the first and last name data, A word length acquisition unit that acquires the length of the matched word in the search, a word number acquisition unit that acquires the number of matched words in the search, a name character number acquisition unit that acquires the number of characters of the name data, and a word length, Number of words,
It is characterized by having a surname character number calculation means for calculating the surname character number from the full name character number and a surname and surname delimiter means for delimiting the name data into surname and surname.

【０００６】[0006]

【作用】氏名データから氏名文字数を得、姓名辞書を検
索して単語数と単語長を得る。[Function] The number of characters of the name is obtained from the name data, the number of words and the length of the word are obtained by searching the surname dictionary.

【０００７】上記３つのデータ長から、姓文字数を算出
する。そして、姓名区切り手段によって、氏名データを
姓と名に区切る。この位置に空白１字を挿入すれば、氏
名データ、例えば「山田太郎」は、「山田太郎」とい
う姓と名が区切られた形式に整形される。The number of surname characters is calculated from the above three data lengths. Then, the surname and surname delimiter separates the name data into surname and surname. If a blank character is inserted at this position, the name data, for example, "Taro Yamada" will be formatted into a form in which the surname and first name "Taro Yamada" are separated.

【０００８】[0008]

【実施例】図１は、本実施例の構成ブロック図である。
１が氏名データ整形装置である。元データバッファ１０
に一例として、「小早川進一」という姓名のデータを入
力する。次に、元データバッファにあるデータを氏名デ
ータ取得手段２０によって取り出す。姓名辞書と氏名デ
ータを比較しながら姓名辞書検索３０を行い、「小／早
川／進一」と分離された結果を得る。ここで単語数取得
５０において単語数３が取得され、単語長取得４０にお
いて単語長１：２：２が取得される。ここで、上記氏名
データ取得２０から氏名文字数取得６０によって氏名文
字数５が直ちに得られる。単語数取得５０、単語長取得
４０、氏名文字数取得６０の３種類のデータを基にし
て、姓文字数の算出７０が行なわれる。DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 is a block diagram of the structure of this embodiment.
1 is a name data shaping device. Original data buffer 10
As an example, input the data of the first and last name "Shinichi Kobayakawa". Next, the name data acquisition means 20 extracts the data in the original data buffer. The surname and surname dictionary is searched while comparing the surname and surname dictionary with the name data to obtain a result separated from "small / Hayakawa / Shinichi". Here, the word number acquisition 50 acquires the word number 3, and the word length acquisition 40 acquires the word length 1: 2: 2. Here, the name character number 5 is immediately obtained from the name data acquisition 20 by the name character number acquisition 60. The number of surname characters is calculated 70 based on the three types of data: word number acquisition 50, word length acquisition 40, and name character number acquisition 60.

【０００９】姓文字数算出７０のステップは図３と図４
と図５に詳述される。姓文字数算出７０で得られた姓文
字数により、姓名区切り８０が行なわれる。姓名区切り
が行なわれたデータを整形後データバッファ９０に出力
する。The steps of calculating the number of surname characters 70 are shown in FIGS.
And detailed in FIG. The surname and surname delimiter 80 is performed based on the surname and character number obtained in the surname character number calculation 70. The data delimited by the family name is output to the data buffer 90 after shaping.

【００１０】図２において、元データ欄（あ）行は、
「山田一郎」という場合の例である。姓名辞書検索の結
果、「山田／一郎」に分離され、整形後のデータとし
て、「山田」と「一郎」が得られる。この例題の場合
は、姓名辞書の検索結果だけで姓と名を区切ることがで
きる。また、単語数が２、単語長が２：２、氏名文字数
が４ということがわかる。In FIG. 2, the original data column (A) line is
This is an example of the case of "Ichiro Yamada". As a result of the first and last name dictionary search, "Yamada / Ichiro" is separated, and "Yamada" and "Ichiro" are obtained as the data after shaping. In this example, the surname and given name can be separated only by the search result of the surname dictionary. Also, it can be seen that the number of words is 2, the word length is 2: 2, and the number of name characters is 4.

【００１１】元データ欄（い）行は、「武浩」という場
合の例である。姓名辞書検索の結果では「武浩」として
分離されないが、本発明により、単語数が１、単語長が
２、氏名文字数が２ということがわかり、これらの数字
データを基にして後述する図３以降の本発明の説明によ
り整形され、整形後データとして「武」と「浩」が得ら
れる。The original data column (I) line is an example of the case of "Takehiro". Although it is not separated as "Takehiro" in the result of the first and last name dictionary search, the present invention reveals that the number of words is 1, the word length is 2, and the number of name characters is 2. Based on these numerical data, FIG. According to the description of the present invention described above, “Take” and “Hiro” are obtained as the data after shaping.

【００１２】元データ欄（う）行は、「鈴木浩之助」と
いう場合の例である。姓名辞書検索の結果では「鈴木／
浩之／助」と分離されてしまうが、本発明により、単語
数が３、単語長が２：２：１、氏名文字数が５というこ
とがわかり、これらの数字データを基にして後述する図
３以降の本発明の説明により整形され、整形後データと
して「鈴木」と「浩之助」が得られる。The original data column (u) line is an example in the case of "Konosuke Suzuki". The result of the first and last name dictionary search is "Suzuki /
However, according to the present invention, it is found that the number of words is 3, the word length is 2: 2: 1, and the number of name characters is 5. Therefore, FIG. By the following description of the present invention, shaping is performed, and "Suzuki" and "Konosuke" are obtained as the shaped data.

【００１３】元データ欄（え）行は、「東西山二郎」と
いう場合の例である。姓名辞書検索の結果では「東／西
山／二郎」と分離されてしまうが、本発明により、単語
数が３、単語長が１：２：２、氏名文字数が５というこ
とがわかり、これらの数字データを基にして後述する図
３以降の本発明の説明により整形され、整形後データと
して「東西山」と「二郎」が得られる。The original data column (e) line is an example in the case of "Jiro Higashinishiyama". In the result of the first and last name dictionary search, it is separated from "Higashi / Nishiyama / Jiro", but according to the present invention, it is found that the number of words is 3, the word length is 1: 2: 2, and the number of name characters is 5. The data is shaped according to the description of the present invention from FIG. 3 onward based on the data, and “Higashi Nishiyama” and “Jiro” are obtained as the shaped data.

【００１４】元データ欄（お）行は、「南北三太郎」と
いう場合の例である。姓名辞書検索の結果では「南／北
／三太郎」と分離されてしまうが、本発明により、単語
数が３、単語長が１：１：３、氏名文字数が５というこ
とがわかり、これらの数字データを基にして後述する図
３以降の本発明の説明により整形され、整形後データと
して「南北」と「三太郎」が得られる。The original data column (O) line is an example in the case of "Santaro Namboku". In the result of the search of the first and last name dictionary, it is separated from “Minami / North / Santaro”, but according to the present invention, it is found that the number of words is 3, the word length is 1: 1: 3, and the number of name characters is 5. Based on the data, the data is shaped by the description of the present invention, which will be given later with reference to FIG. 3, so that “north-south” and “Santaro” are obtained as the shaped data.

【００１５】元データ欄（か）行は、「上中下四郎」と
いう場合の例である。姓名辞書検索の結果では「上／中
／下／四郎」と分離されてしまうが、本発明により、単
語数が４、単語長が１：１：１：２、氏名文字数が５と
いうことがわかり、これらの数字データを基にして後述
する図３以降の本発明の説明により整形され、整形後デ
ータとして「鈴木」と「浩之助」が得られる。The original data column (or) line is an example in the case of "Shiro Kaminakashita". In the result of the search of the surname and surname dictionary, "upper / middle / lower / shiro" is separated, but according to the present invention, it is found that the number of words is 4, the word length is 1: 1: 1: 2, and the number of name characters is 5. Based on these numerical data, shaping is performed by the description of the present invention after FIG. 3 described later, and “Suzuki” and “Honosuke” are obtained as the shaped data.

【００１６】図３は、本発明によるフローチャートであ
る。図１の姓文字数算出７０を詳述したものである。１
００から１２８はステップの番号である。図３の左半分
のフローチャートを図４を参照しながら説明する。氏名
データから姓名辞書を検索して、単語長と単語数を得る
（１００、１０１、１０２、１０３、１０４）。また、
氏名データから氏名文字数を得る（１０７）。FIG. 3 is a flowchart according to the present invention. 2 is a detailed description of the surname character number calculation 70 of FIG. 1
00 to 128 are step numbers. The flowchart of the left half of FIG. 3 will be described with reference to FIG. The surname dictionary is searched from the name data to obtain the word length and the number of words (100, 101, 102, 103, 104). Also,
The number of name characters is obtained from the name data (107).

【００１７】図４のステップ１０５で単語の数が左欄の
略図に示す通り２つの場合は、右欄の略図に示す通り１
番目の単語が姓、２番目の単語が名であるとして、姓文
字数＝１番目の単語長とし、図３のフローチャートで説
明すると、単語数＝２がｙｅｓと判断１０５されて、１
番目の単語長を姓文字数に代入する１０６。図３の１０
５において、単語数＝２がｎｏの場合は氏名文字数が取
得される１０７。氏名文字数が２未満の場合は、姓と名
の文字数の合計が１文字ということは実際にはありえな
いので１０８、処理を終了する。１０８においてｎｏの
場合は、文字数＝２、文字数＝３、文字数＝４、文字数
が６以上の場合に分けて判断していく。If the number of words is two as shown in the schematic diagram in the left column in step 105 of FIG. 4, 1 as shown in the schematic diagram in the right column.
Assuming that the second word is the surname and the second word is the first name, the number of surname characters is the first word length. Explaining with the flowchart of FIG.
The 106th word length is substituted 106 for the number of surname characters. 10 of FIG.
In 5, when the number of words = 2 is no, the number of name characters is acquired 107. If the number of characters of the first name is less than 2, the total number of characters of the surname and the first name cannot be 1 character in practice, so the process is terminated. In the case of no in 108, the number of characters = 2, the number of characters = 3, the number of characters = 4, and the number of characters is 6 or more.

【００１８】氏名文字数が２文字の場合、図４のステッ
プ１０９の左欄の略図と右欄の略図に示す通りである。
文字数＝２においては、姓文字数に１を代入する１０
９、１１０。When the number of name characters is two, it is as shown in the schematic diagram in the left column and the schematic diagram in the right column of step 109 in FIG.
When the number of characters = 2, substitute 1 for the number of surname characters 10
9,110.

【００１９】氏名文字数が３文字の場合は、図４のステ
ップ１１１の左欄の略図に示す通り、３文字一体の場合
と、３文字が別々の場合が考えられるが、両者とも統計
的にみて右欄の略図に示す通り２：１の分けかたにな
る。図３のステップ１１１でＹＥＳの場合に姓文字数に
２を代入する１１３。When the number of characters of the name is three, as shown in the schematic diagram in the left column of step 111 in FIG. 4, there may be a case where the three characters are integrated or a case where the three characters are different. As shown in the schematic in the right column, the division is 2: 1. If YES in step 111 of FIG. 3, 2 is substituted for the number of surname characters 113.

【００２０】氏名文字数が４文字の場合は、図４のステ
ップ１１２の左欄の略図に示す通り、４文字一体の場合
と、２文字の前後に１文字ずつ分離した場合と、４文字
全部分離した場合と、２文字が後半にある場合と、前半
にある場合の５種類の場合すべてが右欄の略図に示す通
り２：２の分けかたになる。図３のステップ１１２、１
１３に示す通りである。氏名文字数が６文字以上の場合
は、実際の例が僅少であることと、分けかたのパターン
が多すぎて、処理時間がかかりすぎて処理能力が犠牲と
なるので処理を行なわないことにした（図３ステップ１
１４）。氏名文字数が５文字の場合を図３の右半分のフ
ローチャートで図５を参照しながら説明する。When the number of characters of the name is four, as shown in the schematic diagram in the left column of step 112 in FIG. 4, the case of four characters integrated, the case of separating one character before and after two characters, and the case of separating all four characters All of the five cases, that is, the case where the two characters are in the latter half and the case where the two characters are in the first half, are divided into 2: 2 as shown in the schematic diagram in the right column. Steps 112 and 1 in FIG.
As shown in 13. If the number of characters in the name is 6 or more, there are few actual examples, and there are too many patterns of division, and it takes too much processing time and sacrifices processing capacity. (Figure 3 Step 1
14). The case where the number of name characters is 5 will be described with reference to FIG. 5 in the flowchart on the right half of FIG.

【００２１】図５のステップ１１５の左欄に示す通り、
３文字ブロックがきて次に１文字が２回連続した場合、
右欄の略図に示す通り３：２の分けかたになる。図３に
おいて１番目の単語長＝３の場合ステップ１１５は、姓
文字数に３を代入する１１６。図５のステップ１１７
の左欄に示す通り２文字ブロックが前後にきた場合と、
２文字ブロックがはじめに２つ連続してきた場合と、２
文字ブロックがはじめに１つきて次に１文字が３つ連続
した場合は、すべて右欄の略図に示す通り２：３の分け
かたになる。図３のステップ１１７、１１８に示す通り
である。As shown in the left column of step 115 in FIG.
If a three-character block comes and then one character continues twice,
As shown in the schematic in the right column, it is divided into 3: 2. If the first word length = 3 in FIG. 3, step 115 substitutes 116 for the number of surname characters. Step 117 of FIG.
As shown in the left column of, when two character blocks come before and after,
When two 2-character blocks first appear consecutively,
When a character block is preceded by 1 and then 3 characters are consecutive, all are divided into 2: 3 as shown in the schematic diagram in the right column. This is as shown in steps 117 and 118 of FIG.

【００２２】図５のステップ１２０の左欄に示す通り、
３文字ブロックの前後に１文字ずつ分かれた場合は、処
理を行なわない。図３のステップ１２０で２番目の単語
長＝３がＹＥＳの場合に示す通りである。As shown in the left column of step 120 of FIG.
If one character is divided before and after the three-character block, no processing is performed. This is as shown in the case where the second word length = 3 is YES in step 120 of FIG.

【００２３】図５のステップ１２１の左欄に示す通り、
１文字の後に２文字ブロックが２回連続した場合と、１
文字の後に２文字ブロックが１回きた場合は、すべて右
欄の略図に示す通り３：２の分けかたになる。図３のス
テップ１２１、１２２に示す通りである。As shown in the left column of step 121 in FIG.
If a two-character block continues twice after one character,
When a two-character block comes once after a character, it is divided into 3: 2 as shown in the schematic diagram in the right column. This is as shown in steps 121 and 122 of FIG.

【００２４】図５のステップ１２３の左欄に示す通り、
１文字が２回連続した後に２文字ブロックがきた場合
と、１文字が２回連続した後に３文字ブロックがきた場
合は、すべて右欄の略図に示す通り２：３の分けかたに
なる。図３のステップ１２３、１２４、１２５に示す通
りである。As shown in the left column of step 123 in FIG.
When a two-character block comes after one character continues twice and when a three-character block comes after one character continues twice, they are all divided into 2: 3 as shown in the schematic diagram in the right column. This is as shown in steps 123, 124, and 125 of FIG.

【００２５】図５のステップ１２６の左欄に示す通り、
１文字が３回連続した後に２文字ブロックがきた場合
は、右欄の略図に示す通り３：２の分けかたになる。図
３のステップ１２６、１２７に示す通りである。As shown in the left column of step 126 in FIG.
When a two-character block comes after one character continues three times, it is divided into 3: 2 as shown in the schematic diagram in the right column. This is as shown in steps 126 and 127 of FIG.

【００２６】図５のステップ１２６＝ＮＯの左欄に示す
通り、１文字が５回連続した場合、処理を終了する。図
３のステップ１２６で、４番目の単語長＝２がＮＯの場
合に示す通りである。As shown in the left column of step 126 = NO in FIG. 5, when one character continues five times, the process ends. This is as shown when the fourth word length = 2 is NO in step 126 of FIG.

【００２７】以上の処理で、姓名辞書の検索結果が完全
でなかった場合でも、かなり正確な姓文字数を得ること
ができる。姓文字数が確定した場合は、姓文字数の後に
空白を１つ挿入して姓名区切りを行う（１２８）。By the above processing, a fairly accurate number of surname characters can be obtained even if the search result of the surname dictionary is not complete. When the number of surname characters is fixed, one blank is inserted after the number of surname characters to separate the surname (128).

【００２８】図６は、本実施例をパーソナルコンピュー
タで実現した場合のハードウェア構成図である。パーソ
ナルコンピュータ２００は、制御手段として作動するＣ
ＰＵ２０１と、プログラムを格納したり、氏名データの
バッファやワークとして使用するＲＡＭ２０２と、ＢＩ
ＯＳなどのシステムプログラムが格納されているＲＯＭ
２０３とから構成される。また、このパーソナルコンピ
ュータ２００には、データなどを表示するＣＲＴ２０４
と、ユーザーからの指令などが入力されるキーボード２
０５と、データなどが格納される磁気ディスク２０６と
が備えられている。FIG. 6 is a hardware configuration diagram when this embodiment is realized by a personal computer. The personal computer 200 has a C functioning as a control means.
PU201, RAM202 which stores a program, and is used as a buffer and a work of name data, BI
ROM that stores system programs such as OS
And 203. The personal computer 200 also has a CRT 204 for displaying data and the like.
And a keyboard 2 for inputting commands from the user
05 and a magnetic disk 206 for storing data and the like.

【００２９】[0029]

【発明の効果】本発明を用いれば、従来よりも姓名辞書
のサイズをコンパクトにでき、辞書検索時間も短縮でき
る。さらに、従来よりも正確に、かつ、多様な氏名デー
タに対して姓名区切りを行うことができる。名刺管理装
置に用いられているデータベースのアプリケーションソ
フトウェアプログラムで扱われる氏名データのうち、姓
と名を区別していない連続した氏名データをコンピュー
タシステムで自動的に区切り、氏名データの整形を容易
に実現する。姓と名を分離することで氏名データの表示
や印刷がきれいで見易くなる。According to the present invention, the family name dictionary can be made smaller in size and the dictionary search time can be shortened as compared with the conventional case. Furthermore, it is possible to perform the surname delimiter more accurately than ever before for various name data. Of the name data handled by the application software program of the database used in the business card management device, the computer system automatically separates continuous name data that does not distinguish between first name and last name, making it easy to shape the name data. To do. By separating the family name and first name, the display and printing of the name data is beautiful and easy to see.

[Brief description of drawings]

【図１】本発明の構成ブロック図である。FIG. 1 is a configuration block diagram of the present invention.

【図２】本発明の具体的説明図である。FIG. 2 is a specific explanatory diagram of the present invention.

【図３】本発明の具体的説明図である。FIG. 3 is a specific explanatory diagram of the present invention.

【図４】本発明の具体的説明図である。FIG. 4 is a specific explanatory diagram of the present invention.

【図５】本発明の具体的説明図である。FIG. 5 is a specific explanatory diagram of the present invention.

【図６】従来のハードウェア構成図である。FIG. 6 is a conventional hardware configuration diagram.

[Explanation of symbols]

１：氏名データ整形装置１０：元データバッファ２０：氏名データ取得手段３０：姓名辞書検索手段４０：単語長取得手段５０：単語数取得手段６０：氏名文字数取得手段７０：姓文字数算出手段８０：姓名区切り手段９０：整形後データバッファ 1: Name data shaping device 10: Original data buffer 20: Name data acquisition means 30: First and last name dictionary search means 40: Word length acquisition means 50: Word number acquisition means 60: First and last name character number acquisition means 70: First and last name character number calculation means 80: First and last name Separator 90: Data buffer after shaping

Claims

[Claims]

1. A full name data consisting of a character string in which a surname and a first name are consecutive, and a surname and first name dictionary storing a first and last name character string and a single kanji character are searched, and a word that matches the first and last name data is searched from the surname and first name dictionary and searched. A name data shaping method, characterized in that the number of surname characters is calculated from the length of the matched words, the number of words, and the number of characters of the name data to divide the name data into surname and surname.

2. A name data storage unit for storing name data consisting of character strings in which surnames and first names are consecutive, and a surname and first name dictionary storing character names and single kanji characters, and the name data storage unit stores the name data. A full name data acquisition unit for obtaining a name, a full name dictionary search unit for searching for a word matching the name data from the surname dictionary, a word length acquisition unit for acquiring the length of the word matched in the search, and A word number acquisition means for acquiring the number of matched words, a name character number acquisition means for acquiring the number of characters of the name data, a surname character number calculation means for calculating the surname character number from the word length, the number of words, and the name character number, and the name A name data shaping device comprising a surname and first name separating means for separating data into first and last names.