JP2004309754A

JP2004309754A - Service system for supporting character identification

Info

Publication number: JP2004309754A
Application number: JP2003102691A
Authority: JP
Inventors: Mayumi Miyashita; 真由美宮下; Masayuki Ozawa; 正行小澤; Koji Okamoto; 幸慈岡本; Takayuki Uemura; 隆之植村
Original assignee: Hitachi Government and Public Sector System Engineering Ltd
Current assignee: Hitachi Social Information Services Ltd
Priority date: 2003-04-07
Filing date: 2003-04-07
Publication date: 2004-11-04

Abstract

<P>PROBLEM TO BE SOLVED: To reduce work load by separating the external character integration work in merging municipalities by making outsourcing do the work. <P>SOLUTION: A service system for supporting character identification is provided with a black-and-white bit map character collection file 13, that is extracted form each system before formatting; a character collection file 15 divided into the character of a system, having the highest priority and a system having a priority lower than the highest one for identification; a character collection file 16 to be identified; an identification target extraction listing file 18, that outputs a predetermined number of characters in higher identification rate order and is created by extracting an identification target, based on a reference; and a removed identification character collection file 20, after a user confirms characters in the identification target extraction list and unnecessary characters are removed. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、市町村合併時の外字統合作業の部分を切り出して作業委託することで、作業負担の軽減を図る文字同定支援サービスシステムに関する。
【０００２】
【従来の技術】
従来、例えば、Ａ，Ｂ，Ｃの３市が合併した場合に、外字を統一する方法としては、Ａ，Ｂ，Ｃの３市がそれぞれ登録している外字を抽出して、目で見ながら最も優先する文字パターンを一つ決定して、代表フォントとすることを手作業で行う必要があった。この場合、外字は、通常、人手でビットパターンを作成していた。従って、複数の自治体の間では、ビットパターンが微妙に異なっているのが通常である。
【０００３】
【発明が解決しようとする課題】
このように、市町村合併などでシステムの統合化が発生した場合に、各システムで使用していた外字を統一することになる。そのときに、外字の二重登録を防ぐために、各システムが持つ外字同士で同定可能か否かをチェックし、同定可能な文字については一つの文字だけ新システムに登録することとする。
しかしながら、前述のように、手作業で統合作業を行うと極めて作業量が多いため、作業精度がよくなく、かつトータルコストが非常にかかることになる。
【０００４】
本発明の目的は、このような従来の課題を解決し、市町村合併などでシステムの統合化を行う場合に、外字の二重登録を防止するため、各システムが持つ外字同士で同定可能か否かをチェックする際に、この文字同定作業にかかるコストを削減することが可能な文字同定支援サービスシステムを提供することにある。
【０００５】
なお、同定とは、２つの文字を比較して、これらの２つの文字が同じであると認めることを言う。
また、同定率とは、２つの文字を比較して、これらの２つの文字が同じである確率を言う。
【０００６】
【課題を解決するための手段】
本発明の文字同定支援サービスシステムは、移行用の文字が各システムから抽出された後、フォーマット統一された白黒ビットマップ文字集合ファイルと、該白黒ビットマップ文字集合ファイルに対して、優先順位の最も高いシステムの文字とそれ以下の優先順位のシステムの文字とに振り分けられた後の前者のシステムの同定する文字集合ファイルと、後者のシステムの同定とみなす文字集合ファイルと、同定率の高い順に予め定められた数の文字を出力し、出力された文字の中に一つも含まれないシステムが存在した場合、出力した文字の中で同定率の低い方を含まれないシステムの中で同定率の高い文字と置き換えるという基準で、同定対象を抽出して作成された同定対象抽出一覧ファイルと、ユーザにより該同定対象抽出一覧の文字の同定確認を行い、同定と認定された同定対象抽出一覧画面と、該同定済み同定対象抽出一覧画面に対して、同定とみなす文字集合のうち、同定と認定された文字を排除した文字集合ファイルとを具備したことを特徴としている。
【０００７】
【発明の実施の形態】
以下、本発明の実施の形態を、図面により詳細に説明する。
（文字同定支援サービスの流れ）
図１は、本発明の実施形態を示す文字同定支援サービスシステムの動作概要図である。
図１に示すように、ユーザ側（市町村側）と外部委託側とで作業を分担して行う。ユーザ側では、まず既存外字ファイル１１を抽出し、外部委託側に渡す。
外部委託側では、まず▲１▼フォーマットを統一する。例えば、各システムの文字フォントを白黒ビットマップファイルに統一する。次に、白黒ビットマップファイルの文字集合ファイル１３から▲２▼同定文字振り分け作業に移る。これは、優先順位の最も高いシステムの文字（同定する文字集合）１５とそれ以下の優先順位のシステムの文字（同定とみなす文字集合）１６とに振り分ける（１４）。
【０００８】
次に、▲３▼同定対象を抽出し（１７）、同定対象抽出一覧ファイル１８を作成する。
ここで、同定対象の抽出基準は、同定する文字集合の一文字ずつを同定される文字集合の全文字と比較し、その同定率を算出する流通しているプログラムを利用して同定率の高い順に予め定められた文字数ずつ出力し（比較文字として表示）、その中に一つも含まれない市の文字が存在した場合には（例えば、Ａ市が基準で、Ｂ市の文字のみが出力され、Ｃ市が出力されないとき等）、抽出された文字から同定率の低い文字と抽出対象に含まれない市の文字（例えば、前記の場合のＣ市）の中で同定率の高い文字と置き換える。このようにすれば、比較のための出力をＢ市、Ｃ市・・の全てにわたって表示することができる。このようにして作成した同定対象抽出一覧ファイル１８をユーザ側に渡す。
この処理を指定した出力文字数分だけ、同定対象抽出一覧ファイル１８に格納する。なお、同定率とは、２つの文字を比較して、これらの文字が同じである確率のこと、を言う。
【０００９】
ユーザ側では、▲４▼同定確認を行う（１９）。すなわち、同定対象抽出一覧ファイル１８から読み出した文字を画面に表示し、人間の目で同定の確認を行い、同定と認定する文字にマークをつける。なお、同定確認の検査は画面上で行われる。
ユーザ側では、同定とみなす文字集合がなくなるまで同じ処理を繰り返した後、マークを付けた文字を削除後、図１０のフォーマットに変換し、移行後外字ファイル２３に格納する。
【００１０】
（ユーザ側の同定確認作業の概要）
図２および図３は、実際の同定作業の説明図であって、Ａ市，Ｂ市，Ｃ市間で外字の統合を行う方法を示す。
図２は、１回目の同定作業（Ａ市を基準に、Ｂ市、Ｃ市と同定を行う）の説明図であって、左側に移行決定文字を全て表示するとともに、右側に移行しない文字対象を表示する。Ａ市移行分とＢ市、Ｃ市の文字で比較して、最適な文字を選択して表示する。同定とみなすＢ市、Ｃ市の文字から同様の文字で削除を希望するものをチェックする。
ここでは、移行決定文字の『挙』に対して、Ｂ市とＣ市のこれに近似した文字として、『擧』ないし『攀』が表示されており、チェックの結果、Ｂ市の『擧』に○（つまり、削除希望）が付けられる。同じようにして、『宝』に対してＢ市の『寳』とＣ市の『寶』が表示され、比較された結果、削除希望がないことになる（○は付されない）。
【００１１】
図３は、２回目の同定作業（Ｂ市を基準にＣ市と同定を行う）の説明図である。
２つの市町村合併に伴うシステム統合の場合には、１回だけの同定作業で済むが、３つの市町村合併に伴うシステム統合の場合には、ＡとＢ，Ｃとの同定作業、および、ＢとＣの同定作業の２回が必要となる。
左側には、Ｂ市分１６００文字（１回目作業で削除されなかったもの）を順次表示し、右側には、左側の文字と類似するパターンのＣ市の文字を順次表示する。Ｂ市移行分とＣ市の文字とで比較し、最適な文字を選択して表示する。そして、選択されなかったＣ市の文字から同様の文字で削除希望するものをチェックする。ここでは、移行決定文字の『実』に対して、Ｃ市の『實』が表示され、同じく『将』に対して、Ｃ市の『將』が表示される。『將』には削除希望がなく、○が付されない。
【００１２】
（画面遷移の詳細説明）
図４は、本発明における文字同定画面遷移のフロー図であり、図５〜図９は図４における各画面の拡大図である。
図５に示すように、文字同定システムメインメニューの画面を表示すると、▲１▼フォーマット統一処理、▲２▼同定文字振り分け処理、▲３▼対象出力文字数選択処理、▲４▼同定文字排除処理、▲５▼終了、のメニューボタンが表示される。希望のボタンを押下して選択することで、希望の処理画面が表示される。
【００１３】
まず、図４の画面上でフォーマット統一処理を選択すると、図６のフォーマット統一画面に遷移される。
フォーマット統一画面は、既存外字ファイルとそのシステム名称の入力領域があり、参照、登録、戻るのボタン領域が配置されている。参照ボタンを操作すると窓が表示される。そして、既存外字ファイル入力領域に、▲１▼既存外字ファイル名称を一つずつ入力する。このように、▲１▼には２つの機能が含まれている。▲２▼既存外字ファイルの入力前のシステム名称を入力する。登録するときには、登録ボタンを選択することにより既存外字ファイル名とシステム名称が登録される。また、前の画面に戻るときには戻るボタンを選択する。
【００１４】
次に、図４の画面上で同定文字振分処理を選択すると、図７の同定文字振分画面に遷移される。
同定文字振分画面は、同定する優先順位の最も高いシステム名称の入力領域があり、登録および戻るのボタンが配置される。同定処理に際しては、優先順位の最も高いシステム名称を選択する。前述のように、Ａ市、Ｂ市、Ｃ市の合併の場合には、Ａ市のシステム名称を選択する。
【００１５】
次に、図４の画面上で対象出力文字数選択処理を選択すると、図８の対象出力文字数選択画面に遷移される。
対象出力文字数選択画面の入力領域で、▲１▼対象出力文字数を選択する。
対象結果出力先選択画面には、登録と戻るのボタンが配置される。対象出力文字数が選択された後、登録ボタンを選択することにより選択された数が登録される。また、前の画面に戻る場合には、戻るボタンを選択する。
【００１６】
次に、図４の画面上で同定文字排除処理を選択すると、図９の同定文字排除画面に遷移される。
同定文字排除画面は、排除処理後の文字を同定対象抽出一覧ファイル１８に格納するために、取込ボタンが設けられる。さらに、操作を前に戻す場合には、戻るのボタン領域を選択する。▲１▼取込ボタンを押下することにより、同定対象抽出一覧ファイル１８に出力される。
【００１７】
図１０は、既存外字ファイルと変換後の外字ファイルのフォーマット図である。
一般的に使用されている文字の字形は、文字コードと関連付けてファイルに格納される。従って、外字がファイルから読み出されるときには、関連付けられた文字コードとともに読み出される。ファイルの種類としては、例えば、ｂｍｐ（ビットマップ）ファイル、ＴＴＦファイル、ＴＴＥファイル、ｂｄｆファイル、ホスト用文字パターンファイルなどがある。
【００１８】
図１１は、図８の対象結果出力先選択画面において、対象出力文字数が入力されたときに表示される同定対象抽出一覧画面の図である。図１１において、入力された数の文字数が移行決定文字として表示される。
移行決定文字の『島』（コードはＡ市の５９Ａ１）に対して、移行しない文字対象として『嶋』（Ｂ市６９Ａ１），『嶌』（Ｂ市６ＣＡ１）が表示される。ユーザは、Ａ市移行分とＢ市の文字を比較して、選択する。すなわち、文字下方の余白欄に、移行決定文字と同じであると判断した場合には削除希望○を付ける。同じように、『亀』（Ａ市５９Ａ２）に対して、『龜』（Ｂ市６９ＡＡ）、『龕』（Ｃ市６ＣＡ４）を比較して、同じであると認定した場合には、削除希望○を付ける。『村』に対しても同じ処理を行う。
【００１９】
図１２は、図１１において別の文字を表示した同定対象抽出一覧画面の図である。
この場合にも、移行決定文字の『京』（Ａ市５９Ａ１）に対して、移行しない文字対象として『亰』（Ｂ市６９Ａ１）が表示される。ユーザは文字下方の余白欄に、移行決定文字と同じであると判断した場合には削除希望であるチェックを付ける。同じように、『舗』（Ａ市５９Ａ２）に対して、移行しない文字対象として『舗』（Ｂ市６９ＡＡ），『鋪』（Ｃ市６ＣＡ３）が表示される。この場合は、チェックは付けられていない。同じように、『梁』（Ａ市５９Ａ１）に対して、移行しない文字対象として『渠』（Ｃ市６９Ａ１），『檗』（Ｃ市６ＣＡ１）が表示される。この場合も、チェックは付けられていない。
このように、画面上で移行決定文字に対する移行しない文字対象を比較し、同じと認定した場合には下欄の余白に削除希望を示すチェック済みの記号が入力される。
【００２０】
図１３は、同定する文字集合、同定される文字集合ファイルのフォーマット例を示す図である。
図１における同定する文字集合ファイル１５および同定される文字集合ファイル１６は、いずれも図１３に示すようなファイルのフォーマットで格納される。
システム判別コードには、Ａ，Ｂ，ＣなどのＡ市、Ｂ市、Ｃ市に属する意味のコードが格納され、システム内文字コードには、図１１および図１２に示す文字の下に付加された文字コードが格納され、ドットパターン格納ファイル名には、ビットマップのパターン配列がそのまま格納される。
【００２１】
【発明の効果】
以上説明したように、本発明によれば、市町村合併時の外字統合作業の部分を切り離して、外部委託に作業を行わせることで、作業負荷の軽減が図れる。
【図面の簡単な説明】
【図１】本発明の実施形態を示す文字同定支援サービスシステムの処理フロー図である。
【図２】図１における１回目の同定作業の説明図である。
【図３】図１における２回目の同定作業の説明図である。
【図４】本発明の実施形態を示す文字同定画面フロー図である。
【図５】文字同定システムメインメニューの画面図である。
【図６】図４におけるフォーマット統一画面図である。
【図７】図４における同定文字振分画面図である。
【図８】対象出力文字数選択画面図である。
【図９】同定文字排除画面図である。
【図１０】図１における既存外字ファイルおよび移行後外字ファイルの説明図である。
【図１１】図８において、同定結果出力先として対象出力文字数入力を入力した場合の同定対象抽出一覧画面の図である。
【図１２】図１１と同じく、同定対象抽出一覧画面において、移行決定文字と移行しない文字対象とを比較する同定対象抽出一覧画面の図である。
【図１３】図１における同定する文字集合、排除済み同定文字集合ファイルのフォーマット例を示す図である。
【符号の説明】
１１…既存外字ファイル、１２…フォーマット統一処理、
１３…白黒ＢＭＰ文字集合ファイル、１５…同定する文字集合ファイル、
１６…同定される文字集合ファイル、１７…同定文字排除処理、
１８…同定対象抽出一覧ファイル、１９…同定確認、
２０…排除済み同定字集合ファイル。[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to a character identification support service system that cuts out a part of an external character integration work at the time of merger of municipalities and outsources the work to reduce the work load.
[0002]
[Prior art]
Conventionally, for example, when the three cities A, B, and C are merged, as a method of unifying the external characters, the external characters registered by the three cities A, B, and C are extracted and visually checked. It was necessary to manually determine one of the character patterns having the highest priority and make it the representative font. In this case, the external character is usually manually created as a bit pattern. Therefore, bit patterns are usually slightly different among a plurality of local governments.
[0003]
[Problems to be solved by the invention]
In this way, when systems are integrated due to merger of municipalities or the like, external characters used in each system are unified. At that time, in order to prevent double registration of external characters, it is checked whether or not the external characters of each system can be identified, and only one character that can be identified is registered in the new system.
However, as described above, if the integration work is performed manually, the amount of work is extremely large, so that the work accuracy is not good and the total cost is extremely high.
[0004]
An object of the present invention is to solve such a conventional problem and to prevent double registration of external characters when integrating systems in a municipal merger or the like, to determine whether external characters of each system can be identified with each other. It is an object of the present invention to provide a character identification support service system capable of reducing the cost required for the character identification work when checking whether a character is identified.
[0005]
Note that identification refers to comparing two characters and recognizing that these two characters are the same.
The identification ratio refers to a probability that two characters are compared and the two characters are the same.
[0006]
[Means for Solving the Problems]
After the character for migration is extracted from each system, the character identification support service system of the present invention provides the black and white bitmap character set file with the unified format and the black and white bitmap character set file with the highest priority. The character set file identified by the former system after being sorted into the characters of the higher system and the characters of the lower priority system, the character set file regarded as the identification of the latter system, and If there is a system that outputs a specified number of characters and none of the output characters are included, the identification rate of the system that does not include the lower identification rate among the output characters An identification target extraction list file created by extracting an identification target on the basis of replacing with a high character, and a character in the identification target extraction list Perform identification confirmation, identification target extraction list screen that has been identified and identified, and for the identified identification target extraction list screen, a character set file that excludes the identified and recognized characters in the character set considered to be identification It is characterized by having.
[0007]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
(Flow of character identification support service)
FIG. 1 is an operation schematic diagram of a character identification support service system according to an embodiment of the present invention.
As shown in FIG. 1, the work is shared between the user (municipalities) and the outsourcing side. The user first extracts the existing external character file 11 and passes it to the outsourcing side.
On the outsourcing side, (1) format is unified first. For example, the character font of each system is unified into a black and white bitmap file. Next, from the character set file 13 of the black-and-white bitmap file, the operation proceeds to (2) identification character distribution work. This is assigned to the character of the system with the highest priority (character set to be identified) 15 and the character of the system with lower priority (character set to be regarded as identification) 16 (14).
[0008]
Next, (3) identification targets are extracted (17), and an identification target extraction list file 18 is created.
Here, the extraction criterion for the identification target is to compare each character of the character set to be identified with all the characters of the character set to be identified, and use a distributed program for calculating the identification rate in order of decreasing the identification rate. A predetermined number of characters are output (displayed as comparison characters), and if there is a city character that does not include any of them (for example, only the character of city B is output based on city A, When the city C is not output, for example), the extracted characters are replaced with a character with a low identification rate and a character of a city not included in the extraction target (for example, city C in the above case) with a character with a high identification rate. In this way, the output for comparison can be displayed for all of B city, C city, and so on. The identification target extraction list file 18 created in this way is passed to the user.
This processing is stored in the identification target extraction list file 18 for the specified number of output characters. Note that the identification ratio refers to the probability that two characters are the same when these characters are compared.
[0009]
The user performs (4) identification confirmation (19). That is, the characters read from the identification target extraction list file 18 are displayed on the screen, the identification is confirmed by human eyes, and the characters identified as identification are marked. In addition, the inspection of the identification confirmation is performed on the screen.
On the user side, the same processing is repeated until there is no longer a character set regarded as identification. After deleting the marked characters, the user converts the format to the format shown in FIG.
[0010]
(Overview of identification confirmation work on the user side)
FIG. 2 and FIG. 3 are explanatory diagrams of the actual identification work, and show a method of integrating external characters among A city, B city, and C city.
FIG. 2 is an explanatory diagram of the first identification work (identification is performed with city B and city C based on city A), and all of the characters determined to be shifted are displayed on the left side, and characters not shifted on the right side. Is displayed. A comparison is made with the characters of City A and the characters of City B and City C, and the optimum character is selected and displayed. Check the same characters that you want to delete from the characters of city B and city C that are regarded as identification.
Here, for the transition determination character "Kyoto", "Behavior" or "Climbing" is displayed as a character similar to that of B city and C city. (That is, a request for deletion) is added to. In the same manner, "Treasure" of B city and "Treasure" of C city are displayed for "Treasure", and as a result of comparison, there is no request for deletion (o is not added).
[0011]
FIG. 3 is an explanatory diagram of the second identification operation (identification with city C based on city B).
In the case of system integration associated with the merger of two municipalities, only one identification operation is required. In the case of system integration associated with the merger of three municipalities, the identification operation of A, B, and C, and B and Two times of identification work of C are required.
On the left side, 1600 characters for B city (those not deleted in the first operation) are sequentially displayed, and on the right side, characters of C city having a pattern similar to the character on the left side are sequentially displayed. A comparison is made between the characters in the city B and the characters in the city C, and the optimum character is selected and displayed. Then, the user selects the same character from the unselected characters of the city C to be deleted. In this case, “Jitsu” of C city is displayed for “Jitsu” of the shift determination character, and “Jitsu” of C city is displayed for “Sho”. "Sho" has no request for deletion and is not marked with a circle.
[0012]
(Detailed explanation of screen transition)
FIG. 4 is a flow chart of the character identification screen transition in the present invention, and FIGS. 5 to 9 are enlarged views of each screen in FIG.
As shown in FIG. 5, when the screen of the character identification system main menu is displayed, (1) format unification processing, (2) identification character distribution processing, (3) target output character number selection processing, (4) identification character exclusion processing, (5) The menu button of “End” is displayed. By pressing and selecting a desired button, a desired processing screen is displayed.
[0013]
First, when the format unifying process is selected on the screen of FIG. 4, the screen is changed to the format unifying screen of FIG.
The format unification screen has an input area for an existing external character file and its system name, and button areas for reference, registration, and return are arranged. When you operate the browse button, a window appears. Then, in the existing external character file input area, (1) input the existing external character file names one by one. Thus, (1) includes two functions. (2) Enter the system name before inputting the existing external character file. At the time of registration, an existing external character file name and system name are registered by selecting a registration button. When returning to the previous screen, the user selects the return button.
[0014]
Next, when the identification character distribution processing is selected on the screen of FIG. 4, the screen is transited to the identification character distribution screen of FIG.
The identification character distribution screen has an input area for the system name having the highest priority to identify, and buttons for registering and returning are arranged. At the time of the identification processing, the system name with the highest priority is selected. As described above, in the case of the merger of City A, City B, and City C, the system name of City A is selected.
[0015]
Next, when the target output character number selection processing is selected on the screen of FIG. 4, the screen is transited to the target output character number selection screen of FIG.
(1) Select the number of target output characters in the input area of the target output character number selection screen.
Buttons for registering and returning are arranged on the target result output destination selection screen. After the number of target output characters is selected, the selected number is registered by selecting a registration button. To return to the previous screen, select the return button.
[0016]
Next, when the identified character exclusion process is selected on the screen of FIG. 4, the screen is transited to the identified character exclusion screen of FIG.
The identification character exclusion screen is provided with an import button in order to store the characters after the exclusion processing in the identification target extraction list file 18. Further, when the operation is to be returned to the previous position, the return button area is selected. (1) When the capture button is pressed, the data is output to the identification target extraction list file 18.
[0017]
FIG. 10 is a format diagram of the existing external character file and the converted external character file.
Glyphs of commonly used characters are stored in a file in association with a character code. Therefore, when an external character is read from a file, it is read together with the associated character code. Examples of the file type include a bmp (bitmap) file, a TTF file, a TTE file, a bdf file, and a host character pattern file.
[0018]
FIG. 11 is a diagram of an identification target extraction list screen displayed when the number of target output characters is input on the target result output destination selection screen of FIG. In FIG. 11, the entered number of characters is displayed as a transition determination character.
With respect to the character “Shift” (the code is 59A1 in City A), “Shima” (69A1 in City B) and “Shima” (6CA1 in City B) are displayed as characters that do not transfer. The user compares the character of the city A with the character of the city B and selects it. That is, when it is determined that the character is the same as the transition determination character in the blank space below the character, the deletion request is given. Similarly, if "Kame" (A city 59A2) is compared with "Kame" (B city 69AA) and "niche" (C city 6CA4) and they are found to be the same, delete it. Add ○. The same processing is performed for “village”.
[0019]
FIG. 12 is a diagram of the identification target extraction list screen in which another character is displayed in FIG.
In this case as well, “Kyo” (B city 69A1) is displayed as a character object that does not shift with respect to the transition determination character “K” (A city 59A1). If the user determines in the margin column below the character that the character is the same as the transition determination character, he / she puts a check indicating that the user desires deletion. Similarly, for “p” (A city 59A2), “p” (B city 69AA) and “p” (C city 6CA3) are displayed as character objects that do not shift. In this case, no check is made. Similarly, for “beam” (A city 59A1), “drain” (C city 69A1) and “abaku” (C city 6CA1) are displayed as characters that do not shift. Also in this case, no check is made.
In this manner, on the screen, the non-migrated character target with respect to the determined character is compared, and if it is determined that the characters are the same, a checked symbol indicating a desire to delete is input in the margin in the lower column.
[0020]
FIG. 13 is a diagram showing a character set to be identified and a format example of a character set file to be identified.
The character set file 15 to be identified and the character set file 16 to be identified in FIG. 1 are both stored in a file format as shown in FIG.
The system discrimination code stores codes that belong to A city, B city, and C city, such as A, B, and C, and the character code in the system is added below the characters shown in FIGS. The character code is stored, and the pattern arrangement of the bitmap is stored as it is in the dot pattern storage file name.
[0021]
【The invention's effect】
As described above, according to the present invention, the work load can be reduced by separating the part of the external character integration work at the time of merger of municipalities and letting the work be outsourced.
[Brief description of the drawings]
FIG. 1 is a process flowchart of a character identification support service system according to an embodiment of the present invention.
FIG. 2 is an explanatory diagram of a first identification operation in FIG. 1;
FIG. 3 is an explanatory diagram of a second identification operation in FIG. 1;
FIG. 4 is a flowchart of a character identification screen showing the embodiment of the present invention.
FIG. 5 is a screen diagram of a character identification system main menu.
FIG. 6 is a format unified screen diagram in FIG. 4;
FIG. 7 is an identification character distribution screen diagram in FIG. 4;
FIG. 8 is a view showing a target output character number selection screen.
FIG. 9 is an identification character exclusion screen.
FIG. 10 is an explanatory diagram of an existing external character file and a post-migration external character file in FIG. 1;
FIG. 11 is a diagram of an identification target extraction list screen when an input of the number of target output characters is input as an identification result output destination in FIG. 8;
FIG. 12 is a diagram of an identification target extraction list screen for comparing a transition determination character and a non-transitionable character target on the identification target extraction list screen, similarly to FIG. 11;
FIG. 13 is a diagram showing a format example of a character set to be identified in FIG. 1 and an excluded identified character set file.
[Explanation of symbols]
11: Existing external character file, 12: Format unification processing,
13: Black and white BMP character set file, 15: Character set file to be identified,
16: character set file to be identified, 17: identification character exclusion processing,
18: identification target extraction list file, 19: identification confirmation,
20 ... Excluded identification character set file.

Claims

In the character identification support service system for unifying the external characters used in each system and shifting to the new system,
After being extracted from each system, a black and white bitmap character set file with a unified format,
For the black and white bitmap character set file, a character set file identified by the system with the highest priority as a result of sorting the characters of the system with the highest priority and the characters of the system with the lower priority, A character set file that is considered to be the identity of the other systems;
A predetermined number of characters are output from the character set file in descending order of the identification rate. If there is a system in which none of the output characters is included, the output character with the lower identification rate is output. Is replaced with a character with a high identification rate in a system that is not included, an identification target extraction list screen created by extracting identification targets,
The user performs character identification on the identification target extraction list screen and, when the identification confirmation processing is completed, includes a character set file from which the identified character is excluded from the character set regarded as identification. A character identification support service system characterized in that:

2. The character identification support system according to claim 1, wherein the output of the character identified and identified on the identification target extraction list screen is stored in a character set file excluded from the screen. Service system.

The character identification support service system according to claim 1 or 2,
The identification target extraction list screen compares each character of the character set to be identified with all the characters of the character set to be identified, and uses a distributed program to calculate the identification rate and determines the identification rate in descending order of the identification rate. Each character is displayed as a comparison character, and if there is a city character that does not include any of the characters, the characters with low identification rate and the city characters that are not included in the extraction target are extracted from the extracted characters. A character identification support service system characterized by displaying characters of all systems integrated with a character to be identified by replacing the character with a character having a high identification rate.