JP4111941B2

JP4111941B2 - Apparatus, method and program for converting lexical data to data

Info

Publication number: JP4111941B2
Application number: JP2004274422A
Authority: JP
Inventors: 達彦相川; 一朗中田
Original assignee: Bank of Tokyo Mitsubishi UFJ Trust Co
Current assignee: MUFG Bank Ltd
Priority date: 2004-09-22
Filing date: 2004-09-22
Publication date: 2008-07-02
Anticipated expiration: 2024-09-22
Also published as: JP2006092065A

Description

本発明は、複数の単語を含む字句を当該複数の単語により特定される１つの情報を含む別のデータに変換する装置、方法及びプログラムに関する。 The present invention relates to an apparatus, a method, and a program for converting a lexical phrase including a plurality of words into another data including one piece of information specified by the plurality of words.

例えば、銀行等では受信する決済指図電文等について、計算機を使用して取引先顧客情報（ＣＩＦ）解析処理が必要になることがある（なお、ＣＩＦは顧客情報ファイル（Customer Information File）を意味する。）。詳細には、例えば電文中の字句「ＧＥＴＲＯＮＩＣＳＦＯＯＤＳＣＯ．，ＬＴＤ１−２−３４ＡＫＡＳＡＫＡ」の中の複数の単語「ＧＥＴＲＯＮＩＣＳ」、「ＦＯＯＤＳ」、「ＣＯ．，ＬＴＤ」及び「ＡＫＡＳＡＫＡ」の組み合わせを顧客コード（即ち、ＣＩＦコード）「１２３４５６７」に変換することが必要になる。 For example, a bank or the like may require a customer information (CIF) analysis process using a computer for a settlement instruction message received (CIF means a customer information file). .) Specifically, for example, a combination of a plurality of words “GETRONICS”, “FOODS”, “CO., LTD”, and “AKASAKA” in the word “GETRONICS FOODS CO., LTD 1-2-34 AKASAKA” It is necessary to convert the customer code (that is, the CIF code) to “1234567”.

従来は、この変換処理を次のように行っていた。即ち、顧客コードを特定する複数の単語の組み合わせとそれに対応する顧客コードとの顧客コード・テーブルを予め記憶装置に格納しておく。次いで、入力データから変換すべき複数の単語を抽出して、その抽出された複数の単語と顧客コード・テーブルの中の複数の単語とを文字列比較を行い、一致した場合顧客コードに変換していた（そのような例として、特許文献１参照。）。 Conventionally, this conversion processing is performed as follows. That is, a customer code table of a combination of a plurality of words specifying a customer code and a corresponding customer code is stored in the storage device in advance. Next, a plurality of words to be converted are extracted from the input data, and the extracted plurality of words and a plurality of words in the customer code table are subjected to character string comparison. (See Patent Document 1 for such an example.)

しかしながら、このような文字列比較は、１文字毎（１バイト単位）で行うため、とりわけ大量のデータが対象になるときには、当該処理に要する検索時間（seek time）の関係上、高速に処理を行うことができないという問題があった。 However, since such character string comparison is performed for each character (in 1-byte units), especially when a large amount of data is targeted, the processing is performed at high speed due to the search time required for the processing. There was a problem that could not be done.

このような問題を克服しようとする方式として、電文中の字句をＣＩＦコードに変換するために用いる辞書類をメイン・メモリに上にメモリ・アドレスをシンボルとしてシンボル化することがこの特許出願より前の特許出願において提案された（特許文献２参照）。詳細には、複数の単語のそれぞれとその属性情報を関連付けて二次記憶装置に予め登録してある基本語辞書と、基本語辞書に登録されている２つの単語を組み合わせた組と、キーとして選択された当該組の中の単語と当該組により特定される記号とを関連付けて二次記憶装置に予め登録してある核名称辞書と、当該記号と当該組に含まれない別の単語と当該記号及び別の単語により特定されるＣＩＦコードとを関連付けて二次記憶装置に予め登録してあるフル名称辞書とをメイン・メモリ上にメモリ・アドレスをシンボルとしてシンボル化する。シンボル化された基本語辞書を参照して、被変換字句に含まれる単語に対応するアドレスを取得する。そのうちの２つのアドレス（一方をキーとする）が一致する組をシンボル化された核名称辞書から見つけ、次いで、もう１つのアドレスと一致する組をシンボル化されたフル名称辞書から見つけて、被変換字句を目的のＣＩＦコードに変換する。 As a method for overcoming such a problem, prior to this patent application, a dictionary used for converting a lexical in a message to a CIF code is symbolized as a main memory on a memory address as a symbol. (See Patent Document 2). Specifically, a basic word dictionary associated with each of a plurality of words and their attribute information and registered in the secondary storage device in advance, a combination of two words registered in the basic word dictionary, and a key The nuclear name dictionary that is pre-registered in the secondary storage device in association with the selected word in the set and the symbol specified by the set, the symbol and another word not included in the set, and the The full name dictionary registered in advance in the secondary storage device in association with the symbol and the CIF code specified by another word is symbolized on the main memory using the memory address as a symbol. By referring to the symbolized basic word dictionary, an address corresponding to the word included in the converted lexical phrase is acquired. A pair that matches two addresses (one of which is a key) is found from the symbolized nuclear name dictionary, and then a pair that matches the other address is found from the symbolized full name dictionary, The conversion token is converted into a target CIF code.

前述のように、この方式は、各辞書を関連付けるキー（シンボル）を物理的メモリ・アドレスに直接割り当てる仕様としている。そのような仕様とした場合、コンピュータの特性として、メモリ・アドレスは動的に割り付けられてしまうために、プログラムをメモリ・ロードする度に、辞書相互間のキー関係を、取得したメモリ・アドレスに置き換え、生成する必要があり、この点でプログラム初期処理上、高負荷及び所要時間がかかる。 As described above, this method is a specification in which a key (symbol) that associates each dictionary is directly assigned to a physical memory address. When such a specification is used, the memory address is dynamically allocated as a characteristic of the computer. Therefore, each time the program is loaded into the memory, the key relationship between the dictionaries is changed to the acquired memory address. It is necessary to replace and generate, and in this respect, high load and required time are required in the program initial processing.

このような問題を克服しようとする方式として、それぞれの単語を表象する論理インデックスを予め割り振って、単語と対応の論理インデックスとを関連付けて辞書に格納し、これをメモリ上にメモリ展開することがこの特許出願より前の特許出願において提案された（特許文献３及び４参照）。 As a method for overcoming such a problem, a logical index representing each word is allocated in advance, and the word and the corresponding logical index are stored in a dictionary in association with each other, and this is expanded in a memory. It was proposed in patent applications prior to this patent application (see Patent Documents 3 and 4).

また、辞書上の単語にインデックスを付し、それをディスクの格納位置と対応付けて、アクセス時間を短縮化する方法は知られている（例えば、特許文献５参照）。
特開２００２−５６００５号公報特願２００３−１１５２８７号特許出願特願２００４−８９１０１号特許出願特願２００４−８９１２１号特許出願特開平７−３６９００号公報 In addition, a method is known in which an index is attached to a word on a dictionary and the access time is shortened by associating the index with a storage position of a disk (see, for example, Patent Document 5).
JP 2002-56005 A Patent application for Japanese Patent Application No. 2003-115287 Patent application for Japanese Patent Application No. 2004-89101 Patent application No. 2004-89121 JP-A-7-36900

特許文献３及び４においては、１つの単語の組み合わせにより一義的にＣＩＦコードが定まる必要があるので、登録される単語の組み合わせには重複が無く、また或る単語の組み合わせを他の単語の組み合わせが含む関係を持たないように登録されている。即ち、特許文献３及び４においては、ＣＩＦコードに対応する単語の組み合わせには、後述する包摂関係が生じていないことを前提としていた。しかし、実際には、図９の参照番号１０００に示すように、ＣＩＦコード等に対応付けられて登録された論理インデックスセット間に包摂関係が生じる場合がある。この参照番号１０００で示す例では、ＣＩＦコード「６９５００００」に対応する名称は「ＮＡＫＡＭＯＲＩＥＬＥＣＴＲＩＣＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」で、論理インデックスセットは（１０５，１０６，１０７，３００）であり、一方ＣＩＦコード「６７６００００」に対応する名称は「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」で、論理インデックスセットは（１０５，１０７，３００）である。従って、両者は、名称における単語の組み合わせにおいては、「ＮＡＫＡＭＯＲＩ」「ＣＯ．，ＬＴＤ」「ＭＡＲＵＮＯＵＣＨＩ」が、また論理インデックスセットにおいては、１０５，１０７，３００が、重複しているために包摂関係が発生している。包摂関係とは、このように、顧客名称等の辞書登録されている複数の論理インデックスセット間において発生し、一方が他方の論理インデックスセットを包含した登録が行われている状態のことである。 In Patent Documents 3 and 4, since it is necessary to uniquely define a CIF code by a combination of one word, there is no duplication in registered word combinations, and a certain word combination is combined with another word Is registered so that it does not have the relationship that includes. That is, Patent Documents 3 and 4 are based on the premise that the inclusion relationship described later does not occur in the combination of words corresponding to the CIF code. However, in practice, as indicated by reference numeral 1000 in FIG. 9, there may be an inclusion relationship between logical index sets registered in association with CIF codes or the like. In the example indicated by the reference number 1000, the name corresponding to the CIF code “6950000” is “NAKAMORI ELECTRIC CO., LTD. MARUNOUCHI” and the logical index set is (105, 106, 107, 300), while the CIF code “ The name corresponding to “6760000” is “NAKAMORI CO., LTD. MARUNOUCHI”, and the logical index set is (105, 107, 300). Therefore, both are inclusive because "NAKAMORI", "CO., LTD", "MARUNOUCHI" are duplicated in the combination of words in the name, and 105, 107, 300 are duplicated in the logical index set. It has occurred. The inclusion relationship is a state in which registration occurs in such a manner as to occur between a plurality of logical index sets registered in a dictionary such as a customer name, and one includes the other logical index set.

上記のように、顧客名称等の重要部分を論理インデックスセットで表したとき、相異なる複数のＣＩＦコード等の間では、一方が他方を含む包摂関係を生じることがあり得る。図９の参照番号１０００で示す例では、ＣＩＦコード「６９５００００」は論理インデックスセットとしては（１０５，１０６，１０７，３００）と表現され、論理インデックスセット（１０５，１０７，３００）のＣＩＦコード「６７６００００」を包含する。この場合、入力文字列からより少ない論理インデックスセット（上記例では、ＣＩＦコードとして「６７６００００」）が完全に解析できたとしても、果たしてそれが正常な解析結果なのか、あるいは、より大きい論理インデックスセットの組み合わせなのか、辞書登録上、包摂関係が成立するために確定することができない。図９を参照してより詳細に説明すると、より一般的に論理インデックスが確定していない「ＡＮＤ」を含む「ＮＡＫＡＭＯＲＩＡＮＤＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が入力されたとする。入力は単語に分割され、辞書１００２を用いて論理インデックスに変換される（参照番号１００４の「辞書適用」の欄参照）。そのとき、「ＡＮＤ」は辞書に登録されていないので、変換されない。従って、ＣＩＦ解析に用いられる論理インデックスセットは（１０５，１０７，３００）となる（参照番号１００４の「ＣＩＦ解析」の欄参照）。この論理インデックスセットに対して、参照番号１０００で示すＣＩＦコードと論理インデックスセットとの対応表を参照して、解析すると、参照番号１００４の「解析結果」の欄に示すように、「６９５００００」又は「６７６００００」となり確定できない。従って、特許文献３及び４の技術では、かかる状態に立ち至った場合、目的とするＣＩＦコードが不特定の故にエラー扱いを余儀なくされる。なお、エラー・リターンをしないようにした場合、却って不正確な処理を許容することにもなる。 As described above, when an important part such as a customer name is represented by a logical index set, an inclusion relationship in which one includes the other may occur between different CIF codes. In the example indicated by reference numeral 1000 in FIG. 9, the CIF code “6950000” is expressed as (105, 106, 107, 300) as the logical index set, and the CIF code “6760000” of the logical index set (105, 107, 300). Is included. In this case, even if a smaller logical index set (in the above example, “676000” as the CIF code) can be completely analyzed from the input character string, whether it is a normal analysis result or a larger logical index set. It cannot be determined because the inclusion relationship is established in the dictionary registration. Describing in more detail with reference to FIG. 9, it is assumed that “NAKAMORI AND CO., LTD. MARUNOUCHI” including “AND” in which a logical index is not fixed is input more generally. The input is divided into words and converted into a logical index using the dictionary 1002 (see “Dictionary application” column with reference number 1004). At this time, “AND” is not registered in the dictionary, and thus is not converted. Accordingly, the logical index set used for CIF analysis is (105, 107, 300) (see the column “CIF analysis” of reference number 1004). When this logical index set is analyzed by referring to the correspondence table between the CIF code indicated by reference number 1000 and the logical index set, as shown in the “Analysis result” column of reference number 1004, “6950000” or It cannot be determined as “6760000”. Therefore, in the techniques of Patent Documents 3 and 4, when such a state is reached, the target CIF code is unspecified and thus must be handled as an error. If an error / return is not performed, an inaccurate process may be allowed.

本発明の課題は、字句をデータに変換する際に、包摂関係が生じている場合にも正しく変換できるようにし、解析処理精度の向上を図ることにある。 An object of the present invention is to improve the accuracy of analysis processing by converting a lexical phrase into data so that it can be correctly converted even when an inclusion relationship has occurred.

上記課題は、本発明の一局面に従った、複数の単語を含む字句を当該複数の単語により特定される１つの情報を含む別のデータに変換する装置であって、複数の単語のそれぞれを、それぞれの単語を表象する論理インデックスと関連付けて記憶手段に予め登録してある基本語辞書と、前記基本語辞書に登録されている単語同士を組み合わせた単語の組に対応する論理インデックスの組を前記単語の組により特定される１つの情報と関連付け、且つ前記論理インデックスの組同士が包摂関係にある場合、包摂される論理インデックスの組について、当該包摂される論理インデックスの組に対応する単語の組に含まれないため前記１つの情報を直接特定する際に用いられない点で不要であるが前記１つの情報を最終的に特定するのを補助する単語である不要語に対応する論理インデックスを前記１つの情報と関連付けて記憶手段に予め登録してある名称辞書と、前記複数の単語を含む字句を前記特定される１つの情報を含む別のデータに変換する処理エンジンとを備え、前記処理エンジンは、前記基本語辞書に登録されている単語を、それぞれの単語を表象する論理インデックスと関連付けてメモリ上にメモリ展開し、前記名称辞書に登録されている論理インデックスの組、及び不要語の論理インデックスを各論理インデックスの組に対応する前記特定される１つの情報と関連付けて前記メモリ上にメモリ展開し、変換されるべき字句を単語に分解し、前記メモリ上にメモリ展開された基本語辞書を参照して、前記の分解された単語に対応する論理インデックスを取得、変換し、不要語の論理インデックスを含めた前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスの組と一致する論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を用いることにより、前記複数の単語を含む字句を前記特定される１つの情報を含む別のデータに変換する装置により解決される。 According to one aspect of the present invention, there is provided an apparatus for converting a lexical phrase including a plurality of words into another data including one piece of information specified by the plurality of words, wherein each of the plurality of words is converted. A basic word dictionary previously registered in the storage means in association with a logical index representing each word, and a set of logical indexes corresponding to a set of words obtained by combining words registered in the basic word dictionary associating one of information specified by a set of said word, and when said set between the logical index is subsumption relation, set for the logical index subsumed, the word corresponding to the set of logical root indexes are the subsumption A word that is unnecessary in that it is not used when directly specifying the one piece of information because it is not included in the set, but assists in finally specifying the one piece of information A logical index corresponding to a certain unnecessary word is associated with the one information in advance and registered in a storage means in advance, and a lexical word including the plurality of words is converted into another data including the specified information. A processing engine, and the processing engine associates a word registered in the basic word dictionary with a logical index representing each word, expands the memory in a memory, and registers the word in the name dictionary A logical index set and a logical index of an unnecessary word are associated with the specified one information corresponding to each logical index set and expanded in the memory, and a lexical word to be converted is decomposed into words, Referring to the basic word dictionary expanded in memory on the memory, the logical index corresponding to the decomposed word is obtained, converted, and the unnecessary word A logical index set that matches the logical index set corresponding to the decomposed word is selected from the memory expanded logical index sets including the logical index, and the selected logical index set is selected. By using one piece of information specified by a set, the problem is solved by a device that converts a lexical phrase including the plurality of words into another data including the specified one piece of information.

本発明の装置の一態様によれば、データを含む情報を入力するための入力手段、及びデータを含む情報を表示する表示手段を更に備え、前記処理エンジンは更に、前記メモリ上にメモリ展開された基本語辞書を参照して、前記の分解された単語に対応する論理インデックスを取得、変換する際に、論理インデックスを取得できない単語がある場合その単語を特定して前記表示手段に表示し、前記入力手段により入力された、前記表示手段に表示された単語が不要語であることを示す情報に応答して、前記基本語辞書に、当該単語をそれに対応する論理インデックスと関連付けて書き込み、且つ前記名称辞書に、変換されるべき字句に含まれる単語の中で前記不要語とされた単語を除いた単語の組から特定される１つの情報と関連付けて前記不要語とされた単語の論理インデックスを書き込むことが好ましい。 According to one aspect of the apparatus of the present invention, the apparatus further comprises input means for inputting information including data, and display means for displaying information including data, and the processing engine is further expanded in the memory. When referring to the basic word dictionary and obtaining and converting the logical index corresponding to the decomposed word, if there is a word for which the logical index cannot be obtained, the word is identified and displayed on the display means, In response to the information input by the input means indicating that the word displayed on the display means is an unnecessary word, the word is written in the basic word dictionary in association with the corresponding logical index; and The name dictionary is associated with one piece of information specified from a set of words excluding a word that is regarded as an unnecessary word among words included in a lexical word to be converted. It is preferable to write the logical index of the word, which is the word.

本発明の装置の別の態様によれば、前記処理エンジンは更に、不要語の論理インデックスを含めた前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスの組と一致する論理インデックスの組を選択する前に、不要語の論理インデックスを含めない前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスのうちの複数の論理インデックスから構成されるいずれかの論理インデックスの組と一致する論理インデックスの組で且つ前記いずれかの論理インデックスの組の論理インデックスのみから構成される論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を用いることにより、前記複数の単語を含む字句を前記特定される１つの情報を含む別のデータに変換することが好ましい。 According to another aspect of the apparatus of the present invention, the processing engine further includes a logic corresponding to the decomposed word from the memory expanded logical index set including a logical index of unnecessary words. Before selecting the logical index set that matches the index set, the logical index corresponding to the decomposed word is selected from the memory expanded logical index set that does not include the logical index of unnecessary words. Selecting a logical index set that is a set of logical indexes that matches a set of logical indexes composed of a plurality of logical indexes, and that is composed only of the logical indexes of the set of logical indexes, By using one piece of information specified by the set of the selected logical indexes, It is preferable to convert the token containing the word to another data including one of information the identified.

上記課題はまた、本発明の別の局面に従った、複数の単語を含む字句を当該複数の単語により特定される１つの情報を含む別のデータに変換する装置であって、複数の単語のそれぞれを、それぞれの単語を表象する論理インデックスと関連付けて記憶手段に予め登録してある基本語辞書と、前記基本語辞書に登録されている単語同士を組み合わせた単語の組に対応する論理インデックスの組を前記単語の組により特定される１つの情報と関連付けて記憶手段に予め登録してある名称辞書と、前記論理インデックスの組同士が包摂関係にある場合、包摂される論理インデックスの組について、当該包摂される論理インデックスの組に対応する単語の組に含まれないため前記１つの情報を直接特定する際に用いられない点で不要であるが前記１つの情報を最終的に特定するのを補助する単語である不要語の論理インデックスを前記１つの情報と関連付けて記憶手段に予め登録してある不要語辞書と、前記複数の単語を含む字句を前記特定される１つの情報を含む別のデータに変換する処理エンジンとを備え、前記処理エンジンは、前記基本語辞書に登録されている単語を、それぞれの単語を表象する論理インデックスと関連付けてメモリ上にメモリ展開し、前記名称辞書に登録されている論理インデックスの組を、各論理インデックスの組に対応する前記特定される１つの情報と関連付けて前記メモリ上にメモリ展開し、前記不要語辞書に登録されている不要語の論理インデックスを前記１つの情報と関連付けて前記メモリ上にメモリ展開し、変換されるべき字句を単語に分解し、前記メモリ上にメモリ展開された基本語辞書を参照して、前記の分解された単語に対応する論理インデックスを取得、変換し、前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスのうちの複数の論理インデックスから構成されるいずれかの論理インデックスの組と一致する論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を一時的に取得し、包摂関係がある場合に、前記のメモリ展開された不要語辞書の中で、前記の一時的に取得された１つの情報に対応する不要語の論理インデックスが、前記の分解された単語に対応する論理インデックスのうちで前記１つの情報を一時的に取得する際に用いなかった論理インデックスと一致する場合、前記の一時的に取得された１つの情報を特定されるべき１つの情報として確定し、当該１つの情報を用いることにより前記複数の単語を含む字句を前記特定される１つの情報を含む別のデータに変換する装置により解決される。 According to another aspect of the present invention, there is provided an apparatus for converting a lexical phrase including a plurality of words into another data including one piece of information specified by the plurality of words. A basic word dictionary registered in advance in the storage means in association with a logical index representing each word, and a logical index corresponding to a set of words obtained by combining words registered in the basic word dictionary If a name dictionary registered in advance in the storage means a set in association with one information specified by a set of said word, set between the logical index is subsumption relation, set for the logical index subsumed, one is a required wherein in that is not used in identifying said one information directly because it is not contained in the set of words that correspond to the set of logical root indexes are the subsumption An unnecessary word dictionary in which a logical index of an unnecessary word, which is a word that assists in finally specifying information, is associated with the one information in advance and registered in a storage unit, and a phrase including the plurality of words is specified. A processing engine that converts the data registered in the basic word dictionary to a logical index representing each word in a memory. The memory is expanded, the set of logical indexes registered in the name dictionary is associated with the specified one information corresponding to each set of logical indexes, the memory is expanded on the memory, and is registered in the unnecessary word dictionary A logical index of an unnecessary word that is generated is associated with the one information and expanded in the memory, and a lexical word to be converted is decomposed into words, The logical index corresponding to the decomposed word is obtained by referring to the basic word dictionary expanded in memory on the memory and converted from the set of logical indexes expanded in memory. One of the logical indexes corresponding to the selected word is selected from a plurality of logical indexes among the logical indexes corresponding to the word, and one specified by the selected logical index set is selected. When the information is temporarily acquired and there is an inclusion relationship, the logical index of the unnecessary word corresponding to the one temporarily acquired information in the memory expanded unnecessary word dictionary is If the logical index corresponding to the decomposed word matches the logical index that was not used when temporarily acquiring the one information, The one piece of information acquired temporarily is determined as one piece of information to be specified, and the lexical word including the plurality of words is determined by using the one piece of information. Solved by a device that converts data.

本発明の装置の一態様によれば、データを含む情報を入力するための入力手段、及びデータを含む情報を表示する表示手段を更に備え、前記処理エンジンは更に、前記メモリ上にメモリ展開された基本語辞書を参照して、前記の分解された単語に対応する論理インデックスを取得、変換する際に、論理インデックスを取得できない単語がある場合その単語を特定して前記表示手段に表示し、前記入力手段により入力された、前記表示手段に表示された単語が不要語であることを示す情報に応答して、前記基本語辞書に、当該単語をそれに対応する論理インデックスと関連付けて書き込み、且つ前記不要語辞書に、変換されるべき字句に含まれる単語の中で前記不要語とされた単語を除いた単語の組から特定される１つの情報と関連付けて前記不要語とされた単語の論理インデックスを書き込むことが好ましい。 According to one aspect of the apparatus of the present invention, the apparatus further comprises input means for inputting information including data, and display means for displaying information including data, and the processing engine is further expanded in the memory. When referring to the basic word dictionary and obtaining and converting the logical index corresponding to the decomposed word, if there is a word for which the logical index cannot be obtained, the word is identified and displayed on the display means, In response to the information input by the input means indicating that the word displayed on the display means is an unnecessary word, the word is written in the basic word dictionary in association with the corresponding logical index; and The unnecessary word dictionary is associated with one piece of information specified from a set of words excluding a word that is an unnecessary word among words included in a lexical word to be converted. It is preferable to write the logical index of words that are the Yogo.

本発明の装置の別の態様によれば、前記処理エンジンは更に、前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスのうちの複数の論理インデックスから構成されるいずれかの論理インデックスの組と一致する論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を一時的に取得する前に、前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスのうちの複数の論理インデックスから構成されるいずれかの論理インデックスの組と一致する論理インデックスの組で且つ前記いずれかの論理インデックスの組の論理インデックスのみから構成される論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を用いることにより、前記複数の単語を含む字句を前記特定される１つの情報を含む別のデータに変換することが好ましい。 According to another aspect of the apparatus of the present invention, the processing engine further includes a plurality of logical indexes of the logical indexes corresponding to the decomposed words from the memory expanded logical index set. Before the temporary acquisition of one piece of information specified by the selected logical index set, the memory expansion is performed. A set of logical indexes that match a set of logical indexes composed of a plurality of logical indexes among the logical indexes corresponding to the decomposed words, and any of the set of logical indexes Select a logical index set consisting only of the logical indexes of the set of logical indexes, and By using one of the information specified by logical index set that is, it is preferable to convert the token comprising a plurality of words to another data including one of information the identified.

更に、上記課題は、本発明の一局面に従った、複数の単語のそれぞれを、それぞれの単語を表象する論理インデックスと関連付けて記憶手段に予め登録してある基本語辞書と、前記基本語辞書に登録されている単語同士を組み合わせた単語の組に対応する論理インデックスの組を前記単語の組により特定される１つの情報と関連付け、且つ前記論理インデックスの組同士が包摂関係にある場合、包摂される論理インデックスの組について、当該包摂される論理インデックスの組に対応する単語の組に含まれないため前記１つの情報を直接特定する際に用いられない点で不要であるが前記１つの情報を最終的に特定するのを補助する単語である不要語に対応する論理インデックスを前記１つの情報と関連付けて記憶手段に予め登録してある名称辞書とを備え、複数の単語を含む字句を当該複数の単語により特定される１つの情報を含む別のデータに変換するコンピュータに、前記基本語辞書に登録されている単語を、それぞれの単語を表象する論理インデックスと関連付けてメモリ上にメモリ展開し、メモリ展開された基本語辞書を生成するステップと、前記名称辞書に登録されている論理インデックスの組、及び不要語の論理インデックスを各論理インデックスの組に対応する前記特定される１つの情報と関連付けて前記メモリ上にメモリ展開し、メモリ展開された名称辞書を生成するステップと、変換されるべき字句を単語に分解するステップと、前記メモリ上にメモリ展開された基本語辞書を参照して、前記の分解された単語に対応する論理インデックスを取得、変換するステップと、包摂関係がある場合に、不要語の論理インデックスを含めた前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスの組と一致する論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を用いることにより、前記複数の単語を含む字句を前記特定される１つの情報を含む別のデータに変換するステップとを実行させる方法により解決される。 Further, according to one aspect of the present invention, there is provided a basic word dictionary in which each of a plurality of words is associated with a logical index representing each word and registered in storage means in advance, and the basic word dictionary If a set of logical indexes corresponding to a set of words that is a combination of words registered in is associated with one piece of information specified by the set of words, and the sets of logical indexes are in an inclusion relationship, The set of logical indexes that are not included in the set of words corresponding to the included set of logical indexes and are not used when directly specifying the one piece of information, but the one piece of information A name that is pre-registered in the storage means in association with the one information, a logical index corresponding to an unnecessary word that is a word that assists in finally identifying And a writing, a lexical including a plurality of words in a computer to be converted to another data including one of the information specified by the plurality of words, a word registered in the basic word dictionary, each word A step of generating memory on a memory in association with a logical index to be represented to generate a basic word dictionary expanded in the memory , a set of logical indexes registered in the name dictionary, and a logical index of unnecessary words for each logical index Expanding the memory on the memory in association with the specified one information corresponding to the set of information, generating a memory expanded name dictionary, decomposing the lexical word to be converted into words, and the memory A step of acquiring and converting a logical index corresponding to the decomposed word by referring to the basic word dictionary expanded in the memory above. And a logical index that matches the logical index set corresponding to the decomposed word from the set of memory expanded logical indexes including the logical index of unnecessary words when there is an inclusion relationship Converting a lexical phrase including the plurality of words into another data including the specified information by using a piece of information specified by the selected set of logical indexes. It is solved by the method of executing .

本発明の方法の一態様によれば、前記コンピュータに、前記メモリ上にメモリ展開された基本語辞書を参照して、前記の分解された単語に対応する論理インデックスを取得、変換する際に、論理インデックスを取得できない単語がある場合その単語を特定して表示手段に表示するステップと、入力手段により入力された、前記表示手段に表示された単語が不要語であることを示す情報に応答して、前記基本語辞書に、当該単語をそれに対応する論理インデックスと関連付けて書き込み、且つ前記名称辞書に、変換されるべき字句に含まれる単語の中で前記不要語とされた単語を除いた単語の組から特定される１つの情報と関連付けて前記不要語とされた単語の論理インデックスを書き込むステップとを更に実行させることが好ましい。 According to one aspect of the method of the present invention, the computer refers to the basic word dictionary expanded in the memory and acquires and converts the logical index corresponding to the decomposed word. and Luz step displays on the display means to identify the word if there is a word that can not be acquired logical index has been input by the input means, the information indicating that the displayed words is unnecessary word on the display means In response, the word is written in the basic word dictionary in association with the corresponding logical index, and the word included in the lexical word to be converted is excluded from the unnecessary words in the name dictionary. It is preferable to further execute a step of writing a logical index of the word determined as the unnecessary word in association with one piece of information specified from the set of words.

本発明の別の態様によれば、前記コンピュータに、不要語の論理インデックスを含めた前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスの組と一致する論理インデックスの組を選択する前記のステップを実行させる前に、不要語の論理インデックスを含めない前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスのうちの複数の論理インデックスから構成されるいずれかの論理インデックスの組と一致する論理インデックスの組で且つ前記いずれかの論理インデックスの組の論理インデックスのみから構成される論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を用いることにより、前記複数の単語を含む字句を前記特定される１つの情報を含む別のデータに変換するステップを更に実行させることが好ましい。 According to another aspect of the present invention, the computer includes a set of logical indexes corresponding to the decomposed words out of the memory expanded logical index sets including the logical indexes of unnecessary words. Before performing the step of selecting a set of matching logical indexes, the logic corresponding to the decomposed word from the memory expanded logical index set that does not include the logical index of unwanted words. A logical index set that matches a logical index set that is composed of a plurality of logical indexes of the indexes and that is composed only of the logical indexes of the logical index set is selected. And using one piece of information specified by the set of selected logical indexes More further it is preferable to perform the step of converting the other data including one of information the identified lexical including the plurality of words.

上記課題は、本発明の別の局面に従った、複数の単語のそれぞれを、それぞれの単語を表象する論理インデックスと関連付けて記憶手段に予め登録してある基本語辞書と、前記基本語辞書に登録されている単語同士を組み合わせた単語の組に対応する論理インデックスの組を前記単語の組により特定される１つの情報と関連付けて記憶手段に予め登録してある名称辞書と、前記論理インデックスの組同士が包摂関係にある場合包摂される論理インデックスの組について、当該包摂される単語の組に含まれないため前記１つの情報を直接特定する際に用いられない点で不要であるが前記１つの情報を最終的に特定するのを補助する単語である不要語に対応する論理インデックスを前記１つの情報と関連付けて記憶手段に予め登録してある不要語辞書とを備え、複数の単語を含む字句を当該複数の単語により特定される１つの情報を含む別のデータに変換するコンピュータに、前記基本語辞書に登録されている単語を、それぞれの単語を表象する論理インデックスと関連付けてメモリ上にメモリ展開し、メモリ展開された基本語辞書を生成するステップと、前記名称辞書に登録されている論理インデックスの組を、各論理インデックスの組に対応する前記特定される１つの情報と関連付けて前記メモリ上にメモリ展開し、メモリ展開された名称辞書を生成するステップと、前記不要語辞書に登録されている不要語の論理インデックスを前記１つの情報と関連付けて前記メモリ上にメモリ展開し、メモリ展開された不要語辞書を生成するステップと、変換されるべき字句を単語に分解するステップと、前記メモリ上にメモリ展開された基本語辞書を参照して、前記の分解された単語に対応する論理インデックスを取得、変換するステップと、前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスのうちの複数の論理インデックスから構成されるいずれかの論理インデックスの組と一致する論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を一時的に取得するステップと、包摂関係がある場合に、前記のメモリ展開された不要語辞書の中で、前記の一時的に取得された１つの情報に対応する不要語の論理インデックスが、前記の分解された単語に対応する論理インデックスのうちで前記１つの情報を一時的に取得する際に用いなかった論理インデックスと一致する場合、前記の一時的に取得された１つの情報を特定されるべき１つの情報として確定し、当該１つの情報を用いることにより前記複数の単語を含む字句を前記特定される１つの情報を含む別のデータに変換するステップとを実行させる方法により解決される。 According to another aspect of the present invention, there is provided a basic word dictionary in which each of a plurality of words is associated with a logical index representing each word and registered in advance in storage means, and the basic word dictionary A name dictionary pre-registered in storage means in association with one piece of information specified by the set of words, a set of logical indexes corresponding to a set of words obtained by combining registered words, When a set is inclusive, a set of included logical indexes is not included in the included set of words and is unnecessary in that it is not used when directly specifying the one information. Unnecessary words in which a logical index corresponding to an unnecessary word, which is a word that assists in finally identifying one piece of information, is registered in advance in the storage means in association with the one information. And a writing, a lexical including a plurality of words in a computer to be converted to another data including one of the information specified by the plurality of words, a word registered in the basic word dictionary, each word A step of generating memory on a memory in association with a logical index to be represented and generating a memory-expanded basic word dictionary, and a set of logical indexes registered in the name dictionary corresponding to each logical index set and memory development on the memory in association with one information specified, and generating a memory expanded name dictionary, the logical indexes of unnecessary words said registered in the unnecessary word dictionary and the one information association and memory development on the memory Te, decomposing and generating a memory expanded unnecessary word dictionary, lexical be converted into words A step of acquiring and converting a logical index corresponding to the decomposed word by referring to the basic word dictionary expanded in the memory on the memory, and in the set of logical indexes expanded in the memory From the logical index corresponding to the decomposed word, a logical index set that matches any logical index set composed of a plurality of logical indexes is selected, and the selected logical index set is selected. A step of temporarily acquiring one piece of information specified by, and corresponding to the one piece of temporarily acquired information in the memory expanded unnecessary word dictionary when there is an inclusion relationship A logical index of unnecessary words is used to temporarily acquire the one information among the logical indexes corresponding to the decomposed words. If there is a match with the logical index that did not exist, the one piece of temporarily acquired information is determined as one piece of information to be specified, and by using the one piece of information, the phrase including the plurality of words is specified. It is solved by a method and a step of converting the other data including one of the information.

本発明の一態様によれば、前記コンピュータに、前記メモリ上にメモリ展開された基本語辞書を参照して、前記の分解された単語に対応する論理インデックスを取得、変換する際に、論理インデックスを取得できない単語がある場合、その単語を特定して表示手段に表示するステップと、入力手段により入力された、前記表示手段に表示された単語が不要語であることを示す情報に応答して、前記基本語辞書に、当該単語をそれに対応する論理インデックスと関連付けて書き込み、且つ前記不要語辞書に、変換されるべき字句に含まれる単語の中で前記不要語とされた単語を除いた単語の組から特定される１つの情報と関連付けて前記不要語とされた単語の論理インデックスを書き込むステップとを更に実行させることが好ましい。 According to one aspect of the present invention, when the computer acquires a logical index corresponding to the decomposed word by referring to the basic word dictionary expanded in the memory, the logical index is converted into the computer. If a word can not be obtained, and displaying on the display means to identify the word, is input by the input means, in response to information indicating that the words displayed on said display means is unnecessary word A word obtained by writing the word in the basic word dictionary in association with a corresponding logical index and excluding a word that is an unnecessary word from words included in a lexical word to be converted in the unnecessary word dictionary It is preferable to further execute a step of writing a logical index of the word determined as the unnecessary word in association with one piece of information specified from the set of

本発明の方法の別の態様によれば、前記コンピュータに、前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスのうちの複数の論理インデックスから構成されるいずれかの論理インデックスの組と一致する論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を一時的に取得する前記のステップを実行させる前に、前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスのうちの複数の論理インデックスから構成されるいずれかの論理インデックスの組と一致する論理インデックスの組で且つ前記いずれかの論理インデックスの組の論理インデックスのみから構成される論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を用いることにより、前記複数の単語を含む字句を前記特定される１つの情報を含む別のデータに変換するステップを更に実行させることが好ましい。 According to another aspect of the method of the present invention, the computer comprises a plurality of logical indexes among the logical indexes corresponding to the decomposed words from the set of memory-expanded logical indexes. Before selecting the set of logical indexes that matches any set of logical indexes to be executed and temporarily performing the step of obtaining one piece of information specified by the selected set of logical indexes. A set of logical indexes that matches one of the sets of logical indexes of the logical indexes corresponding to the decomposed words from the set of logical indexes expanded in memory. And a logical index composed only of logical indexes of any one of the logical index pairs. Converting a lexical phrase including the plurality of words into another data including the specified information by using a piece of information specified by the selected set of logical indexes. It is preferable to further execute .

上記課題は、本発明の一局面に従った、複数の単語のそれぞれを、それぞれの単語を表象する論理インデックスと関連付けて記憶手段に予め登録してある基本語辞書と、前記基本語辞書に登録されている単語同士を組み合わせた単語の組に対応する論理インデックスの組を前記単語の組により特定される１つの情報と関連付け、且つ前記論理インデックスの組同士が包摂関係にある場合、包摂される論理インデックスの組について、当該包摂される論理インデックスの組に対応する単語の組に含まれないため前記１つの情報を直接特定する際に用いられない点で不要であるが前記１つの情報を最終的に特定するのを補助する単語である不要語に対応する論理インデックスを前記１つの情報と関連付けて記憶手段に予め登録してある名称辞書とを備え、複数の単語を含む字句を当該複数の単語により特定される１つの情報を含む別のデータに変換するコンピュータに、前記基本語辞書に登録されている単語を、それぞれの単語を表象する論理インデックスと関連付けてメモリ上にメモリ展開し、メモリ展開された基本語辞書を生成するステップと、前記名称辞書に登録されている論理インデックスの組、及び不要語の論理インデックスを各論理インデックスの組に対応する前記特定される１つの情報と関連付けて前記メモリ上にメモリ展開し、メモリ展開された名称辞書を生成するステップと、変換されるべき字句を単語に分解するステップと、前記メモリ上にメモリ展開された基本語辞書を参照して、前記の分解された単語に対応する論理インデックスを取得、変換するステップと、包摂関係がある場合に、不要語の論理インデックスを含めた前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスの組と一致する論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を用いることにより、前記複数の単語を含む字句を前記特定される１つの情報を含む別のデータに変換するステップとを実行させるプログラムにより解決される。 According to one aspect of the present invention, there is provided a basic word dictionary in which each of a plurality of words is associated with a logical index representing each word and registered in advance in a storage unit, and is registered in the basic word dictionary. It is one of information that the set of logical root indexes corresponding to the set of words that combine words together are identified by the set of words and associating, and when said set between the logical index is subsumption relation, subsumed The logical index set is not included in the word set corresponding to the included logical index set and is not used when directly specifying the one information, but the one information is finalized. A name dictionary that is pre-registered in the storage means in association with the one piece of information, a logical index corresponding to an unnecessary word that is an assisting word A computer that converts a lexical word including a plurality of words into another data including one piece of information specified by the plurality of words, and a word representing each word registered in the basic word dictionary A step of expanding a memory on a memory in association with an index to generate a basic word dictionary expanded in the memory, a set of logical indexes registered in the name dictionary, and a logical index of unnecessary words into each set of logical indexes Expanding the memory on the memory in association with the corresponding one specified information, generating a memory expanded name dictionary, decomposing the lexical word to be converted into words, and storing the memory on the memory Obtaining and converting a logical index corresponding to the decomposed word with reference to the expanded basic word dictionary; When there is an inclusion relationship, a set of logical indexes matching the set of logical indexes corresponding to the decomposed words is selected from the set of logical indexes expanded in memory including the logical indexes of unnecessary words. Selecting and using one piece of information specified by the set of selected logical indexes to convert the lexical word containing the plurality of words into another data containing the one piece of specified information. It is solved by the program to make.

本発明のプログラムの一態様によれば、前記コンピュータに、前記メモリ上にメモリ展開された基本語辞書を参照して、前記の分解された単語に対応する論理インデックスを取得、変換する際に、論理インデックスを取得できない単語がある場合その単語を特定して表示手段に表示するステップと、前記入力手段により入力された、前記表示手段に表示された単語が不要語であることを示す情報に応答して、前記基本語辞書に、当該単語をそれに対応する論理インデックスと関連付けて書き込み、且つ前記名称辞書に、変換されるべき字句に含まれる単語の中で前記不要語とされた単語を除いた単語の組から特定される１つの情報と関連付けて前記不要語とされた単語の論理インデックスを書き込むステップと
を更に実行させることが好ましい。 According to one aspect of the program of the present invention, when acquiring and converting a logical index corresponding to the decomposed word with reference to the basic word dictionary expanded in the memory on the computer, If there is a word for which a logical index cannot be obtained, the step of identifying the word and displaying it on the display means, and responding to information input by the input means and indicating that the word displayed on the display means is an unnecessary word Then, the word is written in the basic word dictionary in association with the logical index corresponding to the word, and the unnecessary word is excluded from the words included in the lexical word to be converted in the name dictionary. It is preferable to further execute a step of writing a logical index of the word determined as the unnecessary word in association with one piece of information specified from the set of words. .

上記課題は、本発明の別の局面に従った、複数の単語のそれぞれを、それぞれの単語を表象する論理インデックスと関連付けて記憶手段に予め登録してある基本語辞書と、前記基本語辞書に登録されている単語同士を組み合わせた単語の組に対応する論理インデックスの組を前記単語の組により特定される１つの情報と関連付けて記憶手段に予め登録してある名称辞書と、前記論理インデックスの組同士が包摂関係にある場合、包摂される論理インデックスの組について、当該包摂される論理インデックスの組に対応する単語の組に含まれないため前記１つの情報を直接特定する際に用いられない点で不要であるが前記１つの情報を最終的に特定するのを補助する単語である不要語に対応する論理インデックスを前記１つの情報と関連付けて記憶手段に予め登録してある不要語辞書とを備え、複数の単語を含む字句を当該複数の単語により特定される１つの情報を含む別のデータに変換するコンピュータに、前記基本語辞書に登録されている単語を、それぞれの単語を表象する論理インデックスと関連付けてメモリ上にメモリ展開し、メモリ展開された基本語辞書を生成するステップと、前記名称辞書に登録されている論理インデックスの組を、各論理インデックスの組に対応する前記特定される１つの情報と関連付けて前記メモリ上にメモリ展開し、メモリ展開された名称辞書を生成するステップと、前記不要語辞書に登録されている不要語に対応する論理インデックスを、各不要語に対応の論理インデックスに対応する前記１つの情報と関連付けて前記メモリ上にメモリ展開し、メモリ展開された不要語辞書を生成するステップと、変換されるべき字句を単語に分解するステップと、前記メモリ上にメモリ展開された基本語辞書を参照して、前記の分解された単語に対応する論理インデックスを取得、変換するステップと、前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスのうちの複数の論理インデックスから構成されるいずれかの論理インデックスの組と一致する論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を一時的に取得するステップと、包摂関係がある場合に、前記のメモリ展開された不要語辞書の中で、前記の一時的に取得された１つの情報に対応する不要語の論理インデックスが、前記の分解された単語に対応する論理インデックスのうちで前記１つの情報を一時的に取得する際に用いなかった論理インデックスと一致する場合、前記の一時的に取得された１つの情報を特定されるべき１つの情報として確定し、当該１つの情報を用いることにより前記複数の単語を含む字句を前記特定される１つの情報を含む別のデータに変換するステップとを実行させるプログラムにより解決される。 According to another aspect of the present invention, there is provided a basic word dictionary in which each of a plurality of words is associated with a logical index representing each word and registered in advance in storage means, and the basic word dictionary A name dictionary pre-registered in storage means in association with one piece of information specified by the set of words, a set of logical indexes corresponding to a set of words obtained by combining registered words, If set to each other in subsumption relation, for a set of logical index subsumed, not used in identifying said one information directly because it is not contained in the set of words that correspond to the set of logical root indexes are the subsumption A logical index corresponding to an unnecessary word that is unnecessary in terms of the word but assists in finally identifying the one information is associated with the one information. Registered in the basic word dictionary in a computer having an unnecessary word dictionary pre-registered in a memory and converting a lexical word including a plurality of words into another data including one information specified by the plurality of words A word which is associated with a logical index representing each word and expanded in memory to generate a basic word dictionary expanded in memory, and a set of logical indexes registered in the name dictionary A step of expanding the memory on the memory in association with the specified one information corresponding to each set of logical indexes, generating a name dictionary expanded in the memory, and an unnecessary word registered in the unnecessary word dictionary And a logical index corresponding to each of the unnecessary words is associated with the one information corresponding to the logical index corresponding to each unnecessary word and is expanded in the memory. A step of generating an unnecessary word dictionary expanded in a memory, a step of decomposing a lexical word to be converted into a word, and a reference to the basic word dictionary expanded in the memory to correspond to the decomposed word A logical index corresponding to the decomposed word out of the set of logical indexes expanded in the memory, and a step of acquiring and converting the logical index Selecting a set of logical indexes that matches the set of logical indexes, and temporarily acquiring one piece of information specified by the selected set of logical indexes; In the unnecessary word dictionary, the logical index of the unnecessary word corresponding to the one piece of temporarily acquired information is converted into the decomposition If the logical index corresponding to the word is identical to the logical index that was not used when the one piece of information was temporarily obtained, the one piece of the temporarily obtained information is to be identified. This is solved by a program that executes the step of determining as one piece of information and converting the lexical phrase including the plurality of words into another data including the specified one piece of information by using the one piece of information.

本発明のプログラムの一態様によれば、前記コンピュータに、前記メモリ上にメモリ展開された基本語辞書を参照して、前記の分解された単語に対応する論理インデックスを取得、変換する際に、論理インデックスを取得できない単語がある場合その単語を特定して表示手段に表示するステップと、入力手段により入力された、前記表示手段に表示された単語が不要語であることを示す情報に応答して、前記基本語辞書に、当該単語をそれに対応する論理インデックスと関連付けて書き込み、且つ前記不要語辞書に、変換されるべき字句に含まれる単語の中で前記不要語とされた単語を除いた単語の組から特定される１つの情報と関連付けて前記不要語とされた単語の論理インデックスを書き込むステップとを実行させることが好ましい。 According to one aspect of the program of the present invention, when acquiring and converting a logical index corresponding to the decomposed word with reference to the basic word dictionary expanded in the memory on the computer, When there is a word for which a logical index cannot be obtained, the step of identifying the word and displaying it on the display means, and responding to information input by the input means and indicating that the word displayed on the display means is an unnecessary word The word is written in the basic word dictionary in association with the corresponding logical index, and the unnecessary word among the words included in the lexical word to be converted is excluded from the unnecessary word dictionary. It is preferable to execute a step of writing a logical index of the word determined as an unnecessary word in association with one piece of information specified from a set of words.

本発明のプログラムの別の態様によれば、前記コンピュータに、前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスのうちの複数の論理インデックスから構成されるいずれかの論理インデックスの組と一致する論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を一時的に取得する前記のステップを実行させる前に、前記のメモリ展開された論理インデックスの組の中から、前記の分解された単語に対応する論理インデックスのうちの複数の論理インデックスから構成されるいずれかの論理インデックスの組と一致する論理インデックスの組で且つ前記いずれかの論理インデックスの組の論理インデックスのみから構成される論理インデックスの組を選択し、当該選択された論理インデックスの組により特定される１つの情報を用いることにより、前記複数の単語を含む字句を前記特定される１つの情報を含む別のデータに変換するステップを更に実行させることが好ましい。 According to another aspect of the program of the present invention, the computer comprises a plurality of logical indexes among the logical indexes corresponding to the decomposed words from the set of logical indexes expanded in the memory. Before selecting the set of logical indexes that matches any set of logical indexes to be executed and temporarily performing the step of obtaining one piece of information specified by the selected set of logical indexes. A set of logical indexes that matches one of the sets of logical indexes of the logical indexes corresponding to the decomposed words from the set of logical indexes expanded in memory. And a logical in of only one logical index set of the logical index set. A lexical phrase including the plurality of words is converted into another data including the specified information by using a piece of information specified by the selected set of logical indexes. Preferably, the steps are further performed.

本発明は前述のような構成により、字句をデータに変換する際に、論理インデックスの組同士又は単語の組同士に包摂関係が生じている場合にも、包摂される論理インデックスの組又は単語の組について、当該包摂される単語の組に含まれないため求める１つの情報を直接特定する際に用いられない点で不要であるが当該１つの情報を最終的に特定するのを補助する単語である不要語を用い、入力された字句の単語の論理インデックスの組と名称辞書の論理インデックスセットとをマッチングするとき、不要語の論理インデックスを考慮してマッチングを取るので、求める１つの情報に正確に変換することができる。即ち、不要語の存在有無を判定することにより、包摂される方の論理インデックスセットの解析精度を向上させることができる。 According to the present invention, when a lexical phrase is converted into data, the inclusion of a set of logical indexes or words to be included is included even when an inclusion relationship is generated between sets of logical indexes or sets of words. A word that is not used when directly specifying one piece of information to be obtained because it is not included in the set of included words, but is a word that assists in finally specifying the one piece of information. When matching a set of logical indexes of input lexical words and a logical index set of a name dictionary using a certain unnecessary word, matching is performed in consideration of the logical index of unnecessary words, so it is accurate to one piece of information to be obtained. Can be converted to That is, by determining the presence / absence of unnecessary words, it is possible to improve the analysis accuracy of the included logical index set.

本発明の好適な実施形態を以下図面を参照して説明する。なお、図面を通して同一又は類似の参照番号は同一又は類似の構成要素を示す。
図１は、銀行等で受信する決済指図電文等について計算機を使用して取引先顧客情報（ＣＩＦ）解析処理に本発明を適用した場合の好適な実施形態に従った字句をデータに変換する装置の基本構成を示す図である。図１において、１０はメイン・フレーム・コンピュータ、パーソナル・コンピュータ、マイクロプロセッサ等の任意のデータ処理装置より構成される処理エンジンを、１２はメイン・メモリを、１４は基本語辞書を、１６は名称辞書を、１８は不要語辞書をそれぞれ示す。この字句をデータに変換する装置はまた、メイン・フレーム・コンピュータ、パーソナル・コンピュータに通常接続されているキーボード等の入力装置２０及び通常の液晶ディスプレイ等の表示装置２２を含み得る。基本語辞書１４、名称辞書１６及び不要語辞書１８は、磁気ディスク等のハード・ディスク（図示せず）に格納されているが、これに限定されず、いずれの他の形式の記憶装置に格納され得る。処理エンジン１０として機能するデータ処理装置と、メイン・メモリ１２、及び基本語辞書１４、名称辞書１６及び不要語辞書１８を格納するハード・ディスク、入力装置２０及び表示装置２２とは通常のデータ・バス等（図示せず）により相互に結合されている。 Preferred embodiments of the present invention will be described below with reference to the drawings. Throughout the drawings, the same or similar reference numerals indicate the same or similar components.
FIG. 1 shows an apparatus for converting a lexical word into data according to a preferred embodiment when the present invention is applied to a customer customer information (CIF) analysis process using a computer for a settlement instruction message received at a bank or the like. It is a figure which shows the basic composition. In FIG. 1, 10 is a processing engine composed of an arbitrary data processing device such as a main frame computer, personal computer, microprocessor, 12 is a main memory, 14 is a basic word dictionary, and 16 is a name. Reference numeral 18 denotes an unnecessary word dictionary. The device for converting this lexical into data may also include a main frame computer, an input device 20 such as a keyboard normally connected to a personal computer, and a display device 22 such as a normal liquid crystal display. The basic word dictionary 14, the name dictionary 16, and the unnecessary word dictionary 18 are stored in a hard disk (not shown) such as a magnetic disk, but are not limited thereto, and are stored in any other type of storage device. Can be done. The data processing device functioning as the processing engine 10, the main memory 12, the hard disk storing the basic word dictionary 14, the name dictionary 16 and the unnecessary word dictionary 18, the input device 20, and the display device 22 are configured with normal data They are coupled to each other by a bus or the like (not shown).

図２は、基本語辞書１４に事前に登録されている単語（以下、「基本語」とも言う。）をメイン・メモリ１２上にメモリ展開した状態を示す。銀行等におけるＣＩＦ解析処理で扱う入力データは、通常アルファベット及びニューメリック（数字）から成り、従って、基本語辞書１４に登録されている単語もアルファベット及びニューメリック（数字）から構成されている。なお、本明細書における単語あるいは基本語には、普通名詞、固有名詞、略語が含まれるのは勿論、その他、ある意味を有するいずれの一組の記号も含まれる。図２に示すように、基本語辞書１４の一例は、項目として、単語、品詞、名称の属性、コードの属性、及び登録された各単語に対応する数字で表される論理インデックスを含む。この論理インデックスの数字は、ＣＰＵ処理単位数を小さくするため、２進数で表現された数字であることが好ましい。本発明の基本語辞書としては基本語を登録するための単語の欄と、その単語に対応する数字の論理インデックスを格納するための論理インデックスの欄を少なくとも含めばよく、その他の項目は上記のものに限定されるものではない。論理インデックスは、基本語が基本語辞書１４の単語の欄に登録されるとき、その単語に対応する数字（定値）が割り振られ、基本語辞書１４の論理インデックスの欄に、登録される基本語と関連付けられて当該数字も格納、即ち登録される。基本語辞書１４は、変換すべき字句に登録されていない基本語を含む場合、新たな基本語及びそれに対応する論理インデックスを登録し、また登録済みの基本語で使用しなくなった場合に当該基本語及びその対応の論理インデックスを削除できる。 FIG. 2 shows a state where words (hereinafter, also referred to as “basic words”) registered in advance in the basic word dictionary 14 are expanded in the main memory 12. The input data handled in the CIF analysis processing in a bank or the like is usually composed of alphabets and numerics (numeric characters), and therefore the words registered in the basic word dictionary 14 are also composed of alphabets and numerics (numeric characters). Note that the word or basic word in this specification includes not only common nouns, proper nouns, and abbreviations, but also any set of symbols having a certain meaning. As shown in FIG. 2, an example of the basic word dictionary 14 includes, as items, words, parts of speech, name attributes, code attributes, and logical indexes represented by numbers corresponding to the registered words. The number of the logical index is preferably a number expressed in binary to reduce the number of CPU processing units. The basic word dictionary of the present invention may include at least a word column for registering a basic word and a logical index column for storing a logical index of a number corresponding to the word. It is not limited to things. When a basic word is registered in the word column of the basic word dictionary 14, the logical index is assigned a number (fixed value) corresponding to the word, and is registered in the logical index column of the basic word dictionary 14. The number is also stored, that is, registered in association with. When the basic word dictionary 14 includes a basic word that is not registered in the lexical phrase to be converted, the basic word dictionary 14 registers a new basic word and a corresponding logical index, and when the basic word dictionary 14 is no longer used in the registered basic word, the basic word dictionary 14 A word and its corresponding logical index can be deleted.

処理エンジン１０は、変換処理を開始する前に、図２に示すように、基本語辞書１４に登録されている基本語をメイン・メモリ１２上にメモリ展開する。メモリ・アドレスは動的に割り付けられるので、プログラムをメモリ・ロードする度に、同じ単語が同一のメモリ・アドレスの場所に格納されるわけではなく、メモリ・アドレスが異なるのが一般的である。従って、この例では、単語の欄の基本語「ＡＫＡＳＡＫＡ」について、参照番号１００で示されるように、プログラムが所与の時点でメモリ・ロードされたとき動的に割り付けられるメイン・メモリ１２上のメモリ・アドレスとして５０番が割り付けられ、その場所に「ＡＫＡＳＡＫＡ」に関する情報が格納される。メモリ・アドレスとは対照的に、基本語「ＡＫＡＳＡＫＡ」は、メイン・メモリ１２上で定値の論理インデックス「＃１００」と関連付けられて格納されていることに注目されたい。なお、高速処理の観点からは、基本語辞書１４を全てメイン・メモリ１２上にメモリ展開するのが好ましいが、変換すべき字句に含まれる基本語が事前に分かっている場合には、用いられる基本語だけをメイン・メモリ１２上にメモリ展開してもよく、更に、用途によっては、変換処理速度が遅くなるが、基本語辞書１４に登録されている基本語の一部分をメモリ展開し、未展開の基本語が変換処理に必要になったとき追加的にメモリ展開するようにしてもよい。 The processing engine 10 expands the basic words registered in the basic word dictionary 14 in the main memory 12 as shown in FIG. 2 before starting the conversion process. Since memory addresses are dynamically allocated, the same word is not stored at the same memory address location each time a program is memory loaded, and the memory addresses are typically different. Thus, in this example, for the basic word “AKASAKA” in the word column, as indicated by reference numeral 100, on the main memory 12 that is dynamically allocated when the program is memory loaded at a given time. Number 50 is assigned as the memory address, and information relating to “AKASAKA” is stored at that location. Note that the basic word “AKASAKA” is stored in the main memory 12 in association with a fixed logical index “# 100”, in contrast to the memory address. From the viewpoint of high-speed processing, it is preferable that all the basic word dictionaries 14 are expanded in the main memory 12, but this is used when the basic words included in the lexical terms to be converted are known in advance. Only basic words may be expanded in the main memory 12, and depending on the application, the conversion processing speed may be slow, but a part of the basic words registered in the basic word dictionary 14 is expanded in the memory. When the basic word of expansion becomes necessary for the conversion process, the memory may be additionally expanded.

図３は、名称辞書１６及び不要語辞書１８のそれぞれに事前に登録されている論理インデックス・セットをメイン・メモリ１２上にメモリ展開した状態を示す。なお、図３には、図２と同様の基本語辞書１４に事前に登録されている基本語をメイン・メモリ１２上にメモリ展開した状態をも示す。 FIG. 3 shows a state in which the logical index set registered in advance in each of the name dictionary 16 and the unnecessary word dictionary 18 is expanded on the main memory 12. FIG. 3 also shows a state in which basic words registered in advance in the basic word dictionary 14 similar to FIG. 2 are expanded in the main memory 12.

図３に示すように、名称辞書１６の項目は、論理インデックス・セット、名称、及び名称に示される単語の組み合わせにより特定される１つの情報、例えば、この銀行の応用の例ではＣＩＦコード、及び包摂関係が発生している個所を明示するための包摂コードから成る。この包摂コードは、包摂関係以前には無条件に解析されていた入力データに対して、「包摂関係が発生しているが故にＣＩＦコードを特定してはならない」状態を明確にすることができ、誤解析の縮小に役立つものである。また、包摂コードにより、包摂関係が発生している場合に包摂関係を有する複数の論理インデックスセットの存在を明確にすることができる。名称辞書１６の名称の欄には、基本語辞書１４に登録されている基本語のうちで、ＣＩＦコードが特定される基本語の組み合わせに含まれる基本語が事前に登録されている。一例として、単語の組み合わせ「ＮＡＫＡＭＯＲＩＥＬＥＣＴＲＩＣＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」によりＣＩＦコード「６９５００００」が、「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」により「６７６００００」がそれぞれ特定されるとする。そこで、名称辞書１６の第１行には「ＮＡＫＡＭＯＲＩＥＬＥＣＴＲＩＣＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が、第２行には「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が、それぞれ文字列として登録されている。論理インデックス・セットの欄には、これらの組み合わされた基本語に対応する論理インデックスの組が登録されている。例えば、名称辞書１６の第１行には、名称「ＮＡＫＡＭＯＲＩＥＬＥＣＴＲＩＣＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」に対応して、「＃１０５，＃１０６，＃１０７，＃３００」が、第２行には、名称「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」に対応して、「＃１０５，＃１０７，＃３００」が、それぞれ基本語辞書１４の論理インデックスの組として登録されている。ＣＩＦコードの欄には、各名称に対応する固有の番号が登録される。例えば、名称辞書１６の第１行には、「ＮＡＫＡＭＯＲＩＥＬＥＣＴＲＩＣＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」に対応するＣＩＦコード「６９５００００」が、第２行には、「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」に対応するＣＩＦコード「６７６００００」が、それぞれ登録されている。 As shown in FIG. 3, the item in the name dictionary 16 includes one piece of information specified by a combination of a logical index set, a name, and a word indicated in the name, for example, a CIF code in this bank application example, and Consists of an inclusion code to specify where the inclusion relationship is occurring. This inclusion code can clarify the state that the CIF code must not be specified because an inclusion relationship has occurred for input data that was unconditionally analyzed before the inclusion relationship. It helps to reduce misanalysis. The inclusion code can clarify the existence of a plurality of logical index sets having an inclusion relationship when the inclusion relationship is generated. Among the basic words registered in the basic word dictionary 14, basic words included in combinations of basic words for which the CIF code is specified are registered in advance in the name column of the name dictionary 16. As an example, it is assumed that the CIF code “6950000” is specified by the word combination “NAKAMORI ELECTRIC CO., LTD. MARUNOUCHI” and “676000” is specified by “NAKAMORI CO., LTD. MARUNOUCHI”. Therefore, “NAKAMORI ELECTRIC CO., LTD. MARUNOUCHI” is registered in the first line of the name dictionary 16, and “NAKAMORI CO., LTD. MARUNOUCHI” is registered as a character string in the second line. A set of logical indexes corresponding to these combined basic words is registered in the logical index set column. For example, “# 105, # 106, # 107, # 300” is associated with the name “NAKAMORI ELECTRIC CO., LTD. MARUNOUCHI” in the first line of the name dictionary 16, and the name “ Corresponding to “NAKAMORI CO., LTD. MARUNOUCHI”, “# 105, # 107, # 300” are registered as a set of logical indexes of the basic word dictionary 14, respectively. A unique number corresponding to each name is registered in the CIF code column. For example, the first line of the name dictionary 16 has a CIF code “6950000” corresponding to “NAKAMORI ELECTRIC CO., LTD MARUNOUCHI”, and the second line has a CIF code corresponding to “NAKAMORI CO., LTD MARUNOUCHI”. “6760000” is registered.

この例では、ＣＩＦコード「６９５００００」と「６７６００００」とは、包摂関係が発生しており、即ち、前者の論理インデックスの組が、後者の論理インデックスの組「＃１０５，＃１０７，＃３００」を含む。即ち、後者の論理インデックスの組は、前者の論理インデックスの組に包摂されている。そこで、この組に対する包摂コード番号として「＃４０００」を割り当て、図３に示すように包摂コードの欄の両方の行に「＃４０００」が登録される。ＣＩＦコード「６９５００００」の包摂コードの欄には、更に共通部分でない論理インデックスが登録される。この例では、「＃１０６」（単語「ＥＬＥＣＴＲＩＣ」が包摂コードの欄に登録される。 In this example, the CIF codes “6950000” and “6760000” have an inclusion relationship, that is, the former logical index set is the latter logical index set “# 105, # 107, # 300”. including. That is, the latter set of logical indexes is included in the former set of logical indexes. Therefore, “# 4000” is assigned as the inclusion code number for this set, and “# 4000” is registered in both lines of the inclusion code column as shown in FIG. A logical index that is not a common part is registered in the inclusion code column of the CIF code “6950000”. In this example, “# 106” (the word “ELECTRIC” is registered in the inclusion code column.

名称辞書１６は、変換すべき字句に、登録されていない基本語の組み合わせがある場合、新たな基本語を含む組み合わせを登録し、また登録済みの組み合わせで使用しなくなった場合に削除できる。 The name dictionary 16 can be deleted when there is a combination of basic words that are not registered in the lexical phrase to be converted, and a combination that includes a new basic word is registered, and when the combination that has been registered is no longer used.

不要語辞書１８の項目は、図３に示すように、ＣＩＦコード及び不要語の論理インデックスから成る。不要語は、名称辞書１６の論理インデックスセットに登録されないため、ＣＩＦコードを直接特定する際に用いられない点で不要であるがＣＩＦコードを最終的に特定するのを補助する単語である。なお、不要語は、多くの場合１つの単語であるが、複数の単語から成っていてもよい。図３に示す例では、ＣＩＦコードの欄に「６７６００００」が登録されており、この「６７６００００」に対して、「ＡＮＤ」及び「ＯＲ」が不要語であるので、この「ＡＮＤ」及び「ＯＲ」にそれぞれ対応する論理インデックス「＃３０１」及び「＃１０８」（図２に示す基本語辞書１４参照）が不要語の論理インデックスの欄に登録されている。なお、不要語は１つの場合もあるが、このように複数の単語が不要語になり得るので、不要語辞書１８の不要語の欄は、複数の単語の論理インデックスを登録することができる構造を有する。 As shown in FIG. 3, the items in the unnecessary word dictionary 18 are composed of a CIF code and a logical index of unnecessary words. The unnecessary word is not registered in the logical index set of the name dictionary 16 and is unnecessary in that it is not used when directly specifying the CIF code, but it is a word that assists in finally specifying the CIF code. The unnecessary word is often a single word, but may be composed of a plurality of words. In the example shown in FIG. 3, “6760000” is registered in the CIF code column, and “AND” and “OR” are unnecessary words for this “6760000”. ”And“ # 108 ”(see the basic word dictionary 14 shown in FIG. 2) are registered in the logical index column for unnecessary words. Although there may be one unnecessary word, a plurality of words can become unnecessary words in this way, and therefore the unnecessary word column of the unnecessary word dictionary 18 can register a logical index of a plurality of words. Have

不要語辞書１８は、新たな包摂関係の発生あるいは消滅、あるいは新たな不要語の発生あるいは消滅の場合変更できる。
次に、図１に示す装置の動作を図４のフローチャートを参照して説明する。 The unnecessary word dictionary 18 can be changed when a new inclusion relationship occurs or disappears, or when a new unnecessary word occurs or disappears.
Next, the operation of the apparatus shown in FIG. 1 will be described with reference to the flowchart of FIG.

処理エンジン１０は、ステップ１００において初期処理を行う。図２を参照して説明したように、基本語辞書１４に登録されている単語をメイン・メモリ１２へメモリ展開する（その展開状態を図３の１４ａとして示す。）。次いで、名称辞書１６に登録されている論理インデックスセットとＣＩＦコード及び包摂コードとを関連付けて、図３の参照番号１６ａで示されるようにメイン・メモリ１２上へメモリ展開し、そして不要語辞書１８に登録されているＣＩＦコードと不要語の論理インデックスとを関連付けて、図３の参照番号１８ａに示されるようにメイン・メモリ１２上へメモリ展開する。 The processing engine 10 performs initial processing in step 100. As described with reference to FIG. 2, the words registered in the basic word dictionary 14 are expanded in the main memory 12 (the expanded state is shown as 14a in FIG. 3). Next, the logical index set registered in the name dictionary 16 is associated with the CIF code and the inclusion code, and expanded into the main memory 12 as indicated by reference numeral 16a in FIG. Are associated with the logical index of the unnecessary word, and are expanded in the main memory 12 as indicated by reference numeral 18a in FIG.

処理エンジン１０は、次に入力データ解析２０を行う。図５に示す入力データ例を参照して説明する。ステップ１０２において、図５の１に示すように、入力データ３０、即ち「ＮＡＫＡＭＯＲＩＡＮＤＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が入力されたとする。次いで、図５の２に示すように、スペースを手がかりにして単語に分解する。処理エンジン１０は、ステップ１０４において、図５の３に示すように、メイン・メモリ１２上にメモリ展開された基本語辞書１４ａを参照して、参照番号６０に示すように論理インデックスを取得する。次いで、図５の４に示すように、入力データを論理インデックスに変換する。 Next, the processing engine 10 performs input data analysis 20. This will be described with reference to an example of input data shown in FIG. In step 102, it is assumed that input data 30, that is, “NAKAMORI AND CO., LTD. MARUNOUCHI” is input, as indicated by 1 in FIG. Next, as shown in 2 of FIG. 5, the space is used as a clue to break it into words. In step 104, the processing engine 10 refers to the basic word dictionary 14 a expanded in the main memory 12 as shown in 3 of FIG. 5 and acquires a logical index as indicated by reference numeral 60. Next, as shown by 4 in FIG. 5, the input data is converted into a logical index.

処理エンジン１０は、続いてマッチング処理２２を行う。
処理エンジン１０は、ステップ１０６において、論理インデックスを取得できなかったことで論理インデックスに変換できなった単語が有るか否かを判定する。図５に示す例では、未変換語が無いので、処理プロセスは、ステップ１０８に進む。 Next, the processing engine 10 performs a matching process 22.
In step 106, the processing engine 10 determines whether there is a word that could not be converted into a logical index because the logical index could not be acquired. In the example shown in FIG. 5, since there are no unconverted words, the processing process proceeds to step 108.

処理エンジン１０は、ステップ１０８において、変換された論理インデックスの組とメイン・メモリ１２上にメモリ展開された名称辞書１６ａ（図３）の論理インデックスセットとが完全マッチングしているか否かを判定する。完全マッチングとは、両者の論理インデックスが全て過不足無く一致することを意味する。ここで、「ＮＡＫＡＭＯＲＩＡＮＤＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」は、それにＣＩＦコードが与えられてなく、登録されていないとする。従って、処理エンジン１０は、入力データに対応する論理インデックスの組「＃１０５，＃３０１，＃１０７，＃３００」（図５の４）は、名称辞書１６ａ（図３）に無いので、完全マッチングしていないと判定する。従って、処理プロセスは、ステップ１１０に進む。 In step 108, the processing engine 10 determines whether or not the set of converted logical indexes and the logical index set of the name dictionary 16a (FIG. 3) expanded in the main memory 12 are completely matched. . Perfect matching means that both logical indexes match with each other without excess or deficiency. Here, it is assumed that “NAKAMORI AND CO., LTD. MARUNOUCHI” has not been given a CIF code and is not registered. Therefore, since the processing engine 10 does not have the logical index set “# 105, # 301, # 107, # 300” (4 in FIG. 5) corresponding to the input data in the name dictionary 16a (FIG. 3), complete matching is performed. Judge that it is not. Therefore, the process proceeds to step 110.

処理エンジン１０は、ステップ１１０において、変換された論理インデックスの組と、メイン・メモリ１２上にメモリ展開された名称辞書１６ａ及び不要語辞書１８ａ（図３）の論理インデックスセット及び不要語の論理インデックスとが部分マッチングしているか否かを判定する。ここで、部分マッチングとは、不要語の論理インデックスを含めた論理インデックスセットの中で、ステップ１０４において変換された論理インデックスと最も多く一致する、即ち最も多く含む論理インデックスセットを選択することを意味する。詳細には、ステップ１０４において変換された論理インデックスの組は、「＃１０５」、「＃１０７」及び「＃３００」を含むので、名称辞書１６ａの第１行及び第２行が対象とされ、次いで、ＣＩＦコードをキーとして不要語辞書１８ａに不要語の論理インデックスの登録があるか確認する。ＣＩＦコード「６９５００００」の方は不要語辞書１８ａに不要語の登録が無く、一方「６７６００００」は、「＃３０１」が登録されている。従って、処理エンジン１０は、名称辞書１６ａの第１行の選択はしないで、第２行についての論理インデックスセット「＃１０５，＃１０７，＃３００」を選択し、更にそれに不要語辞書１８ａの不要語の論理インデックス「＃３０１」又は「＃１０８」を含めた論理インデックスセットを、入力データに対応する論理インデックスの組「＃１０５，＃３０１，＃１０７，＃３００」（図５の４）と一致しているか判定する。このとき、辞書側の論理インデックスセットは、「＃１０５，＃１０７，＃３００，＃３０１」と「＃１０５，＃１０７，＃３００，＃１０８」の２つのセットがあると見て、それぞれのセットを入力データ側の論理インデックスの組と比較する。つまり、辞書側の論理インデックスセットは、不要語の登録数だけのセット数となり、その全てに対して比較を行う。図５の例では、辞書側の論理インデックスセット「＃１０５，＃１０７，＃３００，＃３０１」と入力データ側の論理インデックスの組「＃１０５，＃３０１，＃１０７，＃３００」が一致しているので、処理エンジン１０は、名称辞書１６ａの第２行のＣＩＦコード「６７６００００」を選択し、取得する（図１の参照番号３２参照）。 In step 110, the processing engine 10 performs the conversion of the converted logical index set, the logical index set of the name dictionary 16 a and the unnecessary word dictionary 18 a (FIG. 3) expanded in the main memory 12, and the logical index of the unnecessary word. Are partially matched. Here, the partial matching means that the logical index set including the logical index of unnecessary words is selected from the logical index set that most matches the logical index converted in step 104, that is, includes the most. To do. Specifically, since the set of logical indexes converted in step 104 includes “# 105”, “# 107”, and “# 300”, the first row and the second row of the name dictionary 16a are targeted, Next, it is confirmed whether there is a registration of an unnecessary word logical index in the unnecessary word dictionary 18a using the CIF code as a key. The CIF code “6950000” has no unnecessary word registered in the unnecessary word dictionary 18a, while “67601” has “# 301” registered therein. Accordingly, the processing engine 10 does not select the first row of the name dictionary 16a, selects the logical index set “# 105, # 107, # 300” for the second row, and further does not need the unnecessary word dictionary 18a. The logical index set including the logical index “# 301” or “# 108” of the word is set as a set of logical indexes “# 105, # 301, # 107, # 300” (4 in FIG. 5) corresponding to the input data. Determine if they match. At this time, it is assumed that there are two sets of logical index sets on the dictionary side, “# 105, # 107, # 300, # 301” and “# 105, # 107, # 300, # 108”. Compare the set with the set of logical indexes on the input data side. That is, the number of logical index sets on the dictionary side is the same as the number of unnecessary words registered, and all of them are compared. In the example of FIG. 5, the logical index set “# 105, # 107, # 300, # 301” on the dictionary side matches the logical index set “# 105, # 301, # 107, # 300” on the input data side. Therefore, the processing engine 10 selects and acquires the CIF code “6760000” in the second line of the name dictionary 16a (see reference numeral 32 in FIG. 1).

従って、名称に包摂関係が有る場合でも、不要語を含めた部分マッチングを用いることにより、正しいＣＩＦコードが取得され、入力データは正しいＣＩＦコードに変換することができる。 Therefore, even when the names have an inclusion relationship, by using partial matching including unnecessary words, a correct CIF code can be acquired and input data can be converted into a correct CIF code.

処理エンジン１０は、ステップ１１２において、入力データ「ＮＡＫＡＭＯＲＩＡＮＤＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」を、取得したＣＩＦコード「６７６００００」に変換する。 In step 112, the processing engine 10 converts the input data “NAKAMORI AND CO., LTD. MARUNOUCHI” into the acquired CIF code “6760000”.

次に、「ＡＮＤ」が不要語として登録されていない場合に、図５の入力データが入力されたときの処理プロセスを、図４及び図６を参照して以下に説明する。
図４に示すステップ１００における初期処理は前述のとおりであり、説明を省く。 Next, the processing process when the input data of FIG. 5 is input when “AND” is not registered as an unnecessary word will be described below with reference to FIGS. 4 and 6.
The initial processing in step 100 shown in FIG. 4 is as described above and will not be described.

処理エンジン１０は、入力データ解析２０を行う。ステップ１０２において、図６の１に示すように、入力データ、即ち「ＮＡＫＡＭＯＲＩＡＮＤＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が入力されたとする。次いで、図５の入力データの場合と同様に、スペースを手がかりにして単語に分解する（図６の２参照）。処理エンジン１０は、ステップ１０４において、図６の３に示すように、メイン・メモリ１２上にメモリ展開された基本語辞書１４ａを参照して、参照番号６２に示すように論理インデックスを取得する。このとき、「ＡＮＤ」は、基本語辞書１４に登録されていないので、論理インデックスを取得できない。次いで、図６の４に示すように、入力データを論理インデックスに変換する。ここで、「ＡＮＤ」は、論理インデックスを取得していないため、論理インデックスに変換されず、そのままの状態にされる。 The processing engine 10 performs input data analysis 20. In step 102, it is assumed that input data, that is, “NAKAMORI AND CO., LTD. MARUNOUCHI” is input, as indicated by 1 in FIG. Next, as in the case of the input data shown in FIG. 5, the space is used as a clue to decompose the word (see 2 in FIG. 6). In step 104, the processing engine 10 refers to the basic word dictionary 14 a expanded in the main memory 12 as indicated by 3 in FIG. 6 and acquires a logical index as indicated by reference numeral 62. At this time, since “AND” is not registered in the basic word dictionary 14, a logical index cannot be acquired. Next, as shown in 4 of FIG. 6, the input data is converted into a logical index. Here, since “AND” has not acquired a logical index, it is not converted into a logical index and is left as it is.

処理エンジン１０は、続いてマッチング処理２２に進む。
処理エンジン１０は、ステップ１０６において、論理インデックスを取得できなかったことで論理インデックスに変換できなった単語が有るか否かを判定する。「ＡＮＤ」が変換できていないので、処理エンジン１０は、ステップ１２０に進む。ステップ１２０において、表示装置２２に、例えば、図６の４に示す論理インデックスの組及び未変換の単語、即ち「＃１０５＜ＡＮＤ＞＃１０７＃３００」、及び入力データ「ＮＡＫＡＭＯＲＩＡＮＤＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が表示される。オペレータは、表示されたこれらの情報、及び入力データに関連した情報、例えば、送信元（あるいは宛先）、送信地点の情報等から、入力データが「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」でＣＩＦコードが「６７６００００」であることを特定でき、且つ「ＡＮＤ」が不要語としての性格を有すると判断した場合は、例えば、入力装置２０から「ＡＮＤ」を入力すると、処理エンジン１０は、図２の参照番号６４に示されるように、この「ＡＮＤ」を固有の論理インデックスと関連付けて基本語辞書１４に登録する。次いで、オペレータが「ＡＮＤ」を不要語として不要語辞書１８に登録するよう入力装置２０から指示を入力すると、処理エンジン１０は、図３の参照番号６６に示されるように、ＣＩＦコード「６７６００００」と関連付けて「ＡＮＤ」の論理インデックス「＃３０１」を登録する。従って、不要語辞書１８への登録後に、入力データ「ＮＡＫＡＭＯＲＩＡＮＤＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が入力された場合、この入力は、包摂関係があっても、図５を参照して上記で説明した処理プロセスに従って、正しくＣＩＦコード「６７６００００」に変換される。 The processing engine 10 then proceeds to the matching process 22.
In step 106, the processing engine 10 determines whether there is a word that could not be converted into a logical index because the logical index could not be acquired. Since “AND” has not been converted, the processing engine 10 proceeds to step 120. In step 120, for example, a set of logical indexes shown in 4 of FIG. 6 and unconverted words, that is, “# 105 <AND># 107 # 300”, and input data “NAKAMORI AND CO., LTD” are displayed on the display device 22. MARUNOUCHI "is displayed. From the displayed information and information related to the input data, for example, information on the transmission source (or destination), transmission point, etc., the input data is “NAKAMORI CO., LTD. MARUNOUCHI” and the CIF code is “ 2, and when it is determined that “AND” has a character as an unnecessary word, for example, when “AND” is input from the input device 20, the processing engine 10 is referred to by the reference number in FIG. 2. As indicated by 64, this “AND” is registered in the basic word dictionary 14 in association with a unique logical index. Next, when the operator inputs an instruction from the input device 20 to register “AND” as an unnecessary word in the unnecessary word dictionary 18, the processing engine 10 has a CIF code “6760000” as indicated by reference numeral 66 in FIG. And the logical index “# 301” of “AND” is registered. Therefore, when the input data “NAKAMORI AND CO., LTD MARUNOUCHI” is input after registration in the unnecessary word dictionary 18, even if this input has an inclusion relationship, the processing described above with reference to FIG. According to the process, it is correctly converted to CIF code “6760000”.

ここで、入力データ「ＮＡＫＡＭＯＲＩＯＲＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」における「ＯＲ」が「ＡＮＤ」と同様に不要語としての性格を有するとする。最初にそのデータ「ＮＡＫＡＭＯＲＩＯＲＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が入力されたとき、「ＮＡＫＡＭＯＲＩＡＮＤＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」の場合と同じように処理されて、図２の参照番号６８で示されるように、「ＯＲ」が固有の論理インデックス「＃１０８」と関連付けられて基本語辞書１４に登録され、更に図３の参照番号６６に示されるように、ＣＩＦコード「６７６００００」と関連付けられて論理インデックス「＃１０８」が登録される。このように、不要語辞書１８は、複数の不要語がＣＩＦコードと関連付けられて登録することができる構造を有する。不要語「ＯＲ」の登録後に、「ＮＡＫＡＭＯＲＩＯＲＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が入力された場合正しいＣＩＦコードを特定するための処理プロセスは、前述した入力データ「ＮＡＫＡＭＯＲＩＡＮＤＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」の場合と同様である。 Here, it is assumed that “OR” in the input data “NAKAMORI OR CO., LTD. MARUNOUCHI” has a character as an unnecessary word like “AND”. When the data “NAKAMORI OR CO., LTD. MARUNOUCHI” is first input, it is processed in the same manner as in the case of “NAKAMORI AND CO., LTD. MARUNOUCHI”, and as indicated by reference numeral 68 in FIG. “OR” is registered in the basic word dictionary 14 in association with the unique logical index “# 108”, and further, as indicated by reference numeral 66 in FIG. 3, the logical index “#” is associated with the CIF code “6760000”. 108 "is registered. As described above, the unnecessary word dictionary 18 has a structure in which a plurality of unnecessary words can be registered in association with the CIF code. When “NAKAMORI OR CO., LTD MARUNOUCHI” is input after registration of the unnecessary word “OR”, the processing process for specifying the correct CIF code is the case of the aforementioned input data “NAKAMORI AND CO., LTD MARUNOUCHI” It is the same.

なお、不要語に関する基本語辞書１４及び不要語辞書１８の登録は、オペレータが可能性のある不要語を予め想定して、事前に登録することも可能である。また、不要語に割り当てられる固有の論理インデックスは、処理エンジン１０が固有の論理インデックスを自動的に決定してもよく、またオペレータが選択してもよく、要は固有の論理インデックスが割り当てられれば、いずれの方法でもよい。 The basic word dictionary 14 and the unnecessary word dictionary 18 relating to unnecessary words can be registered in advance by assuming unnecessary words that the operator may possibly have. The unique logical index assigned to the unnecessary word may be automatically determined by the processing engine 10 or selected by the operator. In short, if the unique logical index is assigned. Any method may be used.

次に、入力データとして、「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が入力された場合の処理プロセスについて図４及び図７を参照して説明する。図４のステップ１００は前述の例と同じであるので説明を省く。図４のステップ１０２において、図７の１及び２に示すように、入力データは単語に分解される。次いで、図４のステップ１０４において、図７の３に示すように、分解された単語に対応する論理インデックスが参照番号７０に示すように取得され、更に、図７の４に示すように単語の組は、論理インデックス「＃１０５，＃１０７，＃３００」に変換される。図４のステップ１０６における判定では、変換されなかった単語が無いので、処理プロセスは、ステップ１０８に進む。 Next, a processing process when “NAKAMORI CO., LTD. MARUNOUCHI” is input as input data will be described with reference to FIGS. 4 and 7. Since step 100 in FIG. 4 is the same as the above example, description thereof is omitted. In step 102 of FIG. 4, the input data is broken down into words, as shown at 1 and 2 in FIG. Next, in step 104 of FIG. 4, a logical index corresponding to the decomposed word is obtained as indicated by reference numeral 70 as shown in 3 of FIG. 7, and further, as shown in 4 of FIG. The set is converted into logical indexes “# 105, # 107, # 300”. In the determination in step 106 of FIG. 4, since there is no word that has not been converted, the processing process proceeds to step 108.

次いで、ステップ１０８において、変換された論理インデックスの組「＃１０５，＃１０７，＃３００」とメイン・メモリ１２上にメモリ展開された名称辞書１６ａ（図３）の論理インデックスセットとが完全マッチングしているか否かが判定される。変換された論理インデックスの組「＃１０５，＃１０７，＃３００」が名称辞書１６ａの第２行の論理インデックスセット「＃１０５，＃１０７，＃３００」と全て過不足無く一致しているので、処理エンジン１０による処理プロセスは、ステップ１２２に進む。処理エンジン１０は、ステップ１２２において、名称辞書１６ａの第２行の論理インデックスセット「＃１０５，＃１０７，＃３００」に対応するＣＩＦコード「６７６００００」を選択して、入力データ「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」をＣＩＦコード「６７６００００」に変換する。従って、「ＮＡＫＡＭＯＲＩＥＬＥＣＴＲＩＣＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」と包摂関係を有する「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」についても正しいＣＩＦコードに変換することができる。 Next, at step 108, the converted logical index set “# 105, # 107, # 300” and the logical index set of the name dictionary 16a (FIG. 3) expanded in the main memory 12 are completely matched. It is determined whether or not. Since the converted logical index set “# 105, # 107, # 300” matches the logical index set “# 105, # 107, # 300” in the second row of the name dictionary 16a without any excess or deficiency, The processing process by the processing engine 10 proceeds to step 122. In step 122, the processing engine 10 selects the CIF code “6760000” corresponding to the logical index set “# 105, # 107, # 300” in the second row of the name dictionary 16a, and the input data “NAKAMORI CO.,”. “LTD MARUNOUCHI” is converted to CIF code “6760000”. Therefore, “NAKAMORI CO., LTD. MARUNOUCHI” having an inclusion relationship with “NAKAMORI ELECTRIC CO., LTD. MARUNOUCHI” can also be converted into a correct CIF code.

データとして「ＮＡＫＡＭＯＲＩＥＬＥＣＴＲＩＣＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が入力された場合は、入力データ「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」の場合と同じ前述の処理により、ステップ１０８において、図３の名称辞書１６ａの第１行の論理インデックスセット「＃１０５，＃１０６，＃１０７，＃３００」と完全マッチングし、ステップ１２２において、ＣＩＦコード「６９５００００」に変換される。従って、包摂関係があり且つ包摂している入力データについても正しいＣＩＦコードに変換することができる。 When “NAKAMORI ELECTRIC CO., LTD. MARUNOUCHI” is input as data, the first process of the name dictionary 16a of FIG. 3 is performed in step 108 by the same processing as that of the input data “NAKAMORI CO., LTD. It is completely matched with the logical index set “# 105, # 106, # 107, # 300” of the row, and converted to CIF code “6950000” in step 122. Therefore, it is possible to convert the input data having the inclusion relationship and the inclusion into a correct CIF code.

なお、図４のステップ１１０において、部分マッチングが成立しない場合には、参照番号１２４で示すように解析不能として、その旨表示装置２２に表示し、解析を中止し、オペレータによる解析に委ねる。これは、不正確な解析を回避するためである。 If the partial matching is not established in step 110 of FIG. 4, it is displayed on the display device 22 as being incapable of analysis as indicated by reference numeral 124, the analysis is stopped, and the analysis is left to the operator. This is to avoid inaccurate analysis.

次に、名称辞書１６及び不要語辞書１８の構造の変形態様を図８を参照して説明する。
不要語辞書１８を独立して設ける代わりに、図８に示すように、名称辞書１６′の論理インデックスセットの欄に、主要欄８０と不要語欄８２を設け、主要欄８０にＣＩＦコードと一対一に対応する論理インデックスセットを、不要語欄８２に不要語の単語に対応する論理インデックスを登録する。なお、不要語欄８２は、不要語の単語が複数ある場合、各単語に対応するそれぞれの論理インデックスを登録することができる構造を有する。また、不要語が存在しないときには、不要語欄８２は空の状態となる。 Next, the deformation | transformation aspect of the structure of the name dictionary 16 and the unnecessary word dictionary 18 is demonstrated with reference to FIG.
Instead of providing the unnecessary word dictionary 18 independently, as shown in FIG. 8, a main column 80 and an unnecessary word column 82 are provided in the column of the logical index set of the name dictionary 16 ', and a CIF code and a pair are set in the main column 80. The logical index set corresponding to one is registered in the unnecessary word column 82 with the logical index corresponding to the word of the unnecessary word. Note that the unnecessary word column 82 has a structure in which when there are a plurality of words of unnecessary words, each logical index corresponding to each word can be registered. Further, when there is no unnecessary word, the unnecessary word column 82 is empty.

以下、図３に示す実施形態の場合の動作と異なる点についてのみ説明する。名称辞書１６′がメイン・メモリ１２上にメモリ展開されたものとする。図４のステップ１０８における完全マッチングは、入力データを変換して得られた論理インデックスの組とメイン・メモリ１２上にメモリ展開された名称辞書１６′ａの主要欄８０ａの論理インデックスセットのみとを行う。完全マッチングの仕方は図３の実施形態の場合と同じである。 Only differences from the operation of the embodiment shown in FIG. 3 will be described below. It is assumed that the name dictionary 16 ′ is expanded on the main memory 12. The complete matching in step 108 in FIG. 4 is a combination of the logical index set obtained by converting the input data and only the logical index set in the main column 80a of the name dictionary 16'a expanded in the main memory 12. Do. The method of complete matching is the same as in the embodiment of FIG.

図４のステップ１１０における部分マッチングは、入力データを変換して得られた論理インデックスの組と、主要欄８０ａの論理インデックスセットに不要語欄８２ａの不要語を加えたものとを行う。部分マッチングの仕方は図３の実施形態の場合と同じである。 The partial matching in step 110 in FIG. 4 is performed by combining a set of logical indexes obtained by converting input data and adding an unnecessary word in the unnecessary word column 82a to the logical index set in the main column 80a. The method of partial matching is the same as in the embodiment of FIG.

不要語欄８２への不要語の論理インデックスの登録も不要語辞書１８での不要語の登録と同じである。また、名称辞書１６′の名称の欄に登録される名称に不要語を図８に示すように含めてもよく、また含めなくてもよい。 Registration of an unnecessary word logical index in the unnecessary word column 82 is the same as registration of an unnecessary word in the unnecessary word dictionary 18. Further, unnecessary words may or may not be included in the names registered in the name column of the name dictionary 16 'as shown in FIG.

次に、本発明の１つの変更形態について説明する。初めに、図１０に示すような辞書の登録例を説明する。
図１０は、名称辞書１６”及び不要語辞書１８”のそれぞれに事前に登録されている論理インデックス・セットをメイン・メモリ１２上にメモリ展開した状態を示す。なお、図１０には、図２と同様の基本語辞書１４に事前に登録されている基本語をメイン・メモリ１２上にメモリ展開した状態をも示す。名称辞書１６”の名称の欄には、基本語辞書１４に登録されている基本語のうちで、ＣＩＦコードが特定される基本語の組み合わせ「ＮＡＫＡＭＯＲＩＥＬＥＣＴＲＩＣＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」によりＣＩＦコード「６９５００００」が、「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」により「６７６００００」が、「ＮＡＫＡＹＡＭＡＰＲＩＮＴＩＮＧＣＯＲＰ」により「８９８４０００」がそれぞれ特定されるとする。そこで、名称辞書１６”の第１行には「ＮＡＫＡＭＯＲＩＥＬＥＣＴＲＩＣＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が、第２行には「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が、第３行には「ＮＡＫＡＹＡＭＡＰＲＩＮＴＩＮＧＣＯＲＰ」が、それぞれ文字列として登録されている。論理インデックス・セットの欄には、これらの組み合わされた基本語に対応する論理インデックスの組が登録されている。例えば、名称辞書１６”の第１行には、名称「ＮＡＫＡＭＯＲＩＥＬＥＣＴＲＩＣＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」に対応して、「＃１０５，＃１０６，＃１０７，＃３００」が、第２行には、名称「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」に対応して、「＃１０５，＃１０７，＃３００」が、第３行には、名称「ＮＡＫＡＹＡＭＡＰＲＩＮＴＩＮＧＣＯＲＰ」に対応して、「＃４０１，＃２１０，＃２２０」がそれぞれ基本語辞書１４の論理インデックスの組として登録されている。ＣＩＦコードの欄には、各名称に対応する固有の番号が登録される。例えば、名称辞書１６”の第１行には、「ＮＡＫＡＭＯＲＩＥＬＥＣＴＲＩＣＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」に対応するＣＩＦコード「６９５００００」が、第２行には、「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」に対応するＣＩＦコード「６７６００００」が、第３行には、「ＮＡＫＡＹＡＭＡＰＲＩＮＴＩＮＧＣＯＲＰ」に対応するＣＩＦコード「８９８４０００」が、それぞれ登録されている。 Next, one modified embodiment of the present invention will be described. First, an example of dictionary registration as shown in FIG. 10 will be described.
FIG. 10 shows a state in which a logical index set registered in advance in each of the name dictionary 16 ″ and the unnecessary word dictionary 18 ″ is expanded on the main memory 12. FIG. 10 also shows a state in which basic words registered in advance in the basic word dictionary 14 similar to FIG. 2 are expanded in the main memory 12. In the name column of the name dictionary 16 ”, among the basic words registered in the basic word dictionary 14, a CIF code“ 6950000 ”is generated by a combination of basic words“ NAKAMORI ELECTRIC CO., LTD. ”Is identified as“ 6760000 ”by“ NAKAMORI CO., LTD. MARUNOUCHI ”, and“ 8984000 ”by“ NAKAYAMA PRINTING CORP ”. Therefore, “NAKAMORI ELECTRIC CO., LTD. MARUNOUCHI” is in the first line of the name dictionary 16 ”,“ NAKAMORI CO., LTD. MARUNOUCHI ”is in the second line,“ NAKAYAMA PRINTING CORP ”is in the third line, Each is registered as a character string. A set of logical indexes corresponding to these combined basic words is registered in the logical index set column. For example, “# 105, # 106, # 107, # 300” is associated with the name “NAKAMORI ELECTRIC CO., LTD. MARUNOUCHI” in the first line of the name dictionary 16, and the name is in the second line. “# 105, # 107, # 300” corresponds to “NAKAMORI CO., LTD. MARUNOUCHI”, and “# 401, # 210, # 300” corresponds to the name “NAKAYAMA PRINTING CORP” on the third line. 220 "is registered as a set of logical index of their respective basic word dictionary 14. A unique number corresponding to each name is registered in the CIF code column. For example, the first line of the name dictionary 16 ”has a CIF code“ 6950000 ”corresponding to“ NAKAMORI ELECTRIC CO., LTD MARUNOUCHI ”, and the second line has a CIF corresponding to“ NAKAMORI CO., LTD MARUNOUCHI ”. The code “6760000” is registered in the third line, and the CIF code “8984000” corresponding to “NAKAYAMA PRINTING CORP” is registered.

この例では、ＣＩＦコード「６９５００００」と「６７６００００」とは、包摂関係が発生しており、即ち、前者の論理インデックスの組が、後者の論理インデックスの組「＃１０５，＃１０７，＃３００」を含む。即ち、後者の論理インデックスの組は、前者の論理インデックスの組に包摂されている。そこで、この組に対する包摂コード番号として「＃４０００」を割り当て、図１０に示すように包摂コードの欄の両方の行に「＃４０００」が登録される。ＣＩＦコード６９５００００の包摂コードの欄には、更に共通部分でない論理インデックスが登録される。この例では、「＃１０６」（単語「ＥＬＥＣＴＲＩＣ」）が包摂コードの欄に登録される。 In this example, the CIF codes “6950000” and “6760000” have an inclusion relationship, that is, the former logical index set is the latter logical index set “# 105, # 107, # 300”. including. That is, the latter set of logical indexes is included in the former set of logical indexes. Therefore, “# 4000” is assigned as the inclusion code number for this set, and “# 4000” is registered in both rows of the inclusion code column as shown in FIG. A logical index that is not a common part is registered in the inclusion code column of the CIF code 6950000. In this example, “# 106” (word “ELECTRIC”) is registered in the inclusion code column.

次に、前記の辞書登録（図１０）のときに、図５のデータが入力されたときの処理プロセスを、図１１のフローチャート及び図５を参照して以下に説明する。
図１１に示すステップ１００における初期処理及び入力データ解析２０は前述のとおりであり、説明を省く。処理エンジン１０は、続いてマッチング処理２２′に進む。 Next, the processing process when the data of FIG. 5 is input during the dictionary registration (FIG. 10) will be described below with reference to the flowchart of FIG. 11 and FIG.
The initial processing and input data analysis 20 in step 100 shown in FIG. 11 are as described above, and will not be described. The processing engine 10 then proceeds to the matching process 22 ′.

処理エンジン１０は、ステップ１０６において、論理インデックスを取得できなかったことで論理インデックスに変換出来なかった単語があるか否かを判定する。図５に示す例では、未変換語が無いので、処理プロセスは、ステップ１０８に進む。 In step 106, the processing engine 10 determines whether there is a word that could not be converted into a logical index because the logical index could not be acquired. In the example shown in FIG. 5, since there are no unconverted words, the processing process proceeds to step 108.

処理エンジン１０は、ステップ１０８′において、変換された論理インデックスの組とメイン・メモリ１２上にメモリ展開された名称辞書１６”ｂ（図１０）の論理インデックスセットとが完全マッチングしているか否かを判定する。図５に示す例では、完全マッチングではないので、処理プロセスはステップ１１０に進む。 In step 108 ', the processing engine 10 determines whether or not the set of the converted logical index and the logical index set of the name dictionary 16 "b (FIG. 10) expanded in the main memory 12 are completely matched. In the example shown in Fig. 5, the processing process proceeds to step 110 because it is not perfect matching.

処理エンジン１０は、ステップ１１０′において、変換された論理インデックスの組と、メイン・メモリ１２上にメモリ展開された名称辞書１６”ｂ及び不要語辞書１８”ｂ（図１０）の論理インデックスセット及び不要語の論理インデックスセットとが部分マッチングしているか否かを判定する。図５に示す例では、部分マッチングにて名称辞書１６”ｂの第２行のＣＩＦコード「６７６００００」を選択し、取得する（図１の参照番号３２参照）。処理プロセスはステップ１１６に進む。 In step 110 ′, the processing engine 10, the set of converted logical indexes, the logical index set of the name dictionary 16 ″ b and the unnecessary word dictionary 18 ″ b (FIG. 10) expanded in the main memory 12, and It is determined whether or not the logical index set of unnecessary words is partially matched. In the example shown in FIG. 5, the CIF code “6760000” in the second line of the name dictionary 16 ″ b is selected and acquired by partial matching (see reference numeral 32 in FIG. 1). The process proceeds to step 116.

処理エンジン１０は、ステップ１１６において、ステップ１１０において名称辞書１６”ｂの取得したＣＩＦコード「６７６００００」の行に包摂コードが登録されているか否かを判定する。図５に示す例では、名称辞書１６”ｂに登録されているＣＩＦコード「６７６００００」の行に包摂コード「４０００」が存在するので、処理プロセスはステップ１１４に進む。 In step 116, the processing engine 10 determines whether or not an inclusion code is registered in the line of the CIF code “6760000” acquired in the name dictionary 16 ″ b in step 110. In the example shown in FIG. 5, since the inclusion code “4000” exists in the line of the CIF code “6760000” registered in the name dictionary 16 ″ b, the processing process proceeds to step 114.

処理エンジン１０は、ステップ１１４において包摂関係リスト出力を行う。詳細には、表示装置２２に、ステップ１１６において取得した包摂コード「４０００」をキーにして、名称辞書１６”ｂを検索した結果、取得された名称を表示する。図５の例では、名称辞書１６”ｂに登録されている名称「ＮＡＫＡＭＯＲＩＥＬＥＣＴＲＩＣＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」と、名称「ＮＡＫＡＭＯＲＩＣＯ．，ＬＴＤＭＡＲＵＮＯＵＣＨＩ」が表示される。これにより、該当の名称に包摂関係が発生していることと、包摂関係の要因となっている名称を明示し、名称辞書の補正候補として認識することができる。次いで処理プロセスはステップ１１８に進む。 The processing engine 10 outputs an inclusion relation list in step 114. Specifically, the name acquired as a result of searching the name dictionary 16 ″ b using the inclusion code “4000” acquired in step 116 as a key is displayed on the display device 22. In the example of FIG. The name “NAKAMORI ELECTRIC CO., LTD. MARUNOUCHI” and the name “NAKAMORI CO., LTD. MARUNOUCHI” registered in 16 ″ b are displayed. As a result, it is possible to clearly indicate that an inclusion relationship has occurred in the corresponding name and the name that is the cause of the inclusion relationship, and recognize it as a name dictionary correction candidate. The process then proceeds to step 118.

処理エンジン１０は、ステップ１１８において最終マッチング判定を行う。詳細には、特定されたＣＩＦが、ステップ１０８′の完全マッチングと、ステップ１１０′の部分マッチングのどちらで特定されたかを判定する。ステップ１０８′の完全マッチングでＣＩＦが特定されていた場合は、処理エンジン１０はステップ１１２に進む。ステップ１１０′の部分マッチングでＣＩＦが特定されていた場合には、変換された論理インデックスの組と、メイン・メモリ１２上にメモリ展開された名称辞書１６”ｂ及び不要語辞書１８”ｂ（図１０）の論理インデックスセット及び不要語の論理インデックスセットとが部分マッチングしているか否かを判定し、マッチングできた場合には処理エンジンはステップ１１２に進み、マッチングできなかった場合には処理エンジン１０はステップ１２６に進む。 The processing engine 10 makes a final matching determination in step 118. Specifically, it is determined whether the specified CIF is specified by the complete matching in step 108 'or the partial matching in step 110'. If the CIF has been specified by the perfect matching in step 108 ′, the processing engine 10 proceeds to step 112. When the CIF is specified by the partial matching in step 110 ′, the converted logical index set, the name dictionary 16 ″ b and the unnecessary word dictionary 18 ″ b expanded in the main memory 12 (see FIG. It is determined whether or not the logical index set of 10) and the logical index set of unnecessary words are partially matched. If matching is possible, the processing engine proceeds to step 112. If matching is not possible, the processing engine 10 Proceeds to step 126.

図５の例では、部分マッチングでＣＩＦ「６７６００００」が特定されたので、処理プロセスはステップ１１２に進み、ＣＩＦ「６７６００００」を特定する。 In the example of FIG. 5, since CIF “6760000” has been identified by partial matching, the processing process proceeds to step 112 to identify CIF “6760000”.

図１は、銀行等で受信する決済指図電文等について計算機を使用してＣＩＦ解析処理に本発明を適用した場合の好適な実施形態に従った字句をデータに変換する装置の基本構成を示す図である。FIG. 1 is a diagram showing a basic configuration of an apparatus for converting a lexical word into data according to a preferred embodiment when the present invention is applied to a CIF analysis process using a computer for a settlement instruction message received at a bank or the like. It is. 図２は、基本語辞書１４に事前に登録されている基本語をメイン・メモリ１２上にメモリ展開した状態を示す。FIG. 2 shows a state in which basic words registered in advance in the basic word dictionary 14 are expanded in the main memory 12. 図３は、名称辞書１６及び不要語辞書１８のそれぞれに事前に登録されている論理インデックス・セットをメイン・メモリ１２上にメモリ展開した状態を示す。FIG. 3 shows a state in which the logical index set registered in advance in each of the name dictionary 16 and the unnecessary word dictionary 18 is expanded on the main memory 12. 図４は、図１に示す装置の動作を示すフローチャートである。FIG. 4 is a flowchart showing the operation of the apparatus shown in FIG. 図５は、包摂関係があり且つ不要語を含む入力データの論理インデックスへの変換例を示す。FIG. 5 shows an example of conversion of input data having an inclusion relationship and including unnecessary words into a logical index. 図６は、包摂関係があり且つ不要語を含まない入力データの論理インデックスへの変換例を示す。FIG. 6 shows an example of conversion of input data that is inclusive and does not include unnecessary words into a logical index. 図７は、包摂関係があり且つ不要語を含まない別の入力データの論理インデックスへの変換例を示す。FIG. 7 shows an example of conversion of another input data that is inclusive and does not contain unnecessary words into a logical index. 図８は、図３に示す名称辞書１６及び不要語辞書１８の構造の変形態様を示す。FIG. 8 shows a modification of the structure of the name dictionary 16 and the unnecessary word dictionary 18 shown in FIG. 図９は、包摂関係を有する入力データの処理上の問題を説明するための図である。FIG. 9 is a diagram for explaining a problem in processing input data having an inclusion relationship. 図１０は、本発明の１つの変更形態における辞書の登録例を説明するための図である。FIG. 10 is a diagram for explaining an example of dictionary registration according to one modified embodiment of the present invention. 図１１は、本発明の１つの変更形態におけるフローチャートである。FIG. 11 is a flowchart in one variation of the invention.

Explanation of symbols

１０処理エンジン
１２メイン・メモリ
１４基本語辞書
１６名称辞書
１８不要語辞書
２０入力装置
２２表示装置 10 processing engine 12 main memory 14 basic word dictionary 16 name dictionary 18 unnecessary word dictionary 20 input device 22 display device

Claims

An apparatus for converting a lexical word including a plurality of words into another data including one piece of information specified by the plurality of words,
A basic word dictionary pre-registered in the storage means in association with each of a plurality of words and a logical index representing each word;
Associate a set of logical indexes corresponding to a set of words obtained by combining words registered in the basic word dictionary with one piece of information specified by the set of words, and the sets of logical indexes are inclusive relations In some cases , the set of included logical indexes is not included in the set of words corresponding to the included set of logical indexes, and is unnecessary in that it is not used when directly specifying the one piece of information. A name dictionary in which a logical index corresponding to an unnecessary word which is a word that assists in finally identifying the one information is associated with the one information and is registered in advance in storage means;
A processing engine for converting a lexical phrase including the plurality of words into another data including the specified one piece of information,
The processing engine is
The words registered in the basic word dictionary are associated with a logical index representing each word and expanded in memory.
A logical index set registered in the name dictionary and a logical index of an unnecessary word are associated with the specified one information corresponding to each logical index set and expanded in the memory;
Break the lexical word to be converted into words,
Referencing a basic word dictionary expanded in memory on the memory, obtaining and converting a logical index corresponding to the decomposed word,
A logical index set that matches the logical index set corresponding to the decomposed word is selected from the memory expanded logical index sets including the logical index of unnecessary words, and the selected An apparatus for converting a lexical word including the plurality of words into another data including the specified information by using one information specified by a set of logical indexes.

An input means for inputting information including data, and a display means for displaying information including data;
The processing engine further includes:
Referring to the basic word dictionary expanded in the memory, when acquiring and converting the logical index corresponding to the decomposed word, if there is a word for which the logical index cannot be acquired, specify the word Display on the display means;
In response to the information input by the input means indicating that the word displayed on the display means is an unnecessary word, the word is written in the basic word dictionary in association with the corresponding logical index; and The logical index of the word made into the said unnecessary word in association with one information specified from the word group except the word made into the said word dictionary except the word made into the unnecessary word in the word to be converted claim 1 Symbol mounting apparatus writes.

The processing engine further selects a logical index set that matches the logical index set corresponding to the decomposed word from the memory expanded logical index set including a logical index of an unnecessary word. Any one of a plurality of logical indexes among the logical indexes corresponding to the decomposed words from the set of the memory-expanded logical indexes that do not include the logical indexes of unnecessary words. A set of logical indexes that is identical to the set of logical indexes and is composed only of the logical indexes of the set of any of the logical indexes, and is specified by the selected set of logical indexes By using one piece of information, the lexical phrase including the plurality of words is identified. The apparatus of claim 1, wherein converting to another data including One information.

An apparatus for converting a lexical word including a plurality of words into another data including one piece of information specified by the plurality of words,
A basic word dictionary pre-registered in the storage means in association with each of a plurality of words and a logical index representing each word;
A name dictionary pre-registered in storage means in association with one piece of information specified by the set of words, a set of logical indexes corresponding to a set of words obtained by combining words registered in the basic word dictionary ,
When the set of logical indexes is inclusive , the set of included logical indexes is not included in the set of words corresponding to the included set of logical indexes, and thus when the one information is directly specified Unnecessary word that is unnecessary in that it is not used in the process, but is previously registered in the storage means in association with the one information in association with the logical index of the unnecessary word, which is a word that assists in finally identifying the one information. A dictionary,
A processing engine for converting a lexical phrase including the plurality of words into another data including the specified one piece of information,
The processing engine is
The words registered in the basic word dictionary are associated with a logical index representing each word and expanded in memory.
A set of logical indexes registered in the name dictionary is associated with the specified one piece of information corresponding to each set of logical indexes and expanded in the memory;
A logical index of unnecessary words registered in the unnecessary word dictionary is associated with the one information and expanded in the memory;
Break the lexical word to be converted into words,
Referencing a basic word dictionary expanded in memory on the memory, obtaining and converting a logical index corresponding to the decomposed word,
A set of logical indexes that matches one of the sets of logical indexes of the logical indexes corresponding to the decomposed words from the set of logical indexes expanded in memory. And temporarily acquiring one piece of information specified by the selected set of logical indexes, and there is an inclusion relationship,
In the unnecessary word dictionary expanded in the memory, the logical index of the unnecessary word corresponding to the one piece of temporarily acquired information is the logical index corresponding to the decomposed word. If it matches a logical index that was not used when temporarily acquiring one piece of information, the one piece of temporarily acquired information is determined as one piece of information to be specified, and the one piece of information is used An apparatus for converting a lexical phrase including the plurality of words into another data including the specified information.

An input means for inputting information including data, and a display means for displaying information including data;
The processing engine further includes:
Referring to the basic word dictionary expanded in the memory, when acquiring and converting the logical index corresponding to the decomposed word, if there is a word for which the logical index cannot be acquired, specify the word Display on the display means;
In response to the information input by the input means indicating that the word displayed on the display means is an unnecessary word, the word is written in the basic word dictionary in association with the corresponding logical index; and In the unnecessary word dictionary, among the words included in the lexical word to be converted, the logic of the word determined as the unnecessary word in association with one piece of information specified from the word set excluding the word determined as the unnecessary word 4. Symbol mounting device writes index.

The processing engine further includes any one of a set of logical indexes constituted by a plurality of logical indexes among the logical indexes corresponding to the decomposed word from the set of logical indexes expanded in the memory. Before selecting a matching logical index set and temporarily acquiring one piece of information specified by the selected logical index set, the memory-expanded logical index set is A set of logical indexes that matches a set of any of the logical indexes configured from a plurality of logical indexes among the logical indexes corresponding to the decomposed words, and includes only the logical index of the set of any of the logical indexes Selected logical index set, and the selected logical index set By using one of the information is identified, another into data, according to claim 4 or 5 further comprising a single information lexical is the specific including the plurality of words.

Corresponding to a basic word dictionary that is pre-registered in the storage means in association with a logical index that represents each word, and a set of words that combine words registered in the basic word dictionary A set of logical indexes to be associated with one piece of information specified by the set of words, and when the sets of logical indexes are in an inclusive relationship, for the set of included logical indexes, An unnecessary word that is not included in a set of words corresponding to a set and is not used when directly specifying the one piece of information, but is a word that assists in finally specifying the one piece of information. A name dictionary pre-registered in the storage means in association with the one information, and a lexical phrase including a plurality of words The computer to convert into another data including one information specified by the word,
Associating words registered in the basic word dictionary with a logical index representing each word in a memory and generating a basic word dictionary expanded in memory ;
A logical index set registered in the name dictionary and a logical index of an unnecessary word are associated with the specified one information corresponding to each logical index pair, and are expanded in the memory. Generating a name dictionary;
Breaking the lexical terms to be converted into words;
Obtaining and converting a logical index corresponding to the decomposed word with reference to a basic word dictionary expanded in the memory;
When there is an inclusion relationship, a set of logical indexes matching the set of logical indexes corresponding to the decomposed words is selected from the set of logical indexes expanded in memory including the logical indexes of unnecessary words. selected, by using a single information specified by a set of logical indexes the selected, and a step of converting the lexical including the plurality of words into another data including one of the information the specific how to make.

In the computer,
Referring to the basic word dictionary expanded in the memory, when acquiring and converting the logical index corresponding to the decomposed word, if there is a word for which the logical index cannot be acquired, specify the word and Luz steps be displayed on the display means,
In response to the information input by the input means indicating that the word displayed on the display means is an unnecessary word, the word is written in the basic word dictionary in association with the corresponding logical index, and In the name dictionary, a logical index of the word determined as the unnecessary word in association with one piece of information specified from a set of words excluding the word determined as the unnecessary word among the words included in the lexical word to be converted the method of claim 7 Symbol mounting to further execute a step of writing.

In the computer,
The step of selecting the logical index set that matches the logical index set corresponding to the decomposed word from the memory expanded logical index set including the logical index of unnecessary words is performed. Any one of a plurality of logical indexes among the logical indexes corresponding to the decomposed words from the memory expanded logical index set that does not include the logical indexes of unnecessary words A set of logical indexes that is identical to the set of logical indexes and is composed only of the logical indexes of the set of any of the logical indexes, and is specified by the selected set of logical indexes By using one piece of information, the lexical word including the plurality of words is identified. Furthermore according to claim 7 or 8 A method according to execute a step of converting the other data including one information.

Corresponding to a basic word dictionary that is pre-registered in the storage means in association with a logical index that represents each word, and a set of words that combine words registered in the basic word dictionary A name dictionary pre-registered in storage means in association with one piece of information specified by the set of words and a set of logical indexes to be included when the sets of logical indexes are in an inclusion relationship A word that is not included in the set of included words and is not used when directly specifying the one information, but is a word that assists in finally specifying the one information. A logical index corresponding to a certain unnecessary word is provided in association with the one information and registered in advance in the storage means, and a word / phrase including a plurality of words is assigned. The computer to convert into another data including one information specified by a plurality of words,
Comprising the steps of a word registered in the basic dictionary, and memory development in memory in association with the logical index representation each word, to generate a memory expanded basic dictionary,
Correlating a set of logical indexes registered in the name dictionary with the one piece of specified information corresponding to each set of logical indexes in the memory, and generating a name dictionary expanded in the memory When,
Correlating a logical index of an unnecessary word registered in the unnecessary word dictionary with the one information in the memory, and generating an unnecessary word dictionary expanded in the memory ;
Breaking the lexical terms to be converted into words;
Obtaining and converting a logical index corresponding to the decomposed word with reference to a basic word dictionary expanded in the memory;
A set of logical indexes that matches one of the sets of logical indexes of the logical indexes corresponding to the decomposed words from the set of logical indexes expanded in memory. And temporarily obtaining one piece of information specified by the selected set of logical indexes;
When there is an inclusion relationship, a logical index of an unnecessary word corresponding to the one piece of temporarily acquired information in the memory expanded unnecessary word dictionary corresponds to the decomposed word. If the logical index matches the logical index that was not used when temporarily acquiring the one information, the one temporarily acquired information is determined as one information to be specified, how to execute the steps of converting the lexical including the plurality of words by using the one piece of information to another data including one of information the identified.

In the computer,
When a logical index corresponding to the decomposed word is obtained and converted with reference to the basic word dictionary expanded in the memory on the memory, if there is a word for which a logical index cannot be obtained , the word is identified. Displaying on the display means,
In response to the information input by the input means indicating that the word displayed on the display means is an unnecessary word, the word is written in the basic word dictionary in association with the corresponding logical index, and The logical index of the word made into the unnecessary word in association with one piece of information specified from the word set excluding the word made into the unnecessary word in the word to be converted in the unnecessary word dictionary claim 1 0 Symbol mounting method is further execute a step of writing.

In the computer,
A set of logical indexes that matches one of the sets of logical indexes of the logical indexes corresponding to the decomposed words from the set of logical indexes expanded in memory. And executing the above step of temporarily obtaining one piece of information specified by the selected set of logical indexes,
A set of logical indexes that matches one of the sets of logical indexes of the logical indexes corresponding to the decomposed words from the set of logical indexes expanded in memory. And selecting a logical index set composed only of logical indexes of any one of the logical index sets, and using one piece of information specified by the selected logical index set, the plurality of words 12. The method according to claim 10 or 11 , further comprising the step of converting a lexical phrase including: to another data including the specified one piece of information.

Corresponding to a basic word dictionary that is pre-registered in the storage means in association with a logical index that represents each word, and a set of words that combine words registered in the basic word dictionary associating one of information specified by a set of pairs of said word logical index, and when said set between the logical index is subsumption relation, for a set of logical index subsumed, the logical index is the subsumption stop word is set is not necessary in that is not used in identifying said one information directly because it is not included in the set of words corresponding to a word to assist in finally identifies the piece of information A name dictionary pre-registered in the storage means in association with the one information, and a lexical phrase including a plurality of words The computer to convert into another data including one information specified by the word,
Associating words registered in the basic word dictionary with a logical index representing each word in a memory and generating a basic word dictionary expanded in memory;
A logical index set registered in the name dictionary and a logical index of an unnecessary word are associated with the specified one information corresponding to each logical index pair, and are expanded in the memory. Generating a name dictionary;
Breaking the lexical terms to be converted into words;
Obtaining and converting a logical index corresponding to the decomposed word with reference to a basic word dictionary expanded in the memory;
When there is an inclusion relationship, a set of logical indexes matching the set of logical indexes corresponding to the decomposed words is selected from the set of logical indexes expanded in memory including the logical indexes of unnecessary words. Selecting and using one piece of information specified by the set of selected logical indexes to convert the lexical word containing the plurality of words into another data containing the one piece of specified information. Program to make.

In the computer,
Referring to the basic word dictionary expanded in the memory, when acquiring and converting the logical index corresponding to the decomposed word, if there is a word for which the logical index cannot be acquired, specify the word Displaying on the display means;
In response to the information input by the input means indicating that the word displayed on the display means is an unnecessary word, the word is written in the basic word dictionary in association with the corresponding logical index; and The logical index of the word made into the said unnecessary word in association with one information specified from the word group except the word made into the said word dictionary except the word made into the unnecessary word in the word to be converted Furthermore claim 13 Symbol mounting program and a step of writing.

In the computer,
The step of selecting the logical index set that matches the logical index set corresponding to the decomposed word from the memory expanded logical index set including the logical index of unnecessary words is performed. Any one of a plurality of logical indexes among the logical indexes corresponding to the decomposed words from the memory expanded logical index set that does not include the logical indexes of unnecessary words A set of logical indexes that is identical to the set of logical indexes and is composed only of the logical indexes of the set of any of the logical indexes, and is specified by the selected set of logical indexes By using one piece of information, the lexical word including the plurality of words is identified. Another claim 13 or 14, wherein the program is further perform the step of converting the data including a piece of information.

Corresponding to a basic word dictionary that is pre-registered in the storage means in association with a logical index that represents each word, and a set of words that combine words registered in the basic word dictionary If the name dictionary a set of logical indexes registered in advance in the storage means in association with one of the information specified by said set of words, set between the logical index is subsumption relation, logical index subsumed Is not included in the set of words corresponding to the included logical index set, and is not used when directly specifying the one information. An unnecessary word dictionary in which a logical index corresponding to an unnecessary word that is an assisting word is associated with the one information and registered in advance in a storage unit; Provided, the computer converts the lexical including a plurality of words to another data including one of the information specified by the plurality of words,
Associating words registered in the basic word dictionary with a logical index representing each word in a memory and generating a basic word dictionary expanded in memory;
Correlating a set of logical indexes registered in the name dictionary with the one piece of specified information corresponding to each set of logical indexes in the memory, and generating a name dictionary expanded in the memory When,
The logical index corresponding to the unnecessary word registered in the unnecessary word dictionary is associated with the one information corresponding to the logical index corresponding to each unnecessary word and is expanded in the memory, and the unnecessary word expanded in the memory Creating a dictionary;
Breaking the lexical terms to be converted into words;
Obtaining and converting a logical index corresponding to the decomposed word with reference to a basic word dictionary expanded in the memory;
A set of logical indexes that matches one of the sets of logical indexes of the logical indexes corresponding to the decomposed words from the set of logical indexes expanded in memory. And temporarily obtaining one piece of information specified by the selected set of logical indexes;
When there is an inclusion relationship, a logical index of an unnecessary word corresponding to the one piece of temporarily acquired information in the memory expanded unnecessary word dictionary corresponds to the decomposed word. If the logical index matches the logical index that was not used when temporarily acquiring the one information, the one temporarily acquired information is determined as one information to be specified, A program for executing a step of converting a lexical phrase including the plurality of words into another data including the specified one information by using the one information.

In the computer,
Referring to the basic word dictionary expanded in the memory, when acquiring and converting the logical index corresponding to the decomposed word, if there is a word for which the logical index cannot be acquired, specify the word Displaying on the display means;
In response to the information input by the input means indicating that the word displayed on the display means is an unnecessary word, the word is written in the basic word dictionary in association with the corresponding logical index, and The logical index of the word made into the unnecessary word in association with one piece of information specified from the word set excluding the word made into the unnecessary word in the word to be converted in the unnecessary word dictionary 16. Symbol mounting program and a step of writing.

In the computer,
A set of logical indexes that matches one of the sets of logical indexes of the logical indexes corresponding to the decomposed words from the set of logical indexes expanded in memory. Before executing the step of temporarily obtaining one piece of information specified by the selected set of logical indexes, from among the set of logical indexes expanded in the memory, A set of logical indexes that matches a set of any of the logical indexes configured from a plurality of logical indexes among the logical indexes corresponding to the decomposed words, and includes only the logical index of the set of any of the logical indexes Select a set of logical indexes to be selected, and Ri by using a single information identified, claim 16 or 17, wherein the program is further perform the step of converting the other data including one of information the identified lexical including the plurality of words.