JPS6049932B2

JPS6049932B2 - Japanese information processing method

Info

Publication number: JPS6049932B2
Application number: JP52130656A
Authority: JP
Inventors: 正裕遠山; 護菅原; 英一増田; 博宮部; 秀雄今田
Original assignee: Fujitsu Ltd; Nippon Telegraph and Telephone Corp
Current assignee: Fujitsu Ltd; Nippon Telegraph and Telephone Corp
Priority date: 1977-10-31
Filing date: 1977-10-31
Publication date: 1985-11-06
Also published as: JPS5464446A

Description

【発明の詳細な説明】本発明は日本語情報処理方式、さらに詳しく言えば長
音が長音記号とア行文字と混同して使用さＪれるような
表音文字列を取扱う日本語情報処理方式に関する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a Japanese language information processing system, and more specifically, to a Japanese language information processing system that handles phonetic character strings in which long sounds are used confusingly with long sounds and A-line characters. .

情報化社会の進展に伴い、多種多様の検索システムが
出現している。With the progress of the information society, a wide variety of search systems have appeared.

特に最近はコンピュータの市民化と相挨つて、一般の人
が検索情報を登録したり検索したりする検索システムが
出現しつつある。ただしこの場合一般の人が直接オペレ
ーションすることを意味しない。この代表的システムと
してコンピュータ化された電話番号案内システムがある
。Particularly recently, as computers have become more popular, search systems are emerging that allow ordinary people to register and search for search information. However, in this case, it does not mean that the general public will operate it directly. A typical example of this system is a computerized telephone directory assistance system.

すなわち、該システムの被検索情報は電話所有者からの
登録情報であり、検索情報は電話番号問合せ者からの申
し出情報である。これらの被検索および検索のキー情報
として、かな読み名義が使用される。この場合、日本語
の表音文字化ルールの曖味に問題がある。すなわち表意
文字を表音文字化したとき、新旧かなづかい、同音異表
記等のため同一の名義に対して複数個の異る表現が慣用
されている。この一つに長音の表音文字化の問題がある
。長音の表記方法の原則は１現代かなづかいョにおいて
定められている。１現代かなづかいョは現代語音に基づ
いて現代語をかなで書き表わす楊合の準則を示したもの
で、主として口語文に適用する。That is, the information to be searched in this system is registered information from the telephone owner, and the search information is the offer information from the telephone number inquirer. The kana-yomi name is used as key information for these search targets and searches. In this case, the problem lies in the ambiguity of the Japanese phoneticization rules. That is, when ideograms are converted into phonetic characters, multiple different expressions are commonly used for the same name due to old and new kana usage, homophones, etc. One of these is the problem of converting long sounds into phonetic characters. The principles of how to write long sounds are established in 1 modern kanazukaiyo. 1.Modern Kanazukayo is a set of rules for expressing modern words in kana based on modern speech sounds, and is mainly applied to colloquial sentences.

この１現代かなづかいョによる長音はそれぞれのかなで
書わすことになつている。ただし０オ．列の長音だけは
１ウョで書くことを本則としている。例えば次の通りて
ある。お母さん；おりアさん兄さん；ニイさん夕方；ユウガタ姉さん；ネエさんお父さん；おトウさん（角川国語辞典改訂１０１版より引用）しかしな
がら、上記本則に従わないものや発音と表記文字が一致
しないものがある。The long sounds in this first modern kana are written in their own kana. However, 0 o. The basic rule is to write only the long sounds in a row in one wo. For example, it is as follows. Mother: Oria-san Brother: Nii-san Evening: Yugata Older sister: Nee-san Father: Oto-san (Quoted from Kadokawa Japanese Dictionary Revised 101st Edition) However, there are cases where the above rules are not followed or the pronunciation and written characters do not match. be.

例えば次の通りである。大きい；オオきい遠い；トオいＡＡ；メイレイ（ただし発音は工列長音メロＰ］コエレエ）口答：コウトウ（ただし発音はオ列長音コオトオ）
従つて長音を表音文字化するには、慣用的に長・音記号
で表記するものと１アョ行文字で表記するものとが使用
されている。For example: big; big distance; tooi AA; mei-rei (pronounced as ko-retsu long sound mero-P] koele-e) Oral response: koutou (however, pronounced as ko-retsu long sound ko-o-to-o)
Therefore, in order to convert long sounds into phonetic characters, it is customary to write them using the long sound symbol and to write them using the 1-ayo character.

例えば１空気ョは０クーキョとか０クウキョとかに表記
されるまた、。For example, 1 air is written as 0 kuukyo or 0 kuukyo.

エョ列および１オョ列の長音を表音文字化するには、長
音記号で表記するもの以外に２種類の１アョ行文字で表
記することも行なわれている。例えば１命令ョは１メー
レー／メエレエョあるいは１メイレイョとなり、また、
１口答ョはコートーョ、１コオトオョあるいは１コウト
ウョとなる。この様に、長音の表記方法は例外も多く発
音と異なる表記も採られているため、正確に表記するこ
とは難しく、従つて慣用的に各種表記方法が許）されて
いる。In order to convert the long sounds in the Eyo and 1-O rows into phonetic characters, in addition to using the long sound symbol, two types of 1-O row characters are also used. For example, 1 orderyo becomes 1 mere/meereeyo or 1 meireyo, and
One answer is kotoyo, 1 kootooyo, or 1 koutoyo. In this way, there are many exceptions to how long sounds are written, and some spellings differ from the pronunciation, so it is difficult to write them accurately, so various ways of writing are customarily allowed.

従つて、電話番号案内シスデムにおいて、被検索情報と
して登録されている電話所有者からのかな読み名義（登
録情報）と電話番号問合せ者からの申し出を表音文字化
したかな読み名義（申し出：情報）とが一致しないこと
が多々ある。Therefore, in the telephone number guidance system, the kana reading name (registered information) from the phone owner registered as searched information and the kana reading name (offer: information) that is converted into phonetic characters from the offer from the phone number inquirer. ) often do not match.

この時には、これ等の情報をそのまま被検索キー情報お
よび検索キー情報としたのでは、両者が一致せず検索解
が得られないことになる。この様な不都合をなくすため
には被検索情報と〔検索情報とを一致させる様な規則を
設ければよいわけであるが人名、屋号等登録情報が加入
者個人にかかわる問題であるため強制的に一義に統一す
るということは難しい。At this time, if these pieces of information are used as the searched key information and the search key information, the two will not match and a search solution will not be obtained. In order to eliminate this kind of inconvenience, it would be possible to establish a rule that matches the searched information with the searched information, but since registered information such as people's names and business names are related to individual subscribers, it is not mandatory. It is difficult to unify them in a single sense.

一般には、このような場合の対策として、（１）被検索
情報を、同一加入者に対してその名義の各種類のかな読
みに対応する複数個を用意することにより、あるいは（
２）検索する時に、検索キーを、その名義の各種類のか
な読みに対応する複数個作成することにより、一致解（
検索解）を得る方法を取つている。Generally, as a countermeasure for such cases, (1) prepare multiple pieces of searched information for the same subscriber corresponding to each type of kana reading of the name;
2) When searching, by creating multiple search keys corresponding to each type of kana reading of the name, matching solutions (
We are working on a method to obtain a search solution.

例えば、加入者名義１佐藤クリーニングョに対しては、
１サトウクリーニング２サトオクリーニング３サトーク
リーニング４サトウクリイニング５サトオクリイニング
６サトークリイニングの６通りのかな読みがあるが、上
記（１）の被検索情報を複数個用意する場合は上記１〜
６の６通りのかな読み名義をもつ被検索情報を登録する
ことになり、また上記（２）の検索情報を複数個作成す
る場合には、１〜６の６種類の検索キーを作成すること
になる。For example, for subscriber name 1 Sato Cleanyo,
There are six kana readings: 1 Sato Cleaning 2 Sato Cleaning 3 Sato Cleaning 4 Sato Cleaning 5 Sato Cleaning 6 Sato Cleaning, but if you prepare multiple pieces of searched information in (1) above, use 1 to 1 above.
You will need to register searched information with six different kana pronunciations (6), and if you create multiple pieces of search information (2) above, you will need to create six types of search keys (1 to 6). become.

しかし、上記（１）の場合は被検索情報を収容するファ
イルの容量が増加する。However, in the case of (1) above, the capacity of the file accommodating the searched information increases.

また上記（２）の場合は検索操作に手数がかかりあるい
は検索に長時間を要することとなる、すなわち、オペレ
ータが多数の検索キーを作成し投入しなければならない
のでその操作手数がかかり、あるいは自動的に検索キー
を作成して検索するにしても検索時間が作成した検索キ
ーの個数倍必要となる。すなわち、従来方式はファイル
容量が増すとか検索に手数を要す一るとかの欠点がある
。本発明は上記の欠点を除去し、被検索情報を収容する
ファイルの容量を増加させずまた検索時に検索キー件数
を増すことなく短時間に検索を終了させることが可能な
、上記の長音を有する被検索情報の検索を実行させるた
めの日本語情報処理方式を提供することを目的とするも
のである。In the case of (2) above, the search operation is troublesome or takes a long time. In other words, the operator has to create and input a large number of search keys, which takes time and effort, or Even if you create a search key and perform a search, the search time will be twice as long as the number of created search keys. That is, the conventional method has drawbacks such as an increase in file capacity and a time-consuming search. The present invention eliminates the above drawbacks and has the above-mentioned long sound, which makes it possible to complete the search in a short time without increasing the capacity of the file containing the searched information or increasing the number of search keys during the search. The purpose of this invention is to provide a Japanese information processing method for executing a search for searched information.

この目的は、本発明によれば、長音が長音記号と１アョ
行文字と混合して使用されるような表音文字列を取り扱
う日本語情報処理システムにおいて、表音文字列中より
長音記号を検出する手段と、該長音記号を該長音記号の
前の文字の母音の種別により定められた１アョ行文字に
変換する長音変換手段と、変換された文字列をキー情報
としてデータを記憶する手段と、与えられた検索文字５
列を上記変換手段により検索用キー情報としこのキー情
報により検索を行なう装置とを具備する日本語情報処理
方式によつて達せられる。次に本発明の実施例を図面に
ついて説明する。According to the present invention, in a Japanese information processing system that handles phonetic character strings in which long sounds are used in combination with long sound symbols and 1-ayo characters, the long sound symbols are selected from among the phonetic character strings. a means for detecting the long sound symbol, a long sound conversion means for converting the long sound symbol into a one-ayo line character determined by the type of vowel of the character before the long sound symbol, and a means for storing data using the converted character string as key information. and the given search character 5
This is achieved by a Japanese information processing method that includes a device that uses the column as search key information by the conversion means and performs a search using this key information. Next, embodiments of the present invention will be described with reference to the drawings.

第１図は本発明を電話番号案内システムに実施一した例
のブロック図であつて、図において、１は申し出情報を
入力するためのかな文字列入力装置、２は入力装置１か
ら入力するかな文字列中の長音を矯正して一定パターン
のキー情報とする長音矯正装置、３は多数の被検索情報
を記憶しているファイル記憶装置、４は長音矯正装置２
から与えられるキー情報でファイル記憶装置３から被検
索情報を検索する被検索制御装置、また５は検索結果を
出力する装置てある。なお、長音矯正装置２における２
１はシーケンス制御装置、２２は入力情報レジスタ、２
３は文字分類装置、２４は前母音抽出装置、２５は長母
音変換装置、２６は母音変換装置、２７は出力情報レジ
スタ、２８は長音変換表、２９は母音変換表である。FIG. 1 is a block diagram of an example in which the present invention is implemented in a telephone directory assistance system. In the figure, 1 is a character string input device for inputting offer information, and 2 is a character string input device for inputting offer information. A long sound correction device corrects long sounds in a character string to produce a certain pattern of key information; 3 is a file storage device that stores a large amount of information to be searched; 4 is a long sound correction device 2
Reference numeral 5 denotes a control device to be searched for searching information to be searched from the file storage device 3 using key information given from the key information given from the key information given from the key information given from the key information given from the key information given from the search target control device 5. In addition, 2 in the long sound correction device 2
1 is a sequence control device, 22 is an input information register, 2
3 is a character classification device, 24 is a front vowel extraction device, 25 is a long vowel conversion device, 26 is a vowel conversion device, 27 is an output information register, 28 is a long vowel conversion table, and 29 is a vowel conversion table.

第２図は第１図の実施例のファイル記憶装置３に収容さ
れるファイルの一例の一部分を示すものであつて、Ｋは
キー部で被検索キー情報が収容され、Ｄはデータ部であ
つて被検索情報が収容されている。FIG. 2 shows a part of an example of a file stored in the file storage device 3 of the embodiment shown in FIG. The information to be searched is stored.

上記キー情報は加入者名義（人名、商号・・・等）から
本発明によソー定の規準でその長音を矯正したかなコー
ドとされ、ファイルは上記かなコードの５暗正順で作成
されている。The above key information is converted into a kana code based on the subscriber's name (person's name, trade name, etc.) with long sounds corrected according to the standards set by the present invention, and the file is created in the 5-dark and positive order of the above-mentioned kana code. There is.

加入者名義からかなコード化されたキー情報は次の規準
に従つて作成されている。The key information encoded in kana from the subscriber's name is created according to the following criteria.

（１）長音記号１−ョはその前の文字の母音の種別から
、第３図ｂに示す母音変換表に従つて変換して設定する
。(1) The long sound symbol 1-yo is set by converting the vowel type of the preceding character according to the vowel conversion table shown in FIG. 3b.

第３図ｂにおいてＩは変換前の母音の種別を、■は変換
後の母音を示すものであつて、例えば、１力Ｊｒケョ等
の母音種別１アＪｒ−Ｃ．ョを有する文字に長音記号１
−ョが付された場合はすなわち１カー／ケーョはそれぞ
れ１カア／ケイョに変換される。例えば、１サンケー設
備はサンケイセツビに、２三幸ガーデンはサンコウガア
デンに、３サンコー商工はサンコウシヨウコウに変換さ
れているものとする。In Fig. 3b, I indicates the type of vowel before conversion, and ■ indicates the vowel after conversion. long symbol 1 for characters with
If -yo is added, that is, 1 car/keyo is converted to 1 car/keyo. For example, it is assumed that 1 Sankei Equipment has been converted to Sankei Setsubi, 2 Sanko Garden has been converted to Sankoga Aden, and 3 Sanko Shoko has been converted to Sankou Shiyoukou.

（２）１エョ列および１オョ列の文字に続く１エョおよ
び１オョがある場合、上記１エョおよび０オョはそれぞ
れ１イョおよび１ウョに変換されて設定される。(2) If there is a 1 yo and 1 yo following the 1 yo string and 1 yo string, the 1 yo and 0 yo are converted and set as 1 yo and 1 yo, respectively.

例えばサンコオ電気はサンコウデンキとなる。第２図に
示すファイルの被検索キー情報は上記規準に従つて作成
されたかなコードが収容されている。For example, Sanko Denki becomes Sanko Denki. The key information to be searched in the file shown in FIG. 2 contains a kana code created in accordance with the above criteria.

次に第１図の実施例の動作について説明する。Next, the operation of the embodiment shown in FIG. 1 will be explained.

いま、問合せ者より問合せ情報１三光商エョが口頭（電
話）で番号案内扱者へ伝えられたとする。扱者は上記問
合せ情報を聞いて必ずしも規則によらず慣用に従つて従
えば１サンコオシヨウコーョと表音文字化して、入力装
置１に入力する。Now, assume that the inquirer verbally (over the telephone) conveys the inquiry information 1, ``Sankosho'', to the directory assistance operator. The operator listens to the above-mentioned inquiry information, converts it into phonetic characters, and inputs it into the input device 1, not necessarily according to rules but according to customary practice.

そうすると入力があつたことを入力装置１から長音矯正
装置２へ通知し、該入力情報は入力情報レジスタ２２に
転送蓄積される。該レジスタ２２はシーケンス制御装置
２１に起動される毎に、該入力情報の先頭より順次に１
文字ずつ抽出し、文字分類装置２３へ渡す。Then, the input device 1 notifies the long sound correction device 2 that there has been an input, and the input information is transferred and stored in the input information register 22. Each time the register 22 is activated by the sequence control device 21, the register 22 sequentially registers 1 from the beginning of the input information.
Each character is extracted and sent to the character classification device 23.

この場合、まづ第１文字１サョを抽出して文字分類装置
２３へ渡す。文字分類装置２３は抽出した文字が第１文
字の場合は該文字をそのまま出力情報レジスタ２７へ渡
して蓄積する。In this case, first, the first character is extracted and passed to the character classification device 23. If the extracted character is the first character, the character classification device 23 passes the extracted character as is to the output information register 27 for storage.

この場合、５サョが出力情報レジスタ２７に蓄積される
。第２の文字以降の場合は、文字分類装置２３において
抽出された文字を１１エョまたは１オョか２長音記号０
−ョか、３その他のいずれかに分類する。In this case, 5 days are accumulated in the output information register 27. In the case of the second and subsequent characters, the characters extracted by the character classification device 23 are
- Classify as either 3 or 3.

第２，第３の文字は１ンョ１コョであり、上記３の場合
である。The second and third characters are 1-cho 1-ko, which is the case in 3 above.

この場合は、第１文字の場合と同様な処理が繰返され、
出力情報レジスタ２７は受信した文字を先頭より順に１
文字ずつ蓄積し、ここに１サンコョが蓄積される。なお
、第４文字は１オョであり、上記１の場合である。In this case, the same process as for the first character is repeated,
The output information register 27 stores the received characters as 1 in order from the beginning.
It accumulates letters one by one, and one sancho is accumulated here. Note that the fourth character is 1 o, which is the case of 1 above.

上記１の場合は、抽出された文字が１エョまたは１オョ
であつて、該文字は前母音抽出装置２４へ渡される。そ
うすると前母音抽装置２４は、受信した文字の前文字を
入力情報レジスタ２２から抽出し、この抽出した前文字
（この場合第３文字の１コＪ）を長音変換表２８を使用
して公知の方法により文字変換を行なう。すなわち、長
音変換表２８は第３図ａに示す変換表を含み、抽出され
た前文字（１コＪ）によりその入力側１をサーチし、一
致しているデータの変換側■の文字０オョを抽出し、こ
れを前母音抽出装置２４へ返す。この場合、前母音抽出
装置２４にて受信した文字−と、長音変換表２８て変換
された文字とが一致した場合、すなわち、いずれも１エ
ョまたは１オョである場合は該文字を母音変換装置２６
へ送り、一致しない場合は、前母音抽出装置２４で受信
した文字１エョまたは１オョを出力情報レジスタ２７に
渡す。従つてこの場合５オョが母音変換装置２６に渡さ
れる。なお例えば長音ではない５コエョが抽出されたと
仮定すれば、前母音抽出装置２４で受信した０エョと長
音変換表２８で変換された出力の１オョとは一致しない
ので、前者の文ｊ字１エョがそのまま出力情報レジスタ
２７に渡され蓄積されることとなり、長音変換は行われ
ない。母音変換装置２６は受信した文字１エョまたは１
オョを、母音変換表２９を使用して公知の方法クにより
文字変換を行なう。In case 1 above, the extracted character is 1 yo or 1 yo, and the character is passed to the front vowel extraction device 24. Then, the front vowel extraction device 24 extracts the front character of the received character from the input information register 22, and converts the extracted front character (in this case, the third character 1 J) into a known long vowel conversion table 28. Perform character conversion using this method. That is, the long sound conversion table 28 includes the conversion table shown in FIG. is extracted and returned to the front vowel extraction device 24. In this case, if the character - received by the front vowel extraction device 24 and the character converted by the long sound conversion table 28 match, that is, if they are both 1 eo or 1 yo, the character is transferred to the vowel conversion device. 26
If they do not match, the character 1 yo or 1 yo received by the front vowel extraction device 24 is passed to the output information register 27 . Therefore, in this case, 5 yo is passed to the vowel conversion device 26. For example, if it is assumed that 5 ko yo, which is not a long sound, is extracted, the 0 yo received by the front vowel extraction device 24 and the 1 oh of the output converted by the long sound conversion table 28 do not match, so the former character j character 1 Eo is passed as is to the output information register 27 and stored, and no long sound conversion is performed. The vowel conversion device 26 converts the received character 1e or 1
The vowel conversion table 29 is used to perform character conversion using a known method.

すなわち、母音変換表２９は第３図ｂに示す変換表を含
み、文字１エョまたは１オョによりその入力側１をサー
チし、一致しているデータの変換側■の文字１イョまた
は０ウョを母音変換装置２６に返す。母音変換装置２６
はこの返された文字１イョまたは１ウョを出力情報レジ
スタ２７に渡す。この場合、母音変換装置２６は１オョ
を受信するので１ウョを出力情報レジスタ２７に渡すこ
ととなり、出力情報レジスタ２７には１サンコウョが蓄
積される。上記過程において、入力情報レジスタ２２か
ら抽出される文字力げエョで、かつこの前の文字の母音
が１エョである場合、あるいは入力情報レジフスタ２２
から抽出される文字が１オョでありかつこの前の文字の
母音力げオョである場合は上記の処理が繰返し行なわれ
る。That is, the vowel conversion table 29 includes the conversion table shown in FIG. It is returned to the vowel conversion device 26. Vowel conversion device 26
passes this returned character 1yo or 1yo to the output information register 27. In this case, since the vowel conversion device 26 receives 1 oyo, it passes the 1 oyo to the output information register 27, and the output information register 27 accumulates 1 oyo. In the above process, if the character extracted from the input information register 22 is ``Yo'' and the vowel of the previous character is 1 ``Eo'', or the input information register 22
If the character extracted from is 1 oyo and the previous character has a strong vowel, the above process is repeated.

また、入力情報レジスタ２２から抽出される文字が１エ
ョでかつこの前の文字の母音が１エョでない場合あるい
は入力情・報レジスタ２２から抽出される文字が１オョ
であり、かつこの前の文字の母音が１オョでない場合も
、このような場合について上述した処理が繰返し行なわ
れる。上述の処理が進行し、出力情報レジスタ２７に゛
０サンコウシヨウコョが蓄積されたとする。次に入力情
報レジスタ２２より長音記号１−ョが文字分類装置２３
へ渡される。これは前記２の場合であつて、文字分類装
置２３において分類され、長音記号１−ョが長母音変換
装置２５に渡される。長母音変換装置２５は受信した長
音記号１−ョの前文字を入力情報レジスタ２２から抽出
し、この抽出した文字を長音変換表２８に送り、この長
音変換表２８に含まれる変換表〔第３図ａ〕に従つて、
前記前母音抽出装置２４から行なつた２と同様の文字変
換を行なう。Also, if the character extracted from the input information register 22 is 1 yo and the vowel of the previous character is not 1 yo, or the character extracted from the input information/information register 22 is 1 yo and the previous character Even if the number of vowels is not 1yo, the process described above for such a case is repeated. Assume that the above-described processing progresses and that "0" is accumulated in the output information register 27. Next, the long sound symbol 1-yo is input to the character classification device 22 from the input information register 22.
passed to. This is case 2 above, which is classified by the character classification device 23 and the long vowel symbol 1-yo is passed to the long vowel conversion device 25. The long vowel conversion device 25 extracts the first character of the received long vowel symbol 1-yo from the input information register 22, sends the extracted character to the long vowel conversion table 28, and converts it to the conversion table [third] included in this long vowel conversion table 28. According to figure a],
Character conversion similar to step 2 performed by the front vowel extraction device 24 is performed.

この変換された文字がゝアョ７イョ１ウョの場合には、
これは出力情報レジスタ２７に渡され、１エョ１オョの
場合には、この変換された文字は前母音抽出装置２４に
渡される。この場合、長音変換装置２５は入力情報レジ
スタ２２から抽出した長音記号１−ョの前文字１コョを
第３図ａのように設定されている変換表より、入力側１
（７）ｒコ．に対応する変換側■の０オョの変換文字を
得てこれを前母音抽出装置２４に渡す。If this converted character is ゝ 7 yo 1 yo,
This is passed to the output information register 27, and in the case of 1 yo 1 yo, this converted character is passed to the front vowel extractor 24. In this case, the long sound conversion device 25 converts the first character of the long sound symbol 1-jo extracted from the input information register 22 into the input side 1 from the conversion table set as shown in FIG.
(7) rco. The converted character of 0 on the conversion side (■) corresponding to is obtained and passed to the front vowel extraction device 24.

前母音抽出装置２４は前記２の場合と全く同様に動作し
、この渡された１オョの前文字１コョを再び長音変換表
２８に送り変換文字として１オョを得、長母音変換装置
２５から既に送られた文字１オョと照合し、一致してい
るので、これを母音変換装置２６へ送り、既に説明した
と同一の動作により、これを母音変換表２９によつて１
ウョに変換して出力情報レジスタ２７に渡す。The front vowel extraction device 24 operates in exactly the same manner as in case 2 above, and sends the first character of the passed 1 o to the long vowel conversion table 28 again to obtain 1 o as a converted character, and extracts it from the long vowel converter 25. It is compared with the already sent character 1o, and since it matches, it is sent to the vowel conversion device 26, and in the same operation as already explained, it is converted into 1 by the vowel conversion table 29.
The output information is converted into a file and passed to the output information register 27.

この場合、これによつて１サンコウシヨウコウョが出力
情報レジスタ２７に蓄積されたことになる。In this case, this means that 1 count has been accumulated in the output information register 27.

上記において入力情報レジスタ２２に蓄積されている情
報がすべて抽出されると、シーケンス制御装置２１は出
力情報レジスタ２７に蓄積されている内容を検索制御装
置４へ通知する。When all the information stored in the input information register 22 is extracted in the above, the sequence control device 21 notifies the search control device 4 of the contents stored in the output information register 27.

検索制御装置４は公置の手法でファイル記憶装置３を、
通知された情報σサンコウシヨウコウョ）を検索キー情
報として検索し、これと一致した被検索情報を抽出する
。The search control device 4 uses a public method to access the file storage device 3,
The notified information σ) is searched as search key information, and searched information that matches this is extracted.

この場合、検索キー情報１サンコウシヨウコウョにより
、第２図のように設定されているファイル記憶装置３か
ら、これと一致した被検索キー情報を求め、この結果被
検索情報として１サンコー商工、新宿区．３６４−２４
６８・・・ョと三光商工、中野区．３８１−１３５７・
―とを抽出する。出力装置５は検索制御装置４によつて
抽出された被検索情報を出力する。In this case, search key information that matches this is obtained from the file storage device 3 set as shown in Fig. 2 using the search key information 1. Shinjuku ward. 364-24
68... and Sanko Shoko, Nakano Ward. 381-1357・
- Extract. The output device 5 outputs the searched information extracted by the search control device 4.

これによつて問合せ者から申し出のあつた１三光商エョ
が検索できたことになる。上述の検索操作において、番
号案内扱者は問合せ者より申し出のあつた情報より１サ
ンコオシヨウコーョ表音文字化した例を示したが、問合
せ者からの情報の聞き取りにより、発音は同一であるが
異る表音文字列例えば１サンコーシヨーコーョＪサンコ
ウシヨオコウョ・・・等の何れか一つを入力させれば、
上記説明から容易に判明するように長音矯正装置２によ
つて一義的に上記１サンコウシヨウコウョに矯正され、
上記と同様に、同一の被検索情報が検索される。As a result, it was possible to search for the 1st Sankoshou that was requested by the inquirer. In the above search operation, the directory assistance operator gave an example of converting the information provided by the inquirer into phonetic characters, but after listening to the information from the inquirer, the pronunciation was the same. If you input one of the different phonetic character strings, such as 1 sanko shi yo ko cho J san ko shi yo ko yo... etc.,
As is easily clear from the above explanation, the long sound is uniquely corrected to the above-mentioned 1.
Similar to the above, the same searched information is searched.

なお、上記実施例においては０工．列および０オ，列の
文字に続く１エョおよび１オョをそれぞれ１イョおよび
１ウョに変換したが、これと逆に１エョ列および１オョ
列の文字にそれぞれ続く１イョおよび１ウョをそれぞれ
１エョおよび１オョに変換しても同効てある。In addition, in the above example, 0 engineering. 1 yo and 1 yo following characters in column and 0 o, column were converted to 1 yo and 1 yo, respectively, but conversely, 1 yo and 1 yo following characters in 1 yo column and 1 yo column, respectively, were converted to 1 yo and 1 yo, respectively. The same effect is obtained even if it is converted to 1 o and 1 o.

また、本実施例においては１エョ列Ｊオョ列の文字にそ
れぞれ続く１エョおよび１オョをそれぞれ７イョおよび
１ウョに変換することを行なつたが、情報を手動的に入
力する際、１エョ列文字に続く１エョは１イョとしてま
た１オョ列文字に続く０オョは１ウョとする、すなわち
、例えば１エエョは１エイョに、１オオョは０オウョと
することにすれば、長音矯正装置２において上記の１エ
ョ→１イョ→，ｒオョ→１ウョの変換を行なわず、長音
記号に対する変換のみでも十分な効果が得られる。In addition, in this embodiment, 1 yo and 1 yo following the characters in the 1 yo column and J yo column, respectively, were converted to 7 yo and 1 yo, respectively, but when inputting information manually, 1 1 eyō following an eyō character is treated as 1 yo, and 0 ō following a 1 yo character is treated as 1 yo.For example, if 1 eyō is set as 1 eyō, and 1 ōyo is 0 ōyo, the long sound correction is corrected. In the device 2, sufficient effects can be obtained by only converting the long sound symbol without performing the above-mentioned conversion of 1 yo → 1 yo →, r yo → 1 yo.

上記実施例においては、検索キー情報の作成について説
明したが、被検索キー情報の作成に対しても本発明を適
用することが可能なことは言うまでもない。In the above embodiment, the creation of search key information has been described, but it goes without saying that the present invention can also be applied to the creation of searched key information.

以上説明したように、本発明によれば、聞き取り情報ま
たは表意文字情報を表音文字（かな文字）化したとき、
長音が長音記号とア行文字とが混同して使用されるよう
な表音文字列を取扱う日本語情報処理システムにおいて
、ファイル容量を増すことなく、あるいは人手による操
作を増すことなく、簡単にかつ確実に長音の表音文字化
の曖味さを矯正することが可能な効果がある。As explained above, according to the present invention, when listening information or ideographic information is converted into phonetic characters (kana characters),
In a Japanese information processing system that handles phonetic character strings in which long sounds are mixed with long sounds and A-line characters, it is easy to This has the effect of reliably correcting ambiguity in phonetic transcription of long sounds.

本発明を例えば電話番号案内システムに適用すれは問合
せ者からの申し出情報を入力情報として表音文字化する
とき長音の表記が長音記号であつても５アョ行文字であ
つても１個のキー情報の作成と１個の検索操作によソー
義的に同一の検索キー情報が得られ、これにより正しい
検索結果を得ることが可能な効果がある。For example, when the present invention is applied to a telephone directory assistance system, when converting offer information from an inquirer into phonograms as input information, one key is required regardless of whether the long sound is expressed as a long sound symbol or as a 5-line character. By creating information and performing one search operation, the same search key information can be obtained, which has the effect of making it possible to obtain correct search results.

[Brief explanation of the drawing]

〔第１図は本発明の一実施例のブロック図、第２図は
本発明において使用するファイル記憶装置の内容の一部
を示す図、第３図ａは第１図の実施例の長音変換表２８
に含まれる変換の内容の説明図、ｂは同じく母音変換表
２９に含まれる変換の・内容の説明図である。１・・・入力装置、２・・・長音矯正装置、３・・・フ
ァイル記憶装置、４・・・検索制御装置、５・・・出力
装置、２１・・・シーケンス制御装置、２２・・・入力
情報レジスタ、２３・・・文字分類装置、２４・・・前
母音抽出装）置、２５・・・長母音変換装置、２６・・
・母音変換装置、２７・・・出力情報レジスタ、２８・
・・長音変換表、２９・・・母音変換表、Ｋ・・・被検
索キー情報部、Ｄ・・・被検索情報部。[Figure 1 is a block diagram of an embodiment of the present invention, Figure 2 is a diagram showing part of the contents of a file storage device used in the present invention, and Figure 3a is a long note conversion of the embodiment of Figure 1. Table 28
b is an explanatory diagram of the contents of the conversions included in the vowel conversion table 29. DESCRIPTION OF SYMBOLS 1... Input device, 2... Long sound correction device, 3... File storage device, 4... Search control device, 5... Output device, 21... Sequence control device, 22... Input information register, 23...Character classification device, 24...Front vowel extraction device) device, 25...Long vowel conversion device, 26...
・Vowel converter, 27... Output information register, 28・
...Long sound conversion table, 29...Vowel conversion table, K...Searched key information section, D...Searched information section.

Claims

[Claims] 1. In a Japanese information processing system that handles phonetic character strings in which long sounds are used in combination with long sound symbols and "a" line characters, long sound symbols are detected from phonetic character strings. means, a long sound conversion means for converting the long sound symbol into an "A" line character predetermined according to the vowel type of the character before the long sound symbol, and means for storing data using the converted character string as key information. and a device for converting a given search character string into search key information using the conversion means and performing a data search using this key information. 2. A means for detecting a long sound symbol from a phonetic character string in a Japanese information processing system that handles a phonetic character string in which a long sound is used confusingly with a long sound symbol and an "A" line character, and a means for detecting the long sound symbol from a phonetic character string. a long sound conversion means for converting the character into a predetermined "a" line character according to the vowel type of the character before the long sound symbol, and further the character "E" and the character "E" following the character in the "E" row in the phonetic character string Means for detecting the characters "o" and "u" following the characters in the "i" and "o" strings, vowel conversion means for converting the detected characters into predetermined vowel characters, respectively; and a device for storing data using a given character string as key information, and a device for using a given search character string as search key information by the converting means and performing a data search using this key information. Word information processing method. 3 The vowel converting means converts the character "e" following the character of the "e" string detected from the phonetic character string into "i" and "e".
3. The Japanese language information processing method according to claim 2, wherein the character "o" following a character in the "o" string is converted to "u". 4. The vowel converting means converts the character "i" following the "e" string detected from the phonetic character string into "e" and the character "u" following the "o" string into "e". 3. The Japanese language information processing method according to claim 2, wherein the Japanese language information processing method is converted into "E".