JP2011210001A

JP2011210001A - Information processing apparatus, conversion candidate extraction method, and program

Info

Publication number: JP2011210001A
Application number: JP2010076945A
Authority: JP
Inventors: Kazuya Kudo; 和也工藤; Miki Yoshitake; 美貴吉武; Tatsuya Iyoda; 達也伊與田
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2010-03-30
Filing date: 2010-03-30
Publication date: 2011-10-20
Anticipated expiration: 2030-03-30
Also published as: JP5094902B2

Abstract

PROBLEM TO BE SOLVED: To achieve the output of conversion candidates with respect to input in the Roman alphabet without increasing the scale of an index.SOLUTION: An information processing apparatus 100 which obtains an input Roman alphabetic string and extracts conversion candidates corresponding to the Roman alphabetic string, includes: a Roman alphabet correspondence index storage part 113 for storing an index including a KANA character string including KANA with one or more characters and a Roman alphabetic string with one or more characters which follows the KANA character string and does not satisfy a KANA conversion rule and conversion candidates corresponding to the index; a KANA conversion part 107 for performing KANA conversion on an input series of Roman alphabetic string from the front part of the Roman alphabetic string based on the KANA conversion rule; and an input candidate index extraction part 103 for retrieving an index from the Roman alphabet correspondence index storage part 113 based on the character string converted by the KANA conversion part 107, and for extracting the conversion candidates corresponding to the retrieved index from the Roman alphabet correspondence index storage part 113.

Description

本発明は、入力されたローマ字列を取得して、該ローマ字列に対応する変換候補を抽出する情報を検索する情報処理装置に関する。 The present invention relates to an information processing apparatus that acquires an input Roman character string and searches for information for extracting conversion candidates corresponding to the Roman character string.

従来、パーソナルコンピューター等のクライアント端末を用いて、文書を作成する場合、所定の単語について、読みの先頭の数文字を入力すると、該当する候補の単語を表示することにより、ユーザの入力を支援する入力支援方法が用いられている。
例えば、仮名文字からなる文字列と、該文字列に対応する変換後の候補である変換後文字列とを対応付けたインデックスを作成するとともに、ユーザにより入力された仮名文字からなる文字列を解析する入力支援方法が知られている（例えば、特許文献１）。このような入力支援方法によれば、ユーザにより入力された仮名文字からなる文字列を解析し、候補となる漢字等の文字列を表示することで、ユーザの入力を支援できる。 Conventionally, when a document is created by using a client terminal such as a personal computer, when the first few characters of reading are input for a predetermined word, the corresponding candidate word is displayed to assist the user's input. An input support method is used.
For example, an index that associates a character string composed of kana characters with a converted character string that is a candidate after conversion corresponding to the character string is created, and a character string composed of kana characters input by the user is analyzed An input support method is known (for example, Patent Document 1). According to such an input support method, the user's input can be supported by analyzing a character string composed of kana characters inputted by the user and displaying candidate character strings such as kanji.

特開２０００−２１５１９８号公報JP 2000-215198 A

しかしながら、上述した従来の入力支援方法には、次のような問題があった。すなわち、上述した入力支援方法では、クライアント端末の不具合や、ユーザの操作ミス等、仮名漢字変換モードに切り替えることなくローマ字からなる文字列が入力された場合、候補となる文字列を表示することができないという問題がある。
これに対し、ローマ字からなる文字列に入力候補を対応付けたインデックスを作成するという手法も考えられるが、ローマ字からの仮名変換規則には複数の規格があるため、全ての規格を網羅したインデックスを作成すると、データ量が膨大となるという問題がある。 However, the above-described conventional input support method has the following problems. That is, in the input support method described above, when a character string composed of Roman characters is input without switching to the kana-kanji conversion mode, such as a malfunction of the client terminal or a user operation error, a candidate character string can be displayed. There is a problem that you can not.
On the other hand, a method of creating an index that associates input candidates with a character string consisting of Roman characters is also conceivable, but since there are multiple standards for kana conversion rules from Roman characters, an index that covers all the standards is used. If created, there is a problem that the amount of data becomes enormous.

そこで、本発明は、このような状況に鑑みてなされたものであり、インデックスを膨大化させることなく、ローマ字からなる入力に対する変換候補の出力を実現することを目的とする。 Therefore, the present invention has been made in view of such a situation, and an object of the present invention is to realize conversion candidate output for input composed of Roman characters without enlarging an index.

上述した問題を解決するため、本発明は、次のような特徴を有している。まず、本発明の第１の特徴は、入力されたローマ字列（ローマ字列１５３）を取得して、該ローマ字列に対応する変換候補（仮名漢字変換候補１７０又はローマ字変換候補１７１）を抽出する情報処理装置（情報処理装置１００）であって、１文字以上の仮名からなる仮名文字列（仮名文字列１６３）と、該仮名文字列の後ろに続くとともに仮名変換規則を満たさない１文字以上のローマ字列（変換不可文字列１６５）とからなるインデックス（ローマ字対応インデックス１６１）と、前記インデックスに対応する変換候補（ローマ字変換候補１７１）とを記憶する記憶手段（ローマ字対応インデックス記憶部１１３）と、前記入力された一連のローマ字列を該ローマ字列の前方から仮名変換規則に基づいて仮名変換する仮名変換手段（仮名変換部１０７）と、前記仮名変換手段に変換された文字列に基づいて、前記記憶手段から前記インデックスを検索し、検索された該インデックスに対応する前記変換候補を前記記憶手段から抽出する抽出手段（入力候補インデックス抽出部１０３）と、を備えることを要旨とする。 In order to solve the problems described above, the present invention has the following features. First, the first feature of the present invention is that information that acquires an input Roman character string (Roman character string 153) and extracts a conversion candidate (kana-kanji conversion candidate 170 or Roman character conversion candidate 171) corresponding to the Roman character string. A processing apparatus (information processing apparatus 100) that includes a kana character string (kana character string 163) composed of one or more kana characters, and one or more roman characters that follow the kana character string and do not satisfy the kana conversion rules Storage means (Roman character correspondence index storage unit 113) for storing an index (Roman character correspondence index 161) composed of a string (non-convertible character string 165) and a conversion candidate (Roman character conversion candidate 171) corresponding to the index; Kana conversion means (kana conversion) that converts a series of input Roman characters from the front of the Roman alphabet based on the kana conversion rules A conversion unit 107) and an extraction unit that retrieves the index from the storage unit based on the character string converted by the kana conversion unit and extracts the conversion candidate corresponding to the retrieved index from the storage unit (Input candidate index extraction unit 103).

このような情報処理装置は、記憶手段と、仮名変換手段と、抽出手段とを備える。仮名変換手段は、入力された一連のローマ字列を仮名変換規則に基づいて仮名変換する。すなわち、仮名変換手段は、複数の規格による異なる入力規則で入力されたローマ字列を仮名変換する。 Such an information processing apparatus includes a storage unit, a kana conversion unit, and an extraction unit. The kana conversion means converts the input series of Roman characters into a kana based on a kana conversion rule. That is, the kana conversion means performs kana conversion of a Roman character string input according to different input rules according to a plurality of standards.

記憶手段は、１文字以上の仮名からなる仮名文字列と、仮名文字列の後ろに続くとともに仮名変換規則を満たさない１文字以上のローマ字列とからなるインデックスと、インデックスに対応する変換候補とを記憶する。
すなわち、記憶手段は、仮名変換手段に変換された文字列をインデックスとして記憶するとともに、インデックスに対応する変換候補を記憶する。 The storage means includes a kana character string composed of one or more kana characters, an index composed of one or more roman character strings that follow the kana character string and do not satisfy the kana conversion rules, and conversion candidates corresponding to the indexes. Remember.
That is, the storage unit stores the character string converted by the kana conversion unit as an index and stores conversion candidates corresponding to the index.

このため、記憶手段は、ローマ字入力の全ての規格を網羅するためにデータ量を膨大にすることなく、ローマ字からなる文字列に入力候補を対応付けたインデックスを記憶できる。 For this reason, the storage means can store an index in which input candidates are associated with character strings composed of Roman characters without enlarging the amount of data in order to cover all the standards of Roman character input.

また、抽出手段は、仮名変換手段に変換された文字列に基づいて、記憶手段からインデックスを検索し、検索された該インデックスに対応する変換候補を記憶手段から抽出する。 Further, the extraction means searches the index from the storage means based on the character string converted by the kana conversion means, and extracts the conversion candidates corresponding to the searched index from the storage means.

このインデックスにより、異なる入力規則によって入力されたローマ字列が入力された場合でも、情報処理装置は、変換候補を出力することができる。従って、情報処理装置は、インデックスを膨大化させることなくローマ字からなる入力に対する変換候補の出力を実現することができる。 With this index, the information processing apparatus can output conversion candidates even when a Roman character string input according to different input rules is input. Therefore, the information processing apparatus can realize conversion candidate output for input composed of Roman characters without enlarging the index.

本発明の第２の特徴は、本発明の第１の特徴に係り、予め登録された文字列（クエリ１５１）をローマ字変換規則に基づいて、ローマ字列に変換するローマ字変換手段（ローマ字変換部１２１）と、前記ローマ字変換手段により変換されたローマ字列を先頭から任意の位置で区切り、前記区切った位置までのローマ字列を仮名変換規則に基づいて変換することで、前記インデックスを作成するインデックス作成手段（インデックス作成部１２５）と、を更に備え、前記記憶手段は、前記予め登録された文字列を変換候補として、前記インデックス作成手段により作成されたインデックスに対応させて記憶することを要旨とする。 The second feature of the present invention relates to the first feature of the present invention, and is a Roman character conversion means (Roman character conversion unit 121) for converting a pre-registered character string (query 151) into a Roman character string based on a Roman character conversion rule. And the Roman character string converted by the Roman character conversion means at an arbitrary position from the beginning, and the Roman character string up to the divided position is converted based on a kana conversion rule, thereby creating the index. (Index creation unit 125), and the storage unit stores the previously registered character string as a conversion candidate in association with the index created by the index creation unit.

本発明の第３の特徴は、本発明の第２の特徴に係り、前記ローマ字変換手段は、変換規則が異なる複数のローマ字変換規則それぞれに基づいて、前記入力された文字列（クエリ１５１）をローマ字列（例えば、ローマ字列候補１５７ａ、ローマ字列候補１５７ｂ、ローマ字列候補１５７ｎ）に変換することを要旨とする。 A third feature of the present invention relates to the second feature of the present invention, in which the Roman character conversion means converts the input character string (query 151) based on each of a plurality of Roman character conversion rules having different conversion rules. The gist is to convert it into a Roman character string (for example, a Roman character string candidate 157a, a Roman character string candidate 157b, and a Roman character string candidate 157n).

本発明の第４の特徴は、本発明の第１乃至第３の特徴に係り、前記記憶手段は、入力されたクエリを変換候補として記憶することを要旨とする。 A fourth feature of the present invention relates to the first to third features of the present invention, and is summarized in that the storage unit stores an input query as a conversion candidate.

本発明の第５の特徴は、入力されたローマ字列を取得して、該ローマ字列に対応する変換候補を抽出するコンピュータの変換候補抽出方法であって、コンピュータが、前記入力された一連のローマ字列を該ローマ字列の前方から仮名変換規則に基づいて仮名変換するステップ（ステップＳ３０５）と、コンピュータが、前記変換された文字列に基づいて、１文字以上の仮名からなる仮名文字列と、該仮名文字列の後ろに続くとともに仮名変換規則を満たさない１文字以上のローマ字列とからなるインデックスと、前記インデックスに対応する変換候補とを記憶する記憶手段から前記インデックスを検索し、検索された該インデックスに対応する前記変換候補を前記記憶手段から抽出するステップ（ステップＳ３１１）と、を備えることを要旨とする。 According to a fifth aspect of the present invention, there is provided a conversion candidate extraction method for a computer that acquires an input Roman character string and extracts conversion candidates corresponding to the Roman character string, wherein the computer performs the input series of Roman characters. A step of converting a string from the front of the Roman character string based on a kana conversion rule (step S305), and a computer, based on the converted character string, a kana character string consisting of one or more kana characters, The index is searched from storage means for storing an index consisting of one or more Roman characters that follow the kana character string and does not satisfy the kana conversion rule, and conversion candidates corresponding to the index, and the searched Extracting the conversion candidate corresponding to the index from the storage unit (step S311). To.

本発明の第６の特徴は、入力されたローマ字列を取得して、該ローマ字列に対応する変換候補を抽出するプログラムであって、１文字以上の仮名からなる仮名文字列と、該仮名文字列の後ろに続くとともに仮名変換規則を満たさない１文字以上のローマ字列とからなるインデックスと、前記インデックスに対応する変換候補とを記憶するステップと、前記入力された一連のローマ字列を該ローマ字列の前方から仮名変換規則に基づいて仮名変換するステップと、前記仮名変換手段に変換された文字列に基づいて、前記記憶手段から前記インデックスを検索し、検索された該インデックスに対応する前記変換候補を前記記憶手段から抽出するステップと、をコンピュータに実行させることを要旨とする。 A sixth feature of the present invention is a program for acquiring an input Roman character string and extracting conversion candidates corresponding to the Roman character string, the kana character string consisting of one or more kana characters, and the kana character A step of storing an index consisting of one or more Roman character strings that follow the sequence and do not satisfy the kana conversion rule, and conversion candidates corresponding to the index; and the input sequence of Roman characters A kana conversion step from the front of the kana conversion rule based on the kana conversion rule, and the index is searched from the storage unit based on the character string converted by the kana conversion unit, and the conversion candidate corresponding to the searched index And a step of causing a computer to execute the step of extracting the information from the storage means.

本発明によれば、インデックスを膨大化させることなく、ローマ字からなる入力に対する変換候補の出力を実現できる。 According to the present invention, it is possible to realize conversion candidate output for input composed of Roman characters without enlarging the index.

本発明の実施形態に係る情報処理システム１の全体概略構成図である。1 is an overall schematic configuration diagram of an information processing system 1 according to an embodiment of the present invention. 本発明の実施形態に係る情報処理装置１００の機能ブロック図である。It is a functional block diagram of information processor 100 concerning an embodiment of the present invention. 本発明の実施形態に係る情報処理装置１００に記憶される情報の一例を示す図である。It is a figure which shows an example of the information memorize | stored in the information processing apparatus 100 which concerns on embodiment of this invention. 本発明の実施形態に係る情報処理装置１００の動作を説明するための概念図である。It is a conceptual diagram for demonstrating operation | movement of the information processing apparatus 100 which concerns on embodiment of this invention. 本発明の実施形態に係る情報処理装置１００で記憶される情報の一例を示す図である。である。It is a figure showing an example of information memorized by information processor 100 concerning an embodiment of the present invention. It is. 本発明の実施形態に係る情報処理装置１００の動作を説明するための概念図である。It is a conceptual diagram for demonstrating operation | movement of the information processing apparatus 100 which concerns on embodiment of this invention. 本発明の実施形態に係る情報処理装置１００の動作を説明するための概念図である。It is a conceptual diagram for demonstrating operation | movement of the information processing apparatus 100 which concerns on embodiment of this invention. 本発明の実施形態に係る情報処理装置１００の変換候補抽出及び検索処理動作を示すシーケンス図である。It is a sequence diagram which shows conversion candidate extraction and search processing operation | movement of the information processing apparatus 100 which concerns on embodiment of this invention. 本発明の実施形態に係る情報処理装置１００の変換候補の抽出処理動作の詳細を示すフローチャートである。It is a flowchart which shows the detail of the extraction process operation | movement of the conversion candidate of the information processing apparatus 100 which concerns on embodiment of this invention. 本発明の実施形態に係る情報処理装置１００のインデックスの作成処理動作の詳細を示すフローチャートである。It is a flowchart which shows the detail of the creation process operation | movement of the index of the information processing apparatus 100 which concerns on embodiment of this invention.

次に、図面を参照して、本発明の実施形態について説明する。 Next, embodiments of the present invention will be described with reference to the drawings.

なお、以下の図面の記載において、同一または類似の部分には、同一または類似の符号を付している。また、図面は模式的なものであり、各寸法の比率などは現実のものとは異なる。 In the following description of the drawings, the same or similar parts are denoted by the same or similar reference numerals. Further, the drawings are schematic, and the ratio of each dimension is different from the actual one.

したがって、具体的な寸法などは以下の説明を参酌して判断すればよく、図面相互間においても互いの寸法の関係や比率が異なる部分が含まれている。 Therefore, specific dimensions and the like may be determined in consideration of the following description, and include portions where the relationship and ratio of the dimensions are different between the drawings.

以下の実施形態においては、（１）情報処理システム１の全体構成、（２）情報処理装置１００の詳細構成、（３）情報処理システム１の動作、（５）作用・効果、及び（６）その他の実施形態について説明する。 In the following embodiments, (1) the overall configuration of the information processing system 1, (2) the detailed configuration of the information processing apparatus 100, (3) the operation of the information processing system 1, (5) actions and effects, and (6) Other embodiments will be described.

（１）情報処理システム１の全体構成
図１は、本実施形態に係る情報処理システム１の全体概略構成図である。 (1) Overall Configuration of Information Processing System 1 FIG. 1 is an overall schematic configuration diagram of an information processing system 1 according to this embodiment.

図１に示す情報処理システム１は、ユーザ端末１０、ユーザ端末３０、情報処理装置１００を含む。本実施形態では、ユーザ端末１０、ユーザ端末３０、情報処理装置１００が、通信ネットワーク５０（インターネットなど）を介して接続され、互いにデータ通信可能となっている。 An information processing system 1 illustrated in FIG. 1 includes a user terminal 10, a user terminal 30, and an information processing apparatus 100. In the present embodiment, the user terminal 10, the user terminal 30, and the information processing apparatus 100 are connected via a communication network 50 (such as the Internet) and can perform data communication with each other.

本実施形態では、ユーザ端末１０が、携帯通信端末装置であり、ユーザ端末３０が、いわゆるＰＣである点を除き、ユーザ端末１０及びユーザ端末３０は、同様の機能を有しており、以下の記載においては、主にユーザ端末１０について、記載する。従って、ユーザ端末３０についてもユーザ端末１０と同様の機能を有しているが、詳細の記載は、適宜省略する。 In the present embodiment, the user terminal 10 and the user terminal 30 have the same functions except that the user terminal 10 is a mobile communication terminal device and the user terminal 30 is a so-called PC. In the description, the user terminal 10 is mainly described. Therefore, the user terminal 30 has the same function as the user terminal 10, but detailed description thereof will be omitted as appropriate.

情報処理システム１は、ユーザ端末１０、ユーザ端末３０と、情報処理装置１００との間で、クエリ１５１、ローマ字変換候補１７１、検索ページの受け渡しを行う。 The information processing system 1 exchanges the query 151, the Roman alphabet conversion candidate 171, and the search page between the user terminal 10, the user terminal 30, and the information processing apparatus 100.

情報処理装置１００は、クエリ１５１としてユーザ端末１０から入力されたローマ字列１５３を取得して、該ローマ字列１５３に対応する仮名漢字変換候補１７０又はローマ字変換候補１７１を抽出して、ユーザ端末１０に送信する。以下の実施形態では、「yuubin」というローマ字列１５３からなるクエリ１５１に基づいて、ローマ字列１５３に対応するローマ字変換候補１７１として、「郵便番号」、「郵便局」、「郵便」、「郵便料金」を抽出するものとして、具体例を記載する。 The information processing apparatus 100 acquires the Roman character string 153 input from the user terminal 10 as the query 151, extracts the kana-kanji conversion candidate 170 or the Roman character conversion candidate 171 corresponding to the Roman character string 153, and sends it to the user terminal 10. Send. In the following embodiment, “Zip code”, “Post office”, “Postal”, “Postal charge” are selected as Roman character conversion candidates 171 corresponding to the Roman character string 153 based on the query 151 including the Roman character string 153 “yuubin”. As an example of extracting “

ユーザ端末１０は、通信ネットワーク５０を介して、情報処理装置１００から検索ページを取得する装置である。ユーザ端末１０は、ユーザが使用するＰＣ（Personal Computer）や携帯通信端末装置（携帯電話機やスマートフォン）、カーナビゲーションシステム等であり、無線ＬＡＮや、近傍無線通信（Bluetooth（登録商標）や赤外線通信等）或いは有線通信（USB（Universal Serial Bus）ケーブル接続等）によって、通信ネットワーク５０を介して情報処理装置１００に接続されている。ユーザ端末１０は、情報処理装置１００が有するハードウェア資源に加えて、キーボタンやタッチパネル等の入力装置、液晶ディスプレイ等の出力装置、一般的なブラウザを備える。ブラウザは、インターネットの標準プロトコルであるＨＴＴＰ（Hyper Text Transfer Protocol）等に従い、ＨＴＭＬ（Hyper Text Markup Language）等の言語で記述されたページデータの要求、取得及び表示、フォームデータの送信等を行う機能を有している。 The user terminal 10 is a device that acquires a search page from the information processing device 100 via the communication network 50. The user terminal 10 is a PC (Personal Computer), a mobile communication terminal device (a mobile phone or a smartphone), a car navigation system, or the like used by the user, such as a wireless LAN, near field communication (Bluetooth (registered trademark), infrared communication, or the like). ) Or wired communication (USB (Universal Serial Bus) cable connection or the like) is connected to the information processing apparatus 100 via the communication network 50. The user terminal 10 includes an input device such as a key button or a touch panel, an output device such as a liquid crystal display, and a general browser in addition to hardware resources included in the information processing apparatus 100. The browser is a function for requesting, acquiring and displaying page data described in a language such as HTML (Hyper Text Markup Language), transmitting form data, etc., in accordance with HTTP (Hyper Text Transfer Protocol), which is a standard protocol of the Internet. have.

ユーザ端末１０では、ユーザから文字・数字等がクエリ１５１として入力されるクエリ入力欄２６０（図２参照）と、クエリ入力欄２６０のクエリに基づいた検索を実行するための検索ボタン２７０（図２参照）と、ローマ字変換候補１７１をユーザに表示する変換候補欄２８０とが少なくとも表示される。ユーザ端末１０は、ユーザのクエリ入力欄２６０への入力に基づいて、逐次入力中のクエリ１５１を情報処理装置１００へ送信する。ユーザ端末１０は、ユーザの検索ボタン２７０への操作に伴って、クエリ１５１を情報処理装置１００へ送信してもよい。このユーザ端末１０がクエリ入力欄２６０に入力されたクエリを逐次情報処理装置１００に送信する処理は、例えば、ウェブページに埋め込まれたJavaScript等のプログラムにより実現可能である。 In the user terminal 10, a query input field 260 (see FIG. 2) in which characters, numbers, and the like are input from the user as a query 151, and a search button 270 for executing a search based on the query in the query input field 260 (FIG. 2). And a conversion candidate column 280 for displaying the romaji conversion candidate 171 to the user. The user terminal 10 transmits the query 151 being sequentially input to the information processing apparatus 100 based on the input to the query input field 260 of the user. The user terminal 10 may transmit the query 151 to the information processing apparatus 100 in accordance with the user's operation on the search button 270. The process in which the user terminal 10 sequentially transmits the query input in the query input field 260 to the information processing apparatus 100 can be realized by, for example, a program such as JavaScript embedded in a web page.

また、ユーザ端末１０は、情報処理装置１００からローマ字変換候補１７１を逐次取得し、変換候補欄２８０を介して、ローマ字変換候補１７１をユーザに表示する。すなわち、ユーザ端末１０は、入力中のクエリ１５１について、情報処理装置１００からローマ字変換候補１７１を逐次取得し、ユーザに表示する。 In addition, the user terminal 10 sequentially acquires the Romaji conversion candidates 171 from the information processing apparatus 100 and displays the Romaji conversion candidates 171 to the user via the conversion candidate column 280. That is, the user terminal 10 sequentially acquires Roman character conversion candidates 171 from the information processing apparatus 100 for the query 151 being input, and displays them to the user.

（２）情報処理装置１００の詳細構成
図２は、本実施形態に係る情報処理装置１００の機能ブロック図である。 (2) Detailed Configuration of Information Processing Apparatus 100 FIG. 2 is a functional block diagram of the information processing apparatus 100 according to the present embodiment.

情報処理装置１００は、ＣＰＵ等の制御装置、メモリやＨＤＤ等の記憶装置、通信ネットワーク５０と通信する通信Ｉ／Ｆ等の通信装置といったハードウェア資源を有するコンピュータ（単一或いは複数）により構成される。情報処理装置１００が備える下記の各ブロックは、前述したコンピュータの制御装置等のハードウェア資源を用いてコンピュータプログラムを実行することによって実現されるものである。なお、これらの各ブロック、手段は、単一のコンピュータ上に配置される必要はなく、必要に応じて分散される構成を採用しても良い。 The information processing apparatus 100 includes a computer (single or plural) having hardware resources such as a control device such as a CPU, a storage device such as a memory and an HDD, and a communication device such as a communication I / F that communicates with the communication network 50. The The following blocks included in the information processing apparatus 100 are realized by executing a computer program using hardware resources such as the computer control apparatus described above. These blocks and means do not have to be arranged on a single computer, and a configuration in which they are distributed as necessary may be adopted.

情報処理装置１００は、クエリ取得部１０１、入力候補インデックス抽出部１０３、入力候補インデックス記憶部１０５、仮名変換部１０７、変換文字列判定部１０９、ローマ字対応インデックス抽出部１１１、ローマ字対応インデックス記憶部１１３、入力候補送信部１１５、クエリログ取得部１１７、クエリログ記憶部１１９、ローマ字変換部１２１、仮名変換辞書記憶部１２３、インデックス作成部１２５を備える。これらの機能部は、いわゆるコンピュータにより構成され、演算／制御装置としてのＣＰＵ（Central Processing Unit）、記憶媒体としてのＲＡＭ（Random Access Memory）及びＲＯＭ（Read Only Memory）、通信インターフェイス等が関連することで実現される。以下、各機能部について、説明する。 The information processing apparatus 100 includes a query acquisition unit 101, an input candidate index extraction unit 103, an input candidate index storage unit 105, a kana conversion unit 107, a converted character string determination unit 109, a Roman character corresponding index extraction unit 111, and a Roman character corresponding index storage unit 113. , Input candidate transmission unit 115, query log acquisition unit 117, query log storage unit 119, Romaji conversion unit 121, kana conversion dictionary storage unit 123, and index creation unit 125. These functional units are constituted by so-called computers, and are related to a CPU (Central Processing Unit) as an arithmetic / control device, a RAM (Random Access Memory) and a ROM (Read Only Memory) as a storage medium, a communication interface, and the like. It is realized with. Hereinafter, each functional unit will be described.

（２．１）クエリ取得部１０１
クエリ取得部１０１は、通信ネットワーク５０を介して、ユーザ端末１０からクエリ１５１を取得する。具体的には、クエリ取得部１０１は、通信ネットワーク５０を介して、ユーザ端末１０から入力中のクエリ１５１を逐次取得する。例えば、クエリ取得部１０１は、「yuubin」からなるクエリ１５１を、入力に伴って逐次ユーザ端末１０から取得する。 (2.1) Query acquisition unit 101
The query acquisition unit 101 acquires a query 151 from the user terminal 10 via the communication network 50. Specifically, the query acquisition unit 101 sequentially acquires the query 151 being input from the user terminal 10 via the communication network 50. For example, the query acquisition unit 101 sequentially acquires a query 151 including “yuubin” from the user terminal 10 with input.

（２．２）入力候補インデックス抽出部１０３
図３は、情報処理装置１００の入力候補インデックス記憶部１０５に記憶される情報の一例を示す図である。
入力候補インデックス抽出部１０３は、クエリ取得部１０１で取得したクエリ１５１に基づいて、入力候補インデックス記憶部１０５から検索を行い、対応する仮名漢字変換候補１７０を抽出する。すなわち、クエリ１５１は、仮名漢字対応インデックス１６０に相当するものであり、クエリ１５１と略一致する仮名漢字対応インデックス１６０を検索することにより、対応する仮名漢字変換候補１７０を抽出する。 (2.2) Input candidate index extraction unit 103
FIG. 3 is a diagram illustrating an example of information stored in the input candidate index storage unit 105 of the information processing apparatus 100.
The input candidate index extraction unit 103 searches the input candidate index storage unit 105 based on the query 151 acquired by the query acquisition unit 101 and extracts a corresponding kana-kanji conversion candidate 170. That is, the query 151 corresponds to the kana-kanji correspondence index 160, and the corresponding kana-kanji conversion candidate 170 is extracted by searching the kana-kanji correspondence index 160 that substantially matches the query 151.

本実施形態では、入力候補インデックス抽出部１０３は、「yuubin」からなるクエリ１５１が仮名漢字対応インデックス１６０として記憶されておらず、仮名漢字対応インデックス１６０を入力候補インデックス記憶部１０５から検索できないため、仮名漢字変換候補１７０を抽出することができない。 In this embodiment, the input candidate index extraction unit 103 does not search the kana-kanji corresponding index 160 from the input candidate index storage unit 105 because the query 151 including “yuubin” is not stored as the kana-kanji corresponding index 160. The kana-kanji conversion candidate 170 cannot be extracted.

また、入力候補インデックス抽出部１０３は、仮名変換部１０７でクエリ１５１を仮名変換規則に基づいて変換された変換文字列１６９に基づいて、入力候補インデックス記憶部１０５から検索を行い、対応する仮名漢字変換候補１７０を抽出する。 Further, the input candidate index extraction unit 103 searches the input candidate index storage unit 105 based on the converted character string 169 obtained by converting the query 151 based on the kana conversion rule by the kana conversion unit 107, and corresponding kana kanji. A conversion candidate 170 is extracted.

図３に示すように、例えば、入力候補インデックス抽出部１０３は、クエリ取得部１０１で取得したクエリ又は変換文字列１６９が、「ゆうびん」であった場合、入力候補インデックス記憶部１０５から「ゆうびん」に該当する仮名漢字対応インデックス１６０を検索し、検索された「ゆうびん」に対応する仮名漢字変換候補１７０として、「郵便番号」、「郵便局」、「郵便」、「郵便料金」を抽出する。 As illustrated in FIG. 3, for example, the input candidate index extraction unit 103 reads “yubin” from the input candidate index storage unit 105 when the query or conversion character string 169 acquired by the query acquisition unit 101 is “yubin”. The kana-kanji correspondence index 160 corresponding to is searched, and “postal code”, “post office”, “postal”, and “postage” are extracted as kana-kanji conversion candidates 170 corresponding to the searched “Yubin”.

（２．３）入力候補インデックス記憶部１０５
図３に示すように、入力候補インデックス記憶部１０５は、１文字以上の仮名漢字を含むとともに、ローマ字を含まない文字列からなる仮名漢字対応インデックス１６０と、仮名漢字対応インデックス１６０に対応する変換候補である仮名漢字変換候補１７０とを記憶する。 (2.3) Input candidate index storage unit 105
As shown in FIG. 3, the input candidate index storage unit 105 includes one or more kana-kanji characters and a kana-kanji corresponding index 160 made up of a character string that does not include roman characters, and conversion candidates corresponding to the kana-kanji corresponding index 160. The kana-kanji conversion candidate 170 is stored.

具体的には、入力候補インデックス記憶部１０５は、入力されたクエリ１５１が、仮名漢字を含むとともに、ローマ字を含まない文字列からなる場合に、入力されたクエリ１５１を仮名漢字対応インデックス１６０として記憶する。また入力候補インデックス記憶部１０５は、仮名漢字変換候補１７０を仮名漢字対応インデックス１６０に対応させて記憶する。 Specifically, the input candidate index storage unit 105 stores the input query 151 as the kana-kanji corresponding index 160 when the input query 151 includes a kana-kanji and a character string that does not include a roman character. To do. Further, the input candidate index storage unit 105 stores the kana / kanji conversion candidate 170 in association with the kana / kanji correspondence index 160.

なお、仮名漢字対応インデックス１６０とは、クエリ１５１に基づいて、仮名漢字変換候補１７０を引き当てるためのインデックスであり、平仮名、カタカナ、漢字、数字等の文字列、又はこれらの組み合わせにより構成される。また、仮名漢字変換候補１７０とは、ユーザ端末１０の変換候補欄２８０に表示させるクエリ１５１の変換候補である。すなわち、仮名漢字変換候補１７０は、平仮名、カタカナ、漢字、数字、ローマ字等の文字列、又はこれらの組み合わせにより構成される。 The kana-kanji correspondence index 160 is an index for assigning the kana-kanji conversion candidates 170 based on the query 151, and is composed of character strings such as hiragana, katakana, kanji, and numbers, or a combination thereof. The kana-kanji conversion candidate 170 is a conversion candidate for the query 151 displayed in the conversion candidate column 280 of the user terminal 10. That is, the kana-kanji conversion candidate 170 is composed of a character string such as hiragana, katakana, kanji, numerals, romaji, or a combination thereof.

例えば、入力候補インデックス記憶部１０５は、「ゆう」、「ゆうび」、・・「郵」、「ゆう便」等を仮名漢字対応インデックス１６０として、対応する「郵便番号」、「郵便局」、「郵便」、「郵便料金」等を仮名漢字変換候補１７０として記憶する。 For example, the input candidate index storage unit 105 uses “Yu”, “Yubi”,... “Yu”, “Yubin” etc. as the kana-kanji corresponding index 160, and the corresponding “postal code”, “post office”, “Postal”, “postal charge”, etc. are stored as kana-kanji conversion candidates 170.

（２．４）仮名変換部１０７
図４は、情報処理装置１００の仮名変換部１０７の動作を説明するための概念図である。 (2.4) Kana conversion unit 107
FIG. 4 is a conceptual diagram for explaining the operation of the kana conversion unit 107 of the information processing apparatus 100.

図４に示すように、仮名変換部１０７は、入力されたクエリ１５１が、一連のローマ字列１５３により構成される場合、クエリ１５１を該ローマ字列１５３の前方から仮名変換規則に基づいて仮名変換する。具体的には、仮名変換部１０７は、クエリ１５１をローマ字列１５３の前方から仮名変換規則に基づいて変換文字列１６９に変換する。なお、変換文字列１６９は、入力された一連のローマ字列１５３によって、１文字以上の仮名からなる仮名文字列１６３のみによって構成される場合（図４（ａ））と、１文字以上の仮名からなる仮名文字列１６３と、該仮名文字列１６３の後ろに続くとともに仮名変換規則を満たさない１文字以上のローマ字列である変換不可文字列１６５とによって構成される場合（図４（ｂ））とがある。 As illustrated in FIG. 4, when the input query 151 includes a series of Roman character strings 153, the kana conversion unit 107 converts the query 151 from the front of the Roman character string 153 based on the kana conversion rules. . Specifically, the kana conversion unit 107 converts the query 151 from the front of the Roman character string 153 into the converted character string 169 based on the kana conversion rule. Note that the conversion character string 169 is composed of only a kana character string 163 composed of one or more kana characters (FIG. 4 (a)) by a series of input Roman character strings 153, and from one or more kana characters. A kana character string 163, and a non-convertible character string 165 that is one or more Roman characters that follow the kana character string 163 and do not satisfy the kana conversion rules (FIG. 4B). There is.

例えば、図４（ａ）に示すように、仮名変換部１０７は、クエリ１５１が、「yuubin」であった場合、「yuubin」を変換文字列１６９として「ゆうびん」に変換する。また、図４（ｂ）に示すように、仮名変換部１０７は、クエリ１５１が、「yuub」であった場合、「yuub」を変換文字列１６９として「ゆうｂ」に変換する。 For example, as illustrated in FIG. 4A, when the query 151 is “yuubin”, the kana conversion unit 107 converts “yuubin” into “yubin” as a conversion character string 169. Also, as illustrated in FIG. 4B, when the query 151 is “yuub”, the kana conversion unit 107 converts “yuub” into “yu b” as the conversion character string 169.

（２．５）変換文字列判定部１０９
変換文字列判定部１０９は、変換文字列１６９が、仮名文字列１６３と、該仮名文字列１６３の後ろに続く変換不可文字列１６５とによって構成されるか、又は１文字以上の仮名からなる仮名文字列１６３のみによって構成されるかを判定する。 (2.5) Conversion character string determination unit 109
The conversion character string determination unit 109 includes a kana character whose conversion character string 169 includes a kana character string 163 and a non-convertible character string 165 that follows the kana character string 163, or a kana composed of one or more kana characters. It is determined whether only the character string 163 is included.

具体的には、変換文字列判定部１０９は、変換文字列１６９が、仮名文字列１６３のみによって構成されると判定した場合に、仮名漢字変換候補１７０を入力候補インデックス記憶部１０５から抽出することを判定する。 Specifically, the conversion character string determination unit 109 extracts the kana-kanji conversion candidate 170 from the input candidate index storage unit 105 when it is determined that the conversion character string 169 includes only the kana character string 163. Determine.

また、変換文字列判定部１０９は、変換文字列１６９が、仮名文字列１６３と、該仮名文字列１６３の後ろに続く変換不可文字列１６５とによって構成されると判定した場合にローマ字変換候補１７１をローマ字対応インデックス記憶部１１３から抽出することを判定する。 Further, when the conversion character string determination unit 109 determines that the conversion character string 169 includes the kana character string 163 and the non-convertible character string 165 that follows the kana character string 163, the conversion character string candidate 171 Is extracted from the Roman alphabet corresponding index storage unit 113.

（２．６）ローマ字対応インデックス抽出部１１１
図５は、情報処理装置１００のローマ字対応インデックス記憶部１１３に記憶される情報の一例を示す図である。 (2.6) Romaji corresponding index extraction unit 111
FIG. 5 is a diagram illustrating an example of information stored in the Roman alphabet corresponding index storage unit 113 of the information processing apparatus 100.

図５に示すように、ローマ字対応インデックス抽出部１１１は、変換文字列１６９が、１文字以上の仮名からなる仮名文字列と、該仮名文字列の後ろに続くとともに仮名変換規則を満たさない１文字以上のローマ字列とによって構成される場合にローマ字変換候補１７１をローマ字対応インデックス記憶部１１３から抽出する。 As illustrated in FIG. 5, the Roman alphabet corresponding index extracting unit 111 includes a conversion character string 169 that includes a kana character string that includes one or more kana characters, and a single character that does not satisfy the kana conversion rules after the kana character string. When composed of the above Roman character strings, the Roman character conversion candidates 171 are extracted from the Roman character correspondence index storage unit 113.

具体的には、ローマ字対応インデックス抽出部１１１は、仮名変換部１０７に変換された文字列に基づいて、ローマ字対応インデックス記憶部１１３からローマ字対応インデックス１６１を検索し、検索されたローマ字対応インデックス１６１に対応するローマ字変換候補１７１をローマ字対応インデックス記憶部１１３から抽出する。
例えば、ローマ字対応インデックス抽出部１１１は、仮名変換部１０７で変換された変換文字列１６９が、「ゆうｂ」であった場合、ローマ字対応インデックス記憶部１１３から「ゆうｂ」に該当するローマ字対応インデックス１６１を検索し、検索された「ゆうｂ」に対応するローマ字変換候補１７１として、「郵便番号」、「郵便局」、「郵便」、「郵便料金」をローマ字対応インデックス記憶部１１３から抽出する。 Specifically, the Roman alphabet corresponding index extraction unit 111 searches the Roman alphabet corresponding index 161 from the Roman alphabet corresponding index storage unit 113 based on the character string converted by the kana conversion unit 107, and the searched Roman alphabet corresponding index 161. The corresponding Roman character conversion candidate 171 is extracted from the Roman character correspondence index storage unit 113.
For example, when the converted character string 169 converted by the kana conversion unit 107 is “Yu b”, the Romaji corresponding index extraction unit 111 reads the Romaji corresponding index corresponding to “Yu b” from the Romaji corresponding index storage unit 113. 161 is retrieved, and “zip code”, “post office”, “mail”, and “postage” are extracted from the Roman character correspondence index storage unit 113 as Roman character conversion candidates 171 corresponding to the retrieved “Yu b”.

（２．７）ローマ字対応インデックス記憶部１１３
図５に示すように、ローマ字対応インデックス記憶部１１３は、１文字以上の仮名からなる仮名文字列１６３と、該仮名文字列１６３の後ろに続くとともに仮名変換規則を満たさない１文字以上のローマ字列である変換不可文字列１６５とからなるインデックスであるローマ字対応インデックス１６１と、ローマ字対応インデックス１６１に対応する変換候補であるローマ字変換候補１７１とを記憶する。 (2.7) Roman letter correspondence index storage unit 113
As illustrated in FIG. 5, the Roman alphabet corresponding index storage unit 113 includes a kana character string 163 including one or more kana characters, and one or more roman character strings that follow the kana character string 163 and do not satisfy the kana conversion rules. Romaji corresponding index 161 that is an index composed of non-convertible character string 165 and Romaji conversion candidate 171 that is a conversion candidate corresponding to Romaji corresponding index 161 are stored.

具体的には、ローマ字対応インデックス記憶部１１３は、仮名変換部１０７に変換された変換文字列１６９が、１文字以上の仮名からなる仮名文字列１６３と、該仮名文字列１６３の後ろに続くとともに仮名変換規則を満たさない１文字以上の文字列である変換不可文字列１６５とによって構成される場合に、変換文字列１６９をローマ字対応インデックス１６１として記憶する。また、ローマ字対応インデックス記憶部１１３は、予め登録された文字列からなるクエリ１５１をローマ字変換候補１７１として、ローマ字対応インデックス１６１に対応させて記憶する。 Specifically, in the Roman alphabet corresponding index storage unit 113, the converted character string 169 converted by the kana conversion unit 107 follows the kana character string 163 composed of one or more kana characters and the kana character string 163. In the case of being composed of a non-convertible character string 165 that is a character string of one or more characters that does not satisfy the kana conversion rule, the converted character string 169 is stored as the Roman character correspondence index 161. In addition, the Roman alphabet corresponding index storage unit 113 stores a query 151 including a character string registered in advance as a Roman alphabet conversion candidate 171 corresponding to the Roman alphabet corresponding index 161.

なお、ローマ字対応インデックス１６１とは、クエリ１５１に基づいて、ローマ字変換候補１７１を引き当てるためのインデックスであり、１文字以上の仮名からなる仮名文字列１６３と、該仮名文字列１６３の後ろに続くとともに仮名変換規則を満たさない１文字以上の文字列である変換不可文字列１６５とによって構成される。また、ローマ字変換候補１７１とは、ユーザ端末１０の変換候補欄２８０に表示されるクエリ１５１の変換候補である。すなわち、ローマ字変換候補１７１は、平仮名、カタカナ、漢字、数字、ローマ字等の文字列、又はこれらの組み合わせにより構成される。 The Roman character correspondence index 161 is an index for assigning a Roman character conversion candidate 171 based on the query 151. The kana character string 163 composed of one or more kana characters and the kana character string 163 follow the kana character string 163. And a non-convertible character string 165 that is a character string of one or more characters that does not satisfy the kana conversion rule. The Roman character conversion candidate 171 is a conversion candidate of the query 151 displayed in the conversion candidate column 280 of the user terminal 10. That is, the Roman alphabet conversion candidate 171 is configured by a hiragana character, katakana, kanji, numerals, a character string such as a Roman character, or a combination thereof.

また、変換不可文字列１６５とは、いかなる仮名変換規則に基づいても、仮名に変換できないローマ字列である。具体的には、変換不可文字列１６５とは、仮名変換規則に基づいて母音（a、i、u、e、o）が不足しているローマ字列である。 The non-convertible character string 165 is a Roman character string that cannot be converted into a kana based on any kana conversion rules. Specifically, the non-convertible character string 165 is a Roman character string that lacks vowels (a, i, u, e, o) based on the kana conversion rules.

例えば、ローマ字対応インデックス記憶部１１３は、「ゆうｂ」、「ゆうびｎ」、「ゆうびｘ」等をローマ字対応インデックス１６１として、対応する「郵便番号」、「郵便局」、「郵便」、「郵便料金」等をローマ字変換候補１７１として記憶する。 For example, the Roman alphabet corresponding index storage unit 113 uses “Yu b”, “Yubi n”, “Yubi x”, etc. as the Roman alphabet corresponding index 161, and corresponding “postal code”, “post office”, “postal”, “Postage” or the like is stored as a Roman character conversion candidate 171.

（２．８）入力候補送信部１１５
入力候補送信部１１５は、ユーザ端末１０からの入力中の一連のローマ字列１５３について、仮名漢字変換候補１７０又はローマ字変換候補１７１をユーザ端末１０に逐次送信する。 (2.8) Input candidate transmission unit 115
The input candidate transmission unit 115 sequentially transmits the kana-kanji conversion candidate 170 or the romaji conversion candidate 171 to the user terminal 10 for the series of Roman character strings 153 being input from the user terminal 10.

具体的には、入力候補送信部１１５は、ローマ字列１５３が、１文字以上の仮名からなる仮名文字列１６３のみによって構成される場合、仮名漢字変換候補１７０をユーザ端末１０に送信する。また、入力候補送信部１１５は、ローマ字列１５３が、１文字以上の仮名からなる仮名文字列１６３と、該仮名文字列１６３の後ろに続くとともに仮名変換規則を満たさない１文字以上のローマ字列である変換不可文字列１６５とによって構成される場合、ローマ字変換候補１７１をユーザ端末１０に送信する。 Specifically, the input candidate transmission unit 115 transmits the kana-kanji conversion candidate 170 to the user terminal 10 when the Roman character string 153 is composed only of the kana character string 163 including one or more kana characters. Further, the input candidate transmission unit 115 includes a kana character string 163 that includes one or more kana characters and a kana character string 163 that follows the kana character string 163 and that does not satisfy the kana conversion rules. In the case of being composed of a certain non-convertible character string 165, the Roman character conversion candidate 171 is transmitted to the user terminal 10.

（２．９）クエリログ取得部１１７
クエリログ取得部１１７は、クエリログ記憶部１１９から、ユーザにより入力されたクエリ１５１を取得する。 (2.9) Query log acquisition unit 117
The query log acquisition unit 117 acquires the query 151 input by the user from the query log storage unit 119.

（２．１０）クエリログ記憶部１１９
クエリログ記憶部１１９は、ユーザにより入力されたクエリ１５１を記憶する。具体的には、クエリログ記憶部１１９は、クエリ取得部１０１によって取得されたクエリ１５１、クエリ１５１に基づいた検索結果からユーザがユーザ端末１０において選択したサイトのＵＲＬ、クエリ１５１が入力された時間情報等を記憶する。 (2.10) Query log storage unit 119
The query log storage unit 119 stores the query 151 input by the user. Specifically, the query log storage unit 119 includes the query 151 acquired by the query acquisition unit 101, the URL of the site selected by the user on the user terminal 10 from the search result based on the query 151, and the time information when the query 151 is input. Memorize etc.

（２．１１）ローマ字変換部１２１
図６は、情報処理装置１００のローマ字変換部１２１の動作を説明するための概念図である。 (2.11) Romaji conversion unit 121
FIG. 6 is a conceptual diagram for explaining the operation of the Romaji conversion unit 121 of the information processing apparatus 100.

図６に示すように、ローマ字変換部１２１は、予め登録された文字列であるクエリ１５１をローマ字変換規則に基づいて、ローマ字列であるローマ字列候補１５７に変換する。 As illustrated in FIG. 6, the Roman character conversion unit 121 converts a query 151 that is a pre-registered character string into a Roman character string candidate 157 that is a Roman character string based on a Roman character conversion rule.

ローマ字変換部１２１は、変換規則が異なる複数のローマ字変換規則それぞれに基づいて、入力されたクエリ１５１をローマ字列候補１５７に変換する。 The Romaji conversion unit 121 converts the input query 151 into a Romaji string candidate 157 based on each of a plurality of Romaji conversion rules having different conversion rules.

具体的には、ローマ字変換部１２１は、上記のローマ字変換規則として、例えば、「ＪＩＳ X ４０６３：２０００」（仮名漢字変換システムのための英字キー入力から仮名への変換方式）、訓令式、ヘボン式、「Ｍｉｃｒｏｓｏｆｔ（登録商標）ＩＭＥ」による入力から仮名への変換方式、「ＡＴＯＫ（登録商標）」による入力から仮名への変換方式に基づいて、クエリ１５１を複数のローマ字列候補１５７に変換する。 Specifically, the Romaji conversion unit 121 uses, for example, “JIS X 4063: 2000” (a conversion method from alphabetic key input to Kana for the kana-kanji conversion system), a kanji instruction, a Hebon as the above-mentioned Romaji conversion rules. The query 151 is converted into a plurality of Roman character string candidates 157 based on the formula, the conversion method from the input by “Microsoft (registered trademark) IME” to the kana, and the conversion method from the input by “ATOK (registered trademark)” to the kana. .

また、ローマ字変換部１２１は、クエリログ取得部１１７により取得されたクエリ１５１をローマ字列候補１５７に変換する。なお、クエリ１５１に漢字が含まれる場合、ローマ字変換部１２１は、クエリ１５１に基づいて、仮名変換辞書記憶部１２３から対応する仮名文字列１６３を抽出する。 The Roman character conversion unit 121 converts the query 151 acquired by the query log acquisition unit 117 into a Roman character string candidate 157. When the query 151 includes kanji, the Roman character conversion unit 121 extracts the corresponding kana character string 163 from the kana conversion dictionary storage unit 123 based on the query 151.

例えば、図６（ａ）に示すように、クエリログ取得部１１７により取得されたクエリ１５１が、「郵便番号」であった場合、ローマ字変換部１２１は、「郵便番号」に対応する仮名文字列１６３である「ゆうびんばんごう」を仮名変換辞書記憶部１２３に記憶された辞書に基づいて変換する。次に、ローマ字変換部１２１は、「ゆうびんばんごう」を、ローマ字列候補１５７ａ、ローマ字列候補１５７ｂ、・・・ローマ字列候補１５７ｎ等である「yuubinbangou」、「yuwubinbangou」、「yuwubinbangowu」「yuubinnbanngou」、「yuubixnbaxngou」等に変換する。 For example, as illustrated in FIG. 6A, when the query 151 acquired by the query log acquisition unit 117 is “zip code”, the Roman character conversion unit 121 sets the kana character string 163 corresponding to “zip code”. Is converted based on the dictionary stored in the kana conversion dictionary storage unit 123. Next, the Romaji conversion unit 121 converts “Yubinbango” to “yuubinbangou”, “yuwubinbangouu”, “yuwubinbangowu”, “yuubinnbanngou”, which is a Roman character string candidate 157a, a Roman character string candidate 157b,. , “Yuubixnbaxngou”, etc.

また、図６（ｂ）に示すように、ローマ字変換部１２１は、クエリログ取得部１１７により取得されたクエリ１５１が、仮名文字列１６３からなる（例えば、「ゆうびんばんごう」である）場合、仮名変換辞書記憶部１２３を用いることなくクエリ１５１である「ゆうびんばんごう」を、上記の例と同様にローマ字列候補１５７に変換する。 Also, as shown in FIG. 6B, the Roman character conversion unit 121, when the query 151 acquired by the query log acquisition unit 117 is composed of the kana character string 163 (for example, “Yubinbango”), Without using the conversion dictionary storage unit 123, the query 151 “Yubinbango” is converted into a Roman character string candidate 157 as in the above example.

（２．１２）仮名変換辞書記憶部１２３
仮名変換辞書記憶部１２３は、仮名文字列以外の文字が含まれている漢字混合文字列と、漢字混合文字列の読み仮名からなる仮名文字列とを記憶する。具体的には、仮名変換辞書記憶部１２３は、漢字、カタカナ、数字等を漢字混合文字列として記憶する。 (2.12) Kana conversion dictionary storage unit 123
The kana conversion dictionary storage unit 123 stores a kanji mixed character string including characters other than the kana character string and a kana character string made up of the reading kana of the kanji mixed character string. Specifically, the kana conversion dictionary storage unit 123 stores kanji, katakana, numbers, and the like as kanji mixed character strings.

（２．１３）インデックス作成部１２５
図７は、情報処理装置１００のローマ字変換部１２１の動作を説明するための概念図である。 (2.13) Index creation unit 125
FIG. 7 is a conceptual diagram for explaining the operation of the Romaji conversion unit 121 of the information processing apparatus 100.

図７に示すように、インデックス作成部１２５は、ローマ字変換部１２１により変換された一連のローマ字列であるローマ字列候補１５７を先頭から任意の位置（例えば、２文字や３文字ずつ区切った位置）で区切り、その区切った位置１２６までのローマ字列候補１５７を仮名変換規則に基づいて変換することで、仮名漢字対応インデックス１６０又はローマ字対応インデックス１６１を作成する。 As shown in FIG. 7, the index creating unit 125 has an arbitrary position (for example, a position where two or three characters are divided) from the top of the Roman character string candidate 157 that is a series of Roman character strings converted by the Roman character converting unit 121. The kanji / kanji correspondence index 160 or the romaji correspondence index 161 is created by converting the roman character string candidates 157 up to the delimited position 126 based on the kana conversion rules.

具体的には、図７（ａ）に示すように、インデックス作成部１２５は、ローマ字列候補１５７の区切った位置１２６に応じて、１文字以上の仮名からなる仮名文字列１６３のみによって構成される変換文字列１６９に変換する場合がある。また、図７（ｂ）に示すように、インデックス作成部１２５は、ローマ字列候補１５７の区切った位置１２６に応じて、１文字以上の仮名からなる仮名文字列１６３と、該仮名文字列１６３の後ろに続くとともに仮名変換規則を満たさない１文字以上のローマ字列である変換不可文字列１６５とによって構成される変換文字列１６９に変換する場合がある。 Specifically, as shown in FIG. 7A, the index creation unit 125 is configured only by a kana character string 163 composed of one or more kana characters according to the position 126 where the Roman character string candidates 157 are separated. There is a case where the converted character string 169 is converted. Also, as shown in FIG. 7B, the index creating unit 125 determines the kana character string 163 composed of one or more kana characters and the kana character string 163 according to the position 126 where the Roman character string candidates 157 are separated. There is a case where the character string is converted into a conversion character string 169 that is followed by a non-convertible character string 165 that is one or more Roman characters that do not satisfy the kana conversion rule.

インデックス作成部１２５は、変換文字列１６９が、仮名文字列１６３のみによって構成される場合に変換文字列１６９を仮名漢字対応インデックス１６０として、入力候補インデックス記憶部１０５に記憶する。また、インデックス作成部１２５は、ローマ字変換部１２１及びインデックス作成部１２５により変換文字列１６９に変換された元のクエリ１５１を仮名漢字対応インデックス１６０に対応する仮名漢字変換候補１７０として入力候補インデックス記憶部１０５に記憶する。 The index creating unit 125 stores the converted character string 169 as the kana-kanji corresponding index 160 in the input candidate index storage unit 105 when the converted character string 169 includes only the kana character string 163. The index creation unit 125 also converts the original query 151 converted into the converted character string 169 by the Roman character conversion unit 121 and the index creation unit 125 as an input candidate index storage unit as a kana-kanji conversion candidate 170 corresponding to the kana-kanji corresponding index 160. It memorize | stores in 105.

インデックス作成部１２５は、変換文字列１６９が、仮名文字列１６３と、該仮名文字列１６３の後ろに続く変換不可文字列１６５とによって構成される場合に変換文字列１６９をローマ字対応インデックス１６１として、ローマ字対応インデックス記憶部１１３に記憶する。また、インデックス作成部１２５は、ローマ字変換部１２１及びインデックス作成部１２５により変換文字列１６９に変換された元のクエリ１５１をローマ字対応インデックス１６１に対応するローマ字変換候補１７１としてローマ字対応インデックス記憶部１１３に記憶する。 When the conversion character string 169 is composed of the kana character string 163 and the non-convertible character string 165 that follows the kana character string 163, the index creating unit 125 sets the conversion character string 169 as the Roman character corresponding index 161. It is stored in the Roman alphabet corresponding index storage unit 113. Further, the index creation unit 125 stores the original query 151 converted into the converted character string 169 by the Roman character conversion unit 121 and the index creation unit 125 in the Roman character correspondence index storage unit 113 as the Roman character conversion candidate 171 corresponding to the Roman character correspondence index 161. Remember.

（３）情報処理システム１の動作
情報処理システム１の動作について、（３．１）全体概略動作、（３．２）情報処理装置１００の変換候補抽出処理動作、（３．３）情報処理装置１００のインデックス作成処理動作の順に説明する。 (3) Operation of Information Processing System 1 Regarding the operation of the information processing system 1, (3.1) Overall schematic operation, (3.2) Conversion candidate extraction processing operation of the information processing device 100, (3.3) Information processing device 100 index creation processing operations will be described in this order.

（３．１）全体概略動作
図８は、実施形態に係る情報処理システム１の変換候補抽出及び検索処理動作を示すシーケンス図である。情報処理システム１は、ユーザ端末１０によって入力されたクエリ１５１に基づいて、情報処理装置１００が、検索ページを作成し、ユーザ端末１０に表示する。 (3.1) Overall Schematic Operation FIG. 8 is a sequence diagram showing conversion candidate extraction and search processing operations of the information processing system 1 according to the embodiment. In the information processing system 1, based on the query 151 input by the user terminal 10, the information processing apparatus 100 creates a search page and displays it on the user terminal 10.

図８に示すように、ステップＳ１００において、ユーザ端末１０は、ユーザからローマ字列１５３からなるクエリ１５１を受け入れる。
ステップＳ２００において、ユーザ端末１０は、通信ネットワーク５０を介して、ユーザが入力途中のクエリ１５１について、逐次、情報処理装置１００のクエリ取得部１０１へ送信する。 As shown in FIG. 8, in step S 100, the user terminal 10 accepts a query 151 including a Roman character string 153 from the user.
In step S 200, the user terminal 10 sequentially transmits the query 151 being input by the user to the query acquisition unit 101 of the information processing apparatus 100 via the communication network 50.

ステップＳ３００において、情報処理装置１００は、入力されたローマ字列をクエリ１５１として取得して、該ローマ字列に対応する仮名漢字変換候補１７０又はローマ字変換候補１７１を抽出してユーザ端末１０に送信する。 In step S 300, the information processing apparatus 100 acquires the input Roman character string as the query 151, extracts the kana-kanji conversion candidate 170 or the Roman character conversion candidate 171 corresponding to the Roman character string, and transmits it to the user terminal 10.

ステップＳ４００において、ユーザ端末１０は、仮名漢字変換候補１７０又はローマ字変換候補１７１を一覧表示して、ユーザの変換候補を選択させる。 In step S400, the user terminal 10 displays a list of kana-kanji conversion candidates 170 or romaji conversion candidates 171 to select a user conversion candidate.

ステップＳ５００において、ユーザ端末１０は、検索実行要求を情報処理装置１００へ送信する。 In step S 500, the user terminal 10 transmits a search execution request to the information processing apparatus 100.

ステップＳ６００において、情報処理装置１００は、ステップＳ４００で選択された文字列をクエリとして検索を行い、通信ネットワーク５０を介して、検索ページをユーザ端末１０に送信する。 In step S 600, the information processing apparatus 100 performs a search using the character string selected in step S 400 as a query, and transmits a search page to the user terminal 10 via the communication network 50.

ステップＳ７００において、ユーザ端末１０は、検索ページを表示する。 In step S700, the user terminal 10 displays a search page.

（３．２）情報処理装置１００の変換候補抽出処理動作
図９は、実施形態に係る情報処理装置１００の変換候補抽出処理動作を示すフローチャートである。具体的には、図９は、ステップＳ３００における情報処理装置１００が、ローマ字変換候補１７１を抽出する処理の詳細動作を示す。 (3.2) Conversion Candidate Extraction Processing Operation of Information Processing Device 100 FIG. 9 is a flowchart showing the conversion candidate extraction processing operation of the information processing device 100 according to the embodiment. Specifically, FIG. 9 shows the detailed operation of the process in which the information processing apparatus 100 in step S300 extracts the Romaji conversion candidate 171.

ステップＳ３０１において、クエリ取得部１０１は、通信ネットワーク５０を介して、ユーザ端末１０から入力中のクエリ１５１を逐次取得する。 In step S 301, the query acquisition unit 101 sequentially acquires the query 151 being input from the user terminal 10 via the communication network 50.

ステップＳ３０３において、入力候補インデックス抽出部１０３は、クエリ取得部１０１で取得したクエリ１５１に基づいて、入力候補インデックス記憶部１０５から検索を行い、対応する仮名漢字変換候補１７０が有るか否かを判定する。 In step S303, the input candidate index extraction unit 103 searches the input candidate index storage unit 105 based on the query 151 acquired by the query acquisition unit 101, and determines whether or not there is a corresponding kana-kanji conversion candidate 170. To do.

具体的には、入力候補インデックス抽出部１０３は、クエリ１５１に対応する仮名漢字変換候補１７０が有る場合、仮名漢字変換候補１７０を抽出し、処理をステップＳ３１３へ進める。入力候補インデックス抽出部１０３は、クエリ１５１に対応する仮名漢字変換候補１７０が無い場合、処理をステップＳ３０５へ進める。 Specifically, if there is a kana-kanji conversion candidate 170 corresponding to the query 151, the input candidate index extraction unit 103 extracts the kana-kanji conversion candidate 170 and advances the process to step S313. If there is no kana-kanji conversion candidate 170 corresponding to the query 151, the input candidate index extraction unit 103 advances the process to step S305.

ステップＳ３０５において、仮名変換部１０７は、入力されたクエリ１５１を仮名変換規則に基づいた変換文字列１６９に変換する。 In step S305, the kana conversion unit 107 converts the input query 151 into a conversion character string 169 based on the kana conversion rule.

ステップＳ３０７において、変換文字列判定部１０９は、変換文字列１６９が、１文字以上の仮名からなる仮名文字列１６３と、該仮名文字列１６３の後ろに続くとともに仮名変換規則を満たさない１文字以上のローマ字列である変換不可文字列１６５とによって構成されるか、又は１文字以上の仮名からなる仮名文字列１６３のみによって構成されるかを判定する。 In step S307, the converted character string determination unit 109 determines that the converted character string 169 follows the kana character string 163 including one or more kana characters and the kana character string 163 and does not satisfy the kana conversion rules. It is determined whether it is composed of a non-convertible character string 165 that is a Roman character string or only a kana character string 163 consisting of one or more kana characters.

具体的には、変換文字列判定部１０９は、変換文字列１６９が、１文字以上の仮名からなる仮名文字列１６３のみによって構成される場合、処理をステップＳ３０９へ進める。また、変換文字列判定部１０９は、変換文字列１６９が、１文字以上の仮名からなる仮名文字列１６３と、該仮名文字列１６３の後ろに続くとともに仮名変換規則を満たさない１文字以上のローマ字列とによって構成される場合、処理をステップＳ３１１へ進める。 Specifically, the conversion character string determination unit 109 advances the process to step S309 when the conversion character string 169 is configured only by the kana character string 163 including one or more kana characters. The conversion character string determination unit 109 also includes a conversion character string 169 that includes one or more kana characters and a kana character string 163 that follows the kana character string 163 and that does not satisfy the kana conversion rules. If it is constituted by a column, the process proceeds to step S311.

ステップＳ３０９において、入力候補インデックス抽出部１０３は、変換文字列１６９に対応する仮名漢字変換候補１７０を入力候補インデックス記憶部１０５から抽出する。 In step S 309, the input candidate index extraction unit 103 extracts the kana-kanji conversion candidate 170 corresponding to the converted character string 169 from the input candidate index storage unit 105.

ステップＳ３１１において、ローマ字対応インデックス抽出部１１１は、変換文字列１６９に対応するローマ字変換候補１７１をローマ字対応インデックス記憶部１１３から抽出する。 In step S 311, the Romaji correspondence index extraction unit 111 extracts the Romaji conversion candidate 171 corresponding to the conversion character string 169 from the Romaji correspondence index storage unit 113.

ステップＳ３１３において、入力候補送信部１１５は、ユーザ端末１０からの入力中の一連のローマ字列１５３について、変換候補をユーザ端末１０に逐次送信する。
これにより、情報処理装置１００は、変換候補の抽出処理動作を終了する。 In step S 313, the input candidate transmission unit 115 sequentially transmits conversion candidates to the user terminal 10 for a series of Roman character strings 153 being input from the user terminal 10.
As a result, the information processing apparatus 100 ends the conversion candidate extraction processing operation.

（３．３）情報処理装置１００のインデックス作成処理動作
図１０は、実施形態に係る情報処理装置１００のインデックス作成処理動作を示すフローチャートである。具体的には、図１０は、ステップＳ３００における情報処理装置１００が、ローマ字変換候補１７１を作成するインデックス作成処理の詳細動作を示す。
情報処理装置１００は、ローマ字列１５３からなるクエリ１５１に基づいて、ローマ字変換候補１７１を抽出するために、ローマ字対応インデックス１６１と、ローマ字変換候補１７１とを記憶するデータベースを以下の処理によりローマ字対応インデックス記憶部１１３に記憶する。 (3.3) Index Creation Processing Operation of Information Processing Device 100 FIG. 10 is a flowchart showing index creation processing operation of the information processing device 100 according to the embodiment. Specifically, FIG. 10 shows the detailed operation of the index creation process in which the information processing apparatus 100 in step S300 creates the Romaji conversion candidate 171.
In order to extract the Romaji conversion candidate 171 based on the query 151 including the Romaji string 153, the information processing apparatus 100 stores the Romaji correspondence index 161 and the Romaji conversion candidate 171 in a database storing Romaji conversion index 171 by the following processing. Store in the storage unit 113.

ステップＳ８０１において、クエリログ取得部１１７は、クエリログ記憶部１１９から、ユーザにより入力されたクエリ１５１を取得する。 In step S801, the query log acquisition unit 117 acquires the query 151 input by the user from the query log storage unit 119.

ステップＳ８０３において、ローマ字変換部１２１は、クエリ１５１をローマ字変換規則に基づいて、ローマ字列であるローマ字列候補１５７に変換する。 In step S803, the Romaji conversion unit 121 converts the query 151 into a Romaji string candidate 157, which is a Romaji string, based on a Romaji conversion rule.

ステップＳ８０５において、インデックス作成部１２５は、ローマ字変換部１２１により変換されたローマ字列候補１５７を先頭から任意の位置で区切り、区切った位置までのローマ字列候補１５７を仮名変換規則に基づいた変換文字列１６９に変換することで、仮名漢字対応インデックス１６０又はローマ字対応インデックス１６１を作成する。 In step S805, the index creation unit 125 delimits the Roman character string candidate 157 converted by the Roman character conversion unit 121 at an arbitrary position from the top, and converts the Roman character string candidates 157 up to the delimited position based on the kana conversion rules. By converting to 169, the kana-kanji correspondence index 160 or the Roman alphabet correspondence index 161 is created.

ステップＳ８０７において、インデックス作成部１２５は、仮名漢字対応インデックス１６０又はローマ字対応インデックス１６１を作成し、入力候補インデックス記憶部１０５又はローマ字対応インデックス記憶部１１３にそれぞれ記憶する。 In step S807, the index creating unit 125 creates the kana-kanji corresponding index 160 or the Roman character-corresponding index 161 and stores them in the input candidate index storage unit 105 or the Roman character-corresponding index storage unit 113, respectively.

また、インデックス作成部１２５は、クエリ１５１を仮名漢字対応インデックス１６０に対応する仮名漢字変換候補１７０として、入力候補インデックス記憶部１０５に記憶する。同様にして、インデックス作成部１２５は、クエリ１５１をローマ字対応インデックス１６１に対応するローマ字変換候補１７１として、ローマ字対応インデックス記憶部１１３に記憶する。 Further, the index creation unit 125 stores the query 151 in the input candidate index storage unit 105 as the kana-kanji conversion candidate 170 corresponding to the kana-kanji correspondence index 160. Similarly, the index creating unit 125 stores the query 151 in the Roman alphabet corresponding index storage unit 113 as the Roman alphabet conversion candidate 171 corresponding to the Roman alphabet corresponding index 161.

これにより、情報処理装置１００は、インデックス作成処理動作を終了する。 Thereby, the information processing apparatus 100 ends the index creation processing operation.

（５）作用・効果
以上説明したように、本実施形態によれば、情報処理装置１００は、ローマ字対応インデックス記憶部１１３と、仮名変換部１０７と、入力候補インデックス抽出部１０３とを備える。 (5) Operation / Effect As described above, according to the present embodiment, the information processing apparatus 100 includes the Roman alphabet corresponding index storage unit 113, the kana conversion unit 107, and the input candidate index extraction unit 103.

仮名変換部１０７は、入力された一連のローマ字列１５３を仮名変換規則に基づいた変換文字列１６９に変換する。すなわち、仮名変換部１０７は、複数のローマ字変換の規格に基づいて、異なる入力規則によって入力されたローマ字列１５３が入力された場合でも、変換文字列１６９に変換する。 The kana conversion unit 107 converts the input series of Roman character strings 153 into a converted character string 169 based on the kana conversion rules. That is, the kana conversion unit 107 converts to a converted character string 169 even when a Roman character string 153 input according to different input rules is input based on a plurality of Roman conversion standards.

ローマ字対応インデックス記憶部１１３は、１文字以上の仮名からなる仮名文字列１６３と、仮名文字列１６３の後ろに続くとともに仮名変換規則を満たさない１文字以上のローマ字列である変換不可文字列１６５とからなるローマ字対応インデックス１６１と、ローマ字対応インデックス１６１に対応するローマ字変換候補１７１とを記憶する。 The Roman character correspondence index storage unit 113 includes a kana character string 163 composed of one or more kana characters, a non-convertible character string 165 that follows the kana character string 163 and is one or more roman character strings that do not satisfy the kana conversion rules, Are stored, and a Roman character conversion candidate 171 corresponding to the Roman character correspondence index 161 is stored.

すなわち、ローマ字入力に対応した変換候補のインデックスを、仮名漢字を含んだローマ字対応インデックス１６１として作成しているため、様々な入力規則のあるローマ字入力を吸収したインデックスを作成していると言える。 That is, since the conversion candidate index corresponding to the Romaji input is created as the Romaji corresponding index 161 including the kana / kanji, it can be said that the index is created by absorbing the Romaji input having various input rules.

このため、ローマ字対応インデックス記憶部１１３は、ローマ字入力の全ての規格を網羅するためにデータ量を膨大にすることなく、ローマ字からなる文字列に入力候補を対応付けたインデックスを記憶できる。これにより、複数のローマ字変換の規格に基づいて、異なる入力規則によってローマ字列１５３が入力された場合でも、該ローマ字列１５３に対応した変換候補を出力できる。従って、データ量の膨大化を抑制してローマ字入力に対応したインデックスを作成することで、情報処理装置１００にかかる負荷を低減できる。 For this reason, the Roman alphabet corresponding index storage unit 113 can store an index in which input candidates are associated with character strings composed of Roman characters without enlarging the amount of data in order to cover all the standards of Roman alphabet input. Thereby, even if the Roman character string 153 is input according to different input rules based on a plurality of Roman character conversion standards, conversion candidates corresponding to the Roman character string 153 can be output. Therefore, it is possible to reduce the load on the information processing apparatus 100 by creating an index corresponding to Roman character input while suppressing an increase in the amount of data.

また、入力候補インデックス抽出部１０３は、仮名変換部１０７に変換された変換文字列１６９に基づいて、ローマ字対応インデックス記憶部１１３からローマ字対応インデックス１６１を検索し、検索された該ローマ字対応インデックス１６１に対応するローマ字変換候補１７１をローマ字対応インデックス記憶部１１３から抽出する。 Further, the input candidate index extraction unit 103 searches the Roman character correspondence index 161 from the Roman character correspondence index storage unit 113 based on the converted character string 169 converted by the kana conversion unit 107, and the searched Roman character correspondence index 161 is searched. The corresponding Roman character conversion candidate 171 is extracted from the Roman character correspondence index storage unit 113.

すなわち、情報処理装置１００は、入力されたローマ字列１５３を変換文字列１６９に変換し、変換文字列１６９に対応するローマ字変換候補１７１を抽出できる。 That is, the information processing apparatus 100 can convert the input Roman character string 153 into the converted character string 169 and extract the Roman character conversion candidate 171 corresponding to the converted character string 169.

従って、情報処理装置１００は、情報処理装置１００にかかる負荷を軽減しつつ、ローマ字からなる入力に対する変換候補を表示できる。 Therefore, the information processing apparatus 100 can display conversion candidates for input composed of Roman characters while reducing the load on the information processing apparatus 100.

実施形態では、情報処理装置１００は、入力された文字列であるクエリ１５１をローマ字変換規則に基づいて、ローマ字列に変換するローマ字変換部１２１と、ローマ字変換部１２１により変換されたローマ字列を先頭から任意の位置で区切り、区切った位置までのローマ字列を仮名変換規則に基づいた変換文字列１６９に変換することで、ローマ字対応インデックス１６１を作成するインデックス作成部１２５とを更に備える。また、ローマ字対応インデックス記憶部１１３は、クエリ１５１をローマ字変換候補１７１として、ローマ字対応インデックス１６１に対応させて記憶する。 In the embodiment, the information processing apparatus 100 converts the query 151, which is an input character string, into a Roman character string based on a Roman character conversion rule, and a Roman character string converted by the Roman character conversion unit 121 at the top. Is further provided with an index creation unit 125 that creates a Roman character-corresponding index 161 by converting a Roman character string from an arbitrary position into a converted character string 169 based on a kana conversion rule. Further, the Roman alphabet corresponding index storage unit 113 stores the query 151 as the Roman alphabet conversion candidate 171 in association with the Roman alphabet corresponding index 161.

このため、ローマ字対応インデックス記憶部１１３は、入力候補とする単語を事前に登録する必要がなく、ユーザの情報処理装置１００の利用に伴って入力されるクエリ１５１により、ローマ字対応インデックス１６１とローマ字変換候補１７１とを更新できる。すなわち、ローマ字対応インデックス記憶部１１３によれば、ユーザの行動データからローマ字対応インデックス記憶部１１３を更新できる。 For this reason, the Roman alphabet corresponding index storage unit 113 does not need to register words as input candidates in advance, and the Roman alphabet corresponding index 161 and the Romaji conversion are performed by the query 151 that is input as the user uses the information processing apparatus 100. Candidate 171 can be updated. That is, according to the Roman alphabet corresponding index storage unit 113, the Roman alphabet corresponding index storage unit 113 can be updated from the action data of the user.

実施形態では、ローマ字変換候補１７１は、変換規則が異なる複数のローマ字変換規則に基づいて、クエリ１５１を変換規則の異なるローマ字列候補１５７ａ、ローマ字列候補１５７ｂ、・・・、ローマ字列候補１５７ｎに変換する。 In the embodiment, the Roman alphabet conversion candidate 171 converts the query 151 into a Roman character string candidate 157a, a Roman character string candidate 157b,... To do.

このため、ローマ字対応インデックス記憶部１１３は、入力候補とする単語を事前に登録することなく、変換規則が異なる複数のローマ字変換規則を網羅したローマ字対応インデックス１６１を記憶することができる。従って、情報処理装置１００は、複数の規格に基づいて、異なる入力規則によって入力されたローマ字列１５３が入力された場合でも、ローマ字列１５３に対応するローマ字変換候補１７１として抽出できる。 For this reason, the Roman alphabet corresponding index storage unit 113 can store a Roman alphabet corresponding index 161 that covers a plurality of Roman alphabet conversion rules having different conversion rules without registering words as input candidates in advance. Therefore, the information processing apparatus 100 can extract the Roman character conversion candidate 171 corresponding to the Roman character string 153 even when the Roman character string 153 input according to different input rules is input based on a plurality of standards.

実施形態では、ローマ字対応インデックス記憶部１１３は、入力されたクエリ１５１をローマ字変換候補１７１として記憶する。このため、ユーザの情報処理装置１００の利用に伴って逐次入力される情報をローマ字変換候補１７１として、抽出できる。 In the embodiment, the Roman alphabet corresponding index storage unit 113 stores the input query 151 as a Roman alphabet conversion candidate 171. For this reason, information that is sequentially input with the use of the information processing apparatus 100 by the user can be extracted as the Roman alphabet conversion candidate 171.

（６）その他の実施形態
上述した実施形態では、変換文字列１６９が、仮名文字列１６３のみによって構成される場合に仮名漢字変換候補１７０を入力候補インデックス記憶部１０５に記憶する。また、変換文字列１６９が、仮名文字列１６３と、該仮名文字列１６３の後ろに続く変換不可文字列１６５とによって構成される場合にローマ字変換候補１７１をローマ字対応インデックス記憶部１１３に記憶する。本発明は、これに限られず、入力候補インデックス記憶部１０５と、ローマ字対応インデックス記憶部１１３とが、一つの記憶部に記憶されていてもよい。この場合、また、入力候補インデックス抽出部１０３と、ローマ字対応インデックス抽出部１１１とは、一つの抽出部として機能してもよい。また、該ローマ字列１５３に対応する仮名漢字変換候補１７０又はローマ字変換候補１７１は、変換候補として、仮名漢字変換候補１７０及びローマ字変換候補１７１を識別することなく抽出してもよい。 (6) Other Embodiments In the above-described embodiment, the kana-kanji conversion candidate 170 is stored in the input candidate index storage unit 105 when the conversion character string 169 is constituted only by the kana character string 163. Further, when the converted character string 169 includes the kana character string 163 and the non-convertible character string 165 that follows the kana character string 163, the Roman character conversion candidate 171 is stored in the Roman character correspondence index storage unit 113. The present invention is not limited to this, and the input candidate index storage unit 105 and the Roman character correspondence index storage unit 113 may be stored in one storage unit. In this case, the input candidate index extraction unit 103 and the Roman character corresponding index extraction unit 111 may function as one extraction unit. Alternatively, the kana-kanji conversion candidate 170 or the romaji conversion candidate 171 corresponding to the romaji string 153 may be extracted without identifying the kana-kanji conversion candidate 170 and the romaji conversion candidate 171 as conversion candidates.

このように、本発明は、ここでは記載していない様々な実施の形態などを含むことは勿論である。したがって、本発明の技術的範囲は、上述の説明から妥当な特許請求の範囲に係る発明特定事項によってのみ定められるものである。 As described above, the present invention naturally includes various embodiments that are not described herein. Therefore, the technical scope of the present invention is defined only by the invention specifying matters according to the scope of claims reasonable from the above description.

例えば、各構成図、フローチャート、概念図の図などは例示に過ぎず、各要素の有無、その順序や具体的内容などは適宜変更可能である。一例として、手動によるブロックの指定は必須ではなく、また、本発明において、各手段などの要素は、コンピュータの演算制御部に限らず、ワイヤードロジック等に基づく電子回路で実現してもよい。 For example, each configuration diagram, flowchart, conceptual diagram, and the like are merely examples, and the presence / absence of each element, its order, specific contents, and the like can be changed as appropriate. As an example, manual block designation is not indispensable, and in the present invention, elements such as each means are not limited to an arithmetic control unit of a computer but may be realized by an electronic circuit based on wired logic or the like.

１情報処理システム
１０ユーザ端末
３０ユーザ端末
５０通信ネットワーク
１００情報処理装置
１０１クエリ取得部
１０３入力候補インデックス抽出部
１０５入力候補インデックス記憶部
１０７仮名変換部
１０９変換文字列判定部
１１１ローマ字対応インデックス抽出部
１１３ローマ字対応インデックス記憶部
１１５入力候補送信部
１１７クエリログ取得部
１１９クエリログ記憶部
１２１ローマ字変換部
１２３仮名変換辞書記憶部
１２５インデックス作成部
１２６位置
１５１クエリ
１５３ローマ字列
１５７、１５７ａ、１５７ｂ、１５７ｎローマ字列候補
１６０仮名漢字対応インデックス
１６１ローマ字対応インデックス
１６３仮名文字列
１６５変換不可文字列
１６９変換文字列
１７０仮名漢字変換候補
１７１ローマ字変換候補
２６０クエリ入力欄
２７０検索ボタン
２８０変換候補欄 DESCRIPTION OF SYMBOLS 1 Information processing system 10 User terminal 30 User terminal 50 Communication network 100 Information processing apparatus 101 Query acquisition part 103 Input candidate index extraction part 105 Input candidate index storage part 107 Kana conversion part 109 Conversion character string determination part 111 Roman character corresponding index extraction part 113 Romaji-compatible index storage unit 115 Input candidate transmission unit 117 Query log acquisition unit 119 Query log storage unit 121 Romaji conversion unit 123 Kana conversion dictionary storage unit 125 Index creation unit 126 Position 151 Query 153 Roman character string 157, 157a, 157b, 157n Roman character string candidate 160 Kana-Kanji index 161 161 Roman-character index 163 Kana character string 165 Non-convertible character string 169 Conversion character string 170 Kana-Kanji conversion candidate 171 Romaji conversion candidates 260 Query input field 270 Search button 280 Conversion candidate field

Claims

An information processing apparatus that acquires an input Roman character string and extracts conversion candidates corresponding to the Roman character string,
An kana character string consisting of one or more kana characters, an index consisting of one or more roman character strings that follow the kana character string and do not satisfy the kana conversion rules, and conversion candidates corresponding to the index are stored. Storage means;
Kana conversion means for converting the input series of roman characters from the front of the romaji based on kana conversion rules;
Based on the character string converted into the kana conversion unit, the index is searched from the storage unit, and the conversion candidate corresponding to the searched index is extracted from the storage unit;
An information processing apparatus comprising:

Roman character conversion means for converting a pre-registered character string into a Roman character string based on a Roman character conversion rule;
Index creating means for creating the index by dividing the Roman character string converted by the Roman character converting means at an arbitrary position from the top, and converting the Roman character string up to the divided position based on a kana conversion rule;
Further comprising
The storage means
The information processing apparatus according to claim 1, wherein the character string registered in advance is stored as a conversion candidate in association with the index created by the index creating unit.

The romaji conversion means is:
The information processing apparatus according to claim 2, wherein the input character string is converted into a Roman character string based on a plurality of Roman character conversion rules having different conversion rules.

The storage means
The information processing apparatus according to claim 1, wherein the input query is stored as a conversion candidate.

A computer conversion candidate extraction method for acquiring an input Roman character string and extracting conversion candidates corresponding to the Roman character string,
The computer converting the input series of Roman characters from the front of the Roman characters based on kana conversion rules;
An index comprising a kana character string consisting of one or more kana characters based on the converted character string, and a roman character string of one or more characters that follows the kana character string and does not satisfy the kana conversion rules Retrieving the index from storage means for storing conversion candidates corresponding to the index, and extracting the conversion candidate corresponding to the searched index from the storage means;
A conversion candidate extraction method comprising:

A program that acquires an input Roman character string and extracts conversion candidates corresponding to the Roman character string,
Converting the input series of Roman characters from the front of the Roman characters based on kana conversion rules;
Based on the converted character string, an index consisting of a kana character string consisting of one or more kana characters, and a roman character string of one or more characters that follows the kana character string and does not satisfy the kana conversion rules; Retrieving the index from storage means for storing conversion candidates corresponding to the index, and extracting the conversion candidates corresponding to the searched index from the storage means;
A program that causes a computer to execute.