JP2011186994A

JP2011186994A - Character input device and character input method

Info

Publication number: JP2011186994A
Application number: JP2010054277A
Authority: JP
Inventors: Hitoshi Ikeda; 仁池田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2010-03-11
Filing date: 2010-03-11
Publication date: 2011-09-22

Abstract

<P>PROBLEM TO BE SOLVED: To improve the operability of a character input regarding a character input device and a character input method. <P>SOLUTION: An image processing unit generates mouth-shaped image data by inputting mouth-shaped images. A mouth-shaped image collation database registers vowel information including vowels and syllabic nasals and mouth-shaped image data corresponding to each vowel information. A word conversion dictionary database registers word candidates associated with the vowel information. A control unit performs character input control. The control unit also converts mouth-shaped image data which are generated from mouth shape changes which are image-inputted into vowel information, and retrieves word candidates associated with converted vowel information. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、文字入力を行う文字入力装置および文字入力方法に関する。 The present invention relates to a character input device and a character input method for inputting characters.

携帯電話機の普及に伴い、携帯電話機を使って文字を入力して文章を作成する機会が増えており、文字入力の操作性、効率性が求められている。
携帯電話機のような入力キーの少ない機器では、１つのキーを複数回押下することによって、文字を変化させて入力することが一般的に行われている。また、携帯電話機に搭載されているマイクを使って音声を入力し、音声認識により文字を入力するといったことも行われている。 With the widespread use of mobile phones, opportunities to input text using mobile phones and create texts are increasing, and operability and efficiency of character input are required.
In a device having few input keys such as a cellular phone, it is generally performed by changing a character by pressing one key a plurality of times. In addition, voices are input using a microphone mounted on a mobile phone, and characters are input by voice recognition.

上記のような１つのキーを複数回押して文字入力をする場合、キーの数を減らせるので機器を小型化できるが、同一キーを複数回押さないと、希望する文字を入力できないので、操作回数が増大し、ユーザ操作が煩雑になる。 When inputting characters by pressing one key multiple times as described above, the number of keys can be reduced to reduce the size of the device. However, if the same key is not pressed multiple times, the desired character cannot be input. Increases and the user operation becomes complicated.

例えば、「０〜９」のテンキーしかない携帯電話機で、「こ」という文字を入力したい場合、テンキーの「２」を５回押下したりするなど、操作が面倒である。
一方、マイクを使って、音声認識により文字を入力する場合は、キー操作の煩雑さは解消できる。しかし、周りの雑音による認識不良の発生、または音声を発するために、入力しようとする文字が他人に聞こえてしまうなどの不都合があった。 For example, in a mobile phone having only a numeric keypad of “0 to 9”, when it is desired to input the character “ko”, the operation is troublesome, such as pressing “2” of the numeric keypad five times.
On the other hand, when inputting characters by voice recognition using a microphone, the complexity of key operations can be eliminated. However, there have been inconveniences such as the occurrence of poor recognition due to surrounding noise, or the fact that a character to be input can be heard by another person in order to make a voice.

これに対し、近年になって、携帯電話機に搭載されているカメラを用いて、口（口唇）の動きで文字を入力する機器が開発されている。これは、ユーザの口元の画像をカメラで取り込み、口の形状認識により文字入力を行うものである。口の形状認識により文字入力を行う従来技術としては、例えば、特許文献１〜３が提案されている。 On the other hand, in recent years, devices have been developed that input characters by the movement of the mouth (lip) using a camera mounted on a mobile phone. In this method, an image of a user's mouth is captured by a camera, and character input is performed by mouth shape recognition. For example, Patent Documents 1 to 3 have been proposed as conventional techniques for performing character input by mouth shape recognition.

特開２００５−３０９９５２号公報JP 2005-309952 A 特開２００５−１０８０７９号公報JP 2005-108079 A 特開２００２−２６９５４４号公報JP 2002-269544 A

口の形状認識により文字入力を行う場合、口の動きから判別できる入力文字は、母音情報になるので、子音情報は、他手段を用いて別途入力することになる。上記の特許文献１、２では、携帯電話機に搭載されているカメラで口形状の画像を入力して、入力文字の母音情報を口の形状で画像解析し、子音情報は、キー操作で入力を行っている。 When character input is performed by mouth shape recognition, the input character that can be discriminated from the movement of the mouth becomes vowel information, so consonant information is input separately using other means. In Patent Documents 1 and 2 described above, a mouth shape image is input by a camera mounted on a mobile phone, vowel information of an input character is analyzed by mouth shape, and consonant information is input by key operation. Is going.

しかし、特許文献１、２では、１文字ずつ母音に合わせて子音を入力しているので、１文字入力する度に、口の動きとキーの入力操作とを連動する必要があり、操作タイミングを逸しやすい。このため、正確かつスムーズに文字を入力することが容易ではなく、操作性が良好であるものとはいえない。 However, in Patent Documents 1 and 2, a consonant is input to the vowel one character at a time, so it is necessary to link the movement of the mouth and the key input operation every time one character is input. Easy to miss. For this reason, it is not easy to input characters accurately and smoothly, and it cannot be said that operability is good.

また、特許文献３では、キー操作が難しい人において、眼や顔の動きにより文字の指定を可能としているが、１文字を入力する毎に口と眼や顔の動きとを連動させる必要があるので、この場合も文字入力の操作が煩雑である。 In Japanese Patent Application Laid-Open No. 2004-260260, characters can be designated by eye and face movements for people who have difficulty in key operations. However, it is necessary to link the mouth and eye and face movements every time a character is input. Therefore, the character input operation is also complicated in this case.

本発明はこのような点に鑑みてなされたものであり、口の形状認識による文字入力において操作性の向上を図った文字入力装置を提供することを目的とする。
また、本発明の他の目的は、口の形状認識による文字入力において操作性の向上を図った文字入力方法を提供することである。 The present invention has been made in view of these points, and an object of the present invention is to provide a character input device that improves operability in character input by mouth shape recognition.
Another object of the present invention is to provide a character input method that improves operability in character input by mouth shape recognition.

上記課題を解決するために、文字入力装置が提供される。この文字入力装置は、口の形状の画像を入力して口形状画像データを生成する画像処理部と、母音および撥音を含む母音情報と、個々の前記母音情報に対応する前記口形状画像データとを登録する口形状画像照合データベースと、前記母音情報に関連する単語候補を登録する単語変換辞書データベースと、文字入力制御を行う制御部とを備える。また、前記制御部は、前記口形状画像データを前記母音情報に変換し、変換後の前記母音情報に関連する前記単語候補を検索する。 In order to solve the above problems, a character input device is provided. The character input device includes an image processing unit that inputs a mouth shape image to generate mouth shape image data, vowel information including vowels and sound repellent, and mouth shape image data corresponding to each of the vowel information. A mouth shape image matching database for registering words, a word conversion dictionary database for registering word candidates related to the vowel information, and a control unit for performing character input control. In addition, the control unit converts the mouth shape image data into the vowel information, and searches for the word candidates related to the converted vowel information.

操作性の向上を図ることが可能になる。 It becomes possible to improve operability.

文字入力装置の構成例を示す図である。It is a figure which shows the structural example of a character input device. 携帯電話機の外観構成を示す図である。It is a figure which shows the external appearance structure of a mobile telephone. 携帯電話機の機能ブロックを示す図である。It is a figure which shows the functional block of a mobile telephone. 文字入力画面の表示例を示す図である。It is a figure which shows the example of a display of a character input screen. 「あ」の口の画像を取り込んだ状態の画面表示例を示す図である。It is a figure which shows the example of a screen display of the state which captured the image of the mouth of "A". 単語候補の画面表示例を示す図である。It is a figure which shows the example of a screen display of a word candidate. 入力文字を選択する際の画面表示を示す図である。It is a figure which shows the screen display at the time of selecting an input character. 入力文字を確定した場合の画面表示を示す図である。It is a figure which shows the screen display at the time of confirming an input character. 口形状画像照合ＤＢの登録データを示す図である。It is a figure which shows the registration data of mouth shape image collation DB. 単語変換辞書ＤＢの登録データを示す図である。It is a figure which shows the registration data of word conversion dictionary DB. 単語候補の画面表示例を示す図である。It is a figure which shows the example of a screen display of a word candidate. 入力文字を選択する場合の画面表示を示す図である。It is a figure which shows the screen display in the case of selecting an input character. 入力文字を確定した場合の画面表示を示す図である。It is a figure which shows the screen display at the time of confirming an input character. 文字入力画面の表示例を示す図である。It is a figure which shows the example of a display of a character input screen. 日本語文字入力動作のフローチャートを示す図である。It is a figure which shows the flowchart of a Japanese character input operation. 文字入力画面の表示例を示す図である。It is a figure which shows the example of a display of a character input screen. アルファベット変換辞書ＤＢの登録データを示す図である。It is a figure which shows the registration data of alphabet conversion dictionary DB. アルファベット文字候補の画面表示例を示す図である。It is a figure which shows the example of a screen display of an alphabet character candidate. 入力文字を選択する際の画面表示を示す図である。It is a figure which shows the screen display at the time of selecting an input character. 入力文字を確定する画面表示を示す図である。It is a figure which shows the screen display which fixes an input character. アルファベットの全角／半角の切替を示す図である。It is a figure which shows switching of the full-width / half-width of the alphabet. 入力文字を確定する画面表示を示す図である。It is a figure which shows the screen display which fixes an input character. アルファベット文字入力動作のフローチャートを示す図である。It is a figure which shows the flowchart of an alphabet character input operation | movement. 携帯電話機の外観構成を示す図である。It is a figure which shows the external appearance structure of a mobile telephone.

以下、本発明の実施の形態を図面を参照して説明する。図１は文字入力装置の構成例を示す図である。文字入力装置１は、ユーザインタフェース部１０、画像処理部２０、口形状画像照合データベース３１、単語変換辞書データベース３２ａおよび制御部４０を備え、例えば、携帯電話機などの文字入力が行われる通信機器などに該当する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a diagram illustrating a configuration example of a character input device. The character input device 1 includes a user interface unit 10, an image processing unit 20, a mouth shape image collation database 31, a word conversion dictionary database 32a, and a control unit 40. For example, the character input device 1 is used in a communication device such as a mobile phone where character input is performed. Applicable.

ユーザインタフェース部１０は、キー操作および画面表示を行う。画像処理部２０は、音声を発する際の口の形状の画像を入力して口形状画像データを生成する。口形状画像照合データベース３１は、母音および撥音を含む母音情報と、個々の母音情報に対応する口形状画像データとを登録する。単語変換辞書データベース３２ａは、母音情報に関連する単語候補を登録する。制御部４０は、文字入力制御を行う。 The user interface unit 10 performs key operations and screen display. The image processing unit 20 inputs an image of the mouth shape when sound is generated, and generates mouth shape image data. The mouth shape image collation database 31 registers vowel information including vowels and repellent sounds and mouth shape image data corresponding to individual vowel information. The word conversion dictionary database 32a registers word candidates related to vowel information. The control unit 40 performs character input control.

ここで、制御部４０は、画像入力された口形状変化から生成される１つまたは複数の口形状画像データを、口形状画像照合データベース３１に登録されている母音情報と照合し、一括して母音情報に変換する。 Here, the control unit 40 collates one or a plurality of mouth shape image data generated from the mouth shape change inputted as an image with vowel information registered in the mouth shape image collation database 31, and collectively Convert to vowel information.

例えば、画像入力された口形状の変化が１つであれば、１つの口形状画像データが生成される。そして、１つの該当口形状画像データを口形状画像照合データベース３１に登録されている母音情報と照合して、対応する１つの母音情報に変換する。 For example, if there is only one change in the mouth shape input as an image, one mouth shape image data is generated. Then, the corresponding mouth shape image data is collated with the vowel information registered in the mouth shape image collation database 31 and converted into one corresponding vowel information.

または、画像入力された口形状の変化として、例えば、互いに異なる３つの変化があったとすれば、３つの口形状画像データが生成される。そして、３つの該当口形状画像データを口形状画像照合データベース３１に登録されている母音情報とそれぞれ照合して、対応する３つの母音情報にそれぞれ変換する。 Alternatively, for example, if there are three different changes in the mouth shape input as an image, three mouth shape image data are generated. Then, the three corresponding mouth shape image data are collated with the vowel information registered in the mouth shape image collation database 31, respectively, and converted into the corresponding three vowel information.

口形状画像データを母音情報に変換した後は、変換後の母音情報に関連する単語候補について、単語変換辞書データベース３２ａを利用して検索して、画面に表示する。
このとき、該当単語候補があればユーザによって確定されるが、変換後の母音情報を含む単語候補の中に該当単語候補がない場合には、ユーザは子音情報を入力し、制御部４０は、母音情報と入力された子音情報とから、関連する単語候補を検索する。なお、詳細動作については具体例を挙げて後述する。 After the mouth shape image data is converted into vowel information, word candidates related to the converted vowel information are searched using the word conversion dictionary database 32a and displayed on the screen.
At this time, if there is a corresponding word candidate, it is determined by the user, but if there is no corresponding word candidate among the word candidates including the converted vowel information, the user inputs consonant information, A related word candidate is searched from the vowel information and the input consonant information. The detailed operation will be described later with a specific example.

次に文字入力装置１を携帯電話機に適用した場合の構成および動作について詳しく説明する。図２は携帯電話機の外観構成を示す図であり、図３は携帯電話機の機能ブロックを示す図である。携帯電話機１ａは、ユーザインタフェース部１０、画像処理部２０、メモリ部３０および制御部４０を備える。 Next, the configuration and operation when the character input device 1 is applied to a mobile phone will be described in detail. FIG. 2 is a diagram showing an external configuration of the mobile phone, and FIG. 3 is a diagram showing functional blocks of the mobile phone. The cellular phone 1a includes a user interface unit 10, an image processing unit 20, a memory unit 30, and a control unit 40.

ユーザインタフェース部１０は、画面表示部（ＬＣＤ：Liquid Crystal Display（液晶ディスプレイ））１１と入力操作部１２を含み、入力操作部１２は、テンキー１２ａ、選択キー１２ｂおよび確定キー１２ｃを含む。画像処理部２０は、カメラ２１と画像データ生成部２２を含む。 The user interface unit 10 includes a screen display unit (LCD: Liquid Crystal Display) 11 and an input operation unit 12, and the input operation unit 12 includes a numeric keypad 12a, a selection key 12b, and a confirmation key 12c. The image processing unit 20 includes a camera 21 and an image data generation unit 22.

メモリ部３０は、フラッシュメモリ等に該当し、口形状画像照合データベース（ＤＢ）３１、単語変換辞書ＤＢ３２ａおよびアルファベット変換辞書ＤＢ３２ｂを含む。制御部４０は、ＣＰＵ（Central Processing Unit）４１、ＲＡＭ（Random Access Memory）４２およびＲＯＭ（Read Only Memory）４３を含む。 The memory unit 30 corresponds to a flash memory or the like, and includes a mouth shape image collation database (DB) 31, a word conversion dictionary DB 32a, and an alphabet conversion dictionary DB 32b. The control unit 40 includes a CPU (Central Processing Unit) 41, a RAM (Random Access Memory) 42, and a ROM (Read Only Memory) 43.

カメラ２１は、画像を入力する。口形状認識による文字入力を行う場合は、ユーザの口領域の画像を入力する。画像データ生成部２２は、入力画像をデータ化する。口領域の画像が入力された場合には、口形状画像をデータ化して口形状画像データを生成する。 The camera 21 inputs an image. When inputting characters by mouth shape recognition, an image of the user's mouth area is input. The image data generation unit 22 converts the input image into data. When an image of the mouth area is input, the mouth shape image is converted into data to generate mouth shape image data.

口形状画像照合ＤＢ３１は、母音（「あ」、「い」、「う」、「え」、「お」）および撥音（「ん」）を含む母音情報と、個々の母音情報に対応する口形状画像データとを登録する。 The mouth shape image collation DB 31 stores vowel information including vowels (“A”, “I”, “U”, “E”, “O”) and sound repellent (“N”), and mouths corresponding to individual vowel information. Register shape image data.

単語変換辞書ＤＢ３２ａは、母音情報に関連する単語候補を登録する。アルファベット変換辞書ＤＢ３２ｂは、母音情報に関連するアルファベット候補を登録する。なお、単語変換辞書ＤＢ３２ａは、日本語（仮名）文字入力モード時に使用され、アルファベット変換辞書ＤＢ３２ｂは、アルファベット入力モード時に使用される。 The word conversion dictionary DB 32a registers word candidates related to vowel information. The alphabet conversion dictionary DB 32b registers alphabet candidates related to vowel information. The word conversion dictionary DB 32a is used in the Japanese (kana) character input mode, and the alphabet conversion dictionary DB 32b is used in the alphabet input mode.

制御部４０は、生成された口形状画像データから、口形状画像照合ＤＢ３１を用いて、単語単位に母音および「ん」の形状を認識する。そして、得られた母音および「ん」の母音情報にもとづき、単語変換辞書ＤＢ３２ａから該当母音情報に関連する単語候補を抽出する。 From the generated mouth shape image data, the control unit 40 recognizes the vowel and “n” shape in units of words using the mouth shape image matching DB 31. Then, based on the obtained vowels and vowel information of “n”, word candidates related to the corresponding vowel information are extracted from the word conversion dictionary DB 32a.

画面表示部１１は、抽出された単語候補を表示する。ユーザは、選択キー１２ｂを用いて、表示された単語候補の中から入力したい単語を選ぶ。選択キー１２ｂは、ユーザ操作にもとづいて、複数の単語候補の中から所望の単語を選択するためのキーである。また、確定キー１２ｃは、ユーザ操作によって、選択された単語を確定し、入力したい単語を決定するためのキーである。 The screen display unit 11 displays the extracted word candidates. The user uses the selection key 12b to select a word to be input from the displayed word candidates. The selection key 12b is a key for selecting a desired word from a plurality of word candidates based on a user operation. The confirmation key 12c is a key for confirming a selected word and determining a word to be input by a user operation.

なお、表示された単語候補の中に、入力したい単語がない場合は、ユーザは、入力したい単語の子音情報の１文字目を、テンキー１２ａを用いて入力する。テンキー１２ａは、ユーザ操作にもとづいて、子音情報を入力するためのキーである。 If there is no word to be input among the displayed word candidates, the user inputs the first character of the consonant information of the word to be input using the numeric keypad 12a. The numeric keypad 12a is a key for inputting consonant information based on a user operation.

制御部４０では、単語変換辞書ＤＢ３２ａから、先に抽出した単語候補に対して、１文字目の子音情報を加味して再び検索し、その結果の単語候補を抽出し、画面表示部１１は抽出された単語候補を表示する。この状態でも入力したい単語が表示されていない場合は、さらに２文字目の子音情報を入力することで、さらに単語候補を検索していく。 The control unit 40 searches the word candidate dictionary previously extracted from the word conversion dictionary DB 32a by adding the first character consonant information, extracts the word candidate as a result, and the screen display unit 11 is extracted. Display the word candidates. If the word to be input is not displayed even in this state, the word candidate is further searched by inputting the second character consonant information.

次に具体的な例を挙げて携帯電話機１ａの文字入力動作について詳しく説明する。最初に日本語文字入力の一例として、「愛（あい）」という単語を入力する場合について説明する。 Next, the character input operation of the mobile phone 1a will be described in detail with a specific example. First, as an example of Japanese character input, a case where the word “love” is input will be described.

図４は文字入力画面の表示例を示す図である。メール本文を作成する際の画面１１−１上には、確定した入力文字を表示する確定入力文字表示部１１ａ（メール本文に該当）、ユーザにキー操作をガイダンスするためのキー操作ガイダンス部１１ｂおよび入力文字候補を表示する候補文字表示部１１ｃが表示される。 FIG. 4 is a diagram showing a display example of the character input screen. On the screen 11-1 when creating the mail text, a confirmed input character display section 11a (corresponding to the mail text) for displaying the confirmed input characters, a key operation guidance section 11b for guiding the user to perform key operations, and A candidate character display portion 11c for displaying input character candidates is displayed.

まず、ユーザは、携帯電話機１ａを持ちカメラ２１に向かって、「あ」の口を動かして口形状画像を入力する。図５は「あ」の口の画像を取り込んだ状態の画面表示例を示す図である。口形状画像照合ＤＢ３１から、入力された口形状画像データに該当する母音が照合されて、「ａ」の母音が入力されたことを認識する。 First, the user holds the mobile phone 1a and moves the mouth of “A” toward the camera 21 to input a mouth shape image. FIG. 5 is a diagram illustrating a screen display example in a state in which an image of the mouth “A” is captured. The vowel corresponding to the inputted mouth shape image data is collated from the mouth shape image collation DB 31 to recognize that the vowel “a” is inputted.

画面１１−２において、候補文字表示部１１ｃには、「あ」の口の画像から認識される文字として、ひらがな大文字の「あ」、カタカナ大文字の「ア」、ひらがな小文字の「あ」、カタカナ小文字の「ア」が表示される。また、確定前の入力文字を表示する確定前入力文字表示部１１ｄには、「あ」が表示される。 In the screen 11-2, the candidate character display unit 11c displays hiragana uppercase “a”, katakana uppercase “a”, hiragana lowercase “a”, katakana as characters recognized from the “a” mouth image. A lowercase “A” is displayed. In addition, “a” is displayed in the input character display portion 11d before confirmation that displays the input character before confirmation.

次にユーザは「い」の口を動かす。図６は単語候補の画面表示例を示す図である。ユーザが「い」の口を動かして、カメラ２１に向かって口の動きを止める。すると、口形状画像照合ＤＢ３１から、入力された口形状画像データに該当する母音が照合されて、「ｉ」の母音が入力されたことを認識する。そして、単語変換辞書ＤＢ３２ａから、「あい」に対する関連する単語候補が抽出される。 Next, the user moves the mouth of “I”. FIG. 6 is a diagram illustrating a screen display example of word candidates. The user moves the mouth of “I” and stops the mouth movement toward the camera 21. Then, the vowel corresponding to the inputted mouth shape image data is collated from the mouth shape image matching DB 31 to recognize that the vowel “i” is inputted. Then, word candidates related to “ai” are extracted from the word conversion dictionary DB 32a.

画面１１−３において、抽出された単語が候補文字表示部１１ｃに表示される。図６の場合は６個の単語が抽出されて表示されている。また、確定前入力文字表示部１１ｄには、「あい」が表示される。 On the screen 11-3, the extracted word is displayed on the candidate character display unit 11c. In the case of FIG. 6, six words are extracted and displayed. Further, “ai” is displayed in the input character display portion 11d before confirmation.

図７は入力文字を選択する場合の画面表示を示す図である。ここでは「愛」という文字が画面１１−４の候補文字表示部１１ｃに表示されているので、画面１１−４において、選択キー１２ｂを下方に２回押下して「愛」という文字を選択する。 FIG. 7 is a diagram showing a screen display when an input character is selected. Here, since the character “love” is displayed on the candidate character display portion 11c of the screen 11-4, the character “love” is selected by pressing down the selection key 12b twice on the screen 11-4. .

図８は入力文字を確定した場合の画面表示を示す図である。図７のようにして入力文字を選択した後、確定キー１２ｃを１回押下することで、画面１１−５における確定入力文字表示部１１ａのメール本文内に「愛」の文字が展開される。 FIG. 8 is a diagram showing a screen display when an input character is confirmed. After selecting the input character as shown in FIG. 7, the character “love” is expanded in the mail text of the confirmed input character display portion 11a on the screen 11-5 by pressing the confirm key 12c once.

次に口形状画像照合ＤＢ３１と単語変換辞書ＤＢ３２ａについて説明する。図９は口形状画像照合ＤＢ３１の登録データを示す図である。口形状画像照合ＤＢ３１は、ａ（あ）、ｉ（い）、ｕ（う）、ｅ（え）、ｏ（お）の母音およびｎ（ん）を含む母音情報と、各母音情報に対応する口形状画像データとが登録される。 Next, the mouth shape image collation DB 31 and the word conversion dictionary DB 32a will be described. FIG. 9 is a diagram showing registration data in the mouth shape image collation DB 31. Mouth shape image matching DB 31 corresponds to vowel information including vowels of a (a), i (i), u (u), e (e), o (o) and n (n), and each vowel information. Mouth shape image data is registered.

また、文字を発音するときの口の形状の画像データである口形状画像データは、システム標準データとユーザ登録データに分けられる。システム標準データは、口形状画像照合ＤＢ３１にあらかじめ登録設定されているシステム標準の口形状画像データである。ユーザ登録データは、ユーザ個人が自身の口形状をカメラで入力して登録する口形状画像データである。 Mouth shape image data, which is mouth shape image data when a character is pronounced, is divided into system standard data and user registration data. The system standard data is system standard mouth shape image data registered and set in advance in the mouth shape image matching DB 31. The user registration data is mouth shape image data registered by an individual user who inputs his mouth shape with a camera.

このように、携帯電話機１ａを利用する複数のユーザを想定し、システム標準の他に、ユーザ自身が口形状画像データを登録できるようにして、複数ユーザの各々の口形状画像データを母音情報に対応させて保存する構成とした。これにより、母音情報と口形状画像データとの照合の精度を向上させることが可能になる。 In this way, assuming a plurality of users using the mobile phone 1a, in addition to the system standard, the user himself / herself can register mouth shape image data, and each mouth shape image data of the plurality of users is used as vowel information. It was set as the structure preserve | saved correspondingly. Thereby, it is possible to improve the accuracy of collation between the vowel information and the mouth shape image data.

ここで、カメラ２１を通じて画像入力して生成した口形状画像データから、「あ」〜「お」の母音または「ん」の母音情報に変換する際には、入力された口形状画像データと一致する母音情報を口形状画像照合ＤＢ３１から検出する。そして、口形状画像データを母音情報に変換して、該当母音文字を候補文字表示部１１ｃに表示する。 Here, when the mouth shape image data generated by inputting an image through the camera 21 is converted into the vowel information of “A” to “O” or “n”, it matches the input mouth shape image data. Vowel information to be detected is detected from the mouth shape image matching DB 31. Then, the mouth shape image data is converted into vowel information, and the corresponding vowel character is displayed on the candidate character display unit 11c.

図１０は単語変換辞書ＤＢ３２ａの登録データを示す図である。母音および「ん」の母音情報から単語候補（日本語）を表示するための登録データ例を示している。登録項目としては、“母音情報”、“読み”、“キー”、“表示”がある。なお、“キー”の欄に示される数字をテンキー１２ａで押下することで“表示”に記載されている該当文字が表示される。 FIG. 10 is a diagram showing registration data in the word conversion dictionary DB 32a. The example of the registration data for displaying a word candidate (Japanese) from the vowel information of a vowel and "n" is shown. Registered items include “vowel information”, “reading”, “key”, and “display”. It should be noted that the corresponding characters described in “Display” are displayed by pressing the number shown in the “Key” column with the numeric keypad 12a.

次に「下位（かい）」という単語を入力する場合について説明する。ユーザは、携帯電話機１ａを持ちカメラ２１に向かって、「か」、「い」の口を動かして、口形状画像を入力する。「下位」という単語の母音は、「あい」であるので、「か」の口形状は母音「あ」に変換され、「い」の口形状は母音「い」に変換される。画面の操作については、上述の図６まで同じ画面操作となる。 Next, a case where the word “lower” is input will be described. The user holds the mobile phone 1a, moves the mouths of “ka” and “i” toward the camera 21, and inputs a mouth shape image. Since the vowel of the word “lower” is “ai”, the mouth shape of “ka” is converted to the vowel “a”, and the mouth shape of “i” is converted to the vowel “i”. The screen operation is the same screen operation up to the above-described FIG.

図６の候補文字表示部１１ｃには、入力したい「下位」という文字が表示されていないので、ユーザは、「下位」の１文字目の子音である「か」の入力を行うため、テンキー１２ａの「２」のキーを１回押下する。 Since the character “lower” to be input is not displayed in the candidate character display portion 11c of FIG. 6, the user inputs “ka”, which is the first consonant of “lower”. Press the “2” key once.

図１１は単語候補の画面表示例を示す図である。制御部４０は、単語変換辞書ＤＢ３２ａから、母音「あい」と１文字目の子音「か」に関連する単語を検索し、画面１１−６において、検索した単語候補を候補文字表示部１１ｃに表示する。図の場合、６個の単語候補が表示されている。 FIG. 11 is a diagram illustrating a screen display example of word candidates. The control unit 40 searches the word conversion dictionary DB 32a for words related to the vowel “ai” and the first character consonant “ka”, and displays the searched word candidates on the candidate character display unit 11c on the screen 11-6. . In the case of the figure, six word candidates are displayed.

図１２は入力文字を選択する場合の画面表示を示す図である。入力したい「下位」という文字が候補文字表示部１１ｃに存在しているため、画面１１−７において、選択キー１２ｂを下方に４回押下して選択する。 FIG. 12 is a diagram showing a screen display when an input character is selected. Since the character “lower” to be input exists in the candidate character display portion 11c, the selection key 12b is pressed four times downward on the screen 11-7 for selection.

図１３は入力文字を確定した場合の画面表示を示す図である。図１２のようにして入力文字を選択した後、確定キー１２ｃを１回押下することで、画面１１−８における確定入力文字表示部１１ａのメール本文内に「下位」の文字が展開される。 FIG. 13 is a diagram showing a screen display when an input character is confirmed. After selecting an input character as shown in FIG. 12, by pressing the enter key 12c once, a “lower” character is expanded in the mail text of the confirmed input character display portion 11a on the screen 11-8.

なお、図１１の単語候補が表示される状態のときに、入力したい単語が表示されない場合は、さらに２文字目の子音情報をテンキー１２ａで入力して単語変換辞書ＤＢ３２ａを検索し、候補文字表示部１１ｃに入力したい文字を表示する。 If the word candidate to be input is not displayed when the word candidate of FIG. 11 is displayed, the second character consonant information is input with the numeric keypad 12a to search the word conversion dictionary DB 32a, and the candidate character display unit A character to be input is displayed in 11c.

次に画面表示の変形例について説明する。図１４は文字入力画面の表示例を示す図である。図５のように、口の動きを入力している段階において、画面１１−２ａの空きスペースに、カメラ２１で画像入力した口形状画像ｄ１を表示する。このような表示を行うことで、ユーザは、自分の口の動きを確認しながら操作をすることができるため、母音および「ん」の認識率を向上させることが可能になる。 Next, a modified example of the screen display will be described. FIG. 14 is a diagram showing a display example of the character input screen. As shown in FIG. 5, the mouth shape image d <b> 1 input by the camera 21 is displayed in the empty space on the screen 11-2 a when the mouth movement is input. By performing such display, the user can perform an operation while confirming the movement of his / her mouth, so that the recognition rate of vowels and “n” can be improved.

次に文字入力動作についてフローチャートを用いて説明する。図１５は日本語文字入力動作のフローチャートを示す図である。
〔Ｓ１〕ユーザは、カメラ２１を用いて、口の動きを１文字ずつ画像入力する。 Next, a character input operation will be described using a flowchart. FIG. 15 is a flowchart of the Japanese character input operation.
[S1] Using the camera 21, the user inputs an image of mouth movement character by character.

〔Ｓ２〕入力された口の動きから口形状画像照合ＤＢ３１を用い、口形状画像データを母音情報に変換する。
〔Ｓ３〕口形状が一定時間の間に動いているか否か（変化しているか否か）を判断する。一定時間の間、口の動きがない場合はステップＳ５へいき、一定時間の間に口が動く場合はステップＳ４へいく。 [S2] The mouth shape image data is converted into vowel information using the mouth shape image collation DB 31 from the input mouth movement.
[S3] It is determined whether or not the mouth shape has moved during a certain time (whether or not it has changed). If there is no movement of the mouth for a certain time, the process goes to step S5, and if the mouth moves for a certain time, the process goes to step S4.

〔Ｓ４〕カメラ２１から次の１文字を入力する。ステップＳ２へ戻る。
〔Ｓ５〕口の動きから解析したｍ個の母音情報から、単語変換辞書ＤＢ３２ａを用いて、母音または「ん」に関連する単語候補ｎ個を候補文字表示部１１ｃに表示する。 [S4] The next character is input from the camera 21. Return to step S2.
[S5] From the m vowel information analyzed from the mouth movement, n word candidates related to the vowel or “n” are displayed on the candidate character display unit 11c using the word conversion dictionary DB 32a.

〔Ｓ６〕キー入力待ち状態とする。
〔Ｓ７〕入力されたキーを認識する。選択キー１２ｂの場合はステップＳ８ａへいき、テンキー１２ａの場合はステップＳ８ｂへいき、確定キー１２ｃの場合はステップＳ８ｃへいく。また、文字入力を終了する場合は終了とする。 [S6] Wait for key input.
[S7] The input key is recognized. If it is the selection key 12b, go to step S8a, if it is the numeric keypad 12a, go to step S8b, and if it is the enter key 12c, go to step S8c. If the character input is to be terminated, the process is terminated.

〔Ｓ８ａ〕選択キー１２ｂが入力された場合は、該当の方向にカーソルを移動し、ステップＳ６へ戻り、キー入力待ち状態となる。
〔Ｓ８ｂ〕テンキー１２ａが入力された場合は、該当単語に対して、入力された子音情報を加味して、単語変換辞書ＤＢ３２ａから関連する単語候補検索し、検索結果を候補文字表示部１１ｃに表示する。ステップＳ６へ戻って、キー入力待ち状態となる。 [S8a] When the selection key 12b is input, the cursor is moved in the corresponding direction, and the process returns to step S6 to enter a key input waiting state.
[S8b] When the numeric keypad 12a is input, the related word candidate is searched from the word conversion dictionary DB 32a by adding the input consonant information to the corresponding word, and the search result is displayed on the candidate character display unit 11c. To do. Returning to step S6, a key input waiting state is entered.

〔Ｓ８ｃ〕確定キー１２ｃが入力された場合は、確定文字を確定入力文字表示部１１ａに表示する。
〔Ｓ８ｃ−１〕単語変換辞書ＤＢ３２ａは、確定される単語の頻度の高い順に画面の上位位置に表示されるように登録順を更新する。 [S8c] When the confirmation key 12c is input, the confirmation character is displayed on the confirmation input character display portion 11a.
[S8c-1] The word conversion dictionary DB 32a updates the registration order so that the word conversion dictionary DB 32a is displayed at a higher position on the screen in the descending order of the frequency of words to be determined.

以上説明したように、文字入力装置１を適用した携帯電話機１ａは、日本語入力において、文字または単語レベルで口の動きの画像を取り込み、該当単語の母音および「ん」を解析する。そして、単語変換辞書ＤＢ３２ａから、口の動きで入力された母音および「ん」の母音情報に関連する単語候補を検索してディスプレイに表示する。 As described above, the mobile phone 1a to which the character input device 1 is applied captures an image of mouth movement at the character or word level in Japanese input, and analyzes the vowel and “n” of the corresponding word. Then, word candidates related to the vowel input by the movement of the mouth and the vowel information of “n” are searched from the word conversion dictionary DB 32a and displayed on the display.

ディスプレイに表示した母音情報だけの単語候補の中に、入力したい単語が含まれていれば、選択キー１２ｂと確定キー１２ｃにより、入力文字を選択・確定する。また、ディスプレイに表示した単語候補の中に入力したい単語が含まれていない場合は、該当する子音を例えば、「０〜９」のテンキー１２ａで入力し、候補が出るまで子音を順次追加入力して検索を行う。 If a word to be input is included in the word candidates of only the vowel information displayed on the display, the input character is selected and confirmed by the selection key 12b and the confirmation key 12c. If the word candidate displayed on the display does not include the word to be input, for example, the corresponding consonant is input with the numeric keypad 12a of “0-9”, and additional consonants are sequentially input until the candidate appears. Search.

このように、母音情報だけを単語単位に先に入力して、まず母音情報だけで単語変換辞書ＤＢ３２ａを検索し、該当文字があれば確定し、なければ子音情報をキー操作で入力することになる。 In this way, only the vowel information is input in units of words, and the word conversion dictionary DB 32a is first searched only with the vowel information. Become.

このような構成により、１文字ずつ母音に合わせて子音を入力するといった操作が不要となる。したがって、１文字入力する度に、口形状の画像入力とキーの入力操作とを連動する必要がないため、操作が容易となり、正確かつスムーズに文字を入力することが可能になる。 With such a configuration, an operation of inputting consonants in accordance with the vowels character by character becomes unnecessary. Accordingly, since it is not necessary to link the mouth-shaped image input and the key input operation every time a character is input, the operation is facilitated, and the characters can be input accurately and smoothly.

次にアルファベットの文字入力について説明する。アルファベット入力の一例として、「ｇ」を入力する場合について説明する。
図１６は文字入力画面の表示例を示す図である。画面１１−１ａにおいて、キー操作ガイダンス部１１ｂに示される文字切替キー１１ｂ−１を押下することで、文字入力モードをアルファベット入力モードに切り替える。 Next, alphabetic character input will be described. As an example of alphabet input, a case where “g” is input will be described.
FIG. 16 is a diagram showing a display example of a character input screen. On the screen 11-1a, the character input mode is switched to the alphabet input mode by pressing the character switch key 11b-1 shown in the key operation guidance unit 11b.

ここでは、「ｇ」という文字を入力するため、ユーザは、口の形状を「じー」として、カメラ２１から口の動きを入力し、上述の口形状画像照合ＤＢ３１から、該当口形状画像データに対応する母音情報「ｉ」を得る。 Here, in order to input the letter “g”, the user inputs the movement of the mouth from the camera 21 with the mouth shape as “ji”, and the corresponding mouth shape image data from the mouth shape image matching DB 31 described above. The vowel information “i” corresponding to is obtained.

次に「ｉ」に該当するアルファベット情報を得るため、アルファベット変換辞書ＤＢ３２ｂを用いて、候補となるアルファベットを抽出して、候補文字表示部１１ｃに表示する。 Next, in order to obtain alphabet information corresponding to “i”, a candidate alphabet is extracted using the alphabet conversion dictionary DB 32b and displayed on the candidate character display unit 11c.

図１７はアルファベット変換辞書ＤＢ３２ｂの登録データを示す図である。母音情報からアルファベット候補を表示するための登録データ例を示している。登録項目としては、“母音情報”、“読み”、“キー”、“表示”がある。なお、“表示”には、全小（全角小文字）、全大（全角大文字）、半小（半角小文字）および半大（半角大文字）がある。 FIG. 17 is a diagram showing registration data in the alphabet conversion dictionary DB 32b. An example of registered data for displaying alphabet candidates from vowel information is shown. Registered items include “vowel information”, “reading”, “key”, and “display”. Note that “display” includes full-size (full-width lowercase letters), full-size (full-size uppercase letters), half-small (half-size lowercase letters), and half-large (half-size uppercase letters).

図１８はアルファベット文字候補の画面表示例を示す図である。画面１１−２ａにおいて、母音「ｉ」に対応するアルファベットは「ｂ」、「ｅ」、「ｇ」、「ｐ」、「ｔ」、「ｚ」と複数候補が表示されている。 FIG. 18 is a diagram showing a screen display example of alphabet character candidates. On the screen 11-2a, the alphabet corresponding to the vowel “i” is displayed as “b”, “e”, “g”, “p”, “t”, “z” and a plurality of candidates.

図１９は入力文字を選択する際の画面表示を示す図である。画面１１−３ａにおいて、入力したい「ｇ」に該当するテンキー１２ａの「４」を押下することで、候補文字表示部１１ｃに「ｇ」のみを表示する。または、画面１１−２ａの状態で選択キー１２ｂを３回押下してカーソルを下方に移動して「ｇ」を選択する。 FIG. 19 is a diagram showing a screen display when an input character is selected. By pressing “4” of the numeric keypad 12a corresponding to “g” to be input on the screen 11-3a, only “g” is displayed on the candidate character display portion 11c. Alternatively, in the state of the screen 11-2a, the selection key 12b is pressed three times to move the cursor downward and select “g”.

図２０は入力文字を確定する画面表示を示す図である。図１９のようにして入力文字を選択した後、確定キー１２ｃを押下することで、画面１１−４ａにおいて、確定入力文字表示部１１ａのメール本文内に「ｇ」の文字が展開される。 FIG. 20 is a diagram showing a screen display for confirming an input character. After selecting the input character as shown in FIG. 19, the character “g” is expanded in the mail text of the confirmed input character display portion 11a on the screen 11-4a by pressing the confirm key 12c.

図２１はアルファベットの全角／半角の切替を示す図である。図１９の状態で、全半大小ボタン１１ｂ−２を押下することで、アルファベット変換辞書ＤＢ３２ｂにある“表示”の文字を切替える。これにより、アルファベットの「全角／半角」および「大文字／小文字」の切替えを可能とする。画面１１−５ａでは、全半大小ボタン１１ｂ−２を１度押下した状態の例であり、「ｇ」の文字が「Ｇ」となっていることを表している。 FIG. 21 is a diagram illustrating switching between full-width / half-width alphabets. In the state of FIG. 19, the character of “display” in the alphabet conversion dictionary DB 32 b is switched by pressing the full half size button 11 b-2. As a result, the alphabet can be switched between “full-width / half-width” and “uppercase / lowercase”. The screen 11-5a is an example of a state where the full half size button 11b-2 is pressed once, and indicates that the letter “g” is “G”.

図２２は入力文字を確定する画面表示を示す図である。図２１のようにして入力文字を選択した後、確定キー１２ｃを押下することで、画面１１−６ａにおいて、確定入力文字表示部１１ａのメール本文内に「Ｇ」の文字が展開される。 FIG. 22 is a diagram showing a screen display for confirming an input character. By selecting the input character as shown in FIG. 21 and then pressing the confirmation key 12c, the character “G” is expanded in the mail text of the confirmed input character display portion 11a on the screen 11-6a.

次に文字入力動作についてフローチャートを用いて説明する。図２３はアルファベット文字入力動作のフローチャートを示す図である。
〔Ｓ１１〕ユーザは、カメラ２１を用いて、口の動きを１文字ずつ画像入力する。 Next, a character input operation will be described using a flowchart. FIG. 23 is a flowchart of the alphabet character input operation.
[S11] The user uses the camera 21 to input an image of mouth movement character by character.

〔Ｓ１２〕入力された口の動きから口形状画像照合ＤＢ３１を用い、口形状画像データを母音情報に変換する。
〔Ｓ１３〕口形状が一定時間の間に動いているか否かを判断する。一定時間の間、口が動かない場合はステップＳ１５へいき、一定時間の間に口が動く場合はステップＳ１４へいく。 [S12] The mouth shape image data is converted into vowel information from the input mouth movement using the mouth shape image matching DB 31.
[S13] It is determined whether or not the mouth shape moves during a certain time. If the mouth does not move for a certain time, the process goes to step S15. If the mouth moves for a certain time, the process goes to step S14.

〔Ｓ１４〕カメラ２１から次の１文字を入力する。ステップＳ１２へ戻る。
〔Ｓ１５〕口の動きから解析したｍ個の母音情報から、アルファベット変換辞書ＤＢ３２ｂを用いて、関連するアルファベット候補ｎ個を候補文字表示部１１ｃに表示する。 [S14] The next character is input from the camera 21. Return to step S12.
[S15] From the m vowel information analyzed from the movement of the mouth, n related alphabet candidates are displayed on the candidate character display unit 11c using the alphabet conversion dictionary DB 32b.

〔Ｓ１６〕キー入力待ち状態とする。
〔Ｓ１７〕入力されたキーを認識する。選択キー１２ｂの場合はステップＳ１８ａへいき、テンキー１２ａの場合はステップＳ１８ｂへいき、確定キー１２ｃの場合はステップＳ１８ｃへいく。また、文字入力を終了する場合は終了とする。 [S16] It is in a key input waiting state.
[S17] The input key is recognized. In the case of the selection key 12b, the process proceeds to step S18a, in the case of the numeric keypad 12a, the process proceeds to step S18b, and in the case of the confirmation key 12c, the process proceeds to step S18c. If the character input is to be terminated, the process is terminated.

〔Ｓ１８ａ〕選択キー１２ｂが入力された場合は、該当の方向にカーソルを移動し、ステップＳ１６へ戻り、キー入力待ち状態となる。
〔Ｓ１８ｂ〕テンキー１２ａが使用された場合は、キー情報（アルファベット変換辞書ＤＢ３２ｂの“キー”に該当）にもとづき、アルファベットの候補を特定して入力して、候補文字表示部１１ｃに表示する。ステップＳ１６へ戻り、キー入力待ち状態となる。 [S18a] When the selection key 12b is input, the cursor is moved in the corresponding direction, and the process returns to step S16 to enter a key input waiting state.
[S18b] When the numeric keypad 12a is used, based on the key information (corresponding to the “key” in the alphabet conversion dictionary DB 32b), alphabet candidates are specified and input and displayed on the candidate character display unit 11c. The process returns to step S16 and waits for key input.

〔Ｓ１８ｃ〕確定キー１２ｃが入力された場合は、確定文字を確定入力文字表示部１１ａに表示する。
〔Ｓ１８ｃ−１〕アルファベット変換辞書ＤＢ３２ｂは、確定されるアルファベットの頻度の高い順に画面の上位位置に表示されるように登録順を更新する。 [S18c] When the confirmation key 12c is input, the confirmation character is displayed on the confirmation input character display portion 11a.
[S18c-1] The alphabet conversion dictionary DB 32b updates the registration order so that the alphabetical conversion dictionary DB 32b is displayed at a higher position on the screen in the descending order of the alphabet frequency to be determined.

以上説明したように、文字入力装置１を適用した携帯電話機１ａは、アルファベット入力モードを有し、１つのアルファベット毎に口の動きの画像を取り込み、該当アルファベットの母音情報を解析する。そして、アルファベット変換辞書ＤＢ３２ｂから口の動きで入力された母音情報に関連するアルファベット１文字をディスプレイに表示する。 As described above, the mobile phone 1a to which the character input device 1 is applied has an alphabet input mode, takes an image of mouth movement for each alphabet, and analyzes the vowel information of the corresponding alphabet. Then, one alphabetic character related to vowel information input by mouth movement from the alphabet conversion dictionary DB 32b is displayed on the display.

ディスプレイに表示したアルファベットの候補の中に入力したいアルファベットが含まれていれば、選択キー１２ｂと確定キー１２ｃにより、入力文字を確定する（またはテンキー１２ａでアルファベットを直接特定する）。 If the alphabet to be input is included in the alphabet candidates displayed on the display, the input character is confirmed by the selection key 12b and the confirmation key 12c (or the alphabet is directly specified by the numeric keypad 12a).

ディスプレイに表示した候補の中に、入力したいアルファベットが含まれていない場合は、該当するアルファベットを例えば、「０〜９」のテンキー１２ａで入力することで、アルファベットの候補を特定して入力する。このように、日本語入力だけでなく、アルファベット入力も行うことが可能である。 When the alphabet to be input is not included in the candidates displayed on the display, the alphabet candidate is specified and input by inputting the corresponding alphabet with, for example, the numeric keypad 12a of “0-9”. Thus, not only Japanese input but also alphabet input can be performed.

図２４は携帯電話機の外観構成を示す図である。上記の携帯電話機１ａでは、入力操作をテンキー等のキーで行ったが、タッチパネル１１０のような入力手段を有する携帯電話機１ｂを使用してもよい。基本構成は携帯電話機１ａと同じなので説明は省略する。 FIG. 24 is a diagram showing an external configuration of a mobile phone. In the mobile phone 1a, the input operation is performed with a key such as a numeric keypad. However, a mobile phone 1b having an input unit such as the touch panel 110 may be used. Since the basic configuration is the same as that of the mobile phone 1a, the description is omitted.

以上説明したように、文字入力装置１では、個々の母音情報に対応する口形状画像データを登録する口形状画像照合ＤＢ３１と、母音情報に関連する単語候補を登録する単語変換辞書ＤＢ３２ａとを備えて、口形状画像データを母音情報に変換し、変換後の母音情報に関連する単語候補を検索する構成とした。 As described above, the character input device 1 includes the mouth shape image collation DB 31 for registering mouth shape image data corresponding to individual vowel information, and the word conversion dictionary DB 32a for registering word candidates related to vowel information. Thus, the mouth shape image data is converted into vowel information, and word candidates related to the converted vowel information are searched.

従来の口形状認識による文字入力では、１文字単位で、口形状認識による母音入力とキー操作等による子音入力とを連動して文字を入力していたため、操作性が悪く、文字入力の効率性が低かった。 In conventional character input by mouth shape recognition, characters are input in units of character in conjunction with vowel input by mouth shape recognition and consonant input by key operation etc., so operability is poor and character input efficiency Was low.

これに対し、文字入力装置１では、最初に口形状認識によって口形状画像データを母音情報に一括して変換し、母音入力だけを例えば、単語単位に先に入力しておき、単語変換辞書ＤＢ３２ａを用いて、入力した母音情報に関連する単語候補を検索する。そして、所望の単語が存在しない場合には、子音情報をキー操作で入力して、単語候補を再検索して、所望の単語を入力する構成とした。 On the other hand, in the character input device 1, the mouth shape image data is first converted into vowel information by mouth shape recognition, and only the vowel input is input first, for example, in units of words, and the word conversion dictionary DB 32a. Is used to search for word candidates related to the input vowel information. When the desired word does not exist, the consonant information is input by key operation, the word candidate is searched again, and the desired word is input.

このように、母音情報だけでは所望の単語が表示されない場合に、子音情報を順次入力して検索精度を高めていくとした構成を有するので、従来のような１文字単位で口の動きとキー操作を行って文字を入力するといった煩わしさがなくなり、従来の文字入力と比べて、操作性を格段に向上させることが可能になる。 As described above, when the desired word is not displayed only by the vowel information, the convolution information is sequentially input to improve the search accuracy. The trouble of inputting characters by performing operations is eliminated, and the operability can be significantly improved compared to conventional character input.

また、「０〜９」のテンキーしかないような携帯端末機器においても、文字入力装置１の口形状認識文字入力機能によって、人混みでも必要最小限のキー操作により、アルファベットも含めた操作性の容易な文字入力が可能になる。 Moreover, even in a portable terminal device having only a numeric keypad of “0-9”, the mouth shape recognition character input function of the character input device 1 facilitates operability including alphabets by a minimum key operation even in crowds. Character input becomes possible.

なお、上記の文字入力装置１の処理機能は、コンピュータによって実現することができる。その場合、文字入力装置１が有すべき機能の処理内容を記述したプログラム（文字入力制御プログラム）が提供される。そのプログラムをコンピュータで実行することにより、上記処理機能がコンピュータ上で実現される。処理内容を記述したプログラムは、コンピュータで読み取り可能な記録媒体に記録しておくことができる。 The processing functions of the character input device 1 can be realized by a computer. In that case, a program (character input control program) describing the processing contents of the functions that the character input device 1 should have is provided. By executing the program on a computer, the above processing functions are realized on the computer. The program describing the processing contents can be recorded on a computer-readable recording medium.

コンピュータは、ＣＰＵによって装置全体が制御される。ＣＰＵには、バスを介してＲＡＭ、ハードディスクドライブ（ＨＤＤ）、通信インタフェース、グラフィック処理装置、および入出力インタフェースが接続される。 The entire computer is controlled by a CPU. A RAM, a hard disk drive (HDD), a communication interface, a graphic processing device, and an input / output interface are connected to the CPU via a bus.

ＲＡＭには、ＣＰＵに実行させるＯＳ（Operating System）のプログラムや、文字入力制御を行うためのプログラムの少なくとも一部が一時的に格納される。また、ＲＡＭには、ＣＰＵによる処理に必要な各種データが格納される。ＨＤＤメッセージには、ＯＳやアプリケーションプログラムが格納される。 The RAM temporarily stores at least a part of an OS (Operating System) program to be executed by the CPU and a program for performing character input control. The RAM stores various data necessary for processing by the CPU. The HDD message stores the OS and application programs.

通信インタフェースは、ネットワークに接続されている。通信インタフェースは、ネットワークを介して、他のコンピュータとの間でデータの送受信を行う。グラフィック処理装置は、モニタが接続されている。グラフィック処理装置は、ＣＰＵからの命令にしたがって画像をモニタの画面に表示させる。 The communication interface is connected to the network. The communication interface transmits / receives data to / from other computers via a network. The graphic processing apparatus is connected to a monitor. The graphic processing device displays an image on a monitor screen in accordance with a command from the CPU.

入出力インタフェースには、キーボードとマウスとが接続されている。入出力インタフェースは、キーボードやマウスから送られてくる信号を、バスを介してＣＰＵに送信する。また、入出力インタフェースは、外部記憶媒体への情報の書き込みおよび外部記憶媒体への情報の読出しが可能な外部記憶媒体インタフェースと接続可能になっている。 A keyboard and a mouse are connected to the input / output interface. The input / output interface transmits signals sent from the keyboard and mouse to the CPU via the bus. The input / output interface can be connected to an external storage medium interface capable of writing information to the external storage medium and reading information from the external storage medium.

文字入力装置１は、各機能の処理内容を記述した文字入力制御プログラムをコンピュータで実行することにより実現することができる。すなわち、図１のユーザインタフェース部１０、画像処理部２０、口形状画像照合データベース３１、単語変換辞書データベース３２ａおよび制御部４０に対応する処理内容をプログラムとして記述する。ここで、記述したプログラムは、コンピュータで読み取り可能な記録媒体に記録しておくことができる。 The character input device 1 can be realized by executing a character input control program describing the processing contents of each function by a computer. That is, processing contents corresponding to the user interface unit 10, the image processing unit 20, the mouth shape image collation database 31, the word conversion dictionary database 32a, and the control unit 40 of FIG. 1 are described as a program. Here, the described program can be recorded on a computer-readable recording medium.

コンピュータで読み取り可能な記録媒体としては、磁気記憶装置、光ディスク、光磁気記録媒体、半導体メモリなどがある。磁気記憶装置には、ハードディスク装置、フレキシブルディスク（ＦＤ）、磁気テープなどがある。光ディスクには、ＤＶＤ、ＤＶＤ−ＲＡＭ、ＣＤ−ＲＯＭ／ＲＷなどがある。光磁気記録媒体には、ＭＯ（Magneto-Optical disc）などがある。 Examples of the computer-readable recording medium include a magnetic storage device, an optical disk, a magneto-optical recording medium, and a semiconductor memory. Magnetic storage devices include hard disk devices, flexible disks (FD), and magnetic tapes. Optical discs include DVD, DVD-RAM, CD-ROM / RW, and the like. Magneto-optical recording media include MO (Magneto-Optical disc).

プログラムを流通させる場合には、例えば、そのプログラムが記録されたＤＶＤ、ＣＤ−ＲＯＭなどの可搬型記録媒体が販売される。また、プログラムをサーバコンピュータの記憶装置に格納しておき、ネットワークを介して、サーバコンピュータから他のコンピュータにそのプログラムを転送することもできる。 When distributing the program, for example, a portable recording medium such as a DVD or a CD-ROM in which the program is recorded is sold. It is also possible to store the program in a storage device of a server computer and transfer the program from the server computer to another computer via a network.

また、上記の処理機能の少なくとも一部を、ＤＳＰ（Digital Signal Processor）、ＡＳＩＣ（Application Specific Integrated Circuit）、ＰＬＤ（Programmable Logic Device）などの電子回路で実現することもできる。 In addition, at least a part of the above processing functions can be realized by an electronic circuit such as a DSP (Digital Signal Processor), an ASIC (Application Specific Integrated Circuit), or a PLD (Programmable Logic Device).

プログラムを実行するコンピュータは、例えば、外部記憶媒体に記録されたプログラムまたはサーバプログラムから転送されたプログラムを、自己の記憶装置に格納する。そして、コンピュータは自己の記憶装置からプログラムを読み取り、プログラムにしたがった処理を実行する。なお、コンピュータは、外部記憶媒体から直接プログラムを読み取り、そのプログラムにしたがった処理を実行することもできる。また、コンピュータは、サーバコンピュータからプログラムが転送されるごとに、逐次受け取ったプログラムにしたがった処理を実行することもできる。 The computer that executes the program stores, for example, the program recorded in the external storage medium or the program transferred from the server program in its own storage device. Then, the computer reads the program from its own storage device and executes processing according to the program. The computer can also read a program directly from an external storage medium and execute processing according to the program. Further, each time the program is transferred from the server computer, the computer can also execute processing according to the sequentially received program.

以上、実施の形態を例示したが、実施の形態で示した各部の構成は同様の機能を有する他のものに置換することができる。また、他の任意の構成物や工程が付加されてもよい。なお、上記では、文字入力装置１を携帯電話機に適用した例を示したが、携帯電話機に限らず、カメラが搭載されて文字入力を行う装置全般に対して適用可能である。 As mentioned above, although embodiment was illustrated, the structure of each part shown by embodiment can be substituted by the other thing which has the same function. Moreover, other arbitrary structures and processes may be added. In addition, although the example which applied the character input device 1 to the mobile telephone was shown above, it is applicable not only to a mobile telephone but to the whole apparatus which mounts a camera and performs character input.

１文字入力装置
１０ユーザインタフェース部
２０画像処理部
３１口形状画像照合データベース
３２ａ単語変換辞書データベース
４０制御部 DESCRIPTION OF SYMBOLS 1 Character input device 10 User interface part 20 Image processing part 31 Mouth shape image collation database 32a Word conversion dictionary database 40 Control part

Claims

An image processing unit for inputting mouth shape images and generating mouth shape image data;
A mouth shape image collation database for registering vowel information including vowels and repellent sounds, and the mouth shape image data corresponding to each vowel information;
A word conversion dictionary database for registering word candidates related to the vowel information;
A control unit for performing character input control,
The control unit converts the mouth shape image data into the vowel information, and searches for the word candidates related to the converted vowel information.
A character input device characterized by that.

The said control part searches the said word candidate from the said vowel information and the input consonant information, when there is no applicable word candidate in the said word candidate containing the said vowel information. The character input device according to 1.

An alphabet conversion dictionary database for registering alphabet candidates related to the vowel information;
When the alphabet character input mode is set, the control unit converts the mouth shape image data into the vowel information, and searches for the alphabet candidates related to the converted vowel information.
The character input device according to claim 1.

The character input device according to claim 1, wherein the control unit displays a mouth shape at the time of image input on a screen.

In the character input method,
Mouth shape image data is generated by inputting the mouth shape image,
Register vowel information including vowels and repellent sounds and the mouth shape image data corresponding to each vowel information in a database,
Register word candidates related to the vowel information in the database;
The mouth shape image data is converted into the vowel information, and the word candidates related to the converted vowel information are searched.
Character input method characterized by this.

In a character input control program for causing a computer to execute character input,
In the computer,
Mouth shape image data is generated by inputting the mouth shape image,
Register vowel information including vowels and repellent sounds and the mouth shape image data corresponding to each vowel information in a database,
Register word candidates related to the vowel information in the database;
The mouth shape image data is converted into the vowel information, and the word candidates related to the converted vowel information are searched.
A character input control program for executing a process.