JP4978982B2

JP4978982B2 - Portable information terminal, character input support program and method

Info

Publication number: JP4978982B2
Application number: JP2005296853A
Authority: JP
Inventors: 紘明栗山
Original assignee: ソニーモバイルコミュニケーションズ，エービー
Priority date: 2005-10-11
Filing date: 2005-10-11
Publication date: 2012-07-18
Anticipated expiration: 2025-10-11
Also published as: JP2007108881A

Description

本発明は、例えば携帯電話端末のように少なくとも音声通話と文字入力が可能な携帯情報端末と、その携帯情報端末の文字入力支援プログラム及び方法に関する。 The present invention relates to a mobile information terminal capable of at least voice call and character input, such as a mobile phone terminal, and a character input support program and method for the mobile information terminal.

従来より、例えば携帯電話端末などの携帯情報端末には、筐体の小型化と軽量化が求められている。したがって、このような携帯情報端末においては、ユーザインターフェイスのための操作デバイス用のスペースを多くとることが困難となっている。 2. Description of the Related Art Conventionally, for example, portable information terminals such as mobile phone terminals are required to be smaller and lighter. Therefore, in such a portable information terminal, it is difficult to take a large space for an operation device for a user interface.

その一方で、近年は、一般的なパーソナルコンピュータと同様に、これら携帯情報端末においても、文字入力・編集機能やインターネットへ接続してウェブページを表示するなどの機能が必須になっている。 On the other hand, in recent years, as with a general personal computer, these personal digital assistants also require functions such as a character input / editing function and a function for displaying a web page by connecting to the Internet.

特に、携帯電話端末の場合、最近の利用状況調査によると、平均通信回数はメール通信回数が通話の通信回数を上回っている。このことから、携帯電話端末では、文字の入力・編集が重要な機能の一つになっていることが認識できる。 In particular, in the case of a mobile phone terminal, according to a recent survey of usage conditions, the average number of communications is greater than the number of communications for email communications. From this, it can be recognized that the input / editing of characters is one of important functions in the mobile phone terminal.

また、最近の携帯電話端末は、ユーザによる文字入力の際の負担を減らすための機能として、いわゆる予測変換機能が標準搭載されるようになってきている。 Also, recent mobile phone terminals are equipped with a so-called predictive conversion function as a standard function as a function for reducing the burden when a user inputs characters.

なお、特開２００４−９６５０２号の公開特許公報（特許文献１）には、録音データの音声認識を行って文書データへ変換する技術が開示されている。したがって、この公報記載の技術を例えば携帯電話端末の文字入力の支援技術として用い、入力したい文章を音声により録音し、その録音データを文書データへ変換するようなことを行えば、従来のテンキー等の押下による文章入力操作が不要になると考えられる。 Japanese Patent Application Laid-Open No. 2004-96502 discloses a technique for performing voice recognition of recorded data and converting it into document data. Therefore, if the technique described in this publication is used as, for example, a technique for assisting character input of a mobile phone terminal, a sentence to be input is recorded by voice, and the recorded data is converted into document data. It is considered that the text input operation by pressing is unnecessary.

特開２００４−９６５０２号公報Japanese Patent Laid-Open No. 2004-96502

ところで、上述した予測変換機能を利用した文字入力の場合、ディスプレイ画面上に表示される予測変換の候補は、予め用意された予測候補辞書データベースから参照されたり、過去の入力等に基づく学習辞書データベースから参照されるものとなされている。 By the way, in the case of character input using the predictive conversion function described above, the predictive conversion candidates displayed on the display screen are referred to from a predictive candidate dictionary database prepared in advance, or a learning dictionary database based on past input or the like. It is supposed to be referenced from.

このため、例えば携帯電話端末を用いて電子メール文を作成するような場合において、例えば或る時点で入力文字「Ａ」に対して辞書データベースから参照されてディスプレイ画面上に表示される予測変換候補が例えば「Ｂ」「Ｃ」「Ｄ」…であったとすると、その電子メールの送信相手先が例えば家族や友人、仕事上のつき合いがある人など、何れの相手であったとしても、その入力文字「Ａ」に対する予測変換候補として辞書データベースから参照される候補は上記「Ｂ」「Ｃ」「Ｄ」…になってしまう。 For this reason, for example, in the case where an e-mail text is created using a mobile phone terminal, for example, a predictive conversion candidate displayed on the display screen by referring to the input character “A” from the dictionary database at a certain time, for example. For example, “B”, “C”, “D”, etc., the e-mail transmission destination may be any party, such as a family member, a friend, or a person with a business relationship. The candidates referred to from the dictionary database as predictive conversion candidates for the character “A” are “B”, “C”, “D”,.

したがって、それらディスプレイ画面上に表示された予測変換候補が電子メールの相手先に合っていない候補である場合、ユーザは、相手先に合わせた電子メール文を作成するために、別の新たな予測変換候補を表示させるような操作を行ったり、所望の文字や単語を自ら手入力するような作業を行わなければならなくなる。 Therefore, if the predicted conversion candidates displayed on the display screen are candidates that do not match the destination of the email, the user can create another new prediction in order to create an email sentence that matches the destination. There is a need to perform operations such as displaying conversion candidates or manually inputting desired characters and words.

一方、例えばユーザ辞書として複数の予測候補辞書データベースを設け、それぞれに所望の文字や単語等を登録するようにし、それらユーザ辞書を例えば電子メールの相手先毎に切り換えて使用するようなことも考えられるが、そのような複数のユーザ辞書登録と相手先に応じた辞書の切り換えを行うことは非常に手間がかかる作業であり、また例えば、複数のユーザ辞書を登録できた場合であっても、その辞書をより使い易くするためには、常にそれら辞書を更新していかなければならず、その更新作業にも多くの手間と時間がかかることになり、実用的とは到底言い難い。 On the other hand, for example, a plurality of prediction candidate dictionary databases may be provided as user dictionaries, and desired characters, words, etc. may be registered in each database, and these user dictionaries may be used by switching for each e-mail destination, for example. However, it is a very time-consuming operation to switch between a plurality of user dictionaries and a dictionary according to the other party. For example, even when a plurality of user dictionaries can be registered, In order to make the dictionaries easier to use, the dictionaries must be constantly updated, and the updating work takes a lot of time and effort, which is hardly practical.

本発明は、このような実情に鑑みて提案されたものであり、文字入力時におけるユーザの作業負担を軽減できると共に、辞書データベースから参照される候補の最適化を可能とし、ユーザにとってより使い易い文字入力支援を実現可能とする携帯情報端末、文字入力支援プログラム及び方法を提供することを目的とする。 The present invention has been proposed in view of such a situation, and can reduce the user's work burden at the time of character input and also enables optimization of candidates referred to from the dictionary database, making it easier for the user to use. An object of the present invention is to provide a portable information terminal, a character input support program, and a method capable of realizing character input support.

本発明の携帯情報端末は、会話音声データを記憶する記憶部と、会話音声データの音声認識により、会話音声データにおける会話の中の単語を抽出する単語抽出部と、その抽出された単語を、会話の時間帯と会話を行った相手先と会話中で単語が出現する頻度との少なくとも一つと共にデータベース化して登録するデータベース部と、ユーザが会話内容に関連する処理として相手先を指定していない状態で電子メールの文字入力を行う際に、その入力中の電子メール文に使用されている単語を元に、データベースの時間帯と相手先と頻度との少なくとも一つを参照して電子メールの相手先を推定すると共にその推定した相手先に合う予測変換候補を検索する候補検索部と、その検索された予測変換候補をユーザに提示する提示部とを有することにより、上述した課題を解決する。 The portable information terminal of the present invention includes a storage unit for storing conversation voice data, a word extraction unit for extracting words in the conversation in the conversation voice data by voice recognition of the conversation voice data, and the extracted words. A database unit that stores and registers the database together with at least one of the conversation time zone, the conversation partner and the frequency of occurrence of words in the conversation, and the user designates the partner as a process related to the conversation content. when performing a character input of an e-mail with no state, based on the words that are used in e-mail text in the input, e-mail with reference to the at least one of the time zone and the other party and the frequency of database this has a candidate search unit to search for predictive conversion candidate that matches the destination that the estimated, and a presentation section for presenting the retrieved predicted conversion candidates to the user as well as estimates of the destination Accordingly, to solve the problems described above.

また、本発明の文字入力支援プログラムは、会話音声データを記憶する記憶部と、会話音声データの音声認識により、会話音声データにおける会話の中の単語を抽出する単語抽出部と、その抽出された単語を、会話の時間帯と会話を行った相手先と会話中で単語が出現する頻度との少なくとも一つと共にデータベース化して登録するデータベース部と、ユーザが会話内容に関連する処理として相手先を指定していない状態で電子メールの文字入力を行う際に、その入力中の電子メール文に使用されている単語を元に、データベースの時間帯と相手先と頻度との少なくとも一つ参照して電子メールの相手先を推定すると共にその推定した相手先に合う予測変換候補を検索する候補検索部と、その検索された予測変換候補を提示部に提示させる提示制御部として、携帯情報端末のコンピュータを機能させることにより、上述した課題を解決する。 Further, the character input support program of the present invention includes a storage unit that stores conversation voice data, a word extraction unit that extracts words in the conversation in the conversation voice data by voice recognition of the conversation voice data, and the extracted A database unit that registers and registers words together with at least one of the conversation time zone and the conversation partner and the frequency of occurrence of the word in the conversation, and the partner as a process related to the conversation content by the user When entering e-mail characters without specifying them, refer to at least one of the database time zone, destination, and frequency based on the words used in the e-mail text being entered. a candidate search unit to search for predictive conversion candidate that matches the destination that the estimated while estimating the destination of e-mail, Hisage to present the retrieved predictive conversion candidate indication section As a control unit, by functioning the portable information terminal computer, to solve the problems described above.

また、本発明の文字入力支援方法は、会話音声データを記憶するステップと、会話音声データから音声認識により、会話音声データにおける会話の中の単語を抽出するステップと、その抽出された単語を、会話の時間帯と会話を行った相手先と会話中で単語が出現する頻度との少なくとも一つと共にデータベース化して登録するステップと、ユーザが会話内容に関連する処理として相手先を指定していない状態で電子メールの文字入力を行う際に、その入力中の電子メール文に使用されている単語を元に、データベースの時間帯と相手先と頻度との少なくとも一つを参照して電子メールの相手先を推定すると共にその推定した相手先に合う予測変換候補を検索するステップと、その検索された予測変換候補をユーザに提示するステップとを有することにより、上述した課題を解決する。 The character input support method of the present invention includes a step of storing conversation voice data, a step of extracting words in the conversation in the conversation voice data by voice recognition from the conversation voice data, and the extracted word, A step of creating a database together with at least one of the conversation time zone, the conversation partner and the frequency of occurrence of words in the conversation, and the user does not specify the partner as processing related to the conversation content When entering e-mail characters in the state, refer to at least one of the database time zone, destination, and frequency based on the words used in the e-mail text being entered . Yes retrieving the predictive conversion candidate that matches the destination that the estimated while estimating the destination, and presenting the retrieved predicted conversion candidates to the user By Rukoto, for solving the above problems.

すなわち、本発明によれば、会話音声データから抽出した単語を、会話の時間帯と会話を行った相手先と会話中で単語が出現する頻度との少なくとも一つと共にデータベース化し、会話内容に関連する処理として相手先を指定していない状態で電子メールの文字入力が行われる際に、その入力中の電子メール文に使用されている単語を元に、データベースの時間帯と相手先と頻度との少なくとも一つを参照して電子メールの相手先を推定すると共にその推定した相手先に合う予測変換候補を検索するため、会話内容に合った予測候補単語を提示することができる。 That is, according to the present invention, the words extracted from the conversation voice data are databased together with at least one of the conversation time zone and the conversation partner and the frequency of occurrence of the words in the conversation, and related to the conversation contents. When an e-mail character is input without specifying the other party as a process to be performed , based on the words used in the input e-mail text, In order to estimate the destination of the e-mail with reference to at least one of the above and to search for a prediction conversion candidate that matches the estimated destination, a prediction candidate word that matches the conversation content can be presented.

本発明によれば、会話音声データから抽出した単語を、会話の時間帯と会話を行った相手先と会話中で単語が出現する頻度との少なくとも一つと共にデータベース化し、会話内容に関連する処理として相手先を指定していない状態で電子メールの文字入力が行われる際に、その入力中の電子メール文に使用されている単語を元に、データベースの時間帯と相手先と頻度との少なくとも一つを参照して電子メールの相手先を推定すると共にその推定した相手先に合う予測変換候補を検索するため、会話内容に合った最適な予測候補単語を提示することができ、したがって、文字入力時におけるユーザの作業負担を軽減できると共に、ユーザにとってより使い易い文字入力支援が実現されている。 According to the present invention, the words extracted from the conversation voice data are databased together with at least one of the conversation time zone and the conversation partner and the frequency of occurrence of the words in the conversation, and processing related to the conversation contents When a character is entered in an email without specifying the recipient , the database time zone, the recipient and the frequency are at least based on the words used in the email text being entered. Since the destination of the email is estimated with reference to one and the prediction conversion candidate suitable for the estimated destination is searched, it is possible to present the optimum prediction candidate word suitable for the conversation content. The user's work burden during input can be reduced, and character input support that is easier for the user to use is realized.

以下、図面を参照しながら、本発明の携帯情報端末、文字入力支援プログラム及び方法の一実施形態について説明する。 Hereinafter, an embodiment of a portable information terminal, a character input support program, and a method according to the present invention will be described with reference to the drawings.

なお、本実施形態では、本発明の携帯情報端末の一例として、携帯電話回線を使用して少なくとも音声通話が可能となされた携帯電話端末を挙げているが、勿論、ここで説明する内容はあくまで一例であり、本発明はこの例に限定されないことは言うまでもない。 In the present embodiment, as an example of the mobile information terminal of the present invention, a mobile phone terminal capable of at least voice communication using a mobile phone line is cited, but of course, the contents described here are only It is an example, and it goes without saying that the present invention is not limited to this example.

〔概要〕
本発明実施形態の携帯電話端末では、ユーザが通話を行っている際に発話と受話のそれぞれの通話音声のデータを内蔵大容量記憶装置に記憶させ、例えば待ち受け状態でバッテリ残量に余裕があるような空き時間内に当該記憶した通話音声の音声認識を行い、その音声認識にて得られた上記通話音声の会話内容に多く含まれる単語を抽出し、その単語を、出現頻度や時間帯別の頻度分布、時間的に近接した他の関連した単語、それら他の関連単語における時間的な近接度の高低、通話相手などの情報と関連付けしたデータベース（以下、通話関連データベースと表記する。）を作成する。〔Overview〕
In the mobile phone terminal of the present embodiment, when the user is making a call, the voice data of each of the utterances and receptions is stored in the built-in large-capacity storage device, for example, there is a margin in the remaining battery level in the standby state The voice of the stored call voice is recognized during the idle time, and words included in the conversation content of the call voice obtained by the voice recognition are extracted, and the words are classified according to appearance frequency and time zone. A database (hereinafter referred to as a call-related database) associated with information such as the frequency distribution, other related words that are close in time, the degree of temporal proximity of these other related words, and the information on the other party. create.

そして、ユーザが文字入力を行う際の予測変換候補の参照先として、通常の予測変換辞書データベースと共に上記通話関連データベースを用い、予測変換の候補となる単語をユーザに提示する。 Then, the call-related database is used together with the normal prediction conversion dictionary database as a reference destination of the prediction conversion candidate when the user performs character input, and words that are candidates for prediction conversion are presented to the user.

これにより、本実施形態の携帯電話端末では、ユーザが例えば電子メール文を作成するための文字入力を行う際などに、当該電子メールの相手先に最適化した予測変換候補をユーザに提示可能となり、その結果、ユーザは、より少ない文字入力動作で所望の文章を作成することができるようになる。 Thereby, in the mobile phone terminal of the present embodiment, when the user performs character input for creating an e-mail text, for example, the prediction conversion candidate optimized for the destination of the e-mail can be presented to the user. As a result, the user can create a desired sentence with fewer character input operations.

なお、通話関連データベースには、当該携帯電話端末の電話帳やアドレス帳の項目（例えば通話相手名、住所、生年月日、趣味など）を加えることも可能である。また、過去に入力した文字（例えば送受信済みの電子メール文内の単語）を組み合わせて通話関連データベースを構築することも可能である。また、予測変換候補の検出の際には、現在の日付や曜日、スケジュール、マナーモードなど現在設定中のモード、ＧＰＳ（Global Positioning System）により検出されている位置情報、現在接続中の基地局の情報などを元に、ユーザの現在状態を推測し、その推測に応じた予測変換候補を提示するようにしても良い。 In addition, it is also possible to add items (for example, call partner name, address, date of birth, hobby, etc.) of the phone book and address book of the mobile phone terminal to the call related database. It is also possible to construct a call-related database by combining previously input characters (for example, words in e-mail sentences that have been sent and received). In addition, when detecting a predictive conversion candidate, the currently set mode such as the current date, day of the week, schedule, manner mode, position information detected by GPS (Global Positioning System), the currently connected base station A user's current state may be estimated based on information or the like, and prediction conversion candidates corresponding to the estimation may be presented.

〔ユースケース〕
本実施形態の携帯電話端末のユースケースとして、図１には、携帯電話端末を用いてユーザ１が友人２との間でイタリア旅行の計画の会話（通話）を行ったような場合を例に挙げる。〔Use Case〕
As a use case of the mobile phone terminal of the present embodiment, FIG. 1 shows an example in which the user 1 has a conversation (call) for planning an Italian trip with a friend 2 using the mobile phone terminal. I will give you.

この図１に示すユースケースにおいて、本実施形態の携帯電話端末は、上記イタリア旅行の計画のための通話時における発話と受話のそれぞれの会話音声データを記憶し、その後、例えば待ち受け状態でバッテリ残量に余裕がある空き時間内に、当該記憶した通話音声の音声認識を行って当該通話音声の会話内容に多く含まれる単語を抽出する。ここで、図１のユースケースのようなイタリア旅行の計画に関する会話では、例えば「旅行」、「イタリア」、「料理」など、旅行に関する単語が頻出することになり、したがって、本実施形態の携帯電話端末は、それら頻出する単語を、その会話を構成する重要なキーワードであるとして抽出し、それら単語を頻度，時間帯，通話相手等に関連付けして通話関連データベースに登録する。 In the use case shown in FIG. 1, the mobile phone terminal according to the present embodiment stores conversation voice data of utterances and receptions during a call for the Italian travel plan, and then, for example, a battery remaining in a standby state. Within the free time with a sufficient amount, speech recognition of the stored call voice is performed to extract words included in the conversation content of the call voice. Here, in the conversation related to the Italian travel plan as in the use case of FIG. 1, words related to travel, such as “travel”, “Italy”, “cooking”, etc. frequently appear. The telephone terminal extracts these frequently occurring words as important keywords constituting the conversation, and registers these words in the call-related database in association with the frequency, time zone, call partner, and the like.

そして、その後、例えばユーザ１が友人２を送信相手先として上記イタリア旅行計画についての電子メールを作成するような場合、本実施形態の携帯電話端末は、通常の予測変換辞書データベースと共に上記通話関連データベースを参照する。これにより、本実施形態の携帯電話端末のディスプレイ画面３上には、ユーザ１が入力しようとしている友人２宛の電子メール文に最適な予測変換候補として、上述した「旅行」、「イタリア」、「料理」などの単語が優先的に次々と提示されるようになり、その結果、ユーザ１は、相手先である友人２に合わせた電子メール文を、少ない作業で迅速且つ容易に作成することが可能となる。 Then, for example, when the user 1 creates an e-mail about the Italian travel plan with the friend 2 as the transmission destination, for example, the mobile phone terminal of the present embodiment includes the call-related database together with the normal prediction conversion dictionary database. Refer to As a result, on the display screen 3 of the mobile phone terminal of the present embodiment, the above-mentioned “travel”, “Italy”, Words such as “cook” are preferentially presented one after another, and as a result, the user 1 can quickly and easily create an e-mail message tailored to the friend 2 who is the other party with less work. Is possible.

また、本実施形態の携帯電話端末は、上述したユースケースのような電子メール文の作成時だけでなく、例えばスケジュール管理，メモ帳，ウェブページ閲覧時の文字入力の際など、様々な文字入力時にも、上述した通話関連データベースを使用可能となされている。 In addition, the mobile phone terminal according to the present embodiment is not limited to the creation of e-mail texts as in the use case described above, but also various character input such as when inputting text when browsing schedules, notepads, web pages, etc. Sometimes, the above-described call-related database can be used.

本実施形態の携帯電話端末の別のユースケースとして、図２には、携帯電話端末を用いてユーザ４が例えば会社の上司５との間で、会議のスケジュールに関する会話（通話）を行った場合を例に挙げる。 As another use case of the mobile phone terminal of the present embodiment, FIG. 2 shows a case where the user 4 has a conversation (call) related to a conference schedule with the manager 5 of the company, for example, using the mobile phone terminal. Take as an example.

この図２のユースケースにおいて、本実施形態の携帯電話端末は、会議のスケジュールに関する通話時の発話と受話のそれぞれの音声データを記憶し、その後、例えば待ち受け状態でバッテリ残量に余裕がある空き時間内に、当該記憶した通話音声の音声認識を行い、その通話音声の会話内容に多く含まれる単語を抽出する。ここで、図２のユースケースのような会議のスケジュールに関する会話では、例えば「Ｆ会議室」、「打ち合わせ」、「製品Ｅ」など、会議に関する単語が頻出することになり、したがって、本実施形態の携帯電話端末は、それら頻出する単語を、その会話を構成する重要なキーワードであるとして抽出し、それら単語を頻度，時間帯，通話相手等に関連付けして通話関連データベースに登録する。 In the use case of FIG. 2, the mobile phone terminal of the present embodiment stores voice data of utterance and reception at the time of a call related to the conference schedule, and then, for example, a free space with a sufficient remaining battery level in a standby state Within the time, speech recognition of the stored call voice is performed, and words included in the conversation content of the call voice are extracted. Here, in the conversation related to the meeting schedule as in the use case of FIG. 2, words related to the meeting frequently appear, for example, “F meeting room”, “meeting”, “product E”. The mobile phone terminal extracts these frequently occurring words as important keywords constituting the conversation, and registers these words in the call-related database in association with the frequency, time zone, call partner, and the like.

そしてその後、例えばユーザ４が携帯電話端末のスケジュール帳機能を用いて会議のスケジュールを登録するための文字入力を行うような場合、本実施形態の携帯電話端末は、通常の予測変換辞書データベースと共に上記通話関連データベースを参照する。これにより、例えばユーザ４がスケジュール帳に日時を入力したような場合、本実施形態の携帯電話端末のディスプレイ画面６上には、そのスケジュールに最適な予測変換候補として、上述した「Ｆ会議室」、「打ち合わせ」、「製品Ｅ」のような、会議の場所や内容などに関する単語が優先的に次々と提示されるようになり、その結果、ユーザ４は、会議に関するスケジュールの入力を、少ない作業で迅速且つ容易に作成することが可能となる。 And after that, for example, when the user 4 performs character input for registering a conference schedule using the schedule book function of the mobile phone terminal, the mobile phone terminal of the present embodiment, together with the normal predictive conversion dictionary database, Browse call related databases. Thereby, for example, when the user 4 inputs the date and time in the schedule book, the above-mentioned “F conference room” is displayed on the display screen 6 of the mobile phone terminal of the present embodiment as a prediction conversion candidate optimal for the schedule. , “Meetings”, “Product E”, and the like, words related to the location and content of the conference are preferentially presented one after another. As a result, the user 4 can input a schedule related to the conference with less work. Thus, it can be created quickly and easily.

図３には、例えば図２のユースケースで登録された通話関連データベースの一例を示す。 FIG. 3 shows an example of a call related database registered in the use case of FIG.

この図３の例において、通話関連データベースには、一例として、「単語の読み仮名」（図３では単に「単語」と表記している。）、「単語の漢字」（図３では単に「漢字」と表記している。）、「単語の頻度」（図３では単に「頻度」と表記している。）、「通話相手」、「関連単語」、「受話（発話）データ番号」等の各項目が登録される。「単語の読み仮名」には、通話音声から音声認識により得られた単語の読み仮名が登録される。「単語の漢字」には、当該単語の読み仮名に対応した漢字が登録される。この図３の例のように、通話関連データベースは、同音異義語にも対応可能となされている。「単語の頻度」には、当該単語が通話内に登場した頻度レベルを表す「ａ」，「ｂ」，「ｃ」，…等が登録され、本実施形態では、レベル「ａ」が最も高いレベルを表し、以下、レベル「ｂ」，「ｃ」の順にレベルが低くなっている。また、「単語の頻度」には、「朝」，「昼」，「夜」の何れの時間帯に、当該単語が多く登場したのかについても「高」，「中」，「低」のレベルが付けられて登録される。「通話相手」には、通話中に当該単語を使用した通話相手が登録され、例えば携帯電話端末内の電話帳或いはアドレス帳の相手先名と対応付けられて登録される。「関連単語」には、通話内において当該単語と関連して登場した別の単語（例えば時間的に近接した別の単語）等が登録される。「発話（受話）データ番号」には、その通話内における発話音声データのファイル番号と受話音声データのファイル番号が登録される。携帯電話端末の大容量記憶装置では、当該ファイル番号により音声データの管理が行われる。 In the example of FIG. 3, in the call-related database, as an example, “word reading kana” (indicated simply as “word” in FIG. 3), “word kanji” (in FIG. 3, simply “kanji” ”,“ Frequency of words ”(indicated simply as“ Frequency ”in FIG. 3),“ Call partner ”,“ Related word ”,“ Received (utterance) data number ”, etc. Each item is registered. In the “word reading kana”, a word reading kana obtained by voice recognition from a call voice is registered. In the “word kanji”, kanji corresponding to the reading kana of the word is registered. As shown in the example of FIG. 3, the call-related database can also handle homonyms. In the “word frequency”, “ a ”, “ b ”, “ c ”,... Indicating the frequency level at which the word appears in the call is registered, and in this embodiment, the level “ a ” is the highest. The level represents the level, and the levels are lower in the order of levels “ b ” and “ c ”. The “word frequency” is a level of “high”, “medium”, and “low” for whether the word appears frequently in the “morning”, “daytime”, or “night” time zone. Is added and registered. In “Call partner”, a call partner who uses the word during a call is registered, and is registered in association with, for example, a destination name in a telephone book or an address book in a mobile phone terminal. In the “related word”, another word that appears in association with the word in the call (for example, another word close in time) is registered. In the “speech (receipt) data number”, the file number of the utterance voice data and the file number of the reception voice data in the call are registered. In a large-capacity storage device of a mobile phone terminal, audio data is managed by the file number.

〔ハードウェア構成〕
図４には、上述したことを実現可能な本実施形態の携帯電話端末の概略的な内部構成を示す。 [Hardware configuration]
FIG. 4 shows a schematic internal configuration of the mobile phone terminal of the present embodiment capable of realizing the above.

図４において、データラインは、音声データや電子メールデータ、画像データなの各種データを伝送するための伝送ラインである。 In FIG. 4, a data line is a transmission line for transmitting various data such as voice data, e-mail data, and image data.

制御ラインは、ＣＰＵ（中央処理ユニット）により構成されている制御部３０からの制御データなどの各種制御情報を伝送するための伝送ラインである。 The control line is a transmission line for transmitting various control information such as control data from the control unit 30 configured by a CPU (Central Processing Unit).

アンテナ３２は、通信回路３１に接続され、通話や電子メール通信、インターネット接続などの際の信号電波を送受信するのに用いられる。 The antenna 32 is connected to the communication circuit 31 and is used to transmit and receive signal radio waves during telephone calls, e-mail communication, Internet connection, and the like.

通信回路３１は、送受信信号の周波数変換、変調と復調等を行う。 The communication circuit 31 performs frequency conversion, modulation, demodulation, and the like of transmission / reception signals.

ここで、上記アンテナ３２及び通信回路３１にて受信された通話音声データ（受話音声データ）は音声処理部４２へ送られ、それ以外の受信データは、制御部３０へ送られて適切に処理された後、必要に応じて当該制御部３０から各部へ送られる。なお、受話音声以外の受信データは、パケット通信にかかるデータであり、例えば電子メールデータ、動画像や静止画の画像データ、音楽等のデータ、ＨＴＭＬ（Hyper Text Markup Language）データ、プログラムコードのデータ等を挙げることができる。 Here, call voice data (received voice data) received by the antenna 32 and the communication circuit 31 is sent to the voice processing unit 42, and other received data is sent to the control unit 30 and appropriately processed. After that, it is sent from the control unit 30 to each unit as necessary. The received data other than the received voice is data related to packet communication. For example, e-mail data, moving image or still image data, music data, HTML (Hyper Text Markup Language) data, program code data, etc. Etc.

上記アンテナ３２及び通信回路３１から受話音声データが供給された時の音声処理部４２は、当該受話音声データを復号化し、その復号化後の受話音声データをアナログ音声信号に変換し、そのアナログ受話音声信号をスピーカ４０若しくは図示しないイヤホンジャックへ送る。 When received voice data is supplied from the antenna 32 and the communication circuit 31, the voice processing unit 42 decodes the received voice data, converts the decoded received voice data into an analog voice signal, and receives the analog received voice. An audio signal is sent to the speaker 40 or an earphone jack (not shown).

また、本実施形態の場合、音声処理部４２により復号化された受話音声データは、電話番号などの通話相手先を特定可能な情報やその通話日時情報などと関連付けられて大容量記憶装置４５に記憶される。なお、相手先特定情報（電話番号等）や通話日時情報は、メモリ部３５に保存されても良い。 In the case of the present embodiment, the received voice data decoded by the voice processing unit 42 is associated with information such as a telephone number that can specify a call destination or call date / time information in the mass storage device 45. Remembered. Note that the other party identification information (telephone number or the like) and the call date / time information may be stored in the memory unit 35.

スピーカ４０は、上記音声処理部４２から供給されたアナログ受話音声信号を内部増幅器により増幅した後、その受話音声信号を可聴音声に変換して外部に出力する。これにより、スピーカ４０からは受話音声が出力されることになる。 The speaker 40 amplifies the analog received voice signal supplied from the voice processing unit 42 with an internal amplifier, converts the received voice signal into an audible voice, and outputs the audible voice to the outside. As a result, the received voice is output from the speaker 40.

マイクロホン４１は、発話音声をアナログ音声信号に変換すると共に、内部増幅器により増幅し、さらにそのアナログ発話音声信号を音声処理部４２に送る。 The microphone 41 converts the speech voice into an analog voice signal, amplifies it by an internal amplifier, and further sends the analog speech voice signal to the voice processing unit 42.

上記マイクロホン４１からアナログ発話音声信号が供給された時の音声処理部４２は、その発話音声信号をディジタル発話音声データに変換した後に符号化し、さらにその符号化後の発話音声データを通信回路３１へ送る。これにより、アンテナ３２からは発話音声データが送信されることになる。 When the analog speech signal is supplied from the microphone 41, the speech processing unit 42 converts the speech signal into digital speech data, encodes the speech signal, and further encodes the speech data to the communication circuit 31. send. As a result, utterance voice data is transmitted from the antenna 32.

また、本実施形態の場合、音声処理部４２によりディジタル化された発話音声データは、通話相手先特定情報（電話番号等）やその通話日時情報などと関連付けられて、大容量記憶装置４５に記憶される。なお、この場合も上記相手先特定情報（電話番号等）や通話日時情報はメモリ部３５に保存されても良い。 In the case of the present embodiment, the utterance voice data digitized by the voice processing unit 42 is stored in the mass storage device 45 in association with the call partner identification information (telephone number etc.) and the call date / time information. Is done. In this case as well, the destination identification information (telephone number and the like) and call date / time information may be stored in the memory unit 35.

大容量記憶装置４５は、ハードディスクドライブ（ＨＤＤ）や、フラッシュメモリ等の大容量の不揮発性半導体メモリからなり、上述した相手先特定情報（電話番号等）や通話日時情報と関連付けられている受話音声データや発話音声データを記憶する。当該大容量記憶装置４５に記憶された発話音声データや受話音声データは、当該携帯電話端末が例えば待ち受け状態になっている場合のように制御部３０の処理負荷が少なく、且つ、バッテリ残量に余裕がある時に読み出されて、制御部３０へ送られることになる。 The large-capacity storage device 45 includes a hard disk drive (HDD), a large-capacity nonvolatile semiconductor memory such as a flash memory, and the received voice associated with the above-described destination identification information (telephone number, etc.) and call date / time information. Data and speech data are stored. The utterance voice data and the reception voice data stored in the large-capacity storage device 45 have a small processing load on the control unit 30 as in the case where the mobile phone terminal is in a standby state, and the remaining battery level. It is read when there is a margin and sent to the control unit 30.

そして、この時、制御部３０は、それら音声データに対する音声認識処理を行い、その音声認識にて得られた通話音声に含まれる単語を抽出し、それら抽出した単語の頻度や時間帯，通話相手などを関連付けてデータベース化し、その通話関連データベースを例えばメモリ部３５内の不揮発性メモリへ保存させる。なお、通話関連データベースの保存先は、大容量記憶装置４５であっても良い。その他、制御部３０は、上記大容量記憶装置４５の空き容量が少なくなった場合に、上記音声認識処理等が終了した通話音声データを当該大容量記憶装置４５から削除することも行う。 At this time, the control unit 30 performs voice recognition processing on the voice data, extracts words included in the call voice obtained by the voice recognition, the frequency and time zone of the extracted words, the call partner And the like, and the call-related database is stored in, for example, a nonvolatile memory in the memory unit 35. Note that the storage destination of the call related database may be the mass storage device 45. In addition, when the free capacity of the large-capacity storage device 45 is reduced, the control unit 30 also deletes the call voice data for which the voice recognition processing or the like has been completed from the large-capacity storage device 45.

表示部３３は、液晶ディスプレイ等の表示デバイス及び表示駆動回路を備え、ディスプレイの画面上に画像や文字等を表示する。 The display unit 33 includes a display device such as a liquid crystal display and a display drive circuit, and displays images, characters, and the like on the display screen.

操作部３４は、テンキー（キーボード）や電源ボタン、発話／終話ボタン、ジョグダイヤルなどの操作子とそれら操作子が操作された時の操作信号を発生する操作信号発生器とからなる。ユーザは、この操作部３４を操作することで、通話のための発着呼や電子メール文の作成、電子メールの送受信、インターネットへの接続等を行う。 The operation unit 34 includes operation elements such as a numeric keypad (keyboard), a power button, an utterance / end call button, and a jog dial, and an operation signal generator that generates an operation signal when the operation elements are operated. The user operates the operation unit 34 to make incoming / outgoing calls and e-mail texts for calls, send / receive e-mails, connect to the Internet, and the like.

画像処理部４３は、制御部３０による制御の元で内蔵メモリから読み出された圧縮符号化されている画像データを伸張復号化等し、その伸張復号後の画像データをデータラインを介して表示部３３へ送る。また、画像処理部４３は、図示しないカメラ部により撮影された静止画像や動画像のデータの圧縮符号化等を行い、その圧縮符号化された画像データを、制御部３０による制御の元で内蔵メモリに送って記憶させる。 The image processing unit 43 performs decompression decoding on the compressed and encoded image data read from the built-in memory under the control of the control unit 30 and displays the decompressed decoded image data via the data line. Send to part 33. The image processing unit 43 performs compression encoding of still image and moving image data captured by a camera unit (not shown), and stores the compression encoded image data under the control of the control unit 30. Send to memory for storage.

メモリ部３５は、ＲＯＭ（Read Only Memory）とＲＡＭ（Random Access Memory）を含む。ＲＯＭは、ＯＳ（Operating System）、制御部３０が各部を制御するための制御プログラムコード、各種の初期設定値、フォントデータ、辞書データ、着信音やキー操作音，アラーム音用の各種音データ、電子メールの作成や編集等を行うためのアプリケーション用のプログラムコード、画像や音声に対して様々な処理を行うためのアプリケーション用プログラムコード、本実施形態にかかる文字入力支援プログラムコード、その他、携帯電話端末に搭載される各種のアプリケーション用のプログラムコード、当該携帯電話端末の識別情報（ＩＤ）などを記憶している。なお、本実施形態において、本発明にかかる文字入力支援プログラムには、通話音声データの録音処理を実行するためのプログラムコード、音声認識処理と単語抽出処理を実行するためのプログラムコード、通話関連データベースを作成するためのプログラムコード、通話関連データベースを参照した候補単語提示処理を実行するためのプログラムコードなどが含まれる。このＲＯＭは、ＮＡＮＤ型フラッシュメモリ（NAND-type flash memory）或いはＥＥＰＲＯＭ（Electrically Erasable Programmable Read-Only Memory）のような書き換え可能なＲＯＭを含み、電子メールデータ、電話帳や電子メールアドレス帳のデータ、本実施形態にかかる通話関連データベースのデータ、通常の予測変換の候補単語の辞書データ、文字入力時の学習データ、静画像や動画像データ、キー操作音，アラーム音用等の音データ、その他、各種のユーザ設定値等を保存することも可能となされている。ＲＡＭは、制御部３０が各種のデータ処理を行う際の作業領域として、随時データを格納する。 The memory unit 35 includes a ROM (Read Only Memory) and a RAM (Random Access Memory). The ROM is an OS (Operating System), control program codes for the control unit 30 to control each unit, various initial setting values, font data, dictionary data, ringtones, key operation sounds, various sound data for alarm sounds, Application program code for creating and editing e-mails, application program code for performing various processing on images and sounds, character input support program code according to the present embodiment, and other mobile phones It stores program codes for various applications installed in the terminal, identification information (ID) of the mobile phone terminal, and the like. In the present embodiment, the character input support program according to the present invention includes a program code for executing recording processing of voice data, a program code for executing voice recognition processing and word extraction processing, and a call related database. And a program code for executing candidate word presentation processing with reference to the call related database. This ROM includes a rewritable ROM such as NAND-type flash memory (NAND-type flash memory) or EEPROM (Electrically Erasable Programmable Read-Only Memory), e-mail data, telephone book and e-mail address book data, Call related database data according to the present embodiment, dictionary data of normal prediction conversion candidate words, learning data at the time of character input, still image and moving image data, sound data for key operation sound, alarm sound, etc. It is also possible to save various user setting values and the like. The RAM stores data as needed as a work area when the control unit 30 performs various data processing.

制御部３０は、ＣＰＵからなり、メモリ部３５に記憶されているＯＳや各種プログラムに基づいて、携帯電話端末の制御、本実施形態にかかる通話音声の録音処理、音声認識と単語抽出処理、通話関連データベースの作成や更新処理、通話関連データベースを参照した文字入力処理など、各種機能を実現するための制御や各種演算を行う。なお、音声認識処理については、音声処理部４２が担当しても良い。 The control unit 30 includes a CPU, and controls the mobile phone terminal, the call voice recording process according to the present embodiment, the voice recognition and word extraction process, the call based on the OS and various programs stored in the memory unit 35. It performs control and various calculations to realize various functions such as creation and update processing of related databases and character input processing with reference to call related databases. The voice processing unit 42 may be in charge of the voice recognition process.

その他、図４には図示を省略しているが、本発明の携帯電話端末は、ブラウザ機能、外部メモリインターフェース、外部ケーブル用コネクタ、赤外線通信機能、近距離無線通信機能、電子財布機能、カメラ機能、ＬＥＤ（発光ダイオード）の発光機能、ＧＰＳ機能、バイブレータ機能、バッテリ、電力制御機能など、一般的な携帯電話端末が備えている各構成要素についても備えている。 In addition, although not shown in FIG. 4, the mobile phone terminal of the present invention has a browser function, an external memory interface, an external cable connector, an infrared communication function, a short-range wireless communication function, an electronic wallet function, and a camera function. It also includes various components included in a general mobile phone terminal, such as an LED (light emitting diode) light emitting function, a GPS function, a vibrator function, a battery, and a power control function.

〔データフロー〕
図５には、本発明にかかる文字入力支援処理を実行する際の、上述の図４の主要な構成要素における基本的なデータフローを示す。なお、この図５は、通話関連データベースを大容量記憶装置４５に記憶させるようにした場合の例を挙げている。〔data flow〕
FIG. 5 shows a basic data flow in the main components shown in FIG. 4 when the character input support process according to the present invention is executed. FIG. 5 shows an example in which the call related database is stored in the mass storage device 45.

図５において、先ず通話時において、制御部３０は、本発明の文字入力支援処理にかかる通話音声データ録音機能１５の実行により、マイクロホン４１を介して入力されて音声処理部４２にてディジタル化された発話音声データを、大容量記憶装置４５内の発話音声データ記憶エリア１９に記憶させ、また、通信回路３１により受信されて音声処理部４２にて復号化された受話音声データを、大容量記憶装置４５内の受話音声データ記憶エリア２０に記憶される。 In FIG. 5, first, at the time of a call, the control unit 30 is input via the microphone 41 and digitized by the voice processing unit 42 by the execution of the call voice data recording function 15 according to the character input support process of the present invention. The received speech data is stored in the speech data storage area 19 in the large-capacity storage device 45, and the received speech data received by the communication circuit 31 and decoded by the speech processing unit 42 is stored in the large-capacity storage. It is stored in the received voice data storage area 20 in the device 45.

次に、待ち受け状態などの非通話時で且つ他のＣＰＵ使用率が低く、バッテリ残量に余裕がある場合において、制御部３０は、本発明の文字入力支援処理にかかる音声認識及び単語抽出機能１６の実行により、上記大容量記憶装置４５の発話音声データ記憶エリア１９と受話音声データ記憶エリア２０から、発話音声データと受話音声データをそれぞれ読み出し、それらの音声認識処理と単語抽出処理を行う。そして、その抽出した単語を、頻度や時間帯及び通話相手、その他の関連する単語と共にデータベース化し、ユーザの設定に従ってソーティングした後、大容量記憶装置４５の通話関連データベース記憶エリア２１に登録する。ここで、ユーザの設定に従ったソーティングとしては、例えば頻度を優先、朝の時間帯を優先、特定の通話相手を優先，指定した会話内の単語のみを用いるなどの方法が考えられる。なお、通話関連データベースは、前述の図４の例のように、メモリ部３５に記憶されても良い。 Next, in a non-calling state such as a standby state, and when other CPU usage is low and the remaining battery capacity is sufficient, the control unit 30 performs the voice recognition and word extraction function according to the character input support process of the present invention. 16 is executed to read out the speech voice data and the received voice data from the speech voice data storage area 19 and the reception voice data storage area 20 of the large-capacity storage device 45, and perform voice recognition processing and word extraction processing thereof. Then, the extracted words are databased together with the frequency, time zone, call partner, and other related words, sorted according to user settings, and then registered in the call related database storage area 21 of the mass storage device 45. Here, as the sorting according to the user setting, for example, a method of giving priority to frequency, giving priority to the morning time zone, giving priority to a specific calling party, or using only words in a designated conversation can be considered. Note that the call-related database may be stored in the memory unit 35 as in the example of FIG. 4 described above.

その後、例えば電子メール文の作成や、スケジュール帳の編集、メモ帳への書き込みなど、操作部３４のキーボードを介してユーザが文章入力を行う際に、制御部３０は、本発明の文字入力支援処理にかかる候補単語提示機能１７の実行により、上記大容量記憶装置４５の通話関連データベース記憶エリア２１内の通話関連データベースを参照しながら、予測候補となる単語をユーザに提示する。 Thereafter, when the user inputs a sentence via the keyboard of the operation unit 34 such as creation of an e-mail sentence, editing of a schedule book, writing to a memo pad, etc., the control unit 30 supports the character input support of the present invention. By executing the candidate word presentation function 17 related to the processing, words that are prediction candidates are presented to the user while referring to the call related database in the call related database storage area 21 of the mass storage device 45.

〔状態遷移〕
図６には、本発明にかかる文字入力支援処理の状態遷移図を示す。 [State transition]
FIG. 6 shows a state transition diagram of the character input support process according to the present invention.

図６において、本実施形態の携帯電話端末は、通話状態Ｓ２の時には、常に通話録音状態Ｓ３へ遷移して通話録音処理を行い、大容量記憶装置４５に発話音声データと受話音声データを逐次記憶する。 In FIG. 6, the mobile phone terminal of this embodiment always transitions to the call recording state S3 and performs call recording processing when in the call state S2, and sequentially stores the utterance voice data and the reception voice data in the mass storage device 45. To do.

また、本実施形態の携帯電話端末は、待ち受け状態Ｓ１の時に、制御部３０のＣＰＵ使用率が低く且つバッテリ残量にも余裕がある場合には、単語抽出状態Ｓ４へ遷移し、大容量記憶装置４５に記憶されている音声データの音声認識処理と単語抽出処理を行い、さらに、その単語の出現頻度、時間帯、関連する別の単語等と共にデータベース化を行い、通話関連データベースの作成や更新を行う。 In the mobile phone terminal according to the present embodiment, when the CPU usage rate of the control unit 30 is low and the remaining battery level is sufficient when in the standby state S1, the mobile phone terminal transitions to the word extraction state S4 and stores a large amount of memory. Perform voice recognition processing and word extraction processing of the voice data stored in the device 45, and further create a database with the appearance frequency, time zone, other related words, etc. of the word, and create or update a call related database I do.

また、本実施形態の携帯電話端末は、ユーザが文字を行って文字入力状態Ｓ５となると、候補提示状態Ｓ６に遷移し、通話関連データベースを参照した予測変換候補をディスプレイへ提示する。 In addition, when the user performs a character and enters the character input state S5, the mobile phone terminal according to the present embodiment transitions to the candidate presentation state S6, and presents the predicted conversion candidate referring to the call related database on the display.

〔通話録音処理のフローチャート〕
図７には、本発明にかかる文字入力支援プログラムのうち、制御部２０が実行する通話音声データ録音処理プログラムのフローチャートを示す。 [Call recording process flowchart]
FIG. 7 shows a flowchart of a call voice data recording processing program executed by the control unit 20 in the character input support program according to the present invention.

図７において、制御部２０は、通話のための発着信動作が行われ、ステップＳ１０の処理として通話が開始されると、ステップＳ１１の処理として、マイクロホン４１を通じて入力されてディジタル化された発話音声データ、つまり通話相手へ送る音声データを、大容量記憶装置４５へ記録する。 In FIG. 7, the control unit 20 performs an outgoing / incoming operation for a telephone call, and when a telephone conversation is started as a process of step S <b> 10, a speech voice input and digitized through the microphone 41 is processed as a process of step S <b> 11. Data, that is, voice data to be sent to the other party is recorded in the mass storage device 45.

また同時に、制御部２０は、ステップＳ１２の処理として、通信回路３１を通じて受信した受話音声データ、つまり通話相手から送られてきた音声データを、大容量記憶装置４５へ逐次記録する。 At the same time, the control unit 20 sequentially records the received voice data received through the communication circuit 31, that is, the voice data sent from the other party in the large capacity storage device 45 as the process of step S 12.

その後、制御部２０は、ステップＳ１３にて通話が終了した場合には、携帯電話端末を待ち受け状態へ戻す。 Thereafter, the control unit 20 returns the mobile phone terminal to the standby state when the call is ended in step S13.

〔音声認識と単語抽出及び通話関連データベースの作成・更新処理のフローチャート〕
図８には、本発明にかかる文字入力支援プログラムのうち、制御部２０が実行する音声認識処理と単語抽出処理及び通話関連データベースの作成・更新処理プログラムのフローチャートを示す。 [Flowchart of voice recognition, word extraction and creation / update processing of call related database]
FIG. 8 shows a flowchart of a voice recognition process, a word extraction process, and a call related database creation / update process program executed by the control unit 20 in the character input support program according to the present invention.

図８において、制御部２０は、ステップＳ２０の処理として、大容量記憶装置４５に記録されている発話音声データ、受話音声データを読み出し、音声認識技術を用いてそれら音声データに対する音声認識処理を行い、さらに単語抽出を行う。 In FIG. 8, the control unit 20 reads out the speech voice data and the received voice data recorded in the large-capacity storage device 45 as a process in step S20, and performs voice recognition processing on the voice data using voice recognition technology. Further, word extraction is performed.

次に、制御部２０は、ステップＳ２１の処理として、抽出した単語の出現頻度、出現した時間帯、前後に続く他の関連単語を検出し、さらにステップＳ２２の処理として、それらの情報と共に主要な単語をデータベース化する。 Next, the control unit 20 detects the appearance frequency of the extracted word, the appearance time zone, and other related words that follow before and after as the process of step S21, and further includes the main information together with the information as the process of step S22. Create a database of words.

その後、制御部２０は、ステップＳ２３の処理として、通話関連データベースを、ユーザが設定したモードに合わせてソーティングし、待ち受け状態の待機モードに戻る。 Thereafter, the control unit 20 sorts the call-related database in accordance with the mode set by the user and returns to the standby mode in the standby state as the process of step S23.

〔候補単語の提示処理のフローチャート〕
図９には、本発明にかかる文字入力支援プログラムのうち、文字入力時に制御部２０が通話関連データベースを参照して予測候補単語を提示する処理プログラムのフローチャートを示す。 [Flowchart of candidate word presentation processing]
FIG. 9 shows a flowchart of a processing program of the character input support program according to the present invention in which the control unit 20 refers to the call related database and presents a prediction candidate word when inputting characters.

図９において、制御部２０は、ユーザにより文字入力が行われると、ステップＳ３０の処理として、現在入力されている文字或いは既に入力済みの文字や単語等を検索キーとして、上記通話関連データベースを検索する。 In FIG. 9, when a character is input by the user, the control unit 20 searches the call related database by using the currently input character or the already input character or word as a search key as the process of step S30. To do.

次に、制御部２０は、ステップＳ３１の処理として、その検索にてヒットした単語を、ユーザの設定しているモードに合わせて、予測候補の単語としてディスプレイ上に提示する。 Next, the control part 20 shows the word hit by the search as a prediction candidate word on a display according to the mode which the user has set as a process of step S31.

次に、制御部２０は、ステップＳ３２の処理として、上記提示した予測候補の単語をユーザが選択したか否か判断し、選択しなかった場合にはステップＳ３０へ処理を戻して新たな予測候補単語の検索と提示処理を行う。 Next, as a process of step S32, the control unit 20 determines whether or not the user has selected the word of the presented prediction candidate. If not, the process returns to step S30 to return to a new prediction candidate. Perform word search and presentation processing.

一方、ステップＳ３２において、上記提示した予測候補の単語をユーザが選択したと判断した場合、制御部２０は、ステップＳ３３の処理として、その選択された単語の関連単語を、次の予測変換候補としてユーザに提示する。すなわち、候補の単語が選択された場合には、その選択した単語について、過去の通話で関連していた他の単語を次々と提示する。 On the other hand, when it is determined in step S32 that the user has selected the word of the presented prediction candidate, the control unit 20 sets the related word of the selected word as the next prediction conversion candidate as the process of step S33. Present to the user. That is, when a candidate word is selected, other words related to past calls are successively presented for the selected word.

次に、制御部２０は、ステップＳ３４の処理として、ユーザから文字入力の終了指示がなされたか否か判断し、終了指示がなされていない時にはステップＳ３２へ処理を戻し、終了指示がなされた時には、待ち受け状態の待機モードに戻る。 Next, as a process of step S34, the control unit 20 determines whether or not an instruction to end character input has been given by the user. When no instruction to end has been given, the process returns to step S32, and when an instruction to end has been given, Return to standby mode.

〔まとめ〕
以上説明したように、本発明実施形態の携帯電話端末によれば、通話音声から抽出した単語を、出現頻度や時間、通話相手、その他の単語と関連付けしてデータベース化し、文字入力時に、その通話関連データベースを参照して予測変換候補を提示するようになされているため、予測変換機能にて参照できるデータの範囲が広がり、高い性能の文字入力支援を実現することができる。 [Summary]
As described above, according to the mobile phone terminal of the embodiment of the present invention, the words extracted from the call voice are made into a database in association with the appearance frequency, time, call partner, and other words, and the call is made when characters are input. Since prediction conversion candidates are presented with reference to related databases, the range of data that can be referred to by the prediction conversion function is widened, and high-performance character input support can be realized.

また、本実施形態の携帯電話端末によれば、通話音声から抽出した単語のデータベースを参照して予測変換候補を提示するため、例えば、過去に通話を行った相手との間の会話内容に関連した電子メール文の作成に最適な予測候補や、スケジュール帳，メモ帳など使用するアプリケーションに最適な予測候補を提示することができ、ユーザによる文字入力の負担を軽減することが可能となり、文字入力の高速化、例えば相手に会わせた文章作成、会話内容と文章内容を結びつけた文章作成が行えるようになる。特に、過去に通話を行った相手に送信する電子メールを作成する場合には、通話相手に会わせた単語、表現を使用することができるようになり、また、過去の会話内容と電子メール文の内容との食い違いを少なくすることができるようになる。 In addition, according to the mobile phone terminal of the present embodiment, the prediction conversion candidate is presented with reference to the word database extracted from the call voice. For example, it is related to the conversation content with the other party who made the call in the past. Predictive candidates that are optimal for creating e-mail texts and predictive candidates that are optimal for applications such as a schedule book and memo pad can be presented, reducing the burden of character input by the user, Speeding up, for example, creating texts that meet other parties, and creating texts that link conversation content with text content. In particular, when creating an e-mail to be sent to a party who has made a call in the past, you can use words and expressions that you have met with the other party, and you can use the past conversation contents and e-mail text. The discrepancy with the contents of can be reduced.

また、本実施形態の携帯電話端末によれば、通話関連データベースを、ユーザの設定に従ってソーティング可能となされているため、ユーザは通話関連データベースから候補を検索して提示するモードを、例えば、頻度を優先するモード、朝の時間帯を優先するモード、優先する通話相手を変更したモード、指定した会話内の単語のみとするモードなど、所望のモードに設定することができる。 In addition, according to the mobile phone terminal of the present embodiment, since the call related database can be sorted according to the user's setting, the user searches for a candidate from the call related database and presents a mode, for example, a frequency. It is possible to set a desired mode such as a priority mode, a mode in which the morning time zone is prioritized, a mode in which the priority party is changed, or a mode in which only words in a designated conversation are used.

なお、上述した実施形態の説明は、本発明の一例である。このため、本発明は上述した実施形態に限定されることなく、本発明にかかる技術的思想を逸脱しない範囲であれば、設計等に応じて種々の変更が可能であることはもちろんである。 The above description of the embodiment is an example of the present invention. For this reason, the present invention is not limited to the above-described embodiment, and various modifications can be made according to the design and the like as long as they do not depart from the technical idea of the present invention.

例えば、本発明の携帯情報端末は、携帯電話端末に限定されず、通信機能を搭載したＰＤＡ等の各種の携帯情報端末にも適用可能である。 For example, the mobile information terminal of the present invention is not limited to a mobile phone terminal, and can be applied to various mobile information terminals such as a PDA equipped with a communication function.

また、上述の実施形態では、予測変換候補を提示する例を挙げたが、例えばＨＴＭＬ形式の電子メール等のような自由度の高い電子メール文を作成するような場合には、大容量記憶装置４５に記憶している音声データの中から、キーワード付近の音声データを切り出して電子メールに添付するようなことも可能である。 In the above-described embodiment, an example in which a predictive conversion candidate is presented has been described. For example, in the case where an e-mail sentence having a high degree of freedom such as an e-mail in HTML format is created, a large-capacity storage device It is also possible to cut out voice data near the keyword from the voice data stored in 45 and attach it to an e-mail.

また、本実施形態では、通話音声データを大容量記憶装置４５に記録しているため、例えば、文字入力中に、予測変換候補として提示されている単語の音声、若しくは当該単語とその前後の音声データ、或いはその単語に関連する音声データ等を大容量記憶装置４５から読み出して、スピーカ４０から出力させるようなことも可能となる。 In the present embodiment, since the call voice data is recorded in the large-capacity storage device 45, for example, the voice of a word presented as a predictive conversion candidate during character input, or the word and the voices before and after the word. It is also possible to read out data or voice data related to the word from the mass storage device 45 and output it from the speaker 40.

また例えば、電子メール文の作成において、相手先が指定されていない時には、現在入力中の電子メール文に使用されている単語を元に、通話関連データベースを参照することにより、その電子メールの相手先を推定し、その推定された相手先に合う予測変換候補を通話関連データベースから参照するようなことも可能である。 Also, for example, when creating an e-mail message, when the other party is not specified, refer to the call related database based on the word used in the e-mail message currently being entered, and It is also possible to estimate the destination and refer to the predicted conversion candidate that matches the estimated destination from the call-related database.

その他、上述の実施形態では日本語を例に挙げたが、本発明は日本語以外の他の言語にも勿論対応可能である。 In addition, although the Japanese language is taken as an example in the above-described embodiment, the present invention can of course be applied to languages other than Japanese.

本発明実施形態のユースケースとして、携帯電話端末を用いてユーザが友人との間でイタリア旅行の計画の通話を行った場合の予測変換候補提示の一例を示す図である。It is a figure which shows an example of prediction conversion candidate presentation when a user performs the telephone call of the plan of an Italian trip with a friend using a mobile phone terminal as a use case of this invention embodiment. 本発明実施形態のユースケースとして、携帯電話端末を用いてユーザが会社の上司との間で会議のスケジュールに関する通話を行った場合の予測変換候補提示の一例を示す図である。It is a figure which shows an example of prediction conversion candidate presentation when a user performs the telephone call regarding a meeting schedule with a supervisor of a company using a mobile phone terminal as a use case of this invention embodiment. 図２のユースケースで登録された通話関連データベースの一例を示す図である。It is a figure which shows an example of the telephone call related database registered in the use case of FIG. 本発明実施形態の携帯電話端末の概略的な内部構成を示すブロック図である。It is a block diagram which shows the schematic internal structure of the mobile telephone terminal of embodiment of this invention. 本発明にかかる文字入力支援機能を実行する際の、本実施形態の携帯電話端末の主要な構成要素における基本的なデータフロー図である。It is a basic data flow figure in the main component of the mobile telephone terminal of this embodiment at the time of performing the character input support function concerning the present invention. 本発明にかかる文字入力支援処理の状態遷移図である。It is a state transition diagram of the character input assistance process concerning this invention. 本実施形態の携帯電話端末の制御部が実行する通話音声データ録音処理プログラムのフローチャートである。It is a flowchart of the telephone call voice data recording processing program which the control part of the mobile telephone terminal of this embodiment performs. 本実施形態の携帯電話端末の制御部が実行する音声認識処理と単語抽出処理及び通話関連データベースの作成・更新処理プログラムのフローチャートである。It is a flowchart of a voice recognition process, a word extraction process, and a call related database creation / update process program executed by the control unit of the mobile phone terminal of the present embodiment. 本実施形態の携帯電話端末の制御部が文字入力時に通話関連データベースを参照して予測候補単語を提示する処理プログラムのフローチャートである。It is a flowchart of the processing program which a control part of the mobile telephone terminal of this embodiment presents a prediction candidate word with reference to a telephone call related database at the time of character input.

Explanation of symbols

１，４ユーザ、２友人、５上司、３，６ディスプレイ画面、１５通話音声データ録音機能、１６音声認識及び単語抽出機能、１７候補単語提示機能、１９発話音声データ記憶エリア、２０受話音声データ記憶エリア、２１通話関連データベース記憶エリア、３０制御部、３１通信回路、３２通信用のアンテナ、３３表示部、３４操作部、３５メモリ部、４０スピーカ、４１マイクロホン、４２音声処理部、４３画像処理部、４５大容量記憶装置 1, 4 users, 2 friends, 5 boss, 3, 6 display screen, 15 call voice data recording function, 16 voice recognition and word extraction function, 17 candidate word presentation function, 19 utterance voice data storage area, 20 received voice data storage Area, 21 call-related database storage area, 30 control unit, 31 communication circuit, 32 antenna for communication, 33 display unit, 34 operation unit, 35 memory unit, 40 speaker, 41 microphone, 42 audio processing unit, 43 image processing unit , 45 Mass storage device

Claims

A storage unit for storing conversation voice data;
A word extraction unit for extracting words in the conversation in the conversation voice data by voice recognition of the conversation voice data;
A database unit for registering the word extracted by the word extraction unit as a database together with at least one of the frequency of appearance of the word in the conversation and the destination of the conversation and the destination of the conversation;
When a user inputs an e-mail character without specifying a destination as a process related to the conversation content , the above-mentioned database of the database is based on the word used in the e-mail sentence being input. A candidate search unit that estimates a destination of the e-mail with reference to at least one of a time zone, the destination, and the frequency, and searches for a predictive conversion candidate that matches the estimated destination ;
A portable information terminal comprising: a presentation unit that presents a user with the predicted conversion candidate searched by the candidate search unit.

The portable information terminal according to claim 1, wherein the word extraction unit determines whether or not to execute voice recognition and word extraction of the conversation voice data based on whether or not other processing is performed and a remaining battery level.

The portable information terminal according to claim 1, wherein the candidate search unit determines a search mode when searching for a predictive conversion candidate from the database according to a user setting.

Having a communication unit for communicating at least through a mobile phone line;
The portable information terminal according to claim 1, wherein the storage unit stores conversation voice data by a call through the mobile phone line.

The storage unit stores conversational voice data from a call through the mobile phone line separately for an uttered voice and an incoming voice,
The word extraction unit extracts a word by recognizing the uttered voice and the received voice individually,
5. The portable information terminal according to claim 4, wherein the database unit creates a database of words individually extracted from the uttered voice and the received voice.

A storage unit for storing conversation voice data;
A word extraction unit for extracting words in the conversation in the conversation voice data by voice recognition of the conversation voice data;
A database unit for registering the word extracted by the word extraction unit together with at least one of the time zone of the conversation, the destination of the conversation, and the frequency of occurrence of the word in the conversation;
When a user inputs an e-mail character without specifying a destination as a process related to the conversation content , the above-mentioned database of the database is based on the word used in the e-mail sentence being input. A candidate search unit that estimates a destination of the e-mail with reference to at least one of a time zone, the destination, and the frequency, and searches for a predictive conversion candidate that matches the estimated destination ;
As a presentation control unit that causes the presentation unit to present prediction conversion candidates searched by the candidate search unit,
A character input support program that allows a computer of a portable information terminal to function.

Storing conversation voice data in a storage unit;
A step of extracting a word in the conversation in the conversation voice data by voice recognition from the conversation voice data,
A step of creating a database of the words extracted by the word extraction unit together with at least one of the conversation time zone, the conversation partner and the frequency of occurrence of the word in the conversation, and registering the database unit When,
When a user performs character input of an e-mail without specifying a destination as a process related to the conversation content, the candidate search unit uses the word used in the e-mail sentence being input based on the word , retrieving the predictive conversion candidate that matches the destination that the estimated while estimating at least one reference to the destination of the electronic mail with the time zone and the other party and the frequency of said database,
And a step of presenting the prediction conversion candidate searched by the candidate search unit to the user at the presentation unit.