JP5287735B2

JP5287735B2 - Information processing device

Info

Publication number: JP5287735B2
Application number: JP2010003031A
Authority: JP
Inventors: 千加志杉浦; 岳彦井阪
Original assignee: Fujitsu Mobile Communications Ltd
Current assignee: Fujitsu Mobile Communications Ltd
Priority date: 2010-01-08
Filing date: 2010-01-08
Publication date: 2013-09-11
Anticipated expiration: 2030-01-08
Also published as: JP2011141819A

Description

本発明は、複数言語間の音声翻訳を行うことができる情報処理装置に関する。 The present invention relates to an information processing apparatus capable of performing speech translation between a plurality of languages.

世界のグローバル化が進む中で、母国語が異なる人々とのコミュニケーションの機会が増えている。例えば海外旅行中のショッピングやレストランでの会話、またビジネスシーンでの会話や電話会議などである。音声翻訳技術は異なる言語間でのコミュニケーションを円滑化する技術であり、最近では多くの翻訳装置が製品化されている。 As the world becomes more globalized, opportunities for communication with people with different native languages are increasing. For example, it may be shopping or restaurant conversation during overseas travel, business conversation or telephone conference. Speech translation technology is a technology that facilitates communication between different languages, and recently, many translation devices have been commercialized.

例えば特許文献１によると、円滑なコミュニケーションを実現することができる音声翻訳装置が提案されている。この音声翻訳装置は、音声認識により得られた翻訳対象言語の文字列を表示部のエリアＡに表示し、翻訳言語の文字列をエリアＢに表示するとともに、翻訳対象言語の文字列または翻訳言語の文字列から文書検索のためのキーワードを抽出し、検索されたキーワード用いて検索された文書の言語が翻訳対象言語の場合、翻訳言語に翻訳し、検索された文書の言語が翻訳言語の場合、翻訳対象言語に翻訳し、検索された文書をエリアＣに表示して、当該文書を翻訳した文書をエリアＤに表示するものである。 For example, according to Patent Document 1, a speech translation apparatus capable of realizing smooth communication has been proposed. This speech translation apparatus displays a character string of a translation target language obtained by speech recognition in area A of the display unit, displays a translation language character string in area B, and translates the character string or translation language of the translation target language. If a keyword for document search is extracted from the character string of, and the language of the document searched using the searched keyword is the translation target language, it is translated into the translation language, and the language of the searched document is the translation language The document translated into the language to be translated is displayed in the area C, and the document obtained by translating the document is displayed in the area D.

特開２００９−２０５５７９号公報JP 2009-205579 A

他言語間での会話で音声翻訳を利用するには、双方向の音声翻訳が必要となる。例えば日本語と英語の双方向の音声翻訳を例に挙げると、日本語音声認識処理モジュール、日英機械翻訳処理モジュール、英語音声合成処理モジュール、ならびに英語音声認識処理モジュール、英日機械翻訳処理モジュール、日本語音声合成処理モジュールの大きく分けて６つの処理モジュールが必要となる。そして、「音声認識」、「機械翻訳」、「音声合成」の処理技術においては、言語情報や波形情報を事前に学習した辞書データを利用する手法が一般的であり、各処理を実行するには、事前に辞書データをストレージなどから読み込むなどの初期化処理が必要となる。 To use speech translation in conversations between other languages, bidirectional speech translation is required. Take bi-directional speech translation between Japanese and English as an example. Japanese speech recognition processing module, Japanese-English machine translation processing module, English speech synthesis processing module, English speech recognition processing module, English-Japanese machine translation processing module Japanese speech synthesis processing modules are roughly divided into six processing modules. In the processing technology of “speech recognition”, “machine translation”, and “speech synthesis”, it is common to use dictionary data in which linguistic information and waveform information are learned in advance. Requires an initialization process such as reading dictionary data from storage or the like in advance.

ＰＣ（ＰｅｒｓｏｎａｌＣｏｍｐｕｔｅｒ）上でこのような翻訳技術を利用する場合など、ストレージとのデータ転送速度が速く、かつＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）サイズが大きい環境であれば、音声翻訳アプリ起動時に全ての処理モジュールの初期化を行っても処理時間がそれほど大きくならず、辞書データによるＲＡＭの占有も特に問題とはならない。 If such a translation technology is used on a PC (Personal Computer), if the data transfer speed with the storage is high and the RAM (Random Access Memory) size is large, all processing is performed when the speech translation application is started. Even if the module is initialized, the processing time does not increase so much, and the occupation of the RAM by the dictionary data is not particularly problematic.

しかしながら、例えば携帯端末のように、一般的に低スペックマシンにおいて音声翻訳アプリを動作させる場合には、ＲＡＭサイズが小さいため、音声翻訳アプリ起動時に全てのモジュールを初期化することができない状況が発生する。また、仮に全てのモジュールの初期化が可能なだけのＲＡＭサイズが確保できたとしても、音声着呼やメール受信などの他のプロセスが動作する可能性があるため、ＲＡＭに適当な空き領域を確保しておく必要がある。 However, when a speech translation application is generally operated on a low-spec machine such as a portable terminal, for example, a situation occurs in which all modules cannot be initialized when the speech translation application is activated because the RAM size is small. To do. Even if a RAM size sufficient to initialize all modules can be secured, other processes such as incoming voice calls and mail reception may operate. It is necessary to secure it.

一方で、音声翻訳アプリ起動時に初期化を行うのではなく、処理を動作させるときのみ初期化を行い、処理が終了したら辞書データをＲＡＭから解放する方法も考えられる。しかしながらこの方法では、音声翻訳アプリで音声翻訳を実行する都度、初期化処理が必要となってしまう。特に携帯端末ではデータ転送速度がＰＣなどに比べると遅いことが多く、また演算能力も低いため、音声翻訳を実行するためにかなりの時間を要することとなる。 On the other hand, there is also a method in which initialization is not performed when the speech translation application is activated, but initialization is performed only when the process is operated, and the dictionary data is released from the RAM when the process is completed. However, this method requires an initialization process every time voice translation is executed by the voice translation application. In particular, a portable terminal often has a slower data transfer rate than a PC or the like, and has a low calculation capability, so that a considerable amount of time is required to execute speech translation.

また、音声翻訳アプリを実現するために、ＲＡＭサイズと演算を多く要する処理を、通信網を介したサーバで実現する方法もある。しかしこの方法は通信可能な環境でないと音声翻訳を実現することができず、例えば音声翻訳が最も活躍すべき海外においては、通信できない環境も多く、通信ができるとしても割高な通信料が発生することが多い。 There is also a method for realizing processing that requires a lot of RAM size and computation in a server via a communication network in order to realize a speech translation application. However, this method cannot achieve speech translation unless the environment is communicable. For example, in overseas countries where speech translation is most active, there are many environments where communication is not possible, and even if communication is possible, a high communication fee is incurred. There are many cases.

これらのような理由により、従来、スタンドアロンで音声翻訳機能を実行する際に処理時間を短縮させることが困難であった。 For these reasons, conventionally, it has been difficult to shorten the processing time when executing the speech translation function in a stand-alone manner.

本発明は、上記課題を鑑みてなされてものであり、スタンドアロンでの音声翻訳機能を備えた情報処理装置であって、使用可能なＲＡＭサイズに制約がある場合においても、モジュールの初期化タイミングを制御することで、音声翻訳処理時間が最小限に抑えられた情報処理装置を提供することを目的とする。 The present invention has been made in view of the above problems, and is an information processing apparatus having a stand-alone speech translation function, and the module initialization timing can be set even when the usable RAM size is limited. It is an object of the present invention to provide an information processing apparatus in which the speech translation processing time is minimized by controlling.

上記課題を解決するために、本発明に係る情報処理装置は、翻訳処理に関する複数のモジュールを有する情報処理装置であって、翻訳処理が実行される際、翻訳処理に使用できるＲＡＭの空き領域を算出する第１の算出手段と、前記空き領域および前記複数のモジュールの各々が処理を行う際にＲＡＭの記憶領域を占有する最大サイズと前記複数のモジュールの各々の初期化に必要なＲＡＭの記憶領域と前記複数のモジュールの各々の初期化に要する時間とに基づいて前記複数のモジュールから初期化可能なモジュールを選択する選択手段と、前記選択手段により選択されたモジュールを翻訳処理が実行される前に初期化する第１の初期化手段と、翻訳処理の実行に伴って前記いずれかのモジュールが処理を行う際、このモジュールが前記第１の初期化手段により初期化されていなかった場合、このモジュールの処理開始時にこのモジュール初期化するとともに翻訳処理終了時に解放する第２の初期化手段と、を備えることを特徴とする。 In order to solve the above-described problem, an information processing apparatus according to the present invention is an information processing apparatus having a plurality of modules related to translation processing. When the translation processing is executed, a free RAM area that can be used for translation processing is allocated. First calculation means for calculating, a maximum size that occupies a storage area of the RAM when each of the free area and the plurality of modules performs processing, and storage of the RAM necessary for initialization of each of the plurality of modules Selection means for selecting a module that can be initialized from the plurality of modules based on the area and the time required for initialization of each of the plurality of modules, and translation processing is performed on the module selected by the selection means A first initialization unit that initializes before, and when any of the modules performs a process in accordance with the execution of the translation process, If that has not been initialized by the initialization means, characterized in that it comprises a second initialization means for releasing during translation processing ends as well as the processing start module initialization when this module.

本発明に係る情報処理装置によると、スタンドアロンでの音声翻訳機能を備えるとともに、使用可能なＲＡＭサイズに制約がある場合においても、モジュールの初期化タイミングを制御することで、音声翻訳処理時間を最小限に抑えることが可能となる。 According to the information processing apparatus according to the present invention, the voice translation processing time can be minimized by controlling the initialization timing of the module even when there is a restriction on the usable RAM size as well as the stand-alone speech translation function. It becomes possible to limit to the limit.

本発明に係る情報処理装置（携帯電話機）の開いた状態を示す斜視図。The perspective view which shows the state which the information processing apparatus (mobile telephone) based on this invention opened. 本発明に係る情報処理装置（携帯電話機）の構成を示すブロック図。1 is a block diagram showing a configuration of an information processing apparatus (mobile phone) according to the present invention. 本発明に係る情報処理装置（携帯電話機）において翻訳制御処理が行われている際の画面も一例を示す図。The figure which shows an example of the screen at the time of the translation control process being performed in the information processing apparatus (cellular phone) which concerns on this invention. モジュールリスト情報の一例を示すデータ構成図。The data block diagram which shows an example of module list information. モジュール初期化時間情報の一例を示すデータ構成図。The data block diagram which shows an example of module initialization time information. 初期化パターン情報の一例を示すデータ構成図。The data block diagram which shows an example of initialization pattern information. 本発明に係る情報処理装置（携帯電話機）における翻訳制御処理の手順を示すフローチャート。The flowchart which shows the procedure of the translation control process in the information processing apparatus (mobile telephone) which concerns on this invention. 本発明に係る情報処理装置（携帯電話機）における事前初期化処理の手順を示すフローチャート。The flowchart which shows the procedure of the pre-initialization process in the information processing apparatus (cellular phone) concerning this invention. （Ａ）は、識別符号と初期化状態との関係を示す対応表、（Ｂ）は、モジュール初期化状態情報の一例を示すデータ構成図。(A) is a correspondence table showing the relationship between identification codes and initialization states, and (B) is a data configuration diagram showing an example of module initialization state information. 本発明に係る情報処理装置（携帯電話機）が翻訳制御処理を行う際の、演算処理時の初期化処理の手順を示すフローチャート。The flowchart which shows the procedure of the initialization process at the time of a calculation process at the time of the information processing apparatus (mobile telephone) which concerns on this invention performing a translation control process. 履歴情報の一例を示すデータ構成図。The data block diagram which shows an example of log | history information. 初期化パターンの付加情報の一例を示すデータ構成図。The data block diagram which shows an example of the additional information of an initialization pattern.

本発明に係る情報処理装置の実施形態について、添付図面を参照しながら説明する。本発明に係る情報処理装置として、携帯電話機１を例に挙げて説明する。 Embodiments of an information processing apparatus according to the present invention will be described with reference to the accompanying drawings. A mobile phone 1 will be described as an example of the information processing apparatus according to the present invention.

図１は、携帯電話機１の斜視図である。携帯電話機１は、図１に示すように、矩形の板状の上筐体１０と、この上筐体１０とほぼ同形状をした下筐体１１とが、閉じた状態で、相互に一面を覆うように積層されることにより形成されている。これらの上筐体１０及び下筐体１１は、所定方向（例えば図１のＸ方向）に相互に所定距離だけスライド可能なように結合されていて、上筐体１０を下筐体１１に対してスライドさせることにより、閉じた状態から開いた状態に、あるいは開いた状態から閉じた状態に変形する。 FIG. 1 is a perspective view of the mobile phone 1. As shown in FIG. 1, the mobile phone 1 has a rectangular plate-like upper casing 10 and a lower casing 11 that is substantially the same shape as the upper casing 10 in a closed state. It is formed by being laminated so as to cover. The upper housing 10 and the lower housing 11 are coupled so as to be slidable by a predetermined distance from each other in a predetermined direction (for example, the X direction in FIG. 1). By sliding, the deformed state is changed from the closed state to the open state, or from the open state to the closed state.

上筐体１０の外面（下筐体１１に対面しない側の面）には、データを表示する液晶ディスプレイなどの表示装置１２、音声を出力するスピーカ１３、音声を入力するマイクロフォン１４が設けられている。また、下筐体１１の内面（上筐体１０に対面する側の面）には、ユーザが押下することによりデータを入力する操作キーなどの入力装置１５が設けられている。 A display device 12 such as a liquid crystal display for displaying data, a speaker 13 for outputting sound, and a microphone 14 for inputting sound are provided on the outer surface of the upper housing 10 (the surface not facing the lower housing 11). Yes. An input device 15 such as operation keys for inputting data when the user presses is provided on the inner surface of the lower housing 11 (the surface facing the upper housing 10).

図２は、携帯電話機１の構成を示すブロック図である。携帯電話機１は、図２に示すように、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）２０、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）２１、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）２２、通信装置２３、及び、上述した表示装置１２、スピーカ１３、マイクロフォン１４、入力装置１５がバス２４によって相互に通信可能なように接続されて構成されている。 FIG. 2 is a block diagram showing the configuration of the mobile phone 1. As shown in FIG. 2, the cellular phone 1 includes a CPU (Central Processing Unit) 20, a RAM (Random Access Memory) 21, a ROM (Read Only Memory) 22, a communication device 23, the display device 12, and the speaker 13 described above. The microphone 14 and the input device 15 are connected by a bus 24 so that they can communicate with each other.

ＣＰＵ２０は、携帯電話機１の総括的な制御を行うとともに、後述する翻訳制御処理やその他の様々な処理などを行う。またＣＰＵ２０は、入力装置１５に対する入力インタフェースを備え、例えば入力装置１５として設けられている操作キーの押下を検出すると、その操作キーに対応する処理を行う。またＣＰＵ２０は、実行中のアプリケーションプログラムなどの制御に基づいて画面データを生成して表示装置１２に表示する。 The CPU 20 performs overall control of the mobile phone 1 and performs a translation control process, which will be described later, and various other processes. Further, the CPU 20 includes an input interface for the input device 15. When the CPU 20 detects that an operation key provided as the input device 15 is pressed, for example, the CPU 20 performs processing corresponding to the operation key. Further, the CPU 20 generates screen data based on the control of the application program being executed and displays the screen data on the display device 12.

ＣＰＵ２０は、マイクロフォン１４で集音された音声からアナログ音声信号を生成し、このアナログ音声信号をデジタル音声信号に変換する。またＣＰＵ２０は、デジタル音声信号を取得すると、このデジタル音声信号をアナログ音声信号に変換し、スピーカ１３から音声として出力する。 CPU20 produces | generates an analog audio | voice signal from the audio | voice collected with the microphone 14, and converts this analog audio | voice signal into a digital audio | voice signal. Further, when acquiring the digital audio signal, the CPU 20 converts the digital audio signal into an analog audio signal and outputs the analog audio signal as audio from the speaker 13.

ＲＡＭ２１は、ＣＰＵ２０が処理を行う際の作業領域として一時的にデータを記憶する記憶装置である。ＲＯＭ２２は、ＣＰＵ２０が処理を行う際の処理プログラム（翻訳制御処理の処理プログラムなど）、処理に使用されるデータ（後述するモジュールリスト情報４０など）などを記憶する記憶装置である。音声翻訳に使用される辞書データも、予めＲＯＭ２２に記憶されているものとする。携帯電話機１は、ＲＯＭ２２に加えて、ハードディスク、不揮発性メモリなどの記憶装置を備えていても良く、モジュールリスト情報４０などは、ＲＯＭ２２でなくそれらの記憶装置に記憶されていても良い。 The RAM 21 is a storage device that temporarily stores data as a work area when the CPU 20 performs processing. The ROM 22 is a storage device that stores a processing program (processing program for translation control processing, etc.) when the CPU 20 performs processing, data used for processing (module list information 40, which will be described later), and the like. It is assumed that dictionary data used for speech translation is also stored in the ROM 22 in advance. The mobile phone 1 may include a storage device such as a hard disk and a nonvolatile memory in addition to the ROM 22, and the module list information 40 may be stored in the storage device instead of the ROM 22.

通信装置２３は、ＣＰＵ２０の制御に基づいて、基地局（図示せず）からアンテナ２３ａを介して受信した受信信号をスペクトラム逆拡散処理してデータを復元する。このデータは、ＣＰＵ２０の制御によりスピーカ１３から出力されたり、表示装置１２に表示されたり、またはＲＡＭ２１に記録されたりする。また通信制御部２６は、ＣＰＵ２０の制御に基づいて、マイクロフォン１４で集音された音声データや表示装置１２や入力装置１５を介して入力されたデータやＲＡＭ２１またはＲＯＭ２２に記憶されたデータを取得すると、これらのデータに対してスペクトラム拡散処理を行い、基地局に対してアンテナ２３ａを介して送信する。 Based on the control of the CPU 20, the communication device 23 restores data by performing spectrum despreading processing on a received signal received from a base station (not shown) via the antenna 23 a. This data is output from the speaker 13 under the control of the CPU 20, displayed on the display device 12, or recorded in the RAM 21. In addition, the communication control unit 26 acquires sound data collected by the microphone 14, data input via the display device 12 or the input device 15, or data stored in the RAM 21 or the ROM 22 based on the control of the CPU 20. These data are subjected to spread spectrum processing and transmitted to the base station via the antenna 23a.

携帯電話機１は、スタンドアロンで例えば言語Ａを言語Ｂに翻訳したり言語Ｂを言語Ａに翻訳したりする音声翻訳機能を備えている。さらに携帯電話機１は、翻訳制御処理を行う際に、翻訳制御に使用される各々のモジュールの初期化処理に必要なＲＡＭ２１のＲＡＭサイズ、各々のモジュールの演算処理に必要なＲＡＭサイズ、各々のモジュールの初期化処理に要する時間、及びＲＡＭ２１の空きＲＡＭサイズに基づいて、各々のモジュールの初期化処理のタイミングを制御する機能を備えていて、これにより、ＲＡＭ２１の空きＲＡＭサイズが制限されている場合であっても音声翻訳処理時間を最小限に抑えることができる。 The mobile phone 1 is provided with a speech translation function that translates language A into language B or translates language B into language A, for example. Further, when performing the translation control process, the mobile phone 1 uses the RAM size of the RAM 21 required for the initialization process of each module used for translation control, the RAM size required for the calculation process of each module, and each module. A function for controlling the timing of the initialization process of each module based on the time required for the initialization process and the free RAM size of the RAM 21, thereby limiting the free RAM size of the RAM 21 Even so, speech translation processing time can be minimized.

そして、ＣＰＵ２０は、言語Ａを言語Ｂに翻訳するモジュールとして言語Ａ音声認識部３０、Ａ−Ｂ機械翻訳部３１、言語Ｂ音声合成部３２を備えているとともに、言語Ｂを言語Ａに翻訳するモジュールとして言語Ｂ音声認識部３３、Ｂ−Ａ機械翻訳部３４、言語Ａ音声合成部３５を備えている。 The CPU 20 includes a language A speech recognition unit 30, an AB machine translation unit 31, and a language B speech synthesis unit 32 as modules that translate language A into language B, and translates language B into language A. The module includes a language B speech recognition unit 33, a B-A machine translation unit 34, and a language A speech synthesis unit 35.

言語Ａ音声認識部３０は、マイクロフォン１４に言語Ａの音声が入力されると、入力された音声を示す言語Ａの文章を生成する。Ａ−Ｂ機械翻訳部３１は、生成された言語Ａの文章を言語Ｂの文章に機械翻訳する。言語Ｂ音声合成部３２は、機械翻訳された文章に基づいて音声を合成して、出力用の音声データを生成する。 When the language A speech is input to the microphone 14, the language A speech recognition unit 30 generates a language A sentence indicating the input speech. The AB machine translation unit 31 performs machine translation of the generated language A text into language B text. The language B speech synthesizer 32 synthesizes speech based on the machine-translated sentence and generates speech data for output.

言語Ｂ音声認識部３３は、マイクロフォン１４に言語Ｂの音声が入力されると、入力された音声を示す言語Ｂの文章を生成する。Ｂ−Ａ機械翻訳部３４は、生成された言語Ｂの文章を言語Ａの文章に機械翻訳する。言語Ａ音声合成部３５は、機械翻訳された文章に基づいて音声を合成して、出力用の音声データを生成する。 When the language B speech is input to the microphone 14, the language B speech recognition unit 33 generates a language B sentence indicating the input speech. The B-A machine translation unit 34 machine-translates the generated language B text into language A text. The language A speech synthesizer 35 synthesizes speech based on the machine-translated sentence and generates speech data for output.

一例として、言語Ａ（日本語）から言語Ｂ（英語）に音声翻訳する場合の動作を簡単に説明する。例えばユーザが日本語で『もうすこし安くして下さい』と発話したとすると、言語Ａ音声認識部３０は音声波形を音声認識して文字列「もうすこし安くして下さい」を生成して表示装置１２に表示させる。図３は、携帯電話機１が翻訳制御処理を行う際に表示装置１２に表示させる画面の一例を示す図である。図３に示すように、言語Ａ音声認識部３０は、表示装置１２の画面１２ａにおける翻訳前言語表示欄１２ｂに、音声認識された文字列を表示させる。また、Ａ−Ｂ機械翻訳部３１は、図３に示すように、生成された文字列を英語に機械翻訳して「Ｐｌｅａｓｅｍａｋｅｉｔａｌｉｔｔｌｅｃｈｅａｐｅｒ」の文字列を生成して、画面１２ａの翻訳後言語表示欄１２ｃに表示させる。言語Ｂ音声合成部３２は、機械翻訳された文字列に基づいて音声合成し、『Ｐｌｅａｓｅｍａｋｅｉｔａｌｉｔｔｌｅｃｈｅａｐｅｒ』の文字列を音声として出力させる。 As an example, an operation for speech translation from language A (Japanese) to language B (English) will be briefly described. For example, if the user utters “Please make it a little cheaper” in Japanese, the language A speech recognition unit 30 recognizes the speech waveform and generates a character string “Please make it a little cheaper” to display it. 12 is displayed. FIG. 3 is a diagram illustrating an example of a screen displayed on the display device 12 when the mobile phone 1 performs the translation control process. As shown in FIG. 3, the language A speech recognition unit 30 displays the speech-recognized character string in the pre-translation language display field 12 b on the screen 12 a of the display device 12. Further, as shown in FIG. 3, the AB machine translation unit 31 machine translates the generated character string into English to generate a character string “Please make it a little chainer”, and translates the screen 12a. It is displayed in the rear language display column 12c. The language B speech synthesizer 32 synthesizes speech based on the machine-translated character string, and outputs a “Please make it a little chainer” character string as speech.

英語から日本語に音声翻訳する場合は、上記の逆の処理が実行される。なお、携帯電話機１において、日本語から英語に音声翻訳する処理と、英語から日本語に音声翻訳する処理とが並列に動作することはない。 In the case of speech translation from English to Japanese, the above reverse processing is executed. Note that in the mobile phone 1, the process of translating speech from Japanese to English and the process of translating speech from English to Japanese do not operate in parallel.

言語Ａ音声認識部３０、Ａ−Ｂ機械翻訳部３１、言語Ｂ音声合成部３２、言語Ｂ音声認識部３３、Ｂ−Ａ機械翻訳部３４、言語Ａ音声合成部３５は、処理を行う際、ＲＡＭ２１において、それぞれ辞書データを読み込むための記憶領域（初期化用のＲＡＭ）と演算処理を行うための記憶領域（演算用のＲＡＭ）を占有する必要がある。演算処理前に予め初期化を実行しておくことで、その後の処理を高速に実行することが可能だが、その場合にはその分のＲＡＭ２１の記憶領域を占有し続けることになる。一方、予め初期化を実行しない場合、辞書データによりＲＡＭ２１の記憶領域を占有し続けることは回避されるが、処理を行う都度に初期化が必要となるため処理時間が長くなる。 When the language A speech recognition unit 30, the AB machine translation unit 31, the language B speech synthesis unit 32, the language B speech recognition unit 33, the BA machine translation unit 34, and the language A speech synthesis unit 35 perform processing, In the RAM 21, it is necessary to occupy a storage area for reading dictionary data (RAM for initialization) and a storage area for calculation processing (RAM for calculation). By performing initialization before the arithmetic processing in advance, it is possible to execute the subsequent processing at high speed. However, in that case, the storage area of the RAM 21 is continuously occupied. On the other hand, if the initialization is not executed in advance, it is avoided that the storage area of the RAM 21 is continuously occupied by the dictionary data. However, since the initialization is required every time processing is performed, the processing time becomes long.

ＲＯＭ２２は、ＣＰＵ２０が翻訳制御処理を行う際に使用されるモジュールがリスト化されたモジュールリスト情報４０を記憶している。図４は、モジュールリスト情報４０の一例を示すデータ構成図である。図４に示すように、モジュールリスト情報４０は、各々のモジュール情報４０ａに対して、処理の種類を示す種類情報４０ｂ、ＲＡＭにおいてモジュールが処理を行う際に使用される記憶領域の最大サイズ（ピークＲＡＭサイズ）を示すＲＡＭサイズ情報４０ｃがそれぞれ対応付けられた情報である。なお、モジュールが処理を行う際には、そのモジュールの本来の処理である演算処理に必要なデータをＲＯＭ２２からＲＡＭ２１に展開する初期化処理を行う必要があるため、処理の種類は、この「初期化」処理と、モジュールの本来の処理である「演算」処理との２種類である。 The ROM 22 stores module list information 40 in which modules used when the CPU 20 performs translation control processing are listed. FIG. 4 is a data configuration diagram illustrating an example of the module list information 40. As shown in FIG. 4, the module list information 40 includes, for each module information 40a, type information 40b indicating the type of processing, and the maximum size (peak) of the storage area used when the module performs processing in the RAM. RAM size information 40c indicating (RAM size) is associated with each other. When a module performs processing, it is necessary to perform initialization processing that expands data necessary for arithmetic processing, which is the original processing of the module, from the ROM 22 to the RAM 21. There are two types of processing: “processing” and “calculation” processing that is the original processing of the module.

具体的には、初期化用の言語Ａ音声認識部３０（ＡＳＲ＿Ａ）の初期化に対して「Ｍ＿Ａａ」のＲＡＭサイズが、演算用の言語Ａ音声認識部３０（ＡＳＲ＿Ａ）の演算に対して「Ｍ＿Ａａ＿ｗ」のＲＡＭサイズがそれぞれ対応付けられている。また、Ａ−Ｂ機械翻訳部３１（ＭＴ＿ＡＢ）の初期化に対して「Ｍ＿Ｍａ」のＲＡＭサイズが、Ａ−Ｂ機械翻訳部３１（ＭＴ＿ＡＢ）の演算に対して「Ｍ＿Ｍａ＿ｗ」のＲＡＭサイズがそれぞれ対応付けられている。また、言語Ｂ音声合成部３２（ＴＴＳ＿Ｂ）の初期化に対して「Ｍ＿Ｔａ」のＲＡＭサイズが、言語Ｂ音声合成部３２（ＴＴＳ＿Ｂ）の演算に対して「Ｍ＿Ｔａ＿ｗ」のＲＡＭサイズがそれぞれ対応付けられている。 Specifically, the RAM size of “M_Aa” for initialization of the language A speech recognition unit 30 (ASR_A) for initialization is “0” for the computation of the language A speech recognition unit 30 (ASR_A) for computation. The RAM sizes of “M_Aa_w” are associated with each other. The RAM size of “M_Ma” corresponds to the initialization of the AB machine translation unit 31 (MT_AB), and the RAM size of “M_Ma_w” corresponds to the operation of the AB machine translation unit 31 (MT_AB). It is attached. The RAM size of “M_Ta” is associated with the initialization of the language B speech synthesizer 32 (TTS_B), and the RAM size of “M_Ta_w” is associated with the operation of the language B speech synthesizer 32 (TTS_B). ing.

同様に、言語Ｂ音声認識部３３、Ｂ−Ａ機械翻訳部３４、言語Ａ音声合成部３５に対しても、初期化処理と演算処理のそれぞれに必要なＲＡＭサイズが対応付けられている。 Similarly, the language B speech recognition unit 33, the BA machine translation unit 34, and the language A speech synthesis unit 35 are also associated with the RAM sizes necessary for the initialization process and the calculation process.

また、ＲＯＭ２２は、各モジュールの初期化時間が記された初期化時間情報４１を記憶している。図５は、初期化時間情報４１の一例を示すデータ構成図である。図５に示すように、初期化時間情報４１では、各々のモジュールを示すモジュール情報４０ａに対して、初期化処理に要する時間が対応付けられている。 The ROM 22 stores initialization time information 41 in which the initialization time of each module is recorded. FIG. 5 is a data configuration diagram showing an example of the initialization time information 41. As shown in FIG. 5, in the initialization time information 41, the time required for the initialization process is associated with the module information 40a indicating each module.

図６は、言語Ａから言語Ｂへ音声翻訳するときに用いられる各々のモジュールの初期化タイミングのパターンを示す初期化パターン情報４２の一例（例えば言語Ａから言語Ｂに翻訳するモジュールの初期化タイミングのパターン）を示すデータ構成図である。図６に示すように、初期化パターン情報４２は、各々のパターン識別情報４２ａに対して、言語Ａ音声認識部３０を事前初期化するか否かを示す事前初期化可否情報４２ｂ、Ａ−Ｂ機械翻訳部３１を事前初期化するか否かを示す事前初期化可否情報４２ｃ、言語Ｂ音声合成部３２を事前初期化するか否かを示す事前初期化可否情報４２ｄ、ピークＲＡＭサイズ情報４２ｅ、各々のモジュールを演算時に初期化する際にかかる時間を示す追加処理時間情報４２ｆがそれぞれ対応付けられた情報である。 FIG. 6 shows an example of initialization pattern information 42 indicating the initialization timing pattern of each module used when speech translation is performed from language A to language B (for example, initialization timing of a module that translates from language A to language B). FIG. As shown in FIG. 6, the initialization pattern information 42 includes pre-initialization availability information 42b indicating whether or not the language A speech recognition unit 30 is pre-initialized for each pattern identification information 42a, AB. Pre-initialization availability information 42c indicating whether the machine translation unit 31 is pre-initialized, pre-initialization availability information 42d indicating whether the language B speech synthesis unit 32 is pre-initialized, peak RAM size information 42e, The additional processing time information 42f indicating the time taken to initialize each module at the time of calculation is associated with each module.

初期化パターン情報４２には、言語Ａから言語Ｂへ音声翻訳するときに用いられる各モジュールの事前初期化要否の組み合わせとして考えられる全てのパターンについて、ピークＲＡＭサイズと、モジュール演算処理実行時に初期化を行うために要する時間（追加処理時間）とが登録されている。図６では、事前初期化を行うモジュールの事前初期化可否情報を「○」、事前初期化を行わないモジュールの事前初期化可否情報を「×」で表している。なお、図示しないが、言語Ｂから言語Ａへ音声翻訳する場合の初期化パターン情報も、言語Ａから言語Ｂへ音声翻訳する場合の初期化パターンと同様に予め記憶されている。ここで、事前の初期化とは、音声翻訳機能の開始時に、前もって行われるモジュールの初期化である。 The initialization pattern information 42 includes a peak RAM size and an initial value at the time of execution of module calculation processing for all patterns that can be considered as combinations of necessity of pre-initialization of each module used when speech translation from language A to language B is performed. The time (additional processing time) required for the conversion is registered. In FIG. 6, the pre-initialization availability information of a module to be pre-initialized is represented by “◯”, and the pre-initialization availability information of a module that is not pre-initialized is represented by “×”. Although not shown, initialization pattern information for speech translation from language B to language A is also stored in advance, as is the initialization pattern for speech translation from language A to language B. Here, the prior initialization is module initialization that is performed in advance at the start of the speech translation function.

言語Ａから言語Ｂへの翻訳制御処理を実行する際のピークＲＡＭサイズと追加処理時間について説明する。音声翻訳処理において、モジュールの初期化処理を事前に行った場合、その初期化処理に用いたＲＡＭ２１の記憶領域は、各モジュールの演算処理実行時にもそのまま占有される。そのため、事前に初期化するモジュールが多いほど、多くのＲＡＭ２１の記憶領域が必要となる。初期化パターン情報４２のピークＲＡＭサイズは、各パターンで処理した場合の音声翻訳処理全体の中で必要となるＲＡＭの最大値を表している。音声翻訳処理においては、上述のとおり音声認識完了後に機械翻訳を実行し、機械翻訳完了後に音声合成が実行されるため、各モジュールは並列動作しない。そのため、ピークＲＡＭサイズは、各モジュールの演算処理にかかるＲＡＭサイズの内の最大値と、事前初期化を行う全てのモジュールの初期化に必要なＲＡＭサイズの合計として算出される。 The peak RAM size and the additional processing time when executing the translation control process from language A to language B will be described. In the speech translation process, when the module initialization process is performed in advance, the storage area of the RAM 21 used for the initialization process is occupied as it is when the calculation process of each module is executed. Therefore, the more modules to be initialized in advance, the more storage areas of the RAM 21 are required. The peak RAM size of the initialization pattern information 42 represents the maximum RAM value required in the entire speech translation processing when processing is performed with each pattern. In speech translation processing, as described above, machine translation is executed after completion of speech recognition, and speech synthesis is executed after completion of machine translation. Therefore, the modules do not operate in parallel. For this reason, the peak RAM size is calculated as the sum of the maximum RAM size required for the arithmetic processing of each module and the RAM size required for initialization of all the modules to be pre-initialized.

一方、事前に初期化されないモジュールは、各モジュールの演算処理を行う都度、初期化される。そのため、事前に初期化されないモジュールが多いほど、各モジュールの演算処理を行う都度発生する追加の初期化のための処理時間が必要となる。初期化パターン情報４２の追加処理時間は、音声翻訳処理全体の中で、追加の初期化処理のための処理時間の合計時間を表している。つまり、追加処理時間は、事前に初期化されないモジュールの初期化処理に要する時間として算出される。 On the other hand, modules that are not initialized in advance are initialized each time the arithmetic processing of each module is performed. Therefore, the more modules that are not initialized in advance, the more processing time is required for additional initialization that occurs each time the arithmetic processing of each module is performed. The additional processing time of the initialization pattern information 42 represents the total processing time for the additional initialization processing in the entire speech translation processing. That is, the additional processing time is calculated as the time required for the initialization processing of the module that is not initialized in advance.

例えば、初期化パターン情報４２に登録された各パターンの中で、ＲＡＭ２１の記憶領域を最も多く占有するパターンは、全てのモジュールが事前に初期化するパターン（図６のパターン１）であり、これは、各モジュールが演算処理を行う際のＲＡＭ２１の使用量の最大値ｍａｘ｛Ｍ＿Ａａ＿ｗ，Ｍ＿Ｍａ＿ｗ，Ｍ＿Ｔａ＿ｗ｝＝Ｍ＿ｗと定義すると、ピークＲＡＭサイズＭ＿Ｐｋ＿ｍａｘは「Ｍ＿Ａａ＋Ｍ＿Ｍａ＋Ｍ＿Ｔａ＋Ｍ＿ｗ」となる。また、各モジュールによる演算処理を行うときの追加の初期化の処理が必要無いため、追加処理時間Ｔｐａ１は０となる。 For example, among the patterns registered in the initialization pattern information 42, the pattern that occupies the most storage area of the RAM 21 is a pattern (pattern 1 in FIG. 6) that is initialized in advance by all modules. Is defined as a maximum value max {M_Aa_w, M_Ma_w, M_Ta_w} = M_w of the usage amount of the RAM 21 when each module performs arithmetic processing, the peak RAM size M_Pk_max is “M_Aa + M_Ma + M_Ta + M_w”. Further, the additional processing time Tpa1 is 0 because no additional initialization processing is required when performing the arithmetic processing by each module.

これに対して、全てのモジュールが事前に初期化されないパターン（図６のパターン８）では、初期化によってＲＡＭが占有されない。そのため、ピークＲＡＭサイズは、「Ｍｐａ８＝ｍａｘ{Ｍ＿Ａａ＋Ｍ＿Ａａ＿ｗ，Ｍ＿Ｍａ＋Ｍ＿Ｍａ＿ｗ，Ｍ＿Ｔａ＋Ｍ＿Ｔａ＿ｗ}」である。一方、各モジュールの演算処理実行時に初期化を行うため、追加処理時間が「Ｔｐａ８＝Ｔ＿Ａａ＋Ｔ＿Ｍａ＋Ｔ＿Ｔａ」となる。 On the other hand, in the pattern in which all modules are not initialized in advance (pattern 8 in FIG. 6), the RAM is not occupied by the initialization. Therefore, the peak RAM size is “Mpa8 = max {M_Aa + M_Aa_w, M_Ma + M_Ma_w, M_Ta + M_Ta_w}”. On the other hand, since initialization is performed at the time of execution of arithmetic processing of each module, the additional processing time becomes “Tpa8 = T_Aa + T_Ma + T_Ta”.

初期化パターンは、初期化パターン情報４２に記憶されたピークＲＡＭサイズと追加処理時間の情報と、翻訳制御処理のために使用できる空きＲＡＭサイズに基づいて選択される。 The initialization pattern is selected based on the peak RAM size and additional processing time information stored in the initialization pattern information 42 and the free RAM size that can be used for the translation control process.

ＲＡＭ２１の記憶領域における空きＲＡＭサイズを「Ｍ＿Ｆｒｅｅ」、翻訳制御処理の実行時に他のアプリのために最低限確保しておくＲＡＭサイズを「Ｍ＿ｅｔｃ」と定義すると、翻訳制御処理のために使用できる空きＲＡＭサイズＭ＿ｍａｘは「Ｍ＿Ｆｒｅｅ−Ｍ＿ｅｔｃ」と定義する。 If the free RAM size in the storage area of the RAM 21 is defined as “M_Free” and the minimum RAM size reserved for other applications when executing the translation control processing is defined as “M_etc”, the free space that can be used for the translation control processing The RAM size M_max is defined as “M_Free-M_etc”.

言語Ａから言語Ｂへの音声翻訳に必要なピークＲＡＭサイズＭ＿Ｐｋ＿ｍａｘが空きＲＡＭサイズＭ＿ｍａｘよりも小さければ、音声翻訳を行う前に、事前に全モジュールの初期化処理を実行しておくことが可能なので、パターン１により最小の処理時間で翻訳制御処理を実行することができる。しかしながら、ピークＲＡＭサイズＭ＿Ｐｋ＿ｍａｘが空きＲＡＭサイズＭ＿ｍａｘよりも小さい場合は、モジュールを全て事前に初期化しておくことはできないため、一部のモジュールのみを事前に初期化しておくことになる。この場合、ピークＲＡＭサイズは小さくなるが、事前初期化を行っていないモジュールの初期化処理に要する時間の分だけ処理時間が長くなるため、全てのモジュールを初期化した場合よりも処理時間が長くなる。 If the peak RAM size M_Pk_max required for speech translation from language A to language B is smaller than the free RAM size M_max, it is possible to execute initialization processing for all modules in advance before speech translation. By using pattern 1, the translation control process can be executed with a minimum processing time. However, when the peak RAM size M_Pk_max is smaller than the free RAM size M_max, all the modules cannot be initialized in advance, so only some of the modules are initialized in advance. In this case, the peak RAM size is reduced, but the processing time is longer by the time required for the initialization processing of the modules that have not been pre-initialized, so the processing time is longer than when all the modules are initialized. Become.

以下、各々のモジュールの初期化のタイミングを制御することにより処理時間を最小化するために、事前初期化を行うモジュールの選択方法について説明する。携帯電話機１で翻訳制御処理を行う際、空きＲＡＭサイズに応じて、いずれのモジュールを事前初期化するかどうか選択する。言語Ａから言語Ｂに翻訳するための３種類のモジュールについて考えると、事前初期化有無を決定するためのパターンは初期化パターン情報４２に示すようにパターン１乃至パターン８の８パターンある。 Hereinafter, in order to minimize the processing time by controlling the timing of initialization of each module, a method for selecting a module to be pre-initialized will be described. When the translation control processing is performed by the mobile phone 1, it is selected which module to pre-initialize according to the free RAM size. Considering three types of modules for translating from language A to language B, there are eight patterns 1 to 8 as shown in the initialization pattern information 42 for determining whether or not to pre-initialize.

事前に初期化するモジュール、演算処理を実行するタイミングで初期化するモジュールをそれぞれ選別する初期化処理の最適化は、初期化パターン情報４２に基づいて以下の手順で行われる。
（１）空きＲＡＭサイズＭ＿ｍａｘを算出する。
（２）パターン１乃至パターン８（Ｍｐａ１〜ＭＰａ８）から、ピークＲＡＭサイズが空きＲＡＭサイズＭ＿ｍａｘ以下であるパターンを選出する。
（３）上記（２）で選出されたパターンのうち、追加処理時間（Ｔｐａ１〜Ｔｐａ８）が最小のパターンを選択する。 The optimization of the initialization process for selecting the module to be initialized in advance and the module to be initialized at the timing of executing the arithmetic process is performed based on the initialization pattern information 42 in the following procedure.
(1) The free RAM size M_max is calculated.
(2) A pattern whose peak RAM size is equal to or less than the free RAM size M_max is selected from patterns 1 to 8 (Mpa1 to MPa8).
(3) Of the patterns selected in (2) above, select the pattern with the shortest additional processing time (Tpa1 to Tpa8).

携帯電話機１が各々のモジュールの初期化のタイミングを最適化しつつ翻訳処理を行う翻訳制御処理の手順について、図７、図８、図１０に示すフローチャートに基づいて詳細に説明する。なお、最初に言語Ａから言語Ｂへの翻訳を行う場合を例に挙げて説明する。 The procedure of the translation control process in which the mobile phone 1 performs the translation process while optimizing the initialization timing of each module will be described in detail based on the flowcharts shown in FIGS. Note that a case where translation from the language A to the language B is first performed will be described as an example.

図７は、携帯電話機１が翻訳制御処理を行う際の全体の手順を示すフローチャートであり、図８は、携帯電話機１が事前初期化処理を行う際の手順を示すフローチャートである。図７に示すように、まずＣＰＵ２０は、言語Ａから言語Ｂへの翻訳を行う各々のモジュールについて、事前初期化処理を行う（Ｓ１０１）。この事前初期化処理について、図８に示すフローチャートに基づいて説明する。 FIG. 7 is a flowchart showing an overall procedure when the mobile phone 1 performs the translation control process, and FIG. 8 is a flowchart showing a procedure when the mobile phone 1 performs the pre-initialization process. As shown in FIG. 7, first, the CPU 20 performs a pre-initialization process for each module that performs translation from language A to language B (S101). This pre-initialization process will be described based on the flowchart shown in FIG.

ＣＰＵ２０はまず空きＲＡＭサイズ（Ｍ＿ｍａｘ）を算出する（Ｓ２０１）。またＣＰＵ２０は、追加処理時間の最小値（Ｔｍｉｎ）を初期値「無限大（∞）」に設定するとともに、パターンの計数のための変数ｉに初期値「１」を、最適なパターンを示す変数ｎに初期値「０」を設定する（Ｓ２０３）。ここで、初期値「無限大」は十分に大きな値を意味するものであり、より具体的にはＴｐａ８よりも大きな値であれば良い。 The CPU 20 first calculates an empty RAM size (M_max) (S201). Further, the CPU 20 sets the minimum value (Tmin) of the additional processing time to the initial value “infinity (∞)”, sets the initial value “1” to the variable i for pattern counting, and the variable indicating the optimum pattern. An initial value “0” is set to n (S203). Here, the initial value “infinity” means a sufficiently large value, and more specifically, it may be a value larger than Tpa8.

ＣＰＵ２０は、パターンｉのピークＲＡＭサイズ（Ｍｐａ（ｉ））が、空きＲＡＭサイズ（Ｍ＿ｍａｘ）より小さいかどうかを判断する（Ｓ２０５）。このときＣＰＵ２０は、例えば変数ｉが「１」の場合にはパターン１のピークＲＡＭサイズについて判断する。パターンｉのピークＲＡＭサイズが空きＲＡＭサイズよりも小さい場合（Ｓ２０５のＹｅｓ）、ＣＰＵ２０は、パターンｉの追加処理時間（Ｔｐａ（ｉ））が追加処理時間の最小値Ｔｍｉｎよりも小さいか否かを判断する（Ｓ２０７）。 The CPU 20 determines whether or not the peak RAM size (Mpa (i)) of the pattern i is smaller than the free RAM size (M_max) (S205). At this time, for example, when the variable i is “1”, the CPU 20 determines the peak RAM size of the pattern 1. When the peak RAM size of the pattern i is smaller than the free RAM size (Yes in S205), the CPU 20 determines whether or not the additional processing time (Tpa (i)) of the pattern i is smaller than the minimum value Tmin of the additional processing time. Judgment is made (S207).

パターンｉの追加処理時間（Ｔｐａ（ｉ））が追加処理時間の最小値Ｔｍｉｎよりも小さい場合（Ｓ２０７のＹｅｓ）は、ＣＰＵ２０は、追加処理時間の最小値（Ｔｍｉｎ）にパターンｉの追加処理時間（Ｔｐａ（ｉ））を設定する（Ｓ２０９）。追加処理時間の最小値が無限大に設定されている場合には、必ずこのときのパターンｉの追加処理時間が追加処理時間の最小値に設定される。そしてＣＰＵ２０は、変数ｎにこのときの「ｉ」を設定する（Ｓ２１１）。変数ｎは最適なパターンを示す変数であり、変数ｎに「ｉ」を設定することにより現時点でパターンｉが最適であることを意味する。 When the additional processing time (Tpa (i)) of the pattern i is smaller than the minimum value Tmin of the additional processing time (Yes in S207), the CPU 20 adds the additional processing time of the pattern i to the minimum value (Tmin) of the additional processing time. (Tpa (i)) is set (S209). When the minimum value of the additional processing time is set to infinity, the additional processing time of the pattern i at this time is always set to the minimum value of the additional processing time. Then, the CPU 20 sets “i” at this time to the variable n (S211). The variable n is a variable indicating an optimal pattern, and setting “i” to the variable n means that the pattern i is optimal at the present time.

パターンｉのピークＲＡＭサイズが空きＲＡＭサイズ以上であった場合（Ｓ２０５のＮｏ）、パターンｉの追加処理時間（Ｔｐａ（ｉ））が追加処理時間の最小値以上であった場合（Ｓ２０７のＮｏ）、またはステップＳ２０９及びＳ２１１の処理を完了した後、ＣＰＵ２０は、変数ｉに「１」を加算する（Ｓ２１３）。 When the peak RAM size of the pattern i is equal to or larger than the free RAM size (No in S205), and when the additional processing time (Tpa (i)) of the pattern i is equal to or larger than the minimum value of the additional processing time (No in S207) Alternatively, after completing the processes of steps S209 and S211, the CPU 20 adds “1” to the variable i (S213).

またＣＰＵ２０は、変数ｉがパターンの総数である「８」より大きいか否かを判断する（Ｓ２１５）。変数ｉが８以下であった場合（Ｓ２１５のＮｏ）、すなわち全てのパターンについてステップＳ２０５乃至Ｓ２１１の処理を行っていない場合は、ステップＳ２０５に戻って、ＣＰＵ２０はステップＳ２０５乃至Ｓ２１５の処理を行う。 Further, the CPU 20 determines whether or not the variable i is larger than “8” which is the total number of patterns (S215). If the variable i is 8 or less (No in S215), that is, if the processes in steps S205 to S211 have not been performed for all patterns, the process returns to step S205, and the CPU 20 performs the processes in steps S205 to S215.

変数ｉが８より大きい場合（Ｓ２１５のＹｅｓ）、すなわち全てのパターンについてステップＳ２０５乃至Ｓ２１１の処理を行った場合は、ＣＰＵ２０は、変数ｎが０より大きいか否かを判断する（Ｓ２１７）。変数ｎが「０」の場合には、最適なパターンが選択されておらず、事前初期化を行えないからである。 When the variable i is greater than 8 (Yes in S215), that is, when the processes of Steps S205 to S211 have been performed for all patterns, the CPU 20 determines whether or not the variable n is greater than 0 (S217). This is because when the variable n is “0”, an optimal pattern has not been selected and pre-initialization cannot be performed.

変数ｎが０より大きい場合（Ｓ２１７のＹｅｓ）は、ＣＰＵ２０は初期化パターン情報４２に基づいて、パターンｎの初期化を実行する（Ｓ２１９）。このとき、例えば変数ｎが「１」であった場合は、ＣＰＵ２０は、初期化パターン情報４２のパターン１に基づいて、言語Ａ音声認識部３０、Ａ−Ｂ機械翻訳部３１、言語Ｂ音声合成部３２を初期化し、パターン２が選択された場合は、言語Ａ音声認識部３０、Ａ−Ｂ機械翻訳部３を初期化する。 When the variable n is larger than 0 (Yes in S217), the CPU 20 executes the initialization of the pattern n based on the initialization pattern information 42 (S219). At this time, for example, if the variable n is “1”, the CPU 20 based on the pattern 1 of the initialization pattern information 42, the language A speech recognition unit 30, the AB machine translation unit 31, the language B speech synthesis. When the unit 32 is initialized and the pattern 2 is selected, the language A speech recognition unit 30 and the AB machine translation unit 3 are initialized.

またこのとき、ＣＰＵ２０は、各々のモジュールの初期化状態を示す初期化状態情報４４に、各々のモジュールの初期化状態を記憶する（Ｓ２２１）。図９（Ａ）は、識別符号と初期化状態との関係を示す対応表であり、図９（Ｂ）は、初期化状態情報４４の一例を示すデータ構成図である。図９（Ａ）に示すように、モジュールを事前初期化すべきでないが初期化されている状態に識別符号「−１」が、モジュールを事前初期化しない状態に識別符号「０」が、モジュールを事前初期化すべきだがまだされていない状態に識別符号「１」が、モジュールが既に事前初期化されている状態に識別符号「２」がそれぞれ割り当てられている。 At this time, the CPU 20 stores the initialization state of each module in the initialization state information 44 indicating the initialization state of each module (S221). FIG. 9A is a correspondence table showing the relationship between the identification codes and the initialization states, and FIG. 9B is a data configuration diagram showing an example of the initialization state information 44. As shown in FIG. 9A, the identification code “−1” indicates that the module should not be preinitialized but is initialized, and the identification code “0” indicates that the module is not preinitialized. An identification code “1” is assigned to a state that should be pre-initialized but has not yet been assigned, and an identification code “2” is assigned to a state where the module has already been pre-initialized.

識別符号「−１」は、むやみに解放処理が行われないための識別符号である。例えば、一般的にモジュールが実行された後には即座にメモリの使用領域が解放されるが、このモジュールが繰り返し使用される可能性がある場合には何度も初期化を繰り返すのは非効率的である。そこで、繰り返し使用される可能性の高いモジュールに対して識別符号「−１」を設定しておき、別のモジュールが実行されるタイミングで当該モジュールを解放するようにすることにより、無駄な初期化処理を省くことができる。 The identification code “−1” is an identification code for unnecessarily performing the release process. For example, memory usage is generally released immediately after a module is executed, but if this module is likely to be used repeatedly, it is inefficient to repeat initialization many times. It is. Therefore, by setting an identification code “−1” for a module that is likely to be used repeatedly, and releasing the module at the timing when another module is executed, wasteful initialization is performed. Processing can be omitted.

そして、選択されたパターンに基づいて、モジュールの事前初期化を行わないが初期化されているモジュール（予め識別符号に「−１」が設定されていたモジュール）には「−１」が、モジュールの事前初期化を行わないモジュールには「０」が、既に事前初期化を行ったモジュールには「２」が設定される。また、例えば「Ａ言語からＢ言語」と「Ｂ言語からＡ言語」の双方向の翻訳を行う場合に、双方向分のモジュールのパターンが選択されたが、この時点で１方向のみの事前初期化が行われた場合などで、事前初期化を行うべきだが現時点で行われていないモジュールには「１」が設定される。 Then, based on the selected pattern, the module is not pre-initialized but is initialized (the module whose identification code is set to “−1” in advance) is “−1”, the module “0” is set for a module that is not pre-initialized, and “2” is set for a module that has already been pre-initialized. In addition, for example, when performing bidirectional translation from “A language to B language” and “B language to A language”, a module pattern for bidirectional is selected. For example, “1” is set for a module that should be pre-initialized but not currently performed.

図９（Ｂ）に示すように、初期化状態情報４４では、モジュール情報４４ａに対して初期化状態を示す状態情報４４ｂがそれぞれ対応付けられている。なお、状態情報４４ｂは上述した識別符号で記されている。例えば図９（Ｂ）によると、言語Ａ音声認識部３０（ＡＳＲ＿Ａ）に識別符号「２」（初期化されている状態）が、Ａ−Ｂ機械翻訳部３１（ＭＴ＿ＡＢ）に識別符号「１」（初期化すべきだけどされていない状態）が対応付けられている。ＣＰＵ２０は、ステップＳ２２１にて各々のモジュールについて状態情報４４ｂを更新する。 As shown in FIG. 9B, in the initialization state information 44, the state information 44b indicating the initialization state is associated with the module information 44a. The state information 44b is written with the identification code described above. For example, according to FIG. 9B, the identification code “2” (initialized state) is assigned to the language A speech recognition unit 30 (ASR_A), and the identification code “1” is assigned to the AB machine translation unit 31 (MT_AB). (A state that should be initialized but not done) is associated. In step S221, the CPU 20 updates the state information 44b for each module.

このようにしてＣＰＵ２０は、各々のモジュールに対して事前初期化処理を行った後、ユーザにより終了が指示されたか否かを判断する（Ｓ１０３）。このときＣＰＵ２０は、例えば入力装置１５を介する所定の入力に基づいて、終了が指示されたものと判断する。終了が指示された場合（Ｓ１０３のＹｅｓ）は、ＣＰＵ２０は翻訳制御処理を終了する。 In this manner, the CPU 20 determines whether or not an end is instructed by the user after performing the pre-initialization process for each module (S103). At this time, the CPU 20 determines that the termination has been instructed based on, for example, a predetermined input via the input device 15. When the end is instructed (Yes in S103), the CPU 20 ends the translation control process.

終了が指示されていない場合（Ｓ１０３のＮｏ）は、ＣＰＵ２０は、翻訳方向の切替が指示されたか否かを判断する（Ｓ１０５）。このときＣＰＵ２０は、例えば入力装置１５を介する所定の入力に基づいて、言語Ａから言語Ｂへの翻訳から、言語Ｂから言語Ａへの翻訳に切り替えられた場合に、切替が指示されたものと判断する。 If termination is not instructed (No in S103), the CPU 20 determines whether or not switching of the translation direction is instructed (S105). At this time, for example, the CPU 20 is instructed to switch when the translation from the language A to the language B is switched to the translation from the language B to the language A based on a predetermined input via the input device 15. to decide.

翻訳方向の切替が指示されていない場合（Ｓ１０５のＮｏ）は、ＣＰＵ２０は、言語Ａから言語Ｂに翻訳を行うために、ユーザにより発話されたか否かを判断する（Ｓ１０７）。ユーザは、例えば画面１２ａに表示された認識開始／終了ボタン１２を選択することにより携帯電話機１に音声認識の開始を指示し、認識開始／終了ボタン１２を再び選択することにより音声認識の終了を指示する。ＣＰＵ２０は、例えばこのボタンの選択に基づいて発話されたか否かを判断する。 When the switching of the translation direction is not instructed (No in S105), the CPU 20 determines whether or not the user has spoken in order to translate from the language A to the language B (S107). For example, the user instructs the mobile phone 1 to start voice recognition by selecting the recognition start / end button 12 displayed on the screen 12a, and selects the recognition start / end button 12 again to end the voice recognition. Instruct. The CPU 20 determines whether or not an utterance has been made based on the selection of this button, for example.

発話された場合（Ｓ１０７のＹｅｓ）は、ＣＰＵ２０は、言語Ａ音声認識部３０により言語Ａの音声認識を行う（Ｓ１０９）。例えばユーザが言語Ａ（日本語）で「こんにちは」と発話した場合には、ＣＰＵ２０は「こんにちは」の文字列を生成する。そしてＣＰＵ２０は、機械翻訳処理を実行することを示すユーザ操作を検出した場合に、Ａ−Ｂ機械翻訳部３１により、ステップＳ１０９にて音声認識された文字列を言語Ａから言語Ｂに機械翻訳する（Ｓ１１１）。例えば音声認識により「こんにちは」の文字列が生成された場合には、ＣＰＵ２０は「こんにちは」を言語Ｂ（英語）に機械翻訳して「Ｈｅｌｌｏ」の文字列を生成する。 When the utterance is made (Yes in S107), the CPU 20 performs language A speech recognition by the language A speech recognition unit 30 (S109). For example, if the user utters "Hello" in language A (Japanese) is, CPU20 generates a character string "Hello". When the CPU 20 detects a user operation indicating that machine translation processing is to be executed, the AB machine translation unit 31 machine translates the character string recognized in step S109 from language A to language B. (S111). For example, in the case where the character string "Hello" is generated by the speech recognition, CPU20 generates a string of "Hello" and then machine translation in language B (English) "Hello".

この際、画面１２ａの翻訳前言語表示欄１２ｂに音声認識された文字列である「こんにちは」の文字列が表示されると良い。また、画面１２ａの翻訳後言語表示欄１２ｃに機械翻訳された文字列である「Ｈｅｌｌｏ」の文字列が表示されると良い。 In this case, a voice recognition string to the translation language display column 12b of the screen 12a may character string "Hello" is displayed. Moreover, it is good to display the character string of "Hello" which is the character string machine-translated in the post-translation language display column 12c of the screen 12a.

ＣＰＵ２０は、音声合成処理を実行することを示すユーザ操作を検出した場合に、言語Ｂ音声合成部３２により、ステップＳ１１１にて機械翻訳された文字列について音声合成して音声データを生成する（Ｓ１１３）。例えば生成された文字列が「Ｈｅｌｌｏ」であった場合には「Ｈｅｌｌｏ」の言葉を発する音声データが生成される。そしてＣＰＵ２０は、生成された音声データの出力を示すユーザ操作を検出した場合に、ステップＳ１１３にて生成された音声データをスピーカ１３により音声出力する（Ｓ１１５）。ユーザは、例えば画面１２ａに表示された出力開始ボタン１２ｅを選択することにより、携帯電話機１に音声出力の開始を指示する。ＣＰＵ２０は、このボタンが選択されたことに基づいて音声出力を開始すると良い。その後、ステップＳ１０３に戻って、ＣＰＵ２０は再び終了が指示されたか否かを判断する。 When the CPU 20 detects a user operation indicating that the speech synthesis process is to be executed, the language B speech synthesizer 32 performs speech synthesis on the character string machine-translated in step S111 to generate speech data (S113). ). For example, in the case where the generated character string is “Hello”, voice data that generates the word “Hello” is generated. When the CPU 20 detects a user operation indicating the output of the generated audio data, the CPU 20 outputs the audio data generated in step S113 through the speaker 13 (S115). For example, the user instructs the mobile phone 1 to start audio output by selecting the output start button 12e displayed on the screen 12a. The CPU 20 may start audio output based on the selection of this button. Thereafter, the process returns to step S103, and the CPU 20 determines again whether or not the end is instructed.

一方、翻訳方向の切替が指示された場合（Ｓ１０５のＮｏ）は、ＣＰＵ２０は、翻訳方向を切り替え、言語Ｂから言語Ａへの翻訳を行う各々のモジュールに対して、ステップＳ２０１乃至Ｓ２２１に示す事前初期化処理を行う（Ｓ１１７）。その後、ＣＰＵ２０はユーザにより終了が指示されたか否かを判断する（Ｓ１１９）。このときＣＰＵ２０は、例えば入力装置１５を介する所定の入力に基づいて、終了が指示されたものと判断する。終了が指示された場合（Ｓ１１９のＹｅｓ）は、ＣＰＵ２０は翻訳制御処理を終了する。 On the other hand, when switching of the translation direction is instructed (No in S105), the CPU 20 switches the translation direction and performs advance processing shown in steps S201 to S221 for each module that performs translation from language B to language A. Initialization processing is performed (S117). Thereafter, the CPU 20 determines whether or not the user has instructed termination (S119). At this time, the CPU 20 determines that the termination has been instructed based on, for example, a predetermined input via the input device 15. When the end is instructed (Yes in S119), the CPU 20 ends the translation control process.

終了が指示されていない場合（Ｓ１１９のＮｏ）は、ＣＰＵ２０は、翻訳方向の切替が指示されたか否かを判断する（Ｓ１２１）。このときＣＰＵ２０は、例えば入力装置１５を介する所定の入力に基づいて、言語Ｂから言語Ａへの翻訳から、言語Ａから言語Ｂへの翻訳に切り替えられた場合に、切替が指示されたものと判断する。翻訳方向の切替が指示された場合（Ｓ１２１のＹｅｓ）は、ステップＳ１０１に戻って、ＣＰＵ２０は、翻訳方向を切り替え、言語Ａから言語Ｂへの翻訳を行う各々のモジュールに対して事前初期化を行う。 If termination is not instructed (No in S119), the CPU 20 determines whether or not switching of the translation direction is instructed (S121). At this time, for example, when the CPU 20 switches from translation from the language B to the language A to translation from the language A to the language B based on a predetermined input via the input device 15, the switching is instructed. to decide. If switching of the translation direction is instructed (Yes in S121), the process returns to step S101, and the CPU 20 switches the translation direction and performs pre-initialization for each module that performs translation from language A to language B. Do.

翻訳方向の切替が指示されていない場合（Ｓ１２１のＮｏ）は、ＣＰＵ２０は、言語Ｂから言語Ａに翻訳を行うために、ユーザにより発話されたか否かを判断する（Ｓ１２３）。ユーザは、例えば画面１２ａに表示された認識開始／終了ボタン１２を選択することにより携帯電話機１に音声認識の開始を指示し、認識開始／終了ボタン１２を再び選択することにより音声認識の終了を指示する。ＣＰＵ２０は、例えばこのボタンの選択に基づいて発話されたか否かを判断する。 When switching of the translation direction is not instructed (No in S121), the CPU 20 determines whether or not the user has spoken in order to translate from the language B to the language A (S123). For example, the user instructs the mobile phone 1 to start voice recognition by selecting the recognition start / end button 12 displayed on the screen 12a, and selects the recognition start / end button 12 again to end the voice recognition. Instruct. The CPU 20 determines whether or not an utterance has been made based on the selection of this button, for example.

発話された場合（Ｓ１２３のＹｅｓ）は、ＣＰＵ２０は、言語Ｂ音声認識部３３により、言語Ｂの音声認識を行う（Ｓ１２５）。例えばユーザが言語Ｂ（英語）で「Ｈｅｌｌｏ」と発話した場合には、ＣＰＵ２０は「Ｈｅｌｌｏ」の文字列を生成する。そしてＣＰＵ２０は、機械翻訳処理を実行することを示すユーザ操作を検出した場合に、Ｂ−Ａ機械翻訳部３４により、ステップＳ１２５にて音声認識された文字列を言語Ｂから言語Ａに機械翻訳する（Ｓ１２７）。例えば音声認識により「Ｈｅｌｌｏ」の文字列が生成された場合には、ＣＰＵ２０は「Ｈｅｌｌｏ」を言語Ａ（日本語）に機械翻訳して「こんにちは」の文字列を生成する。 When the utterance is made (Yes in S123), the CPU 20 performs language B speech recognition by the language B speech recognition unit 33 (S125). For example, when the user speaks “Hello” in language B (English), the CPU 20 generates a character string “Hello”. When the CPU 20 detects a user operation indicating that machine translation processing is to be executed, the B-A machine translation unit 34 machine translates the character string recognized in step S125 from language B to language A. (S127). For example, in the case where the character string "Hello" is generated by the speech recognition, CPU20 generates a string of by machine translation "Hello" to the language A (Japanese) "Hello".

ＣＰＵ２０は、音声合成処理を実行することを示すユーザ操作を検出した場合に、言語Ａ音声合成部３５により、ステップＳ１２７にて機械翻訳された文字列について音声合成して、音声データを生成する（Ｓ１２９）。例えば生成された文字列が「こんにちは」であった場合には「こんにちは」の言葉を発する音声データが生成される。そしてＣＰＵ２０は、生成された音声データの出力を示すユーザ操作を検出した場合に、ステップＳ１２９にて生成された音声データをスピーカ１３により音声出力する（Ｓ１３１）。ユーザは、例えば画面１２ａに表示された出力開始ボタン１２ｅを選択することにより、携帯電話機１に音声出力の開始を指示する。ＣＰＵ２０は、このボタンが選択されたことに基づいて音声出力を開始すると良い。その後、ステップＳ１１９に戻って、ＣＰＵ２０は再び終了が指示されたか否かを判断する。 When the CPU 20 detects a user operation indicating execution of the speech synthesis process, the language A speech synthesizer 35 performs speech synthesis on the character string machine-translated in step S127 to generate speech data ( S129). Voice data that emits the words of "Hello" is generated in the case, for example, is generated string was "Hello". When the CPU 20 detects a user operation indicating the output of the generated audio data, the CPU 20 outputs the audio data generated in step S129 through the speaker 13 (S131). For example, the user instructs the mobile phone 1 to start audio output by selecting the output start button 12e displayed on the screen 12a. The CPU 20 may start audio output based on the selection of this button. Thereafter, the process returns to step S119, and the CPU 20 determines whether or not the end is instructed again.

なお、ＣＰＵ２０は、上述したステップＳ１０９、Ｓ１１１、Ｓ１１３、Ｓ１２５、Ｓ１２７、Ｓ１２９の各々の処理（演算処理）をユーザの入力操作に応じて行うとして説明したが、ユーザの入力操作がなかった場合には、各々の処理を実行せずに次のステップの処理に進むようにしても良い。また、ユーザの入力操作に基づかず、自動的に機械翻訳、音声合成、音声出力を実行しても良い。 In addition, although CPU20 demonstrated that each process (arithmetic process) of step S109, S111, S113, S125, S127, S129 mentioned above was performed according to a user's input operation, when there is no user's input operation, May proceed to the processing of the next step without executing each processing. Further, machine translation, speech synthesis, and speech output may be automatically executed without being based on a user input operation.

また、ＣＰＵ２０は、上述したステップＳ１０９、Ｓ１１１、Ｓ１１３、Ｓ１２５、Ｓ１２７、Ｓ１２９の各々の処理（演算処理）を行う際に、各々のモジュールの初期化状態に応じて初期化処理を行う。図１０は、携帯電話機１が翻訳制御処理を行う際の、演算処理時の初期化処理の手順を示すフローチャートである。例えばステップＳ１０９にて言語Ａ音声認識部３０が音声認識を行う場合について説明する。 Further, the CPU 20 performs initialization processing according to the initialization state of each module when performing each processing (calculation processing) of steps S109, S111, S113, S125, S127, and S129 described above. FIG. 10 is a flowchart showing a procedure of initialization processing at the time of calculation processing when the mobile phone 1 performs translation control processing. For example, the case where the language A speech recognition unit 30 performs speech recognition in step S109 will be described.

図１０に示すように、ＣＰＵ２０は、初期化状態情報４４に基づいて言語Ａ音声認識部３０のモジュールの初期化状態を判断する（Ｓ３０１）。状態情報４４ｂが「０」の場合（Ｓ３０１の「０」）は、言語Ａ音声認識部３０のモジュールが事前初期化されずに演算時に初期化されるべきモジュールであるため、ＣＰＵ２０は音声Ａ音声認識部３０のモジュールを初期化する（Ｓ３０２）。 As shown in FIG. 10, the CPU 20 determines the initialization state of the module of the language A speech recognition unit 30 based on the initialization state information 44 (S301). When the status information 44b is “0” (“0” in S301), the module of the language A speech recognition unit 30 is a module that should be initialized at the time of calculation without being pre-initialized. The module of the recognition unit 30 is initialized (S302).

状態情報４４ｂが「１」の場合（Ｓ３０１の「１」）は、言語Ａ音声認識部３０のモジュールが事前初期化されるべきモジュールであるが現時点では初期化されていないため、ＣＰＵ２０は言語Ａ音声認識部３０のモジュールを初期化する（Ｓ３０３）。そしてＣＰＵ２０は、言語Ａ音声認識部３０が初期化されたので、初期化状態情報４４において、言語Ａ音声認識部３０の状態情報４４ｂを「２」に設定する（Ｓ３０４）。 When the status information 44b is “1” (“1” in S301), the module of the language A speech recognition unit 30 is a module that should be pre-initialized, but is not initialized at this time. The module of the voice recognition unit 30 is initialized (S303). Then, since the language A speech recognition unit 30 has been initialized, the CPU 20 sets the state information 44b of the language A speech recognition unit 30 to “2” in the initialization state information 44 (S304).

状態情報４４ｂが「２」の場合（Ｓ３０１の「２」）は、言語Ａ音声認識部３０のモジュールが事前初期化されるべきモジュールであり既に事前初期化されているため、ＣＰＵ２０は初期化を行わずに次のステップに進む。また、状態情報４４ｂが「−１」の場合（Ｓ３０１の「−１」）は、言語Ａ音声認識部３０のモジュールが事前初期化されるべきでないモジュールであるが既に初期化されているため、ＣＰＵ２０は初期化を行わずに次のステップに進む。 When the status information 44b is “2” (“2” in S301), the module of the language A speech recognition unit 30 is a module to be pre-initialized and has already been pre-initialized. Proceed to the next step without doing it. When the status information 44b is “−1” (“−1” in S301), the module of the language A speech recognition unit 30 is a module that should not be pre-initialized, but has already been initialized. The CPU 20 proceeds to the next step without performing initialization.

そしてＣＰＵ２０は、言語Ａ音声認識部３０に演算処理（すなわち音声認識処理）を実行させる（Ｓ３０５）。演算処理の実行後に、ＣＰＵ２０は、再びステップＳ３０１と同様に、言語Ａ音声認識部３０のモジュールの初期化状態を判断する（Ｓ３０７）。 Then, the CPU 20 causes the language A speech recognition unit 30 to execute arithmetic processing (that is, speech recognition processing) (S305). After execution of the arithmetic processing, the CPU 20 again determines the initialization state of the module of the language A speech recognition unit 30 as in step S301 (S307).

状態情報４４ｂが「０」の場合（Ｓ３０７の「０」）は、言語Ａ音声認識部３０のモジュールがステップＳ３０２にて初期化されたので、初期化によりＲＡＭ２１に展開されたデータをＲＡＭ２１から削除することにより言語Ａ音声認識部３０のモジュールを解放する（Ｓ３０９）。一方、状態情報４４ｂが「−１」または「２」の場合（Ｓ３０７の「−１」、「２」）は、ＣＰＵ２０は、言語Ａ音声認識部３０のモジュールを解放せずに処理を終了する。なお、識別符号「−１」が設定されているモジュールは、別のモジュールが実行されるタイミングで解放されると良い。 When the status information 44b is “0” (“0” in S307), the module of the language A speech recognition unit 30 has been initialized in step S302, so the data expanded in the RAM 21 by the initialization is deleted from the RAM 21. As a result, the module of the language A speech recognition unit 30 is released (S309). On the other hand, when the status information 44b is “−1” or “2” (“−1”, “2” in S307), the CPU 20 ends the process without releasing the module of the language A speech recognition unit 30. . Note that a module for which the identification code “−1” is set may be released at a timing when another module is executed.

このようなステップＳ３０１乃至Ｓ３０９の処理が、言語Ａ音声認識部３０、Ａ−Ｂ機械翻訳部３１、言語Ｂ音声合成部３２、言語Ｂ音声認識部３３、Ｂ−Ａ機械翻訳部３４、言語Ａ音声合成部３５の各々のモジュールが処理を実行する毎に行われる。 Such processing in steps S301 to S309 is performed by the language A speech recognition unit 30, the AB machine translation unit 31, the language B speech synthesis unit 32, the language B speech recognition unit 33, the BA machine translation unit 34, and the language A. This is performed every time each module of the speech synthesizer 35 executes processing.

以上のように、音声翻訳に利用できる空きＲＡＭサイズと音声翻訳の各モジュールを事前初期化するときに必要なＲＡＭサイズとの関係および、各モジュールを事前に初期化しなかった場合に追加で必要となる処理時間に基づいて、各モジュールを事前に初期化するか否かを決定する。これによって、限られたＲＡＭサイズであっても、より少ない処理時間で音声翻訳を実行することができる。 As described above, the relationship between the free RAM size that can be used for speech translation and the RAM size required when pre-initializing each speech translation module, and additionally required if each module is not pre-initialized Whether to initialize each module in advance is determined based on the processing time. As a result, even with a limited RAM size, speech translation can be executed with less processing time.

以上のような初期化処理判断処理に対して各々のモジュールの利用頻度を利用することにより、翻訳制御処理の処理時間を更に短縮することができる。図１１は、各々のモジュールの使用履歴が記録された履歴情報４５の一例を示すデータ構成図である。図１１に示すように、履歴情報４５は、例えば最近実行されたモジュールが過去１００回分について記憶された情報である。ＣＰＵ２０は、いずれかのモジュールが実行される毎に（ステップＳ１０９、Ｓ１１１、Ｓ１１３、Ｓ１２５、Ｓ１２７、Ｓ１２９の各々の処理が行われる場合など）、履歴情報４５に実行されたモジュールを記憶しておく。そして、この履歴情報４５に基づいて各モジュールの利用頻度が決定される。 By using the frequency of use of each module for the initialization process determination process as described above, the processing time of the translation control process can be further shortened. FIG. 11 is a data configuration diagram showing an example of the history information 45 in which the usage history of each module is recorded. As illustrated in FIG. 11, the history information 45 is information stored for the last 100 modules executed recently, for example. The CPU 20 stores the executed module in the history information 45 each time one of the modules is executed (for example, when each of steps S109, S111, S113, S125, S127, and S129 is performed). . Based on the history information 45, the usage frequency of each module is determined.

図１２は、初期化パターンの付加情報４６の一例を示すデータ構成図である。図１２に示すように、初期化パターンの付加情報４６は、初期化パターン情報４２と共通の各々のパターン識別情報４６ａに対して、言語Ａ音声認識部３０の利用頻度の比率を示す利用頻度情報４６ｂ、Ａ−Ｂ機械翻訳部３１の利用頻度の比率を示す利用頻度情報４６ｃ、言語Ｂ音声合成部３２の利用頻度の比率を示す利用比率情報４６ｄ、利用頻度を反映させた追加処理時間である拡張追加処理時間を示す拡張追加処理時間情報４６ｅがそれぞれ対応付けられた情報である。拡張追加処理時間は各モジュールの追加処理時間に頻度比率を重み付けした総和であり、各モジュールの利用頻度を利用する場合には、拡張追加処理時間が最小になるようにパターンが選択される。 FIG. 12 is a data configuration diagram showing an example of the additional information 46 of the initialization pattern. As illustrated in FIG. 12, the additional information 46 of the initialization pattern is usage frequency information indicating a ratio of the usage frequency of the language A speech recognition unit 30 with respect to each pattern identification information 46 a common to the initialization pattern information 42. 46b, usage frequency information 46c indicating the usage frequency ratio of the AB machine translation unit 31, usage ratio information 46d indicating the usage frequency ratio of the language B speech synthesizer 32, and additional processing time reflecting the usage frequency. The extended additional processing time information 46e indicating the extended additional processing time is information associated with each other. The extended additional processing time is a sum obtained by weighting the frequency ratio to the additional processing time of each module. When the usage frequency of each module is used, the pattern is selected so that the extended additional processing time is minimized.

例えば図１２によると、言語Ａ音声認識部３０の利用頻度の比率が「Ｒ＿Ａａ」、Ａ−Ｂ機械翻訳部３１の利用頻度の比率が「Ｒ＿Ｍａ」、言語Ｂ音声合成部３２の利用頻度の比率が「Ｒ＿Ｔａ」と定義されている。また、パターン１〜パターン８の拡張追加処理時間Ｔｐａ’〜Ｔｐａ８’が定義されている。 For example, according to FIG. 12, the usage frequency ratio of the language A speech recognition unit 30 is “R_Aa”, the usage frequency ratio of the AB machine translation unit 31 is “R_Ma”, and the usage frequency ratio of the language B speech synthesis unit 32. Is defined as “R_Ta”. Further, extended addition processing times Tpa 'to Tpa8' for patterns 1 to 8 are defined.

例えば、Ｍ＿Ａａ＝１０ＭＢ、Ｍ＿Ｍａ＝１０ＭＢ、Ｍ＿Ｔａ＝１ＭＢ、Ｔ＿Ａａ＝８秒、Ｔ＿Ｍａ＝４秒、Ｔ＿Ｔａ＝１秒、Ｍ＿Ａａ＿ｗ＝５ＭＢ、Ｍ＿Ｍａ＿ｗ＝４ＭＢ、Ｍ＿Ｔａ＿ｗ＝３ＭＢである場合について考える。各々のモジュールの利用頻度を利用しない場合、空きＲＡＭサイズＭ＿ｍａｘが１６ＭＢの場合、メモリ制約から選択されるパターンはパターン３とパターン５である。翻訳処理実行時の追加処理時間はパターン３でＴｐａ３＝４秒、パターン５でＴｐａ５＝８秒なので、追加処理時間が最小となるのはパターン３である。つまり、追加処理時間を多く要する言語Ａ音声認識部３０（８秒）を事前に初期化しておき、Ａ−Ｂ機械翻訳部３１の初期化（４秒）は、機械翻訳処理が必要なタイミングで実行する、という処理手順が選択される。 For example, consider the case where M_Aa = 10 MB, M_Ma = 10 MB, M_Ta = 1 MB, T_Aa = 8 seconds, T_Ma = 4 seconds, T_Ta = 1 second, M_Aa_w = 5 MB, M_Ma_w = 4 MB, and M_Ta_w = 3 MB. When the usage frequency of each module is not used, patterns 3 and 5 are selected from the memory constraints when the free RAM size M_max is 16 MB. The additional processing time at the time of executing the translation processing is Tpa3 = 4 seconds for pattern 3 and Tpa5 = 8 seconds for pattern 5, so that the additional processing time is the minimum for pattern 3. In other words, the language A speech recognition unit 30 (8 seconds), which requires a lot of additional processing time, is initialized in advance, and the AB machine translation unit 31 is initialized (4 seconds) at a timing that requires machine translation processing. The processing procedure to execute is selected.

しかし、例えばユーザが音声認識により得られた文字列をテキストエディタなどで編集し、これに対して機械翻訳を行わせる処理を繰り返すような状況を想定すると、機械翻訳の初期化（４秒）を何度も繰り返すのは効率が悪い。この非効率的な追加処理時間を軽減するために、各モジュールの頻度比率が用いられる。 However, for example, assuming a situation where a user edits a character string obtained by speech recognition with a text editor and repeats the process of performing machine translation on the text string, initialization of machine translation (4 seconds) is performed. Repeating many times is inefficient. In order to reduce this inefficient additional processing time, the frequency ratio of each module is used.

例えば上記ケースにおいて、言語Ａ音声認識部３０、Ａ−Ｂ機械翻訳部３１、言語Ｂ音声合成部３２の各頻度比率がそれぞれＲ＿Ａａ＝０．１、Ｒ＿Ｍａ＝０．８、Ｒ＿Ｔａ＝０．１であるとすると、拡張追加処理時間はパターン３で、Ｔｐａ３’＝４＊０．８＝３．２秒、パターン５でＴｐａ５’＝８＊０．１＝０．８秒なので、最小のものはパターン５となる。このように、ユーザの利用ケースに自動適応して、各モジュールの初期化処理のタイミングを最適化することができる。 For example, in the above case, the frequency ratios of the language A speech recognition unit 30, the AB machine translation unit 31, and the language B speech synthesis unit 32 are R_Aa = 0.1, R_Ma = 0.8, and R_Ta = 0.1, respectively. If there is, the extended additional processing time is pattern 3, Tpa3 ′ = 4 * 0.8 = 3.2 seconds, and pattern 5 has Tpa5 ′ = 8 * 0.1 = 0.8 seconds. 5 In this way, it is possible to optimize the initialization processing timing of each module by automatically adapting to the use case of the user.

なお、各モジュールの追加処理時間に頻度比率を重み付けする方法は、上述した方法に限定されず、最近の履歴ほど重みを大きくなるような重み付け和で頻度比率を算出する方法であっても良い。 Note that the method of weighting the frequency ratio to the additional processing time of each module is not limited to the above-described method, and may be a method of calculating the frequency ratio with a weighted sum that increases the weight of the recent history.

また、言語Ａ音声認識部３０、Ａ−Ｂ機械翻訳部３１、言語Ｂ音声合成部３２、言語Ｂ音声認識部３３、Ｂ−Ａ機械翻訳部３４、言語Ａ音声合成部３５の各モジュールの実行頻度が等しい場合は、初期化パターン情報４２の追加処理時間情報４２ｆを参照してパターンを選択するようにすると良い。 Also, the modules of the language A speech recognition unit 30, the AB machine translation unit 31, the language B speech synthesis unit 32, the language B speech recognition unit 33, the BA machine translation unit 34, and the language A speech synthesis unit 35 are executed. If the frequencies are equal, it is preferable to select a pattern with reference to the additional processing time information 42f of the initialization pattern information 42.

また、Ａ−Ｂ機械翻訳部３１が事前初期化されていない状態（状態「０」または「１」）で、ユーザがテキストエディタ等で翻訳原文（音声認識部における出力文）を編集した場合（編集終了後のタイミングではなく、少しでも内容が変化したタイミングで）、ユーザが次に機械翻訳処理を実行する可能性が高いので、ユーザのテキスト編集の裏で並列処理でＡ−Ｂ機械翻訳部３１を初期化しても良い。 Further, when the AB machine translation unit 31 is not pre-initialized (state “0” or “1”) and the user edits the translation original text (the output text in the speech recognition unit) with a text editor or the like ( Since it is highly likely that the user will next execute the machine translation process (not at the timing after the editing is finished, but at the timing when the content changes even a little), the AB machine translation unit is performed in parallel processing behind the user's text editing. 31 may be initialized.

同様に、言語Ｂ音声合成部３３が事前初期化されていない状態（状態「０」または「１」）で、ユーザがテキストエディタ等で翻訳結果（音声合成部における読み上げ対象文）を編集した場合（編集終了後のタイミングではなく、少しでも内容が変化したタイミングで）、ユーザが次に音声合成処理を実行する可能性が高いので、ユーザのテキスト編集の裏で並列処理で言語Ｂ音声合成部３３を初期化しても良い。 Similarly, when the language B speech synthesizer 33 is not pre-initialized (state “0” or “1”) and the user edits the translation result (the text to be read out by the speech synthesizer) with a text editor or the like. Since it is highly likely that the user will next execute the speech synthesis process (not at the timing after the editing is finished, but at the timing when the content changes even a little), the language B speech synthesis unit is performed in parallel processing behind the user's text editing. 33 may be initialized.

また、携帯電話機１において、片方向ずつの事前初期化処理のみならず、両方向で行っても良い。この場合には、現在選択されている翻訳方向の事前初期化処理を行ったときに、ＲＡＭ２１の空きＲＡＭサイズに余裕があれば、もう片方向の事前初期化処理を行うようにすると良い。ただし、現在選択されていない翻訳方向の初期化パターンは、翻訳処理に用いることができるＲＡＭ２１のサイズから現在選択されている翻訳方向のピークＲＡＭサイズを除いた値を基に判定するのではなく、翻訳処理に用いることができるＲＡＭ２１のサイズを基に判定する。 Further, in the mobile phone 1, not only the pre-initialization process for each direction but also the both directions may be performed. In this case, when the pre-initialization process for the currently selected translation direction is performed, if there is a vacant RAM size in the RAM 21, the other-direction pre-initialization process may be performed. However, the initialization pattern of the translation direction that is not currently selected is not determined based on a value obtained by removing the peak RAM size of the currently selected translation direction from the size of the RAM 21 that can be used for the translation process. The determination is made based on the size of the RAM 21 that can be used for the translation process.

これは、現在選択されていない翻訳方向の事前初期化は、翻訳方向を切替えたときに必要となるものであって、現在選択されている翻訳方向で事前に初期化されたデータは翻訳方向を切替えたときに解放しても良いためである。仮に、現在選択されていない翻訳方向の初期化パターンを、翻訳処理に用いることができるＲＡＭサイズから現在選択されている翻訳方向のピークＲＡＭサイズを除いた値を基に判定した場合、翻訳方向を切替えたときに、最も処理時間が短くなるような事前初期化パターンが選択されない場合がある。例えば、現在選択されていない翻訳方向の事前初期化パターンとして、追加処理時間が８秒のパターンＡと０秒のパターンＢがあり、翻訳処理に用いることができるＲＡＭサイズを基に判定すると、両方のパターンが選択可能であるとする。そして、翻訳処理に用いることができるＲＡＭサイズから現在選択されている翻訳方向のピークＲＡＭサイズを除いた値を基に判定して、パターンＡしか選択することができなかった時に、翻訳方向が選択されていない状態でパターンＡに基づいて現在選択されていない翻訳方向のモジュールの事前初期化を行う場合がある。しかしながら、翻訳方向を切替えたときに、選択し得るパターンの内、最も追加処理時間が短くなるパターンはパターンＢであるため、パターンＢでの事前初期化が必要となり、パターンＡで事前初期化を行っておいたモジュールを解放する必要が出てくる可能性がある。このように現在選択されていない翻訳方向の初期化パターンは、翻訳処理に用いることができるＲＡＭ２１のサイズから現在選択されている翻訳方向のピークＲＡＭサイズを除いた値を基に判定すると、非効率的な事前初期化が行われる場合があり、処理時間の増加につながる。 This is because pre-initialization of the translation direction that is not currently selected is necessary when the translation direction is switched. This is because it may be released when switched. If the initialization pattern of the translation direction that is not currently selected is determined based on the value obtained by removing the peak RAM size of the currently selected translation direction from the RAM size that can be used for the translation process, the translation direction is When switching, the pre-initialization pattern that shortens the processing time may not be selected. For example, there are a pattern A with an additional processing time of 8 seconds and a pattern B of 0 seconds as pre-initialization patterns in the translation direction that are not currently selected, and both are determined based on the RAM size that can be used for translation processing. It is assumed that these patterns can be selected. The translation direction is selected when only the pattern A can be selected based on a value obtained by removing the peak RAM size of the currently selected translation direction from the RAM size that can be used for the translation process. In some cases, a module in a translation direction that is not currently selected based on the pattern A is pre-initialized. However, when the translation direction is switched, the pattern that has the shortest additional processing time among the patterns that can be selected is the pattern B. Therefore, the pattern B needs to be pre-initialized. It may be necessary to release a module that has been done. The initialization pattern of the translation direction that is not currently selected as described above is inefficient if it is determined based on the value obtained by removing the peak RAM size of the currently selected translation direction from the size of the RAM 21 that can be used for the translation process. Pre-initialization may be performed, leading to an increase in processing time.

現在選択されていない翻訳方向の初期化パターンを、翻訳処理に用いることができるＲＡＭ２１のサイズを基に選択する場合の処理を具体的に説明する。例えば、現在選択されている翻訳方向が言語Ｂから言語Ａへの方向である場合であって、翻訳処理に用いることができるＲＡＭ２１のサイズを２０ＭＢ、現在選択されている翻訳方向で初期化パターンを判定し、その初期化パターンで必要なピークＲＡＭサイズが１４ＭＢ必要であるとする。また、現在選択されていない翻訳方向（言語Ａ→言語Ｂ）での各モジュールの初期化処理に必要なＲＡＭサイズをＭ＿Ａａ＝１０ＭＢ、Ｍ＿Ｍａ＝１０ＭＢ、Ｍ＿Ｔａ＝６ＭＢとし、現在選択されていない翻訳方向（言語Ａ→言語Ｂ）での各モジュールの演算処理に必要なＲＡＭサイズをＭ＿Ａａ＿ｗ＝５ＢＭ、Ｍ＿Ｍａ＿ｗ＝４ＭＢ、Ｍ＿Ｔａ＿ｗ＝３ＭＢとし、各モジュールの初期化処理にかかる時間をＴ＿Ａａ＝８秒、Ｔ＿Ｍａ＝４秒、Ｔ＿Ｔａ＝１秒とする。この場合、現在選択されていない翻訳方向の初期化パターンの選択は、翻訳処理に用いることができるＲＡＭ２１のサイズ２０ＭＢの範囲内に収まるピークＲＡＭサイズを有し、追加初期化処理時間が短いパターンが選択される。各モジュールの演算処理に必要なＲＡＭサイズの最大値Ｍ＿ｗはｍａｘ（５ＭＢ、４ＭＢ、３ＭＢ）＝５ＭＢであるため、翻訳処理に用いることができるＲＡＭ２１のサイズ２０ＭＢの範囲内に収まるピークＲＡＭサイズを有する事前初期化のパターンは、現在選択されていない翻訳方向での各モジュールのいずれか１つを事前初期化するパターンとなる。これらのパターンの中で、追加初期化処理時間が短いパターンとして、言語Ａ音声認識部３０が初期化されるパターンが選択される。ただし、言語Ｂ→言語Ａの翻訳方向が選択されている時点で、現在選択されていない翻訳方向（言語Ａ→言語Ｂ）の初期化のためには、翻訳処理に用いることができるＲＡＭサイズ２０ＭＢから現在選択されている翻訳方向（言語Ｂ→言語Ａ）のピークＲＡＭサイズ１４ＭＢを除いた６ＭＢである。そのため、１０ＭＢ必要な言語Ａ音声認識部３０の初期化を実行することができず、初期化状態情報４４の言語Ａ音声認識部３０に対応する初期化状態を「１」に設定し、翻訳方向が切替えられた後に、事前初期化を行う。 A process for selecting an initialization pattern in the translation direction that is not currently selected based on the size of the RAM 21 that can be used for the translation process will be described in detail. For example, when the currently selected translation direction is the direction from language B to language A, the size of the RAM 21 that can be used for the translation process is 20 MB, and the initialization pattern is set in the currently selected translation direction. It is determined that the peak RAM size required for the initialization pattern is 14 MB. Further, the RAM size required for the initialization process of each module in the translation direction not currently selected (language A → language B) is M_Aa = 10 MB, M_Ma = 10 MB, M_Ta = 6 MB, and the translation direction not currently selected The RAM size required for the calculation processing of each module in (Language A → Language B) is M_Aa_w = 5BM, M_Ma_w = 4MB, M_Ta_w = 3MB. 4 seconds and T_Ta = 1 second. In this case, the selection of the initialization pattern in the translation direction that is not currently selected has a peak RAM size that falls within the range of the size of the RAM 21 that can be used for the translation process, and a pattern with a short additional initialization process time. Selected. Since the maximum value M_w of the RAM size necessary for the calculation processing of each module is max (5 MB, 4 MB, 3 MB) = 5 MB, the RAM size of the RAM 21 that can be used for the translation processing has a peak RAM size that falls within the range of 20 MB. The pre-initialization pattern is a pattern for pre-initializing any one of the modules in the translation direction not currently selected. Among these patterns, a pattern for initializing the language A speech recognition unit 30 is selected as a pattern with a short additional initialization processing time. However, when the translation direction of language B → language A is selected, a RAM size of 20 MB that can be used for translation processing is used to initialize a translation direction that is not currently selected (language A → language B). Is 6 MB excluding the peak RAM size of 14 MB in the currently selected translation direction (language B → language A). Therefore, the initialization of the language A speech recognition unit 30 that requires 10 MB cannot be executed, the initialization state corresponding to the language A speech recognition unit 30 in the initialization state information 44 is set to “1”, and the translation direction After is switched, pre-initialization is performed.

なお、上述の例では、一方の翻訳方向選択時に他方の翻訳方向の事前初期化ができない場合を示したが、一方の翻訳方向選択時に他方の翻訳方向の事前初期化ができる場合も考えられる。例えば、上述の例においてＭ＿Ｔａ＝１ＭＢの場合には、初期化パターンとして、言語Ａ音声認識部３０と言語Ｂ音声合成部３２とが初期化されるパターンが選択される。この場合、言語Ｂ音声合成部３２の事前初期化は、言語Ｂ→言語Ａが選択されている場合にも実行できるため、言語Ｂ→言語Ａが選択されているときに事前初期化を行っても良い。また、一方の翻訳方向選択時に他方の翻訳方向の事前初期化ができる場合、どちらの翻訳方向時であっても事前初期化をしておくことができるモジュールについては、翻訳方向の切り替えのたびに初期化したデータを解放する必要が無い。例えば、言語Ｂ音声合成部３２の事前初期化を言語Ｂ→言語Ａが選択されている場合にも実行できる場合、翻訳方向を言語Ａ→言語Ｂから言語Ｂ→言語Ａに切替えたとしても、言語Ｂ音声合成部３２の初期化データを解放しなくても良い。 In the above-described example, the case where the pre-initialization of the other translation direction cannot be performed when one translation direction is selected is shown. However, the case where the other translation direction can be pre-initialized when one translation direction is selected is also conceivable. For example, in the above example, when M_Ta = 1 MB, a pattern for initializing the language A speech recognition unit 30 and the language B speech synthesis unit 32 is selected as the initialization pattern. In this case, since the pre-initialization of the language B speech synthesizer 32 can be executed even when the language B → language A is selected, the pre-initialization is performed when the language B → language A is selected. Also good. In addition, if one of the translation directions can be pre-initialized when the other translation direction is selected, a module that can be pre-initialized in either translation direction is There is no need to release initialized data. For example, when the pre-initialization of the language B speech synthesizer 32 can be executed even when language B → language A is selected, even if the translation direction is switched from language A → language B to language B → language A, The initialization data of the language B speech synthesizer 32 need not be released.

なお、初期化状態情報４４の更新は、翻訳機能の起動時／翻訳方向切り替え時／アプリ画面アクティブ時などに行うと良い。 The initialization state information 44 may be updated when the translation function is activated / when the translation direction is switched / when the application screen is active.

また、１方向（例えば言語Ａから言語Ｂへの翻訳）分の初期化タイミング決定して、そのタイミングに基づいて初期化を行った後、ＲＡＭ２１の空きＲＡＭサイズに余裕が発生した段階で他方向（例えば言語Ｂから言語Ａへの翻訳）分の初期化を行うようにしても良い。例えば、言語Ａから言語Ｂへの翻訳を行っている間に、並列させて、処理が走っていない（例えばＣＰＵ２０が表示処理のみを行っていて、ユーザの入力操作を待っている状態などの）ときに、他方向の初期化を行うようにしてもよい。 Further, after determining initialization timing for one direction (for example, translation from language A to language B) and performing initialization based on the timing, the other direction is generated when there is a margin in the free RAM size of the RAM 21. Initialization (for example, translation from language B to language A) may be performed. For example, while the translation from the language A to the language B is being performed in parallel, the processing is not running (for example, the CPU 20 is only performing the display processing and is waiting for the user's input operation). Sometimes, initialization in the other direction may be performed.

あるいは、事前初期化は必ずしも事前に行わなくても良い。実際に演算処理（音声認識処理や機械翻訳処理など）が必要なタイミングで初期化処理を実行し、翻訳方向が変わるまでその初期化データの解放を行わないようにしても良い。 Alternatively, the pre-initialization is not necessarily performed in advance. It is also possible to execute the initialization process at a timing when an arithmetic process (speech recognition process, machine translation process, etc.) is actually required, and not release the initialization data until the translation direction changes.

また、携帯電話機１において翻訳制御処理を行う際、ＲＡＭ２１の空きＲＡＭサイズに応じて、初期化されていないモジュールの初期化が可能であれば、初期化を促す情報を表示させてユーザに初期化の指示を促すようにしても良い。例えば、ＲＡＭ２１にあと３ＭＢの空きＲＡＭサイズがあれば、もう一つ分のモジュールの事前初期化が可能な場合には、ユーザにその旨を表示しても良い。 In addition, when performing translation control processing in the mobile phone 1, if initialization of a module that has not been initialized is possible according to the free RAM size of the RAM 21, information for prompting initialization is displayed and initialization is performed for the user. You may be made to prompt the instruction. For example, if there is another 3 MB of free RAM size in the RAM 21, if it is possible to pre-initialize another module, the fact may be displayed to the user.

具体的には、例えば実質空きＲＡＭサイズＭ＿ｍａｘに３ＭＢ加算した値を算出し、この値に基づいて、図８のフローチャートに示す方法で事前初期化パターン（パターン１〜８）を判定した結果が、３ＭＢ加算しない場合（Ｍ＿ｍａｘに基づいて事前初期化パターンを判定する場合）と同じであるか否かを判定する。例えば、Ｍ＿ｍａｘに３ＭＢ加算した値に基づいて判定した事前初期化パターンがパターン３であり、Ｍ＿ｍａｘに基づいて判定した事前初期化パターンもパターン３である場合には、３ＭＢの加算の有無に関わらず事前に初期化できるモジュールは変わらない。それに対して、例えば、Ｍ＿ｍａｘに３ＭＢ加算した値に基づいて判定した事前初期化パターンがパターン２であり、Ｍ＿ｍａｘに基づいて判定した事前初期化パターンもパターン３である場合には、空きＲＡＭサイズがあと３ＭＢ確保できると事前初期化できるモジュールが増え、処理時間が削減される。Ｍ＿ｍａｘに３ＭＢ加算した値に基づいて判定した事前初期化パターン（パターン２）の追加処理時間からＭ＿ｍａｘに基づいて判定した事前初期化パターン(パターン３)の追加処理時間を減算し得られた値を、あと３ＭＢ確保することによって短縮できる処理時間であるとし、『空きＲＡＭサイズをあと３ＭＢ確保すれば音声翻訳動作時間を２秒早く実行することができます。』などの表示を行う。 Specifically, for example, a value obtained by adding 3 MB to the actual free RAM size M_max is calculated, and based on this value, the result of determining the pre-initialization pattern (patterns 1 to 8) by the method shown in the flowchart of FIG. It is determined whether it is the same as when 3 MB is not added (when the pre-initialization pattern is determined based on M_max). For example, when the pre-initialization pattern determined based on the value obtained by adding 3 MB to M_max is pattern 3 and the pre-initialization pattern determined based on M_max is also pattern 3, regardless of whether 3 MB is added or not. The modules that can be initialized in advance do not change. On the other hand, for example, when the pre-initialization pattern determined based on the value obtained by adding 3 MB to M_max is pattern 2 and the pre-initialization pattern determined based on M_max is also pattern 3, the free RAM size is If 3 MB can be secured, the number of modules that can be pre-initialized increases and the processing time is reduced. A value obtained by subtracting the additional processing time of the pre-initialization pattern (pattern 3) determined based on M_max from the additional processing time of the pre-initialization pattern (pattern 2) determined based on the value obtained by adding 3 MB to M_max. It is assumed that the processing time can be shortened by securing 3 MB. “If you reserve 3 MB of free RAM, the speech translation operation time can be increased by 2 seconds. "Is displayed.

以上の例では、実質空きＲＡＭサイズに加算する値を３ＭＢとして説明したが、３ＭＢに限らず任意の値で良い。例えば、予め指定された値でも良いし、ユーザによって指定された値でも良い。また、１種類の値で判断するのではなく、複数段階の値（３ＭＢ、５ＭＢ、１０ＭＢなど）を用いて事前初期化パターンを判定し、判定結果を表示しても良い。さらに、実質空きＲＡＭサイズに加算する値を０ＭＢから徐々に（例えば１ＭＢずつ）増やして事前初期化パターンを判定し、実質空きＲＡＭサイズＭ＿ｍａｘに基づいて判定される事前初期化パターンよりも事前に初期化されるモジュールが多い全てのパターンについて、実質空きＲＡＭサイズに加算する最小値を求めても良い。 In the above example, the value added to the actual free RAM size has been described as 3 MB. However, the value is not limited to 3 MB, and may be any value. For example, a value designated in advance or a value designated by the user may be used. Further, instead of making a determination based on one type of value, a pre-initialization pattern may be determined using a plurality of values (3 MB, 5 MB, 10 MB, etc.), and the determination result may be displayed. Further, the value to be added to the actual free RAM size is gradually increased from 0 MB (for example, by 1 MB) to determine the pre-initialization pattern, and the initial value is initialized in advance from the pre-initialization pattern determined based on the real free RAM size M_max. The minimum value to be added to the substantially empty RAM size may be obtained for all patterns having many modules to be converted.

例えば、実質空きＲＡＭサイズＭ＿ｍａｘで判定した事前初期化パターンがパターン３であるときに、パターン３よりも事前初期化されるモジュールの多いパターン２とパターン１を選択するために実質空きＲＡＭサイズＭ＿ｍａｘに少なくとも何ＭＢが必要であるかを判定し、パターン１の事前初期化を行うために必要な追加の空きＲＡＭサイズとパターン２の事前初期化を行うために必要な追加の空きＲＡＭサイズとを表示する。 For example, when the pre-initialization pattern determined by the real free RAM size M_max is the pattern 3, the real free RAM size M_max is selected to select the pattern 2 and the pattern 1 that have more modules to be pre-initialized than the pattern 3. Judge at least how many MBs are needed and display the additional free RAM size required to pre-initialize pattern 1 and the additional free RAM size required to pre-initialize pattern 2 To do.

またこのとき、実行中のアプリケーションプログラム（タスク）のいずれかを終了させることによりＲＡＭ２１の空きＲＡＭサイズを３ＭＢ以上確保できるかをユーザに表示しても良い。あるいは、実行中のアプリケーションプログラム（タスク）を強制終了させて、ＲＡＭ１の空きＲＡＭサイズを確保するようにしても良い。これらは、ユーザが事前に動作モードを設定できるようにしても良い。 At this time, it may be displayed to the user whether or not the free RAM size of the RAM 21 can be secured by 3 MB or more by terminating any of the running application programs (tasks). Alternatively, the application program (task) being executed may be forcibly terminated to secure the free RAM size of the RAM 1. These may be configured so that the user can set the operation mode in advance.

本発明に係る情報処理装置（携帯電話機１）によると、スタンドアロンでの音声翻訳機能を備えているとともに、使用可能なＲＡＭ２１のＲＡＭサイズに制約がある場合においても、各々のモジュールの初期化タイミングを制御することで、音声翻訳処理時間を最小限に抑えることが可能となる。 The information processing apparatus (mobile phone 1) according to the present invention has a stand-alone speech translation function and the initialization timing of each module even when the RAM size of the usable RAM 21 is limited. By controlling, it is possible to minimize the speech translation processing time.

本発明の説明として、携帯電話機１について説明したが、これに限らず、ＰＨＳ（ＰｅｒｓｏｎａｌＨａｎｄｙｐｈｏｎｅＳｙｓｔｅｍ）、ＰＤＡ（ＰｅｒｓｏｎａｌＤｉｇｉｔａｌＡｓｓｉｓｔａｎｔｓ）、携帯ゲーム機、携帯音楽録再機など、翻訳機能を備えている情報処理装置であれば、任意の情報処理装置であっても良い。また、本発明は全ての低スペック機器に適用できるものであって、携帯端末に限定されず、組み込み機器一般や、テレビやパソコンの電気機器一般にも適用できる。 As the description of the present invention, the mobile phone 1 has been described. However, the present invention is not limited to this, and has a translation function such as a PHS (Personal Handyphone System), a PDA (Personal Digital Assistant), a portable game machine, and a portable music recording / reproducing machine. Any information processing apparatus may be used as long as it is an information processing apparatus. Further, the present invention can be applied to all low-spec devices, and is not limited to portable terminals, and can be applied to general embedded devices and general electric devices such as televisions and personal computers.

１…携帯電話機，１０…上筐体，１１…下筐体，１２…表示装置，１２ａ…画面，１２ｂ…翻訳前言語表示欄，１２ｃ…翻訳後言語表示欄，１２ｄ…認識開始／終了ボタン，１２ｅ…出力開始ボタン，１３…スピーカ，１４…マイクロフォン、１５…入力装置，２０…ＣＰＵ，２１…ＲＡＭ，２２…ＲＯＭ，２３…通信装置，２３ａ…アンテナ，３０…言語Ａ音声認識部（ＡＳＲ＿Ａ），３１…Ａ−Ｂ機械翻訳部（ＭＴ＿ＡＢ），３２…言語Ｂ音声合成部（ＴＴＳ＿Ｂ），３３…言語Ｂ音声認識部（ＡＳＲ＿Ｂ），３４…Ｂ−Ａ機械翻訳部（ＭＴ＿Ｂ），３５…言語Ａ音声合成部（ＴＴＳ＿Ａ），４０…モジュールリスト情報，４１…初期化時間情報，４２…初期化パターン情報，４３…識別符号と初期化状態との関係を示す対応表，４４…初期化状態情報，４５…履歴情報，４６…初期化パターンの付加情報。 DESCRIPTION OF SYMBOLS 1 ... Mobile phone, 10 ... Upper housing | casing, 11 ... Lower housing | casing, 12 ... Display apparatus, 12a ... Screen, 12b ... Language display column before translation, 12c ... Language display column after translation, 12d ... Recognition start / end button, 12e ... Output start button, 13 ... Speaker, 14 ... Microphone, 15 ... Input device, 20 ... CPU, 21 ... RAM, 22 ... ROM, 23 ... Communication device, 23a ... Antenna, 30 ... Language A speech recognition unit (ASR_A) , 31 ... AB machine translation unit (MT_AB), 32 ... language B speech synthesis unit (TTS_B), 33 ... language B speech recognition unit (ASR_B), 34 ... BA machine translation unit (MT_B), 35 ... language A speech synthesizer (TTS_A), 40 ... module list information, 41 ... initialization time information, 42 ... initialization pattern information, 43 ... correspondence table showing relationship between identification codes and initialization states, 44 ... Synchronize state information, 45 ... history information 46 ... additional information initialization pattern.

Claims

An information processing apparatus having a plurality of modules related to translation processing,
A first calculating means for calculating an available RAM area that can be used for the translation process when the translation process is executed;
The free space and the RAM size required when each of the plurality of modules performs processing, the RAM size occupied when each of the plurality of modules is initialized in advance, and each of the plurality of modules are determined in advance. Selection means for selecting a module that can be initialized in advance from the plurality of modules based on the time required for initialization
First initialization means for initializing a module selected by the selection means before translation processing is executed;
When any module performs processing in accordance with the execution of the translation processing, if this module has not been initialized by the first initialization means, the second module that initializes this module at the start of processing of this module Initialization means;
An information processing apparatus comprising:

The selection means is a module or a plurality of modules that can be initialized in advance in a free RAM area that can be used for translation processing, and the module or modules that have the shortest additional processing time are initialized in advance. The information processing apparatus according to claim 1, wherein the information processing apparatus is selected as a module that performs conversion.

The information processing apparatus according to claim 1, wherein the plurality of modules include a module that performs speech recognition, a module that performs machine translation, and a module that performs speech synthesis.

Storage means for storing the frequency of use of a plurality of modules related to the translation processing for each module;
The selecting means selects one module or a plurality of modules having the shortest additional processing time based on the time required for the modules to perform initialization in advance and the frequency of use of each of the plurality of modules. The information processing apparatus according to claim 2, wherein the determination is made.

The translation process is a bidirectional translation process,
A translation direction selection means for selecting a direction of translation processing;
The selection means is a RAM free space that can be used for the translation processing calculated by the first calculation means to select a module to be initialized in advance among a plurality of modules used for translation processing in a direction not selected. The information processing apparatus according to claim 1, wherein the information processing apparatus uses a region.

The first initialization means is a module used for translation processing in a direction not selected when translation processing is executed, and is a module that performs initialization in advance selected by the selection means. Initialize a module that can be initialized while executing a translation process in the selected direction, and a module that cannot be initialized while executing a translation process in the selected direction 6. The information processing apparatus according to claim 5, wherein prior initialization is performed after a direction is selected.