JP2021056810A

JP2021056810A - Information processing device, image forming device, information processing method, and program

Info

Publication number: JP2021056810A
Application number: JP2019179836A
Authority: JP
Inventors: 勝彦穐田; Katsuhiko Akita
Original assignee: Konica Minolta Inc
Current assignee: Konica Minolta Inc
Priority date: 2019-09-30
Filing date: 2019-09-30
Publication date: 2021-04-08

Abstract

To provide a technique that enables users, even if they are unfamiliar with information processing devices, to execute jobs based on voice input.SOLUTION: An information processing device includes an acquisition unit 102 configured to acquire setting contents of a job, a generation unit 104 configured to generate voice data of the voice of wording indicating the setting contents acquired by the acquisition unit, and a transmission unit 106 configured to transmit the voice data to an external device that outputs the voice based on the voice data.SELECTED DRAWING: Figure 11

Description

本開示は、情報処理装置、画像形成装置、情報処理方法、およびプログラムに関する。 The present disclosure relates to information processing devices, image forming devices, information processing methods, and programs.

ユーザーにより様々な設定内容が入力される設定画面を表示する情報処理装置がある。情報処理装置は、入力された設定内容に基づいたジョブを実行する。また、情報処理装置は、ユーザーからの音声入力に基づくジョブを実行することができる（例えば、特許文献１参照）。 There is an information processing device that displays a setting screen in which various setting contents are input by the user. The information processing device executes a job based on the input setting contents. Further, the information processing apparatus can execute a job based on voice input from the user (see, for example, Patent Document 1).

特開２０１５−１６６９１２号公報JP 2015-166912

一般的に、音声入力に基づくジョブを情報処理装置に実行させる場合の方が、設定画面から設定内容を手入力してジョブを情報処理装置に実行させるよりも、ユーザーの負担は軽い。ユーザーが、音声入力に基づくジョブを情報処理装置に実行させる場合には、ユーザーは、設定内容の文言を音声で発する必要がある。したがって、このユーザーは、設定内容を覚えておく必要がある。しかしながら、情報処理装置に不慣れなユーザーは、設定内容を覚えていない。特許文献１記載の発明では、このような不慣れなユーザーは、音声入力に基づくジョブを情報処理装置に実行させることができないという問題が生じ得る。 In general, when the information processing device executes a job based on voice input, the burden on the user is lighter than when the information processing device executes the job by manually inputting the setting contents from the setting screen. When the user causes the information processing apparatus to execute a job based on voice input, the user needs to utter the wording of the setting contents by voice. Therefore, this user needs to remember the settings. However, the user who is unfamiliar with the information processing device does not remember the setting contents. In the invention described in Patent Document 1, there may be a problem that such an unfamiliar user cannot cause the information processing apparatus to execute a job based on voice input.

本開示は上述のような問題点を解決するためになされたものであって、ある局面における目的は、情報処理装置に不慣れなユーザーであっても、音声入力に基づくジョブを情報処理装置に実行させる技術を提供することである。 The present disclosure has been made to solve the above-mentioned problems, and the purpose in a certain aspect is to execute a job based on voice input to the information processing device even by a user who is unfamiliar with the information processing device. It is to provide the technology to make it.

本開示のある局面に従うと、ジョブの設定内容を取得する取得部と、取得部が取得した設定内容を示す文言の音声の音声データを生成する生成部と、音声データに基づく音声を出力する外部装置に音声データを送信する送信部とを備える情報処理装置が提供される。 According to a certain aspect of the present disclosure, an acquisition unit that acquires the setting contents of the job, a generation unit that generates audio data of the voice of the wording indicating the setting contents acquired by the acquisition unit, and an external that outputs the audio based on the audio data. An information processing device including a transmission unit that transmits voice data to the device is provided.

ある局面において、取得部は、複数の設定内容を取得し、生成部は、複数の設定内容毎の音声データを生成し、送信部は、複数の設定内容毎の音声データそれぞれに基づく音声を出力する外部装置に複数の設定内容毎の音声データを送信する。 In a certain aspect, the acquisition unit acquires a plurality of setting contents, the generation unit generates audio data for each of the plurality of setting contents, and the transmission unit outputs audio based on each of the audio data for each of the plurality of setting contents. Sends audio data for each of multiple settings to an external device.

ある局面において、情報処理装置は、さらに、音声が入力されるマイクと、ユーザーの音声のマイクへの入力をユーザーに促進する第１促進処理を実行する第１促進部とを備え、生成部は、第１促進処理の実行後にマイクに入力されたユーザーの音声の音声データを、生成部が生成した音声データに対して付与することにより新たな音声データを生成し、送信部は、新たな音声データに基づく音声を出力する外部装置に新たな音声データを送信する。 In a certain aspect, the information processing device further includes a microphone into which voice is input and a first promotion unit that executes a first promotion process for urging the user to input the user's voice into the microphone. , The voice data of the user's voice input to the microphone after the execution of the first promotion process is added to the voice data generated by the generation unit to generate new voice data, and the transmission unit generates new voice. Send new audio data to an external device that outputs data-based audio.

ある局面において、第１促進部は、新たな音声データに基づく音声が外部装置からマイクに入力された場合に、ユーザーの音声のマイクへの入力をユーザーに促進する第２促進処理を実行し、情報処理装置は、さらに、第２促進処理の実行後にマイクに入力されたユーザーの音声の特徴情報と、新たな音声データに基づく音声に含まれるユーザーの音声の特徴情報とが同一である場合に、新たな音声データに基づく音声により示される設定内容に基づくジョブを実行する第１実行部を備える。 In a certain aspect, the first promotion unit executes a second promotion process that promotes the user to input the user's voice to the microphone when the voice based on the new voice data is input to the microphone from the external device. In the information processing device, further, when the characteristic information of the user's voice input to the microphone after the execution of the second promotion process and the characteristic information of the user's voice included in the voice based on the new voice data are the same. The first execution unit is provided to execute a job based on the setting contents indicated by the voice based on the new voice data.

本開示の別の局面に従うと、音声が入力されるマイクと、ジョブの設定内容を示す文言の音声およびユーザーの音声がマイクに入力された場合に、ユーザーの音声のマイクへの入力をユーザーに促進する第１促進部と、第１促進部の促進前にマイクに入力されたユーザーの音声の特徴情報と、第１促進部の促進後にマイクに入力されたユーザーの音声の特徴量とが同一である場合に、第１促進部の促進前にマイクに入力された音声により示されるジョブの設定内容に基づくジョブを実行する第１実行部とを備える情報処理装置が提供される。 According to another aspect of the present disclosure, if a microphone is input with audio, and the voice of the wording indicating the job settings and the user's voice are input to the microphone, the user is input with the user's voice into the microphone. The characteristic information of the user's voice input to the microphone before the promotion of the first promotion unit and the promotion of the first promotion unit is the same as the characteristic amount of the user's voice input to the microphone after the promotion of the first promotion unit. In this case, an information processing apparatus including a first execution unit that executes a job based on the setting contents of the job indicated by the voice input to the microphone before the promotion of the first promotion unit is provided.

ある局面において、音声の特徴情報は、声紋である。 In one aspect, the feature information of the voice is a voiceprint.

ある局面において、生成部は、取得部が取得した設定内容を示す文言の文字数よりも少ない文字数の内容に基づく音声データを生成し、送信部は、少ない文字数の内容に基づく音声データに基づく音声を外部装置に出力させるように音声データを外部装置に送信する、請求項１〜請求項４のいずれか１項に記載の情報処理装置。 In a certain aspect, the generation unit generates voice data based on the content of the number of characters smaller than the number of characters of the wording indicating the setting content acquired by the acquisition unit, and the transmission unit generates the voice data based on the voice data based on the content of the small number of characters. The information processing device according to any one of claims 1 to 4, wherein voice data is transmitted to the external device so as to be output to the external device.

ある局面において、外部装置から出力された音声により示される設定内容に基づくジョブを実行する第２実行部を備え、取得部は、ジョブの設定内容の変更を取得し、第２実行部は、取得部が取得した変更が反映された設定内容に基づくジョブを実行する。 In a certain aspect, a second execution unit that executes a job based on the setting contents indicated by the voice output from the external device is provided, the acquisition unit acquires the change of the job setting contents, and the second execution unit acquires. Execute the job based on the settings that reflect the changes acquired by the department.

本開示の別の局面に従うと、音声が入力されるマイクと、ジョブの設定内容を示す文言の音声がマイクに入力された場合に、該音声により示される設定内容に基づくジョブを実行する実行部と、ジョブの設定内容の変更を取得する取得部とを備え、実行部は、取得部に入力された変更が反映された設定内容に基づくジョブを実行する、情報処理装置が提供される。 According to another aspect of the present disclosure, when a microphone to which a voice is input and a voice of a word indicating a job setting content are input to the microphone, an execution unit that executes a job based on the setting content indicated by the voice. And an acquisition unit that acquires changes in the setting contents of the job, and the execution unit is provided with an information processing device that executes a job based on the setting contents that reflect the changes input to the acquisition unit.

ある局面において、設定内容の変更の取得部への入力をユーザーに促進する第２促進部をさらに備える。 In a certain aspect, a second promotion unit that prompts the user to input a change in the setting content to the acquisition unit is further provided.

本開示の別の局面に従うと、コンピューターに情報処理装置に実行させるジョブの設定内容の選択を受付ける選択受付手順と、選択された設定内容の決定を受付ける決定受付手順と、決定された設定内容を示す文言の音声を出力する出力手順とを実行させるためのプログラムが提供される。 According to another aspect of the present disclosure, the selection acceptance procedure for accepting the selection of the setting contents of the job to be executed by the computer to be executed by the information processing apparatus, the decision acceptance procedure for accepting the decision of the selected setting contents, and the determined setting contents are performed. A program for executing an output procedure for outputting the voice of the indicated wording is provided.

ある局面において、選択受付手順は、表示領域に表示された設定内容がユーザーにより指定されることにより、設定内容の選択を受付ける手順である。 In a certain aspect, the selection acceptance procedure is a procedure for accepting the selection of the setting content by designating the setting content displayed in the display area by the user.

ある局面において、プログラムは、コンピューターに、さらに、決定された設定内容に基づく音声データをサーバー装置に要求する要求手順をさらに実行させ、出力手順は、要求手順によりサーバー装置から受信した音声データでの音声を出力する手順であり、プログラムは、コンピューターに、さらに、要求手順によりサーバー装置から受信した音声データを所定領域に記憶させる記憶手順をさらに実行させる。 In one aspect, the program causes the computer to further perform a request procedure that further requests the server device for voice data based on the determined settings, and the output procedure is the voice data received from the server device by the request procedure. It is a procedure for outputting audio, and the program further causes the computer to perform a storage procedure for storing the audio data received from the server device in a predetermined area according to the request procedure.

ある局面において、プログラムは、コンピューターに、さらに、選択受付手順で選択された設定内容から、情報処理装置が実行可能なジョブに応じた設定内容に変更する変更手順を実行させる。 In a certain aspect, the program causes the computer to further execute a change procedure for changing the setting contents selected in the selection acceptance procedure to the setting contents according to the job that can be executed by the information processing apparatus.

本開示の別の局面に従うと、上述の情報処理装置を備える画像形成装置が提供される。 According to another aspect of the present disclosure, an image forming apparatus including the above-mentioned information processing apparatus is provided.

本開示の別の局面に従うと、ジョブの設定内容を取得する取得ステップと、取得ステップで取得した設定内容を示す文言の音声の音声データを生成する生成ステップと、音声データに基づく音声を外部装置に出力させるように音声データを外部装置に送信する送信ステップとを備える情報処理方法が提供される。 According to another aspect of the present disclosure, the acquisition step of acquiring the setting contents of the job, the generation step of generating the voice data of the voice of the wording indicating the setting contents acquired in the acquisition step, and the voice based on the voice data are externally deviced. An information processing method is provided that includes a transmission step of transmitting audio data to an external device so as to output the data.

本開示によれば、情報処理装置に不慣れなユーザーであっても、音声入力に基づくジョブを情報処理装置に実行させることができる。 According to the present disclosure, even a user who is unfamiliar with the information processing device can cause the information processing device to execute a job based on voice input.

この発明の上記および他の目的、特徴、局面および利点は、添付の図面と関連して理解されるこの発明に関する次の詳細な説明から明らかとなるであろう。 The above and other objectives, features, aspects and advantages of the invention will become apparent from the following detailed description of the invention as understood in connection with the accompanying drawings.

本実施形態の画像形成システムの適用例を示す図である。It is a figure which shows the application example of the image formation system of this embodiment. ＭＦＰと、情報処理装置との関係を示す図である。It is a figure which shows the relationship between the MFP and the information processing apparatus. ＭＦＰのハードウェア構成の一例を示すブロック図である。It is a block diagram which shows an example of the hardware configuration of the MFP. 外部装置のハードウェア構成の一例を示すブロック図である。It is a block diagram which shows an example of the hardware composition of an external device. メール送信ジョブにより送信する画像データの解像度を設定するための画面の一例である。This is an example of a screen for setting the resolution of image data transmitted by an email transmission job. メール送信ジョブにより送信する画像データのファイル形式を設定するための画面の一例である。This is an example of a screen for setting the file format of the image data to be sent by the mail sending job. メール送信ジョブにより送信する画像データの宛先を設定するための画面の一例である。This is an example of a screen for setting the destination of image data to be transmitted by an email transmission job. メール送信ボタンが表示される画面の一例である。This is an example of a screen on which the send mail button is displayed. 外部装置が表示する音声画面の一例である。This is an example of an audio screen displayed by an external device. 音声データの送信先が規定されているテーブルの一例である。This is an example of a table in which the destination of voice data is specified. ＭＦＰ、外部装置との機能構成例を説明するための図である。It is a figure for demonstrating the functional configuration example with an MFP and an external device. 画像形成システムの処理フローの一例を示す図である。It is a figure which shows an example of the processing flow of an image formation system. 第２実施形態のＭＦＰおよび外部装置の処理フローである。It is a processing flow of the MFP and the external device of the 2nd Embodiment. 第２実施形態の外部装置が表示する音声画面の一例である。This is an example of an audio screen displayed by the external device of the second embodiment. 第３実施形態のＭＦＰおよび外部装置の処理フローである。It is a processing flow of the MFP and the external device of the 3rd Embodiment. 第４実施形態のＭＦＰと外部装置との機能構成例を示す図である。It is a figure which shows the functional configuration example of the MFP and the external device of 4th Embodiment. 第４実施形態のＭＦＰおよび外部装置の処理フローである。It is a processing flow of the MFP and the external device of 4th Embodiment. 第４実施形態のＭＦＰおよび外部装置の処理フローである。It is a processing flow of the MFP and the external device of 4th Embodiment. 変換テーブルの一例である。This is an example of a conversion table. 第６実施形態のＭＦＰおよび外部装置の処理フローである。6 is a processing flow of the MFP and the external device of the sixth embodiment. 設定内容の変更を入力させるための画面の一例である。This is an example of a screen for inputting changes in setting contents. 第７実施形態のＭＦＰと外部装置との機能構成例を示す図である。It is a figure which shows the functional configuration example of the MFP and the external device of 7th Embodiment. 第６実施形態のＭＦＰおよび外部装置の処理フローを示す図である。It is a figure which shows the processing flow of the MFP and the external device of 6th Embodiment. ジョブの一覧が表示される画面である。This screen displays a list of jobs. メール送信ジョブに対応する音声画面の一例である。This is an example of an audio screen corresponding to an email transmission job. ３台のＭＦＰの機能を示す図である。It is a figure which shows the function of three MFPs. コピー送信ジョブに対応する音声画面の一例である。This is an example of an audio screen corresponding to a copy transmission job. 第８実施形態の画像形成システムの構成例を示す図である。It is a figure which shows the structural example of the image formation system of 8th Embodiment. 第８実施形態の外部装置２０Ａの処理フローである。It is a processing flow of the external device 20A of the 8th embodiment.

本発明に基づいた実施形態における情報処理装置および画像形成装置等について、以下、図を参照しながら説明する。以下に説明する実施形態において、個数、量などに言及する場合、特に記載がある場合を除き、本発明の範囲は必ずしもその個数、量などに限定されない。同一の部品、相当部品に対しては、同一の参照番号を付し、重複する説明は繰り返さない場合がある。また、各実施形態における構成の少なくとも一部を適宜組み合わせて用いることは当初から予定されていることである。以下では、画像形成装置を、ＭＦＰ１０（multifunction peripheral/product/printer）という。 The information processing device, the image forming device, and the like in the embodiment based on the present invention will be described below with reference to the drawings. When the number, quantity, etc. are referred to in the embodiments described below, the scope of the present invention is not necessarily limited to the number, quantity, etc., unless otherwise specified. The same parts and equivalent parts may be given the same reference number, and duplicate explanations may not be repeated. In addition, it is planned from the beginning to use at least a part of the configurations in each embodiment in appropriate combinations. Hereinafter, the image forming apparatus is referred to as MFP10 (multifunction peripheral / product / printer).

［第１実施形態］
＜本実施形態のＭＦＰの適用例＞
まず、ＭＦＰのある局面における適用例を説明する。本実施形態のＭＦＰは、コピージョブ、プリントジョブ、スキャンジョブ、ファクスジョブ、およびメール送信ジョブを実行することができる。スキャンジョブは、ユーザーによりＭＦＰにセットされた原稿に係る画像データをスキャンするジョブである。コピージョブはスキャンジョブでスキャンした画像データをＭＦＰが印刷するジョブである。プリントジョブは、ＭＦＰが保存している画像データを印刷する等のジョブである。ファックジョブは、他の端末装置にファクス送信するジョブである。メール送信ジョブは、ユーザーにより指定された画像データを外部装置に送信するジョブである。外部装置は、ＰＣ（Personal Computer）、タブレット、スマートフォン、ユーザーが着脱可能なウェアラブルデバイス等である。また、外部装置は、スマートフォンなどユーザーが携帯可能となる携帯端末であることが好ましい。 [First Embodiment]
<Application example of the MFP of this embodiment>
First, an application example in a certain aspect of the MFP will be described. The MFP of the present embodiment can execute a copy job, a print job, a scan job, a fax job, and an email sending job. The scan job is a job of scanning image data related to a document set in the MFP by a user. The copy job is a job in which the MFP prints the image data scanned by the scan job. The print job is a job such as printing image data saved by the MFP. The fuck job is a job of faxing to another terminal device. The mail transmission job is a job for transmitting image data specified by the user to an external device. The external device is a PC (Personal Computer), a tablet, a smartphone, a wearable device that can be attached and detached by a user, and the like. Further, the external device is preferably a mobile terminal such as a smartphone that can be carried by the user.

本実施形態のＭＦＰは、ユーザーによるジョブの設定内容の入力を受付ける。ジョブの設定内容は、ジョブの種別と、該ジョブの設定項目とを含む。ＭＦＰ１０は、後述する操作パネル（つまり、タッチパネル）に設定画面を表示する。ＭＦＰ１０は、設定画面に対するユーザーの手入力、およびユーザーによる音声入力に基づいて、ジョブの設定内容の入力を受付ける。また、一般的に、ジョブの設定内容の入力は、音声入力の方が設定画面に対する手入力よりも、ユーザーにとって負担が軽い。 The MFP of the present embodiment accepts the input of the job setting contents by the user. The job setting contents include the job type and the setting items of the job. The MFP 10 displays a setting screen on an operation panel (that is, a touch panel) described later. The MFP 10 accepts the input of the job setting contents based on the user's manual input to the setting screen and the user's voice input. Further, in general, inputting the setting contents of a job is less burdensome for the user to input by voice than by manually inputting to the setting screen.

ユーザーは設定画面を視認して、該設定画面に対して手入力することにより、ユーザーは、該設定に基づいた１回目のジョブをＭＦＰに実行させる。その後、ユーザーは、該設定と同一の設定の２回目のジョブを、音声入力によりＭＦＰに実行させたい場合がある。なお、「２回目のジョブ」は、例えば、１回目のジョブより後に実行されるジョブをいう。 The user visually recognizes the setting screen and manually inputs to the setting screen, so that the user causes the MFP to execute the first job based on the setting. After that, the user may want the MFP to execute a second job with the same settings as the settings by voice input. The "second job" means, for example, a job executed after the first job.

ここで、ＭＦＰの扱いに慣れているユーザー（以下、「第１ユーザー」ともいう。）であれば、設定内容を覚えている。したがって、第１ユーザーは、設定画面を視認しなくても、該設定画面に表示される設定内容の文言を音声で発することができる。その結果、第１ユーザーは、２回目のジョブを、音声入力によりＭＦＰに実行させることができる。 Here, if the user is accustomed to handling the MFP (hereinafter, also referred to as "first user"), he / she remembers the setting contents. Therefore, the first user can utter the wording of the setting content displayed on the setting screen without visually recognizing the setting screen. As a result, the first user can cause the MFP to execute the second job by voice input.

しかし、ＭＦＰに不慣れなユーザー（以下、「第２ユーザー」ともいう。）もいる。このような第２ユーザーは、設定内容を覚えていない。第２ユーザーは、設定画面を視認した場合には、該設定画面に表示される設定内容の文言の音声を発することができる。一方、第２ユーザーは、設定画面を視認しなければ、該設定画面に表示される設定内容の文言の音声を発することができない。例えば、設定内容に「解像度」といった難解な文言がある場合には、第２ユーザーは、設定画面の「解像度」という文言を視認せずに、「解像度」という文言の音声を発することができない。第２ユーザーが、設定内容の文言の音声を適切に発することができない場合には、音声入力に基づいた２回目のジョブをＭＦＰに実行させることができないという問題が生じ得る。このように、例えば、第２ユーザーが、１回目のジョブのために入力した設定内容を忘れた場合には、２回目のジョブを音声入力によりＭＦＰに実行させることができないという問題が生じ得る。また、第２ユーザーが、１回目のジョブのために設定画面に入力した設定項目を理解していない場合等にも、２回目のジョブを音声入力によりＭＦＰに実行させることができないという問題が生じ得る。 However, some users are unfamiliar with MFPs (hereinafter, also referred to as "second users"). Such a second user does not remember the settings. When the second user visually recognizes the setting screen, the second user can emit a voice of the wording of the setting content displayed on the setting screen. On the other hand, the second user cannot emit the voice of the wording of the setting content displayed on the setting screen unless he / she visually recognizes the setting screen. For example, when there is an esoteric word such as "resolution" in the setting content, the second user cannot emit the voice of the word "resolution" without visually recognizing the word "resolution" on the setting screen. If the second user cannot properly emit the voice of the wording of the setting content, there may be a problem that the MFP cannot execute the second job based on the voice input. As described above, for example, if the second user forgets the setting contents input for the first job, there may be a problem that the MFP cannot execute the second job by voice input. In addition, even if the second user does not understand the setting items entered in the setting screen for the first job, there arises a problem that the MFP cannot execute the second job by voice input. obtain.

そこで、本実施形態のＭＦＰは、例えば、第２ユーザーが、１回目のジョブのために入力した設定内容を忘れた場合および第２ユーザーが、１回目のジョブのために入力した設定内容を理解していない場合等であっても、第２ユーザーは、該設定内容での２回目のジョブを音声入力によりＭＦＰに実行させることができる。 Therefore, the MFP of the present embodiment understands, for example, the setting contents input by the second user for the first job and the setting contents input by the second user for the first job. Even if this is not the case, the second user can cause the MFP to execute the second job with the set contents by voice input.

このように、本実施形態の「第２ユーザー」は、ＭＦＰ１０の設定項目を、設定画面を視認しながらであれば理解することができる一方、設定画面を視認しない場合には、該設定項目の文言を発生することができないユーザーであるとする。 As described above, the "second user" of the present embodiment can understand the setting item of the MFP 10 while visually recognizing the setting screen, but when the setting screen is not visually recognized, the setting item of the setting item is understood. Suppose you are a user who cannot generate wording.

図１は本実施形態の画像形成システム１０００の適用例を示す図である。図１を参照して、画像形成システム１０００は、ＭＦＰ１０と外部装置２０とを備える。図１の例では、１回目のジョブおよび２回目のジョブの種別は、共に、「スキャンジョブ」であるとし、設定内容は、「画像の解像度」および「スキャンジョブ」であるとし、解像度は、「３００×３００ｄｐｉ」であるとする。 FIG. 1 is a diagram showing an application example of the image forming system 1000 of the present embodiment. With reference to FIG. 1, the image forming system 1000 includes an MFP 10 and an external device 20. In the example of FIG. 1, the types of the first job and the second job are both "scan job", the setting contents are "image resolution" and "scan job", and the resolution is It is assumed that it is "300 x 300 dpi".

まず、ステップ（１）において、ユーザーは、１回目のジョブをＭＦＰ１０に実行させるために、１回目のジョブの設定内容をＭＦＰ１０に対して設定画面から入力する。設定内容は、「３００×３００ｄｐｉ」の解像度であるとする。ＭＦＰ１０は、入力された設定内容を取得する。 First, in step (1), the user inputs the setting contents of the first job to the MFP 10 from the setting screen in order to cause the MFP 10 to execute the first job. It is assumed that the setting content has a resolution of "300 x 300 dpi". The MFP 10 acquires the input setting contents.

次に、ユーザーは設定内容をＭＦＰ１０に入力した後に、ジョブを開始させる開始操作をＭＦＰ１０に行う。ステップ（２）において、ＭＦＰ１０が開始操作を受付けたときに、ＭＦＰ１０は、ステップ（１）において入力した設定内容に基づいたジョブ（１回目のジョブ）を実行する。 Next, after inputting the setting contents to the MFP 10, the user performs a start operation to start the job on the MFP 10. In step (2), when the MFP 10 accepts the start operation, the MFP 10 executes a job (first job) based on the setting contents input in step (1).

次に、ＭＦＰ１０は、ユーザーにより入力された設定内容の文言の音声を合成することにより音声を生成する。図１の例では、設定内容は、「画像の解像度」および「スキャンジョブ」であることから、ＭＦＰ１０は、「かいぞうどさんびゃくでぃーぴーあいすきゃんじょぶ」という音声を生成する。その後、ＭＦＰ１０は、この音声に係る音声データを生成する。ステップ（３）において、ＭＦＰ１０は、音声データを外部装置２０に対して送信する。この送信する手段は、本実施形態の画像形成システム１０００が導入されてない外部装置２０が有する手段、つまり、外部装置２０の既存の手段であることが好ましい。この手段は、例えば、ＮＦＣ（Near Field Communication）、およびメール等である。外部装置２０は、この音声データを受信する。外部装置２０は、該受信した音声データを所定の記憶領域に記憶させる。 Next, the MFP 10 generates a voice by synthesizing the voice of the wording of the setting content input by the user. In the example of FIG. 1, since the setting contents are "image resolution" and "scan job", the MFP10 generates the sound "Kaizoudsanbyakudipeeaiskanjobu". .. After that, the MFP 10 generates voice data related to this voice. In step (3), the MFP 10 transmits audio data to the external device 20. It is preferable that the means for transmitting is a means included in the external device 20 to which the image forming system 1000 of the present embodiment is not introduced, that is, an existing means of the external device 20. This means is, for example, NFC (Near Field Communication), mail, or the like. The external device 20 receives this voice data. The external device 20 stores the received voice data in a predetermined storage area.

その後、ユーザーは、２回目のジョブをＭＦＰ１０に対して音声入力により実行させたいと思ったとする。しかし、第２ユーザーは、ＭＦＰ１０に対して不慣れであるため、ＭＦＰ１０の設定画面を見ることなく１回目のジョブの設定内容を思い出すことができない。また、第２ユーザーは、ＭＦＰ１０に対して不慣れであるため、ＭＦＰ１０の設定画面を見ることなく１回目のジョブの設定内容を理解できない場合もある。そこで、ステップ（４）において、ユーザーは、外部装置２０に対して音声出力操作を行う。音声出力操作は、ステップ（３）においてＭＦＰ１０から送信された音声データに基づく音声をＭＦＰ１０が出力するための操作である。本実施携帯では、音声データに基づく音声は、機械音声、または合成音声であるとする。 After that, suppose that the user wants the MFP 10 to execute the second job by voice input. However, since the second user is unfamiliar with the MFP10, he / she cannot remember the setting contents of the first job without looking at the setting screen of the MFP10. Further, since the second user is unfamiliar with the MFP 10, the second user may not be able to understand the setting contents of the first job without looking at the setting screen of the MFP 10. Therefore, in step (4), the user performs a voice output operation on the external device 20. The voice output operation is an operation for the MFP 10 to output a voice based on the voice data transmitted from the MFP 10 in step (3). In the present mobile phone, the voice based on the voice data is assumed to be machine voice or synthetic voice.

ステップ（５）において、外部装置２０が、音声出力操作を受付けたときに、音声データに基づく機械音声を出力する。図１の例では、外部装置２０が、「かいぞうどさんびゃくでぃーぴーあいすきゃんじょぶ」という機械音声を出力する。図１では、「かいぞうどさんびゃくでぃーぴーあいすきゃんじょぶ」という機械音声を、「かいぞうど・・・」というように省略して記載されている。外部装置２０は、該機械音声の入力を受付ける。 In step (5), when the external device 20 receives the voice output operation, it outputs the machine voice based on the voice data. In the example of FIG. 1, the external device 20 outputs a machine voice of "Kaizoudo sanbyakudipeeaiskanjobu". In FIG. 1, the machine voice "Kaizoudo-sanbyakudipeeaiskanjobu" is abbreviated as "Kaizoudo ...". The external device 20 receives the input of the machine voice.

ステップ（６）において、ＭＦＰ１０は、該機械音声の入力を受付けると、ＭＦＰ１０は、機械音声に対して音声認識処理を実行する。ＭＦＰ１０は該音声認識処理の結果に基づいたジョブを実行する。つまり、ステップ（６）において、ＭＦＰ１０は、該機械音声に基づくジョブを実行する。図１の例では、ＭＦＰ１０は、解像度が「３００×３００ｄｐｉ」であるスキャンジョブを実行する。 In step (6), when the MFP 10 receives the input of the machine voice, the MFP 10 executes the voice recognition process for the machine voice. The MFP 10 executes a job based on the result of the voice recognition process. That is, in step (6), the MFP 10 executes a job based on the machine voice. In the example of FIG. 1, the MFP 10 executes a scan job having a resolution of "300 x 300 dpi".

図１の例であると、１回目のジョブのために入力した設定内容を第２ユーザーが忘れた等の場合であっても、該設定内容での２回目のジョブを音声入力によりＭＦＰに実行させることができる。 In the example of FIG. 1, even if the second user forgets the setting contents input for the first job, the second job with the setting contents is executed in the MFP by voice input. Can be made to.

＜情報処理装置とＭＦＰとの関係＞
図２は、ＭＦＰ１０と、本開示の情報処理装置１００との関係を示す図である。図２に示すように、ＭＦＰ１０は、概念的に、本開示の情報処理装置１００を含む。また、本開示の情報処理装置１００は、ＭＦＰに限られず、他の装置に適用するようにしてもよい。他の装置は、例えば、ＰＣ（Personal Computer）、タブレット、スマートフォン、ユーザーが着脱可能なウェアラブルデバイス等である。 <Relationship between information processing equipment and MFP>
FIG. 2 is a diagram showing the relationship between the MFP 10 and the information processing device 100 of the present disclosure. As shown in FIG. 2, the MFP 10 conceptually includes the information processing device 100 of the present disclosure. Further, the information processing device 100 of the present disclosure is not limited to the MFP, and may be applied to other devices. Other devices include, for example, PCs (Personal Computers), tablets, smartphones, wearable devices that can be attached and detached by the user, and the like.

＜ＭＦＰ１０のハードウェア構成例＞
図３は、ＭＦＰ１０のハードウェア構成の一例を表したブロック図である。図３を参照して、ＭＦＰ１０は、コントローラー３１と、固定記憶装置３２と、短距離無線ＩＦ（Inter Face）３３と、スキャンユニット１２と、操作パネル３４と、給紙トレイ１４と、マイク１６と、画像形成ユニット１１と、プリンタコントローラ３５と、ネットワークＩＦ３６と、スピーカー３７とを有する。コントローラー３１には、固定記憶装置３２と、短距離無線ＩＦ（Inter Face）３３と、スキャンユニット１２と、操作パネル３４と、給紙トレイ１４と、マイク１６と、画像形成ユニット１１と、プリンタコントローラ３５と、ネットワークＩＦ３６と、スピーカー３７がバス３８を介して接続されている。 <Hardware configuration example of MFP10>
FIG. 3 is a block diagram showing an example of the hardware configuration of the MFP 10. With reference to FIG. 3, the MFP 10 includes a controller 31, a fixed storage device 32, a short-range wireless IF (Inter Face) 33, a scan unit 12, an operation panel 34, a paper feed tray 14, and a microphone 16. It has an image forming unit 11, a printer controller 35, a network IF 36, and a speaker 37. The controller 31 includes a fixed storage device 32, a short-range wireless IF (Inter Face) 33, a scan unit 12, an operation panel 34, a paper feed tray 14, a microphone 16, an image forming unit 11, and a printer controller. The 35, the network IF36, and the speaker 37 are connected via the bus 38.

コントローラー３１は、ＣＰＵ（Central Processing Unit）３１１と、制御プログラムの格納されたＲＯＭ（Read Only Memory）３１２と、作業用のＳ−ＲＡＭ（Static Random Access Memory）３１３と、画像形成に関わる各種の設定を記憶するバッテリバックアップされたＮＶ−ＲＡＭ（Non-Volatile RAM：不揮発性メモリ）３１４とを有する。ＣＰＵ３１１と、ＲＯＭ３１２と、Ｓ−ＲＡＭ３１３と、画像形成に関わる各種の設定を記憶するバッテリバックアップされたＮＶ−ＲＡＭ（Non-Volatile RAM：不揮発性メモリ）３１４とは、バス３８を介して接続されている。 The controller 31 includes a CPU (Central Processing Unit) 311, a ROM (Read Only Memory) 312 in which a control program is stored, a working S-RAM (Static Random Access Memory) 313, and various settings related to image formation. It has a battery-backed NV-RAM (Non-Volatile RAM) 314 that stores the data. The CPU 311 and ROM 312, the S-RAM 313, and the battery-backed NV-RAM (Non-Volatile RAM) 314 that stores various settings related to image formation are connected via the bus 38. There is.

操作パネル３４は、表示装置３４２と入力装置３４４とを有する。表示装置３４２は、設定画面等の様々な画像、および様々な情報を表示する。入力装置３４４は、ユーザーからの手入力の操作等を受付ける。操作パネル３４は、表示装置３４２と入力装置３４４とが一体的に構成されている。操作パネル３４は、典型的には、タッチパネルである。 The operation panel 34 has a display device 342 and an input device 344. The display device 342 displays various images such as a setting screen and various information. The input device 344 accepts manual input operations and the like from the user. The operation panel 34 is integrally composed of a display device 342 and an input device 344. The operation panel 34 is typically a touch panel.

ネットワークＩＦ３６は、ネットワーク３９を介して接続された外部装置２０等との間で各種の情報を送受信する。 The network IF36 transmits and receives various information to and from the external device 20 and the like connected via the network 39.

プリンタコントローラ３５は、ネットワークＩＦ３６により受信したプリントデータから複写画像を生成する。画像形成ユニット１１は、複写画像を用紙上に形成する。 The printer controller 35 generates a copy image from the print data received by the network IF36. The image forming unit 11 forms a copied image on paper.

固定記憶装置３２は、典型的には、ハードディスク装置である。固定記憶装置３２には、各種のデータが記憶されている。 The fixed storage device 32 is typically a hard disk device. Various types of data are stored in the fixed storage device 32.

また、コントローラー３１は、たとえば、少なくとも１つの集積回路によって構成されてもよい。集積回路は、たとえば、少なくとも１つのＣＰＵ（Central Processing Unit）、ＭＰＵ（Micro Processing Unit）、ＧＰＵ（Graphics Processing Unit）、少なくとも１つのＡＳＩＣ（Application Specific Integrated Circuit）、少なくとも１つのＦＰＧＡ（Field Programmable Gate Array）、またはそれらの組み合わせなどによって構成されるようにしてもよい。 Further, the controller 31 may be composed of, for example, at least one integrated circuit. The integrated circuit includes, for example, at least one CPU (Central Processing Unit), MPU (Micro Processing Unit), GPU (Graphics Processing Unit), at least one ASIC (Application Specific Integrated Circuit), and at least one FPGA (Field Programmable Gate Array). ) Or a combination thereof.

＜外部装置２０のハードウェア構成例＞
図４は、外部装置２０のハードウェア構成の一例を表したブロック図である。図４を参照して、外部装置２０は、コントローラー４１と、操作パネル４４と、マイク４６と、ネットワークＩＦ４６と、スピーカー４７とを有する。コントローラー４１には、と、短距離無線ＩＦ（Inter Face）４４と、スキャンユニット１２と、操作パネル４４と、マイク１６と、ネットワークＩＦ４６と、スピーカー４７がバス４８を介して接続されている。 <Hardware configuration example of external device 20>
FIG. 4 is a block diagram showing an example of the hardware configuration of the external device 20. With reference to FIG. 4, the external device 20 includes a controller 41, an operation panel 44, a microphone 46, a network IF 46, and a speaker 47. A short-range wireless IF (Inter Face) 44, a scan unit 12, an operation panel 44, a microphone 16, a network IF 46, and a speaker 47 are connected to the controller 41 via a bus 48.

コントローラー４１は、ＣＰＵ４１１と、制御プログラムの格納されたＲＯＭ４１２と、作業用のＳ−ＲＡＭ４１３とを有する。ＣＰＵ４１１と、ＲＯＭ４１２と、Ｓ−ＲＡＭ４１４とは、バス４８を介して接続されている。 The controller 41 has a CPU 411, a ROM 412 in which a control program is stored, and an S-RAM 413 for work. The CPU 411, the ROM 412, and the S-RAM 414 are connected via the bus 48.

操作パネル４４は、表示装置４４２と入力装置４４４とを有する。表示装置４４２は、様々な画面等の様々な画像、および様々な情報を表示する。入力装置４４４は、ユーザーからの手入力の操作等を受付ける。操作パネル４４は、表示装置４４２と入力装置４４４とが一体的に構成されている。操作パネル４４は、典型的には、タッチパネルである。 The operation panel 44 has a display device 442 and an input device 444. The display device 442 displays various images such as various screens and various information. The input device 444 accepts manual input operations and the like from the user. The operation panel 44 is integrally composed of a display device 442 and an input device 444. The operation panel 44 is typically a touch panel.

ネットワークＩＦ４６は、ネットワーク３９を介して接続されたＭＦＰ１０等との間で各種の情報を送受信する。 The network IF46 transmits and receives various information to and from the MFP 10 and the like connected via the network 39.

また、コントローラー４１は、たとえば、少なくとも１つの集積回路によって構成されてもよい。集積回路は、たとえば、少なくとも１つのＣＰＵ、ＭＰＵ、少なくとも１つのＡＳＩＣ、少なくとも１つのＦＰＧＡ、またはそれらの組み合わせなどによって構成されるようにしてもよい。 Further, the controller 41 may be composed of, for example, at least one integrated circuit. The integrated circuit may be composed of, for example, at least one CPU, MPU, at least one ASIC, at least one FPGA, or a combination thereof.

＜設定画面＞
次に、ＭＦＰ１０の表示装置３４２が表示する設定画面を説明する。ユーザーが、メニュー画面を表示させる操作をＭＦＰ１０の入力装置３４４に行うと、表示装置３４２はメニュー画面（特に図示せず）を表示する。メニュー画面は、例えば、コピージョブの画像、プリントジョブの画像、スキャンジョブの画像、ファクスジョブの画像、およびメール送信ジョブの画像が表示される。ＭＦＰ１０は、これらの画像のうちユーザーにより決定された画像に対応するジョブを実行する。例えば、ユーザーが、メール送信ジョブの画像を決定した場合には、ＭＦＰ１０は、メール送信ジョブを実行する。 <Setting screen>
Next, the setting screen displayed by the display device 342 of the MFP 10 will be described. When the user performs an operation to display the menu screen on the input device 344 of the MFP 10, the display device 342 displays the menu screen (not particularly shown). On the menu screen, for example, an image of a copy job, an image of a print job, an image of a scan job, an image of a fax job, and an image of a mail sending job are displayed. The MFP 10 executes a job corresponding to an image determined by the user among these images. For example, when the user determines the image of the mail sending job, the MFP 10 executes the mail sending job.

図５〜図８は、メール送信ジョブのユーザーによる設定を行うための設定画面の一例である。コントローラー３１は、図５〜図８の設定画面を、表示装置３４２の表示領域３４２Ａに表示させる。ユーザーは、図５〜図８の設定画面において、設定内容を入力する事項が、図１のステップ（１）に対応する。 5 to 8 are examples of setting screens for setting by the user of the mail transmission job. The controller 31 displays the setting screens of FIGS. 5 to 8 in the display area 342A of the display device 342. The matter of inputting the setting contents on the setting screens of FIGS. 5 to 8 corresponds to the step (1) of FIG.

図５は、メール送信ジョブにより送信する画像データの解像度を設定するための画面である。図６は、メール送信ジョブにより送信する画像データのファイル形式を設定するための画面である。図７は、メール送信ジョブにより送信する画像データの宛先を設定するための画面である。図８は、メール送信ボタンが表示される画面である。メール送信ボタンは、ユーザーによる設定でメール送信ジョブを実行させるためのボタンである。 FIG. 5 is a screen for setting the resolution of the image data transmitted by the mail transmission job. FIG. 6 is a screen for setting the file format of the image data transmitted by the mail transmission job. FIG. 7 is a screen for setting the destination of the image data to be transmitted by the mail transmission job. FIG. 8 is a screen on which the mail transmission button is displayed. The mail sending button is a button for executing the mail sending job by the setting by the user.

図５の設定画面では、解像度に関する画像が表示される。図５の例では、２００×２００ｄｐｉの解像度に関する画像、３００×３００ｄｐｉの解像度に関する画像、４００×４００ｄｐｉの解像度に関する画像、６００×６００ｄｐｉの解像度に関する画像と、およびＯＫボタン５０４とが表示されている。 On the setting screen of FIG. 5, an image related to the resolution is displayed. In the example of FIG. 5, an image having a resolution of 200 × 200 dpi, an image having a resolution of 300 × 300 dpi, an image having a resolution of 400 × 400 dpi, an image having a resolution of 600 × 600 dpi, and an OK button 504 are displayed.

図５に示される解像度に関する複数の画像のうち、ユーザーにより選択された（つまり、ユーザーによりタッチされた）画像の枠線が太く表示される。図５の例では、３００×３００ｄｐｉの解像度に関する画像５０３が選択されているとする。また、解像度に関する画像が選択されている状態で、ユーザーが、ＯＫボタン５０４をタッチすると、ＭＦＰ１０は、該選択されている画像に対応する解像度を設定する。図５の例では、ＭＦＰ１０は、３００×３００ｄｐｉの解像度を設定する。また、解像度に関する画像が選択されている状態で、ユーザーが、ＯＫボタン５０４をタッチすると、表示装置３４２は、図６の設定画面を表示する。 Of the plurality of images with respect to resolution shown in FIG. 5, the border of the image selected by the user (that is, touched by the user) is displayed thickly. In the example of FIG. 5, it is assumed that the image 503 with respect to the resolution of 300 × 300 dpi is selected. Further, when the user touches the OK button 504 while the image related to the resolution is selected, the MFP 10 sets the resolution corresponding to the selected image. In the example of FIG. 5, the MFP 10 sets a resolution of 300 × 300 dpi. Further, when the user touches the OK button 504 while the image related to the resolution is selected, the display device 342 displays the setting screen of FIG.

図６の設定画面では、画像データのファイル形式に関する画像が表示される。図６の例では、ＰＤＦ（Portable Document Format）形式に関する画像、コンパクトＰＤＦ形式に関する画像等が表示されている。 On the setting screen of FIG. 6, an image related to the file format of the image data is displayed. In the example of FIG. 6, an image related to the PDF (Portable Document Format) format, an image related to the compact PDF format, and the like are displayed.

図６に示されるファイル形式に関する複数の画像のうち、ユーザーにより選択された（つまり、ユーザーによりタッチされた）画像の枠線が太く表示される。図６の例では、コンパクトＰＤＦに関する画像５０６が選択されているとする。また、ファイル形式に関する画像が選択されている状態で、ユーザーが、ＯＫボタン５０４をタッチすると、ＭＦＰ１０は、該選択されている画像に対応するファイル形式を設定する。図６の例では、ＭＦＰ１０は、コンパクトＰＤＦのファイル形式を設定する。また、ファイル形式に関する画像が選択されている状態で、ユーザーが、ＯＫボタン５０４をタッチすると、表示装置３４２は、図７の設定画面を表示する。 Of the plurality of images related to the file format shown in FIG. 6, the border of the image selected by the user (that is, touched by the user) is displayed thickly. In the example of FIG. 6, it is assumed that the image 506 relating to the compact PDF is selected. Further, when the user touches the OK button 504 while the image related to the file format is selected, the MFP 10 sets the file format corresponding to the selected image. In the example of FIG. 6, the MFP 10 sets the file format of the compact PDF. Further, when the user touches the OK button 504 while the image related to the file format is selected, the display device 342 displays the setting screen of FIG. 7.

図７の設定画面は、画像データの宛先を設定するための画面である。図７の設定画面は、表示領域５０８と、ソフトキーボード５１０とを含む。図７の例でのソフトキーボード５１０は、「ａ〜ｚ」の英字、「１〜９」の数字、および記号等に対応する複数のソフトキーから構成される。複数の文字は、例えば、それぞれの数字のソフトキー、および「ａ〜ｚ」それぞれの文字のソフトキー等を含む。 The setting screen of FIG. 7 is a screen for setting the destination of the image data. The setting screen of FIG. 7 includes a display area 508 and a soft keyboard 510. The soft keyboard 510 in the example of FIG. 7 is composed of a plurality of soft keys corresponding to letters "a to z", numbers "1 to 9", symbols, and the like. The plurality of letters include, for example, softkeys of the respective numbers, softkeys of the letters "az", and the like.

ユーザーによりソフトキーがタッチされると、表示領域５０８に、該タッチされたソフトキーに対応する英字、数字、記号のいずれかが表示される。図７の例では、ユーザーは、ソフトキーボード５１０を用いて、「user@test.jp」を入力したとする。この場合には、表示領域５０８に「user@test.jp」が表示される。ユーザーがＯＫボタン５０４をタッチしたときに、ＭＦＰ１０は、表示領域５０８に表示されている情報（つまり、ユーザーがソフトキーボード５１０に入力した情報）を、メールの宛先として設定する。なお、表示領域５０８に表示されている情報が、メールの宛先とはならない情報である場合において、ユーザーがＯＫボタン５０４をタッチしたときには、宛先設定できない旨をユーザーに示すポップアップ画面（特に図示せず）が表示される。 When the softkey is touched by the user, any of the letters, numbers, and symbols corresponding to the touched softkey is displayed in the display area 508. In the example of FIG. 7, it is assumed that the user inputs "user@test.jp" using the soft keyboard 510. In this case, "user@test.jp" is displayed in the display area 508. When the user touches the OK button 504, the MFP 10 sets the information displayed in the display area 508 (that is, the information input by the user in the soft keyboard 510) as the mail destination. In addition, when the information displayed in the display area 508 is information that does not serve as the destination of the mail, when the user touches the OK button 504, a pop-up screen (not shown in particular) indicating to the user that the destination cannot be set. ) Is displayed.

図７の設定画面のＯＢ再生ボタン５０４がタッチされた場合において、ＭＦＰ１０が、適切なアドレス先を設定した場合には、ＭＦＰ１０は、図８の画面を表示する。図８の画面には、表示領域５１６と、第１ボタン５１２と、第２ボタン５１４とが表示される。 When the OB playback button 504 on the setting screen of FIG. 7 is touched and the MFP 10 sets an appropriate address destination, the MFP 10 displays the screen of FIG. On the screen of FIG. 8, the display area 516, the first button 512, and the second button 514 are displayed.

表示領域５１６は、図７の設定画面で設定された宛先が表示される。ユーザーは、表示領域５１６に表示された宛先を視認することで、図７の設定画面で設定した宛先が正確か否かを確認することができる。第１ボタン５１２の領域内には「音声データ生成、出力Ｅ−ｍａｉｌ送信」という文字が表示されている。第１ボタン５１２は、ＭＦＰ１０が画像をメール送信すること、およびＭＦＰ１０が図５〜図７の画面で設定された設定項目の文字の音声の音声データをユーザーの外部装置２０に送信することを示すボタンである。ユーザーにより第１ボタン５１２が操作されると、ＭＦＰ１０は、設定内容に基づくジョブを実行する。図５〜図８の例では、ＭＦＰ１０は、メール送信ジョブを実行する。これとともに、ＭＦＰ１０は、設定内容に係る文字の音声の音声データを生成する。その後、ＭＦＰは、音声データを外部装置２０に送信する。 In the display area 516, the destination set on the setting screen of FIG. 7 is displayed. By visually recognizing the destination displayed in the display area 516, the user can confirm whether or not the destination set on the setting screen of FIG. 7 is accurate. In the area of the first button 512, the characters "voice data generation, output E-mail transmission" are displayed. The first button 512 indicates that the MFP 10 sends an image by e-mail, and that the MFP 10 sends the voice data of the characters of the setting items set on the screens of FIGS. 5 to 7 to the user's external device 20. It's a button. When the first button 512 is operated by the user, the MFP 10 executes a job based on the setting contents. In the example of FIGS. 5 to 8, the MFP 10 executes the mail transmission job. At the same time, the MFP 10 generates voice data of the voice of the characters related to the setting contents. After that, the MFP transmits the voice data to the external device 20.

ユーザーにより第２ボタン５１４が操作された場合には、ＭＦＰ１０は、設定内容に基づくジョブを実行する。また、ユーザーにより第２ボタン５１４が操作された場合には、ＭＦＰ１０は、音声データの生成および音声データの送信を実行しない。 When the second button 514 is operated by the user, the MFP 10 executes a job based on the setting contents. Further, when the second button 514 is operated by the user, the MFP 10 does not execute the generation of the voice data and the transmission of the voice data.

このように、第１ボタン５１２または第２ボタン５１４がユーザーにより操作された場合には、ＭＦＰ１０は、ユーザーにより設定されたジョブを実行する。このジョブの実行は、図１のステップ（２）に対応する。 In this way, when the first button 512 or the second button 514 is operated by the user, the MFP 10 executes the job set by the user. Execution of this job corresponds to step (2) in FIG.

また、第２ボタン５１４がユーザーにより操作された場合には、ＭＦＰ１０は、ユーザーにより設定された設定内容を示す文字の音声の音声データを外部装置２０に送信する。この音声データの送信は、図１のステップ（３）に対応する。 Further, when the second button 514 is operated by the user, the MFP 10 transmits the voice data of the voice of the character indicating the setting content set by the user to the external device 20. The transmission of this voice data corresponds to step (3) of FIG.

図５〜図７の例では、ＭＦＰ１０は、解像度が３００×３００ｄｐｉであり、ファイル形式がコンパクトＰＤＦ形式であり、宛先がuser@test.jpであるという設定を行った。さらに、ＭＦＰ１０は、メール送信ジョブを行った。したがって、ＭＦＰ１０は、「かいぞうど、さんびゃくでぃぴーあい、ふぁいるけいしき、こんぱくとぴーでぃーえふ、あてさき、あるふぁべっと、ゆーえすいーあーる、すうじ、いち、あっとまーく、あるふぁべっと、てぃーいーえすてぃーどっとじぇーぴー、に、いーめーるそうしんする」という音声を生成する。ＭＦＰ１０は、最小単位の複数の音声を記憶する。最小単位の音声は、設定内容の最小単位の文言の音声である。例えば、「いち」、「に」、「あてさき」、「かいぞうど」等である。ＭＦＰ１０は、これらの最小単位の音声を合成することにより、音声を生成する。その後、ＭＦＰ１０は、この音声に係る音声データを生成する。 In the examples of FIGS. 5 to 7, the MFP 10 is set to have a resolution of 300 × 300 dpi, a file format of a compact PDF format, and a destination of user@test.jp. Further, the MFP 10 has performed a mail transmission job. Therefore, the MFP10 is "Kaizoud, Sanbyakudipeeai, File Keishiki, Konpakutopediefu, Atesaki, Arufabetto, Yuesuiaru, Suuji," Ichi, Attomark, Arufabetto, Teeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee The MFP 10 stores a plurality of voices in the smallest unit. The minimum unit voice is the voice of the wording of the minimum unit of the setting content. For example, "Ichi", "Ni", "Atesaki", "Kaizoudo" and the like. The MFP 10 generates voice by synthesizing the voice of these minimum units. After that, the MFP 10 generates voice data related to this voice.

なお、変形例として、ＭＦＰ１０は、複数の最小単位の音声のデータを記憶するようにしてもよい。ＭＦＰ１０は、複数の最小単位の音声のデータを組合せることにより、音声に係る音声データを生成するようにしてもよい。 As a modification, the MFP 10 may store a plurality of minimum units of audio data. The MFP 10 may generate voice data related to voice by combining a plurality of minimum units of voice data.

また、変形例として、ＭＦＰ１０は、複数の最小単位の音声のデータを記憶しないようにしてもよい。ＭＦＰ１０は、最小単位の音声のデータを用いずに、他の手法で設定内容の文言に基づく音声データを直接生成するようにしてもよい。 Further, as a modification, the MFP 10 may not store a plurality of minimum units of audio data. The MFP 10 may directly generate voice data based on the wording of the set contents by another method without using the voice data of the minimum unit.

ＭＦＰ１０は、生成した音声データを外部装置２０に送信する。外部装置２０は、該音声データを記憶する。また、ユーザーが、外部装置２０に対して所定の操作を行ったときに、該音声データの音声を出力させるための音声画面を、外部装置２０は表示する。 The MFP 10 transmits the generated voice data to the external device 20. The external device 20 stores the voice data. Further, when the user performs a predetermined operation on the external device 20, the external device 20 displays an audio screen for outputting the audio of the audio data.

図９は、外部装置２０が表示する音声画面の一例である。コントローラー４１は、図９の音声画面を、表示装置４４２の表示領域４４２Ａに表示させる。図９の音声画面は、音声内容５０２と、再生ボタン５０４とを含む。図９の音声内容５０２は、「解像度：３００ｄｐｉ、コンパクトＰＤＦ、宛先：user1＠test.jp、Ｅ−ｍａｉｌ送信」という内容である。また、図９の音声画面で表示される音声内容は、ユーザーにより選択可能とされている。また、図９の音声画面では、１つの音声内容が表示されているが、２以上の音声内容が表示されるようにしてもよい。 FIG. 9 is an example of an audio screen displayed by the external device 20. The controller 41 displays the audio screen of FIG. 9 in the display area 442A of the display device 442. The audio screen of FIG. 9 includes an audio content 502 and a play button 504. The audio content 502 in FIG. 9 has the content of "resolution: 300 dpi, compact PDF, destination: user1@test.jp, E-mail transmission". Further, the audio content displayed on the audio screen of FIG. 9 can be selected by the user. Further, although one audio content is displayed on the audio screen of FIG. 9, two or more audio contents may be displayed.

ユーザーが、音声内容５０２を選択し（例えば、ユーザーが音声内容５０２をタッチし）、再生ボタン５０４をタッチする。ユーザーが音声内容５０２を選択しかつ再生ボタン５０４への操作が、図１のステップ（４）の音声出力操作に対応する。 The user selects the audio content 502 (for example, the user touches the audio content 502) and touches the play button 504. The operation of the user selecting the audio content 502 and pressing the play button 504 corresponds to the audio output operation of step (4) of FIG.

ユーザーにより再生ボタン５０４が操作された場合には、外部装置２０は、ユーザーにより選択された音声内容５０２に係る音声を出力する。ここでは、外部装置２０は、「かいぞうど、さんびゃくでぃぴーあい、ふぁいるけいしき、こんぱくとぴーでぃーえふ、あてさき、あるふぁべっと、ゆーえすいーあーる、すうじ、いち、あっとまーく、あるふぁべっと、てぃーいーえすてぃーどっとじぇーぴー、に、いーめーるそうしんする」という音声を出力する。この音声の出力が、図１のステップ（５）に対応する。 When the play button 504 is operated by the user, the external device 20 outputs the sound related to the sound content 502 selected by the user. Here, the external device 20 is "Kaizoud, Sanbyakudipeeai, File Keishiki, Konpakutopediefu, Atesaki, Arufabetto, Yuesuiaru, Yuesuiaru," Outputs the voice "Suji, Ichi, Attomark, Arufabetto, Teeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee". This audio output corresponds to step (5) in FIG.

ＭＦＰ１０のマイク１６に、この音声が入力されると、ＭＦＰ１０は、該音声に対して音声認識を実行する。ＭＦＰ１０は、この音声認識により、ジョブの設定内容を特定する。ＭＦＰ１０は、この音声認識の結果に基づいたジョブを実行する。ここでは、ＭＦＰ１０は、解像度が３００×３００ｄｐｉであり、ファイル形式がコンパクトＰＤＦ形式である画像を、user@test.jpという宛先に対して送信する。この送信は、メール送信ジョブであり、ＭＦＰ１０がこのメール送信ジョブを実行することが図１のステップ（６）に対応する。 When this voice is input to the microphone 16 of the MFP 10, the MFP 10 executes voice recognition for the voice. The MFP 10 specifies the setting contents of the job by this voice recognition. The MFP 10 executes a job based on the result of this voice recognition. Here, the MFP 10 transmits an image having a resolution of 300 × 300 dpi and a file format of the compact PDF format to a destination of user@test.jp. This transmission is a mail transmission job, and the MFP 10 executing this mail transmission job corresponds to step (6) in FIG.

＜音声データの送信先＞
図１のステップ（３）および図５〜図８でも説明したように、ＭＦＰ１０は、音声データを送信する。次に、この音声データの送信先を説明する。図１０は、音声データの送信先が規定されているテーブルの一例である。ＭＦＰ１０は、図１０のテーブルを記憶する。図１０の例では、ユーザーＩＤ毎に、音声データの送信先が対応づけられている。図１０の例では、ユーザーＩＤがＡ１であるユーザーに対しては、音声データの送信先として、「Ｂ１」が対応づけられている。また、図１０の例では、ユーザーＩＤがＡ２であるユーザーに対しては、音声データの送信先として、「Ｂ２」が対応づけられている。 <Destination of audio data>
As described in step (3) of FIG. 1 and FIGS. 5 to 8, the MFP 10 transmits audio data. Next, the destination of this voice data will be described. FIG. 10 is an example of a table in which the destination of voice data is specified. The MFP 10 stores the table of FIG. In the example of FIG. 10, the destination of the voice data is associated with each user ID. In the example of FIG. 10, "B1" is associated with the user whose user ID is A1 as the destination of voice data. Further, in the example of FIG. 10, "B2" is associated with the user whose user ID is A2 as the transmission destination of the voice data.

例えば、図１のステップ（１）において、ユーザーが設定内容を入力する前に、ＭＦＰ１０は、ユーザー認証を実行可能である。ユーザー認証は、ユーザーにユーザーＩＤおよびパスワードをＭＦＰ１０に入力させて、ユーザーＩＤおよびパスワードのそれぞれが正確なものであるか否かを判断する方式としてもよい。また、ユーザー認証は、顔認証、音声認証、および指紋認証のうちのいずれであってもよい。 For example, in step (1) of FIG. 1, the MFP 10 can execute user authentication before the user inputs the setting contents. The user authentication may be a method in which the user is made to input the user ID and the password into the MFP 10 and it is determined whether or not each of the user ID and the password is accurate. Further, the user authentication may be any of face authentication, voice authentication, and fingerprint authentication.

ＭＦＰ１０は、ユーザー認証により、ユーザーＩＤを取得する。ＭＦＰ１０は、図１０のテーブルを参照して、取得したユーザーＩＤに対応する送信先を特定する。ＭＦＰ１０は、音声データを生成すると、特定した送信先に、ステップ（３）において音声データを送信する。外部装置２０は該音声データを受信した後に、外部装置２０は、この音声データを記憶する。 The MFP 10 acquires a user ID by user authentication. The MFP 10 refers to the table of FIG. 10 to specify the destination corresponding to the acquired user ID. When the MFP 10 generates the voice data, the MFP 10 transmits the voice data to the specified destination in the step (3). After the external device 20 receives the voice data, the external device 20 stores the voice data.

＜ＭＦＰ１０と、外部装置２０との機能構成例＞
図１１は、ＭＦＰ１０と、外部装置２０との機能構成例を説明するための図である。ＭＦＰ１０のコントローラー３１は、取得部１０２と、生成部１０４と、送信部１０６と、音声受付部１１０と、音声認識部１１２と、設定部１１４と、実行部１１６との機能を有する。なお、本実施形態では、便宜上、各ジョブを実行するユニットを分けて表現する場合がある。図５のジョブユニット１２０は、コピージョブユニット、プリントジョブユニット、スキャンジョブユニット、ファクスジョブユニット、およびメール送信ジョブユニットを含む。コピージョブユニットは、コピージョブを実行するユニットである。プリントジョブユニットは、プリントジョブを実行するユニットである。スキャンジョブユニットは、スキャンジョブを実行するユニットである。ファクスジョブユニットは、ファクスジョブを実行するユニットである。メール送信ジョブユニットは、メール送信ジョブを実行するユニットである。スキャンジョブユニットは、例えば、スキャンユニット１２に対応する。 <Example of functional configuration of MFP 10 and external device 20>
FIG. 11 is a diagram for explaining a functional configuration example of the MFP 10 and the external device 20. The controller 31 of the MFP 10 has functions of an acquisition unit 102, a generation unit 104, a transmission unit 106, a voice reception unit 110, a voice recognition unit 112, a setting unit 114, and an execution unit 116. In this embodiment, for convenience, the units that execute each job may be expressed separately. The job unit 120 of FIG. 5 includes a copy job unit, a print job unit, a scan job unit, a fax job unit, and a mail transmission job unit. A copy job unit is a unit that executes a copy job. A print job unit is a unit that executes a print job. The scan job unit is a unit that executes a scan job. A fax job unit is a unit that executes a fax job. The mail sending job unit is a unit that executes a mail sending job. The scan job unit corresponds to, for example, the scan unit 12.

外部装置２０のコントローラー４１は、受信部２０２と、記憶部２０４と、操作受付部２０６と、出力制御部２０８の機能を有する。 The controller 41 of the external device 20 has the functions of the receiving unit 202, the storage unit 204, the operation receiving unit 206, and the output control unit 208.

ユーザーが操作パネル３４に対して、ジョブの設定内容を入力すると（図１のステップ（１）参照）、該ジョブの設定内容は、取得部１０２および設定部１１４に入力される。上述の通り、ジョブの設定内容は、ジョブの種別と、該ジョブの設定項目とを受け付ける。図５〜図８で説明した例では、ジョブの種別は、「メール送信ジョブ」であり、ジョブの設定項目は、「３００×３００ｄｐｉの解像度」、「ファイル形式がコンパクトＰＤＦ形式」、および「user@test.jpという宛先」である。 When the user inputs the setting contents of the job to the operation panel 34 (see step (1) in FIG. 1), the setting contents of the job are input to the acquisition unit 102 and the setting unit 114. As described above, the job setting contents accept the job type and the setting items of the job. In the examples described with reference to FIGS. 5 to 8, the job type is "mail transmission job", and the job setting items are "300 x 300 dpi resolution", "file format is compact PDF format", and "user". The destination is @ test.jp.

取得部１０２は、ジョブの設定内容を取得する。生成部１０４は、該取得部１０２が取得した設定内容に係る文字の音声の音声データを生成する。図１の例では、ジョブの設定内容のうちの「ジョブの種別」は、「スキャンジョブ」であり、ジョブの設定内容のうちの「ジョブの設定項目」は、「「３００×３００ｄｐｉ」の解像度」である。したがって、生成部１０４は、「かいぞうどさんびゃくでぃーぴーあいすきゃんじょぶ」という音声を生成する。その後、生成部１０４は、この音声に係る音声データを生成する。 The acquisition unit 102 acquires the setting contents of the job. The generation unit 104 generates voice data of the voice of the characters related to the setting contents acquired by the acquisition unit 102. In the example of FIG. 1, the "job type" in the job setting contents is "scan job", and the "job setting item" in the job setting contents has a resolution of "300 x 300 dpi". ". Therefore, the generation unit 104 generates the voice "Kaizoudo-sanbyakudipeeaiskanjobu". After that, the generation unit 104 generates voice data related to this voice.

また、図５〜図８の例では、ジョブの設定内容のうちの「ジョブの種別」は、「メール送信ジョブ」である。また、図５〜図８の例では、ジョブの設定内容のうちの「ジョブの設定項目」は、解像度が３００×３００ｄｐｉであり、ファイル形式がコンパクトＰＤＦ形式であり、宛先がuser@test.jpである項目である。 Further, in the examples of FIGS. 5 to 8, the "job type" in the job setting contents is the "mail transmission job". Further, in the examples of FIGS. 5 to 8, the "job setting item" in the job setting contents has a resolution of 300 x 300 dpi, a file format of a compact PDF format, and a destination of user@test.jp. It is an item that is.

したがって、生成部１０４は、「かいぞうど、さんびゃくでぃぴーあい、ふぁいるけいしき、こんぱくとぴーでぃーえふ、あてさき、あるふぁべっと、ゆーえすいーあーる、すうじ、いち、あっとまーく、あるふぁべっと、てぃーいーえすてぃーどっとじぇーぴー、に、いーめーるそうしんする」という音声を生成する。その後、生成部１０４は、この音声に係る音声データを生成する。 Therefore, the generation unit 104 is set to "Kaizoud, Sanbyakudipeeai, File Keishiki, Konpakutopediefu, Atesaki, Arufabetto, Yuesuiaru, Su". Uji, Ichi, Attomark, Arufabetto, Teeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee After that, the generation unit 104 generates voice data related to this voice.

送信部１０６は、生成された音声データを外部装置２０に送信する。また、ＭＦＰ１０が、ユーザー認証した場合には、図１０のテーブルを参照して、該ユーザー認証により得られるユーザーＩＤに対応する外部装置２０に音声データを送信する（図１のステップ（３）参照）。 The transmission unit 106 transmits the generated voice data to the external device 20. When the MFP 10 authenticates the user, the MFP 10 refers to the table in FIG. 10 and transmits voice data to the external device 20 corresponding to the user ID obtained by the user authentication (see step (3) in FIG. 1). ).

また、設定部１１４は、操作パネル３４から入力されたジョブの設定内容を設定する。例えば、ジョブの設定内容に係る情報を所定の記憶領域に記憶させる。また、実行部１１６は、設定された設定内容（つまり、ジョブの設定内容に係る情報）に基づいたジョブを、該ジョブに対応するジョブユニット１２０に実行させる。図１で説明した例では、実行部１１６は、ジョブユニット１２０のうちのスキャンジョブユニットにスキャンジョブを実行させる。また、図５〜図９で説明した例では、実行部１１６は、ジョブユニット１２０のうちのメール送信ジョブユニットにメール送信ジョブを実行させる。 Further, the setting unit 114 sets the setting contents of the job input from the operation panel 34. For example, information related to job settings is stored in a predetermined storage area. Further, the execution unit 116 causes the job unit 120 corresponding to the job to execute a job based on the set setting content (that is, information related to the setting content of the job). In the example described with reference to FIG. 1, the execution unit 116 causes the scan job unit of the job units 120 to execute the scan job. Further, in the example described with reference to FIGS. 5 to 9, the execution unit 116 causes the mail transmission job unit of the job units 120 to execute the mail transmission job.

また、外部装置２０の受信部２０２は、コントローラー３１が送信した音声データを受信する。受信部２０２により受信された音声データは、記憶部２０４に記憶される。また、図９で説明したように、ユーザーが、外部装置２０の操作パネル４４に対して音声出力操作を行う（図１のステップ（４）に対応）。 Further, the receiving unit 202 of the external device 20 receives the audio data transmitted by the controller 31. The voice data received by the receiving unit 202 is stored in the storage unit 204. Further, as described with reference to FIG. 9, the user performs a voice output operation on the operation panel 44 of the external device 20 (corresponding to step (4) of FIG. 1).

操作受付部２０６は、操作パネル４４に対して行われた音声出力操作を受付ける。出力制御部２０８は、操作受付部２０６が受付けた音声出力操作により選択された音声内容の機械音声をスピーカー４７から出力させる（図１のステップ（５）参照）。ＭＦＰ１０のマイク１６に、この音声が入力される。マイク１６は、この音声をデジタル変換することにより、音声データを生成する。音声受付部１１０は、この音声データを受付ける。音声認識部１１２は、この音声データに対して音声認識処理を実行する。音声認識部１１２による音声認識の手法は、如何なる手法であってもよい。例えば、音声認識部１１２は、音響モデルおよび言語モデルなどを備え、音響モデルおよび言語モデルに基づいて、音声認識処理を実行する。音声認識処理による音声認識結果は、設定部１１４に入力される。 The operation reception unit 206 receives the voice output operation performed on the operation panel 44. The output control unit 208 outputs the mechanical voice of the voice content selected by the voice output operation received by the operation reception unit 206 from the speaker 47 (see step (5) in FIG. 1). This sound is input to the microphone 16 of the MFP 10. The microphone 16 generates voice data by digitally converting the voice. The voice reception unit 110 receives this voice data. The voice recognition unit 112 executes voice recognition processing on this voice data. The voice recognition method by the voice recognition unit 112 may be any method. For example, the voice recognition unit 112 includes an acoustic model, a language model, and the like, and executes voice recognition processing based on the acoustic model and the language model. The voice recognition result by the voice recognition process is input to the setting unit 114.

設定部１１４は、音声認識結果に基づく設定を行う。実行部１１６は、設定部１１４により設定された設定内容に基づいたジョブを、該ジョブに対応するジョブユニット１２０に実行させる（図１のステップ（６）参照）。 The setting unit 114 makes settings based on the voice recognition result. The execution unit 116 causes the job unit 120 corresponding to the job to execute a job based on the setting contents set by the setting unit 114 (see step (6) in FIG. 1).

＜画像形成システムの処理フロー＞
図１２は、画像形成システム１０００の処理フローの一例を示す図である。図１２を参照して、ステップＳ２において、コントローラー３１は、取得部１０２がジョブの設定内容を取得したか否かを判断する。コントローラー３１は、取得部１０２がジョブの設定内容を取得したと判断するまで、ステップＳ２の処理を繰返す。コントローラー３１は、取得部１０２がジョブの設定内容を取得したと判断した場合には（ステップＳ２でＹＥＳ、図１のステップ（１）参照）、実行部１１６は、取得した設定内容でのジョブをジョブユニット１２０に実行させる（ステップＳ４、および図１のステップ（２）参照）。 <Processing flow of image formation system>
FIG. 12 is a diagram showing an example of the processing flow of the image forming system 1000. With reference to FIG. 12, in step S2, the controller 31 determines whether or not the acquisition unit 102 has acquired the job setting contents. The controller 31 repeats the process of step S2 until the acquisition unit 102 determines that the job setting contents have been acquired. When the controller 31 determines that the acquisition unit 102 has acquired the job setting contents (YES in step S2, see step (1) in FIG. 1), the execution unit 116 executes the job with the acquired setting contents. Let the job unit 120 execute the job (see step S4 and step (2) of FIG. 1).

次に、ステップＳ６において、生成部１０４は、設定された設定内容の音声データを生成する。次に、ステップＳ８において、送信部１０６は、音声データを送信する（図１のステップ（３）参照）。 Next, in step S6, the generation unit 104 generates audio data of the set contents. Next, in step S8, the transmission unit 106 transmits voice data (see step (3) in FIG. 1).

次に、ステップＳ１０において、外部装置２０の受信部２０２は、音声データを受信する。次に、ステップＳ１２において、記憶部２０４は、音声データを記憶する。また、図１２の「３点リーダー」は、任意の時間が経過したことを意味する。 Next, in step S10, the receiving unit 202 of the external device 20 receives the audio data. Next, in step S12, the storage unit 204 stores the voice data. Further, the “three-point reader” in FIG. 12 means that an arbitrary time has passed.

次に、ステップＳ１４において、コントローラー４１は、操作受付部２０６が音声出力操作を受付けたか否かを判断する。コントローラー４１は、操作受付部２０６が音声出力操作を受付けたと判断するまで、ステップＳ１４の処理を繰返す（ステップＳ１４でＮＯ）。コントローラー４１は、操作受付部２０６が音声出力操作を受付けたと判断した場合には（ステップＳ１４でＹＥＳ、図１のステップ（４）参照）、スピーカー４７は、機械音声を出力する。 Next, in step S14, the controller 41 determines whether or not the operation reception unit 206 has accepted the voice output operation. The controller 41 repeats the process of step S14 until it is determined that the operation reception unit 206 has accepted the voice output operation (NO in step S14). When the controller 41 determines that the operation receiving unit 206 has received the voice output operation (YES in step S14, see step (4) in FIG. 1), the speaker 47 outputs the machine voice.

次に、ステップＳ１８において、マイク１６に機械音声が入力される。マイク１６からの機械音声の音声データを、音声受付部１１０が受付ける。次に、ステップＳ２０において、音声認識部１１２は、音声データに対して音声認識を実行する。設定部１１４は、音声認識結果に基づく設定を行う。次に、ステップＳ２２において、実行部１１６は、設定部１１４により設定された設定内容に基づいたジョブ（つまり、音声認識結果に基づいたジョブ）を、該ジョブに対応するジョブユニット１２０に実行させる。 Next, in step S18, machine voice is input to the microphone 16. The voice reception unit 110 receives the voice data of the machine voice from the microphone 16. Next, in step S20, the voice recognition unit 112 executes voice recognition on the voice data. The setting unit 114 makes settings based on the voice recognition result. Next, in step S22, the execution unit 116 causes the job unit 120 corresponding to the job to execute a job based on the setting contents set by the setting unit 114 (that is, a job based on the voice recognition result).

＜小括＞
上述のＭＦＰ１０に対して第２ユーザーが、ＭＦＰ１０の設定画面に手入力で設定項目を設定することにより、該設定に基づいた１回目のジョブをＭＦＰに実行させる。その後、ユーザーは、該設定と同一の設定の２回目のジョブを、音声入力によりＭＦＰに実行させたい場合がある。このような第２ユーザーは、設定画面を視認しなければ、該設定画面に表示される設定内容の文言の音声を発することができない。したがって、第２ユーザーが、設定内容の文言の音声を適切に発することができない場合には、「それ以降のジョブ」をＭＦＰに実行させることができないという問題が生じ得る。特に、第２ユーザーが、設定画面に入力した設定項目等を忘れた場合等に、２回目のジョブを、音声入力によりＭＦＰに実行させることができないという問題がある。 <Summary>
A second user manually sets a setting item on the setting screen of the MFP 10 for the above-mentioned MFP 10, and causes the MFP to execute the first job based on the setting. After that, the user may want the MFP to execute a second job with the same settings as the settings by voice input. Such a second user cannot emit the voice of the wording of the setting content displayed on the setting screen without visually recognizing the setting screen. Therefore, if the second user cannot properly emit the voice of the wording of the setting content, there may be a problem that the MFP cannot execute the "subsequent jobs". In particular, there is a problem that the MFP cannot execute the second job by voice input when the second user forgets the setting items or the like input on the setting screen.

そこで、第２ユーザーに第１ボタン５１２が操作された場合には、ＭＦＰ１０の送信部１０６は、第２ユーザーが１回目のジョブのために入力した設定内容の文言の音声の音声データを、該第２ユーザーの外部装置２０に送信する（図１のステップ（３）、および図１２のステップＳ８参照）。また、第２ユーザーにより第１ボタン５１２が操作された場合に、ＭＦＰ１０は、音声データを外部装置２０に対して送信する。 Therefore, when the first button 512 is operated by the second user, the transmission unit 106 of the MFP 10 uses the voice data of the wording of the setting contents input by the second user for the first job. It is transmitted to the external device 20 of the second user (see step (3) in FIG. 1 and step S8 in FIG. 12). Further, when the first button 512 is operated by the second user, the MFP 10 transmits the voice data to the external device 20.

外部装置２０は、該音声データを記憶して、外部装置２０は、図９に示すような音声画面を表示できる。第２ユーザーは、ＭＦＰ１０に対して不慣れであり、設定項目を表示する画面を視認することなく、設定項目を思い出すことができない。しかしながら、第２ユーザーは、図９の音声画面を視認することにより、設定項目を理解できる。第２ユーザーは、該音声画面での音声内容５０２に基づく音声を出力させることができる。つまり、送信部１０６は、音声データに基づいた音声を外部装置２０に出力させるように音声データを外部装置２０に送信する。ＭＦＰ１０は、該音声に基づくジョブを実行する。 The external device 20 can store the audio data, and the external device 20 can display an audio screen as shown in FIG. The second user is unfamiliar with the MFP 10 and cannot remember the setting items without visually recognizing the screen displaying the setting items. However, the second user can understand the setting items by visually recognizing the audio screen of FIG. The second user can output a voice based on the voice content 502 on the voice screen. That is, the transmission unit 106 transmits the voice data to the external device 20 so as to output the voice based on the voice data to the external device 20. The MFP 10 executes a job based on the voice.

したがって、ＭＦＰ１０に対して不慣れな第２ユーザーであっても、音声入力に基づくジョブ（本実施形態では、２回目のジョブ）をＭＦＰ１０に実行させることができる。 Therefore, even a second user who is unfamiliar with the MFP 10 can have the MFP 10 execute a job based on voice input (in the present embodiment, the second job).

また、例えば、第１ユーザー（つまり、ＭＦＰ１０に対して慣れているユーザー）は、第２ボタン５１４を操作することにより、音声データを送信させずにＭＦＰ１０にジョブを実行させることができる。したがって、本実施形態のＭＦＰ１０は、「外部装置に対して音声データを必ず送信するＭＦＰ」と比較して、ＭＦＰから外部装置への通信量を低減できる。また、本実施形態のＭＦＰ１０は、ユーザーの操作に基づいて（つまり、第１ボタン５１２および第２ボタン５１４のいずれかの操作に基づいて）、音声データを外部装置２０に送信するか否かを決定する。したがって、ユーザーの利便性を向上させることができる。 Further, for example, a first user (that is, a user who is accustomed to the MFP 10) can cause the MFP 10 to execute a job by operating the second button 514 without transmitting voice data. Therefore, the MFP 10 of the present embodiment can reduce the amount of communication from the MFP to the external device as compared with the "MFP that always transmits voice data to the external device". Further, the MFP 10 of the present embodiment determines whether or not to transmit the voice data to the external device 20 based on the operation of the user (that is, based on the operation of either the first button 512 or the second button 514). decide. Therefore, the convenience of the user can be improved.

なお、ＭＦＰ１０は、音声入力によりジョブを実行する第１モードと、音声入力されたとしても該音声入力に基づくジョブを実行しない第２モードとのうちいずれかのモードに制御するようにしてもよい。また、ＭＦＰ１０が第１モードに制御しているときにおいて、ＭＦＰ１０は、ユーザー認証を行ってもよい。また、ＭＦＰ１０が第１モードに制御しているときにおいて、ユーザー認証の有効期限内であれば音声入力によりジョブを実行する一方、ユーザー認証の有効期限外であれば音声入力によりジョブを実行しないようにしてもよい。 The MFP 10 may be controlled to either a first mode in which a job is executed by voice input or a second mode in which a job based on the voice input is not executed even if voice input is performed. .. Further, when the MFP 10 is controlling to the first mode, the MFP 10 may perform user authentication. Further, when the MFP10 is controlled to the first mode, the job is executed by voice input if it is within the expiration date of user authentication, while the job is not executed by voice input if it is outside the expiration date of user authentication. You may do it.

また、図１のステップ（３）において、外部装置２０が音声データを受信した後においては、ユーザーは、該音声データによる機械音声に基づくジョブを、音声データを送信したＭＦＰとは異なるＭＦＰに実行させるようにしてもよい。このように、本実施形態の画像形成システム１０００は、ユーザーは、音声データによる機械音声に基づくジョブを、該音声データを送信したＭＦＰに実行させるようにしてよく、該音声データを送信したＭＦＰとは異なるＭＦＰに実行させるようにしてもよい。 Further, in step (3) of FIG. 1, after the external device 20 receives the voice data, the user executes a job based on the machine voice based on the voice data to an MFP different from the MFP that transmitted the voice data. You may let it. As described above, in the image forming system 1000 of the present embodiment, the user may cause the MFP that transmits the voice data to execute a job based on the machine voice based on the voice data, and the MFP that transmits the voice data and the MFP. May be run by a different MFP.

［第２実施形態］
第１実施形態のＭＦＰ１０は、２以上の設定内容の音声データを一括で、外部装置２０に送信するとして説明した。第２実施形態のＭＦＰ１０は、２以上の設定内容毎の音声データを個別に生成する。その後、ＭＦＰ１０は、２以上の設定内容毎の音声データを個別に、外部装置２０に送信する。 [Second Embodiment]
The MFP 10 of the first embodiment has been described as transmitting audio data of two or more setting contents to the external device 20 in a batch. The MFP 10 of the second embodiment individually generates audio data for each of two or more setting contents. After that, the MFP 10 individually transmits the audio data for each of the two or more setting contents to the external device 20.

第２実施形態のＭＦＰ１０および外部装置２０は、図１１に示した通りである。図１３は、第２実施形態のＭＦＰ１０および外部装置２０の処理フローである。ステップＳ１０２において、コントローラー３１は、取得部１０２がジョブの１の設定内容を取得したか否かを判断する。図５〜図８の例では、図５の解像度の設定、図６のファイル形式の設定、図７の宛先の設定、および図８のジョブの実行の設定が、それぞれ、１の設定内容である。ステップＳ１０２において、コントローラー３１は、取得部１０２がジョブの１の設定内容を取得したと判断するまで、コントローラー３１は、ステップＳ１０２の処理を繰り返す（ステップＳ１０２でＮＯ）。ステップＳ１０２において、コントローラー３１は、取得部１０２がジョブの１の設定内容を取得したと判断した場合には（ステップＳ１０２でＹＥＳ）、処理は、ステップＳ１０４に進む。 The MFP 10 and the external device 20 of the second embodiment are as shown in FIG. FIG. 13 is a processing flow of the MFP 10 and the external device 20 of the second embodiment. In step S102, the controller 31 determines whether or not the acquisition unit 102 has acquired the setting content of job 1. In the example of FIGS. 5 to 8, the resolution setting of FIG. 5, the file format setting of FIG. 6, the destination setting of FIG. 7, and the job execution setting of FIG. 8 are the setting contents of 1, respectively. .. In step S102, the controller 31 repeats the process of step S102 until the acquisition unit 102 determines that the setting content of job 1 has been acquired (NO in step S102). In step S102, when the controller 31 determines that the acquisition unit 102 has acquired the setting content of job 1 (YES in step S102), the process proceeds to step S104.

ステップＳ１０４において、生成部１０４は、取得部１０２がジョブの１の設定内容の音声データを生成する。次に、ステップＳ１０６において、コントローラー３１は、ジョブ設定が完了したか否かを判断する。 In step S104, the generation unit 104 generates audio data of the setting contents of job 1 by the acquisition unit 102. Next, in step S106, the controller 31 determines whether or not the job setting is completed.

ステップＳ１０２、ステップＳ１０４、およびステップＳ１０６を図５〜図８の例を用いて説明する。図５において、解像度が選択されてＯＫボタン５０４が操作されたときに、コントローラー３１は、取得部１０２がジョブの１の設定内容が取得されたと判断する（ステップＳ１０２でＹＥＳ）。次に、ステップＳ１０４において、生成部１０４は、図５で設定された解像度を示す文言の音声の音声データを生成する。図５の例では、「かいぞうど、さんびゃくでぃぴーあい」という音声の音声データを生成する。その後、ステップＳ１０６では、コントローラー３１は、ジョブ設定が完了したか否かを判断する。図５の解像度が設定された場合においては、ジョブの設定が完了していないことから、処理は、ステップＳ１０２に戻る。 Step S102, step S104, and step S106 will be described with reference to the examples of FIGS. 5 to 8. In FIG. 5, when the resolution is selected and the OK button 504 is operated, the controller 31 determines that the acquisition unit 102 has acquired the setting content of job 1 (YES in step S102). Next, in step S104, the generation unit 104 generates voice data of the voice of the wording indicating the resolution set in FIG. In the example of FIG. 5, the voice data of the voice "Kaizoudo, Sanbyaku dipee" is generated. After that, in step S106, the controller 31 determines whether or not the job setting is completed. When the resolution shown in FIG. 5 is set, the process returns to step S102 because the job setting is not completed.

このような処理において、図５、図６、図７、および図８の順序に応じて、それぞれのステップＳ１０４で、生成部１０４は、音声データを生成する。１回目のステップＳ１０４で、生成部１０４は、図５で説明した「かいぞうど、さんびゃくでぃぴーあい」という音声の音声データを生成する。２回目のステップＳ１０４で、生成部１０４は、図６で説明した「ふぁいるけいしき、こんぱくとぴーでぃーえふ」という音声の音声データを生成する。３回目のステップＳ１０４で、生成部１０４は、図７で説明した「ふぁいるけいしき、こんぱくとぴーでぃーえふ」という音声の音声データを生成する。４回目のステップＳ１０４で、生成部１０４は、図７で説明した「いーめーるそうしんする」という音声の音声データを生成する。このように、図５〜図８の例では、生成部１０４は、４つの音声データを生成する。 In such a process, in each step S104, the generation unit 104 generates voice data according to the order of FIGS. 5, 6, 7, and 8. In the first step S104, the generation unit 104 generates the voice data of the voice "Kaizoudo, Sanbyaku dipee" described with reference to FIG. In the second step S104, the generation unit 104 generates the voice data of the voice "File Keishiki, Konpaku to Pidi Efu" described with reference to FIG. In the third step S104, the generation unit 104 generates the voice data of the voice "File Keishiki, Konpaku to Pidi Efu" described with reference to FIG. 7. In the fourth step S104, the generation unit 104 generates the voice data of the voice "I-mail-so-shin" described with reference to FIG. 7. As described above, in the example of FIGS. 5 to 8, the generation unit 104 generates four voice data.

ステップＳ１０６では、ユーザーによりジョブを実行させる操作が行われた場合に、コントローラー３１は、ジョブ設定完了したと判断する（ステップＳ１０６でＹＥＳ）。図５〜図８の例では、図８の第１ボタン５１２が操作されたときに、コントローラー３１は、ジョブ設定完了したと判断する。 In step S106, when the user performs an operation to execute the job, the controller 31 determines that the job setting is completed (YES in step S106). In the example of FIGS. 5 to 8, when the first button 512 of FIG. 8 is operated, the controller 31 determines that the job setting is completed.

ステップＳ４においては、実行部１１６は、取得した設定内容でのジョブをジョブユニット１２０に実行させる。次に、ステップＳ１０８では、生成部１０４は、ステップＳ１０４で生成部１０４が生成した複数の音声データ（本実施形態では、４つの音声データ）をまとめることにより、１つの音声データにする。次に、ステップＳ８において、ステップＳ１０８で生成部１０４がまとめた１つの音声データを外部装置２０に送信する。 In step S4, the execution unit 116 causes the job unit 120 to execute the job with the acquired setting contents. Next, in step S108, the generation unit 104 combines the plurality of voice data (four voice data in the present embodiment) generated by the generation unit 104 in step S104 into one voice data. Next, in step S8, one voice data collected by the generation unit 104 in step S108 is transmitted to the external device 20.

ステップＳ１０において、外部装置２０が、１つの音声データを受信する。次に、ステップＳ１１２において、外部装置２０は、ステップＳ１０で受信した音声データに含まれる複数の音声データ（本実施形態では、４つの音声データ）をそれぞれ、異なる記憶領域に記憶させる。本実施形態では、外部装置２０は、４つの音声データをそれぞれ異なる記憶領域に記憶させる。 In step S10, the external device 20 receives one voice data. Next, in step S112, the external device 20 stores a plurality of voice data (four voice data in the present embodiment) included in the voice data received in step S10 in different storage areas. In the present embodiment, the external device 20 stores four voice data in different storage areas.

図１４は、第２実施形態の外部装置２０が表示する音声画面の一例である。コントローラー４１は、図１４の音声画面を、表示装置４４２の表示領域４４２Ａに表示させる。図９と、図１４とを対比すると、図９は、複数の音声内容がまとまって１つの音声内容５０２として表示されている。一方、図１４は、複数の音声内容（本実施形態では、４つの音声内容）それぞれが、表示されている。また、表示される複数の音声内容はそれぞれ、生成部１０４が生成した音声データに対応している。 FIG. 14 is an example of an audio screen displayed by the external device 20 of the second embodiment. The controller 41 displays the audio screen of FIG. 14 in the display area 442A of the display device 442. Comparing FIG. 9 and FIG. 14, FIG. 9 shows a plurality of audio contents as one audio content 502. On the other hand, in FIG. 14, each of a plurality of audio contents (four audio contents in the present embodiment) is displayed. Further, each of the plurality of displayed audio contents corresponds to the audio data generated by the generation unit 104.

図１４の例では、「解像度：３００ｄｐｉ」という音声内容５０２１と、「ファイルコンコンパクトＰＤＦ」という音声内容５０２２と、「宛先：user1＠test.jp」という音声内容５０２３と、「宛先：user1＠test.jp」という音声内容５０２３とが表示されている。 In the example of FIG. 14, the audio content 5021 "resolution: 300 dpi", the audio content 5022 "filecon compact PDF", the audio content 5023 "destination: user1@test.jp", and "destination: user1 @ test" The audio content 5023 ".jp" is displayed.

本実施形態では、表示領域４４２Ａに表示されている複数の音声内容のうち、ユーザーにより複数選択可能となっている。例えば、複数の音声内容のうち、ユーザーにより選択された音声内容は、選択されていない音声内容とは異なる態様で表示される。例えば、ユーザーにより選択されていない音声内容の枠線は細く表示される一方、選択された音声内容の枠線は太く表示される。 In the present embodiment, a plurality of audio contents displayed in the display area 442A can be selected by the user. For example, among a plurality of audio contents, the audio content selected by the user is displayed in a mode different from the audio content not selected. For example, the border of the audio content not selected by the user is displayed thinly, while the border of the selected audio content is displayed thick.

また、ユーザーにより１以上の音声内容が選択された状態で、再生ボタン５０４がタッチされると、選択された１以上の音声内容の音声が出力される。例えば、「解像度：３００ｄｐｉ」と、「宛先：user1＠test.jp」が選択された状態で、再生ボタン５０４がタッチされると、外部装置２０は、「かいぞうど、さんびゃくでぃぴーあい、あてさき、あるふぁべっと、ゆーえすいーあーる、すうじ、いち、あっとまーく、あるふぁべっと、てぃーいーえすてぃーどっとじぇーぴー」という音声を出力する。 Further, when the play button 504 is touched while one or more audio contents are selected by the user, the audio of the selected one or more audio contents is output. For example, when the play button 504 is touched while "resolution: 300 dpi" and "destination: user1@test.jp" are selected, the external device 20 is set to "kaizoudo, sanbyakudipeeai". , Atesaki, Arufabetto, Yuesuiaru, Suuji, Ichi, Attomaku, Arufabetto, Teeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee Output.

本実施形態のＭＦＰ１０の取得部１０２は、複数の設定内容を取得する（図１３のステップＳ１０２参照）。次に、生成部１０４は、複数の設定内容毎の音声データを生成する（図１３のステップＳ１０４参照）。次に、送信部１０６は、複数の設定内容毎の音声データを送信する（図１３のステップＳ８参照）。外部装置２０は、該複数の設定内容毎の音声データをそれぞれ異なる記憶領域に記憶させる。さらに、図１４に示すように、外部装置２０は、該記憶された複数の音声データそれぞれに対応する音声内容（図１４の例では、音声内容５０２１〜音声内容５０２４）を表示する。ユーザーは、複数の音声内容のうち１以上の音声内容を選択可能である。外部装置２０は、ユーザーにより決定された１以上の選択された音声内容の音声データを出力することができる。したがって、ユーザーは、ユーザーが所望する音声データを組み合わせた機械音声を外部装置２０から出力させることができる。よって、ユーザーは、音声入力により所望のジョブの設定をＭＦＰ１０に行わせることができ、結果としてユーザーの利便性を向上させることができる。 The acquisition unit 102 of the MFP 10 of the present embodiment acquires a plurality of setting contents (see step S102 in FIG. 13). Next, the generation unit 104 generates audio data for each of a plurality of setting contents (see step S104 in FIG. 13). Next, the transmission unit 106 transmits voice data for each of the plurality of setting contents (see step S8 in FIG. 13). The external device 20 stores the audio data for each of the plurality of setting contents in different storage areas. Further, as shown in FIG. 14, the external device 20 displays the audio content (in the example of FIG. 14, audio content 5021 to audio content 5024) corresponding to each of the plurality of stored audio data. The user can select one or more audio contents from a plurality of audio contents. The external device 20 can output audio data of one or more selected audio contents determined by the user. Therefore, the user can output the machine voice combining the voice data desired by the user from the external device 20. Therefore, the user can make the MFP 10 set a desired job by voice input, and as a result, the convenience of the user can be improved.

［第３実施形態］
図１５は、第３実施形態のＭＦＰ１０および外部装置２０の処理フローである。図１３と図１５とを比較すると、図１３では、ステップＳ１０８およびステップＳ８等に示したように、ＭＦＰ２０は、複数の設定内容それぞれの音声データをまとめて送信する。一方、図１５に示すように、本実施形態のＭＦＰ１０は、１の設定内容の音声データを生成する毎に、該１の設定内容の音声データを外部装置２０に対して送信する。 [Third Embodiment]
FIG. 15 is a processing flow of the MFP 10 and the external device 20 of the third embodiment. Comparing FIG. 13 and FIG. 15, in FIG. 13, as shown in steps S108 and S8, the MFP 20 collectively transmits audio data for each of the plurality of setting contents. On the other hand, as shown in FIG. 15, the MFP 10 of the present embodiment transmits the audio data of the setting content of 1 to the external device 20 every time the audio data of the setting content of 1 is generated.

ステップＳ１０４において、生成部１０４が、音声データを生成する。次に、ステップＳ１５０において、送信部１０６は、該生成された音声データを外部装置２０に対して送信する。ステップＳ１０において、外部装置２０は、ＭＦＰ１０から送信された音声データを受信する。ステップＳ１２において、ＭＦＰ１０は、ステップＳ１０において送信された音声データを記憶領域に記憶させる。 In step S104, the generation unit 104 generates voice data. Next, in step S150, the transmission unit 106 transmits the generated voice data to the external device 20. In step S10, the external device 20 receives the audio data transmitted from the MFP 10. In step S12, the MFP 10 stores the voice data transmitted in step S10 in the storage area.

図１５のステップＳ１５０、ステップＳ１０、およびステップＳ１２に示すように、ＭＦＰ１０は、取得部１０２が１の設定内容を取得する毎に、ＭＦＰ１０は、該１の設定内容に対応する音声データを外部装置２０に送信する。外部装置２０は、音声データを受信する毎に該音声データを記憶領域に記憶させる。また、外部装置２０は、複数の音声データを記憶させる場合には、複数の音声データをそれぞれ異なる記憶領域に記憶させる。したがって、第３実施形態のＭＦＰ１０および外部装置２０は、第２実施形態で説明した効果と同様の効果を奏する。 As shown in step S150, step S10, and step S12 of FIG. 15, every time the acquisition unit 102 acquires the setting content of 1, the MFP 10 transmits the audio data corresponding to the setting content of 1 to an external device. Send to 20. The external device 20 stores the voice data in the storage area each time the voice data is received. Further, when the external device 20 stores a plurality of voice data, the external device 20 stores the plurality of voice data in different storage areas. Therefore, the MFP 10 and the external device 20 of the third embodiment have the same effects as those described in the second embodiment.

［第４実施形態］
図１６は、第４実施形態のＭＦＰ１０と外部装置２０との機能構成例を示す図である。図１１と、図１６とを比較すると、図１６に示すＭＦＰ１０のコントローラー３１は、第１促進部１２２を有する点、および音声受付部１１０が受付けた音声データが生成部１０４に出力される点で、図１１と、図１６とは異なる。第１促進部１２２は、ユーザーの音声のマイク１６への入力をユーザーに促進する。また、本開示の「第１実行部」は、実行部１１６に対応する。 [Fourth Embodiment]
FIG. 16 is a diagram showing a functional configuration example of the MFP 10 and the external device 20 of the fourth embodiment. Comparing FIG. 11 and FIG. 16, the controller 31 of the MFP 10 shown in FIG. 16 has a first promotion unit 122, and the voice data received by the voice reception unit 110 is output to the generation unit 104. , FIG. 11 and FIG. 16 are different. The first promotion unit 122 encourages the user to input the user's voice into the microphone 16. Further, the "first execution unit" of the present disclosure corresponds to the execution unit 116.

図１７および図１８は、第４実施形態のＭＦＰ１０および外部装置２０の処理フローである。ステップＳ６において、生成部１０４が、音声データを生成した後に、ステップＳ２０２において、第１促進部１２２は、ユーザーの音声のマイク１６への入力をユーザーに促進する。ステップＳ２０２での促進処理を、「第１促進処理」ともいう。 17 and 18 are processing flows of the MFP 10 and the external device 20 of the fourth embodiment. In step S6, after the generation unit 104 generates the voice data, in step S202, the first promotion unit 122 promotes the user to input the user's voice into the microphone 16. The accelerating process in step S202 is also referred to as "first accelerating process".

ここで、第１促進処理は、例えば、何らかの質問を音声でユーザーに対して問い合わせる処理である。例えば、第１促進処理は、「あなたの名前を言ってください」という質問を、スピーカー３７からの音声で出力する処理である。 Here, the first promotion process is, for example, a process of asking the user for some question by voice. For example, the first promotion process is a process of outputting the question "Please say your name" by voice from the speaker 37.

ステップＳ２０２の第１促進処理の後のステップＳ２０４において、コントローラー３１は、ユーザーの音声がマイク１６に入力されたか否かを判断する。ステップＳ２０４において、コントローラー３１は、ユーザーの音声がマイク１６に入力されたと判断するまで、コントローラー３１は、ステップＳ２０４の処理を繰り返す（ステップＳ２０４でＮＯ）。ステップＳ２０４において、コントローラー３１は、ユーザーの音声がマイク１６に入力されたと判断した場合には（ステップＳ２０４でＹＥＳ）、処理は、ステップＳ２０６に進む。 In step S204 after the first promotion process of step S202, the controller 31 determines whether or not the user's voice has been input to the microphone 16. In step S204, the controller 31 repeats the process of step S204 (NO in step S204) until the controller 31 determines that the user's voice has been input to the microphone 16. If the controller 31 determines in step S204 that the user's voice has been input to the microphone 16 (YES in step S204), the process proceeds to step S206.

ステップＳ２０６において、生成部１０４は、マイク１６に入力されたユーザーの音声の音声データを、生成部１０４が生成した音声データに対して付与することにより新たな音声データを生成する。ユーザーの音声の音声データを、「ユーザー音声データ」ともいう。「マイク１６に入力されたユーザーの音声」は、ステップＳ２０４においてＹＥＳと判断されたことに基づくユーザーの音声である。このように、新たな音声データは、ステップＳ６で生成された音声データと、ユーザーの音声の音声データとを含む。 In step S206, the generation unit 104 generates new voice data by adding the voice data of the user's voice input to the microphone 16 to the voice data generated by the generation unit 104. The voice data of the user's voice is also referred to as "user voice data". The “user's voice input to the microphone 16” is the user's voice based on the determination of YES in step S204. As described above, the new voice data includes the voice data generated in step S6 and the voice data of the user's voice.

次に、ステップＳ８において、送信部１０６は、外部装置２０に対して新たな音声データを送信する。 Next, in step S8, the transmission unit 106 transmits new voice data to the external device 20.

次に、図１８に示すステップＳ１４において、ユーザーが音声出力操作を外部装置２０に対して行った場合に（ステップＳ１４でＹＥＳ）、ステップＳ１６において、新たな音声データに基づく音声を出力する。本実施形態のステップＳ１６では、外部装置２０のスピーカー４７は、新たな音声データに基づく音声として、ジョブの設定内容を示す文言の音声（つまり、機械音声）と、ステップＳ２０４で説明したユーザーの音声とを出力する。外部装置２０のスピーカー４７は、機械音声を出力した後に、ユーザーの音声を出力するようにしてもよい。また、外部装置２０のスピーカー４７は、ユーザーの音声を出力した後に、機械音声を出力するようにしてもよい。 Next, in step S14 shown in FIG. 18, when the user performs a voice output operation on the external device 20 (YES in step S14), in step S16, the voice based on the new voice data is output. In step S16 of the present embodiment, the speaker 47 of the external device 20 uses the voice of the wording indicating the job setting contents (that is, the machine voice) and the voice of the user described in step S204 as the voice based on the new voice data. And output. The speaker 47 of the external device 20 may output the user's voice after outputting the machine voice. Further, the speaker 47 of the external device 20 may output the machine voice after outputting the user's voice.

ステップＳ１８において、ＭＦＰ１０のコントローラー３１は新たな音声データに基づく音声が外部装置２０からマイク１６に入力されたと判断する。新たな音声データに基づく音声は、機械音声、およびユーザーの音声である。ステップＳ１８の後、ステップＳ２１０において、第１促進部１２２は、ユーザーの音声のマイク１６への入力をユーザーに促進する。ステップＳ２１０での促進処理を、「第２促進処理」ともいう。 In step S18, the controller 31 of the MFP 10 determines that the voice based on the new voice data has been input from the external device 20 to the microphone 16. The voice based on the new voice data is the machine voice and the user's voice. After step S18, in step S210, the first promotion unit 122 prompts the user to input the user's voice into the microphone 16. The accelerating process in step S210 is also referred to as "second accelerating process".

ここで、ＭＦＰ１０は、ステップＳ２１０の処理のために、複数の問いかけと、該複数の問いかけにそれぞれに対応する回答とを有している。複数の問いかけは、例えば、「日本と言ってください」という問いかけ、「アメリカと言ってください」という問いかけ、および「ドイツと言ってください」という問いかけ等を含む。また、「日本と言ってください」という問いかけに対応する回答は、「日本」である。「アメリカと言ってください」という問いかけに対応する回答は、「アメリカ」である。「ドイツと言ってください」という問いかけに対応する回答は、「ドイツ」である。 Here, the MFP 10 has a plurality of questions and answers corresponding to the plurality of questions for the processing of step S210. Multiple questions include, for example, the question "Please say Japan", the question "Please say America", and the question "Please say Germany". The answer to the question "Please say Japan" is "Japan". The answer to the question "Please say America" is "America". The answer to the question "Please say Germany" is "Germany".

ＭＦＰ１０は、これらの複数の問いかけから、１の問いかけを選択し、該選択した１の問いかけに基づく音声を出力する第２促進処理を実行する。ＭＦＰ１０が、「日本と言ってください」という問いかけを選択した場合には、ＭＦＰ１０は、「日本と言ってください」という音声を出力する第２促進処理を実行する。 The MFP 10 selects one question from these plurality of questions, and executes a second promotion process that outputs a voice based on the selected one question. When the MFP10 selects the question "Please say Japan", the MFP10 executes the second promotion process for outputting the voice "Please say Japan".

ステップＳ２１０の第１促進処理の後のステップＳ２１２において、コントローラー３１は、ユーザーの音声がマイク１６に入力されたか否かを判断する。ステップＳ２１２において、コントローラー３１は、ユーザーの音声がマイク１６に入力されたと判断するまで、コントローラー３１は、ステップＳ２１２の処理を繰り返す（ステップＳ２１２でＮＯ）。ステップＳ２１２において、コントローラー３１は、ユーザーの音声がマイク１６に入力されたと判断した場合には（ステップＳ２１２でＹＥＳ）、処理は、ステップＳ２１４に進む。 In step S212 after the first promotion process of step S210, the controller 31 determines whether or not the user's voice has been input to the microphone 16. In step S212, the controller 31 repeats the process of step S212 until it determines that the user's voice has been input to the microphone 16 (NO in step S212). If the controller 31 determines in step S212 that the user's voice has been input to the microphone 16 (YES in step S212), the process proceeds to step S214.

ステップＳ２１４において、コントローラー３１は、ステップＳ２１２で入力されたと判断されたユーザーの音声の回答が正しいか否かを判断する。例えば、ステップＳ２１０において、ＭＦＰ１０が、「日本と言ってください」という音声を出力したとする。この場合には、回答は、「日本」である。ステップＳ２１４において、コントローラー３１は、ステップＳ２１２で入力されたと判断されたユーザーの音声が「日本」である場合には、「回答が正しい」と判断する。また、ステップＳ２１４において、コントローラー３１は、ステップＳ２１２で入力されたと判断されたユーザーの音声が「日本」以外の音声である場合には、「回答が間違い」と判断する。 In step S214, the controller 31 determines whether or not the voice response of the user determined to have been input in step S212 is correct. For example, in step S210, it is assumed that the MFP 10 outputs a voice saying "Please say Japan". In this case, the answer is "Japan". In step S214, the controller 31 determines that the answer is correct when the user's voice determined to have been input in step S212 is "Japan". Further, in step S214, when the voice of the user determined to have been input in step S212 is a voice other than "Japan", the controller 31 determines that the answer is incorrect.

ステップＳ２１４において、コントローラー３１が、回答が間違いであると判断した場合（ステップＳ２１４でＮＯ）、処理は終了する。また、ステップＳ２１４において、コントローラー３１が、回答が正しいであると判断した場合（ステップＳ２１４でＹＥＳ）、処理は、ステップＳ２１６に進む。 If the controller 31 determines in step S214 that the answer is incorrect (NO in step S214), the process ends. If the controller 31 determines in step S214 that the answer is correct (YES in step S214), the process proceeds to step S216.

ステップＳ２１６において、コントローラー３１は、「第２促進処理の実行後にマイク１６に入力されたユーザーの音声」の特徴情報と、「新たな音声データに基づく音声に含まれるユーザーの音声」の特徴情報とが一致しているか否かを判断する。 In step S216, the controller 31 includes the feature information of "the user's voice input to the microphone 16 after the execution of the second promotion process" and the feature information of "the user's voice included in the voice based on the new voice data". Determine if they match.

ここで、「第２促進処理の実行後にマイク１６に入力されたユーザーの音声」は、ステップＳ２１２で入力されたと判断されたユーザーの音声である。「新たな音声データに基づく音声に含まれるユーザーの音声」は、ステップＳ１６で出力された音声に含まれるユーザーの音声である。換言すれば、「新たな音声データに基づく音声に含まれるユーザーの音声」は、ステップＳ２０４で入力されたと判断されたユーザーの音声である。 Here, the "user's voice input to the microphone 16 after the execution of the second promotion process" is the user's voice determined to have been input in step S212. The “user's voice included in the voice based on the new voice data” is the user's voice included in the voice output in step S16. In other words, the "user's voice included in the voice based on the new voice data" is the voice of the user determined to have been input in step S204.

つまり、ステップＳ２１６の処理は、外部装置２０からの音声（つまり、「新たな音声データに基づく音声に含まれるユーザーの音声」）の特徴情報と、回答の音声（つまり、「第２促進処理の実行後にマイク１６に入力されたユーザーの音声」）の特徴情報とを比較する処理である。 That is, in the process of step S216, the feature information of the voice from the external device 20 (that is, the “user's voice included in the voice based on the new voice data”) and the response voice (that is, “the second promotion process”). This is a process of comparing with the feature information of "user's voice input to the microphone 16 after execution").

また、特徴情報は、音声の特徴を示す情報であれば、如何なる情報であってもよい。特徴情報は、音声の声紋、音声の周波数、音声の振幅等のうち少なくとも１つを含む。本実施形態の特徴情報は、「声紋」であるとする。 Further, the feature information may be any information as long as it is information indicating the characteristics of the voice. The feature information includes at least one of voiceprints, voice frequencies, voice amplitudes, and the like. The feature information of this embodiment is assumed to be "voiceprint".

また、コントローラー３１は、外部装置２０からの音声の特徴情報と、回答の音声の特徴情報とが同一である場合にのみ、両者の特徴情報は一致していると判断するようにしてもよい。また、コントローラー３１は、外部装置２０からの音声の特徴情報と、回答の音声の特徴情報とが同一または略同一である場合に両者の特徴情報は一致していると判断するようにしてもよい。また、外部装置２０からの音声の特徴情報と、回答の音声の特徴情報との比較は、例えば、コントローラー３１は、外部装置２０からの音声の特徴情報と、回答の音声の特徴情報とにおける一致度合いを示すスコアを算出する。コントローラー３１は、このスコアが予め定められた閾値以上である場合に、外部装置２０からの音声の特徴情報と、回答の音声の特徴情報とが一致すると判断する。一方、コントローラー３１は、このスコアが閾値未満である場合に、外部装置２０からの音声の特徴情報と、回答の音声の特徴情報とが一致しないと判断する。 Further, the controller 31 may determine that the characteristic information of the voice from the external device 20 and the characteristic information of the voice of the answer are the same only when they are the same. Further, the controller 31 may determine that the characteristic information of the voice from the external device 20 and the characteristic information of the voice of the answer are the same or substantially the same. .. Further, in the comparison between the voice feature information from the external device 20 and the voice feature information of the answer, for example, the controller 31 matches the voice feature information from the external device 20 with the voice feature information of the answer. Calculate a score that indicates the degree. When this score is equal to or higher than a predetermined threshold value, the controller 31 determines that the characteristic information of the voice from the external device 20 and the characteristic information of the voice of the answer match. On the other hand, when this score is less than the threshold value, the controller 31 determines that the characteristic information of the voice from the external device 20 and the characteristic information of the voice of the answer do not match.

コントローラー３１は、外部装置２０からの音声の特徴情報と、回答の音声の特徴情報とが同一であると判断した場合には（ステップＳ２１６でＹＥＳ）、処理はステップＳ２０に進む。また、このステップＳ２０では、コントローラー３１は、ステップＳ１８で入力された機械音声に対して音声認識処理を実行することにより、ジョブの設定内容を特定する。次に、ステップＳ２２において、実行部１１６（つまり、第１実行部）は、音声認識結果に基づいたジョブを実行する。 When the controller 31 determines that the voice feature information from the external device 20 and the voice feature information of the answer are the same (YES in step S216), the process proceeds to step S20. Further, in this step S20, the controller 31 specifies the setting contents of the job by executing the voice recognition process for the machine voice input in step S18. Next, in step S22, the execution unit 116 (that is, the first execution unit) executes a job based on the voice recognition result.

一方、外部装置２０からの音声の特徴情報と、回答の音声の特徴情報とが同一ではないと判断した場合には（ステップＳ２１６でＮＯ）、ステップＳ２０およびステップＳ２２の処理を実行することなく、処理を終了する。 On the other hand, when it is determined that the voice feature information from the external device 20 and the voice feature information of the answer are not the same (NO in step S216), the processes of steps S20 and S22 are not executed. End the process.

本実施形態のＭＦＰ１０の生成部１０４は、第１促進処理（つまり、図１７のステップＳ２０２）の実行後にマイク１６に入力されたユーザーの音声の音声データを、ステップＳ６で生成した音声データに対して付与することにより新たな音声データを生成する（図１７のステップＳ２０６）。また、送信部１０６は、新たな音声データに基づく音声を外部装置２０に出力させるように新たな音声データを外部装置２０に送信する（図１７のステップＳ８）。したがって、ユーザーは、外部装置２０に、設定内容を示す文言の音声と、ユーザーの音声とを出力させることができる。 The generation unit 104 of the MFP 10 of the present embodiment transmits the voice data of the user's voice input to the microphone 16 after the execution of the first promotion process (that is, step S202 of FIG. 17) with respect to the voice data generated in step S6. A new voice data is generated by adding the data (step S206 in FIG. 17). Further, the transmission unit 106 transmits the new voice data to the external device 20 so as to output the voice based on the new voice data to the external device 20 (step S8 in FIG. 17). Therefore, the user can have the external device 20 output the voice of the wording indicating the setting content and the voice of the user.

また、ＭＦＰ１０のマイク１６に、設定内容を示す文言の音声と、ユーザーの音声とが入力された場合に（つまり、図１８のステップＳ１８）、第１促進部１２２は、ユーザーの音声のマイク１６への入力をユーザーに促進する第２促進処理を実行する（ステップＳ２１０）。さらに、コントローラー３１は、第２促進処理の実行後にマイクに入力されたユーザーの音声の特徴情報と、新たな音声データに基づく音声に含まれるユーザーの音声の特徴情報とが一致しているか否かを判断する（ステップＳ２１６）。ステップＳ２１６において、ＹＥＳと判断された場合には、実行部１１６は、新たな音声データに基づく音声に含まれる音声により示される設定内容に基づくジョブを実行する（ステップＳ２２）。一方、ステップＳ２１６において、ＢＯと判断された場合には、実行部１１６は、新たな音声データに基づく音声に含まれる音声により示される設定内容に基づくジョブを実行しない。 Further, when the voice of the wording indicating the setting content and the voice of the user are input to the microphone 16 of the MFP 10 (that is, step S18 in FIG. 18), the first promotion unit 122 uses the microphone 16 of the user's voice. The second promotion process for prompting the user to input to is executed (step S210). Further, the controller 31 determines whether or not the characteristic information of the user's voice input to the microphone after the execution of the second promotion process matches the characteristic information of the user's voice included in the voice based on the new voice data. Is determined (step S216). If YES is determined in step S216, the execution unit 116 executes a job based on the setting content indicated by the voice included in the voice based on the new voice data (step S22). On the other hand, if it is determined to be BO in step S216, the execution unit 116 does not execute the job based on the setting content indicated by the voice included in the voice based on the new voice data.

例えば、正規のユーザーが１回目のジョブ（図１７のステップＳ４のジョブ）をＭＦＰ１０に実行させた場合において、該正規のユーザーとは異なる悪意のあるユーザーが、外部装置２０の音声を用いて２回目のジョブを実行しようとする場合がある。正規のユーザーは、ＭＦＰ１０に対して適切にユーザー認証されたユーザーである。また、悪意のあるユーザーは、例えば、ＭＦＰ１０に対して適切にユーザー認証されてないユーザーである。このような悪意のあるユーザーが、図１８のステップＳ１４以降の処理を実行した場合であっても、ステップＳ２１６において、ＮＯと判断される。したがって、悪意のあるユーザーは、ＭＦＰ１０に対して、外部装置２０の音声を用いた２回目のジョブを実行させることができない。したがって、ＭＦＰ１０は、悪意のあるユーザーによりＭＦＰ１０に２回目のジョブが実行されることを適切に防止することができる。 For example, when a legitimate user causes the MFP 10 to execute the first job (the job in step S4 of FIG. 17), a malicious user different from the legitimate user uses the voice of the external device 20 to perform 2 You may try to run the second job. A legitimate user is a user who is properly authenticated to the MFP 10. Further, the malicious user is, for example, a user who has not been properly authenticated to the MFP 10. Even when such a malicious user executes the processes after step S14 in FIG. 18, it is determined as NO in step S216. Therefore, the malicious user cannot cause the MFP 10 to execute the second job using the voice of the external device 20. Therefore, the MFP 10 can appropriately prevent a malicious user from executing a second job on the MFP 10.

以下では、図１７のステップＳ２、ステップＳ４、ステップＳ６、ステップＳ２０２、ステップＳ２０４、ステップＳ２０６、およびステップＳ８の処理をまとめて「第１の処理」という。また、図１８のステップＳ８、ステップＳ２１０、ステップＳ２１２、ステップＳ２１４、ステップＳ２１６、ステップＳ２０、およびステップＳ２２の処理をまとめて「第２の処理」という。 Hereinafter, the processes of step S2, step S4, step S6, step S202, step S204, step S206, and step S8 in FIG. 17 are collectively referred to as "first process". Further, the processes of step S8, step S210, step S212, step S214, step S216, step S20, and step S22 in FIG. 18 are collectively referred to as "second process".

第３実施形態では、ＭＦＰ１０が、第１の処理と第２の処理との双方を実行するとして説明した。しかしながら、ユーザーは、第１の処理を第１のＭＦＰに実行させることにより、設定内容を示す文言の音声と、ユーザーの音声との音声データを外部装置２０に記憶させるようにしてもよい。また、ユーザーは、外部装置２０からの音声に基づくジョブを、第１のＭＦＰとは異なる第２のＭＦＰに実行させるようにしてもよい。このような構成によれば、ユーザーは、第１の処理を実行させるＭＦＰと、第２の処理を実行させるＭＦＰとを選択することができることから、ユーザーの自由度を向上させることができる。 In the third embodiment, it has been described that the MFP 10 executes both the first process and the second process. However, the user may have the external device 20 store the voice of the wording indicating the setting content and the voice data of the user's voice by causing the first MFP to execute the first process. In addition, the user may cause a second MFP different from the first MFP to execute a job based on the voice from the external device 20. According to such a configuration, the user can select the MFP to execute the first process and the MFP to execute the second process, so that the degree of freedom of the user can be improved.

第２のＭＦＰのコントローラーは、図１６に示した設定部１１４と、音声認識部１１２と、音声受付部１１０と、実行部１１６と、第１促進部１２２との機能を有する。また、第２のＭＦＰのコントローラーは、ステップＳ２１６において、「第１促進部１２２の促進前にマイク１６に入力されたユーザーの音声」の特徴情報と、「第１促進部１２２の促進後にマイク１６に入力されたユーザーの音声」の特徴量とが同一であるか否かを判断する。ここで、「第１促進部１２２の促進前にマイク１６に入力されたユーザーの音声」は、ステップＳ１８で入力したユーザーの音声である。また、「第１促進部１２２の促進後にマイク１６に入力されたユーザーの音声」は、ステップＳ２１２でＹＥＳと判断されたユーザーの音声である。 The controller of the second MFP has the functions of the setting unit 114, the voice recognition unit 112, the voice reception unit 110, the execution unit 116, and the first promotion unit 122 shown in FIG. Further, in step S216, the controller of the second MFP includes the feature information of "the user's voice input to the microphone 16 before the promotion of the first promotion unit 122" and "the microphone 16 after the promotion of the first promotion unit 122". It is determined whether or not the feature amount of "user's voice input to" is the same. Here, the "user's voice input to the microphone 16 before the promotion of the first promotion unit 122" is the user's voice input in step S18. Further, "the voice of the user input to the microphone 16 after the promotion of the first promotion unit 122" is the voice of the user determined to be YES in step S212.

［第５実施形態］
上述の実施形態では、ＭＦＰ１０の生成部１０４は、取得部１０２が取得した設定内容を示す文言のそのままの音声の音声データを生成する。第５実施形態の生成部１０４は、取得部１０２が取得した設定内容を示す文言の文字数よりも少ない文字数の内容に基づく音声データを生成する。以下では、取得部１０２が取得した設定内容を示す文言の文字数よりも少ない文字数の内容を、「短縮内容」という場合がある。 [Fifth Embodiment]
In the above-described embodiment, the generation unit 104 of the MFP 10 generates voice data of the voice as it is in the wording indicating the setting content acquired by the acquisition unit 102. The generation unit 104 of the fifth embodiment generates voice data based on the content of the number of characters smaller than the number of characters of the wording indicating the setting content acquired by the acquisition unit 102. In the following, the content having a number of characters smaller than the number of characters in the wording indicating the setting content acquired by the acquisition unit 102 may be referred to as “shortened content”.

第５実施形態のＭＦＰ１０および外部装置２０は、図１１に示した通りである。また、第５実施形態のＭＦＰ１０および外部装置２０の処理フローを図１２を用いて説明する。ステップＳ６において、生成部１０４は、変換テーブルに基づいて、取得部１０２が取得した設定内容を、該設定内容を示す文言の文字数よりも少ない文字数の内容に変換する。図１９は、変換テーブルの一例である。図１９の変換テーブルは、ＭＦＰ１０の所定の記憶領域に記憶されている。 The MFP 10 and the external device 20 of the fifth embodiment are as shown in FIG. Further, the processing flow of the MFP 10 and the external device 20 of the fifth embodiment will be described with reference to FIG. In step S6, the generation unit 104 converts the setting content acquired by the acquisition unit 102 into a content having a number of characters smaller than the number of characters of the wording indicating the setting content, based on the conversion table. FIG. 19 is an example of a conversion table. The conversion table of FIG. 19 is stored in a predetermined storage area of the MFP 10.

図１９の左欄には、変換前の設定内容が規定されており、図１９の右欄には、変換度の設定内容（つまり、文字数が少ない設定内容）が規定されている。図１９の例では、「解像度２００×２００ｄｐｉ」が、「２ｄ」に対応づけられている。「解像度３００×３００ｄｐｉ」が、「３ｄ」に対応づけられている。「解像度４００×４００ｄｐｉ」が、「４ｄ」に対応づけられている。「解像度６００×６００ｄｐｉ」が、「６ｄ」に対応づけられている。「ファイル形式コンパクトＰＤＦ」が、「ｃｄｐｆ」に対応づけられている。「ファイル形式コンパクトＸＰＳ」が、「ｃｘｐｓ」に対応づけられている。「Ｅｍａｉｌ送信」が、「ｍｓ」に対応づけられている。 The left column of FIG. 19 defines the setting contents before conversion, and the right column of FIG. 19 defines the setting contents of the conversion degree (that is, the setting contents having a small number of characters). In the example of FIG. 19, "resolution 200 x 200 dpi" is associated with "2d". "Resolution 300 x 300 dpi" is associated with "3d". "Resolution 400 x 400 dpi" is associated with "4d". "Resolution 600 x 600 dpi" is associated with "6d". "File format compact PDF" is associated with "cdpf". "File format compact XPS" is associated with "cpps". "Email transmission" is associated with "ms".

例えば、ユーザーにより、「解像度４００×４００ｄｐｉ」、「ファイル形式コンパクトＰＤＦ」、「user@test.jp」、「Ｅｍａｉｌ送信」が設定された場合を説明する。この場合には、取得部１０２は、ジョブの設定内容として、「解像度４００×４００ｄｐｉ」、「ファイル形式コンパクトＰＤＦ」、「user@test.jp」、および「Ｅｍａｉｌ送信」を取得する。このうち、変換対象の設定内容は、「解像度４００×４００ｄｐｉ」、「ファイル形式コンパクトＰＤＦ」、および「Ｅｍａｉｌ送信」である。生成部１０４は、図１９の変換テーブルを参照して、「解像度４００×４００ｄｐｉ」を「４ｄ」に変換し、「ファイル形式コンパクトＰＤＦ」を「ｃｄｐｆ」に変換し、「Ｅｍａｉｌ送信」を「ｍｓ」に変換する。 For example, a case where "resolution 400 x 400 dpi", "file format compact PDF", "user@test.jp", and "email transmission" are set by the user will be described. In this case, the acquisition unit 102 acquires "resolution 400 x 400 dpi", "file format compact PDF", "user@test.jp", and "email transmission" as job setting contents. Of these, the setting contents of the conversion target are "resolution 400 x 400 dpi", "file format compact PDF", and "email transmission". With reference to the conversion table of FIG. 19, the generation unit 104 converts "resolution 400 x 400 dpi" to "4d", converts "file format compact PDF" to "cdpf", and changes "Email transmission" to "ms". To convert to.

ステップＳ６において、生成部１０４は変換後の内容に基づいた音声データを生成する。ここでは、生成部１０４は、「にでぃ、しーでぃーぴーえふ、あるふぁべっと、ゆーえすいーあーる、すうじ、いち、あっとまーく、あるふぁべっと、てぃーいーえすてぃーどっとじぇーぴー、えむえす」という機械音声の音声データを生成する。 In step S6, the generation unit 104 generates audio data based on the converted content. Here, the generation unit 104 is "Nidi, Sidy Peafu, Arufabetto, Yuesuiaru, Suuji, Ichi, Attomark, Arufabetto". , Teeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee

ステップＳ８において、少ない文字数の内容（つまり、短縮内容）に基づく音声データを外部装置２０に送信する。外部装置２０は、この音声データを記憶する。ステップＳ１４において、ユーザーが、外部装置２０に対して音声出力操作を行った場合に、「にでぃ、しーでぃーぴーえふ、あるふぁべっと、ゆーえすいーあーる、すうじ、いち、あっとまーく、あるふぁべっと、てぃーいーえすてぃーどっとじぇーぴー、えむえす」という機械音声を出力する。マイク１６にこの機械音声が入力された場合には、ステップＳ２０において、音声認識部１１２は、この機械音声に対して音声認識処理を実行する。コントローラー３１は、音声認識処理の結果により示される短縮内容を、図１９の変換テーブルを参照して、変換前の設定内容に戻す。これにより、コントローラー３１は、変換前の設定内容を特定することができる。設定部１１４は、変換前の設定内容に基づく設定を行う。ステップＳ２２において、実行部１１６は、該設定に基づくジョブを実行する。 In step S8, audio data based on the content with a small number of characters (that is, the shortened content) is transmitted to the external device 20. The external device 20 stores this voice data. In step S14, when the user performs a voice output operation on the external device 20, "Nidi, Peripheral, Arufabetto, Yuesuiaru, Suji" , Ichi, Attomark, Arufabetto, Teeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee When the machine voice is input to the microphone 16, in step S20, the voice recognition unit 112 executes the voice recognition process for the machine voice. The controller 31 returns the shortened content indicated by the result of the voice recognition process to the setting content before conversion with reference to the conversion table of FIG. As a result, the controller 31 can specify the setting contents before conversion. The setting unit 114 makes settings based on the setting contents before conversion. In step S22, the execution unit 116 executes a job based on the setting.

本実施形態のＭＦＰによれば、生成部１０４は、短縮内容の音声データを生成し、送信部１０６は、この短縮内容の音声データを外部装置２０に送信する。したがって、本実施形態のＭＦＰは、「取得部が取得した設定内容を示す文言のそのままの音声の音声データを外部装置に送信するＭＦＰ」と比較して、音声データの容量を削減することができる。 According to the MFP of the present embodiment, the generation unit 104 generates the audio data of the shortened content, and the transmission unit 106 transmits the audio data of the shortened content to the external device 20. Therefore, the MFP of the present embodiment can reduce the capacity of the voice data as compared with the "MFP that transmits the voice data of the voice as it is in the wording indicating the setting content acquired by the acquisition unit to the external device". ..

［第６実施形態］
第６実施形態のＭＦＰは、外部装置２０から出力された機械音声に基づくジョブの設定内容に、ユーザーが変更可能である可変内容が含まれている場合に、ユーザーがこの可変内容を変更することができる。 [Sixth Embodiment]
In the MFP of the sixth embodiment, when the setting contents of the job based on the machine voice output from the external device 20 include variable contents that can be changed by the user, the user changes the variable contents. Can be done.

第６実施形態のＭＦＰ１０および外部装置２０は、図１１に示した通りである。図１１の実行部１１６は、本開示の「第２実行部」に対応する。また、図２０は、第６実施形態のＭＦＰ１０および外部装置２０の処理フローを示す図である。ステップＳ２０において、音声認識部１１２が入力された機械音声に対して音声認識を実行する。次に、ステップＳ３０２において、コントローラー３１は、音声認識結果に示される設定内容に可変内容が含まれているか否かを判断する。ここで、可変内容は、ユーザーにより自由に変更できれば、ユーザーの利便性が向上する内容である。可変内容は、各ジョブ毎に予め定められている。例えば、メール送信ジョブの可変内容は、「メールの送信先」である。メールの送信先は、送信宛先と、送信フォルダとの少なくとも一方を含む。また、コピージョブおよびプリンタジョブの可変内容は、「印刷枚数」である。 The MFP 10 and the external device 20 of the sixth embodiment are as shown in FIG. The execution unit 116 of FIG. 11 corresponds to the “second execution unit” of the present disclosure. Further, FIG. 20 is a diagram showing a processing flow of the MFP 10 and the external device 20 of the sixth embodiment. In step S20, the voice recognition unit 112 executes voice recognition for the input machine voice. Next, in step S302, the controller 31 determines whether or not the setting content shown in the voice recognition result includes variable content. Here, the variable content is a content that improves the convenience of the user if it can be freely changed by the user. The variable contents are predetermined for each job. For example, the variable content of the mail sending job is "mail destination". The destination of the mail includes at least one of the destination and the destination folder. The variable content of the copy job and the printer job is "the number of prints".

コントローラー３１は、音声認識結果に示される設定内容に可変内容が含まれていると判断すると（ステップＳ３０２でＹＥＳ）、処理はステップＳ３０４に進む。ステップＳ３０４において、コントローラー３１は、設定内容の変更を入力させるための画面を、操作パネル３４に表示する。取得部１０２は、この画面から、設定内容の変更が入力された場合には、取得部１０２は、設定内容の変更を取得する。 When the controller 31 determines that the setting content shown in the voice recognition result includes variable content (YES in step S302), the process proceeds to step S304. In step S304, the controller 31 displays a screen for inputting a change in the setting contents on the operation panel 34. When the change of the setting content is input from this screen, the acquisition unit 102 acquires the change of the setting content.

図２１は、設定内容の変更を入力させるための画面の一例である。図２１の画面には、表示領域５１６と、第３ボタン５５２と、第４ボタン５５４とが表示される。表示領域５１６は、メール送信ジョブの宛先が表示される。第３ボタン５５２の領域内には「宛先入力完了」という文字が表示されている。第４ボタン５５４の領域内には「音声データの宛先を使用」という文字が表示されている。 FIG. 21 is an example of a screen for inputting a change in the setting contents. On the screen of FIG. 21, the display area 516, the third button 552, and the fourth button 554 are displayed. In the display area 516, the destination of the mail transmission job is displayed. The characters "destination input completed" are displayed in the area of the third button 552. In the area of the fourth button 554, the characters "use voice data destination" are displayed.

ユーザーが、外部装置２０から出力された機械音声の宛先を設定する場合には、第４ボタン５５４を操作する。また、ユーザーが、外部装置２０から出力された機械音声の宛先を変更する場合には、第３ボタン５５２を操作する。 When the user sets the destination of the machine voice output from the external device 20, the user operates the fourth button 554. Further, when the user changes the destination of the machine voice output from the external device 20, the user operates the third button 552.

図２１の画面が表示されたときにおいて、第３ボタン５５２および第４ボタン５５４のいずれも操作されていない状態では、ユーザーは、表示領域５１６に新たな宛先を入力可能となる。ユーザーは、表示領域５１６に宛先を入力して、第３ボタン５５２を操作すると、設定部１１４は、外部装置２０から出力された機械音声の宛先を削除して表示領域５１６に入力された宛先を新たに設定する。また、ユーザーが、第４ボタン５５４を操作すると、設定部１１４は、外部装置２０から出力された機械音声の宛先を設定する。 When the screen of FIG. 21 is displayed, if neither the third button 552 nor the fourth button 554 is operated, the user can input a new destination in the display area 516. When the user inputs a destination to the display area 516 and operates the third button 552, the setting unit 114 deletes the destination of the machine voice output from the external device 20 and sets the destination input to the display area 516. Set anew. Further, when the user operates the fourth button 554, the setting unit 114 sets the destination of the machine voice output from the external device 20.

その後、ステップＳ２２において、実行部１１６は、設定内容が変更されなかった場合には、音声認識結果に基づいたジョブを実行する。また、ステップＳ２２において、実行部１１６は、設定内容が変更された場合には、「音声認識結果からの設定内容」および「取得部１０２が取得した変更が反映された設定内容」に基づくジョブ変更された設定内容に基づいたジョブを実行する。 After that, in step S22, the execution unit 116 executes the job based on the voice recognition result when the setting contents are not changed. Further, in step S22, when the setting content is changed, the execution unit 116 changes the job based on the "setting content from the voice recognition result" and the "setting content reflecting the change acquired by the acquisition unit 102". Execute the job based on the settings made.

本実施形態のＭＦＰ１０は、外部装置２０から出力された音声の設定内容のうち可変内容については、ユーザーは変更することができる。したがって、ユーザーの利便性を向上させることができる。 In the MFP 10 of the present embodiment, the user can change the variable content of the audio setting content output from the external device 20. Therefore, the convenience of the user can be improved.

以下では、図２０のステップＳ２、ステップＳ４、ステップＳ６、およびステップＳ８の処理をまとめて「第３の処理」という。また、図２０のステップＳ１８、ステップＳ２０、ステップＳ３０２、ステップＳ３０４、およびステップＳ２２の処理をまとめて「第４の処理」という。 Hereinafter, the processes of step S2, step S4, step S6, and step S8 of FIG. 20 are collectively referred to as a "third process". Further, the processes of step S18, step S20, step S302, step S304, and step S22 in FIG. 20 are collectively referred to as a "fourth process".

第６実施形態では、ＭＦＰ１０が、第３の処理と第４の処理との双方を実行するとして説明した。しかしながら、ユーザーは、第３の処理を第１のＭＦＰに実行させることにより、設定内容を示す文言の音声と、ユーザーの音声との音声データを外部装置２０に記憶させるようにしてもよい。また、ユーザーは、外部装置２０からの音声に基づくジョブを、第１のＭＦＰとは異なる第２のＭＦＰに実行させるようにしてもよい。このような構成によれば、ユーザーは、第１の処理を実行させるＭＦＰと、第２の処理を実行させるＭＦＰとを選択することができることから、ユーザーの自由度を向上させることができる。 In the sixth embodiment, it has been described that the MFP 10 executes both the third process and the fourth process. However, the user may have the external device 20 store the voice of the wording indicating the setting content and the voice data of the user's voice by causing the first MFP to execute the third process. In addition, the user may cause a second MFP different from the first MFP to execute a job based on the voice from the external device 20. According to such a configuration, the user can select the MFP to execute the first process and the MFP to execute the second process, so that the degree of freedom of the user can be improved.

第２のＭＦＰのコントローラーは、図１１に示したマイク１６と、取得部１０２と、設定部１１４と、音声認識部１１２と、音声受付部１１０と、実行部１１６との機能を有する。なお、取得部１０２は、設定内容の変更を取得する。また、第２のＭＦＰのコントローラーは、図２０のステップＳ２０の処理を終了したときに、ステップＳ３０２、ステップＳ３０４，およびステップＳ２２の処理を実行する。 The controller of the second MFP has the functions of the microphone 16 shown in FIG. 11, the acquisition unit 102, the setting unit 114, the voice recognition unit 112, the voice reception unit 110, and the execution unit 116. The acquisition unit 102 acquires the change of the setting content. Further, the controller of the second MFP executes the processes of steps S302, S304, and S22 when the process of step S20 of FIG. 20 is completed.

［第７実施形態］
第６実施形態のＭＦＰは、ユーザーがこの可変内容を変更することができるとして説明した。第７実施形態のＭＦＰは、この可変内容の変更を促進することができる。図２２は、第７実施形態のＭＦＰ１０と外部装置２０との機能構成例を示す図である。図１１と、図２２とを比較すると、図１６に示すＭＦＰ１０のコントローラー３１は、第２促進部１３０を有する点で、図２２と、図１６とは異なる。第２促進部１３０は、設定内容の変更の取得部１０２への入力をユーザーに促進する。 [7th Embodiment]
The MFP of the sixth embodiment has been described as allowing the user to change this variable content. The MFP of the seventh embodiment can facilitate the change of this variable content. FIG. 22 is a diagram showing a functional configuration example of the MFP 10 and the external device 20 according to the seventh embodiment. Comparing FIG. 11 and FIG. 22, the controller 31 of the MFP 10 shown in FIG. 16 is different from FIG. 22 and FIG. 16 in that it has a second promotion unit 130. The second promotion unit 130 prompts the user to input the change of the setting content to the acquisition unit 102.

図２３は、第６実施形態のＭＦＰ１０および外部装置２０の処理フローを示す図である。図２３と、図２０とを比較すると、ステップＳ３０４が、ステップＳ３１０に代替されている点で、両図面は異なる。 FIG. 23 is a diagram showing a processing flow of the MFP 10 and the external device 20 of the sixth embodiment. Comparing FIG. 23 and FIG. 20, both drawings differ in that step S304 is replaced by step S310.

ステップＳ３０２でＹＥＳと判断されると、ステップＳ３１０において、第２促進部１３０は、設定内容の変更の入力を促進する。例えば、第２促進部１３０は、設定内容の変更の入力を、音声で促進する。例えば、第２促進部１３０は、ステップＳ２０での音声認識結果により、メール送信ジョブの音声が含まれていると判断された場合には、「宛先を変更しますか」といった音声を出力する。 If YES is determined in step S302, in step S310, the second promotion unit 130 promotes the input of the change of the setting content. For example, the second promotion unit 130 promotes the input of the change of the setting content by voice. For example, the second promotion unit 130 outputs a voice such as "Do you want to change the destination?" When it is determined that the voice of the mail transmission job is included based on the voice recognition result in step S20.

この音声が出力された後に、ユーザーは、「宛先を変更します。宛先は「XXX@test.jp」です」といった音声をマイク１６に入力させる。この場合には、設定部１１４は、該入力された音声に基づく設定を行う。また、第２促進部１３０は、音声による促進を実行するとともに、図２１の画面を表示するようにしてもよい。この場合には、取得部１０２は、図２１の画面からの設定内容の変更を取得する。 After this voice is output, the user causes the microphone 16 to input a voice such as "Change the destination. The destination is" XXX@test.jp "". In this case, the setting unit 114 makes a setting based on the input voice. In addition, the second promotion unit 130 may execute the promotion by voice and display the screen of FIG. 21. In this case, the acquisition unit 102 acquires the change of the setting content from the screen of FIG. 21.

本実施形態のＭＦＰ１０は、外部装置２０から出力された音声の設定内容のうちの可変内容の変更を、ユーザーに促進することができる。したがって、例えば、ユーザーがＭＦＰ１０の扱いに慣れていない場合であっても、可変内容の変更をユーザーに認識させることができる。 The MFP 10 of the present embodiment can prompt the user to change the variable content of the audio setting content output from the external device 20. Therefore, for example, even if the user is not accustomed to handling the MFP 10, the user can be made aware of the change in the variable content.

［第８実施形態］
第１実施形態〜第７実施形態では、外部装置２０は、ＭＦＰ１０からの音声データを記憶し、該記憶した音声データに基づく音声を出力するとして説明した。第８実施形態の外部装置２０Ａは、例えば、設定内容の音声を出力するアプリケーションをダウンロード可能とする。以下では、このアプリケーションを、「ＭＦＰアプリケーション」という。このＭＦＰアプリケーションは、所定のサーバー装置（特に図示せず）が提供するアプリケーションである。外部装置２０Ａは、該ＭＦＰアプリケーションを起動させている状態で、ユーザーの操作に基づいて機械音声を出力する。 [8th Embodiment]
In the first to seventh embodiments, the external device 20 has been described as storing the voice data from the MFP 10 and outputting the voice based on the stored voice data. The external device 20A of the eighth embodiment makes it possible to download, for example, an application that outputs audio of the setting contents. Hereinafter, this application is referred to as an "MFP application". This MFP application is an application provided by a predetermined server device (not particularly shown). The external device 20A outputs the machine voice based on the user's operation while the MFP application is running.

図２４は、外部装置２０ＡがＭＦＰアプリケーションを起動させた場合に外部装置２０Ａの操作パネル４４２の表示領域４４２Ａに表示される画面である。図２４は、ジョブの一覧が表示される画面である。図２４の例では、ジョブの一覧として、メール送信ジョブ、コピージョブなどが表示される。ユーザーは、ジョブの一覧から、機械音声を出力させたいジョブ、つまり、ＭＦＰ１０に実行させたいジョブを選択できる。ユーザーによりジョブが選択された状態で、決定ボタン６１１を操作されると、外部装置２０Ａは、該選択されたジョブに対応する設定画面を表示する。この設定画面は、選択されたジョブに対応する１以上の設定項目が表示される画面である。ユーザーにより該設定項目が選択されて、該設定項目が決定されると、外部装置２０Ａは、該設定項目に対応する音声を出力する。 FIG. 24 is a screen displayed in the display area 442A of the operation panel 442 of the external device 20A when the external device 20A activates the MFP application. FIG. 24 is a screen on which a list of jobs is displayed. In the example of FIG. 24, a mail sending job, a copy job, and the like are displayed as a list of jobs. From the list of jobs, the user can select a job for which machine voice is to be output, that is, a job for which the MFP 10 is to be executed. When the enter button 611 is operated while the job is selected by the user, the external device 20A displays the setting screen corresponding to the selected job. This setting screen is a screen on which one or more setting items corresponding to the selected job are displayed. When the setting item is selected by the user and the setting item is determined, the external device 20A outputs the sound corresponding to the setting item.

図２５は、メール送信ジョブに対応する音声画面の一例である。図２５の例では、設定項目の一覧６１０と、実行ボタン６１２と、保存ボタン６１４とが表示される。図２５の設定項目の一覧６１０は、「解像度２００×２００ｄｐｉ」、「解像度３００×３００ｄｐｉ」、「解像度４００×４００ｄｐｉ」、「解像度６００×６００ｄｐｉ」、「ファイル形式：コンパクトＰＤＦ」、「ファイル形式：ＴＩＦＦ」、「ファイル形式：ＸＰＳ」、「ファイル形式：ＰＰＴＸ」、「ファイル形式：コンパクトＰＤＦ」、「ファイル形式：ＪＰＥＧ」、および「ファイル形式：コンパクトＸＰＳ」からなる。 FIG. 25 is an example of an audio screen corresponding to an email transmission job. In the example of FIG. 25, a list of setting items 610, an execute button 612, and a save button 614 are displayed. The list of setting items 610 in FIG. 25 includes "resolution 200 x 200 dpi", "resolution 300 x 300 dpi", "resolution 400 x 400 dpi", "resolution 600 x 600 dpi", "file format: compact PDF", and "file format:". It consists of "TIFF", "File format: XPS", "File format: PPTX", "File format: Compact PDF", "File format: JPEG", and "File format: Compact XPS".

図２５の例では、設定項目が表示されている状態で、ユーザーにより指定（例えば、タッチ）されることにより、表示されている設定項目が選択される。図２５の例では、「解像度：３００×３００ｄｐｉ」と、「ファイル形式：コンパクトＰＤＦ」と、がユーザーにより選択されている状態である。なお、属性が同一の設定項目については、２以上の設定項目が選択不可能となっている。本実施形態では、属性とは、「解像度」、「ファイル形式」等である。したがって、例えば、属性が「解像度」である設定項目については、１つの設定項目が選択できるようになっており、２以上の設定項目が選択できないようになっている。例えば、「解像度：３００×３００ｄｐｉ」が選択されている状態において、「解像度：４００×４００ｄｐｉ」が選択された場合には、「解像度：３００×３００ｄｐｉ」の選択は解除される。 In the example of FIG. 25, the displayed setting item is selected by being specified (for example, touched) by the user while the setting item is displayed. In the example of FIG. 25, "resolution: 300 x 300 dpi" and "file format: compact PDF" are selected by the user. For setting items with the same attributes, two or more setting items cannot be selected. In the present embodiment, the attributes are "resolution", "file format", and the like. Therefore, for example, for a setting item whose attribute is "resolution", one setting item can be selected, and two or more setting items cannot be selected. For example, if "resolution: 400 x 400 dpi" is selected while "resolution: 300 x 300 dpi" is selected, the selection of "resolution: 300 x 300 dpi" is canceled.

ユーザーにより設定項目が選択されている状態で、実行ボタン６１２が操作されると、外部装置２０Ａは、該選択されている選択項目を示す文言の音声（例えば、機械音声）を出力する。「解像度３００×３００ｄｐｉ」と、「ファイル形式：コンパクトＰＤＦ」と、がユーザーにより選択されている状態で、実行ボタン６１２が操作された場合には、外部装置２０Ａは、要求信号をサーバー装置に対して送信する。要求信号は、選択されている設定項目を示す文言の音声の音声データをサーバー装置に対して要求するための信号である。サーバー装置は、要求信号を受信すると、該要求信号で要求されている音声データを要求元の外部装置２０Ａに送信する。外部装置２０Ａは、該音声データを受信すると、該音声データに基づく機械音声を出力する。図２５の例では、外部装置２０Ａは、「かいぞうど、さんびゃくでぃぴーあい、ふぁいるけいしき、こんぱくとぴーでぃーえふ」という機械音声を出力する。 When the execution button 612 is operated while the setting item is selected by the user, the external device 20A outputs a voice (for example, machine voice) of the wording indicating the selected selection item. When the execute button 612 is operated while "resolution 300 x 300 dpi" and "file format: compact PDF" are selected by the user, the external device 20A sends a request signal to the server device. And send. The request signal is a signal for requesting the server device for voice data of the voice of the wording indicating the selected setting item. When the server device receives the request signal, it transmits the voice data requested by the request signal to the request source external device 20A. When the external device 20A receives the voice data, it outputs a machine voice based on the voice data. In the example of FIG. 25, the external device 20A outputs a mechanical sound such as "kaizoudo, sanbyakudipeeai, file keishiki, konpakutopideiefu".

また、ユーザーにより設定項目が選択されている状態で、保存ボタン６１４が操作されると、外部装置２０Ａは、要求信号をサーバー装置に送信する。サーバー装置は、要求信号を受信すると、該要求信号で要求されている音声データを要求元の外部装置２０Ａに送信する。外部装置２０Ａは、該音声データを受信すると、外部装置２０Ａの所定の記憶領域に該音声データを記憶する。これにより、ユーザーは、ＭＦＰアプリケーションを起動させなくても、音声を出力するアプリケーションを該記憶されている音声データの音声を出力させることができる。 Further, when the save button 614 is operated while the setting item is selected by the user, the external device 20A transmits the request signal to the server device. When the server device receives the request signal, it transmits the voice data requested by the request signal to the request source external device 20A. When the external device 20A receives the voice data, the external device 20A stores the voice data in a predetermined storage area of the external device 20A. As a result, the user can make the application that outputs the voice output the voice of the stored voice data without starting the MFP application.

このように、本実施形態の外部装置２０Ａは、ＭＦＰアプリケーションをダウンロードした場合には、音声データを記憶せずに、保存ボタン６１４が操作された場合に、選択された設定項目の音声データを記憶する。したがって、本実施形態の外部装置２０Ａは、「ＭＦＰアプリケーションをダウンロードした場合は、全ての設定項目の音声データを記憶する外部装置」と比較して、記憶容量の削減を図ることができる。 As described above, the external device 20A of the present embodiment stores the voice data of the selected setting item when the save button 614 is operated without storing the voice data when the MFP application is downloaded. To do. Therefore, the external device 20A of the present embodiment can reduce the storage capacity as compared with the "external device that stores the audio data of all the setting items when the MFP application is downloaded".

ところで、機能が異なる複数のＭＦＰが存在する場合がある。図２６は、３台のＭＦＰ１０Ｘ、１０Ｙ、１０Ｚの機能を示す図である。図２６の例では、ＭＦＰ１０Ａは、カラー印刷およびモノクロ印刷を実行可能である。また、ＭＦＰ１０Ｂは、カラー印刷およびモノクロ印刷を実行可能である。また、ＭＦＰ１０Ｃは、カラー印刷を実行できずモノクロ印刷を実行可能である。図２６の情報は、ＭＦＰアプリケーションに含まれる情報である。外部装置２０Ａは、ＭＦＰアプリケーションをダウンロードしたときに、図２６の情報を、外部装置２０Ａの所定領域に記憶する。 By the way, there may be a plurality of MFPs having different functions. FIG. 26 is a diagram showing the functions of the three MFPs 10X, 10Y, and 10Z. In the example of FIG. 26, the MFP 10A is capable of performing color printing and monochrome printing. Further, the MFP 10B can execute color printing and monochrome printing. Further, the MFP10C cannot execute color printing and can execute monochrome printing. The information in FIG. 26 is information included in the MFP application. When the MFP application is downloaded, the external device 20A stores the information of FIG. 26 in a predetermined area of the external device 20A.

図２７は、コピー送信ジョブに対応する音声画面の一例である。図２７の例では、設定項目の一覧６５０と、ＭＦＰ１０Ｘ実行ボタン６５２と、ＭＦＰ１０Ｙ実行ボタン６５４と、ＭＦＰ１０Ｚ実行ボタン６５６と、保存ボタン６１４とが表示される。図２５の設定項目の一覧６５０は、「部数：３」、「部数：４」、「部数：５」、「部数：６」、「部数：７」、「部数：８」、「部数：９」、「部数：１０」、「部数：設定する」、「カラーコピー」、「モノクロコピー」からなる。部数は、コピージョブによるコピー枚数である。 FIG. 27 is an example of an audio screen corresponding to a copy transmission job. In the example of FIG. 27, a list of setting items 650, an MFP10X execution button 652, an MFP10Y execution button 654, an MFP10Z execution button 656, and a save button 614 are displayed. The list of setting items 650 in FIG. 25 includes "number of copies: 3", "number of copies: 4", "number of copies: 5", "number of copies: 6", "number of copies: 7", "number of copies: 8", and "number of copies: 9". , "Number of copies: 10", "Number of copies: set", "Color copy", "Monochrome copy". The number of copies is the number of copies made by the copy job.

図２７の例では、「部数：５」、および「カラーコピー」が選択されている。ここで、例えば、ＭＦＰ１０Ｘ実行ボタン６５２またはＭＦＰ１０Ｙ実行ボタン６５４が操作された場合には、外部装置２０Ａは、選択されている設定内容を決定するとともに、該決定された設定内容を示す文言の音声を出力する。したがって、外部装置２０Ａは、「ぶすうごまい、からーこぴー」という機械音声を出力する。 In the example of FIG. 27, "number of copies: 5" and "color copy" are selected. Here, for example, when the MFP10X execution button 652 or the MFP10Y execution button 654 is operated, the external device 20A determines the selected setting content and emits a voice of a word indicating the determined setting content. Output. Therefore, the external device 20A outputs the machine voice "Busuugomai, Karakopee".

次に、「部数：５」、および「カラーコピー」が選択された状態でＭＦＰ１０Ｚ実行ボタン６５６が操作された場合を説明する。図２６でも説明したように、ＭＦＰ１０Ｚは、カラーコピーを実行することができない。したがって、外部装置２０Ａが、「からーこぴー」という機械音声を出力したとしても、ＭＦＰ１０Ｚは、カラーコピーを実行することができない。よって、ユーザーは、設定内容をカラーコピーからモノクロコピーに変更しなければならず、ユーザーの負担が増大することになる。 Next, a case where the MFP10Z execution button 656 is operated with "number of copies: 5" and "color copy" selected will be described. As also described in FIG. 26, the MFP10Z cannot perform color copying. Therefore, even if the external device 20A outputs the machine voice "Karakopee", the MFP10Z cannot execute the color copy. Therefore, the user has to change the setting content from the color copy to the monochrome copy, which increases the burden on the user.

そこで、本実施形態では、外部装置２０Ａは、ユーザーにより選択された設定内容から、ＭＦＰが実行可能なジョブに応じた設定内容に変更する。図２６および図２７の例では、外部装置２０Ａは、ユーザーにより選択された設定内容（ここでは、カラーコピー）から、ＭＦＰ１０Ｚが実行可能なジョブに応じた設定内容（ここでは、モノクロコピー）に変更する。例えば、外部装置２０Ａは、「ＭＦＰ１０Ｚは、カラーコピーを実行不可能であることから、モノクロコピーに変更してもよろしいですか」という画像と、ＹＥＳボタンと、ＮＯボタンとをポップアップ表示する。ユーザーによりＮＯボタンが操作されると、コピー送信ジョブに対応する音声画面の初期状態に戻す。 Therefore, in the present embodiment, the external device 20A changes the setting content selected by the user to the setting content according to the job that can be executed by the MFP. In the examples of FIGS. 26 and 27, the external device 20A changes the setting contents selected by the user (here, color copy) to the setting contents (here, monochrome copy) according to the job that the MFP10Z can execute. To do. For example, the external device 20A pops up an image "May I change to a monochrome copy because the MFP10Z cannot perform color copying?", A YES button, and a NO button. When the NO button is operated by the user, the initial state of the audio screen corresponding to the copy transmission job is returned.

一方、ユーザーによりＹＥＳボタンが操作されると、外部装置２０Ａは、カラーコピーが選択されている状態からモノクロコピーが選択されている状態に変更する。例えば、カラーコピーの枠線が太線となっている状態から、モノクロコピーの枠線が太線となっている状態に切換える。その後、外部装置２０Ａは、「ぶすうごまい、ものくろこぴー」という機械音声を出力する。 On the other hand, when the YES button is operated by the user, the external device 20A changes from the state in which the color copy is selected to the state in which the monochrome copy is selected. For example, the state in which the border of the color copy is a thick line is switched to the state in which the border of the monochrome copy is a thick line. After that, the external device 20A outputs a mechanical voice saying "Busuugomai, Monokurokopee".

図２８は、画像形成システム１０００Ａの構成例を示す図である。図２８の例では、サーバー装置７０と、ＭＦＰ１０と、ＭＦＰアプリケーションをダウンロードした外部装置２０Ａ（コンピューター）とを有する。 FIG. 28 is a diagram showing a configuration example of the image forming system 1000A. In the example of FIG. 28, the server device 70, the MFP 10, and the external device 20A (computer) to which the MFP application is downloaded are provided.

外部装置２０Ａのコントローラー４１Ａは、記憶部２０４と、操作受付部２０６と、出力制御部２０８と、表示制御部２１０と、要求部２１２とを有する。 The controller 41A of the external device 20A has a storage unit 204, an operation reception unit 206, an output control unit 208, a display control unit 210, and a request unit 212.

記憶部２０４は、ＭＦＰアプリケーションのプログラムを記憶する。操作受付部２０６は、ユーザーによる操作パネル４４の操作を受け付ける。本実施形態では、操作受付部２０６は、ＭＦＰ１０に実行させるジョブの設定内容の選択を受付ける。「設定内容の選択」は、例えば、図２５で説明した設定項目の一覧６１０からのユーザーの選択である。また、「設定内容の選択」は、例えば、図２７で説明した設定項目の一覧６５０からのユーザーの選択である。 The storage unit 204 stores the program of the MFP application. The operation reception unit 206 receives an operation of the operation panel 44 by the user. In the present embodiment, the operation reception unit 206 accepts the selection of the setting contents of the job to be executed by the MFP 10. The "selection of setting contents" is, for example, a user's selection from the list 610 of the setting items described with reference to FIG. 25. Further, the "selection of setting contents" is, for example, a user's selection from the list of setting items 650 described with reference to FIG. 27.

また、操作受付部２０６は、選択された設定内容の決定を受付ける。「選択内容の決定」は、例えば、図２５で説明した実行ボタン６１２への操作である。また、「選択内容の決定」は、例えば、図２７で説明したＭＦＰ１０Ｘ実行ボタン６５２と、ＭＦＰ１０Ｙ実行ボタン６５４と、ＭＦＰ１０Ｚ実行ボタン６５６とのうちのいずれかのボタンへの操作である。 Further, the operation reception unit 206 accepts the determination of the selected setting content. “Determining the selected content” is, for example, an operation on the execution button 612 described with reference to FIG. 25. Further, the "determination of the selected content" is, for example, an operation on any one of the MFP10X execution button 652, the MFP10Y execution button 654, and the MFP10Z execution button 656 described with reference to FIG. 27.

また、表示制御部２１０は、図２５および図２７で説明した画面を操作パネル４４の表示装置４４２の表示領域４４２Ａに表示させる。また、要求部２１２は、出力制御部２０８は、ユーザーにより決定された設定内容に基づく音声データをサーバー装置７０に要求する。出力制御部２０８は、要求部２１２の要求によりサーバー装置７０から受信した音声データに基づく機械音声をスピーカー４７から出力させる。ＭＦＰ１０は、該機械音声に基づくジョブを実行する。 Further, the display control unit 210 displays the screens described with reference to FIGS. 25 and 27 in the display area 442A of the display device 442 of the operation panel 44. Further, the request unit 212 requests the server device 70 for voice data based on the setting contents determined by the user by the output control unit 208. The output control unit 208 outputs the machine voice based on the voice data received from the server device 70 at the request of the request unit 212 from the speaker 47. The MFP 10 executes a job based on the machine voice.

また、保存ボタン６１４が操作されたときには、コントローラー４１Ａは、要求部２１２の要求によりサーバー装置７０から受信した音声データを記憶部２０４に記憶させる。 Further, when the save button 614 is operated, the controller 41A stores the voice data received from the server device 70 in the storage unit 204 at the request of the request unit 212.

図２９は、本実施形態の外部装置２０Ａの処理フローである。外部装置２０Ａに対してユーザーによりＭＦＰアプリケーションを起動させる処理が実行された場合に、外部装置２０Ａは、図２９の処理を開始する。 FIG. 29 is a processing flow of the external device 20A of the present embodiment. When the process of initiating the MFP application by the user is executed for the external device 20A, the external device 20A starts the process of FIG. 29.

ステップＳ４０２において、表示制御部２１０は、操作パネルに図２４の一覧画面を表示する。また、表示制御部２１０は、選択されたジョブに対応する設定画面（例えば、図２５または図２７の画面）を表示する。 In step S402, the display control unit 210 displays the list screen of FIG. 24 on the operation panel. Further, the display control unit 210 displays a setting screen (for example, the screen of FIG. 25 or FIG. 27) corresponding to the selected job.

コントローラー４１Ａは、ステップＳ４０４において、操作受付部２０６が設定内容の選択を受付けたか否かを判断する。コントローラー４１Ａは、操作受付部２０６が設定内容の変更を受付けたと判断するまで、ステップＳ４０４の処理を繰返す（ステップＳ４０４でＮＯ）。コントローラー４１Ａは、操作受付部２０６が設定内容の変更を受付けたと判断した場合には（ステップＳ４０４でＹＥＳ）、処理は、ステップＳ４０６に進む。 In step S404, the controller 41A determines whether or not the operation reception unit 206 has accepted the selection of the set contents. The controller 41A repeats the process of step S404 until it is determined that the operation receiving unit 206 has accepted the change of the set contents (NO in step S404). When the controller 41A determines that the operation receiving unit 206 has accepted the change of the set contents (YES in step S404), the process proceeds to step S406.

コントローラー４１Ａは、ステップＳ４０６において、操作受付部２０６が保存ボタン６１４の操作を受付けたか否かを判断する。コントローラー４１Ａは、ステップＳ４０６において、操作受付部２０６が保存ボタン６１４の操作を受付けたと判断すると（ステップＳ４０６でＹＥＳ）、処理は、ステップＳ４１６に進む。ステップＳ４１６において、要求部２１２は、決定された設定内容に基づく音声データをサーバー装置７０に要求する。ステップＳ４１６において、要求部２１２は、要求した音声データを受信すると、該音声データを記憶部２０４に記憶させる。 In step S406, the controller 41A determines whether or not the operation reception unit 206 has accepted the operation of the save button 614. When the controller 41A determines in step S406 that the operation receiving unit 206 has accepted the operation of the save button 614 (YES in step S406), the process proceeds to step S416. In step S416, the requesting unit 212 requests the server device 70 for voice data based on the determined setting contents. In step S416, when the requesting unit 212 receives the requested voice data, the requesting unit 212 stores the requested voice data in the storage unit 204.

コントローラー４１Ａは、ステップＳ４０６において、操作受付部２０６が保存ボタン６１４の操作を受付けていないと判断すると（ステップＳ４０６でＮＯ）、処理は、ステップＳ４０８に進む。 When the controller 41A determines in step S406 that the operation receiving unit 206 has not received the operation of the save button 614 (NO in step S406), the process proceeds to step S408.

コントローラー４１Ａは、ステップＳ４０８において、操作受付部２０６が実行ボタンの操作を受付けたか否かを判断する。ここで、実行ボタンは、図２５で説明した実行ボタン６１２、ＭＦＰ１０Ｘ実行ボタン６５２、ＭＦＰ１０Ｙ実行ボタン６５４、およびＭＦＰ１０Ｚ実行ボタン６５６である。コントローラー４１Ａは、ステップＳ４０６において、操作受付部２０６が実行ボタンの操作を受付けたと判断すると（ステップＳ４０８でＹＥＳ）、処理は、ステップＳ４１０に進む。 In step S408, the controller 41A determines whether or not the operation reception unit 206 has accepted the operation of the execution button. Here, the execution buttons are the execution button 612, the MFP10X execution button 652, the MFP10Y execution button 654, and the MFP10Z execution button 656 described with reference to FIG. 25. When the controller 41A determines in step S406 that the operation reception unit 206 has accepted the operation of the execution button (YES in step S408), the process proceeds to step S410.

ステップＳ４１０において、コントローラー４１Ａは、「ステップＳ４０４で選択されたと判断された設定内容」は、「実行ボタンにより指定されているＭＦＰが実行可能なジョブに応じた設定内容」であるか否かを判断する。例えば、図２７の画面において、カラーコピーが選択されている状態において、ＭＦＰ１０Ｘ実行ボタン６５２が操作されたときには、カラーコピーは、ＭＦＰ１０Ｘ実行ボタン６５２により指定されているＭＦＰ１０Ｘが実行可能なジョブに応じた設定内容である。この場合には、ステップＳ４１０ではＹＥＳと判断され、処理は、ステップＳ４１２に進む。 In step S410, the controller 41A determines whether or not the "setting content determined to be selected in step S404" is the "setting content corresponding to the job that can be executed by the MFP specified by the execute button". To do. For example, on the screen of FIG. 27, when the MFP10X execution button 652 is operated while the color copy is selected, the color copy corresponds to the job that the MFP10X specified by the MFP10X execution button 652 can execute. This is the setting content. In this case, YES is determined in step S410, and the process proceeds to step S412.

また、図２７の画面において、カラーコピーが選択されている状態において、ＭＦＰ１０Ｚ実行ボタン６５６が操作されたときには、カラーコピーは、ＭＦＰ１０Ｚ実行ボタン６５２により指定されているＭＦＰ１０Ｚが実行可能なジョブに応じた設定内容ではない。この場合には、ステップＳ４１０ではＮＯと判断され、処理は、ステップＳ４１４に進む。 Further, when the MFP10Z execution button 656 is operated in the state where the color copy is selected on the screen of FIG. 27, the color copy corresponds to the job that the MFP10Z specified by the MFP10Z execution button 652 can execute. It is not the setting content. In this case, NO is determined in step S410, and the process proceeds to step S414.

ステップＳ４１２において、要求部２１２は、選択されている設定内容に基づく音声データをサーバー装置７０に要求する。ステップＳ４１２において、出力制御部２０８は、要求した音声データを受信すると、該音声データに基づく音声をスピーカー４７から出力させる。 In step S412, the requesting unit 212 requests the server device 70 for voice data based on the selected setting contents. In step S412, when the output control unit 208 receives the requested voice data, the output control unit 208 outputs the voice based on the voice data from the speaker 47.

また、ステップＳ４１４においては、コントローラー４１Ａは、設定内容を変更する。この変更は、「ステップＳ４０４で選択されたと判断された設定内容」から、「ステップＳ４０８で操作されたと判断された実行ボタンにより指定されたＭＦＰが実行可能なジョブに応じた設定内容」への変更である。次のステップＳ４１２では、要求部２１２は、選択されている設定内容および変更された設定内容に基づく音声データをサーバー装置７０に要求する。ステップＳ４１２において、出力制御部２０８は、要求した音声データを受信すると、該音声データに基づく音声をスピーカー４７から出力させる。 Further, in step S414, the controller 41A changes the setting contents. This change is a change from "setting contents determined to be selected in step S404" to "setting contents according to a job that can be executed by the MFP specified by the execute button determined to have been operated in step S408". Is. In the next step S412, the requesting unit 212 requests the server device 70 for voice data based on the selected setting content and the changed setting content. In step S412, when the output control unit 208 receives the requested voice data, the output control unit 208 outputs the voice based on the voice data from the speaker 47.

上述のように第２ユーザーは、設定画面を視認した場合には、該設定画面に表示される設定内容の文言の音声を発することができる一方、設定画面を視認しなければ、該設定画面に表示される設定内容の文言の音声を発することができない。そこで、本実施形態の外部装置２０Ａは、ＭＦＰアプリケーションをダウンロードすることにより、ジョブの設定内容（つまり、ジョブの設定項目およびジョブの種別）の選択および決定を受付ける（図２４、図２５、図２７、および図２０のステップＳ４０４およびステップＳ４０８参照）。さらに、外部装置２０Ａの出力制御部２０８は、決定された設定内容を示す文言の音声をスピーカー４７から出力する（ステップＳ４１２）。したがって、第２ユーザーは、適切に、音声に基づくジョブをＭＦＰ１０に実行させることができる。 As described above, when the second user visually recognizes the setting screen, he / she can emit a voice of the wording of the setting content displayed on the setting screen, but if he / she does not visually recognize the setting screen, the setting screen is displayed. The voice of the displayed setting contents cannot be emitted. Therefore, the external device 20A of the present embodiment accepts the selection and determination of the job setting contents (that is, the job setting items and the job type) by downloading the MFP application (FIGS. 24, 25, 27). , And step S404 and step S408 of FIG. 20). Further, the output control unit 208 of the external device 20A outputs the voice of the wording indicating the determined setting content from the speaker 47 (step S412). Therefore, the second user can appropriately cause the MFP 10 to execute a voice-based job.

また、外部装置２０Ａで表示された設定内容が決定されたときに、外部装置２０Ａは、設定内容をＭＦＰ１０に送信する構成が考えられる。しかしながら、このような構成であれば、外部装置２０ＡとＭＦＰ１０の間の通信手段を設ける必要があり、コストが増大する。本実施形態の外部装置２０Ａであれば、このような通信手段を設ける必要がない。したがって、外部装置２０Ａ、および画像形成システム１０００は、コストの増大を抑えることができる。 Further, when the setting content displayed by the external device 20A is determined, the external device 20A may be configured to transmit the setting content to the MFP 10. However, with such a configuration, it is necessary to provide a communication means between the external device 20A and the MFP 10, which increases the cost. With the external device 20A of the present embodiment, it is not necessary to provide such a communication means. Therefore, the external device 20A and the image forming system 1000 can suppress the increase in cost.

また、外部装置２０Ａは、表示領域４４２Ａに表示された設定内容がユーザーにより指定されることにより、設定内容の選択を受付ける（ステップＳ４０４参照）。したがって、ユーザーは手軽に設定内容を選択できる。 Further, the external device 20A accepts the selection of the setting contents when the setting contents displayed in the display area 442A are specified by the user (see step S404). Therefore, the user can easily select the setting contents.

また、ユーザーにより設定内容が決定されると、要求部２１２は、選択されている設定内容に基づく音声データをサーバー装置７０に要求する。さらに、出力制御部２０８は、要求した音声データを受信すると、該音声データに基づく音声をスピーカー４７から出力させる。また、ユーザーにより保存ボタン６１４が操作されると、要求部２１２は、選択されている設定内容に基づく音声データをサーバー装置７０に要求する。さらに、出力制御部２０８は、要求した音声データを受信すると、該音声データを記憶させる（図２９のステップＳ４１６）。該音声データが記憶された後においては、外部装置２０Ａは、ＭＦＰアプリケーションを起動させなくても（つまり、ユーザーによりＭＦＰアプリケーションを起動させる操作が行われなくても）、外部装置２０は、該記憶された音声データに基づく音声を出力することができる。例えば、ユーザーが頻繁に使用する設定内容（つまり、音声を頻繁に出力させる設定内容）については、ユーザーは該設定内容の音声データを外部装置２０Ａに記憶させることが好ましい。これにより、ユーザーは、ＭＦＰアプリケーションを起動させる操作を行わなくても、設定内容の音声データに基づく音声を外部装置２０Ａから出力させることができる。よって、ユーザーの操作負担を軽減できる。 Further, when the setting content is determined by the user, the requesting unit 212 requests the server device 70 for voice data based on the selected setting content. Further, when the output control unit 208 receives the requested voice data, the output control unit 208 outputs the voice based on the voice data from the speaker 47. Further, when the save button 614 is operated by the user, the request unit 212 requests the server device 70 for voice data based on the selected setting content. Further, when the output control unit 208 receives the requested voice data, the output control unit 208 stores the voice data (step S416 in FIG. 29). After the audio data is stored, the external device 20A does not start the MFP application (that is, even if the user does not perform an operation to start the MFP application), the external device 20 stores the storage. It is possible to output a voice based on the voice data. For example, with respect to the setting contents frequently used by the user (that is, the setting contents for frequently outputting the voice), the user preferably stores the voice data of the setting contents in the external device 20A. As a result, the user can output the voice based on the voice data of the setting contents from the external device 20A without performing the operation of starting the MFP application. Therefore, the operation load on the user can be reduced.

また、コントローラー４１Ａは、ユーザーにより選択された設定内容から、ユーザーにより操作された実行ボタンにより指定されているＭＦＰが実行可能なジョブに応じた設定内容に変更する（ステップＳ４１４）。したがって、外部装置２０Ａは、ユーザー自身が変更する負担をユーザーにかけさせないようにすることができる。 Further, the controller 41A changes the setting contents selected by the user to the setting contents according to the job that can be executed by the MFP designated by the execution button operated by the user (step S414). Therefore, the external device 20A can prevent the user from burdening the user with the change.

また、第８実施形態の外部装置２０Ａは、ＭＦＰアプリケーションをダウンロードすることにより、全ての設定内容の音声データを記憶するようにしてもよい。 Further, the external device 20A of the eighth embodiment may store the audio data of all the setting contents by downloading the MFP application.

外部装置２０Ａにおける処理は、各ハードウェアおよびＣＰＵ４１１により実行されるソフトウェアによって実現される。このようなソフトウェアは、所定の記憶装置（例えば、ＲＯＭ４１２、または図示しないフラッシュメモリ）に予め記憶されている場合がある。また、ソフトウェアは、メモリカードその他の記憶媒体に格納されて、プログラムプロダクトとして流通している場合もある。あるいは、ソフトウェアは、いわゆるインターネットに接続されている情報提供事業者によってダウンロード可能なプログラムプロダクトとして提供される場合もある。このようなソフトウェアは、ＩＣカードリーダライタその他の読取装置によりその記憶媒体から読み取られて、あるいは、通信ＩＦを介してダウンロードされた後、ＲＯＭ４１２に一旦格納される。そのソフトウェアは、ＣＰＵ４１１によってＲＯＭ４１２から読み出され、ＣＰＵ１０１は、そのプログラムを実行する。 The processing in the external device 20A is realized by the software executed by each hardware and the CPU 411. Such software may be pre-stored in a predetermined storage device (eg, ROM412, or flash memory (not shown)). In addition, the software may be stored in a memory card or other storage medium and distributed as a program product. Alternatively, the software may be provided as a program product that can be downloaded by an information provider connected to the so-called Internet. Such software is read from the storage medium by an IC card reader / writer or other reading device, or downloaded via a communication IF, and then temporarily stored in the ROM 412. The software is read from ROM 412 by CPU 411, and CPU 101 executes the program.

また、記録媒体としては、ＤＶＤ-ＲＯＭ、ＣＤ−ＲＯＭ、ＦＤ（Flexible Disk）、ハードディスクに限られず、磁気テープ、カセットテープ、光ディスク（ＭＯ（Magnetic Optical Disc）／ＭＤ（Mini Disc）／ＤＶＤ（Digital Versatile Disc））、光カード、マスクＲＯＭ、ＥＰＲＯＭ（Electronically Programmable Read-Only Memory）、ＥＥＰＲＯＭ（Electronically Erasable Programmable Read-Only Memory）、フラッシュＲＯＭなどの半導体メモリ等の固定的にプログラムを担持する媒体でもよい。また、記録媒体は、当該プログラム等をコンピュータが読取可能な一時的でない媒体である。 The recording medium is not limited to DVD-ROM, CD-ROM, FD (Flexible Disk), and hard disk, but also magnetic tape, cassette tape, optical disc (MO (Magnetic Optical Disc) / MD (Mini Disc) / DVD (Digital). A medium such as a Versatile Disc)), an optical card, a mask ROM, an EPROM (Electronically Programmable Read-Only Memory), an EEPROM (Electronically Erasable Programmable Read-Only Memory), or a semiconductor memory such as a flash ROM may be used. .. Further, the recording medium is a non-temporary medium in which the program or the like can be read by a computer.

また、プログラムとは、ＣＰＵにより直接実行可能なプログラムだけでなく、ソースプログラム形式のプログラム、圧縮処理されたプログラム、暗号化されたプログラム等を含む。 The program includes not only a program that can be directly executed by the CPU, but also a source program format program, a compressed program, an encrypted program, and the like.

また、今回開示された各実施形態は全ての点で例示であって制限的なものではないと考えられるべきである。本発明の範囲は上記した説明ではなくて特許請求の範囲によって示され、特許請求の範囲と均等の意味および範囲内での全ての変更が含まれることが意図される。また、実施形態および各変形例において説明された発明は、可能な限り、単独でも、組合わせても、実施することが意図される。 In addition, each embodiment disclosed this time should be considered to be exemplary in all respects and not restrictive. The scope of the present invention is shown by the scope of claims rather than the above description, and it is intended to include all modifications within the meaning and scope equivalent to the scope of claims. In addition, the inventions described in the embodiments and the modifications are intended to be implemented, either alone or in combination, wherever possible.

１００情報処理装置、１０２取得部、１０４生成部、１０６送信部、１１０音声受付部、１１２音声認識部、１１４設定部、１１６実行部、１２０ジョブユニット、１２２第１促進部、２０２受信部、２０４記憶部、２０６操作受付部、２０８出力制御部。 100 Information processing device, 102 Acquisition unit, 104 Generation unit, 106 Transmission unit, 110 Voice reception unit, 112 Voice recognition unit, 114 Setting unit, 116 Execution unit, 120 Job unit, 122 First promotion unit, 202 Reception unit, 204 Storage unit, 206 operation reception unit, 208 output control unit.

Claims

The acquisition unit that acquires the job settings and
A generation unit that generates voice data of voice of words indicating the setting contents acquired by the acquisition unit, and a generation unit.
An information processing device including an external device that outputs voice based on the voice data and a transmission unit that transmits the voice data.

The acquisition unit acquires a plurality of setting contents and obtains them.
The generation unit generates the voice data for each of the plurality of setting contents, and generates the voice data.
The information processing device according to claim 1, wherein the transmission unit transmits the voice data for each of the plurality of setting contents to the external device that outputs voice based on the voice data for each of the plurality of setting contents.

The information processing device further
A microphone to which voice is input and
It is provided with a first promotion unit that executes a first promotion process that promotes the input of the user's voice to the microphone to the user.
The generation unit generates new voice data by adding the voice data of the user's voice input to the microphone after the execution of the first promotion process to the voice data generated by the generation unit. ,
The information processing device according to claim 1 or 2, wherein the transmission unit transmits the new voice data to the external device that outputs voice based on the new voice data.

The first promotion unit executes a second promotion process for encouraging the user to input the user's voice into the microphone when the voice based on the new voice data is input to the microphone from the external device. ,
The information processing device further
When the characteristic information of the user's voice input to the microphone after the execution of the second promotion process and the characteristic information of the user's voice included in the voice based on the new voice data are the same, the new The information processing apparatus according to claim 3, further comprising a first execution unit that executes a job based on a setting content indicated by voice based on voice data.

A microphone to which voice is input and
When the voice of the wording indicating the setting contents of the job and the voice of the user are input to the microphone, the first promotion unit that prompts the user to input the voice of the user into the microphone, and
When the feature information of the user's voice input to the microphone before the promotion of the first promotion unit and the feature amount of the user's voice input to the microphone after the promotion of the first promotion unit are the same. An information processing device including a first execution unit that executes a job based on a job setting content indicated by a voice input to the microphone before promotion of the first promotion unit.

The information processing device according to any one of claims 3 to 5, wherein the voice feature information is a voiceprint.

The generation unit generates voice data based on the content of the number of characters smaller than the number of characters of the wording indicating the setting content acquired by the acquisition unit.
The method according to any one of claims 1 to 4, wherein the transmission unit transmits the voice data to the external device so as to output the voice based on the voice data based on the content of the small number of characters to the external device. Information processing equipment.

It is provided with a second execution unit that executes a job based on the setting contents indicated by the voice output from the external device.
The acquisition unit acquires changes in the job settings and obtains them.
The information processing device according to any one of claims 1 to 7, wherein the second execution unit executes a job based on the setting contents reflecting the changes acquired by the acquisition unit.

A microphone to which voice is input and
When the voice of the wording indicating the setting contents of the job is input to the microphone, the execution unit that executes the job based on the setting contents indicated by the voice, and the execution unit.
It is equipped with an acquisition unit that acquires changes in job settings.
The execution unit is an information processing device that executes a job based on the setting contents reflecting the changes input to the acquisition unit.

The information processing apparatus according to claim 8 or 9, further comprising a second promotion unit that prompts the user to input a change in the setting content to the acquisition unit.

On the computer
The selection acceptance procedure that accepts the selection of the setting contents of the job to be executed by the information processing device, and
The decision acceptance procedure to accept the decision of the selected setting contents, and
A program for executing the output procedure that outputs the voice of the wording indicating the determined setting contents.

The program according to claim 11, wherein the selection acceptance procedure is a procedure for accepting selection of setting contents by designating the setting contents displayed in the display area by the user.

The program is applied to the computer and further
Further execute the request procedure for requesting the server device for voice data based on the determined setting contents.
The output procedure is a procedure for outputting the voice of the voice data received from the server device according to the request procedure.
The program is applied to the computer and further
The program according to claim 11 or 12, further executing a storage procedure for storing the voice data received from the server device in a predetermined area according to the request procedure.

The program causes the computer to further execute a change procedure for changing the setting contents selected in the selection acceptance procedure to the setting contents according to a job that can be executed by the computer. The program described in any one of the items.

An image forming apparatus including the information processing apparatus according to any one of claims 1 to 10.

The acquisition step to acquire the job settings and
A generation step for generating voice data of the wording indicating the setting contents acquired in the acquisition step, and a generation step.
An information processing method including a transmission step of transmitting the voice data to the external device so as to output the voice based on the voice data to the external device.