JP4259363B2

JP4259363B2 - Video encoding device

Info

Publication number: JP4259363B2
Application number: JP2004081253A
Authority: JP
Inventors: 敬西; 潔福井
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2004-03-19
Filing date: 2004-03-19
Publication date: 2009-04-30
Anticipated expiration: 2024-03-19
Also published as: JP2005269428A

Description

本発明は動画像符号化装置に関し、例えば、携帯電話機に設けられたカメラなどから入力された動画像の符号化を行う装置に適用し得るものである。 The present invention relates to a moving image encoding apparatus, and can be applied to, for example, an apparatus that encodes a moving image input from a camera or the like provided in a mobile phone.

例えば、ビデオカメラなどで撮影された映像を、通信回線を用いて送信する場合には、一般に符号化という手法が用いられる。符号化とは、映像（画像）データを圧縮する処理のことであり、フレーム内符号化、フレーム間符号化、予測符号化などが知られている。また、さまざまな符号化方法のうちから、バッファの残量などに対応した方法を適宜に選択して用いるという技術なども一般に知られている。 For example, when video captured by a video camera or the like is transmitted using a communication line, a technique called encoding is generally used. Encoding is a process of compressing video (image) data, and intraframe encoding, interframe encoding, predictive encoding, and the like are known. In addition, a technique of appropriately selecting and using a method corresponding to the remaining amount of buffer among various encoding methods is generally known.

上記のような技術では、その時々の状況や、各システムに特有の制御方法に対応した符号化を行うことが要請されるが、さまざまな符号化方法に対して、外部から入力される１種類のコントロール信号のみで適切な対応を行うことは困難である場合が多い。 In the technology as described above, it is required to perform encoding corresponding to the situation at each time and a control method peculiar to each system, but one type input from the outside for various encoding methods. In many cases, it is difficult to perform an appropriate response using only the control signal.

ここで、屋外で怪我人がでたときに、携帯電話機を用いて病院に連絡を取り、動画像を利用してそのときの怪我の状態を伝えるというような場合を一例として考える。この場合、怪我の発生時の状況等を言葉で説明しているときには、特に画質を優先する必要はないが、怪我の患部の状態を一目瞭然に病院側に伝えたいときには、その患部の画像を病院へと伝送することが極めて有効である。従って、その場合には、患部の画像をビデオカメラなどで撮影し、その映像信号を携帯電話機を介して病院へと伝送することが要請される。このような患部の全体的な状況をまず伝達したい場合などには、動画像の画質を最優先にすることが望まれる。 Here, when the injured people came out in the outdoors, consider contact the hospital by using a mobile phone, a case such as that using a moving picture tell the state of the injury at that time as an example. In this case, it is not necessary to give priority to the image quality when describing the situation at the time of the injury in words, but if you want to tell the state of the affected area of the injury clearly to the hospital, the image of the affected area is displayed in the hospital. It is extremely effective to transmit to. Therefore, in that case, it is required to take an image of the affected area with a video camera or the like and transmit the video signal to the hospital via the mobile phone. For example, when it is desired to first communicate the overall situation of the affected area, it is desirable to give the highest priority to the quality of moving images.

あるいは、より詳細に患部の状態を伝達するためには、カメラの光学的なズーム機能を使うなどして、患部の拡大画像を撮影することが必要となる場合も多い。 Alternatively, in order to transmit the state of the affected area in more detail, it is often necessary to take an enlarged image of the affected area by using an optical zoom function of the camera.

ところが、そのように患部画像をズームインすると、フレーム間の動きが大きくなって、画質が劣化することがある。 However, when the affected area image is zoomed in in this manner, the motion between frames becomes large, and the image quality may deteriorate.

そこで、このようなズームインの際に生じることの多い拡大画像の画質の劣化を回避するために、従来の技術では、ビデオカメラにおける、光学的なズーム制御と符号化方法の選択とを、連動させるようにしている（特許文献１）。 Therefore, in order to avoid such deterioration of the image quality of an enlarged image that often occurs during zoom-in, in the conventional technique, optical zoom control and selection of an encoding method in a video camera are linked. (Patent Document 1).

すなわち、ズームイン時には、圧縮率を下げて高精細な画像を復元できるような方式の符号化を選択して行い、ズームアウト時には、１フレーム当たりの圧縮率を上げてフレーム数を多くすることで動きを優先することができるような方式の符号化を選択して行うようにしている。そして、このようにして最適な方式によって符号化された画像信号は、携帯電話機及びその通信回線等を伝送手段として経由して、病院などの送信目的地へと転送される。
特開平８−４６８５７号公報 In other words, when zooming in, select a coding method that can restore a high-definition image by reducing the compression rate, and during zoom-out, increase the compression rate per frame to increase the number of frames. Is selected and performed so that priority can be given. Then, the image signal encoded by the optimum method in this way is transferred to a transmission destination such as a hospital via a mobile phone and its communication line as a transmission means.
JP-A-8-46857

しかし、上記のような最適な符号化方式の選択は、光学的ズーム機能を有するビデオカメラ等においては可能であるが、例えば、カメラ付携帯電話機に用いられているような、光学的ズーム機能の付いていないカメラにおいては適用できない。 However, the optimum encoding method as described above can be selected in a video camera or the like having an optical zoom function. For example, an optical zoom function such as that used in a camera-equipped mobile phone is used. It cannot be applied to cameras that are not attached.

また、敢えてカメラ付携帯電話機のカメラに光学的ズーム機構等を付加すると、少なくともその部分の構造が煩雑なものとなり、ひいては携帯電話機全体の寸法の小型・薄型化の著しい妨げとなるという問題や、製造コストや部品コストが高額化する恐れがあるという問題がある。 In addition, when an optical zoom mechanism or the like is added to the camera of the camera-equipped mobile phone, the structure of at least that part becomes complicated, and as a result, the size and overall thickness of the mobile phone are significantly hindered. There is a problem that manufacturing costs and component costs may increase.

さらに、仮に、そのような光学的ズーム機構等を付加できたとしても、その光学的ズームの機能等の使い方に不慣れなユーザにとっては、その操作が難しいものと感じられるという、別段の問題が発生する。例えば、カメラ付携帯電話機の場合には、一般に、撮影する画像のズームイン等のような加工処理は、入力操作用ボタン等を操作することによって電子的に行われるようになっており、光学的に行っているわけではない。このため、一般的なユーザにとっては、入力操作用ボタン等の操作とは別に、光学的ズームのような光学的な機能の操作を行うことに対しては、不慣れである場合が多く、その操作の簡便さや使い勝手の点で、種々の不都合が生じる恐れがある。 Furthermore, even if such an optical zoom mechanism or the like can be added, there is another problem that it is difficult for a user who is unfamiliar with how to use the optical zoom function or the like. To do. For example, in the case of a camera-equipped mobile phone, processing such as zooming in on an image to be taken is generally performed electronically by operating an input operation button or the like, and optically I'm not going. For this reason, it is often unfamiliar for general users to operate optical functions such as optical zoom separately from operations such as input operation buttons. Various inconveniences may occur in terms of simplicity and ease of use.

また、光学的ズーム機能を備えていないカメラ付携帯電話機などで、光学的ズーム機能を用いること以外の方法によって、撮影した映像を拡大する方法としては、撮影された画像を信号処理によって拡大する、という方法がある。 In addition, as a method of enlarging a captured image by a method other than using the optical zoom function in a camera-equipped mobile phone that does not have an optical zoom function, the captured image is expanded by signal processing. There is a method.

ところが、そのように信号処理によって画像を拡大すると、画像の精細度が劣化してしまう。また、そのような信号処理によって拡大されて精細度が劣化した画像に圧縮率が高くなるような符号化処理を施すと、画像の精細度はさらに劣化してしまうという問題がある。 However, when the image is enlarged by such signal processing, the definition of the image deteriorates. In addition, if an encoding process that increases the compression rate is performed on an image that has been enlarged by such signal processing and has deteriorated in definition, the definition of the image is further deteriorated.

本発明は、上述した課題を解決するためになされたもので、光学的ズーム機能を有さない動画像撮像手段を備えた電子機器等における、画像の撮像状態に対応して最適な符号化方式を選択して実行することを可能とする動画像符号化装置を提供することを目的とする。 The present invention has been made to solve the above-described problem, and is an optimal encoding method corresponding to an image capturing state in an electronic apparatus or the like including a moving image capturing unit that does not have an optical zoom function. An object of the present invention is to provide a moving picture coding apparatus that can select and execute the above.

かかる課題を解決するために、本発明に係る動画像符号化装置は、動画像信号を入力する動画像信号入力手段と、所定の複数の符号化方式のうちからいずれかの符号化方式を選択し、当該符号化方式によって、入力された動画像信号を符号化する符号化手段とを有する、光学的ズーム機能を有さない動画像撮像手段を備えた電子機器に設けられた動画像符号化装置であって、入力された動画像信号によって再現される画像の状態を検出する画像状態検出手段と、画像状態検出手段によって検出された画像状態に対応して、複数の符号化方式のうちから符号化方式を選択し、当該符号化方式によって、入力された動画像信号を符号化するように、符号化手段を制御する符号化制御手段とを備え、上記画像状態検出手段は、入力された現画像と当該現画像よりも時間的に前の前画像とを比較する画像比較部と、前画像を格納する前画像記憶部と、画像状態の判定結果の情報を記憶する画像状態記憶部と、画像比較部による比較結果及び画像状態記憶部に記憶されている過去の画像状態に基づいて、画像状態を判定する画像状態判定部とを備え、この画像状態判定部は、現画像が前画像と比較して拡大されたものであると判定した場合、及び、現画像が、拡大状態が続いた後でなった静止状態の画像と判定した場合には、符号化制御手段に対して、動画像信号を画質優先で符号化する符号化方式を選択するように指示することを特徴とする。 In order to solve such a problem, a moving image encoding apparatus according to the present invention selects a moving image signal input means for inputting a moving image signal and one of a plurality of predetermined encoding methods. In addition, according to the encoding method, the moving image encoding provided in the electronic apparatus including the moving image capturing unit that does not have the optical zoom function, and the encoding unit that encodes the input moving image signal. An image state detection unit that detects a state of an image reproduced by an input moving image signal, and a plurality of encoding methods corresponding to the image state detected by the image state detection unit select the encoding method, by the coding method, to encode the input moving image signal, e Bei a coding control means for controlling the encoding means, said image condition detection means is input The current image An image comparison unit that compares the previous image temporally before the current image, a previous image storage unit that stores the previous image, an image state storage unit that stores information on the determination result of the image state, and an image comparison unit And an image state determination unit that determines an image state based on the comparison result of the past and the past image state stored in the image state storage unit. The image state determination unit compares the current image with the previous image. If it is determined that the image has been enlarged, and if the current image is determined to be a still image after the enlargement has continued, the image signal is transmitted to the encoding control means. An instruction is given to select a coding method to be coded with priority .

かかる構成により、入力された動画像信号が表す画像の状態に適応した符号化方式を自動的に選択し、その最適な符号化方式によって、最適な動画像信号の符号化が行われる。 With such a configuration, an encoding method adapted to the state of the image represented by the input moving image signal is automatically selected, and the optimal moving image signal is encoded by the optimal encoding method.

本発明の動画像符号化装置によれば、入力された動画像信号が表す画像の状態に適応した符号化方式を自動的に選択し、その最適な符号化方式によって、画像の状態に適応した最適な動画像信号の符号化を行うことができるので、光学的ズーム機能を有さない動画像撮像手段を備えた電子機器（例えば、カメラ付携帯電話機）においても、ズームイン状態か静止画像状態か動画像状態か、といった画像の状態に対して最適な符号化方式を選択して実行することが可能となる。 According to the moving image encoding apparatus of the present invention, an encoding method adapted to the state of the image represented by the input moving image signal is automatically selected, and the image state is adapted to the optimum encoding method. Since it is possible to encode an optimal moving image signal, whether it is in a zoomed-in state or a still-image state even in an electronic device (for example, a camera-equipped mobile phone) equipped with a moving image capturing means that does not have an optical zoom function. It is possible to select and execute an optimal encoding method for an image state such as a moving image state.

以下、本発明による動画像符号化装置の最良の形態について図面を参照して説明する。 Hereinafter, the best mode of a moving picture coding apparatus according to the present invention will be described with reference to the drawings.

（Ａ）第１の実施形態
図１は、本発明の第１の実施形態に係る動画像符号化装置の主要部の構成を示すブロック図である。 (A) First Embodiment FIG. 1 is a block diagram illustrating a configuration of a main part of a video encoding device according to a first embodiment of the present invention.

この動画像符号化装置は、入力された画像データの状態を検出する画像状態検出部１００と、カメラ装置などから画像データを入力する動画像信号入力部１０１と、入力された画像データを符号化する動画像符号化部１０２と、入力された画像データの符号化方法を制御する符号化制御部１０３とから、その主要部が構成されている。 The moving image encoding apparatus includes an image state detection unit 100 that detects the state of input image data, a moving image signal input unit 101 that inputs image data from a camera device, and the like, and encodes the input image data. The main part is comprised from the moving image encoding part 102 which performs, and the encoding control part 103 which controls the encoding method of the input image data.

画像状態検出部１００は、さらに詳細には、入力された画像データと前フレームの画像データとを比較する画像比較部１０４と、前画像データを記憶する前画像記憶部１０５と、入力された画像データと前フレームの画像データとの比較結果より画像の状態を判定する画像状態判定部１０６とを備えている。 More specifically, the image state detection unit 100 compares the input image data with the image data of the previous frame, the previous image storage unit 105 that stores the previous image data, and the input image. An image state determination unit 106 that determines the state of the image based on the comparison result between the data and the image data of the previous frame is provided.

動画像信号入力部１０１は、画像符号化部１０２及び画像比較部１０４に接続されており、入力された画像データを動画像符号化部１０２及び画像比較部１０４に送信する機能を有している。 The moving image signal input unit 101 is connected to the image encoding unit 102 and the image comparison unit 104 and has a function of transmitting the input image data to the moving image encoding unit 102 and the image comparison unit 104. .

動画像符号化部１０２は、動画像信号入力部１０１及び符号化制御部１０３に接続され、動画像信号入力部１０１から入力された画像データを、符号化制御部１０３から指示される符号化方式によって符号化して、その符号化されたデータを出力する機能を有している。 The moving image encoding unit 102 is connected to the moving image signal input unit 101 and the encoding control unit 103, and the image data input from the moving image signal input unit 101 is encoded by the encoding control unit 103. And a function of outputting the encoded data.

画像比較部１０４は、動画像信号入力部１０１、前画像記憶部１０５及び画像状態判定部１０６に接続され、動画像信号入力部１０１から入力された画像データと前画像記憶部１０５に記憶されている前フレームの画像との比較を行い、その結果を、画像状態判定部１０６へと送出する機能を有している。 The image comparison unit 104 is connected to the moving image signal input unit 101, the previous image storage unit 105, and the image state determination unit 106, and is stored in the previous image storage unit 105 with the image data input from the moving image signal input unit 101. It has a function of comparing the image with the previous frame image and sending the result to the image state determination unit 106.

前画像記憶部１０５は、画像比較部１０４に接続され、その画像比較部１０４で現フレームの画像データとして用いられた画像データを、次回の比較のために、前フレーム画像データとして記憶する機能を有している。 The previous image storage unit 105 is connected to the image comparison unit 104 and has a function of storing the image data used as the image data of the current frame by the image comparison unit 104 as the previous frame image data for the next comparison. Have.

画像状態判定部１０６は、画像比較部１０４及び符号化制御部１０３に接続され、画像比較部１０４の処理結果を受けて画像の状態を判定し、その判定結果に基づいて、符号化方法を決定し、それを選択する指令を符号化制御部１０３へと送信する機能を有している。 The image state determination unit 106 is connected to the image comparison unit 104 and the encoding control unit 103, receives the processing result of the image comparison unit 104, determines the state of the image, and determines the encoding method based on the determination result. And has a function of transmitting a command for selecting it to the encoding control unit 103.

符号化制御部１０３は、画像状態判定部１０６及び動画像符号化部１０２に接続され、画像状態検出部１００内の画像状態判定部１０６から出力される指令に基づいて、最適な符号化方式を選択して、その符号化方式に則した符号化処理を動画像符号化部１０２に実行させる、という制御を行う機能を有している。 The encoding control unit 103 is connected to the image state determination unit 106 and the moving image encoding unit 102, and selects an optimal encoding method based on a command output from the image state determination unit 106 in the image state detection unit 100. It has a function of performing control to select and cause the moving image encoding unit 102 to execute encoding processing according to the encoding method.

次に、この第１の実施形態に係る動画像符号化装置の動作を、図２のフローチャートに基づいて説明する。 Next, the operation of the moving picture coding apparatus according to the first embodiment will be described based on the flowchart of FIG.

まず、ステップＳ１０１において、フレーム数Ｎをカウントする変数を０（Ｎ＝０）とする。 First, in step S101, a variable for counting the number of frames N is set to 0 (N = 0).

次に、カメラなどから動画像信号入力部１０１を通じて入力された画像データがある場合には（ステップＳ１０２のＹＥＳ）、その画像データを、画像比較部１０４に入力する（ステップＳ１０３）。入力データがない場合には（ステップＳ１０２のＮＯ）処理を終了する（ＥＮＤ）。 Next, when there is image data input from the camera or the like through the moving image signal input unit 101 (YES in step S102), the image data is input to the image comparison unit 104 (step S103). If there is no input data (NO in step S102), the process ends (END).

入力された画像が、１フレーム目である場合（ステップＳ１０４のＮＯ：Ｎ＝０のとき）には、その画像データを、前画像記憶部１０５に記憶する（ステップＳ１０５）。そして、次の画像データが入力されるのを待つ（ステップＳ１０５〜ステップＳ１０２）。 When the input image is the first frame (NO in step S104: N = 0), the image data is stored in the previous image storage unit 105 (step S105). Then, it waits for the next image data to be input (steps S105 to S102).

一方、ステップＳ１０４で、２フレーム目以降の画像であると判定された場合には（ステップＳ１０４のＹＥＳ：Ｎ≠０のとき）、フレーム数Ｎの１インクリメントを行なった後（ステップＳ１０６）、前画像記憶部１０５に記憶されている前画像を読み出して、その前画像と今入力された画像との比較を、画像比較部１０４で行う（ステップＳ１０７）。 On the other hand, if it is determined in step S104 that the image is the second frame or later (YES in step S104: N ≠ 0), after incrementing the frame number N by 1 (step S106), the previous The previous image stored in the image storage unit 105 is read, and the image comparison unit 104 compares the previous image with the currently input image (step S107).

画像比較部１０４での比較の結果は、画像状態判定部１０６に入力されて、そのときの画像の状態がズームイン画像状態であるか否かが判定される（ステップＳ１０８）。ここで、ズームイン画像であると判定された場合には（ステップＳ１０８のＹＥＳ）、画質重視の符号化を選択することを、符号化制御手段１０３に対して指示する（ステップＳ１０９）。この比較方法及び判定方法についてのさらに詳細な説明については、図３及び図４を用いて後述する。これに対して、ステップＳ１０８でズームイン画像ではないと判定された場合には（ステップＳ１０８のＮＯ）、動き重視の符号化を選択することを、符号化制御部１０３に対して指示する（ステップＳ１１０）。 The result of the comparison in the image comparison unit 104 is input to the image state determination unit 106, and it is determined whether or not the state of the image at that time is a zoomed-in image state (step S108). If it is determined that the image is a zoomed-in image (YES in step S108), the encoding control unit 103 is instructed to select encoding that emphasizes image quality (step S109). A more detailed description of the comparison method and the determination method will be described later with reference to FIGS. On the other hand, if it is determined in step S108 that the image is not a zoomed-in image (NO in step S108), the encoding control unit 103 is instructed to select motion-oriented encoding (step S110). ).

いずれにしても、ここで符号化方法の指示が完了すると（ステップＳ１０９又はＳ１１０）、入力された画像データを前画像記憶部１０５に記憶する（ステップＳ１０５）。そして、次の画像データが入力されるのを待つ（ステップＳ１０２）。 In any case, when the encoding method instruction is completed (step S109 or S110), the input image data is stored in the previous image storage unit 105 (step S105). Then, it waits for the next image data to be input (step S102).

一方、符号化制御部１０３は、画像の状態に適応した符号化方式を指示されると、それが画質重視の符号化方式を選択する指示である場合には（ステップＳ１０９の場合には）、フレームレートを下げる、１フレーム当りの圧縮率を下げる、量子化ステップサイズを小さくするなど、より高精細な画像を再現（復元）できるような符号化方式を選択して実行するという制御を、動画像符号化部１０２に対して行う。あるいは、動き重視の符号化方式を選択する指示の場合には（ステップＳ１１０の場合には）、フレームレートを上げる、１フレーム当りの圧縮率を上げる、量子化ステップサイズを大きくするなどの符号化方式を選択して実行するという制御を、動画像符号化部１０２に対して行う。 On the other hand, when the encoding control unit 103 is instructed to select an encoding method adapted to the state of the image, when it is an instruction to select an encoding method emphasizing image quality (in the case of step S109), Controls to select and execute a coding method that can reproduce (restore) higher-definition images, such as lowering the frame rate, lowering the compression rate per frame, and reducing the quantization step size. This is performed on the image encoding unit 102. Alternatively, in the case of an instruction to select a motion-oriented coding method (in the case of step S110), encoding such as increasing the frame rate, increasing the compression rate per frame, or increasing the quantization step size. Control for selecting and executing the method is performed on the moving image encoding unit 102.

そして、動画像符号化部１０２では、符号化制御部１０３によって制御されて、最適なものとして選択された符号化方法による符号化を、動画像信号入力部１０１から入力された画像データに対して実行して、その符号化されたデータを出力する。 Then, the moving image coding unit 102 performs coding on the image data input from the moving image signal input unit 101 by the coding method selected by the coding control unit 103 and selected as the optimum one. Execute and output the encoded data.

次に、画像比較部１０４及び画像状態判定部１０６で行われる比較方法及び判定方法（主にステップＳ１０７、Ｓ１０８で行われる動作）について、図３及び図４を用いて、さらに詳細に説明する。 Next, the comparison method and determination method (operations mainly performed in steps S107 and S108) performed by the image comparison unit 104 and the image state determination unit 106 will be described in more detail with reference to FIGS.

図３は、通常のいわゆる広角の画像の状態から次第にズームイン（ズームアップ又はクローズアップとも呼ぶ）して行く、時系列的に連続した３フレーム分の画像データの一例を表したものである。この図３では、フレーム番号の若い順に（前の画像から後の画像への順に）、［０］、［１］、［２］と番号を付している。 FIG. 3 shows an example of image data for three consecutive frames in a time series that is gradually zoomed in (also called zoom-up or close-up) from a normal so-called wide-angle image state. In FIG. 3, [0], [1], and [2] are assigned numbers in ascending order of frame numbers (in order from the previous image to the subsequent image).

また、図４は、画像比較・判定処理の主要な流れを示すフローチャートである。このフローチャートを説明する前提として、フレーム［０］の画像データ（以降［０］と呼ぶ）は、既に前画像記憶部１０５に記憶されているものとして説明する。 FIG. 4 is a flowchart showing the main flow of image comparison / determination processing. As a premise for explaining this flowchart, it is assumed that the image data of frame [0] (hereinafter referred to as [0]) is already stored in the previous image storage unit 105.

カウンタＣを０にセットした後（ステップＳ４０１：Ｃ＝０）、図３に示した前画像（ここでは［０］）における、点線部分に示したような画面中央部の画像を切り出す（ステップＳ４０２）。そして、切り出された画像の拡大画像のデータを、信号処理上の手法によって作成する（ステップＳ４０３）。この拡大処理は、拡大用のフィルタを使って行っても良く、あるいは単に、縦、横に所定の倍率で拡大した画像データを作成しても良い。 After the counter C is set to 0 (step S401: C = 0), the image at the center of the screen as shown by the dotted line portion in the previous image shown in FIG. 3 (here, [0]) is cut out (step S402). ). Then, enlarged image data of the clipped image is created by a signal processing technique (step S403). This enlargement process may be performed using an enlargement filter, or image data enlarged simply at a predetermined magnification in the vertical and horizontal directions may be created.

その［０］を拡大してなる画像と、現画像（ここでは［１］）との、マッチングを行う（ステップＳ４０４）。その比較した結果、両者の差分が規定値よりも小さいなど、両者がマッチした場合には（ステップＳ４０４のＹＥＳ）、ズームイン画像であると判定し、符号化制御部１０３に画質重視の符号化方式を選択して実行する制御を行う指令を出力して（ステップＳ４０５）、一連の処理を終了する（ＥＮＤ）。一方、両者がマッチしていない場合には（ステップＳ４０４のＮＯ）、カウンタＣを１インクリメントする（ステップＳ４０６：例えばＣ＝０をＣ＝１に書き替えるなどする）。 The image obtained by enlarging [0] is matched with the current image ([1] here) (step S404). As a result of the comparison, if the two match, for example, if the difference between the two is smaller than the specified value (YES in step S404), it is determined that the image is a zoomed-in image, and the encoding control unit 103 instructs the encoding control unit 103 to focus on image quality. A command for performing control to select and execute is output (step S405), and a series of processing ends (END). On the other hand, if the two do not match (NO in step S404), the counter C is incremented by 1 (step S406: for example, C = 0 is rewritten to C = 1).

カウンタＣの示す数値が、所定の回数（ここではα回）以上の値になっていなければ（ステップＳ４０７のＮＯ：換言すれば所定の回数未満の場合には）、ステップＳ４０２に戻り、再度、上記と同様の処理（ステップＳ４０１〜Ｓ４０７）を行う。但し、次回のステップＳ４０３では、上記とは異なった拡大率で拡大画像を作成する。 If the numerical value indicated by the counter C is not equal to or greater than the predetermined number of times (here α times) (NO in step S407: in other words, less than the predetermined number), the process returns to step S402, and again, Processing similar to the above (steps S401 to S407) is performed. However, in the next step S403, an enlarged image is created at an enlargement rate different from the above.

あるいは、カウンタＣの示す数値が、所定の回数以上になったことが判定された場合には（ステップＳ４０７のＹＥＳ）、動き重視の符号化方式を選択する指令を符号化制御部１０３へと出力して（ステップＳ４０８）、一連の処理を終了する（ＥＮＤ）。 Alternatively, when it is determined that the numerical value indicated by the counter C is equal to or greater than the predetermined number of times (YES in step S407), a command for selecting a motion-oriented coding method is output to the coding control unit 103. (Step S408), and a series of processing ends (END).

以上のような処理は、さらに次の画像に対しても上記と同様にして続けられる。すなわち、次の画像である［２］の画像についても、それを現画像とし、それよりも時間的に一つ前の画像である［１］の画像を前画像として、上記同様の処理を行う（ステップＳ４０１〜Ｓ４０４）。そして、［２］の画像がズームイン画像であると判定された場合には（ステップＳ４０４のＹＥＳ）、前画像である［１］の画像の場合よりも、さらに画質を重視する制御を［２］に対して行う命令を、符号化制御部１０３へと出力する（ステップＳ４０５）。 The above processing is continued in the same manner as described above for the next image. That is, the same process as described above is performed on the next image [2] as the current image and the previous image [1] as the previous image. (Steps S401 to S404). If it is determined that the image [2] is a zoomed-in image (YES in step S404), control that places more importance on image quality than the image [1] that is the previous image is performed [2]. Is output to the encoding control unit 103 (step S405).

次に、この第１の実施形態に係る動画像符号化装置の作用、効果について説明する。 Next, operations and effects of the moving picture coding apparatus according to the first embodiment will be described.

図３に［０］として示したような通常の状態（広角の状態）のままで、撮影対象が動く、という動画像の信号が入力されている場合には、それを検知して（ステップＳ１０８のＮＯ〜ステップＳ１１０）、一つ一つのフレームごとの画質重視の符号化よりも、動き重視の符号化を選択して実行することで（例えば、フレームレートの高い符号化を選択して行うことで）、撮影対象の動きがスムーズに再現されるようにすることができる。 If a moving image signal indicating that the subject is moving is input in the normal state (wide angle state) as shown in FIG. 3 as [0], it is detected (step S108). NO to step S110), by selecting and executing motion-oriented coding rather than image-oriented coding for each frame (for example, selecting and performing coding with a high frame rate) ), The movement of the object to be photographed can be reproduced smoothly.

あるいはまた、図３に［０］→［１］→［２］として示したようなズームインの状態の動画像の信号が入力されている場合には、それを検知して（ステップＳ１０８のＹＥＳ〜ステップＳ１０９）、動き重視の符号化よりも、一つ一つのフレームごとの画質重視の符号化を選択して実行することで（例えば、量子化ステップサイズの小さい符号化を選択して行うことで）、ズームイン状態に耐え得る高精細な動画像が再現されるようにすることができる。 Alternatively, if a moving image signal in a zoom-in state as shown in [0] → [1] → [2] in FIG. 3 is input, it is detected (YES in step S108). In step S109, by selecting and executing encoding with emphasis on image quality for each frame rather than encoding with emphasis on motion (for example, by selecting and performing encoding with a small quantization step size). ), A high-definition moving image that can withstand a zoom-in state can be reproduced.

以上のように、第１の実施形態に係る動画像符号化装置によれば、光学的ズーム機能が付いていない、又は構造上取り付けることが困難な、カメラや電子機器等においても、信号処理によって画像のズームインやズームアウトを行うと共に、そのときの画像の状態を判定することで、ズームインのときには画質重視の符号化方式を選択し、それ以外のときは動き重視の符号化方式を選択するなど、画像の状態に応じて適切な符号化方式を自動的に的確に選択して、常に良好な表示品質の画像を再現することのできる符号化を実行することができる。 As described above, according to the moving picture coding apparatus according to the first embodiment, even in a camera, electronic device, or the like that does not have an optical zoom function or is difficult to attach structurally, signal processing is performed. In addition to zooming in and out of the image, determining the state of the image at that time, selecting an encoding method that emphasizes image quality when zooming in, and selecting an encoding method that emphasizes motion otherwise Thus, it is possible to automatically select an appropriate encoding method in accordance with the state of the image and execute encoding that can always reproduce an image with good display quality.

（Ｂ）第２の実施形態
図５は、第２の実施形態に係る動画像符号化装置の主要部の構成を表したものである。この動画像符号化装置は、画像状態検出部２００内で、画像の状態を記憶する画像状態記憶部５０１を備えている点、及び画像状態判定部５０２が、画像の状態を判定する機能及び静止状態を判別する機能をさらに備えている点が、第１の実施形態と異なっている。その他については、第１の実施形態と同様である。なお、第１の実施形態と同様の構成部位には同じ符号を付し、その詳細な説明については省略する。 (B) Second Embodiment FIG. 5 illustrates a configuration of a main part of a video encoding device according to a second embodiment. The moving image encoding apparatus includes an image state storage unit 501 that stores the state of an image in the image state detection unit 200, and a function that the image state determination unit 502 determines the state of the image and a still image. The point which is further provided with the function which discriminate | determines a state differs from 1st Embodiment. About others, it is the same as that of 1st Embodiment. In addition, the same code | symbol is attached | subjected to the component similar to 1st Embodiment, and the detailed description is abbreviate | omitted.

画像状態記憶部５０１は、画像状態判定部５０２に接続されており、画像状態判定部５０２で判定された画像の状態を、時系列的に順次記憶する機能を有している。 The image state storage unit 501 is connected to the image state determination unit 502 and has a function of sequentially storing the image states determined by the image state determination unit 502 in time series.

画像状態判定部５０２は、画像比較部１０４、画像状態記憶部５０１及び符号化制御手段１０３に接続されており、画像比較部１０４での比較の結果に基づいて、画像の状態を判定した後、必要に応じて静止状態も判定し、その結果を符号化制御部１０３に伝送する機能を有している。 The image state determination unit 502 is connected to the image comparison unit 104, the image state storage unit 501, and the encoding control unit 103. After determining the image state based on the comparison result in the image comparison unit 104, It has a function of determining a stationary state as necessary and transmitting the result to the encoding control unit 103.

次に、この第２の実施形態に係る動画像符号化装置の動作を、図５及び図６に基づいて説明する。 Next, the operation of the moving picture coding apparatus according to the second embodiment will be described with reference to FIGS.

まず、フレーム数Ｎをカウントする変数を０とする（ステップＳ６０１：Ｎ＝０）。次に、カメラ（図示省略）などから動画像信号入力部１０１を通して入力された画像データがある場合には（ステップＳ６０２のＹＥＳ）、その画像データは画像比較部１０４に入力される（ステップＳ６０４）。またそれと共に、ズームインが何フレーム連続しているかをカウントする変数Ｚと、静止状態が何フレーム連続しているかをカウントする変数Ｓとを、それぞれ０にセットする（ステップＳ６０３）。以後、変数Ｓ及びＺは、画像状態記憶部５０１にて記憶、更新される。入力データがない場合には、処理を終了する（ＥＮＤ）。 First, a variable for counting the number N of frames is set to 0 (step S601: N = 0). Next, when there is image data input through the moving image signal input unit 101 from a camera (not shown) or the like (YES in step S602), the image data is input to the image comparison unit 104 (step S604). . At the same time, a variable Z that counts how many frames the zoom-in continues and a variable S that counts how many frames the stationary state continues are set to 0 (step S603). Thereafter, the variables S and Z are stored and updated in the image state storage unit 501. If there is no input data, the process ends (END).

画像データがあり、その入力された画像が１フレーム目である場合には（Ｎ＝０のとき：ステップ６０５のＮＯ）、その画像データを前画像記憶部１０５に記憶する（ステップＳ６０６）。そして次フレームの画像データが入力されるのを待つ（ステップＳ６０６〜Ｓ６０２）。 If there is image data and the input image is the first frame (when N = 0: NO in step 605), the image data is stored in the previous image storage unit 105 (step S606). Then, it waits for image data of the next frame to be input (steps S606 to S602).

あるいは、２フレーム目以降の画像である場合には（Ｎ≠０のとき：ステップＳ６０５のＹＥＳ）、フレーム数Ｎのインクリメントを行なった後（ステップＳ６０７）、前画像記憶部１０５に記憶されている画像と、今入力された画像との比較を画像比較部１０４で行う（ステップＳ６０８）。 Alternatively, in the case of an image after the second frame (when N ≠ 0: YES in step S605), after incrementing the number of frames N (step S607), it is stored in the previous image storage unit 105. The image comparison unit 104 compares the image with the currently input image (step S608).

画像比較部１０４で行われた比較の結果は、画像状態判定部５０２に入力されて、ズームイン画像かどうかが判定される（ステップＳ６０９）。ここでズームイン画像であると判定された場合には（ステップＳ６０９のＹＥＳ）、変数Ｚをインクリメントすると共に、変数Ｓを０にセットし（ステップＳ６１０）、ステップＳ６０６に移る。ステップＳ６０６では、入力された画像データを前画像記憶部１０５に記憶し、次の画像データが入力されるのを待つ（ステップＳ６０２）。そして上記の流れを繰り返す。 The result of the comparison performed by the image comparison unit 104 is input to the image state determination unit 502, and it is determined whether the image is a zoomed-in image (step S609). If it is determined that the image is a zoomed-in image (YES in step S609), the variable Z is incremented, the variable S is set to 0 (step S610), and the process proceeds to step S606. In step S606, the input image data is stored in the previous image storage unit 105 and waits for the next image data to be input (step S602). Then, the above flow is repeated.

あるいは、ステップＳ６０９で、ズームイン画像ではないと判定された場合には（ステップＳ６０９のＮＯ）、過去にズームイン画像が連続したか（Ｚ＞０）、又は静止状態が連続しているか（Ｓ＞０）を確かめる（ステップＳ６１１）。そのいずれかの場合には（ステップＳ６１１のＹＥＳ）、さらに、現画像が前画像と大差のない状態（静止状態）であるかどうかを判定する（ステップＳ６１２）。 Alternatively, if it is determined in step S609 that the image is not a zoomed-in image (NO in step S609), whether the zoomed-in image has continued in the past (Z> 0) or whether the stationary state has continued (S> 0). ) Is confirmed (step S611). In either case (YES in step S611), it is further determined whether or not the current image is in a state (still state) that is not significantly different from the previous image (step S612).

そして、ステップＳ６１２にて静止状態ではないと判定された場合（ステップＳ６１２のＮＯ）、あるいはステップＳ６１１においてどちらの条件（Ｚ＞０又はＳ＞０）も満たさないと判定された場合には（ステップＳ６１１のＮＯ）、動き重視の符号化方式を選択して実行する制御を行う指令を、符号化制御部１０３に対して出力する（ステップＳ６１５）。 If it is determined in step S612 that the vehicle is not stationary (NO in step S612), or if it is determined in step S611 that neither condition (Z> 0 or S> 0) is satisfied (step S612) In step S <b> 611, a command for performing control to select and execute a motion-oriented coding method is output to the coding control unit 103 (step S <b> 615).

あるいは、ステップＳ６１２にて静止画像であると判定された場合には（ステップＳ６１２のＹＥＳ）、変数Ｓをインクリメントすると共に、変数Ｚを０にセットし（ステップＳ６１３）、画質重視の符号化を選択して実行する制御を行う指令を、符号化制御部１０３に対して出力する（ステップ６１４）。 Alternatively, if it is determined in step S612 that the image is a still image (YES in step S612), the variable S is incremented and the variable Z is set to 0 (step S613), and image quality-oriented encoding is selected. Then, a command for performing the control to be executed is output to the encoding control unit 103 (step 614).

続いて、入力された画像データを前画像記憶部１０５に記憶し（ステップＳ６０６）、次の画像データが入力されるのを待つ（ステップＳ６０６〜Ｓ６０２）。 Subsequently, the input image data is stored in the previous image storage unit 105 (step S606), and the next image data is waited for input (steps S606 to S602).

他方、符号化制御部１０３は、符号化方式を指示されると、それが画質重視の符号化方式を選択する指示の場合には、フレームレートを下げる、フレーム圧縮率を下げる、量子化ステップサイズを小さくするなどの符号化方式を選択してそれによる符号化を実行する制御を動画像符号化部１０２に対して行う。あるいは、動き重視の符号化方式を選択する指示の場合には、フレームレートを上げる、フレーム圧縮率を上げる、量子化ステップサイズを大きくするなどの符号化方式を選択してそれによる符号化を実行する制御を動画像符号化部１０２に対して行う。 On the other hand, when the encoding control unit 103 is instructed to select an encoding method, the encoding control unit 103 decreases the frame rate, decreases the frame compression rate, and the quantization step size when it is an instruction to select an encoding method emphasizing image quality. The moving image encoding unit 102 is controlled to select an encoding method such as reducing the image size and execute encoding based on the selected encoding method. Or, in the case of an instruction to select a motion-oriented coding method, select a coding method such as increasing the frame rate, increasing the frame compression rate, or increasing the quantization step size, and execute the encoding by that. The moving image encoding unit 102 is controlled.

そして、動画像符号化部１０２では、動画像入力部１０１から入力されて来た画像データに対して、符号化制御部１０３によって制御された符号化方法による符号化を行って、その符号化されたデータを出力する。 Then, the moving image encoding unit 102 encodes the image data input from the moving image input unit 101 by the encoding method controlled by the encoding control unit 103, and the encoded data is encoded. Output data.

次に、画像比較部１０４及び画像状態判定部５０２で行われる比較方法及び判定方法の一例について、図６及び図７に基づいて説明する。 Next, an example of the comparison method and determination method performed by the image comparison unit 104 and the image state determination unit 502 will be described with reference to FIGS.

図７は、時系列的に連続する４フレーム分の画像データの一例を表したものである。フレーム番号の若い順に、［０］、［１］、［２］、［３］と番号を付している。なお、この比較方法及び判定方法の主要な流れについては、図６に示したフローチャートのほぼ右半分を用いて説明する。また、その前提として［０］の画像データは既に前画像記憶部１０５に記憶されているものとする。 FIG. 7 shows an example of image data for four frames continuous in time series. Numbers [0], [1], [2], and [3] are assigned in ascending order of frame numbers. The main flow of the comparison method and the determination method will be described using the almost right half of the flowchart shown in FIG. In addition, it is assumed that the image data [0] is already stored in the previous image storage unit 105.

まず、前フレーム画像と現フレーム画像との比較を行う（ステップＳ６０８）。その結果、ズームイン画像であると判定された場合には、変数Ｚをインクリメントする（ステップＳ６１０）。ここでは、例えばＺ＝１にする。また、図７の［０］→［１］のときには、まだ静止状態は検知されていないので、Ｓ＝０とセットされる（ステップＳ６１０）。このズームインか否かの判定方法については、第１の実施形態で説明した方法と同様であるのでここでは省略する。 First, the previous frame image and the current frame image are compared (step S608). As a result, when it is determined that the image is a zoomed-in image, the variable Z is incremented (step S610). Here, for example, Z = 1. Further, in the case of [0] → [1] in FIG. 7, since the stationary state has not been detected yet, S = 0 is set (step S610). The method for determining whether or not this zoom-in is performed is the same as the method described in the first embodiment, and is therefore omitted here.

次の入力画像（図７の［２］）もズームイン画像である場合（図７の［１］→［２］；Ｚ＝２の状態）、その旨が判定されるが、次の入力画像（図７の［２］→［３］）では、ズームイン画像ではないと判定される。 If the next input image ([2] in FIG. 7) is also a zoomed-in image ([1] → [2] in FIG. 7; state Z = 2), this is determined, but the next input image ( In [2] → [3] in FIG. 7, it is determined that the image is not a zoom-in image.

より詳細には、ステップＳ６１１では、変数Ｚ＞０（Ｚ＝２）であることから、ステップＳ６１２に進む。ステップＳ６１２では、前画像と現画像の比較を行うが、この一例では静止画像であるため、静止画像と判定されて（ステップＳ６１２のＹＥＳ）、ステップＳ６１３に進む。 More specifically, in step S611, since variable Z> 0 (Z = 2), the process proceeds to step S612. In step S612, the previous image and the current image are compared. In this example, since the image is a still image, it is determined to be a still image (YES in step S612), and the process proceeds to step S613.

ステップＳ６１３では、静止状態の連続数を示す変数Ｓを１インクリメント（ここではＳ＝１）し、ステップＳ６１４において符号化制御部１０３に画質重視の制御を選択して実行する旨の指令を出力する。画質重視の制御の指令を受けた符号化制御部１０３は、量子化ステップサイズを小さくするなどの符号化方法を選択する旨の制御信号を動画像符号化部１０２に伝達し、動画像符号化部１０２では、その制御信号に基づいて、静止画像に適応した高精細な画像を再現できるような画質重視の符号化を行う。 In step S613, the variable S indicating the number of consecutive stationary states is incremented by 1 (here, S = 1), and in step S614, a command to select and execute image quality-oriented control is output to the encoding control unit 103. . The encoding control unit 103 that has received a control instruction emphasizing image quality transmits a control signal for selecting an encoding method, such as reducing the quantization step size, to the moving image encoding unit 102, so that the moving image encoding is performed. Based on the control signal, unit 102 performs encoding with emphasis on image quality so that a high-definition image adapted to a still image can be reproduced.

以上のように、第２の実施形態の動画像符号化装置によれば、ズームイン状態が連続した後に静止画像状態となった場合にも、画質重視の符号化方式を選択するようにしたので、カメラを単に動かしただけでは画質重視とはならないが、目的の被写体にターゲットを合わせてから静止画状態になったと推定される場合に、そのズームインされて静止状態となっている画像について適切な符号化方式である、画質重視の符号化方式を選択することができる。 As described above, according to the moving image encoding apparatus of the second embodiment, since the still image state is selected after the zoom-in state continues, the image quality-oriented encoding method is selected. Just moving the camera does not give priority to image quality, but if it is presumed that the camera has entered a still image after the target is placed on the target subject, an appropriate code for the image zoomed in and still An encoding method emphasizing image quality can be selected.

具体的には、例えば屋外で怪我をした際に、カメラ付携帯電話機を用いて病院に電話を掛け、そのときの怪我の状況を、動画像を利用して伝えるというような場合に、患部の状態を見せるときには一度大まかに（広角画像的に）患部を写しその後、患部へと徐々にズームインして行き、最大拡大率になった後、暫くはその拡大された静止画像の状態で患部を撮影し続ける、というような撮影の仕方が成されることがある。このような場合、そのズームイン中も、静止中も、常に良好な画質を再現することのできる符号化を適切に選択して実行することが可能となる。 More specifically, for example, when an injury is made outdoors, a phone with a camera is used to call the hospital, and the situation of the injury at that time is conveyed using moving images. When showing the condition, copy the affected area roughly (in a wide-angle image) and then zoom in gradually to the affected area. After reaching the maximum enlargement ratio, the affected area is photographed in the state of the enlarged still image for a while. There are times when the way of shooting is made, such as continuing. In such a case, it is possible to appropriately select and execute an encoding that can always reproduce a good image quality both during zooming in and still.

（Ｃ）第３の実施形態
図８は、第３の実施形態に係る動画像符号化装置の主要部の構成を表したものである。この動画像符号化装置は、音声指令検出部８００と変更指令出力部８０５とを、さらに備えている点が、第２の実施形態と異なっている部分である。なお、第２の実施形態と同様の構成部位には同じ符号を付して、その詳細な説明については省略する。 (C) Third Embodiment FIG. 8 illustrates a configuration of a main part of a video encoding device according to a third embodiment. This moving image encoding apparatus is different from the second embodiment in that it further includes a voice command detection unit 800 and a change command output unit 805. In addition, the same code | symbol is attached | subjected to the component similar to 2nd Embodiment, and the detailed description is abbreviate | omitted.

音声指令検出部８００は、通話中の音声を入力する音声入力部８０１と、音声認識部８０６とを備えている。 The voice command detection unit 800 includes a voice input unit 801 that inputs voice during a call and a voice recognition unit 806.

より具体的には、音声認識部８０６は、音声入力部８０１を介して入力された音声を、テキストデータに変換する音声−テキスト変換部８０２と、その変換されたテキストデータのうちから、所定のキーワードに合致するテキストデータを識別して抽出する音声指令判定部８０３と、所定のキーワードをテキストデータとして記憶しているキーワード記憶部８０４とから、その主要部が構成されている。 More specifically, the speech recognizing unit 806 selects a predetermined value from the speech-text converting unit 802 that converts speech input via the speech input unit 801 into text data, and the converted text data. A voice command determination unit 803 that identifies and extracts text data that matches a keyword and a keyword storage unit 804 that stores a predetermined keyword as text data constitute the main part.

音声入力部８０１は、音声−テキスト変換部８０２に接続されており、携帯電話機の通話機能にて通話中の音声を、音声−テキスト変換部８０２に送信する機能を有している。 The voice input unit 801 is connected to the voice-text conversion unit 802 and has a function of transmitting voice during a call using the call function of the mobile phone to the voice-text conversion unit 802.

音声−テキスト変換部８０２は、音声入力部８０１及び音声指令判定部８０３に接続されており、音声入力部８０１から入力された音声データをテキストデータに変換し、音声指令判定部８０３に送信する機能を有している。 The voice-text conversion unit 802 is connected to the voice input unit 801 and the voice command determination unit 803, converts voice data input from the voice input unit 801 into text data, and transmits the text data to the voice command determination unit 803. have.

音声指令判定部８０３は、音声−テキスト変換部８０２、キーワード記憶部８０４及び変更指令出力部８０５に接続されており、音声−テキスト変換部８０２から得られたテキストデータの中に、キーワード記憶部８０４に予め記憶されているテキストデータと同等のものがあるかを識別し、その結果を変更指令出力部８０５へと送出する機能を有している。 The voice command determination unit 803 is connected to the voice-text conversion unit 802, the keyword storage unit 804, and the change command output unit 805, and the keyword storage unit 804 is included in the text data obtained from the voice-text conversion unit 802. Has a function of identifying whether there is equivalent to text data stored in advance and sending the result to the change command output unit 805.

変更指令出力部８０５は、動画像入力部１０１、音声指令判定部８０３及び画像比較部１０４に接続されており、音声指令判定部８０３の判定結果を受信し、その結果に基づいて、スイッチをＯＮ／ＯＦＦする機能を有する、スイッチである。この変更指令出力部８０５のスイッチがＯＮの状態の場合には、動画像入力部１０１の画像データが画像比較部１０４に送信される。あるいはＯＦＦの場合には、送信されない。 The change command output unit 805 is connected to the moving image input unit 101, the voice command determination unit 803, and the image comparison unit 104, receives the determination result of the voice command determination unit 803, and turns on the switch based on the result. A switch having a function of turning off / off. When the switch of the change command output unit 805 is ON, the image data of the moving image input unit 101 is transmitted to the image comparison unit 104. Or when it is OFF, it is not transmitted.

上記以外の構成要素、接続状態及び機能については、第２の実施形態のそれと同様であるので、ここでの詳述は省略する。 The other components, connection states, and functions are the same as those of the second embodiment, and thus detailed description thereof is omitted here.

次に、この第３の実施形態に係る動画像符号化装置の主要な動作について、図８及び図９に基づいて説明する。なお、主要な動作は概ね第２の実施形態とほぼ同様であるので、ここでは異なった部分のみについて説明する。 Next, main operations of the moving picture coding apparatus according to the third embodiment will be described with reference to FIGS. Since the main operation is substantially the same as that of the second embodiment, only different parts will be described here.

携帯電話機の通話機能を使用して通話が行われるが、その通話中の音声が、音声入力部８０１に入力されると、その音波がキャプチャされてデジタル化され、音声データとなる（ステップＳ９０１）。 A call is performed using the call function of the mobile phone. When the voice during the call is input to the voice input unit 801, the sound wave is captured and digitized to become voice data (step S901). .

続いて、音声−テキスト変換部８０２で、入力された音声データをテキストデータに変換する（ステップＳ９０２）。ここで、音声−テキスト変換とは、音声データを基本的な言語単位（音素）に変換し、音素を単語に構築し、さらに、文脈から単語を判断して発音が似ている単語（例えばｗｒｉｔｅとｒｉｇｈｔなど）の綴りを正しく修正すること等を言う。 Subsequently, the voice-text conversion unit 802 converts the input voice data into text data (step S902). Here, the speech-to-text conversion means that speech data is converted into basic language units (phonemes), phonemes are constructed into words, and words that are pronounced similarly by judging the words from the context (for example, write). And right etc.).

変換されたテキストデータは、音声指令判定部８０３に送信され、キーワード記憶部８０４に予め記憶されたテキストデータとのマッチングが行われる（ステップＳ９０３）。 The converted text data is transmitted to the voice command determination unit 803, and matching with the text data stored in advance in the keyword storage unit 804 is performed (step S903).

ここで予め記憶されていたテキストデータ（キーワード）とマッチすれば、音声指令判定部８０３は変更指令出力部８０５に対してスイッチをＯＮするように指示を出す（ステップＳ９０４）。あるいはマッチしなければ、スイッチをＯＦＦするように指示を出し（ステップＳ９０５）、次の音声入力を待つ（ステップＳ９０１）。 If the text data (keyword) stored in advance is matched, the voice command determination unit 803 instructs the change command output unit 805 to turn on the switch (step S904). Otherwise, an instruction to turn off the switch is issued (step S905), and the next voice input is awaited (step S901).

音声指令判定部８０３から指示を受けた変更指令出力部８０５は、その指示に従って、スイッチをＯＮもしくはＯＦＦにする。スイッチをＯＮにした場合は、動画像入力部１０１から得られた画像データを画像比較部１０４に送信する。 In response to the instruction from the voice command determination unit 803, the change command output unit 805 turns the switch ON or OFF according to the instruction. When the switch is turned on, the image data obtained from the moving image input unit 101 is transmitted to the image comparison unit 104.

以降の処理については、第２の実施形態と同様であるので（ステップＳ９０６〜）、ここでの詳述は省略する。 Since the subsequent processing is the same as that of the second embodiment (step S906 and subsequent steps), detailed description thereof is omitted here.

上記の動作を、具体例に則して説明する。例えば、キーワード記憶部８０４には、「患部」、「写す」、「拡大」などのテキストデータが、キーワードとして予め記憶されていたとする。 The above operation will be described with reference to a specific example. For example, it is assumed that text data such as “affected part”, “copy”, and “enlarge” is stored in advance in the keyword storage unit 804 as keywords.

音声入力部８０１に入力された音声が「これから患部を写します」という言葉であったとすると、その音声をデータ化してなる音声データは、音声−テキスト変換部８０２に送信されて、テキストデータに変換される。そのテキストデータから単語を抽出すると、例えば「これから」、「患部」、「写します」となる。これらのテキストデータとキーワード記憶部８０４に予め記憶されたキーワードとのマッチングをとり、「患部」、「写す」がマッチすることが判明すると、変更指令出力部８０５のスイッチをＯＮにし、以降、第２の実施形態に示したように、画像符号化方法を適応的に変更する処理に移る。 If the voice input to the voice input unit 801 is the word “I will copy the affected part from now on”, the voice data obtained by converting the voice into data is transmitted to the voice-text conversion unit 802 and converted into text data. Is done. When a word is extracted from the text data, for example, “from now on”, “affected part”, and “copy”. These text data and the keyword stored in the keyword storage unit 804 are matched, and when it is found that “affected part” and “copy” match, the switch of the change command output unit 805 is turned on. As shown in the second embodiment, the processing proceeds to adaptively changing the image encoding method.

以上のようにして、第３の実施形態によれば、画像符号化方法を適応的に変更する方法として通話中の音声を利用するようにしたので、ユーザの意思を自動的かつ効果的に取り入れることができ、ユーザが必要なときにのみ画質優先の符号化を行うことが可能になる。 As described above, according to the third embodiment, since the voice during a call is used as a method for adaptively changing the image encoding method, the user's intention is automatically and effectively incorporated. It is possible to perform encoding with priority on image quality only when the user needs it.

（Ｄ）第４の実施形態
図１０は、第４の実施形態に係る動画像符号化装置の主要部の構成を表したものである。この動画像符号化装置は、画像状態検出部３００が、前画像記憶部１０５の代りに画像記憶部１１５を備えている点、及び画像状態判定部１０６の代りに画像類似度判定部１１６を備えている点が、第１の実施形態と異なっている部分である。なお、第１の実施形態と同様の構成部位には同じ符号を付して、その詳細な説明については省略する。 (D) Fourth Embodiment FIG. 10 illustrates a configuration of a main part of a video encoding device according to a fourth embodiment. In this moving image encoding apparatus, the image state detection unit 300 includes an image storage unit 115 instead of the previous image storage unit 105, and an image similarity determination unit 116 instead of the image state determination unit 106. This is a different point from the first embodiment. In addition, the same code | symbol is attached | subjected to the component similar to 1st Embodiment, and the detailed description is abbreviate | omitted.

画像記憶部１１５は、画像比較部１０４に接続されており、予めサンプルの画像データを記憶しておき、それを画像比較部１０４に対して提供する機能を有している。 The image storage unit 115 is connected to the image comparison unit 104, and has a function of storing sample image data in advance and providing it to the image comparison unit 104.

画像類似度判定部１１６は、画像比較部１０４及び符号化制御部１０３に接続されており、画像比較部１０４における、入力された画像データと、予め記憶されていたサンプルの画像データとの比較の結果を受けて、画像の類似度を判定し、その結果を符号化制御部１０３へと送出する機能を有している。 The image similarity determination unit 116 is connected to the image comparison unit 104 and the encoding control unit 103. The image comparison unit 104 compares the input image data with the sample image data stored in advance. It has a function of receiving the result, determining the similarity of the images, and sending the result to the encoding control unit 103.

上記以外の構成要素、接続状態及び機能については第１の実施形態と同様であるので、ここではそれらの詳述は省略する。 Since components, connection states, and functions other than those described above are the same as those in the first embodiment, detailed descriptions thereof are omitted here.

次に、この動画像符号化装置の動作について、図１０を参照して説明する。なお、その動作は概ね第１の実施形態と同様であるので、ここでは第１の実施形態の動画像符号化装置とは異なった部分について特に説明するものとする。 Next, the operation of this moving picture coding apparatus will be described with reference to FIG. Since the operation is substantially the same as that of the first embodiment, here, a different part from the moving picture encoding apparatus of the first embodiment will be particularly described.

まず、動画像入力部１０１より入力された画像データは、画像比較部１０４にて、画像記憶部１１５に記憶されているサンプルの画像データと比較される。このときの比較の方法としては、例えば、第１の実施形態でズームインしているかどうかの判定を行う際のマッチング方法を用いる、ということがある。また、ここで追加すべき機能としては、ここでは拡大しているかどうかではなく類似しているかどうかを判定しなければならないのであるから、マッチングを取る際には、縮小画像も作成することも必要となる。 First, the image data input from the moving image input unit 101 is compared with the sample image data stored in the image storage unit 115 by the image comparison unit 104. As a comparison method at this time, for example, there is a method of using a matching method when determining whether or not the zoom-in is performed in the first embodiment. In addition, as a function to be added here, it is necessary to determine whether they are similar, not enlarged, so it is necessary to create a reduced image when matching is performed. It becomes.

画像比較部１０４で比較した結果は、画像類似度判定部１１６に送信されて、判定される。例えば、画像比較部１０４の比較演算結果（演算値）が、所定の閾値よりも低ければ類似しているものと判定する。この比較結果の情報を、符号化制御部１０３に送信する。 The result of comparison by the image comparison unit 104 is transmitted to the image similarity determination unit 116 for determination. For example, if the comparison calculation result (calculation value) of the image comparison unit 104 is lower than a predetermined threshold, it is determined that they are similar. The comparison result information is transmitted to the encoding control unit 103.

符号化制御部１０３では、類似しているという結果の場合には、画質優先の符号化方式を選択し実行する旨の制御を行い、類似していないという結果の場合には、動き優先の符号化方式を選択し実行する旨の制御を行う。 If the result is similar, the encoding control unit 103 performs control to select and execute the image quality priority encoding method. If the result is not similar, the motion priority code Control to select and execute the system.

そして動画像符号化部１０２では、符号化制御部１０３によって指示された（選択され制御された）符号化方式によって、例えば量子化ステップサイズの変更などのような、そのときの画像の状態に適応した符号化を行って、その符号化された画像データを出力する。 Then, the moving image encoding unit 102 adapts to the state of the image at that time, such as a change of the quantization step size, by the encoding method designated (selected and controlled) by the encoding control unit 103. The encoded image data is output and the encoded image data is output.

上述の動作を、具体例を用いて説明する。例えば、動画像入力部１０１に入力された画像データが、足の怪我を写している画像データであったとする。その画像データは、画像比較部１０４に入力されて、予め画像記憶部１１５に記憶された別の足の怪我の画像データとの比較が行われる。 The above operation will be described using a specific example. For example, it is assumed that the image data input to the moving image input unit 101 is image data showing a foot injury. The image data is input to the image comparison unit 104 and compared with image data of another foot injury stored in advance in the image storage unit 115.

その比較の結果は、画像類似度判定部１１６に伝送される。画像類似度判定部１１６では、比較の結果が所定の類似度以上の類似度で「類似している」、すなわち入力された画像が足の怪我である、と判定した場合には、符号化制御部１０３に画質優先の指示を出力して、怪我の様子がさらに詳細かつ明確な画像として再現できるような符号化方式を選択するように、符号化制御部１０３に対して指令を送る。 The result of the comparison is transmitted to the image similarity determination unit 116. When the image similarity determination unit 116 determines that the comparison result is “similar” with a similarity equal to or higher than a predetermined similarity, that is, the input image is a foot injury, encoding control is performed. The image quality priority instruction is output to the unit 103, and a command is sent to the encoding control unit 103 so as to select an encoding method that can reproduce the state of injury as a more detailed and clear image.

以上のように、第４の実施形態に係る動画像符号化装置によれば、予め画質優先の符号化方式を選択すべき画像を記憶しておき、その画像と、カメラ等からの入力画像との比較を行うようにしたので、画質優先が必要とされる画像を自動的かつ的確に識別することができ、効率的に画質優先の符号化制御を行うことが可能となる。 As described above, according to the moving image encoding apparatus according to the fourth embodiment, an image for which an image quality priority encoding method should be selected is stored in advance, and the image and an input image from a camera or the like are stored. Therefore, it is possible to automatically and accurately identify an image that requires image quality priority, and to efficiently perform image quality priority encoding control.

（Ｅ）他の実施形態
なお、上記各実施形態では、画面の中央部分に注目して、その部分を拡大する場合について説明したが、注目する部分を画面内の特定の領域に指定したり、画像から所定のオブジェクトのみを抽出してその画像にのみ注目するようにすることなども可能であることは勿論である。 (E) Other Embodiments In each of the above-described embodiments, the case where the center portion of the screen is focused and the portion is enlarged has been described. However, the focused portion is designated as a specific area in the screen, Of course, it is possible to extract only a predetermined object from an image so as to focus only on the image.

上記第２の実施形態においては、ズームインが所定の回数以上続き、続いて静止画になった場合に画質優先とする、という判定は、時系列的に連続した複数のフレームを対象として行うようにしたが、何フレーム毎かにすることや、画素数を間引くなどして、判定のために要する演算量を減らすようにしても良い。 In the second embodiment, the determination that the image quality is given priority when the zoom-in continues for a predetermined number of times and subsequently becomes a still image is performed for a plurality of frames that are continuous in time series. However, the amount of calculation required for the determination may be reduced by making it every several frames or by thinning out the number of pixels.

上記第３の実施形態においては、音声−テキスト変換の後、予め用意したテキストとのマッチングを取るようにしたが、例えばユーザ自身の声を音声データとして記憶し、音声データ毎にマッチングを取るようにしても良い。 In the third embodiment, after speech-to-text conversion, matching with text prepared in advance is performed. For example, the user's own voice is stored as speech data, and matching is performed for each speech data. Anyway.

また、通話中の音声を、通常の通話機能とは別に、符号化方式の選択のために（判断材料として）用いるようにしているが、それ以外の音声でも良く、例えば「画質優先」などと言葉で命令を出すことで、その命令に従って、画像符号化方法を変更できるようにしても良いことは勿論である。 In addition, the voice during a call is used for selecting an encoding method (as a judgment material) separately from the normal call function, but other voices may be used, for example, “image quality priority”. Of course, it is also possible to change the image encoding method according to the command by issuing the command in words.

また、電話を掛ける側（当方側）の音声を、符号化方式の選択のために用いるようにしているが、電話を掛けた相手先から当方への音声を用いるようにしても良い。例えば、電話を掛けた先が病院である場合、一般に、「はい、○○病院です」などの音声（言葉）が当方側に伝送されて来る。そこで、相手先からの音声のうちから、「病院」という単語が検出された場合には、画像符号化方法の変更を行うようにする、ということなども可能である。 In addition, although the voice on the side that makes the call (us side) is used for selecting the encoding method, the voice from the other party that made the call to us may be used. For example, when the destination is a hospital, generally a voice (word) such as “Yes, XX hospital” is transmitted to us. Therefore, it is possible to change the image encoding method when the word “hospital” is detected from the speech from the other party.

また、第４の実施形態において、比較する画像を検索する際に、患部を説明している言葉のうちからサンプルとなる画像を推測するようにすれば、その後のサンプル画像と入力画像との比較処理がさらに大幅に軽減されるので、さらに望ましい。 In the fourth embodiment, when searching for images to be compared, if so infer the images to be samples from among the words describing the affected area, with the subsequent sample image and the input image It is further desirable since the comparison process is further greatly reduced.

第１の実施形態に係る動画像符号化装置の主要部の構成を示すブロック図である。It is a block diagram which shows the structure of the principal part of the moving image encoder which concerns on 1st Embodiment. 第１の実施形態に係る動画像符号化装置の動作の流れを表したフローチャートである。It is a flowchart showing the flow of operation | movement of the moving image encoder which concerns on 1st Embodiment. 次第にズームインして行く、時系列的に連続した３フレーム分の画像データの一例を表した図である。It is a figure showing an example of the image data for 3 frames which time-sequentially zoomed in gradually. 第１の実施形態に係る動画像符号化装置における画像比較、判定処理の主要な流れを示すフローチャートである。It is a flowchart which shows the main flows of the image comparison and determination process in the moving image encoder which concerns on 1st Embodiment. 第２の実施形態に係る動画像符号化装置の主要部の構成を表した図である。It is a figure showing the structure of the principal part of the moving image encoder which concerns on 2nd Embodiment. 第２の実施形態に係る動画像符号化装置における画像比較・判定処理の主要な流れを示すフローチャートである。It is a flowchart which shows the main flows of the image comparison and determination process in the moving image encoder which concerns on 2nd Embodiment. 次第にズームインして行き、静止画像状態に至る、時系列的に連続した４フレーム分の画像データの一例を表した図である。It is a figure showing an example of image data for four frames which are time-sequentially continued to zoom in gradually and reach a still image state. 第３の実施形態に係る動画像符号化装置の主要部の構成を表した図である。It is a figure showing the structure of the principal part of the moving image encoder which concerns on 3rd Embodiment. 第３の実施形態に係る動画像符号化装置の主要な動作の流れを示すフローチャートである。It is a flowchart which shows the flow of main operation | movement of the moving image encoder which concerns on 3rd Embodiment. 第４の実施形態に係る動画像符号化装置の主要部の構成を表した図である。It is a figure showing the structure of the principal part of the moving image encoder which concerns on 4th Embodiment.

Explanation of symbols

１００、２００、３００…画像状態検出部、１０１…動画像信号入力部、１０２…動画像符号化部、１０３…符号化制御部、１０４…画像比較部、１０５…前画像記憶部、１０６…画像状態判定部、１１５…画像記憶部、１１６…画像類似度判定部、５０１…画像状態記憶部、５０２…画像状態判定部、８００…音声指令検出部、８０１…音声入力部、８０２…音声−テキスト変換部、８０３…音声指令判定部、８０４…キーワード記憶部、８０５…変更指令出力部、８０６…音声認識部。
DESCRIPTION OF SYMBOLS 100, 200, 300 ... Image state detection part, 101 ... Moving image signal input part, 102 ... Moving image encoding part, 103 ... Encoding control part, 104 ... Image comparison part, 105 ... Previous image storage part, 106 ... Image State determination unit 115 ... Image storage unit 116 116 Image similarity determination unit 501 Image state storage unit 502 Image state determination unit 800 Voice command detection unit 801 Voice input unit 802 Voice-text Conversion unit, 803 ... voice command determination unit, 804 ... keyword storage unit, 805 ... change command output unit, 806 ... voice recognition unit.

Claims

A moving image signal input unit for inputting a moving image signal and any one of a plurality of predetermined encoding methods are selected, and the input moving image signal is encoded by the encoding method. A moving image encoding device provided in an electronic apparatus including a moving image capturing unit that does not have an optical zoom function .
Image state detecting means for detecting the state of an image reproduced by the input moving image signal;
Corresponding to the image state detected by the image state detecting means, an encoding method is selected from the plurality of encoding methods, and the input moving image signal is encoded by the encoding method. as, example Bei a coding control means for controlling the encoding means,
The image state detection means includes: an image comparison unit that compares the input current image with a previous image temporally prior to the current image; a previous image storage unit that stores the previous image; An image state storage unit for storing determination result information; an image state determination unit for determining the image state based on a comparison result by the image comparison unit and a past image state stored in the image state storage unit; With
The image state determination unit determines that the current image is an enlarged image compared to the previous image, and the current image is a still state image after an enlarged state continues. If determined , the moving picture coding apparatus instructing the coding control means to select a coding method for coding the moving picture signal with priority on image quality .

In the case where the frame rate or the quantization step size or the compression rate per frame is the encoding method used for the previous image, the image state determination unit encodes the moving image signal with priority on image quality. The moving picture coding apparatus according to claim 1, wherein an instruction is given to select a coding method that reproduces a higher-definition image that is lower than that of the moving picture.

Voice command detection means for detecting a predetermined keyword contained in the input voice and outputting a command for selecting an encoding method corresponding to the keyword;
A change command output means for outputting a command for changing the selection of the encoding method to the image state detection means in response to a command output from the voice command detection means. The moving image encoding device according to 1 or 2 .

The voice command detection means converts the voice input unit to which the voice is input and the voice input to the voice input unit into character information, and selects character information that matches a predetermined keyword from the character information. The moving image encoding apparatus according to claim 3 , further comprising: a voice recognition unit that extracts and outputs a command for selecting an encoding method corresponding to the character information.

The speech recognition unit includes: a speech-text conversion unit that outputs the speech as text information obtained by converting the speech data; a keyword storage unit that stores the predetermined keyword; and an output from the speech-text conversion unit. Compares the keyword stored in the keyword storage unit, extracts character information that matches the keyword from the character information, and outputs a command for selecting an encoding method corresponding to the character information The moving image encoding apparatus according to claim 4 , further comprising a voice command determination unit that performs the operation.

The voice command detecting means outputs a command for selecting an encoding method corresponding to the character information when character information matching the predetermined keyword is detected, and the command is output. 6. The moving picture encoding apparatus according to claim 4 , wherein the encoding control means changes the selection of the encoding method only.

The image state detection means includes an image storage unit that stores a sample image to be compared in advance, an image reproduced by the input moving image signal, and a sample image stored in the image storage unit. From the comparison result of the image comparison unit to be compared and the image comparison unit, the similarity to both compared images is determined. If the similarity is equal to or higher than the predetermined similarity, the encoding method corresponding to the sample image at that time moving image coding apparatus according to any one of claims 1 to 6, characterized in that an instruction to select and a image similarity determination unit that outputs to the encoding control means.

A voice input unit for inputting voice and the voice input to the voice input unit are converted into character information, and character information that matches a predetermined keyword is extracted from the character information and corresponds to the character information. A speech recognition unit that outputs a command for selecting the encoded encoding method,
When the voice recognition unit extracts character information that matches the predetermined keyword, the image state detection unit selects an image corresponding to the keyword from the images stored in the image storage unit. The moving image coding apparatus according to claim 7 , wherein the moving image coding apparatus is used as the sample image that is searched and compared with the input image by the image comparison unit.