JP5600649B2

JP5600649B2 - Video processing apparatus, method, and program

Info

Publication number: JP5600649B2
Application number: JP2011185308A
Authority: JP
Inventors: 太一河野; 和久山岸
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2011-08-26
Filing date: 2011-08-26
Publication date: 2014-10-01
Anticipated expiration: 2031-08-26
Also published as: JP2013046400A

Description

本発明は、映像処理装置及び方法及びプログラムに係り、特に、入力される映像コンテンツ（以下、「入力映像コンテンツ」と記す）について、一定の映像品質を維持したまま映像コンテンツの解像度を下げることにより、映像コンテンツのデータ量を削減する映像処理装置及び方法及びプログラムに関する。 The present invention relates to a video processing apparatus, method, and program, and in particular, by reducing the resolution of video content while maintaining constant video quality for input video content (hereinafter referred to as “input video content”). The present invention relates to a video processing apparatus, method, and program for reducing the data amount of video content.

地上テレビ放送のデジタル化により、一般家庭にもFull-HDサイズの映像コンテンツを視聴可能な環境が整ってきた。それに伴い、映像配信サービスやパッケージメディアにおいてもFull-HDサイズの映像コンテンツの提供が開始されている。高解像度の映像コンテンツは画質の高さがユーザへの訴求点になっているが、データ量が多くなるため映像データを保存するためのストレージや伝送するためのネットワークの設備コストが大きな負担になる。 Digitalization of terrestrial television broadcasting has created an environment in which general homes can view full-HD video content. Along with this, provision of full-HD video content has started in video distribution services and package media. High-resolution video content is appealing to users because of the high image quality, but the amount of data increases, so storage costs for storing video data and network equipment costs for transmission become a heavy burden. .

ここで解像度と映像品質の関係について説明する。図１は、７種類の映像コンテンツそれぞれにおいて、解像度変換（Full-HD 1920×1080⇒Half-HD 960×1080）が与える品質への影響の差異を表している。図1において、映像コンテンツ５〜７は解像度の低下に伴い映像品質の低下の度合いが大きくなる。すなわち、解像度の低下によって映像品質が低下しているのに対し、映像コンテンツ１〜４は解像度の差がほとんど映像品質に影響を及ぼしていない。図１が示すように、映像コンテンツによっては、映像コンテンツの解像度の低下が映像品質の低下に影響を及ぼさない閾値があるため、映像コンテンツ毎に、映像品質の低下に影響を及ぼさない必要最低限の解像度に設定することにより、映像コンテンツのデータ量を削減することができる。 Here, the relationship between resolution and video quality will be described. FIG. 1 shows the difference in the influence on the quality given by the resolution conversion (Full-HD 1920 × 1080 → Half-HD 960 × 1080) in each of the seven types of video content. In FIG. 1, the video contents 5 to 7 increase in the degree of video quality deterioration as the resolution decreases. That is, while the video quality is reduced due to the decrease in resolution, the difference in resolution hardly affects the video quality in the video contents 1 to 4. As shown in FIG. 1, depending on the video content, there is a threshold at which the reduction in the resolution of the video content does not affect the reduction in the video quality. Therefore, the minimum necessary amount that does not affect the reduction in the video quality for each video content. By setting the resolution to, the data amount of the video content can be reduced.

従来、非特許文献１では映像の製作者が事前にシーン毎に優先度や動きの激しさをメタ情報として付与し、その情報に基づいて解像度を変化させる技術が記載されている。 Conventionally, Non-Patent Document 1 describes a technique in which a video producer gives priorities and intensity of motion as meta information in advance for each scene, and changes the resolution based on the information.

村田嘉利、増田彰久、太田賢、石原進、水野忠則、"モバイルマルチメディアストリーミングサービスのためのコンテンツ指向時空間的解像度制御方式"，情報処理学会論文誌，Vol.43，No．12，pp．3838−3847、2002（12）.Yoshitoshi Murata, Akihisa Masuda, Ken Ota, Susumu Ishihara, Tadanori Mizuno, "Content Oriented Spatio-Temporal Resolution Control Method for Mobile Multimedia Streaming Service", Transactions of Information Processing Society of Japan, Vol.43, No. 12, pp. 3838-3847, 2002 (12).

しかしながら、映像製作者が事前にシーン毎に優先度や動きの激しさをメタ情報として付与し、その情報に基づいて映像品質を考慮して変動させる技術(非特許文献１)は、シーン毎に製作者が対応する必要があり、人的・時間的なコストが増大するという問題がある。 However, a technique (Non-Patent Document 1) in which a video producer assigns priorities and intensity of motion as meta information in advance for each scene and changes the video quality in consideration of the information (non-patent document 1) There is a problem that it is necessary for the producer to cope with it, and human and time costs increase.

本発明は、上記に鑑みてなされたもので、映像コンテンツのデータから得られるコンテンツ特徴量を用いることにより、人的・時間的なコストをかけずに、人間が知覚する映像品質(映像品質の低下の度合い)について、指定された条件を満たす、すなわち指定された閾値内に映像品質を保つ、映像コンテンツの解像度に解像度変換させてデータ量を削減することであり、指定された映像品質の条件に応じた映像コンテンツの解像度に変換する、映像処理装置、方法およびプログラムを提供することを目的とする。 The present invention has been made in view of the above, and by using content feature amounts obtained from video content data, video quality (video quality perceived by humans) can be obtained without incurring human and time costs. The degree of degradation) is to satisfy the specified condition, that is, to maintain the video quality within the specified threshold, to reduce the amount of data by converting the resolution to the resolution of the video content, and to the specified video quality condition It is an object of the present invention to provide a video processing apparatus, method, and program for converting to a video content resolution according to the above.

上記の課題を解決するため、本発明は、入力された映像データ（以下、「入力映像データ」と記す）について、指定された映像品質の条件を満たす解像度に、該入力映像データの入力解像度を変換する映像処理装置であって、
前記入力映像データについて、指定された映像品質の条件を満たす解像度の中から最小の解像度である最適解像度を算出する最適映像パラメータ導出手段と、
前記最適映像パラメータ導出手段で算出した前記最適解像度に基づいて、前記入力映像データの解像度を変換して最適化映像データを出力する映像最適化処理手段と、
を備え、
前記最適映像パラメータ導出手段は、
前記入力映像データに基づいて、エッジ量を算出するエッジ量算出手段と、
解像度変換の前と後の解像度の組み合わせ毎に、前記エッジ量と解像度の解像度変換時の映像品質の関係を表したモデル式を設定し、前記入力解像度を解像度変換前の解像度として、当該解像度変換の前と後の解像度の組み合わせ毎に設定された該モデル式に基づいて、前記エッジ量に対する映像品質の低下の度合いを算出し、当該指定された映像品質の条件を満たす解像度の組み合わせの内、解像度変換後の解像度が最小になる解像度を前記最適解像度として算出する最適解像度算出手段と、
を含む。 In order to solve the above problems, the present invention reduces the input resolution of the input video data to the resolution that satisfies the specified video quality condition for the input video data (hereinafter referred to as “input video data”). A video processing device for conversion,
Optimum video parameter derivation means for calculating the optimum resolution that is the minimum resolution among the resolutions that satisfy the specified video quality conditions for the input video data;
Video optimization processing means for converting the resolution of the input video data and outputting optimized video data based on the optimal resolution calculated by the optimal video parameter derivation means;
With
The optimum video parameter derivation means includes:
Based on the input image data, an edge amount calculation means for calculating an edge amount,
For each combination of resolutions before and after resolution conversion, a model formula representing the relationship between the edge amount and the video quality at the time of resolution conversion is set, and the input resolution is set as the resolution before resolution conversion, and the resolution conversion is performed. Based on the model formula set for each combination of resolution before and after, the degree of degradation of video quality with respect to the edge amount is calculated, and among the combinations of resolutions that satisfy the specified video quality condition, An optimal resolution calculating means for calculating a resolution at which the resolution after resolution conversion is minimized as the optimal resolution;
including.

本発明によれば、入力映像コンテンツの映像品質を一定に保ったまま映像コンテンツの解像度を下げることにより、映像コンテンツのデータ量を削減することができる。そのため、映像品質を指定された閾値を越えて低下させて、映像品質に影響を及ぼすことなく、ストレージの容量削減や通信のトラヒック削減を可能とし、コスト削減に繋がる。 According to the present invention, it is possible to reduce the data amount of video content by lowering the resolution of the video content while keeping the video quality of the input video content constant. Therefore, it is possible to reduce storage capacity and communication traffic without affecting the video quality by reducing the video quality beyond a specified threshold value, leading to cost reduction.

映像コンテンツ毎の解像度変換時の映像品質低下の度合いである。This is the degree of video quality degradation at the time of resolution conversion for each video content. エッジ映像データである。Edge video data. 映像コンテンツのエッジ量と解像度変換時の映像品質低下の度合いの関係である。This is the relationship between the edge amount of video content and the degree of video quality degradation during resolution conversion. 本発明の一実施の形態における映像処理装置の構成図である。It is a block diagram of the video processing apparatus in one embodiment of this invention. 本発明の一実施の形態におけるエッジ量と解像度変換時の映像品質低下の度合いの関係を表したモデル式である。6 is a model formula showing the relationship between the edge amount and the degree of video quality degradation during resolution conversion in an embodiment of the present invention. 本発明の一実施の形態における最適解像度算出部の構成図である。It is a block diagram of the optimal resolution calculation part in one embodiment of this invention. 本発明の一実施の形態における映像処理装置の動作のフローチャートである。It is a flowchart of operation | movement of the video processing apparatus in one embodiment of this invention.

以下図面と共に、本発明の実施の形態を説明する。 Embodiments of the present invention will be described below with reference to the drawings.

本発明は、指定された映像品質の条件に応じた映像コンテンツの解像度に変換するための装置、方法を提供するものであるが、具体的には、映像コンテンツの解像度(変換前)を低下させる解像度変換による映像品質の低下の度合いの値が、予め指定された閾値内になるような変換後の解像度の内、最小の解像度(最適解像度)となる解像度を算出し、当該解像度で実際に映像コンテンツの解像度を変換する処理を行う。 The present invention provides an apparatus and a method for converting video content resolution according to specified video quality conditions. Specifically, the present invention reduces the video content resolution (before conversion). Calculate the resolution that will be the minimum resolution (optimal resolution) of the resolutions after conversion so that the degree of degradation in video quality due to resolution conversion will be within the pre-specified threshold, and the actual video at that resolution. Performs processing to convert the resolution of the content.

本願発明は、解像度の低下が「人間が知覚する映像品質の低下の度合い」に与える影響に関する実験結果に基づき定義した、「解像度変換前後の映像品質の低下の度合いのモデル式」を基に、最適解像度を算出する点が特徴となる。 The present invention is based on "a model formula for the degree of degradation in video quality before and after resolution conversion", which is defined based on the experimental results on the effect of degradation in resolution on the "degree of degradation in video quality perceived by humans" A feature is that the optimum resolution is calculated.

本発明の解決手段を説明する。本発明では、入力される映像コンテンツの画素情報から得られるコンテンツの特徴量であるエッジ量を用いることにより、指定された映像品質の条件を満たすことが可能な最小の映像コンテンツの解像度（最適解像度）を算出し、入力された映像コンテンツの解像度を当該最適解像度に解像度変換することで、指定された一定の映像品質を維持した状態で、映像コンテンツのデータ量の変換を可能とする。 The solving means of the present invention will be described. In the present invention, by using the edge amount that is the feature amount of the content obtained from the pixel information of the input video content, the minimum resolution (optimum resolution) of the video content that can satisfy the specified video quality condition. ) And converting the resolution of the input video content to the optimum resolution, the data amount of the video content can be converted in a state where the specified constant video quality is maintained.

指定された映像品質の条件を満たすとは、「映像品質の低下の度合い」を指定された閾値内に保つことである。 Satisfying the designated video quality condition means keeping the “degree of degradation in video quality” within a designated threshold.

映像コンテンツにおける、解像度の変換による映像品質の低下の原因として、解像度の変換によって映像コンテンツのエッジ（輪郭線）が消失するもしくは歪んでしまうことが挙げられる。このことから、エッジを多く含む映像コンテンツほど、解像度の変換時に映像品質の低下の度合いが大きくなると考えられるため、エッジの量から最適解像度を算出する技術を提供する。 As a cause of deterioration in video quality due to resolution conversion in video content, the edge (contour line) of the video content is lost or distorted due to resolution conversion. For this reason, it is considered that a video content including more edges has a higher degree of deterioration in video quality at the time of resolution conversion. Therefore, a technique for calculating an optimum resolution from the amount of edges is provided.

エッジの量は例えばソーベルフィルタ、ラプラシアンフィルタ、プレヴィットフィルタ、キャニーエッジフィルタなどの既知のエッジ抽出フィルタを適用し得られるエッジ映像データ（図２）に対して、フレーム毎の画素値の平均値または全てのフレームにおける、フレーム毎の画素値の標準偏差の平均値もしくは最大値として計算される。 The amount of edge is, for example, an average value of pixel values for each frame with respect to edge video data (FIG. 2) obtained by applying a known edge extraction filter such as a Sobel filter, Laplacian filter, Previt filter, and Canny edge filter. Alternatively, it is calculated as an average value or a maximum value of standard deviations of pixel values for each frame in all frames.

図３は映像コンテンツのエッジ量と解像度の変換時の映像品質低下の度合いの関係を示している。図３が示すようにエッジ量が大きくなると、解像度の変換時の映像品質の低下の度合いが大きくなる関係がある。この関係を、解像度の変換の前後の解像度の組み合わせ（例えば、1920×1080から1440×1080、1920×1080から960×1080、1920×1080から1280×720など）毎にモデル化する。このモデルを用いることで、入力映像コンテンツの解像度と解像度の変換時の解像度との組み合わせ毎に、エッジ量と解像度変換時の映像品質の低下の度合いを算出し、映像品質の低下の度合いが指定された閾値内を保ち、一定の映像品質を維持する最小の解像度を最適解像度として算出できることになる。 FIG. 3 shows the relationship between the edge amount of video content and the degree of video quality degradation at the time of resolution conversion. As shown in FIG. 3, as the edge amount increases, there is a relationship in which the degree of deterioration in video quality during resolution conversion increases. This relationship is modeled for each combination of resolutions before and after resolution conversion (for example, 1920 × 1080 to 1440 × 1080, 1920 × 1080 to 960 × 1080, 1920 × 1080 to 1280 × 720, etc.). Using this model, for each combination of the resolution of the input video content and the resolution at the time of resolution conversion, the edge amount and the degree of video quality degradation at the time of resolution conversion are calculated, and the degree of video quality degradation is specified It is possible to calculate the minimum resolution that keeps within the set threshold and maintains a constant video quality as the optimum resolution.

図４は、本発明の一実施の形態における映像処理装置の構成を示す。 FIG. 4 shows the configuration of the video processing apparatus in one embodiment of the present invention.

同図に示す映像処理装置は、最適映像パラメータ導出部１、映像最適化処理部２から構成される。映像コンテンツのデータ(以下、「映像データ」と記す)を入力とし、映像コンテンツの解像度を最適解像度に解像度変換した映像データ（以下、「最適化映像データ」と記す）を出力とする。本映像処理装置に入力された映像データは、最適化映像パラメータ導出部１と映像最適化処理部２への入力とされる。 The video processing apparatus shown in FIG. 1 includes an optimal video parameter derivation unit 1 and a video optimization processing unit 2. Video content data (hereinafter referred to as “video data”) is input, and video data obtained by converting the resolution of the video content to the optimal resolution (hereinafter referred to as “optimized video data”) is output. The video data input to the video processing apparatus is input to the optimized video parameter deriving unit 1 and the video optimization processing unit 2.

また、「映像品質」とは映像品質の低下の度合いとし、「指定された映像品質の条件」とは、指定された閾値以下に、解像度変換による映像品質の低下の度合いを保つことである。 The “video quality” is the degree of degradation of video quality, and the “designated video quality condition” is to keep the degree of degradation of video quality due to resolution conversion below a specified threshold.

最適化映像パラメータ導出部１は、エッジ量算出部１１と最適解像度算出部１２で構成される。最適化映像パラメータ導出部１は映像データを入力とし、エッジ量算出部１１において、入力された映像データをもとにエッジ量、入力解像度を算出して、最適解像度算出部１２への入力として出力され、最適解像度算出部１２において、エッジ量算出部１１で算出されたエッジ量、入力解像度をもとに、最適解像度を算出し、入力解像度と最適化解像度を出力して、映像最適化処理部２への入力として出力される。 The optimized video parameter derivation unit 1 includes an edge amount calculation unit 11 and an optimal resolution calculation unit 12. The optimized video parameter derivation unit 1 receives the video data, and the edge amount calculation unit 11 calculates the edge amount and the input resolution based on the input video data, and outputs it as an input to the optimal resolution calculation unit 12. The optimum resolution calculation unit 12 calculates the optimum resolution based on the edge amount and the input resolution calculated by the edge amount calculation unit 11, outputs the input resolution and the optimization resolution, and outputs the image optimization processing unit. 2 is output as an input to 2.

エッジ量算出部１１では、入力された映像データに含まれる輝度レイヤの各フレームに対して、例えばソーベルフィルタ、ラプラシアンフィルタ、プレヴィットフィルタ、キャニーエッジフィルタ（文献１「"A Computational Approach to Edge Detection," by J.F. Canny, IEEE Transactions on Pattern Analysis and Machine Intelligence, 8, pp.769-798, 1986」）などの既知のエッジ抽出フィルタを適用して、エッジ映像データを生成する。本実施の形態では次式のようにソーベルフィルタを用いてエッジ映像データE（w,h,f）を算出する。 In the edge amount calculation unit 11, for example, a Sobel filter, a Laplacian filter, a Previt filter, a Canny edge filter (reference document 1 ““ A Computational Approach to Edge Detection ”is applied to each frame of the luminance layer included in the input video data. , "by JF Canny, IEEE Transactions on Pattern Analysis and Machine Intelligence, 8, pp.769-798, 1986"), and so on, to generate edge video data. In this embodiment, edge video data E (w, h, f) is calculated using a Sobel filter as in the following equation.

ここでy(w,h,f)はｆ番目のフレームの画素位置（w,h）の輝度値を示す。

Here, y (w, h, f) represents the luminance value at the pixel position (w, h) of the f-th frame.

生成したエッジ映像データの各フレームの画素値の標準偏差（もしくは平均値）を基に、全フレームの平均値（もしくは最大値）をエッジ量として算出する。本実施の形態では次式のように標準偏差の平均値を用いてエッジ量ｅを算出する。 Based on the standard deviation (or average value) of the pixel values of each frame of the generated edge video data, the average value (or maximum value) of all frames is calculated as the edge amount. In this embodiment, the edge amount e is calculated using an average value of standard deviations as in the following equation.

ここで、上記式中の関数std_space[ ]は、フレーム毎にフレーム内の画素値の標準偏差を出力する関数、Nはフレームの総数を示す。

Here, the function std _space [] in the above equation is a function for outputting the standard deviation of the pixel values in the frame for each frame, and N indicates the total number of frames.

エッジ量算出部１１では、算出したエッジ量ｅを最適解像度算出部１２に入力すると共に、入力された映像データに含まれるメタデータをもとに、入力された映像データの解像度（以下、「入力解像度」と記す）を参照し、入力解像度を最適解像度算出部１２に入力する。なお、メタデータはAVIやmp4などのファイルフォーマット毎にデータ構造が定められている。また、メタデータを含まない映像データの場合は、入力を受け付けない、もしくはデフォルトの入力解像度を用いる、もしくは手動で設定する必要がある。 The edge amount calculation unit 11 inputs the calculated edge amount e to the optimum resolution calculation unit 12, and based on the metadata included in the input video data, the resolution of the input video data (hereinafter referred to as “input”). The input resolution is input to the optimum resolution calculation unit 12 with reference to “resolution”. The metadata has a data structure defined for each file format such as AVI or mp4. In the case of video data that does not include metadata, it is necessary to accept no input, to use a default input resolution, or to set manually.

最適解像度算出部１２は、解像度の変換の前後の解像度の組み合わせ（例えば、1920×1080と1440×1080，1920×080と960×1080，1920×1080と720×1080，1920×1080と720×480など）毎に、図５に示すようなエッジ量と解像度の変換時の映像品質の低下の度合いとの関係を表したモデル式及びモデル式の係数（モデル係数）を備える。本実施の形態では、モデル式は統一のものを用いるが、解像度の組み合わせ毎にモデル式を変えてもよい。モデル係数は事前に実験によって得られた品質の実測値、もしくは文献２「ITU-T勧告J.247」のような映像品質客観評価モデルを用いて映像データから得られる品質の推定値を用いて、最小二乗法などにより解像度の組み合わせ毎に求められる。モデル式及びモデル係数は図６に示すように記憶部１２２に蓄積されており、演算処理部１２１から参照される。なお初期設定後にモデル式及びモデル係数を変更してもよい。 The optimum resolution calculation unit 12 is a combination of resolutions before and after resolution conversion (for example, 1920 × 1080 and 1440 × 1080, 1920 × 080 and 960 × 1080, 1920 × 1080 and 720 × 1080, 1920 × 1080 and 720 × 480). Etc.), a model formula and a coefficient of the model formula (model coefficient) representing the relationship between the edge amount and the degree of deterioration of the video quality at the time of resolution conversion as shown in FIG. 5 are provided. In the present embodiment, a unified model formula is used, but the model formula may be changed for each combination of resolutions. Model coefficients are measured quality values obtained by experiments in advance, or estimated quality values obtained from video data using a video quality objective evaluation model such as ITU-T Recommendation J.247. It is obtained for each combination of resolutions by the least square method. The model formulas and model coefficients are stored in the storage unit 122 as shown in FIG. The model formula and model coefficient may be changed after the initial setting.

最適解像度算出部１２における、最適解像度の算出方法を説明する。 A method for calculating the optimum resolution in the optimum resolution calculator 12 will be described.

入力されたエッジ量と入力解像度をもとに、入力解像度と解像度の変換後の解像度との組み合わせに対応する予め策定されたモデル式毎に、入力されたエッジ量に対応する映像品質の低下の度合いを算出し、解像度の変換時の映像品質の低下の度合いが予め指定された特定の閾値以下になる入力解像度と解像度変換後の解像度の組み合わせの内、変換後の解像度が最小になる解像度を最適解像度として算出し、当該最適解像度及び入力された入力解像度を映像最適化処理部２への入力として出力する。閾値は図６に示すように記憶部１２２に蓄積され、演算処理部から参照される。閾値を低く設定すれば品質の低下が小さくなるがデータ量の削減効果も小さくなり、閾値を高く設定すれば品質の低下が大きくなるがデータ量の削減効果も大きくなる。閾値の設定は本発明を組み込んだ装置等を用いるサービス事業者などの運用ポリシーに基づいて決定される。なお、初期設定後に閾値を変更してもよい。本実施の形態では、以下のモデル式を用いる。 Based on the input edge amount and input resolution, the video quality corresponding to the input edge amount is reduced for each pre-defined model formula corresponding to the combination of the input resolution and resolution after conversion. Calculate the resolution, and select the resolution that minimizes the resolution after conversion from among the combination of the input resolution and the resolution after resolution conversion where the degree of video quality degradation at the time of resolution conversion is below a specified threshold. The optimum resolution is calculated and the input resolution and the input resolution that have been input are output as an input to the video optimization processing unit 2. The threshold is accumulated in the storage unit 122 as shown in FIG. 6 and is referred to by the arithmetic processing unit. If the threshold value is set low, the deterioration in quality is reduced, but the effect of reducing the data amount is also reduced, and if the threshold value is set high, the reduction in quality is increased but the effect of reducing the data amount is increased. The setting of the threshold value is determined based on an operation policy of a service provider using an apparatus incorporating the present invention. Note that the threshold value may be changed after the initial setting. In the present embodiment, the following model formula is used.

ここでdVQは解像度の変換時の映像品質の低下の度合い、ｅはエッジ量を表す。また、ｖ₁，ｖ₂，ｖ₃は係数であり解像度変換前後の解像度の組み合わせ毎に、事前に主観品質評価実験等で得られる映像品質と、入力された映像データから得られるエッジ量を用いて、最小二乗法などによりモデル式を最適化する必要がある。本モデル式は、エッジ量が低下するとdVQが0に収束し、エッジ量が増加するとdVQがｖ₁に収束する特性を持つ。

Here, dVQ represents the degree of degradation of video quality at the time of resolution conversion, and e represents the edge amount. Further, v ₁ , v ₂ , and v ₃ are coefficients, and for each combination of resolutions before and after resolution conversion, the video quality obtained by a subjective quality evaluation experiment or the like and the edge amount obtained from input video data are used. Therefore, it is necessary to optimize the model formula by the least square method. This model formula has a characteristic that when the edge amount decreases, dVQ converges to 0, and when the edge amount increases, dVQ converges to v ₁ .

映像最適化処理部２は、最適映像パラメータ導出部１で出力された最適解像度と入力解像度、本映像処理装置に入力された映像データを入力とし、入力された最適解像度よりも入力解像度が大きい場合は、例えば、ニアレストネイバー法、バイリニア法、バイキュービック法、平均画素法などの既知の画像縮小アルゴリズムを用いて入力映像データの解像度を最適解像度に解像度変換し、最適化映像データを生成する。また、入力された最適解像度が入力解像度よりも小さい場合は、画像縮小アルゴリズムを用いて入力された映像データを最適化映像データとして出力する。 The video optimization processing unit 2 receives the optimal resolution and input resolution output from the optimal video parameter deriving unit 1 and the video data input to the video processing apparatus, and the input resolution is larger than the input optimal resolution. Uses, for example, a known image reduction algorithm such as a nearest neighbor method, a bilinear method, a bicubic method, or an average pixel method to convert the resolution of the input video data to the optimal resolution, thereby generating optimized video data. If the input optimum resolution is smaller than the input resolution, the input video data is output as optimized video data using an image reduction algorithm.

次に図７のフローチャートを用いて本実施形態の処理フローを説明する。 Next, the processing flow of this embodiment will be described using the flowchart of FIG.

まず、エッジ量算出部１１は、入力された映像データからエッジ量と入力解像度を算出し最適解像度算出部１２に算出したエッジ量と入力解像度を入力する（ステップ１）。 First, the edge amount calculation unit 11 calculates an edge amount and input resolution from the input video data, and inputs the calculated edge amount and input resolution to the optimum resolution calculation unit 12 (step 1).

次に、最適解像度算出部１２は、入力されたエッジ量と入力解像度から最適解像度を算出し、映像最適化処理部２に算出した最適解像度と入力された入力解像度を入力する(ステップ２)。 Next, the optimal resolution calculation unit 12 calculates the optimal resolution from the input edge amount and the input resolution, and inputs the calculated optimal resolution and the input resolution to the video optimization processing unit 2 (step 2).

そして、映像最適化処理部２は入力された映像データと最適解像度と入力解像度から最適化映像データを生成する（ステップ３）。 Then, the video optimization processing unit 2 generates optimized video data from the input video data, the optimal resolution, and the input resolution (step 3).

上記のように、指定された映像品質の条件(映像品質の低下の度合い)に応じて映像コンテンツの解像度を変換することにより、指定された閾値内に映像品質を保つことが可能となる。これにより、人的・時間的なコストをかけずに映像品質を考慮して解像度を変動させる処理を実現することができる。 As described above, the video quality can be maintained within the specified threshold value by converting the resolution of the video content in accordance with the specified video quality condition (the degree of deterioration of the video quality). As a result, it is possible to realize a process of changing the resolution in consideration of video quality without incurring human / time costs.

なお、上記の図４に示す映像処理装置の構成要素の各動作をプログラムとして構築し、映像処理装置として利用されるコンピュータにインストールして実行させる、または、ネットワークを介して流通させることが可能である。 It should be noted that each operation of the components of the video processing apparatus shown in FIG. 4 can be constructed as a program, installed in a computer used as the video processing apparatus and executed, or distributed via a network. is there.

本発明は、上記の実施の形態に限定されることなく、特許請求の範囲内において、種々変更・応用が可能である。 The present invention is not limited to the above-described embodiments, and various modifications and applications are possible within the scope of the claims.

１最適映像パラメータ導出部
２映像最適化処理部
１１エッジ量算出部
１２最適解像度算出部
１２１演算処理部
１２２記憶部 DESCRIPTION OF SYMBOLS 1 Optimal video parameter derivation | leading-out part 2 Video | video optimization processing part 11 Edge amount calculation part 12 Optimal resolution calculation part 121 Operation processing part 122 Storage part

Claims

A video processing device that converts input resolution of input video data to a resolution that satisfies a specified video quality condition for input video data (hereinafter referred to as “input video data”),
Optimum video parameter derivation means for calculating the optimum resolution that is the minimum resolution among the resolutions that satisfy the specified video quality conditions for the input video data;
Video optimization processing means for converting the resolution of the input video data and outputting optimized video data based on the optimal resolution calculated by the optimal video parameter derivation means;
With
The optimum video parameter derivation means includes:
Based on the input image data, an edge amount calculation means for calculating an edge amount,
For each combination of resolutions before and after resolution conversion, a model formula representing the relationship between the edge amount and the video quality at the time of resolution conversion is set, and the input resolution is set as the resolution before resolution conversion, and the resolution conversion is performed. Based on the model formula set for each combination of resolution before and after, the degree of degradation of video quality with respect to the edge amount is calculated, and among the combinations of resolutions that satisfy the specified video quality condition, An optimal resolution calculating means for calculating a resolution at which the resolution after resolution conversion is minimized as the optimal resolution;
A video processing apparatus comprising:

The edge amount calculating means includes
Means for applying an edge extraction filter to each frame of the luminance layer of the input video data to convert it to edge video data;
Means for calculating a feature value of a pixel value for each frame of the edge video data, and calculating as an edge amount of all frames based on the calculated feature value of the pixel value for each frame;
The video processing apparatus according to claim 1, further comprising:

A video processing method for converting input resolution of input video data to resolution satisfying a specified video quality condition for input video data (hereinafter referred to as “input video data”),
Optimum video parameter derivation means, for the input video data, an optimal video parameter derivation step for calculating an optimal resolution that is a minimum resolution among resolutions that satisfy a specified video quality condition;
A video optimization processing step, wherein the video optimization processing means converts the resolution of the input video data based on the optimal resolution calculated in the optimal video parameter derivation step, and outputs optimized video data;
With
In the optimum video parameter derivation step,
Based on the input image data, an edge amount calculating step of calculating an edge amount,
For each combination of resolutions before and after resolution conversion, a model formula representing the relationship between the edge amount and the video quality at the time of resolution conversion is set, and the input resolution is set as the resolution before resolution conversion, and the resolution conversion is performed. Based on the model formula set for each combination of resolution before and after, the degree of degradation of video quality with respect to the edge amount is calculated, and among the combinations of resolutions that satisfy the specified video quality condition, An image processing method comprising: an optimum resolution calculating step of calculating, as the optimum resolution, a resolution at which the resolution after resolution conversion is minimized.

In the edge amount calculating step,
Applying an edge extraction filter to each frame of the luminance layer of the input video data to convert it to edge video data;
Calculating a feature value of a pixel value for each frame of the edge video data, and calculating as an edge amount of all frames based on the calculated feature value of the pixel value for each frame;
The video processing method according to claim 3, further comprising:

Computer
A video processing program for functioning as each unit of the video processing device according to claim 1.