JP2005535200A

JP2005535200A - Personal TV optimization

Info

Publication number: JP2005535200A
Application number: JP2004525618A
Authority: JP
Inventors: カールウィッティグ
Original assignee: Koninklijke Philips Electronics NV
Current assignee: Koninklijke Philips NV
Priority date: 2002-07-31
Filing date: 2003-07-07
Publication date: 2005-11-17
Also published as: KR20050026039A; WO2004014076A3; EP1527604A2; CN1672413A; US20040025183A1; AU2003281822A8; AU2003281822A1; WO2004014076A2

Abstract

パーソナルテレビシステムをモデル化するための方法、システムおよびコンピュータ読取可能な媒体が提供される。パーソナルテレビシステムにより受信映像信号に実行される処理工程を含む、映像処理チェーンのコンピュータ実装モデルが形成され、パーソナルテレビシステム内における記憶前の映像信号の符号化に相当する第１の機能部分と、パーソナルテレビシステム内における記憶された符号化済映像データの復号に相当する第２の機能部分とを含む、１つまたは複数の追加機能が、そのモデルに挿入される。Methods, systems, and computer readable media are provided for modeling personal television systems. A first functional portion corresponding to encoding of a video signal before storage in a personal television system, wherein a computer-implemented model of a video processing chain is formed, including processing steps performed on a received video signal by a personal television system; One or more additional functions are inserted into the model, including a second functional part corresponding to the decoding of the stored encoded video data in the personal television system.

Description

本発明は、テレビのためのモデル化および最適化技術に関するものである。 The present invention relates to modeling and optimization techniques for televisions.

無線送信されているかケーブルテレビ（ＣＡＴＶ）システムを介して配信されているかにかかわらず、アナログテレビ放送信号を受信することができ、後に視聴者が再生できるようにそれらを記録するため、デジタル圧縮および記憶技術を使用することができる、数多くの「パーソナルテレビ」（ＰＴＶ）システムが、市場に投入されている。３つのそのようなシステムとして、フィリップスおよびソニーが販売している「ＴｉＶｏ^ＴＭ」、松下（パナソニック）が販売している「Ｒｅｐｌａｙ−ＴＶ^ＴＭ」、およびマイクロソフトが販売している「Ｕｌｔｉｍａｔｅ−ＴＶ^ＴＭ」が挙げられる。これらのシステムは、視聴者が従来はビデオ・カセット・レコーダー（ＶＣＲ）を用いてきた機能である同一の主機能、すなわち放送番組のタイムシフトを実行するが、視聴者が第２の番組を録画しながら第１の番組を見ることを可能にするというさらなる利点を提供し、また、直線的なビデオテープ内における録画された番組の位置や、新しい番組を録画できる使用可能なテープの位置および長さを覚えておくことから、ユーザーを解放するものである。そのため、これらのシステムは、ますます人気が出てきており、アナログ形式の放送映像の送信および配信が利用され続ける限り、この傾向はおそらく続くであろう。 Regardless of whether it is being transmitted wirelessly or distributed via a cable television (CATV) system, analog television broadcast signals can be received and recorded for later playback by the viewer using digital compression and A number of “personal television” (PTV) systems that can use storage technology are on the market. Three such systems include “TiVo ^™ ” sold by Philips and Sony, “Replay-TV ^™ ” sold by Matsushita (Panasonic), and “Ultimate-TV ^™ ” sold by Microsoft. Is mentioned. These systems perform the same main function that viewers have traditionally used video cassette recorders (VCRs), ie time-shifting broadcast programs, but viewers record a second program. While providing the additional advantage of allowing the first program to be viewed, and the position of the recorded program within a linear videotape, and the position and length of the tape available to record a new program It's freed up by remembering that. As such, these systems are becoming increasingly popular and this trend will likely continue as long as analog form broadcast video transmission and distribution continues to be utilized.

ＰＴＶシステムは、典型的には、アナログ高周波（ＲＦ）放送信号を受信し（衛星テレビ・デコーダのアナログ出力を用いてもよいが）、それらの信号を、復調およびアナログ映像復号処理（ＮＴＳＣ、ＰＡＬ等）の後、デジタルフォーマットに変換する。これらの信号は、その後、ＭＰＥＧ１や２等の、損失を伴うデジタル映像圧縮方式を用いて圧縮され、高密度ハードディスクドライブ（ＨＤＤ）等の大量記憶装置に記録される。近年の記憶装置の高いデータ容量（時と共に非常に急速に増え続けている）と共に圧縮技術を使用することにより、何時間もの放送映像をＨＤＤに記録することが可能となり、それにより、アナログビデオ録画用テープの必要がなくなり、上記のデジタル記憶の利点が提供されるようになった。 A PTV system typically receives analog radio frequency (RF) broadcast signals (although the analog output of a satellite television decoder may be used), and those signals are demodulated and analog video decoded (NTSC, PAL). Etc.) and then convert to digital format. These signals are then compressed using a lossy digital video compression scheme, such as MPEG 1 or 2, and recorded in a mass storage device such as a high density hard disk drive (HDD). By using compression technology along with the high data capacity of recent storage devices (which continues to increase very rapidly with time), it becomes possible to record hours of broadcast video on the HDD, thereby enabling analog video recording. The need for tape for use has been eliminated and the advantages of digital storage described above have been provided.

一般的に、映像処理システムは、アナログ放送またはデジタル映像のいずれかとして送信されたが、両方としてではないソースの、画質を改善するように要請されている。アナログ放送映像の場合には、画質は、典型的には、たとえば、送信チャネルと映像信号の復調およびアナログ復号とによってスペクトル形状が決まるガウシアンノイズを含み得る、チャネルノイズによって劣化させられる。色精度や画像の鮮鋭度といった他の特徴も、この処理によって影響を受け得る。デジタル映像の場合には、ノイズは存在せず（映像信号のデジタル化前に取り込まれた少量のノイズを除く）、色精度は影響を受けない。しかしながら、損失を伴う圧縮およびそれに続く復号処理により、ブロック劣化（ｂｌｏｃｋｉｍｐａｉｒｍｅｎｔ）等の多くの映像アーティファクトが取り込まれる可能性があり、また、画像の鮮鋭度も影響を受け得る。逆に、そのようなアーティファクトは、アナログ放送映像ソースには存在し得ない（そのソースがもとはデジタル形式である場合を除く。そのような場合、ソースは通常、低い圧縮比を用いて符号化されたものであり、したがって、高画質で、上記の画質劣化がほとんどないものである）。したがって、従来は、どちらか一方のカテゴリーの画質劣化のみが、受信された映像放送に現れ、両方が現れることはなかった。 In general, video processing systems are required to improve the image quality of sources that are transmitted as either analog broadcast or digital video, but not both. In the case of analog broadcast video, image quality is typically degraded by channel noise, which can include, for example, Gaussian noise whose spectral shape is determined by the transmission channel and the demodulation and analog decoding of the video signal. Other features such as color accuracy and image sharpness can also be affected by this process. In the case of digital video, there is no noise (except for a small amount of noise captured before digitization of the video signal), and color accuracy is not affected. However, lossy compression and subsequent decoding can introduce many video artifacts, such as block impairments, and can also affect image sharpness. Conversely, such artifacts cannot exist in an analog broadcast video source (except when the source is originally in digital form. In such cases, the source is typically encoded using a low compression ratio. Therefore, the image quality is high and there is almost no deterioration of the image quality). Therefore, conventionally, only the image quality degradation of one of the categories has appeared in the received video broadcast, but not both.

しかしながら、ＰＴＶシステムでは、アナログ放送信号は、従来のやり方で復調および復号されるので、従来のアナログ形式の画質劣化のうち、任意のまたはすべての画質劣化を有し得る。その後、その信号は、可能な限り少ないＨＤＤ領域を使用するために、許容できる最も高い圧縮比で圧縮される。これにより、後に映像がＨＤＤから読み出され復号される際に、上記で述べたデジタル形式の画質劣化のうち、任意のまたはすべての画質劣化が映像に取り込まれる。かかる画像劣化は、これらのシステムで一般に使用される高い圧縮比では顕著となり得る。結果として、ユーザーが見る際には、同一の映像シーケンスが、アナログ放送の画質劣化とデジタル符号化の画質劣化との両方を持つこととなり、それらの画質劣化のいずれかまたは両方が極めて顕著なものとなるかもしれない。これは、従来の状況とは明らかに異なる状況である。 However, in a PTV system, analog broadcast signals are demodulated and decoded in a conventional manner, and thus can have any or all of the image quality degradation of conventional analog formats. The signal is then compressed at the highest acceptable compression ratio to use as little HDD space as possible. As a result, when the video is later read from the HDD and decoded, any or all of the image quality degradations in the digital format described above are captured in the video. Such image degradation can be significant at the high compression ratios commonly used in these systems. As a result, when viewed by the user, the same video sequence will have both analog broadcast image quality degradation and digital encoding image quality degradation, and either or both of these image quality degradations will be extremely prominent. It may become. This is a situation that is clearly different from the conventional situation.

映像処理システムにおいて画質を最適化するため、多くの方法が提案されてきた。自動映像チェーン最適化のための一般的な方法が、ＶａｎＺｏｎ，ＫｅｅｓならびにＡｌｉ，Ｗａｌｉｄの「自動映像チェーン最適化（ＡｕｔｏｍａｔｅｄＶｉｄｅｏＣｈａｉｎＯｐｔｉｍｉｚａｔｉｏｎ）」、コンシューマー・エレクトロニクス国際会議（ＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＣｏｎｓｕｍｅｒＥｌｅｃｔｒｏｎｉｃｓ；ＩＣＣＥ）、２００１年、および、Ａｌｉ，ＷａｌｉｄならびにＶａｎＺｏｎ，Ｋｅｅｓの「並列的な進化モデル化による、直列接続された映像処理モジュールのランダムなシステムの最適化（ＯｐｔｉｍｉｚｉｎｇａＲａｎｄｏｍＳｙｓｔｅｍｏｆＣａｓｃａｄｅｄＶｉｄｅｏＰｒｏｃｅｓｓｉｎｇＭｏｄｕｌｅｓｂｙＰａｒａｌｌｅｌＥｖｏｌｕｔｉｏｎＭｏｄｅｌｉｎｇ）」、画像処理国際会議（ＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＩｍａｇｅＰｒｏｃｅｓｓｉｎｇ）、２００１年に記載されている。これらの文献はいずれも、参照によりその全内容が本明細書に含まれているものとする。参照映像シーケンスが、処理動作の初期映像処理チェーンを用いて処理され、画質が測定される。通常、大きなパラメータ探索空間に亘って全体的な最適化を行うことを可能とする遺伝アルゴリズム等の最適化技術が、結果として得られる映像の客観的画質（ｏｂｊｅｃｔｉｖｅｉｍａｇｅｑｕａｌｉｔｙ；ＯＩＱ）を特定する方法と共に適用される。かかる方法は、アナログ放送映像の画質を改善するように設計されたアナログ映像処理システム、またはデジタル符号化された映像用に設計されたデジタル映像システムに、適用または提案されてきた。 Many methods have been proposed to optimize image quality in video processing systems. General methods for automatic video chain optimization are “Automated Video Chain Optimization” by Van Zon, Kees and Ali, Walid, International Conference on Consumer Electronics; ICCE ), 2001, and Ali, Walid and Van Zon, Kees, “Optimizing a Random System of Cascaded Video Processing Modules by Serially Connected Video Processing Modules.” Parallel Evoluti on Modeling ”, International Conference on Image Processing, 2001. All of these documents are hereby incorporated by reference in their entirety. The reference video sequence is processed using the initial video processing chain of processing operations and the image quality is measured. Usually, an optimization technique such as a genetic algorithm that makes it possible to perform overall optimization over a large parameter search space is a method for specifying an objective image quality (OIQ) of a resultant image. Applied with. Such methods have been applied or proposed for analog video processing systems designed to improve the quality of analog broadcast video or digital video systems designed for digitally encoded video.

アナログ映像の場合、チャネルノイズ低減（時間的および／または空間的）や、鮮鋭度強調のためのピーキング（水平方向、および場合によっては垂直方向）といった技術が、画質の改善によく用いられ、典型的には、映像処理の「チェーン」がこれらおよびその他のビデオ機能を実施する。そのようなチェーンの最適化は、所与の機能に対して可能な限りの最良の画質を同時にもたらすような、これらすべての機能についてのパラメータの設定を必要とする。 For analog video, techniques such as channel noise reduction (temporal and / or spatial) and peaking for sharpness enhancement (horizontal and sometimes vertical) are often used to improve image quality. Specifically, a “chain” of video processing performs these and other video functions. Such chain optimization requires the setting of parameters for all these functions so as to simultaneously provide the best possible image quality for a given function.

デジタル映像については、異なる画像の対応領域間にノイズの結果として微小差分（ｓｍａｌｌｄｉｆｆｅｒｅｎｔｉａｌｓ）が現れるというよく知られた現象があり、それがやはり符号化を免れず、それにより限られたデータ帯域幅を無駄にしてしまうため、圧縮符号化に先立って何らかの形式のノイズ低減が使用されることが多い。ノイズの保持に加えてのこの結果は、有効符号化データレートを減少させることと等価であり、さらには圧縮比を増大させることと等価である。通常、かかる低いノイズレベルの直接的な結果である画質の視覚上の劣化は、これらのシステムにおいて考えるべき副次的な事項である。また、いくつかのシステムでは、復号後において高い圧縮比での符号化の結果として現れるブロック劣化の効果を、低減する方法が用いられている。やはりこれらの高い比において生じ得る高周波成分を破棄することにより、画像の鮮鋭度も顕著に減少し、何らかの形式の鮮鋭度強調が望ましくなる可能性がある。 For digital video, there is a well-known phenomenon that small differentials appear as a result of noise between corresponding regions of different images, which are still subject to encoding and thereby limited data bandwidth In some cases, some form of noise reduction is used prior to compression coding. This result in addition to noise retention is equivalent to reducing the effective encoded data rate and is equivalent to increasing the compression ratio. The visual degradation of image quality, which is usually a direct result of such low noise levels, is a secondary matter to consider in these systems. In some systems, a method of reducing the effect of block deterioration that appears as a result of encoding at a high compression ratio after decoding is used. By discarding the high frequency components that can also occur at these high ratios, the sharpness of the image is also significantly reduced, and some form of sharpness enhancement may be desirable.

したがって、ＰＴＶのための最適化方法が望まれている。 Therefore, an optimization method for PTV is desired.

パーソナルテレビシステムをモデル化するための方法、システムおよびコンピュータ読取可能な媒体が提供される。パーソナルテレビシステムにより受信映像信号に実行される処理工程を含む、映像処理チェーンのコンピュータ実装モデルが形成され、パーソナルテレビシステム内における記憶前の映像信号の符号化に相当する第１の機能部分と、パーソナルテレビシステム内における記憶された符号化済映像データの復号に相当する第２の機能部分とを含む、１つまたは複数の追加機能が上記のモデルに挿入される。 Methods, systems, and computer readable media are provided for modeling personal television systems. A first functional portion corresponding to encoding of a video signal before storage in a personal television system, wherein a computer-implemented model of a video processing chain is formed, including processing steps performed on a received video signal by a personal television system; One or more additional functions are inserted into the model, including a second functional part corresponding to the decoding of the stored encoded video data in the personal television system.

デジタル映像の符号化、記憶、読出し、復号および処理の機能を有する完全なＰＴＶシステムは、上記の最初から４つ目までの要素（ＰＴＶ内における符号化、記憶、読出しおよび復号）を映像処理チェーンの一部として取り扱うことにより、アナログ映像システムについて上記に述べたことに基づく技術を用いて最適化することができる。特に符号化は、典型的には、主たる（かつ、オプションとして唯一の）システムパラメータとして、所望のデータレート（または圧縮比）を用いて行われる。一方、復号は、あくまで取り出された映像データストリームの関数であって、したがって、調整可能なパラメータを有さない。 A complete PTV system having the functions of encoding, storing, reading, decoding and processing of digital video has the above-mentioned first to fourth elements (encoding, storing, reading and decoding in PTV) as a video processing chain. , The analog video system can be optimized using techniques based on what has been described above. In particular, encoding is typically performed using the desired data rate (or compression ratio) as the main (and optionally only) system parameter. On the other hand, decoding is a function of the extracted video data stream, and therefore has no adjustable parameters.

最後に、最近ますます一般的となってきているように、映像データのデータフロー表現が当該最適化方法で使用される場合には、記憶媒体の書込みおよび読出しは、単に映像データストリームの時間遅延を構成するものであり、したがって完全に無視できる。ＭＰＥＧ形式の符号化および復号は、この最適化の状況においてそれとしてモデル化され得るデジタル処理アルゴリズムであるが、それに加えて、この方法はデータ主導の方法であるので、ハードディスクにデータを記憶させてその後データを読み出す際には、時間の概念がない。したがって、後に読出しが続くデータの書込みは、時間遅延に過ぎない。符号化、記憶、読出しおよび復号は、常に直接連続したものとして実行されるので、デジタル処理システムのこれら４つの要素を、単一の機能として取り扱うことが可能である。 Finally, as the data flow representation of video data is used in the optimization method, as recently more and more common, storage media writing and reading are simply time delays in the video data stream. Therefore, it can be completely ignored. MPEG format encoding and decoding is a digital processing algorithm that can be modeled as such in the context of this optimization, but in addition, since this method is a data driven method, the data can be stored on a hard disk. When reading data thereafter, there is no concept of time. Therefore, writing data that is followed by reading is only a time delay. Since the encoding, storing, reading and decoding are always performed as directly continuous, these four elements of the digital processing system can be treated as a single function.

さらには、ＰＴＶの符号化、記憶、読出しおよび復号の処理動作の比較的単純なモデルでは、最適化処理中において単一のパラメータを変化させて、有効な最適化をもたらすことが可能である。かかる単一のパラメータの一例は、圧縮比である。圧縮比とデータレートとの間には１対１の関係があるので、圧縮比の代わりとしてデータレートを用いてもよい。あるいは、最適化処理中において２つ以上のパラメータを変化させる、より複雑なＰＴＶの符号化、記憶、読出しおよび復号の処理動作のモデルを用いてもよい。図１は、ＰＴＶシステムの最適化のためのシステムを示したブロック図である。 Furthermore, a relatively simple model of PTV encoding, storing, reading and decoding processing operations can change a single parameter during the optimization process to provide effective optimization. An example of such a single parameter is the compression ratio. Since there is a one-to-one relationship between the compression ratio and the data rate, the data rate may be used instead of the compression ratio. Alternatively, a more complex model of PTV encoding, storage, reading and decoding processing operations that changes two or more parameters during the optimization process may be used. FIG. 1 is a block diagram illustrating a system for optimizing a PTV system.

ＰＴＶシステムは、各タイプのシステム（すなわち、アナログおよびデジタル）について上記に述べたそれぞれの機能の組合せを用いて、モデル化され得る。しかしながら、ＰＴＶシステムでは、ノイズ低減は、アナログチャネルノイズの低減とデジタル符号化効率の改善との２つの機能を果たす。また、鮮鋭度強調は、チャネル、復調および復号によるアナログ周波数の歪みと、デジタル圧縮による高周波情報の破棄との、両方の効果を打ち消す必要がある。 A PTV system can be modeled using a combination of the respective functions described above for each type of system (ie, analog and digital). However, in PTV systems, noise reduction serves two functions: reducing analog channel noise and improving digital coding efficiency. In addition, the sharpness enhancement needs to cancel both the effects of the distortion of the analog frequency due to channel, demodulation and decoding, and the discarding of high frequency information due to digital compression.

そのようなシステムの適切な最適化は、アナログシステムとデジタルシステムとの個々の最適化を、単純に組み合わせるだけよりも多くの要請を必然的に伴う。映像処理チェーン内で実行される機能は非直線的であるので、モデル内において複数の機能が実行される順序は、実際の映像処理チェーン内でそれらの機能が実行される順序を反映したものとされるべきであり、処理動作の順序（それらのアルゴリズム・パラメータおよびデータビット精度に加えて）は最適化され得る。ＰＴＶシステムは、従来型の映像処理チェーンの要素、すなわち、符号化、記憶、読出しおよび復号機能の要素に対して、たった１つの追加要素を加えることによって、モデル化することができる。この要素はまた、たった１つのパラメータ（具体的には圧縮比）しか有していない。したがって、最適化の複雑さを著しく増大させることなく、従来型の映像処理チェーンにおいて提案され使用されてきたモデル化方法を変更して、ＰＴＶシステムをモデル化することができる。そのため、かかる方法を用いて、商業的意義がますます増大していると見られるタイプの映像処理システムの、全体的な画質を最適化することができる。 Proper optimization of such a system entails more demand than simply combining individual optimizations of analog and digital systems. Since the functions executed in the video processing chain are non-linear, the order in which multiple functions are executed in the model reflects the order in which those functions are executed in the actual video processing chain. Should be done and the order of processing operations (in addition to their algorithm parameters and data bit precision) can be optimized. The PTV system can be modeled by adding only one additional element to the elements of the conventional video processing chain, ie the elements of encoding, storing, reading and decoding functions. This element also has only one parameter, specifically the compression ratio. Thus, the PTV system can be modeled by modifying the modeling methods that have been proposed and used in conventional video processing chains without significantly increasing the optimization complexity. For this reason, such methods can be used to optimize the overall image quality of a type of video processing system that is expected to be of increasing commercial significance.

図１を参照すると、例示的な最適化システムが示されている。このシステムは、映像処理チェーン・シミュレータ１００を含んでいる。シミュレータ１００は、イベント主導のシミュレーションとは対照的に、データ主導のものである。シミュレータ１００は、入力参照映像に対して、ＰＴＶ内の実際の映像処理システムが実行する処理動作と実質的に同一のデジタル信号処理動作を実行する。ただし、シミュレータ１００は、それらの処理動作をリアルタイムで実行しなくてもよい点のみが、実際の映像処理システムと異なる。 Referring to FIG. 1, an exemplary optimization system is shown. This system includes a video processing chain simulator 100. The simulator 100 is data driven as opposed to event driven simulation. The simulator 100 performs a digital signal processing operation substantially the same as the processing operation performed by the actual video processing system in the PTV on the input reference video. However, the simulator 100 differs from an actual video processing system only in that these processing operations do not have to be executed in real time.

映像処理チェーン・シミュレータ１００に入力されるデータは、デジタルフォーマットのデータである。ＰＴＶがアナログ送信信号を受信する場合は、映像入力データは、ベース帯域映像信号への復調とデジタル化とが実行された後のアナログ送信信号を表すものとなる。最適化の際には、反復モデル化および最適化の処理全体に亘って、同一の映像シーケンスが入力として用いられる。 Data input to the video processing chain simulator 100 is data in a digital format. When the PTV receives an analog transmission signal, the video input data represents the analog transmission signal after being demodulated and digitized into a baseband video signal. During optimization, the same video sequence is used as input throughout the iterative modeling and optimization process.

映像処理チェーン・シミュレータ１００は、複数の映像処理アルゴリズム１１０ａ−１１０ｎを含んでいる。たとえば、アルゴリズム１（ブロック１１０ａ）はノイズ低減であってもよく、アルゴリズム２（ブロック１１０ｂ）は鮮鋭度強調であってもよく、そのようにしてアルゴリズムＮ（ブロック１１０ｎ）まで続き、アルゴリズムＮ（ブロック１１０ｎ）はＭＰＥＧブロッキング効果の除去であってもよい。アルゴリズムの選択は、モデル化されるべきＰＴＶシステム内で用いられているアルゴリズムに主導される。 The video processing chain simulator 100 includes a plurality of video processing algorithms 110a to 110n. For example, algorithm 1 (block 110a) may be noise reduction and algorithm 2 (block 110b) may be sharpness enhancement, thus continuing to algorithm N (block 110n) and algorithm N (block 110n) may be removal of the MPEG blocking effect. The choice of algorithm is driven by the algorithm used in the PTV system to be modeled.

符号化、記憶、読出しおよび復号機能の間に実行される処理を表すため、少なくとも１つの追加機能１２０が追加される。この機能１２０の符号化部分は、最適化されたＰＴＶシステム内で使用される実際の符号化ハードウェアをモデル化したものとされるべきである。実際には、ＭＰＥＧ形式のエンコーダ・チップが使用され得る。かかるチップは、ある決まった符号化アルゴリズムを用いるものであるので、そのチップにより実装されるアルゴリズムのソフトウェアモデルが用いられる。ＭＰＥＧの１つの目的はできるだけ良好な圧縮を得ること、すなわちできるだけ良好な画像の忠実度を得ることであるので、ＭＰＥＧ形式のエンコーダを用いた、種々の異なるモデルが存在する。このことは、できるだけ多くの画素を効率的に処理すること、すなわち、最小限の数の符号化されたビットを用いてできるだけ多くの画素情報を処理することを必要とする。任意の映像シーケンスに対して、可能かつ適正な符号化は多数存在する。そのため、エンコーダ・モデルは、システム内で使用される実際のハードウェアをモデル化したものとされるべきである。 At least one additional function 120 is added to represent the processing performed during the encoding, storing, reading and decoding functions. The coding portion of this function 120 should be modeled on the actual coding hardware used in the optimized PTV system. In practice, an MPEG format encoder chip may be used. Since such a chip uses a certain encoding algorithm, a software model of an algorithm implemented by the chip is used. Since one purpose of MPEG is to obtain as good compression as possible, i.e. to obtain as good image fidelity as possible, there are a variety of different models using MPEG-style encoders. This requires that as many pixels as possible be processed efficiently, that is, as much pixel information as possible using a minimum number of encoded bits. There are many possible and proper encodings for any video sequence. Therefore, the encoder model should be modeled on the actual hardware used in the system.

機能１２０のデコーダ部分は、ＭＰＥＧ１規格（ＩＳＯ／ＩＥＣ１１１７２−１（から−５まで）：１９９３）またはＭＰＥＧＩＩ規格（ＩＳＯ／ＩＥＣ１３８１８−４：１９９８／Ａｍｄ２：２０００）から直接実装されたものであってもよい。これらの規格は、参照により、その内容が本明細書に含まれているものとする。ＭＰＥＧ規格の様々な部分は、Ｃコードのセグメントまたはブロックとして規定されているので、当業者においては、ＭＰＥＧデコーダをモデル化するためのデコーダ・Ｃコードを容易に採用できるであろう。 The decoder part of function 120 was implemented directly from the MPEG1 standard (ISO / IEC 11172-1 (from -5): 1993) or MPEGII standard (ISO / IEC 13818-4: 1998 / Amd2: 2000). May be. The contents of these standards are hereby incorporated by reference. Since the various parts of the MPEG standard are defined as C code segments or blocks, one skilled in the art could easily employ a decoder C code to model an MPEG decoder.

上記に述べたように、記憶および取出しの処理動作は遅延に相当するのみであり、シミュレーション中ではモデル化されない。いずれの処理動作も、データを変換したり、画質に影響を与えたりするものではない。映像処理チェーン・シミュレータ１００からの復号された出力データは、客観的画質測定ブロック１３０へと供給される。客観的画質測定基準は、典型的には、予め決められた画像または映像シーケンスの組を用いた画像または映像シーケンスについての、客観的測定基準により与えられた画質尺度と、視聴者の集団による主観的な画像評価との間に、良好な相関付けを与えることにより、選択され調整される。客観的画質測定基準は、たとえば、アナログノイズ、ブロッキング、リンギング、モスキート状アーティファクトおよびその他のタイプのアーティファクトを考慮に入れた、画質の尺度を付与するものであってもよい。この測定基準は、上記の要素の各々が画質に及ぼす影響に対する互いに異なる重み付けにも適用可能なものである。その場合、重みを調整することにより、客観的な測定基準と主観的な評価とによって与えられた尺度の間の相関付けを、改善することができる。 As described above, the storage and retrieval processing operations only correspond to delays and are not modeled during simulation. Neither processing operation converts data or affects image quality. The decoded output data from the video processing chain simulator 100 is supplied to the objective image quality measurement block 130. An objective image quality metric typically consists of an image quality measure given by the objective metric for an image or video sequence using a predetermined set of images or video sequences, and a subjective Selected and adjusted by giving a good correlation with the typical image evaluation. An objective image quality metric may provide a measure of image quality that takes into account, for example, analog noise, blocking, ringing, mosquito artifacts and other types of artifacts. This metric can also be applied to different weightings for the effects of each of the above factors on image quality. In that case, adjusting the weights can improve the correlation between the measures given by the objective metric and the subjective assessment.

アルゴリズム・パラメータ、処理の順序付け、データビット精度、および結果として得られる客観的画質測定値が、最適化の目的に関係するパラメータである。実際の映像出力データは、最適化目的のために記憶されなくてもよい。 Algorithm parameters, processing ordering, data bit accuracy, and resulting objective image quality measurements are parameters that are relevant to the purpose of the optimization. Actual video output data may not be stored for optimization purposes.

ブロック１４０は、映像処理システム最適化ブロックである。この最適化ブロック１４０の望ましい属性には、精度とスピードが含まれる。パラメータ値と処理の順序付けとの異なる組合せは極めて多数存在するので、全体的な最適条件を見つけるために１つ１つすべての組合せを実行してみるのは現実的でない。一方、より大きな全体的な最大値またはより小さな全体的な最小値がある場合でも、極大値または極小値に収束する傾向のあるアルゴリズム（たとえばニュートン法）も、使用されるべきではない。 Block 140 is a video processing system optimization block. Desirable attributes of this optimization block 140 include accuracy and speed. Since there are so many different combinations of parameter values and processing ordering, it is not practical to try every single combination to find the overall optimum. On the other hand, even if there is a larger overall maximum or a smaller overall minimum, algorithms that tend to converge to local maxima or minima (eg, Newton's method) should not be used.

遺伝アルゴリズムは、映像処理チェーン最適化ブロック１４０のための、１つの好ましいアプローチである。遺伝アルゴリズムは、探索空間に関する事前情報を要さずに、全体的な最適条件に向かって進化していくことのできる、進化論に基づいた反復的かつ非限定的なアプローチを用いている。遺伝アルゴリズムは、それぞれが複数の「遺伝子」からなる「染色体」と呼ばれる解候補の組を生成する。各遺伝子は、特定のアルゴリズム・パラメータ値（またはアルゴリズム・パラメータ値のサブセット）、処理の順序付け、またはデータビット精度に対応する。 A genetic algorithm is one preferred approach for the video processing chain optimization block 140. Genetic algorithms use an iterative and non-restrictive approach based on evolution that can evolve towards the overall optimum without requiring prior information about the search space. The genetic algorithm generates a set of solution candidates called “chromosomes” each consisting of a plurality of “genes”. Each gene corresponds to a particular algorithm parameter value (or a subset of algorithm parameter values), processing ordering, or data bit precision.

１つの世代内の各染色体について、映像チェーンが形成される。これは、所与の所望の順序に並べられたすべてのブロック、所与のデータビット精度値の組、および所与のアルゴリズム・パラメータ値の組が選択され、シミュレーション１００が実行され、客観的画質測定１３０の番号（「適合値（ｆｉｔｎｅｓｓｖａｌｕｅ）」）がその遺伝アルゴリズムに与えられることを意味する。客観的画質測定１３０の値に基づいて、最適な画質測定値を有する染色体のサブセットが選択され、「クロスオーバー」によって組み合わされて、次の世代が形成される。クロスオーバーでは、アルゴリズム・パラメータ値のサブセット、パラメータの順序付け、およびデータビット精度（すなわち遺伝子のサブセット）が、染色体間で交換される。その後、ユーザー定義された何らかの（通常は低い）確率によって遺伝子のいくつかを乱すことにより、「変異（ｍｕｔａｔｉｏｎ）」が導入される。変異は、探索空間において探索される可能性がゼロである部分がなくなるように保証し、それによって、よりよい全体的な最適条件の解が存在しているときに極小値または極大値に収束してしまう可能性を低減させる。次の世代の染色体の全体の組は、直ちに処理され評価される準備が整っている。 A video chain is formed for each chromosome within a generation. This is because all blocks in a given desired order, a given set of data bit precision values, and a given set of algorithm parameter values are selected, and a simulation 100 is performed to provide objective image quality. It means that the number of the measurement 130 (“fitness value”) is given to the genetic algorithm. Based on the values of the objective image quality measurement 130, the subset of chromosomes with the optimal image quality measurement is selected and combined by “crossover” to form the next generation. In crossover, a subset of algorithm parameter values, parameter ordering, and data bit precision (ie, a subset of genes) are exchanged between chromosomes. A “mutation” is then introduced by perturbing some of the genes with some (usually low) probability defined by the user. Mutation ensures that no part of the search space is likely to be searched, thereby converging to a local minimum or local maximum when a better overall optimal solution exists. To reduce the possibility of losing. The entire set of next generation chromosomes is ready for immediate processing and evaluation.

次の世代に使用される値が選択されると、映像処理チェーン制御ブロック１５０が、処理の順序付けおよびデータビット精度に加え、映像処理アルゴリズム１１０ａ−１１０ｎのそれぞれについて、さらに符号化、記憶、読出しおよび復号機能１２０について、パラメータ値を設定する。これにより、次の世代の染色体のそれぞれについて、同一の入力映像データの組を用いて、映像処理チェーン・シミュレータ１００を再実行させることが可能となる。 Once the value to be used for the next generation is selected, the video processing chain control block 150 further encodes, stores, reads and stores each of the video processing algorithms 110a-110n in addition to processing ordering and data bit precision. A parameter value is set for the decoding function 120. Thus, the video processing chain simulator 100 can be re-executed using the same set of input video data for each of the next generation chromosomes.

反復処理の終了基準は、許容できる近似解または安定解への到達に基づく基準であってもよいし、予め決められた回数の変異分の反復や、予め決められた数の世代分の反復等に基づく基準であってもよい。 The criterion for termination of the iterative process may be a criterion based on the arrival of an acceptable approximate solution or a stable solution, or a predetermined number of mutation iterations, a predetermined number of generation iterations, etc. It may be a criterion based on.

遺伝アルゴリズムは、探索空間全体の比較的小さな部分の評価しか必要としないにもかかわらず（その結果、早い収束をもたらす）、精確な結果を与えるので、魅力的である。しかしながら、当業者においては、映像処理システム最適化ブロック１４０に、他の最適化アルゴリズムを使用することもできるであろう。 Genetic algorithms are attractive because they give accurate results despite requiring only a relatively small portion of the entire search space to be evaluated (resulting in fast convergence). However, those skilled in the art could use other optimization algorithms for the video processing system optimization block 140.

図２は、パーソナルテレビシステムをモデル化するための１つの方法を示している。 FIG. 2 illustrates one method for modeling a personal television system.

ステップ２００において、パーソナルテレビシステムにより受信映像信号に実行される処理工程１１０ａ−１１０ｎを含む、映像処理チェーンのコンピュータ実装モデル１００が形成される。このモデルは、アルゴリズム・パラメータ、処理の順序付けおよびデータビット精度の、調整を許容するものである。 In step 200, a computer-implemented model 100 of a video processing chain is formed that includes processing steps 110a-110n performed on a received video signal by a personal television system. This model allows for adjustment of algorithm parameters, processing ordering and data bit accuracy.

ステップ２０２では、パーソナルテレビシステム内における記憶前の映像信号の符号化に相当する、エンコーダ固有の機能部分が、映像処理チェーン・モデル１００に挿入される。 In step 202, an encoder-specific functional part corresponding to the encoding of the video signal before storage in the personal television system is inserted into the video processing chain model 100.

ステップ２０４では、シミュレーションは、記憶および取出しを無視することができる。これは、これらの処理動作のいずれも、画像データを変換したり、画質に影響を与えたりしないためである。 In step 204, the simulation can ignore storage and retrieval. This is because none of these processing operations convert image data or affect image quality.

ステップ２０６では、ＰＴＶ内に記憶されているデータの、ＭＰＥＧ１形式またはＭＥＰＧ２形式の復号に相当する機能部分が、映像処理チェーン・モデル１００に挿入される。実施形態によっては、ステップ２０２および２０６が統合されて、符号化および復号に相当する単一の機能が形成され、記憶および読出しは無視される。 In step 206, a functional portion corresponding to the decoding of the MPEG1 format or the MPEG2 format of the data stored in the PTV is inserted into the video processing chain model 100. In some embodiments, steps 202 and 206 are integrated to form a single function corresponding to encoding and decoding, and storage and reading are ignored.

ステップ２０８−２１４は、反復による最適化を提供する。ステップ２０８では、映像処理チェーン・モデル１００が実行される。 Steps 208-214 provide iterative optimization. In step 208, the video processing chain model 100 is executed.

ステップ２１０では、客観的画質測定基準と対照して、シミュレーションの結果が評価される。 In step 210, the simulation results are evaluated against an objective image quality metric.

ステップ２１２では、たとえば遺伝アルゴリズムであってもよい最適化アルゴリズムに従って、モデルが調整される。この調整は、ステップ２１４において、調整パラメータとして圧縮比（またはデータレート）を用いた符号化／復号機能の調整であって、この圧縮比（またはデータレート）が唯一の調整パラメータであってもよい調整を含んでいてもよい。 In step 212, the model is adjusted according to an optimization algorithm, which may be, for example, a genetic algorithm. This adjustment is an adjustment of the encoding / decoding function using the compression ratio (or data rate) as an adjustment parameter in step 214, and this compression ratio (or data rate) may be the only adjustment parameter. Adjustments may be included.

ステップ２０８−２１４は、終了基準が満たされるまで繰り返される。 Steps 208-214 are repeated until the termination criteria are met.

本発明は、コンピュータ実装された処理、およびそれらの処理を実行するための装置の形態で、実施されてもよい。本発明はまた、ランダム・アクセス・メモリ（ＲＡＭ）、フロッピー・ディスク、読出専用メモリ（ＲＯＭ）、ＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ、ハードドライブ、高密度（たとえば「ＺＩＰ^ＴＭ」または「ＪＡＺＺ^ＴＭ」）リムーバブル・ディスク、または、コンピュータがその媒体からコンピュータ・プログラム・コードをロードし実行すると、そのコンピュータが本発明を実施する装置となるような、その他の任意のコンピュータ読取可能な記憶媒体といった、有形の媒体内に組み入れられたコンピュータ・プログラム・コードの形態で実施されてもよい。本発明はまた、たとえば、記憶媒体に記憶されたものであるか、コンピュータによりロードおよび／または実行されるものであるか、電気的な導線またはケーブル、光ファイバー、または電磁放射といった、何らかの伝送媒体を介して伝送されるものであるかにかかわらず、コンピュータがそのコンピュータ・プログラム・コードをロードし実行すると、そのコンピュータが本発明を実施する装置となるような、コンピュータ・プログラム・コードの形態で実施されてもよい。汎用プロセッサに実装される場合には、コンピュータ・プログラム・コードのセグメントによって、特定の論理回路が形成されるように、そのプロセッサが設定される。本発明は、例示的な実施形態の形で説明されてきたが、それらの形態に限定されるものではない。むしろ、本発明の範囲およびその均等の範囲から逸脱することなく当業者が考えつくであろう、本発明の他のバリエーションおよび実施形態を包含するように、各請求項は広く解釈されるべきである。 The present invention may be implemented in the form of computer-implemented processes and apparatuses for performing the processes. The present invention also includes random access memory (RAM), floppy disk, read only memory (ROM), CD-ROM, DVD-ROM, hard drive, high density (eg, “ZIP ^™ ” or “JAZZ ^™ ”). Tangible, such as a removable disk or any other computer-readable storage medium that, when the computer loads and executes the computer program code from the medium, makes the computer an apparatus embodying the invention It may be implemented in the form of computer program code embedded in the medium. The invention also includes any transmission medium, such as stored in a storage medium, loaded and / or executed by a computer, electrical conductors or cables, optical fiber, or electromagnetic radiation. Implemented in the form of computer program code such that when the computer loads and executes the computer program code, whether or not it is transmitted via the computer, the computer becomes an apparatus embodying the present invention. May be. When implemented on a general-purpose processor, the computer program code segments configure the processor so that a specific logic circuit is formed. Although the invention has been described in the form of exemplary embodiments, it is not limited to those forms. Rather, the claims should be construed broadly to include other variations and embodiments of the invention that would occur to those skilled in the art without departing from the scope of the invention and its equivalents. .

パーソナルテレビの映像処理チェーンを最適化するシステムのブロック図Block diagram of a system that optimizes the video processing chain of a personal television パーソナルテレビの映像処理チェーンを最適化する処理のフローチャートFlow chart of processing to optimize the video processing chain of personal TV

Claims

A method of modeling a personal television system,
(A) forming a computer-implemented model of a video processing chain including processing steps performed on the received video signal by the personal television system;
(B) a first functional part corresponding to the encoding of the video signal before storage in the personal television system, and a second functional part corresponding to decoding of the encoded video data stored in the personal television system. Inserting one or more additional functions, including functional parts, into the model.

The method of claim 1, wherein the first functional portion includes an encoder specific model corresponding to a given encoder included in the personal television system.

3. The method according to claim 2, wherein the second functional part includes an MPEG1 format decoder model or an MPEG2 format decoder model.

The method of claim 1, wherein the first functional part and the second functional part are contained in a single function.

5. The method of claim 4, wherein the model ignores storage and retrieval of the encoded video signal within the personal television system.

5. A method according to claim 4, characterized in that the compression ratio of the stored encoded video signal is used as a parameter for adjusting the single function.

(C) after the step (b), executing the computer-implemented model;
The method of claim 1, further comprising: (d) adjusting the model of the video processing chain based on the output generated by step (c).

The method of claim 7, wherein step (d) comprises selecting the adjustment using a genetic algorithm.

8. The method of claim 7, further comprising evaluating the output generated by step (c) using an objective image quality metric.

A computer on which the computer program code is encoded and stored such that when the computer program code is executed by a processor, the processor executes a method for modeling a video processing chain in a personal television system. A readable medium, the method comprising:
(A) forming a computer-implemented model of a video processing chain including processing steps performed on the received video signal by the personal television system;
(B) a first functional part corresponding to the encoding of the video signal before storage in the personal television system, and a second functional part corresponding to decoding of the encoded video data stored in the personal television system. A computer-readable medium comprising: inserting one or more additional functions, including functional parts, into the model.

The computer-readable medium of claim 10, wherein the first functional portion includes an encoder specific model corresponding to a given encoder included in the personal television system.

12. The computer readable medium of claim 11, wherein the second functional portion includes an MPEG1 format decoder model or an MPEG2 format decoder model.

The computer-readable medium of claim 10, wherein the first functional portion and the second functional portion are contained in a single function.

The computer-readable medium of claim 13, wherein the model ignores storage and retrieval of the encoded video signal in the personal television system.

14. The computer readable medium of claim 13, wherein a compression ratio of the stored encoded video signal is used as a parameter for adjusting the single function.

Said method comprises
(C) after the step (b), executing the computer-implemented model;
11. The computer readable medium of claim 10, further comprising: (d) adjusting the model of the video processing chain based on the output generated by the step (c).

The computer-readable medium of claim 16, wherein step (d) comprises selecting the adjustment using a genetic algorithm.

The computer-readable medium of claim 16, wherein the method further comprises evaluating the output generated by step (c) using an objective image quality metric.

A system for modeling a personal television system,
Including a computer programmed with a model of a video processing chain, including processing steps performed on the received video signal by the personal television system;
The computer includes, in the model, a first functional part corresponding to encoding of the video signal before storage in the personal television system, and decoding of encoded video data stored in the personal television system. A system comprising one or more additional functions including a second functional part corresponding to.

20. The system of claim 19, wherein the first functional part includes an encoder specific model corresponding to a given encoder included in the personal television system.

21. The system of claim 20, wherein the second functional part includes an MPEG1 format decoder model or an MPEG2 format decoder model.

20. The system of claim 19, wherein the first functional part and the second functional part are included in a single function.

23. The system of claim 22, wherein the model ignores storage and retrieval of the encoded video signal within the personal television system.

23. The system of claim 22, wherein a compression ratio of the stored encoded video signal is used as a parameter for adjusting the single function.

20. The system of claim 19, further comprising automatic means for adjusting the model of the video processing chain based on output generated by executing the model.

26. The system of claim 25, wherein the automatic means selects the adjustment using a genetic algorithm.

The system of claim 25, further comprising means for evaluating an output generated by executing the model using an objective image quality metric.