JP2006101323A

JP2006101323A - Information processing apparatus and program used for the processing apparatus

Info

Publication number: JP2006101323A
Application number: JP2004286543A
Authority: JP
Inventors: Kosuke Uchida; 耕輔内田; Noriaki Kitada; 典昭北田; Satoshi Hoshina; 聡保科; Yoshihiro Kikuchi; 義浩菊池; Yuji Kawashima; 裕司川島
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2004-09-30
Filing date: 2004-09-30
Publication date: 2006-04-13

Abstract

<P>PROBLEM TO BE SOLVED: To provide an information processing apparatus capable of smoothly executing decoding of a moving picture stream. <P>SOLUTION: A video playback application program 201 detects a present load amount of a computer 10. When the computer 10 is not in a high load state, the video playback application program 201 executes ordinary decode processing for decoding all coded images. When the computer 10 reaches the high load state, the video playback application program 201 executes particular decode processing. In the particular decode processing, decoding applied to either a top field or a bottom field is skipped and only the other field is decoded. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明はパーソナルコンピュータのような情報処理装置および同装置で用いられるデコード用のプログラムに関する。 The present invention relates to an information processing apparatus such as a personal computer and a decoding program used in the apparatus.

近年、ＤＶＤ（Digital Versatile Disc）プレーヤ、ＴＶ装置のようなオーディオ・ビデオ（ＡＶ）機器と同様のＡＶ機能を備えたパーソナルコンピュータが開発されている。 In recent years, personal computers having the same AV function as audio / video (AV) devices such as DVD (Digital Versatile Disc) players and TV devices have been developed.

このようなパーソナルコンピュータにおいては、圧縮符号化された動画像ストリームをソフトウェアによってデコードするソフトウェアデコーダが用いられている。ソフトウェアデコーダの使用により、専用のハードウェアを設けることなく、圧縮符号化された動画像ストリームをプロセッサ（ＣＰＵ）によってデコードすることが可能になる。 In such a personal computer, a software decoder that decodes a compression-coded moving image stream by software is used. By using a software decoder, it is possible to decode a compression-coded moving image stream by a processor (CPU) without providing dedicated hardware.

また、圧縮符号化された動画像ストリームをデコードする装置としては、トップフィールドおよびボトムフィールドの一方のみを表示してスロー再生、静止画再生のような特殊再生を行う場合に、表示することが必要なトップフィールドおよびボトムフィールドの一方のみをデコードするシステムが知られている（例えば、特許文献１参照）。
特開平１１−１３６６７９号公報 Also, as a device that decodes a compressed and encoded video stream, it is necessary to display only one of the top field and bottom field and perform special playback such as slow playback and still image playback. A system that decodes only one of the top field and the bottom field is known (see, for example, Patent Document 1).
JP-A-11-136679

しかし、トップフィールドおよびボトムフィールドの一方のみをデコードする処理が行われるのは特殊再生時のみである。このため、通常再生を行う場合には、トップフィールドおよびボトムフィールドの双方が常にデコードされることになる。 However, the process of decoding only one of the top field and the bottom field is performed only during special playback. For this reason, in normal playback, both the top field and the bottom field are always decoded.

ところで、最近では、次世代の動画像圧縮符号化技術として、Ｈ．２６４／ＡＶＣ（ＡＶＣ：ＡｄｖａｎｃｅｄＶｉｄｅｏＣｏｄｉｎｇ）規格が注目されている。Ｈ．２６４／ＡＶＣ規格は、ＭＰＥＧ２、ＭＰＥＧ４のような従来の圧縮符号化技術よりも高能率の圧縮符号化技術である。このため、Ｈ．２６４／ＡＶＣ規格に対応するエンコード処理およびデコード処理の各々においては、ＭＰＥＧ２、ＭＰＥＧ４のような従来の圧縮符号化技術よりも多くの処理量が必要とされる。 Recently, as a next-generation moving image compression coding technology, H.264 has been introduced. The H.264 / AVC (AVC: Advanced Video Coding) standard has attracted attention. H. The H.264 / AVC standard is a compression encoding technique that is more efficient than conventional compression encoding techniques such as MPEG2 and MPEG4. For this reason, H.C. Each of the encoding process and the decoding process corresponding to the H.264 / AVC standard requires a larger amount of processing than the conventional compression encoding techniques such as MPEG2 and MPEG4.

したがって、Ｈ．２６４／ＡＶＣ規格で圧縮符号化された動画像ストリームをソフトウェアによってデコードするように設計されたパーソナルコンピュータにおいては、システムの負荷が増大すると、デコード処理自体に遅れが生じ、これによってスムーズな動画再生を実行できなくなる危険がある。 Therefore, H.H. In a personal computer designed to decode a video stream compressed and encoded by the H.264 / AVC standard by software, if the system load increases, the decoding process itself will be delayed, thereby enabling smooth video playback. There is a risk that it cannot be executed.

本発明は上述の事情を考慮してなされたものであり、動画像ストリームのデコードをスムーズに実行することが可能な情報処理装置およびプログラムを提供することを目的とする。 The present invention has been made in consideration of the above-described circumstances, and an object thereof is to provide an information processing apparatus and a program that can smoothly decode a moving image stream.

上述の課題を解決するため、本発明は、圧縮符号化された動画像ストリームをデコードするためのデコード処理を実行する情報処理装置において、前記情報処理装置の負荷を検出する負荷検出手段と、前記負荷検出手段によって検出された負荷が所定の基準値よりも大きい場合、前記動画像ストリームに含まれるシンタックス情報に基づいて前記動画像ストリームに含まれる各符号化画面がフィールド画像およびフレーム画像のいずれであるか否かを判別する手段と、前記動画像ストリームに含まれる各符号化画面がフィールド画像である場合、トップフィールドおよびボトムフィールドのいずれか一方に対する前記デコード処理の実行をスキップする制御手段とを具備することを特徴とする。 In order to solve the above-described problem, the present invention provides an information processing apparatus that executes a decoding process for decoding a compression-encoded moving image stream, a load detection unit that detects a load of the information processing apparatus, When the load detected by the load detection unit is larger than a predetermined reference value, each encoded screen included in the moving image stream is a field image or a frame image based on syntax information included in the moving image stream. And a control means for skipping the execution of the decoding process for one of a top field and a bottom field when each coding screen included in the moving image stream is a field image; It is characterized by comprising.

本発明によれば、動画像ストリームのデコードをスムーズに実行することが可能となる。 According to the present invention, it is possible to smoothly decode a moving image stream.

以下、図面を参照して、本発明の実施形態を説明する。
まず、図１および図２を参照して、本発明の一実施形態に係る情報処理装置の構成について説明する。この情報処理装置は、例えば、ノートブック型パーソナルコンピュータ１０として実現されている。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.
First, the configuration of an information processing apparatus according to an embodiment of the present invention will be described with reference to FIG. 1 and FIG. This information processing apparatus is realized as, for example, a notebook personal computer 10.

図１はノートブック型パーソナルコンピュータ１０のディスプレイユニットを開いた状態における正面図である。本コンピュータ１０は、コンピュータ本体１１と、ディスプレイユニット１２とから構成されている。ディスプレイユニット１２にはＬＣＤ（Liquid Crystal Display）１７から構成される表示装置が組み込まれており、そのＬＣＤ１７の表示画面はディスプレイユニット１２のほぼ中央に位置されている。 FIG. 1 is a front view of the notebook personal computer 10 with the display unit opened. The computer 10 includes a computer main body 11 and a display unit 12. The display unit 12 incorporates a display device composed of an LCD (Liquid Crystal Display) 17, and the display screen of the LCD 17 is positioned substantially at the center of the display unit 12.

ディスプレイユニット１２は、コンピュータ本体１１に対して開放位置と閉塞位置との間を回動自在に取り付けられている。コンピュータ本体１１は薄い箱形の筐体を有しており、その上面にはキーボード１３、本コンピュータ１を電源オン／オフするためのパワーボタン１４、入力操作パネル１５、およびタッチパッド１６などが配置されている。 The display unit 12 is attached to the computer main body 11 so as to be rotatable between an open position and a closed position. The computer main body 11 has a thin box-shaped casing, and a keyboard 13, a power button 14 for turning on / off the computer 1, an input operation panel 15, and a touch pad 16 are arranged on the upper surface. Has been.

入力操作パネル１５は、押されたボタンに対応するイベントを入力する入力装置であり、複数の機能をそれぞれ起動するための複数のボタンを備えている。これらボタン群には、ＴＶ起動ボタン１５Ａ、ＤＶＤ（Digital Versatile Disc）起動ボタン１５Ｂも含まれている。ＴＶ起動ボタン１５Ａは、デジタルＴＶ放送番組のような放送番組データの再生及び記録を行うためのＴＶ機能を起動するためのボタンである。ＴＶ起動ボタン１５Ａがユーザによって押下された時、ＴＶ機能を実行するためのアプリケーションプログラムが自動的に起動される。ＤＶＤ起動ボタン１５Ｂは、ＤＶＤに記録されたビデオコンテンツを再生するためのボタンである。ＤＶＤ起動ボタン１５Ｂがユーザによって押下された時、ビデオコンテンツを再生するためのアプリケーションプログラムが自動的に起動される。 The input operation panel 15 is an input device that inputs an event corresponding to a pressed button, and includes a plurality of buttons for starting a plurality of functions. These button groups also include a TV start button 15A and a DVD (Digital Versatile Disc) start button 15B. The TV activation button 15A is a button for activating a TV function for reproducing and recording broadcast program data such as a digital TV broadcast program. When the TV activation button 15A is pressed by the user, an application program for executing the TV function is automatically activated. The DVD start button 15B is a button for playing back video content recorded on a DVD. When the DVD activation button 15B is pressed by the user, an application program for reproducing video content is automatically activated.

次に、図２を参照して、本コンピュータ１０のシステム構成について説明する。 Next, the system configuration of the computer 10 will be described with reference to FIG.

本コンピュータ１０は、図２に示されているように、ＣＰＵ１１１、ノースブリッジ１１２、主メモリ１１３、グラフィクスコントローラ１１４、サウスブリッジ１１９、ＢＩＯＳ−ＲＯＭ１２０、ハードディスクドライブ（ＨＤＤ）１２１、光ディスクドライブ（ＯＤＤ）１２２、デジタルＴＶ放送チューナ１２３、エンベデッドコントローラ／キーボードコントローラＩＣ（ＥＣ／ＫＢＣ）１２４、およびネットワークコントローラ１２５等を備えている。 As shown in FIG. 2, the computer 10 includes a CPU 111, a north bridge 112, a main memory 113, a graphics controller 114, a south bridge 119, a BIOS-ROM 120, a hard disk drive (HDD) 121, and an optical disk drive (ODD) 122. , A digital TV broadcast tuner 123, an embedded controller / keyboard controller IC (EC / KBC) 124, a network controller 125, and the like.

ＣＰＵ１１１は本コンピュータ１０の動作を制御するために設けられたプロセッサであり、ハードディスクドライブ（ＨＤＤ）１２１から主メモリ１１３にロードされる、オペレーティングシステム（ＯＳ）、およびビデオ再生アプリケーションプログラム２０１のような各種アプリケーションプログラムを実行する。 The CPU 111 is a processor provided to control the operation of the computer 10, and various types such as an operating system (OS) and a video playback application program 201 loaded from the hard disk drive (HDD) 121 to the main memory 113. Run the application program.

ビデオ再生アプリケーションプログラム２０１は、圧縮符号化された動画像データをデコードおよび再生するためのソフトウェアである。このビデオ再生アプリケーションプログラム２０１は、Ｈ．２６４／ＡＶＣ規格に対応するソフトウェアデコーダである。ビデオ再生アプリケーションプログラム２０１は、Ｈ．２６４／ＡＶＣ規格で定義された符号化方式で圧縮符号化されている動画像ストリーム（例えば、デジタルＴＶ放送チューナ１２３によって受信されたデジタルＴＶ放送番組、光ディスクドライブ（ＯＤＤ）１２２から読み出されるＨＤ（High Definition）規格のビデオコンテンツ、など）をデコードするための機能を有している。 The video reproduction application program 201 is software for decoding and reproducing the compression-coded moving image data. This video playback application program 201 is an H.264 file. This is a software decoder corresponding to the H.264 / AVC standard. The video playback application program 201 is an H.264 file. A moving image stream (for example, a digital TV broadcast program received by the digital TV broadcast tuner 123, an HD (High) read out from the optical disc drive (ODD) 122) that is compressed and encoded by the encoding method defined in the H.264 / AVC standard. Definition) standard video content, etc.).

このビデオ再生アプリケーションプログラム２０１は、図３に示すように、負荷検出モジュール２１１、デコード制御モジュール２１２、およびデコード実行モジュール２１３を備えている。 As shown in FIG. 3, the video playback application program 201 includes a load detection module 211, a decode control module 212, and a decode execution module 213.

デコード実行モジュール２１３は、Ｈ．２６４／ＡＶＣ規格で定義されたデコード処理を実行するデコーダである。負荷検出モジュール２１１は、コンピュータ１０の負荷を検出するモジュールである。この負荷検出モジュール２１１は、例えば、オペレーティングシステム（ＯＳ）２００にコンピュータ１０の現在の負荷を問い合わせることによって、コンピュータ１０の現在の負荷量を検出する。コンピュータ１０の負荷量は、例えば、ＣＰＵ１１１の使用率に基づいて決定される。 The decode execution module 213 is an H.264 implementation. It is a decoder that executes a decoding process defined in the H.264 / AVC standard. The load detection module 211 is a module that detects the load of the computer 10. For example, the load detection module 211 detects the current load amount of the computer 10 by inquiring the current load of the computer 10 to the operating system (OS) 200. The load amount of the computer 10 is determined based on the usage rate of the CPU 111, for example.

また、コンピュータ１０の負荷量は、ＣＰＵ１１１の使用率とメモリ１１３の使用率との組み合わせによって決定することもできる。通常、ソフトウェアデコーダをスムーズに実行するためには、ある一定サイズ以上のメモリが必要である。システムのメモリ使用率が高くなると、ＯＳのページングにより、ソフトウェアデコーダのデコードパフォーマンスは低下する。よって、ＣＰＵ１１１の使用率とメモリ１１３の使用率との組み合わせによってコンピュータ１０の負荷量を検出することにより、コンピュータ１０の現在の負荷量がソフトウェアデコーダの実行に支障を来す負荷量（高負荷状態）であるかどうかをより精度よく判別することができる。 Further, the load amount of the computer 10 can be determined by a combination of the usage rate of the CPU 111 and the usage rate of the memory 113. Usually, in order to smoothly execute the software decoder, a memory having a certain size or more is required. As the memory usage rate of the system increases, the decoding performance of the software decoder decreases due to paging of the OS. Therefore, by detecting the load amount of the computer 10 based on the combination of the usage rate of the CPU 111 and the usage rate of the memory 113, the current load amount of the computer 10 interferes with the execution of the software decoder (high load state). ) Can be more accurately determined.

デコード制御モジュール２１２は、負荷検出モジュール２１１によって検出されたコンピュータ１０の負荷に応じて、デコード実行モジュール２１３によって実行されるデコード処理の内容を制御する。 The decode control module 212 controls the content of the decoding process executed by the decode execution module 213 according to the load on the computer 10 detected by the load detection module 211.

具体的には、デコード制御モジュール２１２は、コンピュータ１０の負荷量が予め決められた基準値以下である場合には、Ｈ．２６４／ＡＶＣ規格で定義されたデコード処理がＣＰＵ１１１によって実行されるように、デコード実行モジュール２１３によって実行すべきデコード処理の内容を制御する。一方、コンピュータ１０の負荷量が基準値よりも大きい場合には（高負荷状態）、デコード制御モジュール２１２は、Ｈ．２６４／ＡＶＣ規格で定義されたデコード処理の一部が省略または簡略された処理に置換されるように、デコード実行モジュール２１３によって実行すべきデコード処理の内容を制御する。 Specifically, when the load amount of the computer 10 is equal to or less than a predetermined reference value, the decode control module 212 determines the H.264 standard. The contents of the decoding process to be executed by the decoding execution module 213 are controlled so that the decoding process defined by the H.264 / AVC standard is executed by the CPU 111. On the other hand, when the load amount of the computer 10 is larger than the reference value (high load state), the decode control module 212 determines that the H.264 is in the H.264 standard. The content of the decoding process to be executed by the decoding execution module 213 is controlled so that a part of the decoding process defined in the H.264 / AVC standard is replaced with a process that is omitted or simplified.

ビデオ再生アプリケーションプログラム２０１によってデコードされた動画像データは、表示ドライバ２０２を介してグラフィクスコントローラ１１４のビデオメモリ１１４Ａに順次書き込まれる。これにより、デコードされた動画像データはＬＣＤ１７に表示される。表示ドライバ２０２はグラフィクスコントローラ１１４を制御するためのソフトウェアである。 The moving image data decoded by the video playback application program 201 is sequentially written into the video memory 114A of the graphics controller 114 via the display driver 202. As a result, the decoded moving image data is displayed on the LCD 17. The display driver 202 is software for controlling the graphics controller 114.

また、ＣＰＵ１１１は、ＢＩＯＳ−ＲＯＭ１２０に格納されたシステムＢＩＯＳ（Basic Input Output System）も実行する。システムＢＩＯＳはハードウェア制御のためのプログラムである。 The CPU 111 also executes a system BIOS (Basic Input Output System) stored in the BIOS-ROM 120. The system BIOS is a program for hardware control.

ノースブリッジ１１２はＣＰＵ１１１のローカルバスとサウスブリッジ１１９との間を接続するブリッジデバイスである。ノースブリッジ１１２には、主メモリ１１３をアクセス制御するメモリコントローラも内蔵されている。また、ノースブリッジ１１２は、ＡＧＰ（Accelerated Graphics Port）バスなどを介してグラフィクスコントローラ１１４との通信を実行する機能も有している。 The north bridge 112 is a bridge device that connects the local bus of the CPU 111 and the south bridge 119. The north bridge 112 also includes a memory controller that controls access to the main memory 113. The north bridge 112 also has a function of executing communication with the graphics controller 114 via an AGP (Accelerated Graphics Port) bus or the like.

グラフィクスコントローラ１１４は本コンピュータ１０のディスプレイモニタとして使用されるＬＣＤ１７を制御する表示コントローラである。このグラフィクスコントローラ１１４はビデオメモリ（ＶＲＡＭ）１１４Ａに書き込まれた画像データからＬＣＤ１７に送出すべき表示信号を生成する。 The graphics controller 114 is a display controller that controls the LCD 17 used as a display monitor of the computer 10. The graphics controller 114 generates a display signal to be sent to the LCD 17 from the image data written in the video memory (VRAM) 114A.

サウスブリッジ１１９は、ＬＰＣ（Low Pin Count）バス上の各デバイス、およびＰＣＩ（Peripheral Component Interconnect）バス上の各デバイスを制御する。また、サウスブリッジ１１９は、ＨＤＤ１２１、ＯＤＤ１２２を制御するためのＩＤＥ（Integrated Drive Electronics）コントローラを内蔵している。さらに、サウスブリッジ１１９は、デジタルＴＶ放送チューナ１２３を制御する機能、およびＢＩＯＳ−ＲＯＭ１２０をアクセス制御するための機能も有している。 The south bridge 119 controls each device on an LPC (Low Pin Count) bus and each device on a PCI (Peripheral Component Interconnect) bus. The south bridge 119 incorporates an IDE (Integrated Drive Electronics) controller for controlling the HDD 121 and the ODD 122. Further, the south bridge 119 has a function of controlling the digital TV broadcast tuner 123 and a function of controlling access to the BIOS-ROM 120.

ＨＤＤ１２１は、各種ソフトウェア及びデータを格納する記憶装置である。光ディスクドライブ（ＯＤＤ）１２３は、ビデオコンテンツが格納されたＤＶＤなどの記憶メディアを駆動するためのドライブユニットである。デジタルＴＶ放送チューナ１２３は、デジタルＴＶ放送番組のような放送番組データを外部から受信するための受信装置である。 The HDD 121 is a storage device that stores various software and data. The optical disk drive (ODD) 123 is a drive unit for driving a storage medium such as a DVD in which video content is stored. The digital TV broadcast tuner 123 is a receiving device for receiving broadcast program data such as a digital TV broadcast program from the outside.

エンベデッドコントローラ／キーボードコントローラＩＣ（ＥＣ／ＫＢＣ）１２４は、電力管理のためのエンベデッドコントローラと、キーボード（ＫＢ）１３およびタッチパッド１６を制御するためのキーボードコントローラとが集積された１チップマイクロコンピュータである。このエンベデッドコントローラ／キーボードコントローラＩＣ（ＥＣ／ＫＢＣ）１２４は、ユーザによるパワーボタン１４の操作に応じて本コンピュータ１０をパワーオン／パワーオフする機能を有している。さらに、エンベデッドコントローラ／キーボードコントローラＩＣ（ＥＣ／ＫＢＣ）１２４は、ユーザによるＴＶ起動ボタン１５Ａ、ＤＶＤ起動ボタン１５Ｂの操作に応じて、本コンピュータ１０をパワーオンすることもできる。ネットワークコントローラ１２５は、例えばインターネットなどの外部ネットワークとの通信を実行する通信装置である。 The embedded controller / keyboard controller IC (EC / KBC) 124 is a one-chip microcomputer in which an embedded controller for power management and a keyboard controller for controlling the keyboard (KB) 13 and the touch pad 16 are integrated. . The embedded controller / keyboard controller IC (EC / KBC) 124 has a function of powering on / off the computer 10 in accordance with the operation of the power button 14 by the user. Furthermore, the embedded controller / keyboard controller IC (EC / KBC) 124 can also power on the computer 10 in accordance with the operation of the TV start button 15A and the DVD start button 15B by the user. The network controller 125 is a communication device that executes communication with an external network such as the Internet.

次に、図４を参照して、ビデオ再生アプリケーションプログラム２０１によって実現されるソフトウェアデコーダの機能構成を説明する。 Next, the functional configuration of the software decoder realized by the video playback application program 201 will be described with reference to FIG.

ビデオ再生アプリケーションプログラム２０１のデコード実行モジュール２１３は、Ｈ．２６４／ＡＶＣ規格に対応しており、図示のように、エントロピー復号部３０１、逆量子化部３０２、逆ＤＣＴ部（DCT：Discrete Cosine Transform）３０３、加算部３０４、デブロッキングフィルタ部３０５、フレームメモリ３０６、動きベクトル予測部３０７、補間予測部３０８、重み付き予測部３０９、イントラ予測部３１０、およびモード切替スイッチ部３１１を含む。Ｈ．２６４の直交変換は整数精度であり、従来のＤＣＴとは異なるが、ここではＤＣＴと称することとする。 The decode execution module 213 of the video playback application program 201 is an H.264 file. As shown in the figure, the entropy decoding unit 301, the inverse quantization unit 302, the inverse DCT unit (DCT: Discrete Cosine Transform) 303, the addition unit 304, the deblocking filter unit 305, the frame memory 306, a motion vector prediction unit 307, an interpolation prediction unit 308, a weighted prediction unit 309, an intra prediction unit 310, and a mode changeover switch unit 311. H. The H.264 orthogonal transform has integer precision and is different from the conventional DCT, but is referred to as DCT here.

各画面（ピクチャ）の符号化は、たとえば１６ｘ１６画素のマクロブロック単位で実行される。各マクロブロックごとに、フレーム内符号化モード（イントラ符号化モード）および動き補償フレーム間予測符号化モード（インター符号化モード）のいずれか一方が選択される。 Each screen (picture) is encoded in units of macroblocks of 16 × 16 pixels, for example. For each macroblock, either the intraframe coding mode (intra coding mode) or the motion compensated interframe prediction coding mode (inter coding mode) is selected.

動き補償フレーム間予測符号化モードにおいては、既に符号化された画面（ピクチャ）からの動きを推定することによって、符号化対象画面に対応する動き補償フレーム間予測信号が定められた形状単位で生成される。そして、符号化対象画面（ピクチャ）から動き補償フレーム間予測信号を引いた予測誤差信号が、直交変換（ＤＣＴ）、量子化、およびエントロピー符号化によって、符号化される。また、イントラ符号化モードにおいては、符号化対象画面（ピクチャ）から予測信号が生成され、その予測信号が直交変換（ＤＣＴ）、量子化、およびエントロピー符号化によって、符号化される。 In motion-compensated interframe predictive coding mode, motion-predicted interframe prediction signals corresponding to the current picture are generated in a defined shape unit by estimating the motion from the already coded picture (picture). Is done. A prediction error signal obtained by subtracting the motion compensation inter-frame prediction signal from the encoding target screen (picture) is encoded by orthogonal transform (DCT), quantization, and entropy encoding. Further, in the intra coding mode, a prediction signal is generated from a coding target screen (picture), and the prediction signal is encoded by orthogonal transform (DCT), quantization, and entropy coding.

Ｈ．２６４／ＡＶＣ規格に対応するコーデックは、さらに圧縮率を高めるために、
（１）従来のＭＰＥＧよりも高い画素精度（１／４画素精度）の動き補償
（２）フレーム内符号化を効率的に行うためのフレーム内予測
（３）ブロック歪みを低減するためのデブロッキングフィルタ
（４）４ｘ４画素単位の整数ＤＣＴ
（５）任意の位置の複数の画面（ピクチャ）を参照画面として使用可能なマルチリファレンスフレーム
（６）重み付け予測
等の技術を利用する。 H. The codec corresponding to the H.264 / AVC standard,
(1) Motion compensation with higher pixel accuracy (1/4 pixel accuracy) than conventional MPEG (2) Intraframe prediction for efficient intraframe coding (3) Deblocking to reduce block distortion Filter (4) 4x4 pixel integer DCT
(5) A technique such as a multi-reference frame (6) weighted prediction that can use a plurality of screens (pictures) at arbitrary positions as a reference screen is used.

以下、図４のソフトウェアデコーダの動作を説明する。 Hereinafter, the operation of the software decoder of FIG. 4 will be described.

Ｈ．２６４／ＡＶＣ規格にしたがって圧縮符号化された動画像ストリームは、まず、エントロピー復号部３０１に入力される。圧縮符号化された動画像ストリームには、符号化された画像情報の他に、動き補償フレーム間予測符号化（インター予測符号化）で用いられた動きベクトル情報、フレーム内予測符号化（イントラ予測符号化）で用いられたフレーム内予測情報、予測モード（インター予測符号化／イントラ予測符号化）を示すモード情報等が含まれている。 H. A moving image stream compression-encoded according to the H.264 / AVC standard is first input to the entropy decoding unit 301. In addition to encoded image information, motion vector information used in motion-compensated interframe prediction encoding (inter-prediction encoding), intra-frame prediction encoding (intra prediction encoding) Intraframe prediction information used in (encoding), mode information indicating a prediction mode (inter prediction encoding / intra prediction encoding), and the like are included.

デコード処理は、たとえば１６ｘ１６画素のマクロブロック単位で実行される。エントロピー復号部３０１は動画像ストリームに対して可変長復号のようなエントロピー復号処理を施して、動画像ストリームから、量子化ＤＣＴ係数、動きベクトル情報（動きベクトル差分情報）、フレーム内予測情報、およびモード情報を分離する。この場合、例えば、デコード対象画面（ピクチャ）内の各マクロブロックは４ｘ４画素（または８ｘ８画素）のブロック毎にエントロピー復号処理され、各ブロックは４ｘ４（または８ｘ８画素）の量子化ＤＣＴ係数に変換される。以下では、各ブロックが４ｘ４である場合を想定する。 The decoding process is executed in units of macro blocks of 16 × 16 pixels, for example. The entropy decoding unit 301 performs entropy decoding processing such as variable length decoding on the moving image stream, and from the moving image stream, the quantized DCT coefficient, motion vector information (motion vector difference information), intra-frame prediction information, and Separate mode information. In this case, for example, each macroblock in the decoding target screen (picture) is subjected to entropy decoding processing for each block of 4 × 4 pixels (or 8 × 8 pixels), and each block is converted into 4 × 4 (or 8 × 8 pixels) quantized DCT coefficients. The In the following, it is assumed that each block is 4 × 4.

動きベクトル情報は、動きベクトル予測部３０７に送られる。フレーム内予測情報は、イントラ予測部３１０に送られる。モード情報はモード切替スイッチ部３１１に送られる。 The motion vector information is sent to the motion vector prediction unit 307. The intra-frame prediction information is sent to the intra prediction unit 310. The mode information is sent to the mode changeover switch unit 311.

各デコード対象ブロックの４ｘ４の量子化ＤＣＴ係数は、逆量子化部３０２による逆量子化処理により４ｘ４のＤＣＴ係数（直交変換係数）に変換される。この４ｘ４のＤＣＴ係数は、逆ＤＣＴ部３０３による逆整数ＤＣＴ（逆直交変換）処理によって、周波数情報から、４ｘ４の画素値に変換される。この４ｘ４の画素値は、デコード対象ブロックに対応する予測誤差信号である。この予測誤差信号は加算部３０４に送られ、そこでデコード対象ブロックに対応する予測信号（動き補償フレーム間予測信号またはフレーム内予測信号）が加算され、これによってデコード対象ブロックに対応する４ｘ４の画素値がデコードされる。 The 4 × 4 quantized DCT coefficients of each decoding target block are converted into 4 × 4 DCT coefficients (orthogonal transform coefficients) by the inverse quantization process by the inverse quantization unit 302. The 4 × 4 DCT coefficients are converted from frequency information into 4 × 4 pixel values by an inverse integer DCT (inverse orthogonal transform) process by the inverse DCT unit 303. This 4 × 4 pixel value is a prediction error signal corresponding to the decoding target block. This prediction error signal is sent to the adding unit 304, where a prediction signal (motion compensation inter-frame prediction signal or intra-frame prediction signal) corresponding to the decoding target block is added, and thereby a 4 × 4 pixel value corresponding to the decoding target block. Is decoded.

イントラ予測モードにおいては、モード切替スイッチ部３１１によってイントラ予測部３１０が選択され、これによってイントラ予測部３１０からのフレーム内予測信号が予測誤差信号に加算される。インター予測モードにおいては、モード切替スイッチ部３１１によって重み付き予測部３０９が選択され、これによって、動きベクトル予測部３０７、補間予測部３０８、および重み付き予測部３０９によって得られる動き補償フレーム間予測信号が予測誤差信号に加算される。 In the intra prediction mode, the intra prediction unit 310 is selected by the mode changeover switch unit 311, and thereby the intraframe prediction signal from the intra prediction unit 310 is added to the prediction error signal. In the inter prediction mode, the weighted prediction unit 309 is selected by the mode changeover switch unit 311, and thereby the motion compensation interframe prediction signal obtained by the motion vector prediction unit 307, the interpolation prediction unit 308, and the weighted prediction unit 309. Is added to the prediction error signal.

このように、デコード対象画面に対応する予測誤差信号に予測信号（動き補償フレーム間予測信号またはフレーム内予測信号）を加算してデコード対象画面をデコードする処理が所定のブロック単位で実行される。 As described above, the process of adding the prediction signal (motion-compensated inter-frame prediction signal or intra-frame prediction signal) to the prediction error signal corresponding to the decoding target screen and decoding the decoding target screen is executed in predetermined block units.

デコードされた各画面（ピクチャ）は、デブロッキングフィルタ部３０５によってデブロッキングフィルタ処理が施された後に、フレームメモリ３０６に格納される。このデブロッキングフィルタ部３０５は、例えば４ｘ４画素のブロック単位で、デコードされた各画面に対してブロックノイズを低減するためのデブロッキングフィルタ処理を施す。このデブロッキングフィルタ処理は、ブロック歪みが参照画像に含まれてしまい、これによってブロック歪みが復号画像に伝搬してしまうことを防止する。デブロッキングフィルタ処理のための処理量は膨大であり、ソフトウェアデコーダの全処理量の５０パーセントを占める場合もある。デブロッキングフィルタ処理は、ブロック歪みが生じやすい箇所に対してはより強いフィリタリングが施され、ブロック歪みが生じにくい箇所に対しては弱いフィリタリングが施されるように、適応的に実行される。デブロッキングフィルタ処理はループフィルタ処理によって実現されている。 Each decoded screen (picture) is subjected to deblocking filter processing by the deblocking filter unit 305 and then stored in the frame memory 306. The deblocking filter unit 305 performs a deblocking filter process for reducing block noise on each decoded screen, for example, in units of 4 × 4 pixel blocks. This deblocking filter processing prevents block distortion from being included in the reference image, and thereby block distortion propagates to the decoded image. The amount of processing for the deblocking filter processing is enormous and may occupy 50% of the total processing amount of the software decoder. The deblocking filter process is adaptively executed such that a stronger filtering is performed on a portion where block distortion is likely to occur, and a weak filtering is performed on a portion where block distortion is less likely to occur. The deblocking filter process is realized by a loop filter process.

そして、デブロッキングフィルタ処理された各画面は、フレームメモリ３０６から出力画像フレーム（または出力画像フィールド）として読み出される。また、動き補償フレーム間予測のための参照画像として使用されるべき各画面（参照画面）は、フレームメモリ３０６内に一定期間保持される。Ｈ．２６４／ＡＶＣ規格の動き補償フレーム間予測符号化においては、複数の画面を参照画面として使用することができる。このため、フレームメモリ３０６は、複数画面分の画像を記憶するための複数個のフレームメモリ部を備えている。 Each screen subjected to the deblocking filter processing is read out from the frame memory 306 as an output image frame (or output image field). Each screen (reference screen) to be used as a reference image for motion compensation inter-frame prediction is held in the frame memory 306 for a certain period. H. In motion compensation interframe predictive coding according to the H.264 / AVC standard, a plurality of screens can be used as reference screens. For this reason, the frame memory 306 includes a plurality of frame memory units for storing images for a plurality of screens.

動きベクトル予測部３０７は、デコード対象ブロックに対応する動きベクトル差分情報に基づいて、動きベクトル情報を生成する。補間予測部３０８は、デコード対象ブロックに対応する動きベクトル情報に基づいて、参照画面内の、整数精度の画素群および１／４画素精度の予測補間画素群から、動き補償フレーム間予測信号を生成する。１／４画素精度の予測補間画素の生成においては、６タップフィルタ（入力６つ、出力１つ）が用いられる。このため、高周波成分まで考慮した高精度の予測補間処理を実行できるが、その分、動き補償には多くの処理量が必要となる。 The motion vector prediction unit 307 generates motion vector information based on the motion vector difference information corresponding to the decoding target block. The interpolation prediction unit 308 generates a motion compensation inter-frame prediction signal from the integer precision pixel group and the quarter pixel precision prediction interpolation pixel group in the reference screen based on the motion vector information corresponding to the decoding target block. To do. A 6-tap filter (six inputs, one output) is used in the generation of the prediction interpolation pixel with 1/4 pixel accuracy. For this reason, it is possible to execute highly accurate predictive interpolation processing considering even high frequency components. However, a large amount of processing is required for motion compensation.

重み付け予測部３０９は、動き補償フレーム間予測信号に対して重み係数を乗じる処理を動き補償ブロック単位で実行することにより、重み付けされた動き補償フレーム間予測信号を生成する。この重み付け予測は、デコード対象画面の明るさを予測する処理である。この重み付け予測処理により、フェード・イン、フェード・アウトのように、明るさが時間の経過と共に変化する画像の画質を向上することができる。しかし、その分、ソフトウェアデコードに必要な処理量は増大する。 The weighted prediction unit 309 generates a weighted motion compensation inter-frame prediction signal by executing a process of multiplying the motion compensation inter-frame prediction signal by a weighting coefficient for each motion compensation block. This weighted prediction is a process for predicting the brightness of the decoding target screen. This weighted prediction process can improve the image quality of an image whose brightness changes over time, such as fade-in and fade-out. However, the amount of processing required for software decoding increases accordingly.

イントラ予測部３１０は、デコード対象画面からその画面内に含まれるデコード対象ブロックのフレーム内予測信号を生成するものである。このイントラ予測部３１０は、上述のフレーム内予測情報に従って画面内予測処理を実行して、デコード対象ブロックと同一画面内に存在する、当該デコード対象ブロックに近接する既にデコードされた他のブロック内の画素値からフレーム内予測信号を生成する。このフレーム内予測（イントラ予測）は、ブロック間の画素相関を利用して圧縮率を高める技術である。このフレーム内予測においては、フレーム内予測情報に従って、垂直予測（予測モード０）、水平予測（予測モード１）、平均値予測（予測モード３）、平面予測（予測モード４）を含む４種類の予測モードの内の一つが、フレーム内予測ブロック（例えば１６ｘ１６画素）単位で選択される。平面予測が選択される頻度は他のフレーム内予測モードよりも低いが、平面予測のために必要とされる処理量は、他のどのフレーム内予測モードの処理量よりも多い。 The intra prediction unit 310 generates an intra-frame prediction signal of a decoding target block included in the screen from the decoding target screen. The intra prediction unit 310 performs intra-screen prediction processing according to the intra-frame prediction information described above, and exists in the same screen as the decoding target block, in other already decoded blocks close to the decoding target block. An intra-frame prediction signal is generated from the pixel value. This intra-frame prediction (intra prediction) is a technique for increasing the compression rate using pixel correlation between blocks. In this intraframe prediction, four types including vertical prediction (prediction mode 0), horizontal prediction (prediction mode 1), average value prediction (prediction mode 3), and plane prediction (prediction mode 4) are determined according to the intraframe prediction information. One of the prediction modes is selected in units of intra-frame prediction blocks (for example, 16 × 16 pixels). The frequency with which plane prediction is selected is lower than in other intra-frame prediction modes, but the amount of processing required for plane prediction is greater than the processing amount in any other intra-frame prediction mode.

本実施形態においては、たとえコンピュータ１０の負荷が増大しても時間制約内に動画像ストリームをリアルタイムにデコードできるようにするために、コンピュータ１０の負荷に応じて、図４で説明したデコード処理（以下、通常デコード処理と称する）と、特殊デコード処理とを選択的に実行する。特殊デコード処理は、トップフィールドおよびボトムフィールドのいずれか一方のフィールドに対するデコード（図４のエントロピー復号よりも後の全ての処理）をスキップし、他方のフィールドのみをデコードするデコード処理である。 In the present embodiment, even if the load on the computer 10 increases, the decoding process (FIG. 4) described in FIG. Hereinafter, the normal decoding process) and the special decoding process are selectively executed. The special decoding process is a decoding process that skips decoding (all processes after entropy decoding in FIG. 4) for one of the top field and the bottom field and decodes only the other field.

以下、図５のフローチャートを参照して、ビデオ再生アプリケーションプログラム２０１によって実行されるデコード処理の手順を説明する。 Hereinafter, the procedure of the decoding process executed by the video playback application program 201 will be described with reference to the flowchart of FIG.

ビデオ再生アプリケーションプログラム２０１は、デコード処理の実行期間中、ＯＳに対してコンピュータ１０の現在の負荷を問い合わせることによってコンピュータ１０の現在の負荷を検出する処理を定期的に繰り返し実行する（ステップＳ１０１）。このステップＳ１０１においては、ビデオ再生アプリケーションプログラム２０１は、ＣＰＵ１１１の現在の使用率（プロセッサ使用率）と主メモリ１１３の現在の使用率（メモリ使用率）とをＯＳから取得する。 During the execution of the decoding process, the video playback application program 201 periodically and repeatedly executes a process for detecting the current load on the computer 10 by inquiring about the current load on the computer 10 from the OS (step S101). In this step S101, the video playback application program 201 acquires the current usage rate (processor usage rate) of the CPU 111 and the current usage rate (memory usage rate) of the main memory 113 from the OS.

そして、ビデオ再生アプリケーションプログラム２０１は、コンピュータ１０の現在の負荷量が所定の基準値よりも大きいかどうかによって、コンピュータ１０が高負荷状態であるかどうかを判別する（ステップＳ１０２）。ステップＳ１０２においては、例えば、ビデオ再生アプリケーションプログラム２０１は、現在のプロセッサ使用率が予め決められたプロセッサ基準使用率よりも大きいか否かを判別するとともに、現在のメモリ使用率が予め決められたメモリ基準使用率よりも大きいか否かを判別する。現在のプロセッサ使用率および現在のメモリ使用率のいずれか一方でもそれに対応する基準使用率よりも大きい場合には、ビデオ再生アプリケーションプログラム２０１は、コンピュータ１０が高負荷状態であると判定する。現在のプロセッサ使用率および現在のメモリ使用率の双方がそれらに対応する基準使用率以下ならば、ビデオ再生アプリケーションプログラム２０１は、コンピュータ１０が高負荷状態ではないと判定する。 Then, the video reproduction application program 201 determines whether or not the computer 10 is in a high load state depending on whether or not the current load amount of the computer 10 is larger than a predetermined reference value (step S102). In step S102, for example, the video playback application program 201 determines whether or not the current processor usage rate is larger than a predetermined processor reference usage rate, and the current memory usage rate is a predetermined memory. It is determined whether it is larger than the reference usage rate. If any one of the current processor usage rate and the current memory usage rate is larger than the corresponding reference usage rate, the video playback application program 201 determines that the computer 10 is in a high load state. If both the current processor usage rate and the current memory usage rate are equal to or lower than the corresponding reference usage rate, the video playback application program 201 determines that the computer 10 is not in a high load state.

コンピュータ１０が高負荷状態ではないならば（ステップＳ１０２のＮＯ）、ビデオ再生アプリケーションプログラム２０１は、ＣＰＵ１１１に実行させるべきデコード処理として上述の通常デコード処理を選択し、これによって図４で説明した一連の処理をＣＰＵ１１１上で実行する（ステップＳ１０３）。 If the computer 10 is not in a high load state (NO in step S102), the video playback application program 201 selects the above-described normal decoding process as the decoding process to be executed by the CPU 111, and thereby the series of processes described in FIG. Processing is executed on the CPU 111 (step S103).

通常デコード処理においては、圧縮符号化された動画像ストリームに含まれる符号化画面（ピクチャ）群それぞれのデコードが順次実行される。コンピュータ１０が高負荷状態にならない限り、つまりデコードパフォーマンスが低下しない限り、動画像ストリームは通常デコード処理によってデコードされる。 In the normal decoding process, decoding of each encoded screen (picture) group included in the compressed and encoded moving image stream is sequentially executed. Unless the computer 10 is in a high load state, that is, unless the decoding performance is deteriorated, the moving image stream is normally decoded by the decoding process.

一方、コンピュータ１０が高負荷状態であるならば（ステップＳ１０２のＹＥＳ）、ビデオ再生アプリケーションプログラム２０１は、ＣＰＵ１１１に実行させるべきデコード処理として上述の特殊デコード処理を選択し、これによってトップフィールドおよびボトムフィールドのいずれか一方のフィールドに対するデコードを省略したデコード処理をＣＰＵ１１１上で実行する（ステップＳ１０４，Ｓ１０５）。この特殊デコード処理においては、ビデオ再生アプリケーションプログラム２０１は、動画像ストリームに含まれるシンタックス情報を解析して、デコード対象画面の構造を判別する（ステップＳ１０４）。シンタックス情報は動画像ストリームのシーケンス構造を示す情報である。上述の動きベクトル情報、フレーム内予測情報、およびモード情報等もシンタックス情報の一部である。 On the other hand, if the computer 10 is in a high load state (YES in step S102), the video playback application program 201 selects the above-described special decoding process as the decoding process to be executed by the CPU 111, thereby the top field and the bottom field. A decoding process in which decoding for any one of the fields is omitted is executed on the CPU 111 (steps S104 and S105). In this special decoding process, the video playback application program 201 analyzes the syntax information included in the moving image stream to determine the structure of the decoding target screen (step S104). The syntax information is information indicating the sequence structure of the moving image stream. The above-described motion vector information, intraframe prediction information, mode information, and the like are also part of the syntax information.

デコード対象画面の構造が、フィールド画像、またはフィールドモードのマクロブロック適用型フレーム／フィールド（ＭＢＡＦＦ：ＭａｃｒｏｂｌｏｃｋＡｄａｐｔｉｖｅＦｒａｍｅ−Ｆｉｅｌｄ）画像である場合、ビデオ再生アプリケーションプログラム２０１は、トップフィールドおよびボトムフィールドのいずれか一方のフィールドのデコードを省略し、他方のフィールドのデコードのみを実行する（ステップＳ１０５）。 When the structure of the screen to be decoded is a field image or a macroblock adaptive frame-field (MBAFF) image in the field mode, the video playback application program 201 is one of the top field and the bottom field. The decoding of one field is omitted, and only the decoding of the other field is executed (step S105).

ＭＢＡＦＦ画像はＭＢＡＦＦ符号化によって符号化された画面である。ＭＢＡＦＦ画像においては、１つの画面内に、フィールドモードで符号化されたマクロブロックペア（フィールドモードマクロブロックペア）とフレームモードモードで符号化されたマクロブロックペア（フレームモードマクロブロックペア）とを混在することができる。フィールドモードマクロブロックペアにおいては、一方のマクロブロックがトップフィールドに対応する画像から構成され、他方のマクロブロックがボトムフィールドに対応する画像から構成されている。つまり、フィールドモードマクロブロックペアにおいては、１６画素ｘ３２ラインのフレーム画像領域内の１６本の奇数ラインに対応する画像は一方のマクロブロックに集められており、また１６画素ｘ３２ラインのフレーム画像領域内の１６本の偶数ラインに対応する画像は他方のマクロブロックに集められている。ステップＳ１０５においては、各フィールドモードマクロブロックペアの中のトップフィールドおよびボトムフィールドのいずれか一方に対応するマクロブロックに対するデコード処理の実行がスキップされ、他方のマクロブロックに対するデコード処理のみが実行される。これにより、符号化画面がフィールド符号化された画像でない場合にも、トップフィールドおよびボトムフィールドのいずれか一方のフィールドのデコードを省略することができる。 The MBAFF image is a screen encoded by MBAFF encoding. In an MBAFF image, a macroblock pair encoded in the field mode (field mode macroblock pair) and a macroblock pair encoded in the frame mode mode (frame mode macroblock pair) are mixed in one screen. can do. In the field mode macroblock pair, one macroblock is composed of an image corresponding to the top field, and the other macroblock is composed of an image corresponding to the bottom field. In other words, in the field mode macroblock pair, images corresponding to 16 odd lines in the frame image area of 16 pixels × 32 lines are collected in one macroblock, and in the frame image area of 16 pixels × 32 lines. The images corresponding to the 16 even lines are collected in the other macroblock. In step S105, execution of the decoding process for the macroblock corresponding to one of the top field and the bottom field in each field mode macroblock pair is skipped, and only the decoding process for the other macroblock is executed. Thereby, even when the encoding screen is not a field-encoded image, decoding of one of the top field and the bottom field can be omitted.

このようにトップフィールドおよびボトムフィールドのいずれか一方のフィールドのデコード処理を省略することにより、ソフトウェアデコードに必要な処理量は大幅に低下する。よって、ソフトウェアデコードの実行中にたとえ他のプログラムが実行されてコンピュータ１０が高負荷状態となっても、コマ落ちの発生や、オブジェクトの動きが極端に遅くなるなどの不具合を招くことなく、動画像データのデコードおよび再生をスムーズに継続して実行することができる。なお、特殊デコード処理によってデコードされた動画像をコンピュータ１０の表示画面上に表示する場合には、デコードされたトップフィールドおよびボトムフィールドの一方から、例えば補間処理等によってフレーム画像を生成すればよい。 In this way, by omitting the decoding process of either the top field or the bottom field, the processing amount required for software decoding is greatly reduced. Therefore, even if another program is executed during software decoding and the computer 10 is in a heavy load state, the moving image does not cause a problem such as frame dropping or extremely slow movement of the object. The decoding and reproduction of the image data can be executed smoothly and continuously. When a moving image decoded by the special decoding process is displayed on the display screen of the computer 10, a frame image may be generated from one of the decoded top field and bottom field by, for example, an interpolation process.

動画像ストリーム全てのデコードが完了するまで、上述のステップＳ１０１〜Ｓ１０４の処理は繰り返し実行される（ステップＳ１０６）。他のプログラムの実行が終了されること等によってコンピュータ１０の負荷が下がると、デコード処理は、特殊デコード処理から通常デコード処理に再び切り替えられる。 Until the decoding of all the moving image streams is completed, the processes in steps S101 to S104 described above are repeatedly executed (step S106). When the load on the computer 10 decreases due to termination of execution of other programs, the decoding process is switched again from the special decoding process to the normal decoding process.

Ｈ．２６４のシーケンスの構造は、図６のように複数のアクセスユニット（ＡＵ）から構成されている。各アクセスユニットは１つの画面に対応している。各アクセスユニットは複数のＮＡＬ（ＮｅｔｗｏｒｋＡｂｓｔｒａｃｔｉｏｎＬａｙｅｒ）ユニットから構成されている。各ＮＡＬユニットは図７に示すようにヘッダ部とデータ部に分かれている。ＮＡＬユニットは図１１のように３２種類あり、ヘッダ部を解析することによってその種類を判別することができる。図８は図６のＡＵの構造に具体的なＮＡＬユニットの種類を当てはめて示した図である。図８中の各ブロックはＮＡＬユニットを示している。 H. The structure of the H.264 sequence is composed of a plurality of access units (AU) as shown in FIG. Each access unit corresponds to one screen. Each access unit is composed of a plurality of NAL (Network Abstraction Layer) units. Each NAL unit is divided into a header part and a data part as shown in FIG. There are 32 types of NAL units as shown in FIG. 11, and the types can be discriminated by analyzing the header part. FIG. 8 is a diagram showing a specific NAL unit type applied to the AU structure of FIG. Each block in FIG. 8 represents a NAL unit.

Ｈ．２６４／ＡＶＣでは、符号化画面の構造は、図１２に示すように、フィールド画像、フレーム画像、ＭＢＡＦＦ（フレーム画像）の３通り存在する。本実施形態では、ビデオ再生アプリケーションプログラム２０１は、符号化画面の構造がフィールド画像、フレーム画像、ＭＢＡＦＦ（フレーム画像）のいずれであるかを判別するために、図８に示されているSliceヘッダ中のfield_pic_flagと、図８に示されているシーケンスパラメタセットSPS中のmb_adaptive_frame_field_flagを参照する。また、ビデオ再生アプリケーションプログラム２０１は、各フィールド画像がトップフィールド画像およびボトムフィールド画像のどちらであるかを判別するために、図８に示されているSliceヘッダ中のbottom_field_flagを参照する。図１３に示すように、bottom_field_flag=0はトップフィールド画像であることを示し、bottom_field_flag=1はボトムフィールド画像であることを示す。また、ビデオ再生アプリケーションプログラム２０１は、各マクロブロックペアがフィールドモードマクロブロックペアおよびフレームモードマクロブロックペアのどちらであるかを判別するために、図８に示されているSliceヘッダ中のmb_field_decoding_flagを参照する。図１４に示すように、mb_field_decoding_flag=0はフレームモードマクロブロックペア（フレームＭＢペア）であることを示し、mb_field_decoding_flag=1はフィールドモードマクロブロックペア（フィールドＭＢペア）であることを示す。 H. In H.264 / AVC, as shown in FIG. 12, there are three types of coding screen structures: field images, frame images, and MBAFF (frame images). In the present embodiment, the video playback application program 201 determines whether the structure of the encoded screen is a field image, a frame image, or an MBAFF (frame image) in the Slice header shown in FIG. Field_pic_flag and mb_adaptive_frame_field_flag in the sequence parameter set SPS shown in FIG. 8 are referred to. The video playback application program 201 refers to the bottom_field_flag in the Slice header shown in FIG. 8 in order to determine whether each field image is a top field image or a bottom field image. As illustrated in FIG. 13, bottom_field_flag = 0 indicates a top field image, and bottom_field_flag = 1 indicates a bottom field image. Also, the video playback application program 201 refers to mb_field_decoding_flag in the Slice header shown in FIG. 8 in order to determine whether each macroblock pair is a field mode macroblock pair or a frame mode macroblock pair. To do. As shown in FIG. 14, mb_field_decoding_flag = 0 indicates a frame mode macroblock pair (frame MB pair), and mb_field_decoding_flag = 1 indicates a field mode macroblock pair (field MB pair).

次に、図９のフローチャートを参照して、特殊デコード処理の具体的な手順を説明する。 Next, a specific procedure of the special decoding process will be described with reference to the flowchart of FIG.

ビデオ再生アプリケーションプログラム２０１は、シンタックス情報を解析し（ステップＳ２０１）、その解析結果に応じてデコード処理を実行すべきか否かを判断する。具体的には、ビデオ再生アプリケーションプログラム２０１は、まず、field_pic_flagを参照して、デコード対象画面がフィールド画像およびフレーム画像のどちらであるかを判断する（ステップＳ２０２）。フィールド画像であれば（ステップＳ２０２のＮＯ）、ビデオ再生アプリケーションプログラム２０１は、bottom_field_flagを参照して、デコード対象画面がトップフィールドおよびボトムフィールドのどちらであるかを判別する（ステップＳ２０３）。デコード対象画面がトップフィールドであるならば（ステップＳ２０３のＹＥＳ）、ビデオ再生アプリケーションプログラム２０１は、デコード対象画面に対するデコード処理を実行する（ステップＳ２０４）。一方、デコード対象画面がボトムフィールドであるならば（ステップＳ２０３のＮＯ）、ビデオ再生アプリケーションプログラム２０１は、ステップＳ２０４のデコード処理の実行をスキップする。 The video playback application program 201 analyzes the syntax information (step S201), and determines whether or not to execute the decoding process according to the analysis result. Specifically, the video playback application program 201 first refers to field_pic_flag to determine whether the decoding target screen is a field image or a frame image (step S202). If it is a field image (NO in step S202), the video playback application program 201 refers to bottom_field_flag to determine whether the decoding target screen is a top field or a bottom field (step S203). If the decoding target screen is the top field (YES in step S203), the video playback application program 201 executes a decoding process on the decoding target screen (step S204). On the other hand, if the decoding target screen is the bottom field (NO in step S203), the video playback application program 201 skips the decoding process in step S204.

デコード対象画面がフレーム画像である場合には（ステップＳ２０２のＹＥＳ）、ビデオ再生アプリケーションプログラム２０１は、mb_adaptive_frame_field_flagを参照して、デコード対象画面がＭＢＡＦＦ画像であるか否かを判断する（ステップＳ２０５）。ＭＢＡＦＦ画像ではない場合、つまりデコード対象画面が通常のフレーム画像であるならば（ステップＳ２０５のＮＯ）、ビデオ再生アプリケーションプログラム２０１は、デコード対象画面に対するデコード処理を実行する（ステップＳ２０４）。 When the decoding target screen is a frame image (YES in step S202), the video playback application program 201 refers to mb_adaptive_frame_field_flag to determine whether or not the decoding target screen is an MBAFF image (step S205). If it is not an MBAFF image, that is, if the decoding target screen is a normal frame image (NO in step S205), the video playback application program 201 executes a decoding process on the decoding target screen (step S204).

一方、デコード対象画面がＭＢＡＦＦ画像であるならば（ステップＳ２０５のＹＥＳ）、ビデオ再生アプリケーションプログラム２０１は、mb_field_decoding_flagを参照して、デコード対象マクロブロックを含むマクロブロックペアの構造がフィールドＭＢペアおよびフレームＭＢペアのいずれであるかを判断する（ステップＳ２０６）。フレームＭＢペアであるならば（ステップＳ２０６のＮＯ）、ビデオ再生アプリケーションプログラム２０１は、デコード対象マクロブロックに対するデコード処理を実行する（ステップＳ２０４）。 On the other hand, if the decoding target screen is an MBAFF image (YES in step S205), the video playback application program 201 refers to mb_field_decoding_flag so that the structure of the macroblock pair including the decoding target macroblock is a field MB pair and a frame MB. It is determined which of the pair is present (step S206). If it is a frame MB pair (NO in step S206), the video playback application program 201 executes a decoding process on the decoding target macroblock (step S204).

フィールドＭＢペアであるならば（ステップＳ２０６のＹＥＳ）、ビデオ再生アプリケーションプログラム２０１は、デコード対象マクロブロックがトップフィールドおよびボトムフィールドのどちらに対応するマクロブロックであるかを判断する（ステップＳ２０７）。ＭＢＡＦＦにおいては、マクロブロック番号は、図１０に示すように画像の左上のマクロブロックから順に番号が割り当てられる。またフィールドＭＢペアの場合、トップ画像はＭＢペアのうち番号の小さい方のＭＢに集められ、ボトム画像は番号の大きい方のＭＢに集められる。つまりＭＢ番号が偶数であれば、そのＭＢはトップフィールドに対応するＭＢであることがわかる。デコード対象のＭＢがトップフィールドに対応するＭＢであれば（ステップＳ２０７のＹＥＳ）、ビデオ再生アプリケーションプログラム２０１は、そのデコード対象ＭＢに対してデコード処理を実行する（ステップＳ２０４）。一方、デコード対象のＭＢがボトムフィールドに対応するＭＢであれば（ステップＳ２０７のＮＯ）、ビデオ再生アプリケーションプログラム２０１は、そのデコード対象ＭＢに対するステップＳ２０４のデコード処理の実行をスキップする。 If it is a field MB pair (YES in step S206), the video playback application program 201 determines whether the decoding target macroblock is a macroblock corresponding to the top field or the bottom field (step S207). In MBAFF, macroblock numbers are assigned in order from the macroblock at the upper left of the image as shown in FIG. In the case of a field MB pair, the top image is collected in the MB with the smaller number of the MB pairs, and the bottom image is collected in the MB with the larger number. That is, if the MB number is an even number, the MB is an MB corresponding to the top field. If the decoding target MB is an MB corresponding to the top field (YES in step S207), the video playback application program 201 executes a decoding process on the decoding target MB (step S204). On the other hand, if the decoding target MB is an MB corresponding to the bottom field (NO in step S207), the video playback application program 201 skips the execution of the decoding process in step S204 for the decoding target MB.

なお、本実施形態では、ボトムフィールドのデコードを省略したが、ボトムフィールドの代わりにトップフィールドのデコードをスキップしてもよい。 In the present embodiment, decoding of the bottom field is omitted, but decoding of the top field may be skipped instead of the bottom field.

また、上述のデコード制御処理は全てコンピュータプログラムによって実現されているので、このコンピュータプログラムをコンピュータ読み取り可能な記憶媒体を通じて通常のコンピュータに導入するだけで、本実施形態と同様の効果を容易に実現することができる。 Further, since all the decoding control processes described above are realized by a computer program, the same effects as in the present embodiment can be easily realized simply by introducing the computer program into a normal computer through a computer-readable storage medium. be able to.

また、本実施形態のソフトウェアデコーダは、パーソナルコンピュータに限らず、ＰＤＡ、携帯型電話機等にも適用することができる。 Further, the software decoder of the present embodiment can be applied not only to a personal computer but also to a PDA, a mobile phone, and the like.

また、本発明は、上記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施形態に開示されている複数の構成要素の適宜な組み合わせにより種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。更に、異なる実施形態に構成要素を適宜組み合わせてもよい。 Further, the present invention is not limited to the above-described embodiments as they are, and can be embodied by modifying the constituent elements without departing from the scope of the invention in the implementation stage. In addition, various inventions can be formed by appropriately combining a plurality of components disclosed in the embodiment. For example, some components may be deleted from all the components shown in the embodiment. Furthermore, you may combine a component suitably in different embodiment.

本発明の一実施形態に係るコンピュータの概観を示す斜視図。The perspective view showing the general view of the computer concerning one embodiment of the present invention. 図１のコンピュータのシステム構成を示すブロック図。The block diagram which shows the system configuration | structure of the computer of FIG. 図１のコンピュータで用いられるビデオ再生アプリケーションプログラムの機能構成を示すブロック図。The block diagram which shows the function structure of the video reproduction application program used with the computer of FIG. 図３のビデオ再生アプリケーションプログラムによって実現されるソフトウェアデコーダの構成を示すブロック図。The block diagram which shows the structure of the software decoder implement | achieved by the video reproduction application program of FIG. 図３のビデオ再生アプリケーションプログラムによって実行されるデコード処理の手順を示すフローチャート。FIG. 4 is a flowchart showing a procedure of decoding processing executed by the video playback application program of FIG. 3. FIG. 図３のビデオ再生アプリケーションプログラムによってデコードされる動画像ストリームの構造を示す図。The figure which shows the structure of the moving image stream decoded by the video reproduction application program of FIG. 図６の動画像ストリームのＮＡＬユニットの構造を示す図。The figure which shows the structure of the NAL unit of the moving image stream of FIG. 図６の動画像ストリームのアクセスユニットの構造を示す図。The figure which shows the structure of the access unit of the moving image stream of FIG. 図３のビデオ再生アプリケーションプログラムによって実行される特殊デコード処理の手順を示すフローチャート。4 is a flowchart showing a procedure of special decoding processing executed by the video playback application program of FIG. 3. 図３のビデオ再生アプリケーションプログラムによってデコードされるＭＢＡＦＦ画像を説明するための図。The figure for demonstrating the MBAFF image decoded by the video reproduction application program of FIG. 図６の動画像ストリームに含まれるＮＡＬユニットの種類を説明するための図。The figure for demonstrating the kind of NAL unit contained in the moving image stream of FIG. 図６の動画像ストリームに含まれる画面の種類を説明するための図。The figure for demonstrating the kind of screen contained in the moving image stream of FIG. 図６の動画像ストリームに含まれるフィールド画像の種類を説明するための図。The figure for demonstrating the kind of field image contained in the moving image stream of FIG. 図６の動画像ストリームに含まれるマクロブロックペアの種類を説明するための図。The figure for demonstrating the kind of macroblock pair contained in the moving image stream of FIG.

Explanation of symbols

1０…コンピュータ、１１１…ＣＰＵ、１１３…メモリ、１１４…グラフィクスコントローラ、２０１…負荷検出モジュール、２１２…デコード制御モジュール、２１３…デコード実行モジュール、３０１…エントロピー復号部、３０２…逆量子化部、３０３…逆ＤＣＴ部、３０４…加算部、３０５…デブロッキングフィルタ部、３０６…フレームメモリ、３０７…動きベクトル予測部、３０８…補間予測部、３０９…重み付き予測部、３１０…イントラ予測部、３１１…モード切替スイッチ部。 DESCRIPTION OF SYMBOLS 10 ... Computer, 111 ... CPU, 113 ... Memory, 114 ... Graphics controller, 201 ... Load detection module, 212 ... Decode control module, 213 ... Decode execution module, 301 ... Entropy decoding part, 302 ... Inverse quantization part, 303 ... Inverse DCT unit, 304 ... adding unit, 305 ... deblocking filter unit, 306 ... frame memory, 307 ... motion vector prediction unit, 308 ... interpolation prediction unit, 309 ... weighted prediction unit, 310 ... intra prediction unit, 311 ... mode Changeover switch part.

Claims

In an information processing apparatus that executes a decoding process for decoding a compression-encoded moving image stream,
Load detecting means for detecting a load of the information processing apparatus;
When the load detected by the load detection unit is larger than a predetermined reference value, each encoded screen included in the moving image stream is based on syntax information included in the moving image stream. Means for determining whether or not,
An information processing apparatus comprising: control means for skipping execution of the decoding process for one of a top field and a bottom field when each encoded screen included in the moving image stream is a field image; .

When each encoded screen included in the moving image stream is a frame image, the control unit determines whether the frame image is a macroblock application type frame / field image based on the syntax information. And a macroblock application type frame / field image corresponding to one of a top field and a bottom field in each field mode macroblock pair included in the macroblock application type frame / field image. The information processing apparatus according to claim 1, further comprising means for skipping execution of the decoding process for a macroblock.

2. The information processing apparatus according to claim 1, wherein the load detecting means includes means for inquiring of an operating system executed on the information processing apparatus about a load of the information processing apparatus.

The information processing apparatus according to claim 1, wherein the load detection unit includes a unit that detects a load of the information processing apparatus based on a usage rate of a processor provided in the information processing apparatus.

The load detecting means includes means for detecting a load on the information processing device based on a usage rate of a processor provided in the information processing device and a usage rate of a memory provided in the information processing device. The information processing apparatus according to claim 1.

A program for causing a computer to execute a decoding process for decoding a compression-coded moving image stream,
A procedure for causing the computer to execute a process of detecting a load on the computer;
If the detected load is larger than a predetermined reference value, whether each encoded screen included in the moving image stream is a field image or a frame image based on syntax information included in the moving image stream A procedure for causing the computer to execute a process of determining whether or not;
And a procedure for causing the computer to execute a control process for skipping the execution of the decoding process for one of a top field and a bottom field when each encoded screen included in the moving image stream is a field image. A program characterized by

The control processing determines whether or not the frame image is a macroblock application type frame / field image based on the syntax information when each encoded screen included in the moving image stream is a frame image. If the frame / field image is a macroblock application type frame / field image, it corresponds to either the top field or the bottom field in each field mode macroblock pair included in the macroblock application type frame / field image. The program according to claim 6, further comprising a process of skipping execution of the decoding process for a macroblock.

The step of causing the computer to execute the process of detecting the load of the computer includes the step of causing the computer to execute a process of inquiring the operating system executed on the computer about the load of the computer. The program according to claim 6.

The procedure for causing the computer to execute a process for detecting the load on the computer includes a procedure for causing the computer to execute a process for detecting the load on the computer based on a usage rate of a processor provided in the computer. The program according to claim 6, wherein:

The procedure for causing the computer to execute a process for detecting the load on the computer includes a process for detecting the load on the computer based on a usage rate of a processor provided in the computer and a usage rate of a memory provided in the computer. The program according to claim 6, further comprising a procedure to be executed by the computer.