JP2011123894A

JP2011123894A - Antialiasing using multiple display heads of graphics processor

Info

Publication number: JP2011123894A
Application number: JP2010275767A
Authority: JP
Inventors: Duncan A Riach; エー．リアチダンカン; Brijesh Tripathi; トリパシブリジェッシュ; Brett T Hannigan; ティー．ハンニガンブレット; Philip Browning Johnson; ブラウニングジョンソンフィリップ; Brian M Kelleher; エム．ケラハーブライアン; Franck R Diard; アール．ディアードフランク
Original assignee: Nvidia Corp
Current assignee: Nvidia Corp
Priority date: 2006-05-12
Filing date: 2010-12-10
Publication date: 2011-06-23
Anticipated expiration: 2027-05-14
Also published as: JP5116125B2; CN102693712B; TWI343020B; TW200816039A; CN102693712A

Abstract

<P>PROBLEM TO BE SOLVED: To provide the antialiasing of image data using multiple display heads of one graphic processor. <P>SOLUTION: Two display heads 206 of the same graphics processor 122 are coupled to each other in a master/slave configuration via a pixel transfer path. The "master" display head receives pixels from the "slave" display head in addition to its own pixels, and pixel selection logic in the master display head blends the two pixels or select one to the exclusion of the other. When the two pixels correspond to different sampling locations in the same display pixel, the blended pixel is an antialiased pixel. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

Cross-reference of related applications

[0001]本願は、２００６年５月１２日に出願された「ＡｎｔｉａｌｉａｓｉｎｇＵｓｉｎｇＭｕｌｔｉｐｌｅＤｉｓｐｌａｙＨｅａｄｓｏｆａＧｒａｐｈｉｃｓ
Ｐｒｏｃｅｓｓｏｒ」と題する米国仮出願第６０／７４７，１５４号、および同一出願人による同時係属中の２００６年５月１２日に出願された「ＤｉｓｔｒｉｂｕｔｅｄＡｎｔｉａｌｉａｓｉｎｇｉｎａＭｕｌｔｉｐｒｏｃｅｓｓｏｒＧｒａｐｈｉｃｓ
Ｓｙｓｔｅｍ」と題する米国特許出願第１１／３８３，０４８号の利益を主張するものである。 [0001] This application is filed on May 12, 2006, entitled “Antialiasing Using Multiple Display Heads of a Graphics.
US Provisional Application No. 60 / 747,154 entitled "Processor" and "Distributed Antialiasing in a Multiprocessor Graphics" filed May 12, 2006, co-pending by the same applicant.
Claims the benefit of US patent application Ser. No. 11 / 383,048 entitled “System”.

Background of the Invention

[0002]本発明は、概してコンピュータグラフィックスに関し、特にグラフィックスプロセッサの複数のディスプレイヘッドを用いた画像データのアンチエイリアシングに関する。 [0002] The present invention relates generally to computer graphics, and more particularly to anti-aliasing of image data using multiple display heads of a graphics processor.

[0003]当分野で既知のように、コンピュータ生成画像は、画像データを離散カラーサンプル（画素）のアレイへと変換するのに用いる有限サンプリング解像度から生じる様々なビジュアルアーチファクトに影響を受けやすい。一般に「エイリアシング」と呼ばれるこのようなアーチファクトには、平滑なラインのジャギー、規則的なパターンのムラ等が含まれる。 [0003] As is known in the art, computer-generated images are susceptible to various visual artifacts that result from the finite sampling resolution used to convert image data into an array of discrete color samples (pixels). Such artifacts commonly referred to as “aliasing” include smooth line jaggy, regular pattern irregularities, and the like.

[0004]エイリアシングを減らすために、カラーを「オーバーサンプリング」する、すなわち最終（例えばディスプレイまたは記憶）画像を構成する画素数を上回る数のサンプリング位置でサンプリングすることが多い。例えば、画素数の２倍または４倍で画像をサンプリングすることもある。当分野では、各サンプリング位置を別個の画素として扱うスーパーサンプリングや、画素の少なくとも一部をカバーする基本形状毎に１つのカラー値を計算するが、この基本形状による画素のカバレージは複数の位置で決定するマルチサンプリングを含む、各種のオーバーサンプリングが既知である。 [0004] To reduce aliasing, colors are often "oversampled", that is, sampled at a number of sampling locations that exceeds the number of pixels that make up the final (eg, display or storage) image. For example, an image may be sampled at twice or four times the number of pixels. In this field, super-sampling in which each sampling position is treated as a separate pixel, or one color value is calculated for each basic shape covering at least a part of the pixel. The pixel coverage by this basic shape is calculated at a plurality of positions. Various oversamplings are known, including multisampling to determine.

[0005]アンチエイリアシング（ＡＡ）フィルタは、１画素あたり複数のサンプルを混合して１つのカラー値を決定する。従来、ＡＡフィルタは、画素を生成してフレームバッファに記憶するレンダリングパイプライン内、または、画素をフレームバッファから読み出してディスプレイ装置に送るディスプレイパイプライン内のいずれかに適用される。 An anti-aliasing (AA) filter mixes multiple samples per pixel to determine one color value. Conventionally, AA filters are applied either in a rendering pipeline that generates pixels and stores them in a frame buffer, or in a display pipeline that reads pixels from the frame buffer and sends them to a display device.

[0006]本発明の実施形態は、１つのグラフィックスプロセッサの複数のディスプレイヘッドを利用してアンチエイリアシングおよび他の処理タスクを行うシステムおよび方法を提供するものである。一実施形態では、同じグラフィックスプロセッサの２つのディスプレイヘッドが画素転送パスを介してマスター／スレーブ形式で互いに結合されている。「マスター」ディスプレイヘッドは、それ自体の画素に加えて「スレーブ」ディスプレイヘッドから画素を受信し、マスターディスプレイヘッド中の画素選択論理回路がこの２画素を混合するか、いずれか一方を選択して他方を除外する。２画素が同じ画像の異なるサンプリング位置に対応する場合には、混合した画素がＡＡフィルタ処理画素となる。 [0006] Embodiments of the present invention provide systems and methods that utilize multiple display heads of a single graphics processor to perform anti-aliasing and other processing tasks. In one embodiment, two display heads of the same graphics processor are coupled together in a master / slave fashion via a pixel transfer path. The “master” display head receives pixels from the “slave” display head in addition to its own pixels, and the pixel selection logic in the master display head mixes the two pixels and selects one of them. Exclude the other. When two pixels correspond to different sampling positions of the same image, the mixed pixel is an AA filter processing pixel.

[0007]本発明の一態様によれば、グラフィックス処理装置が、第１のディスプレイヘッドと、第２のディスプレイヘッドと、画素転送パスとを含む。第１のディスプレイヘッドは、第１の出力画素を生成するように構成され、集積回路内に配置される。第２のディスプレイヘッドは、第２の出力画素を生成するように構成されており、これも集積回路内に配置されている。第２のディスプレイヘッドは、外部画素を受信するように構成された第１の入力パスと、内部画素を受信するように構成された第２の入力パスと、上記第１の入力パスおよび上記第２の入力パスに結合され、上記外部画素と上記内部画素を混合して、上記混合画素を生成するように構成された画素合成器と、上記外部画素、上記内部画素、または上記混合画素の１つを第２の出力画素として選択するように構成された選択回路とを有利に含む。上記画素転送パスは、上記第１の出力画素が上記外部画素として上記第１の入力パスにより受信されるように、上記第１の出力画素を上記第１のディスプレイヘッドから上記第２のディスプレイヘッドの上記第１の入力パスへと送るように設定可能である。 [0007] According to one aspect of the invention, a graphics processing apparatus includes a first display head, a second display head, and a pixel transfer path. The first display head is configured to generate a first output pixel and is disposed in the integrated circuit. The second display head is configured to generate a second output pixel, which is also disposed in the integrated circuit. The second display head includes a first input path configured to receive external pixels, a second input path configured to receive internal pixels, the first input path, and the first input path. A pixel synthesizer coupled to two input paths and configured to mix the external pixel and the internal pixel to generate the mixed pixel; and one of the external pixel, the internal pixel, or the mixed pixel And a selection circuit configured to select one as the second output pixel. The pixel transfer path moves the first output pixel from the first display head to the second display head such that the first output pixel is received as the external pixel by the first input path. To the first input path.

[0008]いくつかの実施形態では、上記画素転送パスも集積回路内に配置される。他の実施形態では、上記画素転送パスの少なくとも一部が上記集積回路の外部にある。例えば、上記画素転送パスが取り外し可能なコネクタを含む。 [0008] In some embodiments, the pixel transfer path is also located in an integrated circuit. In other embodiments, at least a portion of the pixel transfer path is external to the integrated circuit. For example, the pixel transfer path includes a removable connector.

[0009]本発明の別の態様によれば、グラフィックスサブシステムが、画素出力コネクタおよび画素入力コネクタを有するグラフィックスアダプタを含む。グラフィックスプロセッサは、上記グラフィックスアダプタ上に実装することもできるが、上記画素出力コネクタに通信可能に結合された画素出力ポートと上記画素入力コネクタに通信可能に結合された画素入力ポートとを有する。グラフィックスサブシステムは、上記グラフィックスアダプタの上記画素出力コネクタを上記グラフィックスアダプタの上記画素入力コネクタに接続するように適合された取り外し可能なコネクタユニットも含む。 [0009] According to another aspect of the invention, a graphics subsystem includes a graphics adapter having a pixel output connector and a pixel input connector. A graphics processor may be implemented on the graphics adapter, but has a pixel output port communicatively coupled to the pixel output connector and a pixel input port communicatively coupled to the pixel input connector. . The graphics subsystem also includes a removable connector unit adapted to connect the pixel output connector of the graphics adapter to the pixel input connector of the graphics adapter.

[0010]本発明のまた別の態様によれば、画像を生成する方法が、グラフィックスプロセッサのレンダリングパイプラインを用いて、画像用の入力画素の第１セットおよび入力画素の第２セットをレンダリングするステップを含む。入力画素の第１セットをレンダリングするのに用いられる第１のレンダリング動作は、少なくとも一点で上記入力画素の第２セットをレンダリングするのに用いられる第２のレンダリング動作と異なる。例えば、上記２つのレンダリング動作は、各画素に適用されるサンプリングパターンに関して異なっても、またはレンダリングされる画像の視野域オフセットに関して異なってもよい。上記入力画素の第１セットが上記グラフィックスプロセッサの第１のディスプレイヘッドに送られ、上記入力画素の第２セットが上記グラフィックスプロセッサの第２のディスプレイヘッドに送られる。上記入力画素の第１セットは、さらに上記第１のディスプレイヘッドから上記第２のディスプレイヘッドに送られる。上記第２のディスプレイヘッドで、上記入力画素の第１セットおよび上記入力画素の第２セットの対応する画素が混合され出力画素のセットを生成する。 [0010] According to yet another aspect of the invention, a method for generating an image renders a first set of input pixels and a second set of input pixels for an image using a graphics processor's rendering pipeline. Including the steps of: The first rendering operation used to render the first set of input pixels differs from the second rendering operation used to render the second set of input pixels at least at one point. For example, the two rendering operations may differ with respect to the sampling pattern applied to each pixel or with respect to the field of view offset of the rendered image. The first set of input pixels is sent to the first display head of the graphics processor, and the second set of input pixels is sent to the second display head of the graphics processor. The first set of input pixels is further sent from the first display head to the second display head. In the second display head, corresponding pixels of the first set of input pixels and the second set of input pixels are mixed to generate a set of output pixels.

[0011]以下の詳細な説明が、添付の図面と併せて本発明の性質および利点のより良い理解を与えるであろう。 [0011] The following detailed description, together with the accompanying drawings, will provide a better understanding of the nature and advantages of the present invention.

Detailed Description of the Invention

[0020]本発明の実施形態は、１つのグラフィックスプロセッサの複数のディスプレイヘッドを利用してアンチエイリアシングおよび他の処理タスクを行うシステムおよび方法を提供するものである。一実施形態では、同じグラフィックスプロセッサの２つのディスプレイヘッドが画素転送パスを介してマスター／スレーブ形式で互いに結合されている。「マスター」ディスプレイヘッドは、それ自体の画素に加えて「スレーブ」ディスプレイヘッドから画素を受信し、マスターディスプレイヘッド中の画素選択論理回路がこの２画素を混合するか、いずれか一方を選択して他方を除外する。２画素が同じ画像の異なるサンプリング位置に対応する場合には、混合した画素がＡＡフィルタ処理画素となる。[システム概観] [0020] Embodiments of the present invention provide systems and methods that utilize multiple display heads of a graphics processor to perform anti-aliasing and other processing tasks. In one embodiment, two display heads of the same graphics processor are coupled together in a master / slave fashion via a pixel transfer path. The “master” display head receives pixels from the “slave” display head in addition to its own pixels, and the pixel selection logic in the master display head mixes the two pixels and selects one of them. Exclude the other. When two pixels correspond to different sampling positions of the same image, the mixed pixel is an AA filter processing pixel. [System overview]

[0021]図１は、本発明の実施形態によるコンピュータシステム１００のブロック図である。コンピュータシステム１００は、中央演算処理装置（ＣＰＵ）１０２と、ノースブリッジチップ等のメモリブリッジ１０５を含むバスパスを介して通信するシステムメモリ１０４とを含む。メモリブリッジ１０５は、バスまたは他の通信パス１０６を介してサウスブリッジチップ等のＩ／Ｏ（入力／出力）ブリッジ１０７と接続されている。Ｉ／Ｏブリッジ１０７は、一又は複数のユーザ入力デバイス１０８（キーボード、マウス等）からユーザ入力を受信し、バス１０６およびメモリブリッジ１０５を介してその入力をＣＰＵ１０２へと転送する。ビジュアル出力は、バスまたは他の通信パス１１３を介してメモリブリッジ１０５に結合されたグラフィックスサブシステム１１２の制御下で動作する画素ベースディスプレイ装置１１０（従来のＣＲＴまたはＬＣＤベースモニタ）上で提供される。システムディスク１１４は、Ｉ／Ｏブリッジ１０７にも接続されている。スイッチ１１６は、Ｉ／Ｏブリッジ１０７とネットワークアダプタ１１８および各種アドインカード１２０、１２１等の他のコンポーネントとの間を接続する。ＵＳＢまたは他のポート接続、ＣＤドライブ、ＤＶＤドライブ等の他のコンポーネント（明示せず）をＩ／Ｏブリッジ１０７に接続することもできる。各種コンポーネント間の通信パスは、ＰＣＩ（ＰｅｒｉｐｈｅｒａｌＣｏｍｐｏｎｅｎｔＩｎｔｅｒｃｏｎｎｅｃｔ）、ＰＣＩＥｘｐｒｅｓｓ（ＰＣＩ−Ｅ）、ＡＧＰ（ＡｃｃｅｌｅｒａｔｅｄＧｒａｐｈｉｃｓＰｏｒｔ）、ＨｙｐｅｒＴｒａｎｓｐｏｒｔ、または任意の他のバスまたはポイントツーポイントプロトコルを用いて実施することができ、異なるデバイス間の接続は当分野で既知の異なるプロトコルを用いることができる。 [0021] FIG. 1 is a block diagram of a computer system 100 according to an embodiment of the invention. The computer system 100 includes a central processing unit (CPU) 102 and a system memory 104 that communicates via a bus path including a memory bridge 105 such as a north bridge chip. The memory bridge 105 is connected to an I / O (input / output) bridge 107 such as a south bridge chip via a bus or other communication path 106. The I / O bridge 107 receives user input from one or more user input devices 108 (keyboard, mouse, etc.), and transfers the input to the CPU 102 via the bus 106 and the memory bridge 105. The visual output is provided on a pixel-based display device 110 (conventional CRT or LCD-based monitor) that operates under the control of a graphics subsystem 112 coupled to the memory bridge 105 via a bus or other communication path 113. The The system disk 114 is also connected to the I / O bridge 107. The switch 116 connects between the I / O bridge 107 and other components such as the network adapter 118 and various add-in cards 120 and 121. Other components (not explicitly shown) such as USB or other port connection, CD drive, DVD drive, etc. can also be connected to the I / O bridge 107. The communication path between the various components may be implemented using PCI (Peripheral Component Interconnect), PCI Express (PCI-E), AGP (Accelerated Graphics Port), HyperTransport, or any other bus or point-to-point protocol. The connection between different devices can use different protocols known in the art.

[0022]グラフィックスサブシステム１１２は、Ｎ個（一又は複数）のグラフィックス処理装置（ＧＰＵ）１２２を含んでいる。（本明細書では、同様の物の複数の例は、その物を特定する参照番号および必要に応じてその例を特定する括弧付き番号で示す。）各ＧＰＵ１２２は、関連付けられたグラフィックスメモリ１２４を有する。ＧＰＵ１２２およびグラフィックスメモリ１２４は、例えば、プログラマブルプロセッサ、特定用途向け集積回路（ＡＳＩＣ）およびメモリデバイス等の一又は複数の集積回路デバイスを用いて実施することができる。いくつかの実施形態では、ＧＰＵ１２２およびグラフィックスメモリ１２４が、システム１００の拡張スロット（ＰＣＩ−Ｅスロット等）に挿入または当該拡張スロットから取り外し可能な一又は複数の拡張カードまたは他のアダプタで実施される。任意の数ＮのＧＰＵ１２２を用いることができる。 The graphics subsystem 112 includes N (one or more) graphics processing units (GPUs) 122. (In this document, examples of similar objects are indicated by a reference number identifying the object and, optionally, parenthesized numbers identifying the example.) Each GPU 122 is associated with an associated graphics memory 124. Have GPU 122 and graphics memory 124 may be implemented using one or more integrated circuit devices such as, for example, programmable processors, application specific integrated circuits (ASICs), and memory devices. In some embodiments, the GPU 122 and graphics memory 124 are implemented with one or more expansion cards or other adapters that can be inserted into or removed from expansion slots (such as PCI-E slots) in the system 100. The Any number N of GPUs 122 may be used.

[0023]各ＧＰＵ１２２は、メモリブリッジ１０５およびバス１１３を介してＣＰＵ１０２および／またはシステムメモリ１０４から供給されるグラフィックスデータから画素データ（本明細書では「画素」ともいう）を生成することに関連する各種タスクを行うように構成することができ、各グラフィックスメモリ１２４と情報をやりとりして画素データ等を記憶したり更新したりする。例えば、ＧＰＵ１２２は、ＣＰＵ１０２上で実行する各種プログラムにより提供される２Ｄまたは３Ｄシーンデータから画素データを生成することができる。ＧＰＵ１２２は、メモリブリッジ１０５を介して受信した画素データをさらなる処理の有無を問わずグラフィックスメモリ１２４に書き込むこともできる。各ＧＰＵ１２２は、画素データをグラフィックスメモリ１２４から後述するＧＰＵ１２２の出力ポートへと送るように構成可能なスキャンアウトモジュール（本明細書ではディスプレイパイプラインともいう）も含んでいる。出力ポートは、モニタまたは別のＧＰＵ１２２に接続してもしなくてもよい。 [0023] Each GPU 122 is associated with generating pixel data (also referred to herein as "pixels") from graphics data supplied from CPU 102 and / or system memory 104 via memory bridge 105 and bus 113. It can be configured to perform various tasks, and exchanges information with each graphics memory 124 to store or update pixel data or the like. For example, the GPU 122 can generate pixel data from 2D or 3D scene data provided by various programs executed on the CPU 102. The GPU 122 can also write pixel data received via the memory bridge 105 into the graphics memory 124 with or without further processing. Each GPU 122 also includes a scan-out module (also referred to herein as a display pipeline) that can be configured to send pixel data from the graphics memory 124 to an output port of the GPU 122 described below. The output port may or may not be connected to a monitor or another GPU 122.

[0024]分散レンダリングモードでの動作のために、１つのＧＰＵ（例えばＧＰＵ１２２（０））がスキャンアウトされた画素を別のＧＰＵ（例えばＧＰＵ１２２（Ｎ−１））に送るように好適に構成され、その後者のＧＰＵ（例えばＧＰＵ１２２（Ｎ−１））は、それ自体のディスプレイパイプラインからの内部画素およびＧＰＵ１２２（０）から受信した外部画素間で選択する。３つ以上のＧＰＵ１２２を、スレーブＧＰＵ１２２がその画素を中間ＧＰＵ１２２に送るように「デイジーチェーン」式に相互接続可能であり、中間ＧＰＵ１２２は、それ自体の内部画素およびスレーブからの外部画素間で選択して、最終のマスターＧＰＵ（すなわちモニタに接続されたＧＰＵ）が最終の選択画素をディスプレイ装置に送るまで、選択した画素を別のＧＰＵ等に転送する。 [0024] For operation in a distributed rendering mode, one GPU (eg, GPU 122 (0)) is preferably configured to send scanned out pixels to another GPU (eg, GPU 122 (N-1)). The latter GPU (eg, GPU 122 (N-1)) selects between the internal pixels from its own display pipeline and the external pixels received from GPU 122 (0). More than two GPUs 122 can be interconnected in a “daisy chain” fashion so that the slave GPU 122 sends its pixels to the intermediate GPU 122, which selects between its own internal pixels and external pixels from the slaves. Thus, the selected pixel is transferred to another GPU or the like until the final master GPU (that is, the GPU connected to the monitor) sends the final selected pixel to the display device.

[0025]いくつかの実施形態では、任意のＧＰＵ１２２を任意の他のＧＰＵ１２２のスレーブとなれるように、物理的接続を何ら変えることなくＧＰＵ１２２の配置設定を調整することによってＧＰＵ１２２を互いに相互接続可能である。例えば、ＧＰＵ１２２は、単方向または双方向リングトポロジーで接続可能である。 [0025] In some embodiments, GPUs 122 can be interconnected with each other by adjusting the configuration settings of GPUs 122 without changing any physical connection so that any GPU 122 can be a slave of any other GPU 122. is there. For example, the GPU 122 can be connected in a unidirectional or bidirectional ring topology.

[0026]各種の分散レンダリングモードがサポート可能である。例えば、分割フレームレンダリングでは、同じ画像の異なる部分をレンダリングするために異なるＧＰＵ１２２が割り当てられ、交互フレームレンダリングでは、表示される一連の画像中の異なる画像に異なるＧＰＵ１２２が割り当てられる。本発明には、特定の分散レンダリングモードが必須ということはない。 [0026] Various distributed rendering modes can be supported. For example, in split frame rendering, different GPUs 122 are assigned to render different portions of the same image, and in alternating frame rendering, different GPUs 122 are assigned to different images in the displayed sequence of images. The invention does not require a particular distributed rendering mode.

[0027]本発明の実施形態によれば、ＧＰＵ１２２は、「外部分散」ＡＡモードでも動作可能である。このモードでは、ＧＰＵ１２２の画素選択論理回路が、内部画素および外部画素の一方を選択して他方を除外するのではなくこれらを混合する。内部画素および外部画素が異なるサンプリング位置で同じ画像を表す場合には、画素混合の結果がＡＡ解像動作（本明細書ではＡＡフィルタともいう）に相当する。本発明の別の実施形態によれば、１つのＧＰＵ１２２が、「内部分散」ＡＡモードでも動作可能である。このモードでは、ＧＰＵ１２２の画素選択論理回路が、同じＧＰＵ１２２の２つのディスプレイヘッドにより生成された画素を混合する。（外部分散モードおよび内部分散モードを含む）分散ＡＡモードの例および関連する画素選択論理回路については後述する。 [0027] According to embodiments of the present invention, GPU 122 is also operable in "externally distributed" AA mode. In this mode, the pixel selection logic of the GPU 122 selects one of the internal and external pixels and mixes them instead of excluding the other. When the internal pixel and the external pixel represent the same image at different sampling positions, the result of pixel mixing corresponds to an AA resolution operation (also referred to as an AA filter in this specification). According to another embodiment of the invention, one GPU 122 can also operate in “internally distributed” AA mode. In this mode, the pixel selection logic of the GPU 122 mixes the pixels generated by two display heads of the same GPU 122. Examples of distributed AA modes (including external and internal distribution modes) and associated pixel selection logic are described below.

[0028]いくつかの実施形態では、ＧＰＵ１２２のいくつかまたは全てを、複数のＧＰＵ１２２のうち異なるものが異なるディスプレイ装置用の画像をレンダリングする「独立レンダリング」モードでも動作可能とすることができる。独立レンダリングモードで異なるＧＰＵ１２２によってレンダリングされた画像を互いに関連させてもさせなくてもよい。当然のことながら、ＧＰＵ１２２は、上記または他のモードのいずれでも動作するように設定可能である。 [0028] In some embodiments, some or all of the GPUs 122 may be operable in an "independent rendering" mode in which different ones of the plurality of GPUs 122 render images for different display devices. Images rendered by different GPUs 122 in independent rendering mode may or may not be related to each other. Of course, the GPU 122 can be configured to operate in any of the above or other modes.

[0029]ＣＰＵ１０２は、システム１００のマスタープロセッサとして動作し、他のシステムコンポーネントの動作を制御および調整する。特に、ＣＰＵ１０２は、ＧＰＵ１２２の動作を制御するコマンドを発する。いくつかの実施形態では、ＣＰＵ１０２はＧＰＵ１２２用のコマンドストリームをコマンドバッファに書き込むが、これはシステムメモリ１０４、グラフィックスメモリ１２４、またはＣＰＵ１０２およびＧＰＵ１２２の両方にアクセス可能な別の記憶場所とすることができる。ＧＰＵ１２２は、コマンドバッファからコマンドストリームを読み出し、ＣＰＵ１０２の動作と非同期でコマンドを実行する。コマンドは、画像を生成するための従来のレンダリングコマンドと、ＣＰＵ１０２上で実行するアプリケーションがＧＰＵ１２２の画像生成に関連しないようなデータ処理のための処理能力を活用できるようにする汎用コンピュータコマンドとを含むことができる。 [0029] The CPU 102 operates as a master processor of the system 100 and controls and coordinates the operation of other system components. In particular, the CPU 102 issues a command for controlling the operation of the GPU 122. In some embodiments, CPU 102 writes a command stream for GPU 122 to a command buffer, which may be system memory 104, graphics memory 124, or another storage location accessible to both CPU 102 and GPU 122. it can. The GPU 122 reads the command stream from the command buffer and executes the command asynchronously with the operation of the CPU 102. The commands include conventional rendering commands for generating images and general-purpose computer commands that allow applications running on the CPU 102 to take advantage of processing capabilities for data processing that are not related to GPU 122 image generation. be able to.

[0030]本明細書に示すシステムは例示的であって、変更および修正が可能であることが理解されよう。ブリッジの数および配列を含む相互接続トポロジーは、所望のとおりに修正可能である。例えば、いくつかの実施形態では、システムメモリ１０４がブリッジを介するのではなく直接ＣＰＵ１０２に接続され、他のデバイスがメモリブリッジ１０５およびＣＰＵ１０２を介してシステムメモリ１０４と通信する。他の代替的なトポロジーでは、グラフィックスサブシステム１１２が、メモリブリッジ１０５ではなくＩ／Ｏブリッジ１０７に接続される。また他の実施形態では、Ｉ／Ｏブリッジ１０７およびメモリブリッジ１０５を１つのチップ内に集積することもできる。本明細書で示す特定のコンポーネントは任意であって、例えば、任意の数のアドインカードまたは周辺デバイスをサポートすることもできる。いくつかの実施形態では、スイッチ１１６をなくし、ネットワークアダプタ１１８およびアドインカード１２０、１２１が直接Ｉ／Ｏブリッジ１０７に接続する。 [0030] It will be appreciated that the system shown herein is illustrative and that changes and modifications are possible. The interconnect topology, including the number and arrangement of bridges, can be modified as desired. For example, in some embodiments, system memory 104 is connected directly to CPU 102 rather than via a bridge, and other devices communicate with system memory 104 via memory bridge 105 and CPU 102. In other alternative topologies, the graphics subsystem 112 is connected to the I / O bridge 107 instead of the memory bridge 105. In another embodiment, the I / O bridge 107 and the memory bridge 105 can be integrated in one chip. The particular components shown herein are optional, and may support any number of add-in cards or peripheral devices, for example. In some embodiments, switch 116 is eliminated and network adapter 118 and add-in cards 120, 121 connect directly to I / O bridge 107.

[0031]システム１００の他の部分へのＧＰＵ１２２の接続も変更することができる。いくつかの実施形態では、グラフィックスシステム１１２が、システム１００の拡張スロットに挿入可能な一又は複数の拡張またはアドインカードとして実施される。他の実施形態では、ＧＰＵがメモリブリッジ１０５またはＩ／Ｏブリッジ１０７等のバスブリッジとともに１つのチップ上に集積される。 [0031] The connection of the GPU 122 to other parts of the system 100 may also be changed. In some embodiments, graphics system 112 is implemented as one or more expansion or add-in cards that can be inserted into expansion slots of system 100. In other embodiments, the GPU is integrated on a single chip with a bus bridge such as the memory bridge 105 or I / O bridge 107.

[0032]各ＧＰＵには、任意の量のローカルグラフィックスメモリを設けることができ（ローカルメモリ無しの状況を含む）、ローカルメモリとシステムメモリを任意の組み合わせで用いることができる。例えば、統合メモリアーキテクチャ（ＵＭＡ）実施形態では、専用グラフィックスメモリデバイスは設けないが、ＧＰＵのいくつかまたは全てがシステムメモリを専用またはほとんど専用で用いる。ＵＭＡ実施形態では、ＧＰＵをバスブリッジチップに集積するか、またはＧＰＵをブリッジチップおよびシステムメモリに接続する高速バス（例えばＰＣＩ−Ｅ）を持った個別チップとして設けることができる。 [0032] Each GPU can be provided with any amount of local graphics memory (including situations without local memory), and local memory and system memory can be used in any combination. For example, in a unified memory architecture (UMA) embodiment, no dedicated graphics memory device is provided, but some or all of the GPUs use system memory exclusively or almost exclusively. In UMA embodiments, the GPU can be integrated into a bus bridge chip, or it can be provided as a separate chip with a high speed bus (eg, PCI-E) that connects the GPU to the bridge chip and system memory.

[0033]加えて、本発明の態様を具現化するＧＰＵは、汎用コンピュータシステム、ビデオゲームコンソールおよび他の特殊用途コンピュータシステム、ＤＶＤプレーヤー、携帯電話または携帯情報端末等のハンドヘルドデバイス等の各種デバイスに組み込むことができる。[複数のディスプレイヘッドを備えたＧＰＵ] [0033] In addition, GPUs embodying aspects of the present invention may be used in various devices such as general purpose computer systems, video game consoles and other special purpose computer systems, handheld devices such as DVD players, mobile phones or personal digital assistants. Can be incorporated. [GPU with multiple display heads]

[0034]図２は、本発明の実施に使用可能なＧＰＵ１２２内の画素出力パスのブロック図である。マルチＧＰＵグラフィックスシステムが本発明の実施形態には必要ない場合もあるが、ＧＰＵ１２２は、このようなシステムで使用可能なように好適に構成されている。 [0034] FIG. 2 is a block diagram of a pixel output path within GPU 122 that can be used to implement the present invention. Although a multi-GPU graphics system may not be required for embodiments of the present invention, GPU 122 is preferably configured for use in such a system.

[0035]特に、図２に示すように、ＧＰＵ１２２は、メモリインターフェース２０４に結合されたディスプレイ（またはスキャンアウト）パイプライン２０２を含んでいる。ディスプレイパイプライン２０２は、ディスプレイヘッド２０６ａ（「ヘッドＡ」）およびディスプレイヘッド２０６ｂ（「ヘッドＢ」）にも結合されている。ＧＰＵ１２２は、デジタル出力ポート２１０、２１１およびアナログ出力ポート２１２、２１３を含む複数の出力ポート２１０〜２１３を有する。ＧＰＵ１２２は、別のＧＰＵまたは別の外部デジタルデバイスとの通信を含む様々な目的用に設定可能な２つの多目的入力／出力（ＭＩＯ）ポート２１４ａ（「ＭＩＯＡ」）および２１４ｂ（「ＭＩＯＢ」）も有している。ディスプレイヘッド２０６ａおよび２０６ｂは、クロスバー２２０を介してそれぞれ出力ポート２１０〜２１３およびＭＩＯポート２１４ａ、２１４ｂに接続されている。 In particular, as shown in FIG. 2, GPU 122 includes a display (or scan-out) pipeline 202 coupled to memory interface 204. Display pipeline 202 is also coupled to display head 206a ("Head A") and display head 206b ("Head B"). The GPU 122 has a plurality of output ports 210 to 213 including digital output ports 210 and 211 and analog output ports 212 and 213. The GPU 122 also has two multipurpose input / output (MIO) ports 214a ("MIOA") and 214b ("MIOB") that can be configured for a variety of purposes, including communication with another GPU or another external digital device. is doing. The display heads 206a and 206b are connected to the output ports 210 to 213 and the MIO ports 214a and 214b via the crossbar 220, respectively.

[0036]メモリインターフェース２０４は、ＧＰＵ１２２により生成される画素データを記憶するメモリ（図２には図示せず）、例えば図１のグラフィックスメモリ１２４に結合されている。ディスプレイパイプライン２０２は、メモリインターフェース２０４と通信して記憶された画素データにアクセスする。ディスプレイパイプライン２０２は、画素データをディスプレイヘッド２０６ａ、２０６ｂのいずれかまたは両方に送る。いくつかの実施形態では、ディスプレイパイプライン２０２がディスプレイヘッド２０６ａ、２０６ｂに送る前に画素データに各種の処理動作を施すが、ディスプレイヘッド２０６ａに送られる画素データは、ディスプレイヘッド２０６ｂ宛の画素データとは異なって処理されてもされなくてもよい。加えて、処理およびディスプレイヘッド２０６ａへの送達用のディスプレイパイプライン２０２に提供される画素データは、処理およびディスプレイヘッド２０６ｂへの送達用のディスプレイパイプライン２０２に提供される画素データと同じでも異なってもよい。本発明には、ディスプレイパイプライン２０２およびメモリインターフェース２０４の特定の構成が必須ということはなく、詳細な説明は省略する。 [0036] The memory interface 204 is coupled to a memory (not shown in FIG. 2) that stores pixel data generated by the GPU 122, such as the graphics memory 124 of FIG. Display pipeline 202 communicates with memory interface 204 to access stored pixel data. The display pipeline 202 sends pixel data to either or both display heads 206a, 206b. In some embodiments, the display pipeline 202 performs various processing operations on the pixel data before sending it to the display heads 206a, 206b. The pixel data that is sent to the display head 206a includes pixel data addressed to the display head 206b and May or may not be treated differently. In addition, the pixel data provided to the display pipeline 202 for processing and delivery to the display head 206a is the same or different from the pixel data provided to the display pipeline 202 for processing and delivery to the display head 206b. Also good. In the present invention, specific configurations of the display pipeline 202 and the memory interface 204 are not essential, and a detailed description thereof will be omitted.

[0037]デジタル出力ポート２１０、２１１は、概して従来の設計のものとしてよく、画素データを修正してデジタル出力標準に準拠する回路を含むことができる。例えば、一実施形態では、ポート２１０、２１１のそれぞれが標準ＤＶＩ（ＤｉｇｉｔａｌＶｉｄｅｏＩｎｔｅｒｆａｃｅ）コネクタ用のＴＭＤＳ（ＴｒａｎｓｉｔｉｏｎＭｉｎｉｍｉｚｅｄＤｉｆｆｅｒｅｎｔｉａｌＳｉｇｎａｌｉｎｇ）を実施する。同様に、アナログ出力ポート２１２、２１３は、概して従来の設計のものとしてよく、例えば、多数の例が当分野で既知のあらゆるアナログビデオ標準に準拠するデジタル／アナログコンバータを含むことができる。本発明には、特定のデジタルまたはアナログ出力ポートの有無、その数または性質が必須ということはないことが理解されよう。 [0037] The digital output ports 210, 211 may generally be of conventional design and may include circuitry that modifies pixel data to comply with a digital output standard. For example, in one embodiment, each of the ports 210 and 211 implements TMDS (Transition Minimized Differential Signaling) for a standard DVI (Digital Video Interface) connector. Similarly, the analog output ports 212, 213 may generally be of conventional design and may include, for example, digital to analog converters, many examples compliant with any analog video standard known in the art. It will be appreciated that the present invention does not require the presence, number or nature of a particular digital or analog output port.

[0038]ＭＩＯＡポート２１４ａおよびＭＩＯＢポート２１４ｂは、ディスプレイヘッド２０６ａ、２０６ｂのいずれかにより生成された画素データをＧＰＵ１２２の出力ライン上へと送り出す出力ポートとして設定可能である。ＭＩＯＡポート２１４ａおよびＭＩＯＢポート２１４ｂは、ディスプレイヘッドＡ２０６ａまたはディスプレイヘッドＢ２０６ｂに外部画素データを送る入力ポートとしても設定可能である。いくつかの実施形態では、ＭＩＯＡポート２１４ａおよびＭＩＯＢポート２１４ｂをそれぞれ個別に入力ポートまたは出力ポートのいずれかとして設定可能である。ＭＩＯＡポート２１４ａおよびＭＩＯＢポート２１４ｂの設定は、システム起動中に決定するか、またはシステム動作中の様々な時点で動的に修正が可能である。例えば、各ＭＩＯポートは、ポート設定を特定する値を記憶する制御レジスターを含むことができ、新たな値を所望のとおりシステム起動時または他の時点でレジスターに書き込むことができる。 [0038] The MIOA port 214a and the MIOB port 214b can be set as output ports that send pixel data generated by either of the display heads 206a, 206b onto the output line of the GPU 122. The MIOA port 214a and the MIOB port 214b can also be set as input ports for sending external pixel data to the display head A 206a or the display head B 206b. In some embodiments, the MIOA port 214a and the MIOB port 214b can each be individually configured as either an input port or an output port. The settings of the MIOA port 214a and MIOB port 214b can be determined during system startup or dynamically modified at various times during system operation. For example, each MIO port can include a control register that stores a value that specifies the port setting, and a new value can be written to the register at system startup or at other times as desired.

[0039]ヘッドＡ２０６ａおよびヘッドＢ２０６ｂは、クロスバー２２０を介してそれぞれＭＩＯポート２１４ａ、２１４ｂだけでなく出力ポート２１０〜２１３に結合される。この実施形態では、クロスバー２２０が、ヘッドＡ２０６ａからポート２１０〜２１３、２１４ａまたは２１４ｂのいずれか１つへの任意の接続をサポートするように、また同時にヘッドＢ２０６ｂから現状ではクロスバー２２０によりヘッドＡ２０６ａに接続されていないポート２１０〜２１３、２１４ａまたは２１４ｂのいずれか１つへの任意の接続をサポートするように設定可能である。例えば、ＧＰＵ１２２が、ヘッド２０６ａ、２０６ｂから２つの異なるモニタに（例えば、デジタル出力ポート２１０、２１１および／またはアナログ出力ポート２１２、２１３のいずれか２つを介して）同時に画素データを送り出すことが可能である。あるいは、ＧＰＵ１２２が、ポート２１０〜２１３の１つを介してモニタに、またＭＩＯＡポート２１４ａまたはＭＩＯＢポート２１４ｂを介して別のＧＰＵに同時に画素データを送り出すことが可能である。例によっては、ディスプレイヘッド２０６ａ、２０６ｂの一方または両方をアイドル状態、すなわちどの出力ポートにも画素を送っていない状態にすることもできる。 [0039] Head A 206a and Head B 206b are coupled to output ports 210-213 as well as MIO ports 214a, 214b, respectively, via crossbar 220. In this embodiment, crossbar 220 supports any connection from head A 206a to any one of ports 210-213, 214a or 214b, and at the same time from head B 206b by crossbar 220 to head A 206a. Can be configured to support any connection to any one of ports 210-213, 214a, or 214b that are not connected to. For example, GPU 122 can send pixel data simultaneously from heads 206a, 206b to two different monitors (eg, via any two of digital output ports 210, 211 and / or analog output ports 212, 213). It is. Alternatively, the GPU 122 can send pixel data simultaneously to a monitor via one of the ports 210-213 and to another GPU via the MIOA port 214a or MIOB port 214b. In some examples, one or both of the display heads 206a, 206b can be idle, i.e., not sending pixels to any output port.

[0040]ＭＩＯポート２１４ａ、２１４ｂは、画素データをＧＰＵ１２２の別の１つから受信し、受信した画素データをディスプレイヘッド２０６ａ、２０６ｂ内へと通信するようにも設定可能である。各ＧＰＵ１２２は、各ディスプレイヘッド２０６ａ、２０６ｂ内にＭＩＯポート２１４ａ、２１４ｂの一方から受信した「外部」画素、それ自体のディスプレイパイプライン２０２から受信した「内部」画素、または内部画素および外部画素の組み合わせを選択するための画素選択論理回路（後述する）も有している。 [0040] The MIO ports 214a, 214b can also be configured to receive pixel data from another one of the GPUs 122 and communicate the received pixel data into the display heads 206a, 206b. Each GPU 122 has an “external” pixel received from one of the MIO ports 214a, 214b in each display head 206a, 206b, an “internal” pixel received from its own display pipeline 202, or a combination of internal and external pixels. A pixel selection logic circuit (to be described later).

[0041]いくつかの実施形態では、クロスバー２２０がシステム起動で設定され、また他の実施形態では、システム動作中に接続を変更可能なようにクロスバー２２０が動的に設定可能である。クロスバー２２０は、ＭＩＯポート２１４ａ、２１４ｂの一方で受信された入力された画素データをディスプレイヘッド２０６ａ、２０６ｂのいずれかに結合するように設定可能とすることもできる。 [0041] In some embodiments, the crossbar 220 is configured at system startup, and in other embodiments, the crossbar 220 can be dynamically configured so that connections can be changed during system operation. Crossbar 220 may also be configurable to couple input pixel data received on one of MIO ports 214a, 214b to either display head 206a, 206b.

[0042]図３Ａは、本発明の実施形態によるＧＰＵ１２２のディスプレイヘッド２０６ａ内の画素選択論理回路３００のブロック図である。当然のことながら、ディスプレイヘッド２０６ｂは類似の設計の画素選択論理回路を有することができる。いくつかの実施形態では、ＧＰＵ１２２の各ディスプレイヘッド２０６ａ、２０６ｂがそれ自体の画素選択論理回路３００を有している。 [0042] FIG. 3A is a block diagram of a pixel selection logic circuit 300 in the display head 206a of GPU 122 according to an embodiment of the present invention. Of course, the display head 206b can have a similar design of pixel selection logic. In some embodiments, each display head 206 a, 206 b of GPU 122 has its own pixel selection logic 300.

[0043]画素選択論理回路３００は、第１のパス３０２上で図２のディスプレイパイプライン２０２から内部画素を受信する。図２のＭＩＯＡポート２１４ａ（または、いくつかの実施形態ではＭＩＯＢポート２１４ｂ）が入力ポートとして設定される場合には、画素選択論理回路３００は第２のパス３０４上で外部画素も受信する。 [0043] The pixel selection logic 300 receives internal pixels from the display pipeline 202 of FIG. If the MIOA port 214a of FIG. 2 (or MIOB port 214b in some embodiments) is configured as an input port, the pixel selection logic 300 also receives external pixels on the second path 304.

[0044]外部画素および内部画素は、それぞれ画素合成回路３０６へと伝播され、これが外部画素および内部画素を混合して混合画素を生成する。画素合成回路３０６は、例えば従来の演算論理回路を用いて実施することができる。一実施形態では、画素合成回路３０６が、内部画素を多くの候補除数（例えば１、２、４等）の１つにより除する第１の除算回路３０８と、（除算後の）内部画素を外部画素に加算して合計画素を生成する加算回路３１０と、制御信号（ＰＳＥＬ１）に応答して内部画素および合計画素間で選択をする選択回路３１２と、選択された画素を多くの候補除数（例えば１、２等）の１つにより除する第２の除算回路３１４とを含んでおり、パス３１６上に混合画素としての結果を与える。 [0044] The external and internal pixels are each propagated to the pixel composition circuit 306, which mixes the external and internal pixels to produce a mixed pixel. The pixel synthesis circuit 306 can be implemented using, for example, a conventional arithmetic logic circuit. In one embodiment, the pixel synthesis circuit 306 includes a first divider circuit 308 that divides the internal pixel by one of a number of candidate divisors (eg, 1, 2, 4, etc.) and the internal pixel (after division) as an external An addition circuit 310 that adds to the pixels to generate a total pixel, a selection circuit 312 that selects between the internal pixels and the total pixels in response to a control signal (PSEL1), and a number of candidate divisors (for example, And a second divider circuit 314 that divides by one of the two, and gives the result as a mixed pixel on path 316.

[0045]パス３０４上の外部画素およびパス３１６上の混合画素は、選択回路３１８（マルチプレクサ等）に渡される。制御信号（ＰＳＥＬ２）に応答して、選択回路３１８が図２のクロスバー２２０に接続する出力パス３２０に送るために内部画素、混合画素、または外部画素のいずれかを選択する。 [0045] External pixels on path 304 and mixed pixels on path 316 are passed to selection circuit 318 (such as a multiplexer). In response to the control signal (PSEL2), the selection circuit 318 selects either the internal pixel, the mixed pixel, or the external pixel to send to the output path 320 connected to the crossbar 220 of FIG.

[0046]ＰＳＥＬ１およびＰＳＥＬ２信号は、ディスプレイヘッド２０６ａ中の制御論理回路（明示せず）により好適に生成される。いくつかの実施形態では、概して従来の設計のものとしてよいこの制御論理回路が、図１のＧＰＵ１０２上で実行するグラフィックスドライバプログラムにより生成される制御情報に応答する。例えば、除算回路３０８、３１４用の候補除数間で選択して特定用途向けの適当な加重平均を生成することによって、類似の制御情報を画素合成器３０８の動作を制御するためにも用いることができる。本発明の教示を利用可能な当業者であれば、ＰＳＥＬおよび画素合成器制御信号を生成するのに適した制御論理回路を実施可能なので、制御論理回路の詳細な説明は省略する。 [0046] The PSEL1 and PSEL2 signals are preferably generated by control logic (not explicitly shown) in the display head 206a. In some embodiments, this control logic, which may be of a generally conventional design, is responsive to control information generated by a graphics driver program executing on the GPU 102 of FIG. For example, similar control information can also be used to control the operation of the pixel synthesizer 308 by selecting between candidate divisors for the divider circuits 308, 314 to generate a suitable weighted average for a particular application. it can. Those skilled in the art who can utilize the teachings of the present invention can implement control logic suitable for generating PSEL and pixel synthesizer control signals, and thus a detailed description of the control logic is omitted.

[0047]図３Ｂは、本発明の代替的な実施形態による画素選択論理回路３５０のブロック図である。画素選択論理回路３５０は、図３Ａの画素選択論理回路３００と概ね同様であるが、ガンマ補正画素を混合する能力を持って好適に設計されている。画素選択論理回路３５０は、第１のパス３５２上で図２のディスプレイパイプライン２０２から内部画素を受信する。内部画素は画素合成器３５８へと伝播される。ＭＩＯＡポート２１４ａが入力ポートとして設定される場合には、画素選択論理回路３５０は第２のパス３５４上で外部画素も受信する。この外部画素も画素合成器３５８へと伝播される。 [0047] FIG. 3B is a block diagram of pixel selection logic 350 according to an alternative embodiment of the present invention. The pixel selection logic circuit 350 is generally similar to the pixel selection logic circuit 300 of FIG. 3A, but is suitably designed with the ability to mix gamma correction pixels. Pixel selection logic 350 receives internal pixels from display pipeline 202 of FIG. 2 on first path 352. Internal pixels are propagated to the pixel synthesizer 358. If the MIOA port 214a is set as an input port, the pixel selection logic circuit 350 also receives external pixels on the second path 354. This external pixel is also propagated to the pixel synthesizer 358.

[0048]画素合成器３５８は、内部画素および外部画素を混合し、結果として単一のパス３６０上に混合画素を供給する。画素合成器３５８は、図３Ａの除算回路３０６に類似の除算回路および／または図３Ａの加算回路３０８に類似の加算器を含むことができる。加えて（または代わりに）、画素合成器３５８は、パス３５２および３５４上で受信したガンマ補正画素を混合する演算論理回路を含むこともできる。当分野で既知のように、ガンマ補正は、例えば画素値Ｐを定数γについてＰ^γに変換することによりディスプレイ装置で線形強度応答を生成する非線形スケールに合わせて画素値を調整する。（典型的なディスプレイ装置については、定数γがおよそ２．０〜２．５である。）このような一実施形態では、γ≒２．２についてγ補正出力画素Ｐ_ｏ ^γが、以下の式を用いて算出可能である。
Ｐ_ｏ ^γ＝（４Ｐ_ｉ ^γ+４Ｐ_ｅ ^γ+｜Ｐ_ｉ ^γ-Ｐ_ｅ ^γ｜）／４（式１） [0048] The pixel synthesizer 358 mixes the internal and external pixels, resulting in a mixed pixel on a single pass 360. Pixel synthesizer 358 may include a divider circuit similar to divider circuit 306 in FIG. 3A and / or an adder similar to adder circuit 308 in FIG. 3A. In addition (or alternatively), pixel synthesizer 358 may also include arithmetic logic that mixes the gamma correction pixels received on paths 352 and 354. As is known in the art, gamma correction adjusts pixel values to a non-linear scale that produces a linear intensity response at the display device, for example, by converting pixel value P to P ^γ for a constant γ. (For a typical display device, the constant γ is approximately 2.0 to 2.5.) In such an embodiment, for γ≈2.2, the γ corrected output pixel P _o ^γ is It is possible to calculate using
P _o ^γ = (4P _i ^γ + 4P _e ^γ + | P _i ^γ -P _e ^γ |) / 4

[0049]ここで、Ｐ_ｉ ^γおよび４Ｐ_ｅ ^γは、パス３５２および３５４上に供給されるガンマ補正画素を表す。当業者であれば、厳密な結果に必要な計算ではなく簡単なハードウェアを用いて式１が許容可能な近似を与えることが分かるだろう。（例えば、４による乗算および除算はビットシフトとして実施可能である）また、他の近似で代用可能なことも理解されよう。 [0049] where P _i ^γ and 4P _e ^γ represent the gamma correction pixels provided on paths 352 and 354. One skilled in the art will recognize that Equation 1 provides an acceptable approximation using simple hardware rather than the calculations required for exact results. (For example, multiplication and division by 4 can be implemented as a bit shift.) It will also be appreciated that other approximations can be substituted.

[0050]図３Ｂを再び参照すると、選択マルチプレクサ３６２が、パス３５２上で内部画素を、パス３５４上で外部画素を、及び、パス３６０上で混合画素を受信する。画素選択信号（ＰＳＥＬ）に応答して、選択マルチプレクサ３６２は、図２のクロスバー２２０に接続する出力パス３６４へ送るためにこれらの候補画素の１つを選択する。出力パス３６４は、第２の除算回路３６６を含むことができるが、これは図３Ａの除算回路３１４と同様であってもよい。代替的な実施形態では、図３Ａに示すものと同様の２つの選択マルチプレクサを出力画素の選択に用いるが、除算回路３６６を第２の選択マルチプレクサ前方に配置してもよい。他の選択回路構成も用いることができる。 [0050] Referring back to FIG. 3B, the selection multiplexer 362 receives internal pixels on path 352, external pixels on path 354, and mixed pixels on path 360. In response to the pixel selection signal (PSEL), the selection multiplexer 362 selects one of these candidate pixels for transmission to the output path 364 that connects to the crossbar 220 of FIG. The output path 364 can include a second divider circuit 366, which may be similar to the divider circuit 314 of FIG. 3A. In an alternative embodiment, two selection multiplexers similar to those shown in FIG. 3A are used for output pixel selection, but a divider circuit 366 may be placed in front of the second selection multiplexer. Other selection circuit configurations can also be used.

[0051]いくつかの実施形態では、画素合成器３５０は、画素がガンマ補正されない場合には簡単な加算を用いて、ガンマ補正画素および非ガンマ補正（線形）画素のいずれかに動作するように設定可能である。この設定は、起動動作中（システム起動時）にグラフィックスドライバにより有利に行われる。ガンマ補正フィルタが必要ないのは当然であるが、いくつかの実施形態では最終の画素選択後に任意のガンマ補正を適用することができる。[複数のＧＰＵを用いた分散アンチエイリアシング] [0051] In some embodiments, the pixel synthesizer 350 operates on either a gamma corrected pixel and a non-gamma corrected (linear) pixel using simple addition if the pixel is not gamma corrected. It can be set. This setting is advantageously performed by the graphics driver during the startup operation (system startup). Of course, no gamma correction filter is required, but in some embodiments, any gamma correction can be applied after final pixel selection. [Distributed anti-aliasing using multiple GPUs]

[0052]画素選択論理回路３００または画素選択論理回路３５０を備えた図２のＧＰＵ１２２は、２つ以上のＧＰＵ１２２を備えたシステムでの分散アンチエイリアシング動作について好適に使用可能である。図４Ａは、マスター／スレーブ読み出し構成の２つのＧＰＵ１２２（０）および１２２（１）を有するグラフィックスサブシステム４００の簡略ブロック図である。（明確にするために、アクティブポートとディスプレイヘッドのみを示す。この例では、ＧＰＵ１つにつきディスプレイヘッドが１つだけ用いられる。）スレーブＧＰＵ１２２（１）は、そのＭＩＯＡポート２１４ａ（１）が出力ポートとして設定され、マスターＧＰＵ１２２（０）は、そのＭＩＯＡポート２１４ａ（０）が入力ポートとして設定されている。ＭＩＯＡポート２１４ａ（１）は、ＭＩＯＡポート２１４ａ（０）に結合され、出力画素Ｐ_ｏ１をスレーブＧＰＵ１２２（１）からマスターＧＰＵ１２２（０）へと流す。 [0052] The GPU 122 of FIG. 2 with the pixel selection logic 300 or the pixel selection logic 350 can be suitably used for distributed anti-aliasing operations in a system with more than one GPU 122. FIG. 4A is a simplified block diagram of a graphics subsystem 400 having two GPUs 122 (0) and 122 (1) in a master / slave read configuration. (For clarity, only active ports and display heads are shown. In this example, only one display head is used per GPU.) Slave GPU 122 (1) has its MIOA port 214a (1) as its output port. The master GPU 122 (0) has its MIOA port 214a (0) set as an input port. MIOA port 214a (1) is coupled to the MIOA port 214a (0), passing the output pixel _{P o1} from the slave GPU 122 (1) and to the master GPU 122 (0).

[0053]スレーブＧＰＵ１２２（１）のヘッドＡ２０６ａ（１）は、スレーブＧＰＵ１２２（１）のディスプレイパイプライン２０２（１）により与えられた画素Ｐ_ｉ１を出力画素としてＭＩＯＡポート２１４ａ（１）に転送する。スレーブＧＰＵ１２２（１）からの出力画素Ｐ_ｏ１は、マスターＧＰＵ１２２（０）のＭＩＯＡポート２１４ａ（０）により受信され、これが画素をディスプレイヘッドＡ２０６ａ（０）に転送する。ヘッドＡ２０６ａ（０）では、画素選択論理回路３００（０）（図３の画素選択論理回路３００の一例）が、マスターＧＰＵ１２２（０）のディスプレイパイプライン２０２（０）からの内部画素Ｐｉ０、スレーブＧＰＵ１２２（１）のディスプレイパイプライン２０２（１）由来の外部画素Ｐ_ｏ１、または、加算回路３０８により供給される合計（例えば、Ｐ_ｉ０＋Ｐ_ｏ１）を選択するように動作する。 [0053] The head A 206a (1) of the slave GPU 122 (1) transfers the pixel P _i1 provided by the display pipeline 202 (1) of the slave GPU 122 (1) to the MIOA port 214a (1) as an output pixel. The output pixel P _o1 from the slave GPU 122 (1) is received by the MIOA port 214a (0) of the master GPU 122 (0), which transfers the pixel to the display head A 206a (0). In the head A 206 a (0), the pixel selection logic circuit 300 (0) (an example of the pixel selection logic circuit 300 in FIG. 3) is connected to the internal pixel Pi 0 and the slave GPU 122 from the display pipeline 202 (0) of the master GPU 122 (0). It operates to select the external pixel P _o1 derived from the display pipeline 202 (1) of (1) or the sum (eg, P _i0 + P _o1 ) supplied by the adder circuit 308.

[0054]マスターＧＰＵ１２２（０）のヘッドＡ２０６ａ（０）は、選択された画素（Ｐ_{ｆｉｎａｌ}）を出力ポート、この場合はデジタル出力ポート２１０（０）に送る。ＧＰＵ１２２（０）のヘッドＡ２０６ａ（０）がＭＩＯＡポート２１４ｂ（０）（図２Ｃに明示せず）に画素データを送るように設定することもでき、これを第３のＧＰＵ１２２のＭＩＯポートに接続し、それで第３のＧＰＵ１２２がＧＰＵ１２２（０）をマスターとすることが理解されよう。このように、任意の数のＧＰＵ１２２をデイジーチェーン式に接続することができる。 [0054] Head A 206a (0) of master GPU 122 (0) sends the selected pixel (P _final ) to an output port, in this case, digital output port 210 (0). The head A 206a (0) of the GPU 122 (0) can also be set to send pixel data to the MIOA port 214b (0) (not explicitly shown in FIG. 2C), which is connected to the MIO port of the third GPU 122. Thus, it will be appreciated that the third GPU 122 is mastered by GPU 122 (0). In this way, any number of GPUs 122 can be connected in a daisy chain manner.

[0055]本発明の実施形態によれば、ＧＰＵ１２２（０）および１２２（１）は、外部分散アンチエイリアシング（ＡＡ）モードで用いることが可能である。このモードでは、各ＧＰＵ１２２が、ＧＰＵ１２２（０）によって用いられるサンプリング位置がＧＰＵ１２２（１）によって用いられるサンプリング位置と異なるように、表示パラメータまたはサンプリングパラメータにいくらかの変化量をもって、同じ画像をレンダリングする。例えば、若干異なる表示域または表示面の法線を２つのＧＰＵ１２２について画成し、２つの画像の画素境界に小さいオフセットを作り出すこともできる。あるいは、画素内のサンプリング位置が（例えばグラフィックスドライバにより）設定可能な場合には、各ＧＰＵ１２２を同じ表示パラメータの組と各画素内の異なるサンプリング位置を用いるように設定することもできる。 [0055] According to embodiments of the present invention, GPUs 122 (0) and 122 (1) may be used in an external distributed anti-aliasing (AA) mode. In this mode, each GPU 122 renders the same image with some change in display parameters or sampling parameters such that the sampling position used by GPU 122 (0) is different from the sampling position used by GPU 122 (1). For example, a slightly different display area or display surface normal may be defined for the two GPUs 122 to create a small offset at the pixel boundary of the two images. Alternatively, if the sampling position within a pixel can be set (eg, by a graphics driver), each GPU 122 can be set to use the same set of display parameters and a different sampling position within each pixel.

[0056]分散ＡＡモードでは、ＧＰＵ１２２（０）のディスプレイヘッド２０６ａで画素選択論理回路３００（０）により受信された外部画素Ｐ_ｏ１および内部画素Ｐ_ｉ０が、最終画像の同じ画素についての異なるサンプリング位置に対応する。内部画素および外部画素を平均することによって、表示解像度の２倍のアンチエイリアシングが得られる。より具体的には、選択マルチプレクサ３１０が加算回路３０８により提供される画素合計Ｐ_ｏ１＋Ｐ_ｉ０を選択するように設定されており、除算回路３１６が選択された画素合計を２で除するように設定されているため、最終画素がＰ_{ｆｉｎａｌ}＝（Ｐ_ｏ１＋Ｐ_ｉ０）／２となる。このようにして、画素選択論理回路３００は２×ＡＡフィルタを実施可能である。 [0056] In the distributed AA mode, the external pixel P _o1 and the internal pixel P _i0 received by the pixel selection logic circuit 300 (0) at the display head 206a of the GPU 122 (0) are different sampling positions for the same pixel of the final image. Corresponding to By averaging the internal and external pixels, anti-aliasing twice the display resolution is obtained. More specifically, the selection multiplexer 310 is set to select the pixel sum P _o1 + P _i0 provided by the addition circuit 308, and the division circuit 316 is set to divide the selected pixel sum by two. _Therefore , the final pixel is P _final = (P _o1 + P _i0 ) / 2. In this way, the pixel selection logic circuit 300 can implement a 2 × AA filter.

[0057]ＧＰＵ１２２（０）および１２２（１）が、内部画素または外部画素の一方が他方を除外するように選択されるモードを含む他の分散レンダリングモードでも動作可能であることに留意されたい。特定の選択は、分散レンダリングモードの詳細、例えば、異なるＧＰＵ１２２が同じフレーム異なる部分と異なる連続的なフレームのいずれをレンダリングするかに依存することになるが、本発明とは関係ない。また、３つ以上のＧＰＵ１２２があれば、より高度なアンチエイリアシングを達成可能である。例えば、任意の数のＧＰＵ１２２をそれぞれのＭＩＯＡおよびＭＩＯＢポートを用いてデイジーチェーン式に接続可能であり、デイジーチェーン内の各ＧＰＵが、適切な重み係数を用いてそれ自体の内部画像を上流のＧＰＵから受信する外部画像と混合可能である。 [0057] Note that GPUs 122 (0) and 122 (1) are also operable in other distributed rendering modes, including a mode in which one of the internal or external pixels is selected to exclude the other. The particular choice will depend on details of the distributed rendering mode, for example whether different GPUs 122 render different parts of the same frame or different consecutive frames, but are not relevant to the present invention. If there are three or more GPUs 122, more advanced anti-aliasing can be achieved. For example, any number of GPUs 122 can be daisy-chained using their respective MIOA and MIOB ports, and each GPU in the daisy chain uses its appropriate weighting factor to transmit its own internal image to the upstream GPU. Can be mixed with external images received from

[0058]本明細書に記載のディスプレイヘッド、画素選択論理回路、および分散ＡＡ動作は例示的なものであり、変更および修正が可能であることが理解されよう。例えば、本明細書の除算回路は、少数の離散除数による除算をサポートするものである。他の実施形態では、広範囲のアンチエイリアシングフィルタがサポートされるように、除算回路が（任意に選択された除数を含む）より多くの離散除数をサポートすることもある。また、除算回路は、本明細書に記載のものとは異なる位置に配置することができ、除算回路の数を修正することもできる。例えば、除算回路を内部画素パスに加えて、またはその代わりに外部画素パス上に配置することもできる。 [0058] It will be appreciated that the display head, pixel selection logic, and distributed AA operation described herein are exemplary and can be changed and modified. For example, the division circuit of the present specification supports division by a small number of discrete divisors. In other embodiments, the divider circuit may support more discrete divisors (including arbitrarily selected divisors) so that a wide range of anti-aliasing filters are supported. In addition, the division circuits can be arranged at positions different from those described in this specification, and the number of division circuits can be modified. For example, a divider circuit can be placed on the external pixel path in addition to or instead of the internal pixel path.

[0059]選択回路３１８の特定の構成を修正することもできる。当業者であれば、内部画素および外部画素の両方から得られた内部画素、外部画素および混合画素間で制御可能に選択できるあらゆる回路素子または回路素子の組み合わせを選択回路として使用可能であることが分かるであろう。 [0059] The particular configuration of the selection circuit 318 may be modified. A person skilled in the art can use any circuit element or combination of circuit elements that can be controllably selected between internal pixels, external pixels and mixed pixels obtained from both internal and external pixels as a selection circuit. You will understand.

[0060]本明細書で用いるように、「画素」は、概して画像内のある位置でサンプリングされたカラー値のあらゆる表現またはこのような値（例えば図３の加算回路３０８により生成されたような）の組み合わせを指す。ＧＰＵでパイプラインをレンダリングすることにより、名目上の解像度（ここで解像度は画像内の画素数を指す）で画素が生成されるが、これがディスプレイ装置の解像度と一致することもしないこともある。いくつかの実施形態では、ディスプレイパイプラインが、名目上の解像度をディスプレイ解像度に変換するために必要とされるあらゆるアップフィルタリングまたはダウンフィルタリングを行う。 [0060] As used herein, a "pixel" is generally any representation of a color value sampled at a location in an image or such value (eg, as generated by the adder circuit 308 of FIG. 3). ). By rendering the pipeline with the GPU, pixels are generated at a nominal resolution (where resolution refers to the number of pixels in the image), which may or may not match the resolution of the display device. In some embodiments, the display pipeline performs any up-filtering or down-filtering required to convert nominal resolution to display resolution.

[0061]本明細書中のＭＩＯポートおよびディスプレイヘッドを本明細書中の「Ａ」および「Ｂ」とするラベリングは、単に説明の利便性のためである。当然のことながら、任意のＭＩＯポートを任意の他のＭＩＯポートに接続可能であり、そのポートが出力ポートとして設定される場合にいずれかのディスプレイヘッドがいずれかのＭＩＯポートを駆動可能である。加えて、いくつかのＧＰＵが２つ超または２つ未満のＭＩＯポートおよび／または２つ超または２つ未満のディスプレイヘッドを含むことができる。 [0061] The labeling of the MIO port and display head herein as "A" and "B" herein is for convenience of description only. Of course, any MIO port can be connected to any other MIO port, and any display head can drive any MIO port when that port is configured as an output port. In addition, some GPUs may include more than two or less than two MIO ports and / or more than two or less than two display heads.

[0062]一般に、１つのＧＰＵに別のＧＰＵとの画素データ通信を可能にする任意の１つのポートまたは複数のポートをＩ／Ｏポートとして用いて本発明を実施することができる。いくつかの実施形態では、上述のように、ＭＩＯポートが別のＧＰＵと通信する以外の目的のためにも設定可能である。例えば、ＭＩＯポートは、ＴＶエンコーダ等の各種外部デバイスと通信するように設定可能である。いくつかの実施形態では、ＤＶＯ（インテル社製ＤｉｇｉｔａｌＶｉｄｅｏＯｕｔｐｕｔＩｎｔｅｒｆａｃｅ）または他のビデオ出力標準をサポート可能である。いくつかの実施形態では、グラフィックスアダプタをアセンブルする際に各ＭＩＯポートの設定を決定する。システム起動時に、アダプタがシステムにそのＭＩＯポートの設定について通知する。他の実施形態では、ＭＩＯポートを専用入力または出力ポートを置き換えることもできる。 [0062] In general, the present invention can be implemented using any one port or multiple ports allowing one GPU to communicate pixel data with another GPU as an I / O port. In some embodiments, as described above, the MIO port can also be configured for purposes other than communicating with another GPU. For example, the MIO port can be set to communicate with various external devices such as a TV encoder. Some embodiments may support DVO (Intel Digital Video Output Interface) or other video output standards. In some embodiments, the settings for each MIO port are determined when assembling the graphics adapter. At system startup, the adapter informs the system about its MIO port settings. In other embodiments, the MIO port can be replaced with a dedicated input or output port.

[0063]Ｉ／Ｏポート、ディスプレイヘッド、および他のグラフィックスサブシステムのアスペクトの設定は、全てのグラフィックスプロセッサと通信するように構成されたシステム設定ユニットにより達成することができる。いくつかの実施形態では、システム設定ユニットが、マルチプロセッサグラフィックスサブシステムを含むシステムのＣＰＵ上で実行するグラフィックスドライバプログラムで実施される。ハードウェアおよび／またはソフトウェアコンポーネントのあらゆる組み合わせを含む他の任意の適当なエージェントを、システム設定ユニットとして用いることができる。[内部分散アンチエイリアシング] [0063] Aspect settings for I / O ports, display heads, and other graphics subsystems can be accomplished by a system settings unit configured to communicate with all graphics processors. In some embodiments, the system configuration unit is implemented in a graphics driver program that runs on the CPU of a system that includes a multiprocessor graphics subsystem. Any other suitable agent including any combination of hardware and / or software components can be used as the system configuration unit. [Internally distributed anti-aliasing]

[0064]本発明の実施形態によれば、１つのＧＰＵ１２２の２つのディスプレイヘッド２０６ａ、２０６ｂをマスター／スレーブ形式で互いに結合することができる。この形式では、ＧＰＵ１２２が、マスターとして動作するディスプレイヘッド（例えばディスプレイヘッドＡ２０６ａ）の画素選択論理回路３００を用いて「内部分散」ＡＡフィルタリングを実行可能である。 [0064] According to embodiments of the present invention, two display heads 206a, 206b of one GPU 122 can be coupled together in a master / slave fashion. In this format, the GPU 122 can perform “internally distributed” AA filtering using the pixel selection logic 300 of the display head (eg, display head A 206a) operating as a master.

[0065]図４Ｂは、本発明の実施形態によるマスター／スレーブ形式でディスプレイヘッド２０６ｂに接続されたディスプレイヘッド２０６ａを示すＧＰＵ１２２のブロック図である。当然のことながら、図４ＢのＧＰＵ１２２は図２のＧＰＵ１２２と同一である。図４ＢではアクティブなＩ／Ｏポートのみを示しており、クロスバー２２０は示していない。図４Ｂのディスプレイパイプライン２０２は、２つの平行なセクションを有して示されている。画素をディスプレイヘッド２０６ａへと送るディスプレイパイプラインＡ４０２ａと、画素をディスプレイヘッド２０６ｂへと送るディスプレイパイプラインＢ４０２ｂである。ディスプレイパイプラインＡ４０２ａおよびＢ４０２ｂは、それぞれ従来の設計とすることができ、それぞれ各種の画素処理動作を行うようにすることができる。ディスプレイパイプライン４０２ａおよび４０２ｂは、所望のとおりに同じ動作を行うことも異なる動作を行うこともできる。 [0065] FIG. 4B is a block diagram of GPU 122 showing display head 206a connected to display head 206b in a master / slave format in accordance with an embodiment of the present invention. Of course, the GPU 122 of FIG. 4B is identical to the GPU 122 of FIG. In FIG. 4B, only active I / O ports are shown, and the crossbar 220 is not shown. The display pipeline 202 of FIG. 4B is shown having two parallel sections. A display pipeline A 402a that sends pixels to the display head 206a and a display pipeline B 402b that sends pixels to the display head 206b. The display pipelines A 402a and B 402b can each be a conventional design, and each can perform various pixel processing operations. Display pipelines 402a and 402b can perform the same or different operations as desired.

[0066]ＭＩＯＢポート２１４ｂは、画素転送パス４００を介して同じＧＰＵ１２２のＭＩＯＡポート２１４ａに結合されている。画素転送パス４００は、ディスプレイヘッドＢ２０６ｂにより生成された画素をＭＩＯＢポート２１４ｂからＭＩＯＡポート２１４ａへ転送する。ＭＩＯＡポート２１４ａは、受信した画素をＧＰＵ１２２のディスプレイヘッドＡ２０６ａに送る。画素転送パス４００は、任意の適当な信号転送技術を用いて実施することができる。以下で例を説明する。 [0066] The MIOB port 214b is coupled to the MIOA port 214a of the same GPU 122 via the pixel transfer path 400. The pixel transfer path 400 transfers the pixels generated by the display head B 206b from the MIOB port 214b to the MIOA port 214a. The MIOA port 214 a sends the received pixel to the display head A 206 a of the GPU 122. The pixel transfer path 400 can be implemented using any suitable signal transfer technique. An example is described below.

[0067]ディスプレイヘッドＡ２０６ａの観点からは、ディスプレイヘッドＢ２０６ｂから受信した画素は、異なるＧＰＵから受信した画素と区別ができない。よって、例えば、ヘッドＡ２０６ａ由来の「内部」画素（Ｐ_Ａ）、ヘッドＢ２０６ｂ由来の「外部」画素（Ｐ_Ｂ）、または、画素合成回路３０８により画素Ｐ_ＡおよびＰ_Ｂから作り出した混合画素のいずれか１つを出力画素として選択するようにディスプレイヘッドＡ２０６ａの画素選択論理回路３００が動作可能である。（画素Ｐ_Ｂが画素Ｐ_Ａとは異なりディスプレイパイプラインＡ４０２ａによりディスプレイヘッドＡ２０６ａに提供されないという意味で、画素Ｐ_ＢはディスプレイヘッドＡ２０６ａに対して「外部」である。） [0067] From the perspective of display head A 206a, pixels received from display head B 206b are indistinguishable from pixels received from different GPUs. Thus, for example, any of the “internal” pixel (P _A ) derived from the head _A 206 a, the “external” pixel (P _B ) derived from the head _B 206 _b , or a mixed pixel created from the pixels P _A and P _B by the pixel synthesis circuit 308 The pixel selection logic circuit 300 of the display head A 206a is operable to select one as the output pixel. (Pixel P _B is “external” to display head A 206a in the sense that pixel P _B is not provided to display head A 206a by display pipeline _A 402a, unlike pixel PA.)

[0068]この構成では、ＧＰＵ１２２が、ディスプレイヘッドＡ２０６の画素選択論理回路３００により混合されるサンプル値を供給する２つのディスプレイパイプライン４０２ａ、４０２ｂによる「内部分散」ＡＡの実行に使用可能である。動作においては、ＧＰＵ１２２のレンダリングパイプライン（明示せず）が、２つの画像について用いられるサンプリング位置が互いに異なるように、表示パラメータまたはサンプリングパラメータにいくらかの変化量をもって、同じシーンの２つの画像をレンダリングする。例えば、若干異なる表示域または表示面の法線を２つのＧＰＵ１２２について画成し、２つの画像の画素境界に小さいオフセットを作り出すこともできる。あるいは、画素内のサンプリング位置が（例えばグラフィックスドライバにより）設定可能な場合には、各ＧＰＵ１２２を同じ表示パラメータの組と各画素内の異なるサンプリング位置を用いて各画像を生成することもできる。 [0068] In this configuration, the GPU 122 can be used to perform "internal dispersion" AA by two display pipelines 402a, 402b that supply sample values that are mixed by the pixel selection logic 300 of the display head A206. In operation, the rendering pipeline (not explicitly shown) of the GPU 122 renders two images of the same scene with some change in display parameters or sampling parameters so that the sampling positions used for the two images are different from each other. To do. For example, a slightly different display area or display surface normal may be defined for the two GPUs 122 to create a small offset at the pixel boundary of the two images. Alternatively, if the sampling position in the pixel can be set (for example, by a graphics driver), each GPU 122 can generate each image using the same set of display parameters and a different sampling position in each pixel.

[0069]レンダリングした画像の一方がフレームバッファ「Ａ」４０４に記憶され、他方がフレームバッファ「Ｂ」４０６に記憶される。フレームバッファＡ４０４およびＢ４０６は、ＧＰＵ１２２のオンチップメモリを含む任意の１つのメモリデバイスまたは複数のメモリデバイス、図１のグラフィックスメモリ１２４および／またはシステムメモリ１０４で実施可能である。この２つのフレームバッファは所望のとおりに同じメモリデバイス中にも異なるデバイス中にも配置することができる。 [0069] One of the rendered images is stored in frame buffer “A” 404 and the other is stored in frame buffer “B” 406. Frame buffers A 404 and B 406 may be implemented in any one or more memory devices, including GPU 122 on-chip memory, graphics memory 124 and / or system memory 104 of FIG. The two frame buffers can be located in the same memory device or in different devices as desired.

[0070]ディスプレイパイプラインＢ４０２ｂは、フレームバッファＢ４０６から画素を読み出し、各種の処理動作（概して従来の性質のものでよい）を画素に施し、得られた画素Ｐ_ＢをディスプレイヘッドＢ２０６ｂに転送する。ディスプレイヘッドＢ２０６ｂは、画素Ｐ_Ｂを選択するように動作する画素選択論理回路３００を有しており、それらの画素がクロスバー２２０（図４に明示せず）を介してＭＩＯＢポート２１４ｂへと転送される。画素Ｐ_Ｂは、画素パス４００を介して同じＧＰＵ１２２のＭＩＯＡポート２１４ａへと転送され、これが画素Ｐ_ＢをディスプレイヘッドＡ２０６ａに転送する。 [0070] The display pipeline B 402b reads the pixels from the frame buffer B 406, performs various processing operations (generally of conventional nature) on the pixels, and transfers the resulting pixels P _B to the display head _B 206b. The display head _B 206b has a pixel selection logic circuit 300 that operates to select the pixel P _B and these pixels are transferred to the MIOB port 214b via the crossbar 220 (not explicitly shown in FIG. 4). Is done. Pixel P _B is transferred via pixel path 400 to MIOA port 214a of the same GPU 122, which transfers pixel P _B to display head A 206a.

[0071]この動作と平行して、ディスプレイパイプラインＡ４０２ａがフレームバッファＡから画素を読み出し、画素に（概して従来の性質のものでよい）各種の処理動作を施し、得られた画素Ｐ_ＡをディスプレイヘッドＡ２０６ａに転送する。ディスプレイパイプラインＢ４０２ｂ、ディスプレイヘッドＢ２０６ｂ、および画素パス４００は、同じスクリーン画素に対応する画素値Ｐ_ＡおよびＰ_Ｂが同時に（例えば、同じクロックサイクルで）ディスプレイヘッドＡ２０６ａの画素選択論理回路３００に送られるように、適切なタイミングで好適に設定される。 [0071] In parallel with this operation, the display pipeline A402a reads the pixel from the frame buffer A, pixel (generally may be of conventional nature) performs various processing operations, the pixel P _A obtained display Transfer to head A 206a. Display pipeline B 402b, display head B 206b, and pixel path 400 send pixel values P _A and P _B corresponding to the same screen pixel to pixel selection logic 300 of display head A 206a simultaneously (eg, in the same clock cycle). Thus, it is suitably set at an appropriate timing.

[0072]画素合成回路３０８内では、加算回路３１０が画素Ｐ_ＡとＰ_Ｂを加算し、マルチプレクサ３１２が合計画素を選択し、除算回路３１４が合計を２で除し、こうしてパス３１６上の混合画素が画素Ｐ_ＡおよびＰ_Ｂの平均となる。マルチプレクサ３１８は、混合画素を出力画素Ｐ_{ｆｉｎａｌ}として選択する。ディスプレイヘッドＡ２０６ａが出力画素Ｐ_{ｆｉｎａｌ}をディスプレイ装置への送信のために出力ポート（例えばデジタル出力ポート２１０）に送る。 [0072] Within the pixel combining circuit 308, adding circuit 310 adds the pixel _{P A} and _{P B,} the multiplexer 312 selects the sum pixel, the division circuit 314 by dividing the total of 2, thus mixed on the path 316 pixel is the average of the pixel _{P a} and _{P B.} The multiplexer 318 selects the mixed pixel as the output pixel P _final . Display head A 206a sends output pixel P _final to an output port (eg, digital output port 210) for transmission to the display device.

[0073]ＧＰＵ１２２のレンダリングパイプラインが各フレームを２回レンダリングするため、本明細書に記載の内部分散ＡＡモードで動作する際のＧＰＵ１２２の最大フレームレートが非ＡＡモードで動作する際の最大フレームレートよりも概して低いことに留意されたい。いくつかの実施形態では、この内部分散ＡＡモードのフレームレートはおよそ非ＡＡモードのフレームレートの約１／２である。リアルタイムアニメーションについては、内部分散ＡＡモードのフレームレートが約毎秒３０フレーム（以上）であれば、フレームレートの低下はアニメーションの平滑性にほとんどあるいは全く悪影響を与えない。また、非ＡＡモードで生成される画質は、内部分散ＡＡモードで生成される画質よりも概して低くなる。このように、内部分散ＡＡは高画質化と引き換えにフレームレートが低下する。 [0073] Since the rendering pipeline of the GPU 122 renders each frame twice, the maximum frame rate of the GPU 122 when operating in the internally distributed AA mode described herein is the maximum frame rate when operating in the non-AA mode. Note that it is generally lower than. In some embodiments, the frame rate of this internally distributed AA mode is approximately half that of the non-AA mode. For real-time animation, if the frame rate of the internal distributed AA mode is about 30 frames per second (or more), a decrease in the frame rate has little or no adverse effect on the smoothness of the animation. Also, the image quality generated in the non-AA mode is generally lower than the image quality generated in the internal dispersion AA mode. In this way, the internal dispersion AA has a reduced frame rate in exchange for higher image quality.

[0074]本明細書に記載の内部分散ＡＡモードを用いて得られるフレームレートが、単一のＧＰＵにおいて従来のＡＡ技術（例えば、レンダリングパイプラインおよび／またはディスプレイパイプラインでのフィルタリング）を用いて得られるフレームレートに匹敵することについても留意されたい。単一のＧＰＵを用いた従来のＡＡには、単一の画像を生成するが、１画素あたり複数のサンプルを用いるＧＰＵのレンダリングパイプラインが必要となる。１画素あたりより多くのサンプルを処理することによって、画質の改善と引き換えに非ＡＡモードに対するフレームレートも低下する。２画像のレンダリングの方法によって、レンダリングパイプラインにおいて管理され、内部分散ＡＡモードを用いたＧＰＵのスループットは、従来のＡＡを用いるＧＰＵのスループットに匹敵することができる。 [0074] The frame rate obtained using the internally distributed AA mode described herein can be achieved using conventional AA techniques (eg, filtering in the rendering pipeline and / or the display pipeline) on a single GPU. Note also that it is comparable to the resulting frame rate. Conventional AA using a single GPU generates a single image, but requires a GPU rendering pipeline that uses multiple samples per pixel. By processing more samples per pixel, the frame rate for the non-AA mode is also reduced in exchange for improved image quality. A two-image rendering method manages in the rendering pipeline and the throughput of the GPU using the internal distributed AA mode can be comparable to the throughput of the GPU using the conventional AA.

[0075]より高次のＡＡフィルタを実施することもできるが、このようなフィルタは単一パイプラインおよび内部分散アンチエイリアシング動作の組み合わせを採用することができる。一実施形態では、ディスプレイパイプラインＡ４０２ａおよびディスプレイパイプラインＢ４０２ｂがそれぞれ内部Ｎ×ＡＡフィルタを実施するフィルタオンスキャン（ＦＯＳ）モジュール（明示せず）を含んでいる。より具体的には、レンダリングされる画像の各バージョンについて、ＧＰＵ１２２のレンダリングパイプラインが、例えば従来のスーパーサンプリングおよび／またはマルチサンプリング技術を用いて、１画素あたりＮ個（例えば２、４または１より大きい任意の他の数）のサンプルを生成する。画像の一方のバージョン用のサンプルがフレームバッファＡ４０４に記憶され、画像の他方のバージョン用のサンプルがフレームバッファＢ４０６に記憶される。 [0075] Although higher order AA filters may be implemented, such filters may employ a combination of single pipeline and internal distributed anti-aliasing operations. In one embodiment, display pipeline A 402a and display pipeline B 402b each include a filter on scan (FOS) module (not explicitly shown) that implements an internal N × AA filter. More specifically, for each version of the rendered image, the GPU 122 renders a pipeline of N (e.g., 2, 4, or 1 per pixel) using, for example, conventional supersampling and / or multisampling techniques. Generate any other number of samples). Samples for one version of the image are stored in frame buffer A 404, and samples for the other version of the image are stored in frame buffer B 406.

[0076]ディスプレイパイプラインＡ４０２ａは、各画素についてのＮ個のサンプル全てをフレームバッファＡから受信する。ディスプレイパイプラインＡ４０２ａ内では、第１のフィルタオンスキャン（ＦＯＳ）モジュール（図４には明示せず）が、Ｎ×ＡＡフィルタを実施し、Ｎ個のサンプルを混合して１画素あたり１つのカラー値を生成する。ＦＯＳモジュールにより決定されたカラー値は、画素Ｐ_ＡとしてディスプレイヘッドＡ２０６ａに（場合によってはさらなる処理の後で）供給される。 [0076] Display pipeline A 402a receives all N samples for each pixel from frame buffer A. Within the display pipeline A 402a, a first filter-on-scan (FOS) module (not explicitly shown in FIG. 4) implements an N × AA filter, mixing N samples, one color per pixel. Generate a value. The color value determined by the FOS module is supplied to the display head _A 206a (possibly after further processing) as a pixel PA.

[0077]同様に、ディスプレイパイプライン４０２ｂは、各画素についてのＮ個のサンプル全てをフレームバッファＢ４０６から受信する。ディスプレイパイプライン４０２ｂ内では、第２のＦＯＳモジュール（これも図４には明示せず）が、Ｎ×ＡＡフィルタを実施し、Ｎ個のサンプルを混合して１画素あたり１つのカラー値を決定する。第２のＦＯＳモジュールにより決定されたカラー値は、画素Ｐ_ＢとしてディスプレイヘッドＢ２０６ｂに（場合によってはさらなる処理の後で）供給される。 [0077] Similarly, the display pipeline 402b receives all N samples for each pixel from the frame buffer B406. Within the display pipeline 402b, a second FOS module (also not explicitly shown in FIG. 4) performs an N × AA filter and mixes N samples to determine one color value per pixel. To do. The color value determined by the second FOS module is supplied to the display head _B 206b (possibly after further processing) as a pixel P _B.

[0078]このように、ディスプレイパイプ４０２ａおよび４０２ｂによりそれぞれ生成された画素Ｐ_ＡおよびＰ_Ｂは、それぞれＮ×オーバーサンプリング画像からのフィルタ処理画素とすることが可能である。フレームバッファＡ４０４を埋めるために用いられるサンプリング点がフレームバッファＢ４０６を埋めるために用いられるものと一致しない限り、各ディスプレイパイプ４０２ａ、４０２ｂ内のＮ×ＡＡフィルタを上述の内部分散ＡＡフィルタ技術に組み合わせることによって、（２Ｎ）×ＡＡフィルタが得られる。例えば、各ディスプレイパイプ４０２ａ、４０２ｂ内のＦＯＳモジュールが４×ＡＡフィルタであれば、ＧＰＵ１２２は８×ＡＡを提供可能である。 [0078] Thus, the pixel P _A and P _B respectively generated by the display pipes 402a and 402b, it is possible respectively to filtered pixels from N × oversampling image. As long as the sampling points used to fill frame buffer A 404 do not match those used to fill frame buffer B 406, combine the N × AA filters in each display pipe 402a, 402b with the internal variance AA filter technique described above. (2N) × AA filter is obtained. For example, if the FOS module in each display pipe 402a, 402b is a 4 × AA filter, the GPU 122 can provide 8 × AA.

[0079]本発明には、特定のＦＯＳモジュールまたはＡＡフィルタリングアルゴリズムが必須ということはなく、従来のモジュールおよびアルゴリズムを用いることができる。したがって、詳細な説明は省略する。いくつかの実施形態では、得られる最終画像が特定のディスプレイパイプラインによって処理される画像のバージョンに依存しないように、ディスプレイパイプラインＡ４０２ａおよび４０２ｂ内のＦＯＳモジュールが同一のフィルタアルゴリズムに適合する。また、画像生成プロセスの初期にＮ×ＡＡフィルタリングを行うことが可能である。例えば、代替的な一実施形態では、従来の技術を用いてＧＰＵ１２２のレンダリングパイプライン内でＮ×ＡＡフィルタリングを行うこともできる。 [0079] The present invention does not require a specific FOS module or AA filtering algorithm, and conventional modules and algorithms can be used. Therefore, detailed description is omitted. In some embodiments, the FOS modules in display pipelines A 402a and 402b are adapted to the same filter algorithm so that the final image obtained does not depend on the version of the image processed by a particular display pipeline. It is also possible to perform N × AA filtering early in the image generation process. For example, in an alternative embodiment, N × AA filtering may be performed within the rendering pipeline of GPU 122 using conventional techniques.

[0080]いくつかの実施形態では、異なるフレームバッファを埋めるサンプリング点が、一致するサンプリング点がないように選択される。例えば、図５Ａは画素５００に適用される「グリッド」サンプリングパターンを図示している。画素は、画素内の位置５０１〜５０４（丸で示す）で４回サンプリングされる。図５Ｂは画素５００に適用される「回転グリッド」サンプリングパターンを図示している。画素は、画素内の位置５０１〜５０４とは異なる位置５１１〜５１４（ひし形で示す）で４回サンプリングされる。 [0080] In some embodiments, sampling points that fill different frame buffers are selected such that there are no matching sampling points. For example, FIG. 5A illustrates a “grid” sampling pattern applied to pixel 500. Pixels are sampled four times at positions 501-504 (shown as circles) within the pixel. FIG. 5B illustrates a “rotating grid” sampling pattern applied to pixel 500. The pixels are sampled four times at positions 511-514 (indicated by diamonds) that are different from the positions 501-504 within the pixels.

[0081]一実施形態では、フレームバッファＡ４０４内の画素データが図５Ａのグリッドサンプリングパターンを用いて生成され、フレームバッファＢ４０６内の画素データが図５Ｂの回転グリッドサンプリングパターンを用いて生成される。ディスプレイパイプＡ４０２ａのＦＯＳモジュールは、４つのサンプル値（５０１〜５０４）にフィルタをかけて１つの値Ｐ_Ａとし、ディスプレイパイプＢ４０２ｂのＦＯＳモジュールは、４つのサンプル値（５１１〜５１４）にフィルタをかけて１つの値Ｐ_Ｂとする。ディスプレイヘッド２０６ａの画素選択論理回路３００が上述のように値Ｐ_ＡおよびＰ_Ｂを混合して、全部で８つのサンプルの平均に相当する最終画像を得る。この手順は、図５Ｃの８点パターンを用いて各画素をサンプリングする単一のレンダリングプロセスと同じアンチエイリアシングパワーをもたらす。 [0081] In one embodiment, pixel data in frame buffer A 404 is generated using the grid sampling pattern of FIG. 5A, and pixel data in frame buffer B 406 is generated using the rotated grid sampling pattern of FIG. 5B. FOS module display pipe A402a may filter the four sample values (501 to 504) as a single value _{P A,} FOS module displays pipe B402b filters the four sample values (511 to 514) a single value _{P B} Te. Pixel selection logic 300 in display head 206a is a mixture of values P _A and P _B as described above, to obtain a final image corresponding to an average of eight samples in total. This procedure results in the same anti-aliasing power as a single rendering process that samples each pixel using the 8-point pattern of FIG. 5C.

[0082]本明細書に記載の内部分散ＡＡ技術は例示的なものであって、変更および修正が可能であることが理解されよう。例えば、本明細書に記載のＧＰＵ１２２は、それぞれが最大限で１つの出力ポートを駆動可能な丁度２つのディスプレイヘッドを有しており、その結果、両方のディスプレイヘッドを内部分散アンチエイリアシングに用いれば、ＧＰＵ１２２がディスプレイ装置に最大限で１つの画素ストリームを送ることができる。しかしながら、本発明の実施形態は、少なくとも２つのディスプレイヘッドならびに適当な画素選択論理回路およびＩ／Ｏポートを有する任意のＧＰＵで実施することができる。ＧＰＵが３つ以上のディスプレイヘッドを有する場合には、ＧＰＵが内部分散ＡＡをサポート可能であり、２つ以上のディスプレイデバイスに個々の画素ストリームを供給することも可能である。加えて、ＧＰＵが３つ以上のディスプレイヘッドを有する場合には、ＧＰＵのＡＡパワーを一層高めるために、ＧＰＵのディスプレイヘッドの全てをマスター／スレーブデイジーチェーン式に互いに接続できるようになる。 [0082] It will be appreciated that the internally distributed AA technique described herein is exemplary and that changes and modifications are possible. For example, the GPU 122 described herein has exactly two display heads each capable of driving a maximum of one output port, so that both display heads can be used for internal distributed anti-aliasing. , GPU 122 can send a maximum of one pixel stream to the display device. However, embodiments of the present invention can be implemented with any GPU having at least two display heads and appropriate pixel selection logic and I / O ports. If the GPU has more than two display heads, the GPU can support internally distributed AA and can supply individual pixel streams to more than one display device. In addition, if the GPU has more than two display heads, all of the GPU display heads can be connected together in a master / slave daisy chain fashion to further increase the AA power of the GPU.

[0083]また、本明細書に記載のＧＰＵ１２２は、いずれも内部分散ＡＡに用いられる２つのＭＩＯポートを有している。この実施形態では、ヘッドＡ２０６ａとヘッドＢ２０６ｂのどちらも任意の他のＧＰＵまたはディスプレイヘッドに対してマスターあるいはスレーブとして使用できなくなる。他の実施形態では、ＧＰＵが追加のＭＩＯポートを有することができ、このＭＩＯポートが１つのポートに画素の受信送信を同時に行わせ、内部分散ＡＡと組み合わせて他のＧＰＵとの相互接続を可能にする動作モードを有することができる。例えば、第３のＭＩＯポートがある場合、そのポートは、外部画素を別のＧＰＵからディスプレイヘッドＢ２０６ｂに送る入力ポートまたはディスプレイヘッドＡ２０６ａにより生成された画素を別のＧＰＵに送る出力ポートとして設定することもできる。このような実施形態において、他のＧＰＵはそれ自体の内部分散ＡＡフィルタリングを行うように設定されることもされないこともある。[画素転送パス] [0083] Also, the GPU 122 described in this specification has two MIO ports, both of which are used for internal distributed AA. In this embodiment, neither head A 206a nor head B 206b can be used as a master or slave to any other GPU or display head. In other embodiments, the GPU can have an additional MIO port that allows one port to simultaneously receive and transmit pixels and combine with internal distributed AA to interconnect with other GPUs. Can have an operation mode. For example, if there is a third MIO port, that port should be set as an input port that sends external pixels from another GPU to display head B 206b or an output port that sends pixels generated by display head A 206a to another GPU. You can also. In such embodiments, other GPUs may or may not be configured to perform their own internal distributed AA filtering. [Pixel transfer path]

[0084]本発明の実施形態による画素転送パス実施の例について説明する。明らかなように、画素転送パスはＧＰＵの外部にあっても内部にあってもよい。 [0084] An example of a pixel transfer path implementation according to an embodiment of the present invention will be described. As is apparent, the pixel transfer path may be external or internal to the GPU.

[0085]図６は、本発明の実施形態によるプリント基板カードとして実施され、外部画素転送パスを用いて構成されたグラフィックスアダプタ６００を示す。グラフィックスアダプタ６００は、ＰＣＩ−Ｅまたは別の相互接続標準に適合するプリント回路基板（ＰＣＢ）６０２を用いて拡張カードとして実施される。ＧＰＵ１２２は、ＰＣＢ６０２上に実装され、ＰＣＢ６０２上で配線（図示せず）を介してシステムコネクタ６０４と電気的に結合される。システムコネクタ６０４は、ＰＣＩ−Ｅ拡張スロット（または任意の他種の拡張スロット）に挿入可能なように設計され、ＧＰＵ１２２と図１のシステム１００等のコンピュータシステムの他の部分の間の通信を可能にする。ＧＰＵ１２２は、ＰＣＢ６０２上で配線（図示せず）を介してディスプレイ出力コネクタ６０６とも電気的に結合されている。ディスプレイ出力コネクタ６０６は、ＧＰＵ１２２のデジタル出力ポート２１０、２１１またはアナログ出力ポート２１２、２１３（図２参照）の１つと好適に結合される。いくつかの実施形態では、当分野で既知のように、ＰＣＢ６０２が、それぞれが出力ポート２１０〜２１３の異なる１つに結合された複数のディスプレイ出力コネクタ６０６を提供することができる。 [0085] FIG. 6 illustrates a graphics adapter 600 implemented as a printed circuit board card according to an embodiment of the invention and configured with an external pixel transfer path. Graphics adapter 600 is implemented as an expansion card using a printed circuit board (PCB) 602 that conforms to PCI-E or another interconnect standard. The GPU 122 is mounted on the PCB 602 and electrically coupled to the system connector 604 on the PCB 602 through wiring (not shown). System connector 604 is designed to be insertable into a PCI-E expansion slot (or any other type of expansion slot) to allow communication between GPU 122 and other portions of a computer system such as system 100 of FIG. To. The GPU 122 is also electrically coupled to the display output connector 606 via wiring (not shown) on the PCB 602. Display output connector 606 is preferably coupled to one of digital output ports 210, 211 or analog output ports 212, 213 (see FIG. 2) of GPU 122. In some embodiments, as is known in the art, the PCB 602 can provide multiple display output connectors 606, each coupled to a different one of the output ports 210-213.

[0086]ＰＣＢ６０２は、同一設計が可能な２つのグラフィックスエッジコネクタ６１４ａ、６１４ｂも含んでいる。グラフィックスエッジコネクタ６１４ａは、配線６１６を介してＧＰＵ１２２のＭＩＯＡポート２１４ａに接続しており、グラフィックスエッジコネクタ６１４ｂは、配線６１８を介してＧＰＵ１２２のＭＩＯＢポート２１４ｂに接続している。各グラフィックスエッジコネクタ６１４ａ、６１４ｂは、取り外し可能な相互接続デバイスとの電気的かつ機械的接続用に構成されている。いくつかの実施形態では、グラフィックスエッジコネクタ６１４ａ、６１４ｂが同一の構成を有し、交換して用いることができる。 [0086] The PCB 602 also includes two graphics edge connectors 614a, 614b that can be designed identically. The graphics edge connector 614a is connected to the MIOA port 214a of the GPU 122 via a wiring 616, and the graphics edge connector 614b is connected to the MIOB port 214b of the GPU 122 via a wiring 618. Each graphics edge connector 614a, 614b is configured for electrical and mechanical connection with a removable interconnect device. In some embodiments, graphics edge connectors 614a, 614b have the same configuration and can be used interchangeably.

[0087]一実施形態のグラフィックスアダプタ６００は、２つ以上のＧＰＵが協働してレンダリングタスクの異なる部分を行う分散レンダリングシステムでの使用のために設計されている。このようなシステムは、例えば、各ＣＰＵが画像の異なる部分をレンダリングする分割フレームモード、各ＣＰＵが一連の画像中の異なる画像をレンダリングする代替フレームモード、または分散アンチエイリアシングモードで動作させることができる。これらモードのそれぞれにおいて、１つのＧＰＵ（マスター）がもう１つのＧＰＵ（スレーブ）から画素を受信し、マスターＧＰＵ内の画素選択論理回路３００が上述のようにディスプレイ用の画素を選択する。異なるグラフィックスアダプタ６００のＧＰＵは、適当な相互接続デバイスを用いてそれぞれのグラフィックスエッジコネクタ６１４ａ、６１４ｂを介して有利に接続される。 [0087] The graphics adapter 600 of one embodiment is designed for use in a distributed rendering system in which two or more GPUs cooperate to perform different parts of a rendering task. Such a system can be operated, for example, in a split frame mode where each CPU renders a different portion of the image, an alternate frame mode where each CPU renders a different image in a series of images, or a distributed anti-aliasing mode. . In each of these modes, one GPU (master) receives pixels from another GPU (slave), and the pixel selection logic circuit 300 in the master GPU selects pixels for display as described above. The GPUs of different graphics adapters 600 are advantageously connected through their respective graphics edge connectors 614a, 614b using a suitable interconnect device.

[0088]本発明の実施形態では、取り外し可能な相互接続デバイス６２０が、図６に示すように、同じグラフィックスアダプタ６００のグラフィックスエッジコネクタ６１４ａ、６１４ｂに接続可能なように構成かつ形成されている。相互接続デバイス６２０は、例えば、グラフィックスエッジコネクタ６１４ａ、６１４ｂを受容するレセプタクルを片側に有するリボンケーブルや長さに沿ってプリントされた配線付きＰＣＢとすることが可能であり、２つのグラフィックスエッジコネクタ６１４ａ、６１４ｂを互いに接続する。 [0088] In an embodiment of the present invention, a removable interconnect device 620 is configured and configured to be connectable to graphics edge connectors 614a, 614b of the same graphics adapter 600, as shown in FIG. Yes. The interconnect device 620 can be, for example, a ribbon cable having a receptacle on one side for receiving graphics edge connectors 614a, 614b or a PCB with wiring printed along its length, such as two graphics edges. Connectors 614a and 614b are connected to each other.

[0089]この実施形態では、相互接続デバイス６２０が、グラフィックスアダプタ６００によりサポートされる分散レンダリングシステムのタイミング特性を利用して画素転送パス４００（図４）を構築する。より具体的には、分散レンダリング構成において、ＧＰＵ１２２のＭＩＯＢポート２１４ｂから異なるグラフィックスアダプタ６００上のＧＰＵのＭＩＯＡポート（またはＭＩＯＢポート）への画素転送パスが、転送パスの任意のセグメントに沿って含まれ得る任意の電子コンポーネント（ＦＩＦＯ、ラッチ等）だけでなく、配線６１８および／または６１６の長さおよび２つのアダプタ間の相互接続デバイス由来の特性転送時間を有する。分散レンダリング動作では、スレーブＧＰＵおよびマスターＧＰＵからの画素がマスターＧＰＵのディスプレイヘッドにほぼ同時に（例えば同じクロックサイクル中に）到達するように、スレーブＧＰＵのディスプレイヘッドからマスターＧＰＵのディスプレイヘッドへの画素の転送が有利に調整される。 [0089] In this embodiment, the interconnect device 620 builds the pixel transfer path 400 (FIG. 4) utilizing the timing characteristics of the distributed rendering system supported by the graphics adapter 600. More specifically, in a distributed rendering configuration, a pixel transfer path from the MIOB port 214b of the GPU 122 to the MIOA port (or MIOB port) of the GPU on a different graphics adapter 600 is included along any segment of the transfer path. As well as any electronic components that can be (FIFO, latches, etc.), the length of the wiring 618 and / or 616 and the characteristic transfer time from the interconnect device between the two adapters. In a distributed rendering operation, the pixels from the slave GPU display head to the master GPU display head so that the pixels from the slave GPU and master GPU reach the master GPU display head almost simultaneously (eg, during the same clock cycle). The transfer is advantageously adjusted.

[0090]相互接続デバイス６２０は、異なるＧＰＵを接続する分散レンダリング相互接続デバイスとの送信時間マッチングを与え、相互接続デバイス６２０により提供される画素転送パスは、信号を正しいタイミングでＭＩＯＡポート２１４ａに送る。このように、外部相互接続デバイスを用いた内部分散ＡＡの実施には、元々分散レンダリング用に設計されたＧＰＵ１２２またはアダプタカード６００に何ら内部修正を必要とするものではない。 [0090] The interconnect device 620 provides transmission time matching with distributed rendering interconnect devices that connect different GPUs, and the pixel transfer path provided by the interconnect device 620 sends the signal to the MIOA port 214a at the correct timing. . Thus, implementation of internal distributed AA using external interconnect devices does not require any internal modifications to GPU 122 or adapter card 600 originally designed for distributed rendering.

[0091]本明細書に記載のグラフィックスアダプタおよび相互接続デバイスは、例示的なものであって、変更および修正が可能であることが理解されよう。アダプタおよび相互接続デバイスの形状、レイアウト、および材料組成は、本明細書で示し記載したものから修正することができ、ＭＩＯポート間のデータ転送用にあらゆる通信プロトコルを実施することができる。 [0091] It will be appreciated that the graphics adapters and interconnect devices described herein are illustrative and that changes and modifications are possible. The shape, layout, and material composition of the adapters and interconnect devices can be modified from those shown and described herein, and any communication protocol can be implemented for data transfer between MIO ports.

[0092]代替的な一実施形態では、例えばパス６１８〜パス６１６を接続する配線を用いて、相互接続デバイス６２０をＰＣＢ６０２の一部として実施することもできる。この実施形態では、パス６１８からパス６１６へまたは逆方向のデータ転送を有効または無効とするように、制御デバイス（例えば取り外し可能なジャンパまたはドライバ制御スイッチ）が有利に用いられる。 [0092] In an alternative embodiment, the interconnect device 620 may be implemented as part of the PCB 602, for example using wiring connecting paths 618 to 616. In this embodiment, a control device (eg, a removable jumper or driver control switch) is advantageously used to enable or disable data transfer from path 618 to path 616 or in the reverse direction.

[0093]いくつかの実施形態では、同じＧＰＵの２つのＭＩＯポート間の相互接続デバイス６２０または他の外部接続の存在が自動的に内部分散ＡＡを有効にするわけではないことにも留意されたい。上述のように、画素選択論理回路３００の動作が内部分散ＡＡを行うか否かを決定するが、画素選択論理回路３００の動作はグラフィックスドライバを介して制御される。 [0093] Note also that in some embodiments, the presence of an interconnect device 620 or other external connection between two MIO ports of the same GPU does not automatically enable internal distributed AA. . As described above, the operation of the pixel selection logic circuit 300 determines whether or not to perform internal distribution AA, but the operation of the pixel selection logic circuit 300 is controlled via the graphics driver.

[0094]代替的な別の実施形態では、内部分散ＡＡに用いられる画素転送パスがＧＰＵ内に構築される。図７は、本発明のこのような一実施形態によるＧＰＵ７００のブロック図である。ＧＰＵ７００は、図４のＧＰＵ１２２とほぼ同様であり、同じ参照番号が対応するコンポーネントを識別するのに用いられている。ＧＰＵ１２２とは異なり、ＧＰＵ７００は、ディスプレイヘッドＢ２０６ｂの出力パス７０２をディスプレイヘッドＡ２０６ａの外部画素入力パス７０４に接続する内部画素転送パスを含んでいる。 [0094] In another alternative embodiment, the pixel transfer path used for internal distribution AA is built in the GPU. FIG. 7 is a block diagram of a GPU 700 according to one such embodiment of the invention. GPU 700 is substantially similar to GPU 122 of FIG. 4, and the same reference numerals are used to identify corresponding components. Unlike GPU 122, GPU 700 includes an internal pixel transfer path that connects output path 702 of display head B 206b to external pixel input path 704 of display head A 206a.

[0095]この実施形態では、画素転送パスが、ディスプレイヘッドＢ２０６ｂからの画素およびクロスバー２２０を介してＭＩＯポート（例えばＭＩＯＡポート２１４ａ）の一方からのパス７０８上で受信した画素の間で選択する選択ユニット（例えばマルチプレクサ）７０６を含んでいる。選択された画素は、ディスプレイヘッドＡ２０６ａの外部画素入力パス７０４に提供される。 [0095] In this embodiment, the pixel transfer path selects between pixels from display head B 206b and pixels received on path 708 from one of the MIO ports (eg, MIOA port 214a) via crossbar 220. A selection unit (eg, multiplexer) 706 is included. The selected pixel is provided to the external pixel input path 704 of the display head A 206a.

[0096]選択ユニット７０６は、制御信号（明示せず）に応答して動作する。制御信号は、ＧＰＵ７００が内部分散ＡＡモードで動作している場合には選択ユニット７０６がパス７０２上の画素を選択するように、ＧＰＵ７００のディスプレイヘッドＡ２０６ａが別のＧＰＵに対してマスターとして動作している場合にはパス７０８上の画素を選択するように設定する。この制御信号は、グラフィックスドライバから発せられたコマンドに応答して生成することができ、グラフィックソフトウェアにアクセスする必要なく適切なソフトウェアインターフェースを通じて、ユーザ（またはアプリケーション開発者）が内部分散ＡＡを有効または無効にすることを可能にする。 [0096] The selection unit 706 operates in response to a control signal (not explicitly shown). The control signal is such that when the GPU 700 is operating in the internal distributed AA mode, the display head A 206a of the GPU 700 operates as a master for another GPU so that the selection unit 706 selects a pixel on the path 702. If so, the pixel on the path 708 is set to be selected. This control signal can be generated in response to a command issued from the graphics driver, enabling the user (or application developer) to enable the internal distributed AA through an appropriate software interface without having to access the graphics software. Enable to disable.

[0097]この実施形態では、分散レンダリングモードにおいて外部ＧＰＵから画素が到着するのと同じタイミングでディスプレイヘッドＢ２０６ｂからの画素が選択回路７０６に到達するように、ディスプレイヘッドＢ２０６ｂから選択回路７０６へのパス７０２がＦＩＦＯ、ラッチ、および他のタイミング制御デバイスを含むことができることに留意されたい。この場合、ディスプレイヘッドＢ２０６ｂおよびディスプレイヘッドＡ２０６ａの動作タイミングは、ＧＰＵが分散レンダリングモードまたは内部分散ＡＡモードに依存しない。 [0097] In this embodiment, the path from the display head B 206b to the selection circuit 706 so that the pixel from the display head B 206b reaches the selection circuit 706 at the same timing as the pixel arrives from the external GPU in the distributed rendering mode. Note that 702 can include FIFOs, latches, and other timing control devices. In this case, the operation timing of the display head B 206b and the display head A 206a does not depend on the GPU in the distributed rendering mode or the internal distributed AA mode.

[0098]内部画素転送パスは、ＧＰＵに修正を要するものの、ＧＰＵのＩ／Ｏポートのいずれの使用も必要としない。故に、例えば、ＧＰＵ７００が内部分散ＡＡフィルタリングを実行し続ける間、ＧＰＵ７００のディスプレイヘッドＡ２０６ａは別のＧＰＵのディスプレイヘッドに対してスレーブ化可能であり、あるいはＧＰＵ７００のディスプレイヘッドＢ２０６ｂは別のＧＰＵのディスプレイヘッドに対してマスター化可能である。 [0098] Although the internal pixel transfer path requires modification to the GPU, it does not require the use of any of the GPU's I / O ports. Thus, for example, while GPU 700 continues to perform internal distributed AA filtering, GPU 700 display head A 206a can be slaved to another GPU display head, or GPU 700 display head B 206b can be another GPU display head. Can be mastered.

[0099]本明細書に記載の内部画素転送パスは例示的なものであって、変更および修正が可能であることが理解されよう。例えば、（ディスプレイヘッドＡ２０６ａからディスプレイヘッドＢ２０６ｂへの）「逆方向」画素転送パスを示したパスに追加して設けることもできる。[他の実施形態] [0099] It will be appreciated that the internal pixel transfer paths described herein are exemplary and can be changed and modified. For example, a “reverse” pixel transfer path (from display head A 206a to display head B 206b) may be provided in addition to the path indicated. [Other embodiments]

[0100]上述のように、本発明の実施形態は、ＡＡフィルタ処理画像を生成するために、１つのＧＰＵだけでなく、複数のＧＰＵにわたって分散レンダリングに広く関連付けられた読み出し技術およびコンポーネントを用いることが可能なマルチＧＰＵシステム用の分散アンチエイリアシング技術を提供する。適当なグラフィックスドライバインターフェースを介して、任意のグラフィックスプログラムについてプログラムそれ自体に与えられたＡＡ（またはその欠如）に関係なく、適切に設定されたＧＰＵのエンドユーザが内部分散ＡＡを有効にすることを選択可能である。プログラムがＡＡを提供する場合、本明細書に記載の内部分散ＡＡは、ＡＡ解像度を上げる（例えば２倍にする）ために用いることが可能である。 [0100] As noted above, embodiments of the present invention use readout techniques and components that are widely associated with distributed rendering across multiple GPUs, not just one GPU, to generate AA filtered images. To provide distributed anti-aliasing technology for multi-GPU systems. Appropriately configured GPU end-users enable internal distributed AA, regardless of the AA (or lack thereof) given to the program itself for any graphics program via the appropriate graphics driver interface You can choose that. If the program provides AA, the internally distributed AA described herein can be used to increase (eg, double) AA resolution.

[0101]本発明を特定の実施形態に関連して説明してきたが、当業者であれば多数の修正が可能であることが理解されよう。例えば、本発明はＡＡフィルタリングを参照して説明してきたが、本明細書に記載の単一ＧＰＵのディスプレイヘッド間または複数ＧＰＵのディスプレイヘッド間の結合を別の方法で用いることもできる。 [0101] Although the present invention has been described with reference to particular embodiments, those skilled in the art will recognize that numerous modifications are possible. For example, although the present invention has been described with reference to AA filtering, the coupling between single GPU display heads or multiple GPU display heads described herein may be used in other ways.

[0102]代替的な一実施形態では、ステレオアナグリフを生成するために分散フィルタリングを用いることが可能である。当分野で既知のように、ステレオアナグリフはシーンの左目視野および右目視野を重ねて１つの画像を作り出す。通常、左目画素と右目画素には異なるカラーフィルタが適用されるが、例えば、右目画素を赤色パスフィルタでフィルタ処理し、左目画素を青／緑色パスフィルタを用いてフィルタ処理することができる。左目視野および右目視野間の視野域または視点オフセットにより、シーンの同じ点に対応する左目画素と右目画素はアナグリフの異なる位置にくる。故に、裸眼には、アナグリフが歪んだ色の二重画像に見える。画像を正確に観るには、観察者が、右目画素用に用いた色をフィルタで除外する左レンズと左目画素用に用いた色をフィルタで除外する左レンズの特殊なメガネを着用する。 [0102] In an alternative embodiment, distributed filtering can be used to generate stereo anaglyphs. As is known in the art, stereo anaglyphs superimpose the left and right eye views of a scene to create a single image. Normally, different color filters are applied to the left eye pixel and the right eye pixel. For example, the right eye pixel can be filtered with a red pass filter, and the left eye pixel can be filtered with a blue / green pass filter. Due to the field of view or viewpoint offset between the left and right eye fields, the left and right eye pixels corresponding to the same point in the scene are at different positions in the anaglyph. Therefore, the naked eye can see an anaglyph double-distorted color image. In order to view the image accurately, the observer wears special glasses for the left lens that excludes the color used for the right eye pixel with a filter and the left lens that excludes the color used for the left eye pixel with a filter.

[0103]図４Ａを参照すると、シーンの右目視野を生成するためにＧＰＵ１２２（０）のレンダリングパイプライン（図示せず）を用い、シーンの左目視野を生成するためにＧＰＵ１２２（１）のレンダリングパイプライン（図示せず）を用いることができる。右目および左目視野用のレンダリングパラメータを決定するのに既知の技術を用いることができる。 [0103] Referring to FIG. 4A, the rendering pipeline (not shown) of GPU 122 (0) is used to generate the right eye view of the scene, and the rendering pipe of GPU 122 (1) is used to generate the left eye view of the scene. A line (not shown) can be used. Known techniques can be used to determine the rendering parameters for the right and left eye views.

[0104]右目画素Ｐ_ｉ１および左目画素Ｐ_ｉ０は、ＧＰＵ１２２（０）およびＧＰＵ１２２（１）のそれぞれのレンダリングパイプラインまたはそれぞれのディスプレイパイプライン２０２（０）および２０２（１）のいずれかにおいて、有利にカラーフィルタ処理される。一実施形態では、異なる赤、緑および青色成分を用いて画素色が特定される。右目画素は、例えば赤色成分をゼロまで減らし、緑および青色成分を不変のままにすることによりフィルタ処理可能である。同様に、左目画素は、緑および青色成分をゼロまで減らし、赤色成分を不変のままにすることによりフィルタ処理可能である。 [0104] The right eye pixel P _i1 and the left eye pixel P _i0 are advantageous in either the rendering pipeline of GPU 122 (0) and GPU 122 (1) or in the respective display pipelines 202 (0) and 202 (1). Color filter processing. In one embodiment, pixel colors are identified using different red, green and blue components. The right eye pixel can be filtered, for example, by reducing the red component to zero and leaving the green and blue components unchanged. Similarly, the left eye pixel can be filtered by reducing the green and blue components to zero and leaving the red component unchanged.

[0105]右目画素Ｐ_ｉ１は、ＧＰＵ１２２（１）のディスプレイヘッドＡ２０６ａに送られる。ディスプレイヘッド２０６（ａ）は、画素Ｐ_ｉ１をＭＩＯＡポート２１４ａ（１）に転送し、これが画素をＰ_ｏ１としてＧＰＵ１２２（０）のＭＩＯＡポート２１４ａ（０）へと送る。ディスプレイヘッド２０６ａ（０）は、このようにして右目画素を外部画素として受信する。 [0105] The right-eye pixel P _i1 is sent to the display head A 206a of the GPU 122 (1). The display head 206 (a) transfers the pixel P _i1 to the MIOA port 214a (1), which sends the pixel as P _{o1 to} the MIOA port 214a (0) of the GPU 122 (0). In this way, the display head 206a (0) receives the right eye pixel as an external pixel.

[0106]左目画素Ｐ_ｉ０は、内部画素としてディスプレイヘッド２０６ａ（０）に送られる。一般に、対応する左目画素および右目画素はシーンの異なる位置を再生するが、右目視野および左目視野を生成するのに用いられる視野域または視点オフセットによって、画素選択論理回路３００により処理される対応する左目画素および右目画素がアナグリフフレーム内の同じ位置にある画素となることに留意されたい。 [0106] The left-eye pixel P _i0 is sent to the display head 206a (0) as an internal pixel. In general, the corresponding left eye pixel and right eye pixel reproduce different positions in the scene, but the corresponding left eye processed by the pixel selection logic 300 depending on the field of view or viewpoint offset used to generate the right eye field and the left eye field. Note that the pixel and the right eye pixel are the pixels at the same position in the anaglyph frame.

[0107]一実施形態では、ディスプレイヘッド２０６ａ（０）が図３の画素選択論理回路３００を含んでいる。アナグリフを生成するために、除算回路３０６および３１４がともに１の除数を選択するように設定される。加算回路３０８は、左目画素Ｐ_ｉ０と右目画素Ｐ_ｉ１を加算し、初期のカラーフィルタリングの結果として、左目画素の赤色成分と右目画素の青色および緑色成分を有する合計画素をパス３１０上に生成する。選択マルチプレクサ３１２がパス３１０から合計画素を選択し、選択マルチプレクサ３１６がパス３１５上で混合画素を出力画素として選択する。 [0107] In one embodiment, the display head 206a (0) includes the pixel selection logic 300 of FIG. To generate an anaglyph, both divider circuits 306 and 314 are set to select a divisor of one. The addition circuit 308 adds the left eye pixel P _i0 and the right eye pixel P _i1 , and generates a total pixel on the path 310 having the red component of the left eye pixel and the blue and green components of the right eye pixel as a result of the initial color filtering. . The selection multiplexer 312 selects the total pixels from the path 310, and the selection multiplexer 316 selects the mixed pixels as output pixels on the path 315.

[0108]他の実施形態では、画素選択論理回路３００の前のカラーフィルタは用いない。例えば、選択が各色成分について独立に制御可能であるように選択マルチプレクサ３１２および／または３１６を設定可能である。このような一実施形態では、選択マルチプレクサ３１２がパス３０２から左目画素Ｐ_ｉ０の全ての色成分を通過し、選択マルチプレクサ３１６が左目画素Ｐ_ｉ０の赤色成分と右目画素Ｐ_ｉ１の青色および緑色成分を通過する。結果はパス３１８上の左目画素の赤色成分と右目画素の青色および緑色成分を有する出力画素となる。 [0108] In other embodiments, the color filter in front of the pixel selection logic circuit 300 is not used. For example, the selection multiplexers 312 and / or 316 can be set such that the selection can be controlled independently for each color component. In one such embodiment, the selection multiplexer 312 passes all the color components of the left eye pixel P _i0 from the path 302, and the selection multiplexer 316 passes the red component of the left eye pixel P _i0 and the blue and green components of the right eye pixel P _i1. pass. The result is an output pixel having a red component of the left eye pixel and a blue and green component of the right eye pixel on path 318.

[0109]当業者であれば、アナグリフレンダリングに３つ以上のＧＰＵを使用可能であることを理解することができる。４つのＧＰＵ（例えば図４Ｂ）の実施形態では、２つのＧＰＵを右目視野生成用に用い、２つのＧＰＵを左目視野生成用に用いることが可能である。各視野を生成する２つのＧＰＵは、アナグリフの画質を高めるために上述したような分散アンチエイリアシング技術を採用することが可能である。 [0109] One skilled in the art can appreciate that more than two GPUs can be used for anaglyph rendering. In an embodiment of four GPUs (eg, FIG. 4B), two GPUs can be used for right eye field generation and two GPUs can be used for left eye field generation. The two GPUs that generate each field of view can employ distributed anti-aliasing techniques as described above to enhance the image quality of anaglyphs.

[0110]ステレオアナグリフは、内部分散フィルタリングを用いてもレンダリング可能である。図４Ｂを参照すると、ＧＰＵ１２２（０）のレンダリングパイプライン（図示せず）が両方の視野を生成可能であり、左目視野をフレームバッファＡに、右目視野をフレームバッファＢに（あるいはその反対に）記憶する。ディスプレイパイプラインＢ４０２ｂおよびディスプレイヘッドＢ２０６ｂが右目視野を画素Ｐ_ＢとしてディスプレイヘッドＡ２０６ａに送り、ディスプレイパイプラインＡ４０２ａが左目視野を画素Ｐ_ＡとしてディスプレイヘッドＡ２０６ａに送る。ディスプレイヘッドＡ２０６ａでは、画素合成器３０８（図３）がアナグリフを生成するように適切に画素を混合する。 [0110] Stereo anaglyphs can also be rendered using internal distributed filtering. Referring to FIG. 4B, the rendering pipeline (not shown) of GPU 122 (0) can generate both views, the left eye view into frame buffer A and the right eye view into frame buffer B (or vice versa). Remember. Display pipeline _B 402b and display head _B 206b send right eye field of view as pixel P _B to display head A 206a, and display pipeline _A 402a sends left eye field of view as pixel PA to display head _A 206a. In display head A 206a, pixel combiner 308 (FIG. 3) mixes the pixels appropriately to produce an anaglyph.

[0111]フェードイン、フェードアウト、またはディゾルブ等の遷移効果を発生させるためにも分散フィルタリングを用いることが可能である。例えば、内部分散の場合、フレームバッファＢはフェードアウトする画像を記憶することができ、フレームバッファＡはフェードインする画像を記憶する。各フレームでは、画素合成器３０８が画素の相対的な重みをフレームバッファＡおよびフレームバッファＢから調整し、それによってフレームＡからの画像が徐々に最高強度まで上がり、フレームバッファＢからの画像がゼロ強度まで弱まる。（フレームバッファＢの画像がベタ色領域であったら効果はフェードインとなり、フレームバッファＡの画像がベタ色領域であったら効果はフェードアウトとなる。）遷移の平滑性は、一部には、画素合成器３０８が形成可能な画素Ｐ_Ａおよび画素Ｐ_Ｂの異なる加重平均の数に依存するが、これは設計上の選択の問題である。外部分散フィルタリングの複数のＧＰＵも同様の効果を達成するのに用いることが可能である。 [0111] Distributed filtering can also be used to generate transition effects such as fade-in, fade-out, or dissolve. For example, in the case of internal dispersion, the frame buffer B can store an image that fades out, and the frame buffer A stores an image that fades in. For each frame, the pixel synthesizer 308 adjusts the relative weights of the pixels from frame buffer A and frame buffer B, so that the image from frame A gradually increases to maximum intensity and the image from frame buffer B is zero. It weakens to strength. (If the image in the frame buffer B is a solid color area, the effect is fade-in, and if the image in the frame buffer A is a solid color area, the effect is fade-out.) depends on the number of different weighted average of the synthesizer 308 is capable of forming pixel P _a and the pixel P _B, which is a matter of design choice. Multiple GPUs with external distributed filtering can be used to achieve the same effect.

[0112]別の実施形態では、分散フィルタリングを各ディスプレイヘッドのルックアップテーブルと組み合わせて用いることにより、このような遷移効果を達成可能である。当分野で既知のように、ディスプレイヘッドは、内部画素表現をディスプレイ装置に適したカラー強度に変換するルックアップテーブルを含んでいることが多く、時には異なる値をルックアップテーブルにロードしたりリロードしたりすることが可能である。ルックアップテーブル内の値のカラー強度をあるフレームから次のフレームへと下げる（または上げる）ことによってフェードアウト（またはフェードイン）を達成可能である。故に、フレームバッファＢ内の画像からフレームバッファＡ内の画像へとディゾルブするには、従来のフェードアウトルックアップテーブルをディスプレイヘッドＢに適用することも可能であり、従来のフェードインルックアップテーブルをディスプレイヘッドＡに適用する。画素合成器３０８は、２つの画像を一定の（例えば等しい）重みで合成し、ディゾルブ効果を作り出す。 [0112] In another embodiment, such transition effects can be achieved by using distributed filtering in combination with a look-up table for each display head. As is known in the art, display heads often include a look-up table that converts the internal pixel representation to a color intensity suitable for the display device, sometimes loading and reloading different values into the look-up table. It is possible to Fade out (or fade in) can be achieved by lowering (or increasing) the color intensity of values in the lookup table from one frame to the next. Thus, to dissolve from the image in frame buffer B to the image in frame buffer A, a conventional fade-out look-up table can be applied to display head B, and the conventional fade-in look-up table is displayed. Applies to head A. The pixel synthesizer 308 synthesizes two images with a constant (eg, equal) weight to create a dissolve effect.

[0113]他の実施形態では、同じＧＰＵのディスプレイヘッド間の画素転送が、混合とは関係ないディスプレイ特性を実施するために用いられる。例えば、ディスプレイヘッド間の画素転送は、ＬＣＤオーバードライブ（当分野では「ＬＣＤフィードフォワード」または「応答時間補正」（ＲＴＣ）ともいう）機能を制御するために用いることが可能である。当分野で既知のように、ＬＣＤ画面は、画素を駆動する信号が部分的には所望の新たな強度また部分的には所望の新たな強度と前の強度との差分に基づいて、フレーム毎に調整される場合に、より速く応答するようにすることが可能である。 [0113] In other embodiments, pixel transfers between display heads of the same GPU are used to implement display characteristics that are independent of mixing. For example, pixel transfer between display heads can be used to control LCD overdrive (also referred to in the art as “LCD feedforward” or “response time correction” (RTC)) functions. As is known in the art, LCD screens are based on a frame-by-frame basis where the signal driving the pixel is based in part on the desired new intensity or in part on the difference between the desired new intensity and the previous intensity. It is possible to respond faster when adjusted.

[0114]ＬＣＤオーバードライブ機能を実施するには、フレームバッファＢが前の画像の画素を記憶するのに対し、フレームバッファＡを新たな画像の画素を記憶するように用いることが可能である。ディスプレイヘッドＢは前の画素値をディスプレイヘッドＡに送り、ディスプレイヘッドＡの画素合成器３０８は、例えば従来のＬＣＤオーバードライブ信号を計算するための記述を用いて新たな値および前の値に基づいてオーバードライブ値を計算するように設定可能である。 [0114] To implement the LCD overdrive function, the frame buffer B can store the pixels of the previous image, whereas the frame buffer A can be used to store the pixels of the new image. Display head B sends the previous pixel value to display head A, and pixel synthesizer 308 of display head A is based on the new value and the previous value using, for example, a description for calculating a conventional LCD overdrive signal. Can be set to calculate the overdrive value.

[0115]あるＧＰＵのディスプレイヘッド間の画素転送は、合成画像を生成するためにも用いることができる。例えば、フレームバッファＢは、フレームバッファＡに記憶された画像の一部に重ねられるオーバーレイ画像用の画素を含むことができる。ディスプレイヘッドＢがディスプレイヘッドＡにオーバーレイ画素を送り、ディスプレイヘッドＡの画素選択論理回路３００が、外部画素が選択されるオーバーレイ領域以外で内部画素を選択する。 [0115] Pixel transfer between display heads of a GPU can also be used to generate a composite image. For example, the frame buffer B can include pixels for an overlay image that is overlaid on a portion of the image stored in the frame buffer A. The display head B sends an overlay pixel to the display head A, and the pixel selection logic circuit 300 of the display head A selects an internal pixel outside the overlay region where the external pixel is selected.

[0116]このように、本発明を特定の実施形態に関連して説明してきたが、本発明が特許請求項の範囲内の全ての変形物および均等物を対象とするように意図されていることが理解されよう。 [0116] Thus, while the invention has been described with reference to specific embodiments, the invention is intended to cover all modifications and equivalents within the scope of the claims. It will be understood.

本発明の実施形態によるコンピュータシステムのブロック図である。1 is a block diagram of a computer system according to an embodiment of the present invention. 本発明の実施形態で使用可能なグラフィックス処理ユニット（ＧＰＵ）内の画素出力パスのブロック図である。FIG. 3 is a block diagram of a pixel output path within a graphics processing unit (GPU) that can be used with embodiments of the present invention. 本発明の実施形態で使用可能なＧＰＵのディスプレイヘッド内の画素選択論理回路のブロック図である。FIG. 3 is a block diagram of a pixel selection logic circuit in a GPU display head that can be used in embodiments of the present invention. 本発明の実施形態で使用可能なＧＰＵのディスプレイヘッド内の画素選択論理回路のブロック図である。FIG. 3 is a block diagram of a pixel selection logic circuit in a GPU display head that can be used in embodiments of the present invention. 本発明の実施形態による２つのＧＰＵを有するグラフィックスサブシステムのブロック図である。1 is a block diagram of a graphics subsystem having two GPUs according to an embodiment of the invention. FIG. 本発明の実施形態によるマスター／スレーブ形式で結合された２つのディスプレイヘッドを示すＧＰＵのブロック図である。FIG. 3 is a block diagram of a GPU showing two display heads combined in a master / slave format according to an embodiment of the present invention. 本発明のいくつかの実施形態で使用可能なサンプリングパターンを示す図である。FIG. 3 illustrates a sampling pattern that can be used in some embodiments of the present invention. 本発明のいくつかの実施形態で使用可能なサンプリングパターンを示す図である。FIG. 3 illustrates a sampling pattern that can be used in some embodiments of the present invention. 本発明のいくつかの実施形態で使用可能なサンプリングパターンを示す図である。FIG. 3 illustrates a sampling pattern that can be used in some embodiments of the present invention. 本発明の実施形態によるプリント基板カードとして実施され、外部画素転送パスを用いて構成されたグラフィックスアダプタである。1 is a graphics adapter implemented as a printed circuit board card according to an embodiment of the present invention and configured using an external pixel transfer path. 本発明の実施形態による内部画素転送パスを持ったＧＰＵのブロック図である。2 is a block diagram of a GPU having an internal pixel transfer path according to an embodiment of the present invention; FIG.

１００…コンピュータシステム、１０２…中央演算処理装置（ＣＰＵ）、１０４…システムメモリ、１０５…メモリブリッジ、１０６、１１３…バス、１０７…Ｉ／Ｏ（入力／出力）ブリッジ、１１２…グラフィックスサブシステム、１１０…ディスプレイ装置、１１４…システムディスク、１２２…グラフィックス処理装置（ＧＰＵ）、１２４…グラフィックスメモリ、２０２、４０２…ディスプレイパイプライン、２０６ａ、２０６ｂ…ディスプレイヘッド、２１０、２１１…デジタル出力ポート、２１２、２１３…アナログ出力ポート、２１４ａ、２１４ｂ…多目的入力／出力（ＭＩＯ）ポート、２２０…クロスバー、３００、３５０…画素選択論理回路、３０２、３５２…第１のパス、３０４、３５４…第２のパス、３０６、３５８…画素合成回路、３０８…第１の除算回路、３１０…加算回路、３１２…選択回路、３１４、３６６…第２の除算回路、３１６、３６０…パス、３１８…選択回路、３２０、３６４…出力パス、３６２…選択マルチプレクサ、４００…画素転送パス、４０４…フレームバッファＡ、４０６…フレームバッファＢ。

DESCRIPTION OF SYMBOLS 100 ... Computer system, 102 ... Central processing unit (CPU), 104 ... System memory, 105 ... Memory bridge, 106, 113 ... Bus, 107 ... I / O (input / output) bridge, 112 ... Graphics subsystem, DESCRIPTION OF SYMBOLS 110 ... Display apparatus, 114 ... System disk, 122 ... Graphics processing unit (GPU), 124 ... Graphics memory, 202, 402 ... Display pipeline, 206a, 206b ... Display head, 210, 211 ... Digital output port, 212 213 ... Analog output port, 214a, 214b ... Multi-purpose input / output (MIO) port, 220 ... Crossbar, 300, 350 ... Pixel selection logic, 302, 352 ... First pass, 304, 354 ... Second Path, 306, 358 Pixel synthesis circuit, 308... First division circuit, 310... Addition circuit, 312... Selection circuit, 314, 366... Second division circuit, 316, 360. 362 ... selection multiplexer, 400 ... pixel transfer path, 404 ... frame buffer A, 406 ... frame buffer B.

Claims

A display head for a graphics processor,
A first input path configured to propagate a gamma corrected first pixel generated by a first graphics processor;
A second input path configured to propagate a gamma corrected second pixel generated by the second graphics processor;
Coupled to the first input path and the second input path and configured to mix the gamma corrected first pixel and the gamma corrected second pixel to generate a mixed pixel. A pixel synthesizer,
And a selection circuit configured to select one of the gamma corrected first pixel, the gamma corrected second pixel, or the mixed pixel as an output pixel.

The graphics processor of claim 1, wherein the display head further comprises a divider circuit configured to divide the mixed pixel by a divisor.

The graphics processor of claim 2, wherein the divisor is selected from candidate divisors including 1 and 2 as divisors.

The graphics processor of claim 1, wherein the divisor is selected from candidate divisors including 1, 2, and 4 as divisors.

The graphics of claim 1, wherein the pixel synthesizer is configured to generate the mixed pixel by adding the gamma corrected first pixel and the gamma corrected second pixel. Processor.

The first pixel and the second pixel are gamma correction pixels;
The graphics processor of claim 1, wherein the pixel synthesizer is configured to generate the blended pixel by calculating a gamma correction blend of the first pixel and the second pixel.

The gamma correction approximation of mixing, the gamma-corrected first pixel _{P i,} the gamma-corrected second pixel when _{P e,} and the following formula _{_{(4P i + 4P e + |}} P i -P _e |) / 4
The graphics processor of claim 6, calculated by:

The pixel synthesizer may divide the gamma corrected first pixel using a divisor prior to mixing the gamma corrected first pixel and the gamma corrected second pixel. The graphics processor of claim 1, comprising a configured divider circuit.

A display pipeline configured to generate a gamma corrected first pixel;
An input port configured to receive a gamma corrected second pixel from a source of external pixels;
With a display head,
The display head is
A first input path coupled to the display pipeline and configured to receive the first gamma corrected pixel from the display pipeline;
A second input path coupled to the input port and configured to receive the second gamma corrected pixel from the input port;
Coupled to the first input path and the second input path and configured to mix the gamma corrected first pixel and the gamma corrected second pixel to generate a mixed pixel. A pixel synthesizer,
A graphics processor comprising: a selection circuit configured to select one of the gamma corrected first pixel, the gamma corrected second pixel, or the mixed pixel as an output pixel.

The graphics processor of claim 9, wherein the display pipeline includes a filter unit adapted to an anti-aliasing filter for a plurality of sample values associated with the gamma corrected first pixel.

Multiple output ports,
An external circuit coupled between the display head and a plurality of output ports;
The graphics processor of claim 9, wherein the external circuit is configured to selectively transfer an output pixel to one of the output ports.

The graphics processor of claim 11, wherein the plurality of output ports includes a first output port configured to be connected to an input port of another graphics processor.

A method for generating an image, comprising:
Rendering a first set of gamma corrected input pixels for an image using a first graphics processor;
Rendering a second set of gamma corrected input pixels for an image using a second graphics processor, each rendering of the first graphics processor and the second graphics processor The operation differs in at least one point and
Sending the first set of gamma corrected input pixels and the second set of gamma corrected input pixels to a first display head;
Mixing corresponding pixels of the first set of gamma corrected input pixels and the second set of gamma corrected input pixels with the first display head to generate a first set of output pixels. And a method comprising:

The method of claim 13, wherein the first display head is provided in the first graphics processor.

The method of claim 13, wherein a rendering operation of each of the first graphics processor and the second graphics processor is different with respect to a sampling pattern applied to each pixel.

The method of claim 13, wherein the rendering operation of each of the first graphics processor and the second graphics processor is different with respect to a field of view offset of the rendered image.

The rendering operation of each of the first graphics processor and the second graphics processor is such that the rendering operation by the first graphics processor generates a left eye field of stereo anaglyph, and the second graphics processor 14. The method of claim 13, wherein the rendering operation according to is different in that it produces a right eye field of stereo anaglyphs.

The method of claim 13, further comprising sending the first set of output pixels to a display device.