EP1971959A2 - Procédé et appareil de traitement de sous-blocs de données multimédia dans des systèmes de traitement en parallèle - Google Patents

Procédé et appareil de traitement de sous-blocs de données multimédia dans des systèmes de traitement en parallèle

Info

Publication number
EP1971959A2
EP1971959A2 EP07716561A EP07716561A EP1971959A2 EP 1971959 A2 EP1971959 A2 EP 1971959A2 EP 07716561 A EP07716561 A EP 07716561A EP 07716561 A EP07716561 A EP 07716561A EP 1971959 A2 EP1971959 A2 EP 1971959A2
Authority
EP
European Patent Office
Prior art keywords
blocks
data
image data
sub
rows
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP07716561A
Other languages
German (de)
English (en)
Inventor
Lazar Bivolarski
Bogdan Mitu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Brightscale Inc
Original Assignee
Brightscale Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Brightscale Inc filed Critical Brightscale Inc
Publication of EP1971959A2 publication Critical patent/EP1971959A2/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored program computers
    • G06F15/80Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/20Processor architectures; Processor configuration, e.g. pipelining
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored program computers
    • G06F15/80Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
    • G06F15/8007Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors single instruction multiple data [SIMD] multiprocessors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/38Concurrent instruction execution, e.g. pipeline or look ahead
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5066Algorithms for mapping a plurality of inter-dependent sub-tasks onto a plurality of physical CPUs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/436Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using parallelised computational arrangements

Definitions

  • the invention relates generally to parallel processing. More specifically, the invention relates to methods and apparatuses for processing of multimedia data in parallel processing systems.
  • a method includes generating blocks of image data, wherein each of the blocks of image data are divided into sub-blocks, and a first data point of each sub-block flags a beginning position of the sub-block, and generating a block of type data
  • 354726-990400 1 Atty Dckt No.: 354726-5405 PATENT for each of the blocks of image data, wherein each of the blocks of type data contains the first data point for all of the sub-blocks in the block of image data.
  • a computer readable medium having computer executable instructions thereon for a method of processing in a parallel processing array having computing elements configured to process blocks of data of an image, the method including generating blocks of image data, wherein each of the blocks of image data are divided into sub-blocks, and a first data point of each sub-block flags a beginning position of the sub- block, and generating a block of type data for each of the blocks of image data, wherein each of the blocks of type data contains the first data point for all of the sub-blocks in the block of image data.
  • FIG. 1 conceptually illustrates macroblocks of a 1080i high definition (HD) frame.
  • FIGS. 2A-2B further illustrate the arrangement of blocks such as macroblocks within an image frame.
  • FIGS. 3A-3C illustrate the mapping of macroblocks from their arrangement within an image to individual parallel processors.
  • FIGS. 4A-4E illustrate the mapping of images to individual parallel processors, for various image formats.
  • FIGS. 5A-5B illustrate 16x8 mapping for mapping subdivisions of images to individual parallel processors.
  • FIGS. 6A-6B illustrate 16*4 mapping for mapping subdivisions of images to individual parallel processors.
  • FIGS. 7A-7C illustrate an alternative approach to mapping image blocks to parallel processors, in accordance with an embodiment of the present invention.
  • FIGS. 8A-8C illustrate further details of the data structure of an image format, including luma and chroma information.
  • FIGS. 9A-9C illustrate various alternative approaches to mapping multiple image blocks to parallel processors, in accordance with an embodiment of the present invention.
  • FIGS. lOA-lOC illustrate data block data locations, sub-block locations, sub-block flag data positions, and a block of type data, in accordance with an embodiment of the present invention.
  • FIGS. 1 IA-I IB illustrate algorithm processing steps and selection codes for identifying which processing steps are applied to which data variables.
  • FIG. 12 illustrates a parallel processor.
  • Like reference numerals refer to corresponding parts throughout the drawings.
  • this innovation relates to a more efficient method for the parallel processing of multimedia data. It is known that, in various image formats, the images are subdivided into blocks, with the "later" blocks, or those blocks that fall generally below and to the right of other blocks in the image as it is typically viewed in matrix form, dependent upon information from the "earlier" blocks, i.e. those images above and to the left of the later blocks.
  • the earlier blocks must be processed before the later ones, as the later ones require information, often called dependency data, from the earlier blocks. Accordingly, blocks (or portions thereof) are transmitted to various parallel processors, in the order of their dependency data. Earlier blocks are sent to the parallel processors first, with later blocks sent later.
  • the blocks are stored in the parallel processors in specific locations, and shifted around as necessary, so that every block, when it is processed, has its dependency data located in a specific set of earlier blocks with specified positions. In this manner, its dependency data can be retrieved with the same commands. That is, earlier blocks are shifted around so that later blocks can be processed with a single set of commands that instructs each processor to retrieve its dependency data from specific locations.
  • FIG. 1 conceptually illustrates an exemplary frame of an image, in its matrix form as it is typically viewed and/or stored in memory.
  • a 1080i HD image matrix 10 is subdivided into 68 lines of 120 macroblocks 12 each.
  • images such as this 108Oi frame are processed by individual macroblock 12.
  • one or more macroblocks 12 are processed by each computing element (or processor) of a parallel processing array.
  • the invention is often discussed in the context of the processing of macroblocks 12, it should be recognized that the invention includes the division of images and other data into any portions, often referred to as blocks, that can be processed in parallel.
  • the macroblocks of images such as the 1080i HD frame of FIG. 1 include dependency data, as further illustrated in FIGS. 2A-2B.
  • dependency data e.g., data required for interpolation, etc.
  • the processing of each block of an image requires dependency data from the block immediately to the left, as well as the block diagonally to the immediate upper left, the block immediately above, and the block diagonally to .the immediate upper right.
  • Block a therefore also depends upon information from blocks d and b, block b depends upon information from block d, and so forth, while block d does not depend on information from any other blocks. It can therefore be seen that parallel processing of these blocks requires processing in diagonals, with block d processed first, followed by blocks a and b as they depend upon information from block d, then blocks R and c as they depend upon information from blocks a, d, and b, and so forth.
  • FIG. 3 A illustrates the macroblock structure of an exemplary image, as the image appears to a viewer. As above, the blocks of FIG. 3 A are processed in an order that retains their dependency data for later blocks.
  • FIG. 3B illustrates the diagonals that must be processed, in the order they must be processed to preserve their dependency data for later blocks. Each row illustrates a separate diagonal, with each
  • block ( )o is processed first, as it is located in the uppermost left corner of the image, and thus has no dependency data.
  • Block Oo is processed next, and thus appears in the next row, as it requires dependency data only from block ( )o.
  • Blocks 11 and Io are processed next, and therefore appear in the following row, as block 1 1 requires dependency data from blocks ( )o and OQ, and block Io requires dependency data from block Oo. It can therefore be seen that each diagonal of blocks in FIG. 3A 5 highlighted by the dashed lines, can be mapped into rows of a parallel processing array as shown in FIG. 3B.
  • mapping blocks into rows of computing elements as shown in FIG. 3B preserves all required dependency data above each row, difficulties still exist. More specifically, the dependency data for each block is still often located in different positions relative to that block.
  • block 4i has dependency data located in the following blocks, in clockwise order: 3i, lo > 2o, and 3o-
  • these processors are located as shown by the arrows, with processors 3i, lo, 2 0 , and 3o arranged in an "L" shape above block 4
  • the dependency data for block 9 3 is located in blocks 83, 82, 1 2 , and 6 2 , which are arranged as shown by the arrows.
  • each computing element will require its own commands directing it to retrieve dependency data.
  • dependency data for each block is arranged differently for each block (as shown by blocks 4i and 9 3 )
  • separate data retrieval commands must be pushed to each processor, slowing down the speed at which images can be processed.
  • this problem is overcome by shifting the dependency data for each block prior to the processing of that block.
  • the dependency data can be shifted in any fashion.
  • FIG. 3C one convenient approach to shifting dependency data is illustrated in FIG. 3C, in which the blocks containing dependency data are shifted into the "L" shape described above. That is, when block X is processed, it requires dependency data from blocks A-D. Within the image, these blocks are located directly above X, to the immediate upper left, directly to the left, and to the immediate upper right, respectively. Within the parallel processing array, these blocks can then be shifted to two processor positions above X, three processor positions above, one processor position above, and the processor position to the immediate
  • FIGS. 4A-4E illustrate this point, showing how diagonals of various types of frames can be mapped into varying numbers of processor rows.
  • the diagonals of an HD frame can be mapped into consecutive rows of processors as shown, creating a trapezoidal (or alternately a rhomboid, or possibly even a combination of both) layout where 257 rows of processors are employed, with a maximum of 61 processors being used in a single row.
  • Smaller frames utilize fewer rows, and fewer processors.
  • a GIF frame utilizes 59 rows of processors, with a maximum of 19 processors employed in any row.
  • a 625 SD frame would occupy 117 rows, and a maximum of 36 processors per row, when mapped into a parallel processing array.
  • an SIF frame would occupy 51 rows, and 16 processors maximum per row, when mapped into the same array.
  • a 525 SD frame would occupy 107 rows, and 30 processors maximum per row.
  • the invention can be employed to
  • 354726-990400 6 Attv Dckt No.: 354726-5405 PATENT map any image to a parallel processing array, where data can be shifted within rows as described above, allowing for processing of blocks with a single command or command set.
  • FIGS. 5A-5B illustrate one such embodiment, in which blocks of an image are divided in two. Each of these divisions is then processed as above, except that each division is mapped into, and processed by, one half of a processor.
  • blocks are divided into a top half and a bottom half as shown. That is, the upper left hand block is divided into two sub-blocks, 0 and 2. Similarly, the block next to it is divided into sub-blocks 1 and 3, and so forth.
  • each sub-block behaves the same as a full block for dependency purposes, i.e., sub-block 1 requires dependency data only from block 0, the leftmost sub-block 2 requires dependency data from blocks 0 and 1 , etc.
  • these sub-blocks are then mapped into halves of processors as shown, with sub-blocks 0 and 1 mapped into the first row, sub-blocks 2 and sub-blocks 3 mapped into the second row, and so on.
  • the processes of the invention can then be employed in the same manner as above, with sub-blocks shifted along rows of processors as necessary.
  • FIG. 5B illustrates that its embodiment increases the number of processors utilized by one for every row: the first row utilizes one processor, the second row two, and so forth. The embodiment of FIGS. 5A-5B thus utilize more processors at a time, resulting in even faster processing.
  • FIGS. 6A-6B illustrate another such embodiment, in which blocks of an image are divided into four subdivisions.
  • the upper left block of an image is divided into sub-blocks 0, 2, 4, and 6.
  • These sub-blocks are then mapped into portions of a processor in the order required by their dependency data. That is, each processor can be divided into four "sub-rows" each capable of processing a row of sub-blocks.
  • the various sub-blocks are described below.
  • 354726-990400 7 Attv Dckt No.: 354726-5405 PATENT can then be mapped into the sub-rows of the processors as shown.
  • the 0, 1, 2, and 3 sub-blocks can all be mapped into two processors in the first row (with the first processor processing sub-blocks 0, 1, one 2 sub-block, and one 3 sub-block, and the second processor processing the other 2 and 3 sub-blocks), and processed accordingly.
  • this embodiment employs two processors in the first row instead of one, and that the number of processors grows by two per row, thus allowing even more processors to be utilized per row.
  • the invention also encompasses the division of blocks and processors into 16 subdivisions.
  • the invention includes the processing of multiple blocks "side by side,” i.e., the processing of multiple blocks per row.
  • FIGS. 7A-7C illustrate both these concepts.
  • FIG. 7 A illustrates the division of a block into 16 sub-blocks ( )o — 8o, as shown.
  • FIG. 7B illustrates the fact that unrelated blocks, i.e. blocks that do not require dependency data from each other, can be processed in parallel. Each block is divided as in FIG. 7A, with sub-blocks shown without subscripts for simplicity.
  • the first block is divided into 16 sub-blocks labeled 0 through 9, with like numbers processed simultaneously as above. So long as the blocks in each row do not require dependency data from each other, they can be processed together, in the same row. Accordingly, one group of processors can process multiple unrelated blocks simultaneously. For example, the top row of four blocks in FIG. 7B (with sub-blocks labeled 0-9, 10-19, 20-29, and 30-39, respectively) can be processed in a single set of processors.
  • FIG. 7C a chart of processors (numbered along the left hand side) and the corresponding sub-blocks loaded into them, illustrates this point.
  • sub-blocks 0-9 can be loaded into subdivisions of processors 0-9 (where processors are labeled along the left hand side) to form the diamond-like pattern shown. Further blocks can then be loaded into overlapping sets of processors, with sub-blocks 10-19 loaded into processors 4-13, etc. In this manner, both further subdivisions of blocks, as well as the "chaining" of multiple blocks into overlapping sets of processors, allows mote processors to be utilized more quickly, yielding faster processing.
  • Figs. 7A-7C illustrate four by four processing. It should be understood that this same technique can be implemented in a eight by eight processing as well.
  • the invention encompasses the separate processing of intensity information, luma information, and chroma information from the same block. That is, intensity information from one block can be processed separately from the luma information from that block, which can be processed separately from the chroma information from that block.
  • intensity information from one block can be processed separately from the luma information from that block, which can be processed separately from the chroma information from that block.
  • luma and chroma information can be mapped to processors and processed as above (i.e., shifted as necessary, etc.), and can also be subdivided, with subdivisions mapped to different processors, for increased efficiency in processing.
  • FIGS. 8A-8C illustrate this. In FIG.
  • one block of luma data can be mapped to one processor, with the corresponding "half-block" of chroma data mapped to the same processor or a different one.
  • the intensity, luma, and chroma data can be mapped to adjacent sets of processors, perhaps in at least partially overlapping sets of rows, similar to FIG. 7B.
  • the luma and chroma information can also be divided into sub-blocks, for processing in subdivisions of individual computing elements, as described in connection with FIGS. 5A-5B, and 6A-6B.
  • FIGS, 8B-8C illustrate the division of one frame's luma and chroma data into two and four sub-blocks, respectively. The two sub-blocks of FIG.
  • FIG. 8B can then be processed in different halves of processors, as described in connection with FIGS. 5A-5B.
  • the four sub-blocks of FIG. 8C can be processed in different quarters of processors, like that described in FIGS. 6A-6B.
  • FIGS. 9A-9C which conceptually illustrate processors occupied by various blocks, describe embodiments of the latter concept.
  • rows of processors extend along the vertical axis, while columns extend along the horizontal axis.
  • regions 100-104 regions 100-104.
  • the region(s) 104 do not occupy many processors, thus reducing the overall utilization of the processing array. This can be at least partially remedied by processing another block of data right below the block that occupies
  • 354726-990400 9 Any Dckt No.: 354726-540.5 PATENT regions 100-104.
  • This block can occupy regions 106-112, allowing more processors to be utilized, particularly in the "transition" regions 104-106 between subsequent blocks. In this manner, processing can be accomplished quicker and with more array utilization than if users were to process the block of regions 106-112 only after processing of the block in regions 100-104 was completed.
  • FIGS. 9B-9C illustrate further extensions of this concept.
  • this vertical "chaining" of mapped blocks can be continued over two or more blocks, resulting in significantly higher array utilization.
  • blocks can be mapped into adjacent columns one after another, with regions 116-120 occupied by one block, regions 122-126 occupied by another block, etc.
  • rhomboid shapes can be used instead of or in conjunction with the trapezoidal shapes.
  • any combination of mappings of different formats could be achieved by different sizes or combinations of rhomboids and/or trapezoids to facilitate the processing of multiple streams simultaneously.
  • the invention contemplates use by any parallel processor having multiple computing elements capable of each processing a block of image data, and shifting such data to preserve dependencies. While many such parallel processors are contemplated, one suitable example is described in U.S. Patent Application No. 11/584,480 entitled "Integrated Processor Array, Instruction Sequencer And I/O Controller," filed on October 19, 2006, the disclosure of which is hereby incorporated by reference in its entirety and for all purposes.
  • FIGS. 10A- 1OC illustrate the innovations relating to sub-block parallel processing.
  • each macroblock 12 is a matrix of 16 rows by 16 columns (16x16) of data bits (i.e. pixels), broken up into 4 or more sub-blocks 20. Specifically, each matrix is broken into at least four equal quadrant sub-blocks 20 that are 8x8 in size. Each quadrant sub-block 20 can be further broken up into sub-blocks 20 having sizes that are 8x4, 4x8 and 4x4. Thus, any given block 12 can be broken up into sub-blocks 20 having sizes that are 8x8, 4x8, 8x4 and 4x4.
  • FIG. 1OA illustrates a block 12 with one 8x8 sub-block 20a, two 4x8 sub-blocks 20b, two 8x4 sub-blocks 20c, and four 4x4 sub-blocks 2Od.
  • the numbers of each sized sub- block 20, if any, can vary, as well as their locations within the block 12. Further, the numbers and locations of the various sized sub-blocks 20 can vary from block 12 to block 12.
  • FIG. 1OB illustrates the block 12, and shows the sixteen data locations 22 that could possibly form the first data location for any given sub-block 20 (first meaning the most upper left entry of the sub-block 20). For each block 12, these sixteen positions 22 will contain the data necessary to flag whether this data position constitutes the first entry of a new sub-block 20.
  • this position is considered the starting point of a data-block 20, and the position to its immediate left (if any) is considered the last column of the sub-block 20 immediately to the left, and the position immediately above (if any) is considered the last row of the sub-block 20 immediately above. If it is not flagged, then this entry signifies a continuation of a same sub-block 20. Thus, it can be seen that these sixteen flag data locations 22 contain all the data necessary to determine the locations and sizes of the sub- blocks 20.
  • FIG. 1OC illustrates the type data block according to this innovation, where a block of type data 24, which has a 16x4 size, is associated with each block 12.
  • the four rows of block 24 correspond to the four rows in the block 12 that contain the flag data positions 22.
  • 354726-990400 1 1 Attv Dckt No.: 354726-5405 PATENT analysis of the block 12 is needed for this purpose.
  • remaining data positions in the block 20 can be used to store other data, such as sub-block type (I-locally predicted, P- predicted with motion vectors, and B-bidirectionally predicted), block vectors, etc.
  • sub-block type I-locally predicted, P- predicted with motion vectors, and B-bidirectionally predicted
  • block vectors etc.
  • Another source of parallel processing optimization involves simultaneously processing algorithms having certain similarities (e.g. similar calculations).
  • Computer processing involves two basic calculations: numerical computations and data movements. These calculations are achieved by processing algorithms that either compute the numerical computations or move (or copy) the desired data to a new location.
  • Such algorithms are traditionally processing using a series of "IF" statements, where if a certain criteria is met, then a one calculation is made, whereas if not then either that calculation is not made or a different calculation is made. By navigating through a plurality of IF statements, the desired total calculation is performed in each data.
  • IF IF
  • the same code (algorithm) is generally applied to all data, and only the selection codes need to be tailored for each data to determine how each calculation is made.
  • the advantage here is that if plural data are being processed in which many of the processing steps are the same, then applying one algorithm code with both the calculations in common and those that are not in common simplifies the system. In order to apply this
  • Figs. 1 IA and 1 IB illustrate an example of the above described concept.
  • This example involves bilinear filters used to generate intermediate values between pixels, in which certain number computations are made (although this technique can be used for any data algorithms).
  • the algorithms need to compute the various values use the same basic set of numerical additions and data shifting steps, but the order and numbering of these steps differ based upon the computation being made.
  • the first computation for the 1/2 and 3/4 Bi-Cubic equation is the number 53, which requires 7 computation steps to make.
  • the second computation is the number 18, which requires 6 computation steps, four of which are in common with, and in the same order as, the same four steps as they occur in the previous computation.
  • the last two computations for the first equation again have overlapping computation steps with the first two calculations.
  • Additional computations for 1/2 Bi-Cubic equation, as well as the three Bi-Linear equations of Fig. 1 IB all involve various combinations of the same calculation steps, and all have four computations to make.
  • a selection code of "0011" dictates that the step will only be applied to the third and fourth variables, but not the first and second variables.
  • the second step is applied only to the second variable, as dictated by the selection code "0100".
  • the same methodology is applied for all the steps and variables of all the equations using the selection codes shown.
  • the advantage of using selection codes is that instead of generating twenty algorithm codes to make the twenty various computations illustrated in Figs. 1 IA and 1 IB (or at the very least eight different algorithm codes to make the eight distinct numerical
  • FIGs. 1 IA and 1 IB illustrate the use of selection codes for a data computation application
  • selection codes used for selectively dictating which algorithm steps to apply to data is equally applicable for algorithms used to move data.
  • the invention can be employed to process any subdivisions of any image format. That is, the invention can process in parallel images of any format, whether they be 1080i HD images, CIF images, SIF images, or any other. These images can also be broken into any subdivisions, whether they be macroblocks of an image, or any other.
  • any image data can be so processed, whether it be intensity information, luma information, chroma information, or any other.
  • the embodiments were chosen and described in order to best explain the principles of the invention and its practical applications, to thereby enable others skilled in the art to best utilize the invention and various embodiments with various modifications as are suited to the particular use contemplated.
  • the present invention can be embodied in the form of methods and apparatus for practicing those methods.
  • the present invention can also be embodied in the form of program code embodied in tangible media, such as floppy diskettes, CD-ROMs, hard drives, firmware, or any other machine-readable storage medium, wherein, when the
  • PATENT program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the invention.
  • the present invention can also be embodied in the form of program code, for example, whether stored in a storage medium, loaded into and/or executed by a machine, or transmitted over some transmission medium, such as over electrical wiring or cabling, through fiber optics, or via electromagnetic radiation, wherein, when the program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the invention.
  • the program code segments combine with the processor to provide a unique device that operates analogously to specific logic circuits.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computing Systems (AREA)
  • Signal Processing (AREA)
  • Image Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Multi Processors (AREA)
  • Image Input (AREA)

Abstract

L'invention concerne un procédé et un dispositif efficaces dans le traitement en parallèle de sous-blocs de données. Un réseau de traitement parallèle comprend des éléments de calcul configurés pour traiter des blocs de données d'une image en parallèle. Des blocs de données d'image sont produites, chacun desdits blocs de données d'image est divisé en sous-blocs, un premier point de données de chaque sous-bloc marquant une position de commencement du sous-bloc. Un bloc de données types est créé pour chacun des blocs de données d'image. Chacun des blocs des données types renferme le premier point de données pour tous les sous-blocs dans le bloc de données d'image, de telle manière que les numéros et les emplacements de tous les sous-blocs dans chaque bloc de données d'image peuvent être déterminés sans avoir, d'abord, à traiter le bloc de données d'image.
EP07716561A 2006-01-10 2007-01-10 Procédé et appareil de traitement de sous-blocs de données multimédia dans des systèmes de traitement en parallèle Withdrawn EP1971959A2 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US75806506P 2006-01-10 2006-01-10
PCT/US2007/000771 WO2007082042A2 (fr) 2006-01-10 2007-01-10 Procédé et appareil de traitement de sous-blocs de données multimédia dans des systèmes de traitement en parallèle

Publications (1)

Publication Number Publication Date
EP1971959A2 true EP1971959A2 (fr) 2008-09-24

Family

ID=38257031

Family Applications (3)

Application Number Title Priority Date Filing Date
EP07716562A Withdrawn EP1971956A2 (fr) 2006-01-10 2007-01-10 Procédé et appareil d'ordonnancement du traitement de données multimédia dans des systèmes de traitement en parallèle
EP07716561A Withdrawn EP1971959A2 (fr) 2006-01-10 2007-01-10 Procédé et appareil de traitement de sous-blocs de données multimédia dans des systèmes de traitement en parallèle
EP07716563A Withdrawn EP1971958A2 (fr) 2006-01-10 2007-01-10 Procédé et appareil de traitement d'étapes d'algorithme de données multimédia dans des systèmes de traitement en parallèle

Family Applications Before (1)

Application Number Title Priority Date Filing Date
EP07716562A Withdrawn EP1971956A2 (fr) 2006-01-10 2007-01-10 Procédé et appareil d'ordonnancement du traitement de données multimédia dans des systèmes de traitement en parallèle

Family Applications After (1)

Application Number Title Priority Date Filing Date
EP07716563A Withdrawn EP1971958A2 (fr) 2006-01-10 2007-01-10 Procédé et appareil de traitement d'étapes d'algorithme de données multimédia dans des systèmes de traitement en parallèle

Country Status (7)

Country Link
US (4) US20070189618A1 (fr)
EP (3) EP1971956A2 (fr)
JP (3) JP2009523293A (fr)
KR (3) KR20080085189A (fr)
CN (3) CN101371264A (fr)
TW (3) TW200806039A (fr)
WO (3) WO2007082043A2 (fr)

Families Citing this family (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7383421B2 (en) * 2002-12-05 2008-06-03 Brightscale, Inc. Cellular engine for a data processing system
US7451293B2 (en) * 2005-10-21 2008-11-11 Brightscale Inc. Array of Boolean logic controlled processing elements with concurrent I/O processing and instruction sequencing
KR20080085189A (ko) * 2006-01-10 2008-09-23 브라이트스케일, 인크. 병렬 프로세싱 시스템에서 멀티미디어 데이터의 알고리즘단계들을 프로세싱하기 위한 방법 및 장치
US8976870B1 (en) * 2006-08-30 2015-03-10 Geo Semiconductor Inc. Block and mode reordering to facilitate parallel intra prediction and motion vector prediction
US20080059763A1 (en) * 2006-09-01 2008-03-06 Lazar Bivolarski System and method for fine-grain instruction parallelism for increased efficiency of processing compressed multimedia data
WO2008027567A2 (fr) * 2006-09-01 2008-03-06 Brightscale, Inc. Machine parallèle d'une seule pièce
US20080244238A1 (en) * 2006-09-01 2008-10-02 Bogdan Mitu Stream processing accelerator
US20080059467A1 (en) * 2006-09-05 2008-03-06 Lazar Bivolarski Near full motion search algorithm
US8165224B2 (en) * 2007-03-22 2012-04-24 Research In Motion Limited Device and method for improved lost frame concealment
US8996846B2 (en) 2007-09-27 2015-03-31 Nvidia Corporation System, method and computer program product for performing a scan operation
US8284188B1 (en) 2007-10-29 2012-10-09 Nvidia Corporation Ray tracing system, method, and computer program product for simultaneously traversing a hierarchy of rays and a hierarchy of objects
US8264484B1 (en) 2007-10-29 2012-09-11 Nvidia Corporation System, method, and computer program product for organizing a plurality of rays utilizing a bounding volume
US8065288B1 (en) 2007-11-09 2011-11-22 Nvidia Corporation System, method, and computer program product for testing a query against multiple sets of objects utilizing a single instruction multiple data (SIMD) processing architecture
US8661226B2 (en) 2007-11-15 2014-02-25 Nvidia Corporation System, method, and computer program product for performing a scan operation on a sequence of single-bit values using a parallel processor architecture
US8243083B1 (en) 2007-12-04 2012-08-14 Nvidia Corporation System, method, and computer program product for converting a scan algorithm to a segmented scan algorithm in an operator-independent manner
US8773422B1 (en) 2007-12-04 2014-07-08 Nvidia Corporation System, method, and computer program product for grouping linearly ordered primitives
CN102957914B (zh) 2008-05-23 2016-01-06 松下知识产权经营株式会社 图像解码装置、图像解码方法、图像编码装置、以及图像编码方法
US8340194B2 (en) * 2008-06-06 2012-12-25 Apple Inc. High-yield multi-threading method and apparatus for video encoders/transcoders/decoders with dynamic video reordering and multi-level video coding dependency management
WO2010052837A1 (fr) * 2008-11-10 2010-05-14 パナソニック株式会社 Dispositif de décodage d'image, procédé de décodage d'image, circuit intégré, et programme
KR101010954B1 (ko) * 2008-11-12 2011-01-26 울산대학교 산학협력단 오디오 데이터 처리방법 및 이를 적용한 오디오 데이터 처리장치
US8321492B1 (en) 2008-12-11 2012-11-27 Nvidia Corporation System, method, and computer program product for converting a reduction algorithm to a segmented reduction algorithm
KR101673186B1 (ko) * 2010-06-09 2016-11-07 삼성전자주식회사 매크로블록의 연관관계를 고려하여 영상 데이터의 부호화 및 복호화를 병렬 처리하는 장치 및 방법
KR101698797B1 (ko) * 2010-07-27 2017-01-23 삼성전자주식회사 영상 데이터를 분할하여 부호화 및 복호화를 병렬 처리하는 장치 및 상기 장치의 동작 방법
EP2606424A4 (fr) * 2010-08-17 2014-10-29 Massively Parallel Tech Inc Système et procédé d'exécution d'applications de calcul haute performance
US9262166B2 (en) * 2011-11-30 2016-02-16 Intel Corporation Efficient implementation of RSA using GPU/CPU architecture
US9172923B1 (en) * 2012-12-20 2015-10-27 Elemental Technologies, Inc. Sweep dependency based graphics processing unit block scheduling
US9747563B2 (en) 2013-11-27 2017-08-29 University-Industry Cooperation Group Of Kyung Hee University Apparatus and method for matching large-scale biomedical ontologies
KR101585980B1 (ko) * 2014-04-11 2016-01-19 전자부품연구원 멀티-프로세서의 공유 메모리를 적극 활용한 cr 알고리즘 처리 방법 및 이를 적용한 프로세서
US20160119649A1 (en) * 2014-10-22 2016-04-28 PathPartner Technology Consulting Pvt. Ltd. Device and Method for Processing Ultra High Definition (UHD) Video Data Using High Efficiency Video Coding (HEVC) Universal Decoder
CN105991250B (zh) * 2015-02-10 2020-08-07 华为技术有限公司 一种基站、用户终端及载波调度指示方法和装置
CN108182579B (zh) * 2017-12-18 2020-12-18 东软集团股份有限公司 用于规则判断的数据处理方法、装置、存储介质和设备
CN115756841B (zh) * 2022-11-15 2023-07-11 重庆数字城市科技有限公司 一种基于并行处理高效数据生成系统及方法

Family Cites Families (108)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3308436A (en) * 1963-08-05 1967-03-07 Westinghouse Electric Corp Parallel computer system control
US4212076A (en) * 1976-09-24 1980-07-08 Giddings & Lewis, Inc. Digital computer structure providing arithmetic and boolean logic operations, the latter controlling the former
US4575818A (en) * 1983-06-07 1986-03-11 Tektronix, Inc. Apparatus for in effect extending the width of an associative memory by serial matching of portions of the search pattern
JPS6224366A (ja) * 1985-07-03 1987-02-02 Hitachi Ltd ベクトル処理装置
US4907148A (en) * 1985-11-13 1990-03-06 Alcatel U.S.A. Corp. Cellular array processor with individual cell-level data-dependent cell control and multiport input memory
US4783738A (en) * 1986-03-13 1988-11-08 International Business Machines Corporation Adaptive instruction processing by array processor having processor identification and data dependent status registers in each processing element
GB2211638A (en) * 1987-10-27 1989-07-05 Ibm Simd array processor
US4873626A (en) * 1986-12-17 1989-10-10 Massachusetts Institute Of Technology Parallel processing system with processor array having memory system included in system memory
US5122984A (en) * 1987-01-07 1992-06-16 Bernard Strehler Parallel associative memory system
US4943909A (en) * 1987-07-08 1990-07-24 At&T Bell Laboratories Computational origami
EP0309669B1 (fr) * 1987-09-30 1992-12-30 Siemens Aktiengesellschaft Procédé de réduction de données d'images se basant sur des modèles de scènes pour signaux de télévision numériques
US4876644A (en) * 1987-10-30 1989-10-24 International Business Machines Corp. Parallel pipelined processor
US4983958A (en) * 1988-01-29 1991-01-08 Intel Corporation Vector selectable coordinate-addressable DRAM array
US5241635A (en) * 1988-11-18 1993-08-31 Massachusetts Institute Of Technology Tagged token data processing system with operand matching in activation frames
AU624205B2 (en) * 1989-01-23 1992-06-04 General Electric Capital Corporation Variable length string matcher
US5497488A (en) * 1990-06-12 1996-03-05 Hitachi, Ltd. System for parallel string search with a function-directed parallel collation of a first partition of each string followed by matching of second partitions
US5319762A (en) * 1990-09-07 1994-06-07 The Mitre Corporation Associative memory capable of matching a variable indicator in one string of characters with a portion of another string
EP0485690B1 (fr) * 1990-11-13 1999-05-26 International Business Machines Corporation Système à processeur associatif parallèle
US5963746A (en) * 1990-11-13 1999-10-05 International Business Machines Corporation Fully distributed processing memory element
US5765011A (en) * 1990-11-13 1998-06-09 International Business Machines Corporation Parallel processing system having a synchronous SIMD processing with processing elements emulating SIMD operation using individual instruction streams
US5150430A (en) * 1991-03-15 1992-09-22 The Board Of Trustees Of The Leland Stanford Junior University Lossless data compression circuit and method
US5228098A (en) * 1991-06-14 1993-07-13 Tektronix, Inc. Adaptive spatio-temporal compression/decompression of video image signals
US5706290A (en) * 1994-12-15 1998-01-06 Shaw; Venson Method and apparatus including system architecture for multimedia communication
US5373290A (en) * 1991-09-25 1994-12-13 Hewlett-Packard Corporation Apparatus and method for managing multiple dictionaries in content addressable memory based data compression
US5640582A (en) * 1992-05-21 1997-06-17 Intel Corporation Register stacking in a computer system
US5450599A (en) * 1992-06-04 1995-09-12 International Business Machines Corporation Sequential pipelined processing for the compression and decompression of image data
US5288593A (en) * 1992-06-24 1994-02-22 Eastman Kodak Company Photographic material and process comprising a coupler capable of forming a wash-out dye (Q/Q)
US5818873A (en) * 1992-08-03 1998-10-06 Advanced Hardware Architectures, Inc. Single clock cycle data compressor/decompressor with a string reversal mechanism
US5440753A (en) * 1992-11-13 1995-08-08 Motorola, Inc. Variable length string matcher
US5446915A (en) * 1993-05-25 1995-08-29 Intel Corporation Parallel processing system virtual connection method and apparatus with protection and flow control
JPH07114577A (ja) * 1993-07-16 1995-05-02 Internatl Business Mach Corp <Ibm> データ検索装置、データ圧縮装置及び方法
US6073185A (en) * 1993-08-27 2000-06-06 Teranex, Inc. Parallel data processor
US5490264A (en) * 1993-09-30 1996-02-06 Intel Corporation Generally-diagonal mapping of address space for row/column organizer memories
US6085283A (en) * 1993-11-19 2000-07-04 Kabushiki Kaisha Toshiba Data selecting memory device and selected data transfer device
US5602764A (en) * 1993-12-22 1997-02-11 Storage Technology Corporation Comparing prioritizing memory for string searching in a data compression system
US5758176A (en) * 1994-09-28 1998-05-26 International Business Machines Corporation Method and system for providing a single-instruction, multiple-data execution unit for performing single-instruction, multiple-data operations within a superscalar data processing system
US5631849A (en) * 1994-11-14 1997-05-20 The 3Do Company Decompressor and compressor for simultaneously decompressing and compressng a plurality of pixels in a pixel array in a digital image differential pulse code modulation (DPCM) system
US5682491A (en) * 1994-12-29 1997-10-28 International Business Machines Corporation Selective processing and routing of results among processors controlled by decoding instructions using mask value derived from instruction tag and processor identifier
US6128720A (en) * 1994-12-29 2000-10-03 International Business Machines Corporation Distributed processing array with component processors performing customized interpretation of instructions
US5867726A (en) * 1995-05-02 1999-02-02 Hitachi, Ltd. Microcomputer
US5926642A (en) * 1995-10-06 1999-07-20 Advanced Micro Devices, Inc. RISC86 instruction set
US6317819B1 (en) * 1996-01-11 2001-11-13 Steven G. Morton Digital signal processor containing scalar processor and a plurality of vector processors operating from a single instruction
US5963210A (en) * 1996-03-29 1999-10-05 Stellar Semiconductor, Inc. Graphics processor, system and method for generating screen pixels in raster order utilizing a single interpolator
US5828593A (en) * 1996-07-11 1998-10-27 Northern Telecom Limited Large-capacity content addressable memory
US5867598A (en) * 1996-09-26 1999-02-02 Xerox Corporation Method and apparatus for processing of a JPEG compressed image
US6212237B1 (en) * 1997-06-17 2001-04-03 Nippon Telegraph And Telephone Corporation Motion vector search methods, motion vector search apparatus, and storage media storing a motion vector search program
US5909686A (en) * 1997-06-30 1999-06-01 Sun Microsystems, Inc. Hardware-assisted central processing unit access to a forwarding database
US5951672A (en) * 1997-07-02 1999-09-14 International Business Machines Corporation Synchronization method for work distribution in a multiprocessor system
EP0905651A3 (fr) * 1997-09-29 2000-02-23 Canon Kabushiki Kaisha Procédé et appareil de traitement d'images
US6089453A (en) * 1997-10-10 2000-07-18 Display Edge Technology, Ltd. Article-information display system using electronically controlled tags
US6167502A (en) * 1997-10-10 2000-12-26 Billions Of Operations Per Second, Inc. Method and apparatus for manifold array processing
US6226710B1 (en) * 1997-11-14 2001-05-01 Utmc Microelectronic Systems Inc. Content addressable memory (CAM) engine
US6101592A (en) * 1998-12-18 2000-08-08 Billions Of Operations Per Second, Inc. Methods and apparatus for scalable instruction set architecture with dynamic compact instructions
US6145075A (en) * 1998-02-06 2000-11-07 Ip-First, L.L.C. Apparatus and method for executing a single-cycle exchange instruction to exchange contents of two locations in a register file
US6295534B1 (en) * 1998-05-28 2001-09-25 3Com Corporation Apparatus for maintaining an ordered list
US6088044A (en) * 1998-05-29 2000-07-11 International Business Machines Corporation Method for parallelizing software graphics geometry pipeline rendering
US6119215A (en) * 1998-06-29 2000-09-12 Cisco Technology, Inc. Synchronization and control system for an arrayed processing engine
EP0992916A1 (fr) * 1998-10-06 2000-04-12 Texas Instruments Inc. Processeur de signaux numériques
US6269354B1 (en) * 1998-11-30 2001-07-31 David W. Arathorn General purpose recognition e-circuits capable of translation-tolerant recognition, scene segmentation and attention shift, and their application to machine vision
US6173386B1 (en) * 1998-12-14 2001-01-09 Cisco Technology, Inc. Parallel processor with debug capability
FR2788873B1 (fr) * 1999-01-22 2001-03-09 Intermec Scanner Technology Ct Procede et dispositif de detection de segments de droites dans un flot de donnees numeriques representatives d'une image, dans lequel sont identifies les points contours de ladite image
WO2000062182A2 (fr) * 1999-04-09 2000-10-19 Clearspeed Technology Limited Appareil de traitement de donnees parallele
US6542989B2 (en) * 1999-06-15 2003-04-01 Koninklijke Philips Electronics N.V. Single instruction having op code and stack control field
US6611524B2 (en) * 1999-06-30 2003-08-26 Cisco Technology, Inc. Programmable data packet parser
AU6175500A (en) * 1999-07-30 2001-02-19 Indinell Sociedad Anonima Method and apparatus for processing digital images and audio data
US6745317B1 (en) * 1999-07-30 2004-06-01 Broadcom Corporation Three level direct communication connections between neighboring multiple context processing elements
US7072398B2 (en) * 2000-12-06 2006-07-04 Kai-Kuang Ma System and method for motion vector generation and analysis of digital video clips
US20020107990A1 (en) * 2000-03-03 2002-08-08 Surgient Networks, Inc. Network connected computing system including network switch
GB0019341D0 (en) * 2000-08-08 2000-09-27 Easics Nv System-on-chip solutions
US6898304B2 (en) * 2000-12-01 2005-05-24 Applied Materials, Inc. Hardware configuration for parallel data processing without cross communication
US6772268B1 (en) * 2000-12-22 2004-08-03 Nortel Networks Ltd Centralized look up engine architecture and interface
US7013302B2 (en) * 2000-12-22 2006-03-14 Nortel Networks Limited Bit field manipulation
US20020133688A1 (en) * 2001-01-29 2002-09-19 Ming-Hau Lee SIMD/MIMD processing on a reconfigurable array
US20020159466A1 (en) * 2001-02-14 2002-10-31 John Rhoades Lookup engine
US6985633B2 (en) * 2001-03-26 2006-01-10 Ramot At Tel Aviv University Ltd. Device and method for decoding class-based codewords
US6782054B2 (en) * 2001-04-20 2004-08-24 Koninklijke Philips Electronics, N.V. Method and apparatus for motion vector estimation
JP2003069535A (ja) * 2001-06-15 2003-03-07 Mitsubishi Electric Corp 誤り訂正多重化装置、誤り訂正多重分離装置、これらを用いた光伝送システムおよび誤り訂正多重化伝送方法
US6760821B2 (en) * 2001-08-10 2004-07-06 Gemicer, Inc. Memory engine for the inspection and manipulation of data
US7383421B2 (en) * 2002-12-05 2008-06-03 Brightscale, Inc. Cellular engine for a data processing system
US6938183B2 (en) * 2001-09-21 2005-08-30 The Boeing Company Fault tolerant processing architecture
JP2003100086A (ja) * 2001-09-25 2003-04-04 Fujitsu Ltd 連想メモリ回路
US7116712B2 (en) * 2001-11-02 2006-10-03 Koninklijke Philips Electronics, N.V. Apparatus and method for parallel multimedia processing
US6968445B2 (en) * 2001-12-20 2005-11-22 Sandbridge Technologies, Inc. Multithreaded processor with efficient processing for convergence device applications
US6901476B2 (en) * 2002-05-06 2005-05-31 Hywire Ltd. Variable key type search engine and method therefor
US7000091B2 (en) * 2002-08-08 2006-02-14 Hewlett-Packard Development Company, L.P. System and method for independent branching in systems with plural processing elements
US20040081238A1 (en) * 2002-10-25 2004-04-29 Manindra Parhy Asymmetric block shape modes for motion estimation
US7120195B2 (en) * 2002-10-28 2006-10-10 Hewlett-Packard Development Company, L.P. System and method for estimating motion between images
JP4496209B2 (ja) * 2003-03-03 2010-07-07 モービリゲン コーポレーション メモリワードアレイ構成およびメモリアクセス予測結合
US7581080B2 (en) * 2003-04-23 2009-08-25 Micron Technology, Inc. Method for manipulating data in a group of processing elements according to locally maintained counts
US9292904B2 (en) * 2004-01-16 2016-03-22 Nvidia Corporation Video image processing with parallel processing
JP4511842B2 (ja) * 2004-01-26 2010-07-28 パナソニック株式会社 動きベクトル検出装置及び動画撮影装置
GB2411745B (en) * 2004-03-02 2006-08-02 Imagination Tech Ltd Method and apparatus for management of control flow in a simd device
US20060002474A1 (en) * 2004-06-26 2006-01-05 Oscar Chi-Lim Au Efficient multi-block motion estimation for video compression
DE602005020218D1 (de) * 2004-07-29 2010-05-12 St Microelectronics Pvt Ltd Videodekoder mit Parallelprozessoren für die Dekodierung von Makroblocks
JP2006140601A (ja) * 2004-11-10 2006-06-01 Canon Inc 画像処理装置及び画像処理装置の制御方法
US7644255B2 (en) * 2005-01-13 2010-01-05 Sony Computer Entertainment Inc. Method and apparatus for enable/disable control of SIMD processor slices
US7725691B2 (en) * 2005-01-28 2010-05-25 Analog Devices, Inc. Method and apparatus for accelerating processing of a non-sequential instruction stream on a processor with multiple compute units
AU2006223416A1 (en) * 2005-03-10 2006-09-21 Qualcomm Incorporated Content adaptive multimedia processing
US8149926B2 (en) * 2005-04-11 2012-04-03 Intel Corporation Generating edge masks for a deblocking filter
US8619860B2 (en) * 2005-05-03 2013-12-31 Qualcomm Incorporated System and method for scalable encoding and decoding of multimedia data using multiple layers
US20070071404A1 (en) * 2005-09-29 2007-03-29 Honeywell International Inc. Controlled video event presentation
US7451293B2 (en) * 2005-10-21 2008-11-11 Brightscale Inc. Array of Boolean logic controlled processing elements with concurrent I/O processing and instruction sequencing
KR20080085189A (ko) * 2006-01-10 2008-09-23 브라이트스케일, 인크. 병렬 프로세싱 시스템에서 멀티미디어 데이터의 알고리즘단계들을 프로세싱하기 위한 방법 및 장치
US20080059762A1 (en) * 2006-09-01 2008-03-06 Bogdan Mitu Multi-sequence control for a data parallel system
WO2008027567A2 (fr) * 2006-09-01 2008-03-06 Brightscale, Inc. Machine parallèle d'une seule pièce
US20080059763A1 (en) * 2006-09-01 2008-03-06 Lazar Bivolarski System and method for fine-grain instruction parallelism for increased efficiency of processing compressed multimedia data
US20080059467A1 (en) * 2006-09-05 2008-03-06 Lazar Bivolarski Near full motion search algorithm
US20080126278A1 (en) * 2006-11-29 2008-05-29 Alexander Bronstein Parallel processing motion estimation for H.264 video codec

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO2007082042A3 *

Also Published As

Publication number Publication date
EP1971956A2 (fr) 2008-09-24
JP2009523293A (ja) 2009-06-18
US20070189618A1 (en) 2007-08-16
WO2007082044A2 (fr) 2007-07-19
WO2007082043A2 (fr) 2007-07-19
TW200737983A (en) 2007-10-01
WO2007082042A2 (fr) 2007-07-19
KR20080094005A (ko) 2008-10-22
KR20080094006A (ko) 2008-10-22
US20070188505A1 (en) 2007-08-16
CN101371262A (zh) 2009-02-18
US20100066748A1 (en) 2010-03-18
JP2009523292A (ja) 2009-06-18
TW200803464A (en) 2008-01-01
WO2007082043A3 (fr) 2008-04-17
WO2007082044A3 (fr) 2008-04-17
CN101371263A (zh) 2009-02-18
US20070162722A1 (en) 2007-07-12
TW200806039A (en) 2008-01-16
WO2007082042A3 (fr) 2008-04-17
EP1971958A2 (fr) 2008-09-24
CN101371264A (zh) 2009-02-18
KR20080085189A (ko) 2008-09-23
JP2009523291A (ja) 2009-06-18

Similar Documents

Publication Publication Date Title
EP1971959A2 (fr) Procédé et appareil de traitement de sous-blocs de données multimédia dans des systèmes de traitement en parallèle
JP2010527194A (ja) 動的運動ベクトル分析方法
CN1175750A (zh) 用于块结构图像处理系统的图形存储器存储系统和方法
US7409528B2 (en) Digital signal processing architecture with a wide memory bandwidth and a memory mapping method thereof
US20060161720A1 (en) Image data transmission method and system with DMAC
US7750916B2 (en) Memory addressing techniques
KR20080044102A (ko) 단일 명령 다중 자료 프로세서 및 그것을 위한 메모리어레이 구조
JP2007535267A (ja) 画像処理装置及び方法
JP5533330B2 (ja) データ処理装置
US6938105B2 (en) Data apparatus and method having DMA circuitry to efficiently transfer multivalued bit-plane data
US20100074336A1 (en) Fractional motion estimation engine
US20170270632A1 (en) Caching Method Of Graphic Processing Unit
US20120254573A1 (en) System, data structure, and method for collapsing multi-dimensional data
CN116109468B (zh) 图形处理单元及指令编译方法、存储介质、终端设备
KR100327712B1 (ko) 거대병렬컴퓨터장치
EP4198724A1 (fr) Dispositif de traitement et procédé de distribution de données à une pluralité d&#39;unités de traitement
CN116737409A (zh) 超大数据流的实时处理方法以及数据处理系统
JP4244619B2 (ja) 画像データ処理装置
CN103995739B (zh) 共享内存映射页的处理器系统产生进程号的方法及装置
JP2005110124A (ja) 画像転送方法、画像転送装置
JP2004165766A (ja) 画像処理方法
JPH0415743A (ja) 並列演算装置

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20080630

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA HR MK RS

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN

18W Application withdrawn

Effective date: 20090422