CN101640795B - Video decoding optimization method and device - Google Patents

Video decoding optimization method and device Download PDF

Info

Publication number
CN101640795B
CN101640795B CN 200910039252 CN200910039252A CN101640795B CN 101640795 B CN101640795 B CN 101640795B CN 200910039252 CN200910039252 CN 200910039252 CN 200910039252 A CN200910039252 A CN 200910039252A CN 101640795 B CN101640795 B CN 101640795B
Authority
CN
China
Prior art keywords
data
algorithm
module
state machine
computing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 200910039252
Other languages
Chinese (zh)
Other versions
CN101640795A (en
Inventor
陈勇
陈弟虎
蔡阳波
石南
夏平华
邓素华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NANJING LONGYUAN MICROELECTRONIC CO Ltd
Original Assignee
NANJING LONGYUAN MICROELECTRONIC TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NANJING LONGYUAN MICROELECTRONIC TECHNOLOGY Co Ltd filed Critical NANJING LONGYUAN MICROELECTRONIC TECHNOLOGY Co Ltd
Priority to CN 200910039252 priority Critical patent/CN101640795B/en
Publication of CN101640795A publication Critical patent/CN101640795A/en
Application granted granted Critical
Publication of CN101640795B publication Critical patent/CN101640795B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention provides a video decoding optimization method, including the following steps: reading instruction for processing data, analyzing the algorithm formula adopted in data processing; selecting corresponding algorithm to realize data processing according to analysis result; and executing the corresponding algorithm by adopting pipeline control manner; wherein the corresponding algorithm is dimensional IDCT algorithm or YCbCr2RGB algorithm. The invention also provides a video decoding optimization device corresponding to the method. By the invention, speed of hardware in video data processing can be greatly improved under low-frequency and low-power consumption condition.

Description

A kind of video decoding optimization method and device
Technical field
The present invention relates to the multimedia field, relating in particular to 2-D discrete cosine inverse transformation-IDCT (InverseDiscrete Transform) and YCbCr changes integrated video decoding optimization method and the device of RGB (hereinafter to be referred as YCbCr2RGB).
Background technology
Generally speaking, in video decode, available pure software is realized the decoding of video, such as MPEG2, H264 video decode; Pure software realizes that video decode requires the operating frequency of decoding chip than higher.Operating frequency as fruit chip is not high enough, and that is decoded with software fully almost is impossible.Generally speaking, can be with time-consuming and need the processing procedure (such as 2-D discrete cosine inverse transformation IDCT etc.) of complex calculation to realize with hardware, thus increase substantially decoding efficiency, realize the real-time video decoding.But in the prior art, generally all be to come executed in parallel above-mentioned time-consuming and need the processing procedure of complex calculation with a plurality of multiplication or addition, this just takes resource very much.In addition, the data that general video decode comes out are the YCbCr form, also need further be converted to rgb format by all the other hardware circuits, and this has just further increased the hardware complexity of video decoding chip.
Summary of the invention
In order to solve existing technical problem in the above-mentioned prior art, primary and foremost purpose of the present invention is to provide a kind of video decoding optimization method, format conversion to the most time taking two-dimentional IDCT and YCbCr adopts streamline control respectively, increased substantially the efficient of video decode, under lower decoding chip operating frequency, realized the real-time video decoding.
Another object of the present invention is to provide a kind of video decoding optimization device, realize above-mentioned two-dimentional IDCT and the integrated video decoding optimization method of YCbCr2RGB.
Primary and foremost purpose of the present invention is achieved through the following technical solutions: a kind of video decoding optimization method, it is characterized in that, and may further comprise the steps: step 101, reading instruction for processing data is analyzed the algorithmic formula that deal with data adopted; Step 102 according to the described analysis result of step 101, selects respective algorithms to realize the processing of data, and described respective algorithms is two-dimentional IDCT algorithm or YCbCr2RGB algorithm; Step 103 adopts pipeline control mode, carries out described respective algorithms.
Preferably, step 103 comprises following step: step 113, to command unit input clear command; Step 123 is to initial address, address change amplitude, the operational order of command unit input access number; Step 133 according to the operational order that step 123 is imported, is selected first state machine or second state machine, enters two-dimentional IDCT operation stages or YCbCr2RGB operation stages respectively; Adopt pipeline control mode, carry out reading of data in IDCT computing or the YCbCr2RGB computing, operation result is handled, stored to the data that read.
Preferably, step 133 adopts pipeline control mode, and the step of carrying out the IDCT computing is: at first by first state machine data select signal and latch signal in algoritic module and the storage address module are operated; In each state of first state machine,, give different data respectively to data select signal in the algoritic module and latch signal; Again from the different addresses peek of memory, carry out the row, column repeatedly computing that circulates successively by an adder and multiplier, the intermediate operations result cache is in memory; Row or column circulates repeatedly in the calculating process, during the flag bit sign=1 of first state machine, the computing of one dimension IDCT is finished, and operation result is stored in the memory.
Preferably, step 133 adopts pipeline control mode, and the step of carrying out the YCbCr2RGB computing is: at first by second state machine data select signal in the algoritic module and latch signal are operated; In each state of second state machine,, give different data respectively to data select signal in the algoritic module and latch signal; Read in two groups of YCbCr values of the computing of wanting again by input/output register from the outside, carry out the format conversion of YcbCr to RGB by two adders and a multiplier; When the flag bit sign=1 of second state machine, the YCbCr2RGB computing finishes, and operation result is stored in the input/output register.
Another object of the present invention is achieved through the following technical solutions: a kind of video decoding optimization device is characterized in that comprising: the command unit that is used to read the deal with data instruction; Be used for the algorithmic formula that analyzing and processing data adopts, the kernel control module of selecting IDCT algorithm or YCbCr2RGB algorithm that data are handled, kernel control module comprises first state machine and second state machine of the streamline control that is used for realizing respectively two-dimentional IDCT algorithm and YCbCr2RGB algorithm; Be used for input/output register and memory for described first state machine and the second state machine computing peek and kernel control module storage operation result; Be used to carry out the rudimentary algorithm module of IDCT algorithm or YCbCr2RGB algorithm; Be used for the result that operation result with the rudimentary algorithm module exports input/output register or memory to and select output module; And the access numerical control molding piece that is used for the access of control storage data; Described result selects output module to be connected with rudimentary algorithm module, input/output register, memory respectively, command unit, input/output register, rudimentary algorithm module are connected with kernel control module respectively, and access numerical control molding piece is connected between memory and the kernel control module.
The present invention has the following advantages with respect to prior art: two-dimentional IDCT the most time-consuming in the video decode and YCbCr2RGB format conversion are integrated in the system, move two-dimentional IDCT algorithm or YCbCr2RGB algorithm according to different instructions again.In the process of two-dimentional IDCT algorithm of operation or YCbCr2RGB algorithm,, thereby on hardware circuit, only need a multiplier and two additions to the implementation employing pipeline control mode of these two algorithms; Pipeline control mode in the algorithm implementation has not only increased substantially arithmetic speed, has also reduced the complexity of hardware circuit, under lower frequency, realizes the real-time video decoding transmission.
Description of drawings
Fig. 1 is a workflow diagram of the present invention;
Fig. 2 is the schematic flow sheet of pipeline operation of the present invention;
Fig. 3 is concrete streamline schematic diagram;
Fig. 4 is the structural representation of apparatus of the present invention.
Embodiment
Below in conjunction with embodiment and accompanying drawing, the present invention is described in further detail, but embodiments of the present invention are not limited thereto.
As shown in Figure 1, video decoding optimization method of the present invention, its implementation process may further comprise the steps:
Step 101, reading instruction for processing data is analyzed the algorithmic formula that deal with data adopted.In the present embodiment, the algorithmic formula that deal with data adopted is two-dimentional IDCT or YCbCr2RGB algorithmic formula.
Step 102 according to the described analysis result of step 101, selects respective algorithms to realize the processing of data.If analysis result is two-dimentional IDCT algorithmic formula, then select two-dimentional IDCT algorithm to realize the inverse discrete cosine transform of vision signal; If analysis result is the YCbCr2RGB algorithmic formula, then select the YCbCr2RGB algorithm to realize the format conversion of vision signal, become rgb format from the YCbCr format conversion.Described two-dimentional IDCT algorithm and YCbCr2RGB algorithm all meet the design frequency height, hardware occupies few these two conditions of resource.
Step 103 adopts pipeline control mode, carries out described respective algorithms.Described pipeline control mode is specially, to the following key step that algorithm is carried out, adopt pipeline system to come computing: to read data in memory RAM or the input/output register, operation result is handled, stored to the data that read to memory RAM or input/output register.In the present embodiment, operation result mainly refers to the processed video data.
This step 103 adopts pipeline control mode, as shown in Figure 2, carries out respective algorithms to realize the implementation procedure of data processing, specifically comprises following step:
Step 113, the command unit input clear command in input/output register.Mainly be to remove the operational order of carrying out last computing.
Step 123, initial address, address change amplitude, the operational order of the command unit input access number in input/output register.With the operational order of non-zero signal to start with.
Step 133 according to the operational order that step 123 is imported, is selected first state machine or second state machine, enters two-dimentional IDCT operation stages or YCbCr2RGB operation stages respectively; Adopt pipeline control mode, carry out several key steps of IDCT computing or YCbCr2RGB computing: reading of data, operation result is handled, stored to the data that read.
(1) be first state machine as if what select, what then enter is two-dimentional IDCT operation stages, then at first comes data select signal and latch signal in algoritic module and the memory RAM address module are operated by first state machine; In each state of first state machine,, give different data respectively to data select signal in the algoritic module and latch signal; Again from the different addresses peek of memory RAM, carry out the row, column repeatedly computing that circulates successively by an adder and multiplier, the intermediate operations result cache is in memory RAM; Circulate repeatedly in the calculating process, when the flag bit sign=1 of first state machine, the computing of one dimension IDCT just finished, with the processed video storage in memory RAM, thereby realized the streamline control of complicated algorithm function.Because algoritic module recycles, thereby only needs an adder and a multiplier, has reduced the shared chip area of module.
Below be the IDCT algorithm that adopts coding/decoding method of the present invention to finish, and the concrete implementation procedure of this algorithm:
The IDCT algorithmic formula is as follows:
f ( x , y ) = 2 N Σ x = 0 N - 1 Σ y = 0 N - 1 F ( u , v ) cos ( 2 x + 1 ) uπ 2 N cos ( 2 y + 1 ) uπ 2 N
C ( u ) , C ( v ) = 1 / 2 u , v = 0 1 u , v = 1,2 , . . . , N - 1
This algorithm is realized inverse discrete cosine transform, has used the row, column method of computing successively, finishes the once IDCT computing of two dimension.The transformation matrix of acquiescence input is with row order input from top to bottom, need move twice the IDCT algorithm in DSP_VFU.Special feature of the present invention is to support the matrix operation of 4~255 types, is carried out the computing of dissimilar matrixes by input instruction.
When operation IDCT algorithm,, deposit in the memory RAM at first with the cosine value of needs etc.Two-dimentional IDCT computing with 8 * 8 matrixes is an example, and one group of cosine value of needs is deposited among the RAM that determines the address.The MPEG2 cosine value of acquiescence is the value after having enlarged 65536 times, so that higher precision is arranged; And what follow in H264 is " IDCT " computing of integer, and the value of acquiescence is for enlarging twice.In row order input store RAM 3 from top to bottom, the integer cosine value A1 of MPEG2 is as follows:
23170 32138 30274 27246 23170 18205 12540 6393 23170 27246 12540 - 6393 - 23170 - 32138 - 30274 - 18205 23170 18205 - 12540 - 32138 - 23170 6393 30274 27246 23170 6393 - 30274 - 18205 23170 27246 - 12540 - 32138 23170 - 6393 - 30274 18205 23170 - 27246 - 12540 32138 23170 - 18205 - 12540 32138 - 23170 - 6393 30274 - 27246 23170 - 27246 12540 6393 - 23170 32138 - 30274 18205 23170 - 32138 30274 - 27246 23170 - 18205 12540 - 6393
A2 is as follows for the pairing hexadecimal cosine value of above-mentioned integer cosine value A1:
5 A 82 7 D 8 A 7642 6 A 6 E 5 A 82 471 D 30 FC 18 F 9 5 A 82 6 A 6 E 30 FC E 707 A 800 8276 89 BE B 8 E 3 5 A 82 471 D CF 04 8276 A 800 18 F 9 7642 6 A 6 E 5 A 82 18 F 9 89 BE B 8 E 3 5 A 82 6 A 6 E CF 04 8276 5 A 82 E 707 89 BE 471 D 5 A 82 9592 CF 04 7 D 8 A 5 A 82 B 8 E 3 CF 04 7 D 8 A A 800 E 707 7642 9592 5 A 82 9592 30 FC 18 F 9 A 800 7 D 8 A 89 BE 471 D 5 A 82 8276 7642 9592 5 A 82 B 8 E 3 30 FC E 707
The step of computing is: after treating the input IDCT operational order first time, kernel control module is peek automatically from memory RAM, after finishing the IDCT computing of specified type matrix by control first adder and multiplier then, the result selects successively that computing is the intact data of output module to deposit among the RAM with matrix form, after finishing IDCT computing for the first time, with the flag bit sign set of first state machine.The flag bit sign of initialization first state machine then, operation is the IDCT computing for the second time, kernel control module is peek automatically from memory RAM, computing, at last operation result is deposited in the address ram of appointment, the flag bit sig n set of first state machine, so far once complete two-dimentional IDCT computing is finished.
(2) be second state machine as if what select, what then enter is the YCbCr2RGB operation stages, then at first comes data select signal in the algoritic module and latch signal are operated by second state machine; In each state of second state machine,, give different data respectively to data select signal in the algoritic module and latch signal; Read in two groups of YCbCr values of the computing of wanting again by input/output register from the outside, carry out the format conversion of YcbCr to RGB by two adders and a multiplier; When the flag bit sign=1 of second state machine, the YCbCr2RGB computing finishes, and operation result is stored in the input/output register.
Below be that the YCbCr that adopts coding/decoding method of the present invention to finish changes the RGB algorithm, and the concrete implementation procedure of this algorithm:
The form of format conversion is respectively:
R=1.164(Y-16)+1.596(Cr-128);
G=1.164(Y-16)-0.813(Cr-128)-0.392(Cb-128);
B=1.164(Y-16)+2.017(Cb-128);
This algorithm, in once command, the YCbCr that has promptly finished two groups changes the RGB conversion, and the rgb format that obtains is RGB565, represents color with 16, and wherein red R is 5, and green G is 6, blue B is 5.
Calculating process is: two groups of YCbCr values of computing are put into input/output register, import operational order then, kernel control module 2 is peeked from input/output register, again by control first adder 31, second adder 32 and multiplier 33 computings, two groups of RGB565 values are put into input/output register 1 after finishing, and with the flag bit set of second state machine.Do not use memory RAM in this algorithm.
The concrete streamline of step 133 as shown in Figure 3.For idct transform,, from memory RAM, get the A number at first clock of operational order; Get the B number at second clock then, simultaneously the A number is carried out primary computing; Then, from memory RAM, get the C number, simultaneously the B number is carried out primary computing, and simultaneously the A number operation result first time is carried out the computing second time at the 3rd clock; The 4th clock, peek D carries out the computing first time to the C data simultaneously, and the B data are carried out the computing second time, and the operation result of A number is stored in the memory RAM, by that analogy.For the YCbCr2RGB format conversion, its streamline and idct transform similar; Different just peeks read to input/output register, and operation result also is to be stored in the input/output register.Like this, just realized the acceleration of hardware by the algorithm of streamline control, video two dimension IDCT processing and transmission and YCbCr2RGB format conversion just do not need to write the complicated algorithm software program and realize, only need input control order to tell this processing of kernel control module which data, kernel control module and peripheral hardware circuit thereof will read automatically in the mode of high performance pipeline, processing, write-back operation result.
Apparatus of the present invention mainly adopt DSP_VFU (Digital Singnal Processor-Video FunctionalUnit) to examine existing video decoding optimization method.When performing step 101, command unit by input/output register 1 reads the instruction that needs to handle which kind of data from the outside, and the instruction of being read is transferred to kernel control module 2, analyze by 2 pairs of algorithmic formulas that deal with data adopted of kernel control module.During performing step 102, select corresponding algorithm, data are handled by kernel control module 2.
When performing step 103, described command unit is arranged in the input/output register 1, selects first state machine 22 or second state machine 23 by the big state machine in the kernel control module 2 21.In addition, kernel control module 2 is also coordinated the relation between two-dimentional IDCT and the YCbCr2RGB algorithm, control the state variation of first state machine 21, second state machine 22 according to calculating process, to first adder 31, second adder 32 and multiplier 33 output control signals, realize the streamline control of two-dimentional IDCT and YCbCr2RGB respectively.
(1) a bit concrete, the structure of realization IDCT computing as shown in Figure 4, comprises that mainly kernel control module 2, rudimentary algorithm module, memory RAM 3, access numerical control molding piece, result select output module 4, input/output register 1.The rudimentary algorithm module is selected output module 4 to be connected, and mainly is made up of first adder 31 and multiplier 33 with kernel control module 2, access numerical control molding piece, result respectively.Access numerical control molding piece mainly is made up of RAM Data Control module 51 and address ram control module 52.
First adder 31 and multiplier 33 all have data select signal and data latch signal.51 major controls of RAM Data Control module are from the RAM reading of data; The storage address of address ram control module 52 major control RAM data, this address control module 52 can be implemented in access number in the different addresses of height 16 bit wides of 32 bit wide RAM.
The result selects output module 4, and main instruction according to the command unit output in input/output register 1 is selected operation result is exported to input/output register 1, or external memory RAM 3.Input/output register 1, RAM initial address, address change amplitude and the instruction etc. of main temporary outside input.
What kernel control module was mainly realized is to peek from memory RAM, according to IDCT algorithm and operational order, data and control signal in rudimentary algorithm module output RAM, behind the rudimentary algorithm module arithmetic, operation result is write among the RAM, repeatedly computing circulates, at most can be in an instruction, carry out 65535 computings (255x255 matrix) continuously, can realize being up to the data manipulation of 16 bit wides, be applied as sequence circuit.
(2) a bit concrete, the structure of realization YCbCr2RGB format conversion as shown in Figure 4, comprises that mainly kernel control module 2, rudimentary algorithm module, result select output module 4, input/output register 1.The rudimentary algorithm module mainly is made up of the first adder 31 that has data select signal and data latch signal, second adder 32 and multiplier 33.Kernel control module 2 reads the YcbCr formatted data from input/output register 1, according to YcbCr algorithm and operational order, to rudimentary algorithm module output YcbCr formatted data and control signal, behind the rudimentary algorithm module arithmetic, operation result is write input/output register 1.

Claims (7)

1. a video decoding optimization method is characterized in that, may further comprise the steps:
Step 101, reading instruction for processing data is analyzed the algorithmic formula that deal with data adopted;
Step 102 according to the described analysis result of step 101, selects respective algorithms to realize the processing of data, and described respective algorithms is two-dimentional IDCT algorithm or YCbCr2RGB algorithm;
Step 103 adopts pipeline control mode, carries out described respective algorithms;
Described step 103 comprises following step:
Step 113 is to command unit input clear command;
Step 123 is to initial address, address change amplitude, the operational order of command unit input access number;
Step 133 according to the operational order that step 123 is imported, is selected first state machine or second state machine, enters two-dimentional IDCT operation stages or YCbCr2RGB operation stages respectively; Adopt pipeline control mode, carry out reading of data in IDCT computing or the YCbCr2RGB computing, operation result is handled, stored to the data that read.
2. video decoding optimization method according to claim 1, it is characterized in that, step 133 adopts pipeline control mode, and the step of carrying out the IDCT computing is: at first by first state machine data select signal and latch signal in algoritic module and the storage address module are operated; In each state of first state machine,, give different data respectively to data select signal in the algoritic module and latch signal; Again from the different addresses peek of memory, carry out the row, column repeatedly computing that circulates successively by an adder and multiplier, the intermediate operations result cache is in memory; Row or column circulates repeatedly in the calculating process, during the flag bit sign=1 of first state machine, the computing of one dimension IDCT is finished, and operation result is stored in the memory.
3. video decoding optimization method according to claim 1, it is characterized in that, step 133 adopts pipeline control mode, and the step of carrying out the YCbCr2RGB computing is: at first by second state machine data select signal in the algoritic module and latch signal are operated; In each state of second state machine,, give different data respectively to data select signal in the algoritic module and latch signal; Read in two groups of YCbCr values of the computing of wanting again by input/output register from the outside, carry out the format conversion of YcbCr to RGB by two adders and a multiplier; When the flag bit sign=1 of second state machine, the YCbCr2RGB computing finishes, and operation result is stored in the input/output register.
4. video decoding optimization device according to the described method of claim 1 is characterized in that comprising:
Be used to read the command unit of deal with data instruction;
Be used for the algorithmic formula that analyzing and processing data adopts, the kernel control module of selecting IDCT algorithm or YCbCr2RGB algorithm that data are handled, kernel control module comprises first state machine and second state machine of the streamline control that is used for realizing respectively two-dimentional IDCT algorithm and YCbCr2RGB algorithm;
Be used for input/output register and memory for described first state machine and the second state machine computing peek and kernel control module storage operation result;
Be used to carry out the rudimentary algorithm module of IDCT algorithm or YCbCr2RGB algorithm;
Be used for the result that operation result with the rudimentary algorithm module exports input/output register or memory to and select output module;
And the access numerical control molding piece that is used for the access of control storage data;
Described result selects output module to be connected with rudimentary algorithm module, input/output register, memory respectively, command unit, input/output register, rudimentary algorithm module are connected with kernel control module respectively, and access numerical control molding piece is connected between memory and the kernel control module.
5. video decoding optimization device according to claim 4 is characterized in that: described rudimentary algorithm module comprises first adder, second adder and the multiplier that has data select signal and data latch signal.
6. video decoding optimization device according to claim 4 is characterized in that: described command unit is arranged in the input/output register.
7. video decoding optimization device according to claim 4 is characterized in that: described access numerical control molding piece comprises Data Control module and address control module.
CN 200910039252 2009-05-06 2009-05-06 Video decoding optimization method and device Expired - Fee Related CN101640795B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200910039252 CN101640795B (en) 2009-05-06 2009-05-06 Video decoding optimization method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200910039252 CN101640795B (en) 2009-05-06 2009-05-06 Video decoding optimization method and device

Publications (2)

Publication Number Publication Date
CN101640795A CN101640795A (en) 2010-02-03
CN101640795B true CN101640795B (en) 2011-05-18

Family

ID=41615546

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200910039252 Expired - Fee Related CN101640795B (en) 2009-05-06 2009-05-06 Video decoding optimization method and device

Country Status (1)

Country Link
CN (1) CN101640795B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8680888B2 (en) 2011-12-15 2014-03-25 Micron Technologies, Inc. Methods and systems for routing in a state machine
US9979983B2 (en) 2015-03-16 2018-05-22 Microsoft Technology Licensing, Llc Application- or context-guided video decoding performance enhancements
US10129566B2 (en) 2015-03-16 2018-11-13 Microsoft Technology Licensing, Llc Standard-guided video decoding performance enhancements
US10582259B2 (en) 2015-06-30 2020-03-03 Gopro, Inc. Pipelined video interface for remote controlled aerial vehicle with camera

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1874517A (en) * 2006-06-19 2006-12-06 西安交通大学 Control method for decoding MPEG2 video based on multiple pipeline control and optimizes buffer
CN1912925A (en) * 2006-08-29 2007-02-14 西安交通大学 Design and implementing method of multimedia expansion instructionof flow input read

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1874517A (en) * 2006-06-19 2006-12-06 西安交通大学 Control method for decoding MPEG2 video based on multiple pipeline control and optimizes buffer
CN1912925A (en) * 2006-08-29 2007-02-14 西安交通大学 Design and implementing method of multimedia expansion instructionof flow input read

Also Published As

Publication number Publication date
CN101640795A (en) 2010-02-03

Similar Documents

Publication Publication Date Title
CN109657782B (en) Operation method, device and related product
Possa et al. A multi-resolution FPGA-based architecture for real-time edge and corner detection
CN101179671B (en) Image scaling device and image scaling method
JP5522893B2 (en) Image processing apparatus, image processing method, and program
US11915058B2 (en) Video processing method and device, electronic equipment and storage medium
CN109255438B (en) Method and apparatus for adjusting tensor data
CN101640795B (en) Video decoding optimization method and device
CN109359556B (en) Face detection method and system based on low-power-consumption embedded platform
CN102819818A (en) Method for realizing image processing based on dynamic reconfigurable technology of field programmable gate array (FPGA) chip
CN111523652A (en) Processor, data processing method thereof and camera device
Park et al. Programmable multimedia platform based on reconfigurable processor for 8K UHD TV
US7876330B2 (en) Color conversion apparatus, filter processing apparatus and color conversion method
WO2019206161A1 (en) Pooling operation device
JP6970827B2 (en) Arithmetic processing unit
CN112765540B (en) Data processing method and device and related products
CN109711538B (en) Operation method, device and related product
CN110414672B (en) Convolution operation method, device and system
US20200159495A1 (en) Processing apparatus and method of processing add operation therein
US11194490B1 (en) Data formatter for convolution
CN101562691B (en) Image processing device and method
KR20220083820A (en) 3D Convolution in Neural Network Processors
EP4174644A1 (en) Computing apparatus, integrated circuit chip, board card, electronic device, and computing method
US11423600B2 (en) Methods and apparatus for configuring a texture filter pipeline for deep learning operation
CN111597886B (en) Hardware accelerator, system and acceleration method for fingerprint image processing
US20220222509A1 (en) Processing non-power-of-two work unit in neural processor circuit

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: NANJING LONGYUAN MICROELECTRONICS TECHNOLOGY CO.,

Free format text: FORMER OWNER: GUANGZHOU HISAND IC DESIGN CO., LTD.

Effective date: 20110314

C41 Transfer of patent application or patent right or utility model
C53 Correction of patent of invention or patent application
CB03 Change of inventor or designer information

Inventor after: Chen Yong

Inventor after: Chen Dihu

Inventor after: Cai Yangbo

Inventor after: Shi Nan

Inventor after: Xia Pinghua

Inventor after: Deng Suhua

Inventor before: Wu Kangwen

Inventor before: Chen Dihu

Inventor before: Liu Xurong

COR Change of bibliographic data

Free format text: CORRECT: INVENTOR; FROM: WU KANGWEN CHEN DIHU LIU XURONG TO: CHEN YONG CHEN DIHU CAI YANGBO SHI NAN XIA PINGHUA DENG SUHUA

Free format text: CORRECT: ADDRESS; FROM: 510663 UNIT 701, AREA C1, NO. 182, KEXUE AVENUE, SCIENCE CITY, DEVELOPMENT AREA, GUANGZHOU CITY, GUANGDONG PROVINCE TO: 211100 NO. 68, SHENGTAI ROAD, JIANGNING ECONOMIC AND TECHNOLOGICAL DEVELOPMENT ZONE, NANJING CITY, JIANGSU PROVINCE

TA01 Transfer of patent application right

Effective date of registration: 20110314

Address after: 211100 Jiangning Province Economic and Technological Development Zone, Jiangsu City, No. Sheng Tai Road, No. 68

Applicant after: NANJING LONGYUAN MICROELECTRONIC Co.,Ltd.

Address before: 510663 C1 District 182, science Avenue, Science Town, Guangzhou Development Zone, Guangdong 701, China

Applicant before: Guangzhou HiSand IC Design Co.,Ltd.

C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20110518

CF01 Termination of patent right due to non-payment of annual fee