CN101783021B - Method for speeding up DR image processing by using operation of GPU - Google Patents

Method for speeding up DR image processing by using operation of GPU Download PDF

Info

Publication number
CN101783021B
CN101783021B CN2010101073177A CN201010107317A CN101783021B CN 101783021 B CN101783021 B CN 101783021B CN 2010101073177 A CN2010101073177 A CN 2010101073177A CN 201010107317 A CN201010107317 A CN 201010107317A CN 101783021 B CN101783021 B CN 101783021B
Authority
CN
China
Prior art keywords
fft
gpu
size
computing
cuda
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN2010101073177A
Other languages
Chinese (zh)
Other versions
CN101783021A (en
Inventor
杜静
杜碧
万洪晓
陈永洒
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SHENZHEN ANGELL TECHNOLOGY CO., LTD.
Original Assignee
SHENZHEN ANGELL TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHENZHEN ANGELL TECHNOLOGY Co Ltd filed Critical SHENZHEN ANGELL TECHNOLOGY Co Ltd
Priority to CN2010101073177A priority Critical patent/CN101783021B/en
Publication of CN101783021A publication Critical patent/CN101783021A/en
Application granted granted Critical
Publication of CN101783021B publication Critical patent/CN101783021B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Medical Treatment And Welfare Office Work (AREA)
  • Apparatus For Radiation Diagnosis (AREA)
  • Measuring And Recording Apparatus For Diagnosis (AREA)

Abstract

The invention discloses a method for speeding up DR image processing by using the operation of a GPU, which comprises the following steps: reading original medical image data and expanding and filling an original medical image with image edge pixels; adopting a fast Fourier transform FFT program written by a CUDA to directly call the computation resources of the GPU to perform FFT operation on the expanded and filled original medical image data and a convolution kernel respectively to correspondingly obtain a first FFT result and a second FFT result; adopting a convolution program written by the CUDA to directly call the computation resources of the GPU to perform frequency-domain multiplication on the first and second FFT results to obtain a product; and adopting a reverse FFT program written by the CUDA to directly call the computation resources of the GPU to perform a reverse FFT operation on the product. The method has the advantages of simply and conveniently fulfilling the aim of utilizing the GPU to perform medical image processing on popular PCs of ordinary consumption levels with low cost.

Description

Realize that the GPU computing improves the method for DR image processing speed
Technical field
The present invention relates to the medical image processing technical field, relate in particular to the method that a kind of GPU of realization computing improves the DR image processing speed.
Background technology
In field of medical image processing, the technology of utilizing Console software to carry out DR (DigitalRadiography, Direct Digital X ray photography) Medical Image Processing at present mainly contains: Gain corrections, lumbar vertebrae isostatic correction, usm sharpening, slightly smoothly, moderate smoothing processing etc.
Above-mentioned DR Medical Image Processing generally adopts the CPU of microcomputer to handle, the Console software that moves among the CPU, its most of image processing techniquess have all been used the convolution operator processing, convolution operator is little 3*3,5*5,39*39,101*101 or the like, and these spatial domain convolution operators can pass through FFT (fast fourier transform) and realize at frequency filtering in addition.By optimized Algorithm flow process, optimizer code, the two-dimensional convolution operator is alternative with the one dimension convolution operator, the processing time can be reduced, but because the console aftertreatment is more, bulk treatment speed is still too slow.From the Pentium4 epoch, the lifting of cpu clock frequency meets with bottleneck, rests on about 2GHz to 3GHz always.Therefore, utilize GPU (Graphic Processing Unit, graphic processing circuit) to replace or the technology that cooperates CPU to carry out Medical Image Processing also engenders, as:
Based on the hardware-accelerated object plotting method of GPU, may further comprise the steps: (1) carries out being stored as texture after the pre-service to volume data in No. 200510110665.9 disclosed a kind of medical image of disclosed Chinese invention patent application on May 24th, 2006; (2) for calculating, GPU generates and provides parameter: the 1. generation of benchmark tangent plane; 2. in the ray cast process, the increment of texture and the step number that need advance of light altogether; 3. in GPU handles, redundant points corresponding redundant department is removed; (3) GPU calculates, and generate image: pixel rendering is the ray cast calculating section, comprising: determine light two ends and blend of colors.This patented claim technology is that light projecting algorithm, the traditional texture volume rendering algorithm realized in conjunction with traditional C PU are one, uses GPU to finish the ray cast process, uses the blend of colors method to generate image, sends to then in the color buffer district.Its basic ideas are with graphics card hardware graphic plotting to be carried out speed-up computation, obtain real-time render speed when keeping picture quality, satisfy the needs of digital medical system.
Do not put down in writing the technical matters that concrete measure realizes utilizing the GPU acceleration for overcoming above-mentioned No. 200510110665.9 patented claim, No. 200910131429.3 Chinese invention patent application disclosed on September 2nd, 2009, this application discloses a kind of GPU that utilizes and quickens CR/DR/CT image demonstration and image process method and specialized equipment, it comprises one in PC computer, CR/DR workstation or PACS server.Wherein PC computer chain is followed CR/DR workstation or PACS server, utilize the general-purpose computations ability of novel GPU, realize the quick demonstration of CR/DR/CT image, the demonstration of image and image processing function all utilize GPU to finish, proposition is with the disposal route of image segmentation amalgamation, GDI two dimension to PACS use is in the past quickened different, this method basic ideas are to utilize the general-purpose computations ability of GPU, CPU is freed from the heavy calculating of Flame Image Process, be absorbed in the processing of data communication and data security, to improve system effectiveness.
Must use graphics workstation to carry out GPU to quicken to cause the high technical matters of complex structure cost for overcoming above-mentioned No. 200910131429.3 patented claim, the disclosed a kind of three-dimensional medical image display method that quickens based on GPU of disclosed Chinese invention patent application on Dec 2nd, 2009 200910059864.X number, this method are that medical science DICOM image sequence file is saved in Installed System Memory in the mode of volume data; Utilize the 3 d graphic library programming expansion interface function API of OPENGL or DIRECTX then, volume data is loaded into the GPU video memory; Calculate again to generate and act on behalf of solid, and the polygonal slices of acting on behalf of in the solid is carried out illumination calculation and color calculation by pixel; Mix by ALPHA at last and will act on behalf of the synthetic 3 d medical images of polygonal slices all in the solid.These method basic ideas are to utilize the certain programmed interface, realize the real-time interactive demonstration and need not use graphics workstation on the popular PC of ordinary consumption levels, to reduce cost.
But, above-mentioned 200910059864.X patented claim technology is utilized the 3 d graphic library programming expansion interface function API of OPENGL or DIRECTX, and the method that the volume data of medical science DICOM image sequence file is loaded into the GPU video memory only is suitable for 3-D view and handles.Program that this method need customize or circuit, process is comparatively complicated, and requiring very that the research staff of familiar with computers software engineering, hardware technology designs and researches and develops, R﹠D costs are higher, efficient is lower, stability is difficult to guarantee and be inconvenient to safeguard when breaking down.
Summary of the invention
The technical matters that the present invention mainly solves provides the method that a kind of GPU of realization computing improves the DR image processing speed, can be simple and easy, realize directly calling the function that GPU carries out medical image processing at low cost on the popular PC of ordinary consumption levels.
For solving the problems of the technologies described above, the technical scheme that the present invention adopts is: provide a kind of GPU of realization computing to improve the method for DR image processing speed, comprise: read the primitive medicine view data, with image edge pixels described primitive medicine image is expanded and polishing, wherein establishing described original image size is imageW * imageH, convolution kernel is of a size of kernelW * kernelH, picture size is fftW * fftH behind the polishing, fftW=imageW+kernelW-1 wherein, fftH=imageH+kernelH-1; The Fast Fourier Transform (FFT) FFT program that adopts the unified equipment framework of calculating programming model CUDA to write is directly called the computational resource of GPU, primitive medicine view data behind described expansion and the polishing and convolution kernel are carried out the FFT computing respectively, correspondingly obtain a FFT result and the 2nd FFT result; Adopt and calculate the convolution program that unified equipment framework programming model CUDA writes, directly call the computational resource of GPU a described FFT result and the 2nd FFT result are asked multiplication at frequency domain, obtain multiplied result; Adopt and calculate the contrary FFT program that unified equipment framework programming model CUDA writes, the computational resource that directly calls GPU carries out contrary FFT computing to described multiplied result.
Wherein, described FFT computing comprises an imageW * imageH size two-dimensional FFT operation, a fftW * fftH size two-dimensional FFT operation, described frequency domain that multiplication comprises imageW * imageH size and the fftW * fftH size multiplication mutually of asking, described contrary FFT computing comprises a fftW * contrary FFT computing of fftH size two dimension.
Wherein, described primitive medicine view data behind described expansion and the polishing and convolution kernel are carried out respectively in the step of FFT computing, select for use the CUFFT storehouse of nVidia exploitation to carry out described FFT computing.
The invention has the beneficial effects as follows: be different from the program that prior art need customize or circuit carries out the GPU Flame Image Process and make design process comparatively complicated, have relatively high expectations, the more high situation of cost, the present invention adopts the unified equipment framework of calculating programming model CUDA to write the various programs of carrying out Medical Image Processing, the program of these CUDA exploitations can directly be called GPU when handling image computational resource carries out Flame Image Process, key is to write the concurrent operation program relevant with moving Medical Treatment on the personal computer of cheapness, do not need to build complicated platform, do not need special hardware device or software, make implement very simple and low-cost; Simultaneously, CUDA is a kind of multiple programming model and software environment, and by CUDA, the user can utilize GPU to carry out general-purpose computations, and it adopts the C language development of expansion, and except that can directly calling the computational resource of GPU, operation efficiency is higher; The present invention is directed to medical image and need carry out the characteristics that image edge pixels is handled, with image edge pixels described primitive medicine image is expanded and polishing, implement with aforementioned employing CUDA technology, common effect is to make not need the computing machine of higher level to research and develop operating personnel, do not need special platform and equipment, just can carry out described Medical Image Processing, therefore, the present invention can be simple and easy, on the popular PC of ordinary consumption levels, realize directly calling the function that GPU carries out medical image processing at low cost.
Description of drawings
Fig. 1 is that the present invention realizes that the GPU computing improves the process flow diagram of the method embodiment of DR image processing speed.
Embodiment
By describing technology contents of the present invention, structural attitude in detail, realized purpose and effect, give explanation below in conjunction with embodiment and conjunction with figs. are detailed.
See also Fig. 1, the present invention realizes that the method embodiment of GPU computing raising DR image processing speed mainly comprises step:
Step 101: read the primitive medicine view data, with image edge pixels described primitive medicine image is expanded and polishing, wherein establishing described original image size is imageW * imageH, convolution kernel is of a size of kernelW * kernelH, picture size is fftW * fftH behind the polishing, fftW=imageW+kernelW-1 wherein, fftH=imageH+kernelH-1;
Step 102: the Fast Fourier Transform (FFT) FFT program that adopts the unified equipment framework of calculating programming model CUDA to write is directly called the computational resource of GPU, primitive medicine view data behind described expansion and the polishing and convolution kernel are carried out the FFT computing respectively, correspondingly obtain a FFT result and the 2nd FFT result;
Step 103: adopt and calculate the convolution program that unified equipment framework programming model CUDA writes, directly call the computational resource of GPU a described FFT result and the 2nd FFT result are asked multiplication at frequency domain, obtain multiplied result;
Step 104: adopt and calculate the contrary FFT program that unified equipment framework programming model CUDA writes, the computational resource that directly calls GPU carries out contrary FFT computing to described multiplied result.
Below as can be seen, the present invention adopts the unified equipment framework of calculating programming model CUDA to write the various programs of carrying out Medical Image Processing, the program of these CUDA exploitations can directly be called GPU when handling image computational resource carries out Flame Image Process, key is to write the concurrent operation program relevant with moving Medical Treatment on the personal computer of cheapness, do not need to build complicated platform, do not need special hardware device or software, make implement very simple and low-cost; Simultaneously, CUDA is a kind of multiple programming model and software environment, and by CUDA, the user can utilize GPU to carry out general-purpose computations, and it adopts the C language development of expansion, and except that can directly calling the computational resource of GPU, operation efficiency is higher; The present invention is directed to medical image and need carry out the characteristics that image edge pixels is handled, with image edge pixels described primitive medicine image is expanded and polishing, implement with aforementioned employing CUDA technology, common effect is to make not need the computing machine of higher level to research and develop operating personnel, do not need special platform and equipment, just can carry out described Medical Image Processing, therefore, the present invention can be simple and easy, on the popular PC of ordinary consumption levels, realize directly calling the function that GPU carries out medical image processing at low cost.
In addition, the present invention can also realize following technique effect:
1) the present invention can make full use of the powerful calculating ability of GPU, can reach near 1Tflops/s even higher as GT200;
2) GPU has high bandwidth of memory, and image calculation efficient is higher;
3) under the equal computing power, GPU is more more cheap than CPU;
4), can on the personal computer of cheapness, write and move concurrent program based on the CUDA technology.With respect to the parallel computation environment based on network of workstations and cluster of MPI technology, the present invention has significant cost advantage.
For ease of GPU computing and conserve bandwidth, describedly described primitive medicine image is expanded in the step with polishing with image edge pixels, picture size behind the polishing is set at the multiple that is less than or equal to 1024 2n number or nearest 512.Such as, original image is of a size of 1220 * 120, and convolution kernel is of a size of 7 * 7, calculates to such an extent that fftW and fffH are respectively 1226 and 126, and it is 1536 * 128 that the picture size fftW * fftH that then finally participates in computing need supply.
Image operation in the embodiment of the invention, the FFT computing comprises an imageW * imageH size two-dimensional FFT operation, a fftW * fftH size two-dimensional FFT operation as described, described frequency domain that multiplication comprises imageW * imageH size and the fftW * fftH size multiplication mutually of asking, described contrary FFT computing comprises a fftW * contrary FFT computing of fftH size two dimension.
Wherein, described primitive medicine view data behind described expansion and the polishing and convolution kernel are carried out respectively in the step of FFT computing, select for use the CUFFT storehouse of nVidia exploitation to carry out described FFT computing.
The above only is embodiments of the invention; be not so limit claim of the present invention; every equivalent structure or equivalent flow process conversion that utilizes instructions of the present invention and accompanying drawing content to be done; or directly or indirectly be used in other relevant technical fields, all in like manner be included in the scope of patent protection of the present invention.

Claims (3)

1. realize that the GPU computing improves the method for DR image processing speed for one kind, it is characterized in that, comprising:
Read the primitive medicine view data, with image edge pixels described primitive medicine image is expanded and polishing, wherein establishing described original image size is imageW * imageH, convolution kernel is of a size of kernelW * kernelH, picture size is fftW * fftH behind the polishing, fftW=imageW+kernelW-1 wherein, fftH=imageH+kernelH-1;
The Fast Fourier Transform (FFT) FFT program that adopts the unified equipment framework of calculating programming model CUDA to write is directly called the computational resource of GPU, primitive medicine view data behind described expansion and the polishing and convolution kernel are carried out the FFT computing respectively, correspondingly obtain a FFT result and the 2nd FFT result;
Adopt and calculate the convolution program that unified equipment framework programming model CUDA writes, directly call the computational resource of GPU a described FFT result and the 2nd FFT result are asked multiplication at frequency domain, obtain multiplied result;
Adopt and calculate the contrary FFT program that unified equipment framework programming model CUDA writes, the computational resource that directly calls GPU carries out contrary FFT computing to described multiplied result.
2. method according to claim 1, it is characterized in that: described FFT computing comprises an imageW * imageH size two-dimensional FFT operation, a fftW * fftH size two-dimensional FFT operation, described frequency domain that multiplication comprises imageW * imageH size and the fftW * fftH size multiplication mutually of asking, described contrary FFT computing comprises a fftW * contrary FFT computing of fftH size two dimension.
3. method according to claim 2 is characterized in that: described primitive medicine view data behind described expansion and the polishing and convolution kernel are carried out respectively in the step of FFT computing, select for use the CUFFT storehouse of nVidia exploitation to carry out described FFT computing.
CN2010101073177A 2010-02-02 2010-02-02 Method for speeding up DR image processing by using operation of GPU Active CN101783021B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010101073177A CN101783021B (en) 2010-02-02 2010-02-02 Method for speeding up DR image processing by using operation of GPU

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010101073177A CN101783021B (en) 2010-02-02 2010-02-02 Method for speeding up DR image processing by using operation of GPU

Publications (2)

Publication Number Publication Date
CN101783021A CN101783021A (en) 2010-07-21
CN101783021B true CN101783021B (en) 2011-09-28

Family

ID=42523006

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010101073177A Active CN101783021B (en) 2010-02-02 2010-02-02 Method for speeding up DR image processing by using operation of GPU

Country Status (1)

Country Link
CN (1) CN101783021B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102023838B (en) * 2010-12-17 2012-08-15 浪潮(北京)电子信息产业有限公司 Processing method and system of MRC picture file
CN102609978B (en) * 2012-01-13 2014-01-22 中国人民解放军信息工程大学 Method for accelerating cone-beam CT (computerized tomography) image reconstruction by using GPU (graphics processing unit) based on CUDA (compute unified device architecture) architecture
CN102662612B (en) * 2012-02-29 2014-12-10 浪潮(北京)电子信息产业有限公司 A method and a system for displaying MRC-form picture files by using Qt bank
CN102663759A (en) * 2012-04-20 2012-09-12 中国科学院遥感应用研究所 Remote-sensing image rapid-processing method
CN103099635B (en) * 2012-12-27 2014-12-24 李华 Graphics processing unit (GPU) digital image system for microscopic captive test (CT) machine and control method of GPU digital image system for microscopic CT machine
CN104750560B (en) * 2015-03-06 2018-12-14 联想(北京)有限公司 A kind of information processing method and electronic equipment
US20160379109A1 (en) 2015-06-29 2016-12-29 Microsoft Technology Licensing, Llc Convolutional neural networks on hardware accelerators

Also Published As

Publication number Publication date
CN101783021A (en) 2010-07-21

Similar Documents

Publication Publication Date Title
CN101783021B (en) Method for speeding up DR image processing by using operation of GPU
US11010858B2 (en) Mechanism to accelerate graphics workloads in a multi-core computing architecture
CN106575442B (en) Use the bandwidth reducing of the texture lookups by self-adaption paint
US11593069B2 (en) Use of a single instruction set architecture (ISA) instruction for vector normalization
US20200042834A1 (en) System and method for 3d blob classification and transmission
US10776156B2 (en) Thread priority mechanism
US10748323B2 (en) GPU based shader constant folding
CN105023234B (en) Figure accelerated method based on embedded system storage optimization
WO2016200540A1 (en) Facilitating efficient graphics command generation and execution for improved graphics performance at computing devices
CN105144243A (en) Data visualization
CN105550974A (en) GPU-based acceleration method of image feature extraction algorithm
US20180096446A1 (en) Graphics Processor Register Renaming Mechanism
WO2017201676A1 (en) Self-adaptive window mechanism
Ma et al. Optimized Laplacian image sharpening algorithm based on graphic processing unit
CN109844802B (en) Mechanism for improving thread parallelism in a graphics processor
US11934797B2 (en) Mechanism to perform single precision floating point extended math operations
US20170337728A1 (en) Triangle Rendering Mechanism
WO2017172032A1 (en) System and method of caching for pixel synchronization-based graphics techniques
CN108604386A (en) Multistep texture processing is carried out with the feedback in texture cell
WO2018057091A1 (en) Static data sharing mechanism for a heterogeneous processing environment
US20230146390A1 (en) Trainable visual quality metrics for measuring rendering quality in a graphics environment
US20210256714A1 (en) Dense motion tracking mechanism
US20240126357A1 (en) Power optimized blend
US20240134604A1 (en) Constant modulo via recirculant reduction
US20230306551A1 (en) Compression using a flat mapping in virtual address space

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C56 Change in the name or address of the patentee
CP01 Change in the name or title of a patent holder

Address after: 518057, A, Hua Han Innovation Park, 16 Lang Shan Road, North hi tech Industrial Park, Guangdong, Shenzhen, 3A

Patentee after: SHENZHEN ANGELL TECHNOLOGY CO., LTD.

Address before: 518057, A, Hua Han Innovation Park, 16 Lang Shan Road, North hi tech Industrial Park, Guangdong, Shenzhen, 3A

Patentee before: Shenzhen Angell Technology Co., Ltd.

PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Method for speeding up DR image processing by using operation of GPU

Effective date of registration: 20190620

Granted publication date: 20110928

Pledgee: Shenzhen high tech investment and financing Company limited by guarantee

Pledgor: SHENZHEN ANGELL TECHNOLOGY CO., LTD.

Registration number: 2019990000591

PE01 Entry into force of the registration of the contract for pledge of patent right
PC01 Cancellation of the registration of the contract for pledge of patent right

Date of cancellation: 20200929

Granted publication date: 20110928

Pledgee: Shenzhen high tech investment and financing Company limited by guarantee

Pledgor: SHENZHEN ANGELL TECHNOLOGY Co.,Ltd.

Registration number: 2019990000591

PC01 Cancellation of the registration of the contract for pledge of patent right