CN108563519A - 基于gpgpu中的2d桌面块拷贝填充操作的实现 - Google Patents

基于gpgpu中的2d桌面块拷贝填充操作的实现 Download PDF

Info

Publication number
CN108563519A
CN108563519A CN201810340825.6A CN201810340825A CN108563519A CN 108563519 A CN108563519 A CN 108563519A CN 201810340825 A CN201810340825 A CN 201810340825A CN 108563519 A CN108563519 A CN 108563519A
Authority
CN
China
Prior art keywords
copy
block
block copy
gpgpu
frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810340825.6A
Other languages
English (en)
Other versions
CN108563519B (zh
Inventor
杨盼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Changsha In Blx Ic Design Corp
Original Assignee
Changsha In Blx Ic Design Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Changsha In Blx Ic Design Corp filed Critical Changsha In Blx Ic Design Corp
Priority to CN201810340825.6A priority Critical patent/CN108563519B/zh
Publication of CN108563519A publication Critical patent/CN108563519A/zh
Application granted granted Critical
Publication of CN108563519B publication Critical patent/CN108563519B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogram communication
    • G06F9/543User-generated data transfer, e.g. clipboards, dynamic data exchange [DDE], object linking and embedding [OLE]
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Image Generation (AREA)
  • Image Processing (AREA)

Abstract

本发明公开了一种GPGPU的2D桌面块拷贝和填充实现方式,包括重叠区域拷贝块拷贝操作,从帧存读取源数据写入双口RAM,从RAM读取数据进行字节对齐,再送入块拷贝单元,进行色域转换后将像素写入FIFO,再通过控制进行Burst写回帧存。块填充为高效模式,由控制单元直接产生写帧存操作,同时产生相应的Burst操作。整个光栅化操作过程简单,高效,极大的加速了复杂3D应用场景下2D桌面流畅度。

Description

基于GPGPU中的2D桌面块拷贝填充操作的实现
技术领域
本发明主要涉及到基于GPGPU中的2D桌面设计领域,特指基于GPGPU中的2D桌面块拷贝和块填充操作的实现。
背景技术
操作系统按应用领域分为桌面操作系统,服务器操作系统,嵌入式操作系统。桌面操作系统应用最为普遍,桌面操作系统基本功能是实现人机交互,人机交互过程中追求流畅的桌面窗口显示效果从而带来友好的体验,目前主流桌面系统有MAC OS,LINUX,WINDOWS,桌面系统主要操作对象为像素操作,而像素操作的特点在于数据量庞大,操作次数极其频繁。
实现2D桌面硬件加速通常由软件直接实现,2D桌面性能极大的依赖CPU性能,为了摆脱对CPU的完全依赖,通过GPGPU实现硬件加速。然而在某些应用情景下,GPGPU绘图引擎被负责的3D绘图占据时,2D桌面使用时会造成卡顿,而独立2D桌面块拷贝及块填充操作实现2D桌面的全加速,不占用3D绘图引擎资源。
发明内容
本发明要解决的问题就在于:针对现有的桌面应用需求,本发明提供一种相对简单、硬件资源占比很小、性能极高的块拷贝和填充操作的实现,为2D桌面实现加速,不受限于复杂的3D应用场景,达到流畅的2D桌面显示效果。
与现有技术相比,本发明的优点就在于:1、性能极高:本发明提出的块拷贝操作的实现采用流水方式实现以及高效率Burst方式写帧存,Burst操作可达到128x256bit,相比1024x768显示分辨率,一次可连续操作一行像素;2、逻辑资源少,本发明支持的块拷贝和填充操作占用资源少,电路结构相对简单;3、可复用性强:本发明采用独立的2D块拷贝和填充实现,采用标准的本地总线配置接口接收2D操作命令,采用标准的内部存储总线接口访问帧存,可重用性强,能在GPGPU高性能通用型图形芯片设计重复使用。
附图说明
图1块拷贝相对关系示意图;
图2是本发明提出的块填充与拷贝结构框图。
具体实施方式
以下将结合附图和具体实线对本发明做进一步详细说明。
如图1所示,本发明中块拷贝操作支持重叠区域拷贝,具体实现方式有4种重叠方式,仅需通过判断源于目标的Y其实坐标大小,第一第二种情况以Y坐标递增的方式进行,第三第四种情况以Y坐标递减的情况进行逐行拷贝操作。
如图2所示,本发明的2D桌面块拷贝和填充的实现分为三个阶段,第一阶段:快拷贝需要从帧存读取源数据写入双口RAM。第二阶段:取双口RAM的数据,进行字节对齐再送入块拷贝单元,再进行相应的色域转换。第三阶段:流水转换后依据目标对象起始地址进行对齐,再将像素写入FIFO,FIFO非空状态可直接启动写帧存Burst操作,此处Burst操作若存储带宽足够的情况下,也可以达到全流水操作性能。块填充为高效模式,由控制单元直接产生写帧存操作,同时产生相应的Burst操作,以及字节掩码控制写帧存。每周期可以输出4个像素。整个块拷贝填充过程简单,高效,极大的加速了复杂3D应用场景下2D桌面流畅度。

Claims (1)

1.GPGPU芯片设计中2D桌面快拷贝填充操作的实现方法,其特征在于支持块拷贝操作支持重叠区域拷贝以及高效率填充,采用了128bit全流水操作,块拷贝依据源和目标其实地址采用了流水方式对其进行拷贝操作,实现了单周期4个像素(RGBA,32bit)的光栅化操作,块填充技术仅需按照木目标起始地址进行流水操作,块拷贝流水操作包括从RAM读取像素,两级缓存对齐,两级流水执行光栅化操作,流水写回,填充仅需流水写入控制写回。
CN201810340825.6A 2018-04-17 2018-04-17 基于gpgpu中的2d桌面块拷贝填充操作的实现 Active CN108563519B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810340825.6A CN108563519B (zh) 2018-04-17 2018-04-17 基于gpgpu中的2d桌面块拷贝填充操作的实现

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810340825.6A CN108563519B (zh) 2018-04-17 2018-04-17 基于gpgpu中的2d桌面块拷贝填充操作的实现

Publications (2)

Publication Number Publication Date
CN108563519A true CN108563519A (zh) 2018-09-21
CN108563519B CN108563519B (zh) 2023-06-09

Family

ID=63535457

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810340825.6A Active CN108563519B (zh) 2018-04-17 2018-04-17 基于gpgpu中的2d桌面块拷贝填充操作的实现

Country Status (1)

Country Link
CN (1) CN108563519B (zh)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111813481A (zh) * 2020-07-09 2020-10-23 长沙景嘉微电子股份有限公司 Gpgpu中2d桌面位图扩展方法、装置及图形处理器
CN111813482A (zh) * 2020-07-09 2020-10-23 长沙景嘉微电子股份有限公司 Gpgpu中的2d桌面色键方法、装置及图形处理器
CN111813483A (zh) * 2020-07-09 2020-10-23 长沙景嘉微电子股份有限公司 Gpgpu中的2d桌面旋转方法、装置及图形处理器

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120185671A1 (en) * 2011-01-14 2012-07-19 Qualcomm Incorporated Computational resource pipelining in general purpose graphics processing unit
CN103888771A (zh) * 2013-12-30 2014-06-25 中山大学深圳研究院 基于gpgpu技术的并行视频图像处理方法
US20160364898A1 (en) * 2015-06-11 2016-12-15 Bimal Poddar Optimizing for rendering with clear color

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120185671A1 (en) * 2011-01-14 2012-07-19 Qualcomm Incorporated Computational resource pipelining in general purpose graphics processing unit
CN103888771A (zh) * 2013-12-30 2014-06-25 中山大学深圳研究院 基于gpgpu技术的并行视频图像处理方法
US20160364898A1 (en) * 2015-06-11 2016-12-15 Bimal Poddar Optimizing for rendering with clear color

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111813481A (zh) * 2020-07-09 2020-10-23 长沙景嘉微电子股份有限公司 Gpgpu中2d桌面位图扩展方法、装置及图形处理器
CN111813482A (zh) * 2020-07-09 2020-10-23 长沙景嘉微电子股份有限公司 Gpgpu中的2d桌面色键方法、装置及图形处理器
CN111813483A (zh) * 2020-07-09 2020-10-23 长沙景嘉微电子股份有限公司 Gpgpu中的2d桌面旋转方法、装置及图形处理器

Also Published As

Publication number Publication date
CN108563519B (zh) 2023-06-09

Similar Documents

Publication Publication Date Title
CN108563519A (zh) 基于gpgpu中的2d桌面块拷贝填充操作的实现
US20200020067A1 (en) Concurrent binning and rendering
US11037358B1 (en) Methods and apparatus for reducing memory bandwidth in multi-pass tessellation
CN108492242A (zh) 基于gpgpu中的2d桌面混合操作的实现
US8823715B2 (en) Efficient writing of pixels to tiled planar pixel arrays
KR102645239B1 (ko) Gpu 캐시를 활용한 다운스케일링을 위한 simo 접근 방식으로의 gpu 커널 최적화
TW202230287A (zh) 用於遮擋處理技術的方法和裝置
CN108520490A (zh) 基于gpgpu中的2d桌面光栅化操作的实现
WO2023087827A1 (zh) 渲染方法及装置
US11657471B2 (en) Methods and apparatus for constant data storage
US11893654B2 (en) Optimization of depth and shadow pass rendering in tile based architectures
US9239699B2 (en) Enabling hardware acceleration in a computing device during a mosaic display mode of operation thereof
WO2023164792A1 (en) Checkerboard mask optimization in occlusion culling
US11373267B2 (en) Methods and apparatus for reducing the transfer of rendering information
US20220172695A1 (en) Methods and apparatus for plane planning for overlay composition
TW202318332A (zh) 動態可變速率著色
KR20240069100A (ko) 영역 분리를 이용한 영상 처리 장치, 영상 처리 방법, 및 이를 포함하는 전자 시스템
WO2023158532A1 (en) Adaptive block-based frame similarity encoding

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant