BR112023017122A2 - Textura em looping intraonda - Google Patents
Textura em looping intraondaInfo
- Publication number
- BR112023017122A2 BR112023017122A2 BR112023017122A BR112023017122A BR112023017122A2 BR 112023017122 A2 BR112023017122 A2 BR 112023017122A2 BR 112023017122 A BR112023017122 A BR 112023017122A BR 112023017122 A BR112023017122 A BR 112023017122A BR 112023017122 A2 BR112023017122 A2 BR 112023017122A2
- Authority
- BR
- Brazil
- Prior art keywords
- thread
- subgroups
- group
- intrawave
- looping
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G06T1/20—Processor architectures; Processor configuration, e.g. pipelining
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformation of program code
- G06F8/41—Compilation
- G06F8/44—Encoding
- G06F8/443—Optimisation
- G06F8/4441—Reducing the execution time required by the program code
- G06F8/4442—Reducing the number of cache misses; Data prefetching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
- G06F12/0802—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
- G06F12/0862—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches with prefetch
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/48—Program initiating; Program switching, e.g. by interrupt
- G06F9/4806—Task transfer initiation or dispatching
- G06F9/4843—Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G06T1/60—Memory management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/005—General purpose rendering architectures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/60—Details of cache memory
- G06F2212/6024—History based prefetching
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Computer Graphics (AREA)
- Image Generation (AREA)
Abstract
textura em looping intraonda. a presente revelação refere-se aos métodos e dispositivos para processamento gráfico incluindo um aparelho, por exemplo, uma gpu. o aparelho pode determinar se deve dividir um grupo de threads em uma pluralidade de subgrupos de threads, cada thread do grupo de threads sendo associado a um programa shader. o aparelho também pode dividir, ao determinar a divisão do grupo de threads na pluralidade de subgrupos de threads, o grupo de threads na pluralidade de subgrupos de threads. além disso, o aparelho pode executar, ao dividir o grupo de threads na pluralidade de subgrupos de threads, uma subseção do programa shader para cada subgrupo de threads da pluralidade de subgrupos de threads.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/191,439 US11640647B2 (en) | 2021-03-03 | 2021-03-03 | Methods and apparatus for intra-wave texture looping |
PCT/US2022/015336 WO2022186948A1 (en) | 2021-03-03 | 2022-02-04 | Intra-wave texture looping |
Publications (1)
Publication Number | Publication Date |
---|---|
BR112023017122A2 true BR112023017122A2 (pt) | 2023-09-26 |
Family
ID=80446190
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR112023017122A BR112023017122A2 (pt) | 2021-03-03 | 2022-02-04 | Textura em looping intraonda |
Country Status (7)
Country | Link |
---|---|
US (1) | US11640647B2 (pt) |
EP (1) | EP4302181A1 (pt) |
KR (1) | KR20230130157A (pt) |
CN (1) | CN116940922A (pt) |
BR (1) | BR112023017122A2 (pt) |
TW (1) | TW202301116A (pt) |
WO (1) | WO2022186948A1 (pt) |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7542043B1 (en) | 2005-05-23 | 2009-06-02 | Nvidia Corporation | Subdividing a shader program |
US9648325B2 (en) * | 2007-06-30 | 2017-05-09 | Microsoft Technology Licensing, Llc | Video decoding implementations for a graphics processing unit |
US9648299B2 (en) * | 2013-01-04 | 2017-05-09 | Qualcomm Incorporated | Indication of presence of texture and depth views in tracks for multiview coding plus depth |
US9652284B2 (en) * | 2013-10-01 | 2017-05-16 | Qualcomm Incorporated | GPU divergence barrier |
US9799094B1 (en) * | 2016-05-23 | 2017-10-24 | Qualcomm Incorporated | Per-instance preamble for graphics processing |
US10558460B2 (en) | 2016-12-14 | 2020-02-11 | Qualcomm Incorporated | General purpose register allocation in streaming processor |
US10325341B2 (en) * | 2017-04-21 | 2019-06-18 | Intel Corporation | Handling pipeline submissions across many compute units |
US11442795B2 (en) * | 2018-09-11 | 2022-09-13 | Nvidia Corp. | Convergence among concurrently executing threads |
US11816500B2 (en) * | 2019-03-15 | 2023-11-14 | Intel Corporation | Systems and methods for synchronization of multi-thread lanes |
US11016774B1 (en) * | 2019-11-26 | 2021-05-25 | Arm Limited | Issuing execution threads in a data processor |
-
2021
- 2021-03-03 US US17/191,439 patent/US11640647B2/en active Active
-
2022
- 2022-02-04 EP EP22705673.6A patent/EP4302181A1/en active Pending
- 2022-02-04 BR BR112023017122A patent/BR112023017122A2/pt unknown
- 2022-02-04 WO PCT/US2022/015336 patent/WO2022186948A1/en active Application Filing
- 2022-02-04 CN CN202280017523.XA patent/CN116940922A/zh active Pending
- 2022-02-04 KR KR1020237029282A patent/KR20230130157A/ko active IP Right Grant
- 2022-02-07 TW TW111104250A patent/TW202301116A/zh unknown
Also Published As
Publication number | Publication date |
---|---|
WO2022186948A1 (en) | 2022-09-09 |
US11640647B2 (en) | 2023-05-02 |
TW202301116A (zh) | 2023-01-01 |
CN116940922A (zh) | 2023-10-24 |
US20220284537A1 (en) | 2022-09-08 |
KR20230130157A (ko) | 2023-09-11 |
EP4302181A1 (en) | 2024-01-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR112023015620A2 (pt) | Métodos e aparelhos para suporte de objeto tensor em cargas de trabalho de aprendizagem por máquina | |
IN2012DN00929A (pt) | ||
US8645963B2 (en) | Clustering threads based on contention patterns | |
BR112014018434A8 (pt) | Gerenciamento de buffer para unidade de processamento paralelo de gráficos | |
WO2008127623A3 (en) | Parallel runtime execution on multiple processors | |
JP2016532180A5 (pt) | ||
KR102631479B1 (ko) | 그래픽스 처리 | |
BR112016002637A2 (pt) | sincronização de barreira com cálculo dinâmico de largura | |
ES2570653A1 (es) | Método implementado por ordenador, sistema y producto de programa de ordenador para simular el comportamiento de un tejido entretejido a nivel de hilo | |
BR112016012541A2 (pt) | Método e dispositivo para seleção de âncora mestra | |
BR112018007430A2 (pt) | aceleração de subgráficos de tarefa pela sincronização de remapeamento | |
BR112018016913A2 (pt) | predicados uniformes em shaders para unidades de processamento gráfico | |
BR112015026327A8 (pt) | método para alterar visibilidade de tempo de execução de uma interface de programação de aplicativo (api), método para reduzir a visibilidade preestabelecida dentro de uma interface de programação de aplicativo (api) acessível e método para proporcionar a um consumidor visibilidade especificada dentro de uma interface de programação de aplicativo (api) | |
BR112017005424A2 (pt) | tecido, processo para a produção de um tecido elástico e artigo de vestuário | |
GB2540070A (en) | Hardware counters to track utilization in a multithreading computer system | |
BR112018001970A2 (pt) | mecanismo de regras para dispositivos conectados | |
BR112014019090A8 (pt) | Método e aparelho | |
BR112023017122A2 (pt) | Textura em looping intraonda | |
TW201712630A (en) | Apparatus and method to improve the scalability of graphics processor unit (GPU) virtualization | |
BR112018007923A2 (pt) | método para produção de polpa de dissolução, polpa de dissolução, fibra têxtil, celulose derivatizada, uso de uma polpa de dissolução, e, uso de celulase. | |
Aguilera et al. | Process variation-aware workload partitioning algorithms for GPUs supporting spatial-multitasking | |
BR112015000367A2 (pt) | método e dispositivo de comutação de imagem | |
US20190147986A1 (en) | Methods for assessing neoadjuvant therapies | |
BR112017001840A2 (pt) | método e aparelho para desenvolvimento de operação, administração e manutenção virtual, e sistema de rede virtualizada | |
BR112015015217A2 (pt) | método e dispositivo para otimização de memória |