BR112023017122A2 - Textura em looping intraonda - Google Patents

Textura em looping intraonda

Info

Publication number
BR112023017122A2
BR112023017122A2 BR112023017122A BR112023017122A BR112023017122A2 BR 112023017122 A2 BR112023017122 A2 BR 112023017122A2 BR 112023017122 A BR112023017122 A BR 112023017122A BR 112023017122 A BR112023017122 A BR 112023017122A BR 112023017122 A2 BR112023017122 A2 BR 112023017122A2
Authority
BR
Brazil
Prior art keywords
thread
subgroups
group
intrawave
looping
Prior art date
Application number
BR112023017122A
Other languages
English (en)
Inventor
evan gruber Andrew
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of BR112023017122A2 publication Critical patent/BR112023017122A2/pt

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/20Processor architectures; Processor configuration, e.g. pipelining
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformation of program code
    • G06F8/41Compilation
    • G06F8/44Encoding
    • G06F8/443Optimisation
    • G06F8/4441Reducing the execution time required by the program code
    • G06F8/4442Reducing the number of cache misses; Data prefetching
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0862Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches with prefetch
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/48Program initiating; Program switching, e.g. by interrupt
    • G06F9/4806Task transfer initiation or dispatching
    • G06F9/4843Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/60Memory management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/003D [Three Dimensional] image rendering
    • G06T15/005General purpose rendering architectures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/60Details of cache memory
    • G06F2212/6024History based prefetching

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Computer Graphics (AREA)
  • Image Generation (AREA)

Abstract

textura em looping intraonda. a presente revelação refere-se aos métodos e dispositivos para processamento gráfico incluindo um aparelho, por exemplo, uma gpu. o aparelho pode determinar se deve dividir um grupo de threads em uma pluralidade de subgrupos de threads, cada thread do grupo de threads sendo associado a um programa shader. o aparelho também pode dividir, ao determinar a divisão do grupo de threads na pluralidade de subgrupos de threads, o grupo de threads na pluralidade de subgrupos de threads. além disso, o aparelho pode executar, ao dividir o grupo de threads na pluralidade de subgrupos de threads, uma subseção do programa shader para cada subgrupo de threads da pluralidade de subgrupos de threads.
BR112023017122A 2021-03-03 2022-02-04 Textura em looping intraonda BR112023017122A2 (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US17/191,439 US11640647B2 (en) 2021-03-03 2021-03-03 Methods and apparatus for intra-wave texture looping
PCT/US2022/015336 WO2022186948A1 (en) 2021-03-03 2022-02-04 Intra-wave texture looping

Publications (1)

Publication Number Publication Date
BR112023017122A2 true BR112023017122A2 (pt) 2023-09-26

Family

ID=80446190

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112023017122A BR112023017122A2 (pt) 2021-03-03 2022-02-04 Textura em looping intraonda

Country Status (7)

Country Link
US (1) US11640647B2 (pt)
EP (1) EP4302181A1 (pt)
KR (1) KR20230130157A (pt)
CN (1) CN116940922A (pt)
BR (1) BR112023017122A2 (pt)
TW (1) TW202301116A (pt)
WO (1) WO2022186948A1 (pt)

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7542043B1 (en) 2005-05-23 2009-06-02 Nvidia Corporation Subdividing a shader program
US9648325B2 (en) * 2007-06-30 2017-05-09 Microsoft Technology Licensing, Llc Video decoding implementations for a graphics processing unit
US9648299B2 (en) * 2013-01-04 2017-05-09 Qualcomm Incorporated Indication of presence of texture and depth views in tracks for multiview coding plus depth
US9652284B2 (en) * 2013-10-01 2017-05-16 Qualcomm Incorporated GPU divergence barrier
US9799094B1 (en) * 2016-05-23 2017-10-24 Qualcomm Incorporated Per-instance preamble for graphics processing
US10558460B2 (en) 2016-12-14 2020-02-11 Qualcomm Incorporated General purpose register allocation in streaming processor
US10325341B2 (en) * 2017-04-21 2019-06-18 Intel Corporation Handling pipeline submissions across many compute units
US11442795B2 (en) * 2018-09-11 2022-09-13 Nvidia Corp. Convergence among concurrently executing threads
US11816500B2 (en) * 2019-03-15 2023-11-14 Intel Corporation Systems and methods for synchronization of multi-thread lanes
US11016774B1 (en) * 2019-11-26 2021-05-25 Arm Limited Issuing execution threads in a data processor

Also Published As

Publication number Publication date
WO2022186948A1 (en) 2022-09-09
US11640647B2 (en) 2023-05-02
TW202301116A (zh) 2023-01-01
CN116940922A (zh) 2023-10-24
US20220284537A1 (en) 2022-09-08
KR20230130157A (ko) 2023-09-11
EP4302181A1 (en) 2024-01-10

Similar Documents

Publication Publication Date Title
BR112023015620A2 (pt) Métodos e aparelhos para suporte de objeto tensor em cargas de trabalho de aprendizagem por máquina
IN2012DN00929A (pt)
US8645963B2 (en) Clustering threads based on contention patterns
BR112014018434A8 (pt) Gerenciamento de buffer para unidade de processamento paralelo de gráficos
WO2008127623A3 (en) Parallel runtime execution on multiple processors
JP2016532180A5 (pt)
KR102631479B1 (ko) 그래픽스 처리
BR112016002637A2 (pt) sincronização de barreira com cálculo dinâmico de largura
ES2570653A1 (es) Método implementado por ordenador, sistema y producto de programa de ordenador para simular el comportamiento de un tejido entretejido a nivel de hilo
BR112016012541A2 (pt) Método e dispositivo para seleção de âncora mestra
BR112018007430A2 (pt) aceleração de subgráficos de tarefa pela sincronização de remapeamento
BR112018016913A2 (pt) predicados uniformes em shaders para unidades de processamento gráfico
BR112015026327A8 (pt) método para alterar visibilidade de tempo de execução de uma interface de programação de aplicativo (api), método para reduzir a visibilidade preestabelecida dentro de uma interface de programação de aplicativo (api) acessível e método para proporcionar a um consumidor visibilidade especificada dentro de uma interface de programação de aplicativo (api)
BR112017005424A2 (pt) tecido, processo para a produção de um tecido elástico e artigo de vestuário
GB2540070A (en) Hardware counters to track utilization in a multithreading computer system
BR112018001970A2 (pt) mecanismo de regras para dispositivos conectados
BR112014019090A8 (pt) Método e aparelho
BR112023017122A2 (pt) Textura em looping intraonda
TW201712630A (en) Apparatus and method to improve the scalability of graphics processor unit (GPU) virtualization
BR112018007923A2 (pt) método para produção de polpa de dissolução, polpa de dissolução, fibra têxtil, celulose derivatizada, uso de uma polpa de dissolução, e, uso de celulase.
Aguilera et al. Process variation-aware workload partitioning algorithms for GPUs supporting spatial-multitasking
BR112015000367A2 (pt) método e dispositivo de comutação de imagem
US20190147986A1 (en) Methods for assessing neoadjuvant therapies
BR112017001840A2 (pt) método e aparelho para desenvolvimento de operação, administração e manutenção virtual, e sistema de rede virtualizada
BR112015015217A2 (pt) método e dispositivo para otimização de memória