JP2022541144A5 - - Google Patents
Info
- Publication number
- JP2022541144A5 JP2022541144A5 JP2022500757A JP2022500757A JP2022541144A5 JP 2022541144 A5 JP2022541144 A5 JP 2022541144A5 JP 2022500757 A JP2022500757 A JP 2022500757A JP 2022500757 A JP2022500757 A JP 2022500757A JP 2022541144 A5 JP2022541144 A5 JP 2022541144A5
- Authority
- JP
- Japan
- Prior art keywords
- operations
- routine
- hardware accelerator
- subset
- task
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/511,689 US11250107B2 (en) | 2019-07-15 | 2019-07-15 | Method for interfacing with hardware accelerators |
| US16/511,689 | 2019-07-15 | ||
| PCT/EP2020/068377 WO2021008868A1 (en) | 2019-07-15 | 2020-06-30 | A method for interfacing with hardware accelerators |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2022541144A JP2022541144A (ja) | 2022-09-22 |
| JP2022541144A5 true JP2022541144A5 (https=) | 2022-11-18 |
| JP7361192B2 JP7361192B2 (ja) | 2023-10-13 |
Family
ID=71409414
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2022500757A Active JP7361192B2 (ja) | 2019-07-15 | 2020-06-30 | ハードウェア・アクセラレータとインターフェースするための方法 |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US11250107B2 (https=) |
| EP (1) | EP3999957B1 (https=) |
| JP (1) | JP7361192B2 (https=) |
| CN (1) | CN114127689B (https=) |
| WO (1) | WO2021008868A1 (https=) |
Families Citing this family (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP3933709A1 (en) * | 2020-06-30 | 2022-01-05 | Upstride | Graph processing method and system |
| EP4285286A1 (en) * | 2021-02-01 | 2023-12-06 | Microsoft Technology Licensing, LLC | Semi-programmable and reconfigurable co-accelerator for a deep neural network with normalization or non-linearity |
| CN115904696A (zh) * | 2021-09-30 | 2023-04-04 | 想象技术有限公司 | 用于配置具有可配置流水线的神经网络加速器的方法和设备 |
| TR2021020689A2 (tr) * | 2021-12-22 | 2023-07-21 | Havelsan Hava Elektronik Sanayi Ve Ticaret Anonim Sirketi | Gömülü ve bütünleşi̇k si̇stemlerde paralel yapay si̇ni̇r ağlari i̇le topluluk öğrenmesi̇ |
| US20240161222A1 (en) * | 2022-11-16 | 2024-05-16 | Nvidia Corporation | Application programming interface to indicate image-to-column transformation |
| US12455900B1 (en) | 2023-03-07 | 2025-10-28 | QEngine LLC | Method for executing a query in a multi-dimensional data space using vectorization and a related system |
| KR102740239B1 (ko) * | 2023-03-24 | 2024-12-10 | 한국과학기술원 | 다중 신경망 가속을 위한 확장가능 벡터-어레이 이종 가속기 구조 및 스케쥴링 기법 |
Family Cites Families (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6081890A (en) * | 1998-11-30 | 2000-06-27 | Intel Corporation | Method of communication between firmware written for different instruction set architectures |
| US7219085B2 (en) * | 2003-12-09 | 2007-05-15 | Microsoft Corporation | System and method for accelerating and optimizing the processing of machine learning techniques using a graphics processing unit |
| US8250578B2 (en) | 2008-02-22 | 2012-08-21 | International Business Machines Corporation | Pipelining hardware accelerators to computer systems |
| US7984267B2 (en) * | 2008-09-04 | 2011-07-19 | International Business Machines Corporation | Message passing module in hybrid computing system starting and sending operation information to service program for accelerator to execute application program |
| US8751556B2 (en) | 2010-06-11 | 2014-06-10 | Massachusetts Institute Of Technology | Processor for large graph algorithm computations and matrix operations |
| US8752064B2 (en) * | 2010-12-14 | 2014-06-10 | Advanced Micro Devices, Inc. | Optimizing communication of system call requests |
| JP2012256150A (ja) * | 2011-06-08 | 2012-12-27 | Renesas Electronics Corp | コンパイル装置、コンパイル方法、及びプログラム |
| US9411853B1 (en) | 2012-08-03 | 2016-08-09 | Healthstudio, LLC | In-memory aggregation system and method of multidimensional data processing for enhancing speed and scalability |
| US9471388B2 (en) * | 2013-03-14 | 2016-10-18 | Altera Corporation | Mapping network applications to a hybrid programmable many-core device |
| US20150006341A1 (en) * | 2013-06-27 | 2015-01-01 | Metratech Corp. | Billing transaction scheduling |
| US10540588B2 (en) | 2015-06-29 | 2020-01-21 | Microsoft Technology Licensing, Llc | Deep neural network processing on hardware accelerators with stacked memory |
| JP2018173672A (ja) * | 2015-09-03 | 2018-11-08 | 株式会社Preferred Networks | 実装装置 |
| JP6658033B2 (ja) * | 2016-02-05 | 2020-03-04 | 富士通株式会社 | 演算処理回路、および情報処理装置 |
| US9646243B1 (en) | 2016-09-12 | 2017-05-09 | International Business Machines Corporation | Convolutional neural networks using resistive processing unit array |
| JP6724869B2 (ja) * | 2017-06-19 | 2020-07-15 | 株式会社デンソー | 多層ニューラルネットワークのニューロンの出力レベル調整方法 |
| US11620490B2 (en) * | 2017-10-17 | 2023-04-04 | Xilinx, Inc. | Multi-layer neural network processing by a neural network accelerator using host communicated merged weights and a package of per-layer instructions |
| US10698766B2 (en) * | 2018-04-18 | 2020-06-30 | EMC IP Holding Company LLC | Optimization of checkpoint operations for deep learning computing |
| CN108876702A (zh) * | 2018-06-21 | 2018-11-23 | 北京邮电大学 | 一种加速分布式深度神经网络的训练方法及装置 |
| US10620951B2 (en) * | 2018-06-22 | 2020-04-14 | Intel Corporation | Matrix multiplication acceleration of sparse matrices using column folding and squeezing |
-
2019
- 2019-07-15 US US16/511,689 patent/US11250107B2/en active Active
-
2020
- 2020-06-30 CN CN202080051285.5A patent/CN114127689B/zh active Active
- 2020-06-30 WO PCT/EP2020/068377 patent/WO2021008868A1/en not_active Ceased
- 2020-06-30 EP EP20735563.7A patent/EP3999957B1/en active Active
- 2020-06-30 JP JP2022500757A patent/JP7361192B2/ja active Active
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2022541144A5 (https=) | ||
| Gravina et al. | Anti-symmetric DGN: a stable architecture for deep graph networks | |
| JP6840827B2 (ja) | ニューラルネットワークプロセッサにおけるバッチ処理 | |
| CN109740747B (zh) | 运算方法、装置及相关产品 | |
| JP2025111477A5 (ja) | データ生成方法、半導体集積回路、プログラム、装置及びシステム | |
| KR102302609B1 (ko) | 신경망 아키텍처 최적화 | |
| JP2023515556A5 (https=) | ||
| CN114127689B (zh) | 用于与硬件加速器接口的方法 | |
| Drozdowski | Scheduling for parallel processing | |
| JP2020129404A (ja) | 計算グラフの処理 | |
| Diekmann et al. | Load balancing strategies for distributed memory machines | |
| KR20190093932A (ko) | 딥러닝 시스템에서의 연산 처리 장치 및 방법 | |
| Nachaoui et al. | On the numerical approximation of some inverse problems governed by nonlinear delay differential equation | |
| Manickam et al. | Novel Lagrange sense exponential stability criteria for time-delayed stochastic Cohen–Grossberg neural networks with Markovian jump parameters: a graph-theoretic approach | |
| WO2023125857A1 (zh) | 基于机器学习框架系统的模型训练方法及相关设备 | |
| Pati et al. | Demystifying bert: Implications for accelerator design | |
| WO2024239971A1 (zh) | 神经网络模型的编译方法、推理方法、装置、设备和介质 | |
| CN110825380A (zh) | 核函数的生成方法、目标代码的生成方法和组合处理装置 | |
| US20240354612A1 (en) | Caching Matrix Representations of Repeated Quantum Gates | |
| Ivutin et al. | Design efficient schemes of applied algorithms parallelization based on semantic Petri-Markov net | |
| US20220012573A1 (en) | Neural network accelerators | |
| Blochinger et al. | A Universal Parallel SAT Checking Kernel. | |
| Sumner et al. | Speed-up of machine learning for sound localization via high-performance computing | |
| CN119473317B (zh) | 一种云际环境中成本感知的大模型部署优化方法及系统 | |
| Arora | Test Case Generation Using Progressively Refined Genetic Algorithm for Ajax Web Application Testing |