CA2468800A1 - Systeme de calcul reconfigurable intensif a virgule flottante pour applications iteratives - Google Patents
Systeme de calcul reconfigurable intensif a virgule flottante pour applications iteratives Download PDFInfo
- Publication number
- CA2468800A1 CA2468800A1 CA002468800A CA2468800A CA2468800A1 CA 2468800 A1 CA2468800 A1 CA 2468800A1 CA 002468800 A CA002468800 A CA 002468800A CA 2468800 A CA2468800 A CA 2468800A CA 2468800 A1 CA2468800 A1 CA 2468800A1
- Authority
- CA
- Canada
- Prior art keywords
- instruction
- floating point
- computing system
- processing elements
- processing element
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored program computers
- G06F15/80—Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
- G06F15/8007—Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors single instruction multiple data [SIMD] multiprocessors
- G06F15/8023—Two dimensional arrays, e.g. mesh, torus
Landscapes
- Engineering & Computer Science (AREA)
- Computer Hardware Design (AREA)
- Theoretical Computer Science (AREA)
- Computing Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Advance Control (AREA)
Abstract
L'invention concerne un système de calcul reconfigurable permettant une exécution accélérée d'applications itératives intensives à virgule flottante. Ce système de calcul reconfigurable comprend une pluralité d'éléments de traitement interconnectés (20) montés, un système de traitement hôte permettant d'afficher des sorties en temps réel de calculs à virgule flottante effectués par les éléments de traitement (20), et une interface permettant de connecter les éléments de traitement au système hôte. Chacun des éléments de traitement interconnectés (20) comprend une unité fonctionnelle à virgule flottante (22), une mémoire d'opérande (24), une mémoire de commande (26) et une unité de commande (28). Ladite unité fonctionnelle à virgule flottante (22) comprend une fonction de multiplication/accumulation. Ladite mémoire d'opérande (24) comprend une pluralité de banques de mémoire RAM statique. Lesdits éléments de traitement (20) sont interconnectés au moyen d'une mise en oeuvre voisine ou hiérarchique la plus proche. Un ensemble d'instructions effectuées par ladite unité fonctionnelle à virgule flottante (22) comprend des instructions arithmétiques, ainsi que des instructions de commande et de communication. Ladite interface peut être mise en oeuvre en tant qu'interface de bus PCI au moyen d'un réseau de portes programmable par l'utilisateur ou en tant qu'interface de bus AGP.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US33834701P | 2001-12-06 | 2001-12-06 | |
US60/338,347 | 2001-12-06 | ||
PCT/US2002/038645 WO2003050697A1 (fr) | 2001-12-06 | 2002-12-06 | Systeme de calcul reconfigurable intensif a virgule flottante pour applications iteratives |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2468800A1 true CA2468800A1 (fr) | 2003-06-19 |
Family
ID=23324454
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002468800A Abandoned CA2468800A1 (fr) | 2001-12-06 | 2002-12-06 | Systeme de calcul reconfigurable intensif a virgule flottante pour applications iteratives |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1451701A1 (fr) |
AU (1) | AU2002360469A1 (fr) |
CA (1) | CA2468800A1 (fr) |
WO (1) | WO2003050697A1 (fr) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7840826B2 (en) | 2007-05-31 | 2010-11-23 | Vns Portfolio Llc | Method and apparatus for using port communications to switch processor modes |
CN113760817B (zh) * | 2017-03-28 | 2024-05-24 | 上海山里智能科技有限公司 | 一种综合计算系统 |
CN118092853B (zh) * | 2024-04-26 | 2024-07-19 | 中科亿海微电子科技(苏州)有限公司 | 基于risc-v浮点超越函数指令集扩展方法及装置 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5802290A (en) * | 1992-07-29 | 1998-09-01 | Virtual Computer Corporation | Computer network of distributed virtual computers which are EAC reconfigurable in response to instruction to be executed |
US5892962A (en) * | 1996-11-12 | 1999-04-06 | Lucent Technologies Inc. | FPGA-based processor |
US6289434B1 (en) * | 1997-02-28 | 2001-09-11 | Cognigine Corporation | Apparatus and method of implementing systems on silicon using dynamic-adaptive run-time reconfigurable circuits for processing multiple, independent data and control streams of varying rates |
US6507947B1 (en) * | 1999-08-20 | 2003-01-14 | Hewlett-Packard Company | Programmatic synthesis of processor element arrays |
US6408382B1 (en) * | 1999-10-21 | 2002-06-18 | Bops, Inc. | Methods and apparatus for abbreviated instruction sets adaptable to configurable processor architecture |
-
2002
- 2002-12-06 EP EP20020795726 patent/EP1451701A1/fr not_active Withdrawn
- 2002-12-06 AU AU2002360469A patent/AU2002360469A1/en not_active Abandoned
- 2002-12-06 WO PCT/US2002/038645 patent/WO2003050697A1/fr not_active Application Discontinuation
- 2002-12-06 CA CA002468800A patent/CA2468800A1/fr not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
WO2003050697A1 (fr) | 2003-06-19 |
AU2002360469A1 (en) | 2003-06-23 |
EP1451701A1 (fr) | 2004-09-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3637265B1 (fr) | Dispositif de mémoire effectuant une prélecture en mémoire et système le comprenant | |
CN111630502B (zh) | 用于神经网络处理器的统一存储器组织 | |
CN112463719A (zh) | 一种基于粗粒度可重构阵列实现的存内计算方法 | |
US11593069B2 (en) | Use of a single instruction set architecture (ISA) instruction for vector normalization | |
US20240256274A1 (en) | Supporting 8-bit floating point format operands in a computing architecture | |
Fan et al. | Stream processing dual-track CGRA for object inference | |
US20090158013A1 (en) | Method and Apparatus Implementing a Minimal Area Consumption Multiple Addend Floating Point Summation Function in a Vector Microprocessor | |
US5053986A (en) | Circuit for preservation of sign information in operations for comparison of the absolute value of operands | |
KR20220051006A (ko) | Pim(processing-in-memory) 연산 수행 방법, 및 관련 메모리 디바이스 및 시스템 | |
US20240320185A1 (en) | Deterministic memory for tensor streaming processors | |
US20050171990A1 (en) | Floating point intensive reconfigurable computing system for iterative applications | |
US10185560B2 (en) | Multi-functional execution lane for image processor | |
Kwon et al. | A 1ynm 1.25 v 8gb 16gb/s/pin gddr6-based accelerator-in-memory supporting 1tflops mac operation and various activation functions for deep learning application | |
Abdelhamid et al. | A highly-efficient and tightly-connected many-core overlay architecture | |
US7769981B2 (en) | Row of floating point accumulators coupled to respective PEs in uppermost row of PE array for performing addition operation | |
KR20210113099A (ko) | 조정 가능한 기능-인-메모리 컴퓨팅 시스템 | |
CN113590198A (zh) | 使用脉动阵列在并行计算机器中计算高效的跨通道运算 | |
CA2468800A1 (fr) | Systeme de calcul reconfigurable intensif a virgule flottante pour applications iteratives | |
US8539207B1 (en) | Lattice-based computations on a parallel processor | |
US20230289398A1 (en) | Efficient Matrix Multiply and Add with a Group of Warps | |
Sohn et al. | Low-power 3D graphics processors for mobile terminals | |
Gayles et al. | The design of the MGAP-2: A micro-grained massively parallel array | |
Todaro et al. | Enhanced soft gpu architecture for fpgas | |
Yun et al. | CLAY: CXL-based Scalable NDP Architecture Accelerating Embedding Layers | |
Kim et al. | Cache Register Sharing Structure for Channel-level Near-memory Processing in NAND Flash Memory |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Discontinued |