JP2010532905A5 - - Google Patents

Download PDF

Info

Publication number
JP2010532905A5
JP2010532905A5 JP2010518258A JP2010518258A JP2010532905A5 JP 2010532905 A5 JP2010532905 A5 JP 2010532905A5 JP 2010518258 A JP2010518258 A JP 2010518258A JP 2010518258 A JP2010518258 A JP 2010518258A JP 2010532905 A5 JP2010532905 A5 JP 2010532905A5
Authority
JP
Japan
Prior art keywords
processor
processors
parallel
operable
memory
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2010518258A
Other languages
Japanese (ja)
Other versions
JP2010532905A (en
Filing date
Publication date
Priority claimed from US12/147,332 external-priority patent/US8984256B2/en
Application filed filed Critical
Publication of JP2010532905A publication Critical patent/JP2010532905A/en
Publication of JP2010532905A5 publication Critical patent/JP2010532905A5/ja
Pending legal-status Critical Current

Links

Claims (18)

ダイナミックランダムアクセスメモリ(DRAM)のダイに埋め込まれた複数の並列プロセッサを備えていて、
前記複数の並列プロセッサは、外部メモリコントローラおよび外部プロセッサと通信し、
前記並列プロセッサの各々は、スレッドレベルの並列処理のために最適化された命令セットを処理するように動作可能であり、前記プロセッサの各々は、単一のスレッドを処理するように動作可能であり、複数のスレッドが、共有メモリまたは1つ以上の共有変数を通してデータを共有することを特徴とするシステム。
A plurality of parallel processors embedded in a dynamic random access memory (DRAM) die;
The plurality of parallel processors communicate with an external memory controller and an external processor;
Each of the parallel processors is operable to process an instruction set optimized for thread-level parallel processing, and each of the processors is operable to process a single thread. A system wherein multiple threads share data through shared memory or one or more shared variables.
前記ダイは、DRAMピン配列を有するパッケージに入れられていることを特徴とする請求項1に記載のシステム。   The system of claim 1, wherein the die is in a package having a DRAM pin array. 前記並列プロセッサは、デュアルインラインメモリモジュール上に搭載されていることを特徴とする請求項1に記載のシステム。   The system according to claim 1, wherein the parallel processor is mounted on a dual in-line memory module. 前記システムは、前記プロセッサがDRAMモードレジスタを通してイネーブルにされる時以外は、DRAMとして動作することを特徴とする請求項1に記載のシステム。   The system of claim 1, wherein the system operates as a DRAM except when the processor is enabled through a DRAM mode register. 前記外部プロセッサは、関連する永久記憶装置から前記DRAMにデータおよび命令を転送するように動作可能であることを特徴とする請求項1に記載のシステム。   The system of claim 1, wherein the external processor is operable to transfer data and instructions from an associated permanent storage device to the DRAM. 前記永久記憶装置は、フラッシュメモリであることを特徴とする請求項5に記載のシステム。   6. The system of claim 5, wherein the permanent storage device is a flash memory. 前記外部プロセッサは、前記並列プロセッサと外部装置との間の入出力インターフェースを提供するように動作可能であることを特徴とする請求項1に記載のシステム。   The system of claim 1, wherein the external processor is operable to provide an input / output interface between the parallel processor and an external device. メモリモードレジスタに割り当てられる1以上のビットは、前記並列プロセッサのうちの1つ以上をイネーブルまたはディスエーブルにするように動作可能であることを特徴とする請求項1に記載のシステム。   The system of claim 1, wherein one or more bits assigned to a memory mode register are operable to enable or disable one or more of the parallel processors. 少なくとも一つの前記並列プロセッサがマスタプロセッサとみなされ、他の前記並列プロセッサはスレーブプロセッサとみなされることを特徴とする請求項1に記載のシステム。   The system of claim 1, wherein at least one of the parallel processors is considered a master processor and the other parallel processors are considered slave processors. 各プロセッサは、クロック速度を有していて、前記マスタプロセッサ以外の各プロセッサは、性能または電力消費を最適化するように調整された前記プロセッサのクロック速度を有するように動作可能であることを特徴とする請求項9に記載のシステム。   Each processor has a clock speed, and each processor other than the master processor is operable to have the clock speed of the processor adjusted to optimize performance or power consumption. The system according to claim 9. 各プロセッサは、マスタプロセッサまたはスレーブプロセッサとみなされるように動作可能であることを特徴とする請求項9に記載のシステム。   The system of claim 9, wherein each processor is operable to be considered a master processor or a slave processor. 前記マスタプロセッサは、いくつかのスレーブプロセッサによる処理を要求し、前記いくつかのスレーブプロセッサからの出力を待ち、かつ前記出力を結合することを特徴とする請求項9に記載のシステム。   The system of claim 9, wherein the master processor requests processing by a number of slave processors, waits for output from the number of slave processors, and combines the outputs. 前記マスタプロセッサは、前記出力が前記いくつかのプロセッサの各々から受信されるとき、前記いくつかのプロセッサからの出力を結合することを特徴とする請求項12に記載のシステム。   The system of claim 12, wherein the master processor combines outputs from the several processors as the outputs are received from each of the several processors. 停止されるべき前記並列プロセッサのうちの1つ以上をイネーブルにすることによって、低電力消費が提供されることを特徴とする請求項1に記載のシステム。   The system of claim 1, wherein low power consumption is provided by enabling one or more of the parallel processors to be halted. 前記並列プロセッサの各々は、プログラムカウンタを伴っていて、前記並列プロセッサが伴っているプログラムカウンタに全て1を書き込むことによって停止されるように動作可能であることを特徴とする請求項14に記載のシステム。   15. Each of the parallel processors is associated with a program counter and is operable to be stopped by writing all ones to the program counter associated with the parallel processor. system. 複数のプロセッサは、モノリシックメモリデバイスのために設計された半導体製造プロセスを用いて前記チップ上に配置されるコンピュータメモリと共に前記チップ上に製造されることを特徴とする請求項1に記載のシステム。   The system of claim 1, wherein the plurality of processors are fabricated on the chip along with computer memory disposed on the chip using a semiconductor fabrication process designed for monolithic memory devices. 半導体製造プロセスは、4層未満のメタル相互接続を用いることを特徴とする請求項16に記載のシステム。   The system of claim 16, wherein the semiconductor manufacturing process uses less than four metal interconnects. 半導体製造プロセスは、3層未満のメタル相互接続を用いることを特徴とする請求項17に記載のシステム。   The system of claim 17, wherein the semiconductor manufacturing process uses less than three metal interconnects.
JP2010518258A 2008-06-26 2008-06-27 Thread-optimized multiprocessor architecture Pending JP2010532905A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US12/147,332 US8984256B2 (en) 2006-02-03 2008-06-26 Thread optimized multiprocessor architecture
PCT/US2008/068566 WO2009157943A1 (en) 2008-06-26 2008-06-27 Thread optimized multiprocessor architecture

Publications (2)

Publication Number Publication Date
JP2010532905A JP2010532905A (en) 2010-10-14
JP2010532905A5 true JP2010532905A5 (en) 2012-06-07

Family

ID=41444820

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2010518258A Pending JP2010532905A (en) 2008-06-26 2008-06-27 Thread-optimized multiprocessor architecture

Country Status (10)

Country Link
US (2) US8984256B2 (en)
EP (1) EP2288988A4 (en)
JP (1) JP2010532905A (en)
KR (1) KR101121606B1 (en)
CN (2) CN101796484B (en)
AU (1) AU2008355072C1 (en)
BR (1) BRPI0811497A2 (en)
CA (1) CA2684753A1 (en)
RU (1) RU2450339C2 (en)
WO (1) WO2009157943A1 (en)

Families Citing this family (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8984256B2 (en) * 2006-02-03 2015-03-17 Russell Fish Thread optimized multiprocessor architecture
CN101395578B (en) 2006-02-03 2013-01-09 拉塞尔·H·菲什三世 Multiprocessor System and Thread-level Parallel Processing Method
US8209597B2 (en) * 2009-03-23 2012-06-26 Cognitive Electronics, Inc. System and method for achieving improved accuracy from efficient computer architectures
US8612687B2 (en) 2010-05-26 2013-12-17 International Business Machines Corporation Latency-tolerant 3D on-chip memory organization
US20120151232A1 (en) * 2010-12-12 2012-06-14 Fish Iii Russell Hamilton CPU in Memory Cache Architecture
US9823928B2 (en) 2011-09-30 2017-11-21 Qualcomm Incorporated FIFO load instruction
KR101946455B1 (en) * 2013-03-14 2019-02-11 삼성전자주식회사 System on-Chip and operating method of the same
RU2538920C2 (en) * 2013-05-06 2015-01-10 Общество с ограниченной ответственностью "Аби ИнфоПоиск" Method for task distribution by computer system server, computer-readable data medium and system for implementing said method
US10521387B2 (en) * 2014-02-07 2019-12-31 Toshiba Memory Corporation NAND switch
US9870401B2 (en) * 2014-04-17 2018-01-16 Wisoncsin Alumni Research Foundation Database system with highly denormalized database structure
RU2579899C1 (en) * 2014-09-30 2016-04-10 Общество с ограниченной ответственностью "Аби Девелопмент" Document processing using multiple processing flows
US20160147667A1 (en) * 2014-11-24 2016-05-26 Samsung Electronics Co., Ltd. Address translation in memory
US11567911B2 (en) * 2014-12-19 2023-01-31 Sergey Anatol'evich GORISHNIY System and method for management of functionally linked data
CN104598319B (en) * 2015-01-13 2017-06-30 浪潮电子信息产业股份有限公司 Node distribution method for realizing application performance optimization
RU2612569C2 (en) * 2015-01-27 2017-03-09 Акционерное общество "Научно-исследовательский институт Авиационного оборудования" Method for automatic control of redundancy of heterogeneous computer system and devices for its implementation
US9959208B2 (en) 2015-06-02 2018-05-01 Goodrich Corporation Parallel caching architecture and methods for block-based data processing
CN105808256B (en) * 2016-03-08 2017-06-23 武汉斗鱼网络科技有限公司 It is a kind of to construct the method and system that legal storehouse return value bypasses function call detection
RU2644535C2 (en) * 2016-06-01 2018-02-12 Владимир Викторович Ермишин Parallel computing architecture
US20180113840A1 (en) * 2016-10-25 2018-04-26 Wisconsin Alumni Research Foundation Matrix Processor with Localized Memory
DE102016224747A1 (en) * 2016-12-12 2018-06-14 Robert Bosch Gmbh control unit
US10459771B2 (en) 2017-02-22 2019-10-29 Red Hat Israel, Ltd. Lightweight thread synchronization using shared memory state
KR102782323B1 (en) 2017-07-30 2025-03-18 뉴로블레이드, 리미티드. Memory-based distributed processor architecture
RU2665224C1 (en) * 2017-11-09 2018-08-28 Российская Федерация, от имени которой выступает Государственная корпорация по космической деятельности "РОСКОСМОС" Method of dynamic control of conflict situations in complex technical systems with environmental cloud computing
US10613955B2 (en) * 2017-12-28 2020-04-07 Intel Corporation Platform debug and testing with secured hardware
US11030148B2 (en) 2018-04-04 2021-06-08 Lawrence Livermore National Security, Llc Massively parallel hierarchical control system and method
CN111382091A (en) * 2018-12-30 2020-07-07 德克萨斯仪器股份有限公司 Broadside random access memory for low cycle memory access and additional functions
JP7107275B2 (en) * 2019-04-25 2022-07-27 株式会社デンソー PARALLELIZATION METHOD, SEMICONDUCTOR CONTROL DEVICE, AND VEHICLE CONTROL DEVICE
CN111209042B (en) * 2020-01-06 2022-08-26 北京字节跳动网络技术有限公司 Method, device, medium and electronic equipment for establishing function stack
KR20210156058A (en) 2020-06-17 2021-12-24 삼성전자주식회사 Memory device for performing in-memory processing
EP4075275A1 (en) * 2021-04-16 2022-10-19 GrAl Matter Labs S.A.S. Message based processor, message based processing method and record carrier
KR102620843B1 (en) * 2021-11-22 2024-01-03 리벨리온 주식회사 Reconfigurable on-chip memory bank, Reconfigurable on-chip memory, System on Chip mounted same and Method for using Reconfigurable on-chip memory
CN117573709B (en) * 2023-10-23 2024-09-20 昆易电子科技(上海)有限公司 Data processing system, electronic device, and medium
CN117555599B (en) * 2024-01-10 2024-04-05 睿思芯科(成都)科技有限公司 Chip design method, system and related equipment for accelerating key data access speed

Family Cites Families (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US553890A (en) * 1896-02-04 zeidlee
US3990054A (en) * 1974-11-05 1976-11-02 Honeywell Inc. Microprogram organization techniques
US4661900A (en) * 1983-04-25 1987-04-28 Cray Research, Inc. Flexible chaining in vector processor with selective use of vector registers as operand and result registers
US4641238A (en) * 1984-12-10 1987-02-03 Itt Corporation Multiprocessor system employing dynamically programmable processing elements controlled by a master processor
US5440749A (en) 1989-08-03 1995-08-08 Nanotronics Corporation High performance, low cost microprocessor architecture
US5590345A (en) 1990-11-13 1996-12-31 International Business Machines Corporation Advanced parallel array processor(APAP)
DE69425377T2 (en) * 1994-11-29 2001-02-15 International Business Machines Corp., Armonk Single cycle processor for real time processing
US6524019B1 (en) * 1995-03-27 2003-02-25 Nec Corporation Inter-cluster data transfer system and data transfer method
JP3075184B2 (en) * 1996-08-02 2000-08-07 日本電気株式会社 Main storage system with arithmetic processing function and control method thereof
JPH1111021A (en) 1997-06-23 1999-01-19 Nippon Kayaku Co Ltd Thermal recording material
JP3099290B2 (en) * 1997-10-03 2000-10-16 啓介 進藤 Information processing device using multi-thread program
US6076152A (en) * 1997-12-17 2000-06-13 Src Computers, Inc. Multiprocessor computer architecture incorporating a plurality of memory algorithm processors in the memory subsystem
US7373440B2 (en) * 1997-12-17 2008-05-13 Src Computers, Inc. Switch/network adapter port for clustered computers employing a chain of multi-adaptive processors in a dual in-line memory module format
NO308149B1 (en) * 1998-06-02 2000-07-31 Thin Film Electronics Asa Scalable, integrated data processing device
JP3156670B2 (en) * 1998-06-26 2001-04-16 日本電気株式会社 Packet type memory system with arithmetic processing function and control method thereof
FI117523B (en) * 1998-10-07 2006-11-15 Nokia Corp Procedure for regulating power consumption
JP2000207248A (en) * 1999-01-14 2000-07-28 Toshiba Corp Apparatus and method for generating behavior of parallel program, and recording medium recording software for generating behavior of parallel program
US6606704B1 (en) 1999-08-31 2003-08-12 Intel Corporation Parallel multithreaded processor with plural microengines executing multiple threads each microengine having loadable microcode
JP2002108691A (en) * 2000-09-29 2002-04-12 Mitsubishi Electric Corp Semiconductor storage device and method of controlling semiconductor storage device
US7043416B1 (en) * 2001-07-27 2006-05-09 Lsi Logic Corporation System and method for state restoration in a diagnostic module for a high-speed microprocessor
US7064579B2 (en) * 2002-07-08 2006-06-20 Viciciv Technology Alterable application specific integrated circuit (ASIC)
US6803786B1 (en) * 2003-03-11 2004-10-12 Xilinx, Inc. Structures and methods providing columns of tightly coupled processor and RAM blocks within an array of logic blocks
US7558920B2 (en) * 2004-06-30 2009-07-07 Intel Corporation Apparatus and method for partitioning a shared cache of a chip multi-processor
US7676646B2 (en) 2005-03-02 2010-03-09 Cisco Technology, Inc. Packet processor with wide register set architecture
US20070028010A1 (en) * 2005-08-01 2007-02-01 Texas Instruments, Inc. Peripheral device utilization monitoring
US7421566B2 (en) * 2005-08-12 2008-09-02 International Business Machines Corporation Implementing instruction set architectures with non-contiguous register file specifiers
US8984256B2 (en) 2006-02-03 2015-03-17 Russell Fish Thread optimized multiprocessor architecture
CN101395578B (en) * 2006-02-03 2013-01-09 拉塞尔·H·菲什三世 Multiprocessor System and Thread-level Parallel Processing Method
TW200737188A (en) * 2006-03-27 2007-10-01 Memocom Corp Complex memory chip
EP1855181A2 (en) * 2006-05-10 2007-11-14 Marvell World Trade Ltd. System with high power and low power processors and thread transfer
US7870551B2 (en) * 2006-05-18 2011-01-11 International Business Machines Corporation Optimization of thread wake up for shared processor partitions
US7917788B2 (en) * 2006-11-01 2011-03-29 Freescale Semiconductor, Inc. SOC with low power and performance modes
US20120151232A1 (en) 2010-12-12 2012-06-14 Fish Iii Russell Hamilton CPU in Memory Cache Architecture

Similar Documents

Publication Publication Date Title
JP2010532905A5 (en)
TWI869578B (en) System and method for computing
Asghari-Moghaddam et al. Chameleon: Versatile and practical near-DRAM acceleration architecture for large memory systems
TWI703440B (en) Memory system, processing system thereof and method for operating memory stack
EP2645263B1 (en) Memory device and method of operating the same
Fick et al. Centip3De: A 3930DMIPS/W configurable near-threshold 3D stacked system with 64 ARM Cortex-M3 cores
US11276459B2 (en) Memory die including local processor and global processor, memory device, and electronic device
RU2009145519A (en) MULTI-PROCESSOR ARCHITECTURE OPTIMIZED FOR STREAMS
Farmahini-Farahani et al. DRAMA: An architecture for accelerated processing near memory
US10572339B2 (en) Memory latency management
US11507301B2 (en) Memory module implementing memory centric architecture
JP6303247B2 (en) Techniques for accessing a dynamic random access memory array
US11474950B2 (en) Memory controller including plurality of address mapping tables, system on chip, and electronic device
CN105247617A (en) Hybrid memory device
CN102609312A (en) Method for dispatching shortest-job-first memory requests on basis of fairness consideration
JP7632865B2 (en) Electronic device and memory device
JP2008176699A5 (en)
Lee et al. Buffered compares: Excavating the hidden parallelism inside DRAM architectures with lightweight logic
Zhang et al. Pm3: Power modeling and power management for processing-in-memory
EP4071583A1 (en) Avoiding processor stall when accessing coherent memory device in low power
US9257152B2 (en) Memory architectures having wiring structures that enable different access patterns in multiple dimensions
TWI642055B (en) Nonvolatile memory module
Fang et al. Memory architecture for integrating emerging memory technologies
CN106569968A (en) Inter-array data transmission structure and scheduling method used for reconfigurable processor
TW200933385A (en) Microcontroller having dual-core architecture