JP2022070955A5 - - Google Patents

Download PDF

Info

Publication number
JP2022070955A5
JP2022070955A5 JP2022019764A JP2022019764A JP2022070955A5 JP 2022070955 A5 JP2022070955 A5 JP 2022070955A5 JP 2022019764 A JP2022019764 A JP 2022019764A JP 2022019764 A JP2022019764 A JP 2022019764A JP 2022070955 A5 JP2022070955 A5 JP 2022070955A5
Authority
JP
Japan
Prior art keywords
superlayer
neural network
processing
integrated circuit
inputs
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2022019764A
Other languages
English (en)
Japanese (ja)
Other versions
JP2022070955A (ja
JP7439149B2 (ja
Filing date
Publication date
Priority claimed from US15/599,559 external-priority patent/US10019668B1/en
Application filed filed Critical
Publication of JP2022070955A publication Critical patent/JP2022070955A/ja
Publication of JP2022070955A5 publication Critical patent/JP2022070955A5/ja
Application granted granted Critical
Publication of JP7439149B2 publication Critical patent/JP7439149B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2022019764A 2017-05-19 2022-02-10 ニューラルネットワーク処理のスケジューリング Active JP7439149B2 (ja)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US15/599,559 2017-05-19
US15/599,559 US10019668B1 (en) 2017-05-19 2017-05-19 Scheduling neural network processing
PCT/US2018/013939 WO2018212799A1 (en) 2017-05-19 2018-01-17 Scheduling neural network processing
JP2019552217A JP7025441B2 (ja) 2017-05-19 2018-01-17 ニューラルネットワーク処理のスケジューリング

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
JP2019552217A Division JP7025441B2 (ja) 2017-05-19 2018-01-17 ニューラルネットワーク処理のスケジューリング

Publications (3)

Publication Number Publication Date
JP2022070955A JP2022070955A (ja) 2022-05-13
JP2022070955A5 true JP2022070955A5 (enExample) 2022-08-08
JP7439149B2 JP7439149B2 (ja) 2024-02-27

Family

ID=61157323

Family Applications (2)

Application Number Title Priority Date Filing Date
JP2019552217A Active JP7025441B2 (ja) 2017-05-19 2018-01-17 ニューラルネットワーク処理のスケジューリング
JP2022019764A Active JP7439149B2 (ja) 2017-05-19 2022-02-10 ニューラルネットワーク処理のスケジューリング

Family Applications Before (1)

Application Number Title Priority Date Filing Date
JP2019552217A Active JP7025441B2 (ja) 2017-05-19 2018-01-17 ニューラルネットワーク処理のスケジューリング

Country Status (7)

Country Link
US (4) US10019668B1 (enExample)
EP (1) EP3577605B1 (enExample)
JP (2) JP7025441B2 (enExample)
KR (1) KR102346636B1 (enExample)
CN (2) CN117291239A (enExample)
TW (2) TWI699712B (enExample)
WO (1) WO2018212799A1 (enExample)

Families Citing this family (85)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11437032B2 (en) 2017-09-29 2022-09-06 Shanghai Cambricon Information Technology Co., Ltd Image processing apparatus and method
US11741346B2 (en) 2018-02-08 2023-08-29 Western Digital Technologies, Inc. Systolic neural network engine with crossover connection optimization
US10796198B2 (en) 2018-02-08 2020-10-06 Western Digital Technologies, Inc. Adjusting enhancement coefficients for neural network engine
EP3651073B1 (en) 2018-02-13 2021-10-27 Shanghai Cambricon Information Technology Co., Ltd Computation device and method
US11630666B2 (en) 2018-02-13 2023-04-18 Shanghai Cambricon Information Technology Co., Ltd Computing device and method
US11720357B2 (en) 2018-02-13 2023-08-08 Shanghai Cambricon Information Technology Co., Ltd Computing device and method
CN116991226A (zh) 2018-02-14 2023-11-03 上海寒武纪信息科技有限公司 处理器的控制装置、方法及设备
JP6961640B2 (ja) * 2018-03-22 2021-11-05 南京地平▲線▼机▲器▼人技▲術▼有限公司 データ処理のシステムおよび方法
US11461631B2 (en) * 2018-03-22 2022-10-04 Amazon Technologies, Inc. Scheduling neural network computations based on memory capacity
US11475306B2 (en) 2018-03-22 2022-10-18 Amazon Technologies, Inc. Processing for multiple input data sets
US20210042621A1 (en) * 2018-04-17 2021-02-11 Shenzhen Corerain Technologies Co., Ltd. Method for operation of network model and related product
US11562213B2 (en) * 2018-04-17 2023-01-24 Intel Corporation Methods and arrangements to manage memory in cascaded neural networks
EP3624020B1 (en) 2018-05-18 2025-07-02 Shanghai Cambricon Information Technology Co., Ltd Computation method and product thereof
US10970120B2 (en) * 2018-06-26 2021-04-06 Advanced Micro Devices, Inc. Method and system for opportunistic load balancing in neural networks using metadata
CN110728364B (zh) 2018-07-17 2024-12-17 上海寒武纪信息科技有限公司 一种运算装置和运算方法
EP3798850B1 (en) 2018-06-27 2025-11-19 Shanghai Cambricon Information Technology Co., Ltd On-chip code breakpoint debugging method, on-chip processor, and chip breakpoint debugging system
CN109117949A (zh) * 2018-08-01 2019-01-01 南京天数智芯科技有限公司 用于人工智能设备的灵活数据流处理器和处理方法
WO2020042739A1 (zh) * 2018-08-28 2020-03-05 中科寒武纪科技股份有限公司 数据预处理方法、装置、计算机设备和存储介质
EP4530933A3 (en) 2018-08-29 2025-07-02 QUALCOMM Incorporated Method, apparatus, and system for an architecture for machine learning acceleration
CN110956252B (zh) * 2018-09-27 2025-09-16 第四范式(北京)技术有限公司 执行多个神经网络的计算的方法和计算装置
WO2020062392A1 (zh) 2018-09-28 2020-04-02 上海寒武纪信息科技有限公司 信号处理装置、信号处理方法及相关产品
US11263529B2 (en) * 2018-10-10 2022-03-01 Google Llc Modifying machine learning models to improve locality
CN111383637A (zh) 2018-12-28 2020-07-07 上海寒武纪信息科技有限公司 信号处理装置、信号处理方法及相关产品
JP7379821B2 (ja) * 2019-01-09 2023-11-15 日本電信電話株式会社 推論処理装置および推論処理方法
US12254400B2 (en) * 2019-01-10 2025-03-18 Mipsology SAS Optimizing artificial neural network computations based on automatic determination of a batch size
US11586929B2 (en) 2019-02-15 2023-02-21 Wipro Limited Method and system for optimizing memory requirement for training an artificial neural network model
CN111667046A (zh) * 2019-03-08 2020-09-15 富泰华工业(深圳)有限公司 深度学习加速方法及用户终端
US11783176B2 (en) 2019-03-25 2023-10-10 Western Digital Technologies, Inc. Enhanced storage device memory architecture for machine learning
US10929058B2 (en) 2019-03-25 2021-02-23 Western Digital Technologies, Inc. Enhanced memory device architecture for machine learning
CN111831543B (zh) 2019-04-18 2024-07-16 中科寒武纪科技股份有限公司 一种数据处理方法及相关产品
US11934940B2 (en) 2019-04-18 2024-03-19 Cambricon Technologies Corporation Limited AI processor simulation
US11175898B2 (en) * 2019-05-31 2021-11-16 Apple Inc. Compiling code for a machine learning model for execution on a specialized processor
WO2020245937A1 (ja) * 2019-06-05 2020-12-10 日本電信電話株式会社 推論処理装置および推論処理方法
KR102609719B1 (ko) 2019-06-12 2023-12-04 상하이 캠브리콘 인포메이션 테크놀로지 컴퍼니 리미티드 신경망의 양자화 파라미터 확정방법 및 관련제품
US11676029B2 (en) 2019-06-12 2023-06-13 Shanghai Cambricon Information Technology Co., Ltd Neural network quantization parameter determination method and related products
US11556798B2 (en) * 2019-06-18 2023-01-17 Qualcomm Incorporated Optimizing machine learning model performance
EP3994621A1 (en) * 2019-07-03 2022-05-11 Huaxia General Processor Technologies Inc. Instructions for operating accelerator circuit
US11436019B2 (en) 2019-07-15 2022-09-06 Microsoft Technology Licensing, Llc Data parallelism in distributed training of artificial intelligence models
US11520592B2 (en) * 2019-07-15 2022-12-06 Microsoft Technology Licensing, Llc Executing large artificial intelligence models on memory-constrained devices
US11354579B2 (en) * 2019-07-15 2022-06-07 Microsoft Technology Licensing, Llc Dynamic multi-layer execution for artificial intelligence modeling
CN114258538B (zh) 2019-08-16 2024-04-12 谷歌有限责任公司 片上操作的显式调度
JP7146955B2 (ja) 2019-08-23 2022-10-04 安徽寒武紀信息科技有限公司 データ処理方法、装置、コンピュータデバイス、及び記憶媒体
JP7146952B2 (ja) 2019-08-23 2022-10-04 安徽寒武紀信息科技有限公司 データ処理方法、装置、コンピュータデバイス、及び記憶媒体
CN112434781B (zh) 2019-08-26 2024-09-10 上海寒武纪信息科技有限公司 用于处理数据的方法、装置以及相关产品
WO2021036905A1 (zh) 2019-08-27 2021-03-04 安徽寒武纪信息科技有限公司 数据处理方法、装置、计算机设备和存储介质
US11573828B2 (en) * 2019-09-16 2023-02-07 Nec Corporation Efficient and scalable enclave protection for machine learning programs
US11651209B1 (en) 2019-10-02 2023-05-16 Google Llc Accelerated embedding layer computations
DE102019127795A1 (de) * 2019-10-15 2021-04-15 Infineon Technologies Ag Schaltung und ein Verfahren zum Bestimmen einer Lage eines Magneten und Joystick
CN110515739B (zh) * 2019-10-23 2020-01-31 上海燧原智能科技有限公司 深度学习神经网络模型负载计算方法、装置、设备及介质
US12379933B2 (en) * 2019-10-25 2025-08-05 Samsung Electronics Co., Ltd. Ultra pipelined accelerator for machine learning inference
CN112862085B (zh) * 2019-11-27 2023-08-22 杭州海康威视数字技术股份有限公司 存储空间优化方法及装置
CN114424174A (zh) * 2019-12-18 2022-04-29 谷歌有限责任公司 用于神经网络加速器的参数高速缓存
CN111338816B (zh) * 2020-02-18 2023-05-12 深圳鲲云信息科技有限公司 基于神经网络的指令交互方法、系统、设备及存储介质
CN113298843B (zh) 2020-02-24 2024-05-14 中科寒武纪科技股份有限公司 数据量化处理方法、装置、电子设备和存储介质
CN113408717B (zh) 2020-03-17 2025-09-09 安徽寒武纪信息科技有限公司 计算装置、方法、板卡和计算机可读存储介质
JP6834097B1 (ja) * 2020-05-15 2021-02-24 エッジコーティックス ピーティーイー. リミテッド 推論のニューラルネットワークアクセラレータのハードウェア固有分割
CN115668225A (zh) * 2020-05-29 2023-01-31 华为技术有限公司 神经网络调度方法及装置
WO2021243489A1 (zh) * 2020-05-30 2021-12-09 华为技术有限公司 一种神经网络的数据处理方法及装置
US11288097B2 (en) * 2020-06-12 2022-03-29 Disney Enterprises, Inc. Automated hardware resource optimization
US20220012635A1 (en) * 2020-06-18 2022-01-13 Texas Instruments Incorporated Analytic techniques for improved super tiling machine learning processing
MX2023000126A (es) * 2020-07-02 2023-02-09 Interdigital Patent Holdings Inc Metodos, aparato y sistemas para autocodificador acondicionado por graficas (gcae) usando representaciones que facilitan la topologia.
KR102647690B1 (ko) * 2020-08-21 2024-03-14 주식회사 딥엑스 최적화된 인공신경망 모델을 구동하도록 구성된 신경망 프로세싱 유닛
KR102299084B1 (ko) * 2020-08-24 2021-09-07 오픈엣지테크놀로지 주식회사 하드웨어 가속기의 출력 데이터를 메모리에 저장하는 방법, 하드웨어 가속기의 입력 데이터를 메모리로부터 읽는 방법, 및 이를 위한 하드웨어 가속기
KR102384587B1 (ko) * 2020-08-25 2022-04-08 오픈엣지테크놀로지 주식회사 하드웨어 가속기의 출력 데이터를 압축하는 방법, 하드웨어 가속기로의 입력 데이터를 디코딩하는 방법, 및 이를 위한 하드웨어 가속기
KR20220027500A (ko) * 2020-08-27 2022-03-08 에스케이하이닉스 주식회사 가속 장치, 데이터 저장 장치, 데이터 처리 시스템 및 가속 장치의 동작방법
KR102883346B1 (ko) 2020-09-09 2025-11-07 삼성전자주식회사 호스트 프로세서 및 가속기의 동작 방법 및 이들을 포함한 전자 장치
WO2022116051A1 (en) * 2020-12-02 2022-06-09 Alibaba Group Holding Limited Neural network near memory processing
KR20220078290A (ko) * 2020-12-03 2022-06-10 삼성전자주식회사 뉴럴 네트워크 연산 스케줄링 방법 및 장치
US11734072B2 (en) * 2020-12-31 2023-08-22 Nuvolo Technologies Corporation Stream-based job processing
KR20220124551A (ko) 2021-03-03 2022-09-14 삼성전자주식회사 이종 하드웨어 타입의 가속기들을 포함한 전자 장치
KR102506613B1 (ko) * 2021-04-30 2023-03-06 주식회사 딥엑스 이종의 센서로 제공되는 이종의 데이터를 처리하기 위한 퓨전-인공신경망을 위해 구현되는 npu
US11511772B2 (en) 2021-04-30 2022-11-29 Deepx Co., Ltd. NPU implemented for artificial neural networks to process fusion of heterogeneous data received from heterogeneous sensors
EP4099609A1 (en) * 2021-06-04 2022-12-07 Zama SAS Computational network conversion for fully homomorphic evaluation
US12450478B2 (en) * 2021-09-10 2025-10-21 Maxim Integrated Products, Inc. Dynamic data-dependent neural network processing systems and methods
KR102869720B1 (ko) * 2021-09-17 2025-10-14 구글 엘엘씨 기계 학습 모델의 분할 추론 연산 수행
US11657260B2 (en) * 2021-10-26 2023-05-23 Edgecortix Pte. Ltd. Neural network hardware accelerator data parallelism
TWI802070B (zh) * 2021-11-03 2023-05-11 大陸商星宸科技股份有限公司 積體電路及其配置方法
US20230153583A1 (en) * 2021-11-15 2023-05-18 Xilinx, Inc. Compilation of neural networks into subgraphs for processing by multiple compute circuits
US11514370B1 (en) 2021-12-03 2022-11-29 FriendliAI Inc. Selective batching for inference system for transformer-based generation tasks
US11442775B1 (en) * 2021-12-03 2022-09-13 FriendliAI Inc. Dynamic batching for inference system for transformer-based generation tasks
US12493931B2 (en) 2021-12-08 2025-12-09 Deepx Co., Ltd. Neural processing unit and artificial neural network system for image fusion
KR102651559B1 (ko) * 2021-12-08 2024-03-26 주식회사 딥엑스 영상 융합을 위한 신경 프로세싱 유닛 및 인공신경망 시스템
KR20240102798A (ko) 2022-12-26 2024-07-03 리벨리온 주식회사 뉴럴 프로세서 및 이의 명령어 페치 방법
KR102548582B1 (ko) * 2022-12-26 2023-06-29 리벨리온 주식회사 뉴럴 프로세서 및 이의 명령어 페치 방법
US20240256285A1 (en) * 2023-01-31 2024-08-01 Microsoft Technology Licensing, Llc Parallelizing multi-phase kernels with cross-phase dependency on heterogenous hardware

Family Cites Families (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7089185B2 (en) 2002-06-27 2006-08-08 Intel Corporation Embedded multi-layer coupled hidden Markov model
US7171043B2 (en) 2002-10-11 2007-01-30 Intel Corporation Image recognition using hidden markov models and coupled hidden markov models
US7203368B2 (en) 2003-01-06 2007-04-10 Intel Corporation Embedded bayesian network for pattern recognition
KR100486735B1 (ko) 2003-02-28 2005-05-03 삼성전자주식회사 최적구획 분류신경망 구성방법과 최적구획 분류신경망을이용한 자동 레이블링방법 및 장치
US7639727B1 (en) * 2004-10-05 2009-12-29 Cingular Wireless Ii, L.L.C. System and method for selecting wireless signal bandwidth based on signal strength measurements provided by wireless receivers
TWI525558B (zh) * 2011-01-17 2016-03-11 Univ Nat Taipei Technology Resilient high - speed hardware reverse transfer and feedback type neural network system
US8462018B1 (en) * 2011-05-26 2013-06-11 Rockwell Collins, Inc. Systems and method for controlling the simultaneous display of multi-level classified information on the same surface of an aircraft display unit
US8725658B2 (en) * 2011-09-21 2014-05-13 Brain Corporation Elementary network description for efficient memory management in neuromorphic systems
US8914315B2 (en) * 2012-01-27 2014-12-16 International Business Machines Corporation Multi-compartment neuron suitable for implementation in a distributed hardware model by reducing communication bandwidth
US9477925B2 (en) * 2012-11-20 2016-10-25 Microsoft Technology Licensing, Llc Deep neural networks training for speech and pattern recognition
US10331823B2 (en) * 2013-10-24 2019-06-25 Mentor Graphics Corporation Method and system of fast nested-loop circuit verification for process and environmental variation and hierarchical circuits
US10095917B2 (en) * 2013-11-04 2018-10-09 Facebook, Inc. Systems and methods for facial representation
US20160026912A1 (en) * 2014-07-22 2016-01-28 Intel Corporation Weight-shifting mechanism for convolutional neural networks
EP3035204B1 (en) * 2014-12-19 2018-08-15 Intel Corporation Storage device and method for performing convolution operations
US20160335119A1 (en) 2015-05-12 2016-11-17 minds.ai inc Batch-based neural network system
US10049322B2 (en) * 2015-05-21 2018-08-14 Google Llc Prefetching weights for use in a neural network processor
US9747546B2 (en) * 2015-05-21 2017-08-29 Google Inc. Neural network processor
US10438117B1 (en) * 2015-05-21 2019-10-08 Google Llc Computing convolutions using a neural network processor
US10083395B2 (en) * 2015-05-21 2018-09-25 Google Llc Batch processing in a neural network processor
EP3304437B1 (en) * 2015-06-05 2021-05-26 DeepMind Technologies Limited Whitened neural network layers
EP3104309B1 (en) * 2015-06-10 2020-04-01 Samsung Electronics Co., Ltd. Spiking neural network with reduced memory access and reduced in-network bandwidth consumption
US10387770B2 (en) 2015-06-10 2019-08-20 Samsung Electronics Co., Ltd. Spiking neural network with reduced memory access and reduced in-network bandwidth consumption
US9582726B2 (en) * 2015-06-24 2017-02-28 Qualcomm Incorporated Systems and methods for image processing in a deep convolution network
US10452971B2 (en) 2015-06-29 2019-10-22 Microsoft Technology Licensing, Llc Deep neural network partitioning on servers
CN106599991B (zh) * 2015-10-08 2019-04-09 上海兆芯集成电路有限公司 具有神经存储器的神经网络单元和集体将来自神经存储器的数据列移位的神经处理单元阵列
KR102204887B1 (ko) 2015-10-28 2021-01-19 구글 엘엘씨 연산 그래프 수정
US20170154262A1 (en) * 2015-11-30 2017-06-01 Google Inc. Resizing neural networks
CN105426517B (zh) * 2015-12-02 2020-02-18 上海越峰信息科技有限公司 一种具有图像处理功能的智能存储设备
US10482380B2 (en) * 2015-12-30 2019-11-19 Amazon Technologies, Inc. Conditional parallel processing in fully-connected neural networks
KR102459854B1 (ko) * 2016-05-26 2022-10-27 삼성전자주식회사 심층 신경망용 가속기
AU2016203619A1 (en) * 2016-05-31 2017-12-14 Canon Kabushiki Kaisha Layer-based operations scheduling to optimise memory for CNN applications
US10922610B2 (en) * 2017-09-14 2021-02-16 Intel Corporation Synchronization scheduler of distributed neural network training

Similar Documents

Publication Publication Date Title
JP2022070955A5 (enExample)
US20240419967A1 (en) Asynchronous neural network training
He et al. GPU-accelerated parallel sparse LU factorization method for fast circuit analysis
US8762655B2 (en) Optimizing output vector data generation using a formatted matrix data structure
JP2020521195A5 (enExample)
US20170212968A1 (en) Circuit Verification
WO2019119301A1 (zh) 在卷积神经网络模型中确定特征图像的方法和装置
JP2020513176A5 (enExample)
US11663452B2 (en) Processor array for processing sparse binary neural networks
Lee et al. Fast matrix-vector multiplications for large-scale logistic regression on shared-memory systems
Mushtaq et al. Cluster-based Apache Spark implementation of the GATK DNA analysis pipeline
Wu et al. Skeletongcn: a simple yet effective accelerator for gcn training
CN118897938A (zh) 基于稀疏矩阵算子的lu分解方法、装置、设备及介质
US20240086719A1 (en) Sparse encoding and decoding at mixture-of-experts layer
CN115480919A (zh) 卷积优化运算方法、装置、计算机设备及存储介质
JP5269137B2 (ja) 演算装置
Liu et al. Data-transfer-bottleneck-less architecture for FPGA-based quantum annealing simulation
JP2016139391A (ja) テンソルデータ計算装置、テンソルデータ計算方法、及びプログラム
CN111723913A (zh) 一种数据处理方法、装置、设备及可读存储介质
Sahin et al. Usability of Markov chain Monte Carlo preconditioners in practical problems
CN104360898B (zh) 运行任务的方法和装置
Siládi et al. Adapted parallel Quine-McCluskey algorithm using GPGPU
EP4650952A1 (en) Vector processor tile array with input and output streams
US20210216374A1 (en) Cluster update accelerator circuit
Block et al. A hardware acceleration of a phylogenetic tree reconstruction with maximum parsimony algorithm using FPGA