CN108563510A - The architecture method for sensing and optimizing calculated towards E grades - Google Patents

The architecture method for sensing and optimizing calculated towards E grades Download PDF

Info

Publication number
CN108563510A
CN108563510A CN201810418522.1A CN201810418522A CN108563510A CN 108563510 A CN108563510 A CN 108563510A CN 201810418522 A CN201810418522 A CN 201810418522A CN 108563510 A CN108563510 A CN 108563510A
Authority
CN
China
Prior art keywords
sensing
optimizing
code section
grades
program code
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810418522.1A
Other languages
Chinese (zh)
Other versions
CN108563510B (en
Inventor
刘彦
刘尧
黄智�
黄一智
李仁发
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hunan University
Original Assignee
Hunan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hunan University filed Critical Hunan University
Priority to CN201810418522.1A priority Critical patent/CN108563510B/en
Publication of CN108563510A publication Critical patent/CN108563510A/en
Application granted granted Critical
Publication of CN108563510B publication Critical patent/CN108563510B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a kind of architecture method for sensing and optimizing calculated towards E grades, include the steps that task flow sensing and optimizing;The step of program code sensing and optimizing;The step of the step of programmed algorithm sensing and optimizing and vectorial sensing and optimizing.This architecture method for sensing and optimizing calculated towards E grades provided by the invention, using a variety of sensing and optimizing algorithms, when magnanimity computing resource that can be when being calculated towards E grade, rationally adjustment and distribution hardware resource, raising computational efficiency.

Description

The architecture method for sensing and optimizing calculated towards E grades
Technical field
Present invention relates particularly to a kind of architecture method for sensing and optimizing calculated towards E grades.
Background technology
With the arrival of the development of economic technology, the improvement of people's living standards and information age, E grades calculate (million Trillion times) epoch of supercomputer arrived.E grades calculate have superpower computing capability, can give computational science and Scientific research brings revolutionary variation.
With the reach of science, the scale of various numerical models is also increasing, and model is also more nearly reality Situation.But more accurate model means more complicated model solution.Currently, the reasearch funds of scientific research institutions are opposite Anxiety can not undertake a large amount of required funds of server of purchase.So even if scientific research institutions successfully establish more accurately Numerical model, its own also can not independently solve the numerical model.
If scientific research institutions can be by the code migrating of more accurate numerical model to the super calculation platform calculated towards E grades On, then scientific research institutions are then not necessarily to purchase the powerful server of a large amount of performances, it is only necessary to calculation code is transplanted to super calculation platform, It is long-range to check calculating progress and effect, the expense of the super calculation of calculating of payment.And the expense and a large amount of performances of super calculation of calculating are powerful Server compare, undoubtedly cost wants cheap many.
But how the calculation code of numerical model is effectively transplanted to the super calculation platform calculated towards E grades, to most The computing capabilitys for playing the super calculation platform of E grades of calculating of degree greatly, there has been no relevant researchs at present, thus also to a certain extent Constrain the development in the field.
Invention content
The purpose of the present invention is to provide one kind to calculate towards E grades, can be perceived to journey according to task architecture The architecture method for sensing and optimizing calculated towards E grades that sequence optimizes.
This architecture method for sensing and optimizing calculated towards E grades provided by the invention, includes the following steps:
The step of task flow sensing and optimizing, is perceived and is optimized for all tasks to required execution, and adjustment is simultaneously The processing sequence of each task is obtained, to accelerate the treatment effeciency of task flow;
The step of program code sensing and optimizing, carries out for the program code for needing to execute to be distributed to most suitable system It carries out Distributed Calculation and carries out result and summarize, to improve the treatment effeciency of program code;
The step of programmed algorithm sensing and optimizing, for being analyzed programmed algorithm and being distributed corresponding computing resource, from And improve the treatment effeciency of programmed algorithm;
The step of vectorial sensing and optimizing, selects maximum compression to store lattice for calculating the vectorization in calculating process Formula, to improve the treatment effeciency of vectorization calculating.
The task flow sensing and optimizing, specifically comprises the following steps:
(1) task of institute's execution in need is obtained, and confirms that there is no relation of interdependence between all tasks;
(2) subtask of all tasks to being obtained in step (1) is identified, and unifies in a period of time to each Computing resource is distributed in a subtask;
(3) subtask of calculation amount minimum in each subtask, and priority processing are found using most short optimization of job algorithm The subtask of calculation amount minimum.
The program code sensing and optimizing, specifically comprises the following steps:
1) test code is tested, the best match found between each typical independent code section and system hardware is closed System;
2) program code is identified, finds the independent code section in program code;
3) the independent code section that step 2) obtains is matched with typical independent code section, to obtain each independent generation Correspondence between code section and typical independent code section;
4) according in the independent code section in step 3) and the correspondence between typical independent code section and step 1) Optimum matching relation between each typical case's independent code section and system hardware, independent code section is distributed hard to corresponding system Part, to improve the treatment effeciency of program code.
The programmed algorithm sensing and optimizing, specifically comprises the following steps:
A. before programmed algorithm operation, the mean allocation of computing resource is carried out to different programmed algorithms;
B. when programmed algorithm is run, if occurring calculating the case where waiting for, the resource requirement of programmed algorithm carries out again Analysis;
C. the analysis result obtained according to step B balances (DRF) algorithm using dynamic resource and is carried out again to computing resource Distribution.
Being redistributed to computing resource described in step C is specially redistributed using following principle:
R1. user cannot obtain resources more more than other users;
R2. user cannot obtain more resources by lying about its resource requirement;
R3. all utilizable resources are distributed, do not have to replace existing resource allocation;
R4. user will not prefer the resource allocation of other users.
This architecture method for sensing and optimizing calculated towards E grades provided by the invention, is calculated using a variety of sensing and optimizings Method, can towards E grades calculate when magnanimity computing resource when, rationally adjustment and distribution hardware resource, improve computational efficiency.
Description of the drawings
Fig. 1 is the method schematic diagram of the method for the present invention.
Specific implementation mode
It is the method flow schematic diagram of the method for the present invention as shown in Figure 1:This body calculated towards E grades provided by the invention Architecture method for sensing and optimizing, includes the following steps:
The step of task flow sensing and optimizing, is perceived and is optimized for all tasks to required execution, and adjustment is simultaneously The processing sequence of each task is obtained, to accelerate the treatment effeciency of task flow;Specifically comprise the following steps:
(1) task of institute's execution in need is obtained, and confirms that there is no relation of interdependence between all tasks;
(2) subtask of all tasks to being obtained in step (1) is identified, and unifies in a period of time to each Computing resource is distributed in a subtask;
(3) subtask of calculation amount minimum in each subtask, and priority processing are found using most short optimization of job algorithm The subtask of calculation amount minimum;
The step of program code sensing and optimizing, carries out for the program code for needing to execute to be distributed to most suitable system It carries out Distributed Calculation and carries out result and summarize, to improve the treatment effeciency of program code;Specifically comprise the following steps:
1) test code is tested, the best match found between each typical independent code section and system hardware is closed System;
2) program code is identified, finds the independent code section in program code;
3) the independent code section that step 2) obtains is matched with typical independent code section, to obtain each independent generation Correspondence between code section and typical independent code section;
4) according in the independent code section in step 3) and the correspondence between typical independent code section and step 1) Optimum matching relation between each typical case's independent code section and system hardware, independent code section is distributed hard to corresponding system Part, to improve the treatment effeciency of program code;
The step of programmed algorithm sensing and optimizing, for being analyzed programmed algorithm and being distributed corresponding computing resource, from And improve the treatment effeciency of programmed algorithm;Specifically comprise the following steps:
A. before programmed algorithm operation, the mean allocation of computing resource is carried out to different programmed algorithms;
B. when programmed algorithm is run, if occurring calculating the case where waiting for, the resource requirement of programmed algorithm carries out again Analysis;
C. the analysis result obtained according to step B balances (DRF) algorithm using dynamic resource and is carried out again to computing resource Distribution;
The step of vectorial sensing and optimizing, selects maximum compression to store lattice for calculating the vectorization in calculating process Formula, to improve the treatment effeciency of vectorization calculating;Specially redistributed using following principle:
R1. user cannot obtain resources more more than other users;
R2. user cannot obtain more resources by lying about its resource requirement;
R3. all utilizable resources are distributed, do not have to replace existing resource allocation;
R4. user will not prefer the resource allocation of other users.

Claims (5)

1. a kind of architecture method for sensing and optimizing calculated towards E grades, includes the following steps:
The step of task flow sensing and optimizing, is perceived for all tasks to required execution and is optimized, adjusted and obtained The processing sequence of each task, to accelerate the treatment effeciency of task flow;
The step of program code sensing and optimizing, carries out for the program code for needing to execute to be distributed to most suitable system Distributed Calculation simultaneously carries out result and summarizes, to improve the treatment effeciency of program code;
The step of programmed algorithm sensing and optimizing, for being analyzed programmed algorithm and being distributed corresponding computing resource, to carry The treatment effeciency of high programmed algorithm;
The step of vectorial sensing and optimizing, selects maximum compression storage format for calculating the vectorization in calculating process, from And improve the treatment effeciency of vectorization calculating.
2. the architecture method for sensing and optimizing according to claim 1 calculated towards E grades, it is characterised in that described appoints Business stream sensing and optimizing, specifically comprises the following steps:
(1) task of institute's execution in need is obtained, and confirms that there is no relation of interdependence between all tasks;
(2) subtask of all tasks to being obtained in step (1) is identified, and unifies in a period of time to each height Task distributes computing resource;
(3) subtask of calculation amount minimum in each subtask is found using most short optimization of job algorithm, and the priority processing meter The subtask of calculation amount minimum.
3. the architecture method for sensing and optimizing according to claim 1 or 2 calculated towards E grades, it is characterised in that described Program code sensing and optimizing, specifically comprise the following steps:
1) test code is tested, finds the optimum matching relation between each typical independent code section and system hardware;
2) program code is identified, finds the independent code section in program code;
3) the independent code section that step 2) obtains is matched with typical independent code section, to obtain each independent code section With the correspondence between typical independent code section;
4) according to each in the independent code section in step 3) and the correspondence between typical independent code section and step 1) Optimum matching relation between typical independent code section and system hardware distributes independent code section to corresponding system hardware, To improve the treatment effeciency of program code.
4. the architecture method for sensing and optimizing according to claim 1 or 2 calculated towards E grades, it is characterised in that described Programmed algorithm sensing and optimizing, specifically comprise the following steps:
A. before programmed algorithm operation, the mean allocation of computing resource is carried out to different programmed algorithms;
B. when programmed algorithm is run, if occurring calculating the case where waiting for, the resource requirement of programmed algorithm is reanalysed;
C. the analysis result obtained according to step B divides computing resource using dynamic resource balance (DRF) algorithm again Match.
5. the architecture method for sensing and optimizing according to claim 1 or 2 calculated towards E grades, it is characterised in that step C Described redistributes computing resource, is specially redistributed using following principle:
R1. user cannot obtain resources more more than other users;
R2. user cannot obtain more resources by lying about its resource requirement;
R3. all utilizable resources are distributed, do not have to replace existing resource allocation;
R4. user will not prefer the resource allocation of other users.
CN201810418522.1A 2018-05-04 2018-05-04 E-level calculation-oriented architecture perception optimization method Active CN108563510B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810418522.1A CN108563510B (en) 2018-05-04 2018-05-04 E-level calculation-oriented architecture perception optimization method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810418522.1A CN108563510B (en) 2018-05-04 2018-05-04 E-level calculation-oriented architecture perception optimization method

Publications (2)

Publication Number Publication Date
CN108563510A true CN108563510A (en) 2018-09-21
CN108563510B CN108563510B (en) 2021-07-13

Family

ID=63537648

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810418522.1A Active CN108563510B (en) 2018-05-04 2018-05-04 E-level calculation-oriented architecture perception optimization method

Country Status (1)

Country Link
CN (1) CN108563510B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101453398A (en) * 2007-12-06 2009-06-10 怀特威盛软件公司 Novel distributed grid super computer system and method
US20130080482A1 (en) * 2003-04-09 2013-03-28 Gary Charles Berkowitz Virtual Supercomputer
CN103885839A (en) * 2014-04-06 2014-06-25 孙凌宇 Cloud computing task scheduling method based on multilevel division method and empowerment directed hypergraphs
US8874879B2 (en) * 2010-11-11 2014-10-28 Fujitsu Limited Vector processing circuit, command issuance control method, and processor system
CN104969207A (en) * 2012-10-22 2015-10-07 英特尔公司 High performance interconnect coherence protocol
CN107147517A (en) * 2017-03-24 2017-09-08 上海交通大学 A kind of adaptive polo placement resource allocation methods for virtual network function
CN107977270A (en) * 2017-11-22 2018-05-01 用友金融信息技术股份有限公司 Peers distribution method, peers distribution system and computer installation

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130080482A1 (en) * 2003-04-09 2013-03-28 Gary Charles Berkowitz Virtual Supercomputer
CN101453398A (en) * 2007-12-06 2009-06-10 怀特威盛软件公司 Novel distributed grid super computer system and method
US8874879B2 (en) * 2010-11-11 2014-10-28 Fujitsu Limited Vector processing circuit, command issuance control method, and processor system
CN104969207A (en) * 2012-10-22 2015-10-07 英特尔公司 High performance interconnect coherence protocol
CN103885839A (en) * 2014-04-06 2014-06-25 孙凌宇 Cloud computing task scheduling method based on multilevel division method and empowerment directed hypergraphs
CN107147517A (en) * 2017-03-24 2017-09-08 上海交通大学 A kind of adaptive polo placement resource allocation methods for virtual network function
CN107977270A (en) * 2017-11-22 2018-05-01 用友金融信息技术股份有限公司 Peers distribution method, peers distribution system and computer installation

Also Published As

Publication number Publication date
CN108563510B (en) 2021-07-13

Similar Documents

Publication Publication Date Title
Ben-Nun et al. A modular benchmarking infrastructure for high-performance and reproducible deep learning
US10032114B2 (en) Predicting application performance on hardware accelerators
CN106484777B (en) Multimedia data processing method and device
Meswani et al. Modeling and predicting performance of high performance computing applications on hardware accelerators
CN109523022B (en) Terminal data processing method and device and terminal
CN106991006B (en) Support the cloud workflow task clustering method relied on and the time balances
CN114902273A (en) System and method for optimizing resource allocation using GPU
KR101374120B1 (en) System and method for coinstructing virtual cluster using the exectution history of analysis job
US20170177544A1 (en) Method and system for solving a problem involving a hypergraph partitioning
EP3614312A1 (en) Calculation technique determining system, calculation technique determining device, processing device, calculation technique determining method, processing method, calculation technique determining program, and processing program
Baughman et al. Profiling and predicting application performance on the cloud
CN104793993A (en) Cloud computing task scheduling method of artificial bee colony particle swarm algorithm based on Levy flight
CN113962704A (en) Transfer verification method, device and equipment based on face recognition and storage medium
CN112328869A (en) User loan willingness prediction method and device and computer system
CN108563510A (en) The architecture method for sensing and optimizing calculated towards E grades
Ríos et al. Dynamically adapting floating-point precision to accelerate deep neural network training
CN109769031A (en) A kind of dynamic self-adapting load-balancing method and system
JP6154491B2 (en) Computer and graph data generation method
CN113657724B (en) Resource allocation method and device based on multi-source heterogeneous data and electronic equipment
CN108170404A (en) A kind of Web service combination verification method based on parameterized model
Li et al. Adaptive multi-objective swarm crossover optimization for imbalanced data classification
US20210350280A1 (en) Model modification and deployment
CN108932459A (en) Face recognition model training method and device and recognition algorithms
CN115527626A (en) Molecular processing method, molecular processing apparatus, electronic device, storage medium, and program product
Liu et al. Block-Wise Dynamic-Precision Neural Network Training Acceleration via Online Quantization Sensitivity Analytics

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant