CN106293949A - Resource dispatching strategy based on baseline analysis under a kind of computing environment - Google Patents

Resource dispatching strategy based on baseline analysis under a kind of computing environment Download PDF

Info

Publication number
CN106293949A
CN106293949A CN201610689781.9A CN201610689781A CN106293949A CN 106293949 A CN106293949 A CN 106293949A CN 201610689781 A CN201610689781 A CN 201610689781A CN 106293949 A CN106293949 A CN 106293949A
Authority
CN
China
Prior art keywords
resource
data
dispatching strategy
resource dispatching
scheduling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610689781.9A
Other languages
Chinese (zh)
Inventor
左强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Electronic Information Industry Co Ltd
Original Assignee
Inspur Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Electronic Information Industry Co Ltd filed Critical Inspur Electronic Information Industry Co Ltd
Priority to CN201610689781.9A priority Critical patent/CN106293949A/en
Publication of CN106293949A publication Critical patent/CN106293949A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5077Logical partitioning of resources; Management or configuration of virtualized resources

Abstract

The present invention be more particularly directed to resource dispatching strategy based on baseline analysis under a kind of computing environment.Resource dispatching strategy based on baseline analysis under this computing environment, sets up lexical analysis system, and described lexical analysis system includes log analyzing module, data analysis module and scheduling of resource module;Described log analyzing module includes daily record uploading unit and daily record resolution unit, described data analysis module includes Data Integration and analytic unit and checks analysis result unit, and described scheduling of resource module includes resource dispatching strategy signal generating unit and resource dispatching strategy performance element.Resource dispatching strategy based on baseline analysis under this computing environment, propose a lexical analysis system, the collection by daily record data of the lexical analysis system, data are analyzed, thus generate the scheduling strategy being conducive to improving overall efficiency, by the execution of scheduling strategy, improve the effective rate of utilization of node, it is achieved thereby that the lifting of overall efficiency.

Description

Resource dispatching strategy based on baseline analysis under a kind of computing environment
Technical field
The present invention relates to field of cloud computer technology, particularly to scheduling of resource based on baseline analysis under a kind of computing environment Strategy.
Background technology
The essence of " cloud " is the non-physical of system itself, it is believed that cloud computing is for the customized void of user Intending calculating system, therefore Intel Virtualization Technology becomes the bottom foundation stone that cloud computing realizes.Virtualization is to represent taking out of computer resource As method, can be by the resource after abstract with the way access accessing abstract front resource consistence, this money by Intel Virtualization Technology The abstract method in source is not limited by the physical configuration of realization, geographical position and underlying resource.
Along with the maturation of Intel Virtualization Technology and developing rapidly of the Internet, each big business information system more turns to various Privately owned cloud or publicly-owned cloud, the data of generation also increase rapidly.People recognize the data importance to enterprise, carry out data Analysis mining can provide purpose and information for the decision-making of enterprise.The data generally created and produce are destructuring or half structure The data changed, are analyzed if these data are downloaded to relevant database, can send out the substantial amounts of time and money of expense.At this moment Create such as Map/Reduce distributed computing framework, thus carry out analysis and the excavation of large data collection.
Cloud computing, utilizes system architecture technology that thousands of station servers are integrated, provides the user and provide flexibly Source distribution and task scheduling ability.Intel Virtualization Technology is as one of key technology in cloud computing, and it can be by a physics meter Calculation machine becomes the virtual computer system of multiple stage.The bottom architecture such as physical resource are carried out abstract by Intel Virtualization Technology, make hardware set Difference between Bei and compatibility are transparent to upper layer application, thus realize the unified management of resources all kinds of to bottom.Virtualization skill Art is a kind of method allocated and calculate resource, and it can be by different levels (hardware, software, data, network, the storage of application system Deng) isolate, thus break data center, network, server, data, apply and store in physical equipment between draw Point, it is achieved unified management and dynamically use physical resource and virtual resource, improve motility and the elasticity of system structure.
Accompanying drawing 1 is virtualized system architecture schematic diagram.Virtual machine monitoring software VMM in accompanying drawing, in Intel Virtualization Technology (Virtual Machine Monitor), or referred to as Hypervisor, it has access to that all hardware equipment on server. When startup of server and when calling Hypervisor, it can load the operating system on all virtual-machine client, gives every simultaneously The physical resources such as network, CPU, disk and the internal memory that the distribution of individual virtual machine is appropriate.Hypervisor is responsible for coordinating these hardware money The access in source, the most also applies security protection between each virtual machine.
In large-scale cloud computing environment, physical host is ten hundreds of, and the resource consumed is the hugest.Virtual Change technology provides fictitious host computer, fictitious host computer to be the main objects under cloud computing environment for cloud computing, needs to consume main frame The resources such as cpu, internal memory, disk and network interface card.But virtual machine is different by the physical resource shared by the configuration of virtual machine, the most empty The consumption of resource is also not quite similar by plan machine by the difference of purposes.Based on above-mentioned situation, the present invention proposes a kind of computing environment Under resource dispatching strategy based on baseline analysis.
Summary of the invention
The present invention is in order to make up the defect of prior art, it is provided that divide based on baseline under a kind of simple efficient computing environment The resource dispatching strategy of analysis.
The present invention is achieved through the following technical solutions:
Resource dispatching strategy based on baseline analysis under a kind of computing environment, it is characterised in that: set up lexical analysis system, described Lexical analysis system includes log analyzing module, data analysis module and scheduling of resource module;Described log analyzing module includes Daily record uploading unit and daily record resolution unit, described data analysis module includes Data Integration and analytic unit and checks analysis knot Really unit, described scheduling of resource module includes resource dispatching strategy signal generating unit and resource dispatching strategy performance element.
Described log analyzing module uses Map/Reduce programming model, and Mapper is responsible for reading daily record and resolving, and generates The log object being prone to read gives Reducer process, and Reducer is responsible for merging all daily record datas, and exports HDFS Storage.
Described data analysis module first passes through to be integrated data, and form as requested merges;Then utilize K-means logarithm shows cluster analysis factually, improves computational load amount and efficiency, utilizes Map/Reduce programming framework to K- Means realizes parallelization.
Described scheduling of resource module, according to Data Integration result, arranges virtual machine related information, by generating phase The scheduling strategy closed, improves the utilization rate of node, and simultaneously after scheduling strategy generates, resource dispatching strategy performance element can be held The scheduler task of row associated virtual machine, thus improve usefulness on the whole.
The invention has the beneficial effects as follows: resource dispatching strategy based on baseline analysis under this computing environment, it is proposed that Lexical analysis system, the collection by daily record data of the lexical analysis system, data are analyzed, thus generate and be conducive to improving The scheduling strategy of overall efficiency, by the execution of scheduling strategy, improves the effective rate of utilization of node, it is achieved thereby that overall effect The lifting of energy.
Accompanying drawing explanation
Accompanying drawing 1 is virtualized system architecture schematic diagram.
Accompanying drawing 2 is the simple architecture schematic diagram of Hadoop.
Accompanying drawing 3 is lexical analysis system architecture schematic diagram of the present invention.
Detailed description of the invention
In order to make the technical problem to be solved, technical scheme and beneficial effect clearer, below tie Closing drawings and Examples, the present invention will be described in detail.It should be noted that, specific embodiment described herein is only used To explain the present invention, it is not intended to limit the present invention.
Resource dispatching strategy based on baseline analysis under this computing environment, sets up lexical analysis system, described lexical analysis System includes log analyzing module, data analysis module and scheduling of resource module;Described log analyzing module includes that daily record is uploaded Unit and daily record resolution unit, described data analysis module includes Data Integration and analytic unit and checks analysis result unit, Described scheduling of resource module includes resource dispatching strategy signal generating unit and resource dispatching strategy performance element.
Described log analyzing module uses Hadoop framework, for realizing carrying out unformatted daily record data parsing lattice Formulaization stores, and utilizes Map/Reduce programming framework to carry out parallelization and resolves daily record, extracts virtual machine performance number in daily record Data are used, according to certain form storage after statistics according to (CPU, internal memory usage amount) and virtual machine process.Specifically, Described log analyzing module uses Map/Reduce programming model, and Mapper is responsible for reading daily record and resolving, and generates and is prone to read Log object give Reducer process, Reducer is responsible for merging all daily record datas, and export to HDFS store.
Described data analysis module first passes through to be integrated data, and form as requested merges;Then utilize K-means algorithm logarithm shows cluster analysis factually, improves computational load amount and efficiency, utilizes Map/Reduce programming framework to K- Means realizes parallelization.
Described scheduling of resource module, according to Data Integration result, arranges virtual machine related information, by generating phase The scheduling strategy closed, improves the utilization rate of node, and simultaneously after scheduling strategy generates, resource dispatching strategy performance element can be held The scheduler task of row associated virtual machine, thus improve usefulness on the whole.
1, about Hadoop framework
The core of Hadoop is made up of Map/Reduce and Hadoop Distributed File System.The bottom is HDFS, it stores the file on all nodes of Hadoop, is made up of NameNode and DataNode.Map/Reduce is in The last layer of HDFS, is made up of JobTracker and TaskTracker.
2, about K-means algorithm
K-means algorithm is as being clustering algorithm based on division the most classical.What this algorithm was maximum is a little to be succinctly Quickly, and just can be applied on distributed computing framework by simple amendment, become what large-scale data was analyzed Instrument.
K-means algorithm input numerical value is K and N number of data object, then N number of data object is divided into K cluster, makes to obtain Cluster meet following condition:
(1), in same cluster, data object similarity is higher;
(2), in different clusters, data object similarity is relatively low.
Each cluster has a cluster centre, and the barycenter of the most each cluster is typically equal with the data object in each cluster Value calculates cluster centre and embodies the feature of this cluster.
The basic thought of K-means algorithm is: set up K central point, calculates the similarity of object and central point, with in The object that heart point similarity is high divides corresponding classification into, updates central point until obtaining preferable cluster result.

Claims (4)

1. resource dispatching strategy based on baseline analysis under a computing environment, it is characterised in that: set up lexical analysis system, institute State lexical analysis system and include log analyzing module, data analysis module and scheduling of resource module;Described log analyzing module bag Including daily record uploading unit and daily record resolution unit, described data analysis module includes Data Integration and analytic unit and checks analysis Result unit, described scheduling of resource module includes resource dispatching strategy signal generating unit and resource dispatching strategy performance element.
Resource dispatching strategy based on baseline analysis under computing environment the most according to claim 1, it is characterised in that: described Log analyzing module uses Map/Reduce programming model, and Mapper is responsible for reading daily record and resolving, and generates the day being prone to read Will object gives Reducer process, and Reducer is responsible for merging all daily record datas, and exports HDFS storage.
Resource dispatching strategy based on baseline analysis under computing environment the most according to claim 1, it is characterised in that: described Data analysis module first passes through to be integrated data, and form as requested merges;Then utilize K-means to data Realize cluster analysis, improve computational load amount and efficiency, utilize Map/Reduce programming framework that K-means is realized parallelization.
Resource dispatching strategy based on baseline analysis under computing environment the most according to claim 1, it is characterised in that: described Scheduling of resource module, according to Data Integration result, arranges virtual machine related information, by generating relevant scheduling strategy, Improving the utilization rate of node, simultaneously after scheduling strategy generates, resource dispatching strategy performance element can perform associated virtual machine Scheduler task, thus improve usefulness on the whole.
CN201610689781.9A 2016-08-19 2016-08-19 Resource dispatching strategy based on baseline analysis under a kind of computing environment Pending CN106293949A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610689781.9A CN106293949A (en) 2016-08-19 2016-08-19 Resource dispatching strategy based on baseline analysis under a kind of computing environment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610689781.9A CN106293949A (en) 2016-08-19 2016-08-19 Resource dispatching strategy based on baseline analysis under a kind of computing environment

Publications (1)

Publication Number Publication Date
CN106293949A true CN106293949A (en) 2017-01-04

Family

ID=57660747

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610689781.9A Pending CN106293949A (en) 2016-08-19 2016-08-19 Resource dispatching strategy based on baseline analysis under a kind of computing environment

Country Status (1)

Country Link
CN (1) CN106293949A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107277080A (en) * 2017-08-23 2017-10-20 深信服科技股份有限公司 A kind of is the internet risk management method and system of service based on safety
CN107784440A (en) * 2017-10-23 2018-03-09 国网辽宁省电力有限公司 A kind of power information system resource allocation system and method
CN109783211A (en) * 2018-12-14 2019-05-21 成都四方伟业软件股份有限公司 A kind of batch task scheduling system and dispatching method based on business diary

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140189703A1 (en) * 2012-12-28 2014-07-03 General Electric Company System and method for distributed computing using automated provisoning of heterogeneous computing resources
CN104111996A (en) * 2014-07-07 2014-10-22 山大地纬软件股份有限公司 Health insurance outpatient clinic big data extraction system and method based on hadoop platform

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140189703A1 (en) * 2012-12-28 2014-07-03 General Electric Company System and method for distributed computing using automated provisoning of heterogeneous computing resources
CN104111996A (en) * 2014-07-07 2014-10-22 山大地纬软件股份有限公司 Health insurance outpatient clinic big data extraction system and method based on hadoop platform

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
匡华等: "移动通信运营商办公云", 《移动通信》 *
李梓萌等: "基于基线分析的云计算资源自动分配技术", 《现代计算机》 *
王鹏: "云平台下日志分析系统的设计与实现", 《中国优秀硕士学位论文全文数据库信息科技辑》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107277080A (en) * 2017-08-23 2017-10-20 深信服科技股份有限公司 A kind of is the internet risk management method and system of service based on safety
CN107784440A (en) * 2017-10-23 2018-03-09 国网辽宁省电力有限公司 A kind of power information system resource allocation system and method
CN109783211A (en) * 2018-12-14 2019-05-21 成都四方伟业软件股份有限公司 A kind of batch task scheduling system and dispatching method based on business diary

Similar Documents

Publication Publication Date Title
Fernández et al. Big Data with Cloud Computing: an insight on the computing environment, MapReduce, and programming frameworks
Cano et al. Towards geo-distributed machine learning
CN103873498B (en) Cloud platform resource-adaptive method for early warning and system
CN105049218B (en) PhiCloud clouds charging method and system
CN109492002A (en) A kind of storage of smart grid big data and analysis system and processing method
CN103838617A (en) Method for constructing data mining platform in big data environment
Taft et al. STeP: Scalable tenant placement for managing database-as-a-service deployments
Agneeswaran et al. Paradigms for realizing machine learning algorithms
Costa et al. The SusCity big data warehousing approach for smart cities
Ye et al. Cloud-based big data mining & analyzing services platform integrating R
CN106293949A (en) Resource dispatching strategy based on baseline analysis under a kind of computing environment
Zhang et al. Splitting large medical data sets based on normal distribution in cloud environment
Vrbić Data mining and cloud computing
Singh et al. Big data: technologies, trends and applications
Piccialli et al. S-InTime: A social cloud analytical service oriented system
Arvanitis et al. Automated Performance Management for the Big Data Stack.
Zhang et al. Integrated design and development of intelligent scenic area rural tourism information service based on hybrid cloud
Sridhar et al. A study of big data analytics in clouds with a security perspective
Ribeiro et al. A scalable data integration architecture for smart cities: implementation and evaluation
Khan et al. Computational performance analysis of cluster-based technologies for big data analytics
Naga Rama Devi Emerging trends in big data analytics—A study
Dadheech et al. Performance improvement of heterogeneous Hadoop clusters using MapReduce for big data
Zhao et al. Simple parallel genetic algorithm using cloud computing
Mrozek et al. Technological Roadmap
Kumar et al. A New Framework for Effective Processing of WSN Application Generated Big Data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170104

RJ01 Rejection of invention patent application after publication