CN110363280A - Algorithm model training analysis system - Google Patents

Algorithm model training analysis system Download PDF

Info

Publication number
CN110363280A
CN110363280A CN201910430820.7A CN201910430820A CN110363280A CN 110363280 A CN110363280 A CN 110363280A CN 201910430820 A CN201910430820 A CN 201910430820A CN 110363280 A CN110363280 A CN 110363280A
Authority
CN
China
Prior art keywords
module
algorithm model
model
data
training
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910430820.7A
Other languages
Chinese (zh)
Inventor
杨和平
张志强
杨笛
张强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National Meteorological Information Center
Original Assignee
National Meteorological Information Center
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National Meteorological Information Center filed Critical National Meteorological Information Center
Priority to CN201910430820.7A priority Critical patent/CN110363280A/en
Publication of CN110363280A publication Critical patent/CN110363280A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogram communication
    • G06F9/547Remote procedure calls [RPC]; Web services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention discloses a kind of algorithm model training analysis systems, including data access management module, algorithm model management module, model visualization load module, algorithm model training module, visual control module and algorithm model application module.Ability for algorithm model training neural network based, debugging, monitoring is provided, data access management, model management, model visualization assembly, model training, visual control and the model application function of algorithm model needs are provided.

Description

Algorithm model training analysis system
Technical field
The present invention relates to algorithm model training field more particularly to algorithm model training analysis systems.
Background technique
Artificial intelligence is the trend studied now, and algorithm model training analysis is the core of artificial intelligence platform, existing skill Lack complete set, system algorithm model training analysis tool in art, is trained and analyzes to be directed to algorithm model.It calculates Method model system should have the function to algorithm model training neural network based, debugging, monitoring, and providing algorithm model needs Data access management, model management, model visualization assembly, model training, visual control and the model application function wanted.
Summary of the invention
The purpose of the present invention is to overcome the shortcomings of the existing technology, provides a kind of algorithm model training analysis system, can needle To algorithm model training, debugging and monitoring neural network based, data access management, the model pipe of algorithm model needs are provided Reason, model visualization assembly, model training, visual control and model application function.
The purpose of the present invention is achieved through the following technical solutions: algorithm model training analysis system, including data Access Management, algorithm model management module, model visualization load module, algorithm model training module, visual control Module and algorithm model application module;
Data access management module is the data of algorithm model access training and test;
Algorithm model management module provides the management function to algorithm model;
Each process in algorithm model is encapsulated as component by model visualization load module, and by way of visualization pulls It is assembled into a complete algorithm model;
Algorithm model training module is used for the training of algorithm model;
Visual control module is checked for model training state;
Algorithm model application module is for applying trained model.
Further, data access module data source include local data, api interface data, database data and HDFS data, data access obtain initial data, and the file cached as csv format from each data source, are used for follow-up data Cleaning, refining or model training, test.
Further, local data, api interface data, database data and HDFS data pass through interface encapsulation dependency number According to operation.
Further, algorithm model management module includes algorithm model creation module, algorithm model update module, algorithm mould Type removing module, algorithm model import modul, algorithm model export module and algorithm model version management module.
Further, model visualization load module include Component encapsulating module based on job stream and based on Web can Depending on changing load module;It is disassembled and execute that algorithm model logical definition is converted system by Component encapsulating module based on job stream Job stream, the Visual assembly module based on Web provides model development tool for user.
Further, algorithm model training module includes distributed scheduling module and conditioning training module;Distribution is adjusted Degree module includes scheduling system, training actuator and registration center, and conditioning training module includes shape in newly-built state, training State, physical training condition, halted state and halted state.
Further, model training state includes the summary information, learning rate and iterative relation figure, weight of "current" model Updated value and the ratio between parameter with the number of iterations relational graph, change over time the mark of situation activation primitive, gradient and updated value The histogram of quasi- difference and parameter and updated value.
Further, visual control module further includes in the system to business algorithm model actuator place server It deposits, the monitoring of JVM memory, JVM heap memory, CPU usage variation diagram and hardware information and software information.
Further, algorithm model application module includes modelling effect evaluation module and model application release module;Model Recruitment evaluation module uses trained model, carries out prediction application to data-oriented, and returns to the data after prediction, with user Assessment data be compared, and computation model accuracy rate;Model after training is issued as one by model application release module A RESTFul interface, user are passed to specified parameter, can carry out far call by calling the RESTFul interface.
The beneficial effects of the present invention are: the algorithm model training analysis system of complete set is provided, for based on nerve net Algorithm model training, debugging, the monitoring of network, provide data access management, the model management, model visualization of algorithm model needs Assembly, model training, visual control and model application function.
Detailed description of the invention
Fig. 1 is this system structural block diagram;
Fig. 2 is overall architecture of the present invention;
Fig. 3 is data access flow chart;
Fig. 4 is algorithm model management activity figure;
Fig. 5 is algorithm model and Component encapsulating schematic diagram;
Fig. 6 is distributed scheduling schematic diagram;
Fig. 7 is algorithm model training state diagram;
Fig. 8 is model evaluation flow chart.
Specific embodiment
Technical solution of the present invention is described in further detail with reference to the accompanying drawing, but protection scope of the present invention is not limited to It is as described below.
As shown in figures 1-8, algorithm model training analysis system, including data access management module, algorithm model manage mould Block, model visualization load module, algorithm model training module, visual control module and algorithm model application module;
Data access management module is the data of algorithm model access training and test;
Algorithm model management module provides the management function to algorithm model;
Each process in algorithm model is encapsulated as component by model visualization load module, and by way of visualization pulls It is assembled into a complete algorithm model;
Algorithm model training module is used for the training of algorithm model;
Visual control module is checked for model training state;
Algorithm model application module is for applying trained model.
Data access module data source includes local data, api interface data, database data and HDFS data, number According to access from each data source obtain initial data, and cache be csv format file, for follow-up data cleaning, refining or Model training, test.
Data access management module provides cleaning after data access, refining tool, so that access data are from non-knot Structure, nonstandard numbers are according to the normal data for switching to structuring.Including following functions:
File format conversion: supporting to switch to csv file from formats such as txt, xls, xlsx, supports customized separator, support more Kind character encoding format.
Column count: basic calculating is carried out to column, it is made to meet user demand.It supports to carry out four fundamental rules to single-row or two column datas Operation, including add, subtract, multiplication and division, support power operation and exponent arithmetic, support logic operation, including be greater than, be less than, being equal to, greatly In be equal to, be less than or equal to, be not equal to, between etc..As a result certain column can be covered or increased as new column.
Statistics: carrying out basic statistics to each column (feature), facilitates user to be based on statistical result and carries out decision.Including with it is flat Mean value, median, maximum value, minimum value, variance, standard deviation, relative standard deviation, mean difference, opposite mean difference etc..
Filtering: missing data, the abnormal data, the data for meeting or being unsatisfactory for condition in filter data are crossed.Support filtering number According to the data that there is missing are concentrated, abnormal data meet the data of logical condition;Support direction filtered model.
Sequence: according to the entire data set of ascending or descending order sequence.It supports to sort to single-row and multi-column data collection;It supports by spy The formula that fixes (such as time) is ranked up
Standardization: in terms of weather or model training usually requires that data are after standardizing, and standardized tool is provided to data mark The function of standardization.Support min-max standardization, z-score standardization.
File separator: the function by a file separator for multiple files is provided.It supports to separate by file size.
File mergences: it provides to multiple file mergencess into the function of a file.It supports to merge by row and by column.
Column split: if user with customized separator group organization data, can be according to specified separation by this tool Symbol or regular expression, which separate, is divided into multi-column data for single-row data.
Column merge: it is corresponding with column split, multi-column data can be merged into a column data.
Data type conversion: the function that data conversion is carried out to single-row or multiple row is provided.Support character string, integer, floating-point Conversion between type.
Column operation: data column operation is provided.It supports to increase column, delete column, train value conversion.
Local data, api interface data, database data and HDFS data pass through interface encapsulation associated data operation.
Algorithm model management module includes algorithm model creation module, algorithm model update module, algorithm model deletion mould Block, algorithm model import modul, algorithm model export module and algorithm model version management module.
Algorithm model creation: algorithm model is the basic unit of algorithm model training tool, is the object of model training, is The concentrated reflection of algorithm.Algorithm model creates the information such as the title of clear algorithm model, description.
Algorithm model updates: user edits an already present model, the information such as modification model name, description.
Algorithm model is deleted: user passes through after Authority Verification, deletes an already present model.
Algorithm model export: the model after a training can be exported to local by user.Derived model remains model The information such as parameter.
Algorithm model imports: the model after export can be imported into system by user again, the model of importing keep with The identical parameter information of model before export.
Version management: algorithm model editor provides historical record function, and modification can all automatically record the content of modification every time, User can fall back on specified old version at any time.
Model visualization load module includes the Component encapsulating module based on job stream and the Visual assembly mould based on Web Block;Component encapsulating module based on job stream converts algorithm model logical definition to the job stream that system is disassembled and executes, Visual assembly module based on Web provides model development tool for user.
Component encapsulating based on job stream is to convert algorithm model logical definition to the operation that system is disassembled and executes Stream, this process are referred to as job stream definition.It is defined by the job stream, algorithm can be bound, divide job stream, from job stream Deng.Unified component standard can be provided for the design and manufacturing method of algorithm model based on this mode, improve algorithm model Reusability forms the capitalization of software function feature.
One algorithm model is usually made of multiple general algorithms and logic process flow, the component based on job stream Encapsulation is that these general-purpose algorithms and logical process encapsulation are independent component, is trainable algorithm model for assembling.
The definition of job stream by the process that encapsulates step by step, by fine granularity to coarseness, by surround and watch it is macroscopical, by bottom to Using come the interaction that carries out data.
Visual assembly based on Web provides the model development tool of friendly interface for user.What a user creating After algorithm model, into web Visual assembly interface, algorithm component is pulled, each component, formation algorithm model are connected.
Algorithm model training module includes distributed scheduling module and conditioning training module;Distributed scheduling module includes Scheduling system, training actuator and registration center, conditioning training module include newly-built state, state in training, have trained shape State, halted state and halted state.
Scheduling system, that is, operation system, user initiate each order of model training by operation system;Training actuator It is also known as server in a distributed system, provides service for model training;Distributed registry center is by Zookeeper cluster structure At the main registration and discovery for completing service.When operation, scheduling system first accesses registration center, inquires available server, That is algorithm model training actuator, registration center return to available actuator, and control centre initiates model instruction to the actuator again Practice request.
Distributed scheduling passes through the communication between RPC agreement and RESTFul protocol realization difference process.RPC can be used for same The high performance communication of language, and RESTFul then meets the communication between different language type.
Visual control module further include the Installed System Memory to server where business algorithm model actuator, JVM memory, The monitoring of JVM heap memory, CPU usage variation diagram and hardware information and software information.
Algorithm model application module includes modelling effect evaluation module and model application release module;Modelling effect assesses mould Block uses trained model, carries out prediction application to data-oriented, and returns to the data after prediction, the assessment data with user It is compared, and computation model accuracy rate;Model after training is issued as a RESTFul and connect by model application release module Mouthful, user is passed to specified parameter, can carry out far call by calling the RESTFul interface.
The interface of publication provides following parameter:
Apply Names: the model that defining interface calls also can customize.
The address REST: the address that defining interface calls is the calling interface of service.
Parameter: the parameter of defining interface is divided into optional and necessary.
Request type: include the types such as common GET, POST, PUT, DELETE.
Returned data format: including JSON, file type, general character string type
Request example: friendly calling example is provided for user.
Algorithm model training analysis system proposed by the invention, can for algorithm model neural network based training, Debugging and monitoring, provide algorithm model needs data access management, model management, model visualization assembly, model training, can Depending on changing monitoring and model application function.

Claims (9)

1. algorithm model training analysis system, it is characterised in that: including data access management module, algorithm model management module, Model visualization load module, algorithm model training module, visual control module and algorithm model application module;
Data access management module is the data of algorithm model access training and test;
Algorithm model management module provides the management function to algorithm model;
Each process in algorithm model is encapsulated as component by model visualization load module, and by way of visualization pulls It is assembled into a complete algorithm model;
Algorithm model training module is used for the training of algorithm model;
Visual control module is checked for model training state;
Algorithm model application module is for applying trained model.
2. algorithm model training analysis system according to claim 1, it is characterised in that: the data access module data Source includes local data, api interface data, database data and HDFS data, and data access obtains former from each data source Beginning data, and the file cached as csv format, for follow-up data cleaning, refining or model training, test.
3. algorithm model training analysis system according to claim 2, it is characterised in that: the local data, api interface Data, database data and HDFS data pass through interface encapsulation associated data operation.
4. algorithm model training analysis system according to claim 1, it is characterised in that: the algorithm model management module Including algorithm model creation module, algorithm model update module, algorithm model removing module, algorithm model import modul, algorithm Model export module and algorithm model version management module.
5. algorithm model training analysis system according to claim 1, it is characterised in that: the model visualization fit drawing die Block includes the Component encapsulating module based on job stream and the Visual assembly module based on Web;Component encapsulating based on job stream Module converts algorithm model logical definition to the job stream that system is disassembled and executes, the Visual assembly module based on Web Model development tool is provided for user.
6. algorithm model training analysis system according to claim 1, it is characterised in that: the algorithm model training module Including distributed scheduling module and conditioning training module;Distributed scheduling module includes scheduling system, training actuator and note Volume center, conditioning training module include state, physical training condition, halted state and halted state in newly-built state, training.
7. algorithm model training analysis system according to claim 1, it is characterised in that: the model training state includes The pass of the ratio between summary information, learning rate and the iterative relation figure of "current" model, the updated value of weight and parameter with the number of iterations System's figure changes over time the standard deviation of situation activation primitive, gradient and updated value and the histogram of parameter and updated value.
8. algorithm model training analysis system according to claim 1, it is characterised in that: the visual control module is also Including changing to the Installed System Memory of server, JVM memory, JVM heap memory, CPU usage where business algorithm model actuator The monitoring of figure and hardware information and software information.
9. algorithm model training analysis system according to claim 1, it is characterised in that: the algorithm model application module Including modelling effect evaluation module and model application release module;Modelling effect evaluation module uses trained model, to giving Fixed number returns to the data after prediction according to carrying out prediction application, is compared with the assessment data of user, and computation model is accurate Rate;Model after training is issued as a RESTFul interface by model application release module, and user should by calling RESTFul interface is passed to specified parameter, can carry out far call.
CN201910430820.7A 2019-09-02 2019-09-02 Algorithm model training analysis system Pending CN110363280A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910430820.7A CN110363280A (en) 2019-09-02 2019-09-02 Algorithm model training analysis system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910430820.7A CN110363280A (en) 2019-09-02 2019-09-02 Algorithm model training analysis system

Publications (1)

Publication Number Publication Date
CN110363280A true CN110363280A (en) 2019-10-22

Family

ID=68215328

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910430820.7A Pending CN110363280A (en) 2019-09-02 2019-09-02 Algorithm model training analysis system

Country Status (1)

Country Link
CN (1) CN110363280A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110955470A (en) * 2019-12-06 2020-04-03 深圳前海环融联易信息科技服务有限公司 Algorithm model interfacing method, apparatus, computer device and storage medium
CN111259064A (en) * 2020-01-10 2020-06-09 同方知网(北京)技术有限公司 Visual natural language analysis mining system and modeling method thereof
CN111898742A (en) * 2020-08-05 2020-11-06 上海眼控科技股份有限公司 Method and equipment for monitoring training state of neural network model
CN111913715A (en) * 2020-07-30 2020-11-10 上海数策软件股份有限公司 Micro-service based machine learning automation process management and optimization system and method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106202192A (en) * 2016-06-28 2016-12-07 浪潮软件集团有限公司 Workflow-based big data analysis method
CN108073582A (en) * 2016-11-08 2018-05-25 中移(苏州)软件技术有限公司 A kind of Computational frame selection method and device
CN109189750A (en) * 2018-09-06 2019-01-11 北京九章云极科技有限公司 Operation method, data analysis system and the storage medium of data analysis workflow
CN109213482A (en) * 2018-06-28 2019-01-15 清华大学天津高端装备研究院 The graphical application platform of artificial intelligence and application method based on convolutional neural networks
CN109710383A (en) * 2018-12-29 2019-05-03 上海晏鼠计算机技术股份有限公司 A kind of method of intelligent algorithm containerization application
CN109840111A (en) * 2019-02-26 2019-06-04 广州衡昊数据科技有限公司 A kind of patterned transaction processing system and method

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106202192A (en) * 2016-06-28 2016-12-07 浪潮软件集团有限公司 Workflow-based big data analysis method
CN108073582A (en) * 2016-11-08 2018-05-25 中移(苏州)软件技术有限公司 A kind of Computational frame selection method and device
CN109213482A (en) * 2018-06-28 2019-01-15 清华大学天津高端装备研究院 The graphical application platform of artificial intelligence and application method based on convolutional neural networks
CN109189750A (en) * 2018-09-06 2019-01-11 北京九章云极科技有限公司 Operation method, data analysis system and the storage medium of data analysis workflow
CN109710383A (en) * 2018-12-29 2019-05-03 上海晏鼠计算机技术股份有限公司 A kind of method of intelligent algorithm containerization application
CN109840111A (en) * 2019-02-26 2019-06-04 广州衡昊数据科技有限公司 A kind of patterned transaction processing system and method

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110955470A (en) * 2019-12-06 2020-04-03 深圳前海环融联易信息科技服务有限公司 Algorithm model interfacing method, apparatus, computer device and storage medium
CN110955470B (en) * 2019-12-06 2024-01-19 深圳前海环融联易信息科技服务有限公司 Algorithm model interfacing method, device, computer equipment and storage medium
CN111259064A (en) * 2020-01-10 2020-06-09 同方知网(北京)技术有限公司 Visual natural language analysis mining system and modeling method thereof
CN111913715A (en) * 2020-07-30 2020-11-10 上海数策软件股份有限公司 Micro-service based machine learning automation process management and optimization system and method
CN111898742A (en) * 2020-08-05 2020-11-06 上海眼控科技股份有限公司 Method and equipment for monitoring training state of neural network model

Similar Documents

Publication Publication Date Title
CN110363280A (en) Algorithm model training analysis system
CN104954453B (en) Data mining REST service platform based on cloud computing
CN111355606B (en) Web application-oriented container cluster self-adaptive expansion and contraction system and method
CN107103064B (en) Data statistical method and device
CN111061788A (en) Multi-source heterogeneous data conversion integration system based on cloud architecture and implementation method thereof
CN108092813A (en) Data center's total management system server hardware Governance framework and implementation method
CN103390066A (en) Database overall automation optimizing early warning device and processing method thereof
CN108038239A (en) A kind of heterogeneous data source method of standardization management, device and server
US11119989B1 (en) Data aggregation with schema enforcement
CN106557470A (en) data extraction method and device
CN107590181A (en) A kind of intelligent analysis system of big data
CN108733532A (en) Health degree management-control method, device, medium and the electronic equipment of big data platform
CN108777637A (en) A kind of data center's total management system and method for supporting server isomery
CN108108986A (en) A kind of design method of CRM system, device and electronic equipment
CN109669976A (en) Data service method and equipment based on ETL
CN109308309B (en) Data service quality assessment method and terminal
Lundberg et al. Quality attributes in software architecture design
WO2016036386A1 (en) Dynamically generating an aggregation routine
CN103198099A (en) Cloud-based data mining application method facing telecommunication service
CN115408381A (en) Data processing method and related equipment
CN102411757B (en) Method and system for forecasting capacity of large host central processing unit (CPU)
CN108763323A (en) Meteorological lattice point file application process based on resource set and big data technology
CN106657282B (en) Method and device for integrating running state information of converter station equipment
CN109358842A (en) A kind of service implementing method, electronic equipment and storage medium
Ribeiro et al. A data integration architecture for smart cities

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191022

RJ01 Rejection of invention patent application after publication