WO2022007434A1 - Visualization method and related device - Google Patents

Visualization method and related device Download PDF

Info

Publication number
WO2022007434A1
WO2022007434A1 PCT/CN2021/082348 CN2021082348W WO2022007434A1 WO 2022007434 A1 WO2022007434 A1 WO 2022007434A1 CN 2021082348 W CN2021082348 W CN 2021082348W WO 2022007434 A1 WO2022007434 A1 WO 2022007434A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
visualization
target
training data
data
Prior art date
Application number
PCT/CN2021/082348
Other languages
French (fr)
Chinese (zh)
Inventor
朱雁博
Original Assignee
上海商汤智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海商汤智能科技有限公司 filed Critical 上海商汤智能科技有限公司
Priority to JP2021570729A priority Critical patent/JP2022543180A/en
Priority to KR1020217039065A priority patent/KR20220011134A/en
Publication of WO2022007434A1 publication Critical patent/WO2022007434A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/904Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network

Definitions

  • the present application relates to the field of computer technology, and in particular, to a visualization method and related equipment.
  • deep learning models have been widely used in the field of image and video processing.
  • deep learning models are obtained through training.
  • the training process of deep learning models involves complex and esoteric calculations, and needs to be implemented through multiple iterations.
  • the training time is long, and developers can only evaluate the performance of the deep learning model after training.
  • Embodiments of the present application provide a visualization method and related equipment, which are used to visualize training data of a deep learning model.
  • an embodiment of the present application provides a visualization method, which is applied to a user equipment, and the method includes:
  • the visualization information is displayed.
  • an embodiment of the present application provides a visualization apparatus, which is applied to a user equipment, and the apparatus includes:
  • a sending unit configured to send a visualization request to the cloud server, where the visualization request is used to request visualization of the target training data of the target deep learning model
  • a receiving unit configured to receive a visualization response sent by the cloud server, where the visualization response carries the visualization information of the target training data
  • a display unit for displaying the visual information.
  • an embodiment of the present application provides a visualization method, which is applied to a cloud server, and the method includes:
  • a visualization response is sent to the user equipment, and the visualization response carries the visualization information.
  • an embodiment of the present application provides a visualization device, which is applied to a cloud server, and the device includes:
  • a receiving unit configured to receive a visualization request from the user equipment, where the visualization request is used to request visualization of the target training data of the target deep learning model;
  • a sending unit configured to send a visualization response to the user equipment, where the visualization response carries the visualization information.
  • the present application provides a computer device comprising a processor, a memory, a communication interface, and one or more programs, wherein the one or more programs are stored in the memory and configured by
  • the above-mentioned processor is executed, and the above-mentioned program includes instructions for executing the steps in the method described in the first aspect or the third aspect of the embodiments of the present application.
  • an embodiment of the present application provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program for electronic data exchange, wherein the computer program enables a computer to execute the computer program as described in the first embodiment of the present application.
  • an embodiment of the present application provides a computer program product, wherein the computer program product includes a non-transitory computer-readable storage medium storing a computer program, and the computer program is operable to cause a computer to execute as implemented in the present application.
  • the computer program product may be a software installation package.
  • the user equipment first sends a visualization request to the cloud server, then receives the visualization response sent by the cloud server, the visualization response carries the visualization information of the target training data, and finally displays the visualization information.
  • the visualization information of the data can intuitively understand the training status of the deep learning model, which helps to judge the feasibility of the current training strategy in time, and provides a basis for decision-making such as early stopping.
  • FIG. 1 is a schematic diagram of the architecture of a visualization system provided by an embodiment of the present application.
  • FIG. 2 is a schematic flowchart of a visualization method provided by an embodiment of the present application.
  • FIG. 3 is a schematic structural diagram of a computer device provided by an embodiment of the present application.
  • FIG. 4 is a schematic structural diagram of a visualization device provided by an embodiment of the present application.
  • FIG. 5 is a schematic structural diagram of another visualization apparatus provided by an embodiment of the present application.
  • FIG. 1 is a schematic diagram of the architecture of a visualization system provided by an embodiment of the application, where the visualization system includes a supercomputing cluster, a cloud server, a cloud storage, and user equipment.
  • the visualization system can realize the visualization of multi-dimensional and multi-class intermediate training data based on javascript and svg technology, and supports fast and efficient visualization rendering of a large amount of intermediate training data.
  • the visualization system is based on the kubernetes container orchestration system, which provides stable visualization system deployment at the production level and provides dynamic and controllable service capabilities.
  • the kubernetes container is used to manage containerized applications on multiple hosts in the cloud service.
  • the cloud server provides visualization services, and the visualization services are used to perform data operations such as preprocessing on the acquired intermediate training data, and visualize and render the intermediate training data after data operations to obtain visualization information.
  • the cloud server may further provide at least one of the following services: a service gateway, a training management service, a data storage service, and a permission service.
  • the service gateway can be used as the entrance of at least one of the following services: the entrance of training management service, data storage service, data visualization service and authority service.
  • the service gateway can be an application.
  • the service gateway may have a current limiting function.
  • the training management service can provide at least one of the following services: monitoring whether the training tasks of the deep learning model are completed, recording, indexing and searching for abnormal training tasks, comparing the training process of multiple training tasks of the same deep learning model, and Share the training results of deep learning models.
  • the data storage service can provide at least one of the following services: data persistence, using data threads to store intermediate training data generated by deep learning model training, and using cache to store intermediate training data generated by deep learning model training.
  • the rights management service is used to ensure the security of the visualization system, and the visualization system can only be used after passing the rights management service.
  • shared training is also authenticated through the rights management service, which supports unified authentication account login and/or ordinary There are two types of login methods to register.
  • the cloud server can interact with the developer toolkit through HyperText Transfer Protocol (HTTP), Google Remote Procedure Call (GRPC) protocol or other protocols.
  • HTTP HyperText Transfer Protocol
  • GRPC Google Remote Procedure Call
  • the cloud server can directly store the training data in the cloud storage.
  • the supercomputing cluster provides development tools such as python and other programming language toolkits by running the developer toolkit for users to use with tensorflow, pytorch, caffe frameworks, etc., and/or open a new thread as a dedicated thread for data uploading, and
  • the intermediate training data generated by the training of the deep learning model is uploaded to the cloud server through the HTTP communication protocol and the GRPC protocol on the dedicated thread for data uploading.
  • the uploaded intermediate training data types can include vectors, scalars, pictures, videos, audios, etc. Data is stored in the developer kit's data cache.
  • opening a new thread as a dedicated thread for data uploading can ensure the computing speed of the deep learning model.
  • cloud storage can optionally be distributed storage, which is used for unified management of intermediate training data generated by deep learning models, providing massive storage functions for intermediate training data, and as the amount of intermediate training data increases, it can dynamically
  • the capacity can be expanded to meet the storage requirements of intermediate training data and provide follow-up guarantee for the growth of visualization business.
  • the cloud storage may also be data block-level cloud storage, file-level cloud storage, object-level cloud storage and/or other forms of cloud storage.
  • the user equipment may communicate with the cloud server through preset application software, or communicate with the cloud server through a preset network address.
  • the user equipment may include various handheld devices with communication functions, vehicle-mounted devices, wearable devices, computing devices or other processing devices connected to the wireless modem, and various forms of user equipment (User Equipment, UE), Mobile station (Mobile Station, MS), terminal device (terminal device) and so on.
  • UE User Equipment
  • MS Mobile Station
  • terminal device terminal device
  • FIG. 2 is a schematic flowchart of a visualization method provided by an embodiment of the present application, which is applied to the above-mentioned user equipment and cloud server, and specifically includes the following steps:
  • Step 201 The user equipment sends a visualization request to the cloud server, where the visualization request is used to request visualization of the target training data of the target deep learning model.
  • Step 202 The cloud server receives a visualization request from the user equipment, where the visualization request is used to request visualization of the target training data of the target deep learning model.
  • the visualization request carries the training task identifier of the target deep learning model.
  • the cloud server can obtain the relevant training data of the training task based on the training task identifier of the target deep learning model.
  • the stored training data related to the training tasks of each deep learning model may be associated with the training task identifier of the deep learning model.
  • training data may be stored according to training tasks, and relevant training data of different training tasks are stored in different locations.
  • training data may be stored according to deep learning models or data types.
  • the data type can be a two-dimensional heat map, a three-dimensional histogram, a feature map, a call sequence diagram, a scalar line graph, a directed acyclic graph, and so on.
  • the training tasks are all located on the supercomputing cluster, and the training of the deep learning model can be accelerated by using the central processing unit of the supercomputing cluster.
  • the training data may be stored in association with the task identifier of the training task to which the training data belongs, and so on, which is not limited in this embodiment of the present disclosure.
  • the visualization request carries the training task identification of the target deep learning model and the index information of the target training data, and the index information is used to find the target training data to determine the storage location of the target training data.
  • the index information may be implemented in multiple ways.
  • the index information may include any one or more of data creation time, data type, and data label.
  • the index information includes data type and data label.
  • the same data type can have multiple data labels, and the data labels can be customized.
  • the data labels can be exact values, loss values, and so on.
  • the visualization request may be a request to sample all the data in the intermediate data set corresponding to the index information or randomly sample the data in the intermediate data set according to the training task identifier and the index information.
  • the target deep learning model includes at least one deep neural network, and the target training data may be intermediate data generated by one of the deep neural networks in the target deep learning model, or may be intermediate data generated by multiple deep neural networks in the target deep learning model. .
  • the visualization request may be a request to visualize the intermediate data generated by the target deep learning model in the first period of time, wherein the duration of the first period of time may be other values such as 3min, 5min, 9min, and 15min, and the termination time of the first moment may be is the current time, it can be earlier than the current time, or it can be later than the current time.
  • the target training data is stored in cloud storage.
  • the target training data includes at least one of the following: model performance change trend information, model loss trend information, model parameter distribution information, model processing intermediate results, model structure information, current progress information of model training, Comparing information for different trainings of the same model, and time series of scheduling time information for various processes.
  • the category of information is a scalar line chart.
  • the target training data is the model performance change trend information
  • the target training data is the model loss trend information
  • the target training data is the current progress information of model training
  • the target training data is the comparison information of different trainings of the same model
  • the category of visualization information is a three-dimensional histogram, and it can be determined whether the model parameter distribution is abnormal through the three-dimensional histogram. If the model parameter distribution is abnormal, the model training is stopped.
  • the category of the visual information is the feature map, and through the feature map, it can be determined whether the features extracted by the model are correct, and if the extracted features are wrong, rebuild the model.
  • the category of the visualization information is a directed acyclic graph
  • the directed acyclic graph can determine whether the structure of the model is correct, and if the structure of the model is wrong, rebuild the The structure of the model.
  • the architecture of the deep learning model supported by the directed acyclic graph can be a standard architecture such as Open Neural Network Exchange (ONNX), or other types of architectures, in which ONNX does not need to perform intermediate training data.
  • ONNX Open Neural Network Exchange
  • the types of visualization information are the calling sequence diagrams of various processes, and the time-consuming situation of each operator can be determined through the calling sequence diagram.
  • the time consumption is greater than the preset time, the operators whose time consumption is greater than the preset time are optimized.
  • the category of the visualization information is a two-dimensional heat map. Through the heat map, it can be determined whether the features extracted by the model are correct. Build the model.
  • the feature map and the heat map are different representations of the target training data for the model to process the intermediate results.
  • the types of target training data include at least one of the following: scalar, vector, picture, video, and audio.
  • Step 203 The cloud server obtains target training data corresponding to the visualization request.
  • the acquiring target training data corresponding to the visualization request includes:
  • the target training data stored in the cloud storage is acquired based on the storage location information of the target training data.
  • a list of intermediate training data is obtained based on the training task identification and index information, and the intermediate training data is obtained in cloud storage based on the list of intermediate training data.
  • the visualization request carries the identification information of the target training data of the target deep learning model, and the identification information is used by the cloud server to obtain the target training data from the cloud storage; wherein, the identification information is preset, and the identification Information is unique in the cloud storage.
  • the training tasks of the target deep learning model include training task A and training task B
  • the training data generated by training task A includes training data A1 and training data A2
  • the training data generated by training task B includes training data B1 and training data B2
  • training data B1 and training data B2 are all stored in the cloud platform
  • the identification information of training data A1 is 1, the identification information of training data A2 is 2
  • the identification information of training data B1 is 3
  • the identification information of the training data B2 is 4, if the identification information is 1, the target training data obtained by the cloud server from the cloud storage is the training data A1; if the identification information is 2, the target training data obtained by the cloud server from the cloud storage
  • the data is training data A2; if the identification information is 3, the target training data obtained by the cloud server from the cloud storage is training data B1; if the identification information is 4, the target training data obtained by the cloud server from the cloud storage is training data B2.
  • Step 204 The cloud server preprocesses the target training data to obtain visualization information.
  • Step 205 The cloud server sends a visualization response to the user equipment, where the visualization response carries the visualization information.
  • Step 206 The user equipment receives a visualization response sent by the cloud server, where the visualization response carries the visualization information of the target training data.
  • Step 207 The user equipment displays the visualized information.
  • the visualization information includes at least one of the following categories: directed acyclic graphs, three-dimensional histograms, call sequence diagrams of various processes, feature maps, two-dimensional heat maps, and scalar line graphs.
  • the visualization information includes the following information: topology information of at least a part of the target deep learning model, wherein at least a part of the target deep learning model includes: multiple modules and/or multiple operators of the target deep learning model; resource occupation information of each operator in at least one operator included in the target deep learning model.
  • the topology information of multiple modules includes at least one of the following: identification information of the multiple modules, dependencies between the multiple modules, data size of each module in the multiple modules, Information of at least one operator included in each of the modules.
  • the identification information of the module is unique and can be preset.
  • the information of the operators includes at least one of the following: identification information of the operators, dependencies between operators, and data size of the operators.
  • the dependency relationship can be a sequential relationship or a parallel relationship.
  • the resource occupation information is determined by the cloud server based on at least one of the data type of the operator, the input data information of the operator, and the output data information of the operator.
  • the input of the operator can be a picture, video, audio, scalar, vector, etc., and different inputs correspond to different dimensions.
  • a picture is equivalent to a two-dimensional matrix
  • an audio is equivalent to a one-dimensional matrix.
  • the data type of the operator can be a double-precision type or a single-precision type.
  • the operator can be at least one of the following: convolution, batch normalization (BatchNorm, BN), full link, pooling, matrix multiplication and division, dropout (DropOut), activation, etc.
  • the input data information may be the size of the input data
  • the output data information may be the size of the output data
  • the visualization information includes the performance change trend information of the target deep model, the loss trend information of the target deep model, the training progress information of the target deep learning model, and the difference in the target deep learning model. At least one of the training comparison information.
  • the visualization information includes features extracted by the target deep learning model.
  • the visualization information includes the parameter distribution of each operator in the plurality of operators.
  • the visualization information includes the running time of each operator in at least one process of the multiple operators.
  • the process may be at least one of an interpreted language process, a local process, and an AI chip process.
  • the interpreted language process, the local process, and the AI chip process alternately execute the calling sequence diagram according to time.
  • the user equipment first sends a visualization request to the cloud server, then receives the visualization response sent by the cloud server, the visualization response carries the visualization information of the target training data, and finally displays the visualization information.
  • the visual information of the data can intuitively understand the training status of the deep learning model, which helps to improve the feasibility of timely judgment of the current training strategy, and provides a basis for decision-making such as early stopping.
  • the displaying the visualized information includes:
  • a module in the target deep learning model is used as the minimum display unit, wherein the module includes at least one operator; and/or
  • the operator in the target deep learning model is used as the minimum display unit.
  • the visualization information when the category of the visualization information is a directed acyclic graph, the visualization information can be displayed in the collapsed display mode, and the visualization information can also be displayed in the expanded display mode. , which displays visual information by expanding the display mode.
  • the expanded display mode may be an operator in the expanded display module after the module receives an operation instruction.
  • the expanded display mode may be to directly display the parameter distribution of each operator in each module.
  • the expanded display mode may be to directly display the running time of each operator in each module in at least one process.
  • the target deep learning model includes 2 modules (module A and module B), and module A includes 3 operators (A1, A2, and A3), the model B includes 2 operators (B1 and B2). If the visualized information is displayed in the folded display mode, module A and module B are displayed; if the visualized information is displayed in the expanded display mode, after the module A receives the operation instruction, the operators A1, A2 and A3 in the module A are displayed, and the module B receives the operation After the instruction, the operators B1 and B2 in module B are displayed.
  • the target deep learning model includes 2 modules (module A and module B), module A includes 3 operators (A1, A2, and A3), and model B includes Including two operators (B1 and B2), module A corresponds to 3D histogram 1, and module B corresponds to 3D histogram B, then 3D histogram 1 includes the parameter distribution of A1, the parameter distribution of A2 and the parameter distribution of A3
  • 3D histogram 1 includes the parameter distribution of A1, the parameter distribution of A2 and the parameter distribution of A3
  • the three-dimensional histogram 2 includes the parameter distribution of B1 and the parameter distribution of B2.
  • the target deep learning model includes 2 modules (module A and module B) and 2 processes (process C1 and C2), and module A includes 2 operators ( A1 and A2), model B includes 2 operators (B1 and B2), module A corresponds to call sequence diagram 1, and module B corresponds to call sequence diagram B, then the call sequence diagram 1 includes the running time of A1 in process C1 , the running duration of A1 in process C2, including the running duration of A2 in process C1 and the running duration of A2 in process C2; the calling sequence diagram 2 includes the running duration of B1 in process C1 and the running duration of B1 in process C2 The running duration includes the running duration of B2 in process C1 and the running duration of B2 in process C2.
  • a display mode of visualized information may be preset, which may be to display the visualized information in a folded display mode first, and then display the visualized information in an expanded display mode after a first duration, which may be preset.
  • the type of visualization information is a scalar line graph
  • the expandable display mode may be to directly display the features extracted by the target deep learning model.
  • the user equipment displays visual information in different ways, which is beneficial to improve the application scope of the user equipment.
  • the method before the sending the visualization request to the cloud server, the method further includes:
  • the intermediate training data of the target deep learning model is sent to the cloud server through a data upload thread, and the intermediate training data is used by the cloud server to obtain the index information and training task identifier of the target training data, and based on the
  • the training task identifier stores the index information in a database, and stores the target training data in cloud storage based on the index information.
  • the data uploading thread is a newly opened thread.
  • the training task identification is unique, and the training task identification may be determined after the intermediate training data is generated, or may be determined at the beginning of training.
  • the intermediate training data includes target training data, training task identifiers, and index information.
  • the intermediate training data of the target deep learning model is sent to the cloud server through a data upload thread, and the intermediate data is used for the cloud server to obtain the identification information of the target training data, and to store the identification information in a database.
  • the target training data is stored in cloud storage based on the identification information.
  • the identification information is unique.
  • the method before sending the intermediate training data of the target deep learning model to the cloud server through a data upload thread, the method further includes:
  • the training task identifier and the index information of the target training data determine the training task identifier and the index information of the target training data, and carry the training task information and the index information of the target training data in the intermediate training data;
  • the identification information of the target training data is determined, and the identification information is carried on the intermediate training data.
  • the identification information may be stored in the first buffer area of the database, and the target training data may be stored in the second buffer area of the database based on the identification information.
  • the target data is stored in the cloud storage by the cloud server, which is beneficial for the cloud server to quickly obtain the target training data after receiving the visualization request.
  • the preprocessing of the target training data to obtain visualization information includes:
  • the preprocessing of the picture or video includes one or more of the following processes: insensitive area removal processing, image precision enhancement processing, image noise reduction processing and image processing Binarization processing.
  • the text preprocessing includes one or more of the following processes: document segmentation, text segmentation, and removal of stop words (including punctuation, numbers, monads, and other Meaningless words) processing, text feature extraction, word frequency statistics processing and text vectorization processing.
  • stop words including punctuation, numbers, monads, and other Meaningless words
  • the visual rendering is to assemble the preprocessed data into a hypertext markup language (Hypertext Markup Language, HTML).
  • HTML Hypertext Markup Language
  • the type of visualization information obtained by preprocessing is non-directed acyclic graph.
  • the target training data is preprocessed to obtain preprocessed data, and then the preprocessed data is rendered to obtain visual information, which is beneficial to the target deep learning model and analysis based on the visual information.
  • the preprocessing of the target training data to obtain visualization information includes:
  • the target training data is parsed through a binary tree to obtain the visualization information.
  • the type of the target deep learning model that processes the target training data through the binary tree is parrots.
  • the type of visualization information obtained by parsing the target training data through a binary tree is a directed acyclic graph.
  • the visualization information is obtained by analyzing the target training data, which is beneficial to analyze and analyze the target deep learning model based on the visualization information.
  • FIG. 3 is a schematic structural diagram of a computer device provided by an embodiment of the present application.
  • the computer device includes a processor, a memory, a communication interface, and one or more programs, wherein one or more of the above A plurality of programs are stored in the above-mentioned memory, and are configured to be executed by the above-mentioned processor.
  • the computer device is user equipment
  • the above program includes instructions for performing the following steps:
  • the visualization information is displayed.
  • the visualization information includes at least one of the following categories: directed acyclic graph, three-dimensional histogram, call sequence diagram of various processes, feature map, two-dimensional heat map, and scalar line graph; and/or
  • the target training data includes at least one of the following: model performance change trend information, model loss trend information, model parameter distribution information, model processing intermediate results, model structure information, current progress information of model training, and information on different trainings for the same model. Comparison information, scheduling time information of various processes.
  • the visualization information includes at least one of the following information:
  • topology information of at least a part of the target deep learning model wherein at least a part of the target deep learning model includes: multiple modules and/or multiple operators of the target deep learning model;
  • the topology information of the multiple modules includes at least one of the following:
  • the identification information of the multiple modules the dependencies between the multiple modules, the data size of each module in the multiple modules, and the information of at least one operator included in each of the modules.
  • the resource occupation information is determined by the cloud server based on at least one of the data type of the operator, the input data information of the operator, and the output data information of the operator.
  • the visualization request carries a training task identifier of the target deep learning model and index information of the target training data, where the index information includes: data type and data label.
  • the above-mentioned program includes instructions for executing the following steps:
  • the display of visual data in different modes is supported, wherein, in some embodiments, the visual information may be displayed in a folding display mode or a presentation display mode, or other display modes may also be defined. Embodiments are not limited thereto.
  • a module in the target deep learning model is used as the smallest display unit, wherein the module includes at least one operator.
  • the modules here may be divided in a default division manner, or may also be set by a user, which is not limited in this embodiment of the present disclosure.
  • the operator in the target deep learning model is used as the minimum display unit. At this point, the information of all operators in the model can be displayed.
  • it can also be displayed in a mixed display mode, that is, some modules are folded and displayed, and other modules are displayed in an expanded manner, which can optionally be displayed based on user settings, which is not covered by the embodiments of the present disclosure. Do limit.
  • the above program before sending the visualization request to the cloud server, the above program includes an instruction for performing the following steps:
  • the intermediate training data of the target deep learning model is sent to the cloud server through a data upload thread, and the intermediate training data is used by the cloud server to obtain the index information and training task identifier of the target training data, and based on the
  • the training task identifier stores the index information in a database, and stores the target training data in cloud storage based on the index information.
  • the computer device is a cloud server
  • the above program includes instructions for performing the following steps:
  • a visualization response is sent to the user equipment, and the visualization response carries the visualization information.
  • the visualization information includes at least one of the following categories: directed acyclic graph, three-dimensional histogram, call sequence diagram of various processes, feature map, two-dimensional heat map, and scalar line graph; and/or
  • the target training data includes at least one of the following: model performance change trend information, model loss trend information, model parameter distribution information, model processing intermediate results, model structure information, current progress information of model training, and information on different trainings for the same model. Comparison information, scheduling time information of various processes.
  • the visualization information includes at least one of the following information:
  • topology information of at least a part of the target deep learning model wherein at least a part of the target deep learning model includes: multiple modules and/or multiple operators of the target deep learning model;
  • the topology information of the multiple modules includes at least one of the following:
  • the identification information of the multiple modules the dependencies between the multiple modules, the data size of each module in the multiple modules, and the information of at least one operator included in each of the modules.
  • the above program includes instructions for executing the following steps:
  • the visualization request carries a training task identifier of the target deep learning model and index information of the target training data, where the index information includes: data type and data label;
  • the above-mentioned program includes an instruction for executing the following steps: searching an index database based on the training task identifier and index information carried in the visualization request, so as to obtain the target training data. storage location information; based on the storage location information of the target training data, obtain the target training data stored in the cloud storage.
  • the above program includes instructions for executing the following steps:
  • the above program includes instructions for executing the following steps:
  • the target training data is parsed through a binary tree to obtain the visualization information.
  • FIG. 4 is a visualization apparatus provided by an embodiment of the present application, applied to user equipment, and the apparatus includes:
  • a sending unit 401 configured to send a visualization request to a cloud server, where the visualization request is used to request visualization of target training data of a target deep learning model;
  • a receiving unit 402 configured to receive a visualization response sent by the cloud server, where the visualization response carries the visualization information of the target training data;
  • the display unit 403 is configured to display the visualized information.
  • the visualization information includes at least one of the following categories: directed acyclic graph, three-dimensional histogram, call sequence diagram of various processes, feature map, two-dimensional heat map, and scalar polyline Figures; and/or
  • the target training data includes at least one of the following: model performance change trend information, model loss trend information, model parameter distribution information, model processing intermediate results, model structure information, current progress information of model training, and information on different trainings for the same model. Comparison information, scheduling time information of various processes.
  • the visualized information includes at least one of the following information:
  • topology information of at least a part of the target deep learning model wherein at least a part of the target deep learning model includes: multiple modules and/or multiple operators of the target deep learning model;
  • the topology information of the multiple modules includes at least one of the following:
  • the identification information of the multiple modules the dependencies between the multiple modules, the data size of each module in the multiple modules, and the information of at least one operator included in each of the modules.
  • the resource occupation information is at least one of the cloud server based on the data type of the operator, the input data information of the operator, and the output data information of the operator definite.
  • the visualization request carries a training task identifier of the target deep learning model and index information of the target training data, where the index information includes: data type and data label.
  • the above-mentioned display unit 403 is specifically used for:
  • a module in the target deep learning model is used as the minimum display unit, wherein the module includes at least one operator; and/or
  • the operator in the target deep learning model is used as the minimum display unit.
  • the above-mentioned sending unit 401 is specifically further configured to send the intermediate training data of the target deep learning model to the cloud server through a data upload thread before sending the visualization request to the cloud server, and the The intermediate training data is used for the cloud server to obtain the index information and training task identifier of the target training data;
  • the apparatus further includes: a data storage unit 404, configured to store the index information in a database based on the training task identifier, and store the target training data in cloud storage based on the index information.
  • a data storage unit 404 configured to store the index information in a database based on the training task identifier, and store the target training data in cloud storage based on the index information.
  • the sending unit 401 and the receiving unit 402 can be implemented through a communication interface
  • the display unit 403 can be implemented through a display screen
  • the data storage unit 404 can be implemented through a processor screen.
  • FIG. 5 is a visualization device provided by an embodiment of the present application, applied to a cloud server, and the device includes:
  • a receiving unit 501 configured to receive a visualization request from a user equipment, where the visualization request is used to request visualization of target training data of a target deep learning model;
  • an obtaining unit 502 configured to obtain target training data corresponding to the visualization request
  • Determining unit 503, configured to preprocess the target training data to obtain visualization information
  • the sending unit 504 is configured to send a visualization response to the user equipment, where the visualization response carries the visualization information.
  • the visualization information includes at least one of the following categories: directed acyclic graph, three-dimensional histogram, call sequence diagram of various processes, feature map, two-dimensional heat map, and scalar polyline Figures; and/or
  • the target training data includes at least one of the following: model performance change trend information, model loss trend information, model parameter distribution information, model processing intermediate results, model structure information, current progress information of model training, and information on different trainings for the same model. Comparison information, scheduling time information of various processes.
  • the visualized information includes at least one of the following information:
  • topology information of at least a part of the target deep learning model wherein at least a part of the target deep learning model includes: multiple modules and/or multiple operators of the target deep learning model;
  • the topology information of the multiple modules includes at least one of the following:
  • the identification information of the multiple modules the dependencies between the multiple modules, the data size of each module in the multiple modules, and the information of at least one operator included in each of the modules.
  • the above program includes instructions for executing the following steps:
  • the visualization request carries a training task identifier of the target deep learning model and index information of the target training data, where the index information includes: data type and data label;
  • the above acquiring unit 502 is specifically used for:
  • the target training data stored in the cloud storage is acquired based on the storage location information of the target training data.
  • the above determining unit 503 is specifically configured to:
  • the above determining unit 503 is specifically configured to:
  • the target training data is parsed through a binary tree to obtain the visualization information.
  • the receiving unit 501 and the sending unit 504 may be implemented by a communication interface
  • the acquiring unit 502 and the determining unit 503 may be implemented by a processor.
  • Embodiments of the present application further provide a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program for electronic data exchange, wherein the computer program causes a computer to execute the electronic Some or all of the steps described by the device or cloud server.
  • Embodiments of the present application further provide a computer program product, wherein the computer program product includes a non-transitory computer-readable storage medium storing a computer program, and the computer program is operable to cause the computer to execute the electronic method as described above. Some or all of the steps described by the device or cloud server.
  • the computer program product may be a software installation package.
  • the steps of the method or algorithm described in the embodiments of the present application may be implemented in a hardware manner, or may be implemented in a manner in which a processor executes software instructions.
  • Software instructions can be composed of corresponding software modules, and software modules can be stored in random access memory (Random Access Memory, RAM), flash memory, read only memory (Read Only Memory, ROM), erasable programmable read only memory ( Erasable Programmable ROM, EPROM), Electrically Erasable Programmable Read-Only Memory (Electrically EPROM, EEPROM), registers, hard disk, removable hard disk, CD-ROM, or any other form of storage medium known in the art.
  • An exemplary storage medium is coupled to the processor, such that the processor can read information from, and write information to, the storage medium.
  • the storage medium can also be an integral part of the processor.
  • the processor and storage medium may reside in an ASIC.
  • the ASIC may reside in access network equipment, target network equipment or core network equipment.
  • the processor and the storage medium may also exist in the access network device, the target network device or the core network device as discrete components.
  • the functions described in the embodiments of the present application may be implemented in whole or in part by software, hardware, firmware, or any combination thereof.
  • software it can be implemented in whole or in part in the form of a computer program product.
  • the computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, all or part of the processes or functions described in the embodiments of the present application are generated.
  • the computer may be a general purpose computer, special purpose computer, computer network, or other programmable device.
  • the computer instructions may be stored in or transmitted from one computer readable storage medium to another computer readable storage medium, for example, the computer instructions may be downloaded from a website site, computer, server or data center Transmission to another website site, computer, server or data center via wired (eg coaxial cable, optical fiber, Digital Subscriber Line, DSL) or wireless (eg infrared, wireless, microwave, etc.) means.
  • the computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that includes an integration of one or more available media.
  • the available media may be magnetic media (eg, floppy disks, hard disks, magnetic tapes), optical media (eg, Digital Video Disc (DVD)), or semiconductor media (eg, Solid State Disk (SSD)) )Wait.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Electrically Operated Instructional Devices (AREA)

Abstract

A visualization method and a related device. The method comprises: a user equipment sending a visualization request to a cloud server, wherein the visualization request is used for requesting visualization of target training data of a target deep learning model; the user equipment receiving a visualization response sent by the cloud server, wherein the visualization response carries visualization information of the target training data; and the user equipment displaying the visualization information. By means of the present method, training data of a deep learning model can be visualized, thereby facilitating the timely determination of the feasibility of the current training strategy, and providing a basis for decisions such as stopping early.

Description

可视化方法及相关设备Visualization method and related equipment
本申请要求于2020年07月09日提交中国专利局、申请号为202010656553.8、申请名称为“可视化方法及相关设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number of 202010656553.8 and the application title of "Visualization Method and Related Equipment" filed with the China Patent Office on July 9, 2020, the entire contents of which are incorporated into this application by reference.
技术领域technical field
本申请涉及计算机技术领域,尤其涉及一种可视化方法及相关设备。The present application relates to the field of computer technology, and in particular, to a visualization method and related equipment.
背景技术Background technique
近年来,深度学习模型在图像和视频处理领域中得到广泛应用,一般地,深度学习模型通过训练得到,然而,深度学习模型的训练过程涉及复杂且深奥的计算,并且需要经过多次迭代实现,训练时间较长,研发人员只能在训练结束后对训练得到的深度学习模型进行性能评估。In recent years, deep learning models have been widely used in the field of image and video processing. Generally, deep learning models are obtained through training. However, the training process of deep learning models involves complex and esoteric calculations, and needs to be implemented through multiple iterations. The training time is long, and developers can only evaluate the performance of the deep learning model after training.
发明内容SUMMARY OF THE INVENTION
本申请实施例提供一种可视化方法及相关设备,用于实现深度学习模型的训练数据的可视化。Embodiments of the present application provide a visualization method and related equipment, which are used to visualize training data of a deep learning model.
第一方面,本申请实施例提供一种可视化方法,应用于用户设备,所述方法包括:In a first aspect, an embodiment of the present application provides a visualization method, which is applied to a user equipment, and the method includes:
向云服务器发送可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化;sending a visualization request to the cloud server, where the visualization request is used to request visualization of the target training data of the target deep learning model;
接收所述云服务器发送的可视化响应,所述可视化响应携带有所述目标训练数据的可视化信息;receiving a visualization response sent by the cloud server, where the visualization response carries the visualization information of the target training data;
显示所述可视化信息。The visualization information is displayed.
第二方面,本申请实施例提供一种可视化装置,应用于用户设备,所述装置包括:In a second aspect, an embodiment of the present application provides a visualization apparatus, which is applied to a user equipment, and the apparatus includes:
发送单元,用于向云服务器发送可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化;a sending unit, configured to send a visualization request to the cloud server, where the visualization request is used to request visualization of the target training data of the target deep learning model;
接收单元,用于接收所述云服务器发送的可视化响应,所述可视化响应携带有所述目标训练数据的可视化信息;a receiving unit, configured to receive a visualization response sent by the cloud server, where the visualization response carries the visualization information of the target training data;
显示单元,用于显示所述可视化信息。a display unit for displaying the visual information.
第三方面,本申请实施例提供一种可视化方法,应用于云服务器,所述方法包括:In a third aspect, an embodiment of the present application provides a visualization method, which is applied to a cloud server, and the method includes:
接收来自用户设备的可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化;receiving a visualization request from the user equipment, where the visualization request is used to request visualization of the target training data of the target deep learning model;
获取所述可视化请求对应的目标训练数据;obtaining target training data corresponding to the visualization request;
对所述目标训练数据进行预处理,得到可视化信息;Preprocessing the target training data to obtain visualization information;
向所述用户设备发送可视化响应,所述可视化响应携带所述可视化信息。A visualization response is sent to the user equipment, and the visualization response carries the visualization information.
第四方面,本申请实施例提供一种可视化装置,应用于云服务器,所述装置包括:In a fourth aspect, an embodiment of the present application provides a visualization device, which is applied to a cloud server, and the device includes:
接收单元,用于接收来自用户设备的可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化;a receiving unit, configured to receive a visualization request from the user equipment, where the visualization request is used to request visualization of the target training data of the target deep learning model;
获取单元,用于获取所述可视化请求对应的目标训练数据;an obtaining unit for obtaining target training data corresponding to the visualization request;
确定单元,用于对所述目标训练数据进行预处理,得到可视化信息;a determining unit for preprocessing the target training data to obtain visualization information;
发送单元,用于向所述用户设备发送可视化响应,所述可视化响应携带所述可视化信息。A sending unit, configured to send a visualization response to the user equipment, where the visualization response carries the visualization information.
第五方面,本申请提供一种计算机设备,所述计算机设备包括处理器、存储器、通信接口以及一个或多个程序,其中,上述一个或多个程序被存储在上述存储器中,并且被配置由上述处理器执行,上述程序包括用于执行本申请实施例第一方面或第三方面所述的方法中的步骤的指令。In a fifth aspect, the present application provides a computer device comprising a processor, a memory, a communication interface, and one or more programs, wherein the one or more programs are stored in the memory and configured by The above-mentioned processor is executed, and the above-mentioned program includes instructions for executing the steps in the method described in the first aspect or the third aspect of the embodiments of the present application.
第六方面,本申请实施例提供了一种计算机可读存储介质,其中,上述计算机可读存储介质存储用于电子数据交换的计算机程序,其中,上述计算机程序使得计算机执行如本申请实施例第一方面或第三方面所述的方法中所描述的部分或全部步骤。In a sixth aspect, an embodiment of the present application provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program for electronic data exchange, wherein the computer program enables a computer to execute the computer program as described in the first embodiment of the present application. Some or all of the steps described in the method of the first aspect or the third aspect.
第七方面,本申请实施例提供了一种计算机程序产品,其中,上述计算机程序产品包括存储了计算机程序的非瞬时性计算机可读存储介质,上述计算机程序可操作来使计算机执行如本申请实施例第一方面或第三方面所述的方法中所描述的部分或全部步骤。该计算机程序产品可以为一个软件安装包。In a seventh aspect, an embodiment of the present application provides a computer program product, wherein the computer program product includes a non-transitory computer-readable storage medium storing a computer program, and the computer program is operable to cause a computer to execute as implemented in the present application. For example, some or all of the steps described in the method described in the first aspect or the third aspect. The computer program product may be a software installation package.
可以看出,在本申请实施例中,用户设备首先向云服务器发送可视化请求,然后接收云服务器发送的可视化响应,可视化响应携带有目标训练数据的可视化信息,最后显示可视化信息,由于通过目标训练数据的可视化信息,可以直观地了解深度学习模型的训练状态,有助于及时判断当前训练策略的可行性,为早停等决策提供依据。It can be seen that, in this embodiment of the present application, the user equipment first sends a visualization request to the cloud server, then receives the visualization response sent by the cloud server, the visualization response carries the visualization information of the target training data, and finally displays the visualization information. The visualization information of the data can intuitively understand the training status of the deep learning model, which helps to judge the feasibility of the current training strategy in time, and provides a basis for decision-making such as early stopping.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the embodiments of the present invention or the technical solutions in the prior art more clearly, the following briefly introduces the accompanying drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only These are some embodiments of the present invention. For those of ordinary skill in the art, other drawings can also be obtained according to these drawings without creative efforts.
图1是本申请实施例提供的一种可视化系统的架构示意图;1 is a schematic diagram of the architecture of a visualization system provided by an embodiment of the present application;
图2是本申请实施例提供的一种可视化方法的流程示意图;2 is a schematic flowchart of a visualization method provided by an embodiment of the present application;
图3本申请实施例提供的一种计算机设备的结构示意图;3 is a schematic structural diagram of a computer device provided by an embodiment of the present application;
图4本申请实施例提供的一种可视化装置的结构示意图;FIG. 4 is a schematic structural diagram of a visualization device provided by an embodiment of the present application;
图5本申请实施例提供的另一种可视化装置的结构示意图。FIG. 5 is a schematic structural diagram of another visualization apparatus provided by an embodiment of the present application.
具体实施方式detailed description
为了使本技术领域的人员更好地理解本发明方案,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分的实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员 在没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本发明保护的范围。In order to make those skilled in the art better understand the solutions of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only Embodiments are part of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.
以下分别进行详细说明。Each of them will be described in detail below.
本发明的说明书和权利要求书及所述附图中的术语“第一”、“第二”、“第三”和“第四”等是用于区别不同对象,而不是用于描述特定顺序。此外,术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或模块的过程、方法、系统、产品或设备没有限定于已列出的步骤或模块,而是可选地还包括没有列出的步骤或模块,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或模块。The terms "first", "second", "third" and "fourth" in the description and claims of the present invention and the accompanying drawings are used to distinguish different objects, rather than to describe a specific order. . Furthermore, the terms "comprising" and "having" and any variations thereof are intended to cover non-exclusive inclusion. For example, a process, method, system, product or device comprising a series of steps or modules is not limited to the listed steps or modules, but optionally also includes unlisted steps or modules, or optionally also includes Other steps or modules inherent to these processes, methods, products or devices.
在本文中提及“实施例”意味着,结合实施例描述的特定特征、结构或特性可以包含在本发明的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例,也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其它实施例相结合。Reference herein to an "embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the present invention. The appearances of the phrase in various places in the specification are not necessarily all referring to the same embodiment, nor a separate or alternative embodiment that is mutually exclusive of other embodiments. It is explicitly and implicitly understood by those skilled in the art that the embodiments described herein may be combined with other embodiments.
请参见图1,图1是申请实施例提供的一种可视化系统的架构示意图,该可视化系统包括超算集群、云服务器、云存储及用户设备。需要说明的是,图1中所示的超算集群、云服务器、云存储及用户设备的形态和数量仅用于举例,并不构成对本申请实施例的限定。其中,该可视化系统基于javascript和svg技术可以实现了多维多类中间训练数据的可视化,支持大量中间训练数据的快速高效的可视化渲染。Please refer to FIG. 1 . FIG. 1 is a schematic diagram of the architecture of a visualization system provided by an embodiment of the application, where the visualization system includes a supercomputing cluster, a cloud server, a cloud storage, and user equipment. It should be noted that the shapes and numbers of supercomputing clusters, cloud servers, cloud storages, and user equipment shown in FIG. 1 are only for examples, and do not constitute limitations to the embodiments of the present application. Among them, the visualization system can realize the visualization of multi-dimensional and multi-class intermediate training data based on javascript and svg technology, and supports fast and efficient visualization rendering of a large amount of intermediate training data.
其中,该可视化系统基于kubernetes容器编排系统,提供生产级别稳定的可视化系统部署,提供动态可控的服务能力,kubernetes容器用于管理云服务中多个主机上的容器化的应用。Among them, the visualization system is based on the kubernetes container orchestration system, which provides stable visualization system deployment at the production level and provides dynamic and controllable service capabilities. The kubernetes container is used to manage containerized applications on multiple hosts in the cloud service.
其中,云服务器提供可视化服务,可视化服务用于对获取的中间训练数据进行预处理等数据操作,对进行数据操作后的中间训练数据进行可视化渲染得到可视化信息。The cloud server provides visualization services, and the visualization services are used to perform data operations such as preprocessing on the acquired intermediate training data, and visualize and render the intermediate training data after data operations to obtain visualization information.
可选地,云服务器还可提供以下服务中的至少一种:服务网关、训练管理服务、数据存储服务及权限服务。Optionally, the cloud server may further provide at least one of the following services: a service gateway, a training management service, a data storage service, and a permission service.
其中,服务网关可作为以下至少一种服务的入口:训练管理服务、数据存储服务、数据可视化服务及权限服务的入口。Wherein, the service gateway can be used as the entrance of at least one of the following services: the entrance of training management service, data storage service, data visualization service and authority service.
可选地,服务网关可以是应用程序。Optionally, the service gateway can be an application.
可选地,服务网关可以具备限流功能。Optionally, the service gateway may have a current limiting function.
其中,训练管理服务可提供以下至少一种服务:监控深度学习模型的训练任务是否完成,对产生异常的训练任务进行记录、索引和查找,比较同一深度学习模型的多个训练任务的训练过程以及对深度学习模型的训练结果进行共享。Among them, the training management service can provide at least one of the following services: monitoring whether the training tasks of the deep learning model are completed, recording, indexing and searching for abnormal training tasks, comparing the training process of multiple training tasks of the same deep learning model, and Share the training results of deep learning models.
其中,数据存储服务可提供以下至少一种服务:数据持久化,采用数据线程将深度学习模型训练产生的中间训练数据进行存储以及采用缓存的方式将深度学习模型训练产生的中间训练数据进行存储。The data storage service can provide at least one of the following services: data persistence, using data threads to store intermediate training data generated by deep learning model training, and using cache to store intermediate training data generated by deep learning model training.
其中,权限管理服务用于保证可视化系统的安全性,在通过权限管理服务后才可使用可视化系统,此外,共享训练也是经由权限管理服务进行认证,权限管理服务支持统一认证账号登录和/或普通注册两种类型的登录方式。Among them, the rights management service is used to ensure the security of the visualization system, and the visualization system can only be used after passing the rights management service. In addition, shared training is also authenticated through the rights management service, which supports unified authentication account login and/or ordinary There are two types of login methods to register.
其中,云服务器可以通过超文本传输协议(HyperText Transfer Protocol,HTTP)、谷歌远程过程调用(Google Remote Procedure Call,GRPC)协议或其他协议与开发者工具包进 行交互。Among them, the cloud server can interact with the developer toolkit through HyperText Transfer Protocol (HTTP), Google Remote Procedure Call (GRPC) protocol or other protocols.
其中,云服务器可直接将训练数据存储在云存储中。Among them, the cloud server can directly store the training data in the cloud storage.
其中,超算集群通过运行开发者工具包提供python等编程语言的工具包等开发工具,供用户与tensorflow,pytorch,caffe框架等一起使用,和/或开启新的线程作为数据上传专用线程,以及通过HTTP通信协议和GRPC协议在数据上传专用线程上向云服务器上传深度学习模型训练产生的中间训练数据,上传的中间训练数据类型可以包括向量、标量、图片、视频、音频等,上传的中间训练数据存储在开发者工具包的数据缓存中。Among them, the supercomputing cluster provides development tools such as python and other programming language toolkits by running the developer toolkit for users to use with tensorflow, pytorch, caffe frameworks, etc., and/or open a new thread as a dedicated thread for data uploading, and The intermediate training data generated by the training of the deep learning model is uploaded to the cloud server through the HTTP communication protocol and the GRPC protocol on the dedicated thread for data uploading. The uploaded intermediate training data types can include vectors, scalars, pictures, videos, audios, etc. Data is stored in the developer kit's data cache.
其中,开启新的线程作为数据上传专用线程可以保证深度学习模型的计算速度。Among them, opening a new thread as a dedicated thread for data uploading can ensure the computing speed of the deep learning model.
其中,云存储可选地可以为分布式存储,用于统一管理深度学习模型产生的中间训练数据,为中间训练数据提供海量的存储功能,并且随着中间训练数据的数据量的增长,能够动态扩容以满足中间训练数据的存储需求,为可视化业务的增长提供后续保障。Among them, cloud storage can optionally be distributed storage, which is used for unified management of intermediate training data generated by deep learning models, providing massive storage functions for intermediate training data, and as the amount of intermediate training data increases, it can dynamically The capacity can be expanded to meet the storage requirements of intermediate training data and provide follow-up guarantee for the growth of visualization business.
可选地,云存储还可以是数据块级云存储、文件级云存储、对象级云存储和/或其他形式的云存储。Optionally, the cloud storage may also be data block-level cloud storage, file-level cloud storage, object-level cloud storage and/or other forms of cloud storage.
其中,用户设备可以通过预设的应用软件与云服务器进行通信,也可以通过预设的网络地址与云服务器进行通信。The user equipment may communicate with the cloud server through preset application software, or communicate with the cloud server through a preset network address.
可选地,用户设备可以包括各种具有通信功能的手持设备、车载设备、可穿戴设备、计算设备或连接到无线调制解调器的其他处理设备,以及各种形式的用户设备(User Equipment,UE),移动台(Mobile Station,MS),终端设备(terminal device)等等。Optionally, the user equipment may include various handheld devices with communication functions, vehicle-mounted devices, wearable devices, computing devices or other processing devices connected to the wireless modem, and various forms of user equipment (User Equipment, UE), Mobile station (Mobile Station, MS), terminal device (terminal device) and so on.
请参见图2所示,图2是本申请实施例提供的一种可视化方法的流程示意图,应用于上述用户设备和云服务器,具体包括以下步骤:Please refer to FIG. 2. FIG. 2 is a schematic flowchart of a visualization method provided by an embodiment of the present application, which is applied to the above-mentioned user equipment and cloud server, and specifically includes the following steps:
步骤201:用户设备向云服务器发送可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化。Step 201: The user equipment sends a visualization request to the cloud server, where the visualization request is used to request visualization of the target training data of the target deep learning model.
步骤202:云服务器接收来自用户设备的可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化。Step 202: The cloud server receives a visualization request from the user equipment, where the visualization request is used to request visualization of the target training data of the target deep learning model.
在一种可能的实现方式中,可视化请求携带目标深度学习模型的训练任务标识,云服务器接收到该可视化请求之后,可以基于目标深度学习模型的训练任务标识,获取该训练任务的相关训练数据。作为一个例子,可以将存储的各个深度学习模型的训练任务的相关训练数据与深度学习模型的训练任务标识关联。例如,可以按照训练任务进行训练数据的存储,不同训练任务的相关训练数据存储在不同位置。再例如,可以按照深度学习模型或者数据类型进行训练数据的存储。In a possible implementation manner, the visualization request carries the training task identifier of the target deep learning model. After receiving the visualization request, the cloud server can obtain the relevant training data of the training task based on the training task identifier of the target deep learning model. As an example, the stored training data related to the training tasks of each deep learning model may be associated with the training task identifier of the deep learning model. For example, training data may be stored according to training tasks, and relevant training data of different training tasks are stored in different locations. For another example, training data may be stored according to deep learning models or data types.
其中,数据类型可以是二维热力图、三位直方图、特征图、调用时序图,标量折线图,有向无环图,等等。Among them, the data type can be a two-dimensional heat map, a three-dimensional histogram, a feature map, a call sequence diagram, a scalar line graph, a directed acyclic graph, and so on.
其中,不同的数据类型对应不同的可视化图标。Among them, different data types correspond to different visualization icons.
其中,训练任务都位于超算集群之上,可以通过利用超算集群的中央处理器来加速训练深度学习模型。Among them, the training tasks are all located on the supercomputing cluster, and the training of the deep learning model can be accelerated by using the central processing unit of the supercomputing cluster.
此时,可选地,可以将训练数据和训练数据所属的训练任务的任务标识关联存储,等等,本公开实施例对此不做限定。At this time, optionally, the training data may be stored in association with the task identifier of the training task to which the training data belongs, and so on, which is not limited in this embodiment of the present disclosure.
在一种可能的实现方式中,可视化请求携带目标深度学习模型的训练任务标识和目标 训练数据的索引信息,该索引信息用于查找该目标训练数据,以确定该目标训练数据的存储位置。该索引信息可以有多种实现方式,例如,索引信息可以包括数据创建时间、数据类型、数据标签等任意一种或多种,再例如,索引信息包括数据类型和数据标签。In a possible implementation, the visualization request carries the training task identification of the target deep learning model and the index information of the target training data, and the index information is used to find the target training data to determine the storage location of the target training data. The index information may be implemented in multiple ways. For example, the index information may include any one or more of data creation time, data type, and data label. For another example, the index information includes data type and data label.
其中,同一数据类型可以有多个数据标签,数据标签可以是自定义的。Among them, the same data type can have multiple data labels, and the data labels can be customized.
举例来说,若数据类型为标量折线图,则数据标签可以是准确值,也可以是损失值,等等。For example, if the data type is a scalar line chart, the data labels can be exact values, loss values, and so on.
其中,可视化请求可以是请求根据训练任务标识和索引信息,在索引信息对应的中间数据集合中采样该中间数据集合中的所有数据或随机采样中间数据集合中的数据。The visualization request may be a request to sample all the data in the intermediate data set corresponding to the index information or randomly sample the data in the intermediate data set according to the training task identifier and the index information.
其中,目标深度学习模型包括至少一个深度神经网络,目标训练数据可以是目标深度学习模型中其中一个深度神经网络产生的中间数据,也可以是目标深度学习模型中多个深度神经网络产生的中间数据。The target deep learning model includes at least one deep neural network, and the target training data may be intermediate data generated by one of the deep neural networks in the target deep learning model, or may be intermediate data generated by multiple deep neural networks in the target deep learning model. .
可选地,可视化请求可以是请求可视化第一时段内目标深度学习模型产生的中间数据,其中,第一时段的时长可以是3min,5min,9min及15min等其他值,第一时刻的终止时刻可以是当前时刻,也可以早于当前时刻,也可以晚于当前时刻。Optionally, the visualization request may be a request to visualize the intermediate data generated by the target deep learning model in the first period of time, wherein the duration of the first period of time may be other values such as 3min, 5min, 9min, and 15min, and the termination time of the first moment may be is the current time, it can be earlier than the current time, or it can be later than the current time.
其中,目标训练数据存储在云存储中。Among them, the target training data is stored in cloud storage.
在一种可能的实现方式中,目标训练数据包括以下至少一种:模型性能变化趋势信息、模型损失趋势信息、模型参数分布信息、模型处理中间结果、模型结构信息、模型训练的当前进度信息、针对同一模型的不同训练的比较信息、多种进程的调度时间信息时序。In a possible implementation manner, the target training data includes at least one of the following: model performance change trend information, model loss trend information, model parameter distribution information, model processing intermediate results, model structure information, current progress information of model training, Comparing information for different trainings of the same model, and time series of scheduling time information for various processes.
在一种可能的实现方式中,在目标训练数据为模型性能变化趋势信息、模型损失趋势信息、模型训练的当前进度信息以及针对同一模型的不同训练的比较信息中至少一种的情况下,可视化信息的类别为标量折线图。In a possible implementation manner, when the target training data is at least one of model performance change trend information, model loss trend information, current progress information of model training, and comparison information for different trainings of the same model, the visualization The category of information is a scalar line chart.
其中,在目标训练数据为模型性能变化趋势信息的情况下,基于标量折线图可以确定性能变化趋势是否满足第一预设条件,在满足第一预设条件的情况下,停止模型训练,在不满足第一预设条件的情况下,继续模型训练。Wherein, in the case where the target training data is the model performance change trend information, it can be determined whether the performance change trend satisfies the first preset condition based on the scalar line graph, and when the first preset condition is met, the model training is stopped, and when the first preset condition is met, the model training is stopped. When the first preset condition is met, model training is continued.
其中,目标训练数据为模型损失趋势信息的情况下,基于标量折线图可以确定损失趋势是否满足第二预设条件,在满足第二预设条件的情况下,停止模型训练,在不满足预设条件的情况下,继续模型训练。Wherein, when the target training data is the model loss trend information, it can be determined whether the loss trend satisfies the second preset condition based on the scalar line graph, and when the second preset condition is met, the model training is stopped, and when the preset condition is not met, the model training is stopped. Continue model training if conditions are met.
其中,目标训练数据为模型训练的当前进度信息的情况下,基于标量折线图可以确定训练进度是否满足预设进度,在满足预设进度的情况下,继续模型训练,在不满足预设条件的情况下,停止模型训练。In the case where the target training data is the current progress information of model training, it can be determined whether the training progress meets the preset progress based on the scalar line graph, and if the preset progress is met, continue model training, and if the preset conditions are not met case, stop model training.
其中,目标训练数据为同一模型的不同训练的比较信息的情况下,基于标量折线图可以确定不同训练提取到的特征是否相同,在特征相同的情况下,继续模型训练,在特征不相同的情况下,停止模型训练。Among them, when the target training data is the comparison information of different trainings of the same model, it can be determined whether the features extracted by different trainings are the same based on the scalar line graph. If the features are the same, continue the model training. If the features are different , stop model training.
其中,在目标训练数据为模型参数分布信息的情况下,可视化信息的类别为三维直方图,通过三维直方图可以确定模型参数分布是否异常,在模型参数分布异常的情况下,停止模型训练。Wherein, when the target training data is model parameter distribution information, the category of visualization information is a three-dimensional histogram, and it can be determined whether the model parameter distribution is abnormal through the three-dimensional histogram. If the model parameter distribution is abnormal, the model training is stopped.
其中,在目标训练数据为模型处理中间结果的情况下,可视化信息的类别为特征图,通过特征图可以确定模型提取到的特征是否正确,在提取到的特征错误的情况下,重新构 建模型。Among them, when the target training data is the intermediate result of the model processing, the category of the visual information is the feature map, and through the feature map, it can be determined whether the features extracted by the model are correct, and if the extracted features are wrong, rebuild the model.
其中,在目标训练数据为模型结构信息的情况下,可视化信息的类别为有向无环图,通过有向无环图可以确定模型的结构是否正确,在模型的结构错误的情况下,重新构建模型的结构。Among them, when the target training data is the model structure information, the category of the visualization information is a directed acyclic graph, and the directed acyclic graph can determine whether the structure of the model is correct, and if the structure of the model is wrong, rebuild the The structure of the model.
其中,有向无环图支持的深度学习模型的架构可以是如开放神经网络交换(Open Neural Network Exchange,ONNX)等标准架构,也可以是其他类型的架构,其中,ONNX不需要对中间训练数据做数据处理得到有向无环图,而某些架构则需要对中间训练数据进行预处理得到有向无环图。Among them, the architecture of the deep learning model supported by the directed acyclic graph can be a standard architecture such as Open Neural Network Exchange (ONNX), or other types of architectures, in which ONNX does not need to perform intermediate training data. Do data processing to obtain a directed acyclic graph, while some architectures need to preprocess the intermediate training data to obtain a directed acyclic graph.
其中,在目标训练数据为多种进程的调度时间信息的情况下,可视化信息的类别为多种进程的调用时序图,通过调用时序图可以确定每个算子的耗时情况,在算子的耗时情况大于预设时长时,对耗时情况大于预设时长的算子进行优化。Among them, when the target training data is the scheduling time information of various processes, the types of visualization information are the calling sequence diagrams of various processes, and the time-consuming situation of each operator can be determined through the calling sequence diagram. When the time consumption is greater than the preset time, the operators whose time consumption is greater than the preset time are optimized.
其中,在目标训练数据为针对模型处理中间结果的情况下,可视化信息的类别为二维热力图,通过热力图可以确定模型提取到的特征是否正确,在提取到的特征错误的情况下,重新构建模型。Among them, in the case where the target training data is an intermediate result of model processing, the category of the visualization information is a two-dimensional heat map. Through the heat map, it can be determined whether the features extracted by the model are correct. Build the model.
其中,特征图和热力图是目标训练数据为模型处理中间结果的不同表现形式。Among them, the feature map and the heat map are different representations of the target training data for the model to process the intermediate results.
其中,目标训练数据的类型包括以下至少一种:标量、向量、图片、视频、音频。The types of target training data include at least one of the following: scalar, vector, picture, video, and audio.
步骤203:云服务器获取所述可视化请求对应的目标训练数据。Step 203: The cloud server obtains target training data corresponding to the visualization request.
在一种可能的实现方式中,所述获取所述可视化请求对应的目标训练数据,包括:In a possible implementation manner, the acquiring target training data corresponding to the visualization request includes:
基于所述可视化请求中携带的训练任务标识和索引信息查找索引数据库,以得到所述目标训练数据的存储位置信息;Search the index database based on the training task identifier and index information carried in the visualization request to obtain the storage location information of the target training data;
基于所述目标训练数据的存储位置信息,获取云存储中存储的所述目标训练数据。The target training data stored in the cloud storage is acquired based on the storage location information of the target training data.
其中,基于训练任务标识和索引信息得到中间训练数据的列表,基于中间训练数据的列表在云存储中获取中间训练数据。Wherein, a list of intermediate training data is obtained based on the training task identification and index information, and the intermediate training data is obtained in cloud storage based on the list of intermediate training data.
可选地,可视化请求携带目标深度学习模型的目标训练数据的标识信息,所述标识信息用于云服务器从云存储中获取目标训练数据;其中,所述标识信息是预设的,所述标识信息在所述云存储中具有唯一性。Optionally, the visualization request carries the identification information of the target training data of the target deep learning model, and the identification information is used by the cloud server to obtain the target training data from the cloud storage; wherein, the identification information is preset, and the identification Information is unique in the cloud storage.
举例来说,目标深度学习模型的训练任务包括训练任务A和训练任务B,训练任务A产生的训练数据包括训练数据A1和训练数据A2,训练任务B产生的训练数据包括训练数据B1和训练数据B2,训练数据A1、训练数据A2、训练数据B1和训练数据B2均存储在云平台中,训练数据A1的标识信息为1,训练数据A2的标识信息为2,训练数据B1的标识信息为3,训练数据B2的标识信息为4,若标识信息为1,则云服务器从云存储中获取的目标训练数据为训练数据A1;若标识信息为2,则云服务器从云存储中获取的目标训练数据为训练数据A2;若标识信息为3,则云服务器从云存储中获取的目标训练数据为训练数据B1;若标识信息为4,则云服务器从云存储中获取的目标训练数据为训练数据B2。For example, the training tasks of the target deep learning model include training task A and training task B, the training data generated by training task A includes training data A1 and training data A2, and the training data generated by training task B includes training data B1 and training data B2, training data A1, training data A2, training data B1 and training data B2 are all stored in the cloud platform, the identification information of training data A1 is 1, the identification information of training data A2 is 2, and the identification information of training data B1 is 3 , the identification information of the training data B2 is 4, if the identification information is 1, the target training data obtained by the cloud server from the cloud storage is the training data A1; if the identification information is 2, the target training data obtained by the cloud server from the cloud storage The data is training data A2; if the identification information is 3, the target training data obtained by the cloud server from the cloud storage is training data B1; if the identification information is 4, the target training data obtained by the cloud server from the cloud storage is training data B2.
步骤204:云服务器对所述目标训练数据进行预处理,得到可视化信息。Step 204: The cloud server preprocesses the target training data to obtain visualization information.
步骤205:云服务器向用户设备发送可视化响应,所述可视化响应携带所述可视化信息。Step 205: The cloud server sends a visualization response to the user equipment, where the visualization response carries the visualization information.
步骤206:用户设备接收云服务器发送的可视化响应,所述可视化响应携带有所述目 标训练数据的可视化信息。Step 206: The user equipment receives a visualization response sent by the cloud server, where the visualization response carries the visualization information of the target training data.
步骤207:用户设备显示所述可视化信息。Step 207: The user equipment displays the visualized information.
在一种可能的实现方式中,可视化信息包括以下类别中的至少一种:有向无环图、三维直方图、多种进程的调用时序图、特征图、二维热力图以及标量折线图。In a possible implementation manner, the visualization information includes at least one of the following categories: directed acyclic graphs, three-dimensional histograms, call sequence diagrams of various processes, feature maps, two-dimensional heat maps, and scalar line graphs.
可选地,在可视化信息的类别为有向无环图的情况下,可视化信息包括以下信息:目标深度学习模型的至少一部分的拓扑信息,其中,所述目标深度学习模型的至少一部分包括:所述目标深度学习模型的多个模块和/或多个算子;所述目标深度学习模型所包括的至少一个算子中每个算子的资源占用信息。Optionally, when the category of the visualization information is a directed acyclic graph, the visualization information includes the following information: topology information of at least a part of the target deep learning model, wherein at least a part of the target deep learning model includes: multiple modules and/or multiple operators of the target deep learning model; resource occupation information of each operator in at least one operator included in the target deep learning model.
可选地,多个模块的拓扑信息包括以下至少一种:所述多个模块的标识信息、所述多个模块之间的依赖关系、所述多个模块中每个模块的数据量大小、每个所述模块包括的至少一个算子的信息。Optionally, the topology information of multiple modules includes at least one of the following: identification information of the multiple modules, dependencies between the multiple modules, data size of each module in the multiple modules, Information of at least one operator included in each of the modules.
其中,模块的标识信息具有唯一性,可以是预设的。The identification information of the module is unique and can be preset.
可选地,算子的信息包括以下至少一种:算子的标识信息,算子之间的依赖关系,算子的数据量大小。Optionally, the information of the operators includes at least one of the following: identification information of the operators, dependencies between operators, and data size of the operators.
其中,依赖关系可以是先后关系,并行关系。Among them, the dependency relationship can be a sequential relationship or a parallel relationship.
可选地,资源占用信息是云服务器基于算子的数据类型、算子的输入数据信息、算子的输出数据信息中的至少一种确定的。Optionally, the resource occupation information is determined by the cloud server based on at least one of the data type of the operator, the input data information of the operator, and the output data information of the operator.
其中,算子的输入可以是图片、视频、音频、标量、向量,等等,不同的输入对应的维度不同,例如图片相当于一个二维的矩阵,音频相当于一个一维的矩阵。The input of the operator can be a picture, video, audio, scalar, vector, etc., and different inputs correspond to different dimensions. For example, a picture is equivalent to a two-dimensional matrix, and an audio is equivalent to a one-dimensional matrix.
其中,经过算子的运算后,算子的输入信息的维度会发生变化。Among them, after the operation of the operator, the dimension of the input information of the operator will change.
其中,算子的数据类型可以是双精度类型,也可以是单精度类型。The data type of the operator can be a double-precision type or a single-precision type.
其中,算子可以是以下至少一种:卷积,批标准化(BatchNorm,BN),全链接,池化,矩阵乘除,丢弃(DropOut),激活等。Among them, the operator can be at least one of the following: convolution, batch normalization (BatchNorm, BN), full link, pooling, matrix multiplication and division, dropout (DropOut), activation, etc.
其中,输入数据信息可以是输入数据量大小,输出数据信息可以是输出数据量大小。The input data information may be the size of the input data, and the output data information may be the size of the output data.
可选地,在可视化信息的类别为标量折线图的情况下,可视化信息包括目标深度模型性能变化趋势信息、目标深度模型损失趋势信息、目标深度学习模型的训练进度信息以及目标深度学习模型中不同训练的比较信息中的至少一种。Optionally, in the case where the category of the visualization information is a scalar line graph, the visualization information includes the performance change trend information of the target deep model, the loss trend information of the target deep model, the training progress information of the target deep learning model, and the difference in the target deep learning model. At least one of the training comparison information.
可选地,在可视化信息的类别为特征图或二维热力图的情况下,可视化信息包括目标深度学习模型提取到的特征。Optionally, when the category of the visualization information is a feature map or a two-dimensional heat map, the visualization information includes features extracted by the target deep learning model.
可选地,在可视化信息的类别为三维直方图的情况下,可视化信息包括多个算子中每个算子的参数分布。Optionally, when the category of the visualization information is a three-dimensional histogram, the visualization information includes the parameter distribution of each operator in the plurality of operators.
可选地,在可视化信息的类别为多种进程的调用时序图的情况下,可视化信息包括多个算子中每个算子在至少一个进程中每个进程的运行时长。Optionally, in the case where the category of the visualization information is the calling sequence diagram of multiple processes, the visualization information includes the running time of each operator in at least one process of the multiple operators.
可选地,进程可以是解释性语言进程、本地进程、AI芯片进程中的至少一种。Optionally, the process may be at least one of an interpreted language process, a local process, and an AI chip process.
其中,解释性语言进程、本地进程、AI芯片进程按照时间交替执行调用时序图。Among them, the interpreted language process, the local process, and the AI chip process alternately execute the calling sequence diagram according to time.
可以看出,在本申请实施例中,用户设备首先向云服务器发送可视化请求,然后接收云服务器发送的可视化响应,可视化响应携带有目标训练数据的可视化信息,最后显示可视化信息,由于通过目标训练数据的可视化信息,可以直观地了解深度学习模型的训练状 态,有助于提升及时判断当前训练策略的可行性,为早停等决策提供依据。It can be seen that, in this embodiment of the present application, the user equipment first sends a visualization request to the cloud server, then receives the visualization response sent by the cloud server, the visualization response carries the visualization information of the target training data, and finally displays the visualization information. The visual information of the data can intuitively understand the training status of the deep learning model, which helps to improve the feasibility of timely judgment of the current training strategy, and provides a basis for decision-making such as early stopping.
在本申请的一实现方式中,所述显示所述可视化信息,包括:In an implementation manner of the present application, the displaying the visualized information includes:
以折叠显示模式或展示显示模式显示所述可视化信息,其中,displaying the visual information in a collapsed display mode or a revealing display mode, wherein,
在所述折叠显示模式中,以所述目标深度学习模型中的模块为最小显示单元,其中,所述模块包括至少一个算子;和/或In the folding display mode, a module in the target deep learning model is used as the minimum display unit, wherein the module includes at least one operator; and/or
在所述展开显示模式中,以所述目标深度学习模型中的算子为最小显示单元。In the expanded display mode, the operator in the target deep learning model is used as the minimum display unit.
其中,在可视化信息的类别是有向无环图的情况下,可以通过折叠显示模式显示可视化信息,也可以通过展开显示模式显示可视化信息,在可视化信息的类别非有向无环图的情况下,通过展开显示模式显示可视化信息。Among them, when the category of the visualization information is a directed acyclic graph, the visualization information can be displayed in the collapsed display mode, and the visualization information can also be displayed in the expanded display mode. , which displays visual information by expanding the display mode.
其中,展开显示模式可以是模块在接收到操作指令后展开显示模块中的算子。The expanded display mode may be an operator in the expanded display module after the module receives an operation instruction.
其中,在可视化信息的类别为三维直方图的情况下,展开显示模式可以是直接显示每个模块中每个算子的参数分布情况。Wherein, when the category of the visualization information is a three-dimensional histogram, the expanded display mode may be to directly display the parameter distribution of each operator in each module.
其中,在可视化信息的类别为调用时序图的情况下,展开显示模式可以是直接显示每个模块中每个算子在至少一个进程中的运行时长。Wherein, when the category of the visual information is a call sequence diagram, the expanded display mode may be to directly display the running time of each operator in each module in at least one process.
举例来说,假设可视化信息的类别为有向无环图,目标深度学习模型中包括2个模块(模块A和模块B),模块A中包括3个算子(A1、A2以及A3),模型B中包括2个算子(B1和B2)。若以折叠显示模式显示可视化信息,则显示模块A和模块B;若以展开显示模式显示可视化信息,模块A接收操作指令后,显示模块A中的算子A1、A2以及A3,模块B接收操作指令后,显示模块B中的算子B1和B2。For example, assuming that the category of visualization information is a directed acyclic graph, the target deep learning model includes 2 modules (module A and module B), and module A includes 3 operators (A1, A2, and A3), the model B includes 2 operators (B1 and B2). If the visualized information is displayed in the folded display mode, module A and module B are displayed; if the visualized information is displayed in the expanded display mode, after the module A receives the operation instruction, the operators A1, A2 and A3 in the module A are displayed, and the module B receives the operation After the instruction, the operators B1 and B2 in module B are displayed.
举例来说,假设可视化信息的类别为三维直方图,目标深度学习模型中包括2个模块(模块A和模块B),模块A中包括3个算子(A1、A2以及A3),模型B中包括2个算子(B1和B2),模块A对应三维直方图1,模块B对应三维直方图B,则三维直方图1中包括A1的参数分布情况、A2的参数分布情况以及A3的参数分布情况,三维直方图2中包括B1的参数分布情况和B2的参数分布情况。For example, assuming that the type of visualization information is a three-dimensional histogram, the target deep learning model includes 2 modules (module A and module B), module A includes 3 operators (A1, A2, and A3), and model B includes Including two operators (B1 and B2), module A corresponds to 3D histogram 1, and module B corresponds to 3D histogram B, then 3D histogram 1 includes the parameter distribution of A1, the parameter distribution of A2 and the parameter distribution of A3 The three-dimensional histogram 2 includes the parameter distribution of B1 and the parameter distribution of B2.
举例来说,假设可视化信息的类别为调用时序图,目标深度学习模型中包括2个模块(模块A和模块B)和2个进程(进程C1和C2),模块A中包括2个算子(A1和A2),模型B中包括2个算子(B1和B2),模块A对应调用时序图1,模块B对应调用时序图B,则调用时序图1中包括A1在进程C1中的运行时长、A1在进程C2中的运行时长,包括A2在进程C1中的运行时长、A2在进程C2中的运行时长;调用时序图2中包括B1在进程C1中的运行时长、B1在进程C2中的运行时长,包括B2在进程C1中的运行时长、B2在进程C2中的运行时长。For example, assuming that the category of visualization information is the call sequence diagram, the target deep learning model includes 2 modules (module A and module B) and 2 processes (process C1 and C2), and module A includes 2 operators ( A1 and A2), model B includes 2 operators (B1 and B2), module A corresponds to call sequence diagram 1, and module B corresponds to call sequence diagram B, then the call sequence diagram 1 includes the running time of A1 in process C1 , the running duration of A1 in process C2, including the running duration of A2 in process C1 and the running duration of A2 in process C2; the calling sequence diagram 2 includes the running duration of B1 in process C1 and the running duration of B1 in process C2 The running duration includes the running duration of B2 in process C1 and the running duration of B2 in process C2.
可选地,可以预先设定可视化信息的展示模式,可以是先使用折叠显示模式显示可视化信息,在第一时长后再使用展开显示模式显示可视化信息,第一时长可以是预设的。Optionally, a display mode of visualized information may be preset, which may be to display the visualized information in a folded display mode first, and then display the visualized information in an expanded display mode after a first duration, which may be preset.
可选地,在可视化信息的类别为标量折线图的情况下,折线图可以是多条,可以在接收到操作指令后缩放或平移,标量折线图展开显示模式可以是直接显示目标深度模型性能变化趋势、目标深度模型损失趋势、目标深度学习模型的训练进度以及目标深度学习模型中不同训练提取到的特征中的至少一种。Optionally, when the type of visualization information is a scalar line graph, there can be multiple line graphs, which can be zoomed or panned after receiving an operation instruction, and the expanded display mode of the scalar line graph can be to directly display the performance change of the target depth model. At least one of the trend, the loss trend of the target deep learning model, the training progress of the target deep learning model, and the features extracted by different trainings in the target deep learning model.
可选地,在可视化信息的类别为特征图或二维热力图的情况下,可展开显示模式可以 是直接显示目标深度学习模型提取到的特征。Optionally, when the category of the visual information is a feature map or a two-dimensional heat map, the expandable display mode may be to directly display the features extracted by the target deep learning model.
其中,在二维热力图中,热度越高代表是特征的概率越大。Among them, in the two-dimensional heat map, the higher the heat, the greater the probability of being a feature.
可以看出,在本申请实施例中,用户设备通过不同的方式展示可视化信息,有利于提升用户设备的应用范围。It can be seen that, in the embodiments of the present application, the user equipment displays visual information in different ways, which is beneficial to improve the application scope of the user equipment.
在本申请的一实现方式中,所述向云服务器发送可视化请求之前,所述方法还包括:In an implementation manner of the present application, before the sending the visualization request to the cloud server, the method further includes:
通过数据上传线程向所述云服务器发送所述目标深度学习模型的中间训练数据,所述中间训练数据用于所述云服务器得到所述目标训练数据的索引信息和训练任务标识,以及基于所述训练任务标识将所述索引信息存储在数据库,基于所述索引信息将所述目标训练数据存储到云存储中。The intermediate training data of the target deep learning model is sent to the cloud server through a data upload thread, and the intermediate training data is used by the cloud server to obtain the index information and training task identifier of the target training data, and based on the The training task identifier stores the index information in a database, and stores the target training data in cloud storage based on the index information.
其中,数据上传线程是新开启的线程。The data uploading thread is a newly opened thread.
其中,训练任务标识具有唯一性,训练任务标识可以是在产生中间训练数据后确定的,也可以是在开始训练是确定的。The training task identification is unique, and the training task identification may be determined after the intermediate training data is generated, or may be determined at the beginning of training.
其中,中间训练数据包括目标训练数据、训练任务标识以及索引信息。The intermediate training data includes target training data, training task identifiers, and index information.
可选地,通过数据上传线程向云服务器发送所述目标深度学习模型的中间训练数据,所述中间数据用于所述云服务器得到所述目标训练数据的标识信息,以及将标识信息存储在数据库中,基于所述标识信息将目标训练数据存储在云存储中。Optionally, the intermediate training data of the target deep learning model is sent to the cloud server through a data upload thread, and the intermediate data is used for the cloud server to obtain the identification information of the target training data, and to store the identification information in a database. , the target training data is stored in cloud storage based on the identification information.
其中,标识信息具有唯一性。Among them, the identification information is unique.
可选地,在通过数据上传线程向云服务器发送所述目标深度学习模型的中间训练数据之前,所述方法还包括:Optionally, before sending the intermediate training data of the target deep learning model to the cloud server through a data upload thread, the method further includes:
确定所述中间训练数据中目标训练数据的数据量;determining the data volume of the target training data in the intermediate training data;
在所述数据量大于预设数据量的情况下,确定训练任务标识和所述目标训练数据的索引信息,以及将所述训练任务信息和所述目标训练数据的索引信息承载在所述中间训练数据上;In the case where the data amount is greater than the preset data amount, determine the training task identifier and the index information of the target training data, and carry the training task information and the index information of the target training data in the intermediate training data;
在所述数据量小于预设数据量的情况下,确定所述目标训练数据的标识信息,所述标识信息承载在所述中间训练数据上。In the case that the data amount is less than the preset data amount, the identification information of the target training data is determined, and the identification information is carried on the intermediate training data.
可选地,在云服务器接收到标识信息的情况下,可以将标识信息存储到数据库的第一缓冲区,以及基于标识信息将目标训练数据存储到所述数据库的第二缓冲区。Optionally, when the cloud server receives the identification information, the identification information may be stored in the first buffer area of the database, and the target training data may be stored in the second buffer area of the database based on the identification information.
可以看出,在本申请实施例中,通过云服务器将目标数据存储在云存储中,有利于云服务器在接收到可视化请求后快速获取目标训练数据。It can be seen that, in the embodiment of the present application, the target data is stored in the cloud storage by the cloud server, which is beneficial for the cloud server to quickly obtain the target training data after receiving the visualization request.
在本申请的一实现方式中,所述对所述目标训练数据进行预处理,得到可视化信息,包括:In an implementation manner of the present application, the preprocessing of the target training data to obtain visualization information includes:
对所述目标训练数据进行至少一项预处理,得到预处理数据;Perform at least one preprocessing on the target training data to obtain preprocessing data;
对所述预处理数据进行渲染处理,得到所述可视化信息。Perform rendering processing on the preprocessed data to obtain the visualization information.
其中,若目标训练数据的数据类型为图片或视频,对图片或视频的预处理包括如下处理之中的一种或多种:不敏感区域剔除处理、图像精度增强处理、图像降噪处理和图像二值化处理。Wherein, if the data type of the target training data is a picture or a video, the preprocessing of the picture or video includes one or more of the following processes: insensitive area removal processing, image precision enhancement processing, image noise reduction processing and image processing Binarization processing.
其中,若目标训练数据的数据类型为文本,对文本预处理包括如下处理之中的一种或多种:文档切分处理、文本分词处理、去停用词(包括标点、数字、单子和其他无意义的 词)处理、文本特征提取、词频统计处理及文本向量化处理。Among them, if the data type of the target training data is text, the text preprocessing includes one or more of the following processes: document segmentation, text segmentation, and removal of stop words (including punctuation, numbers, monads, and other Meaningless words) processing, text feature extraction, word frequency statistics processing and text vectorization processing.
其中,可视化渲染是将预处理数据组装成超文本链接标示语言(Hypertext Markup Language,HTML)。Among them, the visual rendering is to assemble the preprocessed data into a hypertext markup language (Hypertext Markup Language, HTML).
其中,通过预处理得到的可视化信息的类型非有向无环图。Among them, the type of visualization information obtained by preprocessing is non-directed acyclic graph.
可以看出,在本申请实施例中,对目标训练数据进行预处理得到预处理数据,然后对预处理数据进行渲染得到可视化信息,有利于基于可视化信息对目标深度学习模型及进行分析。It can be seen that, in the embodiment of the present application, the target training data is preprocessed to obtain preprocessed data, and then the preprocessed data is rendered to obtain visual information, which is beneficial to the target deep learning model and analysis based on the visual information.
在本申请的一实现方式中,所述对所述目标训练数据进行预处理,得到可视化信息,包括:In an implementation manner of the present application, the preprocessing of the target training data to obtain visualization information includes:
在所述目标深度学习模型的训练任务在目标框架下执行的情况下,通过二叉树对所述目标训练数据进行解析处理,得到所述可视化信息。When the training task of the target deep learning model is executed under the target framework, the target training data is parsed through a binary tree to obtain the visualization information.
其中,通过二叉树对目标训练数据进行处理的目标深度学习模型的类型为parrots。Among them, the type of the target deep learning model that processes the target training data through the binary tree is parrots.
其中,通过二叉树对目标训练数据进行解析处理得到的可视化信息的类型为有向无环图。The type of visualization information obtained by parsing the target training data through a binary tree is a directed acyclic graph.
可以看出,在本申请实施例中,对目标训练数据进行解析处理得到可视化信息,有利于基于可视化信息对目标深度学习模型及进行分析。It can be seen that, in the embodiment of the present application, the visualization information is obtained by analyzing the target training data, which is beneficial to analyze and analyze the target deep learning model based on the visualization information.
请参阅图3,图3是本申请实施例提供的一种计算机设备的结构示意图,如图所示,该计算机设备包括处理器、存储器、通信接口以及一个或多个程序,其中,上述一个或多个程序被存储在上述存储器中,并且被配置由上述处理器执行。Please refer to FIG. 3. FIG. 3 is a schematic structural diagram of a computer device provided by an embodiment of the present application. As shown in the figure, the computer device includes a processor, a memory, a communication interface, and one or more programs, wherein one or more of the above A plurality of programs are stored in the above-mentioned memory, and are configured to be executed by the above-mentioned processor.
在一个实现方式中,计算机设备为用户设备,上述程序包括用于执行以下步骤的指令:In one implementation, the computer device is user equipment, and the above program includes instructions for performing the following steps:
向云服务器发送可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化;sending a visualization request to the cloud server, where the visualization request is used to request visualization of the target training data of the target deep learning model;
接收所述云服务器发送的可视化响应,所述可视化响应携带有所述目标训练数据的可视化信息;receiving a visualization response sent by the cloud server, where the visualization response carries the visualization information of the target training data;
显示所述可视化信息。The visualization information is displayed.
可选地,所述可视化信息包括以下类别中的至少一种:有向无环图、三维直方图、多种进程的调用时序图、特征图、二维热力图以及标量折线图;和/或Optionally, the visualization information includes at least one of the following categories: directed acyclic graph, three-dimensional histogram, call sequence diagram of various processes, feature map, two-dimensional heat map, and scalar line graph; and/or
所述目标训练数据包括以下至少一种:模型性能变化趋势信息、模型损失趋势信息、模型参数分布信息、模型处理中间结果、模型结构信息、模型训练的当前进度信息、针对同一模型的不同训练的比较信息、多种进程的调度时间信息。The target training data includes at least one of the following: model performance change trend information, model loss trend information, model parameter distribution information, model processing intermediate results, model structure information, current progress information of model training, and information on different trainings for the same model. Comparison information, scheduling time information of various processes.
可选地,所述可视化信息包括以下至少一种信息:Optionally, the visualization information includes at least one of the following information:
所述目标深度学习模型的至少一部分的拓扑信息,其中,所述目标深度学习模型的至少一部分包括:所述目标深度学习模型的多个模块和/或多个算子;topology information of at least a part of the target deep learning model, wherein at least a part of the target deep learning model includes: multiple modules and/or multiple operators of the target deep learning model;
所述目标深度学习模型所包括的至少一个算子中每个算子的资源占用信息。Resource occupation information of each operator in the at least one operator included in the target deep learning model.
可选地,所述多个模块的拓扑信息包括以下至少一种:Optionally, the topology information of the multiple modules includes at least one of the following:
所述多个模块的标识信息、所述多个模块之间的依赖关系、所述多个模块中每个模块的数据量大小、每个所述模块包括的至少一个算子的信息。The identification information of the multiple modules, the dependencies between the multiple modules, the data size of each module in the multiple modules, and the information of at least one operator included in each of the modules.
可选地,所述资源占用信息是所述云服务器基于所述算子的数据类型、所述算子的输 入数据信息、所述算子的输出数据信息中的至少一种确定的。Optionally, the resource occupation information is determined by the cloud server based on at least one of the data type of the operator, the input data information of the operator, and the output data information of the operator.
可选地,所述可视化请求携带所述目标深度学习模型的训练任务标识和所述目标训练数据的索引信息,其中,所述索引信息包括:数据类型和数据标签。Optionally, the visualization request carries a training task identifier of the target deep learning model and index information of the target training data, where the index information includes: data type and data label.
可选地,在显示所述可视化信息方面,上述程序包括具体用于执行以下步骤指令:Optionally, in terms of displaying the visualized information, the above-mentioned program includes instructions for executing the following steps:
在本公开实施例中,支持以不同模式进行可视化数据的显示,其中,在一些实施例中,可以以折叠显示模式或展示显示模式显示所述可视化信息,或者也可以定义其他显示模式,本公开实施例不限于此。In the embodiments of the present disclosure, the display of visual data in different modes is supported, wherein, in some embodiments, the visual information may be displayed in a folding display mode or a presentation display mode, or other display modes may also be defined. Embodiments are not limited thereto.
在一些实施例中,在所述折叠显示模式中,以所述目标深度学习模型中的模块为最小显示单元,其中,所述模块包括至少一个算子。这里的模块可以是默认的划分方式划分的,或者也可以是用户设置的,本公开实施例对此不做限定。In some embodiments, in the folding display mode, a module in the target deep learning model is used as the smallest display unit, wherein the module includes at least one operator. The modules here may be divided in a default division manner, or may also be set by a user, which is not limited in this embodiment of the present disclosure.
在一些实施例中,在所述展开显示模式中,以所述目标深度学习模型中的算子为最小显示单元。此时,可以显示模型中的所有算子的信息。In some embodiments, in the expanded display mode, the operator in the target deep learning model is used as the minimum display unit. At this point, the information of all operators in the model can be displayed.
在一些实施例中,还可以以混合显示模式进行显示,即对一些模块进行折叠显示,而对另一些模块进行展开显示,其可以可选地基于用户设置进行显示,本公开实施例对此不做限定。In some embodiments, it can also be displayed in a mixed display mode, that is, some modules are folded and displayed, and other modules are displayed in an expanded manner, which can optionally be displayed based on user settings, which is not covered by the embodiments of the present disclosure. Do limit.
可选地,在向云服务器发送可视化请求之前,上述程序包括还用于执行以下步骤指令:Optionally, before sending the visualization request to the cloud server, the above program includes an instruction for performing the following steps:
通过数据上传线程向所述云服务器发送所述目标深度学习模型的中间训练数据,所述中间训练数据用于所述云服务器得到所述目标训练数据的索引信息和训练任务标识,以及基于所述训练任务标识将所述索引信息存储在数据库中,基于所述索引信息将所述目标训练数据存储到云存储中。The intermediate training data of the target deep learning model is sent to the cloud server through a data upload thread, and the intermediate training data is used by the cloud server to obtain the index information and training task identifier of the target training data, and based on the The training task identifier stores the index information in a database, and stores the target training data in cloud storage based on the index information.
在另一个实现方式中,计算机设备为云服务器,上述程序包括用于执行以下步骤的指令:In another implementation, the computer device is a cloud server, and the above program includes instructions for performing the following steps:
接收来自用户设备的可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化;receiving a visualization request from the user equipment, where the visualization request is used to request visualization of the target training data of the target deep learning model;
获取所述可视化请求对应的目标训练数据;obtaining target training data corresponding to the visualization request;
对所述目标训练数据进行预处理,得到可视化信息;Preprocessing the target training data to obtain visualization information;
向所述用户设备发送可视化响应,所述可视化响应携带所述可视化信息。A visualization response is sent to the user equipment, and the visualization response carries the visualization information.
可选地,所述可视化信息包括以下类别中的至少一种:有向无环图、三维直方图、多种进程的调用时序图、特征图、二维热力图以及标量折线图;和/或Optionally, the visualization information includes at least one of the following categories: directed acyclic graph, three-dimensional histogram, call sequence diagram of various processes, feature map, two-dimensional heat map, and scalar line graph; and/or
所述目标训练数据包括以下至少一种:模型性能变化趋势信息、模型损失趋势信息、模型参数分布信息、模型处理中间结果、模型结构信息、模型训练的当前进度信息、针对同一模型的不同训练的比较信息、多种进程的调度时间信息。The target training data includes at least one of the following: model performance change trend information, model loss trend information, model parameter distribution information, model processing intermediate results, model structure information, current progress information of model training, and information on different trainings for the same model. Comparison information, scheduling time information of various processes.
可选地,所述可视化信息包括以下至少一种信息:Optionally, the visualization information includes at least one of the following information:
所述目标深度学习模型的至少一部分的拓扑信息,其中,所述目标深度学习模型的至少一部分包括:所述目标深度学习模型的多个模块和/或多个算子;topology information of at least a part of the target deep learning model, wherein at least a part of the target deep learning model includes: multiple modules and/or multiple operators of the target deep learning model;
所述目标深度学习模型所包括的至少一个算子中每个算子的资源占用信息。Resource occupation information of each operator in the at least one operator included in the target deep learning model.
可选地,所述多个模块的拓扑信息包括以下至少一种:Optionally, the topology information of the multiple modules includes at least one of the following:
所述多个模块的标识信息、所述多个模块之间的依赖关系、所述多个模块中每个模块 的数据量大小、每个所述模块包括的至少一个算子的信息。The identification information of the multiple modules, the dependencies between the multiple modules, the data size of each module in the multiple modules, and the information of at least one operator included in each of the modules.
可选地,在对所述目标训练数据进行预处理,得到可视化信息方面,上述程序包括具体用于执行以下步骤指令:Optionally, in terms of preprocessing the target training data to obtain visualization information, the above program includes instructions for executing the following steps:
基于所述目标深度学习模型中至少一个算子的数据类型、所述算子的输入数据信息、所述算子的输出数据信息中的至少一种,确定所述至少一个算子中每个算子的资源占用信息。Determine each operator in the at least one operator based on at least one of the data type of the at least one operator in the target deep learning model, the input data information of the operator, and the output data information of the operator Child resource occupancy information.
可选地,所述可视化请求携带所述目标深度学习模型的训练任务标识、所述目标训练数据的索引信息,所述索引信息包括:数据类型和数据标签;Optionally, the visualization request carries a training task identifier of the target deep learning model and index information of the target training data, where the index information includes: data type and data label;
在获取所述可视化请求对应的目标训练数据方面,上述程序包括具体用于执行以下步骤指令:基于所述可视化请求中携带的训练任务标识和索引信息查找索引数据库,以得到所述目标训练数据的存储位置信息;基于所述目标训练数据的存储位置信息,获取云存储中存储的所述目标训练数据。In terms of acquiring the target training data corresponding to the visualization request, the above-mentioned program includes an instruction for executing the following steps: searching an index database based on the training task identifier and index information carried in the visualization request, so as to obtain the target training data. storage location information; based on the storage location information of the target training data, obtain the target training data stored in the cloud storage.
可选地,在对所述目标训练数据进行预处理,得到可视化信息方面,上述程序包括具体用于执行以下步骤指令:Optionally, in terms of preprocessing the target training data to obtain visualization information, the above program includes instructions for executing the following steps:
对所述目标训练数据进行至少一项预处理,得到预处理数据;Perform at least one preprocessing on the target training data to obtain preprocessing data;
对所述预处理数据进行渲染处理,得到所述可视化信息。Perform rendering processing on the preprocessed data to obtain the visualization information.
可选地,在对所述目标训练数据进行预处理,得到可视化信息方面,上述程序包括具体用于执行以下步骤指令:Optionally, in terms of preprocessing the target training data to obtain visualization information, the above program includes instructions for executing the following steps:
在所述目标深度学习模型的训练任务在目标框架下执行的情况下,通过二叉树对所述目标训练数据进行解析处理,得到所述可视化信息。When the training task of the target deep learning model is executed under the target framework, the target training data is parsed through a binary tree to obtain the visualization information.
需要说明的是,本实施例的具体实现过程可参见上述方法实施例所述的具体实现过程,在此不再叙述。It should be noted that, for the specific implementation process of this embodiment, reference may be made to the specific implementation process described in the foregoing method embodiment, which is not described herein again.
请参阅图4,图4是本申请实施例提供的一种可视化装置,应用于用户设备,该装置包括:Please refer to FIG. 4. FIG. 4 is a visualization apparatus provided by an embodiment of the present application, applied to user equipment, and the apparatus includes:
发送单元401,用于向云服务器发送可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化;A sending unit 401, configured to send a visualization request to a cloud server, where the visualization request is used to request visualization of target training data of a target deep learning model;
接收单元402,用于接收所述云服务器发送的可视化响应,所述可视化响应携带有所述目标训练数据的可视化信息;A receiving unit 402, configured to receive a visualization response sent by the cloud server, where the visualization response carries the visualization information of the target training data;
显示单元403,用于显示所述可视化信息。The display unit 403 is configured to display the visualized information.
在本申请的一实现方式中,所述可视化信息包括以下类别中的至少一种:有向无环图、三维直方图、多种进程的调用时序图、特征图、二维热力图以及标量折线图;和/或In an implementation of the present application, the visualization information includes at least one of the following categories: directed acyclic graph, three-dimensional histogram, call sequence diagram of various processes, feature map, two-dimensional heat map, and scalar polyline Figures; and/or
所述目标训练数据包括以下至少一种:模型性能变化趋势信息、模型损失趋势信息、模型参数分布信息、模型处理中间结果、模型结构信息、模型训练的当前进度信息、针对同一模型的不同训练的比较信息、多种进程的调度时间信息。The target training data includes at least one of the following: model performance change trend information, model loss trend information, model parameter distribution information, model processing intermediate results, model structure information, current progress information of model training, and information on different trainings for the same model. Comparison information, scheduling time information of various processes.
在本申请的一实现方式中,所述可视化信息包括以下至少一种信息:In an implementation manner of the present application, the visualized information includes at least one of the following information:
所述目标深度学习模型的至少一部分的拓扑信息,其中,所述目标深度学习模型的至少一部分包括:所述目标深度学习模型的多个模块和/或多个算子;topology information of at least a part of the target deep learning model, wherein at least a part of the target deep learning model includes: multiple modules and/or multiple operators of the target deep learning model;
所述目标深度学习模型所包括的至少一个算子中每个算子的资源占用信息。Resource occupation information of each operator in the at least one operator included in the target deep learning model.
在本申请的一实现方式中,所述多个模块的拓扑信息包括以下至少一种:In an implementation manner of the present application, the topology information of the multiple modules includes at least one of the following:
所述多个模块的标识信息、所述多个模块之间的依赖关系、所述多个模块中每个模块的数据量大小、每个所述模块包括的至少一个算子的信息。The identification information of the multiple modules, the dependencies between the multiple modules, the data size of each module in the multiple modules, and the information of at least one operator included in each of the modules.
在本申请的一实现方式中,所述资源占用信息是所述云服务器基于所述算子的数据类型、所述算子的输入数据信息、所述算子的输出数据信息中的至少一种确定的。In an implementation manner of the present application, the resource occupation information is at least one of the cloud server based on the data type of the operator, the input data information of the operator, and the output data information of the operator definite.
在本申请的一实现方式中,所述可视化请求携带所述目标深度学习模型的训练任务标识和所述目标训练数据的索引信息,其中,所述索引信息包括:数据类型和数据标签。In an implementation manner of the present application, the visualization request carries a training task identifier of the target deep learning model and index information of the target training data, where the index information includes: data type and data label.
在本申请的一实现方式中,在显示所述可视化信息方面,上述显示单元403具体用于:In an implementation manner of the present application, in terms of displaying the visualized information, the above-mentioned display unit 403 is specifically used for:
以折叠显示模式或展示显示模式显示所述可视化信息,其中,displaying the visual information in a collapsed display mode or a revealing display mode, wherein,
在所述折叠显示模式中,以所述目标深度学习模型中的模块为最小显示单元,其中,所述模块包括至少一个算子;和/或In the folding display mode, a module in the target deep learning model is used as the minimum display unit, wherein the module includes at least one operator; and/or
在所述展开显示模式中,以所述目标深度学习模型中的算子为最小显示单元。In the expanded display mode, the operator in the target deep learning model is used as the minimum display unit.
在本申请的一实现方式中,上述发送单元401具体,还用于在向云服务器发送可视化请求之前,通过数据上传线程向所述云服务器发送所述目标深度学习模型的中间训练数据,所述中间训练数据用于所述云服务器得到所述目标训练数据的索引信息和训练任务标识;In an implementation manner of the present application, the above-mentioned sending unit 401 is specifically further configured to send the intermediate training data of the target deep learning model to the cloud server through a data upload thread before sending the visualization request to the cloud server, and the The intermediate training data is used for the cloud server to obtain the index information and training task identifier of the target training data;
所述装置还包括:数据存储单元404,用于基于所述训练任务标识将所述索引信息存储在数据库中,基于所述索引信息将所述目标训练数据存储到云存储中。The apparatus further includes: a data storage unit 404, configured to store the index information in a database based on the training task identifier, and store the target training data in cloud storage based on the index information.
需要说明的是,发送单元401、接收单元402可通过通信接口实现,显示单元403可通过显示屏实现,数据存储单元404可通过处理器屏实现。It should be noted that the sending unit 401 and the receiving unit 402 can be implemented through a communication interface, the display unit 403 can be implemented through a display screen, and the data storage unit 404 can be implemented through a processor screen.
请参阅图5,图5是本申请实施例提供的一种可视化装置,应用于云服务器,该装置包括:Please refer to FIG. 5. FIG. 5 is a visualization device provided by an embodiment of the present application, applied to a cloud server, and the device includes:
接收单元501,用于接收来自用户设备的可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化;A receiving unit 501, configured to receive a visualization request from a user equipment, where the visualization request is used to request visualization of target training data of a target deep learning model;
获取单元502,用于获取所述可视化请求对应的目标训练数据;an obtaining unit 502, configured to obtain target training data corresponding to the visualization request;
确定单元503,用于对所述目标训练数据进行预处理,得到可视化信息;Determining unit 503, configured to preprocess the target training data to obtain visualization information;
发送单元504,用于向所述用户设备发送可视化响应,所述可视化响应携带所述可视化信息。The sending unit 504 is configured to send a visualization response to the user equipment, where the visualization response carries the visualization information.
在本申请的一实现方式中,所述可视化信息包括以下类别中的至少一种:有向无环图、三维直方图、多种进程的调用时序图、特征图、二维热力图以及标量折线图;和/或In an implementation of the present application, the visualization information includes at least one of the following categories: directed acyclic graph, three-dimensional histogram, call sequence diagram of various processes, feature map, two-dimensional heat map, and scalar polyline Figures; and/or
所述目标训练数据包括以下至少一种:模型性能变化趋势信息、模型损失趋势信息、模型参数分布信息、模型处理中间结果、模型结构信息、模型训练的当前进度信息、针对同一模型的不同训练的比较信息、多种进程的调度时间信息。The target training data includes at least one of the following: model performance change trend information, model loss trend information, model parameter distribution information, model processing intermediate results, model structure information, current progress information of model training, and information on different trainings for the same model. Comparison information, scheduling time information of various processes.
在本申请的一实现方式中,所述可视化信息包括以下至少一种信息:In an implementation manner of the present application, the visualized information includes at least one of the following information:
所述目标深度学习模型的至少一部分的拓扑信息,其中,所述目标深度学习模型的至少一部分包括:所述目标深度学习模型的多个模块和/或多个算子;topology information of at least a part of the target deep learning model, wherein at least a part of the target deep learning model includes: multiple modules and/or multiple operators of the target deep learning model;
所述目标深度学习模型所包括的至少一个算子中每个算子的资源占用信息。Resource occupation information of each operator in the at least one operator included in the target deep learning model.
在本申请的一实现方式中,所述多个模块的拓扑信息包括以下至少一种:In an implementation manner of the present application, the topology information of the multiple modules includes at least one of the following:
所述多个模块的标识信息、所述多个模块之间的依赖关系、所述多个模块中每个模块 的数据量大小、每个所述模块包括的至少一个算子的信息。The identification information of the multiple modules, the dependencies between the multiple modules, the data size of each module in the multiple modules, and the information of at least one operator included in each of the modules.
在本申请的一实现方式中,在对所述目标训练数据进行预处理,得到可视化信息方面,上述程序包括具体用于执行以下步骤指令:In an implementation manner of the present application, in terms of preprocessing the target training data to obtain visualization information, the above program includes instructions for executing the following steps:
基于所述目标深度学习模型中至少一个算子的数据类型、所述算子的输入数据信息、所述算子的输出数据信息中的至少一种,确定所述至少一个算子中每个算子的资源占用信息。Determine each operator in the at least one operator based on at least one of the data type of the at least one operator in the target deep learning model, the input data information of the operator, and the output data information of the operator Child resource occupancy information.
在本申请的一实现方式中,所述可视化请求携带所述目标深度学习模型的训练任务标识、所述目标训练数据的索引信息,所述索引信息包括:数据类型和数据标签;In an implementation manner of the present application, the visualization request carries a training task identifier of the target deep learning model and index information of the target training data, where the index information includes: data type and data label;
在获取所述可视化请求对应的目标训练数据方面,上述获取单元502具体用于:In terms of acquiring the target training data corresponding to the visualization request, the above acquiring unit 502 is specifically used for:
基于所述可视化请求中携带的训练任务标识和索引信息查找索引数据库,以得到所述目标训练数据的存储位置信息;Search the index database based on the training task identifier and index information carried in the visualization request to obtain the storage location information of the target training data;
基于所述目标训练数据的存储位置信息,获取云存储中存储的所述目标训练数据。The target training data stored in the cloud storage is acquired based on the storage location information of the target training data.
在本申请的一实现方式中,在对所述目标训练数据进行预处理,得到可视化信息方面,上述确定单元503具体用于:In an implementation manner of the present application, in terms of preprocessing the target training data to obtain visualization information, the above determining unit 503 is specifically configured to:
对所述目标训练数据进行至少一项预处理,得到预处理数据;Perform at least one preprocessing on the target training data to obtain preprocessing data;
对所述预处理数据进行渲染处理,得到所述可视化信息。Perform rendering processing on the preprocessed data to obtain the visualization information.
在本申请的一实现方式中,在对所述目标训练数据进行预处理,得到可视化信息方面,上述确定单元503具体用于:In an implementation manner of the present application, in terms of preprocessing the target training data to obtain visualization information, the above determining unit 503 is specifically configured to:
在所述目标深度学习模型的训练任务在目标框架下执行的情况下,通过二叉树对所述目标训练数据进行解析处理,得到所述可视化信息。When the training task of the target deep learning model is executed under the target framework, the target training data is parsed through a binary tree to obtain the visualization information.
需要说明的是,接收单元501和发送单元504可通过通信接口实现,获取单元502和确定单元503可通过处理器实现。It should be noted that, the receiving unit 501 and the sending unit 504 may be implemented by a communication interface, and the acquiring unit 502 and the determining unit 503 may be implemented by a processor.
本申请实施例还提供了一种计算机可读存储介质,其中,所述计算机可读存储介质存储用于电子数据交换的计算机程序,其中,所述计算机程序使得计算机执行如上述方法实施例中电子设备或云服务器所描述的部分或全部步骤。Embodiments of the present application further provide a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program for electronic data exchange, wherein the computer program causes a computer to execute the electronic Some or all of the steps described by the device or cloud server.
本申请实施例还提供了一种计算机程序产品,其中,所述计算机程序产品包括存储了计算机程序的非瞬时性计算机可读存储介质,所述计算机程序可操作来使计算机执行如上述方法中电子设备或云服务器所描述的部分或全部步骤。该计算机程序产品可以为一个软件安装包。Embodiments of the present application further provide a computer program product, wherein the computer program product includes a non-transitory computer-readable storage medium storing a computer program, and the computer program is operable to cause the computer to execute the electronic method as described above. Some or all of the steps described by the device or cloud server. The computer program product may be a software installation package.
本申请实施例所描述的方法或者算法的步骤可以以硬件的方式来实现,也可以是由处理器执行软件指令的方式来实现。软件指令可以由相应的软件模块组成,软件模块可以被存放于随机存取存储器(Random Access Memory,RAM)、闪存、只读存储器(Read Only Memory,ROM)、可擦除可编程只读存储器(Erasable Programmable ROM,EPROM)、电可擦可编程只读存储器(Electrically EPROM,EEPROM)、寄存器、硬盘、移动硬盘、只读光盘(CD-ROM)或者本领域熟知的任何其它形式的存储介质中。一种示例性的存储介质耦合至处理器,从而使处理器能够从该存储介质读取信息,且可向该存储介质写入信息。当然,存储介质也可以是处理器的组成部分。处理器和存储介质可以位于ASIC中。另外,该ASIC可以位于接入网设备、目标网络设备或核心网设备中。当然,处理器和存储介质 也可以作为分立组件存在于接入网设备、目标网络设备或核心网设备中。The steps of the method or algorithm described in the embodiments of the present application may be implemented in a hardware manner, or may be implemented in a manner in which a processor executes software instructions. Software instructions can be composed of corresponding software modules, and software modules can be stored in random access memory (Random Access Memory, RAM), flash memory, read only memory (Read Only Memory, ROM), erasable programmable read only memory ( Erasable Programmable ROM, EPROM), Electrically Erasable Programmable Read-Only Memory (Electrically EPROM, EEPROM), registers, hard disk, removable hard disk, CD-ROM, or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor, such that the processor can read information from, and write information to, the storage medium. Of course, the storage medium can also be an integral part of the processor. The processor and storage medium may reside in an ASIC. Additionally, the ASIC may reside in access network equipment, target network equipment or core network equipment. Of course, the processor and the storage medium may also exist in the access network device, the target network device or the core network device as discrete components.
本领域技术人员应该可以意识到,在上述一个或多个示例中,本申请实施例所描述的功能可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本申请实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(Digital Subscriber Line,DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质(例如,软盘、硬盘、磁带)、光介质(例如,数字视频光盘(Digital Video Disc,DVD))、或者半导体介质(例如,固态硬盘(Solid State Disk,SSD))等。Those skilled in the art should realize that, in one or more of the above examples, the functions described in the embodiments of the present application may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented in software, it can be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, all or part of the processes or functions described in the embodiments of the present application are generated. The computer may be a general purpose computer, special purpose computer, computer network, or other programmable device. The computer instructions may be stored in or transmitted from one computer readable storage medium to another computer readable storage medium, for example, the computer instructions may be downloaded from a website site, computer, server or data center Transmission to another website site, computer, server or data center via wired (eg coaxial cable, optical fiber, Digital Subscriber Line, DSL) or wireless (eg infrared, wireless, microwave, etc.) means. The computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that includes an integration of one or more available media. The available media may be magnetic media (eg, floppy disks, hard disks, magnetic tapes), optical media (eg, Digital Video Disc (DVD)), or semiconductor media (eg, Solid State Disk (SSD)) )Wait.
以上所述的具体实施方式,对本申请实施例的目的、技术方案和有益效果进行了进一步详细说明,所应理解的是,以上所述仅为本申请实施例的具体实施方式而已,并不用于限定本申请实施例的保护范围,凡在本申请实施例的技术方案的基础之上,所做的任何修改、等同替换、改进等,均应包括在本申请实施例的保护范围之内。The specific embodiments described above further describe in detail the purposes, technical solutions and beneficial effects of the embodiments of the present application. It should be understood that the above descriptions are only specific implementations of the embodiments of the present application, and are not intended to be used for The protection scope of the embodiments of the present application is limited, and any modifications, equivalent replacements, improvements, etc. made on the basis of the technical solutions of the embodiments of the present application should be included within the protection scope of the embodiments of the present application.

Claims (20)

  1. 一种可视化方法,其特征在于,应用于用户设备,所述方法包括:A visualization method, characterized in that, applied to a user equipment, the method comprising:
    向云服务器发送可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化;sending a visualization request to the cloud server, where the visualization request is used to request visualization of the target training data of the target deep learning model;
    接收所述云服务器发送的可视化响应,所述可视化响应携带有所述目标训练数据的可视化信息;receiving a visualization response sent by the cloud server, where the visualization response carries the visualization information of the target training data;
    显示所述可视化信息。The visualization information is displayed.
  2. 根据权利要求1所述的方法,其特征在于,所述可视化信息包括以下类别中的至少一种:有向无环图、三维直方图、多种进程的调用时序图、特征图、二维热力图以及标量折线图;和/或The method according to claim 1, wherein the visualization information includes at least one of the following categories: directed acyclic graph, three-dimensional histogram, calling sequence diagram of various processes, feature map, two-dimensional thermal graphs and scalar line graphs; and/or
    所述目标训练数据包括以下至少一种:模型性能变化趋势信息、模型损失趋势信息、模型参数分布信息、模型处理中间结果、模型结构信息、模型训练的当前进度信息、针对同一模型的不同训练的比较信息、多种进程的调度时间信息。The target training data includes at least one of the following: model performance change trend information, model loss trend information, model parameter distribution information, model processing intermediate results, model structure information, current progress information of model training, and information on different trainings for the same model. Comparison information, scheduling time information of various processes.
  3. 根据权利要求1或2所述的方法,所述可视化信息包括以下至少一种信息:The method according to claim 1 or 2, the visualization information includes at least one of the following information:
    所述目标深度学习模型的至少一部分的拓扑信息,其中,所述目标深度学习模型的至少一部分包括:所述目标深度学习模型的多个模块和/或多个算子;topology information of at least a part of the target deep learning model, wherein at least a part of the target deep learning model includes: multiple modules and/or multiple operators of the target deep learning model;
    所述目标深度学习模型所包括的至少一个算子中每个算子的资源占用信息。Resource occupation information of each operator in the at least one operator included in the target deep learning model.
  4. 根据权利要求3所述的方法,其特征在于,所述多个模块的拓扑信息包括以下至少一种:The method according to claim 3, wherein the topology information of the multiple modules includes at least one of the following:
    所述多个模块的标识信息、所述多个模块之间的依赖关系、所述多个模块中每个模块的数据量大小、每个所述模块包括的至少一个算子的信息。The identification information of the multiple modules, the dependencies between the multiple modules, the data size of each module in the multiple modules, and the information of at least one operator included in each of the modules.
  5. 根据权利要求3或4所述的方法,其特征在于,所述资源占用信息是所述云服务器基于所述算子的数据类型、所述算子的输入数据信息、所述算子的输出数据信息中的至少一种确定的。The method according to claim 3 or 4, wherein the resource occupation information is based on the data type of the operator, the input data information of the operator, and the output data of the operator by the cloud server. At least one of the information is determined.
  6. 根据权利要求1至5中任一项所述的方法,其特征在于,所述可视化请求携带所述目标深度学习模型的训练任务标识和所述目标训练数据的索引信息,其中,所述索引信息包括:数据类型和数据标签。The method according to any one of claims 1 to 5, wherein the visualization request carries a training task identifier of the target deep learning model and index information of the target training data, wherein the index information Includes: data types and data labels.
  7. 根据权利要求1-6中任一项所述的方法,其特征在于,所述显示所述可视化信息,包括:The method according to any one of claims 1-6, wherein the displaying the visualized information comprises:
    以折叠显示模式或展示显示模式显示所述可视化信息,其中,displaying the visual information in a collapsed display mode or a revealing display mode, wherein,
    在所述折叠显示模式中,以所述目标深度学习模型中的模块为最小显示单元,其中,所述模块包括至少一个算子;和/或In the folding display mode, a module in the target deep learning model is used as the minimum display unit, wherein the module includes at least one operator; and/or
    在所述展开显示模式中,以所述目标深度学习模型中的算子为最小显示单元。In the expanded display mode, the operator in the target deep learning model is used as the minimum display unit.
  8. 根据权利要求1-7任一项所述的方法,其特征在于,所述向云服务器发送可视化请求之前,所述方法还包括:The method according to any one of claims 1-7, wherein before the sending the visualization request to the cloud server, the method further comprises:
    通过数据上传线程向所述云服务器发送所述目标深度学习模型的中间训练数据,所述中间训练数据用于所述云服务器得到所述目标训练数据的索引信息和训练任务标识,以及基于所述训练任务标识将所述索引信息存储在索引数据库中,基于所述索引信息将所述目标训练数据存储到云存储中。The intermediate training data of the target deep learning model is sent to the cloud server through a data upload thread, and the intermediate training data is used by the cloud server to obtain the index information and training task identifier of the target training data, and based on the The training task identifier stores the index information in an index database, and stores the target training data in cloud storage based on the index information.
  9. 一种可视化方法,其特征在于,应用于云服务器,所述方法包括:A visualization method, characterized in that, applied to a cloud server, the method comprising:
    接收来自用户设备的可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化;receiving a visualization request from the user equipment, where the visualization request is used to request visualization of the target training data of the target deep learning model;
    获取所述可视化请求对应的目标训练数据;obtaining target training data corresponding to the visualization request;
    对所述目标训练数据进行预处理,得到可视化信息;Preprocessing the target training data to obtain visualization information;
    向所述用户设备发送可视化响应,所述可视化响应携带所述可视化信息。A visualization response is sent to the user equipment, and the visualization response carries the visualization information.
  10. 根据权利要求9所述的方法,其特征在于,所述可视化信息包括以下类别中的至少一种:有向无环图、三维直方图、多种进程的调用时序图、特征图、二维热力图以及标量折线图;和/或The method according to claim 9, wherein the visualization information includes at least one of the following categories: directed acyclic graph, three-dimensional histogram, calling sequence diagram of various processes, feature map, two-dimensional thermal graphs and scalar line graphs; and/or
    所述目标训练数据包括以下至少一种:模型性能变化趋势信息、模型损失趋势信息、模型参数分布信息、模型处理中间结果、模型结构信息、模型训练的当前进度信息、针对同一模型的不同训练的比较信息、多种进程的调度时间信息。The target training data includes at least one of the following: model performance change trend information, model loss trend information, model parameter distribution information, model processing intermediate results, model structure information, current progress information of model training, and information on different trainings for the same model. Comparison information, scheduling time information of various processes.
  11. 根据权利要求9或10所述的方法,所述可视化信息包括以下至少一种信息:The method according to claim 9 or 10, the visualization information includes at least one of the following information:
    所述目标深度学习模型的至少一部分的拓扑信息,其中,所述目标深度学习模型的至少一部分包括:所述目标深度学习模型的多个模块和/或多个算子;topology information of at least a part of the target deep learning model, wherein at least a part of the target deep learning model includes: multiple modules and/or multiple operators of the target deep learning model;
    所述目标深度学习模型所包括的至少一个算子中每个算子的资源占用信息。Resource occupation information of each operator in the at least one operator included in the target deep learning model.
  12. 根据权利要求11所述的方法,其特征在于,所述多个模块的拓扑信息包括以下至 少一种:The method according to claim 11, wherein the topology information of the plurality of modules comprises at least one of the following:
    所述多个模块的标识信息、所述多个模块之间的依赖关系、所述多个模块中每个模块的数据量大小、每个所述模块包括的至少一个算子的信息。The identification information of the multiple modules, the dependencies between the multiple modules, the data size of each module in the multiple modules, and the information of at least one operator included in each of the modules.
  13. 根据权利要求9至12中任一项所述的方法,其特征在于,所述对所述目标训练数据进行预处理,得到可视化信息,包括:The method according to any one of claims 9 to 12, wherein the preprocessing of the target training data to obtain visualization information includes:
    基于所述目标深度学习模型中至少一个算子的数据类型、所述算子的输入数据信息、所述算子的输出数据信息中的至少一种,确定所述至少一个算子中每个算子的资源占用信息。Determine each operator in the at least one operator based on at least one of the data type of the at least one operator in the target deep learning model, the input data information of the operator, and the output data information of the operator Child resource occupancy information.
  14. 根据权利要求9至13中任一项所述的方法,其特征在于,所述可视化请求携带所述目标深度学习模型的训练任务标识、所述目标训练数据的索引信息,所述索引信息包括:数据类型和数据标签;The method according to any one of claims 9 to 13, wherein the visualization request carries a training task identifier of the target deep learning model and index information of the target training data, and the index information includes: data types and data labels;
    所述获取所述可视化请求对应的目标训练数据,包括:The acquiring target training data corresponding to the visualization request includes:
    基于所述可视化请求中携带的训练任务标识和索引信息查找索引数据库,以得到所述目标训练数据的存储位置信息;Search the index database based on the training task identifier and index information carried in the visualization request to obtain the storage location information of the target training data;
    基于所述目标训练数据的存储位置信息,获取云存储中存储的所述目标训练数据。The target training data stored in the cloud storage is acquired based on the storage location information of the target training data.
  15. 根据权利要求9至14中任一项所述的方法,其特征在于,所述对所述目标训练数据进行预处理,得到可视化信息,包括:The method according to any one of claims 9 to 14, wherein the preprocessing of the target training data to obtain visualization information includes:
    对所述目标训练数据进行至少一项预处理,得到预处理数据;Perform at least one preprocessing on the target training data to obtain preprocessing data;
    对所述预处理数据进行渲染处理,得到所述可视化信息。Perform rendering processing on the preprocessed data to obtain the visualization information.
  16. 根据权利要求9至15中任一项所述的方法,其特征在于,所述对所述目标训练数据进行预处理,得到可视化信息,包括:The method according to any one of claims 9 to 15, wherein the preprocessing of the target training data to obtain visualization information includes:
    在所述目标深度学习模型的训练任务在目标框架下执行的情况下,通过二叉树对所述目标训练数据进行解析处理,得到所述可视化信息。When the training task of the target deep learning model is executed under the target framework, the target training data is parsed through a binary tree to obtain the visualization information.
  17. 一种可视化装置,其特征在于,应用于用户设备,所述装置包括:A visualization device, characterized in that, applied to user equipment, the device comprising:
    发送单元,用于向云服务器发送可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化;a sending unit, configured to send a visualization request to the cloud server, where the visualization request is used to request visualization of the target training data of the target deep learning model;
    接收单元,用于接收所述云服务器发送的可视化响应,所述可视化响应携带有所述目标训练数据的可视化信息;a receiving unit, configured to receive a visualization response sent by the cloud server, where the visualization response carries the visualization information of the target training data;
    显示单元,用于显示所述可视化信息。a display unit for displaying the visual information.
  18. 一种可视化装置,其特征在于,应用于云服务器,所述装置包括:A visualization device, characterized in that, applied to a cloud server, the device comprising:
    接收单元,用于接收来自用户设备的可视化请求,所述可视化请求用于请求对目标深度学习模型的目标训练数据进行可视化;a receiving unit, configured to receive a visualization request from the user equipment, where the visualization request is used to request visualization of the target training data of the target deep learning model;
    获取单元,用于获取所述可视化请求对应的目标训练数据;an obtaining unit for obtaining target training data corresponding to the visualization request;
    确定单元,用于对所述目标训练数据进行预处理,得到可视化信息;a determining unit for preprocessing the target training data to obtain visualization information;
    发送单元,用于向所述用户设备发送可视化响应,所述可视化响应携带所述可视化信息。A sending unit, configured to send a visualization response to the user equipment, where the visualization response carries the visualization information.
  19. 一种计算机设备,其特征在于,所述用户设备包括处理器、存储器、通信接口,以及一个或多个程序,所述一个或多个程序被存储在所述存储器中,并且被配置由所述处理器执行,所述程序包括用于执行如权利要求1-8或9-16任一项所述的方法中的步骤的指令。A computer device, characterized in that the user equipment includes a processor, a memory, a communication interface, and one or more programs, the one or more programs being stored in the memory and configured by the The processor executes the program comprising instructions for performing the steps in the method of any of claims 1-8 or 9-16.
  20. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机程序,其中,所述计算机程序被处理执行如权利要求1-8或9-16任意一项所述的方法。A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program, wherein the computer program is processed to execute the method according to any one of claims 1-8 or 9-16 .
PCT/CN2021/082348 2020-07-09 2021-03-23 Visualization method and related device WO2022007434A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2021570729A JP2022543180A (en) 2020-07-09 2021-03-23 Visualization method and related device
KR1020217039065A KR20220011134A (en) 2020-07-09 2021-03-23 Visualization method and related device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010656553.8 2020-07-09
CN202010656553.8A CN111797294A (en) 2020-07-09 2020-07-09 Visualization method and related equipment

Publications (1)

Publication Number Publication Date
WO2022007434A1 true WO2022007434A1 (en) 2022-01-13

Family

ID=72810505

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/082348 WO2022007434A1 (en) 2020-07-09 2021-03-23 Visualization method and related device

Country Status (4)

Country Link
JP (1) JP2022543180A (en)
KR (1) KR20220011134A (en)
CN (1) CN111797294A (en)
WO (1) WO2022007434A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114707654A (en) * 2022-06-06 2022-07-05 浙江大学 Algorithm training reasoning performance visualization method and device based on artificial intelligence framework

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111797294A (en) * 2020-07-09 2020-10-20 上海商汤智能科技有限公司 Visualization method and related equipment
CN112199345A (en) * 2020-10-28 2021-01-08 上海大学 Deep learning training real-time visualization method and device
CN112733892A (en) * 2020-12-28 2021-04-30 北京聚云科技有限公司 Data interaction method and device for model training
CN113326113B (en) * 2021-05-25 2022-11-22 北京市商汤科技开发有限公司 Task processing method and device, electronic equipment and storage medium
CN113658289A (en) * 2021-07-30 2021-11-16 国网天津市电力公司 Secondary equipment editing method
CN114357253A (en) * 2021-12-27 2022-04-15 上海商汤科技开发有限公司 Data visualization method and device, electronic equipment and storage medium
CN116360759A (en) * 2023-03-10 2023-06-30 青软创新科技集团股份有限公司 Visual system and method of artificial intelligence algorithm

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012012194A2 (en) * 2010-07-21 2012-01-26 Microsoft Corporation Smart defaults for data visualizations
CN104615735A (en) * 2015-02-11 2015-05-13 中科九度(北京)空间信息技术有限责任公司 Space-time information visualizing method based on geographic information space system
CN109241141A (en) * 2018-09-04 2019-01-18 北京百度网讯科技有限公司 The training data treating method and apparatus of deep learning
CN111797294A (en) * 2020-07-09 2020-10-20 上海商汤智能科技有限公司 Visualization method and related equipment

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101719278B1 (en) * 2015-04-14 2017-04-04 (주)한국플랫폼서비스기술 Deep learnig framework and image recognition method for content-based visual image recognition
CN111079892A (en) * 2019-10-30 2020-04-28 华为技术有限公司 Deep learning model training method, device and system
CN111190805A (en) * 2019-12-29 2020-05-22 广东浪潮大数据研究有限公司 Training log visualization system, method and equipment of deep learning training platform

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012012194A2 (en) * 2010-07-21 2012-01-26 Microsoft Corporation Smart defaults for data visualizations
CN104615735A (en) * 2015-02-11 2015-05-13 中科九度(北京)空间信息技术有限责任公司 Space-time information visualizing method based on geographic information space system
CN109241141A (en) * 2018-09-04 2019-01-18 北京百度网讯科技有限公司 The training data treating method and apparatus of deep learning
CN111797294A (en) * 2020-07-09 2020-10-20 上海商汤智能科技有限公司 Visualization method and related equipment

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114707654A (en) * 2022-06-06 2022-07-05 浙江大学 Algorithm training reasoning performance visualization method and device based on artificial intelligence framework
CN114707654B (en) * 2022-06-06 2022-08-23 浙江大学 Algorithm training reasoning performance visualization method and device based on artificial intelligence framework

Also Published As

Publication number Publication date
JP2022543180A (en) 2022-10-11
KR20220011134A (en) 2022-01-27
CN111797294A (en) 2020-10-20

Similar Documents

Publication Publication Date Title
WO2022007434A1 (en) Visualization method and related device
US20210256320A1 (en) Machine learning artificialintelligence system for identifying vehicles
CN111931064B (en) Information analysis method based on big data and artificial intelligence and cloud service information platform
CN107766940B (en) Method and apparatus for generating a model
US11132114B2 (en) Method and apparatus for generating customized visualization component
CN109862100B (en) Method and device for pushing information
US20170308620A1 (en) Making graph pattern queries bounded in big graphs
CN111967467A (en) Image target detection method and device, electronic equipment and computer readable medium
CN108885633A (en) For finding and being connected to the technology of REST interface automatically
CN113807926A (en) Recommendation information generation method and device, electronic equipment and computer readable medium
US11676345B1 (en) Automated adaptive workflows in an extended reality environment
CN112883088B (en) Data processing method, device, equipment and storage medium
CN117093619A (en) Rule engine processing method and device, electronic equipment and storage medium
CN116956117A (en) Method, device, equipment, storage medium and program product for identifying label
CN113808134B (en) Oil tank layout information generation method, oil tank layout information generation device, electronic apparatus, and medium
WO2021232860A1 (en) Communication method, apparatus and system
WO2015078124A1 (en) Network data processing method and device
CN116263659A (en) Data processing method, apparatus, computer program product, device and storage medium
US10817313B2 (en) Machine learning techniques for adaptation of initial workspace
CN112379967A (en) Simulator detection method, device, equipment and medium
CN112394809A (en) Method, device and communication equipment for realizing augmented reality
CN111814044A (en) Recommendation method and device, terminal equipment and storage medium
US20240127406A1 (en) Image quality adjustment method and apparatus, device, and medium
CN112783986B (en) Object grouping compiling method and device based on label, storage medium and terminal
US11038765B2 (en) Cloud software defined networking application programming interface converter

Legal Events

Date Code Title Description
ENP Entry into the national phase

Ref document number: 2021570729

Country of ref document: JP

Kind code of ref document: A

Ref document number: 20217039065

Country of ref document: KR

Kind code of ref document: A

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21837004

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21837004

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 03.07.20023)

122 Ep: pct application non-entry in european phase

Ref document number: 21837004

Country of ref document: EP

Kind code of ref document: A1