WO2023155839A1 - Online learning method and apparatus for ai model, and communication device and readable storage medium - Google Patents

Online learning method and apparatus for ai model, and communication device and readable storage medium Download PDF

Info

Publication number
WO2023155839A1
WO2023155839A1 PCT/CN2023/076492 CN2023076492W WO2023155839A1 WO 2023155839 A1 WO2023155839 A1 WO 2023155839A1 CN 2023076492 W CN2023076492 W CN 2023076492W WO 2023155839 A1 WO2023155839 A1 WO 2023155839A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
model
online learning
following
measurement information
Prior art date
Application number
PCT/CN2023/076492
Other languages
French (fr)
Chinese (zh)
Inventor
贾承璐
孙布勒
王园园
Original Assignee
维沃移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 维沃移动通信有限公司 filed Critical 维沃移动通信有限公司
Publication of WO2023155839A1 publication Critical patent/WO2023155839A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/145Network analysis or design involving simulating, designing, planning or modelling of a network
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/147Network analysis or design for predicting network behaviour
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/16Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks using machine learning or artificial intelligence

Definitions

  • the application belongs to the field of electronic information technology, and specifically relates to an AI model online learning method, device, communication device and readable storage medium.
  • AI artificial intelligence
  • neural network neural network, NN
  • decision tree decision tree
  • support vector machine support vector machine
  • GA genetic algorithm
  • the AI model is usually trained offline, and then the trained AI model is deployed in a wireless communication system.
  • the wireless communication environment changes, the accuracy of the output result of the AI model is low. In this way, the calculation accuracy of the AI model is poor.
  • Embodiments of the present application provide an online AI model learning method, device, communication device, and readable storage medium, which can solve the problem of invalidation of the AI model caused by dynamic changes in the wireless communication environment in actual scenarios.
  • an online learning method of an AI model includes: a second device configures a first AI model for a first device; the second device configures online learning information of the first AI model for the first device .
  • an online AI model learning device which includes: a configuration module, wherein: the configuration module is used for the second device to configure the first AI model for the first device; the configuration module also The online learning information is used for the second device to configure the first AI model for the first device.
  • an online learning method of an AI model includes: an acquisition module and an execution module, wherein: the acquisition module is used for the first device to acquire the first AI model; the execution module, The first device performs online learning on the first AI model based on the online learning information of the first AI model.
  • an online learning device for an AI model includes: a first device acquires a first AI model; the first device, based on the online learning information of the first AI model, AI models for online learning.
  • a communication device in a fifth aspect, includes a processor and a memory, the memory stores programs or instructions that can run on the processor, and the programs or instructions are implemented when executed by the processor The steps of the method as described in the first aspect.
  • a communication device including a processor and a communication interface, wherein the processor is configured to configure a first AI model for a first device; and configure online learning of the first AI model for the first device information.
  • a communication device in a seventh aspect, includes a processor and a memory, the memory stores programs or instructions that can run on the processor, and the programs or instructions are implemented when executed by the processor The steps of the method as described in the first aspect.
  • a network side device including a processor and a communication interface, wherein the above-mentioned processor is used to obtain a first AI model, and based on the online learning information of the first AI model, the first AI model Take online learning.
  • a readable storage medium is provided, and programs or instructions are stored on the readable storage medium, and when the programs or instructions are executed by a processor, the steps of the method described in the first aspect are realized, or the steps of the method described in the first aspect are realized, or The steps of the method described in the third aspect.
  • a chip in a tenth aspect, includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is used to run programs or instructions to implement the method as described in the first aspect , or implement the method described in the third aspect.
  • a computer program/program product is provided, and the computer program/program product is stored in a storage medium
  • the computer program/program product is executed by at least one processor to implement the first aspect, or to implement the steps of the online learning method of the AI model as described in the third aspect.
  • the first device acquires a first AI model, and performs online learning on the first AI model based on online learning information of the first AI model.
  • the first AI model can be continuously adjusted online on the first device side, thereby maintaining The predictive performance of the first AI model, thereby ensuring the service quality of the first device.
  • FIG. 1 is a block diagram of a wireless communication system provided by an embodiment of the present application.
  • Fig. 2 is one of the schematic flow charts of the online learning method of the AI model provided by the embodiment of the present application;
  • Fig. 3 is the second schematic flow diagram of the online learning method of the AI model provided by the embodiment of the present application.
  • Fig. 4 is one of the structural schematic diagrams of the online learning device of the AI model provided by the embodiment of the present application.
  • Fig. 5 is the second structural schematic diagram of the online learning device of the AI model provided by the embodiment of the present application.
  • FIG. 6 is a schematic structural diagram of a communication device provided by an embodiment of the present application.
  • FIG. 7 is a schematic diagram of a hardware structure of a terminal provided by an embodiment of the present application.
  • FIG. 8 is one of the schematic diagrams of the hardware structure of the network side device provided by the embodiment of the present application.
  • FIG. 9 is the second schematic diagram of the hardware structure of the network side device provided by the embodiment of the present application.
  • first, second and the like in the specification and claims of the present application are used to distinguish similar objects, and are not used to describe a specific sequence or sequence. It is to be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments of the application are capable of operation in sequences other than those illustrated or described herein and that "first" and “second” distinguish objects. It is usually one category, and the number of objects is not limited. For example, there may be one or more first objects.
  • “and/or” in the description and claims means at least one of the connected objects, and the character “/” generally means that the related objects are an "or” relationship.
  • LTE Long Term Evolution
  • LTE-Advanced LTE-Advanced
  • LTE-A Long Term Evolution-Advanced
  • CDMA Code Division Multiple Access
  • TDMA Time Division Multiple Access
  • FDMA Frequency Division Multiple Access
  • OFDMA Orthogonal Frequency Division Multiple Access
  • SC-FDMA Single-carrier Frequency Division Multiple Access
  • system and “network” in the embodiments of the present application are often used interchangeably, and the described technology can be used for the above-mentioned system and radio technology, and can also be used for other systems and radio technologies.
  • the following description describes the New Radio (New Radio, NR) system for example purposes, and uses NR terminology in most of the following descriptions, but these techniques can also be applied to applications other than NR system applications, such as the 6th generation (6th Generation , 6G) communication system.
  • 6G 6th generation
  • Fig. 1 shows a block diagram of a wireless communication system to which the embodiment of the present application is applicable.
  • the wireless communication system includes a terminal 11 and a network side device 12 .
  • the terminal 11 can be a mobile phone, a tablet computer (Tablet Personal Computer), a laptop computer (Laptop Computer) or a notebook computer, a personal digital assistant (Personal Digital Assistant, PDA), a palmtop computer, a netbook, a super mobile personal computer (ultra-mobile personal computer, UMPC), mobile Internet device (Mobile Internet Device, MID), augmented reality (augmented reality, AR) / virtual reality (virtual reality, VR) equipment, robot, wearable device (Wearable Device) , vehicle equipment (VUE), pedestrian terminal (PUE), smart home (home equipment with wireless communication functions, such as refrigerators, TVs, washing machines or furniture, etc.), game consoles, personal computers (personal computers, PCs), teller machines or self-service Wearable devices include: smart watches, smart bracelets, smart headphones, smart glasses, smart jewelry (
  • the network side device 12 may include an access network device or a core network device, where the access network device 12 may also be called a radio access network device, a radio access network (Radio Access Network, RAN), a radio access network function, or Wireless access network unit.
  • RAN Radio Access Network
  • RAN Radio Access Network
  • Wireless access network unit Wireless access network unit
  • the access network device 12 may include a base station, a WLAN access point, or a WiFi node, etc., and the base station may be called a node B, an evolved node B (eNB), an access point, a base transceiver station (Base Transceiver Station, BTS), a radio Base station, radio transceiver, Basic Service Set (BSS), Extended Service Set (ESS), Home Node B, Home Evolved Node B, Transmitting Receiving Point (TRP) or all other in the field
  • eNB evolved node B
  • BTS Base Transceiver Station
  • BTS Base Transceiver station
  • BTS Base Transceiver station
  • radio Base station radio transceiver, Basic Service Set (BSS), Extended Service Set (ESS), Home Node B, Home Evolved Node B, Transmitting Receiving Point (TRP) or all other in the field
  • BSS Basic Service Set
  • ESS Extended Service Set
  • TRP Transmitting Receiving Point
  • the core network equipment may include but not limited to at least one of the following: core network node, core network function, mobility management entity (Mobility Management Entity, MME), access mobility management function (Access and Mobility Management Function, AMF), session management function (Session Management Function, SMF), user plane function (User Plane Function, UPF), policy control function (Policy Control Function, PCF), policy and charging rules function unit (Policy and Charging Rules Function, PCRF), edge application service Discovery function (Edge Application Server Discovery Function, EASDF), unified data management (Unified Data Management, UDM), unified data storage (Unified Data Repository, UDR), home subscriber server (Home Subscriber Server, HSS), centralized network configuration ( Centralized network configuration, CNC), network storage function (Network Repository Function, NRF), network exposure function (Network Exposure Function, NEF), local NEF
  • AI Artificial Intelligence
  • Machine learning is an important branch of artificial intelligence, which mainly studies how to make computers have the ability to learn by themselves.
  • Machine learning algorithms include neural network (neural network, NN) decision tree (decision tree, DT), support vector machine (support vector machine, SVM), genetic algorithm (genetic algorithm, GA) and so on.
  • Neural network A neural network consists of a large number of nodes, which are called neurons. Among them, the composition information of neurons includes: input (a1, a2,...aK) weight/multiplicative coefficient (w), bias/additive coefficient (b), activation function ( ⁇ (.)). Common activation functions include Sigmoid, tanh, ReLU (Rectified Linear Unit), linear rectification function, corrected linear unit) and so on.
  • the parameters of the neural network can be optimized by gradient optimization algorithm.
  • the gradient optimization algorithm is a class of algorithms that minimize or maximize an objective function (sometimes called a loss function), and the objective function is often a mathematical combination of model parameters and data. For example, given data X and its corresponding label Y, after constructing a neural network model f(.), with the model, the predicted output f(x) can be obtained according to the input x, and the predicted value and the real value can be calculated The gap between (f(x)-Y), which is the loss function. If a suitable W,b is found to minimize the value of the above loss function, the smaller the loss value, the closer the model is to the real situation.
  • BP error Back Propagation, error back propagation
  • the basic idea of the BP algorithm is that the learning process consists of two processes: the forward propagation of the signal and the back propagation of the error.
  • the input samples are passed in from the input layer, processed layer by layer by each hidden layer, and passed to the output layer. If the actual output of the output layer does not match the expected output, it will enter the error backpropagation stage.
  • Error backpropagation is to transmit the output error layer by layer through the hidden layer to the input layer in some form, and distribute the error to all the units of each layer, so as to obtain the error signal of each layer unit, and this error signal is used as the correction unit Basis for weight.
  • This weight adjustment process of each layer of signal forward propagation and error back propagation is carried out repeatedly.
  • the process of continuously adjusting the weights is also the learning and training process of the network. This process has been carried out until the error of the network output is reduced to an acceptable level, or until the preset number of learning times.
  • common optimization algorithms include gradient descent (Gradient Descent), stochastic gradient descent (Stochastic Gradient Descent, SGD), mini-batch gradient descent (mini-batch gradient descent), momentum method (Momentum), stochastic gradient descent with momentum (Nesterov), adaptive gradient descent (ADAptive GRADient descent, Adagrad), Adadelta, root mean square error deceleration (root mean square prop, RMSprop), adaptive momentum estimation (Adaptive Moment Estimation, Adam), etc.
  • the above optimization algorithm is based on the error/loss obtained by the loss function when the error is backpropagated, and calculates the derivative/partial derivative of the current neuron, plus the learning rate, the previous gradient/derivative/partial derivative, etc., Get the gradient and pass the gradient to the previous layer.
  • the current AI research in the field of wireless communication mainly focuses on offline learning and deployment. Since the wireless environment is constantly changing, the fixed AI model obtained through offline training will gradually fail in the dynamic environment. How to improve the model in the new changing environment The adaptability among them has become an urgent problem to be solved.
  • This application is an example to solve the above problems, and proposes an online learning method for AI models.
  • the realization of online learning has the following difficulties: 1) limited by the storage capacity and data collection capacity of the device (for example, the time cost and hardware cost of collecting data are relatively high), it is usually difficult to obtain a large enough data set for online training; 2) Limited by the computing power of the device and the limited data set, it may not be possible to perform multiple rounds of model fine-tuning or over-fitting will result after multiple rounds of model fine-tuning; 3)
  • For wireless communication there are also communication delay limitations and communication The problem of continuity, which puts forward requirements on the time of the first equipment data collection and the time of online learning.
  • FIG. 2 shows a flow chart of an online learning method for an AI model provided by an embodiment of the present application.
  • the online learning method of the AI model provided by the example may include the following steps 201 and 202:
  • Step 201 the second device configures the first AI model for the first device.
  • the above-mentioned second device may include at least one of the following: core network equipment, access network equipment, and terminal;
  • the above-mentioned first device may include at least one of the following: core network equipment, access network equipment, and terminal .
  • the second device is a core network device, and correspondingly, the first device may be an access network device or a terminal.
  • the second device is an access network device, and correspondingly, the first device may be a core network device or a terminal.
  • the second device is a terminal, and correspondingly, the first device may be a core network device or an access network device.
  • the above-mentioned first AI model is an AI model obtained by offline training on the second device side.
  • the algorithm of the first AI model may include at least one of the following: a neural network, a decision tree, a support vector machine, and a Bayesian classifier.
  • the above-mentioned first AI model may be an AI model used for terminal positioning, network optimization, processing of large input data sets, and network recommendation for users.
  • the second device may train the AI model based on a preset learning framework, so as to obtain the above-mentioned first AI model.
  • the first AI model is a neural network model as an example.
  • the second device can train the neural network model based on a preset learning framework to obtain the first neural network model.
  • the second device may send the trained first AI model to the first device, and deploy the first AI model on the side of the first device.
  • Step 202 the second device configures online learning information of the first AI model for the first device.
  • the second device can send the information required for the online learning of the first AI model to the first device, and the first device can, based on the online learning information sent by the second device, The learning information performs online learning on the first AI model.
  • the online learning information is configured by the network device, or determined independently by the second device.
  • the foregoing first device may be a terminal
  • the second device may be a core network device.
  • the core network device sends the first AI model to the terminal, and sends the online learning information of the first AI model to the terminal, and the terminal receives the first AI model sent by the core network device, and the online learning information of the first AI model information.
  • the second device configures the first AI model for the first device, and configures the online learning information of the first AI model for the first device.
  • the first AI model can be continuously adjusted online on the first device side, thereby maintaining The predictive performance of the first AI model, thereby ensuring the service quality of the first device.
  • the online learning information includes at least one of the following:
  • the above-mentioned triggering manner is related to the state information of the first device and the channel information of a channel related to the first device. For example, when the moving speed of the first device is fast, or when the channel environment of the working channel of the first device changes, online learning of the first AI model may be triggered, so that the first AI model continuously adapts to changing environment.
  • the first device may suspend the online learning of the first AI model when the number of online learning of the first AI model is greater than the preset number of iterations; or, when the first AI model reaches the preset accuracy , suspending the online learning of the first AI model; or, in the case that the error information of the output result of the first AI model is small, suspending the online learning of the first AI model. Since the number of online learning of the first AI model is greater than the preset number of iterations, or the accuracy of the first AI model reaches the preset accuracy, indicating that the current first AI model is valid, the online learning of the first AI model can be terminated, to save power consumption.
  • the trigger condition corresponding to the above trigger mode includes at least one of the following:
  • the state information of the above-mentioned first device satisfies a first preset condition
  • the amount of data collected by the first device is greater than a first threshold
  • the measurement information of the above-mentioned first device satisfies a second preset condition
  • the error information of the output result of the first AI model is greater than the second threshold
  • the statistical information of the first information corresponding to the above-mentioned first AI model satisfies a third preset condition
  • the above status information includes at least one of the following: moving speed, beam switching information, and cell switching information.
  • the amount of data collected by the above-mentioned first device may be the amount of online data collected by the first device, for example, the first device collects in real time received channel information.
  • the first device when the first device is moving fast, the first device is triggered to learn the first AI model online; or, when the amount of data collected by the first device is large, the first device is triggered to learn the first AI model.
  • the AI model performs online learning; or, when the measurement information of the first device indicates that the channel environment of the current channel changes, the first device is triggered to perform online learning on the first AI model; or, when the output result of the first AI model
  • the error of the first AI model is relatively large or the accuracy of the first AI model is low, the first device is triggered to perform online learning on the first AI model.
  • the first AI model may fail.
  • the first device can learn the first AI model online based on information such as its own moving speed, the channel environment of the relevant channel, and the accuracy value of the AI model when the first AI model fails, thereby improving the first AI model.
  • the prediction accuracy of the AI model is a measure of the accuracy of the AI model.
  • the channel information may include at least one of the following: signal launch angle information, signal arrival angle angle information, signal delay information in the channel, signal quality in the channel, and so on.
  • the above-mentioned first threshold may be 3000, 5000 or 7000 and so on.
  • the measurement information includes at least one of the following: first measurement information of a reference signal received by the first device, and second measurement information collected by a sensor of the first device.
  • the first measurement information includes at least one of the following: instantaneous measurement information of the reference signal, and statistical measurement information of the reference signal.
  • the instantaneous measurement information of the reference signal may be: measurement information of the reference signal at a specific moment; the statistical measurement information of the reference signal may be: measurement information of the reference signal within a period of time.
  • the aforementioned reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS.
  • the aforementioned sensors may include at least one of the following: vision sensors, radar sensors, position sensors and the like.
  • the above-mentioned first information includes at least one of the following: input information of the above-mentioned first AI model, and output information of the above-mentioned first AI model.
  • the first information in the case where the first information includes the input information of the first AI model, the first information may be the working channel or the channel information of the surrounding channel of the terminal. In the first information In the case where the output information of the first AI model is included, the first information may be location information of the terminal.
  • the statistical information of the above-mentioned first information includes at least one of the following:
  • the above-mentioned second statistic is calculated based on the statistic in each second time window.
  • the above statistical information may include at least one of the following: mean value, variance and so on.
  • the foregoing statistical information may include temporal statistical information and spatial statistical information.
  • the statistical information on time may be: statistical information on channels of the same terminal within a continuous period of time
  • the statistical information on space may be statistical information on channels of multiple different terminals under one cell.
  • the statistical information of the above-mentioned first information is used to represent whether the wireless network environment within the action area of the first AI model changes.
  • the first information is channel information as an example.
  • the average value of channel information in a certain continuous time window is less than a certain threshold, or the correlation index of channel information in two time windows is lower than a certain threshold, or, the correlation between the front and rear data in the current time window is lower than
  • a certain threshold if the channel environment representing the relevant channel of the first device changes, the first device can be triggered to perform online learning of the first AI model to adapt to the current channel environment, thereby improving the prediction accuracy of the first AI model .
  • the statistical information of the above-mentioned first information is explained below by taking the first information as channel information as an example.
  • the statistical information of the first information when the statistical information of the first information includes the first statistical quantity of the first information in the first time window, the statistical information of the first information may be: The mean or variance of the channel information.
  • the above-mentioned statistical information of the first information may be: The mean value determined by the mean value of the channel information in each consecutive time window in consecutive time windows, for example, the mean value of the channel information in time window 1 is a, the mean value of channel information in time window 2 is b, and the mean value of channel information in time window 3 is The mean value of is c, then the statistical information of the channel information is the mean value of a, b and c.
  • the statistical information of the first information when the statistical information of the first information includes the statistical information of the first information of at least two terminals under the first cell at the first moment, the statistical information of the above-mentioned first information may be:
  • the mean value of the channel information of different terminals at a certain moment for example, the mean values of the channel information of terminal A, terminal B and terminal C in the same cell at time 1 are d, e and f respectively, then the statistical information of the channel information is Means of d, e and f.
  • the statistical information of the first information includes the correlation information of the first information
  • the statistical information of the above-mentioned first information It may be: the correlation index of the channel information of the two time windows is lower than a certain threshold, for example, the distance between the previous and subsequent data in the current time window is smaller than a certain threshold.
  • the statistical information of the measurement information of the first device includes at least one of the following:
  • the third statistic of the measurement information in the third time window the fourth statistic of the measurement information corresponding to at least two consecutive fourth time windows, and the correlation information of the measurement information.
  • the above fourth statistic is calculated based on the statistic in each fourth time window.
  • the above correlation information includes at least one of the following: distance between data, covariance, and correlation coefficient.
  • the measurement information is channel measurement information of a reference signal received by the first device as an example.
  • the statistical information of the above measurement information may be: the variance of the channel measurement information in a certain continuous time window, or the average of the mean values of the channel measurement information in two consecutive time windows, or the channel measurement information in the current time window distance between the data.
  • the above statistical information of the measurement information is used to represent whether the wireless network environment within the active area of the first AI model changes.
  • the measurement information is channel information as an example.
  • the mean value of the measurement information in a continuous time window is less than a certain threshold, or the correlation index of the measurement information in two time windows is lower than a certain threshold, or, the correlation between the front and rear data in the current time window is lower than
  • a certain threshold if the wireless environment representing the role of the first AI model changes, the first device can be triggered to perform online learning of the first AI model to adapt to the current wireless environment, thereby improving the prediction accuracy of the first AI model .
  • the trigger condition includes: the status information of the first device meets a first preset condition; the satisfaction of the first preset condition includes at least one of the following:
  • the moving speed of the first device is greater than a third threshold
  • the beam switching information indicates that beam switching occurs to the first device, and the beam switching frequency is greater than a fourth threshold
  • the above cell switching information indicates that the cell switching occurs to the first device.
  • the above-mentioned third threshold may be 60km/h, 80km/h, 100km/h and so on.
  • the above-mentioned trigger condition includes: the measurement information of the above-mentioned first device meets the second preset condition; optionally, the above-mentioned meeting the second preset condition includes: the measurement information of the first device It indicates that the channel environment of the relevant channel of the first device changes.
  • the measurement information is a reference signal received by the first device as an example.
  • the above-mentioned second preset condition may be: the first device estimates the downlink channel according to the measurement of CSI-RS, and detects that the channel environment changes, such as changing from a line of sight (LOS) environment to a non-line of sight (not line of sight) environment. sight, NLOS) environment, such as the signal-to-noise ratio SINR is lower than a certain threshold and so on.
  • LOS line of sight
  • NLOS non-line of sight
  • SINR signal-to-noise ratio
  • the measurement information is the measurement information collected by the sensor of the first device as an example.
  • the above-mentioned second preset condition may be: the measurement information obtained by the visual sensor indicates that the first device is in an LOS environment.
  • the above-mentioned trigger conditions include: the statistical information of the first information corresponding to the above-mentioned first AI model satisfies a third preset condition; the above-mentioned meeting the third preset condition includes at least one of the following:
  • the above-mentioned first statistic is greater than the maximum value of the first threshold interval
  • the above-mentioned second statistic is greater than the maximum value of the second threshold interval
  • the correlation information of the first information collected in at least two time windows satisfies the first condition
  • Correlation information between different first pieces of information collected within the current time window satisfies the second condition.
  • the first statistic is an average value of channel information of the terminal within a certain continuous time window.
  • the foregoing third preset condition may be: the first device detects that the statistics of channel information in a certain continuous time window exceed the maximum value of a certain threshold interval.
  • the second statistic is an average value of average values of channel information of the terminal in multiple consecutive time windows as an example.
  • the foregoing third preset condition may be: the first device detects that the average value of the average values of the channel information in multiple consecutive time windows exceeds the maximum value of a certain threshold interval.
  • the first information is channel information as an example.
  • the above-mentioned third preset condition may be: the correlation index of the channel information in the two time windows is lower than a certain threshold, or the correlation of the previous and subsequent data in the current time window is lower than a certain threshold.
  • the trigger condition includes: the statistical information of the measurement information of the first device satisfies a fourth preset condition; the satisfaction of the fourth preset condition includes at least one of the following:
  • the above third statistic is greater than the maximum value of the third threshold interval
  • the above fourth statistic is greater than the maximum value of the fourth threshold interval
  • the correlation information of the measurement information collected in at least two time windows satisfies the third condition
  • the difference between the distribution of the measurement information and the reference distribution is greater than a fifth threshold, and the reference distribution is information configured by the second device for the first device.
  • the above-mentioned fourth preset condition It may be: the mean value of the channel measurement information in a certain continuous time window exceeds the maximum value of a certain threshold interval.
  • the above fourth preset condition may be: based on the average value of the channel measurement information in each fourth time window
  • the mean calculated by the mean exceeds the maximum value of a certain threshold interval.
  • the above third condition may be: the covariance of the data of the measurement information respectively collected in at least two time windows is smaller than a certain threshold.
  • the above fourth condition may be: the distance between different data of the measurement information collected within the current time window is smaller than a certain threshold.
  • the above-mentioned distribution of the measurement information is a statistical distribution of the measurement information.
  • the above reference distribution is the statistical distribution of the first AI model. It can be understood that the training set for offline training of the first AI model obeys the benchmark distribution, and the performance of the first AI model is best when the measurement information obeys the benchmark distribution.
  • the index describing the difference between the distribution of the measurement information and the reference distribution may include at least one of the following: Wasserstein distance; Kullback-Leibler divergence; Hellinger distance and the like.
  • the trigger condition of the trigger mode includes at least one of the following:
  • the second device instructs the first device to perform online learning
  • the output accuracy of the first AI model is less than or equal to the sixth threshold.
  • the above-mentioned second device instructs the first device to perform online learning, including at least one of the following:
  • the second device instructs the first device to perform online learning periodically
  • the second device instructs the first device to conduct online learning semi-periodically
  • the second device instructs the first device to perform online learning aperiodically.
  • the cycle adopted by the above-mentioned first device to perform online learning periodically or semi-periodically is: a cycle pre-configured by the above-mentioned second device, or a cycle independently configured by the above-mentioned first device.
  • the above-mentioned second device instructs the first device to perform online learning half-periodically, including:
  • the second device instructs the first device to perform online learning semi-periodically through the first signaling, and the first signaling includes at least one of the following: medium access control-control element MAC-CE, downlink control information DCI.
  • the first signaling includes at least one of the following: medium access control-control element MAC-CE, downlink control information DCI.
  • the above suspension conditions include at least one of the following:
  • the number of online learning of the first AI model is greater than the preset number of iterations
  • the first AI model reaches the preset accuracy
  • the error information of the output result of the first AI model is smaller than the seventh threshold
  • the second device abruptly instructs the first device to end the current online learning process
  • the difference information between the measurement information distribution of the first device and the reference distribution is smaller than the eighth threshold.
  • the target task associated with the first AI model may be a task currently performed by the first AI model, such as locating a terminal, making network recommendations for a user, and so on.
  • the above parameter configuration information includes at least one of the following:
  • Composition information of the first data set of the first AI model
  • the above-mentioned first data set is at least one of the following: the original data set used by the above-mentioned first AI model, the data set newly collected by the above-mentioned first AI model; the parameter information of the above-mentioned reference distribution includes at least one of the following: variance, mean , standard deviation; the size of the above sample batch refers to the size of the number of samples included in a sample batch (Batch).
  • the above-mentioned original data set is: a data set used for offline training of the first AI model (that is, an old data set), and the above-mentioned newly collected data set is: deploying the first AI model online on the first device side, That is, after the first AI model is configured for the first device, the data set (ie, the new data set) collected by the first device in a new environment.
  • the above-mentioned online learning mode includes any one of the following: an instantaneous training mode (ie, One-shot mode), and a continuous learning mode.
  • an instantaneous training mode ie, One-shot mode
  • a continuous learning mode the first device performs online learning when the collected data reaches a specified amount
  • the continuous learning mode the first device continuously performs online learning as the amount of collected data increases.
  • the size of the above batch (Batch) is N, and N is a positive integer.
  • the state of the optimizer of the above-mentioned first AI model may include a loss function, a learning rate, and the like.
  • the first data set division method may include division ratios of the training set, verification set, and test set, and the like.
  • composition information of the above data sets includes the ratio between the number of original data sets and the number of newly collected data sets. It should be noted that the use of the original data set during training in the embodiment of the present application can effectively prevent over-fitting of newly collected data during the online learning process, thereby effectively improving the performance of the AI model.
  • the contribution weight of the first data set to the update of the first AI model may be: the contribution weights of the old data set and the new data set to the update of the first AI model.
  • the first AI model when it performs online learning, it may be Assign smaller weights to the original dataset and larger weights to the new dataset.
  • the parameter configuration information includes an online learning mode of the first AI model, and the online learning mode is an instantaneous training mode, and the parameter configuration information further includes at least one of the following: the first device The amount of data collected, and the length of time for collecting the above data amount.
  • the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is a continuous learning mode, and the parameter configuration information further includes at least one of the following: The time interval between two online learning, the data volume interval between two adjacent online learning.
  • the data volume interval between two adjacent online learning sessions may be 100, for example, online training is performed every time 100 sets of data are collected.
  • Fig. 3 shows a flowchart of an online learning method for an AI model provided by an embodiment of the present application.
  • the online learning method of the AI model provided by the embodiment of the present application may include the following steps 301 and 302:
  • Step 301 the first device acquires a first AI model.
  • the above-mentioned first AI model is an AI model obtained by offline training on the second device side.
  • the algorithm of the first AI model may include at least one of the following: a neural network, a decision tree, a support vector machine, and a Bayesian classifier.
  • the above-mentioned first AI model may be an AI model used for terminal positioning, network optimization, processing of large input data sets, and network recommendation for users.
  • the second device may train the AI model based on a preset learning framework, so as to obtain the above-mentioned first AI model.
  • the first AI model is a neural network model as an example.
  • the second device can train the neural network model based on a preset learning framework to obtain the first neural network model.
  • Step 302 The first device performs online learning on the first AI model based on the online learning information of the first AI model.
  • the first device may perform online learning on the first AI model obtained through offline training based on the online learning information of the first AI model, and obtain the first AI model after parameter adjustment.
  • the online learning information is information determined by the first device.
  • the first device acquires the first AI model, and performs online learning on the first AI model based on the online learning information of the first AI model.
  • the first AI model can be continuously adjusted online on the first device side, thereby maintaining The predictive performance of the first AI model, thereby ensuring the service quality of the first device.
  • the online learning information includes at least one of the following:
  • the above step 301 may include the following step 301a:
  • Step 301a the first device receives the first AI model configured by the second device.
  • the above-mentioned second device includes at least one of the following: core network equipment, access network equipment, and terminal;
  • the above-mentioned first device includes at least one of the following: core network equipment, access network equipment equipment, and terminals.
  • the second device may send the first AI model obtained through offline training to the first device, and the first device may receive the first AI model sent by the second device.
  • the online learning method provided in the embodiment of the present application further includes the following step A1:
  • Step A1 the first device obtains the online learning information of the first AI model from the second device.
  • the first device can receive the online learning information sent by the second device, and based on the online learning information, can The first AI model for online learning.
  • the above-mentioned first device may be a terminal
  • the second device may be a core network device.
  • the core network device sends the first AI model to the terminal, and sends the online learning information of the first AI model to the terminal, and the terminal receives the first AI model sent by the core network device, and the online learning information of the first AI model information.
  • the second device configures the first AI model for the first device, and configures the parameters required for online learning of the first AI model, so that the first AI model can be continuously adjusted online on the first device side, thereby maintaining The predictive performance of the first AI model, thereby ensuring the service quality of the first device.
  • the online learning method provided in the embodiment of the present application further includes the following step 303:
  • Step 303 The first device configures online learning information of the first AI model for the third device.
  • the third device includes at least one of the following: a core network device, an access network device, and a terminal.
  • the above-mentioned first device may be a core network device, and the third device may be a terminal.
  • the core network device sends the first AI model to the terminal, and sends the online learning information of the first AI model to the terminal, and the terminal receives the first AI model sent by the core network device, and the online learning information of the first AI model information.
  • the above-mentioned second device can autonomously execute the online learning method of the AI model, or, the second device can deploy the AI model to the first device, and configure the information required for the online learning of the AI model for the first device, by The first device executes the online learning of the AI model, or the first device can deploy the AI model to the third device, and configure the information required for the online learning of the AI model for the first device, and the third device executes the online learning of the AI model. study.
  • the trigger condition corresponding to the above trigger mode includes at least one of the following:
  • the state information of the first device satisfies a first preset condition
  • the amount of data collected by the first device is greater than a first threshold
  • the measurement information of the first device satisfies a second preset condition
  • the error information of the output result of the first AI model is greater than the second threshold
  • the statistical information of the first information corresponding to the first AI model satisfies a third preset condition
  • the above state information includes at least one of the following: moving speed, beam switching information, and cell switching information.
  • the amount of data collected by the first device may be the amount of online data collected by the first device, for example, channel information collected by the first device in real time.
  • the above channel information may include at least one of the following: signal emission angle information, time delay information of signals in the channel, signal quality in the channel, and so on.
  • the above-mentioned first threshold may be 3000, 5000 or 7000 and so on.
  • the measurement information includes at least one of the following: first measurement information of a reference signal received by the first device, and second measurement information collected by a sensor of the first device.
  • the first measurement information includes at least one of the following: instantaneous measurement information of the reference signal, and statistical measurement information of the reference signal.
  • the instantaneous measurement information of the reference signal may be: measurement information of the reference signal at a specific moment; the statistical measurement information of the reference signal may be: measurement information of the reference signal within a period of time.
  • the aforementioned reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS.
  • the aforementioned sensors may include at least one of the following: vision sensors, radar sensors, position sensors and the like.
  • the above-mentioned first information includes at least one of the following: input information of the above-mentioned first AI model, and output information of the above-mentioned first AI model.
  • the first information in the case where the first information includes the input information of the first AI model, the first information may be the working channel or the channel information of the surrounding channel of the terminal. In the first information In the case where the output information of the first AI model is included, the first information may be location information of the terminal.
  • the first device may obtain status information in real time or periodically, and perform online learning on the first AI model when the status information satisfies a first preset condition.
  • the first device may obtain the error information of the output result of the first AI model in real time or periodically, or the prediction accuracy of the first AI model, and when the error information is greater than the second In the case of the threshold value, online learning is performed on the first AI model.
  • the first device may collect statistics on input information and output information of the first AI model, and perform online learning on the first AI model when the statistical information of the input information or output information satisfies a third preset condition.
  • the first device may detect the reference signal or the measurement information collected by the sensor, and when the measurement information satisfies the second preset condition, and/or the statistical information of the measurement information satisfies the fourth preset condition, the first device An AI model for online learning.
  • the statistical information of the above-mentioned first information includes at least one of the following: the first statistical quantity of the first information in the first time window, the first statistical quantity of the first information in at least two consecutive second time windows
  • the second statistical quantity corresponding to the information is the statistical information of the first information of at least two terminals under the first cell at the first moment, and the correlation information of the first information.
  • the above-mentioned second statistic is calculated based on the statistic in each second time window.
  • the above statistical information may include at least one of the following: mean value, variance and so on.
  • the foregoing statistical information may include temporal statistical information and spatial statistical information.
  • the statistical information on time may be: statistical information on channels of the same terminal within a continuous period of time
  • the statistical information on space may be statistical information on channels of multiple different terminals under one cell.
  • the statistical information of the above-mentioned first information is explained below by using the first information as channel information.
  • the statistical information of the first information when the statistical information of the first information includes the first statistical quantity of the first information in the first time window, the statistical information of the first information may be: The mean or variance of the channel information.
  • the above-mentioned statistical information of the first information may be: The mean value of the mean value of channel information in continuous time windows, for example, the mean value of channel information in time window 1 is a, the mean value of channel information in time window 2 is b, and the mean value of channel information in time window 3 is c, then the channel information
  • the statistic for is the mean of a, b, and c.
  • the statistical information of the first information when the statistical information of the first information includes the statistical information of the first information of at least two terminals under the first cell at the first moment, the statistical information of the above-mentioned first information may be:
  • the mean value of the channel information of different terminals at a certain moment for example, the mean values of the channel information of terminal A, terminal B and terminal C in the same cell at time 1 are d, e and f respectively, then the statistical information of the channel information is Means of d, e and f.
  • the statistical information of the first information when the statistical information of the first information includes the correlation information of the first information, the statistical information of the first information may be: the correlation index of the channel information of the two time windows is lower than a certain threshold, For example, the distance between the previous and subsequent data in the current time window is smaller than a certain threshold.
  • the statistical information of the measurement information of the first device includes at least one of the following: a third statistic for the measurement information within a third time window, and the measurement information is in at least two consecutive
  • the fourth statistic corresponding to the fourth time window is the correlation information of the above measurement information.
  • the above fourth statistic is calculated based on the statistic in each fourth time window.
  • the above correlation information includes at least one of the following: distance between data, covariance, and correlation coefficient.
  • the measurement information is channel measurement information of a reference signal received by the first device as an example.
  • the statistical information of the above measurement information may be: the variance of the channel measurement information in a certain continuous time window, or the average of the mean values of the channel measurement information in two consecutive time windows, or the channel measurement information in the current time window distance between the data.
  • the trigger condition includes: the state information of the first device meets a first preset condition; the satisfaction of the first preset condition includes at least one of the following:
  • the moving speed of the first device is greater than a third threshold
  • the beam switching information indicates that beam switching occurs to the first device, and the beam switching frequency is greater than a fourth threshold
  • the above cell switching information indicates that the cell switching occurs to the first device.
  • the above-mentioned third threshold may be 60km/h, 80km/h, 100km/h and so on.
  • the trigger condition includes: the measurement information of the first device meets a second preset condition; the meeting the second preset condition includes: the measurement information of the first device indicates that the first device The channel environment of the relevant channel changes.
  • the measurement information is a reference signal received by the first device as an example.
  • the above-mentioned second preset condition may be: the first device estimates the downlink channel according to the measurement of CSI-RS, and detects that the channel environment changes, such as changing from a line of sight (LOS) environment to a non-line of sight (not line of sight) environment. sight, NLOS) environment, such as the signal-to-noise ratio SINR is lower than a certain threshold and so on.
  • LOS line of sight
  • NLOS non-line of sight
  • SINR signal-to-noise ratio
  • the measurement information is the measurement information collected by the sensor of the first device as an example.
  • the above-mentioned second preset condition may be: the measurement information obtained by the visual sensor indicates that the first device is in an LOS environment.
  • the above-mentioned measurement information includes at least one of the following: first measurement information of the reference signal received by the first device, and second measurement information collected by the sensor of the first device; optionally, the above-mentioned first measurement information includes at least one of the following : Instantaneous measurement information of the reference signal, statistical measurement information of the reference signal;
  • the above-mentioned reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS.
  • the above-mentioned trigger conditions include: statistics of the second information corresponding to the above-mentioned first AI model The information satisfies the third preset condition; the above-mentioned meeting of the third preset condition includes at least one of the following:
  • the above-mentioned first statistic is greater than the maximum value of the first threshold interval
  • the above-mentioned second statistic is greater than the maximum value of the second threshold interval
  • the correlation information of the first information collected in at least two time windows satisfies the first condition
  • Correlation information between different first pieces of information collected within the current time window satisfies the second condition.
  • the first statistic is an average value of channel information of the terminal within a certain continuous time window.
  • the foregoing third preset condition may be: the first device detects that the statistics of channel information in a certain continuous time window exceed the maximum value of a certain threshold interval.
  • the second statistic is an average value of average values of channel information of the terminal in multiple consecutive time windows as an example.
  • the foregoing third preset condition may be: the first device detects that the average value of the average values of the channel information in multiple consecutive time windows exceeds the maximum value of a certain threshold interval.
  • the first information is channel information as an example.
  • the above-mentioned third preset condition may be: the correlation index of the channel information in the two time windows is lower than a certain threshold, or the correlation of the previous and subsequent data in the current time window is lower than a certain threshold.
  • the trigger condition includes: the statistical information of the measurement information of the first device satisfies a fourth preset condition; the satisfaction of the fourth preset condition includes at least one of the following:
  • the above third statistic is greater than the maximum value of the third threshold interval
  • the above fourth statistic is greater than the maximum value of the fourth threshold interval
  • the correlation information of the measurement information collected in at least two time windows satisfies the third condition
  • Correlation information among different measurement information collected in the current time window satisfies the fourth condition
  • the difference between the distribution of the measurement information and the reference distribution is greater than the fifth threshold, and the above reference distribution is information configured by the second device for the first device.
  • the above fourth preset condition may be: the mean value of the channel measurement information in a certain continuous time window exceeds a certain The maximum value of the threshold interval.
  • the above fourth preset condition may be: based on the average value of the channel measurement information in each fourth time window
  • the mean calculated by the mean exceeds the maximum value of a certain threshold interval.
  • the above third condition may be: the covariance of the data of the measurement information respectively collected in at least two time windows is smaller than a certain threshold.
  • the above fourth condition may be: the distance between different data of the measurement information collected within the current time window is smaller than a certain threshold.
  • the above-mentioned distribution of the measurement information is a statistical distribution of the measurement information.
  • the above reference distribution is the statistical distribution of the first AI model. It can be understood that the training set for offline training of the first AI model obeys the benchmark distribution, and the performance of the first AI model is best when the measurement information obeys the benchmark distribution.
  • the index describing the difference between the distribution of the measurement information and the reference distribution may include at least one of the following: Wasserstein distance; Kullback-Leibler divergence; Hellinger distance and the like.
  • the trigger condition of the above trigger method includes at least one of the following:
  • the second device instructs the first device to perform online learning
  • the output accuracy of the first AI model is less than or equal to the sixth threshold.
  • the above-mentioned second device instructs the first device to perform online learning, including at least one of the following:
  • the second device instructs the first device to perform online learning periodically
  • the second device instructs the first device to conduct online learning semi-periodically
  • the second device instructs the first device to perform online learning aperiodically.
  • the cycle adopted by the above-mentioned first device to perform online learning periodically or semi-periodically is: a cycle preconfigured by the second device, or a cycle independently configured by the first device.
  • the second device instructs the first device to perform online learning semi-periodically, including:
  • the second device instructs the first device to perform online learning semi-periodically through the first signaling, and the first signaling includes at least one of the following: medium access control-control element MAC-CE, and downlink control information DCI.
  • the above suspension conditions include at least one of the following:
  • the number of online learning of the first AI model is greater than the preset number of iterations
  • the first AI model reaches the preset accuracy
  • the error information of the output result of the first AI model is smaller than the seventh threshold
  • the second device abruptly instructs the first device to end the current online learning process
  • the difference information between the measurement information distribution of the first device and the reference distribution is smaller than the eighth threshold.
  • the first device may detect in real time or periodically whether the suspension condition is met, and if the suspension condition is satisfied, suspend the online learning of the first AI model.
  • the aforementioned target task associated with the first AI model may be a task currently performed by the first AI model, such as locating a terminal, making network recommendations for a user, and so on.
  • the first device may stop online learning of the first AI model when it detects that the number of iterations of the first AI model is greater than a preset number of iterations (eg, 10,000).
  • the second device may stop online learning of the first AI model when detecting that the accuracy value of the first AI model reaches a preset accuracy.
  • the second device may stop the online learning of the first AI model when it detects that the error of the output result of the first AI model is smaller than a preset error value.
  • the first device can stop the online learning of the AI model based on the number of times of online learning of the above-mentioned AI model, the achieved accuracy, the error of the output result, and the difference from the reference distribution, so as to improve the prediction accuracy of the model. case, saving power consumption.
  • suspending the online learning of the first AI model may be temporarily stopping the online learning of the first AI model, or ending the online learning of the first AI model.
  • the above parameter configuration information includes at least one of the following:
  • Composition information of the first data set of the first AI model
  • AI model identification associated with online learning information
  • the above-mentioned first data set is at least one of the following: the original data set used by the first AI model, and the newly collected data set by the first AI model.
  • the above parameter information of the benchmark distribution includes at least one of the following: variance, mean, and standard deviation.
  • the above-mentioned original data set is: a data set used for offline training of the first AI model (that is, an old data set), and the above-mentioned newly collected data set is: deployed online on the first device side, that is, the first AI model.
  • the data set ie, the new data set
  • the data set collected by the first device in a new environment.
  • the above-mentioned online learning mode includes any one of the following: an instantaneous training mode (ie, One-shot mode), and a continuous learning mode.
  • an instantaneous training mode ie, One-shot mode
  • a continuous learning mode ie, the first device performs online learning when the collected data reaches a specified amount
  • the continuous learning mode the first device continuously performs online learning as the amount of collected data increases .
  • the size of the above batch (Batch) is N, and N is a positive integer.
  • the state of the optimizer of the above-mentioned first AI model may include a loss function, a learning rate, and the like.
  • the first data set division method may include division ratios of the training set, verification set, and test set, and the like.
  • composition information of the above data sets includes the ratio between the number of original data sets and the number of newly collected data sets. It should be noted that the use of the original data set during training in the embodiment of the present application can effectively prevent over-fitting of newly collected data during the online learning process, thereby effectively improving the performance of the AI model.
  • the contribution weight of the first data set to the update of the first AI model may be: the contribution weights of the old data set and the new data set to the update of the first AI model.
  • the first AI model when it performs online learning, it may be Assign smaller weights to the original dataset and larger weights to the new dataset.
  • the first device may obtain parameter configuration information of the first AI model, and perform online learning on the first AI model based on the parameter configuration information, so as to ensure the prediction accuracy of the AI model in a changing environment.
  • the parameter configuration information includes an online learning mode of the first AI model, and the online learning mode is an instantaneous training mode, and the parameter configuration information further includes at least one of the following: The amount of data collected and the length of time for collecting data.
  • the above-mentioned parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is a continuous learning mode, and the above-mentioned parameter configuration information also includes at least one of the following: The time interval of online learning, the data volume interval between two adjacent online learning.
  • the data volume interval between two adjacent online learning sessions may be 100, for example, online training is performed every time 100 sets of data are collected.
  • the online learning method of the AI model provided in the embodiment of the present application may be executed by an online learning device of the AI model.
  • the online method of executing the AI model by the online device of the AI model is taken as an example to illustrate the online method of the AI model provided by the embodiment of the application. learning device.
  • An embodiment of the present application provides an online AI model learning device 400. As shown in FIG. A device deploys a first AI model; the configuration 401 is further used for the second device to configure online learning information of the first AI model for the first device.
  • the online learning information includes at least one of the following:
  • the trigger condition corresponding to the trigger mode includes at least one of the following:
  • the state information of the first device satisfies a first preset condition
  • the amount of data collected by the first device is greater than a first threshold
  • the measurement information of the first device satisfies a second preset condition
  • the error information of the output result of the first AI model is greater than a second threshold
  • the statistical information of the first information corresponding to the first AI model satisfies a third preset condition
  • the state information includes at least one of the following: moving speed, beam switching information, cell switching information;
  • the first information includes at least one of the following: input information of the first AI model, and output information of the first AI model.
  • the statistical information of the first information includes at least one of the following: a first statistical quantity of the first information within a first time window, at least two consecutive second time windows A second statistic corresponding to the first information, statistical information of the first information of at least two terminals under the first cell at the first moment, and correlation information of the first information;
  • the second statistics are calculated based on statistics in each of the second time windows
  • the statistical information of the measurement information of the first device includes at least one of the following: a third statistic for the above measurement information within a third time window, where the measurement information corresponds to at least two consecutive fourth time windows. Four statistics, correlation information of the measurement information;
  • the fourth statistic is calculated based on the statistic in each of the fourth time windows
  • the correlation information includes at least one of the following: distance between data, covariance, and correlation coefficient.
  • the trigger condition includes: the status information of the first device meets a first preset condition
  • Said meeting the first preset condition includes at least one of the following:
  • the moving speed of the first device is greater than a third threshold
  • the beam switching information indicates that beam switching occurs for the first device, and the beam switching frequency is greater than a fourth threshold
  • the cell switching information indicates that a cell switching occurs to the first device.
  • the trigger condition includes: the measurement information of the reference signal received by the first device satisfies a second preset condition
  • the meeting the second preset condition includes: the measurement information of the reference signal indicates that the channel environment of the relevant channel of the first device changes;
  • the measurement information includes at least one of the following: first measurement information of a reference signal received by the first device, and second measurement information collected by a sensor of the first device;
  • the first measurement information includes at least one of the following: instantaneous measurement information of the reference signal, statistical measurement information of the reference signal;
  • the reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS.
  • the trigger condition includes: the statistical information of the first information corresponding to the first AI model satisfies a third preset condition;
  • Said meeting the third preset condition includes at least one of the following:
  • the first statistic is greater than the maximum value of the first threshold interval
  • the second statistic is greater than the maximum value of the second threshold interval
  • the correlation information of the first information collected in at least two time windows satisfies a first condition
  • Correlation information between different first pieces of information collected within the current time window satisfies the second condition.
  • the trigger condition includes: statistical information of the measurement information of the first device meets a fourth preset condition;
  • Said meeting the fourth preset condition includes at least one of the following:
  • the third statistic is greater than the maximum value of the third threshold interval
  • the fourth statistic is greater than the maximum value of the fourth threshold interval
  • the correlation information of the measurement information collected in at least two time windows satisfies the third condition
  • Correlation information among different measurement information collected in the current time window satisfies the fourth condition
  • a difference between the distribution of the measurement information and a reference distribution is greater than a fifth threshold, where the reference distribution is information configured by the second device for the first device.
  • the trigger condition of the trigger mode includes at least one of the following:
  • the second device instructs the first device to perform online learning
  • the output accuracy of the first AI model is less than or equal to the sixth threshold.
  • the second device instructs the first device to perform online learning, including at least one of the following:
  • the second device instructs the first device to periodically perform online learning
  • the second device instructs the first device to perform online learning semi-periodically
  • the second device instructs the first device to perform online learning aperiodically.
  • the period adopted by the first device to perform online learning periodically or semi-periodically is: a period preconfigured by the second device, or a period independently configured by the first device.
  • the second device instructs the first device to perform online learning semi-periodically, including:
  • the second device instructs the first device to perform online learning semi-periodically through a first signaling, and the first signaling includes at least one of the following: medium access control-control element MAC-CE, downlink control information DCI .
  • the termination condition includes at least one of the following:
  • the number of online learning of the first AI model is greater than the preset number of iterations
  • the first AI model reaches a preset accuracy
  • the error information of the output result of the first AI model is less than the seventh threshold
  • the second device abruptly instructs the first device to end the current online learning process
  • the difference information between the measurement information distribution of the first device and the reference distribution is smaller than an eighth threshold.
  • the parameter configuration information includes at least one of the following:
  • Composition information of the first data set of the AI model
  • the first data set is at least one of the following: the original data set used by the first AI model, the data set newly collected by the first AI model;
  • the parameter information of the reference distribution includes at least one of the following: variance, mean, and standard deviation.
  • the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is an instantaneous training mode, and the parameter configuration information further includes at least one of the following : the amount of data collected by the first device, and the length of time for collecting the data.
  • the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is a continuous learning mode, and the parameter configuration information further includes at least one of the following : the time interval between two adjacent online learning, and the data volume interval between two adjacent online learning.
  • the second device includes at least one of the following: core network equipment, access network equipment, and terminal; the first device includes at least one of the following: core network equipment, access Network equipment, terminals.
  • the second device configures the first AI model for the first device, and configures the online learning information of the first AI model for the first device.
  • the first AI model can be continuously adjusted online on the first device side, thereby maintaining The predictive performance of the first AI model, thereby ensuring the service quality of the first device.
  • An embodiment of the present application provides an online learning device 500 for an AI model.
  • the first device acquires a first AI model; the execution module 502 is configured to enable the first device to learn online the first AI model based on the online learning information of the first AI model.
  • the first device acquires the first AI model, including:
  • the first device receives the first AI model configured by the second device.
  • the acquisition module is specifically used to
  • the apparatus further includes: a configuration module, configured to configure the online learning information of the first AI model for the third device.
  • the online learning information includes at least one of the following:
  • the trigger condition corresponding to the trigger mode includes at least one of the following:
  • the state information of the first device satisfies a first preset condition
  • the amount of data collected by the first device is greater than a first threshold
  • the measurement information of the first device satisfies a second preset condition
  • the error information of the output result of the first AI model is greater than a second threshold
  • the statistical information of the first information corresponding to the first AI model satisfies a third preset condition
  • the state information includes at least one of the following: moving speed, beam switching information, cell switching information;
  • the first information includes at least one of the following: input information of the first AI model, and output information of the first AI model.
  • the statistical information of the first information includes at least one of the following: a first statistical quantity of the first information within a first time window, at least two consecutive second time windows A second statistic corresponding to the first information, statistical information of the first information of at least two terminals under the first cell at the first moment, and correlation information of the first information;
  • the second statistics are calculated based on statistics in each of the second time windows
  • the statistical information of the measurement information of the first device includes at least one of the following: a third statistic for the above measurement information within a third time window, where the measurement information corresponds to at least two consecutive fourth time windows. Four statistics, correlation information of the measurement information;
  • the fourth statistic is calculated based on the statistic in each of the fourth time windows
  • the correlation information includes at least one of the following: distance between data, covariance, and correlation coefficient.
  • the trigger condition includes: the status information of the first device meets a first preset condition
  • Said meeting the first preset condition includes at least one of the following:
  • the moving speed of the first device is greater than a third threshold
  • the beam switching information indicates that beam switching occurs for the first device, and the beam switching frequency is greater than a fourth threshold
  • the cell switching information indicates that a cell switching occurs to the first device.
  • the trigger condition includes: the measurement information of the reference signal received by the first device satisfies a second preset condition
  • the meeting the second preset condition includes: the measurement information of the reference signal indicates that the channel environment of the relevant channel of the first device changes;
  • the measurement information includes at least one of the following: first measurement information of a reference signal received by the first device, and second measurement information collected by a sensor of the first device;
  • the first measurement information includes at least one of the following: instantaneous measurement information of the reference signal, statistical measurement information of the reference signal;
  • the reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS.
  • the trigger condition includes: the statistical information of the second information corresponding to the first AI model satisfies a third preset condition;
  • Said meeting the third preset condition includes at least one of the following:
  • the first statistic is greater than the maximum value of the first threshold interval
  • the second statistic is greater than the maximum value of the second threshold interval
  • the correlation information of the first information collected in at least two time windows satisfies a first condition
  • Correlation information between different first pieces of information collected within the current time window satisfies the second condition.
  • the trigger condition includes: statistical information of the measurement information of the first device meets a fourth preset condition;
  • Said meeting the fourth preset condition includes at least one of the following:
  • the third statistic is greater than the maximum value of the third threshold interval
  • the fourth statistic is greater than the maximum value of the fourth threshold interval
  • the correlation information of the measurement information collected in at least two time windows satisfies the third condition
  • Correlation information among different measurement information collected in the current time window satisfies the fourth condition
  • a difference between the distribution of the measurement information and a reference distribution is greater than a fifth threshold, where the reference distribution is information configured by the second device for the first device.
  • the trigger condition of the trigger mode includes at least one of the following:
  • the second device instructs the first device to perform online learning
  • the output accuracy of the first AI model is less than or equal to the sixth threshold.
  • the second device instructs the first device to perform online learning, including at least one of the following:
  • the second device instructs the first device to periodically perform online learning
  • the second device instructs the first device to perform online learning semi-periodically
  • the second device instructs the first device to perform online learning aperiodically.
  • the period adopted by the first device to perform online learning periodically or semi-periodically is: a period preconfigured by the second device, or a period independently configured by the first device.
  • the second device instructs the first device to perform online learning semi-periodically, including:
  • the second device instructs the first device to perform online learning semi-periodically through a first signaling, and the first signaling includes at least one of the following: medium access control-control element MAC-CE, downlink control information DCI .
  • the termination condition includes at least one of the following:
  • the number of online learning of the first AI model is greater than the preset number of iterations
  • the first AI model reaches a preset accuracy
  • the error information of the output result of the first AI model is less than the seventh threshold
  • the second device abruptly instructs the first device to end the current online learning process
  • the difference information between the measurement information distribution of the first device and the reference distribution is smaller than an eighth threshold.
  • the parameter configuration information includes at least one of the following:
  • Composition information of the first data set of the AI model
  • the first data set is at least one of the following: the original data set used by the first AI model, the data set newly collected by the first AI model;
  • the parameter information of the reference distribution includes at least one of the following: variance, mean, and standard deviation.
  • the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is an instantaneous training mode, and the parameter configuration information further includes at least one of the following : the amount of data collected by the first device, and the length of time for collecting the data.
  • the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is a continuous learning mode, and the parameter configuration information further includes at least one of the following : The time interval between two adjacent online learning, and the data volume interval between two adjacent online learning.
  • the second device, the first device, and the third device include at least one of the following: a core network device, an access network device, and a terminal.
  • the first device acquires a first AI model, and performs online learning on the first AI model based on online learning information of the first AI model.
  • the first AI model can be continuously adjusted online on the first device side, thereby maintaining The predictive performance of the first AI model, thereby ensuring the service quality of the first device.
  • the online learning apparatus for the AI model in the embodiment of the present application may be an electronic device, such as an electronic device with an operating system, or a component in the electronic device, such as an integrated circuit or a chip.
  • the electronic device may be a terminal, or other devices other than the terminal.
  • the terminal may include, but not limited to, the types of terminal 11 listed above, and other devices may be servers, Network Attached Storage (NAS), etc., which are not specifically limited in this embodiment of the present application.
  • NAS Network Attached Storage
  • the AI model online learning device provided by the embodiment of the present application can realize the various processes realized by the method embodiments in Fig. 1 to Fig. 3 and achieve the same technical effect. To avoid repetition, details are not repeated here.
  • this embodiment of the present application also provides a communication device 600, including a processor 601 and a memory 602, and the memory 602 stores programs or instructions that can run on the processor 601, such as
  • the communication device 600 is a terminal, when the program or instruction is executed by the processor 601, each step of the above embodiment of the online learning method of the AI model can be realized, and the same technical effect can be achieved.
  • the communication device 600 is a network-side device, when the program or instruction is executed by the processor 601, the steps of the above-mentioned online learning method for the AI model are implemented, and the same technical effect can be achieved. To avoid repetition, details are not repeated here. .
  • the embodiment of the present application further provides a terminal, including a processor and a communication interface, and the processor is configured to acquire a first AI model, and perform online learning on the first AI model based on online learning information of the first AI model.
  • This terminal embodiment corresponds to the above-mentioned terminal-side method embodiment, and each implementation process and implementation mode of the above-mentioned method embodiment can be applied to this terminal embodiment, and can achieve the same technical effect.
  • FIG. 7 is a schematic diagram of a hardware structure of a terminal implementing an embodiment of the present application.
  • the terminal 700 includes, but is not limited to: a radio frequency unit 701, a network module 702, an audio output unit 703, an input unit 704, a sensor 705, a display unit 706, a user input unit 707, an interface unit 708, a memory 709, and a processor 710. At least some parts.
  • the terminal 700 may also include a power supply (such as a battery) for supplying power to various components, and the power supply may be logically connected to the processor 710 through the power management system, so as to manage charging, discharging, and power consumption through the power management system. Management and other functions.
  • a power supply such as a battery
  • the terminal structure shown in FIG. 7 does not constitute a limitation on the terminal, and the terminal may include more or fewer components than shown in the figure, or combine some components, or arrange different components, which will not be repeated here.
  • the input unit 704 may include a graphics processing unit (Graphics Processing Unit, GPU) 7041 and a microphone 7042, and the graphics processor 7041 is used by the image capture device (such as the image data of the still picture or video obtained by the camera) for processing.
  • the display unit 706 may include a display panel 7061, and the display panel 7061 may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like.
  • the user input unit 707 includes at least one of a touch panel 7071 and other input devices 7072 .
  • the touch panel 7071 is also called a touch screen.
  • the touch panel 7071 may include two parts, a touch detection device and a touch controller.
  • Other input devices 7072 may include, but are not limited to, physical keyboards, function keys (such as volume control buttons, switch buttons, etc.), trackballs, mice, and joysticks, which will not be described in detail here.
  • the radio frequency unit 701 may transmit the downlink data from the network side device to the processor 710 for processing after receiving the downlink data; in addition, the radio frequency unit 701 may send uplink data to the network side device.
  • the radio frequency unit 701 includes, but is not limited to, an antenna, an amplifier, a transceiver, a coupler, a low noise amplifier, a duplexer, and the like.
  • the memory 709 can be used to store software programs or instructions as well as various data.
  • the memory 709 may mainly include a first storage area for storing programs or instructions and a second storage area for storing data, wherein the first storage area may store an operating system, an application program or instructions required by at least one function (such as a sound playing function, image playback function, etc.), etc.
  • memory 709 may include volatile memory or nonvolatile memory, or, memory 709 may include both volatile and nonvolatile memory.
  • the non-volatile memory can be read-only memory (Read-Only Memory, ROM), programmable read-only memory (Programmable ROM, PROM), erasable programmable read-only memory (Erasable PROM, EPROM), electronically programmable Erase Programmable Read-Only Memory (Electrically EPROM, EEPROM) or Flash.
  • ROM Read-Only Memory
  • PROM programmable read-only memory
  • Erasable PROM Erasable PROM
  • EPROM erasable programmable read-only memory
  • Electrical EPROM Electrical EPROM
  • EEPROM electronically programmable Erase Programmable Read-Only Memory
  • Volatile memory can be random access memory (Random Access Memory, RAM), static random access memory (Static RAM, SRAM), dynamic random access memory (Dynamic RAM, DRAM), synchronous dynamic random access memory (Synchronous DRAM, SDRAM), double data rate synchronous dynamic random access memory (Double Data Rate SDRAM, DDRSDRAM), enhanced synchronous dynamic random access memory (Enhanced SDRAM, ESDRAM), synchronous connection dynamic random access memory (Synch link DRAM , SLDRAM) and Direct Memory Bus Random Access Memory (Direct Rambus RAM, DRRAM).
  • RAM Random Access Memory
  • SRAM static random access memory
  • DRAM dynamic random access memory
  • DRAM synchronous dynamic random access memory
  • SDRAM double data rate synchronous dynamic random access memory
  • Double Data Rate SDRAM Double Data Rate SDRAM
  • DDRSDRAM double data rate synchronous dynamic random access memory
  • Enhanced SDRAM, ESDRAM enhanced synchronous dynamic random access memory
  • Synch link DRAM , SLDRAM
  • Direct Memory Bus Random Access Memory Direct Rambus
  • the processor 710 may include one or more processing units; optionally, the processor 710 integrates an application processor and a modem processor, wherein the application processor mainly processes operations related to the operating system, user interface, and application programs, etc.,
  • the modem processor mainly handles the line communication signals, such as baseband processors. It can be understood that the foregoing modem processor may not be integrated into the processor 710 .
  • the processor 710 is used for the first device to acquire the first AI model, and the processor 710 is also used for the first device to perform the first AI model based on the online learning information of the first AI model.
  • An AI model for online learning is used for the first device to acquire the first AI model, and the processor 710 is also used for the first device to perform the first AI model based on the online learning information of the first AI model.
  • the radio frequency unit 701 is configured to receive the first AI model configured by the second device.
  • the processor 710 is specifically configured to acquire online learning information of the first AI model from the second device.
  • the processor 710 is further configured to configure online learning information of the first AI model for the third device.
  • the online learning information includes at least one of the following:
  • the trigger condition corresponding to the trigger mode includes at least one of the following:
  • the state information of the first device satisfies a first preset condition
  • the amount of data collected by the first device is greater than a first threshold
  • the measurement information of the first device satisfies a second preset condition
  • the error information of the output result of the first AI model is greater than a second threshold
  • the statistical information of the first information corresponding to the first AI model satisfies a third preset condition
  • the state information includes at least one of the following: moving speed, beam switching information, cell switching information;
  • the first information includes at least one of the following: input information of the first AI model, and output information of the first AI model.
  • the statistical information of the first information includes at least one of the following: a first statistical quantity of the first information within a first time window, at least two consecutive second time windows A second statistic corresponding to the first information, statistical information of the first information of at least two terminals under the first cell at the first moment, and correlation information of the first information;
  • the second statistics are calculated based on statistics in each of the second time windows
  • the statistical information of the measurement information of the first device includes at least one of the following: a third statistic for the above measurement information within a third time window, where the measurement information corresponds to at least two consecutive fourth time windows. Four statistics, correlation information of the measurement information;
  • the fourth statistic is calculated based on the statistic in each of the fourth time windows
  • the correlation information includes at least one of the following: distance between data, covariance, and correlation coefficient.
  • the trigger condition includes: the status information of the first device meets a first preset condition
  • Said meeting the first preset condition includes at least one of the following:
  • the moving speed of the first device is greater than a third threshold
  • the beam switching information indicates that beam switching occurs for the first device, and the beam switching frequency is greater than a fourth threshold
  • the cell switching information indicates that a cell switching occurs to the first device.
  • the trigger condition includes: the measurement information of the reference signal received by the first device satisfies a second preset condition
  • the meeting the second preset condition includes: the measurement information of the reference signal indicates that the channel environment of the relevant channel of the first device changes;
  • the measurement information includes at least one of the following: first measurement information of a reference signal received by the first device, and second measurement information collected by a sensor of the first device;
  • the first measurement information includes at least one of the following: instantaneous measurement information of the reference signal, statistical measurement information of the reference signal;
  • the reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS.
  • the trigger condition includes: the statistical information of the second information corresponding to the first AI model satisfies a third preset condition;
  • Said meeting the third preset condition includes at least one of the following:
  • the first statistic is greater than the maximum value of the first threshold interval
  • the second statistic is greater than the maximum value of the second threshold interval
  • the correlation information of the first information collected in at least two time windows satisfies a first condition
  • Correlation information between different first pieces of information collected within the current time window satisfies the second condition.
  • the trigger condition includes: statistical information of the measurement information of the first device meets a fourth preset condition;
  • Said meeting the fourth preset condition includes at least one of the following:
  • the third statistic is greater than the maximum value of the third threshold interval
  • the fourth statistic is greater than the maximum value of the fourth threshold interval
  • the correlation information of the measurement information collected in at least two time windows satisfies the third condition
  • Correlation information among different measurement information collected in the current time window satisfies the fourth condition
  • a difference between the distribution of the measurement information and a reference distribution is greater than a fifth threshold, where the reference distribution is information configured by the second device for the first device.
  • the trigger condition of the trigger mode includes at least one of the following:
  • the second device instructs the first device to perform online learning
  • the output accuracy of the first AI model is less than or equal to the sixth threshold.
  • the second device instructs the first device to perform online learning, including at least one of the following:
  • the second device instructs the first device to periodically perform online learning
  • the second device instructs the first device to perform online learning semi-periodically
  • the second device instructs the first device to perform online learning aperiodically.
  • the period adopted by the first device to perform online learning periodically or semi-periodically is: a period preconfigured by the second device, or a period independently configured by the first device.
  • the second device instructs the first device to perform online learning semi-periodically, including:
  • the second device instructs the first device to perform online learning semi-periodically through a first signaling, and the first signaling includes at least one of the following: medium access control-control element MAC-CE, downlink control information DCI .
  • the termination condition includes at least one of the following:
  • the number of online learning of the first AI model is greater than the preset number of iterations
  • the first AI model reaches a preset accuracy
  • the error information of the output result of the first AI model is less than the seventh threshold
  • the second device abruptly instructs the first device to end the current online learning process
  • the difference information between the measurement information distribution of the first device and the reference distribution is smaller than an eighth threshold.
  • the parameter configuration information includes at least one of the following:
  • Composition information of the first data set of the AI model
  • the first data set is at least one of the following: the original data set used by the first AI model, the data set newly collected by the first AI model;
  • the parameter information of the reference distribution includes at least one of the following: variance, mean, and standard deviation.
  • the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is an instantaneous training mode, and the parameter configuration information further includes at least one of the following : the amount of data collected by the first device, and the length of time for collecting the data.
  • the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is a continuous learning mode, and the parameter configuration information further includes at least one of the following : The time interval between two adjacent online learning, and the data volume interval between two adjacent online learning.
  • the second device, the first device, and the third device include at least one of the following: a core network device, an access network device, and a terminal.
  • the terminal acquires a first AI model, and performs online learning on the first AI model based on online learning information of the first AI model.
  • this method by deploying the first AI model on the terminal side, and configuring the first model on The parameters required for online learning enable continuous online adjustment of the first AI model on the terminal side, thereby maintaining the predictive performance of the first AI model and ensuring the service quality of the terminal.
  • the second device being the network side device as an example.
  • the embodiment of the present application also provides a network side device, including a processor and a communication interface, the processor is configured to configure a first AI model for a first device; and configure online learning information of the first AI model for the first device.
  • the network-side device embodiment corresponds to the above-mentioned network-side device method embodiment, and each implementation process and implementation mode of the above-mentioned method embodiment can be applied to this network-side device embodiment, and can achieve the same technical effect.
  • the embodiment of the present application also provides a network side device.
  • the network side device 800 includes: an antenna 81 , a radio frequency device 82 , a baseband device 83 , a processor 84 and a memory 85 .
  • the antenna 81 is connected to a radio frequency device 82 .
  • the radio frequency device 82 receives information through the antenna 81, and sends the received information to the baseband device 83 for processing.
  • the baseband device 83 processes the information to be sent and sends it to the radio frequency device 82
  • the radio frequency device 82 processes the received information and sends it out through the antenna 81 .
  • the method performed by the network side device in the above embodiments may be implemented in the baseband device 83, where the baseband device 83 includes a baseband processor.
  • the baseband device 83 can include at least one baseband board, for example, a plurality of chips are arranged on the baseband board, as shown in FIG.
  • the program executes the network device operations shown in the above method embodiments.
  • the network side device may also include a network interface 86, such as a common public radio interface (common public radio interface, CPRI).
  • a network interface 86 such as a common public radio interface (common public radio interface, CPRI).
  • the network side device 800 in this embodiment of the present invention further includes: instructions or programs stored in the memory 85 and operable on the processor 84, and the processor 84 invokes the instructions or programs in the memory 85 to execute the various programs shown in FIG.
  • the method of module execution achieves the same technical effect, so in order to avoid repetition, it is not repeated here.
  • the embodiment of the present application also provides a network side device.
  • the network side device 900 includes: a processor 901 , a network interface 902 and a memory 903 .
  • the network interface 902 is, for example, a common public radio interface (common public radio interface, CPRI).
  • the network-side device 900 in this embodiment of the present invention also includes: instructions or programs stored in the memory 903 and executable on the processor 901, and the processor 901 invokes the instructions or programs in the memory 903 to execute the various programs shown in FIG.
  • the method of module execution achieves the same technical effect, so in order to avoid repetition, it is not repeated here.
  • the embodiment of the present application also provides a readable storage medium, the readable storage medium stores a program or an instruction, and when the program or instruction is executed by the processor, each process of the above-mentioned online learning method embodiment of the AI model is realized, and The same technical effect can be achieved, so in order to avoid repetition, details will not be repeated here.
  • the processor is the processor in the terminal described in the foregoing embodiments.
  • the readable storage medium includes a computer-readable storage medium, such as a computer read-only memory ROM, a random access memory RAM, a magnetic disk or an optical disk, and the like.
  • the embodiment of the present application further provides a chip, the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is used to run programs or instructions to realize the online learning method of the AI model
  • the chip includes a processor and a communication interface
  • the communication interface is coupled to the processor
  • the processor is used to run programs or instructions to realize the online learning method of the AI model
  • the chip mentioned in the embodiment of the present application may also be called a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip.
  • the embodiment of the present application further provides a computer program/program product, the computer program/program product is stored in a storage medium, and the computer program/program product is executed by at least one processor to realize the above-mentioned online learning of the AI model
  • Each process of the method embodiment can achieve the same technical effect, and will not be repeated here to avoid repetition.
  • the embodiment of the present application also provides a communication system, including: a terminal and a network-side device, the terminal can be used to execute the steps of the online learning method of the AI model as described above, and the network-side device can be used to execute the above-mentioned The steps of the online learning method of the AI model.
  • the term “comprising”, “comprising” or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article or apparatus comprising a set of elements includes not only those elements, It also includes other elements not expressly listed, or elements inherent in the process, method, article, or device. Without further limitations, an element defined by the phrase “comprising a " does not preclude the presence of additional identical elements in the process, method, article, or apparatus comprising that element.
  • the scope of the methods and devices in the embodiments of the present application is not limited to performing functions in the order shown or discussed, and may also include performing functions in a substantially simultaneous manner or in reverse order according to the functions involved. Functions are performed, for example, the described methods may be performed in an order different from that described, and various steps may also be added, omitted, or combined. Additionally, features described with reference to certain examples may be combined in other examples.
  • the methods of the above embodiments can be implemented by means of software plus a necessary general-purpose hardware platform, and of course also by hardware, but in many cases the former is better implementation.
  • the technical solution of the present application is essentially or the part that contributes to the prior art can be embodied in the form of computer software products, and the computer software products are stored in a storage medium (such as ROM/RAM, disk, CD) contains several instructions to enable a terminal (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to execute the methods described in various embodiments of the present application.
  • a terminal which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Computing Systems (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Feedback Control In General (AREA)
  • Communication Control (AREA)
  • Stored Programmes (AREA)

Abstract

The present application discloses an online learning method and apparatus for an artificial intelligence (AI) model, and a communication device and a readable storage medium. The online learning method for the AI model of embodiments of the present application comprises: a second device configures a first AI model for a first device; and the second device configures online learning information of the first AI model for the first device.

Description

AI模型的在线学习方法、装置、通信设备及可读存储介质AI model online learning method, device, communication device and readable storage medium
相关申请的交叉引用Cross References to Related Applications
本申请主张在2022年02月21日在中国提交的中国专利申请号No.202210157466.7的优先权,其全部内容通过引用包含于此。This application claims priority to Chinese Patent Application No. 202210157466.7 filed in China on February 21, 2022, the entire contents of which are hereby incorporated by reference.
技术领域technical field
本申请属于电子信息技术领域,具体涉及一种AI模型的在线学习方法、装置、通信设备及可读存储介质。The application belongs to the field of electronic information technology, and specifically relates to an AI model online learning method, device, communication device and readable storage medium.
背景技术Background technique
随着人工智能(Artificial Intelligence,AI)在各个领域的广泛的应用,将AI融入无线通信网络,来提升网络吞吐量、时延以及用户容量等技术指标成为无线通信网络的重要任务。目前,无线通信网络中的AI模块有多种实现方式。例如,神经网络(neural network,NN)决策树(decision tree,DT)、支持向量机(support vector machine,SVM)、遗传算法(genetic algorithm,GA)等。With the widespread application of artificial intelligence (AI) in various fields, it has become an important task for wireless communication networks to integrate AI into wireless communication networks to improve technical indicators such as network throughput, delay, and user capacity. At present, there are many ways to realize the AI module in the wireless communication network. For example, neural network (neural network, NN) decision tree (decision tree, DT), support vector machine (support vector machine, SVM), genetic algorithm (genetic algorithm, GA), etc.
在相关技术中,通常是离线训练AI模型,然后将训练好的AI模型部署于无线通信系统。然而,在无线通信环境发生变化时,使得AI模型的输出结果的准确性较低。如此,导致AI模型的计算精准较差。In related technologies, the AI model is usually trained offline, and then the trained AI model is deployed in a wireless communication system. However, when the wireless communication environment changes, the accuracy of the output result of the AI model is low. In this way, the calculation accuracy of the AI model is poor.
发明内容Contents of the invention
本申请实施例提供一种AI模型的在线学习方法、装置、通信设备以及可读存储介质,能够解决解决实际场景中,由无线通信环境的动态变化导致的AI模型失效的问题。Embodiments of the present application provide an online AI model learning method, device, communication device, and readable storage medium, which can solve the problem of invalidation of the AI model caused by dynamic changes in the wireless communication environment in actual scenarios.
第一方面,提供了一种AI模型的在线学习方法,该方法包括:第二设备为第一设备配置第一AI模型;第二设备为该第一设备配置上述第一AI模型的在线学习信息。In the first aspect, an online learning method of an AI model is provided, the method includes: a second device configures a first AI model for a first device; the second device configures online learning information of the first AI model for the first device .
第二方面,提供了一种AI模型的在线学习装置,该装置包括:配置模块,其中:所述配置模块,用于第二设备为第一设备配置第一AI模型;所述配置模块,还用于所述第二设备为所述第一设备配置所述第一AI模型的在线学习信息。In a second aspect, an online AI model learning device is provided, which includes: a configuration module, wherein: the configuration module is used for the second device to configure the first AI model for the first device; the configuration module also The online learning information is used for the second device to configure the first AI model for the first device.
第三方面,提供了一种AI模型的在线学习方法,该方法包括:获取模块和执行模块,其中:所述获取模块,用于所述第一设备获取第一AI模型;所述执行模块,用于所述第一设备基于所述第一AI模型的在线学习信息,对所述第一AI模型进行在线学习。In a third aspect, an online learning method of an AI model is provided, the method includes: an acquisition module and an execution module, wherein: the acquisition module is used for the first device to acquire the first AI model; the execution module, The first device performs online learning on the first AI model based on the online learning information of the first AI model.
第四方面,提供了一种AI模型的在线学习装置,该装置包括:第一设备获取第一AI模型;所述第一设备基于所述第一AI模型的在线学习信息,对所述第一AI模型进行在线学习。In a fourth aspect, an online learning device for an AI model is provided, the device includes: a first device acquires a first AI model; the first device, based on the online learning information of the first AI model, AI models for online learning.
第五方面,提供了一种通信设备,该通信设备包括处理器和存储器,所述存储器存储可在所述处理器上运行的程序或指令,所述程序或指令被所述处理器执行时实现如第一方面所述的方法的步骤。In a fifth aspect, a communication device is provided, the communication device includes a processor and a memory, the memory stores programs or instructions that can run on the processor, and the programs or instructions are implemented when executed by the processor The steps of the method as described in the first aspect.
第六方面,提供了一种通信设备,包括处理器及通信接口,其中,所述处理器用于为第一设备配置第一AI模型;以及为该第一设备配置上述第一AI模型的在线学习信息。In a sixth aspect, a communication device is provided, including a processor and a communication interface, wherein the processor is configured to configure a first AI model for a first device; and configure online learning of the first AI model for the first device information.
第七方面,提供了一种通信设备,该通信设备包括处理器和存储器,所述存储器存储可在所述处理器上运行的程序或指令,所述程序或指令被所述处理器执行时实现如第一方面所述的方法的步骤。In a seventh aspect, a communication device is provided, the communication device includes a processor and a memory, the memory stores programs or instructions that can run on the processor, and the programs or instructions are implemented when executed by the processor The steps of the method as described in the first aspect.
第八方面,提供了一种网络侧设备,包括处理器及通信接口,其中,上述处理器用于获取第一AI模型,基于所述第一AI模型的在线学习信息,对所述第一AI模型进行在线学习。In an eighth aspect, a network side device is provided, including a processor and a communication interface, wherein the above-mentioned processor is used to obtain a first AI model, and based on the online learning information of the first AI model, the first AI model Take online learning.
第九方面,提供了一种可读存储介质,所述可读存储介质上存储程序或指令,所述程序或指令被处理器执行时实现如第一方面所述的方法的步骤,或者实现如第三方面所述的方法的步骤。In the ninth aspect, a readable storage medium is provided, and programs or instructions are stored on the readable storage medium, and when the programs or instructions are executed by a processor, the steps of the method described in the first aspect are realized, or the steps of the method described in the first aspect are realized, or The steps of the method described in the third aspect.
第十方面,提供了一种芯片,所述芯片包括处理器和通信接口,所述通信接口和所述处理器耦合,所述处理器用于运行程序或指令,实现如第一方面所述的方法,或实现如第三方面所述的方法。In a tenth aspect, a chip is provided, the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is used to run programs or instructions to implement the method as described in the first aspect , or implement the method described in the third aspect.
第十一方面,提供了一种计算机程序/程序产品,所述计算机程序/程序产品被存储在存储介质 中,所述计算机程序/程序产品被至少一个处理器执行以实现如第一方面,或实现如第三方面所述的AI模型的在线学习方法的步骤。In an eleventh aspect, a computer program/program product is provided, and the computer program/program product is stored in a storage medium In the above, the computer program/program product is executed by at least one processor to implement the first aspect, or to implement the steps of the online learning method of the AI model as described in the third aspect.
在本申请实施例中,第一设备获取第一AI模型,并基于该第一AI模型的在线学习信息对该第一AI模型进行在线学习。通过该方法,通过在第一设备侧部署第一AI模型,并对该第一模型配置在线学习所需的参数,使得可以在第一设备侧对该第一AI模型进行连续在线调整,从而维持第一AI模型的预测性能,进而保证第一设备的服务质量。In this embodiment of the present application, the first device acquires a first AI model, and performs online learning on the first AI model based on online learning information of the first AI model. With this method, by deploying the first AI model on the first device side and configuring the first model with parameters required for online learning, the first AI model can be continuously adjusted online on the first device side, thereby maintaining The predictive performance of the first AI model, thereby ensuring the service quality of the first device.
附图说明Description of drawings
图1是本申请实施例提供的无线通信系统的框图;FIG. 1 is a block diagram of a wireless communication system provided by an embodiment of the present application;
图2是本申请实施例提供的AI模型的在线学习方法的流程示意图之一;Fig. 2 is one of the schematic flow charts of the online learning method of the AI model provided by the embodiment of the present application;
图3是本申请实施例提供的AI模型的在线学习方法的流程示意图之二;Fig. 3 is the second schematic flow diagram of the online learning method of the AI model provided by the embodiment of the present application;
图4是本申请实施例提供的AI模型的在线学习装置的结构示意图之一;Fig. 4 is one of the structural schematic diagrams of the online learning device of the AI model provided by the embodiment of the present application;
图5是本申请实施例提供的AI模型的在线学习装置的结构示意图之二;Fig. 5 is the second structural schematic diagram of the online learning device of the AI model provided by the embodiment of the present application;
图6是本申请实施例提供通信设备的结构示意图;FIG. 6 is a schematic structural diagram of a communication device provided by an embodiment of the present application;
图7是本申请实施例提供的终端的硬件结构示意图;FIG. 7 is a schematic diagram of a hardware structure of a terminal provided by an embodiment of the present application;
图8是本申请实施例提供的网络侧设备的硬件结构示意图之一;FIG. 8 is one of the schematic diagrams of the hardware structure of the network side device provided by the embodiment of the present application;
图9是本申请实施例提供的网络侧设备的硬件结构示意图之二。FIG. 9 is the second schematic diagram of the hardware structure of the network side device provided by the embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, but not all of them. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments in this application belong to the protection scope of this application.
本申请的说明书和权利要求书中的术语“第一”、“第二”等是用于区别类似的对象,而不用于描述特定的顺序或先后次序。应该理解这样使用的术语在适当情况下可以互换,以便本申请的实施例能够以除了在这里图示或描述的那些以外的顺序实施,且“第一”、“第二”所区别的对象通常为一类,并不限定对象的个数,例如第一对象可以是一个,也可以是多个。此外,说明书以及权利要求中“和/或”表示所连接对象的至少其中之一,字符“/”一般表示前后关联对象是一种“或”的关系。The terms "first", "second" and the like in the specification and claims of the present application are used to distinguish similar objects, and are not used to describe a specific sequence or sequence. It is to be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments of the application are capable of operation in sequences other than those illustrated or described herein and that "first" and "second" distinguish objects. It is usually one category, and the number of objects is not limited. For example, there may be one or more first objects. In addition, "and/or" in the description and claims means at least one of the connected objects, and the character "/" generally means that the related objects are an "or" relationship.
值得指出的是,本申请实施例所描述的技术不限于长期演进型(Long Term Evolution,LTE)/LTE的演进(LTE-Advanced,LTE-A)系统,还可用于其他无线通信系统,诸如码分多址(Code Division Multiple Access,CDMA)、时分多址(Time Division Multiple Access,TDMA)、频分多址(Frequency Division Multiple Access,FDMA)、正交频分多址(Orthogonal Frequency Division Multiple Access,OFDMA)、单载波频分多址(Single-carrier Frequency Division Multiple Access,SC-FDMA)和其他系统。本申请实施例中的术语“系统”和“网络”常被可互换地使用,所描述的技术既可用于以上提及的系统和无线电技术,也可用于其他系统和无线电技术。以下描述出于示例目的描述了新空口(New Radio,NR)系统,并且在以下大部分描述中使用NR术语,但是这些技术也可应用于NR系统应用以外的应用,如第6代(6th Generation,6G)通信系统。It is worth noting that the technology described in the embodiment of this application is not limited to the Long Term Evolution (Long Term Evolution, LTE)/LTE-Advanced (LTE-Advanced, LTE-A) system, and can also be used in other wireless communication systems, such as code Code Division Multiple Access (CDMA), Time Division Multiple Access (TDMA), Frequency Division Multiple Access (FDMA), Orthogonal Frequency Division Multiple Access, OFDMA), Single-carrier Frequency Division Multiple Access (Single-carrier Frequency Division Multiple Access, SC-FDMA) and other systems. The terms "system" and "network" in the embodiments of the present application are often used interchangeably, and the described technology can be used for the above-mentioned system and radio technology, and can also be used for other systems and radio technologies. The following description describes the New Radio (New Radio, NR) system for example purposes, and uses NR terminology in most of the following descriptions, but these techniques can also be applied to applications other than NR system applications, such as the 6th generation (6th Generation , 6G) communication system.
图1示出本申请实施例可应用的一种无线通信系统的框图。无线通信系统包括终端11和网络侧设备12。其中,终端11可以是手机、平板电脑(Tablet Personal Computer)、膝上型电脑(Laptop Computer)或称为笔记本电脑、个人数字助理(Personal Digital Assistant,PDA)、掌上电脑、上网本、超级移动个人计算机(ultra-mobile personal computer,UMPC)、移动上网装置(Mobile Internet Device,MID)、增强现实(augmented reality,AR)/虚拟现实(virtual reality,VR)设备、机器人、可穿戴式设备(Wearable Device)、车载设备(VUE)、行人终端(PUE)、智能家居(具有无线通信功能的家居设备,如冰箱、电视、洗衣机或者家具等)、游戏机、个人计算机(personal computer,PC)、柜员机或者自助机等终端侧设备,可穿戴式设备包括:智能手表、智能手环、智能耳机、智能眼镜、智能首饰(智能手镯、智能手链、智能戒指、智能项链、智能脚镯、智能脚链等)、智能腕带、智能服装等。需要说明的是,在本申请实施例并不限定终端11的具体类型。网络侧设备12可以包括接入网设备或核心网设备,其中,接入网设备12也可以称为无线接入网设备、无线接入网(Radio Access Network,RAN)、无线接入网功能或无线接入网单元。接入网设备12可以包括基站、WLAN接入点或WiFi节点等,基站可被称为节点B、演进节点B(eNB)、接入点、基收发机站(Base Transceiver Station,BTS)、无线电基站、无线电收发机、基本服务集(Basic Service Set,BSS)、扩展服务集(Extended Service Set,ESS)、家用B节点、家用演进型B节点、发送接收点(Transmitting Receiving Point,TRP)或所述领域中其他某 个合适的术语,只要达到相同的技术效果,所述基站不限于特定技术词汇,需要说明的是,在本申请实施例中仅以NR系统中的基站为例进行介绍,并不限定基站的具体类型。核心网设备可以包含但不限于如下至少一项:核心网节点、核心网功能、移动管理实体(Mobility Management Entity,MME)、接入移动管理功能(Access and Mobility Management Function,AMF)、会话管理功能(Session Management Function,SMF)、用户平面功能(User Plane Function,UPF)、策略控制功能(Policy Control Function,PCF)、策略与计费规则功能单元(Policy and Charging Rules Function,PCRF)、边缘应用服务发现功能(Edge Application Server Discovery Function,EASDF)、统一数据管理(Unified Data Management,UDM),统一数据仓储(Unified Data Repository,UDR)、归属用户服务器(Home Subscriber Server,HSS)、集中式网络配置(Centralized network configuration,CNC)、网络存储功能(Network Repository Function,NRF),网络开放功能(Network Exposure Function,NEF)、本地NEF(Local NEF,或L-NEF)、绑定支持功能(Binding Support Function,BSF)、应用功能(Application Function,AF)等。需要说明的是,在本申请实施例中仅以NR系统中的核心网设备为例进行介绍,并不限定核心网设备的具体类型。Fig. 1 shows a block diagram of a wireless communication system to which the embodiment of the present application is applicable. The wireless communication system includes a terminal 11 and a network side device 12 . Wherein, the terminal 11 can be a mobile phone, a tablet computer (Tablet Personal Computer), a laptop computer (Laptop Computer) or a notebook computer, a personal digital assistant (Personal Digital Assistant, PDA), a palmtop computer, a netbook, a super mobile personal computer (ultra-mobile personal computer, UMPC), mobile Internet device (Mobile Internet Device, MID), augmented reality (augmented reality, AR) / virtual reality (virtual reality, VR) equipment, robot, wearable device (Wearable Device) , vehicle equipment (VUE), pedestrian terminal (PUE), smart home (home equipment with wireless communication functions, such as refrigerators, TVs, washing machines or furniture, etc.), game consoles, personal computers (personal computers, PCs), teller machines or self-service Wearable devices include: smart watches, smart bracelets, smart headphones, smart glasses, smart jewelry (smart bracelets, smart bracelets, smart rings, smart necklaces, smart anklets, smart anklets, etc.), Smart wristbands, smart clothing, etc. It should be noted that, the embodiment of the present application does not limit the specific type of the terminal 11 . The network side device 12 may include an access network device or a core network device, where the access network device 12 may also be called a radio access network device, a radio access network (Radio Access Network, RAN), a radio access network function, or Wireless access network unit. The access network device 12 may include a base station, a WLAN access point, or a WiFi node, etc., and the base station may be called a node B, an evolved node B (eNB), an access point, a base transceiver station (Base Transceiver Station, BTS), a radio Base station, radio transceiver, Basic Service Set (BSS), Extended Service Set (ESS), Home Node B, Home Evolved Node B, Transmitting Receiving Point (TRP) or all other in the field An appropriate term, as long as the same technical effect is achieved, the base station is not limited to a specific technical vocabulary. It should be noted that in the embodiment of this application, only the base station in the NR system is used as an example to introduce, and the specific details of the base station are not limited. type. The core network equipment may include but not limited to at least one of the following: core network node, core network function, mobility management entity (Mobility Management Entity, MME), access mobility management function (Access and Mobility Management Function, AMF), session management function (Session Management Function, SMF), user plane function (User Plane Function, UPF), policy control function (Policy Control Function, PCF), policy and charging rules function unit (Policy and Charging Rules Function, PCRF), edge application service Discovery function (Edge Application Server Discovery Function, EASDF), unified data management (Unified Data Management, UDM), unified data storage (Unified Data Repository, UDR), home subscriber server (Home Subscriber Server, HSS), centralized network configuration ( Centralized network configuration, CNC), network storage function (Network Repository Function, NRF), network exposure function (Network Exposure Function, NEF), local NEF (Local NEF, or L-NEF), binding support function (Binding Support Function, BSF), Application Function (Application Function, AF), etc. It should be noted that, in the embodiment of the present application, only the core network equipment in the NR system is used as an example for introduction, and the specific type of the core network equipment is not limited.
下面对本发明实施例中所涉及的部分术语进行解释:Some terms involved in the embodiments of the present invention are explained below:
(1)人工智能(Artificial Intelligence,AI):人工智能是包括十分广泛的科学,它由不同的领域组成,如机器学习,计算机视觉等等。(1) Artificial Intelligence (AI): Artificial intelligence is a very broad science, which consists of different fields, such as machine learning, computer vision and so on.
(2)机器学习(Machine Learning,ML):机器学习是人工智能的一个重要分支,主要研究如何让计算机具有能够自我学习的能力。机器学习的算法包括神经网络(neural network,NN)决策树(decision tree,DT)、支持向量机(support vector machine,SVM)、遗传算法(genetic algorithm,GA)等等。(2) Machine Learning (Machine Learning, ML): Machine learning is an important branch of artificial intelligence, which mainly studies how to make computers have the ability to learn by themselves. Machine learning algorithms include neural network (neural network, NN) decision tree (decision tree, DT), support vector machine (support vector machine, SVM), genetic algorithm (genetic algorithm, GA) and so on.
(3)神经网络:神经网络由大量节点组成,这些节点称为神经元。其中,神经元的构成信息包括:输入(a1,a2,…aK)权值/乘性系数(w),为偏置/加性系数(b),激活函数(σ(.))。常见的激活函数包括Sigmoid、tanh、ReLU(Rectified Linear Unit),线性整流函数,修正线性单元)等等。(3) Neural network: A neural network consists of a large number of nodes, which are called neurons. Among them, the composition information of neurons includes: input (a1, a2,...aK) weight/multiplicative coefficient (w), bias/additive coefficient (b), activation function (σ(.)). Common activation functions include Sigmoid, tanh, ReLU (Rectified Linear Unit), linear rectification function, corrected linear unit) and so on.
进一步地,神经网络的参数可以通过梯度优化算法进行优化。梯度优化算法是一类最小化或者最大化目标函数(有时候也叫损失函数)的算法,而目标函数往往是模型参数和数据的数学组合。例如给定数据X和其对应的标签Y,在构建一个神经网络模型f(.),有了模型后,根据输入x就可以得到预测输出f(x),并且可以计算出预测值和真实值之间的差距(f(x)-Y),即损失函数。若找到合适的W,b使上述的损失函数的值达到最小,损失值越小,则说明模型越接近于真实情况。Further, the parameters of the neural network can be optimized by gradient optimization algorithm. The gradient optimization algorithm is a class of algorithms that minimize or maximize an objective function (sometimes called a loss function), and the objective function is often a mathematical combination of model parameters and data. For example, given data X and its corresponding label Y, after constructing a neural network model f(.), with the model, the predicted output f(x) can be obtained according to the input x, and the predicted value and the real value can be calculated The gap between (f(x)-Y), which is the loss function. If a suitable W,b is found to minimize the value of the above loss function, the smaller the loss value, the closer the model is to the real situation.
示例地,目前常见的优化算法,通常都是基于BP(error Back Propagation,误差反向传播)算法。BP算法的基本思想是,学习过程由信号的正向传播与误差的反向传播两个过程组成。正向传播时,输入样本从输入层传入,经各隐层逐层处理后,传向输出层。若输出层的实际输出与期望的输出不符,则转入误差的反向传播阶段。误差反传是将输出误差以某种形式通过隐层向输入层逐层反传,并将误差分摊给各层的所有单元,从而获得各层单元的误差信号,此误差信号即作为修正各单元权值的依据。这种信号正向传播与误差反向传播的各层权值调整过程,是周而复始地进行的。权值不断调整的过程,也就是网络的学习训练过程。此过程一直进行到网络输出的误差减少到可接受的程度,或进行到预先设定的学习次数为止。For example, currently common optimization algorithms are usually based on BP (error Back Propagation, error back propagation) algorithm. The basic idea of the BP algorithm is that the learning process consists of two processes: the forward propagation of the signal and the back propagation of the error. During forward propagation, the input samples are passed in from the input layer, processed layer by layer by each hidden layer, and passed to the output layer. If the actual output of the output layer does not match the expected output, it will enter the error backpropagation stage. Error backpropagation is to transmit the output error layer by layer through the hidden layer to the input layer in some form, and distribute the error to all the units of each layer, so as to obtain the error signal of each layer unit, and this error signal is used as the correction unit Basis for weight. This weight adjustment process of each layer of signal forward propagation and error back propagation is carried out repeatedly. The process of continuously adjusting the weights is also the learning and training process of the network. This process has been carried out until the error of the network output is reduced to an acceptable level, or until the preset number of learning times.
示例地,常见的优化算法有梯度下降(Gradient Descent)、随机梯度下降(Stochastic Gradient Descent,SGD)、小批量梯度下降(mini-batch gradient descent)、动量法(Momentum)、带动量的随机梯度下降(Nesterov)、自适应梯度下降(ADAptive GRADient descent,Adagrad)、Adadelta、均方根误差降速(root mean square prop,RMSprop)、自适应动量估计(Adaptive Moment Estimation,Adam)等。For example, common optimization algorithms include gradient descent (Gradient Descent), stochastic gradient descent (Stochastic Gradient Descent, SGD), mini-batch gradient descent (mini-batch gradient descent), momentum method (Momentum), stochastic gradient descent with momentum (Nesterov), adaptive gradient descent (ADAptive GRADient descent, Adagrad), Adadelta, root mean square error deceleration (root mean square prop, RMSprop), adaptive momentum estimation (Adaptive Moment Estimation, Adam), etc.
示例地,以上优化算法在误差反向传播时,都是根据损失函数得到的误差/损失,对当前神经元求导数/偏导,加上学习速率、之前的梯度/导数/偏导等影响,得到梯度,将梯度传给上一层。As an example, the above optimization algorithm is based on the error/loss obtained by the loss function when the error is backpropagated, and calculates the derivative/partial derivative of the current neuron, plus the learning rate, the previous gradient/derivative/partial derivative, etc., Get the gradient and pass the gradient to the previous layer.
下面结合附图,通过一些实施例及其应用场景对本申请实施例提供的在线学习方法进行详细地说明。The online learning method provided by the embodiment of the present application will be described in detail below through some embodiments and application scenarios with reference to the accompanying drawings.
当前无线通信领域对的AI研究主要集中来离线学习和部署,而由于无线环境是不断变化的,通过离线训练得到的固定AI模型会在动态环境中逐渐失效,如何提升模型在变化的新的环境之中的适应能力,成为亟待解决的问题。The current AI research in the field of wireless communication mainly focuses on offline learning and deployment. Since the wireless environment is constantly changing, the fixed AI model obtained through offline training will gradually fail in the dynamic environment. How to improve the model in the new changing environment The adaptability among them has become an urgent problem to be solved.
本申请为例解决上述问题,提出一种AI模型的在线学习方法。进一步地,实现在线学习有以下难点:1)受限于设备的存储能力和数据收集能力(如,收集数据的时间成本和硬件成本比较高),通常难以获取足够大量的数据集进行在线训练;2)受限于设备的计算能力和有限的数据集,可能无法进行多轮的模型微调或经过多轮模型微调后会导致过拟合;3)对于无线通信,还存在通信时延限制以及通信的连续性的问题,这对第一设备数据采集的时间和在线学习的时间提出了要求。This application is an example to solve the above problems, and proposes an online learning method for AI models. Further, the realization of online learning has the following difficulties: 1) limited by the storage capacity and data collection capacity of the device (for example, the time cost and hardware cost of collecting data are relatively high), it is usually difficult to obtain a large enough data set for online training; 2) Limited by the computing power of the device and the limited data set, it may not be possible to perform multiple rounds of model fine-tuning or over-fitting will result after multiple rounds of model fine-tuning; 3) For wireless communication, there are also communication delay limitations and communication The problem of continuity, which puts forward requirements on the time of the first equipment data collection and the time of online learning.
图2示出了本申请实施例提供的一种AI模型的在线学习方法的流程图。如图2所示,本申请实施 例提供的AI模型的在线学习方法可以包括如下步骤201和步骤202:FIG. 2 shows a flow chart of an online learning method for an AI model provided by an embodiment of the present application. As shown in Figure 2, the implementation of this application The online learning method of the AI model provided by the example may include the following steps 201 and 202:
步骤201:第二设备为第一设备配置第一AI模型。Step 201: the second device configures the first AI model for the first device.
在本申请实施例中,上述第二设备可以包括以下至少之一:核心网设备,接入网设备,终端;上述第一设备可以包括以下至少之一:核心网设备,接入网设备,终端。In the embodiment of the present application, the above-mentioned second device may include at least one of the following: core network equipment, access network equipment, and terminal; the above-mentioned first device may include at least one of the following: core network equipment, access network equipment, and terminal .
示例性地,第二设备为核心网设备,相应的,第一设备可以为接入网设备或终端。Exemplarily, the second device is a core network device, and correspondingly, the first device may be an access network device or a terminal.
示例性地,第二设备为接入网设备,相应的,第一设备可以为核心网设备或终端。Exemplarily, the second device is an access network device, and correspondingly, the first device may be a core network device or a terminal.
示例性地,第二设备为终端,相应的,第一设备可以为核心网设备或接入网设备。Exemplarily, the second device is a terminal, and correspondingly, the first device may be a core network device or an access network device.
在本申请实施例中,上述第一AI模型为在第二设备侧离线训练得到的AI模型。In the embodiment of the present application, the above-mentioned first AI model is an AI model obtained by offline training on the second device side.
可选地,在本申请实施例中,上述第一AI模型的算法可以包括以下至少之一:神经网络、决策树、支持向量机、贝叶斯分类器。Optionally, in the embodiment of the present application, the algorithm of the first AI model may include at least one of the following: a neural network, a decision tree, a support vector machine, and a Bayesian classifier.
可选地,在本申请实施例中,上述第一AI模型可以为用于终端定位、网络优化、大型输入数据集处理、以及为用户进行网络推荐的AI模型。Optionally, in this embodiment of the present application, the above-mentioned first AI model may be an AI model used for terminal positioning, network optimization, processing of large input data sets, and network recommendation for users.
可选地,在本申请实施例中,第二设备可以基于预设的学习框架对AI模型进行训练,以得到上述第一AI模型。示例性地,以第一AI模型为神经网络模型为例。第二设备可以基于预设的学习框架对神经网络模型进行训练,以得到第一神经网络模型。Optionally, in this embodiment of the present application, the second device may train the AI model based on a preset learning framework, so as to obtain the above-mentioned first AI model. Exemplarily, the first AI model is a neural network model as an example. The second device can train the neural network model based on a preset learning framework to obtain the first neural network model.
可选地,在本申请实施例中,第二设备可以将训练得到的第一AI模型发送至第一设备,并在第一设备侧部署该第一AI模型。Optionally, in this embodiment of the present application, the second device may send the trained first AI model to the first device, and deploy the first AI model on the side of the first device.
步骤202:第二设备为第一设备配置第一AI模型的在线学习信息。Step 202: the second device configures online learning information of the first AI model for the first device.
在本申请实施例中,第二设备可以将第一AI模型在线学习所需的信息发送至第一设备,第一设备在接收到第二设备发送的在线学习信息的情况下,可以基于该在线学习信息对第一AI模型进行在线学习。In this embodiment of the application, the second device can send the information required for the online learning of the first AI model to the first device, and the first device can, based on the online learning information sent by the second device, The learning information performs online learning on the first AI model.
在本申请实施例中,所述在线学习信息为网络设备配置的,或者由第二设备自主确定的。In this embodiment of the present application, the online learning information is configured by the network device, or determined independently by the second device.
可选地,在本申请实施例中,上述第一设备可以为终端,第二设备可以为核心网设备。Optionally, in this embodiment of the present application, the foregoing first device may be a terminal, and the second device may be a core network device.
示例性地,核心网设备将第一AI模型发送至终端,以及将第一AI模型的在线学习信息发送至终端,终端接收核心网设备发送的第一AI模型,以及第一AI模型的在线学习信息。Exemplarily, the core network device sends the first AI model to the terminal, and sends the online learning information of the first AI model to the terminal, and the terminal receives the first AI model sent by the core network device, and the online learning information of the first AI model information.
在本申请实施例提供的在线学习方法中,第二设备为第一设备配置第一AI模型,以及为该第一设备配置上述第一AI模型的在线学习信息。通过该方法,通过在第一设备侧部署第一AI模型,并对该第一模型配置在线学习所需的参数,使得可以在第一设备侧对该第一AI模型进行连续在线调整,从而维持第一AI模型的预测性能,进而保证第一设备的服务质量。In the online learning method provided in the embodiment of the present application, the second device configures the first AI model for the first device, and configures the online learning information of the first AI model for the first device. With this method, by deploying the first AI model on the first device side and configuring the first model with parameters required for online learning, the first AI model can be continuously adjusted online on the first device side, thereby maintaining The predictive performance of the first AI model, thereby ensuring the service quality of the first device.
可选地,在本申请实施例中,上述在线学习信息包括以下至少之一:Optionally, in this embodiment of the application, the online learning information includes at least one of the following:
在线学习的触发方式;How eLearning is triggered;
在线学习的中止条件;Conditions for discontinuation of online learning;
在线学习的参数配置信息;Parameter configuration information for online learning;
在线学习的数据集。Datasets for online learning.
示例性地,上述触发方式与第一设备的状态信息以及第一设备相关信道的信道信息相关。例如,在第一设备的移动速度较快的情况下,或者在第一设备工作信道的信道环境发生变化的情况下,可以触发对第一AI模型进行在线学习,以使得第一AI模型不断适应变化的环境。Exemplarily, the above-mentioned triggering manner is related to the state information of the first device and the channel information of a channel related to the first device. For example, when the moving speed of the first device is fast, or when the channel environment of the working channel of the first device changes, online learning of the first AI model may be triggered, so that the first AI model continuously adapts to changing environment.
示例性地,第一设备可以在第一AI模型的在线学习次数大于预设迭代次数的情况下,中止对第一AI模型进行在线学习;或者,在第一AI模型达到预设精度的情况下,中止对第一AI模型进行在线学习;或者,在第一AI模型的输出结果的误差信息较小的情况下,中止对第一AI模型进行在线学习。由于第一AI模型的在线学习次数大于预设迭代次数,或者第一AI模型的精度达到预设精度,表征当前的第一AI模型具备有效性,则可以终止对第一AI模型的在线学习,以节省功耗。Exemplarily, the first device may suspend the online learning of the first AI model when the number of online learning of the first AI model is greater than the preset number of iterations; or, when the first AI model reaches the preset accuracy , suspending the online learning of the first AI model; or, in the case that the error information of the output result of the first AI model is small, suspending the online learning of the first AI model. Since the number of online learning of the first AI model is greater than the preset number of iterations, or the accuracy of the first AI model reaches the preset accuracy, indicating that the current first AI model is valid, the online learning of the first AI model can be terminated, to save power consumption.
可选地,在本申请实施例中,上述触发方式对应的触发条件包括以下至少之一:Optionally, in this embodiment of the application, the trigger condition corresponding to the above trigger mode includes at least one of the following:
上述第一设备的状态信息满足第一预设条件;The state information of the above-mentioned first device satisfies a first preset condition;
上述第一设备采集到的数据量大于第一阈值;The amount of data collected by the first device is greater than a first threshold;
上述第一设备的测量信息满足第二预设条件;The measurement information of the above-mentioned first device satisfies a second preset condition;
上述第一AI模型的输出结果的误差信息大于第二阈值;The error information of the output result of the first AI model is greater than the second threshold;
上述第一AI模型对应的第一信息的统计信息满足第三预设条件;The statistical information of the first information corresponding to the above-mentioned first AI model satisfies a third preset condition;
上述第一设备的测量信息的统计信息满足第四预设条件;The above-mentioned statistical information of the measurement information of the first device satisfies a fourth preset condition;
可选地,上述状态信息包括以下至少一项:移动速度、波束切换信息、小区切换信息。Optionally, the above status information includes at least one of the following: moving speed, beam switching information, and cell switching information.
可选地,上述第一设备采集到的数据量可以为第一设备在线数据采集数量,如,第一设备实时采集 到的信道信息。Optionally, the amount of data collected by the above-mentioned first device may be the amount of online data collected by the first device, for example, the first device collects in real time received channel information.
示例性地,在第一设备快速移动的情况下,触发第一设备对第一AI模型进行在线学习;或者,在第一设备采集的数据量较大的情况下,触发第一设备对第一AI模型进行在线学习;或者,在第一设备的测量信息指示当前信道的信道环境发生变化的情况下,触发第一设备对第一AI模型进行在线学习;或者,在第一AI模型的输出结果的误差较大或者第一AI模型的精度较低的情况下,触发第一设备对第一AI模型进行在线学习。由于终端快速移动或者信道环境发生变化时,可能会导致第一AI模型失效。如此,使得第一设备可以基于自身的移动速度、相关信道的信道环境以及AI模型的精度值等信息,在第一AI模型失效的情况下对该第一AI模型进行在线学习,从而提高第一AI模型的预测精度。For example, when the first device is moving fast, the first device is triggered to learn the first AI model online; or, when the amount of data collected by the first device is large, the first device is triggered to learn the first AI model. The AI model performs online learning; or, when the measurement information of the first device indicates that the channel environment of the current channel changes, the first device is triggered to perform online learning on the first AI model; or, when the output result of the first AI model When the error of the first AI model is relatively large or the accuracy of the first AI model is low, the first device is triggered to perform online learning on the first AI model. When the terminal moves rapidly or the channel environment changes, the first AI model may fail. In this way, the first device can learn the first AI model online based on information such as its own moving speed, the channel environment of the relevant channel, and the accuracy value of the AI model when the first AI model fails, thereby improving the first AI model. The prediction accuracy of the AI model.
示例性地,上述信道信息可以包括以下至少之一:信号的发射角角度信息,信号的到达角角度信息,信道中信号的时延信息,信道中的信号质量等等。Exemplarily, the channel information may include at least one of the following: signal launch angle information, signal arrival angle angle information, signal delay information in the channel, signal quality in the channel, and so on.
可选地,上述第一阈值可以为3000,5000或者7000等等。Optionally, the above-mentioned first threshold may be 3000, 5000 or 7000 and so on.
可选地,上述测量信息包括以下至少之一:上述第一设备接收的参考信号的第一测量信息,上述第一设备的传感器采集的第二测量信息。示例性地,上述第一测量信息包括以下至少之一:参考信号的瞬时测量信息,参考信号的统计测量信息。示例性地,上述参考信号的瞬时测量信息可以为:在某一特定时刻的参考信号的测量信息;上述参考信号的统计测量信息可以为:在一段时间内的参考信号的测量信息。Optionally, the measurement information includes at least one of the following: first measurement information of a reference signal received by the first device, and second measurement information collected by a sensor of the first device. Exemplarily, the first measurement information includes at least one of the following: instantaneous measurement information of the reference signal, and statistical measurement information of the reference signal. Exemplarily, the instantaneous measurement information of the reference signal may be: measurement information of the reference signal at a specific moment; the statistical measurement information of the reference signal may be: measurement information of the reference signal within a period of time.
示例性地,上述参考信号包括以下至少之一:同步信号块SSB、CSI参考信号CSI-RS、探测参考信号SRS、定位参考信号PRS。可选地,上述传感器可以包括以下至少之一:视觉传感器、雷达传感器、位置传感器等等。Exemplarily, the aforementioned reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS. Optionally, the aforementioned sensors may include at least one of the following: vision sensors, radar sensors, position sensors and the like.
可选地,上述第一信息包括以下至少之一:上述第一AI模型的输入信息,上述第一AI模型的输出信息。Optionally, the above-mentioned first information includes at least one of the following: input information of the above-mentioned first AI model, and output information of the above-mentioned first AI model.
示例性地,以第一设备为终端为例,在第一信息包括第一AI模型的输入信息的情况下,该第一信息可以为终端的工作信道或者周边信道的信道信息,在第一信息包括第一AI模型的输出信息的情况下,上述第一信息可以为终端的位置信息。Exemplarily, taking the first device as a terminal as an example, in the case where the first information includes the input information of the first AI model, the first information may be the working channel or the channel information of the surrounding channel of the terminal. In the first information In the case where the output information of the first AI model is included, the first information may be location information of the terminal.
可选地,在本申请实施例中,上述第一信息的统计信息包括以下至少之一:Optionally, in this embodiment of the present application, the statistical information of the above-mentioned first information includes at least one of the following:
第一时间窗内上述第一信息的第一统计量,至少两个连续的第二时间窗内上述第一信息对应的第二统计量,第一小区下的至少两个终端在第一时刻的第一信息的统计信息,以及,上述第一信息的相关性信息。The first statistic of the above-mentioned first information in the first time window, the second statistic corresponding to the above-mentioned first information in at least two consecutive second time windows, and the at least two terminals under the first cell at the first moment Statistical information of the first information, and correlation information of the above-mentioned first information.
其中,上述第二统计量是基于各个第二时间窗内的统计量计算出的。Wherein, the above-mentioned second statistic is calculated based on the statistic in each second time window.
可选地,上述统计信息可以包括以下至少之一;均值,方差等等。Optionally, the above statistical information may include at least one of the following: mean value, variance and so on.
可选地,上述统计信息可以包括时间上的统计信息和空间上的统计信息。例如,时间上的统计信息可以为:同一终端在一段连续时间内的信道的统计信息,空间上的统计信息可以为一个小区下多个不同终端的信道的统计信息。Optionally, the foregoing statistical information may include temporal statistical information and spatial statistical information. For example, the statistical information on time may be: statistical information on channels of the same terminal within a continuous period of time, and the statistical information on space may be statistical information on channels of multiple different terminals under one cell.
示例性地,上述第一信息的统计信息用于表征第一AI模型的作用区域内的无线网络环境是否发生变化。Exemplarily, the statistical information of the above-mentioned first information is used to represent whether the wireless network environment within the action area of the first AI model changes.
示例性地,以第一信息为信道信息为例。在某一连续时间窗内针对信道信息的均值小于某一阈值,或者在两个时间窗内的信道信息的相关性指标低于某一阈值,或者,当前时间窗内的前后数据相关性低于某一阈值的情况下,表征第一设备相关信道的信道环境发生变化,则可以触发第一设备对第一AI模型进行在线学习,以适应当前的信道环境,从而提升第一AI模型的预测精度。Exemplarily, the first information is channel information as an example. The average value of channel information in a certain continuous time window is less than a certain threshold, or the correlation index of channel information in two time windows is lower than a certain threshold, or, the correlation between the front and rear data in the current time window is lower than In the case of a certain threshold, if the channel environment representing the relevant channel of the first device changes, the first device can be triggered to perform online learning of the first AI model to adapt to the current channel environment, thereby improving the prediction accuracy of the first AI model .
以下以第一信息为信道信息为例,对上述第一信息的统计信息进行解释说明。The statistical information of the above-mentioned first information is explained below by taking the first information as channel information as an example.
示例性地,在第一信息的统计信息包括第一时间窗内上述第一信息的第一统计量的情况下,上述第一信息的统计信息可以为:同一终端在某一连续时间窗内的信道信息的均值或者方差。Exemplarily, when the statistical information of the first information includes the first statistical quantity of the first information in the first time window, the statistical information of the first information may be: The mean or variance of the channel information.
示例性地,在第一信息的统计信息包括至少两个连续的第二时间窗内上述第一信息对应的第二统计量的情况下,上述第一信息的统计信息可以为:基于终端在多个连续时间窗内的每个连续时间窗内信道信息的均值确定的均值,如,时间窗1内信道信息的均值为a,时间窗2内信道信息的均值为b,时间窗3内信道信息的均值为c,则信道信息的统计信息为a,b和c的均值。Exemplarily, when the statistical information of the first information includes the second statistical quantity corresponding to the above-mentioned first information in at least two consecutive second time windows, the above-mentioned statistical information of the first information may be: The mean value determined by the mean value of the channel information in each consecutive time window in consecutive time windows, for example, the mean value of the channel information in time window 1 is a, the mean value of channel information in time window 2 is b, and the mean value of channel information in time window 3 is The mean value of is c, then the statistical information of the channel information is the mean value of a, b and c.
示例性地,在第一信息的统计信息包括第一小区下的至少两个终端在第一时刻的第一信息的统计信息的情况下,上述第一信息的统计信息可以为:一个小区下多个不同终端在某一时刻的信道信息的均值,如,同一小区下的终端A,终端B和终端C在时刻1的信道信息的均值分别为d,e和f,则信道信息的统计信息为d,e和f的均值。Exemplarily, when the statistical information of the first information includes the statistical information of the first information of at least two terminals under the first cell at the first moment, the statistical information of the above-mentioned first information may be: The mean value of the channel information of different terminals at a certain moment, for example, the mean values of the channel information of terminal A, terminal B and terminal C in the same cell at time 1 are d, e and f respectively, then the statistical information of the channel information is Means of d, e and f.
示例性地,在第一信息的统计信息包括第一信息的相关性信息的情况下,上述第一信息的统计信息 可以为:两个时间窗的信道信息的相关性指标低于某一阈值,如,当前时间窗内的前后数据之间的距离小于某一阈值。Exemplarily, when the statistical information of the first information includes the correlation information of the first information, the statistical information of the above-mentioned first information It may be: the correlation index of the channel information of the two time windows is lower than a certain threshold, for example, the distance between the previous and subsequent data in the current time window is smaller than a certain threshold.
可选地,在本申请实施例中,上述第一设备的测量信息的统计信息包括以下至少之一:Optionally, in this embodiment of the present application, the statistical information of the measurement information of the first device includes at least one of the following:
在第三时间窗内针对上述测量信息的第三统计量,上述测量信息在至少两个连续的第四时间窗对应的第四统计量,以及,上述测量信息的相关性信息。The third statistic of the measurement information in the third time window, the fourth statistic of the measurement information corresponding to at least two consecutive fourth time windows, and the correlation information of the measurement information.
其中,上述第四统计量是基于各个第四时间窗内的统计量计算出的。Wherein, the above fourth statistic is calculated based on the statistic in each fourth time window.
可选地,上述相关性信息包括以下至少之一:数据之间的距离,协方差,相关系数。Optionally, the above correlation information includes at least one of the following: distance between data, covariance, and correlation coefficient.
示例性地,以测量信息为第一设备接收的参考信号的信道测量信息为例。上述测量信息的统计信息可以为:信道测量信息在某一连续时间窗内的方差,或者,两个连续的时间窗内的信道测量信息的均值的均值,或者为当前时间窗内的信道测量信息的数据之间的距离。Exemplarily, it is taken that the measurement information is channel measurement information of a reference signal received by the first device as an example. The statistical information of the above measurement information may be: the variance of the channel measurement information in a certain continuous time window, or the average of the mean values of the channel measurement information in two consecutive time windows, or the channel measurement information in the current time window distance between the data.
示例性地,上述测量信息的统计信息用于表征第一AI模型的作用区域内的无线网络环境是否发生变化。Exemplarily, the above statistical information of the measurement information is used to represent whether the wireless network environment within the active area of the first AI model changes.
示例性地,以测量信息为信道信息为例。在某一连续时间窗内针对测量信息的均值小于某一阈值,或者在两个时间窗内的测量信息的相关性指标低于某一阈值,或者,当前时间窗内的前后数据相关性低于某一阈值的情况下,表征第一AI模型作用的无线环境发生变化,则可以触发第一设备对第一AI模型进行在线学习,以适应当前的无线环境,从而提升第一AI模型的预测精度。Exemplarily, the measurement information is channel information as an example. The mean value of the measurement information in a continuous time window is less than a certain threshold, or the correlation index of the measurement information in two time windows is lower than a certain threshold, or, the correlation between the front and rear data in the current time window is lower than In the case of a certain threshold, if the wireless environment representing the role of the first AI model changes, the first device can be triggered to perform online learning of the first AI model to adapt to the current wireless environment, thereby improving the prediction accuracy of the first AI model .
进一步可选地,在本申请实施例中,上述触发条件包括:上述第一设备的状态信息满足第一预设条件;上述满足第一预设条件包括以下至少之一:Further optionally, in the embodiment of the present application, the trigger condition includes: the status information of the first device meets a first preset condition; the satisfaction of the first preset condition includes at least one of the following:
上述第一设备的移动速度大于第三阈值;The moving speed of the first device is greater than a third threshold;
上述波束切换信息指示第一设备发生波束切换、且波束切换频率大于第四阈值;The beam switching information indicates that beam switching occurs to the first device, and the beam switching frequency is greater than a fourth threshold;
上述小区切换信息指示第一设备发生小区切换。The above cell switching information indicates that the cell switching occurs to the first device.
示例性地,上述第三阈值可以为60km/h,80km/h,100km/等等。Exemplarily, the above-mentioned third threshold may be 60km/h, 80km/h, 100km/h and so on.
进一步可选地,在本申请实施例中,上述触发条件包括:上述第一设备的测量信息满足第二预设条件;可选地,上述满足第二预设条件包括:第一设备的测量信息指示第一设备的相关信道的信道环境发生变化。Further optionally, in the embodiment of the present application, the above-mentioned trigger condition includes: the measurement information of the above-mentioned first device meets the second preset condition; optionally, the above-mentioned meeting the second preset condition includes: the measurement information of the first device It indicates that the channel environment of the relevant channel of the first device changes.
示例性地,以测量信息为第一设备接收的参考信号为例。上述第二预设条件可以为:第一设备根据CSI-RS的测量估计下行信道,检测到信道环境发生变化,如由视距(line of sight,LOS)环境变成非视距(not line of sight,NLOS)环境,如信噪比SINR低于一定阈值等等。Exemplarily, it is taken that the measurement information is a reference signal received by the first device as an example. The above-mentioned second preset condition may be: the first device estimates the downlink channel according to the measurement of CSI-RS, and detects that the channel environment changes, such as changing from a line of sight (LOS) environment to a non-line of sight (not line of sight) environment. sight, NLOS) environment, such as the signal-to-noise ratio SINR is lower than a certain threshold and so on.
示例性地,以测量信息为第一设备的传感器采集的测量信息为例。上述第二预设条件可以为:通过视觉传感器得到测量信息指示第一设备处于LOS环境下。Exemplarily, it is taken that the measurement information is the measurement information collected by the sensor of the first device as an example. The above-mentioned second preset condition may be: the measurement information obtained by the visual sensor indicates that the first device is in an LOS environment.
进一步可选地,在本申请实施例中,上述触发条件包括:上述第一AI模型对应的第一信息的统计信息满足第三预设条件;上述满足第三预设条件包括以下至少之一:Further optionally, in the embodiment of the present application, the above-mentioned trigger conditions include: the statistical information of the first information corresponding to the above-mentioned first AI model satisfies a third preset condition; the above-mentioned meeting the third preset condition includes at least one of the following:
上述第一统计量大于第一阈值区间的最大值;The above-mentioned first statistic is greater than the maximum value of the first threshold interval;
上述第二统计量大于第二阈值区间的最大值;The above-mentioned second statistic is greater than the maximum value of the second threshold interval;
在至少两个时间窗内采集的第一信息的相关性信息满足第一条件;The correlation information of the first information collected in at least two time windows satisfies the first condition;
在当前时间窗内采集的不同第一信息之间的相关性信息满足第二条件。Correlation information between different first pieces of information collected within the current time window satisfies the second condition.
示例性地,以第一统计量为终端在某一连续时间窗内的信道信息的均值为例。上述第三预设条件可以为:第一设备检测到某一连续时间窗内的信道信息的统计量超过某一阈值区间的最大值。Exemplarily, it is assumed that the first statistic is an average value of channel information of the terminal within a certain continuous time window. The foregoing third preset condition may be: the first device detects that the statistics of channel information in a certain continuous time window exceed the maximum value of a certain threshold interval.
示例性地,以第二统计量为终端在多个连续时间窗内的信道信息的均值的均值为例。上述第三预设条件可以为:第一设备检测到多个连续时间窗内信道信息的均值的均值超过某一阈值区间的最大值。Exemplarily, the second statistic is an average value of average values of channel information of the terminal in multiple consecutive time windows as an example. The foregoing third preset condition may be: the first device detects that the average value of the average values of the channel information in multiple consecutive time windows exceeds the maximum value of a certain threshold interval.
示例性地,以第一信息为信道信息为例。上述第三预设条件可以为:两个时间窗内的信道信息的相关性指标低于某一阈值,或者,当前时间窗内的前后数据相关性低于某一阈值。Exemplarily, the first information is channel information as an example. The above-mentioned third preset condition may be: the correlation index of the channel information in the two time windows is lower than a certain threshold, or the correlation of the previous and subsequent data in the current time window is lower than a certain threshold.
进一步可选地,在本申请实施例中,上述触发条件包括:上述第一设备的测量信息的统计信息满足第四预设条件;上述满足第四预设条件包括以下至少之一:Further optionally, in the embodiment of the present application, the trigger condition includes: the statistical information of the measurement information of the first device satisfies a fourth preset condition; the satisfaction of the fourth preset condition includes at least one of the following:
上述第三统计量大于第三阈值区间的最大值;The above third statistic is greater than the maximum value of the third threshold interval;
上述第四统计量大于第四阈值区间的最大值;The above fourth statistic is greater than the maximum value of the fourth threshold interval;
在至少两个时间窗内采集的测量信息的相关性信息满足第三条件;The correlation information of the measurement information collected in at least two time windows satisfies the third condition;
在当前时间窗内采集的不同测量信息之间的相关性信息满足第四条件;Correlation information between different measurement information collected in the current time window satisfies the fourth condition;
上述测量信息的分布与基准分布的差异大于第五阈值,上述基准分布为第二设备为第一设备配置的信息。The difference between the distribution of the measurement information and the reference distribution is greater than a fifth threshold, and the reference distribution is information configured by the second device for the first device.
示例性地,在第三统计量为信道测量信息在某一连续时间窗内的均值的情况下,上述第四预设条件 可以为:信道测量信息在某一连续时间窗内的均值超过某一阈值区间的最大值。Exemplarily, in the case where the third statistic is the mean value of the channel measurement information within a certain continuous time window, the above-mentioned fourth preset condition It may be: the mean value of the channel measurement information in a certain continuous time window exceeds the maximum value of a certain threshold interval.
示例性地,在第四统计量为基于各个第四时间窗内信道测量信息的均值计算得到的均值的情况下,上述第四预设条件可以为:基于各个第四时间窗内信道测量信息的均值计算得到的均值超过某一阈值区间的最大值。Exemplarily, in the case where the fourth statistic is an average value calculated based on the average value of the channel measurement information in each fourth time window, the above fourth preset condition may be: based on the average value of the channel measurement information in each fourth time window The mean calculated by the mean exceeds the maximum value of a certain threshold interval.
示例性地,上述第三条件可以为:在至少两个时间窗内分别采集的测量信息的数据的协方差小于某一阈值。Exemplarily, the above third condition may be: the covariance of the data of the measurement information respectively collected in at least two time windows is smaller than a certain threshold.
示例性地,上述第四条件可以为:在当前时间窗内采集的测量信息的不同数据之间的距离小于某一阈值。Exemplarily, the above fourth condition may be: the distance between different data of the measurement information collected within the current time window is smaller than a certain threshold.
示例性地,上述测量信息的分布为测量信息的统计分布。示例性地,上述基准分布为第一AI模型的统计分布。可以理解的是,第一AI模型离线训练时的训练集服从基准分布,第一AI模型在测量信息服从基准分布的时候性能最好。Exemplarily, the above-mentioned distribution of the measurement information is a statistical distribution of the measurement information. Exemplarily, the above reference distribution is the statistical distribution of the first AI model. It can be understood that the training set for offline training of the first AI model obeys the benchmark distribution, and the performance of the first AI model is best when the measurement information obeys the benchmark distribution.
示例性地,描述上述测量信息的分布与基准分布的差异的指标可以包括以下至少之一:Wasserstein距离;Kullback-Leibler散度;Hellinger距离等。Exemplarily, the index describing the difference between the distribution of the measurement information and the reference distribution may include at least one of the following: Wasserstein distance; Kullback-Leibler divergence; Hellinger distance and the like.
可选地,在本申请实施例中,所述触发方式的触发条件包括以下至少之一:Optionally, in this embodiment of the present application, the trigger condition of the trigger mode includes at least one of the following:
第二设备指示第一设备进行在线学习;The second device instructs the first device to perform online learning;
第一AI模型的输出精度小于或者等于第六阈值。The output accuracy of the first AI model is less than or equal to the sixth threshold.
进一步可选地,在本申请实施例中,上述第二设备指示第一设备进行在线学习,包括以下至少之一:Further optionally, in this embodiment of the present application, the above-mentioned second device instructs the first device to perform online learning, including at least one of the following:
第二设备指示第一设备周期性进行在线学习;The second device instructs the first device to perform online learning periodically;
第二设备指示第一设备半周期性进行在线学习;The second device instructs the first device to conduct online learning semi-periodically;
第二设备指示所述第一设备非周期性进行在线学习。The second device instructs the first device to perform online learning aperiodically.
其中,上述第一设备周期性或半周期性进行在线学习时采用的周期为:上述第二设备预配置的周期,或者,上述第一设备自主配置的周期。Wherein, the cycle adopted by the above-mentioned first device to perform online learning periodically or semi-periodically is: a cycle pre-configured by the above-mentioned second device, or a cycle independently configured by the above-mentioned first device.
进一步可选地,上述第二设备指示第一设备半周期性进行在线学习,包括:Further optionally, the above-mentioned second device instructs the first device to perform online learning half-periodically, including:
上述第二设备通过第一信令指示第一设备半周期性进行在线学习,上述第一信令包括以下至少之一:媒体接入控制-控制单元MAC-CE,下行控制信息DCI。The second device instructs the first device to perform online learning semi-periodically through the first signaling, and the first signaling includes at least one of the following: medium access control-control element MAC-CE, downlink control information DCI.
可选地,在本申请实施例中,上述中止条件包括以下至少之一:Optionally, in this embodiment of the application, the above suspension conditions include at least one of the following:
第一AI模型的在线学习次数大于预设迭代次数;The number of online learning of the first AI model is greater than the preset number of iterations;
第一AI模型达到预设精度;The first AI model reaches the preset accuracy;
第一AI模型的输出结果的误差信息小于第七阈值;The error information of the output result of the first AI model is smaller than the seventh threshold;
第二设备突发性指示所述第一设备结束当前在线学习过程;The second device abruptly instructs the first device to end the current online learning process;
与第一AI模型关联的目标任务中止;the target task associated with the first AI model is aborted;
第一设备的测量信息分布与基准分布的差异信息小于第八阈值。The difference information between the measurement information distribution of the first device and the reference distribution is smaller than the eighth threshold.
示例性地,上述与第一AI模型关联的目标任务可以为第一AI模型当前执行的任务,如对终端进行定位,为用户进行网络推荐等。Exemplarily, the target task associated with the first AI model may be a task currently performed by the first AI model, such as locating a terminal, making network recommendations for a user, and so on.
可选地,在本申请实施例中,上述参数配置信息包括以下至少之一:Optionally, in this embodiment of the application, the above parameter configuration information includes at least one of the following:
第一AI模型的在线学习模式;The online learning mode of the first AI model;
第一AI模型的样本批量的大小;the size of the sample batch of the first AI model;
第一AI模型的优化器的状态;the state of the optimizer of the first AI model;
第一AI模型的第一数据集的划分方式;A division method of the first data set of the first AI model;
第一AI模型的第一数据集的构成信息;Composition information of the first data set of the first AI model;
第一AI模型的第一数据集对第一AI模型更新的贡献权重;The contribution weight of the first data set of the first AI model to the update of the first AI model;
与第一信息关联的AI模型标识;AI model identification associated with the first information;
所述第一AI模型的基准分布;a baseline distribution of the first AI model;
其中,上述第一数据集以下至少之一:上述第一AI模型所使用的原始数据集,上述第一AI模型新采集的数据集;上述基准分布的参数信息包括以下至少之一:方差,均值,标准差;上述样本批量的大小是指一个样本批量(Batch)中包括的样本数量的大小。Wherein, the above-mentioned first data set is at least one of the following: the original data set used by the above-mentioned first AI model, the data set newly collected by the above-mentioned first AI model; the parameter information of the above-mentioned reference distribution includes at least one of the following: variance, mean , standard deviation; the size of the above sample batch refers to the size of the number of samples included in a sample batch (Batch).
可选地,上述原始数据集为:在对第一AI模型进行离线训练使用的数据集(即旧数据集),上述新采集的数据集为:在第一设备侧在线部署第一AI模型,即为第一设备配置第一AI模型之后,在新环境下第一设备采集的数据集(即新数据集)。Optionally, the above-mentioned original data set is: a data set used for offline training of the first AI model (that is, an old data set), and the above-mentioned newly collected data set is: deploying the first AI model online on the first device side, That is, after the first AI model is configured for the first device, the data set (ie, the new data set) collected by the first device in a new environment.
可选地,上述在线学习模式包括以下任意一项:瞬时训练模式(即One-shot模式),连续学习模式。示例性地,在One-shot模式下,第一设备在采集的数据达到指定数量的情况下进行在线学习;在 连续学习模式下,第一设备随着采集的数据的数量的增加不断进行在线学习。Optionally, the above-mentioned online learning mode includes any one of the following: an instantaneous training mode (ie, One-shot mode), and a continuous learning mode. Exemplarily, in the One-shot mode, the first device performs online learning when the collected data reaches a specified amount; In the continuous learning mode, the first device continuously performs online learning as the amount of collected data increases.
示例性地,上述批量(Batch)大小为N,且N为正整数。Exemplarily, the size of the above batch (Batch) is N, and N is a positive integer.
示例性地,上述第一AI模型的优化器的状态可以包括损失函数,学习率等。Exemplarily, the state of the optimizer of the above-mentioned first AI model may include a loss function, a learning rate, and the like.
示例性地,第一数据集划分方式可以包括训练集、验证集和测试集划分比例等。Exemplarily, the first data set division method may include division ratios of the training set, verification set, and test set, and the like.
示例性地,上述数据集的构成信息,包括原始数据集的数量与新采集的数据集的数量之间的比例。需要说明是,本申请实施例中采用训练时的原始数据集可以有效防止在线学习过程中,对新采集的数据的过拟合,从而有效提升AI模型的性能。Exemplarily, the composition information of the above data sets includes the ratio between the number of original data sets and the number of newly collected data sets. It should be noted that the use of the original data set during training in the embodiment of the present application can effectively prevent over-fitting of newly collected data during the online learning process, thereby effectively improving the performance of the AI model.
示例性地,第一数据集对第一AI模型更新的贡献权重可以为:旧数据集和新数据集对上述第一AI模型更新的贡献权重,如在第一AI模型进行在线学习时,可以对原始数据集分配较小的权重,对新数据集分配较大的权重。Exemplarily, the contribution weight of the first data set to the update of the first AI model may be: the contribution weights of the old data set and the new data set to the update of the first AI model. For example, when the first AI model performs online learning, it may be Assign smaller weights to the original dataset and larger weights to the new dataset.
进一步可选地,在本申请实施例中,上述参数配置信息包括第一AI模型的在线学习模式、且在线学习模式为瞬时训练模式,上述参数配置信息还包括以下至少之一:上述第一设备所采集的数据量,上述数据量的采集时间长度。Further optionally, in the embodiment of the present application, the parameter configuration information includes an online learning mode of the first AI model, and the online learning mode is an instantaneous training mode, and the parameter configuration information further includes at least one of the following: the first device The amount of data collected, and the length of time for collecting the above data amount.
进一步可选地,在本申请实施例中,上述参数配置信息包括上述第一AI模型的在线学习模式、且上述在线学习模式为连续学习模式,上述参数配置信息还包括以下至少之一:相邻两次在线学习的时间间隔,相邻两次在线学习的数据量间隔。Further optionally, in the embodiment of the present application, the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is a continuous learning mode, and the parameter configuration information further includes at least one of the following: The time interval between two online learning, the data volume interval between two adjacent online learning.
示例性地,上述相邻两次在线学习的数据量间隔可以为100,如每采集100组数据进行一次在线训练。Exemplarily, the data volume interval between two adjacent online learning sessions may be 100, for example, online training is performed every time 100 sets of data are collected.
图3示出了本申请实施例提供的一种AI模型的在线学习方法的流程图。如图3所示,本申请实施例提供的AI模型的在线学习方法可以包括如下步骤301和步骤302:Fig. 3 shows a flowchart of an online learning method for an AI model provided by an embodiment of the present application. As shown in Figure 3, the online learning method of the AI model provided by the embodiment of the present application may include the following steps 301 and 302:
步骤301:第一设备获取第一AI模型。Step 301: the first device acquires a first AI model.
在本申请实施例中,上述第一AI模型为在第二设备侧离线训练得到的AI模型。In the embodiment of the present application, the above-mentioned first AI model is an AI model obtained by offline training on the second device side.
可选地,在本申请实施例中,上述第一AI模型的算法可以包括以下至少之一:神经网络、决策树、支持向量机、贝叶斯分类器。Optionally, in the embodiment of the present application, the algorithm of the first AI model may include at least one of the following: a neural network, a decision tree, a support vector machine, and a Bayesian classifier.
可选地,在本申请实施例中,上述第一AI模型可以为用于终端定位、网络优化、大型输入数据集处理、以及为用户进行网络推荐的AI模型。Optionally, in this embodiment of the present application, the above-mentioned first AI model may be an AI model used for terminal positioning, network optimization, processing of large input data sets, and network recommendation for users.
可选地,在本申请实施例中,第二设备可以基于预设的学习框架对AI模型进行训练,以得到上述第一AI模型。示例性地,以第一AI模型为神经网络模型为例。第二设备可以基于预设的学习框架对神经网络模型进行训练,以得到第一神经网络模型。Optionally, in this embodiment of the present application, the second device may train the AI model based on a preset learning framework, so as to obtain the above-mentioned first AI model. Exemplarily, the first AI model is a neural network model as an example. The second device can train the neural network model based on a preset learning framework to obtain the first neural network model.
步骤302:第一设备基于第一AI模型的在线学习信息,对第一AI模型进行在线学习。Step 302: The first device performs online learning on the first AI model based on the online learning information of the first AI model.
在本申请实施例中,第一设备可以基于第一AI模型的在线学习信息,对离线训练得到的第一AI模型进行在线学习,并得到参数调整后的第一AI模型。In this embodiment of the present application, the first device may perform online learning on the first AI model obtained through offline training based on the online learning information of the first AI model, and obtain the first AI model after parameter adjustment.
在本申请实施例中,上述在线学习信息是由所述第一设备确定的信息。In this embodiment of the present application, the online learning information is information determined by the first device.
在本申请实施例提供的AI模型的在线学习方法中,第一设备获取第一AI模型,并基于该第一AI模型的在线学习信息对该第一AI模型进行在线学习。通过该方法,通过在第一设备侧部署第一AI模型,并对该第一模型配置在线学习所需的参数,使得可以在第一设备侧对该第一AI模型进行连续在线调整,从而维持第一AI模型的预测性能,进而保证第一设备的服务质量。In the online learning method of the AI model provided in the embodiment of the present application, the first device acquires the first AI model, and performs online learning on the first AI model based on the online learning information of the first AI model. With this method, by deploying the first AI model on the first device side and configuring the first model with parameters required for online learning, the first AI model can be continuously adjusted online on the first device side, thereby maintaining The predictive performance of the first AI model, thereby ensuring the service quality of the first device.
可选地,在本申请实施例中,上述在线学习信息包括以下至少之一:Optionally, in this embodiment of the application, the online learning information includes at least one of the following:
在线学习的触发方式;How eLearning is triggered;
在线学习的中止条件;Conditions for discontinuation of online learning;
在线学习的参数配置信息;Parameter configuration information for online learning;
在线学习的数据集。Datasets for online learning.
可选地,在本申请实施例中,上述步骤301可以包括如下步骤301a:Optionally, in this embodiment of the application, the above step 301 may include the following step 301a:
步骤301a:第一设备接收第二设备配置的第一AI模型。Step 301a: the first device receives the first AI model configured by the second device.
可选地,在本申请实施例中,上述第二设备包括以下至少一项:核心网设备,接入网设备,以及终端;上述第一设备包括以下至少一项:核心网设备,接入网设备,以及终端。Optionally, in the embodiment of the present application, the above-mentioned second device includes at least one of the following: core network equipment, access network equipment, and terminal; the above-mentioned first device includes at least one of the following: core network equipment, access network equipment equipment, and terminals.
可选地,第二设备可以将离线训练得到的第一AI模型发送至第一设备,第一设备可以接收第二设备发送的该第一AI模型。Optionally, the second device may send the first AI model obtained through offline training to the first device, and the first device may receive the first AI model sent by the second device.
可选地,在本申请实施例中,上述步骤302之前,本申请实施例提供的在线学习方法还包括如下步骤A1:Optionally, in the embodiment of the present application, before the above step 302, the online learning method provided in the embodiment of the present application further includes the following step A1:
步骤A1:第一设备从第二设备获取第一AI模型的在线学习信息。 Step A1: the first device obtains the online learning information of the first AI model from the second device.
可选地,在第二设备可以将第一AI模型在线学习所需的信息发送至第一设备的情况下,第一设备可以接收第二设备发送的在线学习信息,可以基于该在线学习信息对第一AI模型进行在线学习。Optionally, in the case where the second device can send the information required for the online learning of the first AI model to the first device, the first device can receive the online learning information sent by the second device, and based on the online learning information, can The first AI model for online learning.
可选地,上述第一设备可以为终端,第二设备可以为核心网设备。Optionally, the above-mentioned first device may be a terminal, and the second device may be a core network device.
示例性地,核心网设备将第一AI模型发送至终端,以及将第一AI模型的在线学习信息发送至终端,终端接收核心网设备发送的第一AI模型,以及第一AI模型的在线学习信息。Exemplarily, the core network device sends the first AI model to the terminal, and sends the online learning information of the first AI model to the terminal, and the terminal receives the first AI model sent by the core network device, and the online learning information of the first AI model information.
如此,通过第二设备为第一设备配置第一AI模型,以及配置该第一AI模型在线学习所需的参数,使得可以在第一设备侧对该第一AI模型进行连续在线调整,从而维持第一AI模型的预测性能,进而保证第一设备的服务质量。In this way, the second device configures the first AI model for the first device, and configures the parameters required for online learning of the first AI model, so that the first AI model can be continuously adjusted online on the first device side, thereby maintaining The predictive performance of the first AI model, thereby ensuring the service quality of the first device.
可选地,在本申请实施例中,本申请实施例提供的在线学习方法还包括如下步骤303:Optionally, in the embodiment of the present application, the online learning method provided in the embodiment of the present application further includes the following step 303:
步骤303:第一设备为第三设备配置第一AI模型的在线学习信息。Step 303: The first device configures online learning information of the first AI model for the third device.
可选地,第三设备包括以下至少一项:核心网设备,接入网设备,以及终端。Optionally, the third device includes at least one of the following: a core network device, an access network device, and a terminal.
可选地,上述第一设备可以为核心网设备,第三设备可以为终端。Optionally, the above-mentioned first device may be a core network device, and the third device may be a terminal.
示例性地,核心网设备将第一AI模型发送至终端,以及将第一AI模型的在线学习信息发送至终端,终端接收核心网设备发送的第一AI模型,以及第一AI模型的在线学习信息。Exemplarily, the core network device sends the first AI model to the terminal, and sends the online learning information of the first AI model to the terminal, and the terminal receives the first AI model sent by the core network device, and the online learning information of the first AI model information.
可选地,上述第二设备可以自主执行AI模型的在线学习方法,或者,第二设备可以将AI模型部署至第一设备,并为第一设备配置该AI模型在线学习所需的信息,由第一设备执行AI模型的在线学习,或者,第一设备可以将AI模型部署至第三设备,并为第一设备配置该AI模型在线学习所需的信息,由第三设备执行AI模型的在线学习。Optionally, the above-mentioned second device can autonomously execute the online learning method of the AI model, or, the second device can deploy the AI model to the first device, and configure the information required for the online learning of the AI model for the first device, by The first device executes the online learning of the AI model, or the first device can deploy the AI model to the third device, and configure the information required for the online learning of the AI model for the first device, and the third device executes the online learning of the AI model. study.
可选地,在本申请实施例中,上述触发方式对应的触发条件包括以下至少之一:Optionally, in this embodiment of the application, the trigger condition corresponding to the above trigger mode includes at least one of the following:
第一设备的状态信息满足第一预设条件;The state information of the first device satisfies a first preset condition;
第一设备采集到的数据量大于第一阈值;The amount of data collected by the first device is greater than a first threshold;
第一设备的测量信息满足第二预设条件;The measurement information of the first device satisfies a second preset condition;
第一AI模型的输出结果的误差信息大于第二阈值;The error information of the output result of the first AI model is greater than the second threshold;
第一AI模型对应的第一信息的统计信息满足第三预设条件;The statistical information of the first information corresponding to the first AI model satisfies a third preset condition;
第一设备的测量信息的统计信息满足第四预设条件;Statistical information of the measurement information of the first device satisfies a fourth preset condition;
其中,上述状态信息包括以下至少一项:移动速度、波束切换信息、小区切换信息。Wherein, the above state information includes at least one of the following: moving speed, beam switching information, and cell switching information.
可选地,上述第一设备采集到的数据量可以为第一设备在线数据采集数量,如,第一设备实时采集到的信道信息。Optionally, the amount of data collected by the first device may be the amount of online data collected by the first device, for example, channel information collected by the first device in real time.
示例性地,上述信道信息可以包括以下至少之一:信号的发射角角度信息,信道中信号的时延信息,信道中的信号质量等等。Exemplarily, the above channel information may include at least one of the following: signal emission angle information, time delay information of signals in the channel, signal quality in the channel, and so on.
可选地,上述第一阈值可以为3000,5000或者7000等等。Optionally, the above-mentioned first threshold may be 3000, 5000 or 7000 and so on.
可选地,上述测量信息包括以下至少之一:上述第一设备接收的参考信号的第一测量信息,上述第一设备的传感器采集的第二测量信息。示例性地,上述第一测量信息包括以下至少之一:参考信号的瞬时测量信息,参考信号的统计测量信息。示例性地,上述参考信号的瞬时测量信息可以为:在某一特定时刻的参考信号的测量信息;上述参考信号的统计测量信息可以为:在一段时间内的参考信号的测量信息。Optionally, the measurement information includes at least one of the following: first measurement information of a reference signal received by the first device, and second measurement information collected by a sensor of the first device. Exemplarily, the first measurement information includes at least one of the following: instantaneous measurement information of the reference signal, and statistical measurement information of the reference signal. Exemplarily, the instantaneous measurement information of the reference signal may be: measurement information of the reference signal at a specific moment; the statistical measurement information of the reference signal may be: measurement information of the reference signal within a period of time.
示例性地,上述参考信号包括以下至少之一:同步信号块SSB、CSI参考信号CSI-RS、探测参考信号SRS、定位参考信号PRS。可选地,上述传感器可以包括以下至少之一:视觉传感器、雷达传感器、位置传感器等等。Exemplarily, the aforementioned reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS. Optionally, the aforementioned sensors may include at least one of the following: vision sensors, radar sensors, position sensors and the like.
可选地,上述第一信息包括以下至少之一:上述第一AI模型的输入信息,上述第一AI模型的输出信息。Optionally, the above-mentioned first information includes at least one of the following: input information of the above-mentioned first AI model, and output information of the above-mentioned first AI model.
示例性地,以第一设备为终端为例,在第一信息包括第一AI模型的输入信息的情况下,该第一信息可以为终端的工作信道或者周边信道的信道信息,在第一信息包括第一AI模型的输出信息的情况下,上述第一信息可以为终端的位置信息。Exemplarily, taking the first device as a terminal as an example, in the case where the first information includes the input information of the first AI model, the first information may be the working channel or the channel information of the surrounding channel of the terminal. In the first information In the case where the output information of the first AI model is included, the first information may be location information of the terminal.
示例性地,第一设备可以实时或者周期性获取状态信息,并在状态信息满足第一预设条件的情况下,对第一AI模型进行在线学习。Exemplarily, the first device may obtain status information in real time or periodically, and perform online learning on the first AI model when the status information satisfies a first preset condition.
示例性地,在第一AI模型的计算过程中,第一设备可以实时或者周期性获取第一AI模型的输出结果的误差信息,或者第一AI模型的预测精度,并在误差信息大于第二阈值的情况下,对第一AI模型进行在线学习。For example, during the calculation process of the first AI model, the first device may obtain the error information of the output result of the first AI model in real time or periodically, or the prediction accuracy of the first AI model, and when the error information is greater than the second In the case of the threshold value, online learning is performed on the first AI model.
示例性地,第一设备可以统计第一AI模型的输入信息和输出信息,并在输入信息或者输出信息的统计信息满足第三预设条件的情况下,对第一AI模型进行在线学习。 Exemplarily, the first device may collect statistics on input information and output information of the first AI model, and perform online learning on the first AI model when the statistical information of the input information or output information satisfies a third preset condition.
示例性地,第一设备可以检测参考信号或者传感器采集的测量信息,并在测量信息满足第二预设条件,和/或,测量信息的统计信息满足第四预设条件的情况下,对第一AI模型进行在线学习。Exemplarily, the first device may detect the reference signal or the measurement information collected by the sensor, and when the measurement information satisfies the second preset condition, and/or the statistical information of the measurement information satisfies the fourth preset condition, the first device An AI model for online learning.
可选地,在本申请实施例中,上述第一信息的统计信息包括以下至少之一:第一时间窗内第一信息的第一统计量,至少两个连续的第二时间窗内第一信息对应的第二统计量,第一小区下的至少两个终端在第一时刻的第一信息的统计信息,第一信息的相关性信息。Optionally, in this embodiment of the present application, the statistical information of the above-mentioned first information includes at least one of the following: the first statistical quantity of the first information in the first time window, the first statistical quantity of the first information in at least two consecutive second time windows The second statistical quantity corresponding to the information is the statistical information of the first information of at least two terminals under the first cell at the first moment, and the correlation information of the first information.
其中,上述第二统计量是基于各个第二时间窗内的统计量计算出的。Wherein, the above-mentioned second statistic is calculated based on the statistic in each second time window.
可选地,上述统计信息可以包括以下至少之一;均值,方差等等。Optionally, the above statistical information may include at least one of the following: mean value, variance and so on.
可选地,上述统计信息可以包括时间上的统计信息和空间上的统计信息。例如,时间上的统计信息可以为:同一终端在一段连续时间内的信道的统计信息,空间上的统计信息可以为一个小区下多个不同终端的信道的统计信息。Optionally, the foregoing statistical information may include temporal statistical information and spatial statistical information. For example, the statistical information on time may be: statistical information on channels of the same terminal within a continuous period of time, and the statistical information on space may be statistical information on channels of multiple different terminals under one cell.
以下以第一信息为信道信息对上述第一信息的统计信息进行解释说明。The statistical information of the above-mentioned first information is explained below by using the first information as channel information.
示例性地,在第一信息的统计信息包括第一时间窗内上述第一信息的第一统计量的情况下,上述第一信息的统计信息可以为:同一终端在某一连续时间窗内的信道信息的均值或者方差。Exemplarily, when the statistical information of the first information includes the first statistical quantity of the first information in the first time window, the statistical information of the first information may be: The mean or variance of the channel information.
示例性地,在第一信息的统计信息包括至少两个连续的第二时间窗内上述第一信息对应的第二统计量的情况下,上述第一信息的统计信息可以为:终端在多个连续时间窗内的信道信息的均值的均值,如,时间窗1内信道信息的均值为a,时间窗2内信道信息的均值为b,时间窗3内信道信息的均值为c,则信道信息的统计信息为a,b和c的均值。Exemplarily, when the statistical information of the first information includes the second statistical quantity corresponding to the above-mentioned first information in at least two consecutive second time windows, the above-mentioned statistical information of the first information may be: The mean value of the mean value of channel information in continuous time windows, for example, the mean value of channel information in time window 1 is a, the mean value of channel information in time window 2 is b, and the mean value of channel information in time window 3 is c, then the channel information The statistic for is the mean of a, b, and c.
示例性地,在第一信息的统计信息包括第一小区下的至少两个终端在第一时刻的第一信息的统计信息的情况下,上述第一信息的统计信息可以为:一个小区下多个不同终端在某一时刻的信道信息的均值,如,同一小区下的终端A,终端B和终端C在时刻1的信道信息的均值分别为d,e和f,则信道信息的统计信息为d,e和f的均值。Exemplarily, when the statistical information of the first information includes the statistical information of the first information of at least two terminals under the first cell at the first moment, the statistical information of the above-mentioned first information may be: The mean value of the channel information of different terminals at a certain moment, for example, the mean values of the channel information of terminal A, terminal B and terminal C in the same cell at time 1 are d, e and f respectively, then the statistical information of the channel information is Means of d, e and f.
示例性地,在第一信息的统计信息包括第一信息的相关性信息的情况下,上述第一信息的统计信息可以为:两个时间窗的信道信息的相关性指标低于某一阈值,如,当前时间窗内的前后数据之间的距离小于某一阈值。Exemplarily, when the statistical information of the first information includes the correlation information of the first information, the statistical information of the first information may be: the correlation index of the channel information of the two time windows is lower than a certain threshold, For example, the distance between the previous and subsequent data in the current time window is smaller than a certain threshold.
可选地,在本申请实施例中,上述第一设备的测量信息的统计信息包括以下至少之一:第三时间窗内针对上述测量信息的第三统计量,上述测量信息在至少两个连续的第四时间窗对应的第四统计量,上述测量信息的相关性信息。Optionally, in this embodiment of the present application, the statistical information of the measurement information of the first device includes at least one of the following: a third statistic for the measurement information within a third time window, and the measurement information is in at least two consecutive The fourth statistic corresponding to the fourth time window is the correlation information of the above measurement information.
其中,上述第四统计量是基于各个第四时间窗内的统计量计算出的。Wherein, the above fourth statistic is calculated based on the statistic in each fourth time window.
可选地,上述相关性信息包括以下至少之一:数据之间的距离,协方差,相关系数。Optionally, the above correlation information includes at least one of the following: distance between data, covariance, and correlation coefficient.
示例性地,以测量信息为第一设备接收的参考信号的信道测量信息为例。上述测量信息的统计信息可以为:信道测量信息在某一连续时间窗内的方差,或者,两个连续的时间窗内的信道测量信息的均值的均值,或者为当前时间窗内的信道测量信息的数据之间的距离。Exemplarily, the measurement information is channel measurement information of a reference signal received by the first device as an example. The statistical information of the above measurement information may be: the variance of the channel measurement information in a certain continuous time window, or the average of the mean values of the channel measurement information in two consecutive time windows, or the channel measurement information in the current time window distance between the data.
进一步可选地,在本申请实施例中,上述触发条件包括:上述第一设备的状态信息满足第一预设条件;上述满足第一预设条件包括以下至少之一:Further optionally, in the embodiment of the present application, the trigger condition includes: the state information of the first device meets a first preset condition; the satisfaction of the first preset condition includes at least one of the following:
上述第一设备的移动速度大于第三阈值;The moving speed of the first device is greater than a third threshold;
上述波束切换信息指示第一设备发生波束切换、且波束切换频率大于第四阈值;The beam switching information indicates that beam switching occurs to the first device, and the beam switching frequency is greater than a fourth threshold;
上述小区切换信息指示第一设备发生小区切换。The above cell switching information indicates that the cell switching occurs to the first device.
示例性地,上述第三阈值可以为60km/h,80km/h,100km/等等。Exemplarily, the above-mentioned third threshold may be 60km/h, 80km/h, 100km/h and so on.
进一步可选地,在本申请实施例中,上述触发条件包括:上述第一设备的测量信息满足第二预设条件;上述满足第二预设条件包括:第一设备的测量信息指示第一设备的相关信道的信道环境发生变化。Further optionally, in the embodiment of the present application, the trigger condition includes: the measurement information of the first device meets a second preset condition; the meeting the second preset condition includes: the measurement information of the first device indicates that the first device The channel environment of the relevant channel changes.
示例性地,以测量信息为第一设备接收的参考信号为例。上述第二预设条件可以为:第一设备根据CSI-RS的测量估计下行信道,检测到信道环境发生变化,如由视距(line of sight,LOS)环境变成非视距(not line of sight,NLOS)环境,如信噪比SINR低于一定阈值等等。Exemplarily, it is taken that the measurement information is a reference signal received by the first device as an example. The above-mentioned second preset condition may be: the first device estimates the downlink channel according to the measurement of CSI-RS, and detects that the channel environment changes, such as changing from a line of sight (LOS) environment to a non-line of sight (not line of sight) environment. sight, NLOS) environment, such as the signal-to-noise ratio SINR is lower than a certain threshold and so on.
示例性地,以测量信息为第一设备的传感器采集的测量信息为例。上述第二预设条件可以为:通过视觉传感器得到测量信息指示第一设备处于LOS环境下。Exemplarily, it is taken that the measurement information is the measurement information collected by the sensor of the first device as an example. The above-mentioned second preset condition may be: the measurement information obtained by the visual sensor indicates that the first device is in an LOS environment.
其中,上述测量信息包括以下至少之一:第一设备接收的参考信号的第一测量信息,第一设备的传感器采集的第二测量信息;可选地,上述第一测量信息包括以下至少之一:参考信号的瞬时测量信息,参考信号的统计测量信息;Wherein, the above-mentioned measurement information includes at least one of the following: first measurement information of the reference signal received by the first device, and second measurement information collected by the sensor of the first device; optionally, the above-mentioned first measurement information includes at least one of the following : Instantaneous measurement information of the reference signal, statistical measurement information of the reference signal;
其中,上述参考信号包括以下至少之一:同步信号块SSB、CSI参考信号CSI-RS、探测参考信号SRS、定位参考信号PRS。Wherein, the above-mentioned reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS.
进一步可选地,在本申请实施例中,上述触发条件包括:上述第一AI模型对应的第二信息的统计 信息满足第三预设条件;上述满足第三预设条件包括以下至少之一:Further optionally, in this embodiment of the present application, the above-mentioned trigger conditions include: statistics of the second information corresponding to the above-mentioned first AI model The information satisfies the third preset condition; the above-mentioned meeting of the third preset condition includes at least one of the following:
上述第一统计量大于第一阈值区间的最大值;The above-mentioned first statistic is greater than the maximum value of the first threshold interval;
上述第二统计量大于第二阈值区间的最大值;The above-mentioned second statistic is greater than the maximum value of the second threshold interval;
在至少两个时间窗内采集的第一信息的相关性信息满足第一条件;The correlation information of the first information collected in at least two time windows satisfies the first condition;
在当前时间窗内采集的不同第一信息之间的相关性信息满足第二条件。Correlation information between different first pieces of information collected within the current time window satisfies the second condition.
示例性地,以第一统计量为终端在某一连续时间窗内的信道信息的均值为例。上述第三预设条件可以为:第一设备检测到某一连续时间窗内的信道信息的统计量超过某一阈值区间的最大值。Exemplarily, it is assumed that the first statistic is an average value of channel information of the terminal within a certain continuous time window. The foregoing third preset condition may be: the first device detects that the statistics of channel information in a certain continuous time window exceed the maximum value of a certain threshold interval.
示例性地,以第二统计量为终端在多个连续时间窗内的信道信息的均值的均值为例。上述第三预设条件可以为:第一设备检测到多个连续时间窗内信道信息的均值的均值超过某一阈值区间的最大值。Exemplarily, the second statistic is an average value of average values of channel information of the terminal in multiple consecutive time windows as an example. The foregoing third preset condition may be: the first device detects that the average value of the average values of the channel information in multiple consecutive time windows exceeds the maximum value of a certain threshold interval.
示例性地,以第一信息为信道信息为例。上述第三预设条件可以为:两个时间窗内的信道信息的相关性指标低于某一阈值,或者,当前时间窗内的前后数据相关性低于某一阈值。Exemplarily, the first information is channel information as an example. The above-mentioned third preset condition may be: the correlation index of the channel information in the two time windows is lower than a certain threshold, or the correlation of the previous and subsequent data in the current time window is lower than a certain threshold.
进一步可选地,在本申请实施例中,上述触发条件包括:上述第一设备的测量信息的统计信息满足第四预设条件;上述满足第四预设条件包括以下至少之一:Further optionally, in the embodiment of the present application, the trigger condition includes: the statistical information of the measurement information of the first device satisfies a fourth preset condition; the satisfaction of the fourth preset condition includes at least one of the following:
上述第三统计量大于第三阈值区间的最大值;The above third statistic is greater than the maximum value of the third threshold interval;
上述第四统计量大于第四阈值区间的最大值;The above fourth statistic is greater than the maximum value of the fourth threshold interval;
在至少两个时间窗内采集的测量信息的相关性信息满足第三条件;The correlation information of the measurement information collected in at least two time windows satisfies the third condition;
在当前时间窗内采集的不同所述测量信息之间的相关性信息满足第四条件;Correlation information among different measurement information collected in the current time window satisfies the fourth condition;
测量信息的分布与基准分布的差异大于第五阈值,上述基准分布为第二设备为第一设备配置的信息。The difference between the distribution of the measurement information and the reference distribution is greater than the fifth threshold, and the above reference distribution is information configured by the second device for the first device.
示例性地,在第三统计量为信道测量信息在某一连续时间窗内的均值的情况下,上述第四预设条件可以为:信道测量信息在某一连续时间窗内的均值超过某一阈值区间的最大值。Exemplarily, in the case where the third statistic is the mean value of the channel measurement information in a certain continuous time window, the above fourth preset condition may be: the mean value of the channel measurement information in a certain continuous time window exceeds a certain The maximum value of the threshold interval.
示例性地,在第四统计量为基于各个第四时间窗内信道测量信息的均值计算得到的均值的情况下,上述第四预设条件可以为:基于各个第四时间窗内信道测量信息的均值计算得到的均值超过某一阈值区间的最大值。Exemplarily, in the case where the fourth statistic is an average value calculated based on the average value of the channel measurement information in each fourth time window, the above fourth preset condition may be: based on the average value of the channel measurement information in each fourth time window The mean calculated by the mean exceeds the maximum value of a certain threshold interval.
示例性地,上述第三条件可以为:在至少两个时间窗内分别采集的测量信息的数据的协方差小于某一阈值。Exemplarily, the above third condition may be: the covariance of the data of the measurement information respectively collected in at least two time windows is smaller than a certain threshold.
示例性地,上述第四条件可以为:在当前时间窗内采集的测量信息的不同数据之间的距离小于某一阈值。Exemplarily, the above fourth condition may be: the distance between different data of the measurement information collected within the current time window is smaller than a certain threshold.
示例性地,上述测量信息的分布为测量信息的统计分布。示例性地,上述基准分布为第一AI模型的统计分布。可以理解的是,第一AI模型离线训练时的训练集服从基准分布,第一AI模型在测量信息服从基准分布的时候性能最好。Exemplarily, the above-mentioned distribution of the measurement information is a statistical distribution of the measurement information. Exemplarily, the above reference distribution is the statistical distribution of the first AI model. It can be understood that the training set for offline training of the first AI model obeys the benchmark distribution, and the performance of the first AI model is best when the measurement information obeys the benchmark distribution.
示例性地,描述上述测量信息的分布与基准分布的差异的指标可以包括以下至少之一:Wasserstein距离;Kullback-Leibler散度;Hellinger距离等。Exemplarily, the index describing the difference between the distribution of the measurement information and the reference distribution may include at least one of the following: Wasserstein distance; Kullback-Leibler divergence; Hellinger distance and the like.
可选地,在本申请实施例中,上述触发方式的触发条件包括以下至少之一:Optionally, in this embodiment of the present application, the trigger condition of the above trigger method includes at least one of the following:
第二设备指示第一设备进行在线学习;The second device instructs the first device to perform online learning;
第一AI模型的输出精度小于或者等于第六阈值。The output accuracy of the first AI model is less than or equal to the sixth threshold.
进一步可选地,在本申请实施例中,上述第二设备指示第一设备进行在线学习,包括以下至少之一:Further optionally, in this embodiment of the present application, the above-mentioned second device instructs the first device to perform online learning, including at least one of the following:
第二设备指示第一设备周期性进行在线学习;The second device instructs the first device to perform online learning periodically;
第二设备指示第一设备半周期性进行在线学习;The second device instructs the first device to conduct online learning semi-periodically;
第二设备指示第一设备非周期性进行在线学习。The second device instructs the first device to perform online learning aperiodically.
其中,上述第一设备周期性或半周期性进行在线学习时采用的周期为:第二设备预配置的周期,或者,第一设备自主配置的周期。Wherein, the cycle adopted by the above-mentioned first device to perform online learning periodically or semi-periodically is: a cycle preconfigured by the second device, or a cycle independently configured by the first device.
进一步可选地,第二设备指示第一设备半周期性进行在线学习,包括:Further optionally, the second device instructs the first device to perform online learning semi-periodically, including:
第二设备通过第一信令指示第一设备半周期性进行在线学习,上述第一信令包括以下至少之一:媒体接入控制-控制单元MAC-CE,下行控制信息DCI。The second device instructs the first device to perform online learning semi-periodically through the first signaling, and the first signaling includes at least one of the following: medium access control-control element MAC-CE, and downlink control information DCI.
可选地,在本申请实施例中,上述中止条件包括以下至少之一:Optionally, in this embodiment of the application, the above suspension conditions include at least one of the following:
第一AI模型的在线学习次数大于预设迭代次数;The number of online learning of the first AI model is greater than the preset number of iterations;
第一AI模型达到预设精度;The first AI model reaches the preset accuracy;
第一AI模型的输出结果的误差信息小于第七阈值;The error information of the output result of the first AI model is smaller than the seventh threshold;
第二设备突发性指示第一设备结束当前在线学习过程;The second device abruptly instructs the first device to end the current online learning process;
与第一AI模型关联的目标任务中止; the target task associated with the first AI model is aborted;
第一设备的测量信息分布与基准分布的差异信息小于第八阈值。The difference information between the measurement information distribution of the first device and the reference distribution is smaller than the eighth threshold.
示例性地,在对第一AI模型进行在线学习的过程中,第一设备可以实时或者周期性检测是否满足中止条件,并在满足中止条件的情况下,中止第一AI模型的在线学习。Exemplarily, during the process of online learning of the first AI model, the first device may detect in real time or periodically whether the suspension condition is met, and if the suspension condition is satisfied, suspend the online learning of the first AI model.
示例性地,上述与第一AI模型关联的目标任务可以为第一AI模型当前执行的任务,如对终端进行定位,为用户进行网络推荐等等。Exemplarily, the aforementioned target task associated with the first AI model may be a task currently performed by the first AI model, such as locating a terminal, making network recommendations for a user, and so on.
在一种示例中,第一设备可以在检测到第一AI模型的迭代次数大于预设迭代次数(如10000次)的情况下,中止第一AI模型的在线学习。在另一种示例中,第二设备可以在检测到第一AI模型的精度值达到预设精度的情况下,中止第一AI模型的在线学习。在又一种示例中,第二设备可以在检测到第一AI模型的输出结果的误差小于预设误差值的情况下,中止第一AI模型的在线学习。如此,第一设备可以基于上述AI模型的在线学习次数、达到的精度、输出结果的误差、以及与基准分布的差异满足条件的情况下,中止AI模型的在线学习,从而在提升模型预测精度的情况下,节省功耗。In an example, the first device may stop online learning of the first AI model when it detects that the number of iterations of the first AI model is greater than a preset number of iterations (eg, 10,000). In another example, the second device may stop online learning of the first AI model when detecting that the accuracy value of the first AI model reaches a preset accuracy. In yet another example, the second device may stop the online learning of the first AI model when it detects that the error of the output result of the first AI model is smaller than a preset error value. In this way, the first device can stop the online learning of the AI model based on the number of times of online learning of the above-mentioned AI model, the achieved accuracy, the error of the output result, and the difference from the reference distribution, so as to improve the prediction accuracy of the model. case, saving power consumption.
需要说明的是,中止第一AI模型的在线学习可以为暂时停止第一AI模型的在线学习,或者结束第一AI模型的在线学习。It should be noted that suspending the online learning of the first AI model may be temporarily stopping the online learning of the first AI model, or ending the online learning of the first AI model.
可选地,在本申请实施例中,上述参数配置信息包括以下至少之一:Optionally, in this embodiment of the application, the above parameter configuration information includes at least one of the following:
第一AI模型的在线学习模式;The online learning mode of the first AI model;
第一AI模型的样本批量的大小;the size of the sample batch of the first AI model;
第一AI模型的优化器的状态;the state of the optimizer of the first AI model;
第一AI模型的第一数据集的划分方式;A division method of the first data set of the first AI model;
第一AI模型的第一数据集的构成信息;Composition information of the first data set of the first AI model;
第一AI模型的第一数据集对所述AI模型更新的贡献权重;The contribution weight of the first data set of the first AI model to the update of the AI model;
与在线学习信息关联的AI模型标识;AI model identification associated with online learning information;
第一AI模型的基准分布;The baseline distribution of the first AI model;
其中,上述第一数据集以下至少之一:第一AI模型所使用的原始数据集,第一AI模型新采集的数据集。上述基准分布的参数信息包括以下至少之一:方差,均值,标准差。Wherein, the above-mentioned first data set is at least one of the following: the original data set used by the first AI model, and the newly collected data set by the first AI model. The above parameter information of the benchmark distribution includes at least one of the following: variance, mean, and standard deviation.
可选地,上述原始数据集为:在对第一AI模型进行离线训练使用的数据集(即旧数据集),上述新采集的数据集为:在第一设备侧在线部署,即为第一设备配置第一AI模型之后,在新环境下第一设备采集的数据集(即新数据集)。Optionally, the above-mentioned original data set is: a data set used for offline training of the first AI model (that is, an old data set), and the above-mentioned newly collected data set is: deployed online on the first device side, that is, the first AI model. After the device configures the first AI model, the data set (ie, the new data set) collected by the first device in a new environment.
可选地,上述在线学习模式包括以下任意一项:瞬时训练模式(即One-shot模式),连续学习模式。示例性地,在One-shot模式下,第一设备在采集的数据达到指定数量的情况下进行在线学习;在连续学习模式下,第一设备随着采集的数据的数量的增加不断进行在线学习。Optionally, the above-mentioned online learning mode includes any one of the following: an instantaneous training mode (ie, One-shot mode), and a continuous learning mode. Exemplarily, in the One-shot mode, the first device performs online learning when the collected data reaches a specified amount; in the continuous learning mode, the first device continuously performs online learning as the amount of collected data increases .
示例性地,上述批量(Batch)大小为N,且N为正整数。Exemplarily, the size of the above batch (Batch) is N, and N is a positive integer.
示例性地,上述第一AI模型的优化器的状态可以包括损失函数,学习率等。Exemplarily, the state of the optimizer of the above-mentioned first AI model may include a loss function, a learning rate, and the like.
示例性地,第一数据集划分方式可以包括训练集、验证集和测试集划分比例等。Exemplarily, the first data set division method may include division ratios of the training set, verification set, and test set, and the like.
示例性地,上述数据集的构成信息,包括原始数据集的数量与新采集的数据集的数量之间的比例。需要说明是,本申请实施例中采用训练时的原始数据集可以有效防止在线学习过程中,对新采集的数据的过拟合,从而有效提升AI模型的性能。Exemplarily, the composition information of the above data sets includes the ratio between the number of original data sets and the number of newly collected data sets. It should be noted that the use of the original data set during training in the embodiment of the present application can effectively prevent over-fitting of newly collected data during the online learning process, thereby effectively improving the performance of the AI model.
示例性地,第一数据集对第一AI模型更新的贡献权重可以为:旧数据集和新数据集对上述第一AI模型更新的贡献权重,如在第一AI模型进行在线学习时,可以对原始数据集分配较小的权重,对新数据集分配较大的权重。Exemplarily, the contribution weight of the first data set to the update of the first AI model may be: the contribution weights of the old data set and the new data set to the update of the first AI model. For example, when the first AI model performs online learning, it may be Assign smaller weights to the original dataset and larger weights to the new dataset.
示例性地,第一设备可以获取第一AI模型的参数配置信息,并基于该参数配置信息对该第一AI模型进行在线学习,从而保证该AI模型在变化的环境中的预测精度。Exemplarily, the first device may obtain parameter configuration information of the first AI model, and perform online learning on the first AI model based on the parameter configuration information, so as to ensure the prediction accuracy of the AI model in a changing environment.
进一步可选地,在本申请实施例中,上述参数配置信息包括第一AI模型的在线学习模式、且在线学习模式为瞬时训练模式,上述参数配置信息还包括以下至少之一:第一设备所采集的数据量,数据量的采集时间长度。Further optionally, in the embodiment of the present application, the parameter configuration information includes an online learning mode of the first AI model, and the online learning mode is an instantaneous training mode, and the parameter configuration information further includes at least one of the following: The amount of data collected and the length of time for collecting data.
进一步可选地,在本申请实施例中,上述参数配置信息包括第一AI模型的在线学习模式、且在线学习模式为连续学习模式,上述参数配置信息还包括以下至少之一:相邻两次在线学习的时间间隔,相邻两次在线学习的数据量间隔。Further optionally, in the embodiment of the present application, the above-mentioned parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is a continuous learning mode, and the above-mentioned parameter configuration information also includes at least one of the following: The time interval of online learning, the data volume interval between two adjacent online learning.
示例性地,上述相邻两次在线学习的数据量间隔可以为100,如每采集100组数据进行一次在线训练。Exemplarily, the data volume interval between two adjacent online learning sessions may be 100, for example, online training is performed every time 100 sets of data are collected.
本申请实施例提供的AI模型的在线学习方法,执行主体可以为AI模型的在线学习装置。本申请实施例中以AI模型的在线装置执行AI模型的在线方法为例,说明本申请实施例提供的AI模型的在线 学习装置。The online learning method of the AI model provided in the embodiment of the present application may be executed by an online learning device of the AI model. In the embodiment of this application, the online method of executing the AI model by the online device of the AI model is taken as an example to illustrate the online method of the AI model provided by the embodiment of the application. learning device.
本申请实施例提供一种AI模型的在线学习装置400,如图4所示,该AI模型的在线学习装置400包括:配置模块401,其中:所述配置模块401,用于第二设备为第一设备部署第一AI模型;所述配置401,还用于所述第二设备为所述第一设备配置所述第一AI模型的在线学习信息。An embodiment of the present application provides an online AI model learning device 400. As shown in FIG. A device deploys a first AI model; the configuration 401 is further used for the second device to configure online learning information of the first AI model for the first device.
可选地,在本申请实施例中,所述在线学习信息包括以下至少之一:Optionally, in this embodiment of the application, the online learning information includes at least one of the following:
在线学习的触发方式;How eLearning is triggered;
在线学习的中止条件;Conditions for discontinuation of online learning;
在线学习的参数配置信息;Parameter configuration information for online learning;
在线学习的数据集。Datasets for online learning.
可选地,在本申请实施例中,Optionally, in the embodiment of this application,
所述触发方式对应的触发条件包括以下至少之一:The trigger condition corresponding to the trigger mode includes at least one of the following:
所述第一设备的状态信息满足第一预设条件;The state information of the first device satisfies a first preset condition;
所述第一设备采集到的数据量大于第一阈值;The amount of data collected by the first device is greater than a first threshold;
所述第一设备的测量信息满足第二预设条件;The measurement information of the first device satisfies a second preset condition;
所述第一AI模型的输出结果的误差信息大于第二阈值;The error information of the output result of the first AI model is greater than a second threshold;
所述第一AI模型对应的第一信息的统计信息满足第三预设条件;The statistical information of the first information corresponding to the first AI model satisfies a third preset condition;
所述第一设备的测量信息的统计信息满足第四预设条件;Statistical information of the measurement information of the first device satisfies a fourth preset condition;
其中,所述状态信息包括以下至少一项:移动速度、波束切换信息、小区切换信息;Wherein, the state information includes at least one of the following: moving speed, beam switching information, cell switching information;
所述第一信息包括以下至少之一:所述第一AI模型的输入信息,所述第一AI模型的输出信息。The first information includes at least one of the following: input information of the first AI model, and output information of the first AI model.
可选地,在本申请实施例中,所述第一信息的统计信息包括以下至少之一:第一时间窗内所述第一信息的第一统计量,至少两个连续的第二时间窗内所述第一信息对应的第二统计量,第一小区下的至少两个终端在第一时刻的第一信息的统计信息,所述第一信息的相关性信息;Optionally, in this embodiment of the present application, the statistical information of the first information includes at least one of the following: a first statistical quantity of the first information within a first time window, at least two consecutive second time windows A second statistic corresponding to the first information, statistical information of the first information of at least two terminals under the first cell at the first moment, and correlation information of the first information;
所述第二统计量是基于各个所述第二时间窗内的统计量计算出的;The second statistics are calculated based on statistics in each of the second time windows;
所述第一设备的测量信息的统计信息包括以下至少之一:在第三时间窗内针对上述测量信息的第三统计量,所述测量信息在至少两个连续的第四时间窗对应的第四统计量,所述测量信息的相关性信息;The statistical information of the measurement information of the first device includes at least one of the following: a third statistic for the above measurement information within a third time window, where the measurement information corresponds to at least two consecutive fourth time windows. Four statistics, correlation information of the measurement information;
所述第四统计量是基于各个所述第四时间窗内的统计量计算出的;The fourth statistic is calculated based on the statistic in each of the fourth time windows;
所述相关性信息包括以下至少之一:数据之间的距离,协方差,相关系数。The correlation information includes at least one of the following: distance between data, covariance, and correlation coefficient.
可选地,在本申请实施例中,所述触发条件包括:所述第一设备的状态信息满足第一预设条件;Optionally, in this embodiment of the present application, the trigger condition includes: the status information of the first device meets a first preset condition;
所述满足第一预设条件包括以下至少之一:Said meeting the first preset condition includes at least one of the following:
所述第一设备的移动速度大于第三阈值;The moving speed of the first device is greater than a third threshold;
所述波束切换信息指示所述第一设备发生波束切换、且波束切换频率大于第四阈值;The beam switching information indicates that beam switching occurs for the first device, and the beam switching frequency is greater than a fourth threshold;
所述小区切换信息指示所述第一设备发生小区切换。The cell switching information indicates that a cell switching occurs to the first device.
可选地,在本申请实施例中,所述触发条件包括:所述第一设备接收的参考信号的测量信息满足第二预设条件;Optionally, in this embodiment of the present application, the trigger condition includes: the measurement information of the reference signal received by the first device satisfies a second preset condition;
所述满足第二预设条件包括:所述参考信号的测量信息指示所述第一设备的相关信道的信道环境发生变化;The meeting the second preset condition includes: the measurement information of the reference signal indicates that the channel environment of the relevant channel of the first device changes;
其中,所述测量信息包括以下至少之一:所述第一设备接收的参考信号的第一测量信息,所述第一设备的传感器采集的第二测量信息;Wherein, the measurement information includes at least one of the following: first measurement information of a reference signal received by the first device, and second measurement information collected by a sensor of the first device;
所述第一测量信息包括以下至少之一:所述参考信号的瞬时测量信息,所述参考信号的统计测量信息;The first measurement information includes at least one of the following: instantaneous measurement information of the reference signal, statistical measurement information of the reference signal;
所述参考信号包括以下至少之一:同步信号块SSB、CSI参考信号CSI-RS、探测参考信号SRS、定位参考信号PRS。The reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS.
可选地,在本申请实施例中,所述触发条件包括:所述第一AI模型对应的第一信息的统计信息满足第三预设条件;Optionally, in the embodiment of the present application, the trigger condition includes: the statistical information of the first information corresponding to the first AI model satisfies a third preset condition;
所述满足第三预设条件包括以下至少之一:Said meeting the third preset condition includes at least one of the following:
所述第一统计量大于第一阈值区间的最大值;The first statistic is greater than the maximum value of the first threshold interval;
所述第二统计量大于第二阈值区间的最大值;The second statistic is greater than the maximum value of the second threshold interval;
在至少两个时间窗内采集的所述第一信息的相关性信息满足第一条件;The correlation information of the first information collected in at least two time windows satisfies a first condition;
在当前时间窗内采集的不同所述第一信息之间的相关性信息满足第二条件。Correlation information between different first pieces of information collected within the current time window satisfies the second condition.
可选地,在本申请实施例中,所述触发条件包括:所述第一设备的测量信息的统计信息满足第四预设条件; Optionally, in this embodiment of the present application, the trigger condition includes: statistical information of the measurement information of the first device meets a fourth preset condition;
所述满足第四预设条件包括以下至少之一:Said meeting the fourth preset condition includes at least one of the following:
所述第三统计量大于第三阈值区间的最大值;The third statistic is greater than the maximum value of the third threshold interval;
所述第四统计量大于第四阈值区间的最大值;The fourth statistic is greater than the maximum value of the fourth threshold interval;
在至少两个时间窗内采集的所述测量信息的相关性信息满足第三条件;The correlation information of the measurement information collected in at least two time windows satisfies the third condition;
在当前时间窗内采集的不同所述测量信息之间的相关性信息满足第四条件;Correlation information among different measurement information collected in the current time window satisfies the fourth condition;
所述测量信息的分布与基准分布的差异大于第五阈值,所述基准分布为所述第二设备为所述第一设备配置的信息。A difference between the distribution of the measurement information and a reference distribution is greater than a fifth threshold, where the reference distribution is information configured by the second device for the first device.
可选地,在本申请实施例中,Optionally, in the embodiment of this application,
所述触发方式的触发条件包括以下至少之一:The trigger condition of the trigger mode includes at least one of the following:
所述第二设备指示所述第一设备进行在线学习;The second device instructs the first device to perform online learning;
所述第一AI模型的输出精度小于或者等于第六阈值。The output accuracy of the first AI model is less than or equal to the sixth threshold.
可选地,在本申请实施例中,所述第二设备指示所述第一设备进行在线学习,包括以下至少之一:Optionally, in this embodiment of the present application, the second device instructs the first device to perform online learning, including at least one of the following:
所述第二设备指示所述第一设备周期性进行在线学习;The second device instructs the first device to periodically perform online learning;
所述第二设备指示所述第一设备半周期性进行在线学习;The second device instructs the first device to perform online learning semi-periodically;
所述第二设备指示所述第一设备非周期性进行在线学习。The second device instructs the first device to perform online learning aperiodically.
其中,所述第一设备周期性或半周期性进行在线学习时采用的周期为:所述第二设备预配置的周期,或者,所述第一设备自主配置的周期。Wherein, the period adopted by the first device to perform online learning periodically or semi-periodically is: a period preconfigured by the second device, or a period independently configured by the first device.
可选地,在本申请实施例中,所述第二设备指示所述第一设备半周期性进行在线学习,包括:Optionally, in this embodiment of the present application, the second device instructs the first device to perform online learning semi-periodically, including:
所述第二设备通过第一信令指示所述第一设备半周期性进行在线学习,所述第一信令包括以下至少之一:媒体接入控制-控制单元MAC-CE,下行控制信息DCI。The second device instructs the first device to perform online learning semi-periodically through a first signaling, and the first signaling includes at least one of the following: medium access control-control element MAC-CE, downlink control information DCI .
可选地,在本申请实施例中,所述中止条件包括以下至少之一:Optionally, in this embodiment of the application, the termination condition includes at least one of the following:
所述第一AI模型的在线学习次数大于预设迭代次数;The number of online learning of the first AI model is greater than the preset number of iterations;
所述第一AI模型达到预设精度;The first AI model reaches a preset accuracy;
所述第一AI模型的输出结果的误差信息小于第七阈值;The error information of the output result of the first AI model is less than the seventh threshold;
所述第二设备突发性指示所述第一设备结束当前在线学习过程;The second device abruptly instructs the first device to end the current online learning process;
与所述第一AI模型关联的目标任务中止;a target task associated with the first AI model is aborted;
所述第一设备的测量信息分布与基准分布的差异信息小于第八阈值。The difference information between the measurement information distribution of the first device and the reference distribution is smaller than an eighth threshold.
可选地,在本申请实施例中,所述参数配置信息包括以下至少之一:Optionally, in this embodiment of the present application, the parameter configuration information includes at least one of the following:
所述AI模型的在线学习模式;The online learning mode of the AI model;
所述AI模型的样本批量的大小;the size of the sample batch of the AI model;
所述AI模型的优化器的状态;the state of the optimizer of the AI model;
所述AI模型的第一数据集的划分方式;The division method of the first data set of the AI model;
所述AI模型的第一数据集的构成信息;Composition information of the first data set of the AI model;
所述AI模型的第一数据集对所述AI模型更新的贡献权重;The contribution weight of the first data set of the AI model to the update of the AI model;
与所述第一信息关联的AI模型标识;An AI model identifier associated with the first information;
所述第一AI模型的基准分布;a baseline distribution of the first AI model;
其中,所述第一数据集以下至少之一:所述第一AI模型所使用的原始数据集,所述第一AI模型新采集的数据集;Wherein, the first data set is at least one of the following: the original data set used by the first AI model, the data set newly collected by the first AI model;
所述基准分布的参数信息包括以下至少之一:方差,均值,标准差。The parameter information of the reference distribution includes at least one of the following: variance, mean, and standard deviation.
可选地,在本申请实施例中,所述参数配置信息包括所述第一AI模型的在线学习模式、且所述在线学习模式为瞬时训练模式,所述参数配置信息还包括以下至少之一:所述第一设备所采集的数据量,所述数据量的采集时间长度。Optionally, in this embodiment of the present application, the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is an instantaneous training mode, and the parameter configuration information further includes at least one of the following : the amount of data collected by the first device, and the length of time for collecting the data.
可选地,在本申请实施例中,所述参数配置信息包括所述第一AI模型的在线学习模式、且所述在线学习模式为连续学习模式,所述参数配置信息还包括以下至少之一:相邻两次在线学习的时间间隔,相邻两次在线学习的数据量间隔。Optionally, in this embodiment of the present application, the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is a continuous learning mode, and the parameter configuration information further includes at least one of the following : the time interval between two adjacent online learning, and the data volume interval between two adjacent online learning.
可选地,在本申请实施例中,所述第二设备包括以下至少一项:核心网设备,接入网设备,终端;所述第一设备包括以下至少一项:核心网设备,接入网设备,终端。Optionally, in this embodiment of the present application, the second device includes at least one of the following: core network equipment, access network equipment, and terminal; the first device includes at least one of the following: core network equipment, access Network equipment, terminals.
在本申请实施例提供的AI模型的在线学习装置中,第二设备为第一设备配置第一AI模型,以及为该第一设备配置上述第一AI模型的在线学习信息。通过该方法,通过在第一设备侧部署第一AI模型,并对该第一模型配置在线学习所需的参数,使得可以在第一设备侧对该第一AI模型进行连续在线调整,从而维持第一AI模型的预测性能,进而保证第一设备的服务质量。 In the online AI model learning apparatus provided in the embodiment of the present application, the second device configures the first AI model for the first device, and configures the online learning information of the first AI model for the first device. With this method, by deploying the first AI model on the first device side and configuring the first model with parameters required for online learning, the first AI model can be continuously adjusted online on the first device side, thereby maintaining The predictive performance of the first AI model, thereby ensuring the service quality of the first device.
本申请实施例提供一种AI模型的在线学习装置500,如图5所示,该AI模型的在线学习装置500包括:获取模块501和执行模块502,其中:所述获取模块501,用于所述第一设备获取第一AI模型;所述执行模块502,用于所述第一设备基于所述第一AI模型的在线学习信息,对所述第一AI模型进行在线学习。An embodiment of the present application provides an online learning device 500 for an AI model. As shown in FIG. The first device acquires a first AI model; the execution module 502 is configured to enable the first device to learn online the first AI model based on the online learning information of the first AI model.
可选地,在本申请实施例中,所述第一设备获取第一AI模型,包括:Optionally, in this embodiment of the application, the first device acquires the first AI model, including:
第一设备接收第二设备配置的第一AI模型。The first device receives the first AI model configured by the second device.
可选地,在本申请实施例中,所述获取模块,具体用于Optionally, in the embodiment of this application, the acquisition module is specifically used to
从所述第二设备获取所述第一AI模型的在线学习信息。Obtain online learning information of the first AI model from the second device.
可选地,在本申请实施例中,所述装置还包括:配置模块,所述配置模块用于为第三设备配置所述第一AI模型的在线学习信息。Optionally, in the embodiment of the present application, the apparatus further includes: a configuration module, configured to configure the online learning information of the first AI model for the third device.
可选地,在本申请实施例中,所述在线学习信息包括以下至少之一:Optionally, in this embodiment of the application, the online learning information includes at least one of the following:
在线学习的触发方式;How eLearning is triggered;
在线学习的中止条件;Conditions for discontinuation of online learning;
在线学习的参数配置信息;Parameter configuration information for online learning;
在线学习的数据集。Datasets for online learning.
可选地,在本申请实施例中,所述触发方式对应的触发条件包括以下至少之一:Optionally, in this embodiment of the present application, the trigger condition corresponding to the trigger mode includes at least one of the following:
所述第一设备的状态信息满足第一预设条件;The state information of the first device satisfies a first preset condition;
所述第一设备采集到的数据量大于第一阈值;The amount of data collected by the first device is greater than a first threshold;
所述第一设备的测量信息满足第二预设条件;The measurement information of the first device satisfies a second preset condition;
所述第一AI模型的输出结果的误差信息大于第二阈值;The error information of the output result of the first AI model is greater than a second threshold;
所述第一AI模型对应的第一信息的统计信息满足第三预设条件;The statistical information of the first information corresponding to the first AI model satisfies a third preset condition;
所述第一设备的测量信息的统计信息满足第四预设条件;Statistical information of the measurement information of the first device satisfies a fourth preset condition;
其中,所述状态信息包括以下至少一项:移动速度、波束切换信息、小区切换信息;Wherein, the state information includes at least one of the following: moving speed, beam switching information, cell switching information;
所述第一信息包括以下至少之一:所述第一AI模型的输入信息,所述第一AI模型的输出信息。The first information includes at least one of the following: input information of the first AI model, and output information of the first AI model.
可选地,在本申请实施例中,所述第一信息的统计信息包括以下至少之一:第一时间窗内所述第一信息的第一统计量,至少两个连续的第二时间窗内所述第一信息对应的第二统计量,第一小区下的至少两个终端在第一时刻的第一信息的统计信息,所述第一信息的相关性信息;Optionally, in this embodiment of the present application, the statistical information of the first information includes at least one of the following: a first statistical quantity of the first information within a first time window, at least two consecutive second time windows A second statistic corresponding to the first information, statistical information of the first information of at least two terminals under the first cell at the first moment, and correlation information of the first information;
所述第二统计量是基于各个所述第二时间窗内的统计量计算出的;The second statistics are calculated based on statistics in each of the second time windows;
所述第一设备的测量信息的统计信息包括以下至少之一:在第三时间窗内针对上述测量信息的第三统计量,所述测量信息在至少两个连续的第四时间窗对应的第四统计量,所述测量信息的相关性信息;The statistical information of the measurement information of the first device includes at least one of the following: a third statistic for the above measurement information within a third time window, where the measurement information corresponds to at least two consecutive fourth time windows. Four statistics, correlation information of the measurement information;
所述第四统计量是基于各个所述第四时间窗内的统计量计算出的;The fourth statistic is calculated based on the statistic in each of the fourth time windows;
所述相关性信息包括以下至少之一:数据之间的距离,协方差,相关系数。The correlation information includes at least one of the following: distance between data, covariance, and correlation coefficient.
可选地,在本申请实施例中,所述触发条件包括:所述第一设备的状态信息满足第一预设条件;Optionally, in this embodiment of the present application, the trigger condition includes: the status information of the first device meets a first preset condition;
所述满足第一预设条件包括以下至少之一:Said meeting the first preset condition includes at least one of the following:
所述第一设备的移动速度大于第三阈值;The moving speed of the first device is greater than a third threshold;
所述波束切换信息指示所述第一设备发生波束切换、且波束切换频率大于第四阈值;The beam switching information indicates that beam switching occurs for the first device, and the beam switching frequency is greater than a fourth threshold;
所述小区切换信息指示所述第一设备发生小区切换。The cell switching information indicates that a cell switching occurs to the first device.
可选地,在本申请实施例中,所述触发条件包括:所述第一设备接收的参考信号的测量信息满足第二预设条件;Optionally, in this embodiment of the present application, the trigger condition includes: the measurement information of the reference signal received by the first device satisfies a second preset condition;
所述满足第二预设条件包括:所述参考信号的测量信息指示所述第一设备的相关信道的信道环境发生变化;The meeting the second preset condition includes: the measurement information of the reference signal indicates that the channel environment of the relevant channel of the first device changes;
其中,所述测量信息包括以下至少之一:所述第一设备接收的参考信号的第一测量信息,所述第一设备的传感器采集的第二测量信息;Wherein, the measurement information includes at least one of the following: first measurement information of a reference signal received by the first device, and second measurement information collected by a sensor of the first device;
所述第一测量信息包括以下至少之一:所述参考信号的瞬时测量信息,所述参考信号的统计测量信息;The first measurement information includes at least one of the following: instantaneous measurement information of the reference signal, statistical measurement information of the reference signal;
所述参考信号包括以下至少之一:同步信号块SSB、CSI参考信号CSI-RS、探测参考信号SRS、定位参考信号PRS。The reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS.
可选地,在本申请实施例中,所述触发条件包括:所述第一AI模型对应的第二信息的统计信息满足第三预设条件;Optionally, in this embodiment of the present application, the trigger condition includes: the statistical information of the second information corresponding to the first AI model satisfies a third preset condition;
所述满足第三预设条件包括以下至少之一:Said meeting the third preset condition includes at least one of the following:
所述第一统计量大于第一阈值区间的最大值; The first statistic is greater than the maximum value of the first threshold interval;
所述第二统计量大于第二阈值区间的最大值;The second statistic is greater than the maximum value of the second threshold interval;
在至少两个时间窗内采集的所述第一信息的相关性信息满足第一条件;The correlation information of the first information collected in at least two time windows satisfies a first condition;
在当前时间窗内采集的不同所述第一信息之间的相关性信息满足第二条件。Correlation information between different first pieces of information collected within the current time window satisfies the second condition.
可选地,在本申请实施例中,所述触发条件包括:所述第一设备的测量信息的统计信息满足第四预设条件;Optionally, in this embodiment of the present application, the trigger condition includes: statistical information of the measurement information of the first device meets a fourth preset condition;
所述满足第四预设条件包括以下至少之一:Said meeting the fourth preset condition includes at least one of the following:
所述第三统计量大于第三阈值区间的最大值;The third statistic is greater than the maximum value of the third threshold interval;
所述第四统计量大于第四阈值区间的最大值;The fourth statistic is greater than the maximum value of the fourth threshold interval;
在至少两个时间窗内采集的所述测量信息的相关性信息满足第三条件;The correlation information of the measurement information collected in at least two time windows satisfies the third condition;
在当前时间窗内采集的不同所述测量信息之间的相关性信息满足第四条件;Correlation information among different measurement information collected in the current time window satisfies the fourth condition;
所述测量信息的分布与基准分布的差异大于第五阈值,所述基准分布为所述第二设备为所述第一设备配置的信息。A difference between the distribution of the measurement information and a reference distribution is greater than a fifth threshold, where the reference distribution is information configured by the second device for the first device.
可选地,在本申请实施例中,所述触发方式的触发条件包括以下至少之一:Optionally, in this embodiment of the present application, the trigger condition of the trigger mode includes at least one of the following:
所述第二设备指示所述第一设备进行在线学习;The second device instructs the first device to perform online learning;
所述第一AI模型的输出精度小于或者等于第六阈值。The output accuracy of the first AI model is less than or equal to the sixth threshold.
可选地,在本申请实施例中,所述第二设备指示所述第一设备进行在线学习,包括以下至少之一:Optionally, in this embodiment of the present application, the second device instructs the first device to perform online learning, including at least one of the following:
所述第二设备指示所述第一设备周期性进行在线学习;The second device instructs the first device to periodically perform online learning;
所述第二设备指示所述第一设备半周期性进行在线学习;The second device instructs the first device to perform online learning semi-periodically;
所述第二设备指示所述第一设备非周期性进行在线学习。The second device instructs the first device to perform online learning aperiodically.
其中,所述第一设备周期性或半周期性进行在线学习时采用的周期为:所述第二设备预配置的周期,或者,所述第一设备自主配置的周期。Wherein, the period adopted by the first device to perform online learning periodically or semi-periodically is: a period preconfigured by the second device, or a period independently configured by the first device.
可选地,在本申请实施例中,所述第二设备指示所述第一设备半周期性进行在线学习,包括:Optionally, in this embodiment of the present application, the second device instructs the first device to perform online learning semi-periodically, including:
所述第二设备通过第一信令指示所述第一设备半周期性进行在线学习,所述第一信令包括以下至少之一:媒体接入控制-控制单元MAC-CE,下行控制信息DCI。The second device instructs the first device to perform online learning semi-periodically through a first signaling, and the first signaling includes at least one of the following: medium access control-control element MAC-CE, downlink control information DCI .
可选地,在本申请实施例中,所述中止条件包括以下至少之一:Optionally, in this embodiment of the application, the termination condition includes at least one of the following:
所述第一AI模型的在线学习次数大于预设迭代次数;The number of online learning of the first AI model is greater than the preset number of iterations;
所述第一AI模型达到预设精度;The first AI model reaches a preset accuracy;
所述第一AI模型的输出结果的误差信息小于第七阈值;The error information of the output result of the first AI model is less than the seventh threshold;
所述第二设备突发性指示所述第一设备结束当前在线学习过程;The second device abruptly instructs the first device to end the current online learning process;
与所述第一AI模型关联的目标任务中止;a target task associated with the first AI model is aborted;
所述第一设备的测量信息分布与基准分布的差异信息小于第八阈值。The difference information between the measurement information distribution of the first device and the reference distribution is smaller than an eighth threshold.
可选地,在本申请实施例中,所述参数配置信息包括以下至少之一:Optionally, in this embodiment of the present application, the parameter configuration information includes at least one of the following:
所述AI模型的在线学习模式;The online learning mode of the AI model;
所述AI模型的样本批量的大小;The size of the sample batch of the AI model;
所述AI模型的优化器的状态;the state of the optimizer of the AI model;
所述AI模型的第一数据集的划分方式;The division method of the first data set of the AI model;
所述AI模型的第一数据集的构成信息;Composition information of the first data set of the AI model;
所述AI模型的第一数据集对所述AI模型更新的贡献权重;The contribution weight of the first data set of the AI model to the update of the AI model;
与所述第一信息关联的AI模型标识;An AI model identifier associated with the first information;
所述第一AI模型的基准分布;a baseline distribution of the first AI model;
其中,所述第一数据集以下至少之一:所述第一AI模型所使用的原始数据集,所述第一AI模型新采集的数据集;Wherein, the first data set is at least one of the following: the original data set used by the first AI model, the data set newly collected by the first AI model;
所述基准分布的参数信息包括以下至少之一:方差,均值,标准差。The parameter information of the reference distribution includes at least one of the following: variance, mean, and standard deviation.
可选地,在本申请实施例中,所述参数配置信息包括所述第一AI模型的在线学习模式、且所述在线学习模式为瞬时训练模式,所述参数配置信息还包括以下至少之一:所述第一设备所采集的数据量,所述数据量的采集时间长度。Optionally, in this embodiment of the present application, the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is an instantaneous training mode, and the parameter configuration information further includes at least one of the following : the amount of data collected by the first device, and the length of time for collecting the data.
可选地,在本申请实施例中,所述参数配置信息包括所述第一AI模型的在线学习模式、且所述在线学习模式为连续学习模式,所述参数配置信息还包括以下至少之一:相邻两次在线学习的时间间隔,相邻两次在线学习的数据量间隔。Optionally, in this embodiment of the present application, the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is a continuous learning mode, and the parameter configuration information further includes at least one of the following : The time interval between two adjacent online learning, and the data volume interval between two adjacent online learning.
可选地,在本申请实施例中,所述第二设备、第一设备以及第三设备包括以下至少一项:核心网设备,接入网设备,以及终端。 Optionally, in this embodiment of the present application, the second device, the first device, and the third device include at least one of the following: a core network device, an access network device, and a terminal.
在本申请实施例提供的AI模型的在线学习装置中,第一设备获取第一AI模型,并基于该第一AI模型的在线学习信息对该第一AI模型进行在线学习。通过该方法,通过在第一设备侧部署第一AI模型,并对该第一模型配置在线学习所需的参数,使得可以在第一设备侧对该第一AI模型进行连续在线调整,从而维持第一AI模型的预测性能,进而保证第一设备的服务质量。In the apparatus for online learning of an AI model provided in the embodiment of the present application, the first device acquires a first AI model, and performs online learning on the first AI model based on online learning information of the first AI model. With this method, by deploying the first AI model on the first device side and configuring the first model with parameters required for online learning, the first AI model can be continuously adjusted online on the first device side, thereby maintaining The predictive performance of the first AI model, thereby ensuring the service quality of the first device.
本申请实施例中的AI模型的在线学习装置可以是电子设备,例如具有操作系统的电子设备,也可以是电子设备中的部件,例如集成电路或芯片。该电子设备可以是终端,也可以为除终端之外的其他设备。示例性的,终端可以包括但不限于上述所列举的终端11的类型,其他设备可以为服务器、网络附属存储器(Network Attached Storage,NAS)等,本申请实施例不作具体限定。The online learning apparatus for the AI model in the embodiment of the present application may be an electronic device, such as an electronic device with an operating system, or a component in the electronic device, such as an integrated circuit or a chip. The electronic device may be a terminal, or other devices other than the terminal. Exemplarily, the terminal may include, but not limited to, the types of terminal 11 listed above, and other devices may be servers, Network Attached Storage (NAS), etc., which are not specifically limited in this embodiment of the present application.
本申请实施例提供的AI模型的在线学习装置能够实现图1至图3的方法实施例实现的各个过程,并达到相同的技术效果,为避免重复,这里不再赘述。The AI model online learning device provided by the embodiment of the present application can realize the various processes realized by the method embodiments in Fig. 1 to Fig. 3 and achieve the same technical effect. To avoid repetition, details are not repeated here.
可选的,如图6所示,本申请实施例还提供一种通信设备600,包括处理器601和存储器602,存储器602上存储有可在所述处理器601上运行的程序或指令,例如,该通信设备600为终端时,该程序或指令被处理器601执行时实现上述AI模型的在线学习方法实施例的各个步骤,且能达到相同的技术效果。该通信设备600为网络侧设备时,该程序或指令被处理器601执行时实现上述AI模型的在线学习方法实施例的各个步骤,且能达到相同的技术效果,为避免重复,这里不再赘述。Optionally, as shown in FIG. 6 , this embodiment of the present application also provides a communication device 600, including a processor 601 and a memory 602, and the memory 602 stores programs or instructions that can run on the processor 601, such as When the communication device 600 is a terminal, when the program or instruction is executed by the processor 601, each step of the above embodiment of the online learning method of the AI model can be realized, and the same technical effect can be achieved. When the communication device 600 is a network-side device, when the program or instruction is executed by the processor 601, the steps of the above-mentioned online learning method for the AI model are implemented, and the same technical effect can be achieved. To avoid repetition, details are not repeated here. .
以第一设备为终端为例。Take the first device as a terminal as an example.
本申请实施例还提供一种终端,包括处理器和通信接口,处理器用于获取第一AI模型,基于第一AI模型的在线学习信息,对第一AI模型进行在线学习。该终端实施例与上述终端侧方法实施例对应,上述方法实施例的各个实施过程和实现方式均可适用于该终端实施例中,且能达到相同的技术效果。具体地,图7为实现本申请实施例的一种终端的硬件结构示意图。The embodiment of the present application further provides a terminal, including a processor and a communication interface, and the processor is configured to acquire a first AI model, and perform online learning on the first AI model based on online learning information of the first AI model. This terminal embodiment corresponds to the above-mentioned terminal-side method embodiment, and each implementation process and implementation mode of the above-mentioned method embodiment can be applied to this terminal embodiment, and can achieve the same technical effect. Specifically, FIG. 7 is a schematic diagram of a hardware structure of a terminal implementing an embodiment of the present application.
该终端700包括但不限于:射频单元701、网络模块702、音频输出单元703、输入单元704、传感器705、显示单元706、用户输入单元707、接口单元708、存储器709以及处理器710等中的至少部分部件。The terminal 700 includes, but is not limited to: a radio frequency unit 701, a network module 702, an audio output unit 703, an input unit 704, a sensor 705, a display unit 706, a user input unit 707, an interface unit 708, a memory 709, and a processor 710. At least some parts.
本领域技术人员可以理解,终端700还可以包括给各个部件供电的电源(比如电池),电源可以通过电源管理系统与处理器710逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。图7中示出的终端结构并不构成对终端的限定,终端可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置,在此不再赘述。Those skilled in the art can understand that the terminal 700 may also include a power supply (such as a battery) for supplying power to various components, and the power supply may be logically connected to the processor 710 through the power management system, so as to manage charging, discharging, and power consumption through the power management system. Management and other functions. The terminal structure shown in FIG. 7 does not constitute a limitation on the terminal, and the terminal may include more or fewer components than shown in the figure, or combine some components, or arrange different components, which will not be repeated here.
应理解的是,本申请实施例中,输入单元704可以包括图形处理单元(Graphics Processing Unit,GPU)7041和麦克风7042,图形处理器7041对在视频捕获模式或图像捕获模式中由图像捕获装置(如摄像头)获得的静态图片或视频的图像数据进行处理。显示单元706可包括显示面板7061,可以采用液晶显示器、有机发光二极管等形式来配置显示面板7061。用户输入单元707包括触控面板7071以及其他输入设备7072中的至少一种。触控面板7071,也称为触摸屏。触控面板7071可包括触摸检测装置和触摸控制器两个部分。其他输入设备7072可以包括但不限于物理键盘、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆,在此不再赘述。It should be understood that, in this embodiment of the present application, the input unit 704 may include a graphics processing unit (Graphics Processing Unit, GPU) 7041 and a microphone 7042, and the graphics processor 7041 is used by the image capture device ( Such as the image data of the still picture or video obtained by the camera) for processing. The display unit 706 may include a display panel 7061, and the display panel 7061 may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like. The user input unit 707 includes at least one of a touch panel 7071 and other input devices 7072 . The touch panel 7071 is also called a touch screen. The touch panel 7071 may include two parts, a touch detection device and a touch controller. Other input devices 7072 may include, but are not limited to, physical keyboards, function keys (such as volume control buttons, switch buttons, etc.), trackballs, mice, and joysticks, which will not be described in detail here.
本申请实施例中,射频单元701接收来自网络侧设备的下行数据后,可以传输给处理器710进行处理;另外,射频单元701可以向网络侧设备发送上行数据。通常,射频单元701包括但不限于天线、放大器、收发信机、耦合器、低噪声放大器、双工器等。In the embodiment of the present application, the radio frequency unit 701 may transmit the downlink data from the network side device to the processor 710 for processing after receiving the downlink data; in addition, the radio frequency unit 701 may send uplink data to the network side device. Generally, the radio frequency unit 701 includes, but is not limited to, an antenna, an amplifier, a transceiver, a coupler, a low noise amplifier, a duplexer, and the like.
存储器709可用于存储软件程序或指令以及各种数据。存储器709可主要包括存储程序或指令的第一存储区和存储数据的第二存储区,其中,第一存储区可存储操作系统、至少一个功能所需的应用程序或指令(比如声音播放功能、图像播放功能等)等。此外,存储器709可以包括易失性存储器或非易失性存储器,或者,存储器709可以包括易失性和非易失性存储器两者。其中,非易失性存储器可以是只读存储器(Read-Only Memory,ROM)、可编程只读存储器(Programmable ROM,PROM)、可擦除可编程只读存储器(Erasable PROM,EPROM)、电可擦除可编程只读存储器(Electrically EPROM,EEPROM)或闪存。易失性存储器可以是随机存取存储器(Random Access Memory,RAM),静态随机存取存储器(Static RAM,SRAM)、动态随机存取存储器(Dynamic RAM,DRAM)、同步动态随机存取存储器(Synchronous DRAM,SDRAM)、双倍数据速率同步动态随机存取存储器(Double Data Rate SDRAM,DDRSDRAM)、增强型同步动态随机存取存储器(Enhanced SDRAM,ESDRAM)、同步连接动态随机存取存储器(Synch link DRAM,SLDRAM)和直接内存总线随机存取存储器(Direct Rambus RAM,DRRAM)。本申请实施例中的存储器709包括但不限于这些和任意其它适合类型的存储器。The memory 709 can be used to store software programs or instructions as well as various data. The memory 709 may mainly include a first storage area for storing programs or instructions and a second storage area for storing data, wherein the first storage area may store an operating system, an application program or instructions required by at least one function (such as a sound playing function, image playback function, etc.), etc. Furthermore, memory 709 may include volatile memory or nonvolatile memory, or, memory 709 may include both volatile and nonvolatile memory. Among them, the non-volatile memory can be read-only memory (Read-Only Memory, ROM), programmable read-only memory (Programmable ROM, PROM), erasable programmable read-only memory (Erasable PROM, EPROM), electronically programmable Erase Programmable Read-Only Memory (Electrically EPROM, EEPROM) or Flash. Volatile memory can be random access memory (Random Access Memory, RAM), static random access memory (Static RAM, SRAM), dynamic random access memory (Dynamic RAM, DRAM), synchronous dynamic random access memory (Synchronous DRAM, SDRAM), double data rate synchronous dynamic random access memory (Double Data Rate SDRAM, DDRSDRAM), enhanced synchronous dynamic random access memory (Enhanced SDRAM, ESDRAM), synchronous connection dynamic random access memory (Synch link DRAM , SLDRAM) and Direct Memory Bus Random Access Memory (Direct Rambus RAM, DRRAM). The memory 709 in the embodiment of the present application includes but is not limited to these and any other suitable types of memory.
处理器710可包括一个或多个处理单元;可选的,处理器710集成应用处理器和调制解调处理器,其中,应用处理器主要处理涉及操作系统、用户界面和应用程序等的操作,调制解调处理器主要处理无 线通信信号,如基带处理器。可以理解的是,上述调制解调处理器也可以不集成到处理器710中。The processor 710 may include one or more processing units; optionally, the processor 710 integrates an application processor and a modem processor, wherein the application processor mainly processes operations related to the operating system, user interface, and application programs, etc., The modem processor mainly handles the line communication signals, such as baseband processors. It can be understood that the foregoing modem processor may not be integrated into the processor 710 .
其中,所述处理器710用于所述第一设备获取第一AI模型,所述处理器710,还用于所述第一设备基于所述第一AI模型的在线学习信息,对所述第一AI模型进行在线学习。Wherein, the processor 710 is used for the first device to acquire the first AI model, and the processor 710 is also used for the first device to perform the first AI model based on the online learning information of the first AI model. An AI model for online learning.
可选地,在本申请实施例中,所述射频单元701,用于接收第二设备配置的第一AI模型。Optionally, in this embodiment of the present application, the radio frequency unit 701 is configured to receive the first AI model configured by the second device.
可选地,在本申请实施例中,所述处理器710,具体用于从所述第二设备获取所述第一AI模型的在线学习信息。Optionally, in this embodiment of the present application, the processor 710 is specifically configured to acquire online learning information of the first AI model from the second device.
可选地,在本申请实施例中,所述处理器710,还用于为第三设备配置所述第一AI模型的在线学习信息。Optionally, in this embodiment of the present application, the processor 710 is further configured to configure online learning information of the first AI model for the third device.
可选地,在本申请实施例中,所述在线学习信息包括以下至少之一:Optionally, in this embodiment of the application, the online learning information includes at least one of the following:
在线学习的触发方式;How eLearning is triggered;
在线学习的中止条件;Conditions for discontinuation of online learning;
在线学习的参数配置信息;Parameter configuration information for online learning;
在线学习的数据集。Datasets for online learning.
可选地,在本申请实施例中,所述触发方式对应的触发条件包括以下至少之一:Optionally, in this embodiment of the present application, the trigger condition corresponding to the trigger mode includes at least one of the following:
所述第一设备的状态信息满足第一预设条件;The state information of the first device satisfies a first preset condition;
所述第一设备采集到的数据量大于第一阈值;The amount of data collected by the first device is greater than a first threshold;
所述第一设备的测量信息满足第二预设条件;The measurement information of the first device satisfies a second preset condition;
所述第一AI模型的输出结果的误差信息大于第二阈值;The error information of the output result of the first AI model is greater than a second threshold;
所述第一AI模型对应的第一信息的统计信息满足第三预设条件;The statistical information of the first information corresponding to the first AI model satisfies a third preset condition;
所述第一设备的测量信息的统计信息满足第四预设条件;Statistical information of the measurement information of the first device satisfies a fourth preset condition;
其中,所述状态信息包括以下至少一项:移动速度、波束切换信息、小区切换信息;Wherein, the state information includes at least one of the following: moving speed, beam switching information, cell switching information;
所述第一信息包括以下至少之一:所述第一AI模型的输入信息,所述第一AI模型的输出信息。The first information includes at least one of the following: input information of the first AI model, and output information of the first AI model.
可选地,在本申请实施例中,所述第一信息的统计信息包括以下至少之一:第一时间窗内所述第一信息的第一统计量,至少两个连续的第二时间窗内所述第一信息对应的第二统计量,第一小区下的至少两个终端在第一时刻的第一信息的统计信息,所述第一信息的相关性信息;Optionally, in this embodiment of the present application, the statistical information of the first information includes at least one of the following: a first statistical quantity of the first information within a first time window, at least two consecutive second time windows A second statistic corresponding to the first information, statistical information of the first information of at least two terminals under the first cell at the first moment, and correlation information of the first information;
所述第二统计量是基于各个所述第二时间窗内的统计量计算出的;The second statistics are calculated based on statistics in each of the second time windows;
所述第一设备的测量信息的统计信息包括以下至少之一:在第三时间窗内针对上述测量信息的第三统计量,所述测量信息在至少两个连续的第四时间窗对应的第四统计量,所述测量信息的相关性信息;The statistical information of the measurement information of the first device includes at least one of the following: a third statistic for the above measurement information within a third time window, where the measurement information corresponds to at least two consecutive fourth time windows. Four statistics, correlation information of the measurement information;
所述第四统计量是基于各个所述第四时间窗内的统计量计算出的;The fourth statistic is calculated based on the statistic in each of the fourth time windows;
所述相关性信息包括以下至少之一:数据之间的距离,协方差,相关系数。The correlation information includes at least one of the following: distance between data, covariance, and correlation coefficient.
可选地,在本申请实施例中,所述触发条件包括:所述第一设备的状态信息满足第一预设条件;Optionally, in this embodiment of the present application, the trigger condition includes: the status information of the first device meets a first preset condition;
所述满足第一预设条件包括以下至少之一:Said meeting the first preset condition includes at least one of the following:
所述第一设备的移动速度大于第三阈值;The moving speed of the first device is greater than a third threshold;
所述波束切换信息指示所述第一设备发生波束切换、且波束切换频率大于第四阈值;The beam switching information indicates that beam switching occurs for the first device, and the beam switching frequency is greater than a fourth threshold;
所述小区切换信息指示所述第一设备发生小区切换。The cell switching information indicates that a cell switching occurs to the first device.
可选地,在本申请实施例中,所述触发条件包括:所述第一设备接收的参考信号的测量信息满足第二预设条件;Optionally, in this embodiment of the present application, the trigger condition includes: the measurement information of the reference signal received by the first device satisfies a second preset condition;
所述满足第二预设条件包括:所述参考信号的测量信息指示所述第一设备的相关信道的信道环境发生变化;The meeting the second preset condition includes: the measurement information of the reference signal indicates that the channel environment of the relevant channel of the first device changes;
其中,所述测量信息包括以下至少之一:所述第一设备接收的参考信号的第一测量信息,所述第一设备的传感器采集的第二测量信息;Wherein, the measurement information includes at least one of the following: first measurement information of a reference signal received by the first device, and second measurement information collected by a sensor of the first device;
所述第一测量信息包括以下至少之一:所述参考信号的瞬时测量信息,所述参考信号的统计测量信息;The first measurement information includes at least one of the following: instantaneous measurement information of the reference signal, statistical measurement information of the reference signal;
所述参考信号包括以下至少之一:同步信号块SSB、CSI参考信号CSI-RS、探测参考信号SRS、定位参考信号PRS。The reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS.
可选地,在本申请实施例中,所述触发条件包括:所述第一AI模型对应的第二信息的统计信息满足第三预设条件;Optionally, in this embodiment of the present application, the trigger condition includes: the statistical information of the second information corresponding to the first AI model satisfies a third preset condition;
所述满足第三预设条件包括以下至少之一:Said meeting the third preset condition includes at least one of the following:
所述第一统计量大于第一阈值区间的最大值;The first statistic is greater than the maximum value of the first threshold interval;
所述第二统计量大于第二阈值区间的最大值;The second statistic is greater than the maximum value of the second threshold interval;
在至少两个时间窗内采集的所述第一信息的相关性信息满足第一条件; The correlation information of the first information collected in at least two time windows satisfies a first condition;
在当前时间窗内采集的不同所述第一信息之间的相关性信息满足第二条件。Correlation information between different first pieces of information collected within the current time window satisfies the second condition.
可选地,在本申请实施例中,所述触发条件包括:所述第一设备的测量信息的统计信息满足第四预设条件;Optionally, in this embodiment of the present application, the trigger condition includes: statistical information of the measurement information of the first device meets a fourth preset condition;
所述满足第四预设条件包括以下至少之一:Said meeting the fourth preset condition includes at least one of the following:
所述第三统计量大于第三阈值区间的最大值;The third statistic is greater than the maximum value of the third threshold interval;
所述第四统计量大于第四阈值区间的最大值;The fourth statistic is greater than the maximum value of the fourth threshold interval;
在至少两个时间窗内采集的所述测量信息的相关性信息满足第三条件;The correlation information of the measurement information collected in at least two time windows satisfies the third condition;
在当前时间窗内采集的不同所述测量信息之间的相关性信息满足第四条件;Correlation information among different measurement information collected in the current time window satisfies the fourth condition;
所述测量信息的分布与基准分布的差异大于第五阈值,所述基准分布为所述第二设备为所述第一设备配置的信息。A difference between the distribution of the measurement information and a reference distribution is greater than a fifth threshold, where the reference distribution is information configured by the second device for the first device.
可选地,在本申请实施例中,所述触发方式的触发条件包括以下至少之一:Optionally, in this embodiment of the present application, the trigger condition of the trigger mode includes at least one of the following:
所述第二设备指示所述第一设备进行在线学习;The second device instructs the first device to perform online learning;
所述第一AI模型的输出精度小于或者等于第六阈值。The output accuracy of the first AI model is less than or equal to the sixth threshold.
可选地,在本申请实施例中,所述第二设备指示所述第一设备进行在线学习,包括以下至少之一:Optionally, in this embodiment of the present application, the second device instructs the first device to perform online learning, including at least one of the following:
所述第二设备指示所述第一设备周期性进行在线学习;The second device instructs the first device to periodically perform online learning;
所述第二设备指示所述第一设备半周期性进行在线学习;The second device instructs the first device to perform online learning semi-periodically;
所述第二设备指示所述第一设备非周期性进行在线学习。The second device instructs the first device to perform online learning aperiodically.
其中,所述第一设备周期性或半周期性进行在线学习时采用的周期为:所述第二设备预配置的周期,或者,所述第一设备自主配置的周期。Wherein, the period adopted by the first device to perform online learning periodically or semi-periodically is: a period preconfigured by the second device, or a period independently configured by the first device.
可选地,在本申请实施例中,所述第二设备指示所述第一设备半周期性进行在线学习,包括:Optionally, in this embodiment of the present application, the second device instructs the first device to perform online learning semi-periodically, including:
所述第二设备通过第一信令指示所述第一设备半周期性进行在线学习,所述第一信令包括以下至少之一:媒体接入控制-控制单元MAC-CE,下行控制信息DCI。The second device instructs the first device to perform online learning semi-periodically through a first signaling, and the first signaling includes at least one of the following: medium access control-control element MAC-CE, downlink control information DCI .
可选地,在本申请实施例中,所述中止条件包括以下至少之一:Optionally, in this embodiment of the application, the termination condition includes at least one of the following:
所述第一AI模型的在线学习次数大于预设迭代次数;The number of online learning of the first AI model is greater than the preset number of iterations;
所述第一AI模型达到预设精度;The first AI model reaches a preset accuracy;
所述第一AI模型的输出结果的误差信息小于第七阈值;The error information of the output result of the first AI model is less than the seventh threshold;
所述第二设备突发性指示所述第一设备结束当前在线学习过程;The second device abruptly instructs the first device to end the current online learning process;
与所述第一AI模型关联的目标任务中止;a target task associated with the first AI model is aborted;
所述第一设备的测量信息分布与基准分布的差异信息小于第八阈值。The difference information between the measurement information distribution of the first device and the reference distribution is smaller than an eighth threshold.
可选地,在本申请实施例中,所述参数配置信息包括以下至少之一:Optionally, in this embodiment of the present application, the parameter configuration information includes at least one of the following:
所述AI模型的在线学习模式;The online learning mode of the AI model;
所述AI模型的样本批量的大小;The size of the sample batch of the AI model;
所述AI模型的优化器的状态;the state of the optimizer of the AI model;
所述AI模型的第一数据集的划分方式;The division method of the first data set of the AI model;
所述AI模型的第一数据集的构成信息;Composition information of the first data set of the AI model;
所述AI模型的第一数据集对所述AI模型更新的贡献权重;The contribution weight of the first data set of the AI model to the update of the AI model;
与所述第一信息关联的AI模型标识;An AI model identifier associated with the first information;
所述第一AI模型的基准分布;a baseline distribution of the first AI model;
其中,所述第一数据集以下至少之一:所述第一AI模型所使用的原始数据集,所述第一AI模型新采集的数据集;Wherein, the first data set is at least one of the following: the original data set used by the first AI model, the data set newly collected by the first AI model;
所述基准分布的参数信息包括以下至少之一:方差,均值,标准差。The parameter information of the reference distribution includes at least one of the following: variance, mean, and standard deviation.
可选地,在本申请实施例中,所述参数配置信息包括所述第一AI模型的在线学习模式、且所述在线学习模式为瞬时训练模式,所述参数配置信息还包括以下至少之一:所述第一设备所采集的数据量,所述数据量的采集时间长度。Optionally, in this embodiment of the present application, the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is an instantaneous training mode, and the parameter configuration information further includes at least one of the following : the amount of data collected by the first device, and the length of time for collecting the data.
可选地,在本申请实施例中,所述参数配置信息包括所述第一AI模型的在线学习模式、且所述在线学习模式为连续学习模式,所述参数配置信息还包括以下至少之一:相邻两次在线学习的时间间隔,相邻两次在线学习的数据量间隔。Optionally, in this embodiment of the present application, the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is a continuous learning mode, and the parameter configuration information further includes at least one of the following : The time interval between two adjacent online learning, and the data volume interval between two adjacent online learning.
可选地,在本申请实施例中,所述第二设备、第一设备以及第三设备包括以下至少一项:核心网设备,接入网设备,以及终端。Optionally, in this embodiment of the present application, the second device, the first device, and the third device include at least one of the following: a core network device, an access network device, and a terminal.
在本申请实施例提供的终端中,终端获取第一AI模型,并基于该第一AI模型的在线学习信息对该第一AI模型进行在线学习。通过该方法,通过在终端侧部署第一AI模型,并对该第一模型配置在 线学习所需的参数,使得可以在终端侧对该第一AI模型进行连续在线调整,从而维持第一AI模型的预测性能,进而保证终端的服务质量。In the terminal provided in the embodiment of the present application, the terminal acquires a first AI model, and performs online learning on the first AI model based on online learning information of the first AI model. With this method, by deploying the first AI model on the terminal side, and configuring the first model on The parameters required for online learning enable continuous online adjustment of the first AI model on the terminal side, thereby maintaining the predictive performance of the first AI model and ensuring the service quality of the terminal.
以第二设备为网络侧设备为例。Take the second device being the network side device as an example.
本申请实施例还提供一种网络侧设备,包括处理器和通信接口,处理器用于为第一设备配置第一AI模型;以及为该第一设备配置上述第一AI模型的在线学习信息。该网络侧设备实施例与上述网络侧设备方法实施例对应,上述方法实施例的各个实施过程和实现方式均可适用于该网络侧设备实施例中,且能达到相同的技术效果。The embodiment of the present application also provides a network side device, including a processor and a communication interface, the processor is configured to configure a first AI model for a first device; and configure online learning information of the first AI model for the first device. The network-side device embodiment corresponds to the above-mentioned network-side device method embodiment, and each implementation process and implementation mode of the above-mentioned method embodiment can be applied to this network-side device embodiment, and can achieve the same technical effect.
具体地,本申请实施例还提供了一种网络侧设备。如图8所示,该网络侧设备800包括:天线81、射频装置82、基带装置83、处理器84和存储器85。天线81与射频装置82连接。在上行方向上,射频装置82通过天线81接收信息,将接收的信息发送给基带装置83进行处理。在下行方向上,基带装置83对要发送的信息进行处理,并发送给射频装置82,射频装置82对收到的信息进行处理后经过天线81发送出去。Specifically, the embodiment of the present application also provides a network side device. As shown in FIG. 8 , the network side device 800 includes: an antenna 81 , a radio frequency device 82 , a baseband device 83 , a processor 84 and a memory 85 . The antenna 81 is connected to a radio frequency device 82 . In the uplink direction, the radio frequency device 82 receives information through the antenna 81, and sends the received information to the baseband device 83 for processing. In the downlink direction, the baseband device 83 processes the information to be sent and sends it to the radio frequency device 82 , and the radio frequency device 82 processes the received information and sends it out through the antenna 81 .
以上实施例中网络侧设备执行的方法可以在基带装置83中实现,该基带装置83包括基带处理器。The method performed by the network side device in the above embodiments may be implemented in the baseband device 83, where the baseband device 83 includes a baseband processor.
基带装置83例如可以包括至少一个基带板,该基带板上设置有多个芯片,如图8所示,其中一个芯片例如为基带处理器,通过总线接口与存储器85连接,以调用存储器85中的程序,执行以上方法实施例中所示的网络设备操作。The baseband device 83 can include at least one baseband board, for example, a plurality of chips are arranged on the baseband board, as shown in FIG. The program executes the network device operations shown in the above method embodiments.
该网络侧设备还可以包括网络接口86,该接口例如为通用公共无线接口(common public radio interface,CPRI)。The network side device may also include a network interface 86, such as a common public radio interface (common public radio interface, CPRI).
具体地,本发明实施例的网络侧设备800还包括:存储在存储器85上并可在处理器84上运行的指令或程序,处理器84调用存储器85中的指令或程序执行图4所示各模块执行的方法,并达到相同的技术效果,为避免重复,故不在此赘述。Specifically, the network side device 800 in this embodiment of the present invention further includes: instructions or programs stored in the memory 85 and operable on the processor 84, and the processor 84 invokes the instructions or programs in the memory 85 to execute the various programs shown in FIG. The method of module execution achieves the same technical effect, so in order to avoid repetition, it is not repeated here.
具体地,本申请实施例还提供了一种网络侧设备。如图9所示,该网络侧设备900包括:处理器901、网络接口902和存储器903。其中,网络接口902例如为通用公共无线接口(common public radio interface,CPRI)。Specifically, the embodiment of the present application also provides a network side device. As shown in FIG. 9 , the network side device 900 includes: a processor 901 , a network interface 902 and a memory 903 . Wherein, the network interface 902 is, for example, a common public radio interface (common public radio interface, CPRI).
具体地,本发明实施例的网络侧设备900还包括:存储在存储器903上并可在处理器901上运行的指令或程序,处理器901调用存储器903中的指令或程序执行图4所示各模块执行的方法,并达到相同的技术效果,为避免重复,故不在此赘述。Specifically, the network-side device 900 in this embodiment of the present invention also includes: instructions or programs stored in the memory 903 and executable on the processor 901, and the processor 901 invokes the instructions or programs in the memory 903 to execute the various programs shown in FIG. The method of module execution achieves the same technical effect, so in order to avoid repetition, it is not repeated here.
本申请实施例还提供一种可读存储介质,所述可读存储介质上存储有程序或指令,该程序或指令被处理器执行时实现上述AI模型的在线学习方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。The embodiment of the present application also provides a readable storage medium, the readable storage medium stores a program or an instruction, and when the program or instruction is executed by the processor, each process of the above-mentioned online learning method embodiment of the AI model is realized, and The same technical effect can be achieved, so in order to avoid repetition, details will not be repeated here.
其中,所述处理器为上述实施例中所述的终端中的处理器。所述可读存储介质,包括计算机可读存储介质,如计算机只读存储器ROM、随机存取存储器RAM、磁碟或者光盘等。Wherein, the processor is the processor in the terminal described in the foregoing embodiments. The readable storage medium includes a computer-readable storage medium, such as a computer read-only memory ROM, a random access memory RAM, a magnetic disk or an optical disk, and the like.
本申请实施例另提供了一种芯片,所述芯片包括处理器和通信接口,所述通信接口和所述处理器耦合,所述处理器用于运行程序或指令,实现上AI模型的在线学习方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。The embodiment of the present application further provides a chip, the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is used to run programs or instructions to realize the online learning method of the AI model The various processes of the embodiment can achieve the same technical effect, so in order to avoid repetition, details are not repeated here.
应理解,本申请实施例提到的芯片还可以称为系统级芯片,系统芯片,芯片系统或片上系统芯片等。It should be understood that the chip mentioned in the embodiment of the present application may also be called a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip.
本申请实施例另提供了一种计算机程序/程序产品,所述计算机程序/程序产品被存储在存储介质中,所述计算机程序/程序产品被至少一个处理器执行以实现上述AI模型的在线学习方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。The embodiment of the present application further provides a computer program/program product, the computer program/program product is stored in a storage medium, and the computer program/program product is executed by at least one processor to realize the above-mentioned online learning of the AI model Each process of the method embodiment can achieve the same technical effect, and will not be repeated here to avoid repetition.
本申请实施例还提供了一种通信系统,包括:终端及网络侧设备,所述终端可用于执行如上所述的AI模型的在线学习方法的步骤,所述网络侧设备可用于执行如上所述的AI模型的在线学习方法的步骤。The embodiment of the present application also provides a communication system, including: a terminal and a network-side device, the terminal can be used to execute the steps of the online learning method of the AI model as described above, and the network-side device can be used to execute the above-mentioned The steps of the online learning method of the AI model.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。此外,需要指出的是,本申请实施方式中的方法和装置的范围不限按示出或讨论的顺序来执行功能,还可包括根据所涉及的功能按基本同时的方式或按相反的顺序来执行功能,例如,可以按不同于所描述的次序来执行所描述的方法,并且还可以添加、省去、或组合各种步骤。另外,参照某些示例所描述的特征可在其他示例中被组合。It should be noted that, in this document, the term "comprising", "comprising" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article or apparatus comprising a set of elements includes not only those elements, It also includes other elements not expressly listed, or elements inherent in the process, method, article, or device. Without further limitations, an element defined by the phrase "comprising a ..." does not preclude the presence of additional identical elements in the process, method, article, or apparatus comprising that element. In addition, it should be pointed out that the scope of the methods and devices in the embodiments of the present application is not limited to performing functions in the order shown or discussed, and may also include performing functions in a substantially simultaneous manner or in reverse order according to the functions involved. Functions are performed, for example, the described methods may be performed in an order different from that described, and various steps may also be added, omitted, or combined. Additionally, features described with reference to certain examples may be combined in other examples.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这 样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以计算机软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the methods of the above embodiments can be implemented by means of software plus a necessary general-purpose hardware platform, and of course also by hardware, but in many cases the former is better implementation. Based on this In this understanding, the technical solution of the present application is essentially or the part that contributes to the prior art can be embodied in the form of computer software products, and the computer software products are stored in a storage medium (such as ROM/RAM, disk, CD) contains several instructions to enable a terminal (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to execute the methods described in various embodiments of the present application.
上面结合附图对本申请的实施例进行了描述,但是本申请并不局限于上述的具体实施方式,上述的具体实施方式仅仅是示意性的,而不是限制性的,本领域的普通技术人员在本申请的启示下,在不脱离本申请宗旨和权利要求所保护的范围情况下,还可做出很多形式,均属于本申请的保护之内。。 The embodiments of the present application have been described above in conjunction with the accompanying drawings, but the present application is not limited to the above-mentioned specific implementations. The above-mentioned specific implementations are only illustrative and not restrictive. Those of ordinary skill in the art will Under the inspiration of this application, without departing from the purpose of this application and the scope of protection of the claims, many forms can also be made, all of which belong to the protection of this application. .

Claims (42)

  1. 一种人工智能AI模型的在线学习方法,所述方法包括:An online learning method of an artificial intelligence AI model, said method comprising:
    第一设备获取第一AI模型;The first device obtains the first AI model;
    所述第一设备基于所述第一AI模型的在线学习信息,对所述第一AI模型进行在线学习。The first device performs online learning on the first AI model based on the online learning information of the first AI model.
  2. 根据权利要求1所述的方法,其中,所述第一设备获取第一AI模型,包括:The method according to claim 1, wherein the acquiring the first AI model by the first device comprises:
    第一设备接收第二设备配置的第一AI模型。The first device receives the first AI model configured by the second device.
  3. 根据权利要求1或2所述的方法,其中,所述第一设备基于所述第一AI模型的在线学习信息,对所述第一AI模型进行在线学习之前,所述方法还包括:The method according to claim 1 or 2, wherein, before the first device performs online learning on the first AI model based on the online learning information of the first AI model, the method further includes:
    所述第一设备从所述第二设备获取所述第一AI模型的在线学习信息。The first device acquires online learning information of the first AI model from the second device.
  4. 根据权利要求1所述的方法,其中,所述方法还包括:The method according to claim 1, wherein the method further comprises:
    所述第一设备为第三设备配置所述第一AI模型的在线学习信息。The first device configures the online learning information of the first AI model for the third device.
  5. 根据权利要求1所述的方法,其中,所述在线学习信息包括以下至少之一:The method according to claim 1, wherein the online learning information includes at least one of the following:
    在线学习的触发方式;How eLearning is triggered;
    在线学习的中止条件;Conditions for discontinuation of online learning;
    在线学习的参数配置信息;Parameter configuration information for online learning;
    在线学习的数据集。Datasets for online learning.
  6. 根据权利要求5所述的方法,其中,The method according to claim 5, wherein,
    所述触发方式对应的触发条件包括以下至少之一:The trigger condition corresponding to the trigger mode includes at least one of the following:
    所述第一设备的状态信息满足第一预设条件;The state information of the first device satisfies a first preset condition;
    所述第一设备采集到的数据量大于第一阈值;The amount of data collected by the first device is greater than a first threshold;
    所述第一设备的测量信息满足第二预设条件;The measurement information of the first device satisfies a second preset condition;
    所述第一AI模型的输出结果的误差信息大于第二阈值;The error information of the output result of the first AI model is greater than a second threshold;
    所述第一AI模型对应的第一信息的统计信息满足第三预设条件;The statistical information of the first information corresponding to the first AI model satisfies a third preset condition;
    所述第一设备的测量信息的统计信息满足第四预设条件;Statistical information of the measurement information of the first device satisfies a fourth preset condition;
    其中,所述状态信息包括以下至少一项:移动速度、波束切换信息、小区切换信息;Wherein, the state information includes at least one of the following: moving speed, beam switching information, cell switching information;
    所述第一信息包括以下至少之一:所述第一AI模型的输入信息,所述第一AI模型的输出信息。The first information includes at least one of the following: input information of the first AI model, and output information of the first AI model.
  7. 根据权利要求6所述的方法,其中,The method of claim 6, wherein,
    所述第一信息的统计信息包括以下至少之一:第一时间窗内所述第一信息的第一统计量,至少两个连续的第二时间窗内所述第一信息对应的第二统计量,第一小区下的至少两个终端在第一时刻的第一信息的统计信息,所述第一信息的相关性信息;The statistical information of the first information includes at least one of the following: a first statistic of the first information in a first time window, a second statistic corresponding to the first information in at least two consecutive second time windows Quantity, statistical information of the first information of at least two terminals under the first cell at the first moment, and correlation information of the first information;
    所述第二统计量是基于各个所述第二时间窗内的统计量计算出的。The second statistics are calculated based on statistics in each of the second time windows.
  8. 根据权利要求7所述的方法,其中,所述第三预设条件包括以下至少之一:The method according to claim 7, wherein the third preset condition includes at least one of the following:
    所述第一统计量大于第一阈值区间的最大值;The first statistic is greater than the maximum value of the first threshold interval;
    所述第二统计量大于第二阈值区间的最大值;The second statistic is greater than the maximum value of the second threshold interval;
    在至少两个时间窗内采集的所述第一信息的相关性信息满足第一条件;The correlation information of the first information collected in at least two time windows satisfies a first condition;
    在当前时间窗内采集的不同所述第一信息之间的相关性信息满足第二条件。Correlation information between different first pieces of information collected within the current time window satisfies the second condition.
  9. 根据权利要求6所述的方法,其中,The method of claim 6, wherein,
    所述第一设备的测量信息的统计信息包括以下至少之一:在第三时间窗内针对所述测量信息的第三统计量,所述测量信息在至少两个连续的第四时间窗对应的第四统计量,所述测量信息的相关性信息;The statistical information of the measurement information of the first device includes at least one of the following: a third statistic for the measurement information within a third time window, where the measurement information corresponds to at least two consecutive fourth time windows The fourth statistic, the correlation information of the measurement information;
    所述第四统计量是基于各个所述第四时间窗内的统计量计算出的;The fourth statistic is calculated based on the statistic in each of the fourth time windows;
    所述相关性信息包括以下至少之一:数据之间的距离,协方差,相关系数。The correlation information includes at least one of the following: distance between data, covariance, and correlation coefficient.
  10. 根据权利要求9所述的方法,其中,所述满足第四预设条件包括以下至少之一:The method according to claim 9, wherein said meeting the fourth preset condition includes at least one of the following:
    所述第三统计量大于第三阈值区间的最大值;The third statistic is greater than the maximum value of the third threshold interval;
    所述第四统计量大于第四阈值区间的最大值;The fourth statistic is greater than the maximum value of the fourth threshold interval;
    在至少两个时间窗内采集的所述测量信息的相关性信息满足第三条件;The correlation information of the measurement information collected in at least two time windows satisfies the third condition;
    在当前时间窗内采集的不同所述测量信息之间的相关性信息满足第四条件;Correlation information among different measurement information collected in the current time window satisfies the fourth condition;
    所述测量信息的分布与基准分布的差异大于第五阈值,所述基准分布为所述第二设备为所述第一设备配置的信息。 A difference between the distribution of the measurement information and a reference distribution is greater than a fifth threshold, where the reference distribution is information configured by the second device for the first device.
  11. 根据权利要求6所述的方法,其中,所述触发条件包括:所述第一设备的状态信息满足第一预设条件;The method according to claim 6, wherein the trigger condition comprises: the status information of the first device meets a first preset condition;
    所述满足第一预设条件包括以下至少之一:Said meeting the first preset condition includes at least one of the following:
    所述第一设备的移动速度大于第三阈值;The moving speed of the first device is greater than a third threshold;
    所述波束切换信息指示所述第一设备发生波束切换、且波束切换频率大于第四阈值;The beam switching information indicates that beam switching occurs for the first device, and the beam switching frequency is greater than a fourth threshold;
    所述小区切换信息指示所述第一设备发生小区切换。The cell switching information indicates that a cell switching occurs to the first device.
  12. 根据权利要求6所述的方法,其中,所述触发条件包括:所述第一设备的测量信息满足第二预设条件;The method according to claim 6, wherein the trigger condition comprises: the measurement information of the first device meets a second preset condition;
    所述满足第二预设条件包括:所述测量信息指示所述第一设备的相关信道的信道环境发生变化;The meeting the second preset condition includes: the measurement information indicates that the channel environment of the relevant channel of the first device changes;
    其中,所述测量信息包括以下至少之一:所述第一设备接收的参考信号的第一测量信息,所述第一设备的传感器采集的第二测量信息;Wherein, the measurement information includes at least one of the following: first measurement information of a reference signal received by the first device, and second measurement information collected by a sensor of the first device;
    所述第一测量信息包括以下至少之一:所述参考信号的瞬时测量信息,所述参考信号的统计测量信息;The first measurement information includes at least one of the following: instantaneous measurement information of the reference signal, statistical measurement information of the reference signal;
    所述参考信号包括以下至少之一:同步信号块SSB、CSI参考信号CSI-RS、探测参考信号SRS、定位参考信号PRS。The reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS.
  13. 根据权利要求5所述的方法,其中,The method according to claim 5, wherein,
    所述触发方式的触发条件包括以下至少之一:The trigger condition of the trigger mode includes at least one of the following:
    所述第二设备指示所述第一设备进行在线学习;The second device instructs the first device to perform online learning;
    所述第一AI模型的输出精度小于或者等于第六阈值。The output accuracy of the first AI model is less than or equal to the sixth threshold.
  14. 根据权利要求13所述的方法,其中,The method of claim 13, wherein,
    所述第二设备指示所述第一设备进行在线学习,包括以下至少之一:The second device instructs the first device to perform online learning, including at least one of the following:
    所述第二设备指示所述第一设备周期性进行在线学习;The second device instructs the first device to periodically perform online learning;
    所述第二设备指示所述第一设备半周期性进行在线学习;The second device instructs the first device to perform online learning semi-periodically;
    所述第二设备指示所述第一设备非周期性进行在线学习;The second device instructs the first device to perform online learning aperiodically;
    其中,所述第一设备周期性或半周期性进行在线学习时采用的周期为:所述第二设备预配置的周期,或者,所述第一设备自主配置的周期。Wherein, the period adopted by the first device to perform online learning periodically or semi-periodically is: a period preconfigured by the second device, or a period independently configured by the first device.
  15. 根据权利要求5所述的方法,其中,所述中止条件包括以下至少之一:The method according to claim 5, wherein the termination condition includes at least one of the following:
    所述第一AI模型的在线学习次数大于预设迭代次数;The number of online learning of the first AI model is greater than the preset number of iterations;
    所述第一AI模型达到预设精度;The first AI model reaches a preset accuracy;
    所述第一AI模型的输出结果的误差信息小于第七阈值;The error information of the output result of the first AI model is less than the seventh threshold;
    所述第二设备突发性指示所述第一设备结束当前在线学习过程;The second device abruptly instructs the first device to end the current online learning process;
    与所述第一AI模型关联的目标任务中止;a target task associated with the first AI model is aborted;
    所述第一设备的测量信息分布与基准分布的差异信息小于第八阈值。The difference information between the measurement information distribution of the first device and the reference distribution is smaller than an eighth threshold.
  16. 根据权利要求5所述的方法,其中,所述参数配置信息包括以下至少之一:The method according to claim 5, wherein the parameter configuration information includes at least one of the following:
    所述第一AI模型的在线学习模式;The online learning mode of the first AI model;
    所述第一AI模型的样本批量的大小;the size of the sample batch of the first AI model;
    所述第一AI模型的优化器的状态;the state of the optimizer of the first AI model;
    所述第一AI模型的第一数据集的划分方式;A division method of the first data set of the first AI model;
    所述第一AI模型的第一数据集的构成信息;Composition information of the first data set of the first AI model;
    所述第一AI模型的第一数据集对所述第一AI模型更新的贡献权重;The contribution weight of the first data set of the first AI model to the update of the first AI model;
    与所述在线学习信息关联的AI模型标识;AI model identification associated with the online learning information;
    所述第一AI模型的基准分布;a baseline distribution of the first AI model;
    其中,所述第一数据集以下至少之一:所述第一AI模型所使用的原始数据集,所述第一AI模型新采集的数据集;Wherein, the first data set is at least one of the following: the original data set used by the first AI model, the data set newly collected by the first AI model;
    所述基准分布的参数信息包括以下至少之一:方差,均值,标准差。The parameter information of the reference distribution includes at least one of the following: variance, mean, and standard deviation.
  17. 根据权利要求16所述的方法,其中,所述参数配置信息包括所述第一AI模型的在线学习模式、且所述在线学习模式为瞬时训练模式,所述参数配置信息还包括以下至少之一:所述第一设备所采集的数据量,所述数据量的采集时间长度。The method according to claim 16, wherein the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is an instantaneous training mode, and the parameter configuration information further includes at least one of the following : the amount of data collected by the first device, and the length of time for collecting the data.
  18. 根据权利要求16所述的方法,其中,所述参数配置信息包括所述第一AI模型的在线学习模式、且所述在线学习模式为连续学习模式,所述参数配置信息还包括以下至少之一:相邻两 次在线学习的时间间隔,相邻两次在线学习的数据量间隔。The method according to claim 16, wherein the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is a continuous learning mode, and the parameter configuration information further includes at least one of the following : adjacent two The time interval between online learning and the data volume interval between two adjacent online learning.
  19. 根据权利要求1所述的方法,其中,所述第二设备、第一设备以及第三设备包括以下至少一项:核心网设备,接入网设备,以及终端。The method according to claim 1, wherein the second device, the first device and the third device comprise at least one of the following: a core network device, an access network device, and a terminal.
  20. 一种人工智能AI模型的在线学习方法,所述方法包括:An online learning method of an artificial intelligence AI model, said method comprising:
    第二设备为第一设备配置第一AI模型;The second device configures the first AI model for the first device;
    所述第二设备为所述第一设备配置所述第一AI模型的在线学习信息。The second device configures online learning information of the first AI model for the first device.
  21. 根据权利要求20所述的方法,其中,所述在线学习信息包括以下至少之一:The method according to claim 20, wherein the online learning information includes at least one of the following:
    在线学习的触发方式;How eLearning is triggered;
    在线学习的中止条件;Conditions for discontinuation of online learning;
    在线学习的参数配置信息;Parameter configuration information for online learning;
    在线学习的数据集。Datasets for online learning.
  22. 根据权利要求21所述的方法,其中,The method of claim 21, wherein,
    所述触发方式对应的触发条件包括以下至少之一:The trigger condition corresponding to the trigger mode includes at least one of the following:
    所述第一设备的状态信息满足第一预设条件;The state information of the first device satisfies a first preset condition;
    所述第一设备采集到的数据量大于第一阈值;The amount of data collected by the first device is greater than a first threshold;
    所述第一设备的测量信息满足第二预设条件;The measurement information of the first device satisfies a second preset condition;
    所述第一AI模型的输出结果的误差信息大于第二阈值;The error information of the output result of the first AI model is greater than a second threshold;
    所述第一AI模型对应的第一信息的统计信息满足第三预设条件;The statistical information of the first information corresponding to the first AI model satisfies a third preset condition;
    所述第一设备的测量信息的统计信息满足第四预设条件;Statistical information of the measurement information of the first device satisfies a fourth preset condition;
    其中,所述状态信息包括以下至少一项:移动速度、波束切换信息、小区切换信息;Wherein, the state information includes at least one of the following: moving speed, beam switching information, cell switching information;
    所述第一信息包括以下至少之一:所述第一AI模型的输入信息,所述第一AI模型的输出信息。The first information includes at least one of the following: input information of the first AI model, and output information of the first AI model.
  23. 根据权利要求22所述的方法,其中,The method of claim 22, wherein,
    所述第一信息的统计信息包括以下至少之一:第一时间窗内所述第一信息的第一统计量,至少两个连续的第二时间窗内所述第一信息对应的第二统计量,第一小区下的至少两个终端在第一时刻的第一信息的统计信息,所述第一信息的相关性信息;The statistical information of the first information includes at least one of the following: a first statistic of the first information in a first time window, a second statistic corresponding to the first information in at least two consecutive second time windows Quantity, statistical information of the first information of at least two terminals under the first cell at the first moment, and correlation information of the first information;
    所述第二统计量是基于各个所述第二时间窗内的统计量计算出的。The second statistics are calculated based on statistics in each of the second time windows.
  24. 根据权利要求23所述的方法,其中,所述第三预设条件包括以下至少之一:The method according to claim 23, wherein the third preset condition includes at least one of the following:
    所述第一统计量大于第一阈值区间的最大值;The first statistic is greater than the maximum value of the first threshold interval;
    所述第二统计量大于第二阈值区间的最大值;The second statistic is greater than the maximum value of the second threshold interval;
    在至少两个时间窗内采集的所述第一信息的相关性信息满足第一条件;The correlation information of the first information collected in at least two time windows satisfies a first condition;
    在当前时间窗内采集的不同所述第一信息之间的相关性信息满足第二条件。Correlation information between different first pieces of information collected within the current time window satisfies the second condition.
  25. 根据权利要求22所述的方法,其中,The method of claim 22, wherein,
    所述第一设备的测量信息的统计信息包括以下至少之一:在第三时间窗内针对所述测量信息的第三统计量,所述测量信息在至少两个连续的第四时间窗对应的第四统计量,所述测量信息的相关性信息;The statistical information of the measurement information of the first device includes at least one of the following: a third statistic for the measurement information within a third time window, where the measurement information corresponds to at least two consecutive fourth time windows The fourth statistic, the correlation information of the measurement information;
    所述第四统计量是基于各个所述第四时间窗内的统计量计算出的;The fourth statistic is calculated based on the statistic in each of the fourth time windows;
    所述相关性信息包括以下至少之一:数据之间的距离,协方差,相关系数。The correlation information includes at least one of the following: distance between data, covariance, and correlation coefficient.
  26. 根据权利要求25所述的方法,其中,所述满足第四预设条件包括以下至少之一:The method according to claim 25, wherein said meeting the fourth preset condition includes at least one of the following:
    所述第三统计量大于第三阈值区间的最大值;The third statistic is greater than the maximum value of the third threshold interval;
    所述第四统计量大于第四阈值区间的最大值;The fourth statistic is greater than the maximum value of the fourth threshold interval;
    在至少两个时间窗内采集的所述测量信息的相关性信息满足第三条件;The correlation information of the measurement information collected in at least two time windows satisfies the third condition;
    在当前时间窗内采集的不同所述测量信息之间的相关性信息满足第四条件;Correlation information among different measurement information collected in the current time window satisfies the fourth condition;
    所述测量信息的分布与基准分布的差异大于第五阈值,所述基准分布为所述第二设备为所述第一设备配置的信息。A difference between the distribution of the measurement information and a reference distribution is greater than a fifth threshold, where the reference distribution is information configured by the second device for the first device.
  27. 根据权利要求22所述的方法,其中,所述触发条件包括:所述第一设备的状态信息满足第一预设条件;The method according to claim 22, wherein the trigger condition comprises: the status information of the first device meets a first preset condition;
    所述满足第一预设条件包括以下至少之一:Said meeting the first preset condition includes at least one of the following:
    所述第一设备的移动速度大于第三阈值;The moving speed of the first device is greater than a third threshold;
    所述波束切换信息指示所述第一设备发生波束切换、且波束切换频率大于第四阈值;The beam switching information indicates that beam switching occurs for the first device, and the beam switching frequency is greater than a fourth threshold;
    所述小区切换信息指示所述第一设备发生小区切换。 The cell switching information indicates that a cell switching occurs to the first device.
  28. 根据权利要求22所述的方法,其中,所述触发条件包括:所述第一设备的测量信息满足第二预设条件;The method according to claim 22, wherein the trigger condition comprises: the measurement information of the first device meets a second preset condition;
    所述满足第二预设条件包括:所述测量信息指示所述第一设备的相关信道的信道环境发生变化;The meeting the second preset condition includes: the measurement information indicates that the channel environment of the relevant channel of the first device changes;
    其中,所述测量信息包括以下至少之一:所述第一设备接收的参考信号的第一测量信息,所述第一设备的传感器采集的第二测量信息;Wherein, the measurement information includes at least one of the following: first measurement information of a reference signal received by the first device, and second measurement information collected by a sensor of the first device;
    所述第一测量信息包括以下至少之一:所述参考信号的瞬时测量信息,所述参考信号的统计测量信息;The first measurement information includes at least one of the following: instantaneous measurement information of the reference signal, statistical measurement information of the reference signal;
    所述参考信号包括以下至少之一:同步信号块SSB、CSI参考信号CSI-RS、探测参考信号SRS、定位参考信号PRS。The reference signal includes at least one of the following: a synchronization signal block SSB, a CSI reference signal CSI-RS, a sounding reference signal SRS, and a positioning reference signal PRS.
  29. 根据权利要求21所述的方法,其中,The method of claim 21, wherein,
    所述触发方式的触发条件包括以下至少之一:The trigger condition of the trigger mode includes at least one of the following:
    所述第二设备指示所述第一设备进行在线学习;The second device instructs the first device to perform online learning;
    所述第一AI模型的输出精度小于或者等于第六阈值。The output accuracy of the first AI model is less than or equal to the sixth threshold.
  30. 根据权利要求29所述的方法,其中,The method of claim 29, wherein,
    所述第二设备指示所述第一设备进行在线学习,包括以下至少之一:The second device instructs the first device to perform online learning, including at least one of the following:
    所述第二设备指示所述第一设备周期性进行在线学习;The second device instructs the first device to periodically perform online learning;
    所述第二设备指示所述第一设备半周期性进行在线学习;The second device instructs the first device to perform online learning semi-periodically;
    所述第二设备指示所述第一设备非周期性进行在线学习;The second device instructs the first device to perform online learning aperiodically;
    其中,所述第一设备周期性或半周期性进行在线学习时采用的周期为:所述第二设备预配置的周期,或者,所述第一设备自主配置的周期。Wherein, the period adopted by the first device to perform online learning periodically or semi-periodically is: a period preconfigured by the second device, or a period independently configured by the first device.
  31. 根据权利要求21所述的方法,其中,所述中止条件包括以下至少之一:The method according to claim 21, wherein the termination condition comprises at least one of the following:
    所述第一AI模型的在线学习次数大于预设迭代次数;The number of online learning of the first AI model is greater than the preset number of iterations;
    所述第一AI模型达到预设精度;The first AI model reaches a preset accuracy;
    所述第一AI模型的输出结果的误差信息小于第七阈值;The error information of the output result of the first AI model is less than the seventh threshold;
    所述第二设备突发性指示所述第一设备结束当前在线学习过程;The second device abruptly instructs the first device to end the current online learning process;
    与所述第一AI模型关联的目标任务中止;a target task associated with the first AI model is aborted;
    所述第一设备的测量信息分布与基准分布的差异信息小于第八阈值。The difference information between the measurement information distribution of the first device and the reference distribution is smaller than an eighth threshold.
  32. 根据权利要求21所述的方法,其中,所述参数配置信息包括以下至少之一:The method according to claim 21, wherein the parameter configuration information includes at least one of the following:
    所述第一AI模型的在线学习模式;The online learning mode of the first AI model;
    所述第一AI模型的样本批量的大小;the size of the sample batch of the first AI model;
    所述第一AI模型的优化器的状态;the state of the optimizer of the first AI model;
    所述第一AI模型的第一数据集的划分方式;A division method of the first data set of the first AI model;
    所述第一AI模型的第一数据集的构成信息;Composition information of the first data set of the first AI model;
    所述第一AI模型的第一数据集对所述AI模型更新的贡献权重;The contribution weight of the first data set of the first AI model to the update of the AI model;
    与所述在线学习信息关联的AI模型标识;AI model identification associated with the online learning information;
    所述第一AI模型的基准分布;a baseline distribution of the first AI model;
    其中,所述第一数据集以下至少之一:所述第一AI模型所使用的原始数据集,所述第一AI模型新采集的数据集;Wherein, the first data set is at least one of the following: the original data set used by the first AI model, the data set newly collected by the first AI model;
    所述基准分布的参数信息包括以下至少之一:方差,均值,标准差。The parameter information of the reference distribution includes at least one of the following: variance, mean, and standard deviation.
  33. 根据权利要求32所述的方法,其中,所述参数配置信息包括所述第一AI模型的在线学习模式、且所述在线学习模式为瞬时训练模式,所述参数配置信息还包括以下至少之一:所述第一设备所采集的数据量,所述数据量的采集时间长度。The method according to claim 32, wherein the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is an instantaneous training mode, and the parameter configuration information further includes at least one of the following : the amount of data collected by the first device, and the length of time for collecting the data.
  34. 根据权利要求33所述的方法,其中,所述参数配置信息包括所述第一AI模型的在线学习模式、且所述在线学习模式为连续学习模式,所述参数配置信息还包括以下至少之一:相邻两次在线学习的时间间隔,相邻两次在线学习的数据量间隔。The method according to claim 33, wherein the parameter configuration information includes the online learning mode of the first AI model, and the online learning mode is a continuous learning mode, and the parameter configuration information further includes at least one of the following : The time interval between two adjacent online learning, and the data volume interval between two adjacent online learning.
  35. 根据权利要求20所述的方法,其中,所述第二设备包括以下至少一项:核心网设备,接入网设备,终端;所述第一设备包括以下至少一项:核心网设备,接入网设备,终端。The method according to claim 20, wherein the second device includes at least one of the following: core network equipment, access network equipment, and terminal; the first device includes at least one of the following: core network equipment, access Network equipment, terminals.
  36. 一种AI模型的在线学习装置,所述装置包括:配置模块,其中:An online learning device of an AI model, the device comprising: a configuration module, wherein:
    所述配置模块,用于第二设备为第一设备配置第一AI模型;The configuration module is used for the second device to configure the first AI model for the first device;
    所述配置模块,还用于所述第二设备为所述第一设备配置所述第一AI模型的在线学习信息。 The configuration module is further used for the second device to configure online learning information of the first AI model for the first device.
  37. 一种人工智能AI模型的在线学习装置,所述装置包括:获取模块和执行模块,其中:An online learning device of an artificial intelligence AI model, said device comprising: an acquisition module and an execution module, wherein:
    所述获取模块,用于所述第一设备获取第一AI模型;The obtaining module is used for the first device to obtain a first AI model;
    所述执行模块,用于所述第一设备基于所述第一AI模型的在线学习信息,对所述第一AI模型进行在线学习。The execution module is used for the first device to perform online learning on the first AI model based on the online learning information of the first AI model.
  38. 一种通信设备,包括处理器和存储器,所述存储器存储可在所述处理器上运行的程序或指令,所述程序或指令被所述处理器执行时实现如权利要求1至19任一项所述的AI模型的在线学习方法的步骤。A communication device, comprising a processor and a memory, the memory stores programs or instructions that can run on the processor, and when the programs or instructions are executed by the processor, any one of claims 1 to 19 is implemented The steps of the online learning method of the AI model.
  39. 一种通信设备,包括处理器和存储器,所述存储器存储可在所述处理器上运行的程序或指令,所述程序或指令被所述处理器执行时实现如权利要求20至35任一项所述的AI模型的在线学习方法的步骤。A communication device, comprising a processor and a memory, the memory stores programs or instructions that can run on the processor, and when the programs or instructions are executed by the processor, any one of claims 20 to 35 is implemented The steps of the online learning method of the AI model.
  40. 一种可读存储介质,所述可读存储介质上存储程序或指令,所述程序或指令被处理器执行时实现如权利要求1-19任一项所述的AI模型的在线学习方法,或者实现如权利要求20至35任一项所述的AI模型的在线学习方法的步骤。A readable storage medium, on which a program or instruction is stored, and when the program or instruction is executed by a processor, the online learning method of the AI model according to any one of claims 1-19 is implemented, or The steps of realizing the online learning method of the AI model as described in any one of claims 20 to 35.
  41. 一种计算机程序产品,所述程序产品被至少一个处理器执行以实现如权利要求1至19任一项所述的AI模型的在线学习方法,或者实现如权利要求20至35任一项所述的AI模型的在线学习方法的步骤。A computer program product, the program product is executed by at least one processor to realize the online learning method of the AI model according to any one of claims 1 to 19, or to realize the method according to any one of claims 20 to 35 The steps of the online learning method of the AI model.
  42. 一种芯片,所述芯片包括处理器和通信接口,所述通信接口和所述处理器耦合,所述处理器用于运行程序或指令,实现如权利要求1至19中任一项所述的AI模型的在线学习方法,或者实现如权利要求20至35任一项所述的AI模型的在线学习方法的步骤。 A chip, the chip includes a processor and a communication interface, the communication interface is coupled to the processor, the processor is used to run programs or instructions, and realize the AI described in any one of claims 1 to 19 The online learning method of the AI model, or the steps of realizing the online learning method of the AI model according to any one of claims 20 to 35.
PCT/CN2023/076492 2022-02-21 2023-02-16 Online learning method and apparatus for ai model, and communication device and readable storage medium WO2023155839A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210157466.7 2022-02-21
CN202210157466.7A CN116668309A (en) 2022-02-21 2022-02-21 Online learning method and device of AI model, communication equipment and readable storage medium

Publications (1)

Publication Number Publication Date
WO2023155839A1 true WO2023155839A1 (en) 2023-08-24

Family

ID=87577561

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/076492 WO2023155839A1 (en) 2022-02-21 2023-02-16 Online learning method and apparatus for ai model, and communication device and readable storage medium

Country Status (2)

Country Link
CN (1) CN116668309A (en)
WO (1) WO2023155839A1 (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109861942A (en) * 2019-01-22 2019-06-07 东南大学 A kind of on-line study method of artificial intelligence auxiliary OFDM receiver
CN111626436A (en) * 2020-05-28 2020-09-04 支付宝(杭州)信息技术有限公司 Method and device for online learning
US20200372402A1 (en) * 2019-05-24 2020-11-26 Bank Of America Corporation Population diversity based learning in adversarial and rapid changing environments
US20200387836A1 (en) * 2019-06-04 2020-12-10 Accenture Global Solutions Limited Machine learning model surety
CN113191502A (en) * 2021-04-21 2021-07-30 烽火通信科技股份有限公司 Artificial intelligence model on-line training method and system
CN113255935A (en) * 2020-02-13 2021-08-13 京东城市(北京)数字科技有限公司 Model updating method and device based on federal learning and storage medium
CN113469206A (en) * 2020-03-31 2021-10-01 华为技术有限公司 Method, device, equipment and storage medium for obtaining artificial intelligence model

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109861942A (en) * 2019-01-22 2019-06-07 东南大学 A kind of on-line study method of artificial intelligence auxiliary OFDM receiver
US20200372402A1 (en) * 2019-05-24 2020-11-26 Bank Of America Corporation Population diversity based learning in adversarial and rapid changing environments
US20200387836A1 (en) * 2019-06-04 2020-12-10 Accenture Global Solutions Limited Machine learning model surety
CN113255935A (en) * 2020-02-13 2021-08-13 京东城市(北京)数字科技有限公司 Model updating method and device based on federal learning and storage medium
CN113469206A (en) * 2020-03-31 2021-10-01 华为技术有限公司 Method, device, equipment and storage medium for obtaining artificial intelligence model
CN111626436A (en) * 2020-05-28 2020-09-04 支付宝(杭州)信息技术有限公司 Method and device for online learning
CN113191502A (en) * 2021-04-21 2021-07-30 烽火通信科技股份有限公司 Artificial intelligence model on-line training method and system

Also Published As

Publication number Publication date
CN116668309A (en) 2023-08-29

Similar Documents

Publication Publication Date Title
WO2023143572A1 (en) Positioning method based on artificial intelligence (ai) model, and communication device
WO2023134650A1 (en) Information interaction method and apparatus, and communication device
WO2023155839A1 (en) Online learning method and apparatus for ai model, and communication device and readable storage medium
WO2023066288A1 (en) Model request method, model request processing method and related device
WO2024067281A1 (en) Ai model processing method and apparatus, and communication device
WO2023103911A1 (en) Measurement method and apparatus, device, and storage medium
WO2023040888A1 (en) Data transmission method and apparatus
WO2023179540A1 (en) Channel prediction method and apparatus, and wireless communication device
WO2024008111A1 (en) Data acquisition method and device
WO2023098662A1 (en) Positioning method and communication device
WO2024032695A1 (en) Csi prediction processing method and apparatus, communication device, and readable storage medium
WO2024067280A1 (en) Method and apparatus for updating ai model parameter, and communication device
WO2024083004A1 (en) Ai model configuration method, terminal, and network side device
WO2023098661A1 (en) Positioning method and communication device
WO2023125951A1 (en) Communication model configuration method and apparatus, and communication device
WO2024067665A1 (en) Csi prediction processing method and apparatus, and communication device and readable storage medium
WO2024027576A1 (en) Performance supervision method and apparatus for ai network model, and communication device
WO2024120358A1 (en) Information transmission method, information transmission apparatus, and communication device
WO2024032694A1 (en) Csi prediction processing method and apparatus, communication device, and readable storage medium
WO2023125855A1 (en) Model updating method and communication device
WO2024120447A1 (en) Model supervision trigger method and apparatus, and ue, network-side device, readable storage medium and communication system
WO2024017176A1 (en) Model training method and apparatus, network side device, and terminal device
WO2024067434A1 (en) Rs configuration method and apparatus, terminal, and network side device
WO2023213270A1 (en) Model training processing methods, apparatus, terminal and network side device
WO2024125525A1 (en) Ai computing power reporting method, and terminal and network-side device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23755840

Country of ref document: EP

Kind code of ref document: A1