WO2023092310A1 - Information processing method, model generation method, and devices - Google Patents

Information processing method, model generation method, and devices Download PDF

Info

Publication number
WO2023092310A1
WO2023092310A1 PCT/CN2021/132593 CN2021132593W WO2023092310A1 WO 2023092310 A1 WO2023092310 A1 WO 2023092310A1 CN 2021132593 W CN2021132593 W CN 2021132593W WO 2023092310 A1 WO2023092310 A1 WO 2023092310A1
Authority
WO
WIPO (PCT)
Prior art keywords
model
preset
information
sub
compressed
Prior art date
Application number
PCT/CN2021/132593
Other languages
French (fr)
Chinese (zh)
Inventor
田文强
Original Assignee
Oppo广东移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oppo广东移动通信有限公司 filed Critical Oppo广东移动通信有限公司
Priority to CN202180100859.8A priority Critical patent/CN117678257A/en
Priority to PCT/CN2021/132593 priority patent/WO2023092310A1/en
Publication of WO2023092310A1 publication Critical patent/WO2023092310A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements

Definitions

  • the present application relates to the communication field, and more specifically, relates to an information processing method, a model generation method, a terminal device, a network device, an electronic device, a chip, a computer-readable storage medium, a computer program product, and a computer program.
  • wireless communication systems mainly rely on basic models and pre-configured feedback parameter sets for channel information determination and feedback.
  • the error between the feedback channel information and the real channel information is relatively large. Therefore, a wireless communication solution based on artificial intelligence (AI, Artificial Intelligence) is proposed in some studies to make up for the above-mentioned deficiency.
  • AI Artificial Intelligence
  • how to ensure the overall performance of the network becomes a problem that needs to be solved.
  • Embodiments of the present application provide an information processing method, a model generation method, a terminal device, a network device, an electronic device, a chip, a computer-readable storage medium, a computer program product, and a computer program, which can at least solve the above problems.
  • An embodiment of the present application provides an information processing method, including:
  • the terminal device receives the first information
  • the terminal device sends second information obtained based on the first information
  • the second information is obtained by processing the first information through the first model, and the second information is used for processing through the second model to obtain channel information; the first model and the second model are a joint obtained by training.
  • An embodiment of the present application provides an information processing method, including:
  • the network device sends the first information
  • the network device receives second information; wherein, the second information is obtained by processing the first information through a first model;
  • the network device processes the second information based on a second model to obtain channel information; wherein, the first model and the second model are obtained through joint training.
  • the embodiment of the present application provides a method for generating a model, including:
  • the first model is the first preset model after training, and the second model is the second preset model after training; the first model is used to process the first information to obtain Second information; the second model is used to process the second information to obtain channel information.
  • An embodiment of the present application provides a terminal device, including:
  • a first communication unit configured to receive first information; send second information obtained based on the first information;
  • the second information is obtained by processing the first information through the first model, and the second information is used for processing through the second model to obtain channel information; the first model and the second model are a joint obtained by training.
  • An embodiment of the present application provides a network device, including:
  • the second communication unit is configured to send the first information; receive the second information; wherein, the second information is obtained by processing the first information through the first model;
  • the second processing unit is configured to process the second information based on a second model to obtain channel information; wherein, the first model and the second model are obtained through joint training.
  • An embodiment of the present application provides an electronic device, including:
  • the third processing unit is configured to use training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model;
  • the first model is the first preset model after training, and the second model is the second preset model after training; the first model is used to process the first information to obtain Second information; the second model is used to process the second information to obtain channel information.
  • An embodiment of the present application provides a terminal device, including a processor and a memory.
  • the memory is used to store computer programs
  • the processor is used to call and run the computer programs stored in the memory, so that the terminal device executes the above information processing method.
  • An embodiment of the present application provides a network device, including a processor and a memory.
  • the memory is used to store a computer program
  • the processor is used to call and run the computer program stored in the memory, so that the network device executes the above-mentioned information processing method.
  • An embodiment of the present application provides an electronic device, including a processor and a memory.
  • the memory is used to store a computer program
  • the processor is used to call and run the computer program stored in the memory, so that the network device executes the above-mentioned model generation method.
  • An embodiment of the present application provides a chip configured to implement the above information processing method or model generation method.
  • the chip includes: a processor, configured to call and run a computer program from the memory, so that the device installed with the chip executes the above-mentioned information processing method or model generation method.
  • An embodiment of the present application provides a computer-readable storage medium for storing a computer program, and when the computer program is run by a device, the device is made to execute the above-mentioned information processing method or model generation method.
  • An embodiment of the present application provides a computer program product, including computer program instructions, which enable a computer to execute the above-mentioned information processing method or model generation method.
  • An embodiment of the present application provides a computer program that, when run on a computer, causes the computer to execute the above-mentioned information processing method or model generation method.
  • the terminal device when the terminal device receives the first information, it can process the first information through the first model to obtain the second information and send it, so that the receiving end can use the second model to process the second information
  • the channel information obtained through processing is obtained through joint training of the first model and the second model. Since the processing, transmission, and analysis of the second information are realized by using the first model and the second model obtained through joint training, the performance requirements of the entire information processing, transmission, and analysis can be taken into account, ensuring the overall performance of the network.
  • Fig. 1 is a schematic diagram of an application scenario according to an embodiment of the present application.
  • Fig. 2 is a schematic flowchart of a method for sending and receiving information in a wireless communication system according to an embodiment of the present application.
  • Fig. 3 is a schematic diagram of transmission and reception scenarios of pilot signals according to an embodiment of the present application.
  • Fig. 4 is a schematic flowchart of a channel information feedback method according to an embodiment of the present application.
  • FIG. 5 is a schematic diagram of the basic structure of a neural network according to an embodiment of the present application.
  • FIG. 6 is a schematic flowchart 1 of an information processing method according to an embodiment of the present application.
  • FIG. 7 is a schematic diagram of eigenvector information of channel information according to an embodiment of the present application.
  • Figures 8a to 8d are schematic diagrams of the composition and structure of the model according to the embodiment of the present application.
  • FIG. 9 is a schematic diagram of a second training sample according to an embodiment of the present application.
  • FIG. 10 is another schematic diagram of a second training sample according to an embodiment of the present application.
  • FIG. 11 is a schematic flowchart II of an information processing method according to an embodiment of the present application.
  • FIG. 12 is a schematic flowchart three of an information processing method according to an embodiment of the present application.
  • FIG. 13 is a schematic flowchart 4 of an information processing method according to an embodiment of the present application.
  • Fig. 14 is a schematic flowchart of a model generation method according to an embodiment of the present application.
  • FIG. 15 is a schematic diagram of a first model and a second model according to an embodiment of the present application.
  • FIG. 16 is a schematic flow of channel estimation for a reference signal according to an embodiment of the present application.
  • FIG. 17 is a schematic flowchart of channel state information feedback according to an embodiment of the present application.
  • FIG. 18 is a first schematic block diagram of a terminal device according to an embodiment of the present application.
  • FIG. 19 is a second schematic block diagram of a terminal device according to an embodiment of the present application.
  • FIG. 20 is a schematic block diagram of a network device according to an embodiment of the present application.
  • Fig. 21 is a schematic block diagram of an electronic device according to an embodiment of the present application.
  • Fig. 22 is a schematic block diagram of a communication device according to an embodiment of the present application.
  • Fig. 23 is a schematic block diagram of a chip according to an embodiment of the present application.
  • Fig. 24 is a schematic block diagram of a communication system according to an embodiment of the present application.
  • the technical solution of the embodiment of the present application can be applied to various communication systems, such as: Global System of Mobile communication (Global System of Mobile communication, GSM) system, code division multiple access (Code Division Multiple Access, CDMA) system, broadband code division multiple access (Wideband Code Division Multiple Access, WCDMA) system, General Packet Radio Service (GPRS), Long Term Evolution (LTE) system, Advanced long term evolution (LTE-A) system , New Radio (NR) system, evolution system of NR system, LTE (LTE-based access to unlicensed spectrum, LTE-U) system on unlicensed spectrum, NR (NR-based access to unlicensed spectrum) on unlicensed spectrum unlicensed spectrum (NR-U) system, Non-Terrestrial Networks (NTN) system, Universal Mobile Telecommunications System (UMTS), Wireless Local Area Networks (WLAN), Wireless Fidelity (Wireless Fidelity, WiFi), fifth-generation communication (5th-Generation, 5G) system or other communication systems, etc.
  • GSM Global System of Mobile
  • D2D Device to Device
  • M2M Machine to Machine
  • MTC Machine Type Communication
  • V2V Vehicle to Vehicle
  • V2X Vehicle to everything
  • the communication system in the embodiment of the present application may be applied to a carrier aggregation (Carrier Aggregation, CA) scenario, may also be applied to a dual connectivity (Dual Connectivity, DC) scenario, and may also be applied to an independent (Standalone, SA) deployment Web scene.
  • Carrier Aggregation, CA Carrier Aggregation
  • DC Dual Connectivity
  • SA independent deployment Web scene
  • the communication system in the embodiment of the present application may be applied to an unlicensed spectrum, where the unlicensed spectrum may also be considered as a shared spectrum; or, the communication system in the embodiment of the present application may also be applied to a licensed spectrum, where, Licensed spectrum can also be considered as non-shared spectrum.
  • the embodiments of the present application describe various embodiments in conjunction with network equipment and terminal equipment, wherein the terminal equipment may also be referred to as user equipment (User Equipment, UE), access terminal, user unit, user station, mobile station, mobile station, remote station, remote terminal, mobile device, user terminal, terminal, wireless communication device, user agent or user device, etc.
  • user equipment User Equipment, UE
  • access terminal user unit
  • user station mobile station
  • mobile station mobile station
  • remote station remote terminal
  • mobile device user terminal
  • terminal wireless communication device
  • wireless communication device user agent or user device
  • the terminal device can be a station (STAION, ST) in the WLAN, a cellular phone, a cordless phone, a Session Initiation Protocol (Session Initiation Protocol, SIP) phone, a wireless local loop (Wireless Local Loop, WLL) station, a personal digital processing (Personal Digital Assistant, PDA) devices, handheld devices with wireless communication functions, computing devices or other processing devices connected to wireless modems, vehicle-mounted devices, wearable devices, next-generation communication systems such as terminal devices in NR networks, or future Terminal equipment in the evolved public land mobile network (Public Land Mobile Network, PLMN) network, etc.
  • STAION, ST Session Initiation Protocol
  • SIP Session Initiation Protocol
  • WLL Wireless Local Loop
  • PDA Personal Digital Assistant
  • the terminal device can be deployed on land, including indoor or outdoor, handheld, wearable or vehicle-mounted; it can also be deployed on water (such as ships, etc.); it can also be deployed in the air (such as aircraft, balloons and satellites) superior).
  • the terminal device may be a mobile phone (Mobile Phone), a tablet computer (Pad), a computer with a wireless transceiver function, a virtual reality (Virtual Reality, VR) terminal device, an augmented reality (Augmented Reality, AR) terminal Equipment, wireless terminal equipment in industrial control, wireless terminal equipment in self driving, wireless terminal equipment in remote medical, wireless terminal equipment in smart grid , wireless terminal equipment in transportation safety, wireless terminal equipment in smart city, or wireless terminal equipment in smart home.
  • a virtual reality (Virtual Reality, VR) terminal device an augmented reality (Augmented Reality, AR) terminal Equipment
  • wireless terminal equipment in industrial control wireless terminal equipment in self driving
  • wireless terminal equipment in remote medical wireless terminal equipment in smart grid
  • wireless terminal equipment in transportation safety wireless terminal equipment in smart city, or wireless terminal equipment in smart home.
  • the terminal device may also be a wearable device.
  • Wearable devices can also be called wearable smart devices, which is a general term for the application of wearable technology to intelligently design daily wear and develop wearable devices, such as glasses, gloves, watches, clothing and shoes.
  • a wearable device is a portable device that is worn directly on the body or integrated into the user's clothing or accessories. Wearable devices are not only a hardware device, but also achieve powerful functions through software support, data interaction, and cloud interaction.
  • Generalized wearable smart devices include full-featured, large-sized, complete or partial functions without relying on smart phones, such as smart watches or smart glasses, etc., and only focus on a certain type of application functions, and need to cooperate with other devices such as smart phones Use, such as various smart bracelets and smart jewelry for physical sign monitoring.
  • the network device may be a device for communicating with the mobile device, and the network device may be an access point (Access Point, AP) in WLAN, a base station (Base Transceiver Station, BTS) in GSM or CDMA , or a base station (NodeB, NB) in WCDMA, or an evolved base station (Evolutional Node B, eNB or eNodeB) in LTE, or a relay station or access point, or a vehicle-mounted device, a wearable device, and an NR network
  • BTS Base Transceiver Station
  • NodeB, NB base station
  • Evolutional Node B, eNB or eNodeB evolved base station
  • LTE Long Term Evolutional Node B, eNB or eNodeB
  • gNB network equipment in the network or the network equipment in the future evolved PLMN network or the network equipment in the NTN network, etc.
  • the network device may have a mobile feature, for example, the network device may be a mobile device.
  • the network equipment may be a satellite or a balloon station.
  • the satellite can be a low earth orbit (low earth orbit, LEO) satellite, a medium earth orbit (medium earth orbit, MEO) satellite, a geosynchronous earth orbit (geosynchronous earth orbit, GEO) satellite, a high elliptical orbit (High Elliptical Orbit, HEO) satellite. ) Satellite etc.
  • the network device may also be a base station installed on land, water, and other locations.
  • the network device may provide services for a cell, and the terminal device communicates with the network device through the transmission resources (for example, frequency domain resources, or spectrum resources) used by the cell, and the cell may be a network device ( For example, a cell corresponding to a base station), the cell may belong to a macro base station, or may belong to a base station corresponding to a small cell (Small cell), and the small cell here may include: a metro cell (Metro cell), a micro cell (Micro cell), a pico cell ( Pico cell), Femto cell, etc. These small cells have the characteristics of small coverage and low transmission power, and are suitable for providing high-speed data transmission services.
  • the transmission resources for example, frequency domain resources, or spectrum resources
  • the cell may be a network device (
  • the cell may belong to a macro base station, or may belong to a base station corresponding to a small cell (Small cell)
  • the small cell here may include: a metro cell (Metro cell), a micro cell (Micro
  • FIG. 1 exemplarily shows a communication system 100 .
  • the communication system includes a network device 110 and two terminal devices 120 .
  • the communication system 100 may include multiple network devices 110, and the coverage of each network device 110 may include other numbers of terminal devices 120, which is not limited in this embodiment of the present application.
  • the communication system 100 may also include other network entities such as a mobility management entity (Mobility Management Entity, MME), an access and mobility management function (Access and Mobility Management Function, AMF), etc. Not limited.
  • MME Mobility Management Entity
  • AMF Access and Mobility Management Function
  • the network equipment may further include access network equipment and core network equipment. That is, the wireless communication system also includes multiple core networks for communicating with access network devices.
  • the access network device may be a long-term evolution (long-term evolution, LTE) system, a next-generation (mobile communication system) (next radio, NR) system or an authorized auxiliary access long-term evolution (LAA- Evolved base station (evolutional node B, abbreviated as eNB or e-NodeB) macro base station, micro base station (also called “small base station”), pico base station, access point (access point, AP), Transmission point (transmission point, TP) or new generation base station (new generation Node B, gNodeB), etc.
  • LTE long-term evolution
  • NR next-generation
  • LAA- Evolved base station evolutional node B, abbreviated as eNB or e-NodeB
  • eNB next-generation
  • NR next-generation
  • a device with a communication function in the network/system in the embodiment of the present application may be referred to as a communication device.
  • the communication equipment may include network equipment and terminal equipment with communication functions. It may include other devices in the communication system, such as network controllers, mobility management entities and other network entities, which are not limited in this embodiment of the present application.
  • the workflow of the wireless communication system may include: the sending end performs operations such as encoding, modulating, and encrypting on the information source to form the sending information to be transmitted; the sending information is transmitted through the wireless space, and at this time the sending information will be received by the channel The impact of the environment and interference noise; at the receiving end, the received information is decoded, decrypted and demodulated, and finally the source information is restored.
  • the sending end may be a network device in the aforementioned communication system shown in FIG. 1, and the receiving end may be a terminal device in the aforementioned communication system shown in FIG. 1; or, the sending end It may be a terminal device in the aforementioned communication system shown in FIG. 1 , and the receiving end may be a network device in the aforementioned communication system shown in FIG. 1 .
  • the sending end (such as a network device) will send some pilot signals, such as sending some channel state information reference signal (CSI-RS, Channel State Information Reference Signal), demodulation reference signal (DMRS, Demodulation Reference Signal), phase tracking reference signal (PT-RS, Phase Tracking Reference Signal), synchronization signal and PBCH block (SSB, Synchronization Signal and PBCH block), etc., used to assist the receiving end (such as terminal equipment) to obtain and estimate the current channel characteristics.
  • CSI-RS channel state information reference signal
  • DMRS demodulation reference signal
  • PT-RS Phase Tracking Reference Signal
  • SSB Synchronization Signal and PBCH block
  • the receiving end (such as a terminal device) can feed back corresponding channel information to the sending end (such as a network device) based on the estimated and recovered channel characteristics, and finally the sending end (such as a network device) performs corresponding coding according to the acquired channel information , Modulation, etc.
  • the sending end (such as a network device) sends out a specific pilot signal, and the pilot signal is received by the receiving end (such as a terminal device) after being transmitted through a channel.
  • the receiving end (such as a terminal device) can The pilot signal and the actual pilot signal estimate the channel condition through which the pilot signal passes and determine the channel information based on the channel condition.
  • the receiving end (such as a terminal device) obtains the channel information, it can also feed back the channel information to the sending end (such as a network device), and then the sending end (such as a network device) can perform subsequent data Scheduling etc.
  • the channel information acquired by the receiving end may be the channel information where the pilot signal is located (such as the time domain and/or frequency domain resource where the pilot signal is located); correspondingly, basic interpolation, etc.
  • the method restores the channel information in each time slot of the complete broadband based on the received channel information where the pilot signal is located, and then performs corresponding data scheduling and other processing.
  • the processing method for channel information feedback may include: after the terminal device estimates the channel information, it feeds back the channel information to the network device in a channel state information feedback manner in the current communication system.
  • the feedback of channel information is very important in LTE system and NR system, which determines the performance of MIMO transmission. Taking the CSI feedback process as an example in conjunction with FIG.
  • S410 The network device configures the feedback parameter information indicated by channel state information (CSI, Channel State information), for example, the network device configures the terminal device to feedback channel quality Which information in the indication (CQI, Channel Quality Indicator), precoding matrix indication (PMI, Precoding Matrix Indicator), rank indication (RI, Rank Indication) and other information; at the same time, the network device will configure some reference signals for CSI measurement , such as SSB or CSI-RS.
  • S430 The terminal device generates CSI by measuring the above reference signal.
  • S440 The terminal device feeds back the CSI to the network device.
  • the network device configures a data transmission mode based on the CSI, that is, the network device can configure a reasonable and efficient data transmission mode based on the CSI.
  • the CSI may include indications of information such as CQI, PMI, and RI.
  • the above-mentioned processing method of channel information feedback can further introduce research and design of artificial intelligence represented by neural network, for example, the estimation of wireless channel can be realized through the design of neural network.
  • the basic structure of the neural network includes: an input layer, a hidden layer and an output layer; the input layer is responsible for receiving data, the hidden layer processes the data, and the final result is produced at the output layer .
  • Each node in Figure 5 represents a processing unit, and each processing unit can simulate a neuron, and multiple neurons form a layer of neural network, and multi-layer information transmission and processing construct an overall neural network.
  • neural network deep learning algorithms have been proposed in recent years, more hidden layers have been introduced, and feature learning is performed through layer-by-layer training of neural networks with multiple hidden layers, which greatly improves the learning of neural networks.
  • processing capabilities and are widely used in pattern recognition, signal processing, optimization combination, anomaly detection, etc.
  • the "indication" mentioned in the embodiments of the present application may be a direct indication, may also be an indirect indication, and may also mean that there is an association relationship.
  • a indicates B which can mean that A directly indicates B, for example, B can be obtained through A; it can also indicate that A indirectly indicates B, for example, A indicates C, and B can be obtained through C; it can also indicate that there is an association between A and B relation.
  • the term "corresponding" may indicate that there is a direct or indirect correspondence between the two, or that there is an association between the two, or that it indicates and is indicated, configuration and is configuration etc.
  • Fig. 6 is a schematic flowchart of an information processing method 600 according to an embodiment of the present application.
  • the method can optionally be applied to the system shown in Fig. 1, but is not limited thereto.
  • the method includes at least some of the following.
  • the terminal device receives first information.
  • the terminal device sends second information obtained based on the first information
  • the second information is obtained by processing the first information through the first model, and the second information is used for processing through the second model to obtain channel information; the first model and the second model are a joint obtained by training.
  • the first information may be a reference signal
  • the reference signal may be a reference signal of the current channel, such as a downlink reference signal of the current channel.
  • the downlink reference signal may include at least one of CSI-RS, DMRS, and PT-RS.
  • the first information may be distributed in the first dimension and/or the second dimension.
  • the first dimension is a time domain dimension; the first information is distributed in at least one time unit in the time domain dimension.
  • Each time unit in the at least one time unit may include one of the following: 1 time slot and 1 Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing) symbol.
  • the first signal is a downlink reference signal, and the downlink reference signal may be distributed in one time slot in the time domain dimension, or the downlink reference signal may be distributed in two or on 4 time slots.
  • the second dimension is a frequency domain dimension; the first information is distributed on at least one frequency domain resource in the frequency domain dimension; wherein, each frequency domain resource in the at least one frequency domain resource can be one of the following One: one resource block (RB, Resource Block), one subcarrier.
  • RB resource block
  • the first signal is a downlink reference signal, and the downlink reference signal may be distributed in 1 RB in the frequency domain dimension, or the downlink reference signal may be distributed in 2 or 4 RBs in the time domain dimension. on RBs.
  • the first dimension and the second dimension above can be used in combination, that is, the first information can be distributed on the first dimension and the second dimension; for example, the first information can be distributed on a RBs in the frequency domain dimension , distributed in b time slots in the time domain dimension; both a and b are positive integers.
  • the first information is a downlink reference signal, and the downlink reference signal may be distributed in 4 RBs in the frequency domain, and may be distributed in 6 time slots in the time domain dimension.
  • the first information can also be expressed as a complex number, that is, the first information is also distributed in the third dimension; the third dimension is a complex dimension; the first information includes the first information sample The real part of and the imaginary part of the first information sample.
  • the real part of the first information is distributed on the a RBs of the frequency domain resources and the b time slots of the time domain resources
  • the imaginary part of the first information is distributed on the a RBs of the frequency domain resources. b time slots of the time domain resource.
  • the terminal device may first receive configuration information sent by the network device, and the configuration information may be configured with the first information for the terminal device to measure.
  • the configuration information may be configuring the terminal device to measure SSB or CSI-RS and so on.
  • the terminal device may process the first information based on the first model to obtain the second information.
  • the second information is channel compression information
  • the first model is configured to process the input first information to obtain channel compression information. That is, the input information of the first model is the first information, and the second information output by the first model is the channel compression information.
  • the first model may also be called an encoding model or an encoding network, as long as the input information is the first information and the output information is the channel compression information, the model or neural network is within the protection scope of this embodiment.
  • the first model may specifically include the following sub-models: an estimation sub-model and a compression sub-model;
  • the estimation sub-model is used to perform channel estimation based on the first information to obtain channel estimation information
  • the compression sub-model is used to compress the channel estimation information to obtain channel compression information.
  • the estimation sub-model can also be called a channel estimation sub-model or a channel estimation sub-neural network, and the estimation sub-model can use one of a fully connected network, a convolutional neural network, a residual network, and a self-attention mechanism network. Or a variety of network structure construction.
  • the compression sub-model may be called a channel compression sub-model or a channel compression sub-neural network, and the compression sub-model may use one of a fully connected network, a convolutional neural network, a residual network, a self-attention mechanism network, or A variety of network structure construction.
  • the estimation method adopted by the estimation sub-model may include algorithms such as minimum mean square error (MMSE).
  • MMSE minimum mean square error
  • the compression sub-model can compress the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the terminal device processes the first information based on the first model to obtain the second information, which may specifically be:
  • the terminal device inputs the first information into the estimation sub-model, and obtains channel estimation information output by the estimation sub-model;
  • the terminal device inputs the channel estimation information into the compression sub-model, and obtains channel compression information output by the compression sub-model.
  • the first information may be a reference signal, specifically, the first information may be a reference signal of a current channel, for example, the first information may be a downlink reference signal of a current channel.
  • the channel information may be used to characterize at least one of channel quality, channel state, and channel estimation result obtained based on the first information.
  • the channel information may be represented by a matrix of T dimensions, where T is an integer greater than or equal to 2.
  • the channel estimation information may also be represented by a matrix of T dimensions.
  • the channel information is used as an example for description below, and the description of the channel estimation information is similar to that and will not be repeated.
  • the matrix of the T dimensions may specifically be a two-dimensional matrix of M ⁇ N; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension; M and N are all positive integers. That is to say, the channel information may be composed of a two-dimensional matrix with a size of M ⁇ N, which has M first granularities in the fourth dimension and N second granularities in the fifth dimension; the above M and N May or may not be equal.
  • the specific numerical indication in the two-dimensional matrix represents the received signal strength at a certain first granularity of the channel quality, where the unit of the numerical value in the two-dimensional matrix may be dBm, or the numerical value in the two-dimensional matrix has no unit It is the value obtained after normalization.
  • the two-dimensional matrix of M ⁇ N can also be synthesized into one-dimensional data of size 1 ⁇ (M ⁇ N) or (M ⁇ N) ⁇ 1.
  • the specific transformation can be the fourth dimension first and then the fifth dimension. It may also be the fifth dimension first and then the fourth dimension, which is not limited in this embodiment.
  • the fourth dimension may be a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
  • the fourth dimension may be a time-domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol lengths, and the number of sampling points of K3 symbols; K1, K2, and K3 are positive integers .
  • the symbols are Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing).
  • OFDM Orthogonal Frequency Division Multiplexing
  • the first granularity may be L1 RBs (L1 is greater than or equal to 1, such as 2RB, 4RB, 8RB), then the channel information in the frequency domain dimension
  • the distribution range is the frequency domain range corresponding to M ⁇ L1 RBs; or the first granularity can be L2 subcarriers (L2 is greater than 1, such as 4 subcarriers, 6 subcarriers, and 18 subcarriers), then the channel
  • the distribution of information in the frequency domain dimension is the frequency domain range corresponding to M ⁇ L2 subcarriers.
  • the first granularity may be a delay granularity, for example, a first granularity is the number of sampling points of K1 microseconds, or K2 symbol lengths, or K3 symbols,
  • the symbol may be an OFDM symbol;
  • the fourth dimension is the time domain dimension and the first granularity is K1 microseconds
  • the distribution range of the channel information on the time domain dimension is M ⁇ K1 The time domain range corresponding to microseconds.
  • the fifth dimension may be a space domain dimension; correspondingly, the second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival. That is to say, the fifth dimension is the space domain dimension, specifically, it may be an antenna dimension, and the second granularity may be a pair of transmitting and receiving antennas. Alternatively, the fifth dimension is a space domain dimension, specifically, an angle domain dimension, and the second granularity may be an interval of arrival angles.
  • the value of the ijth position in the two-dimensional matrix representing the channel information is used to represent the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension
  • the channel quality of ; i and j are both positive integers. That is to say, a numerical value (or an indicator value) at a certain position in the two-dimensional matrix used to represent the channel information represents the channel quality under the combination of the fourth dimension and the fifth dimension.
  • the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
  • the T dimensions may also include a sixth dimension.
  • the matrix of T dimensions may be a three-dimensional matrix of M ⁇ N ⁇ W; wherein, M represents the number of first granularities in the fourth dimension, N represents the number of second granularities in the fifth dimension, and W represents The number of third granularities under the sixth dimension; M, N and W are all positive integers.
  • the sixth dimension may be a complex dimension, the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
  • the fourth dimension represents the time domain dimension
  • the first granularity is the delay granularity
  • the fifth dimension is the spatial domain dimension, specifically the angle dimension
  • the second granularity is the interval of arrival angles
  • the sixth dimension is a complex dimension, W is 2, k is 1 to indicate the real part, and k is 2 to indicate the imaginary part.
  • the above channel information is illustrated by using a two-dimensional matrix formed by the fourth dimension and the fifth dimension.
  • the dimension of the above channel information matrix is not limited to two dimensions.
  • the second information may be channel compression information; the channel compression information includes eigenvector information of compressed channel estimation information; correspondingly, the first model is used for the input first The information is processed to obtain the eigenvector information of the compressed channel estimation information.
  • the first information may be a reference signal, specifically, the first information may be a reference signal of a current channel, for example, the first information may be a downlink reference signal of a current channel.
  • the channel information output by the second model may specifically be feature vector information of the channel information.
  • the first model can also be called an encoding model or an encoding neural network, etc., as long as the input information is the first information and the output information is the eigenvector information of the compressed channel estimation information or the neural network.
  • the input information is the first information
  • the output information is the eigenvector information of the compressed channel estimation information or the neural network.
  • the first model may specifically include the following sub-models: estimation sub-model, channel generation sub-model and compression sub-model;
  • the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information
  • the channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
  • the compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain compressed eigenvector information of the channel estimation information.
  • the eigenvector information of the channel information includes R groups of eigenvector sequence information; R is a positive integer.
  • R may be 1, then the eigenvector information of the channel information includes a set of eigenvector sequence information.
  • R may be 2, then the eigenvector information of the channel information includes 2 sets of eigenvector sequence information.
  • the above value of R may be determined according to the actual situation, or may be specified during the training of the first model.
  • the eigenvector information of the channel estimation information may also include R groups of eigenvector sequence information, which will not be described in detail.
  • each set of feature vector sequence information may include a feature sequence of a preset length.
  • the preset length can be set according to the actual situation or can be set during training, for example, it can be any one of 16, 32, 48, 64, 128, 256, and of course it can be longer or shorter.
  • the embodiment does not exhaustively list all possible values of the preset length. In conjunction with FIG. 7, for example, the preset length is 32 (but it can be a bit), and R is 4, that is, the feature vector information of the channel information includes 4 sets of feature vector sequence information, wherein each set of feature vector sequence information contains a feature sequence of length 32.
  • the manner of performing eigendecomposition in the channel generation sub-model may specifically be a singular value decomposition (SVD, Singular Value Decomposition) manner.
  • SVD singular value decomposition
  • the channel generation sub-model performs SVD eigendecomposition on the input channel estimation information to obtain eigenvector information of the channel estimation information after eigendecomposition.
  • the channel estimation information may be represented by a matrix, and the specific description is the same as that of the foregoing embodiment, and will not be repeated here.
  • the compression sub-model may be to compress the data volume of the input information.
  • the compression rate between the second information output by the compression sub-model and the input information can be determined during training, for example, the compression rate can be five thousandths, two thousandths, ten percent, etc., not here Exhaustive.
  • the channel estimation information output by the estimation sub-model may be different from the channel information output by the second model, and the channel estimation information output by the estimation sub-model may specifically be a matrix of channel information, such as It is represented by a matrix of T dimensions; the channel information output by the second model may be eigenvector information of the channel information, for example, may include R groups of eigenvector sequence information.
  • the channel information output by the second model in this example may also be the same as the channel estimation information output by the estimation sub-model.
  • the terminal device may process the first information based on the first model to obtain the second information, which may specifically be:
  • the terminal device inputs the first information into the estimation sub-model, and obtains channel estimation information output by the estimation sub-model;
  • the terminal device inputs the channel estimation information into the channel generation sub-model, and obtains eigenvector information of the channel estimation information output by the channel generation sub-model;
  • the terminal device inputs the eigenvector information of the channel estimation information into the compression sub-model, and obtains the eigenvector information of the compressed channel estimation information output by the compression sub-model.
  • the terminal device may execute S620, where the terminal device sends second information obtained based on the first information.
  • the terminal device sending the second information obtained based on the first information may be: the terminal device sends the second information obtained based on the first information to a network device.
  • the network device may be an access network device (such as a base station, or eNB, or gNB) serving the terminal device, or the network device may refer to an access network device that communicates with the terminal device (such as a base station, or eNB, or gNB).
  • the second information may be carried by one of the following information: information included in the random access process, radio resource control (RRC, Radio Resource Control) message, uplink control information (UCI, Uplink Control Information).
  • RRC Radio Resource Control
  • UCI Uplink Control Information
  • the information contained in the random access process includes one of the following: message A in the two-step random access process; Msg1 in the four-step random access process; Msg3 in the four-step random access process.
  • the above describes how the terminal device uses the first model in detail.
  • the first way the terminal device obtains the first model directly
  • the second One way obtained by the terminal device through training. The two methods are described below:
  • the terminal device receives the first model.
  • the terminal device receives the first model sent by the electronic device; for example, the terminal device may receive model parameters of the first model sent by the electronic device.
  • the electronic device may be an electronic device that obtains the first model and the second model through joint training.
  • the electronic device may be a network device, and the network device may be an access network device serving the terminal device, such as a base station, eNB, gNB, and so on.
  • the electronic device may be other devices than the network device serving the terminal device, for example, it may be a server, or a desktop computer, or a notebook or other device with data processing capabilities, which is not described in this embodiment. Exhaustive.
  • receiving the first model sent by the electronic device by the terminal device may specifically be: the terminal device receives the first model sent by the network device. a model.
  • the first model (or the model parameters of the first model) may be carried by at least one of the following: downlink control signaling, media access control (MAC, Media Access Control) control element (CE, Control Element) Messages, RRC messages, broadcast messages, downlink data transmission, and downlink data transmission for artificial intelligence business transmission requirements.
  • the first model (or the model parameters of the first model) may be transmitted through a wired connection, or other transmitted over a wireless connection.
  • the electronic device transmits the first model (or the model parameters of the first model) to the terminal device through a wired connection with the terminal device.
  • the electronic device transmits the first model (or the model parameters of the first model) to the terminal device through other wireless connections with the terminal device; wherein, the other wireless connection method may be a Bluetooth connection Ways or wireless fidelity (Wi-Fi, Wireless Fidelity) connection ways, etc., are not exhaustive here.
  • the terminal device may receive multiple sub-models respectively, and then combine the multiple received sub-models to obtain the first model.
  • the first model includes an estimation sub-model and a compression sub-model.
  • the terminal device receives the estimated sub-model and the compressed sub-model; the terminal device generates the first model based on the estimated sub-model and the compressed sub-model. Specifically, the terminal device receives the model parameters of the estimated sub-model and the model parameters of the compressed sub-model sent by the electronic device; parameters to obtain the first model.
  • the terminal device may receive the estimated sub-model and the compressed sub-model sent by the electronic device at the same time; or, it may receive the estimated sub-model and the compressed sub-model sent by the electronic device separately, for example, it may be received first
  • the estimated sub-model sent by the electronic device then receives the compressed sub-model sent by the electronic device, or first receives the compressed sub-model sent by the electronic device and then receives the estimated sub-model sent by the electronic device.
  • the estimated sub-model (or the model parameters of the estimated sub-model) and the compressed sub-model (or the compressed sub-model) may be carried simultaneously or separately by one of the following information Model parameters of the model): downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements.
  • the electronic device may use the above estimated sub-model (or the model of the estimated sub-model) through a wired connection with the terminal device parameters) and the compressed sub-model (or the model parameters of the compressed sub-model) are sent to the terminal device at the same time or separately.
  • the electronic device sends the above-mentioned estimated sub-model (or the model parameters of the estimated sub-model) and the compressed sub-model (or the model parameters of the compressed sub-model) to the terminal at the same time or separately through other wireless connections with the terminal device device; wherein, the other wireless connection methods may be a Bluetooth connection method or a WIFI connection method, etc., which are not exhaustive here.
  • the first model includes an estimation sub-model, a channel generation sub-model, and a compression sub-model.
  • the terminal device receives the estimation sub-model, the compression sub-model and the channel generation sub-model; the terminal device generates the first model. Specifically, the terminal device receives the model parameters of the estimated sub-model, the model parameters of the compression sub-model and the model parameters of the channel generation sub-model sent by the electronic device; , model parameters of the compression sub-model, and model parameters of the channel generation sub-model to obtain the first model.
  • the terminal device may simultaneously receive the estimation sub-model, compression sub-model and channel generation sub-model sent by the electronic device.
  • the terminal device may respectively receive the estimated sub-model, the compressed sub-model and the channel generation sub-model sent by the electronic device, for example, the estimated sub-model, the compressed sub-model and the channel generation sub-model are respectively received; or, the estimated Any two of the sub-model, compression sub-model and channel generation sub-model are received separately from the remaining one.
  • the terminal device may first receive the estimated submodel sent by the electronic device, then receive the channel generation submodel sent by the electronic device, and finally receive the compressed submodel sent by the electronic device; or, the terminal The device first receives the compressed submodel and the channel generation submodel sent by the electronic device, and then receives the estimated submodel sent by the electronic device.
  • the above is only an exemplary description, and does not mean that there are only several combinations of the above-mentioned exemplary sub-models, compression sub-models, and channel generation sub-models that are actually sent or received respectively, but this embodiment is not exhaustive. lift.
  • the electronic device is a network device serving the terminal device
  • the terminal device can simultaneously receive or separately receive the estimation submodel, compression submodel and channel generation submodel sent by the network device
  • the The estimation sub-model, compression sub-model and channel generation sub-model are carried by one of the following at the same time or separately: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink for artificial intelligence business transmission requirements data transmission.
  • the electronic device can generate the above-mentioned estimated sub-model, compressed sub-model and channel through a wired connection with the terminal device.
  • the sub-models are sent to the terminal devices simultaneously or separately.
  • the electronic device sends the above-mentioned estimation submodel, compression submodel and channel generation submodel to the terminal device simultaneously or separately through other wireless connections with the terminal device; wherein, the other wireless connection methods may be Bluetooth or WIFI, etc., are not exhaustive here.
  • the terminal device may receive the first model, and then may execute the foregoing processing of S610-S620.
  • the method may further include: the terminal device receiving the second model. Specifically, the terminal device may receive model parameters of the second model. Still further, the terminal device may receive the second model sent by the electronic device, specifically, the terminal device may receive model parameters of the second model sent by the electronic device.
  • the second model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, manual Downlink data transmission required for intelligent service transmission.
  • the second model (or the model parameters of the second model) may be transmitted through a wired connection, or other transmitted over a wireless connection.
  • the electronic device transmits the second model (or the model parameters of the second model) to the terminal device through a wired connection with the terminal device.
  • the electronic device transmits the second model (or the model parameters of the second model) to the terminal device through other wireless connections with the terminal device; wherein, the other wireless connection methods may be Bluetooth or WIFI, etc., are not exhaustive here.
  • the above-mentioned second model and the first model may also be sent at the same time, or may be sent separately.
  • the method may further include: the terminal device receiving the third model. Specifically, the terminal device may receive model parameters of the third model. Still further, the terminal device may receive the third model sent by the electronic device, specifically, the terminal device may receive model parameters of the third model sent by the electronic device.
  • the third model is used to perform data transformation processing on the second information output by the first model and input it into the second model; the first model, the second model and the third model are obtained through joint training .
  • the data transformation processing includes: convolution processing or Fourier transform processing.
  • the Fourier transform processing may specifically include: converting to the frequency domain through Fourier transform, multiplication, and then converting to the time domain through inverse Fourier transform.
  • the third model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, manual Downlink data transmission required for intelligent business transmission.
  • the third model (or the model parameters of the third model) may be transmitted through a wired connection, or other transmitted over a wireless connection.
  • the electronic device transmits the third model (or the model parameters of the third model) to the terminal device through a wired connection with the terminal device.
  • the electronic device transmits the third model (or the model parameters of the third model) to the terminal device through other wireless connections with the terminal device; wherein, the other wireless connection methods may be Bluetooth or WIFI, etc., are not exhaustive here.
  • first model, second model, and third model can be sent separately or simultaneously; or the above-mentioned first model, second model, and third model can also be sent separately; or, it is also possible It is any two combinations of which are sent at the same time, and the remaining one is sent separately, and so on.
  • the foregoing first model is a model that the terminal device needs to use when receiving the first information.
  • the terminal device can also receive the second model and/or the third model, and the reasons are as follows:
  • the terminal device may decide whether to use the first model and the second model received this time after completing the overall evaluation of the first model and the second model. If the overall evaluation result of the terminal device on the first model and the second model is poor (for example, the compression rate is low or the accuracy rate of recovered channel information is low, etc.), the first model may not be used.
  • the terminal device may re-train the first model and the second model jointly to update the model parameters of the first model and the second model, or the terminal device Train yourself to get a new first model and a new second model.
  • the terminal device obtains the new first model and the new second model after performing joint training or updating again, it can also synchronize the new first model and the new second model to the network device; correspondingly, After the network device receives the new first model and the new second model, it can also replace the original first model and the second model used by itself, and can also send the new first model and the new second model to other terminal equipment.
  • this embodiment does not exhaustively enumerate them. Through the above processing, it can be ensured that the first model with the best performance and its corresponding second model are used in the whole system, so that the communication performance of the whole system can be further guaranteed.
  • the terminal device can also perform an overall evaluation on the first model, the second model, and the third model after receiving it and subsequent Processing, the specific processing method is the same as the above, and will not be repeated.
  • the terminal device trains itself to obtain the above-mentioned first model.
  • the terminal device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model;
  • the first model is the first preset model after training
  • the second model is the second preset model after training
  • the training may use the first loss function or the second loss function.
  • the following describes the training using the above two loss functions:
  • the loss function used in the training is a first loss function; the first loss function is based on the difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model degree of difference is constructed.
  • the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model is determined based on a distance, or determined based on a degree of similarity.
  • the specific calculation method for determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the distance can use mean square error (MSE, Mean Squared Error ) or normalized mean square error (NMSE), etc., which are not exhaustive in this embodiment.
  • MSE mean square error
  • NMSE normalized mean square error
  • the output information of the second preset model may be a matrix
  • the input information of the compressed preset sub-model may also be a matrix
  • the output matrix of the second preset model It is called matrix 1
  • the matrix of the input of the compressed preset submodel is called matrix 2
  • the output information of the second preset model and the compressed preset submodel of the first preset model are determined based on the distance
  • the way of the degree of difference between the input information is the MSE way, for example, its calculation may include: calculating the difference between matrix 1 and matrix 2, and taking the square of the difference as the difference degree.
  • the specific calculation method for determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may use cosine similarity or cosine similarity squared etc., which are not exhaustive in this embodiment.
  • the output information of the second preset model may be R sets of feature vector sequence information
  • the input information of the compressed preset sub-model may also be R sets of feature vector sequence information.
  • the The R sets of feature vector sequence information output by the second preset model are called feature vector sequence 1
  • the R sets of feature vector sequence information input by the compressed preset sub-model are called feature vector sequence 2.
  • the method of determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may be cosine similarity, for example, its calculation may include: The cosine angle between the eigenvector sequence 1 and the eigenvector sequence 2 is used to determine the degree of similarity, and the degree of similarity is used as the degree of difference.
  • the first preset model includes an estimation preset sub-model and a compression preset sub-model.
  • FIG. 8 a it illustrates a first preset model 800 , a second preset model 810 , and an estimated preset sub-model 801 and a compressed preset sub-model 802 included in the first preset model 800 .
  • the above-mentioned first preset model 800, the second preset model 810, and the input-output relationship between the estimated preset sub-model 801 and the compressed preset sub-model 802 contained in the first preset model 800 can be: estimated The input information of the preset submodel 801 is the first training sample 920; the output information of the estimated preset submodel 801 is used as the input information of the compressed preset submodel 802; the output information of the compressed preset submodel 802 As the input information of the second preset model 810 .
  • the terminal device uses training samples to jointly train the first preset model and the second preset model, including:
  • the terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the first training samples may be reference signal samples.
  • the reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition. More specifically, the reference signal samples may be downlink reference signal samples. It should be understood that this embodiment does not limit that the first training samples must be the downlink reference signal samples, and uplink reference signal samples or other reference signal samples may also be used, which are not exhaustive in this embodiment.
  • the information input into the estimated preset sub-model of the first preset model may also be wireless channel or other scene-related information, for example, may include at least one of the following: Channel signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
  • a specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information.
  • the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
  • the aforementioned initial information may be a matrix, and the dimension of the matrix is not limited here, and may be a matrix of two or more dimensions.
  • the value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions.
  • the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the compressed information obtained by compressing the preset sub-model contains less data than the input initial information.
  • the form of the above-mentioned compressed information is the same as that of the initial information, for example, the initial information is a matrix, and the corresponding compressed information is also a matrix, and the matrix dimensions of the initial information and the compressed information are the same but the amount of data is different.
  • the function of the second preset model may be to decompress its input information.
  • the input information of the second preset model is the compressed information
  • the output of the second preset model is the restored information.
  • the decompression rate of the second preset model should make the obtained restored information contain the same data content as the original information.
  • the performing reverse conduction update of the first preset model and the second preset model based on the first loss function may specifically refer to performing reverse conduction update based on the degree of difference determined by the first loss function.
  • the model parameters of the estimated preset sub-model, the model parameters of the compressed preset sub-model, and the model parameters of the second preset model may specifically refer to performing reverse conduction update based on the degree of difference determined by the first loss function.
  • the manner of the above training convergence may include at least one of the following: judging whether the number of iterative training reaches a preset number, and judging whether the degree of difference is smaller than a preset threshold.
  • the preset number of times and the preset threshold value can be set according to actual conditions, and are not exhaustive. That is to say, when it is determined that the training is completed based on the above manner, the first preset model after the training can be used as the first model, and the second preset model after the training can be used as the second model.
  • the first preset model includes an estimation preset submodel, a preset channel generation submodel, and a compression preset submodel.
  • FIG. 8b it shows a first preset model 800, a second preset model 810, and the estimation preset sub-model 801, compression preset sub-model 802 and channel generation preset included in the first preset model 800.
  • Set sub-model 803 it shows a first preset model 800, a second preset model 810, and the estimation preset sub-model 801, compression preset sub-model 802 and channel generation preset included in the first preset model 800.
  • first preset model 800 second preset model 810, and the estimation preset submodel 801
  • the input-output relationship of can be as follows: the input information of the estimated preset sub-model 801 is the first training sample 920; the output information of the estimated preset sub-model 801 is used as the input information of the channel generation preset sub-model 803; The output information of the channel generation preset submodel 803 is used as the input information of the compression preset submodel 802 ; the output information of the compression preset submodel 802 is used as the input information of the second preset model 810 .
  • the terminal device uses training samples to jointly train the first preset model and the second preset model, which may include:
  • the terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the information input into the estimated preset sub-model of the first preset model may also be wireless channel or other scene-related information, for example, may include at least one of the following: Channel signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
  • a specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain the initial information.
  • the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
  • the initial information output by the estimated preset sub-model above may be a matrix, and the dimension of the matrix is not limited here, and may be a two-dimensional or more dimensional matrix.
  • the value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions.
  • the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
  • a function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information.
  • the eigenvector information of the initial information may include R groups of eigenvector sequences.
  • the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the compressed eigenvector information obtained by compressing the preset sub-model contains less data than the eigenvector information of the input initial information.
  • the above compressed feature vector information is in the same form as the feature vector information of the initial information.
  • the feature vector information of the initial information is a sequence of R groups of feature vectors
  • the compressed feature vector information is also a sequence of feature vectors of groups R but The amount of data contained in the two is different.
  • the function of the second preset model may be to decompress its input information.
  • the input information of the second preset model is compressed feature vector information
  • the output of the second preset model is restored feature vector information.
  • the decompression rate of the second preset model should be such that the obtained restored feature vector information contains the same or substantially the same data as the feature vector information of the initial information.
  • the function of the above-mentioned second preset model can also include recovering the input information to obtain the restored initial information. At this time, the decompression rate of the second preset model should make the recovered initial information contain the same content as the initial information. or basically the same data.
  • the performing reverse conduction to update the first preset model and the second preset model based on the degree of difference determined by the first loss function may specifically refer to: performing a reverse conduction based on the degree of difference determined by the first loss function performing reverse conduction to update model parameters of the estimated preset submodel, model parameters of the channel generation preset submodel, model parameters of the compression preset submodel, and model parameters of the second preset model.
  • the method for determining the convergence of the above training is the same as that of the above-mentioned case 1, and repeated explanations are not repeated.
  • the terminal device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, including:
  • the terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein , the third model is a trained third preset model.
  • FIG. 8c it shows a first preset model 800, a second preset model 810, a third preset model 830, and the estimated preset sub-model 801 contained in the first preset model 800, the compression preset Submodel 802.
  • the above-mentioned first preset model 800, the second preset model 810, and the input-output relationship between the estimated preset sub-model 801 and the compressed preset sub-model 802 contained in the first preset model 800 can be: estimated The input information of the preset submodel 801 is the first training sample 920; the output information of the estimated preset submodel 801 is used as the input information of the compressed preset submodel 802; the output information of the compressed preset submodel 802 is used as the The input information of the third preset model 830; the output information of the third preset model is used as the output information of the second preset model 810.
  • the terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
  • the terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc.
  • channel Signal-to-noise ratio signal-to-interference-noise ratio
  • channel type bandwidth information
  • delay information etc.
  • estimation preset sub-model of the first preset model and the compression preset sub-model of the first preset model are the same as those in the first case, so the description will not be repeated.
  • a third preset model is added relative to case one, and the function of the third preset model is to simulate the channel environment, and the specific processing can be to perform data transformation on input information to obtain data transformed information as output information .
  • the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
  • the function of the second preset model may be to decompress its input information.
  • the input information of the second preset model is transformed information, and the output of the second preset model is restored information.
  • the decompression rate of the second preset model should make the obtained restored information contain the same data as the original information.
  • the updating of the first preset model, the second preset model, and the third preset model based on the first loss function may specifically refer to: performing reverse conduction based on the first loss function Conductively updating model parameters of the estimated preset sub-model, model parameters of the compressed preset sub-model, model parameters of the second preset model, and model parameters of the third preset model.
  • Case 4 is different from the above case 3 in that the first preset model includes an estimation preset sub-model, a preset channel generation sub-model, and a compression preset sub-model.
  • FIG. 8d it shows a first preset model 800, a second preset model 810, a third preset model 830, and the estimated preset sub-model 801 contained in the first preset model 800, the compression preset Submodel 802 and channel generation preset submodel 803 .
  • the input-output relationship between the preset sub-models 803 may be: the input information of the estimated preset sub-model 801 is the first training sample 920; the output information of the estimated preset sub-model 801 is used as the channel generation preset sub-model The input information of 803; the output information of the channel generation preset submodel 803 is used as the input information of the compression preset submodel 802; the output information of the compression preset submodel 802 is used as the input of the third preset model 830 information; the output information of the third preset model 830 is used as the input information of the second preset model 810 .
  • the terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
  • the terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the specific description about the first training sample is the same as any one of the foregoing case 1, case 2, and case 3, so no repeated description is given.
  • the specific function of the estimated preset sub-model of the first preset model is the same as any one of the foregoing case 1, case 2, and case 3.
  • a function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information.
  • the eigenvector information of the initial information may include R groups of eigenvector sequences.
  • the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the function of the third preset model is to simulate the wireless channel environment, and the specific processing may be to perform data transformation on input information to obtain information after data transformation as output information.
  • the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
  • the function of the second preset model may be to decompress its input information.
  • the input information of the second preset model is transformed feature vector information, and the output of the second preset model is restored feature vector information.
  • the decompression rate of the second preset model should make the obtained restored feature vector information and the feature vector information of the initial information contain close to or the same data.
  • the performing reverse conduction update based on the degree of difference determined by the first loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The degree of difference determined by a loss function is used to perform reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the channel generation preset sub-model, the model parameters of the compressed preset sub-model, the The model parameters of the second preset model and the model parameters of the third preset model.
  • the scenario where the first loss function is used for joint training is described above.
  • the scenario where the second loss function is used for joint training can also be provided, as follows:
  • the loss function used in the training is a second loss function; the second loss function is based on the difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model The first difference degree of the first preset model and the second difference degree between the output information of the estimated preset sub-model of the first preset model and the second training sample; wherein, the second training sample and the input of the estimated Corresponds to the first training sample of the preset sub-model.
  • the first degree of difference is determined based on a distance, or is determined based on a degree of similarity; and/or, the second degree of difference is determined based on a distance, or is determined based on a degree of similarity.
  • the specific calculation method for determining the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the distance can use a mean square error (MSE, Mean Squared Error) or normalized mean square error (NMSE), etc., this embodiment is not exhaustive.
  • MSE mean square error
  • NMSE normalized mean square error
  • the output information of the second preset model may be a matrix
  • the input information of the compressed preset sub-model may also be a matrix
  • the output matrix of the second preset model It is called matrix 3
  • the matrix of the input of the compressed preset submodel is called matrix 4
  • the output information of the second preset model and the compressed preset submodel of the first preset model are determined based on the distance
  • the way of the degree of difference between the input information is the MSE way, for example: calculate the difference between matrix 3 and matrix 4, and use the square of the difference as the degree of difference.
  • the specific calculation method for determining the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may use cosine similarity or cosine similarity Degree square and other methods are not exhaustive in this embodiment.
  • the output information of the second preset model may be R sets of feature vector sequence information
  • the input information of the compressed preset sub-model may also be R sets of feature vector sequence information.
  • the The R group of feature vector sequence information output by the second preset model is called feature vector sequence 3
  • the R group of feature vector sequence information input by the compressed preset sub-model is called feature vector sequence 4.
  • the method of determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may be cosine similarity, for example: feature vector sequence 3 and the cosine angle of the eigenvector sequence 4 to determine the degree of similarity, and use the degree of similarity as the degree of difference.
  • the specific calculation method for determining the second degree of difference between the output information of the estimated preset sub-model of the first preset model based on the distance and the second training sample can use mean square error (MSE, Mean Squared Error) or normalization
  • MSE mean square error
  • NMSE normalized mean square error
  • the output information of the estimated preset sub-model may be a matrix, and correspondingly, the second training sample may also be a matrix.
  • the output matrix of the estimated preset sub-model is called matrix 5
  • the matrix of the input of the compressed preset sub-model is called matrix 6
  • the second degree of difference between the output information of the estimated preset sub-model of the first preset model and the second training sample is determined based on the distance In the MSE mode, for example: calculate the difference between matrix 5 and matrix 6, and use the square of the difference as the degree of difference.
  • the specific calculation method can use cosine similarity or cosine similarity squared, etc.
  • the methods are not exhaustive in this embodiment.
  • the method of connection may be to add the weights of the first degree of difference and the second degree of difference, for example, the two each account for 50%; or , the joint method can be the addition of unequal weights between the first difference degree and the second difference degree.
  • the difference before and after the compression and recovery between the two can be assigned a greater weight, or the above-mentioned second degree of difference can be assigned a larger weight, that is, the accuracy of the output information of the above-mentioned estimated preset sub-model is assigned a larger weight; or, its combination
  • the method can be in the form of multiplying the first degree of difference and the second degree of difference; or the joint method can be that the first degree of difference and the second degree of difference can be calculated by cross entropy, such as p1*log(first degree of difference)+p2*log (the second degree of difference), where both p1 and p2 can be set according to actual conditions, and are not limited here.
  • the terminal device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model; wherein, the first preset model includes estimated preset sub-models Model and compression preset submodels.
  • composition of each model and the input-output relationship between each model are the same as the previous case 1.
  • FIG. 8 a please refer to FIG. 8 a , which will not be repeated here.
  • the terminal device uses training samples to jointly train the first preset model and the second preset model, including:
  • the terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the first training samples may be reference signal samples.
  • the reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition. More specifically, the reference signal samples may be downlink reference signal samples. It should be understood that this embodiment does not limit that the first training samples must be the downlink reference signal samples, and uplink reference signal samples or other reference signal samples may also be used, which are not exhaustive in this embodiment.
  • the information input into the estimated preset sub-model of the first preset model may also be wireless channel or other scene-related information, for example, may include at least one of the following: Channel signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
  • the specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information.
  • the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
  • the aforementioned initial information may be a matrix, and the dimension of the matrix is not limited here, and may be a matrix of two or more dimensions.
  • the value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions.
  • the channel quality can be characterized by a signal strength value; the unit of the signal strength value can be decibel milliwatt (dBm, decibel relative to one milliwatt"), or the signal strength value has no unit but is normalized values obtained afterwards.
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the compressed information obtained by compressing the preset sub-model contains less data than the input initial information.
  • the form of the above-mentioned compressed information is the same as that of the initial information, for example, the initial information is a matrix, and the corresponding compressed information is also a matrix, and the matrix dimensions of the initial information and the compressed information are the same but the amount of data is different.
  • the function of the second preset model may be to decompress its input information.
  • the decompression rate of the second preset model should make the obtained restored information contain the same data as the original information.
  • the performing reverse conduction based on the second loss function to update the first preset model and the second preset model may specifically refer to: performing reverse conduction based on the second loss function to update the estimated preset sub-model , the model parameters of the compressed preset sub-model and the model parameters of the second preset model.
  • the way of the above-mentioned training convergence can include at least one of the following: judging whether the number of iterative training reaches the preset number of times, judging whether the first difference degree is less than the first preset threshold value, judging the second difference Whether the degree is smaller than the second preset threshold value.
  • the preset times, the first preset threshold value and the second preset threshold value can be set according to actual conditions, and are not exhaustive. That is to say, when it is determined that the training is completed based on the above manner, the first preset model after the training can be used as the first model, and the second preset model after the training can be used as the second model.
  • Case 6 is different from Case 5 in that the first preset model includes an estimation preset sub-model, a preset channel generation sub-model, and a compression preset sub-model.
  • composition of each model in this case and the input-output relationship between each model are the same as those in the second case, for details, please refer to FIG. 8 b , which will not be repeated here.
  • the terminal device uses training samples to jointly train the first preset model and the second preset model, including:
  • the terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the second training sample corresponds to the first training sample
  • the information input into the estimated preset sub-model of the first preset model may also include other information related to wireless channels or scenes, for example, may include at least one of the following: Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
  • the specific function of the estimation preset sub-model of the first preset model is the same as the fifth case above.
  • a function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information.
  • the eigenvector information of the initial information may include R groups of eigenvector sequences.
  • the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the compressed eigenvector information obtained by compressing the preset sub-model contains less data than the eigenvector information of the input initial information.
  • the above compressed feature vector information is in the same form as the feature vector information of the initial information.
  • the feature vector information of the initial information is a sequence of R groups of feature vectors
  • the compressed feature vector information is also a sequence of feature vectors of groups R but The amount of data contained in the two is different.
  • the function of the second preset model may be to decompress its input information.
  • the input information of the second preset model is compressed feature vector information
  • the output of the second preset model is restored feature vector information.
  • the decompression rate of the second preset model should be such that the obtained restored feature vector information contains the same or substantially the same data as the feature vector information of the initial information.
  • Performing reverse conduction according to the second loss function to update the first preset model and the second preset model may specifically refer to: performing reverse conduction based on the second loss function to update the estimated preset sub-model model parameters of the channel generation preset sub-model, model parameters of the compression preset sub-model and model parameters of the second preset model.
  • the method for determining the above-mentioned training convergence is the same as that of the above-mentioned case five, and repeated explanations are not repeated.
  • the terminal device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, including:
  • the terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein , the third model is a trained third preset model.
  • the terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
  • the terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • composition of each model in this case and the input-output relationship between each model are the same as those in the third case above, which can be referred to FIG. 8c, and repeated explanations are not repeated here.
  • the specific description about the first training sample is the same as the foregoing case five or six, so no repeated description is given.
  • estimation preset sub-model of the first preset model and the compression preset sub-model of the first preset model are the same as those of the fifth case above, so repeated descriptions will not be made.
  • a third preset model is added relative to case five.
  • the function of the third preset model is to simulate the channel environment, and the specific processing can be to perform data transformation on input information to obtain data transformed information as output information .
  • the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
  • the function of the second preset model may be to decompress its input information.
  • the performing reverse conduction update based on the degree of difference determined by the second loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first
  • the second loss function performs reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the compressed preset sub-model, the model parameters of the second preset model, and the model of the third preset model parameter.
  • Case 8 is different from the above case 7 in that the first preset model includes an estimation preset submodel, a preset channel generation submodel, and a compression preset submodel.
  • the terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
  • the terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the second training sample corresponds to the first training sample
  • composition of each model and the input-output relationship between each model are the same as the foregoing case 4, which can be referred to FIG. 8d , and repeated descriptions are not repeated here.
  • the specific description about the first training sample is the same as any one of the above-mentioned case 5, case 6, and case 7, so the description will not be repeated.
  • the specific function of the estimated preset sub-model of the first preset model is the same as any one of the fifth, sixth, and seventh cases.
  • a function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information.
  • the eigenvector information of the initial information may include R groups of eigenvector sequences.
  • the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the function of the third preset model is to simulate the wireless channel environment, and the specific processing may be to perform data transformation on input information to obtain information after data transformation as output information.
  • the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
  • the function of the second preset model may be to decompress its input information.
  • the input information of the second preset model is transformed feature vector information, and the output of the second preset model is restored feature vector information.
  • the decompression rate of the second preset model should be such that the obtained restored feature vector information and the feature vector information of the initial information contain data that are close to or identical.
  • the performing reverse conduction update based on the degree of difference determined by the first loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The degree of difference determined by a loss function is used to perform reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the channel generation preset sub-model, the model parameters of the compressed preset sub-model, the The model parameters of the second preset model and the model parameters of the third preset model.
  • the terminal device may obtain the first model and the second model after its own joint training, or obtain the first model, the second model and the third model after joint training by adopting the above second method. Furthermore, the above-mentioned processing of S610 to S620 may be performed.
  • the training samples are used in the processing of the joint training of the terminal device itself to obtain the first model and the second model, and the joint training of the terminal device itself to obtain the first model, the second model and the third model provided by the second method above. , the following is a detailed description of the training samples:
  • the training samples may include a first training sample.
  • the first training samples may be reference signal samples.
  • the reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition.
  • the original reference signal may refer to a reference signal that has not been transmitted through a wireless channel.
  • the method for acquiring and processing the reference signal may include: using the reference signal received after the original reference signal passes through the wireless channel (or the real wireless channel, or the real wireless channel) as the processed reference signal.
  • the method for obtaining a processed reference signal may include: using a reference signal received after the original reference signal passes through a simulated wireless channel as a processed reference signal.
  • the original reference signal may be a downlink reference signal or an uplink reference signal.
  • the first training samples are distributed in the first dimension and/or the second dimension.
  • the first dimension is a time domain dimension; the first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer.
  • first information samples may be distributed in each of the m time units, where n is a positive integer.
  • Each time unit may include at least one time slot, or at least one symbol (such as an OFDM symbol).
  • the first information sample is a downlink reference signal sample
  • the number of time slots contained in each time unit may be c (c is a positive integer)
  • n downlink reference signals in each c time slot A sample, combination of c and n can be e.g. (1,1)(1,2)(1,3)(1,4)(1,6)(2,1)(4,1)(5,1) (8,1)(10,1).
  • the second dimension is a frequency domain dimension; the first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer.
  • y first information samples may be distributed in each of the x frequency domain resources, and y is a positive integer.
  • Each frequency domain resource may include at least one resource block (RB), or at least one subcarrier.
  • the first information sample is a downlink reference signal sample
  • the number of time slots contained in each frequency domain resource may be d (d is a positive integer)
  • d is a positive integer
  • y time slots in every d RBs in the frequency domain
  • the combination of d and y can be, for example, (1,1)(1,2)(2,1)(1,3)(3,1)(1,4)(4,1)(1 ,6)(6,1).
  • the above-mentioned first training samples are distributed in the first dimension and/or the second dimension. It can be understood that the subsequent training can be performed only according to the distribution of the first training samples in the frequency domain dimension, or only based on the distribution of the first training samples in the frequency domain. The subsequent training may be performed according to the distribution of the first training sample in the frequency domain and the time domain. For example, a first training sample contains 10 RBs in the frequency domain dimension and 1 time slot in the time domain dimension, each RB has 3 first signal samples, and each time slot has 1 first signal samples, the first training samples include a total of 30 first signal samples.
  • the sizes of the first dimension and the second dimension, the time domain dimension and the frequency domain dimension may be equal or unequal.
  • the above-mentioned time-domain dimension and frequency-domain dimension can also be combined into one dimension. Specifically, the combination can be the time-domain dimension first and then the frequency-domain dimension, or the frequency-domain dimension first and then the time-domain dimension, which is not implemented in this embodiment limited.
  • the solution provided by this embodiment can be based on the above-mentioned first dimension and second dimension.
  • Increase the presentation form of complex numbers or it can be understood as adding a dimension, which is caused by the independent presentation of the imaginary part and real part data of the original reference signal or the processed reference signal
  • the first training The samples are also distributed in the third dimension.
  • the third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample.
  • each first training sample contains 1 time unit (such as 1 time slot) in the time domain dimension, and contains 10 frequency domain resources (such as 10 RBs) in the frequency domain dimension
  • each first The information sample can be expressed as a real part and an imaginary part, and the first training sample can be a 1 ⁇ 10 ⁇ 2 matrix.
  • the training samples also include a second training sample corresponding to the first training sample; the second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2.
  • the second training samples may be used to characterize the expected channel quality based on the first training samples, or channel response, or channel state, or channel estimation results, or channel information .
  • the T dimensions include a fourth dimension and a fifth dimension.
  • the matrix of the T dimensions may specifically be a two-dimensional matrix of M ⁇ N; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension; M and N are all positive integers.
  • a second training sample consists of a two-dimensional matrix with a size of M ⁇ N, which has M first granularities in the fourth dimension and N second granularities in the fifth dimension; the above M and N May or may not be equal.
  • the specific numerical indication in the two-dimensional matrix represents the received signal strength at a certain first granularity of the channel quality.
  • the specific numerical value in the two-dimensional matrix here may refer to the signal strength value, and its unit may be dBm, or There is no unit but the value obtained after normalization.
  • the two-dimensional matrix of M ⁇ N can also be synthesized into one-dimensional data of size 1 ⁇ (M ⁇ N) or (M ⁇ N) ⁇ 1.
  • the specific transformation can be the fourth dimension first and then the fifth dimension. It may also be the fifth dimension first and then the fourth dimension, which is not limited in this embodiment.
  • the fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
  • the fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, and K3 symbol sampling points; K1, K2, and K3 are positive integers.
  • the symbols are Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing).
  • OFDM Orthogonal Frequency Division Multiplexing
  • the second training sample may be a channel information sample corresponding to the reference signal sample, or may also be called a channel state sample Wait, I'm not going to exhaust the names here.
  • the first granularity can be L1 RBs (L1 is greater than or equal to 1, such as 2RB, 4RB, 8RB), and the distribution range of a second training sample in the frequency domain dimension is M ⁇ L1
  • the frequency domain range corresponding to each RB; or the first granularity can be L2 subcarriers (L2 is greater than 1, such as 4 subcarriers, 6 subcarriers, and 18 subcarriers), then the distribution of a second training sample on the frequency domain dimension is the frequency domain range corresponding to M ⁇ L2 subcarriers.
  • the first granularity may be a delay granularity, for example, a first granularity is the number of sampling points of K1 microseconds, or K2 symbol lengths, or K3 symbols, where the symbols It can be an OFDM symbol;
  • the fourth dimension is the time domain dimension and the first granularity is K1 microseconds
  • the distribution range of a second training sample in the time domain dimension is the time domain range corresponding to M ⁇ K1 microseconds .
  • the fifth dimension is a space domain dimension; correspondingly, the second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
  • the fifth dimension is the space domain dimension, specifically, the antenna dimension, for example, the fifth dimension is composed of N antenna pairs, and correspondingly, the second granularity is a pair of transmitting and receiving antennas.
  • the fifth dimension is a space domain dimension, specifically an angle domain dimension, for example, the fifth dimension is composed of N arrival angles, and the second granularity is the interval between the above N arrival angles.
  • the value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; Both i and j are positive integers. That is to say, in the case of using a first training sample, the value (or referred to as an indicator value) at a certain position in the two-dimensional matrix used to represent the second training sample represents the The expected channel quality situation under such a combination of five dimensions.
  • the channel quality or the channel quality situation can be characterized by signal strength, and the unit of the value (or indicator value) can be dBm, or there is no unit but a value obtained after normalization.
  • the fourth dimension represents the frequency domain dimension
  • the fifth dimension is the space domain dimension, specifically the antenna dimension
  • the first granularity is 2RB
  • the second granularity is 1
  • the value (or indicator value) at this position can be used to represent the channel quality (or channel quality situation) on the third 2RB bandwidth (that is, the fifth RB to the sixth RB) on the sixth pair of transceiver antennas ).
  • S may also be used to represent the number of second training samples, and S may be an integer greater than or equal to 1, that is, the second training samples may include one or more.
  • S may also be used to represent the number of second training samples, and S may be an integer greater than or equal to 1, that is, the second training samples may include one or more.
  • the T dimensions also include a sixth dimension.
  • the matrix of T dimensions is a three-dimensional matrix of M ⁇ N ⁇ W; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension, W represents the quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
  • the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality corresponding to the first training sample at a third granularity; i, j and k are all positive integers.
  • the sixth dimension may be a complex dimension.
  • the second training samples can be used to characterize the expected channel quality based on the first training samples (or called channel response, or called channel state, or called channel estimation result, or called channel information), and the above-mentioned channel quality can also be presented by a complex number, so a sixth dimension, that is, a complex number dimension, can be added on the basis of the above two dimensions of the second training sample, and the complex number dimension is the second training sample.
  • the imaginary and real parts of the channel quality in the samples are presented independently generated.
  • the sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
  • the third granularity being 1 specifically refers to a real part or an imaginary part, and the number of the third granularity being 2 means that there may be two third granularities in the complex dimension.
  • the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The real part of the channel quality at the second granularity;
  • the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
  • the first value is different from the second value, for example, the first value can be set to 1 and the second value can be 2, or the first value can be 0 and the second value can be 1, or the first value It can be 1 and the second value can be 0, as long as the first value is different from the second value, it is within the protection scope of this embodiment.
  • the above-mentioned second training samples can also be split and combined on the basis of the above-mentioned fourth dimension, fifth dimension, and sixth dimension.
  • the fifth dimension is an antenna pair dimension
  • the It can be split into sending antenna sub-dimensions and receiving antenna sub-dimensions, thereby expanding the dimension of the second training sample.
  • This embodiment does not exhaustively enumerate various possible sub-dimensions after splitting.
  • the terminal device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model; or the terminal device itself trains the first preset model
  • the model, the second preset model and the third preset module are jointly trained to obtain the trained first model, the second model and the third model.
  • the terminal device can at least send the trained second model.
  • the following is an example of how the terminal device sends the model:
  • the terminal device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model. After the foregoing training is completed, the terminal device sends the second model. Specifically, it may be: the terminal device sends the second model to the network device. Still further, it may also be: the terminal device sends the model parameters of the second model to the network device.
  • the network device may be a network device that provides services for the terminal device, such as an access network device, and specifically may be a base station, eNB, gNB, and the like.
  • the second model (or model parameters of the second model) is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  • the terminal device may retain the first model itself for processing the first information to obtain the second information; correspondingly, since the network device can receive the second model sent by the terminal device, Therefore, the network device may process the second information based on the second model to obtain channel information.
  • the channel information may also be feature vector information of the channel information.
  • the network device can store the second models sent by the multiple terminal devices.
  • base station 1 can serve three mobile phones, namely mobile phone 1, mobile phone 2 and mobile phone 3. of the second model.
  • the base station 1 receives the second information sent by the mobile phone 2, the base station 1 can process the second information of the mobile phone 2 based on the second model sent by the mobile phone 2 to obtain the channel information corresponding to the mobile phone 2.
  • the terminal device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model. After the above training is completed, on the basis that the terminal device sends the second model, the method may further include: the terminal device also sends the first model.
  • the terminal device sends the first model to the network device. Still further, it may also be: the terminal device sends the model parameters of the first model to the network device.
  • the network device may be a network device that provides services for the terminal device, such as an access network device, and specifically may be a base station, eNB, gNB, and the like.
  • the first model (or model parameters of the first model) is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  • first model and the second model may be sent at the same time, or the above-mentioned first model and the second model may be sent separately, which is not limited in this embodiment.
  • the terminal device can process the first information through the first model to obtain the second information; correspondingly, since the network device can receive the second model sent by the terminal device, the The network device may process the second information based on the second model to obtain channel information.
  • the channel information may also be feature vector information of the channel information.
  • the network device can conduct an overall evaluation of the first model and the second model, and after completing the overall evaluation of the first model and the second model After the evaluation, you can decide whether to use the first model and the second model received this time. If the overall evaluation result is poor (for example, the compression rate is low or the accuracy of the restored channel information is low, etc.), the above first model may not be used. A model and a second model. If the network device decides not to use the above-mentioned first model and the second model, it can also re-train the first model and the second model to update the model parameters of the first model and the second model, or the network device trains itself to obtain New first model as well as second model.
  • the network device jointly trains the first model and the second model again, or updates the first model and the second model, the network device also needs to send the new first model and the second model to the terminal device , or the network device sends the new first model to the terminal device.
  • the network device can also save the first model and the second model sent by the multiple terminal devices. Furthermore, the network device can conduct an overall evaluation of the first model and the second model sent by each terminal device, and can select the target first model with the best overall evaluation result and its corresponding target second model, and then the network device can itself The target second model is reserved, and the target first model is sent to the above-mentioned multiple terminal devices. Whether the overall evaluation result is optimal can be judged by indicators such as compression ratio and recovery accuracy.
  • the network device in this example saves the first model and the second model sent by multiple terminal devices, it does not process the first model and the second model sent by each terminal device, but only After receiving the second information sent by any terminal device, the second information is processed based on the second model of the terminal device that sent the second information.
  • the terminal device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model.
  • the difference between this example and Example 2 is that after the above training is completed, on the basis of the terminal device sending the second model, the terminal device can send the estimation sub-model and the compression sub-model in the first model Model.
  • the terminal device may send the estimated sub-model and the compressed sub-model in the first model at the same time; or, the terminal device may send the estimated sub-model and the compressed sub-model in the first model respectively.
  • the terminal device may send the estimated sub-model and the compressed sub-model in the first model to the network device at the same time; or, the terminal device may send the estimated sub-model in the first model to the network device respectively.
  • Submodels and compressed submodels are examples of the terminal device.
  • the terminal device may send the model parameters of the estimated sub-model and the model parameters of the compressed sub-model in the first model to the network device at the same time; or, the terminal device may send the first model parameters to the network device respectively.
  • Model parameters for the estimated submodel and model parameters for the compressed submodel in the model may be sent to the network device at the same time; or, the terminal device may send the first model parameters to the network device respectively.
  • the estimation sub-model and the compression sub-model may be simultaneously carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, uplink data transmission for artificial intelligence business type transmission requirements;
  • the estimation sub-model and the compression sub-model may be respectively carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  • the terminal device can process the first information through the estimation sub-model and the compression sub-model in the first model to obtain the second information; correspondingly, the network device can receive the A second model. Therefore, the network device may process the second information based on the second model to obtain channel information.
  • the channel information may also be feature vector information of the channel information.
  • the network device may integrate the estimated sub-model, the compressed sub-model and the second model Evaluation, after completing the overall evaluation of the estimated sub-model, compressed sub-model and the second model, you can decide whether to use the estimated sub-model, compressed sub-model and the second model received this time, if the overall evaluation results are poor (such as compressed rate is low or the accuracy of recovering channel information is low, etc.), the estimation sub-model, the compression sub-model and the second model may not be used.
  • the network device decides not to use the above estimation sub-model, compression sub-model and second model, it can also re-train the estimation sub-model, compression sub-model and second model by itself to update the estimation sub-model, compression sub-model and second
  • the model parameters of the model, or the network device trains itself to obtain a new estimated sub-model, a compressed sub-model and a second model.
  • the network device jointly trains or updates the estimated sub-model, the compressed sub-model and the second model, the network device also needs to send the new estimated sub-model, the new compressed sub-model and the new second model to The terminal device or the network device sends the new estimated sub-model and the new compressed sub-model to the terminal device.
  • the network device may store the estimated sub-model, the compressed sub-model and the second model sent by the multiple terminal devices. Furthermore, the network device can conduct an overall evaluation of the estimation sub-model, compression sub-model and second model sent by each terminal device, and can select the target estimation sub-model, target compression sub-model and their corresponding target sub-models with the best overall evaluation results. The second model, and then the network device can reserve and use the target second model, and send the target estimation sub-model and the target compression sub-model to the above-mentioned multiple terminal devices. Whether the overall evaluation result is optimal can be judged by indicators such as compression ratio and recovery accuracy.
  • the network device in this example saves all the models sent by multiple terminal devices, it does not process the models sent by each terminal device, but only receives the first model sent by any terminal device. After receiving the second information, process the second information based on the second model of the terminal device that sent the second information.
  • the terminal device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model; wherein, the first model includes an estimation sub-model, a compression sub-model and a channel generation sub-model .
  • the difference between this example and Example 3 is that after the above training is completed, on the basis of the terminal device sending the second model, the terminal device can send the estimation sub-model, compression sub-model in the first model model and the channel generation submodel.
  • the terminal device may simultaneously send the estimation sub-model, the compression sub-model and the channel generation sub-model in the first model; or, the terminal device may separately send the estimation sub-model, the compression sub-model and the channel generation sub-model submodel.
  • the terminal device may send the estimation sub-model, the compression sub-model and the channel generation sub-model in the first model to the network device at the same time; or, the terminal device may send the first model to the network device respectively.
  • the estimation sub-model, compression sub-model and channel generation sub-model may be simultaneously carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, uplink data transmission for artificial intelligence service class transmission requirements;
  • the estimation sub-model, compression sub-model and channel generation sub-model may be respectively carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  • any two of the estimation sub-model, compression sub-model and channel generation sub-model may be simultaneously carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink transmission requirements for artificial intelligence services data transmission.
  • the remaining sub-model can be carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  • the terminal device can process the first information through the estimation sub-model, compression sub-model and channel generation sub-model in the first model to obtain the second information; correspondingly, the network device can receive The second model sent by the terminal device, therefore, the network device may process the second information based on the second model to obtain channel information.
  • the channel information may specifically be: channel information, or may be eigenvector information of the channel information.
  • the network device may perform the estimated sub-model, the compressed sub-model and the second model.
  • the channel generation sub-model and the second model perform an overall evaluation, and the processing after the overall evaluation is similar to the third example above, and will not be repeated here.
  • the terminal device itself performs joint training on the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model. After the above training is completed, on the basis that the terminal device sends the second model and the first model, the terminal device may send the third model.
  • the terminal device may send the first model, the second model and the third model to the network device at the same time; or, the terminal device may send the first model and the second model to the network device respectively and the third model; or, the terminal device may first send any two of the first model, the second model, and the third model to the network device, and then send the remaining one model to the network device.
  • the first model, the second model, and the third model may be simultaneously carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services;
  • the first model, the second model and the third model may be respectively carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, uplink data transmission for artificial intelligence service class transmission requirements;
  • any two of the first model, the second model, the third model, and the remaining one model are respectively carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and artificial intelligence services Uplink data transmission for class transmission requirements.
  • the first model may include an estimation sub-model and a compression sub-model.
  • sending the first model may refer to simultaneously or separately sending the estimation sub-model and the compression sub-model.
  • the carrying manner of the estimation sub-model and the compression sub-model is the same as that of the previous example and will not be repeated here.
  • the first model may include an estimation submodel, a channel generation submodel, and a compression submodel.
  • sending the first model may refer to sending the estimation submodel, the The channel generation sub-model and the compression sub-model, the carrying manners of the estimation sub-model, the channel generation sub-model and the compression sub-model are the same as the previous examples and will not be repeated here.
  • the terminal device can process the first information through the estimation sub-model and the compression sub-model in the first model to obtain the second information; correspondingly, the network device can receive the A second model. Therefore, the network device may process the second information based on the second model to obtain channel information.
  • the channel information may also be feature vector information of the channel information.
  • the network device may perform an overall evaluation on the first model, the second model and the third model , after completing the overall evaluation of the first model, the second model and the third model, it may be decided whether to use the first model, the second model and the third model received this time, if the overall evaluation result is poor (For example, the compression rate is low or the accuracy rate of recovering channel information is low, etc.), the above-mentioned first model, second model and third model may not be used.
  • the network device decides not to use the above-mentioned first model, second model and third model, it can also re-train the first model, second model and third model by itself to update the estimation sub-model, compression sub-model model and model parameters of the second model, or the network device trains itself to obtain a new first model, a new second model, and a new third model. It should be pointed out that if the network device jointly trains or updates the estimation sub-model, the compression sub-model and the second model, the network device also needs to send the new first model, the new second model and the new third model to The terminal device or the network device sends the new first model to the terminal device.
  • the network device can store the first model, the second model and the third model sent by the multiple terminal devices. Furthermore, the network device can conduct an overall evaluation of the first model, the second model, and the third model sent by each terminal device, and can select the target first model, the target second model, and the target third model with the best overall evaluation results. , and then the network device can reserve and use the target second model, and send the target first model to the above-mentioned multiple terminal devices. Whether the overall evaluation result is optimal can be judged by indicators such as compression ratio and recovery accuracy.
  • the terminal device when the terminal device receives the first information, it can process the first information through the first model to obtain the second information and send it, so that the receiving end can use the second model to obtain the second information.
  • the channel information obtained by processing the information is obtained through joint training of the first model and the second model. Since the processing, transmission, and analysis of the second information are realized by using the first model and the second model obtained through joint training, the performance requirements in the entire information processing, transmission, and analysis can be taken into account, and the overall performance of the network is guaranteed.
  • the functions between the first model and the second model can be made compatible with each other, so that the performance of the first model and the second model can reach In a better state, when the processing, transmission and analysis process of the second information is processed as a whole based on the first model and the second model, the performance of the whole processing can be guaranteed, thereby ensuring the performance of the whole network.
  • Fig. 11 is a schematic flowchart of an information processing method 1100 according to an embodiment of the present application.
  • the method can optionally be applied to the system shown in Fig. 1, but is not limited thereto.
  • the method includes at least some of the following.
  • the network device sends first information.
  • the network device receives second information; wherein, the second information is obtained by processing the first information through a first model.
  • the network device processes the second information based on a second model to obtain channel information; wherein, the first model and the second model are obtained through joint training.
  • the first information may be a reference signal, specifically, the first information may be a reference signal of the current channel, such as a downlink reference signal of the current channel.
  • the downlink reference signal may include at least one of CSI-RS, DMRS, and PT-RS.
  • the first information may be distributed in the first dimension and/or the second dimension.
  • the first dimension is a time domain dimension; the first information is distributed in at least one time unit in the time domain dimension.
  • Each time unit in the at least one time unit may include one of the following: 1 time slot and 1 Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing) symbol.
  • the first signal is a downlink reference signal, and the downlink reference signal may be distributed in one time slot in the time domain dimension, or the downlink reference signal may be distributed in two or on 4 time slots.
  • the second dimension is a frequency domain dimension; the first information is distributed on at least one frequency domain resource in the frequency domain dimension; wherein, each frequency domain resource can be one of the following: one RB, one subcarrier .
  • the first signal is a downlink reference signal, and the downlink reference signal may be distributed in 1 RB in the frequency domain dimension, or the downlink reference signal may be distributed in 2 or 4 RBs in the time domain dimension. on RBs.
  • the first dimension and the second dimension above can be used in combination, that is, the first information can be distributed on the first dimension and the second dimension; for example, the first information can be distributed on a RBs in the frequency domain dimension , distributed in b time slots in the time domain dimension; both a and b are positive integers.
  • the first information is a downlink reference signal, and the downlink reference signal may be distributed in 4 RBs in the frequency domain, and may be distributed in 6 time slots in the time domain dimension.
  • the first information can also be expressed as a complex number, that is, the first information is also distributed in the third dimension; the third dimension is a complex dimension; the first information includes the first information sample The real part of and the imaginary part of the first information sample.
  • the real part of the first information is distributed on the a RBs of the frequency domain resources and the b time slots of the time domain resources
  • the imaginary part of the first information is distributed on the a RBs of the frequency domain resources. b time slots of the time domain resource.
  • the network device may also send configuration information first, and the configuration information may be configured with first information for terminal device measurement.
  • the configuration information may be to configure the terminal device to measure SSB or CSI-RS and so on.
  • the network device executes S1120.
  • the second information may be carried by one of the following information: information included in the random access process, radio resource control (RRC, Radio Resource Control) signaling, and uplink control information (UCI, Uplink Control Information).
  • RRC Radio Resource Control
  • UCI Uplink Control Information
  • the information contained in the random access process includes one of the following: message A in the two-step random access process; Msg1 in the four-step random access process; Msg3 in the four-step random access process.
  • the second information in S1120 is channel compression information; the second model is used to decompress the channel compression information to obtain channel information.
  • the network device processes the second information based on the second model to obtain channel information, including: the network device inputs the channel compression information into the second model, and obtains the second model output The channel information of .
  • the second information is obtained by processing the first information through the first model. That is to say, the first model is used to process the input first information to obtain channel compression information.
  • the first model may also be called an encoding model or an encoding network, as long as the input information is the first information and the output information is the channel compression information, the model or neural network is within the protection scope of this embodiment.
  • the channel information may be used to characterize the channel quality, or channel response, or channel state, or channel estimation result obtained based on the first information.
  • the channel information may be represented by a matrix of T dimensions, where T is an integer greater than or equal to 2.
  • the matrix of the T dimensions may specifically be a two-dimensional matrix of M ⁇ N; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension; M and N are all positive integers. That is to say, the channel information may be composed of a two-dimensional matrix with a size of M ⁇ N, which has M first granularities in the fourth dimension and N second granularities in the fifth dimension; the above M and N May or may not be equal.
  • the specific numerical indication in the two-dimensional matrix represents the received signal strength at a certain first granularity of the channel quality, where the unit of the numerical value in the two-dimensional matrix may be dBm, or the numerical value in the two-dimensional matrix has no unit It is the value obtained after normalization.
  • the two-dimensional matrix of M ⁇ N can also be synthesized into one-dimensional data of size 1 ⁇ (M ⁇ N) or (M ⁇ N) ⁇ 1.
  • the specific transformation can be the fourth dimension first and then the fifth dimension. It may also be the fifth dimension first and then the fourth dimension, which is not limited in this embodiment.
  • the fourth dimension may be a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
  • the fourth dimension may be a time-domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol lengths, and the number of sampling points of K3 symbols; K1, K2, and K3 are positive integers .
  • the symbols are Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing).
  • OFDM Orthogonal Frequency Division Multiplexing
  • the first granularity may be L1 RBs (L1 is greater than or equal to 1, such as 2RB, 4RB, 8RB), then the channel information in the frequency domain dimension
  • the distribution range is the frequency domain range corresponding to M ⁇ L1 RBs; or the first granularity can be L2 subcarriers (L2 is greater than 1, such as 4 subcarriers, 6 subcarriers, and 18 subcarriers), then the channel
  • the distribution of information in the frequency domain dimension is the frequency domain range corresponding to M ⁇ L2 subcarriers.
  • the first granularity may be a delay granularity, for example, a first granularity is the number of sampling points of K1 microseconds, or K2 symbol lengths, or K3 symbols, The symbol here may be an OFDM symbol; when the fourth dimension is the time domain dimension and the first granularity is K1 microseconds, the distribution range of the channel information on the time domain dimension is M ⁇ K1 The time domain range corresponding to microseconds.
  • the fifth dimension may be a space domain dimension; correspondingly, the second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival. That is to say, the fifth dimension is the space domain dimension, specifically, it may be an antenna dimension, and the second granularity may be a pair of transmitting and receiving antennas. Alternatively, the fifth dimension is a space domain dimension, specifically, an angle domain dimension, and the second granularity may be an interval of arrival angles.
  • the value of the ijth position in the two-dimensional matrix representing the channel information is used to represent the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension
  • the channel quality of ; i and j are both positive integers. That is to say, a numerical value (or an indicator value) at a certain position in the two-dimensional matrix used to represent the channel information represents the channel quality under the combination of the fourth dimension and the fifth dimension.
  • the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
  • the T dimensions may also include a sixth dimension.
  • the matrix of T dimensions may be a three-dimensional matrix of M ⁇ N ⁇ W; wherein, M represents the number of first granularities in the fourth dimension, N represents the number of second granularities in the fifth dimension, and W represents The number of third granularities under the sixth dimension; M, N and W are all positive integers.
  • the sixth dimension may be a complex dimension, the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
  • the fourth dimension represents the time domain dimension
  • the first granularity is the delay granularity
  • the fifth dimension is the spatial domain dimension, specifically the angle dimension
  • the second granularity is the interval of arrival angles
  • the sixth dimension is a complex dimension, W is 2, k is 1 to indicate the real part, and k is 2 to indicate the imaginary part.
  • the above channel information is illustrated by using a two-dimensional matrix formed by the fourth dimension and the fifth dimension.
  • the dimension of the above channel information matrix is not limited to two dimensions.
  • the second information in S1120 is channel compression information; the channel compression information includes eigenvector information of compressed channel estimation information.
  • the channel information is eigenvector information of the channel information; the second model is used to decompress the compressed eigenvector information of the channel estimation information to obtain the eigenvector information of the channel information.
  • the eigenvector information of the channel information includes R groups of eigenvector sequence information; R is a positive integer.
  • the network device processes the second information based on the second model to obtain channel information, which may include:
  • the network device inputs the eigenvector information of the compressed channel estimation information into the second model, and obtains the eigenvector information of the channel information output by the second model.
  • the second information may be channel compression information; the channel compression information includes eigenvector information of compressed channel estimation information. That is to say, the first model is used to process the input first information to obtain eigenvector information of compressed channel estimation information.
  • the eigenvector information of the channel information includes R groups of eigenvector sequence information; R is a positive integer.
  • R may be 1, then the eigenvector information of the channel information includes a set of eigenvector sequence information.
  • R may be 2, then the eigenvector information of the channel information includes 2 sets of eigenvector sequence information.
  • the above value of R may be determined according to the actual situation, or may be specified during the training of the first model.
  • each set of feature vector sequence information may include a feature sequence of a preset length.
  • the preset length can be set according to the actual situation or can be set during training, for example, it can be any one of 16, 32, 48, 64, 128, 256, and of course it can be longer or shorter.
  • the embodiment does not exhaustively list all possible values of the preset length. In conjunction with FIG. 7, for example, the preset length is 32 (but it can be a bit), and R is 4, that is, the feature vector information of the channel information includes 4 sets of feature vector sequence information, wherein each set of feature vector sequence information contains a feature sequence of length 32.
  • the channel estimation information output by the estimation sub-model may be different from the channel information output by the second model, and the channel estimation information output by the estimation sub-model may specifically be a matrix of channel information, such as It is represented by a matrix of T dimensions; the channel information output by the second model may be eigenvector information of the channel information, for example, may include R groups of eigenvector sequence information.
  • the channel information output by the second model and the channel estimation information output by the estimation sub-model may also be the same, for example, both may be a matrix of channel information.
  • the above describes in detail how the network device uses the second model.
  • the first way the network device obtains it directly
  • the second One way obtained by the network device training. The two methods are described below:
  • the network device receives the second model.
  • the network device receives the second model sent by the electronic device; for example, the network device may receive model parameters of the second model sent by the electronic device.
  • the electronic device may be an electronic device that obtains the first model and the second model through joint training.
  • the electronic device may be the terminal device, and in this case, the network device may be an access network device serving the terminal device, such as a base station, eNB, gNB, and so on.
  • the electronic device may be other devices than the terminal device, for example, it may be a server, or a desktop computer, or a notebook, or other devices capable of data processing, which are not exhaustive in this embodiment.
  • the network device receives the second model sent by the terminal device.
  • the second model is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  • the second model (or the model parameters of the second model) may be transmitted through a wired connection or other wireless connection.
  • the electronic device transmits the second model (or the model parameters of the second model) to the network device through a wired connection with the network device.
  • the electronic device transmits the second model (or the model parameters of the second model) to the network device through other wireless connections with the network device; wherein, the other wireless connection methods may be bluetooth or WIFI, etc., are not exhaustive here.
  • the network device may also receive the first model.
  • the network device may receive the model parameters of the first model sent by the electronic device.
  • the electronic device may be the terminal device, and in this case, the network device may be an access network device serving the terminal device, such as a base station, eNB, gNB, and so on.
  • the electronic device may be other devices than the terminal device, for example, it may be a server, or a desktop computer, or a notebook, or other devices capable of data processing, which are not exhaustive in this embodiment.
  • the network device receives the first model sent by the terminal device.
  • the first model is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  • the first model (or the model parameters of the first model) may be transmitted through a wired connection or other wireless connection.
  • the electronic device transmits the first model (or the model parameters of the first model) to the network device through a wired connection with the network device.
  • the electronic device transmits the first model (or the model parameters of the first model) to the network device through other wireless connections with the network device; wherein, the other wireless connection methods may be Bluetooth or WIFI, etc., are not exhaustive here.
  • the foregoing first model and the foregoing second model may be received at the same time, or the foregoing first model and the foregoing second model may be received separately, which is not limited in this embodiment.
  • the foregoing first model may include: an estimation submodel and a compression submodel; or, the first model may include: an estimation submodel, a channel generation submodel, and a compression submodel.
  • the above-mentioned first model may include: an estimation sub-model and a compression sub-model;
  • the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information
  • the compression sub-model is used to compress the channel estimation information to obtain the second information.
  • the estimation sub-model can also be called a channel estimation sub-model or a channel estimation sub-neural network, and the estimation sub-model can use one of a fully connected network, a convolutional neural network, a residual network, and a self-attention mechanism network. Or a variety of network structure construction.
  • the compression sub-model may be called a channel compression sub-model or a channel compression sub-neural network, and the compression sub-model may use one of a fully connected network, a convolutional neural network, a residual network, a self-attention mechanism network, or A variety of network structure construction.
  • the estimation method adopted by the estimation sub-model may include algorithms such as minimum mean square error (MMSE).
  • MMSE minimum mean square error
  • the compression sub-model can compress the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the above-mentioned first model may include: an estimation sub-model, a channel generation sub-model and a compression sub-model;
  • the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information
  • the channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
  • the compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain compressed eigenvector information of the channel estimation information.
  • the method of performing eigendecomposition in the channel generation sub-model may be a singular value decomposition (SVD, Singular Value Decomposition) method.
  • SVD singular value decomposition
  • the input channel information may be subjected to SVD eigendecomposition to obtain eigenvector information of channel estimation information after eigendecomposition.
  • the channel information may be represented by a matrix, and the specific description is the same as that of the foregoing embodiment, and will not be repeated here.
  • the compression sub-model may be to compress the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the channel estimation information output by the estimation sub-model may be different from the channel information output by the second model, and the channel estimation information output by the estimation sub-model may specifically be a matrix of channel information, such as It is represented by a matrix of T dimensions; the channel information output by the second model may be eigenvector information of the channel information, for example, may include R groups of eigenvector sequence information.
  • the channel information output by the second model and the channel estimation information output by the estimation sub-model may also be the same, for example, both may be a matrix of channel information.
  • the channel estimation information output by the estimation sub-model and the channel information output by the second model may be the same or different, and the description will not be repeated.
  • the network device can directly receive the first model.
  • the network device can also receive multiple sub-models respectively, and then receive The obtained multiple sub-models are combined to obtain the first model.
  • the first model includes an estimation sub-model and a compression sub-model.
  • the network device receives the estimated sub-model and the compressed sub-model; the network device generates the first model based on the estimated sub-model and the compressed sub-model.
  • the network device receives the model parameters of the estimated sub-model and the model parameters of the compressed sub-model sent by the electronic device; the network device based on the model parameters of the estimated sub-model and the model of the compressed sub-model parameters to obtain the first model.
  • the network device may receive the estimated sub-model and the compressed sub-model sent by the electronic device at the same time; or, it may receive the estimated sub-model and the compressed sub-model sent by the electronic device separately, for example, it may be received first
  • the estimated sub-model sent by the electronic device then receives the compressed sub-model sent by the electronic device, or first receives the compressed sub-model sent by the electronic device and then receives the estimated sub-model sent by the electronic device.
  • the estimated sub-model (or the model parameters of the estimated sub-model) and the compressed sub-model (or the model parameters of the compressed sub-model) may be carried simultaneously or separately by one of the following information ): Uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements.
  • the electronic device can use the above-mentioned estimated sub-model (or the model parameters of the estimated sub-model) and the compressed sub-model (or the model parameters of the compressed sub-model) through a wired connection with the network device. ) are sent simultaneously or separately to the network devices.
  • the electronic device sends the above-mentioned estimated sub-model (or the model parameters of the estimated sub-model) and the compressed sub-model (or the model parameters of the compressed sub-model) to the network simultaneously or separately through other wireless connections with the network device device; wherein, the other wireless connection methods may be bluetooth or WIFI, etc., which are not exhaustive here.
  • the first model includes an estimation sub-model, a channel generation sub-model, and a compression sub-model.
  • the network device receives the estimation sub-model, the compression sub-model and the channel generation sub-model; the network device generates the first model. Specifically, the network device receives the model parameters of the estimated sub-model, the model parameters of the compression sub-model and the model parameters of the channel generation sub-model sent by the electronic device; the network device based on the model parameters of the estimated sub-model , model parameters of the compression sub-model and model parameters of the channel generation sub-model to obtain the first model.
  • the network device may simultaneously receive the estimation sub-model, compression sub-model and channel generation sub-model sent by the electronic device.
  • the estimated sub-model, the compressed sub-model and the channel generation sub-model sent by the electronic device may be respectively received, for example, the estimated sub-model, the compressed sub-model and the channel generation sub-model are all received respectively; or, the estimated sub-model, compressed Any two of the sub-models and the channel generation sub-model are received separately from the remaining one.
  • the network device may first receive the estimated submodel sent by the electronic device, then receive the channel generation submodel sent by the electronic device, and finally receive the compressed submodel sent by the electronic device; or, first receive the The compressed sub-model and the channel generation sub-model sent by the electronic device, and then receive the estimated sub-model sent by the electronic device.
  • the above is only an exemplary description, and does not mean that there are only several combinations of the above-mentioned exemplary sub-models, compression sub-models, and channel generation sub-models that are actually sent or received respectively, but this embodiment is not exhaustive. lift.
  • the network device When the electronic device is the terminal device, when the network device receives the estimated sub-model, the compressed sub-model and the channel generation sub-model simultaneously or respectively, the estimated sub-model, the compressed sub-model and the channel generation sub-model
  • the model is carried simultaneously or separately by one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  • the electronic device may send the above-mentioned estimation sub-model, compression sub-model and channel generation sub-model to the network device simultaneously or separately through a wired connection with the network device .
  • the electronic device sends the above estimated sub-model, compression sub-model and channel generation sub-model to the network device simultaneously or separately through other wireless connections with the network device; wherein, the other wireless connection methods may be Bluetooth or WIFI, etc., are not exhaustive here.
  • the method may further include: the network device receiving the third model. Specifically, the network device may receive model parameters of the third model. Still further, the network device may receive the third model sent by the electronic device, for example, the network device may receive model parameters of the third model sent by the electronic device.
  • the third model is used to perform data transformation processing on the second information output by the first model and input it into the second model; the first model, the second model and the third model are obtained through joint training .
  • the data transformation processing includes: convolution processing or Fourier transform processing.
  • the Fourier transform processing may specifically include: converting to the frequency domain through Fourier transform, multiplication, and then converting to the time domain through inverse Fourier transform.
  • the third model is carried by one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements.
  • the third model (or the model parameters of the third model) may be transmitted through a wired connection or other wireless connection.
  • the electronic device transmits the third model (or the model parameters of the third model) to the network device through a wired connection with the network device.
  • the electronic device transmits the third model (or the model parameters of the third model) to the network device through other wireless connections with the network device; wherein, the other wireless connection manner may be Bluetooth or WIFI, etc., are not exhaustive here.
  • first model, second model, and third model can be sent separately or simultaneously; or the above-mentioned first model, second model, and third model can also be sent separately; or, it is also possible It is any two combinations of which are sent at the same time, and the remaining one is sent separately, and so on.
  • the foregoing second model is a model that the network device needs to use when receiving the second information and processing to obtain channel information.
  • the first model and/or the third model can also be received. This is because when the first model and the second model are a whole obtained through joint training, if the network device wants to The overall evaluation of the first model and the second model needs to obtain the first model and the second model, and then, after the network device completes the overall evaluation of the first model and the second model, it can decide whether to use the received
  • the first model and the second model if the overall evaluation results of the network device on the first model and the second model are poor (for example, the compression rate is low or the accuracy of recovering channel information is low, etc.), the above-mentioned model may not be used.
  • the network device may re-train the first model and the second model by itself to update the model parameters of the first model and the second model, or the network device itself A new first model and a new second model are obtained through training.
  • the network device can also evaluate the first model, the second model, and the third model as a whole and correspondingly Subsequent processing, the specific processing method is the same as the above, and will not be repeated.
  • the network device trains itself to obtain the above-mentioned first model.
  • the network device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model;
  • the first model is the first preset model after training
  • the second model is the second preset model after training
  • the training may use the first loss function or the second loss function.
  • the following describes the training using the above two loss functions:
  • the loss function used in the training is a first loss function; the first loss function is based on the difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model degree of difference is constructed.
  • the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model is determined based on a distance, or determined based on a degree of similarity.
  • the specific calculation method for determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the distance can use mean square error (MSE, Mean Squared Error ) or normalized mean square error (NMSE), etc., which are not exhaustive in this embodiment.
  • MSE mean square error
  • NMSE normalized mean square error
  • the output information of the second preset model may be a matrix
  • the input information of the compressed preset sub-model may also be a matrix
  • the output matrix of the second preset model It is called matrix 1
  • the matrix of the input of the compressed preset submodel is called matrix 2
  • the output information of the second preset model and the compressed preset submodel of the first preset model are determined based on the distance
  • the way of the degree of difference between the input information is the MSE way, for example: calculate the difference between matrix 1 and matrix 2, and use the square of the difference as the degree of difference.
  • the specific calculation method for determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may use cosine similarity or cosine similarity squared etc., which are not exhaustive in this embodiment.
  • the output information of the second preset model may be R sets of feature vector sequence information
  • the input information of the compressed preset sub-model may also be R sets of feature vector sequence information.
  • the The R sets of feature vector sequence information output by the second preset model are called feature vector sequence 1
  • the R sets of feature vector sequence information input by the compressed preset sub-model are called feature vector sequence 2.
  • the method of determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may be cosine similarity, for example: feature vector sequence 1 and the cosine angle of the eigenvector sequence 2 to determine the degree of similarity, and use the degree of similarity as the degree of difference.
  • the first preset model includes an estimation preset sub-model and a compression preset sub-model.
  • FIG. 8 a it illustrates a first preset model 800 , a second preset model 810 , and an estimated preset sub-model 801 and a compressed preset sub-model 802 included in the first preset model 800 .
  • the above-mentioned first preset model 800, the second preset model 810, and the input-output relationship between the estimated preset sub-model 801 and the compressed preset sub-model 802 included in the first preset model 300 can be: estimated The input information of the preset submodel 801 is the first training sample 920; the output information of the estimated preset submodel 801 is used as the input information of the compressed preset submodel 802; the output information of the compressed preset submodel 802 As the input information of the second preset model 810 .
  • the network device uses training samples to jointly train the first preset model and the second preset model, including:
  • the network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the first training samples may be reference signal samples.
  • the reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition. More specifically, the reference signal samples may be downlink reference signal samples. It should be pointed out that this embodiment does not limit the first training samples to be the downlink reference signal samples, and uplink reference signal samples or other reference signal samples may also be used, but this embodiment does not make an exhaustive list.
  • the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc.
  • channel Signal-to-noise ratio signal-to-interference-noise ratio
  • channel type bandwidth information
  • delay information etc.
  • a specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain the initial information.
  • the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
  • the aforementioned initial information may be a matrix, and the dimension of the matrix is not limited here, and may be a matrix of two or more dimensions.
  • the value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions.
  • the channel quality may be in dBm, or may be a value after normalization processing of the channel quality.
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the compressed information obtained by compressing the preset sub-model contains less content or data volume than the input information.
  • the form of the above-mentioned compressed information is the same as that of the initial information.
  • the initial information is a matrix
  • the corresponding compressed information is also a matrix.
  • the dimensions of the matrix of the initial information and the compressed information are the same, but the data The amount is different.
  • the function of the second preset model may be to decompress its input information.
  • the input information of the second preset model is compressed information, and the output of the second preset model is restored information.
  • the decompression rate of the second preset model should make the obtained restored information contain the same data content as the original information.
  • the performing reverse conduction update of the first preset model and the second preset model based on the first loss function may specifically refer to performing reverse conduction update based on the degree of difference determined by the first loss function.
  • the model parameters of the estimated preset sub-model, the model parameters of the compressed preset sub-model, and the model parameters of the second preset model may specifically refer to performing reverse conduction update based on the degree of difference determined by the first loss function.
  • the manner of the above training convergence may include at least one of the following: judging whether the number of iterative training reaches a preset number, and judging whether the degree of difference is smaller than a preset threshold.
  • the preset number of times and the preset threshold value can be set according to actual conditions, and are not exhaustive. That is to say, when it is determined that the training is completed based on the above manner, the first preset model after the training can be used as the first model, and the second preset model after the training can be used as the second model.
  • the first preset model includes an estimation preset submodel, a preset channel generation submodel, and a compression preset submodel.
  • FIG. 8b it shows a first preset model 800, a second preset model 810, and the estimation preset sub-model 801, compression preset sub-model 802 and channel generation preset included in the first preset model 800.
  • Set sub-model 803 it shows a first preset model 800, a second preset model 810, and the estimation preset sub-model 801, compression preset sub-model 802 and channel generation preset included in the first preset model 800.
  • first preset model 800 second preset model 810, and the estimation preset submodel 801
  • the input-output relationship of can be as follows: the input information of the estimated preset sub-model 801 is the first training sample 920; the output information of the estimated preset sub-model 801 is used as the input information of the channel generation preset sub-model 803; The output information of the channel generation preset submodel 803 is used as the input information of the compression preset submodel 802 ; the output information of the compression preset submodel 802 is used as the input information of the second preset model 810 .
  • the network device uses training samples to jointly train the first preset model and the second preset model, which may include:
  • the network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc.
  • channel Signal-to-noise ratio signal-to-interference-noise ratio
  • channel type bandwidth information
  • delay information etc.
  • the specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information.
  • the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
  • a function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information.
  • the eigenvector information of the initial information may include R groups of eigenvector sequences.
  • the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the compressed feature vector information obtained by compressing the preset sub-model contains less content or data volume than the feature vector information of the input initial information.
  • the form of the above-mentioned compressed feature vector information is the same as that of the initial information.
  • the feature vector information of the initial information is an R group of feature vector sequences
  • the corresponding compressed feature vector information is also an R group of feature vectors. sequence, but the amount of data contained in the two is different.
  • the function of the second preset model may be to decompress its input information to obtain restoration information.
  • the input information of the second preset model is compressed feature vector information
  • the output of the second preset model is restored feature vector information.
  • the decompression rate of the second preset model should make the obtained restored feature vector information contain the same data content as the feature vector information of the initial information.
  • the performing reverse conduction to update the first preset model and the second preset model based on the degree of difference determined by the first loss function may specifically refer to: performing a reverse conduction based on the degree of difference determined by the first loss function performing reverse conduction to update model parameters of the estimated preset submodel, model parameters of the channel generation preset submodel, model parameters of the compression preset submodel, and model parameters of the second preset model.
  • the method for determining the convergence of the above training is the same as that of the above-mentioned case 1, and repeated explanations are not repeated.
  • the network device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, including:
  • the network device uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein , the third model is a trained third preset model.
  • FIG. 8c it shows a first preset model 800, a second preset model 810, a third preset model 830, and the estimated preset sub-model 801 contained in the first preset model 800, the compression preset Submodel 802.
  • the above-mentioned first preset model 800, the second preset model 810, and the input-output relationship between the estimated preset sub-model 801 and the compressed preset sub-model 802 contained in the first preset model 800 can be: estimated The input information of the preset submodel 801 is the first training sample 920; the output information of the estimated preset submodel 801 is used as the input information of the compressed preset submodel 802; the output information of the compressed preset submodel 802 is used as the The input information of the third preset model 830; the output information of the third preset model is used as the output information of the second preset model 810.
  • the network device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
  • the network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc.
  • channel Signal-to-noise ratio signal-to-interference-noise ratio
  • channel type bandwidth information
  • delay information etc.
  • estimation preset sub-model of the first preset model and the compression preset sub-model of the first preset model are the same as those in the first case, so the description will not be repeated.
  • a third preset model is added relative to case one, and the function of the third preset model is to simulate the channel environment, and the specific processing can be to perform data transformation on input information to obtain data transformed information as output information .
  • the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
  • the function of the second preset model may be to decompress its input information to obtain restoration information.
  • the input information of the second preset model is transformed information, and the output of the second preset model is restored information.
  • the decompression rate of the second preset model should make the restored information obtained by it contain close to or the same data content as the original information.
  • the updating of the first preset model, the second preset model, and the third preset model based on the first loss function may specifically refer to: performing reverse conduction based on the first loss function Conductively updating model parameters of the estimated preset sub-model, model parameters of the compressed preset sub-model, model parameters of the second preset model, and model parameters of the third preset model.
  • Case 4 is different from the above case 3 in that the first preset model includes an estimation preset sub-model, a preset channel generation sub-model, and a compression preset sub-model.
  • FIG. 8d it shows a first preset model 800, a second preset model 810, a third preset model 830, and the estimated preset sub-model 801 contained in the first preset model 800, the compression preset Submodel 802 and channel generation preset submodel 803 .
  • the input-output relationship between the preset sub-models 803 may be: the input information of the estimated preset sub-model 801 is the first training sample 920; the output information of the estimated preset sub-model 801 is used as the channel generation preset sub-model The input information of 803; the output information of the channel generation preset submodel 803 is used as the input information of the compression preset submodel 802; the output information of the compression preset submodel 802 is used as the input of the third preset model 830 information; the output information of the third preset model 830 is used as the input information of the second preset model 810 .
  • the network device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
  • the network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the specific description about the first training sample is the same as any one of the foregoing case 1, case 2, and case 3, so no repeated description is given.
  • the specific function of the estimated preset sub-model of the first preset model is the same as any one of the foregoing case 1, case 2, and case 3.
  • a function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information.
  • the eigenvector information of the initial information may include R groups of eigenvector sequences.
  • the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the function of the third preset model is to simulate the wireless channel environment, and the specific processing may be to perform data transformation on input information to obtain information after data transformation as output information.
  • the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
  • the function of the second preset model may be to decompress its input information to obtain restoration information.
  • the input information of the second preset model is transformed feature vector information, and the output of the second preset model is restored feature vector information.
  • the decompression rate of the second preset model should make the obtained restored feature vector information and the feature vector information of the initial information contain close to or the same data content.
  • the performing reverse conduction update based on the degree of difference determined by the first loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The degree of difference determined by a loss function is used to perform reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the channel generation preset sub-model, the model parameters of the compressed preset sub-model, the The model parameters of the second preset model and the model parameters of the third preset model.
  • the scenario where the first loss function is used for joint training is described above.
  • the scenario where the second loss function is used for joint training can also be provided, as follows:
  • the loss function used in the training is a second loss function; the second loss function is based on the difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model The first difference degree of the first preset model and the second difference degree between the output information of the estimated preset sub-model of the first preset model and the second training sample; wherein, the second training sample and the input of the estimated Corresponds to the first training sample of the preset sub-model.
  • the first degree of difference is determined based on a distance, or is determined based on a degree of similarity; and/or, the second degree of difference is determined based on a distance, or is determined based on a degree of similarity.
  • the specific calculation method for determining the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the distance can use a mean square error (MSE, Mean Squared Error) or normalized mean square error (NMSE), etc., this embodiment is not exhaustive.
  • MSE mean square error
  • NMSE normalized mean square error
  • the output information of the second preset model may be a matrix
  • the input information of the compressed preset sub-model may also be a matrix
  • the output matrix of the second preset model It is called matrix 3
  • the matrix of the input of the compressed preset submodel is called matrix 4
  • the output information of the second preset model and the compressed preset submodel of the first preset model are determined based on the distance
  • the way of the degree of difference between the input information is the MSE way, for example: calculate the difference between matrix 3 and matrix 4, and use the square of the difference as the degree of difference.
  • the specific calculation method for determining the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may use cosine similarity or cosine similarity Degree square and other methods are not exhaustive in this embodiment.
  • the output information of the second preset model may be R sets of feature vector sequence information
  • the input information of the compressed preset sub-model may also be R sets of feature vector sequence information.
  • the The R group of feature vector sequence information output by the second preset model is called feature vector sequence 3
  • the R group of feature vector sequence information input by the compressed preset sub-model is called feature vector sequence 4.
  • the method of determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may be cosine similarity, for example: feature vector sequence 3 and the cosine angle of the eigenvector sequence 4 to determine the degree of similarity, and use the degree of similarity as the degree of difference.
  • the specific calculation method for determining the second degree of difference between the output information of the estimated preset sub-model of the first preset model based on the distance and the second training sample can use mean square error (MSE, Mean Squared Error) or normalization
  • MSE mean square error
  • NMSE normalized mean square error
  • the output information of the estimated preset sub-model may be a matrix, and correspondingly, the second training sample may also be a matrix.
  • the output matrix of the estimated preset sub-model is called matrix 5
  • the matrix of the input of the compressed preset sub-model is called matrix 6
  • the second degree of difference between the output information of the estimated preset sub-model of the first preset model and the second training sample is determined based on the distance In the MSE mode, for example: calculate the difference between matrix 5 and matrix 6, and use the square of the difference as the degree of difference.
  • the specific calculation method can use cosine similarity or cosine similarity squared, etc.
  • the methods are not exhaustive in this embodiment.
  • the method of connection may be to add the weights of the first degree of difference and the second degree of difference, for example, the two each account for 50%; or , the joint method can be the addition of unequal weights between the first difference degree and the second difference degree.
  • the difference before and after the compression and recovery between the two can be assigned a greater weight, or the above-mentioned second degree of difference can be assigned a larger weight, that is, the accuracy of the output information of the above-mentioned estimated preset sub-model is assigned a larger weight; or, its combination
  • the method can be in the form of multiplying the first degree of difference and the second degree of difference; or the joint method can be that the first degree of difference and the second degree of difference can be calculated by cross entropy, such as p1*log(first degree of difference)+p2*log (the second degree of difference), where both p1 and p2 can be set according to actual conditions, and are not limited here.
  • the network device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model; wherein, the first preset model includes estimated preset Set submodels and compress preset submodels.
  • composition of each model and the input-output relationship between each model are the same as the previous case 1.
  • FIG. 8 a please refer to FIG. 8 a , which will not be repeated here.
  • the network device uses training samples to jointly train the first preset model and the second preset model, including:
  • the network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the first training samples may be reference signal samples.
  • the reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition. More specifically, the reference signal samples may be downlink reference signal samples. It should be pointed out that this embodiment does not limit the first training samples to be the downlink reference signal samples, and uplink reference signal samples or other reference signal samples may also be used, but this embodiment does not make an exhaustive list.
  • the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. In the process of training, whether one or more of the above information is input may be relevant according to the actual situation or the actual scene, and it is not limited here.
  • the specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information.
  • the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
  • the above initial information may be a matrix.
  • the dimension of the matrix is not limited here, and may be a matrix of two or more dimensions.
  • the value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions.
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the compressed information obtained by compressing the preset sub-model contains less content or data volume than the input initial information.
  • the form of the above-mentioned compressed information is the same as that of the initial information, such as a matrix, and the matrix dimensions of the initial information and the compressed information are the same, but the data content (or data volume) is different.
  • the function of the second preset model may be to decompress its input information to obtain restoration information.
  • the decompression rate of the second preset model should make the obtained restored information contain the same data content (or data amount) as the original information.
  • the performing reverse conduction based on the second loss function to update the first preset model and the second preset model may specifically refer to: performing reverse conduction based on the second loss function to update the estimated preset sub-model , the model parameters of the compressed preset sub-model and the model parameters of the second preset model.
  • the way of the above training convergence may include at least one of the following: judging whether the number of iterative training reaches the preset number, and judging whether the degree of difference determined by the second loss function is less than a preset threshold. That is to say, when it is determined that the training is completed based on the above manner, the first preset model after the training can be used as the first model, and the second preset model after the training can be used as the second model.
  • Case 6 is different from Case 5 in that the first preset model includes an estimation preset sub-model, a preset channel generation sub-model, and a compression preset sub-model.
  • composition of each model in this case and the input-output relationship between each model are the same as those in the second case, for details, please refer to FIG. 8 b , which will not be repeated here.
  • the network device uses training samples to jointly train the first preset model and the second preset model, including:
  • the network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the second training sample corresponds to the first training sample
  • the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc.
  • channel Signal-to-noise ratio signal-to-interference-noise ratio
  • channel type bandwidth information
  • delay information etc.
  • the specific function of the estimation preset sub-model of the first preset model is the same as the fifth case above.
  • a function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information.
  • the eigenvector information of the initial information may include R groups of eigenvector sequences.
  • the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the content or data volume of the compressed feature vector information output by the compressed preset sub-model is smaller than the data volume or data content of the feature vector information of the input initial information.
  • the form of the compressed feature vector information and the feature vector information of the initial information are the same, but the amount of data contained in the two is different.
  • the function of the second preset model may be to decompress its input information to obtain restoration information.
  • the input information of the second preset model is compressed feature vector information
  • the output of the second preset model is restored feature vector information.
  • the decompression rate of the second preset model should make the obtained restored feature vector information contain the same data content (or data amount) as the feature vector information of the initial information.
  • Performing reverse conduction according to the second loss function to update the first preset model and the second preset model may specifically refer to: performing reverse conduction based on the second loss function to update the estimated preset sub-model model parameters of the channel generation preset sub-model, model parameters of the compression preset sub-model and model parameters of the second preset model.
  • the method for determining the above-mentioned training convergence is the same as that of the above-mentioned case five, and repeated explanations are not repeated.
  • the network device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, including:
  • the network device uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein , the third model is a trained third preset model.
  • the network device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
  • the network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • composition of each model in this case and the input-output relationship between each model are the same as those in the third case above, which can be referred to FIG. 8c, and repeated explanations are not repeated here.
  • the specific description about the first training sample is the same as the foregoing case five or six, so no repeated description is given.
  • estimation preset sub-model of the first preset model and the compression preset sub-model of the first preset model are the same as those of the fifth case above, so repeated descriptions will not be made.
  • a third preset model is added relative to case five.
  • the function of the third preset model is to simulate the channel environment, and the specific processing can be to perform data transformation on input information to obtain data transformed information as output information .
  • the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
  • the function of the second preset model may be to decompress its input information to obtain restoration information.
  • the performing reverse conduction update based on the degree of difference determined by the second loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first
  • the second loss function performs reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the compressed preset sub-model, the model parameters of the second preset model, and the model of the third preset model parameter.
  • Case 8 is different from the above case 7 in that the first preset model includes an estimation preset submodel, a preset channel generation submodel, and a compression preset submodel.
  • the network device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
  • the network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
  • the second training sample corresponds to the first training sample
  • composition of each model and the input-output relationship between each model are the same as the foregoing case 4, which can be referred to FIG. 8d , and repeated descriptions are not repeated here.
  • the specific description about the first training sample is the same as any one of the above-mentioned case 5, case 6, and case 7, so the description will not be repeated.
  • the specific function of the estimated preset sub-model of the first preset model is the same as any one of the fifth, sixth, and seventh cases.
  • a function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information.
  • the eigenvector information of the initial information may include R groups of eigenvector sequences.
  • the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the function of the third preset model is to simulate the wireless channel environment, and the specific processing may be to perform data transformation on input information to obtain information after data transformation as output information.
  • the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
  • the function of the second preset model may be to decompress its input information to obtain restored feature vector information.
  • the input information of the second preset model is transformed feature vector information, and the output of the second preset model is restored feature vector information.
  • the decompression rate of the second preset model should be such that the obtained restored feature vector information and the feature vector information of the initial information contain data content close to or the same.
  • the performing reverse conduction update based on the degree of difference determined by the first loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The degree of difference determined by a loss function is used to perform reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the channel generation preset sub-model, the model parameters of the compressed preset sub-model, the The model parameters of the second preset model and the model parameters of the third preset model.
  • the network device can obtain the first model and the second model after its own joint training, or obtain the first model, the second model and the third model after joint training by adopting the above second method. Furthermore, the above-mentioned processing of S1110 to S1130 may be performed.
  • the training samples are used in the process of the joint training of the network device itself to obtain the first model and the second model, and the joint training of the network device itself to obtain the first model, the second model and the third model provided by the second method above. , the following is a detailed description of the training samples:
  • the training samples may include a first training sample.
  • the first training samples may be reference signal samples.
  • the reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition.
  • the original reference signal may refer to a reference signal that has not been transmitted through a wireless channel.
  • the method for acquiring and processing the reference signal may include: using the reference signal received after the original reference signal passes through the wireless channel (or the real wireless channel, or the real wireless channel) as the processed reference signal.
  • the method for obtaining a processed reference signal may include: using a reference signal received after the original reference signal passes through a simulated wireless channel as a processed reference signal.
  • the original reference signal may be a downlink reference signal or an uplink reference signal.
  • the first training samples are distributed in the first dimension and/or the second dimension.
  • the first dimension is a time domain dimension; the first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer.
  • first information samples may be distributed in each of the m time units, where n is a positive integer.
  • Each time unit may include at least one time slot, or at least one symbol (such as an OFDM symbol).
  • the first information sample is a downlink reference signal sample
  • the number of time slots contained in each time unit may be c (c is a positive integer)
  • n downlink reference signals in each c time slot A sample, combination of c and n can be e.g. (1,1)(1,2)(1,3)(1,4)(1,6)(2,1)(4,1)(5,1) (8,1)(10,1).
  • the second dimension is a frequency domain dimension; the first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer.
  • y first information samples may be distributed in each of the x frequency domain resources, and y is a positive integer.
  • Each frequency domain resource may include at least one resource block (RB), or at least one subcarrier.
  • the first information sample is a downlink reference signal sample
  • the number of time slots contained in each frequency domain resource may be d (d is a positive integer)
  • d is a positive integer
  • y time slots in every d RBs in the frequency domain
  • the combination of d and y can be, for example, (1,1)(1,2)(2,1)(1,3)(3,1)(1,4)(4,1)(1 ,6)(6,1).
  • the above-mentioned first training samples are distributed in the first dimension and/or the second dimension. It can be understood that only the distribution of the first training samples in the frequency domain dimension can be used for subsequent training, or only the first training samples can be used in the frequency domain.
  • the distribution of the first training sample in the frequency domain and the time domain can also be used for subsequent training.
  • a first training sample contains 10 RBs in the frequency domain dimension and 1 time slot in the time domain dimension, each RB has 3 first signal samples, and each time slot has 1 first signal samples, the first training samples include a total of 30 first signal samples.
  • the sizes of the first dimension and the second dimension, the time domain dimension and the frequency domain dimension may be equal or unequal.
  • the above-mentioned time-domain dimension and frequency-domain dimension can also be combined into one dimension. Specifically, the combination can be the time-domain dimension first and then the frequency-domain dimension, or the frequency-domain dimension first and then the time-domain dimension, which is not implemented in this embodiment limited.
  • the solution provided by this embodiment can be based on the above-mentioned first dimension and second dimension.
  • Increase the presentation form of complex numbers or it can be understood as adding a dimension, which is caused by the independent presentation of the imaginary part and real part data of the original reference signal or the processed reference signal
  • the first training The samples are also distributed in the third dimension.
  • the third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample.
  • each first training sample contains 1 time unit (such as 1 time slot) in the time domain dimension, and contains 10 frequency domain resources (such as 10 RBs) in the frequency domain dimension
  • each first The information sample can be expressed as a real part and an imaginary part, and the first training sample can be a 1 ⁇ 10 ⁇ 2 matrix.
  • the training samples also include a second training sample corresponding to the first training sample; the second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2.
  • the second training samples may be used to characterize the expected channel quality based on the first training samples, or channel response, or channel state, or channel estimation results, or channel information .
  • the T dimensions include a fourth dimension and a fifth dimension.
  • the matrix of the T dimensions may specifically be a two-dimensional matrix of M ⁇ N; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension; M and N are all positive integers.
  • a second training sample consists of a two-dimensional matrix with a size of M ⁇ N, which has M first granularities in the fourth dimension and N second granularities in the fifth dimension; the above M and N May or may not be equal.
  • the specific numerical indication in the two-dimensional matrix represents the received signal strength at a certain first granularity of the channel quality, where the unit of the numerical value in the two-dimensional matrix may be dBm, or the numerical value in the two-dimensional matrix has no unit It is the value obtained after normalization.
  • the two-dimensional matrix of M ⁇ N can also be synthesized into one-dimensional data of size 1 ⁇ (M ⁇ N) or (M ⁇ N) ⁇ 1.
  • the specific transformation can be the fourth dimension first and then the fifth dimension. It may also be the fifth dimension first and then the fourth dimension, which is not limited in this embodiment.
  • the fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
  • the fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, and K3 symbol sampling points; K1, K2, and K3 are positive integers.
  • the symbols are Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing).
  • OFDM Orthogonal Frequency Division Multiplexing
  • the second training sample may be a channel information sample corresponding to the reference signal sample, or may also be called a channel state sample Wait, I'm not going to exhaust the names here.
  • the fourth dimension is the frequency domain dimension
  • the first granularity can be L1 RBs (L1 is greater than or equal to 1, such as 2RB, 4RB, 8RB), and the frequency domain range indicated by a second training sample is M ⁇ L1 or the first granularity can be L2 subcarriers (L2 is greater than 1, such as 4 subcarriers, 6 subcarriers, and 18 subcarriers), then the frequency domain range indicated by a second training sample is the frequency domain of M ⁇ L2 scope.
  • the first granularity may be a delay granularity, for example, a first granularity is the number of sampling points of K1 microseconds, or K2 symbol lengths, or K3 symbols, where the symbols It may be an OFDM symbol; when the fourth dimension is the time domain dimension and the first granularity is K1 microseconds, the time domain range indicated by a second training sample is the time domain range of M ⁇ K1.
  • the fifth dimension is a space domain dimension; correspondingly, the second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
  • the fifth dimension is the space domain dimension, specifically, the antenna dimension, for example, the fifth dimension is composed of N antenna pairs, and correspondingly, the second granularity is a pair of transmitting and receiving antennas.
  • the fifth dimension is a space domain dimension, specifically an angle domain dimension, for example, the fifth dimension is composed of N arrival angles, and the second granularity is the interval between the above N arrival angles.
  • the value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; Both i and j are positive integers. That is to say, in the case of using a first training sample, the value (or referred to as an indicator value) at a certain position in the two-dimensional matrix used to represent the second training sample represents the The expected channel quality situation under such a combination of five dimensions.
  • the channel quality or the channel quality situation can be characterized by signal strength, and the unit of the value (or indicator value) can be dBm, or there is no unit but a value obtained after normalization.
  • the fourth dimension represents the frequency domain dimension
  • the fifth dimension is the space domain dimension, specifically the antenna dimension
  • the first granularity is 2RB
  • the second granularity is 1
  • the value (or indicator value) at this position can be used to represent the channel quality (or channel quality situation) on the third 2RB bandwidth (that is, the fifth RB to the sixth RB) on the sixth pair of transceiver antennas ).
  • S may also be used to represent the number of second training samples, and S may be an integer greater than or equal to 1, that is, the second training samples may include one or more.
  • the T dimensions also include a sixth dimension.
  • the matrix of T dimensions is a three-dimensional matrix of M ⁇ N ⁇ W; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension, W represents the quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
  • the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality corresponding to the first training sample at a third granularity; i, j and k are all positive integers.
  • the sixth dimension may be a complex dimension.
  • the second training samples can be used to characterize the expected channel quality based on the first training samples (or called channel response, or called channel state, or called channel estimation result, or called channel information), and the above-mentioned channel quality can also be presented by a complex number, so a sixth dimension, that is, a complex number dimension, can be added on the basis of the above two dimensions of the second training sample, and the complex number dimension is the second training sample.
  • the imaginary and real parts of the channel quality in the samples are presented independently generated.
  • the sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
  • the third granularity being 1 specifically refers to a real part or an imaginary part, and the number of the third granularity being 2 means that there may be two third granularities in the complex dimension.
  • the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The real part of the channel quality at the second granularity;
  • the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
  • the first value is different from the second value, for example, the first value can be set to 1 and the second value can be 2, or the first value can be 0 and the second value can be 1, or the first value It can be 1 and the second value can be 0, as long as the first value is different from the second value, it is within the protection scope of this embodiment.
  • the above-mentioned second training samples can also be split and combined on the basis of the above-mentioned fourth dimension, fifth dimension, and sixth dimension.
  • the fifth dimension is an antenna pair dimension
  • the It can be split into sending antenna sub-dimensions and receiving antenna sub-dimensions, thereby expanding the dimension of the second training sample.
  • This embodiment does not exhaustively enumerate various possible sub-dimensions after splitting.
  • the network device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model; or the network device itself trains the first preset model
  • the model, the second preset model and the third preset module are jointly trained to obtain the trained first model, the second model and the third model.
  • the network device can at least send the trained second model.
  • the following is an example of how network devices send models:
  • the network device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model. After the above training is completed, the network device sends the second model. Specifically, it may be: the network device sends the second model to the terminal device. Still further, it may also be: the network device sends the model parameters of the second model to the terminal device.
  • the network device may be a network device that provides services for the terminal device, such as an access network device, and specifically may be a base station, eNB, gNB, and the like.
  • the second model (or the model parameters of the second model) is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data for artificial intelligence business transmission requirements transmission.
  • the network device may retain the second model by itself to process the second information to obtain channel information; correspondingly, due to the first model that the terminal device can receive, the terminal The device may process the first information based on the first model to obtain second information.
  • the network device may send the first model to all terminal devices (or at least some terminal devices) it serves.
  • the network device Taking the network device as the base station and the terminal device as the mobile phone as an example, base station 1 can serve three mobile phones, namely mobile phone 1, mobile phone 2 and mobile phone 3, and base station 1 can send the first call to mobile phone 1, mobile phone 2 and mobile phone 3 respectively Model.
  • the network device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model. After the above training is completed, after the network device sends the second model, the method may further include: the network device also sends the first model.
  • the network device sends the first model to the terminal device. Still further, it may also be: the network device sends the model parameters of the first model to the terminal device.
  • the network device may be a network device that provides services for the terminal device, such as an access network device, and specifically may be a base station, eNB, gNB, and the like.
  • the first model (or the model parameters of the first model) is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data for artificial intelligence business transmission requirements transmission.
  • first model and the second model may be sent at the same time, or the above-mentioned first model and the second model may be sent separately, which is not limited in this embodiment.
  • the terminal device After the terminal device receives the first model and the second model sent by the network device, the terminal device can perform an overall evaluation of the first model and the second model, and after completing the overall evaluation of the first model and the second model, It can be decided whether to use the first model and the second model received this time. If the overall evaluation result is poor (for example, the compression rate is low or the accuracy of recovering channel information is low, etc.), the above-mentioned first model and the second model may not be used. Second model. If the terminal device decides not to use the above-mentioned first model and the second model, it can also re-train the first model and the second model to update the model parameters of the first model and the second model, or the terminal device trains itself to obtain New first model as well as second model.
  • Second model Second model. If the terminal device decides not to use the above-mentioned first model and the second model, it can also re-train the first model and the second model to update the model parameters of the first model and the second model, or the terminal device trains
  • the terminal device if the terminal device jointly trains the first model and the second model again, or updates the first model and the second model, the terminal device also needs to send the new first model and the second model to the network device , or send the new first model to the network device.
  • the network device determines that the overall performance of the new first model and the new second model is better after receiving the new first model and the new second model, it can also replace the first model generated by itself and the second model, and synchronize the new first model and the new second model to other terminal devices served by itself. Otherwise, the network device may specify that the terminal device does not use the new first model and the new second model, so as to ensure that information exchanged between the network device and the terminal device can be correctly transmitted and parsed.
  • the network device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model.
  • the difference between this example and Example 2 is that after the above training is completed, on the basis of the network device sending the second model, the network device can send the estimation sub-model and the compression sub-model in the first model Model.
  • the network device may send the estimated sub-model and the compressed sub-model in the first model at the same time; or, the network device may send the estimated sub-model and the compressed sub-model in the first model respectively.
  • the network device may send the estimated sub-model and the compressed sub-model in the first model to the terminal device at the same time; or, the network device may send the estimated sub-model in the first model to the terminal device respectively.
  • Submodels and compressed submodels are examples of the network device.
  • the network device may send the model parameters of the estimation sub-model and the model parameters of the compression sub-model in the first model to the terminal device at the same time; or, the network device may send the first model parameters to the terminal device respectively.
  • Model parameters for the estimated submodel and model parameters for the compressed submodel in the model may be sent to the terminal device at the same time; or, the network device may send the first model parameters to the terminal device respectively.
  • the estimation sub-model and the compression sub-model may be carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
  • estimation sub-model and the compression sub-model may be respectively carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data for artificial intelligence business transmission requirements transmission.
  • the terminal device may integrate the estimated sub-model, the compressed sub-model and the second model Evaluation, after completing the overall evaluation of the estimated sub-model, compressed sub-model, and second model, you can decide whether to use the estimated sub-model, compressed sub-model, and second model received this time.
  • the specific processing method is similar to the previous example three , which will not be repeated here.
  • the network device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model; wherein, the first model includes an estimation sub-model, a compression sub-model and a channel generation sub-model .
  • the difference between this example and Example 3 is that after the above training is completed, on the basis of the network device sending the second model, the network device can send the estimation sub-model, compression sub-model in the first model model and the channel generation submodel.
  • the network device may simultaneously send the estimation sub-model, the compression sub-model and the channel generation sub-model in the first model; or, the network device may separately send the estimation sub-model, the compression sub-model and the channel generation sub-model submodel.
  • the network device may send the estimation sub-model, the compression sub-model and the channel generation sub-model in the first model to the terminal device at the same time; or, the network device may send the first model to the terminal device respectively.
  • the estimation sub-model, compression sub-model and channel generation sub-model may be simultaneously carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink for artificial intelligence business transmission requirements data transmission;
  • estimation sub-model, compression sub-model and channel generation sub-model may be respectively carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, transmission requirements for artificial intelligence services downlink data transmission.
  • any two of the estimation submodel, the compression submodel, and the channel generation submodel may be simultaneously carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, for Downlink data transmission required for artificial intelligence business transmission.
  • the remaining sub-model can be carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements.
  • the terminal device may perform the estimation sub-model, the compressed sub-model and the channel generation sub-model
  • the overall evaluation is performed on the model and the second model, and the processing after the overall evaluation is similar to that of Example 3 above, and will not be repeated here.
  • the network device itself performs joint training on the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model. After the above training is completed, on the basis that the terminal device sends the second model and the first model, the network device may send the third model.
  • the network device may send the first model, the second model and the third model to the terminal device at the same time; or, the network device may send the first model and the second model to the terminal device respectively and the third model; or, the network device may first send any two of the first model, the second model, and the third model to the terminal device, and then send the remaining one model to the terminal device.
  • the first model, the second model and the third model may be simultaneously carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data for artificial intelligence business transmission requirements transmission;
  • the first model, the second model and the third model may be respectively carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, and information for artificial intelligence business transmission requirements downlink data transmission;
  • any two of the first model, the second model and the third model and the remaining one model are respectively carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data Transmission, downlink data transmission for artificial intelligence business transmission requirements.
  • the first model may include an estimation sub-model and a compression sub-model.
  • sending the first model may refer to simultaneously or separately sending the estimation sub-model and the compression sub-model.
  • the carrying manner of the estimation sub-model and the compression sub-model is the same as that of the previous example and will not be repeated here.
  • the first model may include an estimation submodel, a channel generation submodel, and a compression submodel.
  • sending the first model may refer to sending the estimation submodel, the The channel generation sub-model and the compression sub-model, the carrying manners of the estimation sub-model, the channel generation sub-model and the compression sub-model are the same as the previous examples and will not be repeated here.
  • the terminal device may perform an overall evaluation on the first model, the second model and the third model, specifically The processing is similar to the foregoing example three, and repeated descriptions are not repeated.
  • the network device is a base station, which may specifically be:
  • S1201 The base station trains to obtain the first model and the second model.
  • the base station uses training samples as the main body to jointly train the first preset model and the second preset model to obtain the first model and the second model; wherein, the first model includes an estimation sub-model and a compression sub-model; or, the first The model contains an estimation submodel, a channel generation submodel, and a compression submodel.
  • the base station uses training samples as the main body to train the first preset model, the second preset model and the third preset model to obtain the first model, the second model and the third model; wherein, the first model includes An estimation submodel and a compression submodel; alternatively, the first model comprises an estimation submodel, a channel generation submodel and a compression submodel.
  • the base station sends the first model to the terminal device.
  • the terminal device may receive the first model sent by the base station.
  • This step may refer to: the base station transmits at least the first model to the terminal device; or it may be: the base station transmits the estimated sub-model and the compressed sub-model to the terminal device; or it may be: the The base station transmits the estimation sub-model, the channel generation sub-model and the compression sub-model to the terminal device.
  • the sub-models to be obtained by training determined during training of the base station relevant For example, if the base station obtains the two submodels of the estimated submodel and the compressed submodel through training, the base station may transmit to the terminal device the first model including the two submodels of the estimated submodel and the compressed submodel, or, the The base station directly transmits the two sub-models, the estimated sub-model and the compressed sub-model, to the terminal equipment.
  • this step may also include: the base station may send the second model to the terminal device.
  • the terminal device may receive the second model sent by the base station.
  • this step may further include: the base station transmitting the third model to the terminal device.
  • the terminal device may receive the third model sent by the base station.
  • the base station sends first information; correspondingly, the terminal device receives the first information.
  • the first information may be a downlink reference signal, specifically a downlink reference signal of the current channel, such as SSB or CSI-RS, and this example does not limit its specific content.
  • the terminal device processes the first information based on the first model to obtain second information.
  • the second information is compressed channel information output after being processed by an estimation sub-model and a compression sub-model.
  • the channel information can be expressed as a matrix.
  • the specific processing of the estimation sub-model and the compression sub-model in the first model is the same as the foregoing embodiment, and will not be repeated here.
  • the second information is the characteristics of the compressed channel information output after being processed by the estimation sub-model, channel generation sub-model and compression sub-model vector information.
  • the eigenvector information of the compressed channel information can be expressed as a matrix. Regarding the specific description of the matrix, the specific processing of the estimation sub-model and the compression sub-model in the first model is the same as that of the foregoing embodiment, and will not be repeated here.
  • the terminal device sends the second information; correspondingly, the base station receives the second information.
  • the base station processes the second information based on the second model to obtain channel information.
  • the channel information may specifically be a matrix representing channel information.
  • the matrix representing channel information has been described in detail in the foregoing embodiments, and here Do not repeat.
  • the channel information may specifically be the eigenvector information of the channel information, and the eigenvector information of the channel information has been described in detail in the foregoing embodiments, and will not be repeated here. .
  • the network device is a base station, and may specifically be:
  • the terminal device trains to obtain the first model and the second model
  • the terminal device is the main body and uses training samples to jointly train the first preset model and the second preset model to obtain the first model and the second model; wherein, the first model includes an estimation sub-model and a compression sub-model; or, the second A model includes an estimation submodel, a channel generation submodel and a compression submodel.
  • the terminal device uses training samples as the main body to train the first preset model, the second preset model and the third preset model to obtain the first model, the second model and the third model; wherein, the first model An estimation submodel and a compression submodel are included; alternatively, the first model includes an estimation submodel, a channel generation submodel, and a compression submodel.
  • the terminal device sends the second model to the base station.
  • the base station may receive the second model sent by the terminal device.
  • This step may refer to: the terminal device at least transmits the second model to a base station.
  • this step may also include: the terminal device sending the first model to the base station.
  • the base station may receive the first model sent by the terminal device.
  • the processing method may be the same as that provided in the foregoing embodiment, and the description will not be repeated in this example.
  • this step may further include: the terminal device transmitting the third model to the base station.
  • the base station may receive the third model sent by the terminal device.
  • the base station sends first information; correspondingly, the terminal device receives the first information.
  • the first information may specifically be a downlink reference signal, such as an SSB or a CSI-RS, and this example does not limit its specific content.
  • the terminal device processes the first information based on the first model to obtain second information.
  • the second information is compressed channel information output after being processed by an estimation sub-model and a compression sub-model.
  • the channel information can be expressed as a matrix.
  • the specific processing of the estimation sub-model and the compression sub-model in the first model is the same as the foregoing embodiment, and will not be repeated here.
  • the second information is the eigenvector information of the compressed channel information output after being processed by the estimation sub-model, the channel generation sub-model and the compression sub-model.
  • the eigenvector information of the compressed channel information can be expressed as a matrix.
  • the specific processing of the estimation sub-model and the compression sub-model in the first model is the same as that of the foregoing embodiment, and will not be repeated here.
  • the terminal device sends the second information; correspondingly, the base station receives the second information.
  • the base station processes the second information based on the second model to obtain channel information.
  • the channel information may specifically be a matrix representing channel information.
  • the matrix representing channel information has been described in detail in the foregoing embodiments, and here Do not repeat.
  • the channel information may specifically be the eigenvector information of the channel information, and the eigenvector information of the channel information has been described in detail in the foregoing embodiments, and will not be repeated here. .
  • the difference between the example shown in FIG. 13 and the aforementioned example in FIG. 12 also includes: since one base station can communicate with multiple terminal devices, the base station side may have received the second model from multiple terminal devices. In this case, the base station side may use the second model sent by the terminal device to process the second information sent by the terminal device for each terminal device to obtain channel information. Alternatively, the base station may designate one of the multiple second models sent from multiple terminal devices as the target second model, and at least send the target first model corresponding to the target second model to other terminal devices , so that all terminal devices connected or served by the base station use the same target first model and target second model for subsequent processing, which can also reduce the time consumption of searching for different second models of terminal devices on the base station side.
  • the terminal device when the terminal device receives the first information, it can process the first information through the first model to obtain the second information and send it, so that the receiving end can use the second model to obtain the second information.
  • the channel information obtained by processing the information is obtained through joint training of the first model and the second model. Since the processing, transmission, and analysis of the second information are realized by using the first model and the second model obtained through joint training, the performance requirements in the entire information processing, transmission, and analysis can be taken into account, and the overall performance of the network is guaranteed.
  • the functions between the first model and the second model can be made compatible with each other, so that the performance of the first model and the second model can reach In a better state, when the processing, transmission and analysis process of the second information is processed as a whole based on the first model and the second model, the performance of the whole processing can be guaranteed, thereby ensuring the performance of the whole network.
  • Fig. 14 is a schematic flowchart of a model generation method 1400 according to an embodiment of the present application.
  • the method can optionally be applied to the system shown in Fig. 1, but is not limited thereto.
  • the method includes at least some of the following.
  • the first model is the first preset model after training, and the second model is the second preset model after training; the first model is used to process the first information to obtain Second information; the second model is used to process the second information to obtain channel information.
  • the model generation method provided in this embodiment can be applied to electronic equipment, and the electronic equipment can be a network equipment or a terminal equipment; the network equipment can be a server, an access network equipment, etc.; the terminal equipment can be a smart Phones, tablets, laptops, desktops (or desktop computers), and more. That is to say, any electronic device capable of data processing can execute the model generation method provided in this embodiment.
  • the training may use the first loss function or the second loss function.
  • the following describes the training using the above two loss functions:
  • the loss function used in the training is a first loss function; the first loss function is based on the difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model degree of difference is constructed.
  • the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model is determined based on a distance, or determined based on a degree of similarity.
  • the specific calculation method for determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the distance can use mean square error (MSE, Mean Squared Error ) or normalized mean square error (NMSE), etc., which are not exhaustive in this embodiment.
  • MSE mean square error
  • NMSE normalized mean square error
  • the output information of the second preset model may be a matrix
  • the input information of the compressed preset sub-model may also be a matrix
  • the output matrix of the second preset model It is called matrix 1
  • the matrix of the input of the compressed preset submodel is called matrix 2
  • the output information of the second preset model and the compressed preset submodel of the first preset model are determined based on the distance
  • the way of the degree of difference between the input information is the MSE way, for example, its calculation may include: calculating the difference between matrix 1 and matrix 2, and taking the square of the difference as the difference degree.
  • the specific calculation method for determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may use cosine similarity or cosine similarity squared etc., which are not exhaustive in this embodiment.
  • the output information of the second preset model may be R sets of feature vector sequence information
  • the input information of the compressed preset sub-model may also be R sets of feature vector sequence information.
  • the The R sets of feature vector sequence information output by the second preset model are called feature vector sequence 1
  • the R sets of feature vector sequence information input by the compressed preset sub-model are called feature vector sequence 2.
  • the method of determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may be cosine similarity, for example, its calculation may include: The cosine angle between the eigenvector sequence 1 and the eigenvector sequence 2 is used to determine the degree of similarity, and the degree of similarity is used as the degree of difference.
  • the first preset model includes an estimation preset sub-model and a compression preset sub-model.
  • FIG. 8 a it illustrates a first preset model 800 , a second preset model 810 , and an estimated preset sub-model 801 and a compressed preset sub-model 802 included in the first preset model 800 .
  • the above-mentioned first preset model 800, the second preset model 810, and the input-output relationship between the estimated preset sub-model 801 and the compressed preset sub-model 802 contained in the first preset model 800 can be: estimated The input information of the preset submodel 801 is the first training sample 920; the output information of the estimated preset submodel 801 is used as the input information of the compressed preset submodel 802; the output information of the compressed preset submodel 802 As the input information of the second preset model 810 .
  • the joint training of the first preset model and the second preset model by using training samples includes:
  • the first training samples may be reference signal samples.
  • the reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition. More specifically, the reference signal samples may be downlink reference signal samples. It should be understood that this embodiment does not limit that the first training samples must be the downlink reference signal samples, and uplink reference signal samples or other reference signal samples may also be used, which are not exhaustive in this embodiment.
  • the information input into the estimated preset sub-model of the first preset model may also be wireless channel or other scene-related information, for example, may include at least one of the following: Channel signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
  • a specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information.
  • the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
  • the aforementioned initial information may be a matrix, and the dimension of the matrix is not limited here, and may be a matrix of two or more dimensions.
  • the value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions.
  • the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the compressed information obtained by compressing the preset sub-model contains less data than the input initial information.
  • the form of the above-mentioned compressed information is the same as that of the initial information, for example, the initial information is a matrix, and the corresponding compressed information is also a matrix, and the matrix dimensions of the initial information and the compressed information are the same but the amount of data is different.
  • the function of the second preset model may be to decompress its input information.
  • the input information of the second preset model is the compressed information
  • the output of the second preset model is the restored information.
  • the decompression rate of the second preset model should make the obtained restored information contain the same data content as the original information.
  • the performing reverse conduction update of the first preset model and the second preset model based on the first loss function may specifically refer to performing reverse conduction update based on the degree of difference determined by the first loss function.
  • the model parameters of the estimated preset sub-model, the model parameters of the compressed preset sub-model, and the model parameters of the second preset model may specifically refer to performing reverse conduction update based on the degree of difference determined by the first loss function.
  • the manner of the above training convergence may include at least one of the following: judging whether the number of iterative training reaches a preset number, and judging whether the degree of difference is smaller than a preset threshold.
  • the preset number of times and the preset threshold value can be set according to actual conditions, and are not exhaustive. That is to say, when it is determined that the training is completed based on the above manner, the first preset model after the training can be used as the first model, and the second preset model after the training can be used as the second model.
  • the first preset model includes an estimation preset submodel, a preset channel generation submodel, and a compression preset submodel.
  • FIG. 8b it shows a first preset model 800, a second preset model 810, and the estimation preset sub-model 801, compression preset sub-model 802 and channel generation preset included in the first preset model 800.
  • Set sub-model 803 it shows a first preset model 800, a second preset model 810, and the estimation preset sub-model 801, compression preset sub-model 802 and channel generation preset included in the first preset model 800.
  • first preset model 800 second preset model 810, and the estimation preset submodel 801
  • the input-output relationship of can be as follows: the input information of the estimated preset sub-model 801 is the first training sample 920; the output information of the estimated preset sub-model 801 is used as the input information of the channel generation preset sub-model 803; The output information of the channel generation preset submodel 803 is used as the input information of the compression preset submodel 802 ; the output information of the compression preset submodel 802 is used as the input information of the second preset model 810 .
  • Using training samples to jointly train the first preset model and the second preset model may include:
  • the information input into the estimated preset sub-model of the first preset model may also be wireless channel or other scene-related information, for example, may include at least one of the following: Channel signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
  • the specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information.
  • the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
  • the initial information output by the estimated preset sub-model above may be a matrix, and the dimension of the matrix is not limited here, and may be a two-dimensional or more dimensional matrix.
  • the value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions.
  • the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
  • a function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information.
  • the eigenvector information of the initial information may include R groups of eigenvector sequences.
  • the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the compressed eigenvector information obtained by compressing the preset sub-model contains less data than the eigenvector information of the input initial information.
  • the above compressed feature vector information is in the same form as the feature vector information of the initial information.
  • the feature vector information of the initial information is a sequence of R groups of feature vectors
  • the compressed feature vector information is also a sequence of feature vectors of groups R but The amount of data contained in the two is different.
  • the function of the second preset model may be to decompress its input information.
  • the input information of the second preset model is compressed feature vector information
  • the output of the second preset model is restored feature vector information.
  • the decompression rate of the second preset model should be such that the obtained restored feature vector information contains the same or substantially the same data as the feature vector information of the initial information.
  • the performing reverse conduction to update the first preset model and the second preset model based on the degree of difference determined by the first loss function may specifically refer to: performing a reverse conduction based on the degree of difference determined by the first loss function performing reverse conduction to update model parameters of the estimated preset submodel, model parameters of the channel generation preset submodel, model parameters of the compression preset submodel, and model parameters of the second preset model.
  • the method for determining the convergence of the above training is the same as that of the above-mentioned case 1, and repeated explanations are not repeated.
  • the training sample is used to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, including:
  • the first preset model uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein, the first preset model
  • the three models are the third preset models after training.
  • FIG. 8c it shows a first preset model 800, a second preset model 810, a third preset model 830, and the estimated preset sub-model 801 contained in the first preset model 800, the compression preset Submodel 802.
  • the above-mentioned first preset model 800, the second preset model 810, and the input-output relationship between the estimated preset sub-model 801 and the compressed preset sub-model 802 contained in the first preset model 800 can be: estimated The input information of the preset submodel 801 is the first training sample 920; the output information of the estimated preset submodel 801 is used as the input information of the compressed preset submodel 802; the output information of the compressed preset submodel 802 is used as the The input information of the third preset model 830; the output information of the third preset model is used as the output information of the second preset model 810.
  • the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc.
  • channel Signal-to-noise ratio signal-to-interference-noise ratio
  • channel type bandwidth information
  • delay information etc.
  • estimation preset sub-model of the first preset model and the compression preset sub-model of the first preset model are the same as those in the first case, so the description will not be repeated.
  • a third preset model is added relative to the first case.
  • the function of the third preset model is to simulate the channel environment.
  • the specific processing can be to perform data conversion on the input information to obtain the information after data conversion as Output information.
  • the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
  • the function of the second preset model may be to decompress its input information.
  • the input information of the second preset model is transformed information, and the output of the second preset model is restored information.
  • the decompression rate of the second preset model should make the obtained restored information contain the same data as the original information.
  • the updating of the first preset model, the second preset model, and the third preset model based on the first loss function may specifically refer to: performing reverse conduction based on the first loss function Conductively updating model parameters of the estimated preset sub-model, model parameters of the compressed preset sub-model, model parameters of the second preset model, and model parameters of the third preset model.
  • Case 4 is different from the above case 3 in that the first preset model includes an estimation preset sub-model, a preset channel generation sub-model, and a compression preset sub-model.
  • FIG. 8d it shows a first preset model 800, a second preset model 810, a third preset model 830, and the estimated preset sub-model 801 contained in the first preset model 800, the compression preset Submodel 802 and channel generation preset submodel 803 .
  • the input-output relationship between the preset sub-models 803 may be: the input information of the estimated preset sub-model 801 is the first training sample 920; the output information of the estimated preset sub-model 801 is used as the channel generation preset sub-model The input information of 803; the output information of the channel generation preset submodel 803 is used as the input information of the compression preset submodel 802; the output information of the compression preset submodel 802 is used as the input of the third preset model 830 information; the output information of the third preset model 830 is used as the input information of the second preset model 810 .
  • the specific description about the first training sample is the same as any one of the foregoing case 1, case 2, and case 3, so no repeated description is given.
  • the specific function of the estimated preset sub-model of the first preset model is the same as any one of the foregoing case 1, case 2, and case 3.
  • a function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information.
  • the eigenvector information of the initial information may include R groups of eigenvector sequences.
  • the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the function of the third preset model is to simulate the wireless channel environment, and the specific processing may be to perform data transformation on the input information to obtain the information after data transformation as output information.
  • the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
  • the function of the second preset model may be to decompress its input information.
  • the input information of the second preset model is transformed feature vector information, and the output of the second preset model is restored feature vector information.
  • the decompression rate of the second preset model should make the obtained restored feature vector information and the feature vector information of the initial information contain close to or the same data.
  • the performing reverse conduction update based on the degree of difference determined by the first loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The degree of difference determined by a loss function is used to perform reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the channel generation preset sub-model, the model parameters of the compressed preset sub-model, the The model parameters of the second preset model and the model parameters of the third preset model.
  • the scenario where the first loss function is used for joint training is described above.
  • the scenario where the second loss function is used for joint training can also be provided, as follows:
  • the loss function used in the training is a second loss function; the second loss function is based on the difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model The first difference degree of the first preset model and the second difference degree between the output information of the estimated preset sub-model of the first preset model and the second training sample; wherein, the second training sample and the input of the estimated Corresponds to the first training sample of the preset sub-model.
  • the first degree of difference is determined based on a distance, or is determined based on a degree of similarity; and/or, the second degree of difference is determined based on a distance, or is determined based on a degree of similarity.
  • the specific calculation method for determining the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the distance can use a mean square error (MSE, Mean Squared Error) or normalized mean square error (NMSE), etc., this embodiment is not exhaustive.
  • MSE mean square error
  • NMSE normalized mean square error
  • the output information of the second preset model may be a matrix
  • the input information of the compressed preset sub-model may also be a matrix
  • the output matrix of the second preset model It is called matrix 3
  • the matrix of the input of the compressed preset submodel is called matrix 4
  • the output information of the second preset model and the compressed preset submodel of the first preset model are determined based on the distance
  • the way of the degree of difference between the input information is the MSE way, for example: calculate the difference between matrix 3 and matrix 4, and use the square of the difference as the degree of difference.
  • the specific calculation method for determining the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may use cosine similarity or cosine similarity Degree square and other methods are not exhaustive in this embodiment.
  • the output information of the second preset model may be R sets of feature vector sequence information
  • the input information of the compressed preset sub-model may also be R sets of feature vector sequence information.
  • the The R group of feature vector sequence information output by the second preset model is called feature vector sequence 3
  • the R group of feature vector sequence information input by the compressed preset sub-model is called feature vector sequence 4.
  • the method of determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may be cosine similarity, for example: feature vector sequence 3 and the cosine angle of the eigenvector sequence 4 to determine the degree of similarity, and use the degree of similarity as the degree of difference.
  • the specific calculation method for determining the second degree of difference between the output information of the estimated preset sub-model of the first preset model based on the distance and the second training sample can use mean square error (MSE, Mean Squared Error) or normalization
  • MSE mean square error
  • NMSE normalized mean square error
  • the output information of the estimated preset sub-model may be a matrix, and correspondingly, the second training sample may also be a matrix.
  • the output matrix of the estimated preset sub-model is called matrix 5
  • the matrix of the input of the compressed preset sub-model is called matrix 6
  • the second degree of difference between the output information of the estimated preset sub-model of the first preset model and the second training sample is determined based on the distance In the MSE mode, for example: calculate the difference between matrix 5 and matrix 6, and use the square of the difference as the degree of difference.
  • the specific calculation method can use cosine similarity or cosine similarity squared, etc.
  • the methods are not exhaustive in this embodiment.
  • the method of connection may be to add the weights of the first degree of difference and the second degree of difference, for example, the two each account for 50%; or , the joint method can be the addition of unequal weights between the first difference degree and the second difference degree.
  • the difference before and after the compression and recovery between the two can be assigned a greater weight, or the above-mentioned second degree of difference can be assigned a larger weight, that is, the accuracy of the output information of the above-mentioned estimated preset sub-model is assigned a larger weight; or, its combination
  • the method can be in the form of multiplying the first degree of difference and the second degree of difference; or the joint method can be that the first degree of difference and the second degree of difference can be calculated by cross entropy, such as p1*log(first degree of difference)+p2*log (the second degree of difference), where both p1 and p2 can be set according to actual conditions, and are not limited here.
  • the terminal device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model; wherein, the first preset model includes estimated preset sub-models Model and compression preset submodels.
  • composition of each model and the input-output relationship between each model are the same as the previous case 1.
  • FIG. 8 a please refer to FIG. 8 a , which will not be repeated here.
  • training samples to jointly train the first preset model and the second preset model including:
  • the first training samples may be reference signal samples.
  • the reference signal sample may be an original reference signal obtained through historical acquisition, or a processed reference signal. More specifically, the reference signal samples may be downlink reference signal samples. It should be understood that this embodiment does not limit that the first training samples must be the downlink reference signal samples, and uplink reference signal samples or other reference signal samples may also be used, which are not exhaustive in this embodiment.
  • the information input into the estimated preset sub-model of the first preset model may also be wireless channel or other scene-related information, for example, may include at least one of the following: Channel signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
  • the specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information.
  • the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
  • the aforementioned initial information may be a matrix, and the dimension of the matrix is not limited here, and may be a matrix of two or more dimensions.
  • the value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions.
  • the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the compressed information obtained by compressing the preset sub-model contains less data than the input initial information.
  • the form of the above-mentioned compressed information is the same as that of the initial information, for example, the initial information is a matrix, and the corresponding compressed information is also a matrix, and the matrix dimensions of the initial information and the compressed information are the same but the amount of data is different.
  • the function of the second preset model may be to decompress its input information.
  • the decompression rate of the second preset model should make the obtained restored information contain the same data as the original information.
  • the performing reverse conduction based on the second loss function to update the first preset model and the second preset model may specifically refer to: performing reverse conduction based on the second loss function to update the estimated preset sub-model , the model parameters of the compressed preset sub-model and the model parameters of the second preset model.
  • the way of the above-mentioned training convergence can include at least one of the following: judging whether the number of iterative training reaches the preset number of times, judging whether the first difference degree is less than the first preset threshold value, judging the second difference Whether the degree is smaller than the second preset threshold value.
  • the preset times, the first preset threshold value and the second preset threshold value can be set according to actual conditions, and are not exhaustive. That is to say, when it is determined that the training is completed based on the above manner, the first preset model after the training can be used as the first model, and the second preset model after the training can be used as the second model.
  • Case 6 is different from Case 5 in that the first preset model includes an estimation preset sub-model, a preset channel generation sub-model, and a compression preset sub-model.
  • composition of each model in this case and the input-output relationship between each model are the same as those in the second case, for details, please refer to FIG. 8 b , which will not be repeated here.
  • training samples to jointly train the first preset model and the second preset model including:
  • the second training sample corresponds to the first training sample
  • the information input into the estimated preset sub-model of the first preset model may also include other information related to wireless channels or scenes, for example, may include at least one of the following: Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
  • the specific function of the estimation preset sub-model of the first preset model is the same as the fifth case above.
  • a function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information.
  • the eigenvector information of the initial information may include R groups of eigenvector sequences.
  • the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the compressed eigenvector information obtained by compressing the preset sub-model contains less data than the eigenvector information of the input initial information.
  • the above compressed feature vector information is in the same form as the feature vector information of the initial information.
  • the feature vector information of the initial information is a sequence of R groups of feature vectors
  • the compressed feature vector information is also a sequence of feature vectors of groups R but The amount of data contained in the two is different.
  • the function of the second preset model may be to decompress its input information.
  • the input information of the second preset model is compressed feature vector information
  • the output of the second preset model is restored feature vector information.
  • the decompression rate of the second preset model should be such that the obtained restored feature vector information contains the same or substantially the same data as the feature vector information of the initial information.
  • Performing reverse conduction according to the second loss function to update the first preset model and the second preset model may specifically refer to: performing reverse conduction based on the second loss function to update the estimated preset sub-model model parameters of the channel generation preset sub-model, model parameters of the compression preset sub-model and model parameters of the second preset model.
  • the method for determining the above-mentioned training convergence is the same as that of the above-mentioned case five, and repeated explanations are not repeated.
  • Case 7 using training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, including:
  • the first preset model uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein, the first preset model
  • the three models are the third preset models after training.
  • composition of each model in this case and the input-output relationship between each model are the same as those in the third case above, which can be referred to FIG. 8c, and repeated explanations are not repeated here.
  • the specific description about the first training sample is the same as the foregoing case five or six, so no repeated description is given.
  • estimation preset sub-model of the first preset model and the compression preset sub-model of the first preset model are the same as those of the fifth case above, so repeated descriptions will not be made.
  • a third preset model is added relative to case five.
  • the function of the third preset model is to simulate the channel environment, and the specific processing can be to perform data transformation on input information to obtain data transformed information as output information .
  • the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
  • the function of the second preset model may be to decompress its input information.
  • the performing reverse conduction update based on the degree of difference determined by the second loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first
  • the second loss function performs reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the compressed preset sub-model, the model parameters of the second preset model, and the model of the third preset model parameter.
  • Case 8 is different from the above case 7 in that the first preset model includes an estimation preset submodel, a preset channel generation submodel, and a compression preset submodel.
  • the second training sample corresponds to the first training sample
  • composition of each model and the input-output relationship between each model are the same as the foregoing case 4, which can be referred to FIG. 8d , and repeated descriptions are not repeated here.
  • the specific description about the first training sample is the same as any one of the above-mentioned case 5, case 6, and case 7, so the description will not be repeated.
  • the specific function of the estimated preset sub-model of the first preset model is the same as any one of the fifth, sixth, and seventh cases.
  • a function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information.
  • the eigenvector information of the initial information may include R groups of eigenvector sequences.
  • the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
  • the compressed preset sub-model of the first preset model compresses the data volume of the input information.
  • the compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
  • the function of the third preset model is to simulate the wireless channel environment, and the specific processing may be to perform data transformation on input information to obtain information after data transformation as output information.
  • the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
  • the function of the second preset model may be to decompress its input information.
  • the input information of the second preset model is transformed feature vector information, and the output of the second preset model is restored feature vector information.
  • the decompression rate of the second preset model should be such that the obtained restored feature vector information and the feature vector information of the initial information contain data that are close to or identical.
  • the performing reverse conduction update based on the degree of difference determined by the first loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The degree of difference determined by a loss function is used to perform reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the channel generation preset sub-model, the model parameters of the compressed preset sub-model, the The model parameters of the second preset model and the model parameters of the third preset model.
  • the first model and the second model after joint training, or the first model, the second model and the third model after joint training can be obtained.
  • the training samples are used, and the training samples are described in detail below :
  • the training samples may include a first training sample.
  • the first training samples may be reference signal samples.
  • the reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition.
  • the original reference signal may refer to a reference signal that has not been transmitted through a wireless channel.
  • the method for acquiring and processing the reference signal may include: using the reference signal received after the original reference signal passes through the wireless channel (or the real wireless channel, or the real wireless channel) as the processed reference signal.
  • the method for obtaining and processing the reference signal may include: using the reference signal received after the original reference signal passes through the simulated wireless channel as the processed reference signal.
  • the original reference signal may be a downlink reference signal or an uplink reference signal.
  • the first training samples are distributed in the first dimension and/or the second dimension.
  • the first dimension is a time domain dimension; the first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer.
  • first information samples may be distributed in each of the m time units, where n is a positive integer.
  • Each time unit may include at least one time slot, or at least one symbol (such as an OFDM symbol).
  • the first information sample is a downlink reference signal sample
  • the number of time slots contained in each time unit may be c (c is a positive integer)
  • n downlink reference signals in each c time slot A sample, combination of c and n can be e.g. (1,1)(1,2)(1,3)(1,4)(1,6)(2,1)(4,1)(5,1) (8,1)(10,1).
  • the second dimension is a frequency domain dimension; the first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer.
  • y first information samples may be distributed in each of the x frequency domain resources, and y is a positive integer.
  • Each frequency domain resource may include at least one resource block (RB), or at least one subcarrier.
  • the first information sample is a downlink reference signal sample
  • the number of time slots contained in each frequency domain resource may be d (d is a positive integer)
  • d is a positive integer
  • y time slots in every d RBs in the frequency domain
  • the combination of d and y can be, for example, (1,1)(1,2)(2,1)(1,3)(3,1)(1,4)(4,1)(1 ,6)(6,1).
  • the above-mentioned first training samples are distributed in the first dimension and/or the second dimension. It can be understood that the subsequent training can be performed only according to the distribution of the first training samples in the frequency domain dimension, or only based on the distribution of the first training samples in the frequency domain. The subsequent training may be performed according to the distribution of the first training sample in the frequency domain and the time domain. For example, a first training sample contains 10 RBs in the frequency domain dimension and 1 time slot in the time domain dimension, each RB has 3 first signal samples, and each time slot has 1 first signal samples, the first training samples include a total of 30 first signal samples.
  • the sizes of the first dimension and the second dimension, the time domain dimension and the frequency domain dimension may be equal or unequal.
  • the above-mentioned time-domain dimension and frequency-domain dimension can also be combined into one dimension. Specifically, the combination can be the time-domain dimension first and then the frequency-domain dimension, or the frequency-domain dimension first and then the time-domain dimension, which is not implemented in this embodiment limited.
  • the solution provided by this embodiment can be based on the above-mentioned first dimension and second dimension.
  • Increase the presentation form of complex numbers or it can be understood as adding a dimension, which is caused by the independent presentation of the imaginary part and real part data of the original reference signal or the processed reference signal
  • the first training The samples are also distributed in the third dimension.
  • the third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample.
  • each first training sample contains 1 time unit (such as 1 time slot) in the time domain dimension, and contains 10 frequency domain resources (such as 10 RBs) in the frequency domain dimension
  • each first The information sample can be expressed as a real part and an imaginary part, and the first training sample can be a 1 ⁇ 10 ⁇ 2 matrix.
  • the training samples also include a second training sample corresponding to the first training sample; the second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2.
  • the second training samples may be used to characterize the expected channel quality based on the first training samples, or channel response, or channel state, or channel estimation results, or channel information .
  • the T dimensions include a fourth dimension and a fifth dimension.
  • the matrix of the T dimensions may specifically be a two-dimensional matrix of M ⁇ N; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension; M and N are all positive integers.
  • a second training sample consists of a two-dimensional matrix with a size of M ⁇ N, which has M first granularities in the fourth dimension and N second granularities in the fifth dimension; the above M and N May or may not be equal.
  • the specific numerical indication in the two-dimensional matrix represents the received signal strength at a certain first granularity of the channel quality.
  • the specific numerical value in the two-dimensional matrix here may refer to the signal strength value, and its unit may be dBm, or There is no unit but the value obtained after normalization.
  • the two-dimensional matrix of M ⁇ N can also be synthesized into one-dimensional data of size 1 ⁇ (M ⁇ N) or (M ⁇ N) ⁇ 1.
  • the specific transformation can be the fourth dimension first and then the fifth dimension. It may also be the fifth dimension first and then the fourth dimension, which is not limited in this embodiment.
  • the fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
  • the fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, and K3 symbol sampling points; K1, K2, and K3 are positive integers.
  • the symbols are Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing).
  • OFDM Orthogonal Frequency Division Multiplexing
  • the second training sample may be a channel information sample corresponding to the reference signal sample, or may also be called a channel state sample Wait, I'm not going to exhaust the names here.
  • the first granularity can be L1 RBs (L1 is greater than or equal to 1, such as 2RB, 4RB, 8RB), and the distribution range of a second training sample in the frequency domain dimension is M ⁇ L1
  • the frequency domain range corresponding to each RB; or the first granularity can be L2 subcarriers (L2 is greater than 1, such as 4 subcarriers, 6 subcarriers, and 18 subcarriers), then the distribution of a second training sample on the frequency domain dimension is the frequency domain range corresponding to M ⁇ L2 subcarriers.
  • the first granularity may be a delay granularity, for example, a first granularity is the number of sampling points of K1 microseconds, or K2 symbol lengths, or K3 symbols, where the symbols It can be an OFDM symbol;
  • the fourth dimension is the time domain dimension and the first granularity is K1 microseconds
  • the distribution range of a second training sample in the time domain dimension is the time domain range corresponding to M ⁇ K1 microseconds .
  • the fifth dimension is a space domain dimension; correspondingly, the second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
  • the fifth dimension is the space domain dimension, specifically, the antenna dimension, for example, the fifth dimension is composed of N antenna pairs, and correspondingly, the second granularity is a pair of transmitting and receiving antennas.
  • the fifth dimension is a space domain dimension, specifically an angle domain dimension, for example, the fifth dimension is composed of N arrival angles, and the second granularity is the interval between the above N arrival angles.
  • the value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; Both i and j are positive integers. That is to say, in the case of using a first training sample, the value (or referred to as an indicator value) at a certain position in the two-dimensional matrix used to represent the second training sample represents the The expected channel quality situation under such a combination of five dimensions.
  • the channel quality or the channel quality situation can be characterized by signal strength, and the unit of the value (or indicator value) can be dBm, or there is no unit but a value obtained after normalization.
  • the fourth dimension represents the frequency domain dimension
  • the fifth dimension is the space domain dimension, specifically the antenna dimension
  • the first granularity is 2RB
  • the second granularity is 1
  • the value (or indicator value) at this position can be used to represent the channel quality (or channel quality situation) on the third 2RB bandwidth (that is, the fifth RB to the sixth RB) on the sixth pair of transceiver antennas ).
  • S may also be used to represent the number of second training samples, and S may be an integer greater than or equal to 1, that is, the second training samples may include one or more.
  • the T dimensions also include a sixth dimension.
  • the matrix of T dimensions is a three-dimensional matrix of M ⁇ N ⁇ W; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension, W represents the quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
  • the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality corresponding to the first training sample at a third granularity; i, j and k are all positive integers.
  • the sixth dimension may be a complex dimension.
  • the second training samples can be used to characterize the expected channel quality based on the first training samples (or called channel response, or called channel state, or called channel estimation result, or called channel information), and the above-mentioned channel quality can also be presented by a complex number, so a sixth dimension, that is, a complex number dimension, can be added on the basis of the above two dimensions of the second training sample, and the complex number dimension is the second training sample.
  • the imaginary and real parts of the channel quality in the samples are presented independently generated.
  • the sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
  • the third granularity being 1 specifically refers to a real part or an imaginary part, and the number of the third granularity being 2 means that there may be two third granularities in the complex dimension.
  • the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The real part of the channel quality at the second granularity;
  • the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
  • the first value is different from the second value, for example, the first value can be set to 1 and the second value can be 2, or the first value can be 0 and the second value can be 1, or the first value It can be 1 and the second value can be 0, as long as the first value is different from the second value, it is within the protection scope of this embodiment.
  • the above-mentioned second training samples can also be split and combined on the basis of the above-mentioned fourth dimension, fifth dimension, and sixth dimension.
  • the fifth dimension is an antenna pair dimension
  • the It can be split into sending antenna sub-dimensions and receiving antenna sub-dimensions, thereby expanding the dimension of the second training sample.
  • This embodiment does not exhaustively enumerate various possible sub-dimensions after splitting.
  • the basic structure of the neural network includes: an input layer, a hidden layer and an output layer.
  • the input layer is responsible for receiving data
  • the hidden layer processes the data
  • the final result is generated in the output layer.
  • each node represents a processing unit, which can be regarded as simulating a neuron.
  • Multiple neurons form a layer of neural network, and multiple layers of information transmission and processing construct an overall neural network.
  • convolutional neural networks are further studied.
  • a convolutional neural network its basic structure includes: an input layer, multiple convolutional layers, multiple pooling layers, a fully connected layer, and an output layer.
  • the input and output between the estimation sub-model, the compression sub-model and the second model included in the first model obtained after the joint training in this embodiment are described. The input shown in FIG.
  • the information of the estimated sub-model of the first model may be a reference signal, specifically, a reference signal sequence with a length of 144.
  • the estimation sub-model inputs the received input information into its own fully connected layer for processing to obtain its output result.
  • the output dimension of a fully connected layer in the estimation sub-model shown in FIG. 15 is 1024 in size
  • the output dimension of the last fully connected layer is 8192.
  • the output result of the estimation sub-model can be input into the compression sub-model to obtain the output of the compression sub-model.
  • another part of the fully connected layer is processed to finally obtain an output result with an output dimension of 256.
  • the compressed sub-model inputs the output result obtained by it into the second model to obtain the final result restored by the second model.
  • the second model in the second model, it can be processed through a part of the fully connected layer to obtain the output dimension For the result of 1024 size, another part of the fully connected layer is processed to finally get the result with the output dimension of 2048. After the last part of the fully connected layer is processed, the final result with the output dimension of 8192 can be obtained.
  • the final second model can output channel information with a size of 8192, or it can be transformed into a [128,32,2]-dimensional channel information matrix.
  • the first is AI-based channel estimation.
  • the reference signal received by the terminal equipment is used as input, and the reference signal is processed by the AI-based channel estimation module (or AI-based channel estimation model) in the terminal equipment to obtain the channel estimation result .
  • the channel estimation result obtained in the processing shown in FIG. 16 takes the wireless channel to be recovered as the expected output and achieves the best estimation of the wireless channel as the target.
  • the second is AI-based channel state information feedback.
  • the channel state information processed by the encoding end is input into the encoding end neural network to obtain an output feedback vector, which may be compressed channel state information; the feedback vector is sent to the decoding end, and the encoding end Inputting the feedback vector into the neural network at the encoding end to obtain output channel state information.
  • the channel information to be fed back is taken as the output, and the above-mentioned channel information is compressed to the greatest extent at the sending end and the above-mentioned channel information is restored to the greatest extent at the receiving end as the goal, and a corresponding neural network-based solution is constructed; wherein, the to-be-feedback
  • the channel information may be complete channel information, or partial channel information, or processed channel information; the processed channel information may be a channel feature vector.
  • the best performance of a single module for the communication system does not necessarily mean the best performance of the overall communication system solution.
  • the design goal is to use information such as reference signals obtained to maximize the effective estimation of the channel and minimize the error.
  • the design goal is to achieve the best channel information feedback effect with the minimum feedback overhead.
  • the processing, transmission and analysis of the second information are realized by using the first model and the second model obtained through joint training, it can take into account the performance requirements in the entire information processing, transmission and analysis. , to ensure the overall performance of the network. Furthermore, since the above solution uses the first model and the second model obtained through joint training, the functions between the first model and the second model can be made compatible with each other, so that the performance of the first model and the second model can reach In a better state, when the processing, transmission and analysis process of the second information is processed as a whole based on the first model and the second model, the performance of the whole processing can be guaranteed, thereby ensuring the performance of the whole network.
  • the solution provided in this embodiment can avoid problems such as artificially dividing sub-modules and performing independent task training, such as training objectives and redundant information utilization, through the joint training model, and can integrate the entire channel estimation and channel information feedback.
  • the design process is always aimed at allowing the opposite end to recover channel information with the minimum cost, thereby avoiding unnecessary channel recovery appeals and waste of design and calculation overhead, and ensuring that the overall coordination of multiple models obtained in the end can achieve optimal processing As a result, the overall performance of the system is guaranteed.
  • Fig. 18 is a schematic block diagram of a terminal device 1800 according to an embodiment of the present application.
  • Can include:
  • the first communication unit 1801 is configured to receive first information; send second information obtained based on the first information;
  • the second information is obtained by processing the first information through the first model, and the second information is used for processing through the second model to obtain channel information; the first model and the second model are a joint obtained by training.
  • the second information is channel compression information
  • the first model is used to process the input first information to obtain channel compression information.
  • the first model includes: an estimation sub-model and a compression sub-model
  • the estimation sub-model is used to perform channel estimation based on the first information to obtain channel estimation information
  • the compression sub-model is used to compress the channel estimation information to obtain channel compression information.
  • the terminal device also includes:
  • the first processing unit 1802 is configured to input the first information into the estimation sub-model to obtain channel estimation information output by the estimation sub-model; input the channel estimation information to the compression sub-model to obtain the compressed Channel compression information for the submodel output.
  • the first information is a reference signal.
  • the second information is channel compression information;
  • the channel compression information includes eigenvector information of compressed channel estimation information;
  • the first model is used to process the input first information to obtain eigenvector information of compressed channel estimation information.
  • the first model includes: an estimation submodel, a channel generation submodel and a compression submodel;
  • the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information
  • the channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
  • the compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain compressed eigenvector information of the channel estimation information.
  • the eigenvector information of the channel information includes R groups of eigenvector sequence information; R is a positive integer.
  • the first processing unit 1802 is configured to input the first information into the estimation sub-model to obtain channel estimation information output by the estimation sub-model; input the channel estimation information to the channel generation sub-model to obtain The eigenvector information of the channel estimation information output by the channel generation sub-model; input the eigenvector information of the channel estimation information into the compression sub-model, and obtain the eigenvector information of the compressed channel estimation information output by the compression sub-model .
  • the first information is a reference signal; the channel information is feature vector information of the channel information.
  • the first communication unit 1801 is configured to receive the first model.
  • the first model is carried by at least one of the following: downlink control signaling, media access control MAC control element CE message, radio resource control RRC message, broadcast message, downlink data transmission, downlink data for artificial intelligence service transmission requirements transmission.
  • the first communication unit 1801 is configured for the terminal device to receive an estimated sub-model and a compressed sub-model
  • the first processing unit 1802 is configured to generate the first model based on the estimated sub-model and the compressed sub-model.
  • the first communication unit 1801 is configured to receive an estimation sub-model, a compression sub-model and a channel generation sub-model;
  • the first processing unit 1802 is configured to generate the first model based on the estimation sub-model, the compression sub-model and the channel generation sub-model.
  • the estimation sub-model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
  • the compressed sub-model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
  • the channel generation sub-model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements.
  • the first communication unit 1801 is configured to receive the second model.
  • the first communication unit 1801 is configured to receive the third model.
  • the third model is used to perform data conversion processing on the second information output by the first model and then input it into the second model;
  • the first model, the second model and the third model are obtained through joint training.
  • the data transformation processing includes: convolution processing or Fourier transform processing.
  • the first processing unit is configured to use training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model;
  • the first model is the first preset model after training
  • the second model is the second preset model after training
  • the loss function used in the training is the first loss function

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The present application relates to an information processing method, a model generation method, a terminal device, a network device, an electronic device, a chip, a computer readable storage medium, a computer program product, and a computer program. The method comprises: a terminal device receives first information; and the terminal device sends second information obtained on the basis of the first information, wherein the second information is obtained by processing the first information via a first model, the second information is used for being processed via a second model to obtain channel information, and the first model and the second model are obtained by means of joint training.

Description

信息处理方法、模型生成方法及设备Information processing method, model generation method and device 技术领域technical field
本申请涉及通信领域,更具体地,涉及一种信息处理方法、模型生成方法、终端设备、网络设备、电子设备、芯片、计算机可读存储介质、计算机程序产品以及计算机程序。The present application relates to the communication field, and more specifically, relates to an information processing method, a model generation method, a terminal device, a network device, an electronic device, a chip, a computer-readable storage medium, a computer program product, and a computer program.
背景技术Background technique
对于信道信息的获得与反馈方式,无线通信系统中主要依赖于基本模型以及预先配置的反馈参数集合做信道信息的确定与反馈,这种处理中反馈的信道信息与真实的信道信息间的误差相对较大,因此,在一些研究中提出基于人工智能(AI,Artificial Intelligence)的无线通信解决方案以弥补上述不足。然而,在基于AI的无线通信解决方案中,如何保证网络的整体性能就成为需要解决的问题。For channel information acquisition and feedback methods, wireless communication systems mainly rely on basic models and pre-configured feedback parameter sets for channel information determination and feedback. In this process, the error between the feedback channel information and the real channel information is relatively large. Therefore, a wireless communication solution based on artificial intelligence (AI, Artificial Intelligence) is proposed in some studies to make up for the above-mentioned deficiency. However, in AI-based wireless communication solutions, how to ensure the overall performance of the network becomes a problem that needs to be solved.
发明内容Contents of the invention
本申请实施例提供一种信息处理方法、模型生成方法、终端设备、网络设备、电子设备、芯片、计算机可读存储介质、计算机程序产品以及计算机程序,可以至少解决上述问题。Embodiments of the present application provide an information processing method, a model generation method, a terminal device, a network device, an electronic device, a chip, a computer-readable storage medium, a computer program product, and a computer program, which can at least solve the above problems.
本申请实施例提供一种一种信息处理方法,包括:An embodiment of the present application provides an information processing method, including:
终端设备接收第一信息;The terminal device receives the first information;
所述终端设备发送基于第一信息得到的第二信息;The terminal device sends second information obtained based on the first information;
其中,所述第二信息为所述第一信息经由第一模型处理得到的,所述第二信息用于经由第二模型进行处理以得到信道信息;所述第一模型和第二模型为联合训练得到的。Wherein, the second information is obtained by processing the first information through the first model, and the second information is used for processing through the second model to obtain channel information; the first model and the second model are a joint obtained by training.
本申请实施例提供一种信息处理方法,包括:An embodiment of the present application provides an information processing method, including:
网络设备发送第一信息;The network device sends the first information;
所述网络设备接收第二信息;其中,所述第二信息为所述第一信息经由第一模型处理得到的;The network device receives second information; wherein, the second information is obtained by processing the first information through a first model;
所述网络设备基于第二模型对所述第二信息进行处理得到信道信息;其中,所述第一模型和第二模型为联合训练得到的。The network device processes the second information based on a second model to obtain channel information; wherein, the first model and the second model are obtained through joint training.
本申请实施例提供一种模型生成方法,包括:The embodiment of the present application provides a method for generating a model, including:
采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的第一模型和第二模型;performing joint training on the first preset model and the second preset model by using the training samples to obtain the trained first model and the second model;
其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型;所述第一模型用于对第一信息进行处理得到第二信息;所述第二模型用于对所述第二信息进行处理得到信道信息。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training; the first model is used to process the first information to obtain Second information; the second model is used to process the second information to obtain channel information.
本申请实施例提供一种终端设备,包括:An embodiment of the present application provides a terminal device, including:
第一通信单元,用于接收第一信息;发送基于第一信息得到的第二信息;a first communication unit, configured to receive first information; send second information obtained based on the first information;
其中,所述第二信息为所述第一信息经由第一模型处理得到的,所述第二信息用于经由第二模型进行处理以得到信道信息;所述第一模型和第二模型为联合训练得到的。Wherein, the second information is obtained by processing the first information through the first model, and the second information is used for processing through the second model to obtain channel information; the first model and the second model are a joint obtained by training.
本申请实施例提供一种网络设备,包括:An embodiment of the present application provides a network device, including:
第二通信单元,用于发送第一信息;接收第二信息;其中,所述第二信息为所述第一信息经由第一模型处理得到的;The second communication unit is configured to send the first information; receive the second information; wherein, the second information is obtained by processing the first information through the first model;
第二处理单元,用于基于第二模型对所述第二信息进行处理得到信道信息;其中,所述第一模型和第二模型为联合训练得到的。The second processing unit is configured to process the second information based on a second model to obtain channel information; wherein, the first model and the second model are obtained through joint training.
本申请实施例提供一种电子设备,包括:An embodiment of the present application provides an electronic device, including:
第三处理单元,用于采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的第一模型和第二模型;The third processing unit is configured to use training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model;
其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型;所述第一模型用于对第一信息进行处理得到第二信息;所述第二模型用于对所述第二信息进行处理得到信道信息。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training; the first model is used to process the first information to obtain Second information; the second model is used to process the second information to obtain channel information.
本申请实施例提供一种终端设备,包括处理器和存储器。该存储器用于存储计算机程序,该处理器用于调用并运行该存储器中存储的计算机程序,以使该终端设备执行上述的信息处理方法。An embodiment of the present application provides a terminal device, including a processor and a memory. The memory is used to store computer programs, and the processor is used to call and run the computer programs stored in the memory, so that the terminal device executes the above information processing method.
本申请实施例提供一种网络设备,包括处理器和存储器。该存储器用于存储计算机程序,该处理器用于调用并运行该存储器中存储的计算机程序,以使该网络设备执行上述的信息处理方法。An embodiment of the present application provides a network device, including a processor and a memory. The memory is used to store a computer program, and the processor is used to call and run the computer program stored in the memory, so that the network device executes the above-mentioned information processing method.
本申请实施例提供一种电子设备,包括处理器和存储器。该存储器用于存储计算机程序,该处理器用于调用并运行该存储器中存储的计算机程序,以使该网络设备执行上述的模型生成方法。An embodiment of the present application provides an electronic device, including a processor and a memory. The memory is used to store a computer program, and the processor is used to call and run the computer program stored in the memory, so that the network device executes the above-mentioned model generation method.
本申请实施例提供一种芯片,用于实现上述的信息处理方法或模型生成方法。An embodiment of the present application provides a chip configured to implement the above information processing method or model generation method.
具体地,该芯片包括:处理器,用于从存储器中调用并运行计算机程序,使得安装有该芯片的设备执行上述的信息处理方法或模型生成方法。Specifically, the chip includes: a processor, configured to call and run a computer program from the memory, so that the device installed with the chip executes the above-mentioned information processing method or model generation method.
本申请实施例提供一种计算机可读存储介质,用于存储计算机程序,当该计算机程序被设备运行时使得该设备执行上述的信息处理方法或模型生成方法。An embodiment of the present application provides a computer-readable storage medium for storing a computer program, and when the computer program is run by a device, the device is made to execute the above-mentioned information processing method or model generation method.
本申请实施例提供一种计算机程序产品,包括计算机程序指令,该计算机程序指令使得计算机执行上述的信息处理方法或模型生成方法。An embodiment of the present application provides a computer program product, including computer program instructions, which enable a computer to execute the above-mentioned information processing method or model generation method.
本申请实施例提供一种计算机程序,当其在计算机上运行时,使得计算机执行上述的信息处理方法或模型生成方法。An embodiment of the present application provides a computer program that, when run on a computer, causes the computer to execute the above-mentioned information processing method or model generation method.
本申请实施例,可以在终端设备接收到第一信息的情况下,经由第一模型对第一信息进行处理得到第二信息并发送,使得接收端能够通过使用第二模型对该第二信息进行处理以得到的信道信息,由于该第一模型与该第二模型为联合训练得到的。由于第二信息的处理、传输及解析过程是采用联合训练得到的第一模型和第二模型来实现的,因此可以兼顾整 个信息处理、传输及解析中的性能要求,保证了网络整体的性能。In this embodiment of the present application, when the terminal device receives the first information, it can process the first information through the first model to obtain the second information and send it, so that the receiving end can use the second model to process the second information The channel information obtained through processing is obtained through joint training of the first model and the second model. Since the processing, transmission, and analysis of the second information are realized by using the first model and the second model obtained through joint training, the performance requirements of the entire information processing, transmission, and analysis can be taken into account, ensuring the overall performance of the network.
附图说明Description of drawings
图1是根据本申请实施例的应用场景的示意图。Fig. 1 is a schematic diagram of an application scenario according to an embodiment of the present application.
图2是根据本申请实施例的无线通信系统中收发信息的方法的示意性流程图。Fig. 2 is a schematic flowchart of a method for sending and receiving information in a wireless communication system according to an embodiment of the present application.
图3是根据本申请实施例的导频信号的传输与接收场景示意性图。Fig. 3 is a schematic diagram of transmission and reception scenarios of pilot signals according to an embodiment of the present application.
图4是根据本申请实施例的信道信息反馈方法的示意性流程图。Fig. 4 is a schematic flowchart of a channel information feedback method according to an embodiment of the present application.
图5是根据本申请实施例的神经网络的基本结构示意图;5 is a schematic diagram of the basic structure of a neural network according to an embodiment of the present application;
图6是根据本申请实施例的信息处理方法的示意性流程图一;FIG. 6 is a schematic flowchart 1 of an information processing method according to an embodiment of the present application;
图7是根据本申请实施例的信道信息的特征向量信息的示意图;FIG. 7 is a schematic diagram of eigenvector information of channel information according to an embodiment of the present application;
图8a~图8d是根据本申请实施例的模型组成结构示意图;Figures 8a to 8d are schematic diagrams of the composition and structure of the model according to the embodiment of the present application;
图9是根据本申请实施例的第二训练样本的一种示意图;FIG. 9 is a schematic diagram of a second training sample according to an embodiment of the present application;
图10是根据本申请实施例的第二训练样本的另一种示意图;FIG. 10 is another schematic diagram of a second training sample according to an embodiment of the present application;
图11是根据本申请实施例的信息处理方法的示意性流程图二;FIG. 11 is a schematic flowchart II of an information processing method according to an embodiment of the present application;
图12是根据本申请实施例的信息处理方法的示意性流程图三;FIG. 12 is a schematic flowchart three of an information processing method according to an embodiment of the present application;
图13是根据本申请实施例的信息处理方法的示意性流程图四;FIG. 13 is a schematic flowchart 4 of an information processing method according to an embodiment of the present application;
图14是根据本申请实施例的模型生成方法的示意性流程图;Fig. 14 is a schematic flowchart of a model generation method according to an embodiment of the present application;
图15是根据本申请实施例的第一模型和第二模型的示意图;15 is a schematic diagram of a first model and a second model according to an embodiment of the present application;
图16是根据本申请实施例的对参考信号进行信道估计的示意性流程;FIG. 16 is a schematic flow of channel estimation for a reference signal according to an embodiment of the present application;
图17是根据本申请实施例的信道状态信息反馈的示意性流程图;FIG. 17 is a schematic flowchart of channel state information feedback according to an embodiment of the present application;
图18是根据本申请一实施例的终端设备的示意性框图一;FIG. 18 is a first schematic block diagram of a terminal device according to an embodiment of the present application;
图19是根据本申请一实施例的终端设备的示意性框图二;FIG. 19 is a second schematic block diagram of a terminal device according to an embodiment of the present application;
图20是根据本申请一实施例的网络设备的示意性框图;FIG. 20 is a schematic block diagram of a network device according to an embodiment of the present application;
图21是根据本申请一实施例的电子设备的示意性框图;Fig. 21 is a schematic block diagram of an electronic device according to an embodiment of the present application;
图22是根据本申请实施例的通信设备示意性框图;Fig. 22 is a schematic block diagram of a communication device according to an embodiment of the present application;
图23是根据本申请实施例的芯片的示意性框图;Fig. 23 is a schematic block diagram of a chip according to an embodiment of the present application;
图24是根据本申请实施例的通信系统的示意性框图。Fig. 24 is a schematic block diagram of a communication system according to an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行描述。The technical solutions in the embodiments of the present application will be described below with reference to the drawings in the embodiments of the present application.
本申请实施例的技术方案可以应用于各种通信系统,例如:全球移动通讯(Global System of Mobile communication,GSM)系统、码分多址(Code Division Multiple Access,CDMA)系统、宽带码分多址(Wideband Code Division Multiple Access,WCDMA)系统、通用分组无线业务(General Packet Radio Service,GPRS)、长期演进(Long Term Evolution,LTE)系统、先进的长期演进(Advanced long term evolution,LTE-A)系统、新无线(New Radio,NR)系统、NR系统的演进系统、非授权频谱上的LTE(LTE-based access to unlicensed spectrum,LTE-U)系统、非授权频谱上的NR(NR-based access to unlicensed spectrum,NR-U)系统、非地面通信网络(Non-Terrestrial Networks,NTN)系统、通用移动通信系统(Universal Mobile Telecommunication System,UMTS)、无线局域网(Wireless Local Area Networks,WLAN)、无线保真(Wireless Fidelity,WiFi)、第五代通信(5th-Generation,5G)系统或其他通信系统等。The technical solution of the embodiment of the present application can be applied to various communication systems, such as: Global System of Mobile communication (Global System of Mobile communication, GSM) system, code division multiple access (Code Division Multiple Access, CDMA) system, broadband code division multiple access (Wideband Code Division Multiple Access, WCDMA) system, General Packet Radio Service (GPRS), Long Term Evolution (LTE) system, Advanced long term evolution (LTE-A) system , New Radio (NR) system, evolution system of NR system, LTE (LTE-based access to unlicensed spectrum, LTE-U) system on unlicensed spectrum, NR (NR-based access to unlicensed spectrum) on unlicensed spectrum unlicensed spectrum (NR-U) system, Non-Terrestrial Networks (NTN) system, Universal Mobile Telecommunications System (UMTS), Wireless Local Area Networks (WLAN), Wireless Fidelity (Wireless Fidelity, WiFi), fifth-generation communication (5th-Generation, 5G) system or other communication systems, etc.
通常来说,传统的通信系统支持的连接数有限,也易于实现,然而,随着通信技术的发展,移动通信系统将不仅支持传统的通信,还将支持例如,设备到设备(Device to Device,D2D)通信,机器到机器(Machine to Machine,M2M)通信,机器类型通信(Machine Type Communication,MTC),车辆间(Vehicle to Vehicle,V2V)通信,或车联网(Vehicle to everything,V2X)通信等,本申请实施例也可以应用于这些通信系统。Generally speaking, the number of connections supported by traditional communication systems is limited and easy to implement. However, with the development of communication technology, mobile communication systems will not only support traditional communication, but also support, for example, Device to Device (Device to Device, D2D) communication, Machine to Machine (M2M) communication, Machine Type Communication (MTC), Vehicle to Vehicle (V2V) communication, or Vehicle to everything (V2X) communication, etc. , the embodiments of the present application may also be applied to these communication systems.
可选地,本申请实施例中的通信系统可以应用于载波聚合(Carrier Aggregation,CA)场景,也可以应用于双连接(Dual Connectivity,DC)场景,还可以应用于独立(Standalone,SA)布网场景。Optionally, the communication system in the embodiment of the present application may be applied to a carrier aggregation (Carrier Aggregation, CA) scenario, may also be applied to a dual connectivity (Dual Connectivity, DC) scenario, and may also be applied to an independent (Standalone, SA) deployment Web scene.
可选地,本申请实施例中的通信系统可以应用于非授权频谱,其中,非授权频谱也可以认为是共享频谱;或者,本申请实施例中的通信系统也可以应用于授权频谱,其中,授权频谱也可以认为是非共享频谱。Optionally, the communication system in the embodiment of the present application may be applied to an unlicensed spectrum, where the unlicensed spectrum may also be considered as a shared spectrum; or, the communication system in the embodiment of the present application may also be applied to a licensed spectrum, where, Licensed spectrum can also be considered as non-shared spectrum.
本申请实施例结合网络设备和终端设备描述了各个实施例,其中,终端设备也可以称为用户设备(User Equipment,UE)、接入终端、用户单元、用户站、移动站、移动台、远方站、远程终端、移动设备、用户终端、终端、无线通信设备、用户代理或用户装置等。The embodiments of the present application describe various embodiments in conjunction with network equipment and terminal equipment, wherein the terminal equipment may also be referred to as user equipment (User Equipment, UE), access terminal, user unit, user station, mobile station, mobile station, remote station, remote terminal, mobile device, user terminal, terminal, wireless communication device, user agent or user device, etc.
终端设备可以是WLAN中的站点(STAION,ST),可以是蜂窝电话、无绳电话、会话启动协议(Session Initiation Protocol,SIP)电话、无线本地环路(Wireless Local Loop,WLL)站、个人数字处理(Personal Digital Assistant,PDA)设备、具有无线通信功能的手持设备、计算设备或连接到无线调制解调器的其它处理设备、车载设备、可穿戴设备、下一代通信系统例如NR网络中的终端设备,或者未来演进的公共陆地移动网络(Public Land Mobile Network,PLMN)网络中的终端设备等。The terminal device can be a station (STAION, ST) in the WLAN, a cellular phone, a cordless phone, a Session Initiation Protocol (Session Initiation Protocol, SIP) phone, a wireless local loop (Wireless Local Loop, WLL) station, a personal digital processing (Personal Digital Assistant, PDA) devices, handheld devices with wireless communication functions, computing devices or other processing devices connected to wireless modems, vehicle-mounted devices, wearable devices, next-generation communication systems such as terminal devices in NR networks, or future Terminal equipment in the evolved public land mobile network (Public Land Mobile Network, PLMN) network, etc.
在本申请实施例中,终端设备可以部署在陆地上,包括室内或室外、手持、穿戴或车载;也可以部署在水面上(如轮船等);还可以部署在空中(例如飞机、气球和卫星上等)。In the embodiment of this application, the terminal device can be deployed on land, including indoor or outdoor, handheld, wearable or vehicle-mounted; it can also be deployed on water (such as ships, etc.); it can also be deployed in the air (such as aircraft, balloons and satellites) superior).
在本申请实施例中,终端设备可以是手机(Mobile Phone)、平板电脑(Pad)、带无线收发功能的电脑、虚拟现实(Virtual Reality,VR)终端设备、增强现实(Augmented Reality,AR)终端设备、工业控制(industrial control)中的无线终端设备、无人驾驶(self driving)中的无线终端设备、远程医疗(remote medical)中的无线终端设备、智能电网(smart grid)中的无线终端设备、运输安全(transportation safety)中的无线终端设备、智慧城市(smart city)中的无线终端设备或智慧家庭(smart home)中的无线终端设备等。In this embodiment of the application, the terminal device may be a mobile phone (Mobile Phone), a tablet computer (Pad), a computer with a wireless transceiver function, a virtual reality (Virtual Reality, VR) terminal device, an augmented reality (Augmented Reality, AR) terminal Equipment, wireless terminal equipment in industrial control, wireless terminal equipment in self driving, wireless terminal equipment in remote medical, wireless terminal equipment in smart grid , wireless terminal equipment in transportation safety, wireless terminal equipment in smart city, or wireless terminal equipment in smart home.
作为示例而非限定,在本申请实施例中,该终端设备还可以是可穿戴设备。可穿戴设备也可以称为穿戴式智能设备,是应用穿戴式技术对日常穿戴进行智能化设计、开发出可以穿戴的设备的总称,如眼镜、手套、手表、服饰及鞋等。可穿戴设备即直接穿在身上,或是整合到用户的衣服或配件的一种便携式设备。可穿戴设备不仅仅是一种硬件设备,更是通过软件支持以及数据交互、云端交互来实现强大的功能。广义穿戴式智能设备包括功能全、尺寸大、可不依赖智能手机实现完整或者部分的功能,例如:智能手表或智能眼镜等,以及只专注于某一类应用功能,需要和其它设备如智能手机配合使用,如各类进行体征监测的智能手环、智能首饰等。As an example but not a limitation, in this embodiment of the present application, the terminal device may also be a wearable device. Wearable devices can also be called wearable smart devices, which is a general term for the application of wearable technology to intelligently design daily wear and develop wearable devices, such as glasses, gloves, watches, clothing and shoes. A wearable device is a portable device that is worn directly on the body or integrated into the user's clothing or accessories. Wearable devices are not only a hardware device, but also achieve powerful functions through software support, data interaction, and cloud interaction. Generalized wearable smart devices include full-featured, large-sized, complete or partial functions without relying on smart phones, such as smart watches or smart glasses, etc., and only focus on a certain type of application functions, and need to cooperate with other devices such as smart phones Use, such as various smart bracelets and smart jewelry for physical sign monitoring.
在本申请实施例中,网络设备可以是用于与移动设备通信的设备,网络设备可以是WLAN中的接入点(Access Point,AP),GSM或CDMA中的基站(Base Transceiver Station,BTS),也可以是WCDMA中的基站(NodeB,NB),还可以是LTE中的演进型基站(Evolutional Node B,eNB或eNodeB),或者中继站或接入点,或者车载设备、可穿戴设备以及NR网络中的网络设备(gNB)或者未来演进的PLMN网络中的网络设备或者NTN网络中的网络设备等。In the embodiment of the present application, the network device may be a device for communicating with the mobile device, and the network device may be an access point (Access Point, AP) in WLAN, a base station (Base Transceiver Station, BTS) in GSM or CDMA , or a base station (NodeB, NB) in WCDMA, or an evolved base station (Evolutional Node B, eNB or eNodeB) in LTE, or a relay station or access point, or a vehicle-mounted device, a wearable device, and an NR network The network equipment (gNB) in the network or the network equipment in the future evolved PLMN network or the network equipment in the NTN network, etc.
作为示例而非限定,在本申请实施例中,网络设备可以具有移动特性,例如网络设备可以为移动的设备。可选地,网络设备可以为卫星、气球站。例如,卫星可以为低地球轨道(low earth orbit,LEO)卫星、中地球轨道(medium earth orbit,MEO)卫星、地球同步轨道(geostationary earth orbit,GEO)卫星、高椭圆轨道(High Elliptical Orbit,HEO)卫星等。可选地,网络设备还可以为设置在陆地、水域等位置的基站。As an example but not a limitation, in this embodiment of the present application, the network device may have a mobile feature, for example, the network device may be a mobile device. Optionally, the network equipment may be a satellite or a balloon station. For example, the satellite can be a low earth orbit (low earth orbit, LEO) satellite, a medium earth orbit (medium earth orbit, MEO) satellite, a geosynchronous earth orbit (geosynchronous earth orbit, GEO) satellite, a high elliptical orbit (High Elliptical Orbit, HEO) satellite. ) Satellite etc. Optionally, the network device may also be a base station installed on land, water, and other locations.
在本申请实施例中,网络设备可以为小区提供服务,终端设备通过该小区使用的传输资源(例如,频域资源,或者说,频谱资源)与网络设备进行通信,该小区可以是网络设备(例如基站)对应的小区,小区可以属于宏基站,也可以属于小小区(Small cell)对应的基站,这里的小小区可以包括:城市小区(Metro cell)、微小区(Micro cell)、微微小区(Pico cell)、毫微微小区(Femto cell)等,这些小小区具有覆盖范围小、发射功率低的特点,适用于提供高速率的数据传输服务。In this embodiment of the present application, the network device may provide services for a cell, and the terminal device communicates with the network device through the transmission resources (for example, frequency domain resources, or spectrum resources) used by the cell, and the cell may be a network device ( For example, a cell corresponding to a base station), the cell may belong to a macro base station, or may belong to a base station corresponding to a small cell (Small cell), and the small cell here may include: a metro cell (Metro cell), a micro cell (Micro cell), a pico cell ( Pico cell), Femto cell, etc. These small cells have the characteristics of small coverage and low transmission power, and are suitable for providing high-speed data transmission services.
图1示例性地示出了一种通信系统100。该通信系统包括一个网络设备110和两个终端设备120。可选地,该通信系统100可以包括多个网络设备110,并且每个网络设备110的覆盖范围内可以包括其它数量的终端设备120,本申请实施例对此不做限定。FIG. 1 exemplarily shows a communication system 100 . The communication system includes a network device 110 and two terminal devices 120 . Optionally, the communication system 100 may include multiple network devices 110, and the coverage of each network device 110 may include other numbers of terminal devices 120, which is not limited in this embodiment of the present application.
可选地,该通信系统100还可以包括移动性管理实体(Mobility Management Entity,MME)、接入与移动性管理功能(Access and Mobility Management Function,AMF)等其他网络实体,本申请实施例对此不作限定。Optionally, the communication system 100 may also include other network entities such as a mobility management entity (Mobility Management Entity, MME), an access and mobility management function (Access and Mobility Management Function, AMF), etc. Not limited.
其中,网络设备又可以包括接入网设备和核心网设备。即无线通信系统还包括用于与接入网设备进行通信的多个核心网。接入网设备可以是长期演进(long-term evolution,LTE)系统、下一代(移动通信系统)(next radio,NR)系统或者授权辅助接入长期演进(authorized auxiliary access long-term evolution,LAA-LTE)系统中的演进型基站(evolutional node B,简称可以为eNB或e-NodeB)宏基站、微基站(也称为“小基站”)、微微基站、接入站点(access point,AP)、传输站点(transmission point,TP)或新一代基站(new generation Node B,gNodeB)等。Wherein, the network equipment may further include access network equipment and core network equipment. That is, the wireless communication system also includes multiple core networks for communicating with access network devices. The access network device may be a long-term evolution (long-term evolution, LTE) system, a next-generation (mobile communication system) (next radio, NR) system or an authorized auxiliary access long-term evolution (LAA- Evolved base station (evolutional node B, abbreviated as eNB or e-NodeB) macro base station, micro base station (also called "small base station"), pico base station, access point (access point, AP), Transmission point (transmission point, TP) or new generation base station (new generation Node B, gNodeB), etc.
应理解,本申请实施例中网络/系统中具有通信功能的设备可称为通信设备。以图1示出的通信系统为例,通信设备可包括具有通信功能的网络设备和终端设备,网络设备和终端设备可以为本申请实施例中的具体设备,此处不再赘述;通信设备还可包括通信系统中的其他设备,例如网络控制器、移动管理实体等其他网络实体,本申请实施例中对此不做限定。It should be understood that a device with a communication function in the network/system in the embodiment of the present application may be referred to as a communication device. Taking the communication system shown in Figure 1 as an example, the communication equipment may include network equipment and terminal equipment with communication functions. It may include other devices in the communication system, such as network controllers, mobility management entities and other network entities, which are not limited in this embodiment of the present application.
为了便于理解本申请实施例,下面对本申请实施例所涉及到的基本流程以及基本概念进行简单说明。应理解,下文所介绍的基本流程以及基本概念并不对本申请实施例产生限定。In order to facilitate the understanding of the embodiments of the present application, the following briefly describes the basic processes and basic concepts involved in the embodiments of the present application. It should be understood that the basic processes and basic concepts described below do not limit the embodiments of the present application.
无线通信系统的工作流程如图2所示可以包括:发送端对信源进行编码、调制、加密等操作形成待传输的发送信息;发送信息通过无线空间传输,此时所述发送信息会受到信道环境以及干扰噪声的影响;在接收端对接收信息进行解码、解密解调等操作最终恢复信源信息。在上述处理中,所述发送端可以为前述图1示出的通信系统中的网络设备,所述接收端可以为前述图1所示出的通信系统中的终端设备;或者,所述发送端可以为前述图1示出的通信系统中的终端设备,所述接收端可以为前述图1所示出的通信系统中的网络设备。As shown in Figure 2, the workflow of the wireless communication system may include: the sending end performs operations such as encoding, modulating, and encrypting on the information source to form the sending information to be transmitted; the sending information is transmitted through the wireless space, and at this time the sending information will be received by the channel The impact of the environment and interference noise; at the receiving end, the received information is decoded, decrypted and demodulated, and finally the source information is restored. In the above processing, the sending end may be a network device in the aforementioned communication system shown in FIG. 1, and the receiving end may be a terminal device in the aforementioned communication system shown in FIG. 1; or, the sending end It may be a terminal device in the aforementioned communication system shown in FIG. 1 , and the receiving end may be a network device in the aforementioned communication system shown in FIG. 1 .
在上述无线通信系统的工作流程中,信道环境的好坏以及是否能够准确估计出当前的信道环境,对于无线通信系统的性能来说至关重要。在做无线通信系统设计时,发送端(例如网络设备)会发送一些导频信号,例如发送一些信道状态信息参考信号(CSI-RS,Channel State Information Reference Signal)、解调参考信号(DMRS,Demodulation Reference Signal)、相位跟踪参考信号(PT-RS,Phase Tracking Reference Signal)、同步信号和PBCH块(SSB,Synchronization Signal and PBCH block)等,用于辅助接收端(例如终端设备)获取并估计出当前的信道特征。进而接收端(例如终端设备)可以基于估计、恢复出的信道特征反馈相应的信道信息给发送端(例如网络设备),最终发送端(例如网络设备)再依据所获取的信道信息做相应的编码、调制等工作。如图3所示,发送端(例如网络设备)将特定的导频信号发出,导频信号经过信道传输后在接收端(例如终端设备)被接收,接收端(例如终端设备)可以基于接收的导频信号以及实际的导频信号估计出导频信号所经过的信道情况并基于该信道情况确定信道信息。在所述接收端(例如终端设备)获得所述信道信息之后,还可以将该信道信息反馈给所述发送端(例如网络设备),进而所述发送端(例如网络设备)可以进行后续的数据调度等处理。其中,接收端获取的信道信息可以为所述导频信号所在处(比如导频信号所在的时域和/或频域资源处)的信道信息;相应的,在发送端可利用基本的插值等方法基于接收到的所述导频信号所在处的信道信息恢复出完整的宽带的各个时隙内的信道信息,再做相应的数据调度等处理。In the above-mentioned workflow of the wireless communication system, the quality of the channel environment and whether the current channel environment can be accurately estimated are crucial to the performance of the wireless communication system. When designing a wireless communication system, the sending end (such as a network device) will send some pilot signals, such as sending some channel state information reference signal (CSI-RS, Channel State Information Reference Signal), demodulation reference signal (DMRS, Demodulation Reference Signal), phase tracking reference signal (PT-RS, Phase Tracking Reference Signal), synchronization signal and PBCH block (SSB, Synchronization Signal and PBCH block), etc., used to assist the receiving end (such as terminal equipment) to obtain and estimate the current channel characteristics. Furthermore, the receiving end (such as a terminal device) can feed back corresponding channel information to the sending end (such as a network device) based on the estimated and recovered channel characteristics, and finally the sending end (such as a network device) performs corresponding coding according to the acquired channel information , Modulation, etc. As shown in Figure 3, the sending end (such as a network device) sends out a specific pilot signal, and the pilot signal is received by the receiving end (such as a terminal device) after being transmitted through a channel. The receiving end (such as a terminal device) can The pilot signal and the actual pilot signal estimate the channel condition through which the pilot signal passes and determine the channel information based on the channel condition. After the receiving end (such as a terminal device) obtains the channel information, it can also feed back the channel information to the sending end (such as a network device), and then the sending end (such as a network device) can perform subsequent data Scheduling etc. Wherein, the channel information acquired by the receiving end may be the channel information where the pilot signal is located (such as the time domain and/or frequency domain resource where the pilot signal is located); correspondingly, basic interpolation, etc. The method restores the channel information in each time slot of the complete broadband based on the received channel information where the pilot signal is located, and then performs corresponding data scheduling and other processing.
信道信息反馈的处理方法可以包括:终端设备在估计出信道信息之后,在当前的通信系统中会通过信道状态信息反馈的方式将信道信息反馈至网络设备。信道信息的反馈在LTE系统、NR系统中都非常重要,其决定了MIMO传输的性能。以CSI反馈流程为例结合图4进行说明,可以包括以下步骤:S410:网络设备配置用于信道状态信息(CSI,Channel State information)指示的反馈参数信息,例如网络设备配置终端设备需要反馈信道质量指示(CQI,Channel Quality Indicator)、预编码矩阵指示(PMI,Precoding Matrix Indicator)、秩指示(RI,Rank Indication)等信息中的哪些信息;同时,网络设备会配置一些供CSI测量用的参考信号,例如SSB或者CSI-RS。S420:网络设备向终端设备发送参考信号。S430:终端设备通过对上述参考信号的测量生成CSI。S440:所述终端设备向网络设备反馈CSI。然后执行S450:所述网络设备基于CSI配置数据传输方式,也就是说,网络设备可以基于CSI配置出合理高效的数据传输方式。CSI中 可以包括对CQI、PMI、RI等信息的指示。The processing method for channel information feedback may include: after the terminal device estimates the channel information, it feeds back the channel information to the network device in a channel state information feedback manner in the current communication system. The feedback of channel information is very important in LTE system and NR system, which determines the performance of MIMO transmission. Taking the CSI feedback process as an example in conjunction with FIG. 4 for illustration, it may include the following steps: S410: The network device configures the feedback parameter information indicated by channel state information (CSI, Channel State information), for example, the network device configures the terminal device to feedback channel quality Which information in the indication (CQI, Channel Quality Indicator), precoding matrix indication (PMI, Precoding Matrix Indicator), rank indication (RI, Rank Indication) and other information; at the same time, the network device will configure some reference signals for CSI measurement , such as SSB or CSI-RS. S420: The network device sends the reference signal to the terminal device. S430: The terminal device generates CSI by measuring the above reference signal. S440: The terminal device feeds back the CSI to the network device. Then perform S450: the network device configures a data transmission mode based on the CSI, that is, the network device can configure a reasonable and efficient data transmission mode based on the CSI. The CSI may include indications of information such as CQI, PMI, and RI.
上述信道信息反馈的处理方法中进一步可以引入以神经网络为代表的人工智能方面的研究和设计,例如,通过神经网络的设计实现对于无线信道的估计。如图5所示,所述神经网络的基本结构包括:输入层,隐藏层和输出层;所述输入层负责接收数据,所述隐藏层对数据的处理,最后的结果在所述输出层产生。图5中各个节点代表一个处理单元,每个处理单元可以是模拟了一个神经元,多个神经元组成一层神经网络,多层的信息传递与处理构造出一个整体的神经网络。随着神经网络研究的不断发展,近年来又提出了神经网络深度学习算法,较多的隐层被引入,通过多隐层的神经网络逐层训练进行特征学习,极大地提升了神经网络的学习和处理能力,并在模式识别、信号处理、优化组合、异常探测等方面广泛被应用。The above-mentioned processing method of channel information feedback can further introduce research and design of artificial intelligence represented by neural network, for example, the estimation of wireless channel can be realized through the design of neural network. As shown in Figure 5, the basic structure of the neural network includes: an input layer, a hidden layer and an output layer; the input layer is responsible for receiving data, the hidden layer processes the data, and the final result is produced at the output layer . Each node in Figure 5 represents a processing unit, and each processing unit can simulate a neuron, and multiple neurons form a layer of neural network, and multi-layer information transmission and processing construct an overall neural network. With the continuous development of neural network research, neural network deep learning algorithms have been proposed in recent years, more hidden layers have been introduced, and feature learning is performed through layer-by-layer training of neural networks with multiple hidden layers, which greatly improves the learning of neural networks. And processing capabilities, and are widely used in pattern recognition, signal processing, optimization combination, anomaly detection, etc.
应理解,本文中术语“系统”和“网络”在本文中常被可互换使用。本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。It should be understood that the terms "system" and "network" are often used interchangeably herein. The term "and/or" in this article is just an association relationship describing associated objects, which means that there can be three relationships, for example, A and/or B can mean: A exists alone, A and B exist simultaneously, and there exists alone B these three situations. In addition, the character "/" in this article generally indicates that the contextual objects are an "or" relationship.
应理解,在本申请的实施例中提到的“指示”可以是直接指示,也可以是间接指示,还可以是表示具有关联关系。举例说明,A指示B,可以表示A直接指示B,例如B可以通过A获取;也可以表示A间接指示B,例如A指示C,B可以通过C获取;还可以表示A和B之间具有关联关系。It should be understood that the "indication" mentioned in the embodiments of the present application may be a direct indication, may also be an indirect indication, and may also mean that there is an association relationship. For example, A indicates B, which can mean that A directly indicates B, for example, B can be obtained through A; it can also indicate that A indirectly indicates B, for example, A indicates C, and B can be obtained through C; it can also indicate that there is an association between A and B relation.
在本申请实施例的描述中,术语“对应”可表示两者之间具有直接对应或间接对应的关系,也可以表示两者之间具有关联关系,也可以是指示与被指示、配置与被配置等关系。In the description of the embodiments of the present application, the term "corresponding" may indicate that there is a direct or indirect correspondence between the two, or that there is an association between the two, or that it indicates and is indicated, configuration and is configuration etc.
为便于理解本申请实施例的技术方案,以下对本申请实施例的相关技术进行说明,以下相关技术作为可选方案与本申请实施例的技术方案可以进行任意结合,其均属于本申请实施例的保护范围。In order to facilitate the understanding of the technical solutions of the embodiments of the present application, the related technologies of the embodiments of the present application are described below. The following related technologies can be combined with the technical solutions of the embodiments of the present application as optional solutions, and all of them belong to the embodiments of the present application. protected range.
图6是根据本申请一实施例的信息处理方法600的示意性流程图。该方法可选地可以应用于图1所示的系统,但并不仅限于此。该方法包括以下内容的至少部分内容。Fig. 6 is a schematic flowchart of an information processing method 600 according to an embodiment of the present application. The method can optionally be applied to the system shown in Fig. 1, but is not limited thereto. The method includes at least some of the following.
S610、终端设备接收第一信息。S610. The terminal device receives first information.
S620、所述终端设备发送基于第一信息得到的第二信息;S620. The terminal device sends second information obtained based on the first information;
其中,所述第二信息为所述第一信息经由第一模型处理得到的,所述第二信息用于经由第二模型进行处理以得到信道信息;所述第一模型和第二模型为联合训练得到的。Wherein, the second information is obtained by processing the first information through the first model, and the second information is used for processing through the second model to obtain channel information; the first model and the second model are a joint obtained by training.
上述S610中,所述第一信息可以为参考信号,具体来说,所述参考信号可以为当前信道的参考信号,比如可以为当前信道的下行参考信号。所述下行参考信号可以包括CSI-RS、DMRS、PT-RS中至少一种。In the above S610, the first information may be a reference signal, specifically, the reference signal may be a reference signal of the current channel, such as a downlink reference signal of the current channel. The downlink reference signal may include at least one of CSI-RS, DMRS, and PT-RS.
所述第一信息可以分布在第一维度和/或第二维度。The first information may be distributed in the first dimension and/or the second dimension.
其中,所述第一维度为时域维度;所述第一信息分布在所述时域维度中的至少一个时间单元内。所述至少一个时间单元中每个时间单元可以包含以下之一:1个时隙、1个正交频分复用(OFDM,Orthogonal Frequency Division Multiplexing)符号。举例来说,所述第一信号为下行参考信号,所述下行参考信号在时域维度上可以分布在1个时隙上,或者所述下行参考信号在时域维度上可以分布在2个或4个时隙上。Wherein, the first dimension is a time domain dimension; the first information is distributed in at least one time unit in the time domain dimension. Each time unit in the at least one time unit may include one of the following: 1 time slot and 1 Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing) symbol. For example, the first signal is a downlink reference signal, and the downlink reference signal may be distributed in one time slot in the time domain dimension, or the downlink reference signal may be distributed in two or on 4 time slots.
所述第二维度为频域维度;所述第一信息分布在所述频域维度中的至少一个频域资源上;其中,所述至少一个频域资源中每个频域资源可以为以下之一:一个资源块(RB,Resource Block)、一个子载波。举例来说,所述第一信号为下行参考信号,所述下行参考信号在频域维度上可以分布在1个RB上,或者所述下行参考信号在时域维度上可以分布在2个或4个RB上。The second dimension is a frequency domain dimension; the first information is distributed on at least one frequency domain resource in the frequency domain dimension; wherein, each frequency domain resource in the at least one frequency domain resource can be one of the following One: one resource block (RB, Resource Block), one subcarrier. For example, the first signal is a downlink reference signal, and the downlink reference signal may be distributed in 1 RB in the frequency domain dimension, or the downlink reference signal may be distributed in 2 or 4 RBs in the time domain dimension. on RBs.
上述第一维度与第二维度可以合并使用,也就是说,所述第一信息可以分布在第一维度以及第二维度上;比如,所述第一信息在频域维度中分布在a个RB上,在时域维度中分布在b个时隙中;a和b均为正整数。举例来说,所述第一信息为下行参考信号,该下行参考信号在频域上可以分布在4个RB上,在时域维度上可以分布在6个时隙中。The first dimension and the second dimension above can be used in combination, that is, the first information can be distributed on the first dimension and the second dimension; for example, the first information can be distributed on a RBs in the frequency domain dimension , distributed in b time slots in the time domain dimension; both a and b are positive integers. For example, the first information is a downlink reference signal, and the downlink reference signal may be distributed in 4 RBs in the frequency domain, and may be distributed in 6 time slots in the time domain dimension.
再进一步地,所述第一信息还可以表示为复数,也就是说,所述第一信息还分布在第三维度;所述第三维度为复数维度;所述第一信息包括第一信息样本的实部和第一信息样本的虚部。比如,所述第一信息的实部分布在频域资源的a个RB上以及时域资源的b个时隙上,所述第一信息的虚部分布在频域资源的a个RB上以及时域资源的b个时隙上。Still further, the first information can also be expressed as a complex number, that is, the first information is also distributed in the third dimension; the third dimension is a complex dimension; the first information includes the first information sample The real part of and the imaginary part of the first information sample. For example, the real part of the first information is distributed on the a RBs of the frequency domain resources and the b time slots of the time domain resources, and the imaginary part of the first information is distributed on the a RBs of the frequency domain resources. b time slots of the time domain resource.
可选地,所述终端设备在接收第一信息之前,还可以先接收网络设备发送的配置信息,该配置信息中可以配置供终端设备测量用的第一信息。以该第一信息为下行参考信号为例,该配置信息可以是配置终端设备测量SSB或者CSI-RS等等。Optionally, before receiving the first information, the terminal device may first receive configuration information sent by the network device, and the configuration information may be configured with the first information for the terminal device to measure. Taking the first information as an example of a downlink reference signal, the configuration information may be configuring the terminal device to measure SSB or CSI-RS and so on.
在完成S610即所述终端设备接收到第一信息之后,所述终端设备可以基于第一模型对所述第一信息进行处理得到所述第二信息。After completing S610, that is, after the terminal device receives the first information, the terminal device may process the first information based on the first model to obtain the second information.
在一种示例中,所述第二信息为信道压缩信息;所述第一模型用于基于输入的所述第一信息进行处理得到信道压缩信息。也就是说,所述第一模型的输入信息为所述第一信息,所述第一模型输出的第二信息为所述信道压缩信息。In an example, the second information is channel compression information; the first model is configured to process the input first information to obtain channel compression information. That is, the input information of the first model is the first information, and the second information output by the first model is the channel compression information.
需要指出的是,所述第一模型还可以称为编码模型或编码网络等,只要输入信息为第一信息以及输出信息为信道压缩信息的模型或神经网络均在本实施例保护范围内。It should be pointed out that the first model may also be called an encoding model or an encoding network, as long as the input information is the first information and the output information is the channel compression information, the model or neural network is within the protection scope of this embodiment.
其中,所述第一模型具体可以包括以下子模型:估计子模型和压缩子模型;Wherein, the first model may specifically include the following sub-models: an estimation sub-model and a compression sub-model;
所述估计子模型用于基于所述第一信息进行信道估计得到信道估计信息;The estimation sub-model is used to perform channel estimation based on the first information to obtain channel estimation information;
所述压缩子模型用于对所述信道估计信息进行压缩得到信道压缩信息。The compression sub-model is used to compress the channel estimation information to obtain channel compression information.
所述估计子模型还可以称为信道估计子模型或称为信道估计子神经网络,该估计子模型可采用全连接网络、卷积神经网络、残差网络、自注意力机制网络中的一种或者多种网络结构构建。所述压缩子模型可以称为信道压缩子模型或称为信道压缩子神经网络,该压缩子模型可采用全连接网络、卷积神经网络、残差网络、自注意力机制网络中的一种或者多种网络结构构建。The estimation sub-model can also be called a channel estimation sub-model or a channel estimation sub-neural network, and the estimation sub-model can use one of a fully connected network, a convolutional neural network, a residual network, and a self-attention mechanism network. Or a variety of network structure construction. The compression sub-model may be called a channel compression sub-model or a channel compression sub-neural network, and the compression sub-model may use one of a fully connected network, a convolutional neural network, a residual network, a self-attention mechanism network, or A variety of network structure construction.
所述估计子模型采用的估计的方法可以包括有最小均方误差(MMSE)等算法。The estimation method adopted by the estimation sub-model may include algorithms such as minimum mean square error (MMSE).
所述压缩子模型可以对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。The compression sub-model can compress the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
所述终端设备基于第一模型对所述第一信息进行处理得到所述第二信息,具体可以为:The terminal device processes the first information based on the first model to obtain the second information, which may specifically be:
所述终端设备将所述第一信息输入所述估计子模型,得到所述估计子模型输出的信道估计信息;The terminal device inputs the first information into the estimation sub-model, and obtains channel estimation information output by the estimation sub-model;
所述终端设备将所述信道估计信息输入所述压缩子模型,得到所述压缩子模型输出的信道压缩信息。The terminal device inputs the channel estimation information into the compression sub-model, and obtains channel compression information output by the compression sub-model.
本示例中,所述第一信息可以为参考信号,具体来说,所述第一信息可以为当前信道的参考信号,比如所述第一信息可以为当前信道的下行参考信号。相应的,所述信道信息可以用于表征基于所述第一信息所得到的信道质量、信道状态、信道估计结果中至少之一。In this example, the first information may be a reference signal, specifically, the first information may be a reference signal of a current channel, for example, the first information may be a downlink reference signal of a current channel. Correspondingly, the channel information may be used to characterize at least one of channel quality, channel state, and channel estimation result obtained based on the first information.
所述信道信息可以采用T个维度的矩阵表示,T为大于等于2的整数。或者所述信道估计信息同样可以采用T个维度的矩阵表示,以下以信道信息为例进行说明,信道估计信息的说明与其类似不做重复说明。The channel information may be represented by a matrix of T dimensions, where T is an integer greater than or equal to 2. Alternatively, the channel estimation information may also be represented by a matrix of T dimensions. The channel information is used as an example for description below, and the description of the channel estimation information is similar to that and will not be repeated.
所述T个维度的矩阵具体可以为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。也就是说,所述信道信息可以由大小为M×N的二维矩阵构成,其在第四维度上有M个第一粒度,在第五维度上有N个第二粒度;上述M和N可以相等也可以不相等。所述二维矩阵内具体的数值指示代表信道质量某一个第一粒度下接收的信号强度,这里所述二维矩阵内的数值的单位可以是dBm,或所述二维矩阵内的数值没有单位而是归一化之后所得到的数值。此外,也可以将M×N的二维矩阵合成成为1×(M×N)大小或者(M×N)×1大小的一维数据,具体变换是可以是先第四维度再第五维度,也可以是先第五维度再第四维度,本实施例不对其进行限定。The matrix of the T dimensions may specifically be a two-dimensional matrix of M×N; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension; M and N are all positive integers. That is to say, the channel information may be composed of a two-dimensional matrix with a size of M×N, which has M first granularities in the fourth dimension and N second granularities in the fifth dimension; the above M and N May or may not be equal. The specific numerical indication in the two-dimensional matrix represents the received signal strength at a certain first granularity of the channel quality, where the unit of the numerical value in the two-dimensional matrix may be dBm, or the numerical value in the two-dimensional matrix has no unit It is the value obtained after normalization. In addition, the two-dimensional matrix of M×N can also be synthesized into one-dimensional data of size 1×(M×N) or (M×N)×1. The specific transformation can be the fourth dimension first and then the fifth dimension. It may also be the fifth dimension first and then the fourth dimension, which is not limited in this embodiment.
所述第四维度可以为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。或者,所述第四维度可以为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。所述符号为正交频分复用符号(OFDM,Orthogonal Frequency Division Multiplexing)。这里,所述第四维度为时域维度的时候,所述第一粒度还可以称为时延粒度。The fourth dimension may be a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers. Alternatively, the fourth dimension may be a time-domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol lengths, and the number of sampling points of K3 symbols; K1, K2, and K3 are positive integers . The symbols are Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing). Here, when the fourth dimension is a time domain dimension, the first granularity may also be called a delay granularity.
举例来说,当所述第四维度是频域维度时,所述第一粒度可以是L1个RB(L1大于等于1,例如2RB,4RB,8RB),则所述信道信息在频域维度上的分布范围是M×L1个RB所对应的频域范围;或者所述第一粒度可以是L2个子载波(L2大于1,例如4个子载波,6个子载波,18个子载波),则所述信道信息在频域维度上的分布是M×L2个子载波对应的频域范围。当所述第四维度是时域维度时,所述第一粒度可以是时延粒度,例如一个第一粒度是K1个微秒、或者K2个符号长度、或者K3个符号的采样点个数,这里所述符号可以是一个OFDM符号;当所述第四维度是时域维度且所述第一粒度为K1个微秒时,所述信道信息在时域维度上的分布范围是M×K1个微秒对应的时域范围。For example, when the fourth dimension is the frequency domain dimension, the first granularity may be L1 RBs (L1 is greater than or equal to 1, such as 2RB, 4RB, 8RB), then the channel information in the frequency domain dimension The distribution range is the frequency domain range corresponding to M×L1 RBs; or the first granularity can be L2 subcarriers (L2 is greater than 1, such as 4 subcarriers, 6 subcarriers, and 18 subcarriers), then the channel The distribution of information in the frequency domain dimension is the frequency domain range corresponding to M×L2 subcarriers. When the fourth dimension is a time-domain dimension, the first granularity may be a delay granularity, for example, a first granularity is the number of sampling points of K1 microseconds, or K2 symbol lengths, or K3 symbols, Here, the symbol may be an OFDM symbol; when the fourth dimension is the time domain dimension and the first granularity is K1 microseconds, the distribution range of the channel information on the time domain dimension is M×K1 The time domain range corresponding to microseconds.
所述第五维度可以为空间域维度;相应的,所述第二粒度为一对收发天线或到达角度的间隔。也就是说,所述第五维度为所述空间域维度,具体地可以是天线维度,所述第二粒度可以是一对收发天线。或者,所述第五维度为空间域维度,具体的可以是角度域维度,所述第二粒度可以是到达角度的间隔大小。The fifth dimension may be a space domain dimension; correspondingly, the second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival. That is to say, the fifth dimension is the space domain dimension, specifically, it may be an antenna dimension, and the second granularity may be a pair of transmitting and receiving antennas. Alternatively, the fifth dimension is a space domain dimension, specifically, an angle domain dimension, and the second granularity may be an interval of arrival angles.
再进一步地,表征所述信道信息的二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。也就是说,用于表示所述信道信息的所述二维矩阵中某一个位置处的数值(或称为指示值)代表了在第四维度以及第五维度这样的组合下的信道质量。其中,所述信道质量可以采用信号强度值来表征;所述信号强度值的单位可以是dBm,或所述信号强度值没有单位而是归一化之后所得到的数值。Still further, the value of the ijth position in the two-dimensional matrix representing the channel information is used to represent the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension The channel quality of ; i and j are both positive integers. That is to say, a numerical value (or an indicator value) at a certain position in the two-dimensional matrix used to represent the channel information represents the channel quality under the combination of the fourth dimension and the fifth dimension. Wherein, the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
所述T个维度中还可以包括第六维度。所述T个维度的矩阵可以为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The T dimensions may also include a sixth dimension. The matrix of T dimensions may be a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, N represents the number of second granularities in the fifth dimension, and W represents The number of third granularities under the sixth dimension; M, N and W are all positive integers.
示例性的,所述第六维度可以为复数维度,所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。举例来说,所述第四维度表示时域维度的时候,所述第一粒度为时延粒度;所述第五维度为空间域维度具体为角度维度,所述第二粒度为到达角度的间隔;所述第六维度为复数维度,W为2,k为1表示实部,k为2表示虚部。i=4、j=5、k=1时,上述三维矩阵的第ijk个位置处的数值(或指示值)表示第5个空间粒度(例如到达角度的间隔)内的第4个时延粒度上的信道质量的实部。若i=4、j=5、k=2时,则上述三维矩阵的第ijk个位置处的数值(或指示值)表示第5个空间粒度内的第4个时延粒度上的信道质量的虚部。Exemplarily, the sixth dimension may be a complex dimension, the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2. For example, when the fourth dimension represents the time domain dimension, the first granularity is the delay granularity; the fifth dimension is the spatial domain dimension, specifically the angle dimension, and the second granularity is the interval of arrival angles ; The sixth dimension is a complex dimension, W is 2, k is 1 to indicate the real part, and k is 2 to indicate the imaginary part. When i=4, j=5, k=1, the value (or indicator value) at the ijkth position of the above-mentioned three-dimensional matrix represents the fourth delay granularity in the fifth spatial granularity (such as the interval of arrival angle) The real part of the channel quality on . If i=4, j=5, k=2, then the value (or indicator value) at the ijkth position of the above-mentioned three-dimensional matrix represents the channel quality on the 4th delay granularity in the 5th spatial granularity imaginary part.
后续的描述中,为了描述简单起见,以第四维度和第五维度构成的二维矩阵来举例说明上述信道信息,但需要明确的是上述信道信息的矩阵的维度不局限在二维。In the subsequent description, for the sake of simplicity, the above channel information is illustrated by using a two-dimensional matrix formed by the fourth dimension and the fifth dimension. However, it should be clarified that the dimension of the above channel information matrix is not limited to two dimensions.
在另一种示例中,所述第二信息可以为信道压缩信息;该信道压缩信息包括压缩的信道估计信息的特征向量信息;相应的,所述第一模型用于对输入的所述第一信息进行处理得到压缩的信道估计信息的特征向量信息。In another example, the second information may be channel compression information; the channel compression information includes eigenvector information of compressed channel estimation information; correspondingly, the first model is used for the input first The information is processed to obtain the eigenvector information of the compressed channel estimation information.
本示例中,所述第一信息可以为参考信号,具体来说,所述第一信息可以为当前信道的参考信号,比如所述第一信息可以为当前信道的下行参考信号。所述第二模型输出的信道信息具体可以为信道信息的特征向量信息。In this example, the first information may be a reference signal, specifically, the first information may be a reference signal of a current channel, for example, the first information may be a downlink reference signal of a current channel. The channel information output by the second model may specifically be feature vector information of the channel information.
需要指出的是,所述第一模型还可以称为编码模型或编码神经网络等,只要输入信息为第一信息以及输出信息为压缩的信道估计信息的特征向量信息的模型或神经网络均在本实施例保护范围内。It should be pointed out that the first model can also be called an encoding model or an encoding neural network, etc., as long as the input information is the first information and the output information is the eigenvector information of the compressed channel estimation information or the neural network. Within the protection scope of the embodiment.
其中,所述第一模型具体可以包括以下子模型:估计子模型、信道生成子模型和压缩子模型;Wherein, the first model may specifically include the following sub-models: estimation sub-model, channel generation sub-model and compression sub-model;
所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;The estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
所述信道生成子模型用于对所述信道估计信息进行特征分解得到信道估计信息的特征向量信息;The channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
所述压缩子模型用于对所述信道估计信息的特征向量信息进行压缩得到压缩的信道估计信息的特征向量信息。The compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain compressed eigenvector information of the channel estimation information.
其中,所述信道信息的特征向量信息包含R组特征向量序列信息;R为正整数。比如,R可以为1,则所述信道信息的特征向量信息包含1组特征向量序列信息。R可以为2,则所述信道信息的特征向量信息包含2组特征向量序列信息。上述R的取值可以根据实际情况来确定,又或者可以是在所述第一模型训练的时候指定。相应的,所述信道估计信息的特征向量信息也可以包含R组特征向量序列信息,不做赘述。Wherein, the eigenvector information of the channel information includes R groups of eigenvector sequence information; R is a positive integer. For example, R may be 1, then the eigenvector information of the channel information includes a set of eigenvector sequence information. R may be 2, then the eigenvector information of the channel information includes 2 sets of eigenvector sequence information. The above value of R may be determined according to the actual situation, or may be specified during the training of the first model. Correspondingly, the eigenvector information of the channel estimation information may also include R groups of eigenvector sequence information, which will not be described in detail.
上述R组特征向量序列信息中,每一组特征向量序列信息中可以包括预设长度的特征序列。其中,不同组的特征向量序列信息所包含的特征序列的长度相同。所述预设长度可以根据实际情况设置或者可以是在训练的时候设置的,比如,可以为16、32、48、64、128、256中任意之一,当然还可以更长或更短,本实施例不对所述预设长度的全部可能的取值进行穷举。结合图7举例来说,预设长度为32(但是可以为比特bit),R为4,也就是信道信息的特征向量信息包含 了4组特征向量序列信息,其中,每一组特征向量序列信息中包含了长度为32的特征序列。In the above R sets of feature vector sequence information, each set of feature vector sequence information may include a feature sequence of a preset length. Wherein, the lengths of the feature sequences included in the feature vector sequence information of different groups are the same. The preset length can be set according to the actual situation or can be set during training, for example, it can be any one of 16, 32, 48, 64, 128, 256, and of course it can be longer or shorter. The embodiment does not exhaustively list all possible values of the preset length. In conjunction with FIG. 7, for example, the preset length is 32 (but it can be a bit), and R is 4, that is, the feature vector information of the channel information includes 4 sets of feature vector sequence information, wherein each set of feature vector sequence information contains a feature sequence of length 32.
所述估计子模型的功能与前述实施例相同,不做重复说明。The function of the estimation sub-model is the same as that of the foregoing embodiment, and no repeated description is given.
所述信道生成子模型中进行特征分解的方式具体可以为奇异值分解(SVD,Singular Value Decomposition)方式。比如,所述信道生成子模型将输入的信道估计信息进行SVD特征分解,得到特征分解之后的信道估计信息的特征向量信息。所述信道估计信息可以为采用矩阵表示,具体的说明与前述实施例相同,不再赘述。The manner of performing eigendecomposition in the channel generation sub-model may specifically be a singular value decomposition (SVD, Singular Value Decomposition) manner. For example, the channel generation sub-model performs SVD eigendecomposition on the input channel estimation information to obtain eigenvector information of the channel estimation information after eigendecomposition. The channel estimation information may be represented by a matrix, and the specific description is the same as that of the foregoing embodiment, and will not be repeated here.
所述压缩子模型可以为对输入信息的数据量进行压缩。所述压缩子模型输出的第二信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。The compression sub-model may be to compress the data volume of the input information. The compression rate between the second information output by the compression sub-model and the input information can be determined during training, for example, the compression rate can be five thousandths, two thousandths, ten percent, etc., not here Exhaustive.
本示例中所述估计子模型所输出的信道估计信息与所述第二模型所输出的信道信息可以是不同的,所述估计子模型所输出的信道估计信息具体可以为信道信息的矩阵,比如采用T个维度的矩阵来表示;所述第二模型所输出的信道信息可以为信道信息的特征向量信息,比如可以包含R组特征向量序列信息。当然,本示例中所述第二模型所输出的信道信息与所述估计子模型所输出的信道估计信息也可能是相同的。In this example, the channel estimation information output by the estimation sub-model may be different from the channel information output by the second model, and the channel estimation information output by the estimation sub-model may specifically be a matrix of channel information, such as It is represented by a matrix of T dimensions; the channel information output by the second model may be eigenvector information of the channel information, for example, may include R groups of eigenvector sequence information. Certainly, the channel information output by the second model in this example may also be the same as the channel estimation information output by the estimation sub-model.
所述终端设备可以基于第一模型对所述第一信息进行处理得到所述第二信息,具体可以为:The terminal device may process the first information based on the first model to obtain the second information, which may specifically be:
所述终端设备将所述第一信息输入所述估计子模型,得到所述估计子模型输出的信道估计信息;The terminal device inputs the first information into the estimation sub-model, and obtains channel estimation information output by the estimation sub-model;
所述终端设备将所述信道估计信息输入所述信道生成子模型,得到所述信道生成子模型输出的信道估计信息的特征向量信息;The terminal device inputs the channel estimation information into the channel generation sub-model, and obtains eigenvector information of the channel estimation information output by the channel generation sub-model;
所述终端设备将所述信道估计信息的特征向量信息输入所述压缩子模型,得到所述压缩子模型输出的压缩的信道估计信息的特征向量信息。The terminal device inputs the eigenvector information of the channel estimation information into the compression sub-model, and obtains the eigenvector information of the compressed channel estimation information output by the compression sub-model.
在完成上述处理之后,所述终端设备可以执行S620,所述终端设备发送基于第一信息得到的第二信息。After the above processing is completed, the terminal device may execute S620, where the terminal device sends second information obtained based on the first information.
具体的,所述终端设备发送基于所述第一信息得到的第二信息,可以为:所述终端设备向网络设备发送基于所述第一信息得到的第二信息。其中,所述网络设备可以为服务所述终端设备的接入网设备(比如基站,或eNB,或gNB),或者所述网络设备可以指的是与所述终端设备进行通信的接入网设备(比如基站,或eNB,或gNB)。Specifically, the terminal device sending the second information obtained based on the first information may be: the terminal device sends the second information obtained based on the first information to a network device. Wherein, the network device may be an access network device (such as a base station, or eNB, or gNB) serving the terminal device, or the network device may refer to an access network device that communicates with the terminal device (such as a base station, or eNB, or gNB).
其中,所述第二信息可以由以下信息中之一携带:随机接入过程中包含的信息,无线资源控制(RRC,Radio Resource Control)消息,上行控制信息(UCI,Uplink Control Information)。所述随机接入过程中包含的信息,包括以下之一:两步随机接入过程中的消息A;四步随机接入过程中的Msg1;四步随机接入过程中的Msg3。Wherein, the second information may be carried by one of the following information: information included in the random access process, radio resource control (RRC, Radio Resource Control) message, uplink control information (UCI, Uplink Control Information). The information contained in the random access process includes one of the following: message A in the two-step random access process; Msg1 in the four-step random access process; Msg3 in the four-step random access process.
以上对终端设备如何使用所述第一模型进行了详细说明,关于所述终端设备得到所述第一模型的方式可以有以下两种:第一种方式:所述终端设备直接获取的;第二种方式:所述终端设备训练得到的。下面针对这两种方式分别进行说明:The above describes how the terminal device uses the first model in detail. There are two ways for the terminal device to obtain the first model: the first way: the terminal device obtains the first model directly; the second One way: obtained by the terminal device through training. The two methods are described below:
第一种方式、所述终端设备接收所述第一模型。In a first manner, the terminal device receives the first model.
具体来说,所述终端设备接收电子设备发送的所述第一模型;比如,可以是所述终端设备可以接收电子设备发送的所述第一模型的模型参数。Specifically, the terminal device receives the first model sent by the electronic device; for example, the terminal device may receive model parameters of the first model sent by the electronic device.
这里,所述电子设备可以为联合训练得到所述第一模型和所述第二模型的电子设备。Here, the electronic device may be an electronic device that obtains the first model and the second model through joint training.
示例性的,所述电子设备可以为网络设备,所述网络设备可以为服务所述终端设备的接入网设备,比如基站、eNB、gNB等等。或者,所述电子设备可以为除了服务所述终端设备的网络设备之外的其他设备,比如,可以是服务器、或台式机、或笔记本等等具备数据处理能力的其他设备,本实施例不进行穷举。Exemplarily, the electronic device may be a network device, and the network device may be an access network device serving the terminal device, such as a base station, eNB, gNB, and so on. Alternatively, the electronic device may be other devices than the network device serving the terminal device, for example, it may be a server, or a desktop computer, or a notebook or other device with data processing capabilities, which is not described in this embodiment. Exhaustive.
其中,在所述电子设备为服务所述终端设备的网络设备的情况下,所述终端设备接收电子设备发送的所述第一模型具体可以为:所述终端设备接收网络设备发送的所述第一模型。其中,所述第一模型(或所述第一模型的模型参数)可以由以下至少之一携带:下行控制信令、媒体接入控制(MAC,Media Access Control)控制元素(CE,Control Element)消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。Wherein, when the electronic device is a network device serving the terminal device, receiving the first model sent by the electronic device by the terminal device may specifically be: the terminal device receives the first model sent by the network device. a model. Wherein, the first model (or the model parameters of the first model) may be carried by at least one of the following: downlink control signaling, media access control (MAC, Media Access Control) control element (CE, Control Element) Messages, RRC messages, broadcast messages, downlink data transmission, and downlink data transmission for artificial intelligence business transmission requirements.
在所述电子设备为除了服务所述终端设备的网络设备之外的其他设备的情况下,该第一模型(或所述第一模型的模型参数)可以是通过有线连接方式传输的、或其他无线连接方式传输的。比如,电子设备通过与终端设备之间的有线连接将所述第一模型(或所述第一模型的模型参数)传输给所述终端设备。或者,电子设备通过与终端设备之间的其他无线连接将所述第一模型(或所述第一模型的模型参数)传输给所述终端设备;其中,所述其他无线连接方式可以是蓝牙连接方式或无线保真(Wi-Fi,Wireless Fidelity)连接方式等等,这里不进行穷举。In the case that the electronic device is other than the network device serving the terminal device, the first model (or the model parameters of the first model) may be transmitted through a wired connection, or other transmitted over a wireless connection. For example, the electronic device transmits the first model (or the model parameters of the first model) to the terminal device through a wired connection with the terminal device. Alternatively, the electronic device transmits the first model (or the model parameters of the first model) to the terminal device through other wireless connections with the terminal device; wherein, the other wireless connection method may be a Bluetooth connection Ways or wireless fidelity (Wi-Fi, Wireless Fidelity) connection ways, etc., are not exhaustive here.
或者,所述终端设备可以分别接收多个子模型,然后将接收到的所述多个子模型合并得到所述第一模型。Alternatively, the terminal device may receive multiple sub-models respectively, and then combine the multiple received sub-models to obtain the first model.
在一种情况中,所述第一模型包含有估计子模型以及压缩子模型。相应的,所述终端设备接收估计子模型以及压缩子模型;所述终端设备基于所述估计子模型以及所述压缩子模型,生成所述第一模型。具体可以是:所述终端设备接收所述电子设备发送的估计子模型的模型参数以及压缩子模型的模型参数;所述终端设备基于所述估计子模型的模型参数以及所述压缩子模型的模型参数得到所述第一模型。In one case, the first model includes an estimation sub-model and a compression sub-model. Correspondingly, the terminal device receives the estimated sub-model and the compressed sub-model; the terminal device generates the first model based on the estimated sub-model and the compressed sub-model. Specifically, the terminal device receives the model parameters of the estimated sub-model and the model parameters of the compressed sub-model sent by the electronic device; parameters to obtain the first model.
这里,所述终端设备可以为同时接收所述电子设备发送的估计子模型以及压缩子模型;又或者,可以分别接收所述电子设备发送的估计子模型以及压缩子模型,比如,可以为先接收所述电子设备发送的估计子模型再接收所述电子设备发送的压缩子模型,或者,先接收所述电子设备发送的压缩子模型再接收所述电子设备发送的估计子模型。Here, the terminal device may receive the estimated sub-model and the compressed sub-model sent by the electronic device at the same time; or, it may receive the estimated sub-model and the compressed sub-model sent by the electronic device separately, for example, it may be received first The estimated sub-model sent by the electronic device then receives the compressed sub-model sent by the electronic device, or first receives the compressed sub-model sent by the electronic device and then receives the estimated sub-model sent by the electronic device.
在所述电子设备为服务所述终端设备的网络设备的情况下,可以是由以下信息之一同时携带或分别携带估计子模型(或估计子模型的模型参数)以及压缩子模型(或压缩子模型的模型参数):下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。In the case that the electronic device is a network device serving the terminal device, the estimated sub-model (or the model parameters of the estimated sub-model) and the compressed sub-model (or the compressed sub-model) may be carried simultaneously or separately by one of the following information Model parameters of the model): downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements.
在所述电子设备为除了服务所述终端设备的网络设备之外的其他设备的情况下,所述电子设备可以通过与终端设备之间的有线连接将上述估计子模型(或估计子模型的模型参数)以及压缩子模型(或压缩子模型的模型参数)同时发送或分别发送给所述终端设备。或者,电子设备通过与终端设备之间的其他无线连接将上述估计子模型(或估计子模型的模型参数)以及压缩子模型(或压缩子模型的模型参数)同时发送或分别发送给所述终端设备;其中,所述其他无线连接方式可以是蓝牙连接方式或WIFI连接方式等等,这里不进行穷举。In the case that the electronic device is other than the network device serving the terminal device, the electronic device may use the above estimated sub-model (or the model of the estimated sub-model) through a wired connection with the terminal device parameters) and the compressed sub-model (or the model parameters of the compressed sub-model) are sent to the terminal device at the same time or separately. Or, the electronic device sends the above-mentioned estimated sub-model (or the model parameters of the estimated sub-model) and the compressed sub-model (or the model parameters of the compressed sub-model) to the terminal at the same time or separately through other wireless connections with the terminal device device; wherein, the other wireless connection methods may be a Bluetooth connection method or a WIFI connection method, etc., which are not exhaustive here.
在又一种情况中,所述第一模型包含有估计子模型、信道生成子模型以及压缩子模型。In yet another case, the first model includes an estimation sub-model, a channel generation sub-model, and a compression sub-model.
这种情况下,所述终端设备接收估计子模型、压缩子模型以及信道生成子模型;所述终端设备基于所述估计子模型、 所述压缩子模型以及所述信道生成子模型,生成所述第一模型。具体可以是:所述终端设备接收所述电子设备发送的估计子模型的模型参数、压缩子模型的模型参数以及信道生成子模型的模型参数;所述终端设备基于所述估计子模型的模型参数、所述压缩子模型的模型参数以及所述信道生成子模型的模型参数得到所述第一模型。In this case, the terminal device receives the estimation sub-model, the compression sub-model and the channel generation sub-model; the terminal device generates the first model. Specifically, the terminal device receives the model parameters of the estimated sub-model, the model parameters of the compression sub-model and the model parameters of the channel generation sub-model sent by the electronic device; , model parameters of the compression sub-model, and model parameters of the channel generation sub-model to obtain the first model.
这里,所述终端设备可以同时接收所述电子设备发送的估计子模型、压缩子模型以及信道生成子模型。或者,所述终端设备可以分别接收所述电子设备发送的估计子模型、压缩子模型以及信道生成子模型,比如,估计子模型、压缩子模型以及信道生成子模型均分别接收;又或者,估计子模型、压缩子模型以及信道生成子模型中任意两个与剩余一个分别接收。举例来说,所述终端设备可以为先接收所述电子设备发送的估计子模型,再接收所述电子设备发送的信道生成子模型,最后接收电子设备发送的压缩子模型;或者,所述终端设备先接收所述电子设备发送的压缩子模型和信道生成子模型,再接收所述电子设备发送的估计子模型。需要指出的是,上述仅为示例性说明,不代表实际分别发送或接收上述估计子模型、压缩子模型以及信道生成子模型仅存在上述示例性的几种组合方式,只是本实施例不做穷举。Here, the terminal device may simultaneously receive the estimation sub-model, compression sub-model and channel generation sub-model sent by the electronic device. Alternatively, the terminal device may respectively receive the estimated sub-model, the compressed sub-model and the channel generation sub-model sent by the electronic device, for example, the estimated sub-model, the compressed sub-model and the channel generation sub-model are respectively received; or, the estimated Any two of the sub-model, compression sub-model and channel generation sub-model are received separately from the remaining one. For example, the terminal device may first receive the estimated submodel sent by the electronic device, then receive the channel generation submodel sent by the electronic device, and finally receive the compressed submodel sent by the electronic device; or, the terminal The device first receives the compressed submodel and the channel generation submodel sent by the electronic device, and then receives the estimated submodel sent by the electronic device. It should be pointed out that the above is only an exemplary description, and does not mean that there are only several combinations of the above-mentioned exemplary sub-models, compression sub-models, and channel generation sub-models that are actually sent or received respectively, but this embodiment is not exhaustive. lift.
在所述电子设备为服务所述终端设备的网络设备的情况下,所述终端设备可以同时接收或分别接收所述网络设备发送的估计子模型、压缩子模型以及信道生成子模型时,所述估计子模型、压缩子模型以及信道生成子模型由以下之一同时携带或分别携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。When the electronic device is a network device serving the terminal device, when the terminal device can simultaneously receive or separately receive the estimation submodel, compression submodel and channel generation submodel sent by the network device, the The estimation sub-model, compression sub-model and channel generation sub-model are carried by one of the following at the same time or separately: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink for artificial intelligence business transmission requirements data transmission.
在所述电子设备为除了服务所述终端设备的网络设备之外的其他设备的情况下,所述电子设备可以通过与终端设备之间的有线连接将上述估计子模型、压缩子模型以及信道生成子模型,同时或分别发送给所述终端设备。或者,电子设备通过与终端设备之间的其他无线连接将上述估计子模型、压缩子模型以及信道生成子模型同时或分别发送给所述终端设备;其中,所述其他无线连接方式可以是蓝牙或WIFI等等,这里不进行穷举。In the case that the electronic device is other than the network device serving the terminal device, the electronic device can generate the above-mentioned estimated sub-model, compressed sub-model and channel through a wired connection with the terminal device. The sub-models are sent to the terminal devices simultaneously or separately. Alternatively, the electronic device sends the above-mentioned estimation submodel, compression submodel and channel generation submodel to the terminal device simultaneously or separately through other wireless connections with the terminal device; wherein, the other wireless connection methods may be Bluetooth or WIFI, etc., are not exhaustive here.
通过上述处理,所述终端设备可以接收到第一模型,进而可以执行前述S610~S620的处理。Through the above processing, the terminal device may receive the first model, and then may execute the foregoing processing of S610-S620.
在本方式中,所述方法还可以包括:所述终端设备接收所述第二模型。具体来说,所述终端设备可以接收所述第二模型的模型参数。再进一步的,所述终端设备可以接收电子设备发送的所述第二模型,具体可以是所述终端设备可以接收电子设备发送的所述第二模型的模型参数。In this manner, the method may further include: the terminal device receiving the second model. Specifically, the terminal device may receive model parameters of the second model. Still further, the terminal device may receive the second model sent by the electronic device, specifically, the terminal device may receive model parameters of the second model sent by the electronic device.
在所述电子设备为服务所述终端设备的网络设备的情况下,所述第二模型由以下之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。When the electronic device is a network device serving the terminal device, the second model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, manual Downlink data transmission required for intelligent service transmission.
在所述电子设备为除了服务所述终端设备的网络设备之外的其他设备的情况下,该第二模型(或所述第二模型的模型参数)可以是通过有线连接方式传输的、或其他无线连接方式传输的。比如,电子设备通过与终端设备之间的有线连接将所述第二模型(或所述第二模型的模型参数)传输给所述终端设备。或者,电子设备通过与终端设备之间的其他无线连接将所述第二模型(或所述第二模型的模型参数)传输给所述终端设备;其中,所述其他无线连接方式可以是蓝牙或WIFI等等,这里不进行穷举。In the case that the electronic device is other than the network device serving the terminal device, the second model (or the model parameters of the second model) may be transmitted through a wired connection, or other transmitted over a wireless connection. For example, the electronic device transmits the second model (or the model parameters of the second model) to the terminal device through a wired connection with the terminal device. Alternatively, the electronic device transmits the second model (or the model parameters of the second model) to the terminal device through other wireless connections with the terminal device; wherein, the other wireless connection methods may be Bluetooth or WIFI, etc., are not exhaustive here.
需要说明的是,上述第二模型与所述第一模型也可以是同时发送的,还可以是分别发送的。It should be noted that the above-mentioned second model and the first model may also be sent at the same time, or may be sent separately.
在本方式中,所述方法还可以包括:所述终端设备接收所述第三模型。具体来说,所述终端设备可以接收所述第三模型的模型参数。再进一步的,所述终端设备可以接收电子设备发送的所述第三模型,具体可以是所述终端设备可以接收电子设备发送的所述第三模型的模型参数。In this manner, the method may further include: the terminal device receiving the third model. Specifically, the terminal device may receive model parameters of the third model. Still further, the terminal device may receive the third model sent by the electronic device, specifically, the terminal device may receive model parameters of the third model sent by the electronic device.
其中,所述第三模型用于对所述第一模型输出的第二信息进行数据变换处理后输入所述第二模型;所述第一模型、第二模型以及第三模型为联合训练得到的。Wherein, the third model is used to perform data transformation processing on the second information output by the first model and input it into the second model; the first model, the second model and the third model are obtained through joint training .
所述数据变换处理包括:卷积处理或傅里叶变换处理。比如,所述傅里叶变换处理具体可以包括:通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域。The data transformation processing includes: convolution processing or Fourier transform processing. For example, the Fourier transform processing may specifically include: converting to the frequency domain through Fourier transform, multiplication, and then converting to the time domain through inverse Fourier transform.
在所述电子设备为服务所述终端设备的网络设备的情况下,所述第三模型由以下之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。When the electronic device is a network device serving the terminal device, the third model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, manual Downlink data transmission required for intelligent business transmission.
在所述电子设备为除了服务所述终端设备的网络设备之外的其他设备的情况下,该第三模型(或所述第三模型的模型参数)可以是通过有线连接方式传输的、或其他无线连接方式传输的。比如,电子设备通过与终端设备之间的有线连接将所述第三模型(或所述第三模型的模型参数)传输给所述终端设备。或者,电子设备通过与终端设备之间的其他无线连接将所述第三模型(或所述第三模型的模型参数)传输给所述终端设备;其中,所述其他无线连接方式可以是蓝牙或WIFI等等,这里不进行穷举。In the case that the electronic device is other than the network device serving the terminal device, the third model (or the model parameters of the third model) may be transmitted through a wired connection, or other transmitted over a wireless connection. For example, the electronic device transmits the third model (or the model parameters of the third model) to the terminal device through a wired connection with the terminal device. Alternatively, the electronic device transmits the third model (or the model parameters of the third model) to the terminal device through other wireless connections with the terminal device; wherein, the other wireless connection methods may be Bluetooth or WIFI, etc., are not exhaustive here.
上述第一模型、第二模型以及第三模型可以为分别发送也可以为同时发送;或者上述第一模型、第二模型、第三模型也可以是三个模型均分别发送;又或者,还可以是其中任意两个组合同时发送,剩下一个分别发送等等。The above-mentioned first model, second model, and third model can be sent separately or simultaneously; or the above-mentioned first model, second model, and third model can also be sent separately; or, it is also possible It is any two combinations of which are sent at the same time, and the remaining one is sent separately, and so on.
需要说明的是,上述第一模型为所述终端设备在接收到第一信息时需要使用的模型。而本方式中所述终端设备除了可以接收第一模型之外,还可以接收第二模型和/或第三模型,其原因如下:It should be noted that the foregoing first model is a model that the terminal device needs to use when receiving the first information. However, in this method, besides the first model, the terminal device can also receive the second model and/or the third model, and the reasons are as follows:
以第一模型与第二模型为联合训练得到的为例来说,所述终端设备若要对第一模型以及第二模型进行整体评估,则需要得到所述第一模型以及所述第二模型,进而所述终端设备在完成所述第一模型以及所述第二模型的整体评估之后,可以决定是否使用本次接收到的所述第一模型以及所述第二模型。若所述终端设备对所述第一模型以及所述第二模型的整体评估结果较差(比如压缩率较低或者恢复信道信息的准确率较低等等),可以不使用上述第一模型。若所述终端设备确定不使用上述第一模型,所述终端设备可以自身对第一模型以及第二模型重新进行联合训练以更新第一模型以及第二模型的模型参数,或者,所述终端设备自己训练得到新的第一模型以及新的第二模型。另外,上述终端设备若重新进行联合训练或者更新后得到新的第一模型以及新的第二模型之后,还可以将其新的第一模型以及新的第二模型同步至网络设备;相应的,所述网络设备接收到新的第一模型以及新的第二模型之后,还可以替换自身原来使用的第一模型以及第二模型,并且还可以将新的第一模型以及新的第二模型发送给其他终端设备。关于其后续其他相关处理,本实施例不做穷举。通过以上处理,可以保证整个系统内使用性能最优的第一模型及其对应的第二模型,如此可以进一步保证整个系统的通信性能。Taking the joint training of the first model and the second model as an example, if the terminal device wants to evaluate the first model and the second model as a whole, it needs to obtain the first model and the second model , and then the terminal device may decide whether to use the first model and the second model received this time after completing the overall evaluation of the first model and the second model. If the overall evaluation result of the terminal device on the first model and the second model is poor (for example, the compression rate is low or the accuracy rate of recovered channel information is low, etc.), the first model may not be used. If the terminal device determines not to use the above-mentioned first model, the terminal device may re-train the first model and the second model jointly to update the model parameters of the first model and the second model, or the terminal device Train yourself to get a new first model and a new second model. In addition, if the above-mentioned terminal device obtains the new first model and the new second model after performing joint training or updating again, it can also synchronize the new first model and the new second model to the network device; correspondingly, After the network device receives the new first model and the new second model, it can also replace the original first model and the second model used by itself, and can also send the new first model and the new second model to other terminal equipment. Regarding other subsequent related processes, this embodiment does not exhaustively enumerate them. Through the above processing, it can be ensured that the first model with the best performance and its corresponding second model are used in the whole system, so that the communication performance of the whole system can be further guaranteed.
当第一模型、第二模型以及第三模型为联合训练得到的整体的时候,同样所述终端设备可以在接收到第一模型、第二模型以及第三模型后对其进行整体评估及其后续处理,具体的处理方式与前述相同,不做赘述。When the first model, the second model, and the third model are the whole obtained through joint training, the terminal device can also perform an overall evaluation on the first model, the second model, and the third model after receiving it and subsequent Processing, the specific processing method is the same as the above, and will not be repeated.
第二种方式、终端设备自身训练得到上述第一模型。In a second manner, the terminal device trains itself to obtain the above-mentioned first model.
所述终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型;The terminal device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model;
其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training.
在所述第二种方式中,训练可以采用第一损失函数或第二损失函数。下面对采用上述两种损失函数进行训练分别进行说明:In the second manner, the training may use the first loss function or the second loss function. The following describes the training using the above two loss functions:
所述训练采用的损失函数为第一损失函数;所述第一损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度构建的。The loss function used in the training is a first loss function; the first loss function is based on the difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model degree of difference is constructed.
所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度为基于距离确定的,或者为基于相似程度确定的。The degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model is determined based on a distance, or determined based on a degree of similarity.
基于距离确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的具体计算方式可以采用均方误差(MSE,Mean Squared Error)或归一化均方误差(NMSE)等方式,本实施例不做穷举。The specific calculation method for determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the distance can use mean square error (MSE, Mean Squared Error ) or normalized mean square error (NMSE), etc., which are not exhaustive in this embodiment.
举例来说,所述第二预设模型的输出信息可以为矩阵,相应的,所述压缩预设子模型的输入信息也可以为矩阵,这里,将所述第二预设模型的输出的矩阵称为矩阵1,将所述压缩预设子模型的输入的矩阵称为矩阵2;基于距离确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的方式为MSE方式,比如其计算可以包括:将矩阵1与矩阵2进行计算得到差值,将差值的平方作为所述差异程度。For example, the output information of the second preset model may be a matrix, and correspondingly, the input information of the compressed preset sub-model may also be a matrix, and here, the output matrix of the second preset model It is called matrix 1, and the matrix of the input of the compressed preset submodel is called matrix 2; the output information of the second preset model and the compressed preset submodel of the first preset model are determined based on the distance The way of the degree of difference between the input information is the MSE way, for example, its calculation may include: calculating the difference between matrix 1 and matrix 2, and taking the square of the difference as the difference degree.
基于相似程度确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的具体计算方式可以采用余弦相似度或余弦相似度平方等方式,本实施例不做穷举。The specific calculation method for determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may use cosine similarity or cosine similarity squared etc., which are not exhaustive in this embodiment.
举例来说,所述第二预设模型的输出信息可以为R组特征向量序列信息,相应的,所述压缩预设子模型的输入信息也可以为R组特征向量序列信息,这里,将所述第二预设模型的输出的R组特征向量序列信息称为特征向量序列1,将所述压缩预设子模型的输入的R组特征向量序列信息称为特征向量序列2。基于相似程度确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的方式可以是余弦相似度,比如其计算可以包括:特征向量序列1以及特征向量序列2的余弦夹角来确定相似程度,将该相似程度作为所述差异程度。For example, the output information of the second preset model may be R sets of feature vector sequence information, and correspondingly, the input information of the compressed preset sub-model may also be R sets of feature vector sequence information. Here, the The R sets of feature vector sequence information output by the second preset model are called feature vector sequence 1, and the R sets of feature vector sequence information input by the compressed preset sub-model are called feature vector sequence 2. The method of determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may be cosine similarity, for example, its calculation may include: The cosine angle between the eigenvector sequence 1 and the eigenvector sequence 2 is used to determine the degree of similarity, and the degree of similarity is used as the degree of difference.
采用上述第一损失函数进行训练的处理中,由于第一预设模型中包含的子模型的不同以及是否包含用于模拟无线信道环境的第三预设模型进行联合训练会使得联合训练的方式有所不同,因此下面分四种情况分别进行说明:In the process of using the above-mentioned first loss function for training, due to the difference in the sub-models contained in the first preset model and whether the third preset model for simulating the wireless channel environment is included for joint training, the way of joint training will be different. Therefore, the following four situations are described separately:
情况一,所述第一预设模型中包括估计预设子模型和压缩预设子模型。 Case 1, the first preset model includes an estimation preset sub-model and a compression preset sub-model.
参见图8a,其中示意出第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801和压缩预设子模型802。上述第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801和压缩预设子模型802之间的输入输出关系可以为:估计预设子模型801的输入信息为第一训练样本920;所述估计预设子模型801的输出信息作为所述压缩预设子模型802的输入信息;所述压缩预设子模型802的输出信息作为所述第二预设模型810的输入信息。Referring to FIG. 8 a , it illustrates a first preset model 800 , a second preset model 810 , and an estimated preset sub-model 801 and a compressed preset sub-model 802 included in the first preset model 800 . The above-mentioned first preset model 800, the second preset model 810, and the input-output relationship between the estimated preset sub-model 801 and the compressed preset sub-model 802 contained in the first preset model 800 can be: estimated The input information of the preset submodel 801 is the first training sample 920; the output information of the estimated preset submodel 801 is used as the input information of the compressed preset submodel 802; the output information of the compressed preset submodel 802 As the input information of the second preset model 810 .
所述终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The terminal device uses training samples to jointly train the first preset model and the second preset model, including:
所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息以及所述初始信息之间的差异程度,确定所述第一损失函数;determining the first loss function based on the degree of difference between the restoration information and the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
其中,所述第一训练样本可以为参考信号样本。所述参考信号样本可以为历史采集得到的原始参考信号、或者处理过的参考信号。再具体的,所述参考信号样本可以为下行参考信号样本。应理解,本实施例并不限定所述第一训练样本一定为所述下行参考信号样本,还可以采用上行参考信号样本或其他参考信号样本,只是本实施例不做穷举。Wherein, the first training samples may be reference signal samples. The reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition. More specifically, the reference signal samples may be downlink reference signal samples. It should be understood that this embodiment does not limit that the first training samples must be the downlink reference signal samples, and uplink reference signal samples or other reference signal samples may also be used, which are not exhaustive in this embodiment.
需要指出的是,除了所述第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在联合训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景来确定,这里不对其进行限定。It should be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model may also be wireless channel or other scene-related information, for example, may include at least one of the following: Channel signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
所述第一预设模型的估计预设子模型的具体功能可以为:基于所述第一训练样本进行信道估计得到初始信息。其中,信道估计可以采用最小均方误差(MMSE)等算法。A specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information. Wherein, the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
上述初始信息可以为矩阵,该矩阵的维度这里不做限定,可以为二维或更多维度的矩阵。所述矩阵中的每一个位置上的数值用于表示对应多个维度的相应粒度下所对应的信道质量。其中,所述信道质量可以采用信号强度值来表征;所述信号强度值的单位可以是dBm,或所述信号强度值没有单位而是归一化之后所得到的数值。The aforementioned initial information may be a matrix, and the dimension of the matrix is not limited here, and may be a matrix of two or more dimensions. The value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions. Wherein, the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。所述压缩预设子模型得到的压缩后的信息包含的数据量小于其输入的初始信息的数据量。上述压缩后的信息与初始信息的形式为相同的,比如所述初始信息为矩阵,相应的所述压缩后的信息也为矩阵,所述初始信息与所述压缩后的信息的矩阵维度是相同的但数据量是不同的。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift. The compressed information obtained by compressing the preset sub-model contains less data than the input initial information. The form of the above-mentioned compressed information is the same as that of the initial information, for example, the initial information is a matrix, and the corresponding compressed information is also a matrix, and the matrix dimensions of the initial information and the compressed information are the same but the amount of data is different.
所述第二预设模型的功能可以是对其输入信息进行解压缩。本情况中,所述第二预设模型的输入信息为所述压缩后的信息,所述第二预设模型的输出为所述恢复信息。在理想状态下,第二预设模型的解压缩率应该使得其得到的恢复信息与初始信息包含相同的数据内容。The function of the second preset model may be to decompress its input information. In this case, the input information of the second preset model is the compressed information, and the output of the second preset model is the restored information. Ideally, the decompression rate of the second preset model should make the obtained restored information contain the same data content as the original information.
所述基于第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型具体可以指的是:基于第一损失函数所确定的差异程度来进行反向传导更新所述估计预设子模型的模型参数、所述压缩预设子模型的模型参数和所述第二预设模型的模型参数。The performing reverse conduction update of the first preset model and the second preset model based on the first loss function may specifically refer to performing reverse conduction update based on the degree of difference determined by the first loss function. The model parameters of the estimated preset sub-model, the model parameters of the compressed preset sub-model, and the model parameters of the second preset model.
针对上述训练还需要指出,关于上述训练收敛的方式可以包括以下至少之一:判断迭代训练的次数是否达到预设次数,判断差异程度是否小于预设门限值。其中,所述预设次数、所述预设门限值可以根据实际情况设置,不对其进行穷 举。也就是说,基于上述方式确定训练完成时,可以将训练完成后的第一预设模型作为第一模型,将训练完成后的第二预设模型作为第二模型。Regarding the above training, it should also be pointed out that the manner of the above training convergence may include at least one of the following: judging whether the number of iterative training reaches a preset number, and judging whether the degree of difference is smaller than a preset threshold. Wherein, the preset number of times and the preset threshold value can be set according to actual conditions, and are not exhaustive. That is to say, when it is determined that the training is completed based on the above manner, the first preset model after the training can be used as the first model, and the second preset model after the training can be used as the second model.
情况二,所述第一预设模型中包括估计预设子模型、预设信道生成子模型和压缩预设子模型。Case 2, the first preset model includes an estimation preset submodel, a preset channel generation submodel, and a compression preset submodel.
参见图8b,其中示意出第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802和信道生成预设子模型803。上述第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802和信道生成预设子模型803之间的输入输出关系可以为:估计预设子模型801的输入信息为第一训练样本920;所述估计预设子模型801的输出信息作为所述信道生成预设子模型803的输入信息;所述信道生成预设子模型803的输出信息作为压缩预设子模型802的输入信息;所述压缩预设子模型802的输出信息作为所述第二预设模型810的输入信息。Referring to Fig. 8b, it shows a first preset model 800, a second preset model 810, and the estimation preset sub-model 801, compression preset sub-model 802 and channel generation preset included in the first preset model 800. Set sub-model 803 . Between the above-mentioned first preset model 800, second preset model 810, and the estimation preset submodel 801, compression preset submodel 802 and channel generation preset submodel 803 contained in the first preset model 800 The input-output relationship of can be as follows: the input information of the estimated preset sub-model 801 is the first training sample 920; the output information of the estimated preset sub-model 801 is used as the input information of the channel generation preset sub-model 803; The output information of the channel generation preset submodel 803 is used as the input information of the compression preset submodel 802 ; the output information of the compression preset submodel 802 is used as the input information of the second preset model 810 .
所述终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练,可以包括:The terminal device uses training samples to jointly train the first preset model and the second preset model, which may include:
所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
其中,关于第一训练样本的具体说明与前述情况一相同,因此不做重复说明。需要指出的是,除了所述第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在联合训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景来确定,这里不对其进行限定。Wherein, the specific description about the first training sample is the same as the above-mentioned case 1, so repeated description will not be given. It should be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model may also be wireless channel or other scene-related information, for example, may include at least one of the following: Channel signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
所述第一预设模型的估计预设子模型的具体功能可以为:基于第一训练样本进行信道估计得到所述初始信息。其中,信道估计可以采用最小均方误差(MMSE)等算法。上述估计预设子模型输出的初始信息可以为矩阵,该矩阵的维度这里不做限定,可以为二维或更多维度的矩阵。所述矩阵中的每一个位置上的数值用于表示对应多个维度的相应粒度下所对应的信道质量。其中,所述信道质量可以采用信号强度值来表征;所述信号强度值的单位可以是dBm,或所述信号强度值没有单位而是归一化之后所得到的数值。A specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain the initial information. Wherein, the channel estimation may adopt algorithms such as minimum mean square error (MMSE). The initial information output by the estimated preset sub-model above may be a matrix, and the dimension of the matrix is not limited here, and may be a two-dimensional or more dimensional matrix. The value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions. Wherein, the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
所述信道生成预设子模型的功能可以是对所述初始信息进行特征分解,得到所述初始信息的特征向量信息。其中,所述初始信息的特征向量信息可以包括R组特征向量序列。对所述初始信息进行特征分解的方式可以采用奇异值分解(SVD,Singular Value Decomposition)的方式。A function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information. Wherein, the eigenvector information of the initial information may include R groups of eigenvector sequences. The method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。在上述处理中,所述压缩预设子模型得到的压缩后的特征向量信息包含的数据量小于其输入的初始信息的特征向量信息的数据量。上述压缩后的特征向量信息与初始信息的特征向量信息的形式为相同的,比如初始信息的特征向量信息为R组特征向量序列,所述压缩后的特征向量信息也为R组特征向量序列但两者包含的数据量是不同的。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift. In the above processing, the compressed eigenvector information obtained by compressing the preset sub-model contains less data than the eigenvector information of the input initial information. The above compressed feature vector information is in the same form as the feature vector information of the initial information. For example, the feature vector information of the initial information is a sequence of R groups of feature vectors, and the compressed feature vector information is also a sequence of feature vectors of groups R but The amount of data contained in the two is different.
所述第二预设模型的功能可以是对其输入信息进行解压缩。在上述处理中,所述第二预设模型的输入信息为压缩后的特征向量信息,第二预设模型的输出为恢复的特征向量信息。第二预设模型的解压缩率应该使得其得到的恢复的特征向量信息与初始信息的特征向量信息包含相同或基本相同的数据。当然,上述第二预设模型的功能还可以包含有对输入信息进行恢复得到恢复的初始信息,此时第二预设模型的解压缩率应该使得其得到的恢复的初始信息与初始信息包含相同或基本相同的数据。The function of the second preset model may be to decompress its input information. In the above processing, the input information of the second preset model is compressed feature vector information, and the output of the second preset model is restored feature vector information. The decompression rate of the second preset model should be such that the obtained restored feature vector information contains the same or substantially the same data as the feature vector information of the initial information. Of course, the function of the above-mentioned second preset model can also include recovering the input information to obtain the restored initial information. At this time, the decompression rate of the second preset model should make the recovered initial information contain the same content as the initial information. or basically the same data.
所述基于第一损失函数所确定的差异程度来进行反向传导更新所述第一预设模型和所述第二预设模型具体可以指的是:基于第一损失函数所确定的差异程度来进行反向传导更新所述估计预设子模型的模型参数、所述信道生成预设子模型的模型参数、所述压缩预设子模型的模型参数和所述第二预设模型的模型参数。The performing reverse conduction to update the first preset model and the second preset model based on the degree of difference determined by the first loss function may specifically refer to: performing a reverse conduction based on the degree of difference determined by the first loss function performing reverse conduction to update model parameters of the estimated preset submodel, model parameters of the channel generation preset submodel, model parameters of the compression preset submodel, and model parameters of the second preset model.
关于上述训练收敛的确定方式与前述情况一相同,不做重复说明。The method for determining the convergence of the above training is the same as that of the above-mentioned case 1, and repeated explanations are not repeated.
情况三,所述终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型,包括:In case three, the terminal device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, including:
所述终端设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein , the third model is a trained third preset model.
参见图8c,其中示意出第一预设模型800,第二预设模型810,第三预设模型830,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802。上述第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801和压缩预设子模型802之间的输入输出关系可以为:估计预设子模型801的输入信息为第一训练样本920;所述估计预设子模型801的输出信息作为压缩预设子模型802的输入信息;所述压缩预设子模型802的输出信息作为所述第三预设模型830的输入信息;所述第三预设模型的输出信息作为所述第二预设模型810的输出信息。Referring to FIG. 8c, it shows a first preset model 800, a second preset model 810, a third preset model 830, and the estimated preset sub-model 801 contained in the first preset model 800, the compression preset Submodel 802. The above-mentioned first preset model 800, the second preset model 810, and the input-output relationship between the estimated preset sub-model 801 and the compressed preset sub-model 802 contained in the first preset model 800 can be: estimated The input information of the preset submodel 801 is the first training sample 920; the output information of the estimated preset submodel 801 is used as the input information of the compressed preset submodel 802; the output information of the compressed preset submodel 802 is used as the The input information of the third preset model 830; the output information of the third preset model is used as the output information of the second preset model 810.
所述终端设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息以及所述初始信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restoration information and the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
关于第一训练样本的具体说明与前述情况一或情况二相同,因此不做重复说明。还需要指出的是,除了第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景相关,这里不对其进行限定。The specific description about the first training sample is the same as the above-mentioned case 1 or case 2, so no repeated description is given. It should also be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. In the process of training, whether one or more of the above information is input may be relevant according to the actual situation or the actual scene, and it is not limited here.
所述第一预设模型的估计预设子模型以及所述第一预设模型的压缩预设子模型的具体功能与前述情况一相同,因此不做重复说明。The specific functions of the estimation preset sub-model of the first preset model and the compression preset sub-model of the first preset model are the same as those in the first case, so the description will not be repeated.
在情况三中相对于情况一增加了第三预设模型,关于所述第三预设模型的功能为模拟信道环境,具体处理可以为对输入信息进行数据变换得到数据变换后的信息作为输出信息。其中,所述数据变换的具体处理方式可以包括卷积处理或者等效于卷积的数据处理;其中,所述等效于卷积的数据处理可以为多次傅里叶变换处理,比如,可以为将所述第三预设模型的输入信息通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域,以此来等效时域卷积的处理。In case three, a third preset model is added relative to case one, and the function of the third preset model is to simulate the channel environment, and the specific processing can be to perform data transformation on input information to obtain data transformed information as output information . Wherein, the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
所述第二预设模型的功能可以是对其输入信息进行解压缩。在第三种情况的处理中,所述第二预设模型的输入信息为变换后的信息,第二预设模型的输出为恢复信息。第二预设模型的解压缩率应该使得其得到的恢复信息与初始信息包含相同的数据。The function of the second preset model may be to decompress its input information. In the processing of the third case, the input information of the second preset model is transformed information, and the output of the second preset model is restored information. The decompression rate of the second preset model should make the obtained restored information contain the same data as the original information.
所述基于第一损失函数进行反向传导更新更新所述第一预设模型、所述第二预设模型和所述第三预设模型具体可以指的是:基于第一损失函数进行反向传导更新所述估计预设子模型的模型参数、所述压缩预设子模型的模型参数、所述第二预设模型的模型参数和所述第三预设模型的模型参数。The updating of the first preset model, the second preset model, and the third preset model based on the first loss function may specifically refer to: performing reverse conduction based on the first loss function Conductively updating model parameters of the estimated preset sub-model, model parameters of the compressed preset sub-model, model parameters of the second preset model, and model parameters of the third preset model.
关于上述训练收敛的方式与前述情况一或情况二相同,不做赘述。The manner of the above-mentioned training convergence is the same as that of the foregoing case 1 or case 2, and will not be repeated here.
情况四,与上述情况三不同在于所述第一预设模型中包括估计预设子模型、预设信道生成子模型和压缩预设子模型。Case 4 is different from the above case 3 in that the first preset model includes an estimation preset sub-model, a preset channel generation sub-model, and a compression preset sub-model.
参见图8d,其中示意出第一预设模型800,第二预设模型810,第三预设模型830,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802和信道生成预设子模型803。上述第一预设模型800,第二预设模型810,第三预设模型830,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802和信道生成预设子模型803之间的输入输出关系可以为:估计预设子模型801的输入信息为第一训练样本920;所述估计预设子模型801的输出信息作为所述信道生成预设子模型803的输入信息;所述信道生成预设子模型803的输出信息作为压缩预设子模型802的输入信息;所述压缩预设子模型802的输出信息作为所述第三预设模型830的输入信息;所述第三预设模型830的输出信息作为所述第二预设模型810的输入信息。Referring to Fig. 8d, it shows a first preset model 800, a second preset model 810, a third preset model 830, and the estimated preset sub-model 801 contained in the first preset model 800, the compression preset Submodel 802 and channel generation preset submodel 803 . The above-mentioned first preset model 800, second preset model 810, third preset model 830, and the estimation preset sub-model 801, compression preset sub-model 802 and channel generation included in the first preset model 800 The input-output relationship between the preset sub-models 803 may be: the input information of the estimated preset sub-model 801 is the first training sample 920; the output information of the estimated preset sub-model 801 is used as the channel generation preset sub-model The input information of 803; the output information of the channel generation preset submodel 803 is used as the input information of the compression preset submodel 802; the output information of the compression preset submodel 802 is used as the input of the third preset model 830 information; the output information of the third preset model 830 is used as the input information of the second preset model 810 .
所述终端设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
关于第一训练样本的具体说明与前述情况一、情况二、情况三中任意之一相同,因此不做重复说明。The specific description about the first training sample is the same as any one of the foregoing case 1, case 2, and case 3, so no repeated description is given.
所述第一预设模型的估计预设子模型的具体功能与前述情况一、情况二、情况三中任意之一相同。The specific function of the estimated preset sub-model of the first preset model is the same as any one of the foregoing case 1, case 2, and case 3.
所述信道生成预设子模型的功能可以是对所述初始信息进行特征分解,得到所述初始信息的特征向量信息。其中,所述初始信息的特征向量信息可以包括R组特征向量序列。对所述初始信息进行特征分解的方式可以采用奇异值分解(SVD,Singular Value Decomposition)的方式。A function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information. Wherein, the eigenvector information of the initial information may include R groups of eigenvector sequences. The method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
所述第三预设模型的功能为模拟无线信道环境,具体处理可以为对输入信息进行数据变换得到数据变换后的信息作为输出信息。其中,所述数据变换的具体处理方式可以包括卷积处理或者等效于卷积的数据处理;其中,所述等效于卷积的数据处理可以为多次傅里叶变换处理,比如,可以为将所述第三预设模型的输入信息通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域,以此来等效时域卷积的处理。The function of the third preset model is to simulate the wireless channel environment, and the specific processing may be to perform data transformation on input information to obtain information after data transformation as output information. Wherein, the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
所述第二预设模型的功能可以是对其输入信息进行解压缩。所述第二预设模型的输入信息为变换后的特征向量信息,第二预设模型的输出为恢复的特征向量信息。第二预设模型的解压缩率应该使得其得到的恢复的特征向量信息与初始信息的特征向量信息包含接近或相同的数据。The function of the second preset model may be to decompress its input information. The input information of the second preset model is transformed feature vector information, and the output of the second preset model is restored feature vector information. The decompression rate of the second preset model should make the obtained restored feature vector information and the feature vector information of the initial information contain close to or the same data.
所述基于第一损失函数所确定的差异程度来进行反向传导更新更新所述第一预设模型、所述第二预设模型和所述第三预设模型具体可以指的是:基于第一损失函数所确定的差异程度来进行反向传导更新所述估计预设子模型的模型参数、所述信道生成预设子模型的模型参数、所述压缩预设子模型的模型参数、所述第二预设模型的模型参数和所述第三预设模型的模型参数。The performing reverse conduction update based on the degree of difference determined by the first loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The degree of difference determined by a loss function is used to perform reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the channel generation preset sub-model, the model parameters of the compressed preset sub-model, the The model parameters of the second preset model and the model parameters of the third preset model.
关于上述训练收敛的方式与前述实施例相同,不做重复说明。The manner of the above-mentioned training convergence is the same as that of the foregoing embodiment, and no repeated description is given.
以上针对联合训练采用第一损失函数的场景进行了说明,本实施例中还可以提供采用第二损失函数进行联合训练的场景,具体如下:The scenario where the first loss function is used for joint training is described above. In this embodiment, the scenario where the second loss function is used for joint training can also be provided, as follows:
所述训练采用的损失函数为第二损失函数;所述第二损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度以及所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度构建的;其中,所述第二训练样本与输入所述估计预设子模型的第一训练样本相对应。The loss function used in the training is a second loss function; the second loss function is based on the difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model The first difference degree of the first preset model and the second difference degree between the output information of the estimated preset sub-model of the first preset model and the second training sample; wherein, the second training sample and the input of the estimated Corresponds to the first training sample of the preset sub-model.
所述第一差异程度为基于距离确定的,或者为基于相似程度确定的;和/或,所述第二差异程度为基于距离确定的,或者为基于相似程度确定的。The first degree of difference is determined based on a distance, or is determined based on a degree of similarity; and/or, the second degree of difference is determined based on a distance, or is determined based on a degree of similarity.
基于距离确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度的具体计算方式可以采用均方误差(MSE,Mean Squared Error)或归一化均方误差(NMSE)等方式,本实施例不做穷举。The specific calculation method for determining the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the distance can use a mean square error (MSE, Mean Squared Error) or normalized mean square error (NMSE), etc., this embodiment is not exhaustive.
举例来说,所述第二预设模型的输出信息可以为矩阵,相应的,所述压缩预设子模型的输入信息也可以为矩阵,这里,将所述第二预设模型的输出的矩阵称为矩阵3,将所述压缩预设子模型的输入的矩阵称为矩阵4;基于距离确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的方式MSE方式,比如:将矩阵3与矩阵4进行计算得到差值,将差值的平方作为所述差异程度。For example, the output information of the second preset model may be a matrix, and correspondingly, the input information of the compressed preset sub-model may also be a matrix, and here, the output matrix of the second preset model It is called matrix 3, and the matrix of the input of the compressed preset submodel is called matrix 4; the output information of the second preset model and the compressed preset submodel of the first preset model are determined based on the distance The way of the degree of difference between the input information is the MSE way, for example: calculate the difference between matrix 3 and matrix 4, and use the square of the difference as the degree of difference.
基于相似程度确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度的具体计算方式可以采用余弦相似度或余弦相似度平方等方式,本实施例不做穷举。The specific calculation method for determining the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may use cosine similarity or cosine similarity Degree square and other methods are not exhaustive in this embodiment.
举例来说,所述第二预设模型的输出信息可以为R组特征向量序列信息,相应的,所述压缩预设子模型的输入信息也可以为R组特征向量序列信息,这里,将所述第二预设模型的输出的R组特征向量序列信息称为特征向量序列3,将所述压缩预设子模型的输入的R组特征向量序列信息称为特征向量序列4。基于相似程度确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的方式可以是余弦相似度,比如:特征向量序列3以及特征向量序列4的余弦夹角来确定相似程度,将该相似程度作为所述差异程度。For example, the output information of the second preset model may be R sets of feature vector sequence information, and correspondingly, the input information of the compressed preset sub-model may also be R sets of feature vector sequence information. Here, the The R group of feature vector sequence information output by the second preset model is called feature vector sequence 3, and the R group of feature vector sequence information input by the compressed preset sub-model is called feature vector sequence 4. The method of determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may be cosine similarity, for example: feature vector sequence 3 and the cosine angle of the eigenvector sequence 4 to determine the degree of similarity, and use the degree of similarity as the degree of difference.
基于距离确定所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度的具体计算方式可以采用均方误差(MSE,Mean Squared Error)或归一化均方误差(NMSE)等方式,本实施例不做穷举。The specific calculation method for determining the second degree of difference between the output information of the estimated preset sub-model of the first preset model based on the distance and the second training sample can use mean square error (MSE, Mean Squared Error) or normalization The methods such as normalized mean square error (NMSE) are not exhaustive in this embodiment.
举例来说,所述估计预设子模型的输出信息可以为矩阵,相应的,所述第二训练样本也可以为矩阵,这里,将所述估计预设子模型的输出的矩阵称为矩阵5,将所述压缩预设子模型的输入的矩阵称为矩阵6;基于距离确定所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度的方式MSE方式,比如:将矩阵5与矩阵6进行计算得到差值,将差值的平方作为所述差异程度。For example, the output information of the estimated preset sub-model may be a matrix, and correspondingly, the second training sample may also be a matrix. Here, the output matrix of the estimated preset sub-model is called matrix 5 , the matrix of the input of the compressed preset sub-model is called matrix 6; the second degree of difference between the output information of the estimated preset sub-model of the first preset model and the second training sample is determined based on the distance In the MSE mode, for example: calculate the difference between matrix 5 and matrix 6, and use the square of the difference as the degree of difference.
基于相似程度所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度的具体计算方式的具体计算方式可以采用余弦相似度或余弦相似度平方等方式,本实施例不做穷举。Based on the specific calculation method of the second degree of difference between the output information of the estimated preset sub-model of the first preset model and the second training sample based on the degree of similarity, the specific calculation method can use cosine similarity or cosine similarity squared, etc. The methods are not exhaustive in this embodiment.
上述第一差异程度以及第二差异程度联合构建所述第二损失函数时,其联的方式可以是对第一差异程度以及第二差异程度等权重相加,比如两者各占50%;或者,其联合的方式可以是第一差异程度以及第二差异程度不等权重相加,比如可以对第一差异程度赋予更大权重,也就是对上述第二预设模型与压缩预设子模型之间的压缩恢复前后的差异情况赋予更大权重,或者可以对上述第二差异程度赋予更大权重,也就是对上述估计预设子模型的输出信息的准确度赋予更大权重;或者,其联合的方式可以是第一差异程度以及第二差异程度相乘的形式;或者其联合的方式可以是第一差异程度以及第二差异程度可以是通过交叉熵计算的形式,比如p1*log(第一差异程度)+p2*log(第二差异程度),其中p1和p2均可以根据实际情况设置,这里不对其进行限定。When the first degree of difference and the second degree of difference are combined to construct the second loss function, the method of connection may be to add the weights of the first degree of difference and the second degree of difference, for example, the two each account for 50%; or , the joint method can be the addition of unequal weights between the first difference degree and the second difference degree. The difference before and after the compression and recovery between the two can be assigned a greater weight, or the above-mentioned second degree of difference can be assigned a larger weight, that is, the accuracy of the output information of the above-mentioned estimated preset sub-model is assigned a larger weight; or, its combination The method can be in the form of multiplying the first degree of difference and the second degree of difference; or the joint method can be that the first degree of difference and the second degree of difference can be calculated by cross entropy, such as p1*log(first degree of difference)+p2*log (the second degree of difference), where both p1 and p2 can be set according to actual conditions, and are not limited here.
采用上述第二损失函数进行训练的处理中,由于第一预设模型中包含的子模型的不同以及是否包含用于模拟无线信道环境的第三预设模型进行联合训练的具体处理是不同的,因此分以下四种情况分别进行说明:In the process of using the above-mentioned second loss function for training, since the sub-models contained in the first preset model are different and whether the third preset model for simulating the wireless channel environment is included for joint training is different, Therefore, the following four situations are described separately:
情况五,终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练得到训练后的第一模型和第二模型;其中,所述第一预设模型中包括估计预设子模型和压缩预设子模型。Case five, the terminal device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model; wherein, the first preset model includes estimated preset sub-models Model and compression preset submodels.
本情况中各个模型的组成以及各个模型之间的输入输出关系与前述情况一相同,具体可以参见图8a,这里不做重复说明。In this case, the composition of each model and the input-output relationship between each model are the same as the previous case 1. For details, please refer to FIG. 8 a , which will not be repeated here.
所述终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The terminal device uses training samples to jointly train the first preset model and the second preset model, including:
所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本相对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
其中,所述第一训练样本可以为参考信号样本。所述参考信号样本可以为历史采集得到的原始参考信号、或者处理过的参考信号。再具体的,所述参考信号样本可以为下行参考信号样本。应理解,本实施例并不限定所述第一训练样本一定为所述下行参考信号样本,还可以采用上行参考信号样本或其他参考信号样本,只是本实施例不做穷举。Wherein, the first training samples may be reference signal samples. The reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition. More specifically, the reference signal samples may be downlink reference signal samples. It should be understood that this embodiment does not limit that the first training samples must be the downlink reference signal samples, and uplink reference signal samples or other reference signal samples may also be used, which are not exhaustive in this embodiment.
需要指出的是,除了所述第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在联合训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景来确定,这里不对其进行限定。It should be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model may also be wireless channel or other scene-related information, for example, may include at least one of the following: Channel signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
所述第一预设模型的估计预设子模型的具体功能可以为:基于第一训练样本进行信道估计得到初始信息。其中,信道估计可以采用最小均方误差(MMSE)等算法。The specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information. Wherein, the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
上述初始信息可以为矩阵,该矩阵的维度这里不做限定,可以为二维或更多维度的矩阵。所述矩阵中的每一个位置上的数值用于表示对应多个维度的相应粒度下所对应的信道质量。其中,所述信道质量可以采用信号强度值来表征;所述信号强度值的单位可以是分贝毫瓦(dBm,decibel relative to one milliwatt”),或所述信号强度值没有单位而是归一化之后所得到的数值。The aforementioned initial information may be a matrix, and the dimension of the matrix is not limited here, and may be a matrix of two or more dimensions. The value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions. Wherein, the channel quality can be characterized by a signal strength value; the unit of the signal strength value can be decibel milliwatt (dBm, decibel relative to one milliwatt"), or the signal strength value has no unit but is normalized values obtained afterwards.
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。所述压缩预设子模型得到的压缩后的信息包含的数据量小于其输入的初始信息的数据量。上述压缩后的信息与初始信息的形式为相同的,比如所述初始信息为矩阵,相应的所述压缩后的信息也为矩阵,所述初始信息与所述压缩后的信息的矩阵维度是相同的但数据量是不同的。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift. The compressed information obtained by compressing the preset sub-model contains less data than the input initial information. The form of the above-mentioned compressed information is the same as that of the initial information, for example, the initial information is a matrix, and the corresponding compressed information is also a matrix, and the matrix dimensions of the initial information and the compressed information are the same but the amount of data is different.
所述第二预设模型的功能可以是对其输入信息进行解压缩。第二预设模型的解压缩率应该使得其得到的恢复信息与初始信息包含相同的数据。The function of the second preset model may be to decompress its input information. The decompression rate of the second preset model should make the obtained restored information contain the same data as the original information.
所述基于第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型具体可以指的是:基于第二损失函数进行反向传导更新所述估计预设子模型的模型参数、所述压缩预设子模型的模型参数和所述第二预设模型的模型参数。The performing reverse conduction based on the second loss function to update the first preset model and the second preset model may specifically refer to: performing reverse conduction based on the second loss function to update the estimated preset sub-model , the model parameters of the compressed preset sub-model and the model parameters of the second preset model.
针对上述训练还需要指出,关于上述训练收敛的方式可以包括以下至少之一:判断迭代训练的次数是否达到预设次数,判断第一差异程度是否小于第一预设门限值,判断第二差异程度是否小于第二预设门限值。其中,所述预设次数、所述第一预设门限值以及第二预设门限值可以根据实际情况设置,不对其进行穷举。也就是说,基于上述方式确定训练完成时,可以将训练完成后的第一预设模型作为第一模型,将训练完成后的第二预设模型作为第二模型。For the above-mentioned training, it should also be pointed out that the way of the above-mentioned training convergence can include at least one of the following: judging whether the number of iterative training reaches the preset number of times, judging whether the first difference degree is less than the first preset threshold value, judging the second difference Whether the degree is smaller than the second preset threshold value. Wherein, the preset times, the first preset threshold value and the second preset threshold value can be set according to actual conditions, and are not exhaustive. That is to say, when it is determined that the training is completed based on the above manner, the first preset model after the training can be used as the first model, and the second preset model after the training can be used as the second model.
情况六、与情况五不同在于,所述第一预设模型中包括估计预设子模型、预设信道生成子模型和压缩预设子模型。Case 6 is different from Case 5 in that the first preset model includes an estimation preset sub-model, a preset channel generation sub-model, and a compression preset sub-model.
本情况中各个模型的组成以及各个模型之间的输入输出关系与前述情况二相同,具体可以参见图8b,这里不做重复说明。The composition of each model in this case and the input-output relationship between each model are the same as those in the second case, for details, please refer to FIG. 8 b , which will not be repeated here.
所述终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The terminal device uses training samples to jointly train the first preset model and the second preset model, including:
所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
关于第一训练样本的具体说明与前述情况五相同,因此不做重复说明。需要指出的是,除了第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在联合训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景来确定,这里不对其进行限定。The specific description about the first training sample is the same as that of the fifth case above, so the description will not be repeated. It should be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model may also include other information related to wireless channels or scenes, for example, may include at least one of the following: Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
所述第一预设模型的估计预设子模型的具体功能与前述情况五相同。The specific function of the estimation preset sub-model of the first preset model is the same as the fifth case above.
所述信道生成预设子模型的功能可以是对所述初始信息进行特征分解,得到所述初始信息的特征向量信息。其中,所述初始信息的特征向量信息可以包括R组特征向量序列。对所述初始信息进行特征分解的方式可以采用奇异值分解(SVD,Singular Value Decomposition)的方式。A function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information. Wherein, the eigenvector information of the initial information may include R groups of eigenvector sequences. The method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。在上述处理中,所述压缩预设子模型得到的压缩后的特征向量信息包含的数据量小于其输入的初始信息的特征向量信息的数据量。上述压缩后的特征向量信息与初始信息的特征向量信息的形式为相同的,比如初始信息的特征向量信息为R组特征向量序列,所述压缩后的特征向量信息也为R组特征向量序列但两者包含的数据量是不同的。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift. In the above processing, the compressed eigenvector information obtained by compressing the preset sub-model contains less data than the eigenvector information of the input initial information. The above compressed feature vector information is in the same form as the feature vector information of the initial information. For example, the feature vector information of the initial information is a sequence of R groups of feature vectors, and the compressed feature vector information is also a sequence of feature vectors of groups R but The amount of data contained in the two is different.
所述第二预设模型的功能可以是对其输入信息进行解压缩。在上述处理中,所述第二预设模型的输入信息为压缩后的特征向量信息,第二预设模型的输出为恢复的特征向量信息。第二预设模型的解压缩率应该使得其得到的恢复的特征向量信息与初始信息的特征向量信息包含相同或基本相同的数据。The function of the second preset model may be to decompress its input information. In the above processing, the input information of the second preset model is compressed feature vector information, and the output of the second preset model is restored feature vector information. The decompression rate of the second preset model should be such that the obtained restored feature vector information contains the same or substantially the same data as the feature vector information of the initial information.
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型具体可以指的是:基于第二损失函数进行反向传导更新所述估计预设子模型的模型参数、所述信道生成预设子模型的模型参数、所述压缩预设子模型的模型参数和所述第二预设模型的模型参数。Performing reverse conduction according to the second loss function to update the first preset model and the second preset model may specifically refer to: performing reverse conduction based on the second loss function to update the estimated preset sub-model model parameters of the channel generation preset sub-model, model parameters of the compression preset sub-model and model parameters of the second preset model.
关于上述训练收敛的确定方式与前述情况五相同,不做重复说明。The method for determining the above-mentioned training convergence is the same as that of the above-mentioned case five, and repeated explanations are not repeated.
情况七,所述终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型,包括:In case seven, the terminal device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, including:
所述终端设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein , the third model is a trained third preset model.
所述终端设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述预设模型的第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model of the preset model to obtain transformed information output by the third preset model;
将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
本情况中各个模型的组成以及各个模型之间的输入输出关系与前述情况三相同,可以参见图8c,这里不做重复说明。The composition of each model in this case and the input-output relationship between each model are the same as those in the third case above, which can be referred to FIG. 8c, and repeated explanations are not repeated here.
关于第一训练样本的具体说明与前述情况五或情况六相同,因此不做重复说明。The specific description about the first training sample is the same as the foregoing case five or six, so no repeated description is given.
所述第一预设模型的估计预设子模型以及所述第一预设模型的压缩预设子模型的具体功能与前述情况五相同,因此不做重复说明。The specific functions of the estimation preset sub-model of the first preset model and the compression preset sub-model of the first preset model are the same as those of the fifth case above, so repeated descriptions will not be made.
在情况七中相对于情况五增加了第三预设模型,关于所述第三预设模型的功能为模拟信道环境,具体处理可以为对输入信息进行数据变换得到数据变换后的信息作为输出信息。其中,所述数据变换的具体处理方式可以包括卷积处理或者等效于卷积的数据处理;其中,所述等效于卷积的数据处理可以为多次傅里叶变换处理,比如,可以为将所述第三预设模型的输入信息通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域,以此来等效时域卷积的处理。In case seven, a third preset model is added relative to case five. The function of the third preset model is to simulate the channel environment, and the specific processing can be to perform data transformation on input information to obtain data transformed information as output information . Wherein, the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
所述第二预设模型的功能可以是对其输入信息进行解压缩。The function of the second preset model may be to decompress its input information.
所述基于第二损失函数所确定的差异程度来进行反向传导更新更新所述第一预设模型、所述第二预设模型和所述第 三预设模型具体可以指的是:基于第二损失函数进行反向传导更新所述估计预设子模型的模型参数、所述压缩预设子模型的模型参数、所述第二预设模型的模型参数和所述第三预设模型的模型参数。The performing reverse conduction update based on the degree of difference determined by the second loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The second loss function performs reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the compressed preset sub-model, the model parameters of the second preset model, and the model of the third preset model parameter.
关于上述训练收敛的方式与前述情况五或情况六相同,不做重复说明。The manner of the above-mentioned training convergence is the same as that of the foregoing case five or six, and no repeated description is made.
情况八,与上述情况七不同在于所述第一预设模型中包括估计预设子模型、预设信道生成子模型和压缩预设子模型。Case 8 is different from the above case 7 in that the first preset model includes an estimation preset submodel, a preset channel generation submodel, and a compression preset submodel.
所述终端设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述预设模型中的第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;inputting the compressed feature vector information into a third preset model among the preset models, to obtain transformed feature vector information output by the third preset model;
将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
本情况中各个模型的组成以及各个模型之间的输入输出关系与前述情况四相同,可以参见图8d,这里不做重复说明。In this case, the composition of each model and the input-output relationship between each model are the same as the foregoing case 4, which can be referred to FIG. 8d , and repeated descriptions are not repeated here.
关于第一训练样本的具体说明与前述情况五、情况六、情况七中任意之一相同,因此不做重复说明。The specific description about the first training sample is the same as any one of the above-mentioned case 5, case 6, and case 7, so the description will not be repeated.
所述第一预设模型的估计预设子模型的具体功能与情况五、情况六、情况七中任意之一相同。The specific function of the estimated preset sub-model of the first preset model is the same as any one of the fifth, sixth, and seventh cases.
所述信道生成预设子模型的功能可以是对所述初始信息进行特征分解,得到所述初始信息的特征向量信息。其中,所述初始信息的特征向量信息可以包括R组特征向量序列。举例来说,对初始信息进行特征分解的方式可以采用奇异值分解(SVD,Singular Value Decomposition)的方式。A function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information. Wherein, the eigenvector information of the initial information may include R groups of eigenvector sequences. For example, the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
所述第三预设模型的功能为模拟无线信道环境,具体处理可以为对输入信息进行数据变换得到数据变换后的信息作为输出信息。其中,所述数据变换的具体处理方式可以包括卷积处理或者等效于卷积的数据处理;其中,所述等效于卷积的数据处理可以为多次傅里叶变换处理,比如,可以为将所述第三预设模型的输入信息通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域,以此来等效时域卷积的处理。The function of the third preset model is to simulate the wireless channel environment, and the specific processing may be to perform data transformation on input information to obtain information after data transformation as output information. Wherein, the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
所述第二预设模型的功能可以是对其输入信息进行解压缩。所述第二预设模型的输入信息为变换后的特征向量信息,第二预设模型的输出为恢复的特征向量信息。第二预设模型的解压缩率应该使得其得到的恢复的特征向量信息与初始信息的特征向量信息的包含接近或相同的数据。The function of the second preset model may be to decompress its input information. The input information of the second preset model is transformed feature vector information, and the output of the second preset model is restored feature vector information. The decompression rate of the second preset model should be such that the obtained restored feature vector information and the feature vector information of the initial information contain data that are close to or identical.
所述基于第一损失函数所确定的差异程度来进行反向传导更新更新所述第一预设模型、所述第二预设模型和所述第三预设模型具体可以指的是:基于第一损失函数所确定的差异程度来进行反向传导更新所述估计预设子模型的模型参数、所述信道生成预设子模型的模型参数、所述压缩预设子模型的模型参数、所述第二预设模型的模型参数和所述第三预设模型的模型参数。The performing reverse conduction update based on the degree of difference determined by the first loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The degree of difference determined by a loss function is used to perform reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the channel generation preset sub-model, the model parameters of the compressed preset sub-model, the The model parameters of the second preset model and the model parameters of the third preset model.
关于上述训练收敛的方式与前述情况五、情况六、情况七中任意之一相同,因此不做重复说明。The manner of the above-mentioned training convergence is the same as any one of the aforementioned cases 5, 6, and 7, so repeated explanations will not be made.
所述终端设备通过采用以上第二种方式可以得到自身联合训练后的第一模型、第二模型,或者得到联合训练后的第一模型、第二模型以及第三模型。进而可以执行前述S610~S620的处理。The terminal device may obtain the first model and the second model after its own joint training, or obtain the first model, the second model and the third model after joint training by adopting the above second method. Furthermore, the above-mentioned processing of S610 to S620 may be performed.
在上述第二种方式所提供的终端设备自身进行联合训练得到第一模型和第二模型,以及终端设备自身进行联合训练得到第一模型、第二模型和第三模型的处理中使用了训练样本,下面针对训练样本进行详细说明:The training samples are used in the processing of the joint training of the terminal device itself to obtain the first model and the second model, and the joint training of the terminal device itself to obtain the first model, the second model and the third model provided by the second method above. , the following is a detailed description of the training samples:
所述训练样本中可以包含第一训练样本。所述第一训练样本可以为参考信号样本。所述参考信号样本可以为历史采集得到的原始参考信号、或者处理过的参考信号。其中,所述原始参考信号可以指的是未经过无线信道传输的参考信号。获取处理参考信号的方法可以包括:将原始参考信号通过无线信道(或真实无线信道、或实际无线信道)后接收到的参考信号作为处理过的参考信号。或者,获取处理参考信号的方法可以包括:将原始参考信号通过模拟的无线信道后接收的参考信号作为处理过的参考信号。再进一步地,原始参考信号可以为下行参考信号,或者上行参考信号。The training samples may include a first training sample. The first training samples may be reference signal samples. The reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition. Wherein, the original reference signal may refer to a reference signal that has not been transmitted through a wireless channel. The method for acquiring and processing the reference signal may include: using the reference signal received after the original reference signal passes through the wireless channel (or the real wireless channel, or the real wireless channel) as the processed reference signal. Alternatively, the method for obtaining a processed reference signal may include: using a reference signal received after the original reference signal passes through a simulated wireless channel as a processed reference signal. Still further, the original reference signal may be a downlink reference signal or an uplink reference signal.
所述第一训练样本分布在第一维度和/或第二维度。The first training samples are distributed in the first dimension and/or the second dimension.
其中,所述第一维度为时域维度;所述第一训练样本包括在所述时域维度中的m个时间单元内分布的第一信息样本;m为正整数。其中,所述m个时间单元中每个时间单元中可以分布有n个第一信息样本,n为正整数。所述每个时间单元可以包含有至少一个时隙、或至少一个符号(比如OFDM符号)。Wherein, the first dimension is a time domain dimension; the first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer. Wherein, n first information samples may be distributed in each of the m time units, where n is a positive integer. Each time unit may include at least one time slot, or at least one symbol (such as an OFDM symbol).
举例来说,所述第一信息样本为下行参考信号样本,每个时间单元内包含的时隙数量可以为c个(c为正整数),在每c个时隙内有n个下行参考信号样本,c和n的组合可以是例如(1,1)(1,2)(1,3)(1,4)(1,6)(2,1)(4,1)(5,1)(8,1)(10,1)。For example, the first information sample is a downlink reference signal sample, the number of time slots contained in each time unit may be c (c is a positive integer), and there are n downlink reference signals in each c time slot A sample, combination of c and n can be e.g. (1,1)(1,2)(1,3)(1,4)(1,6)(2,1)(4,1)(5,1) (8,1)(10,1).
所述第二维度为频域维度;所述第一训练样本包括在所述频域维度中的x个频域资源上分布的第一信息样本;x为正整数。其中,所述x个频域资源中每个频域资源中可以分布有y个第一信息样本,y为正整数。所述每个频域资源可以包含有至少一个资源块(RB)、或至少一个子载波。The second dimension is a frequency domain dimension; the first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer. Wherein, y first information samples may be distributed in each of the x frequency domain resources, and y is a positive integer. Each frequency domain resource may include at least one resource block (RB), or at least one subcarrier.
举例来说,所述第一信息样本为下行参考信号样本,每个频域资源内包含的时隙数量可以为d个(d为正整数),在频域上每d个RB内有y个下行参考信号样本,d和y的组合可以是例如(1,1)(1,2)(2,1)(1,3)(3,1)(1,4)(4,1)(1,6)(6,1)。For example, the first information sample is a downlink reference signal sample, and the number of time slots contained in each frequency domain resource may be d (d is a positive integer), and there are y time slots in every d RBs in the frequency domain Downlink reference signal samples, the combination of d and y can be, for example, (1,1)(1,2)(2,1)(1,3)(3,1)(1,4)(4,1)(1 ,6)(6,1).
上述第一训练样本分布在第一维度和/或第二维度,可以理解为可以仅根据第一训练样本在频域维度上的分布情况来进行后续的训练,也可以仅根据第一训练样本在时域维度上的分布情况来进行后续的训练,还可以根据第一训练样本在频域维度以及时域维度上的分布情况来进行后续的训练。比如,一个第一训练样本在频域维度上包含10个RB、在时域 维度上包含1个时隙,每个RB中有3个第一信号样本,每个时隙有1个第一信号样本,则第一训练样本一共包含有30个第一信号样本。The above-mentioned first training samples are distributed in the first dimension and/or the second dimension. It can be understood that the subsequent training can be performed only according to the distribution of the first training samples in the frequency domain dimension, or only based on the distribution of the first training samples in the frequency domain. The subsequent training may be performed according to the distribution of the first training sample in the frequency domain and the time domain. For example, a first training sample contains 10 RBs in the frequency domain dimension and 1 time slot in the time domain dimension, each RB has 3 first signal samples, and each time slot has 1 first signal samples, the first training samples include a total of 30 first signal samples.
上述第一维度和第二维度,即时域维度和频域维度的大小可以相等、也可以不相等。另外,也可以将上述时域维度和频域维度合并成为一个维度,具体合并是可以是先时域维度再频域维度,也可以是先频域维度再时域维度,本实施例不对其进行限定。The sizes of the first dimension and the second dimension, the time domain dimension and the frequency domain dimension may be equal or unequal. In addition, the above-mentioned time-domain dimension and frequency-domain dimension can also be combined into one dimension. Specifically, the combination can be the time-domain dimension first and then the frequency-domain dimension, or the frequency-domain dimension first and then the time-domain dimension, which is not implemented in this embodiment limited.
需要注意的是,因为原始参考信号、或者处理过的参考信号都可以是通过复数来呈现,所以本实施例提供的方案上述第一训练样本可以在上述第一维度和第二维度的基础上额外增加复数的呈现形式(或可以理解为增加一个维度,该维度是将原始参考信号、或者处理过的参考信号的虚部和实部数据独立呈现所造成的),具体的:所述第一训练样本还分布在第三维度。所述第三维度为复数维度;所述第一训练样本包括第一信息样本的实部和第一信息样本的虚部。举例来说,假设一个第一训练样本中在时域维度上包含1个时间单元(比如1个时隙),频域维度上包含10个频域资源(比如10个RB),每个第一信息样本可以表示为实部和虚部,则第一训练样本可以为一个1×10×2的矩阵。It should be noted that, because the original reference signal or the processed reference signal can be represented by complex numbers, the solution provided by this embodiment can be based on the above-mentioned first dimension and second dimension. Increase the presentation form of complex numbers (or it can be understood as adding a dimension, which is caused by the independent presentation of the imaginary part and real part data of the original reference signal or the processed reference signal), specifically: the first training The samples are also distributed in the third dimension. The third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample. For example, assuming that a first training sample contains 1 time unit (such as 1 time slot) in the time domain dimension, and contains 10 frequency domain resources (such as 10 RBs) in the frequency domain dimension, each first The information sample can be expressed as a real part and an imaginary part, and the first training sample can be a 1×10×2 matrix.
所述训练样本中还包含与所述第一训练样本对应的第二训练样本;所述第二训练样本由T个维度的矩阵构成;T为大于等于2的整数。这里,所述第二训练样本可以用于表征基于所述第一训练样本所期望得到的信道质量、或称为信道响应、或称为信道状态、或称为信道估计结果、或称为信道信息。The training samples also include a second training sample corresponding to the first training sample; the second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2. Here, the second training samples may be used to characterize the expected channel quality based on the first training samples, or channel response, or channel state, or channel estimation results, or channel information .
所述T个维度中包含第四维度和第五维度。The T dimensions include a fourth dimension and a fifth dimension.
所述T个维度的矩阵具体可以为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。The matrix of the T dimensions may specifically be a two-dimensional matrix of M×N; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension; M and N are all positive integers.
也就是说,一个第二训练样本由大小为M×N的二维矩阵构成,其在第四维度上有M个第一粒度,在第五维度上有N个第二粒度;上述M和N可以相等也可以不相等。所述二维矩阵内具体的数值指示代表信道质量某一个第一粒度下接收的信号强度,这里所述二维矩阵内的数值的具体可以指的是信号强度值,其单位可以是dBm,或没有单位而是归一化之后所得到的数值。此外,也可以将M×N的二维矩阵合成成为1×(M×N)大小或者(M×N)×1大小的一维数据,具体变换是可以是先第四维度再第五维度,也可以是先第五维度再第四维度,本实施例不对其进行限定。That is to say, a second training sample consists of a two-dimensional matrix with a size of M×N, which has M first granularities in the fourth dimension and N second granularities in the fifth dimension; the above M and N May or may not be equal. The specific numerical indication in the two-dimensional matrix represents the received signal strength at a certain first granularity of the channel quality. The specific numerical value in the two-dimensional matrix here may refer to the signal strength value, and its unit may be dBm, or There is no unit but the value obtained after normalization. In addition, the two-dimensional matrix of M×N can also be synthesized into one-dimensional data of size 1×(M×N) or (M×N)×1. The specific transformation can be the fourth dimension first and then the fifth dimension. It may also be the fifth dimension first and then the fourth dimension, which is not limited in this embodiment.
可选地,所述第四维度为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。或者,所述第四维度为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。所述符号为正交频分复用符号(OFDM,Orthogonal Frequency Division Multiplexing)。这里,第四维度为时域维度的时候,所述第一粒度还可以称为时延粒度。Optionally, the fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers. Alternatively, the fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, and K3 symbol sampling points; K1, K2, and K3 are positive integers. The symbols are Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing). Here, when the fourth dimension is a time domain dimension, the first granularity may also be called a delay granularity.
举例来说,在第一训练样本为参考信号样本或下行参考信号样本的时候,所述第二训练样本可以为与所述参考信号样本所对应的信道信息样本,或者还可以称为信道状态样本等等,这里不对其名称进行穷举。当第四维度是频域维度时,第一粒度可以是L1个RB(L1大于等于1,例如2RB,4RB,8RB),则一个第二训练样本在频域维度上的分布范围是M×L1个RB所对应的频域范围;或者第一粒度可以是L2个子载波(L2大于1,例如4个子载波,6个子载波,18个子载波),则一个第二训练样本在频域维度上的分布是M×L2个子载波对应的频域范围。当第四维度是时域维度时,第一粒度可以是时延粒度,例如一个第一粒度是K1个微秒、或者K2个符号长度、或者K3个符号的采样点个数,这里所述符号可以是一个OFDM符号;当第四维度是时域维度且第一粒度为K1个微秒时,一个第二训练样本在时域维度上的分布范围是M×K1个微秒对应的时域范围。For example, when the first training sample is a reference signal sample or a downlink reference signal sample, the second training sample may be a channel information sample corresponding to the reference signal sample, or may also be called a channel state sample Wait, I'm not going to exhaust the names here. When the fourth dimension is the frequency domain dimension, the first granularity can be L1 RBs (L1 is greater than or equal to 1, such as 2RB, 4RB, 8RB), and the distribution range of a second training sample in the frequency domain dimension is M×L1 The frequency domain range corresponding to each RB; or the first granularity can be L2 subcarriers (L2 is greater than 1, such as 4 subcarriers, 6 subcarriers, and 18 subcarriers), then the distribution of a second training sample on the frequency domain dimension is the frequency domain range corresponding to M×L2 subcarriers. When the fourth dimension is a time-domain dimension, the first granularity may be a delay granularity, for example, a first granularity is the number of sampling points of K1 microseconds, or K2 symbol lengths, or K3 symbols, where the symbols It can be an OFDM symbol; when the fourth dimension is the time domain dimension and the first granularity is K1 microseconds, the distribution range of a second training sample in the time domain dimension is the time domain range corresponding to M×K1 microseconds .
所述第五维度为空间域维度;相应的,所述第二粒度为一对收发天线或到达角度的间隔。The fifth dimension is a space domain dimension; correspondingly, the second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
举例来说,第五维度为所述空间域维度,具体地可以是天线维度,例如第五维度上由N个天线对构成,相应的,第二粒度是一对收发天线。或者,第五维度为空间域维度,具体的可以是角度域维度,例如第五维度上由N个到达角度构成,第二粒度是上述N个到达角度之间的到达角度的间隔大小。For example, the fifth dimension is the space domain dimension, specifically, the antenna dimension, for example, the fifth dimension is composed of N antenna pairs, and correspondingly, the second granularity is a pair of transmitting and receiving antennas. Alternatively, the fifth dimension is a space domain dimension, specifically an angle domain dimension, for example, the fifth dimension is composed of N arrival angles, and the second granularity is the interval between the above N arrival angles.
再进一步地,所述二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。也就是说,在使用一个第一训练样本的情况下,用于表示第二训练样本的所述二维矩阵中某一个位置处的数值(或称为指示值)代表了在第四维度以及第五维度这样的组合下的所期望得到的信道质量情况。其中,信道质量或信道质量情况可以采用信号强度来表征,其数值(或称为指示值)的单位可以是dBm,或没有单位而是归一化之后所得到的数值。Still further, the value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; Both i and j are positive integers. That is to say, in the case of using a first training sample, the value (or referred to as an indicator value) at a certain position in the two-dimensional matrix used to represent the second training sample represents the The expected channel quality situation under such a combination of five dimensions. Wherein, the channel quality or the channel quality situation can be characterized by signal strength, and the unit of the value (or indicator value) can be dBm, or there is no unit but a value obtained after normalization.
例如,结合图9来说,在M×N的二维矩阵中,若第四维度表示频域维度,第五维度为空间域维度具体为天线维度,第一粒度为2RB,第二粒度为1对收发天线;若M×N的二维矩阵中的第ij个位置为第i=3j=6个位置,则为图9中所示出的第3行第6列上黑色方框所在位置,该位置处的数值(或称为指示值)可以用于表示第6对收发天线上的第3个2RB带宽(也就是第5个RB至第6个RB)上的信道质量(或信道质量情况)。另外,图9中还可以用S来表示第二训练样本的数量,S可以为大于等于1的整数,也就是说,第二训练样本可以包含一个或多个。For example, in conjunction with Figure 9, in the M×N two-dimensional matrix, if the fourth dimension represents the frequency domain dimension, the fifth dimension is the space domain dimension, specifically the antenna dimension, the first granularity is 2RB, and the second granularity is 1 For the transceiver antenna; if the ij position in the two-dimensional matrix of M×N is the i=3j=6 position, then it is the position of the black box on the 3rd row and the 6th column shown in Fig. 9, The value (or indicator value) at this position can be used to represent the channel quality (or channel quality situation) on the third 2RB bandwidth (that is, the fifth RB to the sixth RB) on the sixth pair of transceiver antennas ). In addition, in FIG. 9 , S may also be used to represent the number of second training samples, and S may be an integer greater than or equal to 1, that is, the second training samples may include one or more.
再例如,在图10中展示的M×N的二维矩阵中,第四维度表示时域维度,第四维度为时域维度的时候,所述第一粒度为1个时延粒度;第五维度为空间域维度具体为角度维度,第二粒度为1个角度基本粒度(比如可以是1个到达角度的间隔);若M×N的二维矩阵中的第ij个位置为第i=4j=5个位置,则为图10中所示出的中的第4行第5列上黑色方框所在位置,该位置处的数值(或指示值)可以表示第5个空间粒度(例如到达角度的间隔)内的第4个时延粒度上的信道质量(或信道质量情况)。另外,图10中还可以用S来表示第二训练样本的数量,S可以为大于等于1的整数,也就是说,第二训练样本可以包含一个或多个。For another example, in the M×N two-dimensional matrix shown in FIG. 10 , the fourth dimension represents the time domain dimension, and when the fourth dimension is the time domain dimension, the first granularity is one delay granularity; the fifth The dimension is the spatial domain dimension, specifically the angle dimension, and the second granularity is the basic granularity of 1 angle (for example, it can be the interval of 1 arrival angle); if the ij-th position in the M×N two-dimensional matrix is i=4j = 5 positions, then it is the position of the black box on the 4th row and 5th column shown in Fig. The channel quality (or channel quality situation) at the 4th delay granularity within the interval of . In addition, in FIG. 10 , S may also be used to represent the number of second training samples, and S may be an integer greater than or equal to 1, that is, the second training samples may include one or more.
所述T个维度中还包括第六维度。相应的,所述T个维度的矩阵为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The T dimensions also include a sixth dimension. Correspondingly, the matrix of T dimensions is a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension, W represents the quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、第五维度的第j个第二粒度下、所述第六维度的第k个第三粒度下与所述第一训练样本所对应的信道质量;i、j和k均为正整数。The value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality corresponding to the first training sample at a third granularity; i, j and k are all positive integers.
其中,关于第四维度及其第一粒度,第五维度及其第二粒度的说明与前述实施例相同,这里不再重复说明。Wherein, the explanations about the fourth dimension and its first granularity, the fifth dimension and its second granularity are the same as those in the foregoing embodiments, and will not be repeated here.
本实施例中,所述第六维度可以为复数维度。这是由于所述第二训练样本可以用于表征基于所述第一训练样本所期 望得到的信道质量(或称为信道响应、或称为信道状态、或称为信道估计结果、或称为信道信息),而上述信道质量还可以通过复数来呈现,因此可以在所述第二训练样本的以上两个维度的基础上增加一个第六维度即复数维度,该复数维度是将所述第二训练样本中的信道质量的虚部和实部独立呈现所产生的。In this embodiment, the sixth dimension may be a complex dimension. This is because the second training samples can be used to characterize the expected channel quality based on the first training samples (or called channel response, or called channel state, or called channel estimation result, or called channel information), and the above-mentioned channel quality can also be presented by a complex number, so a sixth dimension, that is, a complex number dimension, can be added on the basis of the above two dimensions of the second training sample, and the complex number dimension is the second training sample. The imaginary and real parts of the channel quality in the samples are presented independently generated.
具体来说,所述第六维度为复数维度;所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。其中,所述第三粒度为1具体指的是一个实部或一个虚部,所述第三粒度的数量为2指的是在复数维度下可以存在2个第三粒度。Specifically, the sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2. Wherein, the third granularity being 1 specifically refers to a real part or an imaginary part, and the number of the third granularity being 2 means that there may be two third granularities in the complex dimension.
所述k为第一值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的实部;When the k is the first value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The real part of the channel quality at the second granularity;
所述k为第二值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的虚部。When the k is the second value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
其中,所述第一值与所述第二值不同,比如可以设置第一值为1第二值为2,又或者,第一值可以为0第二值可以为1,再或者第一值可以为1第二值可以为0,只要第一值与第二值不同则在本实施例的保护范围内。Wherein, the first value is different from the second value, for example, the first value can be set to 1 and the second value can be 2, or the first value can be 0 and the second value can be 1, or the first value It can be 1 and the second value can be 0, as long as the first value is different from the second value, it is within the protection scope of this embodiment.
举例来说,M×N×W的三维矩阵中,第四维度表示时域维度,第四维度为时域维度的时候,所述第一粒度还可以称为时延粒度;第五维度为空间域维度具体为角度维度,第二粒度为到达角度的间隔;第六维度为复数维度,W为2,k为1表示实部k为2表示虚部。若i=4、j=5、k=1,则表示第4行第5列上的数值(或指示值)为第5个空间粒度(例如到达角度的间隔)内的第4个时延粒度上的信道质量(或信道质量情况)的实部。若i=4、j=5、k=2,则表示第4行第5列上的数值(或指示值)为第5个空间粒度(例如到达角度的间隔)内的第4个时延粒度上的信道质量(或信道质量情况)的虚部。For example, in a three-dimensional matrix of M×N×W, the fourth dimension represents the time domain dimension, and when the fourth dimension is the time domain dimension, the first granularity can also be called the delay granularity; the fifth dimension is the space The domain dimension is specifically the angle dimension, and the second granularity is the interval of the arrival angle; the sixth dimension is the complex number dimension, W is 2, k is 1 for the real part and 2 for the imaginary part. If i=4, j=5, k=1, it means that the value (or indicator value) on the 4th row and 5th column is the 4th delay granularity in the 5th spatial granularity (such as the interval of arrival angle) The real part of the channel quality (or channel quality situation) on . If i=4, j=5, k=2, it means that the value (or indicator value) on the 4th row and 5th column is the 4th delay granularity in the 5th spatial granularity (such as the interval of arrival angle) The imaginary part of the channel quality (or channel quality situation) on .
此外,还需要注意的是,上述第二训练样本还可以是在上述第四维度、第五维度和第六维度的基础上的拆分与组合,例如当第五维度是天线对维度时,还可以拆分成为发送天线子维度和接收天线子维度,从而扩展上述第二训练样本的维度,本实施例不再对拆分后各种可能存在的子维度进行穷举。In addition, it should be noted that the above-mentioned second training samples can also be split and combined on the basis of the above-mentioned fourth dimension, fifth dimension, and sixth dimension. For example, when the fifth dimension is an antenna pair dimension, the It can be split into sending antenna sub-dimensions and receiving antenna sub-dimensions, thereby expanding the dimension of the second training sample. This embodiment does not exhaustively enumerate various possible sub-dimensions after splitting.
在上述第二种方式中,终端设备自身对第一预设模型、第二预设模型进行联合训练得到训练后的第一模型和第二模型;或由所述终端设备自身对第一预设模型、第二预设模型和第三预设模块联合训练得到训练后的第一模型、第二模型以及第三模型。这种方式下,所述终端设备至少还可以发送训练后的第二模型。下面对终端设备进行模型发送的处理进行示例说明:In the above-mentioned second method, the terminal device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model; or the terminal device itself trains the first preset model The model, the second preset model and the third preset module are jointly trained to obtain the trained first model, the second model and the third model. In this way, the terminal device can at least send the trained second model. The following is an example of how the terminal device sends the model:
示例一、Example one,
所述终端设备自身对第一预设模型、第二预设模型进行联合训练得到训练后的第一模型和第二模型。完成上述训练之后,所述终端设备发送所述第二模型。具体可以是:所述终端设备向网络设备发送所述第二模型。再进一步地,还可以是:所述终端设备向所述网络设备发送所述第二模型的模型参数。The terminal device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model. After the foregoing training is completed, the terminal device sends the second model. Specifically, it may be: the terminal device sends the second model to the network device. Still further, it may also be: the terminal device sends the model parameters of the second model to the network device.
其中,所述网络设备可以是为所述终端设备提供服务的网络设备,比如接入网设备,具体可以是基站、eNB、gNB等。Wherein, the network device may be a network device that provides services for the terminal device, such as an access network device, and specifically may be a base station, eNB, gNB, and the like.
所述第二模型(或第二模型的模型参数)由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The second model (or model parameters of the second model) is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
这种示例中,所述终端设备可以自身保留所述第一模型,以用于对第一信息进行处理得到第二信息;相应的,网络设备由于可以接收到终端设备发来的第二模型,因此,所述网络设备可以基于所述第二模型对所述第二信息进行处理以得到信道信息。其中,所述信道信息还可以为信道信息的特征向量信息。In such an example, the terminal device may retain the first model itself for processing the first information to obtain the second information; correspondingly, since the network device can receive the second model sent by the terminal device, Therefore, the network device may process the second information based on the second model to obtain channel information. Wherein, the channel information may also be feature vector information of the channel information.
需要说明的是,由于一个网络设备可以服务多个终端设备,因此,所述网络设备可以保存多个终端设备发来的第二模型。以网络设备为基站、终端设备为手机为例来说,基站1可以服务3个手机,分别为手机1、手机2和手机3,基站1可以分别接收到手机1、手机2和手机3发来的第二模型。在基站1接收到手机2发来的第二信息的时候,所述基站1可以基于手机2发来的第二模型对手机2的第二信息进行处理以得到手机2所对应的信道信息。It should be noted that, since one network device can serve multiple terminal devices, the network device can store the second models sent by the multiple terminal devices. Taking the network device as the base station and the terminal device as the mobile phone as an example, base station 1 can serve three mobile phones, namely mobile phone 1, mobile phone 2 and mobile phone 3. of the second model. When the base station 1 receives the second information sent by the mobile phone 2, the base station 1 can process the second information of the mobile phone 2 based on the second model sent by the mobile phone 2 to obtain the channel information corresponding to the mobile phone 2.
示例二、Example two,
所述终端设备自身对第一预设模型、第二预设模型进行联合训练得到训练后的第一模型和第二模型。完成上述训练之后,在所述终端设备发送所述第二模型的基础上,还可以包括:所述终端设备还发送所述第一模型。The terminal device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model. After the above training is completed, on the basis that the terminal device sends the second model, the method may further include: the terminal device also sends the first model.
具体可以是:所述终端设备向网络设备发送所述第一模型。再进一步地,还可以是:所述终端设备向所述网络设备发送所述第一模型的模型参数。Specifically, it may be: the terminal device sends the first model to the network device. Still further, it may also be: the terminal device sends the model parameters of the first model to the network device.
其中,所述网络设备可以是为所述终端设备提供服务的网络设备,比如接入网设备,具体可以是基站、eNB、gNB等。Wherein, the network device may be a network device that provides services for the terminal device, such as an access network device, and specifically may be a base station, eNB, gNB, and the like.
所述第一模型(或第一模型的模型参数)由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The first model (or model parameters of the first model) is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
上述第一模型与第二模型可以同时发送,或者上述第一模型与第二模型可以分别发送,本实施例不对其进行限定。The above-mentioned first model and the second model may be sent at the same time, or the above-mentioned first model and the second model may be sent separately, which is not limited in this embodiment.
这种示例中,所述终端设备可以通过所述第一模型对第一信息进行处理得到第二信息;相应的,所述网络设备由于可以接收到终端设备发来的第二模型,因此,所述网络设备可以基于所述第二模型对所述第二信息进行处理以得到信道信息。其中,所述信道信息还可以为信道信息的特征向量信息。In this example, the terminal device can process the first information through the first model to obtain the second information; correspondingly, since the network device can receive the second model sent by the terminal device, the The network device may process the second information based on the second model to obtain channel information. Wherein, the channel information may also be feature vector information of the channel information.
进一步地,所述网络设备接收终端设备发来的第一模型和第二模型之后,所述网络设备可以对第一模型以及第二模型进行整体评估,在完成第一模型以及第二模型的整体评估之后,可以决定是否使用本次接收到的第一模型以及第二模型,若整体评估结果较差(比如压缩率较低或者恢复信道信息的准确率较低等等),可以不使用上述第一模型和第二模型。若网络设备决定不使用上述第一模型和第二模型,还可以自身对第一模型以及第二模型重新进行联合训练以更新第一模型以及第二模型的模型参数,或者,网络设备自己训练得到新的第一模型以及第二模型。需要指出,若网络设备重新联合训练第一模型以及第二模型,或者更新第一模型以及第二模型,则所述网络设备还需要将新的第一模型和第二模型发送至所述终端设备,或者网络设备将新的第一模型发送至终端设备。Further, after the network device receives the first model and the second model sent by the terminal device, the network device can conduct an overall evaluation of the first model and the second model, and after completing the overall evaluation of the first model and the second model After the evaluation, you can decide whether to use the first model and the second model received this time. If the overall evaluation result is poor (for example, the compression rate is low or the accuracy of the restored channel information is low, etc.), the above first model may not be used. A model and a second model. If the network device decides not to use the above-mentioned first model and the second model, it can also re-train the first model and the second model to update the model parameters of the first model and the second model, or the network device trains itself to obtain New first model as well as second model. It should be pointed out that if the network device jointly trains the first model and the second model again, or updates the first model and the second model, the network device also needs to send the new first model and the second model to the terminal device , or the network device sends the new first model to the terminal device.
需要说明的是,由于一个网络设备可以服务多个终端设备,因此,所述网络设备还可以保存多个终端设备发来的第一模型和第二模型。进而网络设备可以对每一个终端设备发来的第一模型和第二模型进行整体评估,可以从中选取整体 评估结果最优的目标第一模型及其对应的目标第二模型,然后网络设备可以自己保留使用目标第二模型,将目标第一模型发送给上述多个终端设备。整体评估结果是否最优可以通过压缩率以及恢复准确率等指标来判断。It should be noted that, since one network device can serve multiple terminal devices, the network device can also save the first model and the second model sent by the multiple terminal devices. Furthermore, the network device can conduct an overall evaluation of the first model and the second model sent by each terminal device, and can select the target first model with the best overall evaluation result and its corresponding target second model, and then the network device can itself The target second model is reserved, and the target first model is sent to the above-mentioned multiple terminal devices. Whether the overall evaluation result is optimal can be judged by indicators such as compression ratio and recovery accuracy.
应理解,本示例中所述网络设备还可以保存多个终端设备发来的第一模型和第二模型之后,不对每个终端设备发来的第一模型及第二模型进行处理,而仅是在接收到任意一个终端设备发来的第二信息之后,基于发来第二信息的终端设备的第二模型对该第二信息进行处理。It should be understood that after the network device in this example saves the first model and the second model sent by multiple terminal devices, it does not process the first model and the second model sent by each terminal device, but only After receiving the second information sent by any terminal device, the second information is processed based on the second model of the terminal device that sent the second information.
示例三、Example three,
所述终端设备自身对第一预设模型、第二预设模型进行联合训练得到训练后的第一模型和第二模型。本示例与示例二的不同之处在于,完成上述训练之后,在所述终端设备发送所述第二模型的基础上,所述终端设备可以发送所述第一模型中的估计子模型以及压缩子模型。比如,所述终端设备可以同时发送所述第一模型中的估计子模型以及压缩子模型;或者,所述终端设备可以分别发送所述第一模型中的估计子模型以及压缩子模型。The terminal device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model. The difference between this example and Example 2 is that after the above training is completed, on the basis of the terminal device sending the second model, the terminal device can send the estimation sub-model and the compression sub-model in the first model Model. For example, the terminal device may send the estimated sub-model and the compressed sub-model in the first model at the same time; or, the terminal device may send the estimated sub-model and the compressed sub-model in the first model respectively.
再具体来说,所述终端设备可以同时向网络设备发送所述第一模型中的估计子模型以及压缩子模型;或者,所述终端设备可以分别向网络设备发送所述第一模型中的估计子模型以及压缩子模型。More specifically, the terminal device may send the estimated sub-model and the compressed sub-model in the first model to the network device at the same time; or, the terminal device may send the estimated sub-model in the first model to the network device respectively. Submodels and compressed submodels.
进一步地,所述终端设备可以同时向网络设备发送所述第一模型中的估计子模型的模型参数以及压缩子模型的模型参数;或者,所述终端设备可以分别向网络设备发送所述第一模型中的估计子模型的模型参数以及压缩子模型的模型参数。Further, the terminal device may send the model parameters of the estimated sub-model and the model parameters of the compressed sub-model in the first model to the network device at the same time; or, the terminal device may send the first model parameters to the network device respectively. Model parameters for the estimated submodel and model parameters for the compressed submodel in the model.
所述估计子模型和所述压缩子模型可由以下至少之一同时携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The estimation sub-model and the compression sub-model may be simultaneously carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, uplink data transmission for artificial intelligence business type transmission requirements;
所述估计子模型和所述压缩子模型可由以下至少之一分别携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The estimation sub-model and the compression sub-model may be respectively carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
这种示例中,所述终端设备可以通过所述第一模型中的估计子模型以及压缩子模型对第一信息进行处理得到第二信息;相应的,网络设备由于可以接收到终端设备发来的第二模型,因此,所述网络设备可以基于所述第二模型对所述第二信息进行处理以得到信道信息。其中,所述信道信息还可以为信道信息的特征向量信息。In this example, the terminal device can process the first information through the estimation sub-model and the compression sub-model in the first model to obtain the second information; correspondingly, the network device can receive the A second model. Therefore, the network device may process the second information based on the second model to obtain channel information. Wherein, the channel information may also be feature vector information of the channel information.
进一步地,所述网络设备接收终端设备发来的第一模型中的估计子模型以及压缩子模型和第二模型之后,所述网络设备可以对估计子模型、压缩子模型以及第二模型进行整体评估,在完成估计子模型、压缩子模型以及第二模型的整体评估之后,可以决定是否使用本次接收到的估计子模型、压缩子模型以及第二模型,若整体评估结果较差(比如压缩率较低或者恢复信道信息的准确率较低等等),可以不使用上述估计子模型、压缩子模型和第二模型。若网络设备决定不使用上述估计子模型、压缩子模型和第二模型,还可以自身对估计子模型、压缩子模型以及第二模型重新进行联合训练以更新估计子模型、压缩子模型以及第二模型的模型参数,或者,网络设备自己训练得到新的估计子模型、压缩子模型以及第二模型。需要指出,若网络设备重新联合训练或更新估计子模型、压缩子模型以及第二模型,则所述网络设备还需要将新的估计子模型、新的压缩子模型和新的第二模型发送至所述终端设备,或者网络设备将新的估计子模型和新的压缩子模型发送至终端设备。Further, after the network device receives the estimated sub-model, the compressed sub-model and the second model in the first model sent by the terminal device, the network device may integrate the estimated sub-model, the compressed sub-model and the second model Evaluation, after completing the overall evaluation of the estimated sub-model, compressed sub-model and the second model, you can decide whether to use the estimated sub-model, compressed sub-model and the second model received this time, if the overall evaluation results are poor (such as compressed rate is low or the accuracy of recovering channel information is low, etc.), the estimation sub-model, the compression sub-model and the second model may not be used. If the network device decides not to use the above estimation sub-model, compression sub-model and second model, it can also re-train the estimation sub-model, compression sub-model and second model by itself to update the estimation sub-model, compression sub-model and second The model parameters of the model, or the network device trains itself to obtain a new estimated sub-model, a compressed sub-model and a second model. It should be pointed out that if the network device jointly trains or updates the estimated sub-model, the compressed sub-model and the second model, the network device also needs to send the new estimated sub-model, the new compressed sub-model and the new second model to The terminal device or the network device sends the new estimated sub-model and the new compressed sub-model to the terminal device.
需要说明的是,由于一个网络设备可以服务多个终端设备,因此,所述网络设备可以保存多个终端设备发来的估计子模型、压缩子模型和第二模型。进而网络设备可以对每一个终端设备发来的估计子模型、压缩子模型和第二模型进行整体评估,可以从中选取整体评估结果最优的目标估计子模型、目标压缩子模型及其对应的目标第二模型,然后网络设备可以自己保留使用目标第二模型,将目标估计子模型和目标压缩子模型发送给上述多个终端设备。整体评估结果是否最优可以通过压缩率以及恢复准确率等指标来判断。It should be noted that, since one network device may serve multiple terminal devices, the network device may store the estimated sub-model, the compressed sub-model and the second model sent by the multiple terminal devices. Furthermore, the network device can conduct an overall evaluation of the estimation sub-model, compression sub-model and second model sent by each terminal device, and can select the target estimation sub-model, target compression sub-model and their corresponding target sub-models with the best overall evaluation results. The second model, and then the network device can reserve and use the target second model, and send the target estimation sub-model and the target compression sub-model to the above-mentioned multiple terminal devices. Whether the overall evaluation result is optimal can be judged by indicators such as compression ratio and recovery accuracy.
应理解,本示例中所述网络设备还可以保存多个终端设备发来的全部模型之后,不对每个终端设备发来的模型进行处理,而仅是在接收到任意一个终端设备发来的第二信息之后,基于发来第二信息的终端设备的第二模型对该第二信息进行处理。It should be understood that after the network device in this example saves all the models sent by multiple terminal devices, it does not process the models sent by each terminal device, but only receives the first model sent by any terminal device. After receiving the second information, process the second information based on the second model of the terminal device that sent the second information.
示例四、Example four,
所述终端设备自身对第一预设模型、第二预设模型进行联合训练得到训练后的第一模型和第二模型;其中,第一模型包含估计子模型、压缩子模型以及信道生成子模型。本示例与示例三的不同之处在于,完成上述训练之后,在所述终端设备发送所述第二模型的基础上,所述终端设备可以发送所述第一模型中的估计子模型、压缩子模型以及信道生成子模型。比如,所述终端设备可以同时发送所述第一模型中的估计子模型、压缩子模型以及信道生成子模型;或者,所述终端设备可以分别发送所述估计子模型、压缩子模型以及信道生成子模型。The terminal device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model; wherein, the first model includes an estimation sub-model, a compression sub-model and a channel generation sub-model . The difference between this example and Example 3 is that after the above training is completed, on the basis of the terminal device sending the second model, the terminal device can send the estimation sub-model, compression sub-model in the first model model and the channel generation submodel. For example, the terminal device may simultaneously send the estimation sub-model, the compression sub-model and the channel generation sub-model in the first model; or, the terminal device may separately send the estimation sub-model, the compression sub-model and the channel generation sub-model submodel.
再具体来说,所述终端设备可以同时向网络设备发送所述第一模型中的估计子模型、压缩子模型以及信道生成子模型;或者,所述终端设备可以分别向网络设备发送所述第一模型中的估计子模型、压缩子模型以及信道生成子模型;再或者,所述终端设备可以向网络设备同时发送所述第一模型中的估计子模型、压缩子模型以及信道生成子模型中的任意之二,再发送剩余的一个子模型。More specifically, the terminal device may send the estimation sub-model, the compression sub-model and the channel generation sub-model in the first model to the network device at the same time; or, the terminal device may send the first model to the network device respectively. The estimation sub-model, compression sub-model and channel generation sub-model in a model; or, the terminal device may simultaneously send the estimation sub-model, compression sub-model and channel generation sub-model in the first model to the network device Any two of , and then send the remaining sub-model.
所述估计子模型、压缩子模型以及信道生成子模型可由以下至少之一同时携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The estimation sub-model, compression sub-model and channel generation sub-model may be simultaneously carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, uplink data transmission for artificial intelligence service class transmission requirements;
所述估计子模型、压缩子模型以及信道生成子模型可由以下至少之一分别携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The estimation sub-model, compression sub-model and channel generation sub-model may be respectively carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
所述估计子模型、压缩子模型以及信道生成子模型中的任意之二,可由以下至少之一同时携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。剩余的一个子模型可由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。Any two of the estimation sub-model, compression sub-model and channel generation sub-model may be simultaneously carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink transmission requirements for artificial intelligence services data transmission. The remaining sub-model can be carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
这种示例中,所述终端设备可以通过所述第一模型中的估计子模型、压缩子模型以及信道生成子模型对第一信息进行处理得到第二信息;相应的,网络设备由于可以接收到终端设备发来的第二模型,因此,所述网络设备可以基于所述第二模型对所述第二信息进行处理以得到信道信息。其中,所述信道信息具体可以为:信道信息,又或者,可以为信道信息的特征向量信息。In this example, the terminal device can process the first information through the estimation sub-model, compression sub-model and channel generation sub-model in the first model to obtain the second information; correspondingly, the network device can receive The second model sent by the terminal device, therefore, the network device may process the second information based on the second model to obtain channel information. Wherein, the channel information may specifically be: channel information, or may be eigenvector information of the channel information.
进一步地,所述网络设备接收终端设备发来的第一模型中的估计子模型、压缩子模型以及信道生成子模型和第二模 型之后,所述网络设备可以对估计子模型、压缩子模型以及信道生成子模型以及第二模型进行整体评估,在完成整体评估之后的处理与前述示例三相似,这里不再赘述。Further, after the network device receives the estimated sub-model, the compressed sub-model, the channel generation sub-model and the second model in the first model sent by the terminal device, the network device may perform the estimated sub-model, the compressed sub-model and the second model. The channel generation sub-model and the second model perform an overall evaluation, and the processing after the overall evaluation is similar to the third example above, and will not be repeated here.
示例五、Example five,
所述终端设备自身对第一预设模型、第二预设模型以及第三预设模型进行联合训练得到训练后的第一模型、第二模型和第三模型。完成上述训练之后,在所述终端设备发送所述第二模型、以及第一模型的基础上,所述终端设备可以发送所述第三模型。The terminal device itself performs joint training on the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model. After the above training is completed, on the basis that the terminal device sends the second model and the first model, the terminal device may send the third model.
再具体来说,所述终端设备可以同时向网络设备发送所述第一模型、第二模型和第三模型;或者,所述终端设备可以分别向网络设备发送所述第一模型、第二模型和第三模型;又或者,所述终端设备可以先向网络设备发送所述第一模型、第二模型和第三模型中的任意之二,再向网络设备发送剩余的一个模型。More specifically, the terminal device may send the first model, the second model and the third model to the network device at the same time; or, the terminal device may send the first model and the second model to the network device respectively and the third model; or, the terminal device may first send any two of the first model, the second model, and the third model to the network device, and then send the remaining one model to the network device.
所述第一模型、第二模型和第三模型可由以下至少之一同时携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The first model, the second model, and the third model may be simultaneously carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services;
或者,所述第一模型、第二模型和第三模型可由以下至少之一分别携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;Alternatively, the first model, the second model and the third model may be respectively carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, uplink data transmission for artificial intelligence service class transmission requirements;
或者,所述第一模型、第二模型和第三模型中的任意之二以及剩余的一个模型分别由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。Alternatively, any two of the first model, the second model, the third model, and the remaining one model are respectively carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and artificial intelligence services Uplink data transmission for class transmission requirements.
可选地,本示例中,所述第一模型可以包含估计子模型和压缩子模型,相应的,所述发送第一模型可以指的是,同时或分别发送估计子模型和压缩子模型,关于估计子模型和压缩子模型的携带方式与前述示例相同这里不做重复说明。Optionally, in this example, the first model may include an estimation sub-model and a compression sub-model. Correspondingly, sending the first model may refer to simultaneously or separately sending the estimation sub-model and the compression sub-model. Regarding The carrying manner of the estimation sub-model and the compression sub-model is the same as that of the previous example and will not be repeated here.
可选地,本示例中,所述第一模型可以包含估计子模型、信道生成子模型和压缩子模型,相应的,所述发送第一模型可以指的是,同时或分别发送估计子模型、信道生成子模型和压缩子模型,关于估计子模型、信道生成子模型和压缩子模型的携带方式与前述示例相同这里不做重复说明。Optionally, in this example, the first model may include an estimation submodel, a channel generation submodel, and a compression submodel. Correspondingly, sending the first model may refer to sending the estimation submodel, the The channel generation sub-model and the compression sub-model, the carrying manners of the estimation sub-model, the channel generation sub-model and the compression sub-model are the same as the previous examples and will not be repeated here.
这种示例中,所述终端设备可以通过所述第一模型中的估计子模型以及压缩子模型对第一信息进行处理得到第二信息;相应的,网络设备由于可以接收到终端设备发来的第二模型,因此,所述网络设备可以基于所述第二模型对所述第二信息进行处理以得到信道信息。其中,所述信道信息还可以为信道信息的特征向量信息。In this example, the terminal device can process the first information through the estimation sub-model and the compression sub-model in the first model to obtain the second information; correspondingly, the network device can receive the A second model. Therefore, the network device may process the second information based on the second model to obtain channel information. Wherein, the channel information may also be feature vector information of the channel information.
进一步地,所述网络设备接收终端设备发来的所述第一模型、第二模型和第三模型之后,所述网络设备可以对所述第一模型、第二模型和第三模型进行整体评估,在完成所述第一模型、第二模型和第三模型的整体评估之后,可以决定是否使用本次接收到的所述第一模型、第二模型和第三模型,若整体评估结果较差(比如压缩率较低或者恢复信道信息的准确率较低等等),可以不使用上述所述第一模型、第二模型和第三模型。若网络设备决定不使用上述所述第一模型、第二模型和第三模型,还可以自身对所述第一模型、第二模型和第三模型重新进行联合训练以更新估计子模型、压缩子模型以及第二模型的模型参数,或者,网络设备自己训练得到新的第一模型、新的第二模型和新的第三模型。需要指出,若网络设备重新联合训练或更新估计子模型、压缩子模型以及第二模型,则所述网络设备还需要将新的第一模型、新的第二模型和新的第三模型发送至所述终端设备,或者网络设备将新的第一模型发送至终端设备。Further, after the network device receives the first model, the second model and the third model sent by the terminal device, the network device may perform an overall evaluation on the first model, the second model and the third model , after completing the overall evaluation of the first model, the second model and the third model, it may be decided whether to use the first model, the second model and the third model received this time, if the overall evaluation result is poor (For example, the compression rate is low or the accuracy rate of recovering channel information is low, etc.), the above-mentioned first model, second model and third model may not be used. If the network device decides not to use the above-mentioned first model, second model and third model, it can also re-train the first model, second model and third model by itself to update the estimation sub-model, compression sub-model model and model parameters of the second model, or the network device trains itself to obtain a new first model, a new second model, and a new third model. It should be pointed out that if the network device jointly trains or updates the estimation sub-model, the compression sub-model and the second model, the network device also needs to send the new first model, the new second model and the new third model to The terminal device or the network device sends the new first model to the terminal device.
需要说明的是,由于一个网络设备可以服务多个终端设备,因此,所述网络设备可以保存多个终端设备发来的第一模型、第二模型和第三模型。进而网络设备可以对每一个终端设备发来的第一模型、第二模型和第三模型进行整体评估,可以从中选取整体评估结果最优的目标第一模型、目标第二模型和目标第三模型,然后网络设备可以自己保留使用目标第二模型,将目标第一模型发送给上述多个终端设备。整体评估结果是否最优可以通过压缩率以及恢复准确率等指标来判断。It should be noted that, since one network device can serve multiple terminal devices, the network device can store the first model, the second model and the third model sent by the multiple terminal devices. Furthermore, the network device can conduct an overall evaluation of the first model, the second model, and the third model sent by each terminal device, and can select the target first model, the target second model, and the target third model with the best overall evaluation results. , and then the network device can reserve and use the target second model, and send the target first model to the above-mentioned multiple terminal devices. Whether the overall evaluation result is optimal can be judged by indicators such as compression ratio and recovery accuracy.
可见,通过采用上述方案,可以在终端设备接收到第一信息的情况下,经由第一模型对第一信息进行处理得到第二信息并发送,使得接收端能够通过使用第二模型对该第二信息进行处理以得到的信道信息,由于该第一模型与该第二模型为联合训练得到的。由于第二信息的处理、传输及解析过程是采用联合训练得到的第一模型和第二模型来实现的,因此可以兼顾整个信息处理、传输及解析中的性能要求,保证了网络整体的性能。进一步地,由于上述方案采用了联合训练得到的第一模型和第二模型,因此可以使得第一模型与第二模型之间的功能相互兼容,使得第一模型以及第二模型的性能均可以达到较优的状态,进而基于该第一模型和第二模型来对第二信息的处理、传输及解析过程进行整体处理时,可以保证整体处理的性能,从而保证了网络整体的性能。It can be seen that by adopting the above solution, when the terminal device receives the first information, it can process the first information through the first model to obtain the second information and send it, so that the receiving end can use the second model to obtain the second information. The channel information obtained by processing the information is obtained through joint training of the first model and the second model. Since the processing, transmission, and analysis of the second information are realized by using the first model and the second model obtained through joint training, the performance requirements in the entire information processing, transmission, and analysis can be taken into account, and the overall performance of the network is guaranteed. Furthermore, since the above solution uses the first model and the second model obtained through joint training, the functions between the first model and the second model can be made compatible with each other, so that the performance of the first model and the second model can reach In a better state, when the processing, transmission and analysis process of the second information is processed as a whole based on the first model and the second model, the performance of the whole processing can be guaranteed, thereby ensuring the performance of the whole network.
图11是根据本申请一实施例的信息处理方法1100的示意性流程图。该方法可选地可以应用于图1所示的系统,但并不仅限于此。该方法包括以下内容的至少部分内容。Fig. 11 is a schematic flowchart of an information processing method 1100 according to an embodiment of the present application. The method can optionally be applied to the system shown in Fig. 1, but is not limited thereto. The method includes at least some of the following.
S1110、网络设备发送第一信息。S1110. The network device sends first information.
S1120、所述网络设备接收第二信息;其中,所述第二信息为所述第一信息经由第一模型处理得到的。S1120. The network device receives second information; wherein, the second information is obtained by processing the first information through a first model.
S1130、所述网络设备基于第二模型对所述第二信息进行处理得到信道信息;其中,所述第一模型和第二模型为联合训练得到的。S1130. The network device processes the second information based on a second model to obtain channel information; wherein, the first model and the second model are obtained through joint training.
上述S1110中,所述第一信息可以为参考信号,具体来说,所述第一信息可以为当前信道的参考信号,比如当前信道的下行参考信号。所述下行参考信号可以包括CSI-RS、DMRS、PT-RS中至少一种。In the above S1110, the first information may be a reference signal, specifically, the first information may be a reference signal of the current channel, such as a downlink reference signal of the current channel. The downlink reference signal may include at least one of CSI-RS, DMRS, and PT-RS.
所述第一信息可以分布在第一维度和/或第二维度。The first information may be distributed in the first dimension and/or the second dimension.
其中,所述第一维度为时域维度;所述第一信息分布在所述时域维度中的至少一个时间单元内。所述至少一个时间单元中每个时间单元可以包含以下之一:1个时隙、1个正交频分复用(OFDM,Orthogonal Frequency Division Multiplexing)符号。举例来说,所述第一信号为下行参考信号,所述下行参考信号在时域维度上可以分布在1个时隙上,或者所述下行参考信号在时域维度上可以分布在2个或4个时隙上。Wherein, the first dimension is a time domain dimension; the first information is distributed in at least one time unit in the time domain dimension. Each time unit in the at least one time unit may include one of the following: 1 time slot and 1 Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing) symbol. For example, the first signal is a downlink reference signal, and the downlink reference signal may be distributed in one time slot in the time domain dimension, or the downlink reference signal may be distributed in two or on 4 time slots.
所述第二维度为频域维度;所述第一信息分布在所述频域维度中的至少一个频域资源上;其中,每个频域资源可以为以下之一:一个RB、一个子载波。举例来说,所述第一信号为下行参考信号,所述下行参考信号在频域维度上可以分布在1个RB上,或者所述下行参考信号在时域维度上可以分布在2个或4个RB上。The second dimension is a frequency domain dimension; the first information is distributed on at least one frequency domain resource in the frequency domain dimension; wherein, each frequency domain resource can be one of the following: one RB, one subcarrier . For example, the first signal is a downlink reference signal, and the downlink reference signal may be distributed in 1 RB in the frequency domain dimension, or the downlink reference signal may be distributed in 2 or 4 RBs in the time domain dimension. on RBs.
上述第一维度与第二维度可以合并使用,也就是说,所述第一信息可以分布在第一维度以及第二维度上;比如,所 述第一信息在频域维度中分布在a个RB上,在时域维度中分布在b个时隙中;a和b均为正整数。举例来说,所述第一信息为下行参考信号,该下行参考信号在频域上可以分布在4个RB上,在时域维度上可以分布在6个时隙中。The first dimension and the second dimension above can be used in combination, that is, the first information can be distributed on the first dimension and the second dimension; for example, the first information can be distributed on a RBs in the frequency domain dimension , distributed in b time slots in the time domain dimension; both a and b are positive integers. For example, the first information is a downlink reference signal, and the downlink reference signal may be distributed in 4 RBs in the frequency domain, and may be distributed in 6 time slots in the time domain dimension.
再进一步地,所述第一信息还可以表示为复数,也就是说,所述第一信息还分布在第三维度;所述第三维度为复数维度;所述第一信息包括第一信息样本的实部和第一信息样本的虚部。比如,所述第一信息的实部分布在频域资源的a个RB上以及时域资源的b个时隙上,所述第一信息的虚部分布在频域资源的a个RB上以及时域资源的b个时隙上。Still further, the first information can also be expressed as a complex number, that is, the first information is also distributed in the third dimension; the third dimension is a complex dimension; the first information includes the first information sample The real part of and the imaginary part of the first information sample. For example, the real part of the first information is distributed on the a RBs of the frequency domain resources and the b time slots of the time domain resources, and the imaginary part of the first information is distributed on the a RBs of the frequency domain resources. b time slots of the time domain resource.
可选地,所述网络设备在发送第一信息之前,还可以先发送的配置信息,该配置信息中可以配置供终端设备测量用的第一信息。以该第一信息为下行参考信号为例该配置信息可以是配置终端设备测量SSB或者CSI-RS等等。Optionally, before sending the first information, the network device may also send configuration information first, and the configuration information may be configured with first information for terminal device measurement. Taking the first information as an example of a downlink reference signal, the configuration information may be to configure the terminal device to measure SSB or CSI-RS and so on.
在完成S1110之后,所述网络设备执行S1120。其中,所述第二信息可以由以下信息中之一携带:随机接入过程中包含的信息,无线资源控制(RRC,Radio Resource Control)信令,上行控制信息(UCI,Uplink Control Information)。所述随机接入过程中包含的信息,包括以下之一:两步随机接入过程中的消息A;四步随机接入过程中的Msg1;四步随机接入过程中的Msg3。After completing S1110, the network device executes S1120. Wherein, the second information may be carried by one of the following information: information included in the random access process, radio resource control (RRC, Radio Resource Control) signaling, and uplink control information (UCI, Uplink Control Information). The information contained in the random access process includes one of the following: message A in the two-step random access process; Msg1 in the four-step random access process; Msg3 in the four-step random access process.
在一种示例中,S1120中所述第二信息为信道压缩信息;所述第二模型用于对所述信道压缩信息进行解压缩处理,得到信道信息。In an example, the second information in S1120 is channel compression information; the second model is used to decompress the channel compression information to obtain channel information.
相应的S1130中所述网络设备基于第二模型对所述第二信息进行处理得到信道信息,包括:所述网络设备将所述信道压缩信息输入所述第二模型,得到所述第二模型输出的所述信道信息。Correspondingly, in S1130, the network device processes the second information based on the second model to obtain channel information, including: the network device inputs the channel compression information into the second model, and obtains the second model output The channel information of .
所述第二信息为所述第一信息经由第一模型处理得到的。也就是说,所述第一模型用于基于输入的第一信息进行处理得到信道压缩信息。The second information is obtained by processing the first information through the first model. That is to say, the first model is used to process the input first information to obtain channel compression information.
需要指出的是,所述第一模型还可以称为编码模型或编码网络等,只要输入信息为第一信息以及输出信息为信道压缩信息的模型或神经网络均在本实施例保护范围内。It should be pointed out that the first model may also be called an encoding model or an encoding network, as long as the input information is the first information and the output information is the channel compression information, the model or neural network is within the protection scope of this embodiment.
所述信道信息可以用于表征基于所述第一信息所得到的信道质量、或信道响应、或信道状态、或信道估计结果。The channel information may be used to characterize the channel quality, or channel response, or channel state, or channel estimation result obtained based on the first information.
所述信道信息可以采用T个维度的矩阵表示,T为大于等于2的整数。The channel information may be represented by a matrix of T dimensions, where T is an integer greater than or equal to 2.
所述T个维度的矩阵具体可以为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。也就是说,所述信道信息可以由大小为M×N的二维矩阵构成,其在第四维度上有M个第一粒度,在第五维度上有N个第二粒度;上述M和N可以相等也可以不相等。所述二维矩阵内具体的数值指示代表信道质量某一个第一粒度下接收的信号强度,这里所述二维矩阵内的数值的单位可以是dBm,或所述二维矩阵内的数值没有单位而是归一化之后所得到的数值。此外,也可以将M×N的二维矩阵合成成为1×(M×N)大小或者(M×N)×1大小的一维数据,具体变换是可以是先第四维度再第五维度,也可以是先第五维度再第四维度,本实施例不对其进行限定。The matrix of the T dimensions may specifically be a two-dimensional matrix of M×N; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension; M and N are all positive integers. That is to say, the channel information may be composed of a two-dimensional matrix with a size of M×N, which has M first granularities in the fourth dimension and N second granularities in the fifth dimension; the above M and N May or may not be equal. The specific numerical indication in the two-dimensional matrix represents the received signal strength at a certain first granularity of the channel quality, where the unit of the numerical value in the two-dimensional matrix may be dBm, or the numerical value in the two-dimensional matrix has no unit It is the value obtained after normalization. In addition, the two-dimensional matrix of M×N can also be synthesized into one-dimensional data of size 1×(M×N) or (M×N)×1. The specific transformation can be the fourth dimension first and then the fifth dimension. It may also be the fifth dimension first and then the fourth dimension, which is not limited in this embodiment.
所述第四维度可以为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。或者,所述第四维度可以为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。所述符号为正交频分复用符号(OFDM,Orthogonal Frequency Division Multiplexing)。这里,所述第四维度为时域维度的时候,所述第一粒度还可以称为时延粒度。The fourth dimension may be a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers. Alternatively, the fourth dimension may be a time-domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol lengths, and the number of sampling points of K3 symbols; K1, K2, and K3 are positive integers . The symbols are Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing). Here, when the fourth dimension is a time domain dimension, the first granularity may also be called a delay granularity.
举例来说,当所述第四维度是频域维度时,所述第一粒度可以是L1个RB(L1大于等于1,例如2RB,4RB,8RB),则所述信道信息在频域维度上的分布范围是M×L1个RB所对应的频域范围;或者所述第一粒度可以是L2个子载波(L2大于1,例如4个子载波,6个子载波,18个子载波),则所述信道信息在频域维度上的分布是M×L2个子载波对应的频域范围。当所述第四维度是时域维度时,所述第一粒度可以是时延粒度,例如一个第一粒度是K1个微秒、或者K2个符号长度、或者K3个符号的采样点个数,这里所述符号可以是一个OFDM符号;当所述第四维度是时域维度且所述第一粒度为K1个微秒时,所述信道信息在时域维度上的分布范围是M×K1个微秒对应的时域范围。For example, when the fourth dimension is the frequency domain dimension, the first granularity may be L1 RBs (L1 is greater than or equal to 1, such as 2RB, 4RB, 8RB), then the channel information in the frequency domain dimension The distribution range is the frequency domain range corresponding to M×L1 RBs; or the first granularity can be L2 subcarriers (L2 is greater than 1, such as 4 subcarriers, 6 subcarriers, and 18 subcarriers), then the channel The distribution of information in the frequency domain dimension is the frequency domain range corresponding to M×L2 subcarriers. When the fourth dimension is a time-domain dimension, the first granularity may be a delay granularity, for example, a first granularity is the number of sampling points of K1 microseconds, or K2 symbol lengths, or K3 symbols, The symbol here may be an OFDM symbol; when the fourth dimension is the time domain dimension and the first granularity is K1 microseconds, the distribution range of the channel information on the time domain dimension is M×K1 The time domain range corresponding to microseconds.
所述第五维度可以为空间域维度;相应的,所述第二粒度为一对收发天线或到达角度的间隔。也就是说,所述第五维度为所述空间域维度,具体地可以是天线维度,所述第二粒度可以是一对收发天线。或者,所述第五维度为空间域维度,具体的可以是角度域维度,所述第二粒度可以是到达角度的间隔大小。The fifth dimension may be a space domain dimension; correspondingly, the second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival. That is to say, the fifth dimension is the space domain dimension, specifically, it may be an antenna dimension, and the second granularity may be a pair of transmitting and receiving antennas. Alternatively, the fifth dimension is a space domain dimension, specifically, an angle domain dimension, and the second granularity may be an interval of arrival angles.
再进一步地,表征所述信道信息的二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。也就是说,用于表示所述信道信息的所述二维矩阵中某一个位置处的数值(或称为指示值)代表了在第四维度以及第五维度这样的组合下的信道质量。其中,所述信道质量可以采用信号强度值来表征;所述信号强度值的单位可以是dBm,或所述信号强度值没有单位而是归一化之后所得到的数值。Still further, the value of the ijth position in the two-dimensional matrix representing the channel information is used to represent the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension The channel quality of ; i and j are both positive integers. That is to say, a numerical value (or an indicator value) at a certain position in the two-dimensional matrix used to represent the channel information represents the channel quality under the combination of the fourth dimension and the fifth dimension. Wherein, the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
所述T个维度中还可以包括第六维度。所述T个维度的矩阵可以为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The T dimensions may also include a sixth dimension. The matrix of T dimensions may be a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, N represents the number of second granularities in the fifth dimension, and W represents The number of third granularities under the sixth dimension; M, N and W are all positive integers.
示例性的,所述第六维度可以为复数维度,所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。举例来说,所述第四维度表示时域维度的时候,所述第一粒度为时延粒度;所述第五维度为空间域维度具体为角度维度,所述第二粒度为到达角度的间隔;所述第六维度为复数维度,W为2,k为1表示实部,k为2表示虚部。i=4、j=5、k=1时,上述三维矩阵的第ijk个位置处的数值(或指示值)表示第5个空间粒度(例如到达角度的间隔)内的第4个时延粒度上的信道质量的实部。若i=4、j=5、k=2时,则上述三维矩阵的第ijk个位置处的数值(或指示值)表示第5个空间粒度内的第4个时延粒度上的信道质量的虚部。Exemplarily, the sixth dimension may be a complex dimension, the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2. For example, when the fourth dimension represents the time domain dimension, the first granularity is the delay granularity; the fifth dimension is the spatial domain dimension, specifically the angle dimension, and the second granularity is the interval of arrival angles ; The sixth dimension is a complex dimension, W is 2, k is 1 to indicate the real part, and k is 2 to indicate the imaginary part. When i=4, j=5, k=1, the value (or indicator value) at the ijkth position of the above-mentioned three-dimensional matrix represents the fourth delay granularity in the fifth spatial granularity (such as the interval of arrival angle) The real part of the channel quality on . If i=4, j=5, k=2, then the value (or indicator value) at the ijkth position of the above-mentioned three-dimensional matrix represents the channel quality on the 4th delay granularity in the 5th spatial granularity imaginary part.
后续的描述中,为了描述简单起见,以第四维度和第五维度构成的二维矩阵来举例说明上述信道信息,但需要明确的是上述信道信息的矩阵的维度不局限在二维。In the subsequent description, for the sake of simplicity, the above channel information is illustrated by using a two-dimensional matrix formed by the fourth dimension and the fifth dimension. However, it should be clarified that the dimension of the above channel information matrix is not limited to two dimensions.
在另一种示例中,S1120中的所述第二信息为信道压缩信息;所述信道压缩信息包含压缩的信道估计信息的特征向量信息。所述信道信息为信道信息的特征向量信息;所述第二模型用于对所述压缩的信道估计信息的特征向量信息进行解压缩处理,得到信道信息的特征向量信息。其中,所述信道信息的特征向量信息中包含R组特征向量序列信息;R为正整数。In another example, the second information in S1120 is channel compression information; the channel compression information includes eigenvector information of compressed channel estimation information. The channel information is eigenvector information of the channel information; the second model is used to decompress the compressed eigenvector information of the channel estimation information to obtain the eigenvector information of the channel information. Wherein, the eigenvector information of the channel information includes R groups of eigenvector sequence information; R is a positive integer.
相应的,在S1130中所述网络设备基于第二模型对所述第二信息进行处理得到信道信息,可以包括:Correspondingly, in S1130, the network device processes the second information based on the second model to obtain channel information, which may include:
所述网络设备将所述压缩的信道估计信息的特征向量信息输入所述第二模型,得到所述第二模型输出的所述信道信息的特征向量信息。The network device inputs the eigenvector information of the compressed channel estimation information into the second model, and obtains the eigenvector information of the channel information output by the second model.
所述第二信息可以为信道压缩信息;该信道压缩信息包括压缩的信道估计信息的特征向量信息。也就是说,所述第一模型用于对输入的所述第一信息进行处理得到压缩的信道估计信息的特征向量信息。The second information may be channel compression information; the channel compression information includes eigenvector information of compressed channel estimation information. That is to say, the first model is used to process the input first information to obtain eigenvector information of compressed channel estimation information.
其中,所述信道信息的特征向量信息包含R组特征向量序列信息;R为正整数。比如,R可以为1,则所述信道信息的特征向量信息包含1组特征向量序列信息。R可以为2,则所述信道信息的特征向量信息包含2组特征向量序列信息。上述R的取值可以根据实际情况来确定,又或者可以是在所述第一模型训练的时候指定。Wherein, the eigenvector information of the channel information includes R groups of eigenvector sequence information; R is a positive integer. For example, R may be 1, then the eigenvector information of the channel information includes a set of eigenvector sequence information. R may be 2, then the eigenvector information of the channel information includes 2 sets of eigenvector sequence information. The above value of R may be determined according to the actual situation, or may be specified during the training of the first model.
上述R组特征向量序列信息中,每一组特征向量序列信息中可以包括预设长度的特征序列。其中,不同组的特征向量序列信息所包含的特征序列的长度相同。所述预设长度可以根据实际情况设置或者可以是在训练的时候设置的,比如,可以为16、32、48、64、128、256中任意之一,当然还可以更长或更短,本实施例不对所述预设长度的全部可能的取值进行穷举。结合图7举例来说,预设长度为32(但是可以为比特bit),R为4,也就是信道信息的特征向量信息包含了4组特征向量序列信息,其中,每一组特征向量序列信息中包含了长度为32的特征序列。In the above R sets of feature vector sequence information, each set of feature vector sequence information may include a feature sequence of a preset length. Wherein, the lengths of the feature sequences included in the feature vector sequence information of different groups are the same. The preset length can be set according to the actual situation or can be set during training, for example, it can be any one of 16, 32, 48, 64, 128, 256, and of course it can be longer or shorter. The embodiment does not exhaustively list all possible values of the preset length. In conjunction with FIG. 7, for example, the preset length is 32 (but it can be a bit), and R is 4, that is, the feature vector information of the channel information includes 4 sets of feature vector sequence information, wherein each set of feature vector sequence information contains a feature sequence of length 32.
本示例中所述估计子模型所输出的信道估计信息与所述第二模型所输出的信道信息可以是不同的,所述估计子模型所输出的信道估计信息具体可以为信道信息的矩阵,比如采用T个维度的矩阵来表示;所述第二模型所输出的信道信息可以为信道信息的特征向量信息,比如可以包含R组特征向量序列信息。当然,所述第二模型所输出的信道信息与所述估计子模型所输出的信道估计信息也可能是相同的,比如可以都是信道信息的矩阵。In this example, the channel estimation information output by the estimation sub-model may be different from the channel information output by the second model, and the channel estimation information output by the estimation sub-model may specifically be a matrix of channel information, such as It is represented by a matrix of T dimensions; the channel information output by the second model may be eigenvector information of the channel information, for example, may include R groups of eigenvector sequence information. Of course, the channel information output by the second model and the channel estimation information output by the estimation sub-model may also be the same, for example, both may be a matrix of channel information.
以上对网络设备如何使用所述第二模型进行了详细说明,关于所述网络设备得到所述第二模型的方式可以有以下两种:第一种方式:所述网络设备直接获取的;第二种方式:所述网络设备训练得到的。下面针对这两种方式分别进行说明:The above describes in detail how the network device uses the second model. There are two ways for the network device to obtain the second model: the first way: the network device obtains it directly; the second One way: obtained by the network device training. The two methods are described below:
第一种方式、所述网络设备接收所述第二模型。In a first manner, the network device receives the second model.
具体来说,所述网络设备接收电子设备发送的所述第二模型;比如,可以是所述网络设备可以接收电子设备发送的所述第二模型的模型参数。Specifically, the network device receives the second model sent by the electronic device; for example, the network device may receive model parameters of the second model sent by the electronic device.
这里,所述电子设备可以为联合训练得到所述第一模型和所述第二模型的电子设备。Here, the electronic device may be an electronic device that obtains the first model and the second model through joint training.
示例性的,所述电子设备可以为所述终端设备,此时,所述网络设备可以为服务所述终端设备的接入网设备,比如基站、eNB、gNB等等。或者,所述电子设备可以为除了所述终端设备之外的其他设备,比如,可以是服务器、或台式机、或笔记本等等其他具备数据处理能力的设备,本实施例不进行穷举。Exemplarily, the electronic device may be the terminal device, and in this case, the network device may be an access network device serving the terminal device, such as a base station, eNB, gNB, and so on. Alternatively, the electronic device may be other devices than the terminal device, for example, it may be a server, or a desktop computer, or a notebook, or other devices capable of data processing, which are not exhaustive in this embodiment.
其中,在所述电子设备为所述终端设备的情况下,具体可以为:所述网络设备接收所述终端设备设备发送的所述第二模型。其中,所述第二模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。Wherein, when the electronic device is the terminal device, specifically, the network device receives the second model sent by the terminal device. Wherein, the second model is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
在所述电子设备为其他设备的情况下,该第二模型(或所述第二模型的模型参数)可以是通过有线连接方式传输的、或其他无线连接方式传输的。比如,电子设备通过与网络设备之间的有线连接将所述第二模型(或所述第二模型的模型参数)传输给所述网络设备。或者,电子设备通过与网络设备之间的其他无线连接将所述第二模型(或所述第二模型的模型参数)传输给所述网络设备;其中,所述其他无线连接方式可以是蓝牙或WIFI等等,这里不进行穷举。In the case that the electronic device is other devices, the second model (or the model parameters of the second model) may be transmitted through a wired connection or other wireless connection. For example, the electronic device transmits the second model (or the model parameters of the second model) to the network device through a wired connection with the network device. Alternatively, the electronic device transmits the second model (or the model parameters of the second model) to the network device through other wireless connections with the network device; wherein, the other wireless connection methods may be bluetooth or WIFI, etc., are not exhaustive here.
在上述处理的基础上,所述网络设备还可以接收所述第一模型。比如,可以是所述网络设备可以接收电子设备发送的所述第一模型的模型参数。Based on the above processing, the network device may also receive the first model. For example, the network device may receive the model parameters of the first model sent by the electronic device.
所述电子设备可以为所述终端设备,此时,所述网络设备可以为服务所述终端设备的接入网设备,比如基站、eNB、gNB等等。或者,所述电子设备可以为除了所述终端设备之外的其他设备,比如,可以是服务器、或台式机、或笔记本等等其他具备数据处理能力的设备,本实施例不进行穷举。The electronic device may be the terminal device, and in this case, the network device may be an access network device serving the terminal device, such as a base station, eNB, gNB, and so on. Alternatively, the electronic device may be other devices than the terminal device, for example, it may be a server, or a desktop computer, or a notebook, or other devices capable of data processing, which are not exhaustive in this embodiment.
其中,在所述电子设备为所述终端设备的情况下,具体可以为:所述网络设备接收所述终端设备设备发送的所述第一模型。其中,所述第一模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。Wherein, when the electronic device is the terminal device, specifically, the network device receives the first model sent by the terminal device. Wherein, the first model is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
在所述电子设备为其他设备的情况下,该第一模型(或所述第一模型的模型参数)可以是通过有线连接方式传输的、或其他无线连接方式传输的。比如,电子设备通过与网络设备之间的有线连接将所述第一模型(或所述第一模型的模型参数)传输给所述网络设备。或者,电子设备通过与网络设备之间的其他无线连接将所述第一模型(或所述第一模型的模型参数)传输给所述网络设备;其中,所述其他无线连接方式可以是蓝牙或WIFI等等,这里不进行穷举。In the case that the electronic device is other devices, the first model (or the model parameters of the first model) may be transmitted through a wired connection or other wireless connection. For example, the electronic device transmits the first model (or the model parameters of the first model) to the network device through a wired connection with the network device. Alternatively, the electronic device transmits the first model (or the model parameters of the first model) to the network device through other wireless connections with the network device; wherein, the other wireless connection methods may be Bluetooth or WIFI, etc., are not exhaustive here.
上述第一模型与前述第二模型可以是同时接收的,或者,上述第一模型与前述第二模型可以是分别接收的,本实施例不对其进行限定。The foregoing first model and the foregoing second model may be received at the same time, or the foregoing first model and the foregoing second model may be received separately, which is not limited in this embodiment.
上述第一模型可以包括:估计子模型和压缩子模型;或者,所述第一模型可以包括:估计子模型、信道生成子模型和压缩子模型。The foregoing first model may include: an estimation submodel and a compression submodel; or, the first model may include: an estimation submodel, a channel generation submodel, and a compression submodel.
需要指出的是,所述第一模型包含不同子模型的时候,不同子模型的功能可能存在部分不同,分别来说:It should be pointed out that when the first model includes different sub-models, the functions of different sub-models may be partially different, respectively:
上述第一模型可以包括:估计子模型和压缩子模型;The above-mentioned first model may include: an estimation sub-model and a compression sub-model;
其中,所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;所述压缩子模型用于对所述信道估计信息进行压缩得到所述第二信息。Wherein, the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information; the compression sub-model is used to compress the channel estimation information to obtain the second information.
所述估计子模型还可以称为信道估计子模型或称为信道估计子神经网络,该估计子模型可采用全连接网络、卷积神经网络、残差网络、自注意力机制网络中的一种或者多种网络结构构建。所述压缩子模型可以称为信道压缩子模型或称为信道压缩子神经网络,该压缩子模型可采用全连接网络、卷积神经网络、残差网络、自注意力机制网络中的一种或者多种网络结构构建。The estimation sub-model can also be called a channel estimation sub-model or a channel estimation sub-neural network, and the estimation sub-model can use one of a fully connected network, a convolutional neural network, a residual network, and a self-attention mechanism network. Or a variety of network structure construction. The compression sub-model may be called a channel compression sub-model or a channel compression sub-neural network, and the compression sub-model may use one of a fully connected network, a convolutional neural network, a residual network, a self-attention mechanism network, or A variety of network structure construction.
所述估计子模型采用的估计的方法可以包括有最小均方误差(MMSE)等算法。The estimation method adopted by the estimation sub-model may include algorithms such as minimum mean square error (MMSE).
所述压缩子模型可以对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。The compression sub-model can compress the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
上述第一模型可以包括:估计子模型、信道生成子模型和压缩子模型;The above-mentioned first model may include: an estimation sub-model, a channel generation sub-model and a compression sub-model;
所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;The estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
所述信道生成子模型用于对所述信道估计信息进行特征分解得到信道估计信息的特征向量信息;The channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
所述压缩子模型用于对所述信道估计信息的特征向量信息进行压缩得到压缩的信道估计信息的特征向量信息。The compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain compressed eigenvector information of the channel estimation information.
所述估计子模型的功能与前述实施例相同,不做重复说明。The function of the estimation sub-model is the same as that of the foregoing embodiment, and no repeated description is given.
所述信道生成子模型中进行特征分解的方式可以为奇异值分解(SVD,Singular Value Decomposition)方式。比如,在所述信道生成子模型的处理中,可以将输入的信道信息进行SVD特征分解,得到特征分解之后的信道估计信息的特征向量信息。所述信道信息可以为采用矩阵表示,具体的说明与前述实施例相同,不再赘述。The method of performing eigendecomposition in the channel generation sub-model may be a singular value decomposition (SVD, Singular Value Decomposition) method. For example, in the processing of the channel generation sub-model, the input channel information may be subjected to SVD eigendecomposition to obtain eigenvector information of channel estimation information after eigendecomposition. The channel information may be represented by a matrix, and the specific description is the same as that of the foregoing embodiment, and will not be repeated here.
所述压缩子模型可以为对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。The compression sub-model may be to compress the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
本示例中所述估计子模型所输出的信道估计信息与所述第二模型所输出的信道信息可以是不同的,所述估计子模型所输出的信道估计信息具体可以为信道信息的矩阵,比如采用T个维度的矩阵来表示;所述第二模型所输出的信道信息可以为信道信息的特征向量信息,比如可以包含R组特征向量序列信息。当然,本示例中所述第二模型所输出的信道信息与所述估计子模型所输出的信道估计信息也可能是相同的,比如可以都是信道信息的矩阵。以下实施例中涉及到估计子模型输出的信道估计信息,以及第二模型输出的信道信息都可以是相同或不同的,不再重复说明。In this example, the channel estimation information output by the estimation sub-model may be different from the channel information output by the second model, and the channel estimation information output by the estimation sub-model may specifically be a matrix of channel information, such as It is represented by a matrix of T dimensions; the channel information output by the second model may be eigenvector information of the channel information, for example, may include R groups of eigenvector sequence information. Of course, in this example, the channel information output by the second model and the channel estimation information output by the estimation sub-model may also be the same, for example, both may be a matrix of channel information. In the following embodiments, the channel estimation information output by the estimation sub-model and the channel information output by the second model may be the same or different, and the description will not be repeated.
上述实施例中所述网络设备直接可以接收所述第一模型,实际处理中,由于所述第一模型可以包含多个子模型,因此,所述网络设备还可以分别接收多个子模型,然后将接收到的所述多个子模型合并得到所述第一模型。In the above embodiment, the network device can directly receive the first model. In actual processing, since the first model can contain multiple sub-models, the network device can also receive multiple sub-models respectively, and then receive The obtained multiple sub-models are combined to obtain the first model.
在一种情况中,所述第一模型包含有估计子模型以及压缩子模型。相应的,所述网络设备接收估计子模型以及压缩子模型;所述网络设备基于所述估计子模型以及所述压缩子模型,生成所述第一模型。具体可以是:所述网络设备接收所述电子设备发送的估计子模型的模型参数以及压缩子模型的模型参数;所述网络设备基于所述估计子模型的模型参数以及所述压缩子模型的模型参数得到所述第一模型。In one case, the first model includes an estimation sub-model and a compression sub-model. Correspondingly, the network device receives the estimated sub-model and the compressed sub-model; the network device generates the first model based on the estimated sub-model and the compressed sub-model. Specifically, the network device receives the model parameters of the estimated sub-model and the model parameters of the compressed sub-model sent by the electronic device; the network device based on the model parameters of the estimated sub-model and the model of the compressed sub-model parameters to obtain the first model.
这里,所述网络设备可以为同时接收所述电子设备发送的估计子模型以及压缩子模型;又或者,可以分别接收所述电子设备发送的估计子模型以及压缩子模型,比如,可以为先接收所述电子设备发送的估计子模型再接收所述电子设备发送的压缩子模型,或者,先接收所述电子设备发送的压缩子模型再接收所述电子设备发送的估计子模型。Here, the network device may receive the estimated sub-model and the compressed sub-model sent by the electronic device at the same time; or, it may receive the estimated sub-model and the compressed sub-model sent by the electronic device separately, for example, it may be received first The estimated sub-model sent by the electronic device then receives the compressed sub-model sent by the electronic device, or first receives the compressed sub-model sent by the electronic device and then receives the estimated sub-model sent by the electronic device.
在所述电子设备为所述终端设备的情况下,可以是由以下信息之一同时携带或分别携带估计子模型(或估计子模型的模型参数)以及压缩子模型(或压缩子模型的模型参数):上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。In the case where the electronic device is the terminal device, the estimated sub-model (or the model parameters of the estimated sub-model) and the compressed sub-model (or the model parameters of the compressed sub-model) may be carried simultaneously or separately by one of the following information ): Uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements.
在所述电子设备为其他设备的情况下,电子设备可以通过与网络设备之间的有线连接将上述估计子模型(或估计子模型的模型参数)以及压缩子模型(或压缩子模型的模型参数)同时发送或分别发送给所述网络设备。或者,电子设备通过与网络设备之间的其他无线连接将上述估计子模型(或估计子模型的模型参数)以及压缩子模型(或压缩子模型的模型参数)同时发送或分别发送给所述网络设备;其中,所述其他无线连接方式可以是蓝牙或WIFI等等,这里不进行穷举。In the case that the electronic device is other devices, the electronic device can use the above-mentioned estimated sub-model (or the model parameters of the estimated sub-model) and the compressed sub-model (or the model parameters of the compressed sub-model) through a wired connection with the network device. ) are sent simultaneously or separately to the network devices. Alternatively, the electronic device sends the above-mentioned estimated sub-model (or the model parameters of the estimated sub-model) and the compressed sub-model (or the model parameters of the compressed sub-model) to the network simultaneously or separately through other wireless connections with the network device device; wherein, the other wireless connection methods may be bluetooth or WIFI, etc., which are not exhaustive here.
在又一种情况中,所述第一模型包含有估计子模型、信道生成子模型以及压缩子模型。In yet another case, the first model includes an estimation sub-model, a channel generation sub-model, and a compression sub-model.
这种情况下,所述网络设备接收估计子模型、压缩子模型以及信道生成子模型;所述网络设备基于所述估计子模型、所述压缩子模型以及所述信道生成子模型,生成所述第一模型。具体可以是:所述网络设备接收所述电子设备发送的估计子模型的模型参数、压缩子模型的模型参数以及信道生成子模型的模型参数;所述网络设备基于所述估计子模型的模型参数、压缩子模型的模型参数以及信道生成子模型的模型参数得到所述第一模型。In this case, the network device receives the estimation sub-model, the compression sub-model and the channel generation sub-model; the network device generates the first model. Specifically, the network device receives the model parameters of the estimated sub-model, the model parameters of the compression sub-model and the model parameters of the channel generation sub-model sent by the electronic device; the network device based on the model parameters of the estimated sub-model , model parameters of the compression sub-model and model parameters of the channel generation sub-model to obtain the first model.
这里,所述网络设备可以同时接收所述电子设备发送的估计子模型、压缩子模型以及信道生成子模型。或者,可以分别接收所述电子设备发送的估计子模型、压缩子模型以及信道生成子模型,比如,估计子模型、压缩子模型以及信道生成子模型均分别接收;又或者,估计子模型、压缩子模型以及信道生成子模型中任意两个与剩余一个分别接收。举例来说,所述网络设备可以为先接收所述电子设备发送的估计子模型,再接收所述电子设备发送的信道生成子模型,最后接收电子设备发送的压缩子模型;或者,先接收所述电子设备发送的压缩子模型和信道生成子模型,再接收所述电子设备发送的估计子模型。需要指出的是,上述仅为示例性说明,不代表实际分别发送或接收上述估计子模型、压缩子模型以及信道生成子模型仅存在上述示例性的几种组合方式,只是本实施例不做穷举。Here, the network device may simultaneously receive the estimation sub-model, compression sub-model and channel generation sub-model sent by the electronic device. Alternatively, the estimated sub-model, the compressed sub-model and the channel generation sub-model sent by the electronic device may be respectively received, for example, the estimated sub-model, the compressed sub-model and the channel generation sub-model are all received respectively; or, the estimated sub-model, compressed Any two of the sub-models and the channel generation sub-model are received separately from the remaining one. For example, the network device may first receive the estimated submodel sent by the electronic device, then receive the channel generation submodel sent by the electronic device, and finally receive the compressed submodel sent by the electronic device; or, first receive the The compressed sub-model and the channel generation sub-model sent by the electronic device, and then receive the estimated sub-model sent by the electronic device. It should be pointed out that the above is only an exemplary description, and does not mean that there are only several combinations of the above-mentioned exemplary sub-models, compression sub-models, and channel generation sub-models that are actually sent or received respectively, but this embodiment is not exhaustive. lift.
在所述电子设备为所述终端设备的情况下,所述网络设备同时接收或分别接收估计子模型、压缩子模型以及信道生成子模型时,所述估计子模型、压缩子模型以及信道生成子模型由以下之一同时或分别携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。When the electronic device is the terminal device, when the network device receives the estimated sub-model, the compressed sub-model and the channel generation sub-model simultaneously or respectively, the estimated sub-model, the compressed sub-model and the channel generation sub-model The model is carried simultaneously or separately by one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
在所述电子设备为其他设备的情况下,电子设备可以通过与所述网络设备之间的有线连接将上述估计子模型、压缩子模型以及信道生成子模型,同时或分别发送给所述网络设备。或者,电子设备通过与所述网络设备之间的其他无线连接将上述估计子模型、压缩子模型以及信道生成子模型同时或分别发送给所述网络设备;其中,所述其他无线连接方式可以是蓝牙或WIFI等等,这里不进行穷举。In the case that the electronic device is other devices, the electronic device may send the above-mentioned estimation sub-model, compression sub-model and channel generation sub-model to the network device simultaneously or separately through a wired connection with the network device . Alternatively, the electronic device sends the above estimated sub-model, compression sub-model and channel generation sub-model to the network device simultaneously or separately through other wireless connections with the network device; wherein, the other wireless connection methods may be Bluetooth or WIFI, etc., are not exhaustive here.
在本方式中,所述方法还可以包括:所述网络设备接收所述第三模型。具体来说,所述网络设备可以接收所述第三模型的模型参数。再进一步的,所述网络设备可以接收电子设备发送的所述第三模型,比如所述网络设备可以接收电子设备发送的所述第三模型的模型参数。In this manner, the method may further include: the network device receiving the third model. Specifically, the network device may receive model parameters of the third model. Still further, the network device may receive the third model sent by the electronic device, for example, the network device may receive model parameters of the third model sent by the electronic device.
其中,所述第三模型用于对所述第一模型输出的第二信息进行数据变换处理后输入所述第二模型;所述第一模型、第二模型以及第三模型为联合训练得到的。Wherein, the third model is used to perform data transformation processing on the second information output by the first model and input it into the second model; the first model, the second model and the third model are obtained through joint training .
所述数据变换处理包括:卷积处理或傅里叶变换处理。比如,所述傅里叶变换处理具体可以包括:通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域。The data transformation processing includes: convolution processing or Fourier transform processing. For example, the Fourier transform processing may specifically include: converting to the frequency domain through Fourier transform, multiplication, and then converting to the time domain through inverse Fourier transform.
在所述电子设备为所述终端设备的情况下,所述第三模型由以下之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。In the case that the electronic device is the terminal device, the third model is carried by one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements.
在所述电子设备为其他设备的情况下,该第三模型(或所述第三模型的模型参数)可以是通过有线连接方式传输的、 或其他无线连接方式传输的。比如,电子设备通过与所述网络设备之间的有线连接将所述第三模型(或所述第三模型的模型参数)传输给所述网络设备。或者,电子设备通过与所述网络设备之间的其他无线连接将所述第三模型(或所述第三模型的模型参数)传输给所述网络设备;其中,所述其他无线连接方式可以是蓝牙或WIFI等等,这里不进行穷举。In the case that the electronic device is other devices, the third model (or the model parameters of the third model) may be transmitted through a wired connection or other wireless connection. For example, the electronic device transmits the third model (or the model parameters of the third model) to the network device through a wired connection with the network device. Alternatively, the electronic device transmits the third model (or the model parameters of the third model) to the network device through other wireless connections with the network device; wherein, the other wireless connection manner may be Bluetooth or WIFI, etc., are not exhaustive here.
上述第一模型、第二模型以及第三模型可以为分别发送也可以为同时发送;或者上述第一模型、第二模型、第三模型也可以是三个模型均分别发送;又或者,还可以是其中任意两个组合同时发送,剩下一个分别发送等等。The above-mentioned first model, second model, and third model can be sent separately or simultaneously; or the above-mentioned first model, second model, and third model can also be sent separately; or, it is also possible It is any two combinations of which are sent at the same time, and the remaining one is sent separately, and so on.
需要说明的是,上述第二模型为所述网络设备在接收到第二信息并处理得到信道信息时需要使用的模型。而本方式中除了第二模型还可以接收第一模型和/或第三模型,这是由于当第一模型与第二模型为联合训练得到的一个整体的时候,所述网络设备若要对第一模型以及第二模型进行整体评估,则需要得到第一模型以及第二模型,进而,所述网络设备在完成第一模型以及第二模型的整体评估之后,可以决定是否使用本次接收到的第一模型以及第二模型,若所述网络设备对第一模型以及第二模型的整体评估结果较差(比如压缩率较低或者恢复信道信息的准确率较低等等),可以不使用上述第一模型。若确定不使用上述第一模型,此时,所述网络设备可以自身对第一模型以及第二模型重新进行联合训练以更新第一模型以及第二模型的模型参数,或者,所述网络设备自己训练得到新的第一模型以及第二模型。当第一模型、第二模型以及第三模型为联合训练得到的整体的时候,同样所述网络设备可以在接收到第一模型、第二模型以及第三模型后对其进行整体评估以及相应的后续处理,具体的处理方式与前述相同,不做赘述。It should be noted that the foregoing second model is a model that the network device needs to use when receiving the second information and processing to obtain channel information. However, in this mode, besides the second model, the first model and/or the third model can also be received. This is because when the first model and the second model are a whole obtained through joint training, if the network device wants to The overall evaluation of the first model and the second model needs to obtain the first model and the second model, and then, after the network device completes the overall evaluation of the first model and the second model, it can decide whether to use the received The first model and the second model, if the overall evaluation results of the network device on the first model and the second model are poor (for example, the compression rate is low or the accuracy of recovering channel information is low, etc.), the above-mentioned model may not be used. first model. If it is determined not to use the above-mentioned first model, at this time, the network device may re-train the first model and the second model by itself to update the model parameters of the first model and the second model, or the network device itself A new first model and a new second model are obtained through training. When the first model, the second model, and the third model are the whole obtained through joint training, the network device can also evaluate the first model, the second model, and the third model as a whole and correspondingly Subsequent processing, the specific processing method is the same as the above, and will not be repeated.
第二种方式、所述网络设备自身训练得到上述第一模型。In a second manner, the network device trains itself to obtain the above-mentioned first model.
所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型;The network device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model;
其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training.
在所述第二种方式中,训练可以采用第一损失函数或第二损失函数。下面对采用上述两种损失函数进行训练分别进行说明:In the second manner, the training may use the first loss function or the second loss function. The following describes the training using the above two loss functions:
所述训练采用的损失函数为第一损失函数;所述第一损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度构建的。The loss function used in the training is a first loss function; the first loss function is based on the difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model degree of difference is constructed.
所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度为基于距离确定的,或者为基于相似程度确定的。The degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model is determined based on a distance, or determined based on a degree of similarity.
基于距离确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的具体计算方式可以采用均方误差(MSE,Mean Squared Error)或归一化均方误差(NMSE)等方式,本实施例不做穷举。The specific calculation method for determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the distance can use mean square error (MSE, Mean Squared Error ) or normalized mean square error (NMSE), etc., which are not exhaustive in this embodiment.
举例来说,所述第二预设模型的输出信息可以为矩阵,相应的,所述压缩预设子模型的输入信息也可以为矩阵,这里,将所述第二预设模型的输出的矩阵称为矩阵1,将所述压缩预设子模型的输入的矩阵称为矩阵2;基于距离确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的方式MSE方式,比如:将矩阵1与矩阵2进行计算得到差值,将差值的平方作为所述差异程度。For example, the output information of the second preset model may be a matrix, and correspondingly, the input information of the compressed preset sub-model may also be a matrix, and here, the output matrix of the second preset model It is called matrix 1, and the matrix of the input of the compressed preset submodel is called matrix 2; the output information of the second preset model and the compressed preset submodel of the first preset model are determined based on the distance The way of the degree of difference between the input information is the MSE way, for example: calculate the difference between matrix 1 and matrix 2, and use the square of the difference as the degree of difference.
基于相似程度确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的具体计算方式可以采用余弦相似度或余弦相似度平方等方式,本实施例不做穷举。The specific calculation method for determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may use cosine similarity or cosine similarity squared etc., which are not exhaustive in this embodiment.
举例来说,所述第二预设模型的输出信息可以为R组特征向量序列信息,相应的,所述压缩预设子模型的输入信息也可以为R组特征向量序列信息,这里,将所述第二预设模型的输出的R组特征向量序列信息称为特征向量序列1,将所述压缩预设子模型的输入的R组特征向量序列信息称为特征向量序列2。基于相似程度确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的方式可以是余弦相似度,比如:特征向量序列1以及特征向量序列2的余弦夹角来确定相似程度,将该相似程度作为所述差异程度。For example, the output information of the second preset model may be R sets of feature vector sequence information, and correspondingly, the input information of the compressed preset sub-model may also be R sets of feature vector sequence information. Here, the The R sets of feature vector sequence information output by the second preset model are called feature vector sequence 1, and the R sets of feature vector sequence information input by the compressed preset sub-model are called feature vector sequence 2. The method of determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may be cosine similarity, for example: feature vector sequence 1 and the cosine angle of the eigenvector sequence 2 to determine the degree of similarity, and use the degree of similarity as the degree of difference.
采用上述第一损失函数进行训练的处理中,由于第一预设模型中包含的子模型的不同以及是否包含用于模拟无线信道环境的第三预设模型进行联合训练,下面分四种情况分别进行说明:In the process of using the above-mentioned first loss function for training, due to the difference in the sub-models contained in the first preset model and whether the third preset model for simulating the wireless channel environment is included for joint training, the following four cases are respectively Be explained:
情况一,所述第一预设模型中包括估计预设子模型和压缩预设子模型。 Case 1, the first preset model includes an estimation preset sub-model and a compression preset sub-model.
参见图8a,其中示意出第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801和压缩预设子模型802。上述第一预设模型800,第二预设模型810,以及所述第一预设模型300中包含的估计预设子模型801和压缩预设子模型802之间的输入输出关系可以为:估计预设子模型801的输入信息为第一训练样本920;所述估计预设子模型801的输出信息作为所述压缩预设子模型802的输入信息;所述压缩预设子模型802的输出信息作为所述第二预设模型810的输入信息。Referring to FIG. 8 a , it illustrates a first preset model 800 , a second preset model 810 , and an estimated preset sub-model 801 and a compressed preset sub-model 802 included in the first preset model 800 . The above-mentioned first preset model 800, the second preset model 810, and the input-output relationship between the estimated preset sub-model 801 and the compressed preset sub-model 802 included in the first preset model 300 can be: estimated The input information of the preset submodel 801 is the first training sample 920; the output information of the estimated preset submodel 801 is used as the input information of the compressed preset submodel 802; the output information of the compressed preset submodel 802 As the input information of the second preset model 810 .
本情况中,所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:In this case, the network device uses training samples to jointly train the first preset model and the second preset model, including:
所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息以及所述初始信息之间的差异程度,确定所述第一损失函数;determining the first loss function based on the degree of difference between the restoration information and the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
其中,所述第一训练样本可以为参考信号样本。所述参考信号样本可以为历史采集得到的原始参考信号、或者处理过的参考信号。再具体的,所述参考信号样本可以为下行参考信号样本。需要指出的是,本实施例并不限定所述第一训练样本一定为所述下行参考信号样本,还可以采用上行参考信号样本或其他参考信号样本,只是本实施例不做穷举。Wherein, the first training samples may be reference signal samples. The reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition. More specifically, the reference signal samples may be downlink reference signal samples. It should be pointed out that this embodiment does not limit the first training samples to be the downlink reference signal samples, and uplink reference signal samples or other reference signal samples may also be used, but this embodiment does not make an exhaustive list.
还需要指出的是,除了第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景相关,这里不对其进行限定。It should also be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. In the process of training, whether one or more of the above information is input may be relevant according to the actual situation or the actual scene, and it is not limited here.
所述第一预设模型的估计预设子模型的具体功能可以为:基于第一训练样本进行信道估计得到所述初始信息。其中,信道估计可以采用最小均方误差(MMSE)等算法。A specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain the initial information. Wherein, the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
上述初始信息可以为矩阵,该矩阵的维度这里不做限定,可以为二维或更多维度的矩阵。所述矩阵中的每一个位置上的数值用于表示对应多个维度的相应粒度下所对应的信道质量。其中,所述信道质量可以是以dBm为单位,又或者可 以为对信道质量进行归一化处理后的数值。The aforementioned initial information may be a matrix, and the dimension of the matrix is not limited here, and may be a matrix of two or more dimensions. The value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions. Wherein, the channel quality may be in dBm, or may be a value after normalization processing of the channel quality.
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。在上述处理中,所述压缩预设子模型得到的压缩后的信息包含的内容或数据量小于其输入的信息的数据量或内容。上述压缩后的信息与初始信息的形式为相同的,比如初始信息为矩阵,相应的所述压缩后的信息也为矩阵,所述初始信息与压缩后的信息的矩阵维度是相同的,但是数据量是不同的。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift. In the above processing, the compressed information obtained by compressing the preset sub-model contains less content or data volume than the input information. The form of the above-mentioned compressed information is the same as that of the initial information. For example, the initial information is a matrix, and the corresponding compressed information is also a matrix. The dimensions of the matrix of the initial information and the compressed information are the same, but the data The amount is different.
所述第二预设模型的功能可以是对其输入信息进行解压缩。所述第二预设模型的输入信息为压缩后的信息,第二预设模型的输出为恢复信息。在理想状态下,第二预设模型的解压缩率应该使得其得到的恢复信息与初始信息包含相同的数据内容。The function of the second preset model may be to decompress its input information. The input information of the second preset model is compressed information, and the output of the second preset model is restored information. Ideally, the decompression rate of the second preset model should make the obtained restored information contain the same data content as the original information.
所述基于第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型具体可以指的是:基于第一损失函数所确定的差异程度来进行反向传导更新所述估计预设子模型的模型参数、所述压缩预设子模型的模型参数和所述第二预设模型的模型参数。The performing reverse conduction update of the first preset model and the second preset model based on the first loss function may specifically refer to performing reverse conduction update based on the degree of difference determined by the first loss function. The model parameters of the estimated preset sub-model, the model parameters of the compressed preset sub-model, and the model parameters of the second preset model.
针对上述训练还需要指出,关于上述训练收敛的方式可以包括以下至少之一:判断迭代训练的次数是否达到预设次数,判断差异程度是否小于预设门限值。其中,所述预设次数、所述预设门限值可以根据实际情况设置,不对其进行穷举。也就是说,基于上述方式确定训练完成时,可以将训练完成后的第一预设模型作为第一模型,将训练完成后的第二预设模型作为第二模型。Regarding the above training, it should also be pointed out that the manner of the above training convergence may include at least one of the following: judging whether the number of iterative training reaches a preset number, and judging whether the degree of difference is smaller than a preset threshold. Wherein, the preset number of times and the preset threshold value can be set according to actual conditions, and are not exhaustive. That is to say, when it is determined that the training is completed based on the above manner, the first preset model after the training can be used as the first model, and the second preset model after the training can be used as the second model.
情况二,所述第一预设模型中包括估计预设子模型、预设信道生成子模型和压缩预设子模型。Case 2, the first preset model includes an estimation preset submodel, a preset channel generation submodel, and a compression preset submodel.
参见图8b,其中示意出第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802和信道生成预设子模型803。上述第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802和信道生成预设子模型803之间的输入输出关系可以为:估计预设子模型801的输入信息为第一训练样本920;所述估计预设子模型801的输出信息作为所述信道生成预设子模型803的输入信息;所述信道生成预设子模型803的输出信息作为压缩预设子模型802的输入信息;所述压缩预设子模型802的输出信息作为所述第二预设模型810的输入信息。Referring to Fig. 8b, it shows a first preset model 800, a second preset model 810, and the estimation preset sub-model 801, compression preset sub-model 802 and channel generation preset included in the first preset model 800. Set sub-model 803 . Between the above-mentioned first preset model 800, second preset model 810, and the estimation preset submodel 801, compression preset submodel 802 and channel generation preset submodel 803 contained in the first preset model 800 The input-output relationship of can be as follows: the input information of the estimated preset sub-model 801 is the first training sample 920; the output information of the estimated preset sub-model 801 is used as the input information of the channel generation preset sub-model 803; The output information of the channel generation preset submodel 803 is used as the input information of the compression preset submodel 802 ; the output information of the compression preset submodel 802 is used as the input information of the second preset model 810 .
所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,可以包括:The network device uses training samples to jointly train the first preset model and the second preset model, which may include:
所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
其中,关于第一训练样本的具体说明与前述情况一相同,因此不做重复说明。还需要指出的是,除了第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景相关,这里不对其进行限定。Wherein, the specific description about the first training sample is the same as the above-mentioned case 1, so repeated description will not be given. It should also be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. In the process of training, whether one or more of the above information is input may be relevant according to the actual situation or the actual scene, and it is not limited here.
所述第一预设模型的估计预设子模型的具体功能可以为:基于第一训练样本进行信道估计得到初始信息。其中,信道估计可以采用最小均方误差(MMSE)等算法。The specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information. Wherein, the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
所述信道生成预设子模型的功能可以是对所述初始信息进行特征分解,得到所述初始信息的特征向量信息。其中,所述初始信息的特征向量信息可以包括R组特征向量序列。对所述初始信息进行特征分解的方式可以采用奇异值分解(SVD,Singular Value Decomposition)的方式。A function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information. Wherein, the eigenvector information of the initial information may include R groups of eigenvector sequences. The method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。在上述处理中,所述压缩预设子模型得到的压缩后的特征向量信息包含的内容或数据量小于其输入的初始信息的特征向量信息的数据量或内容。上述压缩后的特征向量信息与初始信息的特征向量信息的形式为相同的,比如初始信息的特征向量信息为R组特征向量序列,相应的所述压缩后的特征向量信息也为R组特征向量序列,但是两者包含的数据量是不同的。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift. In the above processing, the compressed feature vector information obtained by compressing the preset sub-model contains less content or data volume than the feature vector information of the input initial information. The form of the above-mentioned compressed feature vector information is the same as that of the initial information. For example, the feature vector information of the initial information is an R group of feature vector sequences, and the corresponding compressed feature vector information is also an R group of feature vectors. sequence, but the amount of data contained in the two is different.
所述第二预设模型的功能可以是对其输入信息进行解压缩得到恢复信息。在上述处理中,所述第二预设模型的输入信息为压缩后的特征向量信息,第二预设模型的输出为恢复的特征向量信息。第二预设模型的解压缩率应该使得其得到的恢复的特征向量信息与初始信息的特征向量信息包含相同的数据内容。The function of the second preset model may be to decompress its input information to obtain restoration information. In the above processing, the input information of the second preset model is compressed feature vector information, and the output of the second preset model is restored feature vector information. The decompression rate of the second preset model should make the obtained restored feature vector information contain the same data content as the feature vector information of the initial information.
所述基于第一损失函数所确定的差异程度来进行反向传导更新所述第一预设模型和所述第二预设模型具体可以指的是:基于第一损失函数所确定的差异程度来进行反向传导更新所述估计预设子模型的模型参数、所述信道生成预设子模型的模型参数、所述压缩预设子模型的模型参数和所述第二预设模型的模型参数。The performing reverse conduction to update the first preset model and the second preset model based on the degree of difference determined by the first loss function may specifically refer to: performing a reverse conduction based on the degree of difference determined by the first loss function performing reverse conduction to update model parameters of the estimated preset submodel, model parameters of the channel generation preset submodel, model parameters of the compression preset submodel, and model parameters of the second preset model.
关于上述训练收敛的确定方式与前述情况一相同,不做重复说明。The method for determining the convergence of the above training is the same as that of the above-mentioned case 1, and repeated explanations are not repeated.
情况三,所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型,包括:In case three, the network device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, including:
所述网络设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The network device uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein , the third model is a trained third preset model.
参见图8c,其中示意出第一预设模型800,第二预设模型810,第三预设模型830,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802。上述第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801和压缩预设子模型802之间的输入输出关系可以为:估计预设子模型801的输入信息为第一训练样本920;所述估计预设子模型801的输出信息作为压缩预设子模型802的输入信息;所述压缩预设子模型802 的输出信息作为所述第三预设模型830的输入信息;所述第三预设模型的输出信息作为所述第二预设模型810的输出信息。Referring to FIG. 8c, it shows a first preset model 800, a second preset model 810, a third preset model 830, and the estimated preset sub-model 801 contained in the first preset model 800, the compression preset Submodel 802. The above-mentioned first preset model 800, the second preset model 810, and the input-output relationship between the estimated preset sub-model 801 and the compressed preset sub-model 802 contained in the first preset model 800 can be: estimated The input information of the preset submodel 801 is the first training sample 920; the output information of the estimated preset submodel 801 is used as the input information of the compressed preset submodel 802; the output information of the compressed preset submodel 802 is used as the The input information of the third preset model 830; the output information of the third preset model is used as the output information of the second preset model 810.
所述网络设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The network device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息以及所述初始信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restoration information and the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
关于第一训练样本的具体说明与前述情况一或情况二相同,因此不做重复说明。还需要指出的是,除了第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景相关,这里不对其进行限定。The specific description about the first training sample is the same as the above-mentioned case 1 or case 2, so no repeated description is given. It should also be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. In the process of training, whether one or more of the above information is input may be relevant according to the actual situation or the actual scene, and it is not limited here.
所述第一预设模型的估计预设子模型以及所述第一预设模型的压缩预设子模型的具体功能与前述情况一相同,因此不做重复说明。The specific functions of the estimation preset sub-model of the first preset model and the compression preset sub-model of the first preset model are the same as those in the first case, so the description will not be repeated.
在情况三中相对于情况一增加了第三预设模型,关于所述第三预设模型的功能为模拟信道环境,具体处理可以为对输入信息进行数据变换得到数据变换后的信息作为输出信息。其中,所述数据变换的具体处理方式可以包括卷积处理或者等效于卷积的数据处理;其中,所述等效于卷积的数据处理可以为多次傅里叶变换处理,比如,可以为将所述第三预设模型的输入信息通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域,以此来等效时域卷积的处理。In case three, a third preset model is added relative to case one, and the function of the third preset model is to simulate the channel environment, and the specific processing can be to perform data transformation on input information to obtain data transformed information as output information . Wherein, the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
所述第二预设模型的功能可以是对其输入信息进行解压缩得到恢复信息。所述第二预设模型的输入信息为变换后的信息,第二预设模型的输出为恢复信息。第二预设模型的解压缩率应该使得其得到的恢复信息与初始信息包含接近或相同的数据内容。The function of the second preset model may be to decompress its input information to obtain restoration information. The input information of the second preset model is transformed information, and the output of the second preset model is restored information. The decompression rate of the second preset model should make the restored information obtained by it contain close to or the same data content as the original information.
所述基于第一损失函数进行反向传导更新更新所述第一预设模型、所述第二预设模型和所述第三预设模型具体可以指的是:基于第一损失函数进行反向传导更新所述估计预设子模型的模型参数、所述压缩预设子模型的模型参数、所述第二预设模型的模型参数和所述第三预设模型的模型参数。The updating of the first preset model, the second preset model, and the third preset model based on the first loss function may specifically refer to: performing reverse conduction based on the first loss function Conductively updating model parameters of the estimated preset sub-model, model parameters of the compressed preset sub-model, model parameters of the second preset model, and model parameters of the third preset model.
关于上述训练收敛的方式与前述情况一或情况二相同,不做赘述。The manner of the above-mentioned training convergence is the same as that of the foregoing case 1 or case 2, and will not be repeated here.
情况四,与上述情况三不同在于所述第一预设模型中包括估计预设子模型、预设信道生成子模型和压缩预设子模型。Case 4 is different from the above case 3 in that the first preset model includes an estimation preset sub-model, a preset channel generation sub-model, and a compression preset sub-model.
参见图8d,其中示意出第一预设模型800,第二预设模型810,第三预设模型830,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802和信道生成预设子模型803。上述第一预设模型800,第二预设模型810,第三预设模型830,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802和信道生成预设子模型803之间的输入输出关系可以为:估计预设子模型801的输入信息为第一训练样本920;所述估计预设子模型801的输出信息作为所述信道生成预设子模型803的输入信息;所述信道生成预设子模型803的输出信息作为压缩预设子模型802的输入信息;所述压缩预设子模型802的输出信息作为所述第三预设模型830的输入信息;所述第三预设模型830的输出信息作为所述第二预设模型810的输入信息。Referring to Fig. 8d, it shows a first preset model 800, a second preset model 810, a third preset model 830, and the estimated preset sub-model 801 contained in the first preset model 800, the compression preset Submodel 802 and channel generation preset submodel 803 . The above-mentioned first preset model 800, second preset model 810, third preset model 830, and the estimation preset sub-model 801, compression preset sub-model 802 and channel generation included in the first preset model 800 The input-output relationship between the preset sub-models 803 may be: the input information of the estimated preset sub-model 801 is the first training sample 920; the output information of the estimated preset sub-model 801 is used as the channel generation preset sub-model The input information of 803; the output information of the channel generation preset submodel 803 is used as the input information of the compression preset submodel 802; the output information of the compression preset submodel 802 is used as the input of the third preset model 830 information; the output information of the third preset model 830 is used as the input information of the second preset model 810 .
所述网络设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The network device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
关于第一训练样本的具体说明与前述情况一、情况二、情况三中任意之一相同,因此不做重复说明。The specific description about the first training sample is the same as any one of the foregoing case 1, case 2, and case 3, so no repeated description is given.
所述第一预设模型的估计预设子模型的具体功能与前述情况一、情况二、情况三中任意之一相同。The specific function of the estimated preset sub-model of the first preset model is the same as any one of the foregoing case 1, case 2, and case 3.
所述信道生成预设子模型的功能可以是对所述初始信息进行特征分解,得到所述初始信息的特征向量信息。其中,所述初始信息的特征向量信息可以包括R组特征向量序列。对所述初始信息进行特征分解的方式可以采用奇异值分解(SVD,Singular Value Decomposition)的方式。A function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information. Wherein, the eigenvector information of the initial information may include R groups of eigenvector sequences. The method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
所述第三预设模型的功能为模拟无线信道环境,具体处理可以为对输入信息进行数据变换得到数据变换后的信息作为输出信息。其中,所述数据变换的具体处理方式可以包括卷积处理或者等效于卷积的数据处理;其中,所述等效于卷积的数据处理可以为多次傅里叶变换处理,比如,可以为将所述第三预设模型的输入信息通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域,以此来等效时域卷积的处理。The function of the third preset model is to simulate the wireless channel environment, and the specific processing may be to perform data transformation on input information to obtain information after data transformation as output information. Wherein, the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
所述第二预设模型的功能可以是对其输入信息进行解压缩得到恢复信息。所述第二预设模型的输入信息为变换后的特征向量信息,第二预设模型的输出为恢复的特征向量信息。第二预设模型的解压缩率应该使得其得到的恢复的特征向量信息与初始信息的特征向量信息包含接近或相同的数据内容。The function of the second preset model may be to decompress its input information to obtain restoration information. The input information of the second preset model is transformed feature vector information, and the output of the second preset model is restored feature vector information. The decompression rate of the second preset model should make the obtained restored feature vector information and the feature vector information of the initial information contain close to or the same data content.
所述基于第一损失函数所确定的差异程度来进行反向传导更新更新所述第一预设模型、所述第二预设模型和所述第三预设模型具体可以指的是:基于第一损失函数所确定的差异程度来进行反向传导更新所述估计预设子模型的模型参数、所述信道生成预设子模型的模型参数、所述压缩预设子模型的模型参数、所述第二预设模型的模型参数和所述第三预设 模型的模型参数。The performing reverse conduction update based on the degree of difference determined by the first loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The degree of difference determined by a loss function is used to perform reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the channel generation preset sub-model, the model parameters of the compressed preset sub-model, the The model parameters of the second preset model and the model parameters of the third preset model.
关于上述训练收敛的方式与前述实施例相同,不做重复说明。The manner of the above-mentioned training convergence is the same as that of the foregoing embodiment, and no repeated description is given.
以上针对联合训练采用第一损失函数的场景进行了说明,本实施例中还可以提供采用第二损失函数进行联合训练的场景,具体如下:The scenario where the first loss function is used for joint training is described above. In this embodiment, the scenario where the second loss function is used for joint training can also be provided, as follows:
所述训练采用的损失函数为第二损失函数;所述第二损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度以及所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度构建的;其中,所述第二训练样本与输入所述估计预设子模型的第一训练样本相对应。The loss function used in the training is a second loss function; the second loss function is based on the difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model The first difference degree of the first preset model and the second difference degree between the output information of the estimated preset sub-model of the first preset model and the second training sample; wherein, the second training sample and the input of the estimated Corresponds to the first training sample of the preset sub-model.
所述第一差异程度为基于距离确定的,或者为基于相似程度确定的;和/或,所述第二差异程度为基于距离确定的,或者为基于相似程度确定的。The first degree of difference is determined based on a distance, or is determined based on a degree of similarity; and/or, the second degree of difference is determined based on a distance, or is determined based on a degree of similarity.
基于距离确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度的具体计算方式可以采用均方误差(MSE,Mean Squared Error)或归一化均方误差(NMSE)等方式,本实施例不做穷举。The specific calculation method for determining the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the distance can use a mean square error (MSE, Mean Squared Error) or normalized mean square error (NMSE), etc., this embodiment is not exhaustive.
举例来说,所述第二预设模型的输出信息可以为矩阵,相应的,所述压缩预设子模型的输入信息也可以为矩阵,这里,将所述第二预设模型的输出的矩阵称为矩阵3,将所述压缩预设子模型的输入的矩阵称为矩阵4;基于距离确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的方式MSE方式,比如:将矩阵3与矩阵4进行计算得到差值,将差值的平方作为所述差异程度。For example, the output information of the second preset model may be a matrix, and correspondingly, the input information of the compressed preset sub-model may also be a matrix, and here, the output matrix of the second preset model It is called matrix 3, and the matrix of the input of the compressed preset submodel is called matrix 4; the output information of the second preset model and the compressed preset submodel of the first preset model are determined based on the distance The way of the degree of difference between the input information is the MSE way, for example: calculate the difference between matrix 3 and matrix 4, and use the square of the difference as the degree of difference.
基于相似程度确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度的具体计算方式可以采用余弦相似度或余弦相似度平方等方式,本实施例不做穷举。The specific calculation method for determining the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may use cosine similarity or cosine similarity Degree square and other methods are not exhaustive in this embodiment.
举例来说,所述第二预设模型的输出信息可以为R组特征向量序列信息,相应的,所述压缩预设子模型的输入信息也可以为R组特征向量序列信息,这里,将所述第二预设模型的输出的R组特征向量序列信息称为特征向量序列3,将所述压缩预设子模型的输入的R组特征向量序列信息称为特征向量序列4。基于相似程度确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的方式可以是余弦相似度,比如:特征向量序列3以及特征向量序列4的余弦夹角来确定相似程度,将该相似程度作为所述差异程度。For example, the output information of the second preset model may be R sets of feature vector sequence information, and correspondingly, the input information of the compressed preset sub-model may also be R sets of feature vector sequence information. Here, the The R group of feature vector sequence information output by the second preset model is called feature vector sequence 3, and the R group of feature vector sequence information input by the compressed preset sub-model is called feature vector sequence 4. The method of determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may be cosine similarity, for example: feature vector sequence 3 and the cosine angle of the eigenvector sequence 4 to determine the degree of similarity, and use the degree of similarity as the degree of difference.
基于距离确定所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度的具体计算方式可以采用均方误差(MSE,Mean Squared Error)或归一化均方误差(NMSE)等方式,本实施例不做穷举。The specific calculation method for determining the second degree of difference between the output information of the estimated preset sub-model of the first preset model based on the distance and the second training sample can use mean square error (MSE, Mean Squared Error) or normalization The methods such as normalized mean square error (NMSE) are not exhaustive in this embodiment.
举例来说,所述估计预设子模型的输出信息可以为矩阵,相应的,所述第二训练样本也可以为矩阵,这里,将所述估计预设子模型的输出的矩阵称为矩阵5,将所述压缩预设子模型的输入的矩阵称为矩阵6;基于距离确定所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度的方式MSE方式,比如:将矩阵5与矩阵6进行计算得到差值,将差值的平方作为所述差异程度。For example, the output information of the estimated preset sub-model may be a matrix, and correspondingly, the second training sample may also be a matrix. Here, the output matrix of the estimated preset sub-model is called matrix 5 , the matrix of the input of the compressed preset sub-model is called matrix 6; the second degree of difference between the output information of the estimated preset sub-model of the first preset model and the second training sample is determined based on the distance In the MSE mode, for example: calculate the difference between matrix 5 and matrix 6, and use the square of the difference as the degree of difference.
基于相似程度所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度的具体计算方式的具体计算方式可以采用余弦相似度或余弦相似度平方等方式,本实施例不做穷举。Based on the specific calculation method of the second degree of difference between the output information of the estimated preset sub-model of the first preset model and the second training sample based on the degree of similarity, the specific calculation method can use cosine similarity or cosine similarity squared, etc. The methods are not exhaustive in this embodiment.
上述第一差异程度以及第二差异程度联合构建所述第二损失函数时,其联的方式可以是对第一差异程度以及第二差异程度等权重相加,比如两者各占50%;或者,其联合的方式可以是第一差异程度以及第二差异程度不等权重相加,比如可以对第一差异程度赋予更大权重,也就是对上述第二预设模型与压缩预设子模型之间的压缩恢复前后的差异情况赋予更大权重,或者可以对上述第二差异程度赋予更大权重,也就是对上述估计预设子模型的输出信息的准确度赋予更大权重;或者,其联合的方式可以是第一差异程度以及第二差异程度相乘的形式;或者其联合的方式可以是第一差异程度以及第二差异程度可以是通过交叉熵计算的形式,比如p1*log(第一差异程度)+p2*log(第二差异程度),其中p1和p2均可以根据实际情况设置,这里不对其进行限定。When the first degree of difference and the second degree of difference are combined to construct the second loss function, the method of connection may be to add the weights of the first degree of difference and the second degree of difference, for example, the two each account for 50%; or , the joint method can be the addition of unequal weights between the first difference degree and the second difference degree. The difference before and after the compression and recovery between the two can be assigned a greater weight, or the above-mentioned second degree of difference can be assigned a larger weight, that is, the accuracy of the output information of the above-mentioned estimated preset sub-model is assigned a larger weight; or, its combination The method can be in the form of multiplying the first degree of difference and the second degree of difference; or the joint method can be that the first degree of difference and the second degree of difference can be calculated by cross entropy, such as p1*log(first degree of difference)+p2*log (the second degree of difference), where both p1 and p2 can be set according to actual conditions, and are not limited here.
采用上述第二损失函数进行训练的处理中,由于第一预设模型中包含的子模型的不同以及是否包含用于模拟无线信道环境的第三预设模型进行联合训练的具体处理是不同的,因此分以下四种情况分别进行说明:In the process of using the above-mentioned second loss function for training, since the sub-models contained in the first preset model are different and whether the third preset model for simulating the wireless channel environment is included for joint training is different, Therefore, the following four situations are described separately:
情况五,所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练得到训练后的第一模型和第二模型;其中,所述第一预设模型中包括估计预设子模型和压缩预设子模型。In case five, the network device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model; wherein, the first preset model includes estimated preset Set submodels and compress preset submodels.
本情况中各个模型的组成以及各个模型之间的输入输出关系与前述情况一相同,具体可以参见图8a,这里不做重复说明。In this case, the composition of each model and the input-output relationship between each model are the same as the previous case 1. For details, please refer to FIG. 8 a , which will not be repeated here.
所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The network device uses training samples to jointly train the first preset model and the second preset model, including:
所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本相对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
其中,所述第一训练样本可以为参考信号样本。所述参考信号样本可以为历史采集得到的原始参考信号、或者处理过的参考信号。再具体的,所述参考信号样本可以为下行参考信号样本。需要指出的是,本实施例并不限定所述第一训练样本一定为所述下行参考信号样本,还可以采用上行参考信号样本或其他参考信号样本,只是本实施例不做穷举。还需要指出的是,除了第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景相关,这里不对其进行限定。Wherein, the first training samples may be reference signal samples. The reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition. More specifically, the reference signal samples may be downlink reference signal samples. It should be pointed out that this embodiment does not limit the first training samples to be the downlink reference signal samples, and uplink reference signal samples or other reference signal samples may also be used, but this embodiment does not make an exhaustive list. It should also be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. In the process of training, whether one or more of the above information is input may be relevant according to the actual situation or the actual scene, and it is not limited here.
所述第一预设模型的估计预设子模型的具体功能可以为:基于第一训练样本进行信道估计得到初始信息。其中,信道估计可以采用最小均方误差(MMSE)等算法。The specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information. Wherein, the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
上述初始信息可以为矩阵。该矩阵的维度这里不做限定,可以为二维或更多维度的矩阵。所述矩阵中的每一个位置上的数值用于表示对应多个维度的相应粒度下所对应的信道质量。The above initial information may be a matrix. The dimension of the matrix is not limited here, and may be a matrix of two or more dimensions. The value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions.
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。在上述处理中,所述压缩预设子模型得到的压缩后的信息包含的内容或数据量小于其输入的初始信息的数据量或内容。上述压缩后的信息与初始信息的形式为相同的,比如均为矩阵,所述初始信息与压缩后的信息的矩阵维度是相同的,但是数据内容(或数据量)是不同的。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift. In the above processing, the compressed information obtained by compressing the preset sub-model contains less content or data volume than the input initial information. The form of the above-mentioned compressed information is the same as that of the initial information, such as a matrix, and the matrix dimensions of the initial information and the compressed information are the same, but the data content (or data volume) is different.
所述第二预设模型的功能可以是对其输入信息进行解压缩得到恢复信息。第二预设模型的解压缩率应该使得其得到的恢复信息与初始信息包含相同的数据内容(或数据量)。The function of the second preset model may be to decompress its input information to obtain restoration information. The decompression rate of the second preset model should make the obtained restored information contain the same data content (or data amount) as the original information.
所述基于第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型具体可以指的是:基于第二损失函数进行反向传导更新所述估计预设子模型的模型参数、所述压缩预设子模型的模型参数和所述第二预设模型的模型参数。The performing reverse conduction based on the second loss function to update the first preset model and the second preset model may specifically refer to: performing reverse conduction based on the second loss function to update the estimated preset sub-model , the model parameters of the compressed preset sub-model and the model parameters of the second preset model.
针对上述训练还需要指出,关于上述训练收敛的方式可以包括以下至少之一:判断迭代训练的次数是否达到预设次数,判断第二损失函数确定的差异程度是否小于预设门限值。也就是说,基于上述方式确定训练完成时,可以将训练完成后的第一预设模型作为第一模型,将训练完成后的第二预设模型作为第二模型。Regarding the above training, it should also be pointed out that the way of the above training convergence may include at least one of the following: judging whether the number of iterative training reaches the preset number, and judging whether the degree of difference determined by the second loss function is less than a preset threshold. That is to say, when it is determined that the training is completed based on the above manner, the first preset model after the training can be used as the first model, and the second preset model after the training can be used as the second model.
情况六、与情况五不同在于,所述第一预设模型中包括估计预设子模型、预设信道生成子模型和压缩预设子模型。Case 6 is different from Case 5 in that the first preset model includes an estimation preset sub-model, a preset channel generation sub-model, and a compression preset sub-model.
本情况中各个模型的组成以及各个模型之间的输入输出关系与前述情况二相同,具体可以参见图8b,这里不做重复说明。The composition of each model in this case and the input-output relationship between each model are the same as those in the second case, for details, please refer to FIG. 8 b , which will not be repeated here.
所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The network device uses training samples to jointly train the first preset model and the second preset model, including:
所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
关于第一训练样本的具体说明与前述情况五相同,因此不做重复说明。还需要指出的是,除了第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景相关,这里不对其进行限定。The specific description about the first training sample is the same as that of the fifth case above, so the description will not be repeated. It should also be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. In the process of training, whether one or more of the above information is input may be relevant according to the actual situation or the actual scene, and it is not limited here.
所述第一预设模型的估计预设子模型的具体功能与前述情况五相同。The specific function of the estimation preset sub-model of the first preset model is the same as the fifth case above.
所述信道生成预设子模型的功能可以是对所述初始信息进行特征分解,得到所述初始信息的特征向量信息。其中,所述初始信息的特征向量信息可以包括R组特征向量序列。对所述初始信息进行特征分解的方式可以采用奇异值分解(SVD,Singular Value Decomposition)的方式。A function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information. Wherein, the eigenvector information of the initial information may include R groups of eigenvector sequences. The method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。在上述处理中,所述压缩预设子模型输出的压缩后的特征向量信息包含的内容或数据量小于其输入的初始信息的特征向量信息的数据量或数据内容。上述压缩后的特征向量信息与初始信息的特征向量信息的形式为相同的,但是两者包含的数据量是不同的。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift. In the above processing, the content or data volume of the compressed feature vector information output by the compressed preset sub-model is smaller than the data volume or data content of the feature vector information of the input initial information. The form of the compressed feature vector information and the feature vector information of the initial information are the same, but the amount of data contained in the two is different.
所述第二预设模型的功能可以是对其输入信息进行解压缩得到恢复信息。在上述处理中,所述第二预设模型的输入信息为压缩后的特征向量信息,第二预设模型的输出为恢复的特征向量信息。第二预设模型的解压缩率应该使得其得到的恢复的特征向量信息与初始信息的特征向量信息包含相同的数据内容(或数据量)。The function of the second preset model may be to decompress its input information to obtain restoration information. In the above processing, the input information of the second preset model is compressed feature vector information, and the output of the second preset model is restored feature vector information. The decompression rate of the second preset model should make the obtained restored feature vector information contain the same data content (or data amount) as the feature vector information of the initial information.
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型具体可以指的是:基于第二损失函数进行反向传导更新所述估计预设子模型的模型参数、所述信道生成预设子模型的模型参数、所述压缩预设子模型的模型参数和所述第二预设模型的模型参数。Performing reverse conduction according to the second loss function to update the first preset model and the second preset model may specifically refer to: performing reverse conduction based on the second loss function to update the estimated preset sub-model model parameters of the channel generation preset sub-model, model parameters of the compression preset sub-model and model parameters of the second preset model.
关于上述训练收敛的确定方式与前述情况五相同,不做重复说明。The method for determining the above-mentioned training convergence is the same as that of the above-mentioned case five, and repeated explanations are not repeated.
情况七,所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型,包括:In case seven, the network device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, including:
所述网络设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The network device uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein , the third model is a trained third preset model.
所述网络设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The network device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述预设模型的第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model of the preset model to obtain transformed information output by the third preset model;
将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
本情况中各个模型的组成以及各个模型之间的输入输出关系与前述情况三相同,可以参见图8c,这里不做重复说明。The composition of each model in this case and the input-output relationship between each model are the same as those in the third case above, which can be referred to FIG. 8c, and repeated explanations are not repeated here.
关于第一训练样本的具体说明与前述情况五或情况六相同,因此不做重复说明。The specific description about the first training sample is the same as the foregoing case five or six, so no repeated description is given.
所述第一预设模型的估计预设子模型以及所述第一预设模型的压缩预设子模型的具体功能与前述情况五相同,因此不做重复说明。The specific functions of the estimation preset sub-model of the first preset model and the compression preset sub-model of the first preset model are the same as those of the fifth case above, so repeated descriptions will not be made.
在情况七中相对于情况五增加了第三预设模型,关于所述第三预设模型的功能为模拟信道环境,具体处理可以为对输入信息进行数据变换得到数据变换后的信息作为输出信息。其中,所述数据变换的具体处理方式可以包括卷积处理或者等效于卷积的数据处理;其中,所述等效于卷积的数据处理可以为多次傅里叶变换处理,比如,可以为将所述第三预设模型的输入信息通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域,以此来等效时域卷积的处理。In case seven, a third preset model is added relative to case five. The function of the third preset model is to simulate the channel environment, and the specific processing can be to perform data transformation on input information to obtain data transformed information as output information . Wherein, the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
所述第二预设模型的功能可以是对其输入信息进行解压缩得到恢复信息。The function of the second preset model may be to decompress its input information to obtain restoration information.
所述基于第二损失函数所确定的差异程度来进行反向传导更新更新所述第一预设模型、所述第二预设模型和所述第三预设模型具体可以指的是:基于第二损失函数进行反向传导更新所述估计预设子模型的模型参数、所述压缩预设子模型的模型参数、所述第二预设模型的模型参数和所述第三预设模型的模型参数。The performing reverse conduction update based on the degree of difference determined by the second loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The second loss function performs reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the compressed preset sub-model, the model parameters of the second preset model, and the model of the third preset model parameter.
关于上述训练收敛的方式与前述情况五或情况六相同,不做重复说明。The manner of the above-mentioned training convergence is the same as that of the foregoing case five or six, and no repeated description is made.
情况八,与上述情况七不同在于所述第一预设模型中包括估计预设子模型、预设信道生成子模型和压缩预设子模型。Case 8 is different from the above case 7 in that the first preset model includes an estimation preset submodel, a preset channel generation submodel, and a compression preset submodel.
所述网络设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The network device uses training samples to jointly train the first preset model, the second preset model and the third preset model, including:
所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述预设模型中的第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;inputting the compressed feature vector information into a third preset model among the preset models, to obtain transformed feature vector information output by the third preset model;
将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
本情况中各个模型的组成以及各个模型之间的输入输出关系与前述情况四相同,可以参见图8d,这里不做重复说明。In this case, the composition of each model and the input-output relationship between each model are the same as the foregoing case 4, which can be referred to FIG. 8d , and repeated descriptions are not repeated here.
关于第一训练样本的具体说明与前述情况五、情况六、情况七中任意之一相同,因此不做重复说明。The specific description about the first training sample is the same as any one of the above-mentioned case 5, case 6, and case 7, so the description will not be repeated.
所述第一预设模型的估计预设子模型的具体功能与情况五、情况六、情况七中任意之一相同。The specific function of the estimated preset sub-model of the first preset model is the same as any one of the fifth, sixth, and seventh cases.
所述信道生成预设子模型的功能可以是对所述初始信息进行特征分解,得到所述初始信息的特征向量信息。其中,所述初始信息的特征向量信息可以包括R组特征向量序列。举例来说,对初始信息进行特征分解的方式可以采用奇异值分解(SVD,Singular Value Decomposition)的方式。A function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information. Wherein, the eigenvector information of the initial information may include R groups of eigenvector sequences. For example, the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
所述第三预设模型的功能为模拟无线信道环境,具体处理可以为对输入信息进行数据变换得到数据变换后的信息作为输出信息。其中,所述数据变换的具体处理方式可以包括卷积处理或者等效于卷积的数据处理;其中,所述等效于卷积的数据处理可以为多次傅里叶变换处理,比如,可以为将所述第三预设模型的输入信息通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域,以此来等效时域卷积的处理。The function of the third preset model is to simulate the wireless channel environment, and the specific processing may be to perform data transformation on input information to obtain information after data transformation as output information. Wherein, the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
所述第二预设模型的功能可以是对其输入信息进行解压缩得到恢复的特征向量信息。所述第二预设模型的输入信息为变换后的特征向量信息,第二预设模型的输出为恢复的特征向量信息。第二预设模型的解压缩率应该使得其得到的恢复的特征向量信息与初始信息的特征向量信息的包含接近或相同的数据内容。The function of the second preset model may be to decompress its input information to obtain restored feature vector information. The input information of the second preset model is transformed feature vector information, and the output of the second preset model is restored feature vector information. The decompression rate of the second preset model should be such that the obtained restored feature vector information and the feature vector information of the initial information contain data content close to or the same.
所述基于第一损失函数所确定的差异程度来进行反向传导更新更新所述第一预设模型、所述第二预设模型和所述第三预设模型具体可以指的是:基于第一损失函数所确定的差异程度来进行反向传导更新所述估计预设子模型的模型参数、所述信道生成预设子模型的模型参数、所述压缩预设子模型的模型参数、所述第二预设模型的模型参数和所述第三预设模型的模型参数。The performing reverse conduction update based on the degree of difference determined by the first loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The degree of difference determined by a loss function is used to perform reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the channel generation preset sub-model, the model parameters of the compressed preset sub-model, the The model parameters of the second preset model and the model parameters of the third preset model.
关于上述训练收敛的方式与前述情况五、情况六、情况七中任意之一相同,因此不做重复说明。The manner of the above-mentioned training convergence is the same as any one of the aforementioned cases 5, 6, and 7, so repeated explanations will not be made.
所述网络设备通过采用以上第二种方式可以得到自身联合训练后的第一模型、第二模型,或者得到联合训练后的第一模型、第二模型以及第三模型。进而可以执行前述S1110~S1130的处理。The network device can obtain the first model and the second model after its own joint training, or obtain the first model, the second model and the third model after joint training by adopting the above second method. Furthermore, the above-mentioned processing of S1110 to S1130 may be performed.
在上述第二种方式所提供的网络设备自身进行联合训练得到第一模型和第二模型,以及网络设备自身进行联合训练得到第一模型、第二模型和第三模型的处理中使用了训练样本,下面针对训练样本进行详细说明:The training samples are used in the process of the joint training of the network device itself to obtain the first model and the second model, and the joint training of the network device itself to obtain the first model, the second model and the third model provided by the second method above. , the following is a detailed description of the training samples:
所述训练样本中可以包含第一训练样本。所述第一训练样本可以为参考信号样本。所述参考信号样本可以为历史采集得到的原始参考信号、或者处理过的参考信号。其中,所述原始参考信号可以指的是未经过无线信道传输的参考信号。获取处理参考信号的方法可以包括:将原始参考信号通过无线信道(或真实无线信道、或实际无线信道)后接收到的参考信号作为处理过的参考信号。或者,获取处理参考信号的方法可以包括:将原始参考信号通过模拟的无线信道后接收的参考信号作为处理过的参考信号。再进一步地,原始参考信号可以为下行参考信号,或者上行参考信号。The training samples may include a first training sample. The first training samples may be reference signal samples. The reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition. Wherein, the original reference signal may refer to a reference signal that has not been transmitted through a wireless channel. The method for acquiring and processing the reference signal may include: using the reference signal received after the original reference signal passes through the wireless channel (or the real wireless channel, or the real wireless channel) as the processed reference signal. Alternatively, the method for obtaining a processed reference signal may include: using a reference signal received after the original reference signal passes through a simulated wireless channel as a processed reference signal. Still further, the original reference signal may be a downlink reference signal or an uplink reference signal.
所述第一训练样本分布在第一维度和/或第二维度。The first training samples are distributed in the first dimension and/or the second dimension.
其中,所述第一维度为时域维度;所述第一训练样本包括在所述时域维度中的m个时间单元内分布的第一信息样本;m为正整数。其中,所述m个时间单元中每个时间单元中可以分布有n个第一信息样本,n为正整数。所述每个时间单元可以包含有至少一个时隙、或至少一个符号(比如OFDM符号)。Wherein, the first dimension is a time domain dimension; the first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer. Wherein, n first information samples may be distributed in each of the m time units, where n is a positive integer. Each time unit may include at least one time slot, or at least one symbol (such as an OFDM symbol).
举例来说,所述第一信息样本为下行参考信号样本,每个时间单元内包含的时隙数量可以为c个(c为正整数),在每c个时隙内有n个下行参考信号样本,c和n的组合可以是例如(1,1)(1,2)(1,3)(1,4)(1,6)(2,1)(4,1)(5,1)(8,1)(10,1)。For example, the first information sample is a downlink reference signal sample, the number of time slots contained in each time unit may be c (c is a positive integer), and there are n downlink reference signals in each c time slot A sample, combination of c and n can be e.g. (1,1)(1,2)(1,3)(1,4)(1,6)(2,1)(4,1)(5,1) (8,1)(10,1).
所述第二维度为频域维度;所述第一训练样本包括在所述频域维度中的x个频域资源上分布的第一信息样本;x为正整数。其中,所述x个频域资源中每个频域资源中可以分布有y个第一信息样本,y为正整数。所述每个频域资源可以包含有至少一个资源块(RB)、或至少一个子载波。The second dimension is a frequency domain dimension; the first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer. Wherein, y first information samples may be distributed in each of the x frequency domain resources, and y is a positive integer. Each frequency domain resource may include at least one resource block (RB), or at least one subcarrier.
举例来说,所述第一信息样本为下行参考信号样本,每个频域资源内包含的时隙数量可以为d个(d为正整数),在频域上每d个RB内有y个下行参考信号样本,d和y的组合可以是例如(1,1)(1,2)(2,1)(1,3)(3,1)(1,4)(4,1)(1,6)(6,1)。For example, the first information sample is a downlink reference signal sample, and the number of time slots contained in each frequency domain resource may be d (d is a positive integer), and there are y time slots in every d RBs in the frequency domain Downlink reference signal samples, the combination of d and y can be, for example, (1,1)(1,2)(2,1)(1,3)(3,1)(1,4)(4,1)(1 ,6)(6,1).
上述第一训练样本分布在第一维度和/或第二维度,可以理解为可以仅使用第一训练样本在频域维度上的分布情况来进行后续的训练,也可以仅使用第一训练样本在时域维度上的分布情况来进行后续的训练,还可以使用第一训练样本在频域维度以及时域维度上的分布情况来进行后续的训练。比如,一个第一训练样本在频域维度上包含10个RB、在时域维度上包含1个时隙,每个RB中有3个第一信号样本,每个时隙有1个第一信号样本,则第一训练样本一共包含有30个第一信号样本。The above-mentioned first training samples are distributed in the first dimension and/or the second dimension. It can be understood that only the distribution of the first training samples in the frequency domain dimension can be used for subsequent training, or only the first training samples can be used in the frequency domain. The distribution of the first training sample in the frequency domain and the time domain can also be used for subsequent training. For example, a first training sample contains 10 RBs in the frequency domain dimension and 1 time slot in the time domain dimension, each RB has 3 first signal samples, and each time slot has 1 first signal samples, the first training samples include a total of 30 first signal samples.
上述第一维度和第二维度,即时域维度和频域维度的大小可以相等、也可以不相等。另外,也可以将上述时域维度和频域维度合并成为一个维度,具体合并是可以是先时域维度再频域维度,也可以是先频域维度再时域维度,本实施例不对其进行限定。The sizes of the first dimension and the second dimension, the time domain dimension and the frequency domain dimension may be equal or unequal. In addition, the above-mentioned time-domain dimension and frequency-domain dimension can also be combined into one dimension. Specifically, the combination can be the time-domain dimension first and then the frequency-domain dimension, or the frequency-domain dimension first and then the time-domain dimension, which is not implemented in this embodiment limited.
需要注意的是,因为原始参考信号、或者处理过的参考信号都可以是通过复数来呈现,所以本实施例提供的方案上述第一训练样本可以在上述第一维度和第二维度的基础上额外增加复数的呈现形式(或可以理解为增加一个维度,该维度是将原始参考信号、或者处理过的参考信号的虚部和实部数据独立呈现所造成的),具体的:所述第一训练样本还分布在第三维度。所述第三维度为复数维度;所述第一训练样本包括第一信息样本的实部和第一信息样本的虚部。举例来说,假设一个第一训练样本中在时域维度上包含1个时间单元(比如1个时隙),频域维度上包含10个频域资源(比如10个RB),每个第一信息样本可以表示为实部和虚部,则第一训练样本可以为一个1×10×2的矩阵。It should be noted that, because the original reference signal or the processed reference signal can be represented by complex numbers, the solution provided by this embodiment can be based on the above-mentioned first dimension and second dimension. Increase the presentation form of complex numbers (or it can be understood as adding a dimension, which is caused by the independent presentation of the imaginary part and real part data of the original reference signal or the processed reference signal), specifically: the first training The samples are also distributed in the third dimension. The third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample. For example, assuming that a first training sample contains 1 time unit (such as 1 time slot) in the time domain dimension, and contains 10 frequency domain resources (such as 10 RBs) in the frequency domain dimension, each first The information sample can be expressed as a real part and an imaginary part, and the first training sample can be a 1×10×2 matrix.
所述训练样本中还包含与所述第一训练样本对应的第二训练样本;所述第二训练样本由T个维度的矩阵构成;T为大于等于2的整数。这里,所述第二训练样本可以用于表征基于所述第一训练样本所期望得到的信道质量、或称为信道响应、或称为信道状态、或称为信道估计结果、或称为信道信息。The training samples also include a second training sample corresponding to the first training sample; the second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2. Here, the second training samples may be used to characterize the expected channel quality based on the first training samples, or channel response, or channel state, or channel estimation results, or channel information .
所述T个维度中包含第四维度和第五维度。The T dimensions include a fourth dimension and a fifth dimension.
所述T个维度的矩阵具体可以为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。The matrix of the T dimensions may specifically be a two-dimensional matrix of M×N; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension; M and N are all positive integers.
也就是说,一个第二训练样本由大小为M×N的二维矩阵构成,其在第四维度上有M个第一粒度,在第五维度上有N个第二粒度;上述M和N可以相等也可以不相等。所述二维矩阵内具体的数值指示代表信道质量某一个第一粒度下接收的信号强度,这里所述二维矩阵内的数值的单位可以是dBm,或所述二维矩阵内的数值没有单位而是归一化之后所得到的数值。此外,也可以将M×N的二维矩阵合成成为1×(M×N)大小或者(M×N)×1大小的一维数据,具体变换是可以是先第四维度再第五维度,也可以是先第五维度再第四维度,本实施例不对其进行限定。That is to say, a second training sample consists of a two-dimensional matrix with a size of M×N, which has M first granularities in the fourth dimension and N second granularities in the fifth dimension; the above M and N May or may not be equal. The specific numerical indication in the two-dimensional matrix represents the received signal strength at a certain first granularity of the channel quality, where the unit of the numerical value in the two-dimensional matrix may be dBm, or the numerical value in the two-dimensional matrix has no unit It is the value obtained after normalization. In addition, the two-dimensional matrix of M×N can also be synthesized into one-dimensional data of size 1×(M×N) or (M×N)×1. The specific transformation can be the fourth dimension first and then the fifth dimension. It may also be the fifth dimension first and then the fourth dimension, which is not limited in this embodiment.
可选地,所述第四维度为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。或者,所述第四维度为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。所述符号为正交频分复用符号(OFDM,Orthogonal Frequency Division Multiplexing)。这里,第四维度为时域维度的时候,所述第一粒度还可以称为时延粒度。Optionally, the fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers. Alternatively, the fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, and K3 symbol sampling points; K1, K2, and K3 are positive integers. The symbols are Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing). Here, when the fourth dimension is a time domain dimension, the first granularity may also be called a delay granularity.
举例来说,在第一训练样本为参考信号样本或下行参考信号样本的时候,所述第二训练样本可以为与所述参考信号样本所对应的信道信息样本,或者还可以称为信道状态样本等等,这里不对其名称进行穷举。当第四维度是频域维度时,第一粒度可以是L1个RB(L1大于等于1,例如2RB,4RB,8RB),则一个第二训练样本所指示的频域范围是M×L1的频域范围;或者第一粒度可以是L2个子载波(L2大于1,例如4个子载波,6个子载波,18个子载波),则一个第二训练样本所指示的频域范围是M×L2的频域范围。当第四维度是时域维度时,第一粒度可以是时延粒度,例如一个第一粒度是K1个微秒、或者K2个符号长度、或者K3个符号的采样点个数,这里所述符号可以是一个OFDM符号;当第四维度是时域维度且第一粒度为K1个微秒时,一个第二训练样本所指示的时域范围是M×K1的时域范围。For example, when the first training sample is a reference signal sample or a downlink reference signal sample, the second training sample may be a channel information sample corresponding to the reference signal sample, or may also be called a channel state sample Wait, I'm not going to exhaust the names here. When the fourth dimension is the frequency domain dimension, the first granularity can be L1 RBs (L1 is greater than or equal to 1, such as 2RB, 4RB, 8RB), and the frequency domain range indicated by a second training sample is M×L1 or the first granularity can be L2 subcarriers (L2 is greater than 1, such as 4 subcarriers, 6 subcarriers, and 18 subcarriers), then the frequency domain range indicated by a second training sample is the frequency domain of M×L2 scope. When the fourth dimension is a time-domain dimension, the first granularity may be a delay granularity, for example, a first granularity is the number of sampling points of K1 microseconds, or K2 symbol lengths, or K3 symbols, where the symbols It may be an OFDM symbol; when the fourth dimension is the time domain dimension and the first granularity is K1 microseconds, the time domain range indicated by a second training sample is the time domain range of M×K1.
所述第五维度为空间域维度;相应的,所述第二粒度为一对收发天线或到达角度的间隔。The fifth dimension is a space domain dimension; correspondingly, the second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
举例来说,第五维度为所述空间域维度,具体地可以是天线维度,例如第五维度上由N个天线对构成,相应的,第二粒度是一对收发天线。或者,第五维度为空间域维度,具体的可以是角度域维度,例如第五维度上由N个到达角度构成,第二粒度是上述N个到达角度之间的到达角度的间隔大小。For example, the fifth dimension is the space domain dimension, specifically, the antenna dimension, for example, the fifth dimension is composed of N antenna pairs, and correspondingly, the second granularity is a pair of transmitting and receiving antennas. Alternatively, the fifth dimension is a space domain dimension, specifically an angle domain dimension, for example, the fifth dimension is composed of N arrival angles, and the second granularity is the interval between the above N arrival angles.
再进一步地,所述二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。也就是说,在使用一个第一训练样本的情况下,用于表示第二训练样本的所述二维矩阵中某一个位置处的数值(或称为指示值)代表了在第四维度以及第五维度这样的组合下的所期望得到的信道质量情况。其中,信道质量或信道质量情况可以采用信号强度来表征,其数值(或称为指示值)的单位可以是dBm,或没有单位而是归一化之后所得到的数值。Still further, the value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; Both i and j are positive integers. That is to say, in the case of using a first training sample, the value (or referred to as an indicator value) at a certain position in the two-dimensional matrix used to represent the second training sample represents the The expected channel quality situation under such a combination of five dimensions. Wherein, the channel quality or the channel quality situation can be characterized by signal strength, and the unit of the value (or indicator value) can be dBm, or there is no unit but a value obtained after normalization.
例如,结合图9来说,在M×N的二维矩阵中,若第四维度表示频域维度,第五维度为空间域维度具体为天线维度,第一粒度为2RB,第二粒度为1对收发天线;若M×N的二维矩阵中的第ij个位置为第i=3j=6个位置,则为图9中所示出的第3行第6列上黑色方框所在位置,该位置处的数值(或称为指示值)可以用于表示第6对收发天线上的第3个2RB带宽(也就是第5个RB至第6个RB)上的信道质量(或信道质量情况)。另外,图9中还可以用S来表示第二训练样本的数量,S可以为大于等于1的整数,也就是说,第二训练样本可以包含一个或多个。For example, in conjunction with Figure 9, in the M×N two-dimensional matrix, if the fourth dimension represents the frequency domain dimension, the fifth dimension is the space domain dimension, specifically the antenna dimension, the first granularity is 2RB, and the second granularity is 1 For the transceiver antenna; if the ij position in the two-dimensional matrix of M×N is the i=3j=6 position, then it is the position of the black box on the 3rd row and the 6th column shown in Fig. 9, The value (or indicator value) at this position can be used to represent the channel quality (or channel quality situation) on the third 2RB bandwidth (that is, the fifth RB to the sixth RB) on the sixth pair of transceiver antennas ). In addition, in FIG. 9 , S may also be used to represent the number of second training samples, and S may be an integer greater than or equal to 1, that is, the second training samples may include one or more.
再例如,在图10中展示的M×N的二维矩阵中,第四维度表示时域维度,第四维度为时域维度的时候,所述第一粒度为1个时延粒度;第五维度为空间域维度具体为角度维度,第二粒度为1个角度基本粒度(比如可以是1个到达角度的间隔);若M×N的二维矩阵中的第ij个位置为第i=4j=5个位置,则为图10中所示出的中的第4行第5列上黑色方框所在位置,该位置处的数值(或指示值)可以表示第5个空间粒度(例如到达角度的间隔)内的第4个时延粒度上的信道质量(或信道质量情况)。For another example, in the M×N two-dimensional matrix shown in FIG. 10 , the fourth dimension represents the time domain dimension, and when the fourth dimension is the time domain dimension, the first granularity is one delay granularity; the fifth The dimension is the spatial domain dimension, specifically the angle dimension, and the second granularity is the basic granularity of 1 angle (for example, it can be the interval of 1 arrival angle); if the ij-th position in the M×N two-dimensional matrix is i=4j = 5 positions, then it is the position of the black box on the 4th row and 5th column shown in Fig. The channel quality (or channel quality situation) at the 4th delay granularity within the interval of .
所述T个维度中还包括第六维度。相应的,所述T个维度的矩阵为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The T dimensions also include a sixth dimension. Correspondingly, the matrix of T dimensions is a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension, W represents the quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、第五维度的第j个第二粒度下、所述第六维度的第k个第三粒度下与所述第一训练样本所对应的信道质量;i、j和k均为正整数。The value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality corresponding to the first training sample at a third granularity; i, j and k are all positive integers.
其中,关于第四维度及其第一粒度,第五维度及其第二粒度的说明与前述实施例相同,这里不再重复说明。Wherein, the explanations about the fourth dimension and its first granularity, the fifth dimension and its second granularity are the same as those in the foregoing embodiments, and will not be repeated here.
本实施例中,所述第六维度可以为复数维度。这是由于所述第二训练样本可以用于表征基于所述第一训练样本所期望得到的信道质量(或称为信道响应、或称为信道状态、或称为信道估计结果、或称为信道信息),而上述信道质量还可以通过复数来呈现,因此可以在所述第二训练样本的以上两个维度的基础上增加一个第六维度即复数维度,该复数维度是将所述第二训练样本中的信道质量的虚部和实部独立呈现所产生的。In this embodiment, the sixth dimension may be a complex dimension. This is because the second training samples can be used to characterize the expected channel quality based on the first training samples (or called channel response, or called channel state, or called channel estimation result, or called channel information), and the above-mentioned channel quality can also be presented by a complex number, so a sixth dimension, that is, a complex number dimension, can be added on the basis of the above two dimensions of the second training sample, and the complex number dimension is the second training sample. The imaginary and real parts of the channel quality in the samples are presented independently generated.
具体来说,所述第六维度为复数维度;所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。其中,所述第三粒度为1具体指的是一个实部或一个虚部,所述第三粒度的数量为2指的是在复数维度下可以存在2个第三粒度。Specifically, the sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2. Wherein, the third granularity being 1 specifically refers to a real part or an imaginary part, and the number of the third granularity being 2 means that there may be two third granularities in the complex dimension.
所述k为第一值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的实部;When the k is the first value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The real part of the channel quality at the second granularity;
所述k为第二值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的虚部。When the k is the second value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
其中,所述第一值与所述第二值不同,比如可以设置第一值为1第二值为2,又或者,第一值可以为0第二值可以为1,再或者第一值可以为1第二值可以为0,只要第一值与第二值不同则在本实施例的保护范围内。Wherein, the first value is different from the second value, for example, the first value can be set to 1 and the second value can be 2, or the first value can be 0 and the second value can be 1, or the first value It can be 1 and the second value can be 0, as long as the first value is different from the second value, it is within the protection scope of this embodiment.
举例来说,M×N×W的三维矩阵中,第四维度表示时域维度,第四维度为时域维度的时候,所述第一粒度还可以称为时延粒度;第五维度为空间域维度具体为角度维度,第二粒度为到达角度的间隔;第六维度为复数维度,W为2,k为1表示实部k为2表示虚部。若i=4、j=5、k=1,则表示第4行第5列上的数值(或指示值)为第5个空间粒度(例如到达角度的间隔)内的第4个时延粒度上的信道质量(或信道质量情况)的实部。若i=4、j=5、k=2,则表示第4行第5列上的数值(或指示值)为第5个空间粒度(例如到达角度的间隔)内的第4个时延粒度上的信道质量(或信道质量情况)的虚部。For example, in a three-dimensional matrix of M×N×W, the fourth dimension represents the time domain dimension, and when the fourth dimension is the time domain dimension, the first granularity can also be called the delay granularity; the fifth dimension is the space The domain dimension is specifically the angle dimension, and the second granularity is the interval of the arrival angle; the sixth dimension is the complex number dimension, W is 2, k is 1 for the real part and 2 for the imaginary part. If i=4, j=5, k=1, it means that the value (or indicator value) on the 4th row and 5th column is the 4th delay granularity in the 5th spatial granularity (such as the interval of arrival angle) The real part of the channel quality (or channel quality situation) on . If i=4, j=5, k=2, it means that the value (or indicator value) on the 4th row and 5th column is the 4th delay granularity in the 5th spatial granularity (such as the interval of arrival angle) The imaginary part of the channel quality (or channel quality situation) on .
此外,还需要注意的是,上述第二训练样本还可以是在上述第四维度、第五维度和第六维度的基础上的拆分与组合,例如当第五维度是天线对维度时,还可以拆分成为发送天线子维度和接收天线子维度,从而扩展上述第二训练样本的维度,本实施例不再对拆分后的各种可能存在的子维度进行穷举。In addition, it should be noted that the above-mentioned second training samples can also be split and combined on the basis of the above-mentioned fourth dimension, fifth dimension, and sixth dimension. For example, when the fifth dimension is an antenna pair dimension, the It can be split into sending antenna sub-dimensions and receiving antenna sub-dimensions, thereby expanding the dimension of the second training sample. This embodiment does not exhaustively enumerate various possible sub-dimensions after splitting.
在上述第二种方式中,网络设备自身对第一预设模型、第二预设模型进行联合训练得到训练后的第一模型和第二模型;或由所述网络设备自身对第一预设模型、第二预设模型和第三预设模块联合训练得到训练后的第一模型、第二模型以及第三模型。这种方式下,所述网络设备至少还可以发送训练后的第二模型。下面对网络设备进行模型发送的处理进行示例说明:In the above-mentioned second method, the network device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model; or the network device itself trains the first preset model The model, the second preset model and the third preset module are jointly trained to obtain the trained first model, the second model and the third model. In this way, the network device can at least send the trained second model. The following is an example of how network devices send models:
示例一、Example one,
所述网络设备自身对第一预设模型、第二预设模型进行联合训练得到训练后的第一模型和第二模型。完成上述训练之后,所述网络设备发送所述第二模型。具体可以是:所述网络设备向终端设备发送所述第二模型。再进一步地,还可以是:所述网络设备向所述终端设备发送所述第二模型的模型参数。The network device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model. After the above training is completed, the network device sends the second model. Specifically, it may be: the network device sends the second model to the terminal device. Still further, it may also be: the network device sends the model parameters of the second model to the terminal device.
其中,所述网络设备可以是为所述终端设备提供服务的网络设备,比如接入网设备,具体可以是基站、eNB、gNB等。Wherein, the network device may be a network device that provides services for the terminal device, such as an access network device, and specifically may be a base station, eNB, gNB, and the like.
所述第二模型(或第二模型的模型参数)由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。The second model (or the model parameters of the second model) is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data for artificial intelligence business transmission requirements transmission.
这种示例中,所述网络设备可以自身保留所述第二模型,以用于对第二信息进行处理得到信道信息;相应的,终端设备由于可以接收到的第一模型,因此,所述终端设备可以基于所述第一模型对所述第一信息进行处理以得到第二信息。In this example, the network device may retain the second model by itself to process the second information to obtain channel information; correspondingly, due to the first model that the terminal device can receive, the terminal The device may process the first information based on the first model to obtain second information.
需要说明的是,由于一个网络设备可以服务多个终端设备,因此,所述网络设备可以向其服务的全部终端设备(或至少部分终端设备)发第一模型。以网络设备为基站、终端设备为手机为例来说,基站1可以服务3个手机,分别为手机1、手机2和手机3,基站1可以分别向手机1、手机2和手机3发第一模型。It should be noted that, since one network device may serve multiple terminal devices, the network device may send the first model to all terminal devices (or at least some terminal devices) it serves. Taking the network device as the base station and the terminal device as the mobile phone as an example, base station 1 can serve three mobile phones, namely mobile phone 1, mobile phone 2 and mobile phone 3, and base station 1 can send the first call to mobile phone 1, mobile phone 2 and mobile phone 3 respectively Model.
示例二、Example two,
所述网络设备自身对第一预设模型、第二预设模型进行联合训练得到训练后的第一模型和第二模型。完成上述训练之后,在所述网络设备发送所述第二模型的基础上,还可以包括:所述网络设备还发送所述第一模型。The network device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model. After the above training is completed, after the network device sends the second model, the method may further include: the network device also sends the first model.
具体可以是:所述网络设备向终端设备发送所述第一模型。再进一步地,还可以是:所述网络设备向所述终端设备发送所述第一模型的模型参数。Specifically, it may be: the network device sends the first model to the terminal device. Still further, it may also be: the network device sends the model parameters of the first model to the terminal device.
其中,所述网络设备可以是为所述终端设备提供服务的网络设备,比如接入网设备,具体可以是基站、eNB、gNB等。Wherein, the network device may be a network device that provides services for the terminal device, such as an access network device, and specifically may be a base station, eNB, gNB, and the like.
所述第一模型(或第一模型的模型参数)由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。The first model (or the model parameters of the first model) is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data for artificial intelligence business transmission requirements transmission.
上述第一模型与第二模型可以同时发送,或者上述第一模型与第二模型可以分别发送,本实施例不对其进行限定。The above-mentioned first model and the second model may be sent at the same time, or the above-mentioned first model and the second model may be sent separately, which is not limited in this embodiment.
所述终端设备接收网络设备发来的第一模型和第二模型之后,所述终端设备可以对第一模型以及第二模型进行整体评估,在完成第一模型以及第二模型的整体评估之后,可以决定是否使用本次接收到的第一模型以及第二模型,若整体评估结果较差(比如压缩率较低或者恢复信道信息的准确率较低等等),可以不使用上述第一模型和第二模型。若终端设备决定不使用上述第一模型和第二模型,还可以自身对第一模型以及第二模型重新进行联合训练以更新第一模型以及第二模型的模型参数,或者,终端设备自己训练得到新的第一模型以及第二模型。需要指出,若终端设备重新联合训练第一模型以及第二模型,或者更新第一模型以及第二模型,则所述终端设备还需要将新的第一模型和第二模型发送至所述网络设备,或者将新的第一模型发送至网络设备。相应的,若网络设备接收到该新的第一模型和新的第二模型之后,确定该新的第一模型和新的第二模型的整体性能更优,还可以替换自身生成的第一模型和第二模型,并将新的第一模型和新的第二模型同步给自身服务的其他终端设备。否则,所述网络设备可以指定该终端设备不使用新的第一模型和新的第二模型,以保证网络设备与终端设备之间交互的信息能够得到正确的传输以及解析。After the terminal device receives the first model and the second model sent by the network device, the terminal device can perform an overall evaluation of the first model and the second model, and after completing the overall evaluation of the first model and the second model, It can be decided whether to use the first model and the second model received this time. If the overall evaluation result is poor (for example, the compression rate is low or the accuracy of recovering channel information is low, etc.), the above-mentioned first model and the second model may not be used. Second model. If the terminal device decides not to use the above-mentioned first model and the second model, it can also re-train the first model and the second model to update the model parameters of the first model and the second model, or the terminal device trains itself to obtain New first model as well as second model. It should be pointed out that if the terminal device jointly trains the first model and the second model again, or updates the first model and the second model, the terminal device also needs to send the new first model and the second model to the network device , or send the new first model to the network device. Correspondingly, if the network device determines that the overall performance of the new first model and the new second model is better after receiving the new first model and the new second model, it can also replace the first model generated by itself and the second model, and synchronize the new first model and the new second model to other terminal devices served by itself. Otherwise, the network device may specify that the terminal device does not use the new first model and the new second model, so as to ensure that information exchanged between the network device and the terminal device can be correctly transmitted and parsed.
示例三、Example three,
所述网络设备自身对第一预设模型、第二预设模型进行联合训练得到训练后的第一模型和第二模型。本示例与示例二的不同之处在于,完成上述训练之后,在所述网络设备发送所述第二模型的基础上,所述网络设备可以发送所述第一模型中的估计子模型以及压缩子模型。比如,所述网络设备可以同时发送所述第一模型中的估计子模型以及压缩子模型;或者,所述网络设备可以分别发送所述第一模型中的估计子模型以及压缩子模型。The network device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model. The difference between this example and Example 2 is that after the above training is completed, on the basis of the network device sending the second model, the network device can send the estimation sub-model and the compression sub-model in the first model Model. For example, the network device may send the estimated sub-model and the compressed sub-model in the first model at the same time; or, the network device may send the estimated sub-model and the compressed sub-model in the first model respectively.
再具体来说,所述网络设备可以同时向终端设备发送所述第一模型中的估计子模型以及压缩子模型;或者,所述网络设备可以分别向终端设备发送所述第一模型中的估计子模型以及压缩子模型。More specifically, the network device may send the estimated sub-model and the compressed sub-model in the first model to the terminal device at the same time; or, the network device may send the estimated sub-model in the first model to the terminal device respectively. Submodels and compressed submodels.
进一步地,所述网络设备可以同时向终端设备发送所述第一模型中的估计子模型的模型参数以及压缩子模型的模型参数;或者,所述网络设备可以分别向终端设备发送所述第一模型中的估计子模型的模型参数以及压缩子模型的模型参数。Further, the network device may send the model parameters of the estimation sub-model and the model parameters of the compression sub-model in the first model to the terminal device at the same time; or, the network device may send the first model parameters to the terminal device respectively. Model parameters for the estimated submodel and model parameters for the compressed submodel in the model.
所述估计子模型和所述压缩子模型可由以下至少之一同时携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The estimation sub-model and the compression sub-model may be carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
或者,所述估计子模型和所述压缩子模型可由以下至少之一分别携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。Alternatively, the estimation sub-model and the compression sub-model may be respectively carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data for artificial intelligence business transmission requirements transmission.
进一步地,所述终端设备接收网络设备发来的第一模型中的估计子模型以及压缩子模型和第二模型之后,所述终端设备可以对估计子模型、压缩子模型以及第二模型进行整体评估,在完成估计子模型、压缩子模型以及第二模型的整体评估之后,可以决定是否使用本次接收到的估计子模型、压缩子模型以及第二模型,具体的处理方式与前述示例三相似,这里不做重复说明。Further, after the terminal device receives the estimated sub-model, the compressed sub-model and the second model in the first model sent by the network device, the terminal device may integrate the estimated sub-model, the compressed sub-model and the second model Evaluation, after completing the overall evaluation of the estimated sub-model, compressed sub-model, and second model, you can decide whether to use the estimated sub-model, compressed sub-model, and second model received this time. The specific processing method is similar to the previous example three , which will not be repeated here.
示例四、Example four,
所述网络设备自身对第一预设模型、第二预设模型进行联合训练得到训练后的第一模型和第二模型;其中,第一模型包含估计子模型、压缩子模型以及信道生成子模型。本示例与示例三的不同之处在于,完成上述训练之后,在所述网络设备发送所述第二模型的基础上,所述网络设备可以发送所述第一模型中的估计子模型、压缩子模型以及信道生成子模型。比如,所述网络设备可以同时发送所述第一模型中的估计子模型、压缩子模型以及信道生成子模型;或者,所述网络设备可以分别发送所述估计子模型、压缩子模型以及信道生成子模型。The network device itself performs joint training on the first preset model and the second preset model to obtain the trained first model and the second model; wherein, the first model includes an estimation sub-model, a compression sub-model and a channel generation sub-model . The difference between this example and Example 3 is that after the above training is completed, on the basis of the network device sending the second model, the network device can send the estimation sub-model, compression sub-model in the first model model and the channel generation submodel. For example, the network device may simultaneously send the estimation sub-model, the compression sub-model and the channel generation sub-model in the first model; or, the network device may separately send the estimation sub-model, the compression sub-model and the channel generation sub-model submodel.
再具体来说,所述网络设备可以同时向终端设备发送所述第一模型中的估计子模型、压缩子模型以及信道生成子模型;或者,所述网络设备可以分别向终端设备发送所述第一模型中的估计子模型、压缩子模型以及信道生成子模型;再或者,所述网络设备可以向终端设备同时发送所述第一模型中的估计子模型、压缩子模型以及信道生成子模型中的任意之二,再发送剩余的一个子模型。More specifically, the network device may send the estimation sub-model, the compression sub-model and the channel generation sub-model in the first model to the terminal device at the same time; or, the network device may send the first model to the terminal device respectively. The estimation sub-model, the compression sub-model and the channel generation sub-model in a model; or, the network device may simultaneously send the estimation sub-model, the compression sub-model and the channel generation sub-model in the first model to the terminal device Any two of , and then send the remaining sub-model.
所述估计子模型、压缩子模型以及信道生成子模型可由以下至少之一同时携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The estimation sub-model, compression sub-model and channel generation sub-model may be simultaneously carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink for artificial intelligence business transmission requirements data transmission;
或者,所述估计子模型、压缩子模型以及信道生成子模型可由以下至少之一分别携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。Alternatively, the estimation sub-model, compression sub-model and channel generation sub-model may be respectively carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, transmission requirements for artificial intelligence services downlink data transmission.
或者,所述估计子模型、压缩子模型以及信道生成子模型中的任意之二,可由以下至少之一同时携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。剩余的一个子模型可由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。Alternatively, any two of the estimation submodel, the compression submodel, and the channel generation submodel may be simultaneously carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, for Downlink data transmission required for artificial intelligence business transmission. The remaining sub-model can be carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements.
所述终端设备接收网络设备发来的第一模型中的估计子模型、压缩子模型以及信道生成子模型和第二模型之后,所述终端设备可以对估计子模型、压缩子模型以及信道生成子模型以及第二模型进行整体评估,在完成整体评估之后的处理与前述示例三相似,这里不再赘述。After the terminal device receives the estimated sub-model, the compressed sub-model, the channel generation sub-model and the second model in the first model sent by the network device, the terminal device may perform the estimation sub-model, the compressed sub-model and the channel generation sub-model The overall evaluation is performed on the model and the second model, and the processing after the overall evaluation is similar to that of Example 3 above, and will not be repeated here.
示例五、Example five,
所述网络设备自身对第一预设模型、第二预设模型以及第三预设模型进行联合训练得到训练后的第一模型、第二模型和第三模型。完成上述训练之后,在所述终端设备发送所述第二模型、以及第一模型的基础上,所述网络设备可以发送所述第三模型。The network device itself performs joint training on the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model. After the above training is completed, on the basis that the terminal device sends the second model and the first model, the network device may send the third model.
再具体来说,所述网络设备可以同时向终端设备发送所述第一模型、第二模型和第三模型;或者,所述网络设备可以分别向终端设备发送所述第一模型、第二模型和第三模型;又或者,所述网络设备可以先向终端设备发送所述第一模型、第二模型和第三模型中的任意之二,再向终端设备发送剩余的一个模型。More specifically, the network device may send the first model, the second model and the third model to the terminal device at the same time; or, the network device may send the first model and the second model to the terminal device respectively and the third model; or, the network device may first send any two of the first model, the second model, and the third model to the terminal device, and then send the remaining one model to the terminal device.
所述第一模型、第二模型和第三模型可由以下至少之一同时携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The first model, the second model and the third model may be simultaneously carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data for artificial intelligence business transmission requirements transmission;
或者,所述第一模型、第二模型和第三模型可由以下至少之一分别携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;Alternatively, the first model, the second model and the third model may be respectively carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, and information for artificial intelligence business transmission requirements downlink data transmission;
或者,所述第一模型、第二模型和第三模型中的任意之二以及剩余的一个模型分别由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。Or, any two of the first model, the second model and the third model and the remaining one model are respectively carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data Transmission, downlink data transmission for artificial intelligence business transmission requirements.
可选地,本示例中,所述第一模型可以包含估计子模型和压缩子模型,相应的,所述发送第一模型可以指的是,同时或分别发送估计子模型和压缩子模型,关于估计子模型和压缩子模型的携带方式与前述示例相同这里不做重复说明。Optionally, in this example, the first model may include an estimation sub-model and a compression sub-model. Correspondingly, sending the first model may refer to simultaneously or separately sending the estimation sub-model and the compression sub-model. Regarding The carrying manner of the estimation sub-model and the compression sub-model is the same as that of the previous example and will not be repeated here.
可选地,本示例中,所述第一模型可以包含估计子模型、信道生成子模型和压缩子模型,相应的,所述发送第一模型可以指的是,同时或分别发送估计子模型、信道生成子模型和压缩子模型,关于估计子模型、信道生成子模型和压缩子模型的携带方式与前述示例相同这里不做重复说明。Optionally, in this example, the first model may include an estimation submodel, a channel generation submodel, and a compression submodel. Correspondingly, sending the first model may refer to sending the estimation submodel, the The channel generation sub-model and the compression sub-model, the carrying manners of the estimation sub-model, the channel generation sub-model and the compression sub-model are the same as the previous examples and will not be repeated here.
所述终端设备接收网络设备发来的所述第一模型、第二模型和第三模型之后,所述终端设备可以对所述第一模型、第二模型和第三模型进行整体评估,具体的处理与前述示例三相似,不做重复说明。After the terminal device receives the first model, the second model and the third model sent by the network device, the terminal device may perform an overall evaluation on the first model, the second model and the third model, specifically The processing is similar to the foregoing example three, and repeated descriptions are not repeated.
最后结合图12对前述终端设备执行的信息处理方法以及网络设备执行的信息处理方法进行示例性说明,所述网络设备为基站,具体可以为:Finally, with reference to FIG. 12 , the information processing method performed by the aforementioned terminal device and the information processing method performed by the network device are exemplarily described. The network device is a base station, which may specifically be:
S1201:基站训练得到第一模型和第二模型。S1201: The base station trains to obtain the first model and the second model.
比如,基站为主体采用训练样本对第一预设模型和第二预设模型进行联合训练得到第一模型和第二模型;其中,第一模型包含估计子模型和压缩子模型;或者,第一模型包含估计子模型、信道生成子模型和压缩子模型。For example, the base station uses training samples as the main body to jointly train the first preset model and the second preset model to obtain the first model and the second model; wherein, the first model includes an estimation sub-model and a compression sub-model; or, the first The model contains an estimation submodel, a channel generation submodel, and a compression submodel.
或者,所述基站为主体采用训练样本对第一预设模型、第二预设模型和第三预设模型进行训练得到了第一模型、第二模型和第三模型;其中,第一模型包含估计子模型和压缩子模型;或者,第一模型包含估计子模型、信道生成子模型和压缩子模型。Alternatively, the base station uses training samples as the main body to train the first preset model, the second preset model and the third preset model to obtain the first model, the second model and the third model; wherein, the first model includes An estimation submodel and a compression submodel; alternatively, the first model comprises an estimation submodel, a channel generation submodel and a compression submodel.
S1202:所述基站向终端设备发送第一模型。相应的,所述终端设备可以接收所述基站发来的第一模型。S1202: The base station sends the first model to the terminal device. Correspondingly, the terminal device may receive the first model sent by the base station.
本步骤可以指的是:所述基站至少向终端设备传输所述第一模型;又或者可以是:所述基站向所述终端设备传输估计子模型和压缩子模型;再或者可以是:所述基站向所述终端设备传输估计子模型、信道生成子模型和压缩子模型。This step may refer to: the base station transmits at least the first model to the terminal device; or it may be: the base station transmits the estimated sub-model and the compressed sub-model to the terminal device; or it may be: the The base station transmits the estimation sub-model, the channel generation sub-model and the compression sub-model to the terminal device.
关于所述基站传输估计子模型和压缩子模型这两个子模型,还是传输估计子模型、信道生成子模型和压缩子模型这三个子模型,与所述基站训练时确定的所要训练得到的子模型相关。举例来说,若基站训练得到了估计子模型和压缩子模型这两个子模型,则所述基站可以向终端设备传输包含估计子模型和压缩子模型这两个子模型的第一模型,或者,所述基站向终端设备直接传输估计子模型和压缩子模型这两个子模型。Regarding the two sub-models of the base station transmission estimation sub-model and compression sub-model, or the three sub-models of transmission estimation sub-model, channel generation sub-model and compression sub-model, the sub-models to be obtained by training determined during training of the base station relevant. For example, if the base station obtains the two submodels of the estimated submodel and the compressed submodel through training, the base station may transmit to the terminal device the first model including the two submodels of the estimated submodel and the compressed submodel, or, the The base station directly transmits the two sub-models, the estimated sub-model and the compressed sub-model, to the terminal equipment.
应理解,本步骤还可以包括:所述基站可以向终端设备发送第二模型。相应的,所述终端设备可以接收所述基站发来的第二模型。It should be understood that this step may also include: the base station may send the second model to the terminal device. Correspondingly, the terminal device may receive the second model sent by the base station.
另外,若基站训练得到了第一模型、第二模型和第三模型的情况下,本步骤还可以包括:所述基站向所述终端设备传输第三模型。相应的,所述终端设备可以接收所述基站发来的第三模型。In addition, if the base station has obtained the first model, the second model and the third model through training, this step may further include: the base station transmitting the third model to the terminal device. Correspondingly, the terminal device may receive the third model sent by the base station.
S1203:所述基站发送第一信息;相应的,所述终端设备接收所述第一信息。S1203: The base station sends first information; correspondingly, the terminal device receives the first information.
本步骤中,所述第一信息可以为下行参考信号,具体可以是当前信道的下行参考信号,比如可以为SSB或CSI-RS,本示例不对其具体内容进行限定。In this step, the first information may be a downlink reference signal, specifically a downlink reference signal of the current channel, such as SSB or CSI-RS, and this example does not limit its specific content.
S1204:所述终端设备基于所述第一模型对所述第一信息进行处理,得到第二信息。S1204: The terminal device processes the first information based on the first model to obtain second information.
具体的,若所述终端设备接收到的第一模型中不包含信道生成子模型,则所述第二信息为经过估计子模型以及压缩子模型处理后所输出的压缩后的信道信息。该信道信息可以表示为矩阵,关于矩阵的具体说明,第一模型中估计子模型和压缩子模型的具体处理与前述实施例相同,不再赘述。Specifically, if the first model received by the terminal device does not include a channel generation sub-model, the second information is compressed channel information output after being processed by an estimation sub-model and a compression sub-model. The channel information can be expressed as a matrix. Regarding the specific description of the matrix, the specific processing of the estimation sub-model and the compression sub-model in the first model is the same as the foregoing embodiment, and will not be repeated here.
若所述终端设备接收到的第一模型中包含信道生成子模型,则所述第二信息为经过估计子模型、信道生成子模型以及压缩子模型处理后所输出的压缩后的信道信息的特征向量信息。该压缩后的信道信息的特征向量信息可以表示为矩阵,关于矩阵的具体说明,第一模型中估计子模型和压缩子模型的具体处理与前述实施例相同,不再赘述。If the first model received by the terminal device includes a channel generation sub-model, the second information is the characteristics of the compressed channel information output after being processed by the estimation sub-model, channel generation sub-model and compression sub-model vector information. The eigenvector information of the compressed channel information can be expressed as a matrix. Regarding the specific description of the matrix, the specific processing of the estimation sub-model and the compression sub-model in the first model is the same as that of the foregoing embodiment, and will not be repeated here.
S1205:所述终端设备发送所述第二信息;相应的,所述基站接收所述第二信息。S1205: The terminal device sends the second information; correspondingly, the base station receives the second information.
S1206:所述基站基于所述第二模型对所述第二信息进行处理得到信道信息。S1206: The base station processes the second information based on the second model to obtain channel information.
其中,若所述第一模型中不包含信道生成子模型,相应的,所述信道信息具体可以为用于表示信道信息的矩阵,关于表示信道信息的矩阵在前述实施例中已经详述,这里不再重复。若所述第一模型中包含信道生成子模型,相应的,所述信道信息具体可以为信道信息的特征向量信息,关于信道信息的特征向量信息在前述实施例已做了详细说明,不再赘述。Wherein, if the first model does not include a channel generation sub-model, correspondingly, the channel information may specifically be a matrix representing channel information. The matrix representing channel information has been described in detail in the foregoing embodiments, and here Do not repeat. If the first model includes a channel generation sub-model, correspondingly, the channel information may specifically be the eigenvector information of the channel information, and the eigenvector information of the channel information has been described in detail in the foregoing embodiments, and will not be repeated here. .
再结合图13对前述终端设备执行的信息处理方法的实施例以及网络设备执行的信息处理方法进行示例性说明,所述网络设备为基站,具体可以为:In conjunction with FIG. 13, an exemplary description will be given of an embodiment of the information processing method performed by the aforementioned terminal device and an information processing method performed by the network device. The network device is a base station, and may specifically be:
S1301:终端设备训练得到第一模型和第二模型;S1301: The terminal device trains to obtain the first model and the second model;
比如,终端设备为主体采用训练样本对第一预设模型和第二预设模型进行联合训练得到第一模型和第二模型;其中,第一模型包含估计子模型和压缩子模型;或者,第一模型包含估计子模型、信道生成子模型和压缩子模型。For example, the terminal device is the main body and uses training samples to jointly train the first preset model and the second preset model to obtain the first model and the second model; wherein, the first model includes an estimation sub-model and a compression sub-model; or, the second A model includes an estimation submodel, a channel generation submodel and a compression submodel.
或者,所述终端设备为主体采用训练样本对第一预设模型、第二预设模型和第三预设模型进行训练得到了第一模型、第二模型和第三模型;其中,第一模型包含估计子模型和压缩子模型;或者,第一模型包含估计子模型、信道生成子模型和压缩子模型。Alternatively, the terminal device uses training samples as the main body to train the first preset model, the second preset model and the third preset model to obtain the first model, the second model and the third model; wherein, the first model An estimation submodel and a compression submodel are included; alternatively, the first model includes an estimation submodel, a channel generation submodel, and a compression submodel.
S1302:所述终端设备向所述基站发送第二模型。相应的,所述基站可以接收所述终端设备发来的第二模型。S1302: The terminal device sends the second model to the base station. Correspondingly, the base station may receive the second model sent by the terminal device.
本步骤可以指的是:所述终端设备至少向基站传输所述第二模型。This step may refer to: the terminal device at least transmits the second model to a base station.
应理解,本步骤还可以包括:所述终端设备向所述基站发送第一模型。相应的,所述基站可以接收所述终端设备发来的第一模型。这里,关于第一信息为整体发送还是分别发送各个子模型,可以与前述实施例提供的处理方式相同,本示例中不重复说明。It should be understood that this step may also include: the terminal device sending the first model to the base station. Correspondingly, the base station may receive the first model sent by the terminal device. Here, regarding whether the first information is sent as a whole or each sub-model is sent separately, the processing method may be the same as that provided in the foregoing embodiment, and the description will not be repeated in this example.
另外,若基站训练得到了第一模型、第二模型和第三模型的情况下,本步骤还可以包括:所述终端设备向所述基站传输第三模型。相应的,所述基站可以接收所述终端设备发来的第三模型。In addition, if the base station has obtained the first model, the second model and the third model through training, this step may further include: the terminal device transmitting the third model to the base station. Correspondingly, the base station may receive the third model sent by the terminal device.
S1303:所述基站发送第一信息;相应的,所述终端设备接收所述第一信息。S1303: The base station sends first information; correspondingly, the terminal device receives the first information.
本步骤中,所述第一信息具体可以为下行参考信号,比如可以为SSB或CSI-RS,本示例不对其具体内容进行限定。In this step, the first information may specifically be a downlink reference signal, such as an SSB or a CSI-RS, and this example does not limit its specific content.
S1304:所述终端设备基于所述第一模型对所述第一信息进行处理,得到第二信息。S1304: The terminal device processes the first information based on the first model to obtain second information.
具体的,若所述第一模型中不包含信道生成子模型,则所述第二信息为经过估计子模型以及压缩子模型处理后所输出的压缩后的信道信息。该信道信息可以表示为矩阵,关于矩阵的具体说明,第一模型中估计子模型和压缩子模型的具体处理与前述实施例相同,不再赘述。Specifically, if the first model does not include a channel generation sub-model, the second information is compressed channel information output after being processed by an estimation sub-model and a compression sub-model. The channel information can be expressed as a matrix. Regarding the specific description of the matrix, the specific processing of the estimation sub-model and the compression sub-model in the first model is the same as the foregoing embodiment, and will not be repeated here.
若所述第一模型中包含信道生成子模型,则所述第二信息为经过估计子模型、信道生成子模型以及压缩子模型处理后所输出的压缩后的信道信息的特征向量信息。该压缩后的信道信息的特征向量信息可以表示为矩阵,关于矩阵的具体说明,第一模型中估计子模型和压缩子模型的具体处理与前述实施例相同,不再赘述。If the first model includes a channel generation sub-model, the second information is the eigenvector information of the compressed channel information output after being processed by the estimation sub-model, the channel generation sub-model and the compression sub-model. The eigenvector information of the compressed channel information can be expressed as a matrix. Regarding the specific description of the matrix, the specific processing of the estimation sub-model and the compression sub-model in the first model is the same as that of the foregoing embodiment, and will not be repeated here.
S1305:所述终端设备发送所述第二信息;相应的,所述基站接收所述第二信息。S1305: The terminal device sends the second information; correspondingly, the base station receives the second information.
S1306:所述基站基于所述第二模型对所述第二信息进行处理得到信道信息。S1306: The base station processes the second information based on the second model to obtain channel information.
其中,若所述第一模型中不包含信道生成子模型,相应的,所述信道信息具体可以为用于表示信道信息的矩阵,关于表示信道信息的矩阵在前述实施例中已经详述,这里不再重复。若所述第一模型中包含信道生成子模型,相应的,所 述信道信息具体可以为信道信息的特征向量信息,关于信道信息的特征向量信息在前述实施例已做了详细说明,不再赘述。Wherein, if the first model does not include a channel generation sub-model, correspondingly, the channel information may specifically be a matrix representing channel information. The matrix representing channel information has been described in detail in the foregoing embodiments, and here Do not repeat. If the first model includes a channel generation sub-model, correspondingly, the channel information may specifically be the eigenvector information of the channel information, and the eigenvector information of the channel information has been described in detail in the foregoing embodiments, and will not be repeated here. .
图13所给出的示例与前述图12的示例的区别还包括:由于一个基站可以与多个终端设备进行通信,因此,所述基站侧可能接收到了多个终端设备发来的第二模型。这种情况下,所述基站侧可以是针对每一个终端设备采用该终端设备发来的第二模型对该终端设备发送的第二信息进行处理以得到信道信息。又或者,所述基站侧可以从多个终端设备发来的多个第二模型中指定一个作为目标第二模型,并至少将该目标第二模型所对应的目标第一模型发送给其他终端设备,从而使得该基站连接或服务的全部终端设备使用相同的目标第一模型以及目标第二模型进行后续的处理,这样还可以减少基站侧查找不同终端设备的第二模型的时间损耗。The difference between the example shown in FIG. 13 and the aforementioned example in FIG. 12 also includes: since one base station can communicate with multiple terminal devices, the base station side may have received the second model from multiple terminal devices. In this case, the base station side may use the second model sent by the terminal device to process the second information sent by the terminal device for each terminal device to obtain channel information. Alternatively, the base station may designate one of the multiple second models sent from multiple terminal devices as the target second model, and at least send the target first model corresponding to the target second model to other terminal devices , so that all terminal devices connected or served by the base station use the same target first model and target second model for subsequent processing, which can also reduce the time consumption of searching for different second models of terminal devices on the base station side.
可见,通过采用上述方案,可以在终端设备接收到第一信息的情况下,经由第一模型对第一信息进行处理得到第二信息并发送,使得接收端能够通过使用第二模型对该第二信息进行处理以得到的信道信息,由于该第一模型与该第二模型为联合训练得到的。由于第二信息的处理、传输及解析过程是采用联合训练得到的第一模型和第二模型来实现的,因此可以兼顾整个信息处理、传输及解析中的性能要求,保证了网络整体的性能。进一步地,由于上述方案采用了联合训练得到的第一模型和第二模型,因此可以使得第一模型与第二模型之间的功能相互兼容,使得第一模型以及第二模型的性能均可以达到较优的状态,进而基于该第一模型和第二模型来对第二信息的处理、传输及解析过程进行整体处理时,可以保证整体处理的性能,从而保证了网络整体的性能。It can be seen that by adopting the above solution, when the terminal device receives the first information, it can process the first information through the first model to obtain the second information and send it, so that the receiving end can use the second model to obtain the second information. The channel information obtained by processing the information is obtained through joint training of the first model and the second model. Since the processing, transmission, and analysis of the second information are realized by using the first model and the second model obtained through joint training, the performance requirements in the entire information processing, transmission, and analysis can be taken into account, and the overall performance of the network is guaranteed. Furthermore, since the above solution uses the first model and the second model obtained through joint training, the functions between the first model and the second model can be made compatible with each other, so that the performance of the first model and the second model can reach In a better state, when the processing, transmission and analysis process of the second information is processed as a whole based on the first model and the second model, the performance of the whole processing can be guaranteed, thereby ensuring the performance of the whole network.
图14是根据本申请一实施例的模型生成方法1400的示意性流程图。该方法可选地可以应用于图1所示的系统,但并不仅限于此。该方法包括以下内容的至少部分内容。Fig. 14 is a schematic flowchart of a model generation method 1400 according to an embodiment of the present application. The method can optionally be applied to the system shown in Fig. 1, but is not limited thereto. The method includes at least some of the following.
S1410、采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型;S1410. Using training samples to jointly train the first preset model and the second preset model, to obtain the trained first model and the second model;
其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型;所述第一模型用于对第一信息进行处理得到第二信息;所述第二模型用于对所述第二信息进行处理得到信道信息。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training; the first model is used to process the first information to obtain Second information; the second model is used to process the second information to obtain channel information.
本实施例所提供的模型生成方法可以应用于电子设备,该电子设备可以是网络设备也可以是终端设备;所述网络设备可以为服务器、接入网设备等等;所述终端设备可以为智能手机、平板电脑、笔记本电脑、台式机(或台式电脑)等等。也就是说,具备数据处理能力的任意一种电子设备均能够执行本实施例提供的模型生成方法。The model generation method provided in this embodiment can be applied to electronic equipment, and the electronic equipment can be a network equipment or a terminal equipment; the network equipment can be a server, an access network equipment, etc.; the terminal equipment can be a smart Phones, tablets, laptops, desktops (or desktop computers), and more. That is to say, any electronic device capable of data processing can execute the model generation method provided in this embodiment.
在所述第二种方式中,训练可以采用第一损失函数或第二损失函数。下面对采用上述两种损失函数进行训练分别进行说明:In the second manner, the training may use the first loss function or the second loss function. The following describes the training using the above two loss functions:
所述训练采用的损失函数为第一损失函数;所述第一损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度构建的。The loss function used in the training is a first loss function; the first loss function is based on the difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model degree of difference is constructed.
所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度为基于距离确定的,或者为基于相似程度确定的。The degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model is determined based on a distance, or determined based on a degree of similarity.
基于距离确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的具体计算方式可以采用均方误差(MSE,Mean Squared Error)或归一化均方误差(NMSE)等方式,本实施例不做穷举。The specific calculation method for determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the distance can use mean square error (MSE, Mean Squared Error ) or normalized mean square error (NMSE), etc., which are not exhaustive in this embodiment.
举例来说,所述第二预设模型的输出信息可以为矩阵,相应的,所述压缩预设子模型的输入信息也可以为矩阵,这里,将所述第二预设模型的输出的矩阵称为矩阵1,将所述压缩预设子模型的输入的矩阵称为矩阵2;基于距离确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的方式为MSE方式,比如其计算可以包括:将矩阵1与矩阵2进行计算得到差值,将差值的平方作为所述差异程度。For example, the output information of the second preset model may be a matrix, and correspondingly, the input information of the compressed preset sub-model may also be a matrix, and here, the output matrix of the second preset model It is called matrix 1, and the matrix of the input of the compressed preset submodel is called matrix 2; the output information of the second preset model and the compressed preset submodel of the first preset model are determined based on the distance The way of the degree of difference between the input information is the MSE way, for example, its calculation may include: calculating the difference between matrix 1 and matrix 2, and taking the square of the difference as the difference degree.
基于相似程度确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的具体计算方式可以采用余弦相似度或余弦相似度平方等方式,本实施例不做穷举。The specific calculation method for determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may use cosine similarity or cosine similarity squared etc., which are not exhaustive in this embodiment.
举例来说,所述第二预设模型的输出信息可以为R组特征向量序列信息,相应的,所述压缩预设子模型的输入信息也可以为R组特征向量序列信息,这里,将所述第二预设模型的输出的R组特征向量序列信息称为特征向量序列1,将所述压缩预设子模型的输入的R组特征向量序列信息称为特征向量序列2。基于相似程度确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的方式可以是余弦相似度,比如其计算可以包括:特征向量序列1以及特征向量序列2的余弦夹角来确定相似程度,将该相似程度作为所述差异程度。For example, the output information of the second preset model may be R sets of feature vector sequence information, and correspondingly, the input information of the compressed preset sub-model may also be R sets of feature vector sequence information. Here, the The R sets of feature vector sequence information output by the second preset model are called feature vector sequence 1, and the R sets of feature vector sequence information input by the compressed preset sub-model are called feature vector sequence 2. The method of determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may be cosine similarity, for example, its calculation may include: The cosine angle between the eigenvector sequence 1 and the eigenvector sequence 2 is used to determine the degree of similarity, and the degree of similarity is used as the degree of difference.
采用上述第一损失函数进行训练的处理中,由于第一预设模型中包含的子模型的不同以及是否包含用于模拟无线信道环境的第三预设模型进行联合训练会使得联合训练的方式有所不同,因此下面分四种情况分别进行说明:In the process of using the above-mentioned first loss function for training, due to the difference in the sub-models contained in the first preset model and whether the third preset model for simulating the wireless channel environment is included for joint training, the way of joint training will be different. Therefore, the following four situations are described separately:
情况一,所述第一预设模型中包括估计预设子模型和压缩预设子模型。 Case 1, the first preset model includes an estimation preset sub-model and a compression preset sub-model.
参见图8a,其中示意出第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801和压缩预设子模型802。上述第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801和压缩预设子模型802之间的输入输出关系可以为:估计预设子模型801的输入信息为第一训练样本920;所述估计预设子模型801的输出信息作为所述压缩预设子模型802的输入信息;所述压缩预设子模型802的输出信息作为所述第二预设模型810的输入信息。Referring to FIG. 8 a , it illustrates a first preset model 800 , a second preset model 810 , and an estimated preset sub-model 801 and a compressed preset sub-model 802 included in the first preset model 800 . The above-mentioned first preset model 800, the second preset model 810, and the input-output relationship between the estimated preset sub-model 801 and the compressed preset sub-model 802 contained in the first preset model 800 can be: estimated The input information of the preset submodel 801 is the first training sample 920; the output information of the estimated preset submodel 801 is used as the input information of the compressed preset submodel 802; the output information of the compressed preset submodel 802 As the input information of the second preset model 810 .
所述采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The joint training of the first preset model and the second preset model by using training samples includes:
将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息以及所述初始信息之间的差异程度,确定所述第一损失函数;determining the first loss function based on the degree of difference between the restoration information and the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
其中,所述第一训练样本可以为参考信号样本。所述参考信号样本可以为历史采集得到的原始参考信号、或者处理过的参考信号。再具体的,所述参考信号样本可以为下行参考信号样本。应理解,本实施例并不限定所述第一训练样本一定为所述下行参考信号样本,还可以采用上行参考信号样本或其他参考信号样本,只是本实施例不做穷举。Wherein, the first training samples may be reference signal samples. The reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition. More specifically, the reference signal samples may be downlink reference signal samples. It should be understood that this embodiment does not limit that the first training samples must be the downlink reference signal samples, and uplink reference signal samples or other reference signal samples may also be used, which are not exhaustive in this embodiment.
需要指出的是,除了所述第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者 场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在联合训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景来确定,这里不对其进行限定。It should be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model may also be wireless channel or other scene-related information, for example, may include at least one of the following: Channel signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
所述第一预设模型的估计预设子模型的具体功能可以为:基于所述第一训练样本进行信道估计得到初始信息。其中,信道估计可以采用最小均方误差(MMSE)等算法。A specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information. Wherein, the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
上述初始信息可以为矩阵,该矩阵的维度这里不做限定,可以为二维或更多维度的矩阵。所述矩阵中的每一个位置上的数值用于表示对应多个维度的相应粒度下所对应的信道质量。其中,所述信道质量可以采用信号强度值来表征;所述信号强度值的单位可以是dBm,或所述信号强度值没有单位而是归一化之后所得到的数值。The aforementioned initial information may be a matrix, and the dimension of the matrix is not limited here, and may be a matrix of two or more dimensions. The value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions. Wherein, the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。所述压缩预设子模型得到的压缩后的信息包含的数据量小于其输入的初始信息的数据量。上述压缩后的信息与初始信息的形式为相同的,比如所述初始信息为矩阵,相应的所述压缩后的信息也为矩阵,所述初始信息与所述压缩后的信息的矩阵维度是相同的但数据量是不同的。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift. The compressed information obtained by compressing the preset sub-model contains less data than the input initial information. The form of the above-mentioned compressed information is the same as that of the initial information, for example, the initial information is a matrix, and the corresponding compressed information is also a matrix, and the matrix dimensions of the initial information and the compressed information are the same but the amount of data is different.
所述第二预设模型的功能可以是对其输入信息进行解压缩。本情况中,所述第二预设模型的输入信息为所述压缩后的信息,所述第二预设模型的输出为所述恢复信息。在理想状态下,第二预设模型的解压缩率应该使得其得到的恢复信息与初始信息包含相同的数据内容。The function of the second preset model may be to decompress its input information. In this case, the input information of the second preset model is the compressed information, and the output of the second preset model is the restored information. Ideally, the decompression rate of the second preset model should make the obtained restored information contain the same data content as the original information.
所述基于第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型具体可以指的是:基于第一损失函数所确定的差异程度来进行反向传导更新所述估计预设子模型的模型参数、所述压缩预设子模型的模型参数和所述第二预设模型的模型参数。The performing reverse conduction update of the first preset model and the second preset model based on the first loss function may specifically refer to performing reverse conduction update based on the degree of difference determined by the first loss function. The model parameters of the estimated preset sub-model, the model parameters of the compressed preset sub-model, and the model parameters of the second preset model.
针对上述训练还需要指出,关于上述训练收敛的方式可以包括以下至少之一:判断迭代训练的次数是否达到预设次数,判断差异程度是否小于预设门限值。其中,所述预设次数、所述预设门限值可以根据实际情况设置,不对其进行穷举。也就是说,基于上述方式确定训练完成时,可以将训练完成后的第一预设模型作为第一模型,将训练完成后的第二预设模型作为第二模型。Regarding the above training, it should also be pointed out that the manner of the above training convergence may include at least one of the following: judging whether the number of iterative training reaches a preset number, and judging whether the degree of difference is smaller than a preset threshold. Wherein, the preset number of times and the preset threshold value can be set according to actual conditions, and are not exhaustive. That is to say, when it is determined that the training is completed based on the above manner, the first preset model after the training can be used as the first model, and the second preset model after the training can be used as the second model.
情况二,所述第一预设模型中包括估计预设子模型、预设信道生成子模型和压缩预设子模型。Case 2, the first preset model includes an estimation preset submodel, a preset channel generation submodel, and a compression preset submodel.
参见图8b,其中示意出第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802和信道生成预设子模型803。上述第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802和信道生成预设子模型803之间的输入输出关系可以为:估计预设子模型801的输入信息为第一训练样本920;所述估计预设子模型801的输出信息作为所述信道生成预设子模型803的输入信息;所述信道生成预设子模型803的输出信息作为压缩预设子模型802的输入信息;所述压缩预设子模型802的输出信息作为所述第二预设模型810的输入信息。Referring to Fig. 8b, it shows a first preset model 800, a second preset model 810, and the estimation preset sub-model 801, compression preset sub-model 802 and channel generation preset included in the first preset model 800. Set sub-model 803 . Between the above-mentioned first preset model 800, second preset model 810, and the estimation preset submodel 801, compression preset submodel 802 and channel generation preset submodel 803 contained in the first preset model 800 The input-output relationship of can be as follows: the input information of the estimated preset sub-model 801 is the first training sample 920; the output information of the estimated preset sub-model 801 is used as the input information of the channel generation preset sub-model 803; The output information of the channel generation preset submodel 803 is used as the input information of the compression preset submodel 802 ; the output information of the compression preset submodel 802 is used as the input information of the second preset model 810 .
采用训练样本对第一预设模型和第二预设模型进行联合训练,可以包括:Using training samples to jointly train the first preset model and the second preset model may include:
将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
其中,关于第一训练样本的具体说明与前述情况一相同,因此不做重复说明。需要指出的是,除了所述第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在联合训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景来确定,这里不对其进行限定。Wherein, the specific description about the first training sample is the same as the above-mentioned case 1, so repeated description will not be given. It should be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model may also be wireless channel or other scene-related information, for example, may include at least one of the following: Channel signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
所述第一预设模型的估计预设子模型的具体功能可以为:基于第一训练样本进行信道估计得到初始信息。其中,信道估计可以采用最小均方误差(MMSE)等算法。上述估计预设子模型输出的初始信息可以为矩阵,该矩阵的维度这里不做限定,可以为二维或更多维度的矩阵。所述矩阵中的每一个位置上的数值用于表示对应多个维度的相应粒度下所对应的信道质量。其中,所述信道质量可以采用信号强度值来表征;所述信号强度值的单位可以是dBm,或所述信号强度值没有单位而是归一化之后所得到的数值。The specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information. Wherein, the channel estimation may adopt algorithms such as minimum mean square error (MMSE). The initial information output by the estimated preset sub-model above may be a matrix, and the dimension of the matrix is not limited here, and may be a two-dimensional or more dimensional matrix. The value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions. Wherein, the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
所述信道生成预设子模型的功能可以是对所述初始信息进行特征分解,得到所述初始信息的特征向量信息。其中,所述初始信息的特征向量信息可以包括R组特征向量序列。对所述初始信息进行特征分解的方式可以采用奇异值分解(SVD,Singular Value Decomposition)的方式。A function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information. Wherein, the eigenvector information of the initial information may include R groups of eigenvector sequences. The method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。在上述处理中,所述压缩预设子模型得到的压缩后的特征向量信息包含的数据量小于其输入的初始信息的特征向量信息的数据量。上述压缩后的特征向量信息与初始信息的特征向量信息的形式为相同的,比如初始信息的特征向量信息为R组特征向量序列,所述压缩后的特征向量信息也为R组特征向量序列但两者包含的数据量是不同的。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift. In the above processing, the compressed eigenvector information obtained by compressing the preset sub-model contains less data than the eigenvector information of the input initial information. The above compressed feature vector information is in the same form as the feature vector information of the initial information. For example, the feature vector information of the initial information is a sequence of R groups of feature vectors, and the compressed feature vector information is also a sequence of feature vectors of groups R but The amount of data contained in the two is different.
所述第二预设模型的功能可以是对其输入信息进行解压缩。在上述处理中,所述第二预设模型的输入信息为压缩后的特征向量信息,第二预设模型的输出为恢复的特征向量信息。第二预设模型的解压缩率应该使得其得到的恢复的特征向量信息与初始信息的特征向量信息包含相同或基本相同的数据。The function of the second preset model may be to decompress its input information. In the above processing, the input information of the second preset model is compressed feature vector information, and the output of the second preset model is restored feature vector information. The decompression rate of the second preset model should be such that the obtained restored feature vector information contains the same or substantially the same data as the feature vector information of the initial information.
所述基于第一损失函数所确定的差异程度来进行反向传导更新所述第一预设模型和所述第二预设模型具体可以指的是:基于第一损失函数所确定的差异程度来进行反向传导更新所述估计预设子模型的模型参数、所述信道生成预设子模型的模型参数、所述压缩预设子模型的模型参数和所述第二预设模型的模型参数。The performing reverse conduction to update the first preset model and the second preset model based on the degree of difference determined by the first loss function may specifically refer to: performing a reverse conduction based on the degree of difference determined by the first loss function performing reverse conduction to update model parameters of the estimated preset submodel, model parameters of the channel generation preset submodel, model parameters of the compression preset submodel, and model parameters of the second preset model.
关于上述训练收敛的确定方式与前述情况一相同,不做重复说明。The method for determining the convergence of the above training is the same as that of the above-mentioned case 1, and repeated explanations are not repeated.
情况三,采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型,包括:In the third case, the training sample is used to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, including:
采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。Using training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein, the first preset model The three models are the third preset models after training.
参见图8c,其中示意出第一预设模型800,第二预设模型810,第三预设模型830,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802。上述第一预设模型800,第二预设模型810,以及所述第一预设模型800中包含的估计预设子模型801和压缩预设子模型802之间的输入输出关系可以为:估计预设子模型801的输入信息为第一训练样本920;所述估计预设子模型801的输出信息作为压缩预设子模型802的输入信息;所述压缩预设子模型802的输出信息作为所述第三预设模型830的输入信息;所述第三预设模型的输出信息作为所述第二预设模型810的输出信息。Referring to FIG. 8c, it shows a first preset model 800, a second preset model 810, a third preset model 830, and the estimated preset sub-model 801 contained in the first preset model 800, the compression preset Submodel 802. The above-mentioned first preset model 800, the second preset model 810, and the input-output relationship between the estimated preset sub-model 801 and the compressed preset sub-model 802 contained in the first preset model 800 can be: estimated The input information of the preset submodel 801 is the first training sample 920; the output information of the estimated preset submodel 801 is used as the input information of the compressed preset submodel 802; the output information of the compressed preset submodel 802 is used as the The input information of the third preset model 830; the output information of the third preset model is used as the output information of the second preset model 810.
采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:Using training samples to jointly train the first preset model, the second preset model and the third preset model, including:
将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息以及所述初始信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restoration information and the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
关于第一训练样本的具体说明与前述情况一或情况二相同,因此不做重复说明。还需要指出的是,除了第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景相关,这里不对其进行限定。The specific description about the first training sample is the same as the above-mentioned case 1 or case 2, so no repeated description is given. It should also be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model can also be wireless channel or other information related to the scene, for example, it can include at least one of the following: channel Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. In the process of training, whether one or more of the above information is input may be relevant according to the actual situation or the actual scene, and it is not limited here.
所述第一预设模型的估计预设子模型以及所述第一预设模型的压缩预设子模型的具体功能与前述情况一相同,因此不做重复说明。The specific functions of the estimation preset sub-model of the first preset model and the compression preset sub-model of the first preset model are the same as those in the first case, so the description will not be repeated.
在情况三中相对于第一种情况增加了第三预设模型,关于所述第三预设模型的功能为模拟信道环境,具体处理可以为对输入信息进行数据变换得到数据变换后的信息作为输出信息。其中,所述数据变换的具体处理方式可以包括卷积处理或者等效于卷积的数据处理;其中,所述等效于卷积的数据处理可以为多次傅里叶变换处理,比如,可以为将所述第三预设模型的输入信息通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域,以此来等效时域卷积的处理。In case three, a third preset model is added relative to the first case. The function of the third preset model is to simulate the channel environment. The specific processing can be to perform data conversion on the input information to obtain the information after data conversion as Output information. Wherein, the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
所述第二预设模型的功能可以是对其输入信息进行解压缩。在第三种情况的处理中,所述第二预设模型的输入信息为变换后的信息,第二预设模型的输出为恢复信息。第二预设模型的解压缩率应该使得其得到的恢复信息与初始信息包含相同的数据。The function of the second preset model may be to decompress its input information. In the processing of the third case, the input information of the second preset model is transformed information, and the output of the second preset model is restored information. The decompression rate of the second preset model should make the obtained restored information contain the same data as the original information.
所述基于第一损失函数进行反向传导更新更新所述第一预设模型、所述第二预设模型和所述第三预设模型具体可以指的是:基于第一损失函数进行反向传导更新所述估计预设子模型的模型参数、所述压缩预设子模型的模型参数、所述第二预设模型的模型参数和所述第三预设模型的模型参数。The updating of the first preset model, the second preset model, and the third preset model based on the first loss function may specifically refer to: performing reverse conduction based on the first loss function Conductively updating model parameters of the estimated preset sub-model, model parameters of the compressed preset sub-model, model parameters of the second preset model, and model parameters of the third preset model.
关于上述训练收敛的方式与前述情况一或情况二相同,不做赘述。The manner of the above-mentioned training convergence is the same as that of the foregoing case 1 or case 2, and will not be repeated here.
情况四,与上述情况三不同在于所述第一预设模型中包括估计预设子模型、预设信道生成子模型和压缩预设子模型。Case 4 is different from the above case 3 in that the first preset model includes an estimation preset sub-model, a preset channel generation sub-model, and a compression preset sub-model.
参见图8d,其中示意出第一预设模型800,第二预设模型810,第三预设模型830,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802和信道生成预设子模型803。上述第一预设模型800,第二预设模型810,第三预设模型830,以及所述第一预设模型800中包含的估计预设子模型801、压缩预设子模型802和信道生成预设子模型803之间的输入输出关系可以为:估计预设子模型801的输入信息为第一训练样本920;所述估计预设子模型801的输出信息作为所述信道生成预设子模型803的输入信息;所述信道生成预设子模型803的输出信息作为压缩预设子模型802的输入信息;所述压缩预设子模型802的输出信息作为所述第三预设模型830的输入信息;所述第三预设模型830的输出信息作为所述第二预设模型810的输入信息。Referring to Fig. 8d, it shows a first preset model 800, a second preset model 810, a third preset model 830, and the estimated preset sub-model 801 contained in the first preset model 800, the compression preset Submodel 802 and channel generation preset submodel 803 . The above-mentioned first preset model 800, second preset model 810, third preset model 830, and the estimation preset sub-model 801, compression preset sub-model 802 and channel generation included in the first preset model 800 The input-output relationship between the preset sub-models 803 may be: the input information of the estimated preset sub-model 801 is the first training sample 920; the output information of the estimated preset sub-model 801 is used as the channel generation preset sub-model The input information of 803; the output information of the channel generation preset submodel 803 is used as the input information of the compression preset submodel 802; the output information of the compression preset submodel 802 is used as the input of the third preset model 830 information; the output information of the third preset model 830 is used as the input information of the second preset model 810 .
采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:Using training samples to jointly train the first preset model, the second preset model and the third preset model, including:
将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
关于第一训练样本的具体说明与前述情况一、情况二、情况三中任意之一相同,因此不做重复说明。The specific description about the first training sample is the same as any one of the foregoing case 1, case 2, and case 3, so no repeated description is given.
所述第一预设模型的估计预设子模型的具体功能与前述情况一、情况二、情况三中任意之一相同。The specific function of the estimated preset sub-model of the first preset model is the same as any one of the foregoing case 1, case 2, and case 3.
所述信道生成预设子模型的功能可以是对所述初始信息进行特征分解,得到所述初始信息的特征向量信息。其中,所述初始信息的特征向量信息可以包括R组特征向量序列。对所述初始信息进行特征分解的方式可以采用奇异值分解(SVD,Singular Value Decomposition)的方式。A function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information. Wherein, the eigenvector information of the initial information may include R groups of eigenvector sequences. The method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
所述第三预设模型的功能为模拟无线信道环境,具体处理可以为对输入信息进行数据变换得到数据变换后的信息作 为输出信息。其中,所述数据变换的具体处理方式可以包括卷积处理或者等效于卷积的数据处理;其中,所述等效于卷积的数据处理可以为多次傅里叶变换处理,比如,可以为将所述第三预设模型的输入信息通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域,以此来等效时域卷积的处理。The function of the third preset model is to simulate the wireless channel environment, and the specific processing may be to perform data transformation on the input information to obtain the information after data transformation as output information. Wherein, the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
所述第二预设模型的功能可以是对其输入信息进行解压缩。所述第二预设模型的输入信息为变换后的特征向量信息,第二预设模型的输出为恢复的特征向量信息。第二预设模型的解压缩率应该使得其得到的恢复的特征向量信息与初始信息的特征向量信息包含接近或相同的数据。The function of the second preset model may be to decompress its input information. The input information of the second preset model is transformed feature vector information, and the output of the second preset model is restored feature vector information. The decompression rate of the second preset model should make the obtained restored feature vector information and the feature vector information of the initial information contain close to or the same data.
所述基于第一损失函数所确定的差异程度来进行反向传导更新更新所述第一预设模型、所述第二预设模型和所述第三预设模型具体可以指的是:基于第一损失函数所确定的差异程度来进行反向传导更新所述估计预设子模型的模型参数、所述信道生成预设子模型的模型参数、所述压缩预设子模型的模型参数、所述第二预设模型的模型参数和所述第三预设模型的模型参数。The performing reverse conduction update based on the degree of difference determined by the first loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The degree of difference determined by a loss function is used to perform reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the channel generation preset sub-model, the model parameters of the compressed preset sub-model, the The model parameters of the second preset model and the model parameters of the third preset model.
关于上述训练收敛的方式与前述实施例相同,不做重复说明。The manner of the above-mentioned training convergence is the same as that of the foregoing embodiment, and no repeated description is given.
以上针对联合训练采用第一损失函数的场景进行了说明,本实施例中还可以提供采用第二损失函数进行联合训练的场景,具体如下:The scenario where the first loss function is used for joint training is described above. In this embodiment, the scenario where the second loss function is used for joint training can also be provided, as follows:
所述训练采用的损失函数为第二损失函数;所述第二损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度以及所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度构建的;其中,所述第二训练样本与输入所述估计预设子模型的第一训练样本相对应。The loss function used in the training is a second loss function; the second loss function is based on the difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model The first difference degree of the first preset model and the second difference degree between the output information of the estimated preset sub-model of the first preset model and the second training sample; wherein, the second training sample and the input of the estimated Corresponds to the first training sample of the preset sub-model.
所述第一差异程度为基于距离确定的,或者为基于相似程度确定的;和/或,所述第二差异程度为基于距离确定的,或者为基于相似程度确定的。The first degree of difference is determined based on a distance, or is determined based on a degree of similarity; and/or, the second degree of difference is determined based on a distance, or is determined based on a degree of similarity.
基于距离确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度的具体计算方式可以采用均方误差(MSE,Mean Squared Error)或归一化均方误差(NMSE)等方式,本实施例不做穷举。The specific calculation method for determining the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the distance can use a mean square error (MSE, Mean Squared Error) or normalized mean square error (NMSE), etc., this embodiment is not exhaustive.
举例来说,所述第二预设模型的输出信息可以为矩阵,相应的,所述压缩预设子模型的输入信息也可以为矩阵,这里,将所述第二预设模型的输出的矩阵称为矩阵3,将所述压缩预设子模型的输入的矩阵称为矩阵4;基于距离确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的方式MSE方式,比如:将矩阵3与矩阵4进行计算得到差值,将差值的平方作为所述差异程度。For example, the output information of the second preset model may be a matrix, and correspondingly, the input information of the compressed preset sub-model may also be a matrix, and here, the output matrix of the second preset model It is called matrix 3, and the matrix of the input of the compressed preset submodel is called matrix 4; the output information of the second preset model and the compressed preset submodel of the first preset model are determined based on the distance The way of the degree of difference between the input information is the MSE way, for example: calculate the difference between matrix 3 and matrix 4, and use the square of the difference as the degree of difference.
基于相似程度确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度的具体计算方式可以采用余弦相似度或余弦相似度平方等方式,本实施例不做穷举。The specific calculation method for determining the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may use cosine similarity or cosine similarity Degree square and other methods are not exhaustive in this embodiment.
举例来说,所述第二预设模型的输出信息可以为R组特征向量序列信息,相应的,所述压缩预设子模型的输入信息也可以为R组特征向量序列信息,这里,将所述第二预设模型的输出的R组特征向量序列信息称为特征向量序列3,将所述压缩预设子模型的输入的R组特征向量序列信息称为特征向量序列4。基于相似程度确定所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度的方式可以是余弦相似度,比如:特征向量序列3以及特征向量序列4的余弦夹角来确定相似程度,将该相似程度作为所述差异程度。For example, the output information of the second preset model may be R sets of feature vector sequence information, and correspondingly, the input information of the compressed preset sub-model may also be R sets of feature vector sequence information. Here, the The R group of feature vector sequence information output by the second preset model is called feature vector sequence 3, and the R group of feature vector sequence information input by the compressed preset sub-model is called feature vector sequence 4. The method of determining the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model based on the degree of similarity may be cosine similarity, for example: feature vector sequence 3 and the cosine angle of the eigenvector sequence 4 to determine the degree of similarity, and use the degree of similarity as the degree of difference.
基于距离确定所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度的具体计算方式可以采用均方误差(MSE,Mean Squared Error)或归一化均方误差(NMSE)等方式,本实施例不做穷举。The specific calculation method for determining the second degree of difference between the output information of the estimated preset sub-model of the first preset model based on the distance and the second training sample can use mean square error (MSE, Mean Squared Error) or normalization The methods such as normalized mean square error (NMSE) are not exhaustive in this embodiment.
举例来说,所述估计预设子模型的输出信息可以为矩阵,相应的,所述第二训练样本也可以为矩阵,这里,将所述估计预设子模型的输出的矩阵称为矩阵5,将所述压缩预设子模型的输入的矩阵称为矩阵6;基于距离确定所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度的方式MSE方式,比如:将矩阵5与矩阵6进行计算得到差值,将差值的平方作为所述差异程度。For example, the output information of the estimated preset sub-model may be a matrix, and correspondingly, the second training sample may also be a matrix. Here, the output matrix of the estimated preset sub-model is called matrix 5 , the matrix of the input of the compressed preset sub-model is called matrix 6; the second degree of difference between the output information of the estimated preset sub-model of the first preset model and the second training sample is determined based on the distance In the MSE mode, for example: calculate the difference between matrix 5 and matrix 6, and use the square of the difference as the degree of difference.
基于相似程度所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度的具体计算方式的具体计算方式可以采用余弦相似度或余弦相似度平方等方式,本实施例不做穷举。Based on the specific calculation method of the second degree of difference between the output information of the estimated preset sub-model of the first preset model and the second training sample based on the degree of similarity, the specific calculation method can use cosine similarity or cosine similarity squared, etc. The methods are not exhaustive in this embodiment.
上述第一差异程度以及第二差异程度联合构建所述第二损失函数时,其联的方式可以是对第一差异程度以及第二差异程度等权重相加,比如两者各占50%;或者,其联合的方式可以是第一差异程度以及第二差异程度不等权重相加,比如可以对第一差异程度赋予更大权重,也就是对上述第二预设模型与压缩预设子模型之间的压缩恢复前后的差异情况赋予更大权重,或者可以对上述第二差异程度赋予更大权重,也就是对上述估计预设子模型的输出信息的准确度赋予更大权重;或者,其联合的方式可以是第一差异程度以及第二差异程度相乘的形式;或者其联合的方式可以是第一差异程度以及第二差异程度可以是通过交叉熵计算的形式,比如p1*log(第一差异程度)+p2*log(第二差异程度),其中p1和p2均可以根据实际情况设置,这里不对其进行限定。When the first degree of difference and the second degree of difference are combined to construct the second loss function, the method of connection may be to add the weights of the first degree of difference and the second degree of difference, for example, the two each account for 50%; or , the joint method can be the addition of unequal weights between the first difference degree and the second difference degree. The difference before and after the compression and recovery between the two can be assigned a greater weight, or the above-mentioned second degree of difference can be assigned a larger weight, that is, the accuracy of the output information of the above-mentioned estimated preset sub-model is assigned a larger weight; or, its combination The method can be in the form of multiplying the first degree of difference and the second degree of difference; or the joint method can be that the first degree of difference and the second degree of difference can be calculated by cross entropy, such as p1*log(first degree of difference)+p2*log (the second degree of difference), where both p1 and p2 can be set according to actual conditions, and are not limited here.
采用上述第二损失函数进行训练的处理中,由于第一预设模型中包含的子模型的不同以及是否包含用于模拟无线信道环境的第三预设模型进行联合训练的具体处理是不同的,因此分以下四种情况分别进行说明:In the process of using the above-mentioned second loss function for training, since the sub-models contained in the first preset model are different and whether the third preset model for simulating the wireless channel environment is included for joint training is different, Therefore, the following four situations are described separately:
情况五,终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练得到训练后的第一模型和第二模型;其中,所述第一预设模型中包括估计预设子模型和压缩预设子模型。Case five, the terminal device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model; wherein, the first preset model includes estimated preset sub-models Model and compression preset submodels.
本情况中各个模型的组成以及各个模型之间的输入输出关系与前述情况一相同,具体可以参见图8a,这里不做重复说明。In this case, the composition of each model and the input-output relationship between each model are the same as the previous case 1. For details, please refer to FIG. 8 a , which will not be repeated here.
采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:Using training samples to jointly train the first preset model and the second preset model, including:
将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本相对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
其中,所述第一训练样本可以为参考信号样本。所述参考信号样本可以为历史采集得到的原始参考信号、或者处理 过的参考信号。再具体的,所述参考信号样本可以为下行参考信号样本。应理解,本实施例并不限定所述第一训练样本一定为所述下行参考信号样本,还可以采用上行参考信号样本或其他参考信号样本,只是本实施例不做穷举。Wherein, the first training samples may be reference signal samples. The reference signal sample may be an original reference signal obtained through historical acquisition, or a processed reference signal. More specifically, the reference signal samples may be downlink reference signal samples. It should be understood that this embodiment does not limit that the first training samples must be the downlink reference signal samples, and uplink reference signal samples or other reference signal samples may also be used, which are not exhaustive in this embodiment.
需要指出的是,除了所述第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在联合训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景来确定,这里不对其进行限定。It should be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model may also be wireless channel or other scene-related information, for example, may include at least one of the following: Channel signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
所述第一预设模型的估计预设子模型的具体功能可以为:基于第一训练样本进行信道估计得到初始信息。其中,信道估计可以采用最小均方误差(MMSE)等算法。The specific function of the estimation preset sub-model of the first preset model may be: perform channel estimation based on the first training samples to obtain initial information. Wherein, the channel estimation may adopt algorithms such as minimum mean square error (MMSE).
上述初始信息可以为矩阵,该矩阵的维度这里不做限定,可以为二维或更多维度的矩阵。所述矩阵中的每一个位置上的数值用于表示对应多个维度的相应粒度下所对应的信道质量。其中,所述信道质量可以采用信号强度值来表征;所述信号强度值的单位可以是dBm,或所述信号强度值没有单位而是归一化之后所得到的数值。The aforementioned initial information may be a matrix, and the dimension of the matrix is not limited here, and may be a matrix of two or more dimensions. The value at each position in the matrix is used to represent the corresponding channel quality at the corresponding granularity corresponding to multiple dimensions. Wherein, the channel quality may be characterized by a signal strength value; the unit of the signal strength value may be dBm, or the signal strength value has no unit but a value obtained after normalization.
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。所述压缩预设子模型得到的压缩后的信息包含的数据量小于其输入的初始信息的数据量。上述压缩后的信息与初始信息的形式为相同的,比如所述初始信息为矩阵,相应的所述压缩后的信息也为矩阵,所述初始信息与所述压缩后的信息的矩阵维度是相同的但数据量是不同的。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift. The compressed information obtained by compressing the preset sub-model contains less data than the input initial information. The form of the above-mentioned compressed information is the same as that of the initial information, for example, the initial information is a matrix, and the corresponding compressed information is also a matrix, and the matrix dimensions of the initial information and the compressed information are the same but the amount of data is different.
所述第二预设模型的功能可以是对其输入信息进行解压缩。第二预设模型的解压缩率应该使得其得到的恢复信息与初始信息包含相同的数据。The function of the second preset model may be to decompress its input information. The decompression rate of the second preset model should make the obtained restored information contain the same data as the original information.
所述基于第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型具体可以指的是:基于第二损失函数进行反向传导更新所述估计预设子模型的模型参数、所述压缩预设子模型的模型参数和所述第二预设模型的模型参数。The performing reverse conduction based on the second loss function to update the first preset model and the second preset model may specifically refer to: performing reverse conduction based on the second loss function to update the estimated preset sub-model , the model parameters of the compressed preset sub-model and the model parameters of the second preset model.
针对上述训练还需要指出,关于上述训练收敛的方式可以包括以下至少之一:判断迭代训练的次数是否达到预设次数,判断第一差异程度是否小于第一预设门限值,判断第二差异程度是否小于第二预设门限值。其中,所述预设次数、所述第一预设门限值以及第二预设门限值可以根据实际情况设置,不对其进行穷举。也就是说,基于上述方式确定训练完成时,可以将训练完成后的第一预设模型作为第一模型,将训练完成后的第二预设模型作为第二模型。For the above-mentioned training, it should also be pointed out that the way of the above-mentioned training convergence can include at least one of the following: judging whether the number of iterative training reaches the preset number of times, judging whether the first difference degree is less than the first preset threshold value, judging the second difference Whether the degree is smaller than the second preset threshold value. Wherein, the preset times, the first preset threshold value and the second preset threshold value can be set according to actual conditions, and are not exhaustive. That is to say, when it is determined that the training is completed based on the above manner, the first preset model after the training can be used as the first model, and the second preset model after the training can be used as the second model.
情况六、与情况五不同在于,所述第一预设模型中包括估计预设子模型、预设信道生成子模型和压缩预设子模型。Case 6 is different from Case 5 in that the first preset model includes an estimation preset sub-model, a preset channel generation sub-model, and a compression preset sub-model.
本情况中各个模型的组成以及各个模型之间的输入输出关系与前述情况二相同,具体可以参见图8b,这里不做重复说明。The composition of each model in this case and the input-output relationship between each model are the same as those in the second case, for details, please refer to FIG. 8 b , which will not be repeated here.
采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:Using training samples to jointly train the first preset model and the second preset model, including:
将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
关于第一训练样本的具体说明与前述情况五相同,因此不做重复说明。需要指出的是,除了第一训练样本之外,输入所述第一预设模型的估计预设子模型的信息还可以无线信道或者场景相关的其他信息,比如可以包括以下至少之一:信道的信噪比、信干扰噪比、信道类型、带宽信息、时延信息等。关于在联合训练的过程中,输入以上信息的一种还是多种可以根据实际情况或实际场景来确定,这里不对其进行限定。The specific description about the first training sample is the same as that of the fifth case above, so the description will not be repeated. It should be pointed out that, in addition to the first training samples, the information input into the estimated preset sub-model of the first preset model may also include other information related to wireless channels or scenes, for example, may include at least one of the following: Signal-to-noise ratio, signal-to-interference-noise ratio, channel type, bandwidth information, delay information, etc. Whether one or more of the above information is input during the joint training process may be determined according to actual conditions or actual scenarios, and is not limited here.
所述第一预设模型的估计预设子模型的具体功能与前述情况五相同。The specific function of the estimation preset sub-model of the first preset model is the same as the fifth case above.
所述信道生成预设子模型的功能可以是对所述初始信息进行特征分解,得到所述初始信息的特征向量信息。其中,所述初始信息的特征向量信息可以包括R组特征向量序列。对所述初始信息进行特征分解的方式可以采用奇异值分解(SVD,Singular Value Decomposition)的方式。A function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information. Wherein, the eigenvector information of the initial information may include R groups of eigenvector sequences. The method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。在上述处理中,所述压缩预设子模型得到的压缩后的特征向量信息包含的数据量小于其输入的初始信息的特征向量信息的数据量。上述压缩后的特征向量信息与初始信息的特征向量信息的形式为相同的,比如初始信息的特征向量信息为R组特征向量序列,所述压缩后的特征向量信息也为R组特征向量序列但两者包含的数据量是不同的。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift. In the above processing, the compressed eigenvector information obtained by compressing the preset sub-model contains less data than the eigenvector information of the input initial information. The above compressed feature vector information is in the same form as the feature vector information of the initial information. For example, the feature vector information of the initial information is a sequence of R groups of feature vectors, and the compressed feature vector information is also a sequence of feature vectors of groups R but The amount of data contained in the two is different.
所述第二预设模型的功能可以是对其输入信息进行解压缩。在上述处理中,所述第二预设模型的输入信息为压缩后的特征向量信息,第二预设模型的输出为恢复的特征向量信息。第二预设模型的解压缩率应该使得其得到的恢复的特征向量信息与初始信息的特征向量信息包含相同或基本相同的数据。The function of the second preset model may be to decompress its input information. In the above processing, the input information of the second preset model is compressed feature vector information, and the output of the second preset model is restored feature vector information. The decompression rate of the second preset model should be such that the obtained restored feature vector information contains the same or substantially the same data as the feature vector information of the initial information.
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型具体可以指的是:基于第二损失函数进行反向传导更新所述估计预设子模型的模型参数、所述信道生成预设子模型的模型参数、所述压缩预设子模型的模型参数和所述第二预设模型的模型参数。Performing reverse conduction according to the second loss function to update the first preset model and the second preset model may specifically refer to: performing reverse conduction based on the second loss function to update the estimated preset sub-model model parameters of the channel generation preset sub-model, model parameters of the compression preset sub-model and model parameters of the second preset model.
关于上述训练收敛的确定方式与前述情况五相同,不做重复说明。The method for determining the above-mentioned training convergence is the same as that of the above-mentioned case five, and repeated explanations are not repeated.
情况七,采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型,包括:Case 7, using training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, including:
采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。Using training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein, the first preset model The three models are the third preset models after training.
采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:Using training samples to jointly train the first preset model, the second preset model and the third preset model, including:
将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述预设模型的第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model of the preset model to obtain transformed information output by the third preset model;
将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
本情况中各个模型的组成以及各个模型之间的输入输出关系与前述情况三相同,可以参见图8c,这里不做重复说明。The composition of each model in this case and the input-output relationship between each model are the same as those in the third case above, which can be referred to FIG. 8c, and repeated explanations are not repeated here.
关于第一训练样本的具体说明与前述情况五或情况六相同,因此不做重复说明。The specific description about the first training sample is the same as the foregoing case five or six, so no repeated description is given.
所述第一预设模型的估计预设子模型以及所述第一预设模型的压缩预设子模型的具体功能与前述情况五相同,因此不做重复说明。The specific functions of the estimation preset sub-model of the first preset model and the compression preset sub-model of the first preset model are the same as those of the fifth case above, so repeated descriptions will not be made.
在情况七中相对于情况五增加了第三预设模型,关于所述第三预设模型的功能为模拟信道环境,具体处理可以为对输入信息进行数据变换得到数据变换后的信息作为输出信息。其中,所述数据变换的具体处理方式可以包括卷积处理或者等效于卷积的数据处理;其中,所述等效于卷积的数据处理可以为多次傅里叶变换处理,比如,可以为将所述第三预设模型的输入信息通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域,以此来等效时域卷积的处理。In case seven, a third preset model is added relative to case five. The function of the third preset model is to simulate the channel environment, and the specific processing can be to perform data transformation on input information to obtain data transformed information as output information . Wherein, the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
所述第二预设模型的功能可以是对其输入信息进行解压缩。The function of the second preset model may be to decompress its input information.
所述基于第二损失函数所确定的差异程度来进行反向传导更新更新所述第一预设模型、所述第二预设模型和所述第三预设模型具体可以指的是:基于第二损失函数进行反向传导更新所述估计预设子模型的模型参数、所述压缩预设子模型的模型参数、所述第二预设模型的模型参数和所述第三预设模型的模型参数。The performing reverse conduction update based on the degree of difference determined by the second loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The second loss function performs reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the compressed preset sub-model, the model parameters of the second preset model, and the model of the third preset model parameter.
关于上述训练收敛的方式与前述情况五或情况六相同,不做重复说明。The manner of the above-mentioned training convergence is the same as that of the foregoing case five or six, and no repeated description is made.
情况八,与上述情况七不同在于所述第一预设模型中包括估计预设子模型、预设信道生成子模型和压缩预设子模型。Case 8 is different from the above case 7 in that the first preset model includes an estimation preset submodel, a preset channel generation submodel, and a compression preset submodel.
采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:Using training samples to jointly train the first preset model, the second preset model and the third preset model, including:
将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述预设模型中的第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;inputting the compressed feature vector information into a third preset model among the preset models, to obtain transformed feature vector information output by the third preset model;
将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
本情况中各个模型的组成以及各个模型之间的输入输出关系与前述情况四相同,可以参见图8d,这里不做重复说明。In this case, the composition of each model and the input-output relationship between each model are the same as the foregoing case 4, which can be referred to FIG. 8d , and repeated descriptions are not repeated here.
关于第一训练样本的具体说明与前述情况五、情况六、情况七中任意之一相同,因此不做重复说明。The specific description about the first training sample is the same as any one of the above-mentioned case 5, case 6, and case 7, so the description will not be repeated.
所述第一预设模型的估计预设子模型的具体功能与情况五、情况六、情况七中任意之一相同。The specific function of the estimated preset sub-model of the first preset model is the same as any one of the fifth, sixth, and seventh cases.
所述信道生成预设子模型的功能可以是对所述初始信息进行特征分解,得到所述初始信息的特征向量信息。其中,所述初始信息的特征向量信息可以包括R组特征向量序列。举例来说,对初始信息进行特征分解的方式可以采用奇异值分解(SVD,Singular Value Decomposition)的方式。A function of the channel generation preset submodel may be to perform eigendecomposition on the initial information to obtain eigenvector information of the initial information. Wherein, the eigenvector information of the initial information may include R groups of eigenvector sequences. For example, the method of performing eigendecomposition on the initial information may adopt a method of Singular Value Decomposition (SVD, Singular Value Decomposition).
所述第一预设模型的压缩预设子模型对输入信息的数据量进行压缩。所述压缩子模型的输出信息与输入信息之间的压缩率可以为训练时确定的,比如,压缩率可以为千分之五、千分之二、百分之十等等,这里不进行穷举。The compressed preset sub-model of the first preset model compresses the data volume of the input information. The compression rate between the output information of the compressed sub-model and the input information can be determined during training, for example, the compression rate can be 5/1000, 2/1000, 10%, etc. lift.
所述第三预设模型的功能为模拟无线信道环境,具体处理可以为对输入信息进行数据变换得到数据变换后的信息作为输出信息。其中,所述数据变换的具体处理方式可以包括卷积处理或者等效于卷积的数据处理;其中,所述等效于卷积的数据处理可以为多次傅里叶变换处理,比如,可以为将所述第三预设模型的输入信息通过傅里叶变换转换到频域后相乘再通过傅里叶反变换转换到时域,以此来等效时域卷积的处理。The function of the third preset model is to simulate the wireless channel environment, and the specific processing may be to perform data transformation on input information to obtain information after data transformation as output information. Wherein, the specific processing method of the data transformation may include convolution processing or data processing equivalent to convolution; wherein, the data processing equivalent to convolution may be multiple Fourier transform processing, for example, may In order to transform the input information of the third preset model into the frequency domain through Fourier transform, multiply and then transform into the time domain through inverse Fourier transform, the process of convolution in the time domain is equivalent.
所述第二预设模型的功能可以是对其输入信息进行解压缩。所述第二预设模型的输入信息为变换后的特征向量信息,第二预设模型的输出为恢复的特征向量信息。第二预设模型的解压缩率应该使得其得到的恢复的特征向量信息与初始信息的特征向量信息的包含接近或相同的数据。The function of the second preset model may be to decompress its input information. The input information of the second preset model is transformed feature vector information, and the output of the second preset model is restored feature vector information. The decompression rate of the second preset model should be such that the obtained restored feature vector information and the feature vector information of the initial information contain data that are close to or identical.
所述基于第一损失函数所确定的差异程度来进行反向传导更新更新所述第一预设模型、所述第二预设模型和所述第三预设模型具体可以指的是:基于第一损失函数所确定的差异程度来进行反向传导更新所述估计预设子模型的模型参数、所述信道生成预设子模型的模型参数、所述压缩预设子模型的模型参数、所述第二预设模型的模型参数和所述第三预设模型的模型参数。The performing reverse conduction update based on the degree of difference determined by the first loss function to update the first preset model, the second preset model, and the third preset model may specifically refer to: based on the first The degree of difference determined by a loss function is used to perform reverse conduction to update the model parameters of the estimated preset sub-model, the model parameters of the channel generation preset sub-model, the model parameters of the compressed preset sub-model, the The model parameters of the second preset model and the model parameters of the third preset model.
关于上述训练收敛的方式与前述情况五、情况六、情况七中任意之一相同,因此不做重复说明。The manner of the above-mentioned training convergence is the same as any one of the aforementioned cases 5, 6, and 7, so repeated explanations will not be made.
通过采用以上第二种方式可以得到联合训练后的第一模型、第二模型,或者得到联合训练后的第一模型、第二模型以及第三模型。By adopting the second method above, the first model and the second model after joint training, or the first model, the second model and the third model after joint training can be obtained.
在上述第二种方式所提供的联合训练得到第一模型和第二模型,以及联合训练得到第一模型、第二模型和第三模型的处理中使用了训练样本,下面针对训练样本进行详细说明:In the joint training provided by the second method above to obtain the first model and the second model, and the joint training to obtain the first model, the second model and the third model, the training samples are used, and the training samples are described in detail below :
所述训练样本中可以包含第一训练样本。所述第一训练样本可以为参考信号样本。所述参考信号样本可以为历史采集得到的原始参考信号、或者处理过的参考信号。其中,所述原始参考信号可以指的是未经过无线信道传输的参考信号。获取处理参考信号的方法可以包括:将原始参考信号通过无线信道(或真实无线信道、或实际无线信道)后接收到的参考信号作为处理过的参考信号。或者,获取处理参考信号的方法可以包括:将原始参考信号通过模拟的无线信道后接收 的参考信号作为处理过的参考信号。再进一步地,原始参考信号可以为下行参考信号,或者上行参考信号。The training samples may include a first training sample. The first training samples may be reference signal samples. The reference signal samples may be original reference signals or processed reference signals obtained through historical acquisition. Wherein, the original reference signal may refer to a reference signal that has not been transmitted through a wireless channel. The method for acquiring and processing the reference signal may include: using the reference signal received after the original reference signal passes through the wireless channel (or the real wireless channel, or the real wireless channel) as the processed reference signal. Or, the method for obtaining and processing the reference signal may include: using the reference signal received after the original reference signal passes through the simulated wireless channel as the processed reference signal. Still further, the original reference signal may be a downlink reference signal or an uplink reference signal.
所述第一训练样本分布在第一维度和/或第二维度。The first training samples are distributed in the first dimension and/or the second dimension.
其中,所述第一维度为时域维度;所述第一训练样本包括在所述时域维度中的m个时间单元内分布的第一信息样本;m为正整数。其中,所述m个时间单元中每个时间单元中可以分布有n个第一信息样本,n为正整数。所述每个时间单元可以包含有至少一个时隙、或至少一个符号(比如OFDM符号)。Wherein, the first dimension is a time domain dimension; the first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer. Wherein, n first information samples may be distributed in each of the m time units, where n is a positive integer. Each time unit may include at least one time slot, or at least one symbol (such as an OFDM symbol).
举例来说,所述第一信息样本为下行参考信号样本,每个时间单元内包含的时隙数量可以为c个(c为正整数),在每c个时隙内有n个下行参考信号样本,c和n的组合可以是例如(1,1)(1,2)(1,3)(1,4)(1,6)(2,1)(4,1)(5,1)(8,1)(10,1)。For example, the first information sample is a downlink reference signal sample, the number of time slots contained in each time unit may be c (c is a positive integer), and there are n downlink reference signals in each c time slot A sample, combination of c and n can be e.g. (1,1)(1,2)(1,3)(1,4)(1,6)(2,1)(4,1)(5,1) (8,1)(10,1).
所述第二维度为频域维度;所述第一训练样本包括在所述频域维度中的x个频域资源上分布的第一信息样本;x为正整数。其中,所述x个频域资源中每个频域资源中可以分布有y个第一信息样本,y为正整数。所述每个频域资源可以包含有至少一个资源块(RB)、或至少一个子载波。The second dimension is a frequency domain dimension; the first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer. Wherein, y first information samples may be distributed in each of the x frequency domain resources, and y is a positive integer. Each frequency domain resource may include at least one resource block (RB), or at least one subcarrier.
举例来说,所述第一信息样本为下行参考信号样本,每个频域资源内包含的时隙数量可以为d个(d为正整数),在频域上每d个RB内有y个下行参考信号样本,d和y的组合可以是例如(1,1)(1,2)(2,1)(1,3)(3,1)(1,4)(4,1)(1,6)(6,1)。For example, the first information sample is a downlink reference signal sample, and the number of time slots contained in each frequency domain resource may be d (d is a positive integer), and there are y time slots in every d RBs in the frequency domain Downlink reference signal samples, the combination of d and y can be, for example, (1,1)(1,2)(2,1)(1,3)(3,1)(1,4)(4,1)(1 ,6)(6,1).
上述第一训练样本分布在第一维度和/或第二维度,可以理解为可以仅根据第一训练样本在频域维度上的分布情况来进行后续的训练,也可以仅根据第一训练样本在时域维度上的分布情况来进行后续的训练,还可以根据第一训练样本在频域维度以及时域维度上的分布情况来进行后续的训练。比如,一个第一训练样本在频域维度上包含10个RB、在时域维度上包含1个时隙,每个RB中有3个第一信号样本,每个时隙有1个第一信号样本,则第一训练样本一共包含有30个第一信号样本。The above-mentioned first training samples are distributed in the first dimension and/or the second dimension. It can be understood that the subsequent training can be performed only according to the distribution of the first training samples in the frequency domain dimension, or only based on the distribution of the first training samples in the frequency domain. The subsequent training may be performed according to the distribution of the first training sample in the frequency domain and the time domain. For example, a first training sample contains 10 RBs in the frequency domain dimension and 1 time slot in the time domain dimension, each RB has 3 first signal samples, and each time slot has 1 first signal samples, the first training samples include a total of 30 first signal samples.
上述第一维度和第二维度,即时域维度和频域维度的大小可以相等、也可以不相等。另外,也可以将上述时域维度和频域维度合并成为一个维度,具体合并是可以是先时域维度再频域维度,也可以是先频域维度再时域维度,本实施例不对其进行限定。The sizes of the first dimension and the second dimension, the time domain dimension and the frequency domain dimension may be equal or unequal. In addition, the above-mentioned time-domain dimension and frequency-domain dimension can also be combined into one dimension. Specifically, the combination can be the time-domain dimension first and then the frequency-domain dimension, or the frequency-domain dimension first and then the time-domain dimension, which is not implemented in this embodiment limited.
需要注意的是,因为原始参考信号、或者处理过的参考信号都可以是通过复数来呈现,所以本实施例提供的方案上述第一训练样本可以在上述第一维度和第二维度的基础上额外增加复数的呈现形式(或可以理解为增加一个维度,该维度是将原始参考信号、或者处理过的参考信号的虚部和实部数据独立呈现所造成的),具体的:所述第一训练样本还分布在第三维度。所述第三维度为复数维度;所述第一训练样本包括第一信息样本的实部和第一信息样本的虚部。举例来说,假设一个第一训练样本中在时域维度上包含1个时间单元(比如1个时隙),频域维度上包含10个频域资源(比如10个RB),每个第一信息样本可以表示为实部和虚部,则第一训练样本可以为一个1×10×2的矩阵。It should be noted that, because the original reference signal or the processed reference signal can be represented by complex numbers, the solution provided by this embodiment can be based on the above-mentioned first dimension and second dimension. Increase the presentation form of complex numbers (or it can be understood as adding a dimension, which is caused by the independent presentation of the imaginary part and real part data of the original reference signal or the processed reference signal), specifically: the first training The samples are also distributed in the third dimension. The third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample. For example, assuming that a first training sample contains 1 time unit (such as 1 time slot) in the time domain dimension, and contains 10 frequency domain resources (such as 10 RBs) in the frequency domain dimension, each first The information sample can be expressed as a real part and an imaginary part, and the first training sample can be a 1×10×2 matrix.
所述训练样本中还包含与所述第一训练样本对应的第二训练样本;所述第二训练样本由T个维度的矩阵构成;T为大于等于2的整数。这里,所述第二训练样本可以用于表征基于所述第一训练样本所期望得到的信道质量、或称为信道响应、或称为信道状态、或称为信道估计结果、或称为信道信息。The training samples also include a second training sample corresponding to the first training sample; the second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2. Here, the second training samples may be used to characterize the expected channel quality based on the first training samples, or channel response, or channel state, or channel estimation results, or channel information .
所述T个维度中包含第四维度和第五维度。The T dimensions include a fourth dimension and a fifth dimension.
所述T个维度的矩阵具体可以为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。The matrix of the T dimensions may specifically be a two-dimensional matrix of M×N; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension; M and N are all positive integers.
也就是说,一个第二训练样本由大小为M×N的二维矩阵构成,其在第四维度上有M个第一粒度,在第五维度上有N个第二粒度;上述M和N可以相等也可以不相等。所述二维矩阵内具体的数值指示代表信道质量某一个第一粒度下接收的信号强度,这里所述二维矩阵内的数值的具体可以指的是信号强度值,其单位可以是dBm,或没有单位而是归一化之后所得到的数值。此外,也可以将M×N的二维矩阵合成成为1×(M×N)大小或者(M×N)×1大小的一维数据,具体变换是可以是先第四维度再第五维度,也可以是先第五维度再第四维度,本实施例不对其进行限定。That is to say, a second training sample consists of a two-dimensional matrix with a size of M×N, which has M first granularities in the fourth dimension and N second granularities in the fifth dimension; the above M and N May or may not be equal. The specific numerical indication in the two-dimensional matrix represents the received signal strength at a certain first granularity of the channel quality. The specific numerical value in the two-dimensional matrix here may refer to the signal strength value, and its unit may be dBm, or There is no unit but the value obtained after normalization. In addition, the two-dimensional matrix of M×N can also be synthesized into one-dimensional data of size 1×(M×N) or (M×N)×1. The specific transformation can be the fourth dimension first and then the fifth dimension. It may also be the fifth dimension first and then the fourth dimension, which is not limited in this embodiment.
可选地,所述第四维度为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。或者,所述第四维度为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。所述符号为正交频分复用符号(OFDM,Orthogonal Frequency Division Multiplexing)。这里,第四维度为时域维度的时候,所述第一粒度还可以称为时延粒度。Optionally, the fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers. Alternatively, the fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, and K3 symbol sampling points; K1, K2, and K3 are positive integers. The symbols are Orthogonal Frequency Division Multiplexing (OFDM, Orthogonal Frequency Division Multiplexing). Here, when the fourth dimension is a time domain dimension, the first granularity may also be called a delay granularity.
举例来说,在第一训练样本为参考信号样本或下行参考信号样本的时候,所述第二训练样本可以为与所述参考信号样本所对应的信道信息样本,或者还可以称为信道状态样本等等,这里不对其名称进行穷举。当第四维度是频域维度时,第一粒度可以是L1个RB(L1大于等于1,例如2RB,4RB,8RB),则一个第二训练样本在频域维度上的分布范围是M×L1个RB所对应的频域范围;或者第一粒度可以是L2个子载波(L2大于1,例如4个子载波,6个子载波,18个子载波),则一个第二训练样本在频域维度上的分布是M×L2个子载波对应的频域范围。当第四维度是时域维度时,第一粒度可以是时延粒度,例如一个第一粒度是K1个微秒、或者K2个符号长度、或者K3个符号的采样点个数,这里所述符号可以是一个OFDM符号;当第四维度是时域维度且第一粒度为K1个微秒时,一个第二训练样本在时域维度上的分布范围是M×K1个微秒对应的时域范围。For example, when the first training sample is a reference signal sample or a downlink reference signal sample, the second training sample may be a channel information sample corresponding to the reference signal sample, or may also be called a channel state sample Wait, I'm not going to exhaust the names here. When the fourth dimension is the frequency domain dimension, the first granularity can be L1 RBs (L1 is greater than or equal to 1, such as 2RB, 4RB, 8RB), and the distribution range of a second training sample in the frequency domain dimension is M×L1 The frequency domain range corresponding to each RB; or the first granularity can be L2 subcarriers (L2 is greater than 1, such as 4 subcarriers, 6 subcarriers, and 18 subcarriers), then the distribution of a second training sample on the frequency domain dimension is the frequency domain range corresponding to M×L2 subcarriers. When the fourth dimension is a time-domain dimension, the first granularity may be a delay granularity, for example, a first granularity is the number of sampling points of K1 microseconds, or K2 symbol lengths, or K3 symbols, where the symbols It can be an OFDM symbol; when the fourth dimension is the time domain dimension and the first granularity is K1 microseconds, the distribution range of a second training sample in the time domain dimension is the time domain range corresponding to M×K1 microseconds .
所述第五维度为空间域维度;相应的,所述第二粒度为一对收发天线或到达角度的间隔。The fifth dimension is a space domain dimension; correspondingly, the second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
举例来说,第五维度为所述空间域维度,具体地可以是天线维度,例如第五维度上由N个天线对构成,相应的,第二粒度是一对收发天线。或者,第五维度为空间域维度,具体的可以是角度域维度,例如第五维度上由N个到达角度构成,第二粒度是上述N个到达角度之间的到达角度的间隔大小。For example, the fifth dimension is the space domain dimension, specifically, the antenna dimension, for example, the fifth dimension is composed of N antenna pairs, and correspondingly, the second granularity is a pair of transmitting and receiving antennas. Alternatively, the fifth dimension is a space domain dimension, specifically an angle domain dimension, for example, the fifth dimension is composed of N arrival angles, and the second granularity is the interval between the above N arrival angles.
再进一步地,所述二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。也就是说,在使用一个第一训练样本的情况下,用于表示第二训练样本的所述二维矩阵中某一个位置处的数值(或称为指示值)代表了在第四维度以及第五维度这样的组合下的所期望得到的信道质量情况。其中,信道质量或信道质量情况可以采用信号强度来表征,其数值(或称为指示值)的单位可以是dBm,或没有单位而是归一化之后所得到的数值。Still further, the value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; Both i and j are positive integers. That is to say, in the case of using a first training sample, the value (or referred to as an indicator value) at a certain position in the two-dimensional matrix used to represent the second training sample represents the The expected channel quality situation under such a combination of five dimensions. Wherein, the channel quality or the channel quality situation can be characterized by signal strength, and the unit of the value (or indicator value) can be dBm, or there is no unit but a value obtained after normalization.
例如,结合图9来说,在M×N的二维矩阵中,若第四维度表示频域维度,第五维度为空间域维度具体为天线维度, 第一粒度为2RB,第二粒度为1对收发天线;若M×N的二维矩阵中的第ij个位置为第i=3j=6个位置,则为图9中所示出的第3行第6列上黑色方框所在位置,该位置处的数值(或称为指示值)可以用于表示第6对收发天线上的第3个2RB带宽(也就是第5个RB至第6个RB)上的信道质量(或信道质量情况)。另外,图9中还可以用S来表示第二训练样本的数量,S可以为大于等于1的整数,也就是说,第二训练样本可以包含一个或多个。For example, referring to FIG. 9, in the M×N two-dimensional matrix, if the fourth dimension represents the frequency domain dimension, the fifth dimension is the space domain dimension, specifically the antenna dimension, the first granularity is 2RB, and the second granularity is 1 For the transceiver antenna; if the ij position in the two-dimensional matrix of M×N is the i=3j=6 position, then it is the position of the black box on the 3rd row and the 6th column shown in Fig. 9, The value (or indicator value) at this position can be used to represent the channel quality (or channel quality situation) on the third 2RB bandwidth (that is, the fifth RB to the sixth RB) on the sixth pair of transceiver antennas ). In addition, in FIG. 9 , S may also be used to represent the number of second training samples, and S may be an integer greater than or equal to 1, that is, the second training samples may include one or more.
再例如,在图10中展示的M×N的二维矩阵中,第四维度表示时域维度,第四维度为时域维度的时候,所述第一粒度为1个时延粒度;第五维度为空间域维度具体为角度维度,第二粒度为1个角度基本粒度(比如可以是1个到达角度的间隔);若M×N的二维矩阵中的第ij个位置为第i=4j=5个位置,则为图10中所示出的中的第4行第5列上黑色方框所在位置,该位置处的数值(或指示值)可以表示第5个空间粒度(例如到达角度的间隔)内的第4个时延粒度上的信道质量(或信道质量情况)。For another example, in the M×N two-dimensional matrix shown in FIG. 10 , the fourth dimension represents the time domain dimension, and when the fourth dimension is the time domain dimension, the first granularity is one delay granularity; the fifth The dimension is the spatial domain dimension, specifically the angle dimension, and the second granularity is the basic granularity of 1 angle (for example, it can be the interval of 1 arrival angle); if the ij-th position in the M×N two-dimensional matrix is i=4j = 5 positions, then it is the position of the black box on the 4th row and 5th column shown in Fig. The channel quality (or channel quality situation) at the 4th delay granularity within the interval of .
所述T个维度中还包括第六维度。相应的,所述T个维度的矩阵为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The T dimensions also include a sixth dimension. Correspondingly, the matrix of T dimensions is a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, and N represents the number of second granularities in the fifth dimension, W represents the quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、第五维度的第j个第二粒度下、所述第六维度的第k个第三粒度下与所述第一训练样本所对应的信道质量;i、j和k均为正整数。The value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality corresponding to the first training sample at a third granularity; i, j and k are all positive integers.
其中,关于第四维度及其第一粒度,第五维度及其第二粒度的说明与前述实施例相同,这里不再重复说明。Wherein, the explanations about the fourth dimension and its first granularity, the fifth dimension and its second granularity are the same as those in the foregoing embodiments, and will not be repeated here.
本实施例中,所述第六维度可以为复数维度。这是由于所述第二训练样本可以用于表征基于所述第一训练样本所期望得到的信道质量(或称为信道响应、或称为信道状态、或称为信道估计结果、或称为信道信息),而上述信道质量还可以通过复数来呈现,因此可以在所述第二训练样本的以上两个维度的基础上增加一个第六维度即复数维度,该复数维度是将所述第二训练样本中的信道质量的虚部和实部独立呈现所产生的。In this embodiment, the sixth dimension may be a complex dimension. This is because the second training samples can be used to characterize the expected channel quality based on the first training samples (or called channel response, or called channel state, or called channel estimation result, or called channel information), and the above-mentioned channel quality can also be presented by a complex number, so a sixth dimension, that is, a complex number dimension, can be added on the basis of the above two dimensions of the second training sample, and the complex number dimension is the second training sample. The imaginary and real parts of the channel quality in the samples are presented independently generated.
具体来说,所述第六维度为复数维度;所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。其中,所述第三粒度为1具体指的是一个实部或一个虚部,所述第三粒度的数量为2指的是在复数维度下可以存在2个第三粒度。Specifically, the sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2. Wherein, the third granularity being 1 specifically refers to a real part or an imaginary part, and the number of the third granularity being 2 means that there may be two third granularities in the complex dimension.
所述k为第一值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的实部;When the k is the first value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The real part of the channel quality at the second granularity;
所述k为第二值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的虚部。When the k is the second value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
其中,所述第一值与所述第二值不同,比如可以设置第一值为1第二值为2,又或者,第一值可以为0第二值可以为1,再或者第一值可以为1第二值可以为0,只要第一值与第二值不同则在本实施例的保护范围内。Wherein, the first value is different from the second value, for example, the first value can be set to 1 and the second value can be 2, or the first value can be 0 and the second value can be 1, or the first value It can be 1 and the second value can be 0, as long as the first value is different from the second value, it is within the protection scope of this embodiment.
举例来说,M×N×W的三维矩阵中,第四维度表示时域维度,第四维度为时域维度的时候,所述第一粒度还可以称为时延粒度;第五维度为空间域维度具体为角度维度,第二粒度为到达角度的间隔;第六维度为复数维度,W为2,k为1表示实部k为2表示虚部。若i=4、j=5、k=1,则表示第4行第5列上的数值(或指示值)为第5个空间粒度(例如到达角度的间隔)内的第4个时延粒度上的信道质量(或信道质量情况)的实部。若i=4、j=5、k=2,则表示第4行第5列上的数值(或指示值)为第5个空间粒度(例如到达角度的间隔)内的第4个时延粒度上的信道质量(或信道质量情况)的虚部。For example, in a three-dimensional matrix of M×N×W, the fourth dimension represents the time domain dimension, and when the fourth dimension is the time domain dimension, the first granularity can also be called the delay granularity; the fifth dimension is the space The domain dimension is specifically the angle dimension, and the second granularity is the interval of the arrival angle; the sixth dimension is the complex number dimension, W is 2, k is 1 for the real part and 2 for the imaginary part. If i=4, j=5, k=1, it means that the value (or indicator value) on the 4th row and 5th column is the 4th delay granularity in the 5th spatial granularity (such as the interval of arrival angle) The real part of the channel quality (or channel quality situation) on . If i=4, j=5, k=2, it means that the value (or indicator value) on the 4th row and 5th column is the 4th delay granularity in the 5th spatial granularity (such as the interval of arrival angle) The imaginary part of the channel quality (or channel quality situation) on .
此外,还需要注意的是,上述第二训练样本还可以是在上述第四维度、第五维度和第六维度的基础上的拆分与组合,例如当第五维度是天线对维度时,还可以拆分成为发送天线子维度和接收天线子维度,从而扩展上述第二训练样本的维度,本实施例不再对拆分后各种可能存在的子维度进行穷举。In addition, it should be noted that the above-mentioned second training samples can also be split and combined on the basis of the above-mentioned fourth dimension, fifth dimension, and sixth dimension. For example, when the fifth dimension is an antenna pair dimension, the It can be split into sending antenna sub-dimensions and receiving antenna sub-dimensions, thereby expanding the dimension of the second training sample. This embodiment does not exhaustively enumerate various possible sub-dimensions after splitting.
最后示例性的对神经网络及其在本实施例中的应用进行相关说明:Finally, an exemplary description of the neural network and its application in this embodiment:
关于基于神经网络如图5所示,其中,神经网络的基本结构包括:输入层,隐藏层和输出层。输入层负责接收数据,隐藏层对数据的处理,最后的结果在输出层产生。在这其中,各个节点代表一个处理单元,可以认为是模拟了一个神经元,多个神经元组成一层神经网络,多层的信息传递与处理构造出一个整体的神经网络。Regarding the neural network as shown in Figure 5, the basic structure of the neural network includes: an input layer, a hidden layer and an output layer. The input layer is responsible for receiving data, the hidden layer processes the data, and the final result is generated in the output layer. Among them, each node represents a processing unit, which can be regarded as simulating a neuron. Multiple neurons form a layer of neural network, and multiple layers of information transmission and processing construct an overall neural network.
进一步地,又结合神经网络深度学习算法,较多的隐藏层被引入,通过多隐藏层的神经网络逐层训练进行特征学习,极大地提升了神经网络的学习和处理能力,并在模式识别、信号处理、优化组合、异常探测等方面广泛被应用。随着深度学习的发展,卷积神经网络被进一步研究。在一个卷积神经网络中,其基本结构包括:输入层、多个卷积层、多个池化层、全连接层及输出层。示例性的,结合图15对本实施例中联合训练完成后得到的第一模型中包含的估计子模型、压缩子模型以及第二模型之间的输入输出进行说明,图15所示意出的输入所述第一模型的估计子模型的信息可以为参考信号,具体可以是长度为144的参考信号序列。所述估计子模型将接收到的输入的信息输入到自身的全连接层中进行处理得到其输出结果,比如图15中所示在估计子模型中的一个全连接层的输出维度为1024大小,最后的一个全连接层的输出维度为8192大小。该估计子模型输出结果可以输入到压缩子模型中以得到压缩子模型的输出,比如图15中所示,在压缩子模型中可以先通过部分全连接层进行处理可以得到输出维度为1024大小的结果,再进行另一部分全连接层的处理最终得到输出维度为256大小的输出结果。压缩子模型将其得到的输出结果输入到第二模型以得到第二模型恢复出来的最终结果,比如图15中所示,在第二模型中可以先通过部分全连接层进行处理可以得到输出维度为1024大小的结果,再进行另一部分全连接层的处理最终得到输出维度为2048大小的结果,最后一部分全连接层处理后可以得到输出维度为8192大小的最终结果。在图15中最终第二模型可以输出大小为8192的信道信息,或者,可以转化为[128,32,2]维度的信道信息矩阵。Furthermore, combined with the deep learning algorithm of the neural network, more hidden layers are introduced, and the feature learning is performed layer by layer through the multi-hidden layer neural network training, which greatly improves the learning and processing capabilities of the neural network, and in pattern recognition, Signal processing, optimal combination, anomaly detection, etc. are widely used. With the development of deep learning, convolutional neural networks are further studied. In a convolutional neural network, its basic structure includes: an input layer, multiple convolutional layers, multiple pooling layers, a fully connected layer, and an output layer. Exemplarily, in conjunction with FIG. 15, the input and output between the estimation sub-model, the compression sub-model and the second model included in the first model obtained after the joint training in this embodiment are described. The input shown in FIG. 15 is The information of the estimated sub-model of the first model may be a reference signal, specifically, a reference signal sequence with a length of 144. The estimation sub-model inputs the received input information into its own fully connected layer for processing to obtain its output result. For example, the output dimension of a fully connected layer in the estimation sub-model shown in FIG. 15 is 1024 in size, The output dimension of the last fully connected layer is 8192. The output result of the estimation sub-model can be input into the compression sub-model to obtain the output of the compression sub-model. As a result, another part of the fully connected layer is processed to finally obtain an output result with an output dimension of 256. The compressed sub-model inputs the output result obtained by it into the second model to obtain the final result restored by the second model. For example, as shown in Figure 15, in the second model, it can be processed through a part of the fully connected layer to obtain the output dimension For the result of 1024 size, another part of the fully connected layer is processed to finally get the result with the output dimension of 2048. After the last part of the fully connected layer is processed, the final result with the output dimension of 8192 can be obtained. In Figure 15, the final second model can output channel information with a size of 8192, or it can be transformed into a [128,32,2]-dimensional channel information matrix.
结合相关技术来说,在一些基于人工智能的关于无线通信系统的研究和设计中,存在以下两种设计思路:In terms of related technologies, in some research and design of wireless communication systems based on artificial intelligence, there are the following two design ideas:
第一种,基于AI的信道估计。如图16所示,以终端设备接收到的参考信号作为输入,通过终端设备中的基于AI的信道估计模块(或称为基于AI的信道估计模型)对该参考信号进行处理,得到信道估计结果。在图16所示的处理中所得到的信道估计结果是以待恢复的无线信道作为期望输出以实现无线信道的最佳估计作为目标。The first is AI-based channel estimation. As shown in Figure 16, the reference signal received by the terminal equipment is used as input, and the reference signal is processed by the AI-based channel estimation module (or AI-based channel estimation model) in the terminal equipment to obtain the channel estimation result . The channel estimation result obtained in the processing shown in FIG. 16 takes the wireless channel to be recovered as the expected output and achieves the best estimation of the wireless channel as the target.
第二种,基于AI的信道状态信息反馈。如图17所示,将编码端处理得到的信道状态信息输入编码端神经网络得到输出的反馈向量,该反馈向量可以是压缩后的信道状态信息;将所述反馈向量发送至解码端,编码端将所述反馈向量输入编码端神经网络得到输出的信道状态信息。本思路中以待反馈的信道信息为输出,以在发送端最大程度压缩上述信道 信息并在接收端最大程度恢复上述信道信息为目标,构建相应基于神经网络的解决方案;其中,所述待反馈的信道信息可以是完整的信道信息、或者部分信道信息、或者处理过的信道信息;所述处理过的信道信息可以为信道特征向量。The second is AI-based channel state information feedback. As shown in Figure 17, the channel state information processed by the encoding end is input into the encoding end neural network to obtain an output feedback vector, which may be compressed channel state information; the feedback vector is sent to the decoding end, and the encoding end Inputting the feedback vector into the neural network at the encoding end to obtain output channel state information. In this idea, the channel information to be fed back is taken as the output, and the above-mentioned channel information is compressed to the greatest extent at the sending end and the above-mentioned channel information is restored to the greatest extent at the receiving end as the goal, and a corresponding neural network-based solution is constructed; wherein, the to-be-feedback The channel information may be complete channel information, or partial channel information, or processed channel information; the processed channel information may be a channel feature vector.
以上两种设计思路对于单模块功能来说已经体现出个各种设计的增益与有效性。但是,对于通信系统来说单独一个模块的性能最佳并不一定是对整体通信系统解决方案的性能最佳。例如,当通过AI设计信道估计模块时,设计目标是利用获得的参考信号等信息最大程度的实现对信道的有效估计使得误差最小。当通过AI设计信道信息反馈时,设计目标是用最小的反馈开销做到最佳的信道信息反馈效果。分别做上述两种模块时,对于信道估计模块的AI训练,以尽可能高的减小信道估计误差,或者人为的控制信道估计的精度以较低神经网络复杂度的需求。但当联合信道估计模块和信道状态信息反馈模块一起分析时就会发现以下问题:上述信道估计误差最小化、以及为了使信道估计误差最小化所做的额外信道估计模型、算法复杂度是否有意义是存在疑问的;如果降低信道估计精度以降低实现复杂度,这部分降低的精度可能存在冗余;从信道状态信息反馈模块来看,在同样的反馈效率的假设下,依赖高精度信道做方案、模型设计也可能会带来额外的冗余;上述高精度信道信息也可能会对于信道状态信息的生成和反馈带来冗余的开销。The above two design ideas have already reflected the gains and effectiveness of various designs for single-module functions. However, the best performance of a single module for the communication system does not necessarily mean the best performance of the overall communication system solution. For example, when designing a channel estimation module through AI, the design goal is to use information such as reference signals obtained to maximize the effective estimation of the channel and minimize the error. When designing channel information feedback through AI, the design goal is to achieve the best channel information feedback effect with the minimum feedback overhead. When doing the above two modules separately, for the AI training of the channel estimation module, the channel estimation error should be reduced as high as possible, or the accuracy of artificially controlled channel estimation can meet the requirements of lower neural network complexity. However, when the joint channel estimation module and the channel state information feedback module are analyzed together, the following problems will be found: whether the above-mentioned channel estimation error minimization, and the additional channel estimation model and algorithm complexity made to minimize the channel estimation error are meaningful There are doubts; if the accuracy of channel estimation is reduced to reduce the complexity of implementation, this part of the reduced accuracy may be redundant; from the perspective of the channel state information feedback module, under the assumption of the same feedback efficiency, relying on high-precision channels to make solutions , Model design may also bring additional redundancy; the above-mentioned high-precision channel information may also bring redundant overhead to the generation and feedback of channel state information.
综上,可以看出在基于AI做无线通信系统信道估计以及信道信息反馈的处理中,由于基于不同功能划分多个模块(或模型)并且多个模块(或模型)分别做独立任务训练,可能会存在需要针对中间设计目标引入对于反馈方案存在影响的问题。In summary, it can be seen that in the processing of channel estimation and channel information feedback in wireless communication systems based on AI, since multiple modules (or models) are divided based on different functions and multiple modules (or models) perform independent task training, it is possible There will be problems that need to be introduced for intermediate design goals that have an impact on the feedback scheme.
而本实施例提供的方案,由于第二信息的处理、传输及解析过程是采用联合训练得到的第一模型和第二模型来实现的,因此可以兼顾整个信息处理、传输及解析中的性能要求,保证了网络整体的性能。进一步地,由于上述方案采用了联合训练得到的第一模型和第二模型,因此可以使得第一模型与第二模型之间的功能相互兼容,使得第一模型以及第二模型的性能均可以达到较优的状态,进而基于该第一模型和第二模型来对第二信息的处理、传输及解析过程进行整体处理时,可以保证整体处理的性能,从而保证了网络整体的性能。再进一步地,本实施例提供的方案通过联合训练模型,可以规避人为划分子模块并做独立任务训练时的训练目标、信息利用程度冗余等问题,可以让整个信道估计、信道信息反馈一体化设计的过程始终以用最小代价让对端恢复信道信息为目的,从而规避了对于非必须信道恢复的诉求与设计、计算开销的浪费,能够保证最终得到的多个模型整体配合达到最优的处理效果,保证了系统整体的性能。However, in the solution provided by this embodiment, since the processing, transmission and analysis of the second information are realized by using the first model and the second model obtained through joint training, it can take into account the performance requirements in the entire information processing, transmission and analysis. , to ensure the overall performance of the network. Furthermore, since the above solution uses the first model and the second model obtained through joint training, the functions between the first model and the second model can be made compatible with each other, so that the performance of the first model and the second model can reach In a better state, when the processing, transmission and analysis process of the second information is processed as a whole based on the first model and the second model, the performance of the whole processing can be guaranteed, thereby ensuring the performance of the whole network. Furthermore, the solution provided in this embodiment can avoid problems such as artificially dividing sub-modules and performing independent task training, such as training objectives and redundant information utilization, through the joint training model, and can integrate the entire channel estimation and channel information feedback. The design process is always aimed at allowing the opposite end to recover channel information with the minimum cost, thereby avoiding unnecessary channel recovery appeals and waste of design and calculation overhead, and ensuring that the overall coordination of multiple models obtained in the end can achieve optimal processing As a result, the overall performance of the system is guaranteed.
图18是根据本申请一实施例的终端设备1800的示意性框图。可以包括:Fig. 18 is a schematic block diagram of a terminal device 1800 according to an embodiment of the present application. Can include:
第一通信单元1801,用于接收第一信息;发送基于第一信息得到的第二信息;The first communication unit 1801 is configured to receive first information; send second information obtained based on the first information;
其中,所述第二信息为所述第一信息经由第一模型处理得到的,所述第二信息用于经由第二模型进行处理以得到信道信息;所述第一模型和第二模型为联合训练得到的。Wherein, the second information is obtained by processing the first information through the first model, and the second information is used for processing through the second model to obtain channel information; the first model and the second model are a joint obtained by training.
所述第二信息为信道压缩信息;The second information is channel compression information;
所述第一模型用于基于输入的所述第一信息进行处理得到信道压缩信息。The first model is used to process the input first information to obtain channel compression information.
所述第一模型包括:估计子模型和压缩子模型;The first model includes: an estimation sub-model and a compression sub-model;
其中,所述估计子模型用于基于所述第一信息进行信道估计得到信道估计信息;Wherein, the estimation sub-model is used to perform channel estimation based on the first information to obtain channel estimation information;
所述压缩子模型用于对所述信道估计信息进行压缩得到信道压缩信息。The compression sub-model is used to compress the channel estimation information to obtain channel compression information.
如图19所示,所述终端设备还包括:As shown in Figure 19, the terminal device also includes:
第一处理单元1802,用于将所述第一信息输入所述估计子模型,得到所述估计子模型输出的信道估计信息;将所述信道估计信息输入所述压缩子模型,得到所述压缩子模型输出的信道压缩信息。The first processing unit 1802 is configured to input the first information into the estimation sub-model to obtain channel estimation information output by the estimation sub-model; input the channel estimation information to the compression sub-model to obtain the compressed Channel compression information for the submodel output.
所述第一信息为参考信号。The first information is a reference signal.
所述第二信息为信道压缩信息;所述信道压缩信息包含压缩的信道估计信息的特征向量信息;The second information is channel compression information; the channel compression information includes eigenvector information of compressed channel estimation information;
所述第一模型用于对输入的所述第一信息进行处理得到压缩的信道估计信息的特征向量信息。The first model is used to process the input first information to obtain eigenvector information of compressed channel estimation information.
所述第一模型包括:估计子模型、信道生成子模型和压缩子模型;The first model includes: an estimation submodel, a channel generation submodel and a compression submodel;
其中,所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;Wherein, the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
所述信道生成子模型用于对所述信道估计信息进行特征分解得到信道估计信息的特征向量信息;The channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
所述压缩子模型用于对所述信道估计信息的特征向量信息进行压缩得到压缩的信道估计信息的特征向量信息。The compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain compressed eigenvector information of the channel estimation information.
所述信道信息的特征向量信息包含R组特征向量序列信息;R为正整数。The eigenvector information of the channel information includes R groups of eigenvector sequence information; R is a positive integer.
所述第一处理单元1802,用于将所述第一信息输入所述估计子模型,得到所述估计子模型输出的信道估计信息;将所述信道估计信息输入所述信道生成子模型,得到所述信道生成子模型输出的信道估计信息的特征向量信息;将所述信道估计信息的特征向量信息输入所述压缩子模型,得到所述压缩子模型输出的压缩的信道估计信息的特征向量信息。The first processing unit 1802 is configured to input the first information into the estimation sub-model to obtain channel estimation information output by the estimation sub-model; input the channel estimation information to the channel generation sub-model to obtain The eigenvector information of the channel estimation information output by the channel generation sub-model; input the eigenvector information of the channel estimation information into the compression sub-model, and obtain the eigenvector information of the compressed channel estimation information output by the compression sub-model .
所述第一信息为参考信号;所述信道信息为所述信道信息的特征向量信息。The first information is a reference signal; the channel information is feature vector information of the channel information.
所述第一通信单元1801,用于接收所述第一模型。The first communication unit 1801 is configured to receive the first model.
所述第一模型由以下至少之一携带:下行控制信令、媒体接入控制MAC控制元素CE消息、无线资源控制RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。The first model is carried by at least one of the following: downlink control signaling, media access control MAC control element CE message, radio resource control RRC message, broadcast message, downlink data transmission, downlink data for artificial intelligence service transmission requirements transmission.
所述第一通信单元1801,用于所述终端设备接收估计子模型以及压缩子模型;The first communication unit 1801 is configured for the terminal device to receive an estimated sub-model and a compressed sub-model;
所述第一处理单元1802,用于基于所述估计子模型以及所述压缩子模型,生成所述第一模型。The first processing unit 1802 is configured to generate the first model based on the estimated sub-model and the compressed sub-model.
所述第一通信单元1801,用于接收估计子模型、压缩子模型以及信道生成子模型;The first communication unit 1801 is configured to receive an estimation sub-model, a compression sub-model and a channel generation sub-model;
所述第一处理单元1802,用于基于所述估计子模型、所述压缩子模型以及所述信道生成子模型,生成所述第一模型。The first processing unit 1802 is configured to generate the first model based on the estimation sub-model, the compression sub-model and the channel generation sub-model.
其中,所述估计子模型由以下之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;Wherein, the estimation sub-model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
所述压缩子模型由以下之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The compressed sub-model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
所述信道生成子模型由以下之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。The channel generation sub-model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements.
所述第一通信单元1801,用于接收所述第二模型。The first communication unit 1801 is configured to receive the second model.
所述第一通信单元1801,用于接收所述第三模型。The first communication unit 1801 is configured to receive the third model.
所述第三模型用于对所述第一模型输出的第二信息进行数据变换处理后输入所述第二模型;The third model is used to perform data conversion processing on the second information output by the first model and then input it into the second model;
所述第一模型、第二模型以及第三模型为联合训练得到的。The first model, the second model and the third model are obtained through joint training.
所述数据变换处理包括:卷积处理或傅里叶变换处理。The data transformation processing includes: convolution processing or Fourier transform processing.
所述第一处理单元,用于采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型;The first processing unit is configured to use training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model;
其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training.
所述训练采用的损失函数为第一损失函数;The loss function used in the training is the first loss function;
所述第一损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度构建的。The first loss function is constructed based on the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model.
所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度为基于距离确定的,或者为基于相似程度确定的。The degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model is determined based on a distance, or determined based on a degree of similarity.
所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The first processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息以及所述初始信息之间的差异程度,确定所述第一损失函数;determining the first loss function based on the degree of difference between the restoration information and the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The first processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
所述第一处理单元,用于采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;The first processing unit is configured to use training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model;
其中,所述第三模型为训练后的第三预设模型。Wherein, the third model is a trained third preset model.
所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The first processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息以及所述初始信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restoration information and the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The first processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
所述训练采用的损失函数为第二损失函数;The loss function used in the training is a second loss function;
所述第二损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度,以及所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度构建的;其中,所述第二训练样本与输入所述估计预设子模型的第一训练样本相对应。The second loss function is based on a first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model, and the first preset The second degree of difference between the output information of the estimated preset sub-model of the model and the second training sample is constructed; wherein the second training sample corresponds to the first training sample input to the estimated preset sub-model.
所述第一差异程度为基于距离确定的,或者为基于相似程度确定的;和/或,The first degree of difference is determined based on a distance, or is determined based on a degree of similarity; and/or,
所述第二差异程度为基于距离确定的,或者为基于相似程度确定的。The second degree of difference is determined based on distance, or determined based on similarity.
所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The first processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本相对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出 的初始信息;The first processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
所述第一处理单元,用于采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;The first processing unit is configured to use training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model;
其中,所述第三模型为训练后的第三预设模型。Wherein, the third model is a trained third preset model.
所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The first processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述预设模型的第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model of the preset model to obtain transformed information output by the third preset model;
将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The first processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述预设模型中的第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;inputting the compressed feature vector information into a third preset model among the preset models, to obtain transformed feature vector information output by the third preset model;
将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
所述训练样本中包含第一训练样本。The training samples include the first training samples.
所述第一训练样本分布在第一维度和/或第二维度。The first training samples are distributed in the first dimension and/or the second dimension.
所述第一维度为时域维度;所述第一训练样本包括在所述时域维度中的m个时间单元内分布的第一信息样本;m为正整数。The first dimension is a time domain dimension; the first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer.
所述第二维度为频域维度;所述第一训练样本包括在所述频域维度中的x个频域资源上分布的第一信息样本;x为正整数。The second dimension is a frequency domain dimension; the first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer.
所述第一训练样本还分布在第三维度;The first training samples are also distributed in the third dimension;
所述第三维度为复数维度;所述第一训练样本包括第一信息样本的实部和第一信息样本的虚部。The third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample.
所述训练样本中还包含与所述第一训练样本对应的第二训练样本;The training samples also include a second training sample corresponding to the first training sample;
所述第二训练样本由T个维度的矩阵构成;T为大于等于2的整数。The second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2.
所述T个维度中包含第四维度和第五维度。The T dimensions include a fourth dimension and a fifth dimension.
所述T个维度的矩阵为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。The matrix of the T dimensions is a two-dimensional matrix of M×N; wherein, M represents the quantity of the first granularity under the fourth dimension, and N represents the quantity of the second granularity under the fifth dimension; both M and N are is a positive integer.
所述二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。The value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; both i and j are is a positive integer.
所述T个维度中还包括第六维度。The T dimensions also include a sixth dimension.
所述T个维度的矩阵为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The matrix of T dimensions is a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, N represents the number of second granularities in the fifth dimension, and W represents the number of granularities in the fifth dimension. The quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、第五维度的第j个第二粒度下、所述第六维度的第k个第三粒度下的信道质量;i、j和k均为正整数。The value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality at the third granularity; i, j and k are all positive integers.
所述第四维度为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。The fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
所述第四维度为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。The fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, K3 number of sampling points of symbols; K1, K2 and K3 are positive integers.
所述符号为OFDM符号。The symbols are OFDM symbols.
所述第五维度为空间域维度;The fifth dimension is a spatial domain dimension;
所述第二粒度为一对收发天线或到达角度的间隔。The second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
所述第六维度为复数维度;所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。The sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
所述k为第一值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的实部;When the k is the first value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The real part of the channel quality at the second granularity;
所述k为第二值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的虚部。When the k is the second value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
所述第一通信单元,用于发送所述第二模型。The first communication unit is configured to send the second model.
所述第一通信单元,用于发送所述第一模型。The first communication unit is configured to send the first model.
所述第一通信单元,用于发送所述第一模型中的估计子模型以及压缩子模型。The first communication unit is configured to send the estimated sub-model and the compressed sub-model in the first model.
所述第一通信单元,用于发送所述第一模型中的估计子模型、压缩子模型以及信道生成子模型。The first communication unit is configured to send the estimation sub-model, the compression sub-model and the channel generation sub-model in the first model.
所述第一通信单元,用于发送所述第三模型。The first communication unit is configured to send the third model.
所述第一模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The first model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements;
所述第二模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The second model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements;
所述第三模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The third model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements;
所述估计子模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The estimation sub-model is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, uplink data transmission for artificial intelligence business type transmission requirements;
所述压缩子模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The compressed sub-model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements;
所述信道生成子模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The channel generation sub-model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
本申请实施例的终端设备1800能够实现前述的方法实施例中的终端设备的对应功能。该终端设备中的各个模块(子模块、单元或组件等)对应的流程、功能、实现方式以及有益效果,可参见上述方法实施例中的对应描述,在此不再赘述。需要说明,关于申请实施例的终端设备400中的各个模块(子模块、单元或组件等)所描述的功能,可以由不同的模块(子模块、单元或组件等)实现,也可以由同一个模块(子模块、单元或组件等)实现。The terminal device 1800 in the embodiment of the present application can implement the corresponding functions of the terminal device in the foregoing method embodiments. For the processes, functions, implementations and beneficial effects corresponding to each module (submodule, unit or component, etc.) in the terminal device, refer to the corresponding description in the above method embodiment, and details are not repeated here. It should be noted that the functions described by the modules (submodules, units or components, etc.) in the terminal device 400 of the embodiment of the application can be realized by different modules (submodules, units or components, etc.), or by the same Module (submodule, unit or component, etc.) implementation.
图20是根据本申请一实施例的网络设备2000的示意性框图。该网络设备可以包括:Fig. 20 is a schematic block diagram of a network device 2000 according to an embodiment of the present application. This network equipment can include:
第二通信单元2001,用于发送第一信息;接收第二信息;其中,所述第二信息为所述第一信息经由第一模型处理得到的;The second communication unit 2001 is configured to send first information; receive second information; wherein, the second information is obtained by processing the first information through the first model;
第二处理单元2002,用于基于第二模型对所述第二信息进行处理得到信道信息;其中,所述第一模型和第二模型为联合训练得到的。The second processing unit 2002 is configured to process the second information based on a second model to obtain channel information; wherein, the first model and the second model are obtained through joint training.
所述第二信息为信道压缩信息;The second information is channel compression information;
所述第二模型用于对所述信道压缩信息进行解压缩处理,得到信道信息。The second model is used to decompress the channel compressed information to obtain channel information.
第二处理单元2002,用于将所述信道压缩信息输入所述第二模型,得到所述第二模型输出的所述信道信息。The second processing unit 2002 is configured to input the channel compression information into the second model, and obtain the channel information output by the second model.
所述第二信息为信道压缩信息;所述信道压缩信息包含压缩的信道估计信息的特征向量信息;所述信道信息为信道信息的特征向量信息;The second information is channel compression information; the channel compression information includes eigenvector information of compressed channel estimation information; the channel information is eigenvector information of channel information;
所述第二模型用于对所述压缩的信道估计信息的特征向量信息进行解压缩处理,得到信道信息的特征向量信息。The second model is used to decompress the compressed eigenvector information of the channel estimation information to obtain the eigenvector information of the channel information.
所述信道信息的特征向量信息中包含R组特征向量序列信息;R为正整数。The eigenvector information of the channel information includes R groups of eigenvector sequence information; R is a positive integer.
所述第二处理单元,用于将所述压缩的信道估计信息的特征向量信息输入所述第二模型,得到所述第二模型输出的所述信道信息的特征向量信息。The second processing unit is configured to input eigenvector information of the compressed channel estimation information into the second model, and obtain eigenvector information of the channel information output by the second model.
所述第一信息为参考信号。The first information is a reference signal.
所述第二通信单元2001,用于接收所述第二模型。The second communication unit 2001 is configured to receive the second model.
所述第二模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The second model is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
所述第二通信单元,用于接收所述第一模型。The second communication unit is configured to receive the first model.
所述第一模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The first model is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
所述第一模型包括:估计子模型和压缩子模型;The first model includes: an estimation sub-model and a compression sub-model;
其中,所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;Wherein, the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
所述压缩子模型用于对所述信道估计信息进行压缩得到所述第二信息。The compression sub-model is used to compress the channel estimation information to obtain the second information.
所述第一模型包括:估计子模型、信道生成子模型和压缩子模型;The first model includes: an estimation submodel, a channel generation submodel and a compression submodel;
其中,所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;Wherein, the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
所述信道生成子模型用于对所述信道估计信息进行特征分解得到信道估计信息的特征向量信息;The channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
所述压缩子模型用于对所述信道估计信息的特征向量信息进行压缩得到所述第二信息。The compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain the second information.
所述第二通信单元,用于接收所述估计子模型以及所述压缩子模型;The second communication unit is configured to receive the estimated sub-model and the compressed sub-model;
所述第二处理单元,用于基于所述估计子模型以及所述压缩子模型,生成所述第一模型。The second processing unit is configured to generate the first model based on the estimated sub-model and the compressed sub-model.
所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;The estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
所述压缩子模型用于对所述信道估计信息进行压缩得到信道压缩信息。The compression sub-model is used to compress the channel estimation information to obtain channel compression information.
所述第二通信单元,用于接收所述估计子模型、所述压缩子模型、信道生成子模型;The second communication unit is configured to receive the estimation sub-model, the compression sub-model, and the channel generation sub-model;
所述第二处理单元,用于基于所述估计子模型、所述压缩子模型以及所述信道生成子模型,生成所述第一模型。The second processing unit is configured to generate the first model based on the estimation sub-model, the compression sub-model and the channel generation sub-model.
所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;The estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
所述信道生成子模型用于对所述信道估计信息进行特征分解得到信道估计信息的特征向量信息;The channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
所述压缩子模型用于对所述信道估计信息的特征向量信息进行压缩得到所述第二信息。The compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain the second information.
所述估计子模型由以下之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上 行数据传输;The estimated sub-model is carried by one of the following: uplink control signaling, RRC message, uplink data transmission, uplink data transmission for artificial intelligence business class transmission requirements;
所述压缩子模型由以下之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The compressed sub-model is carried by one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements;
所述信道生成子模型由以下之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The channel generation sub-model is carried by one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
所述第二通信单元,用于接收第三模型。The second communication unit is configured to receive the third model.
所述第三模型用于对所述第一模型输出的第二信息进行数据变换处理后输入所述第二模型;The third model is used to perform data conversion processing on the second information output by the first model and then input it into the second model;
所述第一模型、第二模型以及第三模型为联合训练得到的。The first model, the second model and the third model are obtained through joint training.
所述数据变换处理包括:卷积处理或傅里叶变换处理。The data transformation processing includes: convolution processing or Fourier transform processing.
所述第二处理单元,用于采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型;The second processing unit is configured to use training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model;
其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training.
所述训练采用的损失函数为第一损失函数;The loss function used in the training is the first loss function;
所述第一损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度构建的。The first loss function is constructed based on the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model.
所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度为基于距离确定的,或者为基于相似程度确定的。The degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model is determined based on a distance, or determined based on a degree of similarity.
所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The second processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息以及所述初始信息之间的差异程度,确定所述第一损失函数;determining the first loss function based on the degree of difference between the restoration information and the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The second processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining the eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
所述第二处理单元,用于采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The second processing unit is configured to use training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the A third model; wherein, the third model is a trained third preset model.
所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The second processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息以及所述初始信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restoration information and the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model, and the third preset model.
所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The second processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining the eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model, and the third preset model.
所述训练采用的损失函数为第二损失函数;The loss function used in the training is a second loss function;
所述第二损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度以及所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度构建的;其中,所述第二训练样本与输入所述估计预设子模型的第一训练样本相对应。The second loss function is based on the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model and the first preset model The second difference degree between the output information of the estimated preset sub-model and the second training sample is constructed; wherein, the second training sample corresponds to the first training sample input into the estimated preset sub-model.
所述第一差异程度为基于距离确定的,或者为基于相似程度确定的;和/或,The first degree of difference is determined based on a distance, or is determined based on a degree of similarity; and/or,
所述第二差异程度为基于距离确定的,或者为基于相似程度确定的。The second degree of difference is determined based on distance, or determined based on similarity.
所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The second processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The second processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model, and obtaining the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
所述第二处理单元,用于采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The second processing unit is configured to use training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the A third model; wherein, the third model is a trained third preset model.
所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The second processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The second processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
所述训练样本中包含第一训练样本。The training samples include the first training samples.
所述第一训练样本分布在第一维度和/或第二维度。The first training samples are distributed in the first dimension and/or the second dimension.
所述第一维度为时域维度;The first dimension is a time domain dimension;
所述第一训练样本包括在所述时域维度中的m个时间单元内分布的第一信息样本;m为正整数。The first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer.
所述第二维度为频域维度;The second dimension is a frequency domain dimension;
所述第一训练样本包括在所述频域维度中的x个频域资源上分布的第一信息样本;x为正整数。The first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer.
所述第一训练样本还分布在第三维度;The first training samples are also distributed in the third dimension;
所述第三维度为复数维度;所述第一训练样本包括第一信息样本的实部和第一信息样本的虚部。The third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample.
所述训练样本中还包括与所述第一训练样本对应的第二训练样本;The training samples also include a second training sample corresponding to the first training sample;
所述第二训练样本由T个维度的矩阵构成;T为大于等于2的整数。The second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2.
所述T个维度中包含第四维度和第五维度。The T dimensions include a fourth dimension and a fifth dimension.
所述T个维度的矩阵为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。The matrix of the T dimensions is a two-dimensional matrix of M×N; wherein, M represents the quantity of the first granularity under the fourth dimension, and N represents the quantity of the second granularity under the fifth dimension; both M and N are is a positive integer.
所述二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。The value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; both i and j are is a positive integer.
所述T个维度中还包括第六维度。The T dimensions also include a sixth dimension.
所述T个维度的矩阵为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The matrix of T dimensions is a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, N represents the number of second granularities in the fifth dimension, and W represents the number of granularities in the fifth dimension. The quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、第五维度的第j个第二粒度下、所述第六维度的第k个第三粒度下的信道质量;i、j和k均为正整数。The value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality at the third granularity; i, j and k are all positive integers.
所述第四维度为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。The fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
所述第四维度为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。The fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, K3 number of sampling points of symbols; K1, K2 and K3 are positive integers.
所述符号为OFDM符号。The symbols are OFDM symbols.
所述第五维度为空间域维度;The fifth dimension is a spatial domain dimension;
所述第二粒度为一对收发天线或到达角度的间隔。The second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
所述第六维度为复数维度;所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。The sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
所述k为第一值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的实部;When the k is the first value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The real part of the channel quality at the second granularity;
所述k为第二值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的虚部。When the k is the second value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
所述第二通信单元,用于发送所述第二模型。The second communication unit is configured to send the second model.
所述第二通信单元,用于发送所述第一模型。The second communication unit is configured to send the first model.
所述第二通信单元,用于所述估计子模型以及所述压缩子模型。The second communication unit is used for the estimation sub-model and the compression sub-model.
所述第二通信单元,用于发送所述估计子模型、所述压缩子模型以及所述信道生成子模型。The second communication unit is configured to send the estimation sub-model, the compression sub-model and the channel generation sub-model.
所述第二通信单元,用于发送所述第三模型。The second communication unit is configured to send the third model.
所述第一模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The first model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
所述第二模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The second model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
所述第三模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The third model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
所述估计子模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The estimated sub-model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
所述压缩子模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The compressed sub-model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
所述信道生成子模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。The channel generation sub-model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements.
本申请实施例的网络设备2000能够实现前述的方法实施例中的网络设备的对应功能。该网络设备中的各个模块(子模块、单元或组件等)对应的流程、功能、实现方式以及有益效果,可参见上述方法实施例中的对应描述,在此不再赘述。需要说明,关于申请实施例的网络设备中的各个模块(子模块、单元或组件等)所描述的功能,可以由不同的模块(子模块、单元或组件等)实现,也可以由同一个模块(子模块、单元或组件等)实现。The network device 2000 in the embodiment of the present application can implement the corresponding functions of the network device in the foregoing method embodiments. For the procedures, functions, implementation methods and beneficial effects corresponding to each module (submodule, unit or component, etc.) in the network device, refer to the corresponding description in the above method embodiments, and details are not repeated here. It should be noted that the functions described by each module (submodule, unit or component, etc.) in the network device of the application embodiment can be realized by different modules (submodule, unit or component, etc.), or by the same module (submodule, unit or component, etc.) implementation.
本申请实施例还提供一种电子设备2100,如图21所示,包括:The embodiment of the present application also provides an electronic device 2100, as shown in FIG. 21 , including:
第三处理单元2101,用于采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的第一模型和第二模型;The third processing unit 2101 is configured to use training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model;
其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型;所述第一模型用于对第一信息进行处理得到第二信息;所述第二模型用于对所述第二信息进行处理得到信道信息。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training; the first model is used to process the first information to obtain Second information; the second model is used to process the second information to obtain channel information.
所述训练采用的损失函数为第一损失函数;The loss function used in the training is the first loss function;
所述第一损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度构建的。The first loss function is constructed based on the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model.
所述第二预设模型的输出信息与所述压缩预设子模型的输入信息之间的差异程度为基于距离确定的,或者为基于相似程度确定的。The degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model is determined based on a distance, or determined based on a degree of similarity.
所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The third processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息以及所述初始信息之间的差异程度,确定所述第一损失函数;determining the first loss function based on the degree of difference between the restoration information and the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The third processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
所述第三处理单元,用于采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The third processing unit is configured to use training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the A third model; wherein, the third model is a trained third preset model.
所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The third processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息以及所述初始信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restoration information and the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The third processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
所述训练采用的损失函数为第二损失函数;The loss function used in the training is a second loss function;
所述第二损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度以及所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度构建的;其中,所述第二训练样本与输入所述估计预设子模型的第一训练样本相对应。The second loss function is based on the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model and the first preset model The second difference degree between the output information of the estimated preset sub-model and the second training sample is constructed; wherein, the second training sample corresponds to the first training sample input into the estimated preset sub-model.
所述第一差异程度为基于距离确定的,或者为基于相似程度确定的;和/或,The first degree of difference is determined based on a distance, or is determined based on a degree of similarity; and/or,
所述第二差异程度为基于距离确定的,或者为基于相似程度确定的。The second degree of difference is determined based on distance, or determined based on similarity.
所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The third processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The third processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
所述第三处理单元,用于采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The third processing unit is configured to use training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the A third model; wherein, the third model is a trained third preset model.
所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The third processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The third processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain initial information output by the estimated preset sub-model;
将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
所述训练样本中包含第一训练样本。The training samples include the first training samples.
所述第一训练样本分布在第一维度和/或第二维度。The first training samples are distributed in the first dimension and/or the second dimension.
所述第一维度为时域维度;所述第一训练样本包括在所述时域维度中的m个时间单元内分布的第一信息样本;m为正整数。The first dimension is a time domain dimension; the first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer.
所述第二维度为频域维度;所述第一训练样本包括在所述频域维度中的x个频域资源上分布的第一信息样本;x为正整数。The second dimension is a frequency domain dimension; the first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer.
所述第一训练样本还分布在第三维度;The first training samples are also distributed in the third dimension;
所述第三维度为复数维度;所述第一训练样本包括第一信息样本的实部和第一信息样本的虚部。The third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample.
其中,所述训练样本中还包含与所述第一训练样本对应的第二训练样本;Wherein, the training samples also include a second training sample corresponding to the first training sample;
所述第二训练样本由T个维度的矩阵构成;T为大于等于2的整数。The second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2.
所述T个维度中包含第四维度和第五维度。The T dimensions include a fourth dimension and a fifth dimension.
所述T个维度的矩阵为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。The matrix of the T dimensions is a two-dimensional matrix of M×N; wherein, M represents the quantity of the first granularity under the fourth dimension, and N represents the quantity of the second granularity under the fifth dimension; both M and N are is a positive integer.
所述二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。The value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; both i and j are is a positive integer.
所述T个维度中还包括第六维度。The T dimensions also include a sixth dimension.
所述T个维度的矩阵为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The matrix of T dimensions is a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, N represents the number of second granularities in the fifth dimension, and W represents the number of granularities in the fifth dimension. The quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、第五维度的第j个第二粒度下、所述第六维度的第k个第三粒度下的信道质量;i、j和k均为正整数。The value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality at the third granularity; i, j and k are all positive integers.
所述第四维度为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。The fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
所述第四维度为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。The fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, K3 number of sampling points of symbols; K1, K2 and K3 are positive integers.
所述符号为OFDM符号。The symbols are OFDM symbols.
所述第五维度为空间域维度;The fifth dimension is a spatial domain dimension;
所述第二粒度为一对收发天线或到达角度的间隔。The second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
所述第六维度为复数维度;所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。The sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
所述k为第一值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的实部;When the k is the first value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The real part of the channel quality at the second granularity;
所述k为第二值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的虚部。When the k is the second value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
图22是根据本申请实施例的通信设备2200示意性结构图。该通信设备2200包括处理器2210,处理器2210可以从存储器中调用并运行计算机程序,以使通信设备2200实现本申请实施例中的方法。Fig. 22 is a schematic structural diagram of a communication device 2200 according to an embodiment of the present application. The communication device 2200 includes a processor 2210, and the processor 2210 can invoke and run a computer program from a memory, so that the communication device 2200 implements the method in the embodiment of the present application.
可选地,通信设备2200还可以包括存储器2220。其中,处理器2210可以从存储器2220中调用并运行计算机程序,以使通信设备2200实现本申请实施例中的方法。Optionally, the communication device 2200 may further include a memory 2220 . Wherein, the processor 2210 may call and run a computer program from the memory 2220, so that the communication device 2200 implements the method in the embodiment of the present application.
其中,存储器2220可以是独立于处理器2210的一个单独的器件,也可以集成在处理器2210中。Wherein, the memory 2220 may be an independent device independent of the processor 2210 , or may be integrated in the processor 2210 .
可选地,通信设备2200还可以包括收发器2230,处理器2210可以控制该收发器2230与其他设备进行通信,具体地,可以向其他设备发送信息或数据,或接收其他设备发送的信息或数据。Optionally, the communication device 2200 may further include a transceiver 2230, and the processor 2210 may control the transceiver 2230 to communicate with other devices, specifically, may send information or data to other devices, or receive information or data sent by other devices .
其中,收发器2230可以包括发射机和接收机。收发器2230还可以进一步包括天线,天线的数量可以为一个或多个。Wherein, the transceiver 2230 may include a transmitter and a receiver. The transceiver 2230 may further include an antenna, and the number of antennas may be one or more.
可选地,该通信设备2200可为本申请实施例的网络设备,并且该通信设备2200可以实现本申请实施例的各个方法中由网络设备实现的相应流程,为了简洁,在此不再赘述。Optionally, the communication device 2200 may be the network device of the embodiment of the present application, and the communication device 2200 may implement the corresponding processes implemented by the network device in the methods of the embodiment of the present application. For the sake of brevity, details are not repeated here.
可选地,该通信设备2200可为本申请实施例的终端设备,并且该通信设备2200可以实现本申请实施例的各个方法中由终端设备实现的相应流程,为了简洁,在此不再赘述。Optionally, the communication device 2200 may be the terminal device of the embodiment of the present application, and the communication device 2200 may implement the corresponding processes implemented by the terminal device in each method of the embodiment of the present application. For the sake of brevity, details are not repeated here.
图23是根据本申请实施例的芯片2300的示意性结构图。该芯片2300包括处理器2310,处理器2310可以从存储器中调用并运行计算机程序,以实现本申请实施例中的方法。FIG. 23 is a schematic structural diagram of a chip 2300 according to an embodiment of the present application. The chip 2300 includes a processor 2310, and the processor 2310 can call and run a computer program from the memory, so as to implement the method in the embodiment of the present application.
可选地,芯片2300还可以包括存储器2320。其中,处理器2310可以从存储器2320中调用并运行计算机程序,以实现本申请实施例中由终端设备或者网络设备执行的方法。Optionally, the chip 2300 may also include a memory 2320 . Wherein, the processor 2310 may invoke and run a computer program from the memory 2320, so as to implement the method performed by the terminal device or the network device in the embodiment of the present application.
其中,存储器2320可以是独立于处理器2310的一个单独的器件,也可以集成在处理器2310中。Wherein, the memory 2320 may be an independent device independent of the processor 2310 , or may be integrated in the processor 2310 .
可选地,该芯片2300还可以包括输入接口2330。其中,处理器2310可以控制该输入接口2330与其他设备或芯片进行通信,具体地,可以获取其他设备或芯片发送的信息或数据。Optionally, the chip 2300 may also include an input interface 2330 . Wherein, the processor 2310 can control the input interface 2330 to communicate with other devices or chips, specifically, can obtain information or data sent by other devices or chips.
可选地,该芯片2300还可以包括输出接口2340。其中,处理器2310可以控制该输出接口2340与其他设备或芯片进行通信,具体地,可以向其他设备或芯片输出信息或数据。Optionally, the chip 2300 may also include an output interface 2340 . Wherein, the processor 2310 can control the output interface 2340 to communicate with other devices or chips, specifically, can output information or data to other devices or chips.
可选地,该芯片可应用于本申请实施例中的网络设备,并且该芯片可以实现本申请实施例的各个方法中由网络设备实现的相应流程,为了简洁,在此不再赘述。Optionally, the chip can be applied to the network device in the embodiment of the present application, and the chip can implement the corresponding processes implemented by the network device in the methods of the embodiment of the present application. For the sake of brevity, details are not repeated here.
可选地,该芯片可应用于本申请实施例中的终端设备,并且该芯片可以实现本申请实施例的各个方法中由终端设备实现的相应流程,为了简洁,在此不再赘述。Optionally, the chip can be applied to the terminal device in the embodiments of the present application, and the chip can implement the corresponding processes implemented by the terminal device in the methods of the embodiments of the present application. For the sake of brevity, details are not repeated here.
应用于网络设备和终端设备的芯片可以是相同的芯片或不同的芯片。Chips applied to network devices and terminal devices may be the same chip or different chips.
应理解,本申请实施例提到的芯片还可以称为系统级芯片,系统芯片,芯片系统或片上系统芯片等。It should be understood that the chip mentioned in the embodiment of the present application may also be called a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip.
上述提及的处理器可以是通用处理器、数字信号处理器(digital signal processor,DSP)、现成可编程门阵列(field programmable gate array,FPGA)、专用集成电路(application specific integrated circuit,ASIC)或者其他可编程逻辑器件、晶体管逻辑器件、分立硬件组件等。其中,上述提到的通用处理器可以是微处理器或者也可以是任何常规的处理器等。The processor mentioned above can be a general-purpose processor, a digital signal processor (DSP), an off-the-shelf programmable gate array (FPGA), an application specific integrated circuit (ASIC) or Other programmable logic devices, transistor logic devices, discrete hardware components, etc. Wherein, the general-purpose processor mentioned above may be a microprocessor or any conventional processor or the like.
上述提及的存储器可以是易失性存储器或非易失性存储器,或可包括易失性和非易失性存储器两者。其中,非易失性存储器可以是只读存储器(read-only memory,ROM)、可编程只读存储器(programmable ROM,PROM)、可擦除可编程只读存储器(erasable PROM,EPROM)、电可擦除可编程只读存储器(electrically EPROM,EEPROM)或闪存。易失性存储器可以是随机存取存储器(random access memory,RAM)。The aforementioned memories may be volatile memories or nonvolatile memories, or may include both volatile and nonvolatile memories. Among them, the non-volatile memory can be read-only memory (read-only memory, ROM), programmable read-only memory (programmable ROM, PROM), erasable programmable read-only memory (erasable PROM, EPROM), electrically programmable Erases programmable read-only memory (electrically EPROM, EEPROM) or flash memory. The volatile memory may be random access memory (RAM).
应理解,上述存储器为示例性但不是限制性说明,例如,本申请实施例中的存储器还可以是静态随机存取存储器(static RAM,SRAM)、动态随机存取存储器(dynamic RAM,DRAM)、同步动态随机存取存储器(synchronous DRAM,SDRAM)、双倍数据速率同步动态随机存取存储器(double data rate SDRAM,DDR SDRAM)、增强型同步动态随机存取存储器(enhanced SDRAM,ESDRAM)、同步连接动态随机存取存储器(synch link DRAM,SLDRAM)以及直接内存总线随机存取存储器(Direct Rambus RAM,DR RAM)等等。也就是说,本申请实施例中的存储器旨在包括但不限 于这些和任意其它适合类型的存储器。It should be understood that the above-mentioned memory is illustrative but not restrictive. For example, the memory in the embodiment of the present application may also be a static random access memory (static RAM, SRAM), a dynamic random access memory (dynamic RAM, DRAM), Synchronous dynamic random access memory (synchronous DRAM, SDRAM), double data rate synchronous dynamic random access memory (double data rate SDRAM, DDR SDRAM), enhanced synchronous dynamic random access memory (enhanced SDRAM, ESDRAM), synchronous connection Dynamic random access memory (synch link DRAM, SLDRAM) and direct memory bus random access memory (Direct Rambus RAM, DR RAM), etc. That is, memory in embodiments of the present application is intended to include, but not be limited to, these and any other suitable types of memory.
图24是根据本申请实施例的通信系统2400的示意性框图。该通信系统2400包括终端设备2410和网络设备2420。Fig. 24 is a schematic block diagram of a communication system 2400 according to an embodiment of the present application. The communication system 2400 includes a terminal device 2410 and a network device 2420 .
其中,该终端设备2410可以用于实现上述方法中由终端设备实现的相应的功能,以及该网络设备2420可以用于实现上述方法中由网络设备实现的相应的功能。为了简洁,在此不再赘述。Wherein, the terminal device 2410 may be used to realize corresponding functions realized by the terminal device in the above method, and the network device 2420 may be used to realize corresponding functions realized by the network device in the above method. For the sake of brevity, details are not repeated here.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。该计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行该计算机程序指令时,全部或部分地产生按照本申请实施例中的流程或功能。该计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。该计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,该计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(Digital Subscriber Line,DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。该计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。该可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘(Solid State Disk,SSD))等。In the above embodiments, all or part of them may be implemented by software, hardware, firmware or any combination thereof. When implemented using software, it may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on the computer, the processes or functions according to the embodiments of the present application will be generated in whole or in part. The computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable device. The computer instructions may be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium, e.g. (such as coaxial cable, optical fiber, digital subscriber line (Digital Subscriber Line, DSL)) or wireless (such as infrared, wireless, microwave, etc.) to another website site, computer, server or data center. The computer-readable storage medium may be any available medium that can be accessed by a computer, or a data storage device such as a server or a data center integrated with one or more available media. The available medium may be a magnetic medium (such as a floppy disk, a hard disk, or a magnetic tape), an optical medium (such as a DVD), or a semiconductor medium (such as a solid state disk (Solid State Disk, SSD)), etc.
应理解,在本申请的各种实施例中,上述各过程的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。It should be understood that, in various embodiments of the present application, the sequence numbers of the above-mentioned processes do not mean the order of execution, and the execution order of the processes should be determined by their functions and internal logic, and should not be used in the embodiments of the present application. The implementation process constitutes any limitation.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the above-described system, device and unit can refer to the corresponding process in the foregoing method embodiment, which will not be repeated here.
以上所述仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以该权利要求的保护范围为准。The above is only the specific implementation of the application, but the scope of protection of the application is not limited thereto. Anyone familiar with the technical field can easily think of changes or substitutions within the technical scope disclosed in the application, and should covered within the scope of protection of this application. Therefore, the protection scope of the present application should be based on the protection scope of the claims.

Claims (312)

  1. 一种信息处理方法,包括:An information processing method, comprising:
    终端设备接收第一信息;The terminal device receives the first information;
    所述终端设备发送基于第一信息得到的第二信息;The terminal device sends second information obtained based on the first information;
    其中,所述第二信息为所述第一信息经由第一模型处理得到的,所述第二信息用于经由第二模型进行处理以得到信道信息;所述第一模型和第二模型为联合训练得到的。Wherein, the second information is obtained by processing the first information through the first model, and the second information is used for processing through the second model to obtain channel information; the first model and the second model are a joint obtained by training.
  2. 根据权利要求1所述的方法,其中,所述第二信息为信道压缩信息;The method according to claim 1, wherein the second information is channel compression information;
    所述第一模型用于基于输入的所述第一信息进行处理得到信道压缩信息。The first model is used to process the input first information to obtain channel compression information.
  3. 根据权利要求2所述的方法,其中,所述第一模型包括:估计子模型和压缩子模型;The method of claim 2, wherein the first model comprises: an estimation sub-model and a compression sub-model;
    其中,所述估计子模型用于基于所述第一信息进行信道估计得到信道估计信息;Wherein, the estimation sub-model is used to perform channel estimation based on the first information to obtain channel estimation information;
    所述压缩子模型用于对所述信道估计信息进行压缩得到信道压缩信息。The compression sub-model is used to compress the channel estimation information to obtain channel compression information.
  4. 根据权利要求3所述的方法,其中,所述方法还包括:The method according to claim 3, wherein the method further comprises:
    所述终端设备将所述第一信息输入所述估计子模型,得到所述估计子模型输出的信道估计信息;The terminal device inputs the first information into the estimation sub-model, and obtains channel estimation information output by the estimation sub-model;
    所述终端设备将所述信道估计信息输入所述压缩子模型,得到所述压缩子模型输出的信道压缩信息。The terminal device inputs the channel estimation information into the compression sub-model, and obtains channel compression information output by the compression sub-model.
  5. 根据权利要求1-4任一项所述的方法,其中,所述第一信息为参考信号。The method according to any one of claims 1-4, wherein the first information is a reference signal.
  6. 根据权利要求1所述的方法,其中,所述第二信息为信道压缩信息;所述信道压缩信息包含压缩的信道估计信息的特征向量信息;The method according to claim 1, wherein the second information is channel compression information; the channel compression information includes eigenvector information of compressed channel estimation information;
    所述第一模型用于对输入的所述第一信息进行处理得到压缩的信道估计信息的特征向量信息。The first model is used to process the input first information to obtain eigenvector information of compressed channel estimation information.
  7. 根据权利要求6所述的方法,其中,所述第一模型包括:估计子模型、信道生成子模型和压缩子模型;The method according to claim 6, wherein the first model comprises: an estimation submodel, a channel generation submodel and a compression submodel;
    其中,所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;Wherein, the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
    所述信道生成子模型用于对所述信道估计信息进行特征分解得到信道估计信息的特征向量信息;The channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
    所述压缩子模型用于对所述信道估计信息的特征向量信息进行压缩得到压缩的信道估计信息的特征向量信息。The compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain compressed eigenvector information of the channel estimation information.
  8. 根据权利要求7所述的方法,其中,The method according to claim 7, wherein,
    所述信道估计信息的特征向量信息包含R组特征向量序列信息;R为正整数。The eigenvector information of the channel estimation information includes R groups of eigenvector sequence information; R is a positive integer.
  9. 根据权利要求7或8所述的方法,其中,所述方法还包括:The method according to claim 7 or 8, wherein the method further comprises:
    所述终端设备将所述第一信息输入所述估计子模型,得到所述估计子模型输出的信道估计信息;The terminal device inputs the first information into the estimation sub-model, and obtains channel estimation information output by the estimation sub-model;
    所述终端设备将所述信道估计信息输入所述信道生成子模型,得到所述信道生成子模型输出的信道估计信息的特征向量信息;The terminal device inputs the channel estimation information into the channel generation sub-model, and obtains eigenvector information of the channel estimation information output by the channel generation sub-model;
    所述终端设备将所述信道估计信息的特征向量信息输入所述压缩子模型,得到所述压缩子模型输出的压缩的信道估计信息的特征向量信息。The terminal device inputs the eigenvector information of the channel estimation information into the compression sub-model, and obtains the eigenvector information of the compressed channel estimation information output by the compression sub-model.
  10. 根据权利要求6-9任一项所述的方法,其中,所述第一信息为参考信号;所述信道信息为所述信道信息的特征向量信息。The method according to any one of claims 6-9, wherein the first information is a reference signal; and the channel information is eigenvector information of the channel information.
  11. 根据权利要求1-10任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 1-10, wherein the method further comprises:
    所述终端设备接收所述第一模型。The terminal device receives the first model.
  12. 根据权利要求9所述的方法,其中,所述第一模型由以下至少之一携带:下行控制信令、媒体接入控制MAC控制元素CE消息、无线资源控制RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。The method according to claim 9, wherein the first model is carried by at least one of the following: downlink control signaling, media access control MAC control element CE message, radio resource control RRC message, broadcast message, downlink data transmission , Downlink data transmission for artificial intelligence business transmission requirements.
  13. 根据权利要求1-10任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 1-10, wherein the method further comprises:
    所述终端设备接收估计子模型以及压缩子模型;The terminal device receives the estimated sub-model and the compressed sub-model;
    所述终端设备基于所述估计子模型以及所述压缩子模型,生成所述第一模型。The terminal device generates the first model based on the estimated sub-model and the compressed sub-model.
  14. 根据权利要求1-10任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 1-10, wherein the method further comprises:
    所述终端设备接收估计子模型、压缩子模型以及信道生成子模型;The terminal device receives an estimation sub-model, a compression sub-model and a channel generation sub-model;
    所述终端设备基于所述估计子模型、所述压缩子模型以及所述信道生成子模型,生成所述第一模型。The terminal device generates the first model based on the estimation sub-model, the compression sub-model and the channel generation sub-model.
  15. 根据权利要求13或14所述的方法,其中,所述估计子模型由以下之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The method according to claim 13 or 14, wherein the estimated sub-model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, transmission requirements for artificial intelligence services downlink data transmission;
    所述压缩子模型由以下之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The compressed sub-model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
    所述信道生成子模型由以下之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。The channel generation sub-model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements.
  16. 根据权利要求11-15任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 11-15, wherein the method further comprises:
    所述终端设备接收所述第二模型。The terminal device receives the second model.
  17. 根据权利要求11-16任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 11-16, wherein the method further comprises:
    所述终端设备接收所述第三模型。The terminal device receives the third model.
  18. 根据权利要求17所述的方法,其中,所述第三模型用于对所述第一模型输出的第二信息进行数据变换处理后输入所述第二模型;The method according to claim 17, wherein the third model is used to input the second information output by the first model into the second model after data conversion processing;
    所述第一模型、第二模型以及第三模型为联合训练得到的。The first model, the second model and the third model are obtained through joint training.
  19. 根据权利要求18所述的方法,其中,所述数据变换处理包括:卷积处理或傅里叶变换处理。The method according to claim 18, wherein said data transformation processing comprises: convolution processing or Fourier transform processing.
  20. 根据权利要求1-10任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 1-10, wherein the method further comprises:
    所述终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型;The terminal device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model;
    其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training.
  21. 根据权利要求20所述的方法,其中,所述训练采用的损失函数为第一损失函数;The method according to claim 20, wherein the loss function used in the training is a first loss function;
    所述第一损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度构建的。The first loss function is constructed based on the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model.
  22. 根据权利要求21所述的方法,其中,所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度为基于距离确定的,或者为基于相似程度确定的。The method of claim 21, wherein the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model is determined based on distance, or determined on the basis of similarity.
  23. 根据权利要求21或22所述的方法,其中,所述终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The method according to claim 21 or 22, wherein the terminal device uses training samples to jointly train the first preset model and the second preset model, comprising:
    所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息以及所述初始信息之间的差异程度,确定所述第一损失函数;determining the first loss function based on the degree of difference between the restoration information and the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
  24. 根据权利要求21或22所述的方法,其中,所述终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The method according to claim 21 or 22, wherein the terminal device uses training samples to jointly train the first preset model and the second preset model, comprising:
    所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
  25. 根据权利要求21或22所述的方法,其中,所述终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型,包括:The method according to claim 21 or 22, wherein the terminal device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second preset model. models, including:
    所述终端设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;The terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model;
    其中,所述第三模型为训练后的第三预设模型。Wherein, the third model is a trained third preset model.
  26. 根据权利要求25所述的方法,其中,所述终端设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The method according to claim 25, wherein the terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model, comprising:
    所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
    将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息以及所述初始信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restoration information and the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
  27. 根据权利要求25所述的方法,其中,所述终端设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The method according to claim 25, wherein the terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model, comprising:
    所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
    将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
  28. 根据权利要求20所述的方法,其中,所述训练采用的损失函数为第二损失函数;The method according to claim 20, wherein the loss function used in the training is a second loss function;
    所述第二损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度,以及所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度构建的;其中,所述第二训练样本与输入所述估计预设子模型的第一训练样本相对应。The second loss function is based on a first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model, and the first preset The second degree of difference between the output information of the estimated preset sub-model of the model and the second training sample is constructed; wherein the second training sample corresponds to the first training sample input to the estimated preset sub-model.
  29. 根据权利要求28所述的方法,其中,所述第一差异程度为基于距离确定的,或者为基于相似程度确定的;和/或,The method according to claim 28, wherein the first degree of difference is determined based on a distance, or determined based on a degree of similarity; and/or,
    所述第二差异程度为基于距离确定的,或者为基于相似程度确定的。The second degree of difference is determined based on distance, or determined based on similarity.
  30. 根据权利要求28或29所述的方法,其中,所述终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The method according to claim 28 or 29, wherein the terminal device uses training samples to jointly train the first preset model and the second preset model, comprising:
    所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本相对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  31. 根据权利要求28或29所述的方法,其中,所述终端设备采用训练样本对第一预设模型和第二预设模型进行联 合训练,包括:The method according to claim 28 or 29, wherein the terminal device uses training samples to jointly train the first preset model and the second preset model, comprising:
    所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  32. 根据权利要求28或29所述的方法,其中,所述终端设备采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型,包括:The method according to claim 28 or 29, wherein the terminal device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second preset model. models, including:
    所述终端设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;The terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model;
    其中,所述第三模型为训练后的第三预设模型。Wherein, the third model is a trained third preset model.
  33. 根据权利要求32所述的方法,其中,所述终端设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The method according to claim 32, wherein the terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model, comprising:
    所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述预设模型的第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model of the preset model to obtain transformed information output by the third preset model;
    将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
    根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
  34. 根据权利要求32所述的方法,其中,所述终端设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The method according to claim 32, wherein the terminal device uses training samples to jointly train the first preset model, the second preset model and the third preset model, comprising:
    所述终端设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述预设模型中的第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;inputting the compressed feature vector information into a third preset model among the preset models, to obtain transformed feature vector information output by the third preset model;
    将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
    根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
  35. 根据权利要求20-34任一项所述的方法,其中,所述训练样本中包含第一训练样本。The method according to any one of claims 20-34, wherein the training samples include a first training sample.
  36. 根据权利要求35所述的方法,其中,The method of claim 35, wherein,
    所述第一训练样本分布在第一维度和/或第二维度。The first training samples are distributed in the first dimension and/or the second dimension.
  37. 根据权利要求35所述的方法,其中,The method of claim 35, wherein,
    所述第一维度为时域维度;所述第一训练样本包括在所述时域维度中的m个时间单元内分布的第一信息样本;m为正整数。The first dimension is a time domain dimension; the first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer.
  38. 根据权利要求35所述的方法,其中,The method of claim 35, wherein,
    所述第二维度为频域维度;所述第一训练样本包括在所述频域维度中的x个频域资源上分布的第一信息样本;x为正整数。The second dimension is a frequency domain dimension; the first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer.
  39. 根据权利要求36-38任一项所述的方法,其中,所述第一训练样本还分布在第三维度;The method according to any one of claims 36-38, wherein the first training samples are also distributed in a third dimension;
    所述第三维度为复数维度;所述第一训练样本包括第一信息样本的实部和第一信息样本的虚部。The third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample.
  40. 根据权利要求35-39任一项所述的方法,其中,所述训练样本中还包含与所述第一训练样本对应的第二训练样本;The method according to any one of claims 35-39, wherein the training samples further include a second training sample corresponding to the first training sample;
    所述第二训练样本由T个维度的矩阵构成;T为大于等于2的整数。The second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2.
  41. 根据权利要求40所述的方法,其中,The method of claim 40, wherein,
    所述T个维度中包含第四维度和第五维度。The T dimensions include a fourth dimension and a fifth dimension.
  42. 根据权利要求41所述的方法,其中,The method of claim 41, wherein,
    所述T个维度的矩阵为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。The matrix of the T dimensions is a two-dimensional matrix of M×N; wherein, M represents the quantity of the first granularity under the fourth dimension, and N represents the quantity of the second granularity under the fifth dimension; both M and N are is a positive integer.
  43. 根据权利要求42所述的方法,其中,The method of claim 42, wherein,
    所述二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。The value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; both i and j are is a positive integer.
  44. 根据权利要求41-43任一项所述的方法,其中,所述T个维度中还包括第六维度。The method according to any one of claims 41-43, wherein the T dimensions further include a sixth dimension.
  45. 根据权利要求44所述的方法,其中,The method of claim 44, wherein,
    所述T个维度的矩阵为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度 下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The matrix of T dimensions is a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, N represents the number of second granularities in the fifth dimension, and W represents the number of granularities in the fifth dimension. The quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
  46. 根据权利要求45所述的方法,其中,The method of claim 45, wherein,
    所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、第五维度的第j个第二粒度下、所述第六维度的第k个第三粒度下的信道质量;i、j和k均为正整数。The value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality at the third granularity; i, j and k are all positive integers.
  47. 根据权利要求41-46任一项所述的方法,其中,The method according to any one of claims 41-46, wherein,
    所述第四维度为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。The fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
  48. 根据权利要求41-46任一项所述的方法,其中,The method according to any one of claims 41-46, wherein,
    所述第四维度为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。The fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, K3 number of sampling points of symbols; K1, K2 and K3 are positive integers.
  49. 根据权利要求48所述的方法,其中,所述符号为OFDM符号。The method of claim 48, wherein the symbols are OFDM symbols.
  50. 根据权利要求41-49任一项所述的方法,其中,所述第五维度为空间域维度;The method according to any one of claims 41-49, wherein the fifth dimension is a spatial domain dimension;
    所述第二粒度为一对收发天线或到达角度的间隔。The second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
  51. 根据权利要求44-50任一项所述的方法,其中,The method according to any one of claims 44-50, wherein,
    所述第六维度为复数维度;所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。The sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
  52. 根据权利要求51所述的方法,其中,所述k为第一值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的实部;The method according to claim 51, wherein, when the k is the first value, the value of the ijkth position in the three-dimensional matrix is used to represent the ith first value in the fourth dimension Granularity, the real part of the channel quality at the jth second granularity of the fifth dimension;
    所述k为第二值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的虚部。When the k is the second value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
  53. 根据权利要求20-52任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 20-52, wherein the method further comprises:
    所述终端设备发送所述第二模型。The terminal device sends the second model.
  54. 根据权利要求20-53任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 20-53, wherein the method further comprises:
    所述终端设备发送所述第一模型。The terminal device sends the first model.
  55. 根据权利要求20-53任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 20-53, wherein the method further comprises:
    所述终端设备发送所述第一模型中的估计子模型以及压缩子模型。The terminal device sends the estimated sub-model and the compressed sub-model in the first model.
  56. 根据权利要求20-53任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 20-53, wherein the method further comprises:
    所述终端设备发送所述第一模型中的估计子模型、压缩子模型以及信道生成子模型。The terminal device sends the estimation sub-model, the compression sub-model and the channel generation sub-model in the first model.
  57. 根据权利要求20-56任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 20-56, wherein the method further comprises:
    所述终端设备发送所述第三模型。The terminal device sends the third model.
  58. 根据权利要求53-57任一项所述的方法,其中,所述第一模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The method according to any one of claims 53-57, wherein the first model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink transmission requirements for artificial intelligence services data transmission;
    所述第二模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The second model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements;
    所述第三模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The third model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements;
    所述估计子模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The estimation sub-model is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, uplink data transmission for artificial intelligence business type transmission requirements;
    所述压缩子模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The compressed sub-model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements;
    所述信道生成子模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The channel generation sub-model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  59. 一种信息处理方法,包括:An information processing method, comprising:
    网络设备发送第一信息;The network device sends the first information;
    所述网络设备接收第二信息;其中,所述第二信息为所述第一信息经由第一模型处理得到的;The network device receives second information; wherein, the second information is obtained by processing the first information through a first model;
    所述网络设备基于第二模型对所述第二信息进行处理得到信道信息;其中,所述第一模型和第二模型为联合训练得到的。The network device processes the second information based on a second model to obtain channel information; wherein, the first model and the second model are obtained through joint training.
  60. 根据权利要求59所述的方法,其中,所述第二信息为信道压缩信息;The method of claim 59, wherein the second information is channel compression information;
    所述第二模型用于对所述信道压缩信息进行解压缩处理,得到信道信息。The second model is used to decompress the channel compressed information to obtain channel information.
  61. 根据权利要求60所述的方法,其中,所述网络设备基于第二模型对所述第二信息进行处理得到信道信息,包括:The method according to claim 60, wherein the network device processes the second information based on a second model to obtain channel information, comprising:
    所述网络设备将所述信道压缩信息输入所述第二模型,得到所述第二模型输出的所述信道信息。The network device inputs the channel compression information into the second model to obtain the channel information output by the second model.
  62. 根据权利要求59所述的方法,其中,所述第二信息为信道压缩信息;所述信道压缩信息包含压缩的信道估计信息的特征向量信息;所述信道信息为信道信息的特征向量信息;The method according to claim 59, wherein the second information is channel compression information; the channel compression information includes eigenvector information of compressed channel estimation information; the channel information is eigenvector information of channel information;
    所述第二模型用于对所述压缩的信道估计信息的特征向量信息进行解压缩处理,得到信道信息的特征向量信息。The second model is used to decompress the compressed eigenvector information of the channel estimation information to obtain the eigenvector information of the channel information.
  63. 根据权利要求62所述的方法,其中,The method of claim 62, wherein,
    所述信道信息的特征向量信息中包含R组特征向量序列信息;R为正整数。The eigenvector information of the channel information includes R groups of eigenvector sequence information; R is a positive integer.
  64. 根据权利要求62或63所述的方法,其中,所述网络设备基于第二模型对所述第二信息进行处理得到信道信息,包括:The method according to claim 62 or 63, wherein the network device processes the second information based on a second model to obtain channel information, including:
    所述网络设备将所述压缩的信道估计信息的特征向量信息输入所述第二模型,得到所述第二模型输出的所述信道信息的特征向量信息。The network device inputs the eigenvector information of the compressed channel estimation information into the second model, and obtains the eigenvector information of the channel information output by the second model.
  65. 根据权利要求59-64任一项所述的方法,其中,所述第一信息为参考信号。The method according to any one of claims 59-64, wherein the first information is a reference signal.
  66. 根据权利要求59-65任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 59-65, wherein the method further comprises:
    所述网络设备接收所述第二模型。The network device receives the second model.
  67. 根据权利要求66所述的方法,其中,The method of claim 66, wherein,
    所述第二模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The second model is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  68. 根据权利要求66或67所述的方法,其中,所述方法还包括:The method according to claim 66 or 67, wherein the method further comprises:
    所述网络设备接收所述第一模型。The network device receives the first model.
  69. 根据权利要求68所述的方法,其中,The method of claim 68, wherein,
    所述第一模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The first model is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  70. 根据权利要求68所述的方法,其中,所述第一模型包括:估计子模型和压缩子模型;The method of claim 68, wherein the first model comprises: an estimation sub-model and a compression sub-model;
    其中,所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;Wherein, the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
    所述压缩子模型用于对所述信道估计信息进行压缩得到所述第二信息。The compression sub-model is used to compress the channel estimation information to obtain the second information.
  71. 根据权利要求68所述的方法,其中,所述第一模型包括:估计子模型、信道生成子模型和压缩子模型;The method of claim 68, wherein the first model comprises: an estimation submodel, a channel generation submodel, and a compression submodel;
    其中,所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;Wherein, the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
    所述信道生成子模型用于对所述信道估计信息进行特征分解得到信道估计信息的特征向量信息;The channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
    所述压缩子模型用于对所述信道估计信息的特征向量信息进行压缩得到所述第二信息。The compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain the second information.
  72. 根据权利要求66或67所述的方法,其中,所述方法还包括:The method according to claim 66 or 67, wherein the method further comprises:
    所述网络设备接收所述估计子模型以及所述压缩子模型;The network device receives the estimated sub-model and the compressed sub-model;
    所述网络设备基于所述估计子模型以及所述压缩子模型,生成所述第一模型。The network device generates the first model based on the estimated sub-model and the compressed sub-model.
  73. 根据权利要求72所述的方法,其中,所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;The method according to claim 72, wherein the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
    所述压缩子模型用于对所述信道估计信息进行压缩得到所述第二信息。The compression sub-model is used to compress the channel estimation information to obtain the second information.
  74. 根据权利要求66或67所述的方法,其中,所述方法还包括:The method according to claim 66 or 67, wherein the method further comprises:
    所述网络设备接收所述估计子模型、所述压缩子模型、信道生成子模型;The network device receives the estimation sub-model, the compression sub-model, and the channel generation sub-model;
    所述网络设备基于所述估计子模型、所述压缩子模型以及所述信道生成子模型,生成所述第一模型。The network device generates the first model based on the estimation sub-model, the compression sub-model and the channel generation sub-model.
  75. 根据权利要求74所述的方法,其中,The method of claim 74, wherein,
    所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;The estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
    所述信道生成子模型用于对所述信道估计信息进行特征分解得到信道估计信息的特征向量信息;The channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
    所述压缩子模型用于对所述信道估计信息的特征向量信息进行压缩得到所述第二信息。The compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain the second information.
  76. 根据权利要求72-75任一项所述的方法,其中,所述估计子模型由以下之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The method according to any one of claims 72-75, wherein the estimation sub-model is carried by one of the following: uplink control signaling, RRC message, uplink data transmission, uplink data for artificial intelligence service class transmission requirements transmission;
    所述压缩子模型由以下之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The compressed sub-model is carried by one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements;
    所述信道生成子模型由以下之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The channel generation sub-model is carried by one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  77. 根据权利要求66-76任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 66-76, wherein the method further comprises:
    所述网络设备接收第三模型。The network device receives a third model.
  78. 根据权利要求77所述的方法,其中,所述第三模型用于对所述第一模型输出的第二信息进行数据变换处理后输入所述第二模型;The method according to claim 77, wherein the third model is used to input the second information output by the first model into the second model after data conversion processing;
    所述第一模型、第二模型以及第三模型为联合训练得到的。The first model, the second model and the third model are obtained through joint training.
  79. 根据权利要求78所述的方法,其中,所述数据变换处理包括:卷积处理或傅里叶变换处理。The method according to claim 78, wherein said data transformation processing comprises: convolution processing or Fourier transform processing.
  80. 根据权利要求59-65任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 59-65, wherein the method further comprises:
    所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型;The network device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model;
    其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training.
  81. 根据权利要求80所述的方法,其中,所述训练采用的损失函数为第一损失函数;The method according to claim 80, wherein the loss function used in the training is a first loss function;
    所述第一损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度构建的。The first loss function is constructed based on the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model.
  82. 根据权利要求81所述的方法,其中,所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度为基于距离确定的,或者为基于相似程度确定的。The method of claim 81, wherein the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model is determined based on distance, or determined on the basis of similarity.
  83. 根据权利要求81或82所述的方法,其中,所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The method according to claim 81 or 82, wherein the network device uses training samples to jointly train the first preset model and the second preset model, comprising:
    所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息以及所述初始信息之间的差异程度,确定所述第一损失函数;determining the first loss function based on the degree of difference between the restoration information and the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
  84. 根据权利要求81或82所述的方法,其中,所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The method according to claim 81 or 82, wherein the network device uses training samples to jointly train the first preset model and the second preset model, comprising:
    所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后 的特征向量信息;Input the feature vector information of the initial information into the compressed preset sub-model of the first preset model, and obtain the compressed feature vector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
  85. 根据权利要求81或82所述的方法,其中,所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型,包括:The method according to claim 81 or 82, wherein the network device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second preset model. models, including:
    所述网络设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The network device uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein , the third model is a trained third preset model.
  86. 根据权利要求85所述的方法,其中,所述网络设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The method according to claim 85, wherein the network device uses training samples to jointly train the first preset model, the second preset model and the third preset model, comprising:
    所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
    将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息以及所述初始信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restoration information and the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model, and the third preset model.
  87. 根据权利要求85所述的方法,其中,所述网络设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The method according to claim 85, wherein the network device uses training samples to jointly train the first preset model, the second preset model and the third preset model, comprising:
    所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
    将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model, and the third preset model.
  88. 根据权利要求80所述的方法,其中,所述训练采用的损失函数为第二损失函数;The method according to claim 80, wherein the loss function used in the training is a second loss function;
    所述第二损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度以及所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度构建的;其中,所述第二训练样本与输入所述估计预设子模型的第一训练样本相对应。The second loss function is based on the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model and the first preset model The second difference degree between the output information of the estimated preset sub-model and the second training sample is constructed; wherein, the second training sample corresponds to the first training sample input into the estimated preset sub-model.
  89. 根据权利要求88所述的方法,其中,所述第一差异程度为基于距离确定的,或者为基于相似程度确定的;和/或,The method of claim 88, wherein the first degree of difference is determined based on a distance, or is determined based on a degree of similarity; and/or,
    所述第二差异程度为基于距离确定的,或者为基于相似程度确定的。The second degree of difference is determined based on distance, or determined based on similarity.
  90. 根据权利要求88或89所述的方法,其中,所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The method according to claim 88 or 89, wherein the network device uses training samples to jointly train the first preset model and the second preset model, comprising:
    所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  91. 根据权利要求88或89所述的方法,其中,所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The method according to claim 88 or 89, wherein the network device uses training samples to jointly train the first preset model and the second preset model, comprising:
    所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  92. 根据权利要求88或89所述的方法,其中,所述网络设备采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型,包括:The method according to claim 88 or 89, wherein the network device uses training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second preset model. models, including:
    所述网络设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The network device uses training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein , the third model is a trained third preset model.
  93. 根据权利要求92所述的方法,其中,所述网络设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The method according to claim 92, wherein the network device uses training samples to jointly train the first preset model, the second preset model and the third preset model, comprising:
    所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
    将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
    根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
  94. 根据权利要求92所述的方法,其中,所述网络设备采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The method according to claim 92, wherein the network device uses training samples to jointly train the first preset model, the second preset model and the third preset model, comprising:
    所述网络设备将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device inputs the first training sample into the estimated preset sub-model of the first preset model, and obtains initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
    将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
    根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
  95. 根据权利要求80-94任一项所述的方法,其中,所述训练样本中包含第一训练样本。The method according to any one of claims 80-94, wherein said training samples include a first training sample.
  96. 根据权利要求95所述的方法,其中,The method of claim 95, wherein,
    所述第一训练样本分布在第一维度和/或第二维度。The first training samples are distributed in the first dimension and/or the second dimension.
  97. 根据权利要求96所述的方法,其中,The method of claim 96, wherein,
    所述第一维度为时域维度;The first dimension is a time domain dimension;
    所述第一训练样本包括在所述时域维度中的m个时间单元内分布的第一信息样本;m为正整数。The first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer.
  98. 根据权利要求96所述的方法,其中,The method of claim 96, wherein,
    所述第二维度为频域维度;The second dimension is a frequency domain dimension;
    所述第一训练样本包括在所述频域维度中的x个频域资源上分布的第一信息样本;x为正整数。The first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer.
  99. 根据权利要求96-98任一项所述的方法,其中,所述第一训练样本还分布在第三维度;The method according to any one of claims 96-98, wherein said first training samples are also distributed in a third dimension;
    所述第三维度为复数维度;所述第一训练样本包括第一信息样本的实部和第一信息样本的虚部。The third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample.
  100. 根据权利要求95-99任一项所述的方法,其中,所述训练样本中还包括与所述第一训练样本对应的第二训练样本;The method according to any one of claims 95-99, wherein the training samples further include a second training sample corresponding to the first training sample;
    所述第二训练样本由T个维度的矩阵构成;T为大于等于2的整数。The second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2.
  101. 根据权利要求100所述的方法,其中,The method of claim 100, wherein,
    所述T个维度中包含第四维度和第五维度。The T dimensions include a fourth dimension and a fifth dimension.
  102. 根据权利要求101所述的方法,其中,The method of claim 101, wherein,
    所述T个维度的矩阵为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。The matrix of the T dimensions is a two-dimensional matrix of M×N; wherein, M represents the quantity of the first granularity under the fourth dimension, and N represents the quantity of the second granularity under the fifth dimension; both M and N are is a positive integer.
  103. 根据权利要求102所述的方法,其中,The method of claim 102, wherein,
    所述二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。The value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; both i and j are is a positive integer.
  104. 根据权利要求101-103任一项所述的方法,其中,所述T个维度中还包括第六维度。The method according to any one of claims 101-103, wherein the T dimensions further include a sixth dimension.
  105. 根据权利要求104所述的方法,其中,The method of claim 104, wherein,
    所述T个维度的矩阵为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The matrix of T dimensions is a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, N represents the number of second granularities in the fifth dimension, and W represents the number of granularities in the fifth dimension. The quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
  106. 根据权利要求105所述的方法,其中,The method of claim 105, wherein,
    所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、第五维度的第j个第二粒度下、所述第六维度的第k个第三粒度下的信道质量;i、j和k均为正整数。The value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality at the third granularity; i, j and k are all positive integers.
  107. 根据权利要求101-106任一项所述的方法,其中,The method according to any one of claims 101-106, wherein,
    所述第四维度为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。The fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
  108. 根据权利要求101-106任一项所述的方法,其中,The method according to any one of claims 101-106, wherein,
    所述第四维度为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。The fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, K3 number of sampling points of symbols; K1, K2 and K3 are positive integers.
  109. 根据权利要求108所述的方法,其中,所述符号为OFDM符号。The method of claim 108, wherein the symbols are OFDM symbols.
  110. 根据权利要求101-109任一项所述的方法,其中,所述第五维度为空间域维度;The method according to any one of claims 101-109, wherein the fifth dimension is a spatial domain dimension;
    所述第二粒度为一对收发天线或到达角度的间隔。The second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
  111. 根据权利要求104-110任一项所述的方法,其中,The method according to any one of claims 104-110, wherein,
    所述第六维度为复数维度;所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。The sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
  112. 根据权利要求111所述的方法,其中,所述k为第一值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的实部;The method according to claim 111, wherein, when the k is the first value, the value of the ijkth position in the three-dimensional matrix is used to represent the ith first in the fourth dimension Granularity, the real part of the channel quality at the jth second granularity of the fifth dimension;
    所述k为第二值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的虚部。When the k is the second value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
  113. 根据权利要求80-112任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 80-112, wherein the method further comprises:
    所述网络设备发送所述第二模型。The network device sends the second model.
  114. 根据权利要求80-113任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 80-113, wherein the method further comprises:
    所述网络设备发送所述第一模型。The network device sends the first model.
  115. 根据权利要求80-113任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 80-113, wherein the method further comprises:
    所述网络设备发送所述估计子模型以及所述压缩子模型。The network device sends the estimated sub-model and the compressed sub-model.
  116. 根据权利要求80-113任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 80-113, wherein the method further comprises:
    所述网络设备发送所述估计子模型、所述压缩子模型以及所述信道生成子模型。The network device sends the estimation sub-model, the compression sub-model and the channel generation sub-model.
  117. 根据权利要求80-116任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 80-116, wherein the method further comprises:
    所述网络设备发送所述第三模型。The network device sends the third model.
  118. 根据权利要求113-117任一项所述的方法,其中,所述第一模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The method according to any one of claims 113-117, wherein the first model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, artificial intelligence Downlink data transmission required for similar business transmission;
    所述第二模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The second model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
    所述第三模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The third model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
    所述估计子模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The estimated sub-model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
    所述压缩子模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The compressed sub-model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
    所述信道生成子模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。The channel generation sub-model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements.
  119. 一种模型生成方法,包括:A method of model generation comprising:
    采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的第一模型和第二模型;performing joint training on the first preset model and the second preset model by using the training samples to obtain the trained first model and the second model;
    其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型;所述第一模型用于对第一信息进行处理得到第二信息;所述第二模型用于对所述第二信息进行处理得到信道信息。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training; the first model is used to process the first information to obtain Second information; the second model is used to process the second information to obtain channel information.
  120. 根据权利要求119所述的方法,其中,所述训练采用的损失函数为第一损失函数;The method according to claim 119, wherein the loss function used in the training is a first loss function;
    所述第一损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度构建的。The first loss function is constructed based on the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model.
  121. 根据权利要求120所述的方法,其中,所述第二预设模型的输出信息与所述压缩预设子模型的输入信息之间的差异程度为基于距离确定的,或者为基于相似程度确定的。The method according to claim 120, wherein the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model is determined based on a distance, or is determined based on a degree of similarity .
  122. 根据权利要求120或121所述的方法,其中,所述采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The method according to claim 120 or 121, wherein the joint training of the first preset model and the second preset model using training samples comprises:
    将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息以及所述初始信息之间的差异程度,确定所述第一损失函数;determining the first loss function based on the degree of difference between the restoration information and the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
  123. 根据权利要求120或121所述的方法,其中,所述采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The method according to claim 120 or 121, wherein the joint training of the first preset model and the second preset model using training samples comprises:
    将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
  124. 根据权利要求120或121所述的方法,其中,所述采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型,包括:The method according to claim 120 or 121, wherein said training samples are used to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, include:
    采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。Using training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein, the first preset model The three models are the third preset models after training.
  125. 根据权利要求124所述的方法,其中,所述采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The method according to claim 124, wherein the joint training of the first preset model, the second preset model and the third preset model using training samples comprises:
    将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
    将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息以及所述初始信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restoration information and the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
  126. 根据权利要求124所述的方法,其中,所述采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The method according to claim 124, wherein the joint training of the first preset model, the second preset model and the third preset model using training samples comprises:
    将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
    将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
  127. 根据权利要求119所述的方法,其中,所述训练采用的损失函数为第二损失函数;The method according to claim 119, wherein the loss function used in the training is a second loss function;
    所述第二损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度以及所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度构建的;其中,所述第二训练样本与输入所述估计预设子模型的第一训练样本相对应。The second loss function is based on the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model and the first preset model The second difference degree between the output information of the estimated preset sub-model and the second training sample is constructed; wherein, the second training sample corresponds to the first training sample input into the estimated preset sub-model.
  128. 根据权利要求127所述的方法,其中,所述第一差异程度为基于距离确定的,或者为基于相似程度确定的;和/或,The method of claim 127, wherein the first degree of difference is determined based on a distance, or determined based on a degree of similarity; and/or,
    所述第二差异程度为基于距离确定的,或者为基于相似程度确定的。The second degree of difference is determined based on distance, or determined based on similarity.
  129. 根据权利要求127或128所述的方法,其中,采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The method according to claim 127 or 128, wherein the joint training of the first preset model, the second preset model and the third preset model using training samples comprises:
    将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  130. 根据权利要求127或128所述的方法,其中,所述采用训练样本对第一预设模型和第二预设模型进行联合训练,包括:The method according to claim 127 or 128, wherein the joint training of the first preset model and the second preset model using training samples comprises:
    将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  131. 根据权利要求127或128所述的方法,其中,所述采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型,包括:The method according to claim 127 or 128, wherein said training samples are used to jointly train the first preset model and the second preset model to obtain the trained first model and the second model, include:
    采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。Using training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the trained first model, the second model and the third model; wherein, the first preset model The three models are the third preset models after training.
  132. 根据权利要求131所述的方法,其中,所述采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The method according to claim 131, wherein the joint training of the first preset model, the second preset model and the third preset model using training samples comprises:
    将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
    将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
    根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
  133. 根据权利要求131所述的方法,其中,所述采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,包括:The method according to claim 131, wherein the joint training of the first preset model, the second preset model and the third preset model using training samples comprises:
    将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;inputting the first training sample into the estimated preset sub-model of the first preset model, and obtaining initial information output by the estimated preset sub-model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
    将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  134. 根据权利要求119-133任一项所述的方法,其中,所述训练样本中包含第一训练样本。The method according to any one of claims 119-133, wherein the training samples include a first training sample.
  135. 根据权利要求134所述的方法,其中,The method of claim 134, wherein,
    所述第一训练样本分布在第一维度和/或第二维度。The first training samples are distributed in the first dimension and/or the second dimension.
  136. 根据权利要求134所述的方法,其中,The method of claim 134, wherein,
    所述第一维度为时域维度;所述第一训练样本包括在所述时域维度中的m个时间单元内分布的第一信息样本;m为正整数。The first dimension is a time domain dimension; the first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer.
  137. 根据权利要求134所述的方法,其中,The method of claim 134, wherein,
    所述第二维度为频域维度;所述第一训练样本包括在所述频域维度中的x个频域资源上分布的第一信息样本;x为正整数。The second dimension is a frequency domain dimension; the first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer.
  138. 根据权利要求135-137任一项所述的方法,其中,所述第一训练样本还分布在第三维度;The method according to any one of claims 135-137, wherein said first training samples are also distributed in a third dimension;
    所述第三维度为复数维度;所述第一训练样本包括第一信息样本的实部和第一信息样本的虚部。The third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample.
  139. 根据权利要求134-138任一项所述的方法,其中,所述训练样本中还包含与所述第一训练样本对应的第二训练样本;The method according to any one of claims 134-138, wherein the training samples further include a second training sample corresponding to the first training sample;
    所述第二训练样本由T个维度的矩阵构成;T为大于等于2的整数。The second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2.
  140. 根据权利要求139所述的方法,其中,The method of claim 139, wherein,
    所述T个维度中包含第四维度和第五维度。The T dimensions include a fourth dimension and a fifth dimension.
  141. 根据权利要求140所述的方法,其中,The method of claim 140, wherein,
    所述T个维度的矩阵为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。The matrix of the T dimensions is a two-dimensional matrix of M×N; wherein, M represents the quantity of the first granularity under the fourth dimension, and N represents the quantity of the second granularity under the fifth dimension; both M and N are is a positive integer.
  142. 根据权利要求141所述的方法,其中,The method of claim 141, wherein,
    所述二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。The value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; both i and j are is a positive integer.
  143. 根据权利要求140-142任一项所述的方法,其中,所述T个维度中还包括第六维度。The method according to any one of claims 140-142, wherein the T dimensions further include a sixth dimension.
  144. 根据权利要求143所述的方法,其中,The method of claim 143, wherein,
    所述T个维度的矩阵为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The matrix of T dimensions is a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, N represents the number of second granularities in the fifth dimension, and W represents the number of granularities in the fifth dimension. The quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
  145. 根据权利要求144所述的方法,其中,The method of claim 144, wherein,
    所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、第五维度的第j个第二粒度下、所述第六维度的第k个第三粒度下的信道质量;i、j和k均为正整数。The value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality at the third granularity; i, j and k are all positive integers.
  146. 根据权利要求140-145任一项所述的方法,其中,The method according to any one of claims 140-145, wherein,
    所述第四维度为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。The fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
  147. 根据权利要求140-145任一项所述的方法,其中,The method according to any one of claims 140-145, wherein,
    所述第四维度为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。The fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, K3 number of sampling points of symbols; K1, K2 and K3 are positive integers.
  148. 根据权利要求147所述的方法,其中,所述符号为OFDM符号。The method of claim 147, wherein the symbols are OFDM symbols.
  149. 根据权利要求140-148任一项所述的方法,其中,所述第五维度为空间域维度;The method of any one of claims 140-148, wherein the fifth dimension is a spatial domain dimension;
    所述第二粒度为一对收发天线或到达角度的间隔。The second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
  150. 根据权利要求143-149任一项所述的方法,其中,The method according to any one of claims 143-149, wherein,
    所述第六维度为复数维度;所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。The sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
  151. 根据权利要求150所述的方法,其中,所述k为第一值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的实部;The method according to claim 150, wherein, when the k is the first value, the value of the ijkth position in the three-dimensional matrix is used to represent the ith first in the fourth dimension Granularity, the real part of the channel quality at the jth second granularity of the fifth dimension;
    所述k为第二值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的虚部。When the k is the second value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
  152. 一种终端设备,包括:A terminal device comprising:
    第一通信单元,用于接收第一信息;发送基于第一信息得到的第二信息;a first communication unit, configured to receive first information; send second information obtained based on the first information;
    其中,所述第二信息为所述第一信息经由第一模型处理得到的,所述第二信息用于经由第二模型进行处理以得到信道信息;所述第一模型和第二模型为联合训练得到的。Wherein, the second information is obtained by processing the first information through the first model, and the second information is used for processing through the second model to obtain channel information; the first model and the second model are a joint obtained by training.
  153. 根据权利要求152所述的终端设备,其中,所述第二信息为信道压缩信息;The terminal device according to claim 152, wherein the second information is channel compression information;
    所述第一模型用于基于输入的所述第一信息进行处理得到信道压缩信息。The first model is used to process the input first information to obtain channel compression information.
  154. 根据权利要求153所述的终端设备,其中,所述第一模型包括:估计子模型和压缩子模型;The terminal device according to claim 153, wherein the first model comprises: an estimation sub-model and a compression sub-model;
    其中,所述估计子模型用于基于所述第一信息进行信道估计得到信道估计信息;Wherein, the estimation sub-model is used to perform channel estimation based on the first information to obtain channel estimation information;
    所述压缩子模型用于对所述信道估计信息进行压缩得到信道压缩信息。The compression sub-model is used to compress the channel estimation information to obtain channel compression information.
  155. 根据权利要求154所述的终端设备,其中,所述终端设备还包括:The terminal device according to claim 154, wherein the terminal device further comprises:
    第一处理单元,用于将所述第一信息输入所述估计子模型,得到所述估计子模型输出的信道估计信息;将所述信道估计信息输入所述压缩子模型,得到所述压缩子模型输出的信道压缩信息。The first processing unit is configured to input the first information into the estimation sub-model to obtain channel estimation information output by the estimation sub-model; input the channel estimation information to the compression sub-model to obtain the compression sub-model Channel compression information for the model output.
  156. 根据权利要求152-155任一项所述的终端设备,其中,所述第一信息为参考信号。The terminal device according to any one of claims 152-155, wherein the first information is a reference signal.
  157. 根据权利要求152所述的终端设备,其中,所述第二信息为信道压缩信息;所述信道压缩信息包含压缩的信道估计信息的特征向量信息;The terminal device according to claim 152, wherein the second information is channel compression information; the channel compression information includes eigenvector information of compressed channel estimation information;
    所述第一模型用于对输入的所述第一信息进行处理得到压缩的信道估计信息的特征向量信息。The first model is used to process the input first information to obtain eigenvector information of compressed channel estimation information.
  158. 根据权利要求157所述的终端设备,其中,所述第一模型包括:估计子模型、信道生成子模型和压缩子模型;The terminal device according to claim 157, wherein the first model comprises: an estimation submodel, a channel generation submodel, and a compression submodel;
    其中,所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;Wherein, the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
    所述信道生成子模型用于对所述信道估计信息进行特征分解得到信道估计信息的特征向量信息;The channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
    所述压缩子模型用于对所述信道估计信息的特征向量信息进行压缩得到压缩的信道估计信息的特征向量信息。The compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain compressed eigenvector information of the channel estimation information.
  159. 根据权利要求158所述的终端设备,其中,The terminal device according to claim 158, wherein,
    所述信道信息的特征向量信息包含R组特征向量序列信息;R为正整数。The eigenvector information of the channel information includes R groups of eigenvector sequence information; R is a positive integer.
  160. 根据权利要求158或159所述的终端设备,其中,所述终端设备还包括:The terminal device according to claim 158 or 159, wherein the terminal device further comprises:
    第一处理单元,用于将所述第一信息输入所述估计子模型,得到所述估计子模型输出的信道估计信息;将所述信道估计信息输入所述信道生成子模型,得到所述信道生成子模型输出的信道估计信息的特征向量信息;将所述信道估计信息的特征向量信息输入所述压缩子模型,得到所述压缩子模型输出的压缩的信道估计信息的特征向量信息。A first processing unit, configured to input the first information into the estimation sub-model to obtain channel estimation information output by the estimation sub-model; input the channel estimation information to the channel generation sub-model to obtain the channel generating eigenvector information of channel estimation information output by the sub-model; inputting the eigenvector information of the channel estimation information into the compression sub-model to obtain the eigenvector information of the compressed channel estimation information output by the compression sub-model.
  161. 根据权利要求156-160任一项所述的终端设备,其中,所述第一信息为参考信号;所述信道信息为所述信道信息的特征向量信息。The terminal device according to any one of claims 156-160, wherein the first information is a reference signal; and the channel information is eigenvector information of the channel information.
  162. 根据权利要求152-161任一项所述的终端设备,其中,所述第一通信单元,用于接收所述第一模型。The terminal device according to any one of claims 152-161, wherein the first communication unit is configured to receive the first model.
  163. 根据权利要求162所述的终端设备,其中,所述第一模型由以下至少之一携带:下行控制信令、媒体接入控制MAC控制元素CE消息、无线资源控制RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。The terminal device according to claim 162, wherein the first model is carried by at least one of the following: downlink control signaling, media access control MAC control element CE message, radio resource control RRC message, broadcast message, downlink data Transmission, downlink data transmission for artificial intelligence business transmission requirements.
  164. 根据权利要求152-161任一项所述的终端设备,其中,所述第一通信单元,用于所述终端设备接收估计子模型以及压缩子模型;The terminal device according to any one of claims 152-161, wherein the first communication unit is used for the terminal device to receive the estimated sub-model and the compressed sub-model;
    所述第一处理单元,用于基于所述估计子模型以及所述压缩子模型,生成所述第一模型。The first processing unit is configured to generate the first model based on the estimated sub-model and the compressed sub-model.
  165. 根据权利要求152-161任一项所述的终端设备,其中,所述第一通信单元,用于接收估计子模型、压缩子模型以及信道生成子模型;The terminal device according to any one of claims 152-161, wherein the first communication unit is configured to receive the estimation sub-model, the compression sub-model and the channel generation sub-model;
    所述第一处理单元,用于基于所述估计子模型、所述压缩子模型以及所述信道生成子模型,生成所述第一模型。The first processing unit is configured to generate the first model based on the estimation sub-model, the compression sub-model and the channel generation sub-model.
  166. 根据权利要求164或165所述的终端设备,其中,所述估计子模型由以下之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The terminal device according to claim 164 or 165, wherein the estimation sub-model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, transmission for artificial intelligence services Required downlink data transmission;
    所述压缩子模型由以下之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The compressed sub-model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
    所述信道生成子模型由以下之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。The channel generation sub-model is carried by one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements.
  167. 根据权利要求162-166任一项所述的终端设备,其中,所述第一通信单元,用于接收所述第二模型。The terminal device according to any one of claims 162-166, wherein the first communication unit is configured to receive the second model.
  168. 根据权利要求162-167任一项所述的终端设备,其中,所述第一通信单元,用于接收所述第三模型。The terminal device according to any one of claims 162-167, wherein the first communication unit is configured to receive the third model.
  169. 根据权利要求168所述的终端设备,其中,所述第三模型用于对所述第一模型输出的第二信息进行数据变换处理后输入所述第二模型;The terminal device according to claim 168, wherein the third model is used to input the second information output by the first model into the second model after data conversion processing;
    所述第一模型、第二模型以及第三模型为联合训练得到的。The first model, the second model and the third model are obtained through joint training.
  170. 根据权利要求169所述的终端设备,其中,所述数据变换处理包括卷积处理或傅里叶变换处理。The terminal device according to claim 169, wherein said data transformation processing comprises convolution processing or Fourier transform processing.
  171. 根据权利要求152-161任一项所述的终端设备,其中,所述第一处理单元,用于采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型;The terminal device according to any one of claims 152-161, wherein the first processing unit is configured to use training samples to jointly train the first preset model and the second preset model to obtain all said first model and said second model;
    其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training.
  172. 根据权利要求171所述的终端设备,其中,所述训练采用的损失函数为第一损失函数;The terminal device according to claim 171, wherein the loss function used in the training is a first loss function;
    所述第一损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度构建的。The first loss function is constructed based on the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model.
  173. 根据权利要求172所述的终端设备,其中,所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度为基于距离确定的,或者为基于相似程度确定的。The terminal device according to claim 172, wherein the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model is determined based on distance, Or determined based on the degree of similarity.
  174. 根据权利要求172或173所述的终端设备,其中,所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device according to claim 172 or 173, wherein the first processing unit is configured to input a first training sample into the estimated preset sub-model of the first preset model to obtain the estimated preset sub-model The initial information output by the model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息以及所述初始信息之间的差异程度,确定所述第一损失函数;determining the first loss function based on the degree of difference between the restoration information and the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
  175. 根据权利要求172或173所述的终端设备,其中,所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device according to claim 172 or 173, wherein the first processing unit is configured to input a first training sample into the estimated preset sub-model of the first preset model to obtain the estimated preset sub-model The initial information output by the model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
  176. 根据权利要求172或176所述的终端设备,其中,所述第一处理单元,用于采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;The terminal device according to claim 172 or 176, wherein the first processing unit is configured to use training samples to jointly train the first preset model, the second preset model, and the third preset model to obtain the training the first model, the second model and the third model after;
    其中,所述第三模型为训练后的第三预设模型。Wherein, the third model is a trained third preset model.
  177. 根据权利要求176所述的终端设备,其中,所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device according to claim 176, wherein the first processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain an output of the estimated preset sub-model the initial information;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
    将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息以及所述初始信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restoration information and the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
  178. 根据权利要求176所述的终端设备,其中,所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device according to claim 176, wherein the first processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain an output of the estimated preset sub-model the initial information;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
    将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
  179. 根据权利要求171所述的终端设备,其中,所述训练采用的损失函数为第二损失函数;The terminal device according to claim 171, wherein the loss function used in the training is a second loss function;
    所述第二损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度,以及所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度构建的;其中,所述第二训练样本与输入所述估计预设子模型的第一训练样本相对应。The second loss function is based on a first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model, and the first preset The second degree of difference between the output information of the estimated preset sub-model of the model and the second training sample is constructed; wherein the second training sample corresponds to the first training sample input to the estimated preset sub-model.
  180. 根据权利要求179所述的终端设备,其中,所述第一差异程度为基于距离确定的,或者为基于相似程度确定的;和/或,The terminal device according to claim 179, wherein the first degree of difference is determined based on a distance, or determined based on a degree of similarity; and/or,
    所述第二差异程度为基于距离确定的,或者为基于相似程度确定的。The second degree of difference is determined based on distance, or determined based on similarity.
  181. 根据权利要求179或180所述的终端设备,其中,所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device according to claim 179 or 180, wherein the first processing unit is configured to input a first training sample into the estimated preset sub-model of the first preset model to obtain the estimated preset sub-model The initial information output by the model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本相对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  182. 根据权利要求179或180所述的终端设备,其中,所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device according to claim 179 or 180, wherein the first processing unit is configured to input a first training sample into the estimated preset sub-model of the first preset model to obtain the estimated preset sub-model The initial information output by the model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  183. 根据权利要求179或180所述的终端设备,其中,所述第一处理单元,用于采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;The terminal device according to claim 179 or 180, wherein the first processing unit is configured to use training samples to jointly train the first preset model, the second preset model, and the third preset model to obtain the training the first model, the second model and the third model after;
    其中,所述第三模型为训练后的第三预设模型。Wherein, the third model is a trained third preset model.
  184. 根据权利要求183所述的终端设备,其中,所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device according to claim 183, wherein the first processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain an output of the estimated preset sub-model the initial information;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述预设模型的第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model of the preset model to obtain transformed information output by the third preset model;
    将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
    根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
  185. 根据权利要求183所述的终端设备,其中,所述第一处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The terminal device according to claim 183, wherein the first processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain an output of the estimated preset sub-model the initial information;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述预设模型中的第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;inputting the compressed feature vector information into a third preset model among the preset models, to obtain transformed feature vector information output by the third preset model;
    将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
    根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
  186. 根据权利要求171-185任一项所述的终端设备,其中,所述训练样本中包含第一训练样本。The terminal device according to any one of claims 171-185, wherein the training samples include a first training sample.
  187. 根据权利要求186所述的终端设备,其中,The terminal device according to claim 186, wherein,
    所述第一训练样本分布在第一维度和/或第二维度。The first training samples are distributed in the first dimension and/or the second dimension.
  188. 根据权利要求187所述的终端设备,其中,The terminal device according to claim 187, wherein,
    所述第一维度为时域维度;所述第一训练样本包括在所述时域维度中的m个时间单元内分布的第一信息样本;m为正整数。The first dimension is a time domain dimension; the first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer.
  189. 根据权利要求187所述的终端设备,其中,The terminal device according to claim 187, wherein,
    所述第二维度为频域维度;所述第一训练样本包括在所述频域维度中的x个频域资源上分布的第一信息样本;x为正整数。The second dimension is a frequency domain dimension; the first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer.
  190. 根据权利要求187-189任一项所述的终端设备,其中,所述第一训练样本还分布在第三维度;The terminal device according to any one of claims 187-189, wherein the first training samples are also distributed in a third dimension;
    所述第三维度为复数维度;所述第一训练样本包括第一信息样本的实部和第一信息样本的虚部。The third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample.
  191. 根据权利要求186-190任一项所述的终端设备,其中,所述训练样本中还包含与所述第一训练样本对应的第二训练样本;The terminal device according to any one of claims 186-190, wherein the training samples further include a second training sample corresponding to the first training sample;
    所述第二训练样本由T个维度的矩阵构成;T为大于等于2的整数。The second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2.
  192. 根据权利要求191所述的终端设备,其中,The terminal device according to claim 191, wherein,
    所述T个维度中包含第四维度和第五维度。The T dimensions include a fourth dimension and a fifth dimension.
  193. 根据权利要求192所述的终端设备,其中,The terminal device according to claim 192, wherein,
    所述T个维度的矩阵为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。The matrix of the T dimensions is a two-dimensional matrix of M×N; wherein, M represents the quantity of the first granularity under the fourth dimension, and N represents the quantity of the second granularity under the fifth dimension; both M and N are is a positive integer.
  194. 根据权利要求193所述的终端设备,其中,The terminal device according to claim 193, wherein,
    所述二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。The value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; both i and j are is a positive integer.
  195. 根据权利要求192-194任一项所述的终端设备,其中,所述T个维度中还包括第六维度。The terminal device according to any one of claims 192-194, wherein the T dimensions further include a sixth dimension.
  196. 根据权利要求195所述的终端设备,其中,The terminal device according to claim 195, wherein,
    所述T个维度的矩阵为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The matrix of T dimensions is a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, N represents the number of second granularities in the fifth dimension, and W represents the number of granularities in the fifth dimension. The quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
  197. 根据权利要求196所述的终端设备,其中,The terminal device according to claim 196, wherein,
    所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、第五维度的第j个第二粒度下、所述第六维度的第k个第三粒度下的信道质量;i、j和k均为正整数。The value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality at the third granularity; i, j and k are all positive integers.
  198. 根据权利要求192-197任一项所述的终端设备,其中,The terminal device according to any one of claims 192-197, wherein,
    所述第四维度为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。The fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
  199. 根据权利要求192-197任一项所述的终端设备,其中,The terminal device according to any one of claims 192-197, wherein,
    所述第四维度为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。The fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, K3 number of sampling points of symbols; K1, K2 and K3 are positive integers.
  200. 根据权利要求199所述的终端设备,其中,所述符号为OFDM符号。The terminal device of claim 199, wherein the symbols are OFDM symbols.
  201. 根据权利要求192-200任一项所述的终端设备,其中,所述第五维度为空间域维度;The terminal device according to any one of claims 192-200, wherein the fifth dimension is a spatial domain dimension;
    所述第二粒度为一对收发天线或到达角度的间隔。The second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
  202. 根据权利要求195-201任一项所述的终端设备,其中,The terminal device according to any one of claims 195-201, wherein,
    所述第六维度为复数维度;所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。The sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
  203. 根据权利要求202所述的终端设备,其中,所述k为第一值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的实部;The terminal device according to claim 202, wherein when the k is the first value, the value of the ijk-th position in the three-dimensional matrix is used to represent the i-th position in the fourth dimension A granularity, the real part of the channel quality at the jth second granularity of the fifth dimension;
    所述k为第二值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的虚部。When the k is the second value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
  204. 根据权利要求171-203任一项所述的终端设备,其中,所述第一通信单元,用于发送所述第二模型。The terminal device according to any one of claims 171-203, wherein the first communication unit is configured to send the second model.
  205. 根据权利要求171-204任一项所述的终端设备,其中,所述第一通信单元,用于发送所述第一模型。The terminal device according to any one of claims 171-204, wherein the first communication unit is configured to send the first model.
  206. 根据权利要求171-204任一项所述的终端设备,其中,所述第一通信单元,用于发送所述第一模型中的估计子模型以及压缩子模型。The terminal device according to any one of claims 171-204, wherein the first communication unit is configured to send the estimation sub-model and the compression sub-model in the first model.
  207. 根据权利要求171-204任一项所述的终端设备,其中,所述第一通信单元,用于发送所述第一模型中的估计子模型、压缩子模型以及信道生成子模型。The terminal device according to any one of claims 171-204, wherein the first communication unit is configured to send the estimation sub-model, the compression sub-model and the channel generation sub-model in the first model.
  208. 根据权利要求171-207任一项所述的终端设备,其中,所述第一通信单元,用于发送所述第三模型。The terminal device according to any one of claims 171-207, wherein the first communication unit is configured to send the third model.
  209. 根据权利要求204-208任一项所述的终端设备,其中,所述第一模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The terminal device according to any one of claims 204-208, wherein the first model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and transmission requirements for artificial intelligence services Uplink data transmission;
    所述第二模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The second model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements;
    所述第三模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The third model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements;
    所述估计子模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The estimation sub-model is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, uplink data transmission for artificial intelligence business type transmission requirements;
    所述压缩子模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The compressed sub-model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements;
    所述信道生成子模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The channel generation sub-model is carried by at least one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  210. 一种网络设备,包括:A network device comprising:
    第二通信单元,用于发送第一信息;接收第二信息;其中,所述第二信息为所述第一信息经由第一模型处理得到的;The second communication unit is configured to send the first information; receive the second information; wherein, the second information is obtained by processing the first information through the first model;
    第二处理单元,用于基于第二模型对所述第二信息进行处理得到信道信息;其中,所述第一模型和第二模型为联合训练得到的。The second processing unit is configured to process the second information based on a second model to obtain channel information; wherein, the first model and the second model are obtained through joint training.
  211. 根据权利要求210所述的网络设备,其中,所述第二信息为信道压缩信息;The network device according to claim 210, wherein the second information is channel compression information;
    所述第二模型用于对所述信道压缩信息进行解压缩处理,得到信道信息。The second model is used to decompress the channel compressed information to obtain channel information.
  212. 根据权利要求211所述的网络设备,其中,第二处理单元,用于将所述信道压缩信息输入所述第二模型,得到所述第二模型输出的所述信道信息。The network device according to claim 211, wherein the second processing unit is configured to input the channel compression information into the second model, and obtain the channel information output by the second model.
  213. 根据权利要求210所述的网络设备,其中,所述第二信息为信道压缩信息;所述信道压缩信息包含压缩的信道估计信息的特征向量信息;所述信道信息为信道信息的特征向量信息;The network device according to claim 210, wherein the second information is channel compression information; the channel compression information includes eigenvector information of compressed channel estimation information; the channel information is eigenvector information of channel information;
    所述第二模型用于对所述压缩的信道估计信息的特征向量信息进行解压缩处理,得到信道信息的特征向量信息。The second model is used to decompress the compressed eigenvector information of the channel estimation information to obtain the eigenvector information of the channel information.
  214. 根据权利要求213所述的网络设备,其中,The network device of claim 213, wherein:
    所述信道信息的特征向量信息中包含R组特征向量序列信息;R为正整数。The eigenvector information of the channel information includes R groups of eigenvector sequence information; R is a positive integer.
  215. 根据权利要求213或214所述的网络设备,其中,所述第二处理单元,用于将所述压缩的信道估计信息的特征向量信息输入所述第二模型,得到所述第二模型输出的所述信道信息的特征向量信息。The network device according to claim 213 or 214, wherein the second processing unit is configured to input the eigenvector information of the compressed channel estimation information into the second model, and obtain the output of the second model Eigenvector information of the channel information.
  216. 根据权利要求210-215任一项所述的网络设备,其中,所述第一信息为参考信号。The network device according to any one of claims 210-215, wherein the first information is a reference signal.
  217. 根据权利要求210-216任一项所述的网络设备,其中,所述第二通信单元,用于接收所述第二模型。The network device according to any one of claims 210-216, wherein the second communication unit is configured to receive the second model.
  218. 根据权利要求217所述的网络设备,其中,The network device of claim 217, wherein,
    所述第二模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The second model is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  219. 根据权利要求217或218所述的网络设备,其中,所述第二通信单元,用于接收所述第一模型。The network device according to claim 217 or 218, wherein the second communication unit is configured to receive the first model.
  220. 根据权利要求219所述的网络设备,其中,The network device of claim 219, wherein,
    所述第一模型由以下至少之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The first model is carried by at least one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  221. 根据权利要求219所述的网络设备,其中,所述第一模型包括:估计子模型和压缩子模型;The network device of claim 219, wherein the first model comprises: an estimation sub-model and a compression sub-model;
    其中,所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;Wherein, the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
    所述压缩子模型用于对所述信道估计信息进行压缩得到所述第二信息。The compression sub-model is used to compress the channel estimation information to obtain the second information.
  222. 根据权利要求219所述的网络设备,其中,所述第一模型包括:估计子模型、信道生成子模型和压缩子模型;The network device according to claim 219, wherein the first model comprises: an estimation submodel, a channel generation submodel and a compression submodel;
    其中,所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;Wherein, the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
    所述信道生成子模型用于对所述信道估计信息进行特征分解得到信道估计信息的特征向量信息;The channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
    所述压缩子模型用于对所述信道估计信息的特征向量信息进行压缩得到所述第二信息。The compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain the second information.
  223. 根据权利要求217或218所述的网络设备,其中,所述第二通信单元,用于接收所述估计子模型以及所述压缩子模型;The network device according to claim 217 or 218, wherein the second communication unit is configured to receive the estimated sub-model and the compressed sub-model;
    所述第二处理单元,用于基于所述估计子模型以及所述压缩子模型,生成所述第一模型。The second processing unit is configured to generate the first model based on the estimated sub-model and the compressed sub-model.
  224. 根据权利要求223所述的网络设备,其中,所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;The network device according to claim 223, wherein the estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
    所述压缩子模型用于对所述信道估计信息进行压缩得到所述第二信息。The compression sub-model is used to compress the channel estimation information to obtain the second information.
  225. 根据权利要求217或218所述的网络设备,其中,所述第二通信单元,用于接收所述估计子模型、所述压缩子模型、信道生成子模型;The network device according to claim 217 or 218, wherein the second communication unit is configured to receive the estimation sub-model, the compression sub-model, and the channel generation sub-model;
    所述第二处理单元,用于基于所述估计子模型、所述压缩子模型以及所述信道生成子模型,生成所述第一模型。The second processing unit is configured to generate the first model based on the estimation sub-model, the compression sub-model and the channel generation sub-model.
  226. 根据权利要求225所述的网络设备,其中,The network device of claim 225, wherein,
    所述估计子模型用于对所述第一信息进行信道估计得到信道估计信息;The estimation sub-model is used to perform channel estimation on the first information to obtain channel estimation information;
    所述信道生成子模型用于对所述信道估计信息进行特征分解得到信道估计信息的特征向量信息;The channel generation sub-model is used to perform eigendecomposition on the channel estimation information to obtain eigenvector information of the channel estimation information;
    所述压缩子模型用于对所述信道估计信息的特征向量信息进行压缩得到所述第二信息。The compression sub-model is used to compress the eigenvector information of the channel estimation information to obtain the second information.
  227. 根据权利要求223-226任一项所述的网络设备,其中,所述估计子模型由以下之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The network device according to any one of claims 223-226, wherein the estimation sub-model is carried by one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink transmission requirements for artificial intelligence services data transmission;
    所述压缩子模型由以下之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输;The compressed sub-model is carried by one of the following: uplink control signaling, RRC messages, uplink data transmission, and uplink data transmission for artificial intelligence business transmission requirements;
    所述信道生成子模型由以下之一携带:上行控制信令、RRC消息、上行数据传输、针对人工智能类业务类传输需求的上行数据传输。The channel generation sub-model is carried by one of the following: uplink control signaling, RRC message, uplink data transmission, and uplink data transmission for the transmission requirements of artificial intelligence services.
  228. 根据权利要求217-227任一项所述的网络设备,其中,所述第二通信单元,用于接收第三模型。The network device according to any one of claims 217-227, wherein the second communication unit is configured to receive the third model.
  229. 根据权利要求228所述的网络设备,其中,所述第三模型用于对所述第一模型输出的第二信息进行数据变换处理后输入所述第二模型;The network device according to claim 228, wherein the third model is used to input the second information output by the first model into the second model after data conversion processing;
    所述第一模型、第二模型以及第三模型为联合训练得到的。The first model, the second model and the third model are obtained through joint training.
  230. 根据权利要求229所述的网络设备,其中,所述数据变换处理包括:卷积处理或傅里叶变换处理。The network device according to claim 229, wherein said data transformation processing comprises: convolution processing or Fourier transform processing.
  231. 根据权利要求210-216任一项所述的网络设备,其中,所述第二处理单元,用于采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的所述第一模型和所述第二模型;The network device according to any one of claims 210-216, wherein the second processing unit is configured to use training samples to jointly train the first preset model and the second preset model to obtain the trained said first model and said second model;
    其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training.
  232. 根据权利要求231所述的网络设备,其中,所述训练采用的损失函数为第一损失函数;The network device according to claim 231, wherein the loss function used in the training is a first loss function;
    所述第一损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度构建的。The first loss function is constructed based on the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model.
  233. 根据权利要求232所述的网络设备,其中,所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度为基于距离确定的,或者为基于相似程度确定的。The network device of claim 232, wherein the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model is determined based on distance, Or determined based on the degree of similarity.
  234. 根据权利要求232或233所述的网络设备,其中,所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device according to claim 232 or 233, wherein the second processing unit is configured to input the first training sample into the estimated preset sub-model of the first preset model to obtain the estimated preset sub-model The initial information output by the model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息以及所述初始信息之间的差异程度,确定所述第一损失函数;determining the first loss function based on the degree of difference between the restoration information and the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
  235. 根据权利要求232或233所述的网络设备,其中,所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device according to claim 232 or 233, wherein the second processing unit is configured to input the first training sample into the estimated preset sub-model of the first preset model to obtain the estimated preset sub-model The initial information output by the model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
  236. 根据权利要求232或233所述的网络设备,其中,所述第二处理单元,用于采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The network device according to claim 232 or 233, wherein the second processing unit is configured to use training samples to jointly train the first preset model, the second preset model, and the third preset model to obtain the training After the first model, the second model and the third model; wherein, the third model is the third preset model after training.
  237. 根据权利要求236所述的网络设备,其中,所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device according to claim 236, wherein the second processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain an output of the estimated preset sub-model the initial information;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
    将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息以及所述初始信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restoration information and the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model, and the third preset model.
  238. 根据权利要求236所述的网络设备,其中,所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device according to claim 236, wherein the second processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain an output of the estimated preset sub-model the initial information;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
    将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model, and the third preset model.
  239. 根据权利要求231所述的网络设备,其中,所述训练采用的损失函数为第二损失函数;The network device according to claim 231, wherein the loss function used in the training is a second loss function;
    所述第二损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度以及所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度构建的;其中,所述第二训练样本与输入所述估计预设子模型的第一训练样本相对应。The second loss function is based on the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model and the first preset model The second difference degree between the output information of the estimated preset sub-model and the second training sample is constructed; wherein, the second training sample corresponds to the first training sample input into the estimated preset sub-model.
  240. 根据权利要求239所述的网络设备,其中,所述第一差异程度为基于距离确定的,或者为基于相似程度确定的;和/或,The network device according to claim 239, wherein the first degree of difference is determined based on a distance, or is determined based on a degree of similarity; and/or,
    所述第二差异程度为基于距离确定的,或者为基于相似程度确定的。The second degree of difference is determined based on distance, or determined based on similarity.
  241. 根据权利要求239或240所述的网络设备,其中,所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device according to claim 239 or 240, wherein the second processing unit is configured to input the first training sample into the estimated preset sub-model of the first preset model to obtain the estimated preset sub-model The initial information output by the model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  242. 根据权利要求239或240所述的网络设备,其中,所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device according to claim 239 or 240, wherein the second processing unit is configured to input the first training sample into the estimated preset sub-model of the first preset model to obtain the estimated preset sub-model The initial information output by the model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  243. 根据权利要求239或240所述的网络设备,其中,所述第二处理单元,用于采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The network device according to claim 239 or 240, wherein the second processing unit is configured to use training samples to jointly train the first preset model, the second preset model, and the third preset model to obtain the training After the first model, the second model and the third model; wherein, the third model is the third preset model after training.
  244. 根据权利要求243所述的网络设备,其中,所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device according to claim 243, wherein the second processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain an output of the estimated preset sub-model the initial information;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
    将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
    根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
  245. 根据权利要求243所述的网络设备,其中,所述第二处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The network device according to claim 243, wherein the second processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain an output of the estimated preset sub-model the initial information;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
    将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
    根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
  246. 根据权利要求231-245任一项所述的网络设备,其中,所述训练样本中包含第一训练样本。The network device according to any one of claims 231-245, wherein the training samples include a first training sample.
  247. 根据权利要求246所述的网络设备,其中,The network device of claim 246, wherein:
    所述第一训练样本分布在第一维度和/或第二维度。The first training samples are distributed in the first dimension and/or the second dimension.
  248. 根据权利要求247所述的网络设备,其中,The network device of claim 247, wherein,
    所述第一维度为时域维度;The first dimension is a time domain dimension;
    所述第一训练样本包括在所述时域维度中的m个时间单元内分布的第一信息样本;m为正整数。The first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer.
  249. 根据权利要求247所述的网络设备,其中,The network device of claim 247, wherein,
    所述第二维度为频域维度;The second dimension is a frequency domain dimension;
    所述第一训练样本包括在所述频域维度中的x个频域资源上分布的第一信息样本;x为正整数。The first training samples include first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer.
  250. 根据权利要求247-249任一项所述的网络设备,其中,所述第一训练样本还分布在第三维度;The network device according to any one of claims 247-249, wherein the first training samples are also distributed in a third dimension;
    所述第三维度为复数维度;所述第一训练样本包括第一信息样本的实部和第一信息样本的虚部。The third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample.
  251. 根据权利要求246-250任一项所述的网络设备,其中,所述训练样本中还包括与所述第一训练样本对应的第二训练样本;The network device according to any one of claims 246-250, wherein the training samples further include a second training sample corresponding to the first training sample;
    所述第二训练样本由T个维度的矩阵构成;T为大于等于2的整数。The second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2.
  252. 根据权利要求251所述的网络设备,其中,The network device of claim 251, wherein,
    所述T个维度中包含第四维度和第五维度。The T dimensions include a fourth dimension and a fifth dimension.
  253. 根据权利要求252所述的网络设备,其中,The network device of claim 252, wherein:
    所述T个维度的矩阵为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。The matrix of the T dimensions is a two-dimensional matrix of M×N; wherein, M represents the quantity of the first granularity under the fourth dimension, and N represents the quantity of the second granularity under the fifth dimension; both M and N are is a positive integer.
  254. 根据权利要求253所述的网络设备,其中,The network device of claim 253, wherein:
    所述二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。The value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; both i and j are is a positive integer.
  255. 根据权利要求252-254任一项所述的网络设备,其中,所述T个维度中还包括第六维度。The network device according to any one of claims 252-254, wherein the T dimensions further include a sixth dimension.
  256. 根据权利要求255所述的网络设备,其中,The network device of claim 255, wherein,
    所述T个维度的矩阵为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The matrix of T dimensions is a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, N represents the number of second granularities in the fifth dimension, and W represents the number of granularities in the fifth dimension. The quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
  257. 根据权利要求256所述的网络设备,其中,The network device of claim 256, wherein:
    所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、第五维度的第j个第二粒度下、所述第六维度的第k个第三粒度下的信道质量;i、j和k均为正整数。The value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality at the third granularity; i, j and k are all positive integers.
  258. 根据权利要求252-257任一项所述的网络设备,其中,The network device according to any one of claims 252-257, wherein,
    所述第四维度为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。The fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
  259. 根据权利要求252-257任一项所述的网络设备,其中,The network device according to any one of claims 252-257, wherein,
    所述第四维度为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。The fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, K3 number of sampling points of symbols; K1, K2 and K3 are positive integers.
  260. 根据权利要求259所述的网络设备,其中,所述符号为OFDM符号。The network device of claim 259, wherein the symbols are OFDM symbols.
  261. 根据权利要求252-260任一项所述的网络设备,其中,所述第五维度为空间域维度;The network device according to any one of claims 252-260, wherein the fifth dimension is a spatial domain dimension;
    所述第二粒度为一对收发天线或到达角度的间隔。The second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
  262. 根据权利要求255-261任一项所述的网络设备,其中,The network device according to any one of claims 255-261, wherein,
    所述第六维度为复数维度;所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。The sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
  263. 根据权利要求262所述的网络设备,其中,所述k为第一值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的实部;The network device according to claim 262, wherein, when the k is the first value, the value of the ijk-th position in the three-dimensional matrix is used to represent the i-th position in the fourth dimension A granularity, the real part of the channel quality at the jth second granularity of the fifth dimension;
    所述k为第二值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的虚部。When the k is the second value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
  264. 根据权利要求231-263任一项所述的网络设备,其中,所述第二通信单元,用于发送所述第二模型。The network device according to any one of claims 231-263, wherein the second communication unit is configured to send the second model.
  265. 根据权利要求231-264任一项所述的网络设备,其中,所述第二通信单元,用于发送所述第一模型。The network device according to any one of claims 231-264, wherein the second communication unit is configured to send the first model.
  266. 根据权利要求231-264任一项所述的网络设备,其中,所述第二通信单元,用于所述估计子模型以及所述压缩子模型。The network device according to any one of claims 231-264, wherein the second communication unit is used for the estimation sub-model and the compression sub-model.
  267. 根据权利要求231-264任一项所述的网络设备,其中,所述第二通信单元,用于发送所述估计子模型、所述压缩子模型以及所述信道生成子模型。The network device according to any one of claims 231-264, wherein the second communication unit is configured to send the estimation sub-model, the compression sub-model and the channel generation sub-model.
  268. 根据权利要求231-267任一项所述的网络设备,其中,所述第二通信单元,用于发送所述第三模型。The network device according to any one of claims 231-267, wherein the second communication unit is configured to send the third model.
  269. 根据权利要求265-268任一项所述的网络设备,其中,所述第一模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The network device according to any one of claims 265-268, wherein the first model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, manual Downlink data transmission required for intelligent business transmission;
    所述第二模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The second model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
    所述第三模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The third model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
    所述估计子模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The estimated sub-model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
    所述压缩子模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输;The compressed sub-model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements;
    所述信道生成子模型由以下至少之一携带:下行控制信令、MAC CE消息、RRC消息、广播消息、下行数据传输、针对人工智能类业务传输需求的下行数据传输。The channel generation sub-model is carried by at least one of the following: downlink control signaling, MAC CE message, RRC message, broadcast message, downlink data transmission, downlink data transmission for artificial intelligence business transmission requirements.
  270. 一种电子设备,包括:An electronic device comprising:
    第三处理单元,用于采用训练样本对第一预设模型和第二预设模型进行联合训练,得到训练后的第一模型和第二模型;The third processing unit is configured to use training samples to jointly train the first preset model and the second preset model to obtain the trained first model and the second model;
    其中,所述第一模型为训练后的所述第一预设模型,所述第二模型为训练后的所述第二预设模型;所述第一模型用于对第一信息进行处理得到第二信息;所述第二模型用于对所述第二信息进行处理得到信道信息。Wherein, the first model is the first preset model after training, and the second model is the second preset model after training; the first model is used to process the first information to obtain Second information; the second model is used to process the second information to obtain channel information.
  271. 根据权利要求270所述的电子设备,其中,所述训练采用的损失函数为第一损失函数;The electronic device according to claim 270, wherein the loss function used in the training is a first loss function;
    所述第一损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的差异程度构建的。The first loss function is constructed based on the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model.
  272. 根据权利要求271所述的电子设备,其中,所述第二预设模型的输出信息与所述压缩预设子模型的输入信息之间的差异程度为基于距离确定的,或者为基于相似程度确定的。The electronic device according to claim 271, wherein the degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model is determined based on a distance, or determined based on a degree of similarity of.
  273. 根据权利要求271或272所述的电子设备,其中,所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The electronic device according to claim 271 or 272, wherein the third processing unit is configured to input a first training sample into the estimated preset sub-model of the first preset model to obtain the estimated preset sub-model The initial information output by the model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息以及所述初始信息之间的差异程度,确定所述第一损失函数;determining the first loss function based on the degree of difference between the restoration information and the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
  274. 根据权利要求271或272所述的电子设备,其中,所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The electronic device according to claim 271 or 272, wherein the third processing unit is configured to input a first training sample into the estimated preset sub-model of the first preset model to obtain the estimated preset sub-model The initial information output by the model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the first loss function to update the first preset model and the second preset model.
  275. 根据权利要求271或272所述的电子设备,其中,所述第三处理单元,用于采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The electronic device according to claim 271 or 272, wherein the third processing unit is configured to use training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the training After the first model, the second model and the third model; wherein, the third model is the third preset model after training.
  276. 根据权利要求275所述的电子设备,其中,所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The electronic device according to claim 275, wherein the third processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain an output of the estimated preset sub-model the initial information;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
    将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息以及所述初始信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restoration information and the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
  277. 根据权利要求275所述的电子设备,其中,所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The electronic device according to claim 275, wherein the third processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain an output of the estimated preset sub-model the initial information;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特 征向量信息;Inputting the initial information into the channel generation preset sub-model of the first preset model to obtain the eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
    将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的差异程度,确定第一损失函数;determining a first loss function based on the degree of difference between the restored eigenvector information and the eigenvector information of the initial information;
    根据所述第一损失函数进行反向传导更新所述第一预设模型、第二预设模型和第三预设模型。performing reverse conduction according to the first loss function to update the first preset model, the second preset model and the third preset model.
  278. 根据权利要求270所述的电子设备,其中,所述训练采用的损失函数为第二损失函数;The electronic device according to claim 270, wherein the loss function used in the training is a second loss function;
    所述第二损失函数为基于所述第二预设模型的输出信息与所述第一预设模型的压缩预设子模型的输入信息之间的第一差异程度以及所述第一预设模型的估计预设子模型的输出信息与第二训练样本之间的第二差异程度构建的;其中,所述第二训练样本与输入所述估计预设子模型的第一训练样本相对应。The second loss function is based on the first degree of difference between the output information of the second preset model and the input information of the compressed preset sub-model of the first preset model and the first preset model The second difference degree between the output information of the estimated preset sub-model and the second training sample is constructed; wherein, the second training sample corresponds to the first training sample input into the estimated preset sub-model.
  279. 根据权利要求278所述的电子设备,其中,所述第一差异程度为基于距离确定的,或者为基于相似程度确定的;和/或,The electronic device according to claim 278, wherein the first degree of difference is determined based on a distance, or determined based on a degree of similarity; and/or,
    所述第二差异程度为基于距离确定的,或者为基于相似程度确定的。The second degree of difference is determined based on distance, or determined based on similarity.
  280. 根据权利要求278或279所述的电子设备,其中,所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The electronic device according to claim 278 or 279, wherein the third processing unit is configured to input the first training sample into the estimated preset sub-model of the first preset model to obtain the estimated preset sub-model The initial information output by the model;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the compressed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  281. 根据权利要求278或279所述的电子设备,其中,所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The electronic device according to claim 278 or 279, wherein the third processing unit is configured to input the first training sample into the estimated preset sub-model of the first preset model to obtain the estimated preset sub-model The initial information output by the model;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the compressed feature vector information into the second preset model to obtain restored feature vector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  282. 根据权利要求278或279所述的电子设备,其中,所述第三处理单元,用于采用训练样本对第一预设模型、第二预设模型和第三预设模型进行联合训练,得到训练后的所述第一模型、所述第二模型和第三模型;其中,所述第三模型为训练后的第三预设模型。The electronic device according to claim 278 or 279, wherein the third processing unit is configured to use training samples to jointly train the first preset model, the second preset model and the third preset model to obtain the training After the first model, the second model and the third model; wherein, the third model is the third preset model after training.
  283. 根据权利要求282所述的电子设备,其中,所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The electronic device according to claim 282, wherein the third processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain an output of the estimated preset sub-model the initial information;
    将所述初始信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的信息;inputting the initial information into the compressed preset sub-model of the first preset model to obtain compressed information output by the compressed preset sub-model;
    将所述压缩后的信息输入第三预设模型,得到所述第三预设模型输出的变换后的信息;inputting the compressed information into a third preset model to obtain transformed information output by the third preset model;
    将所述变换后的信息输入所述第二预设模型,得到所述第二预设模型输出的恢复信息;inputting the transformed information into the second preset model to obtain restoration information output by the second preset model;
    基于所述恢复信息与所述初始信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;Determine a second loss function based on a first degree of difference between the restored information and the initial information, and based on a second degree of difference between the initial information and a second training sample; the second training sample and The first training sample corresponds to;
    根据所述第二损失函数进行反向传导更新所述第一预设模型、所述第二预设模型和所述第三预设模型。performing reverse conduction according to the second loss function to update the first preset model, the second preset model, and the third preset model.
  284. 根据权利要求282所述的电子设备,其中,所述第三处理单元,用于将第一训练样本输入所述第一预设模型的估计预设子模型,得到所述估计预设子模型输出的初始信息;The electronic device according to claim 282, wherein the third processing unit is configured to input a first training sample into an estimated preset sub-model of the first preset model, and obtain an output of the estimated preset sub-model the initial information;
    将所述初始信息输入所述第一预设模型的信道生成预设子模型,得到所述信道生成预设子模型输出的初始信息的特征向量信息;inputting the initial information into the channel generation preset sub-model of the first preset model, and obtaining eigenvector information of the initial information output by the channel generation preset sub-model;
    将所述初始信息的特征向量信息输入所述第一预设模型的压缩预设子模型,得到所述压缩预设子模型输出的压缩后的特征向量信息;Inputting the eigenvector information of the initial information into the compressed preset sub-model of the first preset model to obtain the compressed eigenvector information output by the compressed preset sub-model;
    将所述压缩后的特征向量信息输入第三预设模型,得到所述第三预设模型输出的变换后的特征向量信息;Inputting the compressed feature vector information into a third preset model to obtain transformed feature vector information output by the third preset model;
    将所述变换后的特征向量信息输入所述第二预设模型,得到所述第二预设模型输出的恢复的特征向量信息;inputting the transformed eigenvector information into the second preset model to obtain restored eigenvector information output by the second preset model;
    基于所述恢复的特征向量信息以及所述初始信息的特征向量信息之间的第一差异程度,以及基于所述初始信息与第二训练样本之间的第二差异程度,确定第二损失函数;所述第二训练样本与所述第一训练样本对应;determining a second loss function based on a first degree of difference between the restored feature vector information and the feature vector information of the initial information, and based on a second degree of difference between the initial information and a second training sample; The second training sample corresponds to the first training sample;
    根据所述第二损失函数进行反向传导更新所述第一预设模型和所述第二预设模型。performing reverse conduction according to the second loss function to update the first preset model and the second preset model.
  285. 根据权利要求270-284任一项所述的电子设备,其中,所述训练样本中包含第一训练样本。The electronic device according to any one of claims 270-284, wherein the training samples include a first training sample.
  286. 根据权利要求285所述的电子设备,其中,The electronic device of claim 285, wherein:
    所述第一训练样本分布在第一维度和/或第二维度。The first training samples are distributed in the first dimension and/or the second dimension.
  287. 根据权利要求285所述的电子设备,其中,The electronic device of claim 285, wherein:
    所述第一维度为时域维度;所述第一训练样本包括在所述时域维度中的m个时间单元内分布的第一信息样本;m为正整数。The first dimension is a time domain dimension; the first training samples include first information samples distributed in m time units in the time domain dimension; m is a positive integer.
  288. 根据权利要求285所述的电子设备,其中,The electronic device of claim 285, wherein:
    所述第二维度为频域维度;所述第一训练样本包括在所述频域维度中的x个频域资源上分布的第一信息样本;x为 正整数。The second dimension is a frequency domain dimension; the first training sample includes first information samples distributed on x frequency domain resources in the frequency domain dimension; x is a positive integer.
  289. 根据权利要求286-288任一项所述的电子设备,其中,所述第一训练样本还分布在第三维度;The electronic device according to any one of claims 286-288, wherein the first training samples are further distributed in a third dimension;
    所述第三维度为复数维度;所述第一训练样本包括第一信息样本的实部和第一信息样本的虚部。The third dimension is a complex dimension; the first training samples include the real part of the first information sample and the imaginary part of the first information sample.
  290. 根据权利要求285-289任一项所述的电子设备,其中,所述训练样本中还包含与所述第一训练样本对应的第二训练样本;The electronic device according to any one of claims 285-289, wherein the training samples further include a second training sample corresponding to the first training sample;
    所述第二训练样本由T个维度的矩阵构成;T为大于等于2的整数。The second training sample is composed of a matrix of T dimensions; T is an integer greater than or equal to 2.
  291. 根据权利要求290所述的电子设备,其中,The electronic device according to claim 290, wherein,
    所述T个维度中包含第四维度和第五维度。The T dimensions include a fourth dimension and a fifth dimension.
  292. 根据权利要求291所述的电子设备,其中,The electronic device of claim 291, wherein:
    所述T个维度的矩阵为M×N的二维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量;M和N均为正整数。The matrix of the T dimensions is a two-dimensional matrix of M×N; wherein, M represents the quantity of the first granularity under the fourth dimension, and N represents the quantity of the second granularity under the fifth dimension; both M and N are is a positive integer.
  293. 根据权利要求292所述的电子设备,其中,The electronic device of claim 292, wherein:
    所述二维矩阵中的第ij个位置的数值用于表示在所述第四维度下的第i个第一粒度以及第五维度的第j个第二粒度下的信道质量;i和j均为正整数。The value at the ijth position in the two-dimensional matrix is used to represent the channel quality at the i-th first granularity in the fourth dimension and the j-th second granularity in the fifth dimension; both i and j are is a positive integer.
  294. 根据权利要求291-293任一项所述的电子设备,其中,所述T个维度中还包括第六维度。The electronic device according to any one of claims 291-293, wherein the T dimensions further include a sixth dimension.
  295. 根据权利要求294所述的电子设备,其中,The electronic device of claim 294, wherein:
    所述T个维度的矩阵为M×N×W的三维矩阵;其中,M表示在第四维度下的第一粒度的数量,N表示在第五维度下的第二粒度的数量,W表示在第六维度下的第三粒度的数量;M、N和W均为正整数。The matrix of T dimensions is a three-dimensional matrix of M×N×W; wherein, M represents the number of first granularities in the fourth dimension, N represents the number of second granularities in the fifth dimension, and W represents the number of granularities in the fifth dimension. The quantity of the third granularity under the sixth dimension; M, N and W are all positive integers.
  296. 根据权利要求295所述的电子设备,其中,The electronic device of claim 295, wherein:
    所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、第五维度的第j个第二粒度下、所述第六维度的第k个第三粒度下的信道质量;i、j和k均为正整数。The value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension, the j-th second granularity in the fifth dimension, and the k-th in the sixth dimension channel quality at the third granularity; i, j and k are all positive integers.
  297. 根据权利要求291-296任一项所述的电子设备,其中,The electronic device according to any one of claims 291-296, wherein,
    所述第四维度为频域维度;所述第一粒度包含以下之一:L1个资源块RB,L2个子载波;L1和L2为正整数。The fourth dimension is a frequency domain dimension; the first granularity includes one of the following: L1 resource blocks RB, L2 subcarriers; L1 and L2 are positive integers.
  298. 根据权利要求291-296任一项所述的电子设备,其中,The electronic device according to any one of claims 291-296, wherein,
    所述第四维度为时域维度;所述第一粒度包含以下之一:K1个微秒、K2个符号长度、K3个符号的采样点个数;K1、K2和K3为正整数。The fourth dimension is a time domain dimension; the first granularity includes one of the following: K1 microseconds, K2 symbol length, K3 number of sampling points of symbols; K1, K2 and K3 are positive integers.
  299. 根据权利要求298所述的电子设备,其中,所述符号为OFDM符号。The electronic device of claim 298, wherein the symbols are OFDM symbols.
  300. 根据权利要求291-299任一项所述的电子设备,其中,所述第五维度为空间域维度;The electronic device according to any one of claims 291-299, wherein the fifth dimension is a spatial domain dimension;
    所述第二粒度为一对收发天线或到达角度的间隔。The second granularity is an interval between a pair of transmitting and receiving antennas or an angle of arrival.
  301. 根据权利要求294-300任一项所述的电子设备,其中,The electronic device according to any one of claims 294-300, wherein,
    所述第六维度为复数维度;所述第三粒度为1,在所述复数维度下的第三粒度的数量W为2。The sixth dimension is a complex dimension; the third granularity is 1, and the quantity W of the third granularity under the complex dimension is 2.
  302. 根据权利要求301所述的电子设备,其中,所述k为第一值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的实部;The electronic device according to claim 301, wherein, when the k is the first value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th position in the fourth dimension A granularity, the real part of the channel quality at the jth second granularity of the fifth dimension;
    所述k为第二值的情况下,所述三维矩阵中的第ijk个位置的数值用于表示在所述第四维度下的第i个第一粒度、所述第五维度的第j个第二粒度下的信道质量的虚部。When the k is the second value, the value of the ijkth position in the three-dimensional matrix is used to represent the i-th first granularity in the fourth dimension and the j-th granularity in the fifth dimension The imaginary part of the channel quality at the second granularity.
  303. 一种终端设备,包括:处理器和存储器,该存储器用于存储计算机程序,所述处理器用于调用并运行所述存储器中存储的计算机程序,以使所述终端设备执行如权利要求1至58中任一项所述的方法。A terminal device, comprising: a processor and a memory, the memory is used to store a computer program, and the processor is used to call and run the computer program stored in the memory, so that the terminal device executes claims 1 to 58 any one of the methods described.
  304. 一种网络设备,包括:处理器和存储器,该存储器用于存储计算机程序,所述处理器用于调用并运行所述存储器中存储的计算机程序,以使所述网络设备执行如权利要求59至118中任一项所述的方法。A network device, comprising: a processor and a memory, the memory is used to store a computer program, and the processor is used to invoke and run the computer program stored in the memory, so that the network device performs the tasks described in claims 59 to 118 any one of the methods described.
  305. 一种电子设备,包括:处理器和存储器,该存储器用于存储计算机程序,所述处理器用于调用并运行所述存储器中存储的计算机程序,以使所述网络设备执行如权利要求119至151中任一项所述的方法。An electronic device, comprising: a processor and a memory, the memory is used to store a computer program, the processor is used to call and run the computer program stored in the memory, so that the network device performs the any one of the methods described.
  306. 一种芯片,包括:处理器,用于从存储器中调用并运行计算机程序,使得安装有所述芯片的设备执行如权利要求1至58中任一项所述的方法。A chip, comprising: a processor, configured to invoke and run a computer program from a memory, so that a device equipped with the chip executes the method as claimed in any one of claims 1 to 58.
  307. 一种芯片,包括:处理器,用于从存储器中调用并运行计算机程序,使得安装有所述芯片的设备执行如权利要求59至118中任一项所述的方法。A chip, comprising: a processor for invoking and running a computer program from a memory, so that a device equipped with the chip executes the method as claimed in any one of claims 59 to 118.
  308. 一种芯片,包括:处理器,用于从存储器中调用并运行计算机程序,使得安装有所述芯片的设备执行如权利要求119至151中任一项所述的方法。A chip, comprising: a processor, configured to invoke and run a computer program from a memory, so that a device equipped with the chip executes the method as claimed in any one of claims 119 to 151.
  309. 一种计算机可读存储介质,用于存储计算机程序,当所述计算机程序被设备运行时使得所述设备执行如权利要求1至151中任一项所述的方法。A computer-readable storage medium for storing a computer program, which causes the device to execute the method according to any one of claims 1 to 151 when the computer program is executed by the device.
  310. 一种计算机程序产品,包括计算机程序指令,该计算机程序指令使得计算机执行如权利要求1至151中任一项所述的方法。A computer program product comprising computer program instructions for causing a computer to perform the method as claimed in any one of claims 1 to 151.
  311. 一种计算机程序,所述计算机程序使得计算机执行如权利要求1至151中任一项所述的方法。A computer program that causes a computer to perform the method according to any one of claims 1 to 151.
  312. 一种通信系统,包括:A communication system comprising:
    终端设备,用于执行如权利要求1至58中任一项所述的方法;A terminal device, configured to perform the method according to any one of claims 1 to 58;
    网络设备,用于执行如权利要求59至118中任一项所述的方法。A network device configured to execute the method according to any one of claims 59-118.
PCT/CN2021/132593 2021-11-23 2021-11-23 Information processing method, model generation method, and devices WO2023092310A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202180100859.8A CN117678257A (en) 2021-11-23 2021-11-23 Information processing method, model generation method and equipment
PCT/CN2021/132593 WO2023092310A1 (en) 2021-11-23 2021-11-23 Information processing method, model generation method, and devices

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/132593 WO2023092310A1 (en) 2021-11-23 2021-11-23 Information processing method, model generation method, and devices

Publications (1)

Publication Number Publication Date
WO2023092310A1 true WO2023092310A1 (en) 2023-06-01

Family

ID=86538648

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/132593 WO2023092310A1 (en) 2021-11-23 2021-11-23 Information processing method, model generation method, and devices

Country Status (2)

Country Link
CN (1) CN117678257A (en)
WO (1) WO2023092310A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111404849A (en) * 2020-03-20 2020-07-10 北京航空航天大学 OFDM channel estimation and signal detection method based on deep learning
CN111464220A (en) * 2020-03-10 2020-07-28 西安交通大学 Channel state information reconstruction method based on deep learning
CN112737985A (en) * 2020-12-25 2021-04-30 东南大学 Large-scale MIMO channel joint estimation and feedback method based on deep learning
US20210218601A1 (en) * 2020-01-10 2021-07-15 Samsung Electronics Co., Ltd. Method and apparatus of parameter tracking for csi estimation
CN113497770A (en) * 2020-04-01 2021-10-12 清华大学 Fast time-varying channel parameter estimation method and device for OFDM system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210218601A1 (en) * 2020-01-10 2021-07-15 Samsung Electronics Co., Ltd. Method and apparatus of parameter tracking for csi estimation
CN111464220A (en) * 2020-03-10 2020-07-28 西安交通大学 Channel state information reconstruction method based on deep learning
CN111404849A (en) * 2020-03-20 2020-07-10 北京航空航天大学 OFDM channel estimation and signal detection method based on deep learning
CN113497770A (en) * 2020-04-01 2021-10-12 清华大学 Fast time-varying channel parameter estimation method and device for OFDM system
CN112737985A (en) * 2020-12-25 2021-04-30 东南大学 Large-scale MIMO channel joint estimation and feedback method based on deep learning

Also Published As

Publication number Publication date
CN117678257A (en) 2024-03-08

Similar Documents

Publication Publication Date Title
WO2022033456A1 (en) Channel state information measurement feedback method, and related apparatus
WO2021217519A1 (en) Method and apparatus for adjusting neural network
WO2021142631A1 (en) Method, device and computer readable medium of communication
WO2021142605A1 (en) Method and apparatus for channel measurement
US20220263560A1 (en) Channel state information reporting method and communications apparatus
JP2022529124A (en) Improvement of communication efficiency
US20240030980A1 (en) Data transmission method and apparatus
US20230164004A1 (en) Channel parameter obtaining method and apparatus
WO2021081847A1 (en) Channel measurement method and communication apparatus
US20240048413A1 (en) Channel restoration method and receiving device
WO2021159309A1 (en) Channel measurement method and communication device
US20230136416A1 (en) Neural network obtaining method and apparatus
US20230239028A1 (en) Channel measurement method and communication apparatus
WO2023123062A1 (en) Quality evaluation method for virtual channel sample, and device
WO2023092310A1 (en) Information processing method, model generation method, and devices
US11784853B2 (en) Channel measurement method and communication apparatus
WO2021189302A1 (en) Updating method and communication apparatus
WO2023092307A1 (en) Communication method, model training method, and device
US20230379020A1 (en) Precoding method and apparatus
WO2022267633A1 (en) Information transmission method and apparatus
WO2023102814A1 (en) Wireless communication method, terminal device, and network device
WO2023070459A1 (en) Wireless communication method, terminal device, and network device
WO2024093867A1 (en) Precoding indication method, and communication apparatus
WO2022083307A1 (en) Channel measurement method and communication apparatus
WO2024061093A1 (en) Uplink precoding indication method and communication apparatus

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21965049

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 202180100859.8

Country of ref document: CN