WO2021051893A1 - 一种基于生成式对抗网络的蒙特卡洛渲染图去噪模型、方法及装置 - Google Patents
一种基于生成式对抗网络的蒙特卡洛渲染图去噪模型、方法及装置 Download PDFInfo
- Publication number
- WO2021051893A1 WO2021051893A1 PCT/CN2020/094759 CN2020094759W WO2021051893A1 WO 2021051893 A1 WO2021051893 A1 WO 2021051893A1 CN 2020094759 W CN2020094759 W CN 2020094759W WO 2021051893 A1 WO2021051893 A1 WO 2021051893A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- rendering
- monte carlo
- denoising
- rendering image
- network
- Prior art date
Links
- 238000009877 rendering Methods 0.000 title claims abstract description 335
- 238000000034 method Methods 0.000 title claims abstract description 70
- 238000010276 construction Methods 0.000 claims abstract description 5
- 230000008569 process Effects 0.000 claims description 42
- 238000005070 sampling Methods 0.000 claims description 18
- 230000004927 fusion Effects 0.000 claims description 16
- 238000000605 extraction Methods 0.000 claims description 14
- 238000013527 convolutional neural network Methods 0.000 claims description 9
- 239000000463 material Substances 0.000 claims description 9
- 238000004590 computer program Methods 0.000 claims description 6
- 230000004913 activation Effects 0.000 claims description 5
- 238000013528 artificial neural network Methods 0.000 claims description 2
- 230000000694 effects Effects 0.000 description 8
- 230000014759 maintenance of location Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 3
- 230000000007 visual effect Effects 0.000 description 3
- 238000000342 Monte Carlo simulation Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 238000007792 addition Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000016776 visual perception Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/70—Denoising; Smoothing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/20—Image enhancement or restoration using local operators
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/60—Image enhancement or restoration using machine learning, e.g. neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/30—Noise filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/7715—Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/774—Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/806—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
Definitions
- the invention belongs to the field of image denoising, and in particular relates to a Monte Carlo rendering graph denoising model, method and device based on a generative confrontation network.
- Monte-Carlo Simulation (Monte-Carlo Simulation) rendering technology
- the variance convergence of the rendered image requires a large amount of sampling, it consumes a lot of time and computing resources.
- a lower sampling rate is used for rendering to obtain a noisy rendered image, and then certain denoising techniques are used to denoise the rendered image to obtain a noise-free and visual performance Better rendering.
- the more cutting-edge denoising technology for Monte Carlo rendering is mostly based on deep learning.
- the most commonly used is to use a convolutional neural network to denoise the Monte Carlo rendering.
- the L1 norm/L2 norm loss function of the Monte Carlo rendering and the target noise-free image are used as the goal of optimizing regression.
- the convolutional neural network is trained, and the trained convolutional neural model can realize the denoising of the Monte Carlo rendering.
- the purpose of the present invention is to provide a Monte Carlo rendering image denoising model based on a generative confrontation network and its establishment method.
- the established Monte Carlo rendering image denoising model can realize the denoising of the Monte Carlo rendering image containing noise. Noise, while achieving a good denoising effect on low-frequency details, it can also significantly improve the retention of high-frequency details to obtain a more visually realistic rendering.
- Another object of the present invention is to provide a denoising method and device for a Te Carlo rendering map.
- the denoising method and device use the Monte Carlo rendering map denoising model constructed as described above, which can realize the denoising of the Monte Carlo rendering map. Denoising, while achieving a good denoising effect on low-frequency details, it can also significantly improve the retention of high-frequency details to obtain a more visually realistic rendering.
- the first embodiment provides a method for constructing a Monte Carlo rendering graph denoising model based on a generative confrontation network, which includes the following steps:
- the generative confrontation network includes a denoising network and a discriminant network.
- the denoising network is used to denoise the input noise rendering image and auxiliary features, and output the denoising rendering image.
- the network is used to classify the input denoising rendering image and the target rendering image corresponding to the noise rendering image, and output the classification result;
- the training samples are used to tune the network parameters of the generative confrontation network. After the tuning is completed, the denoising network determined by the network parameters is used as the Monte Carlo rendering graph denoising model.
- the second embodiment provides a Monte Carlo rendering map denoising model based on a generative confrontation network, and the Monte Carlo rendering map denoising model is constructed by the construction method provided in the first embodiment.
- the Monte Carlo rendering image denoising model is a Monte Carlo rendering image denoising model M d , which is the Monte Carlo rendering image P d obtained by rendering using the diffuse path rendering process, and generating the Monte Carlo rendering image assist features when P d, and the target P d Monte Carlo rendering rendering corresponding training samples obtained as training;
- the third embodiment provides a Monte Carlo rendering image denoising method, including the following steps:
- the rendering process of the rendering engine is split into the diffuse path rendering process and the specular path rendering process;
- the denoising rendering image P d 'and the denoising rendering image P s ' are merged to obtain the final denoising rendering image.
- the fourth embodiment provides a denoising device for Monte Carlo rendering, including a computer memory, a computer processor, and a computer program stored in the computer memory and executable on the computer processor.
- the said Monte Carlo rendering image denoising model M s and the Monte Carlo rendering image denoising model M d are stored in the computer memory;
- the rendering process of the rendering engine is split into the diffuse path rendering process and the specular path rendering process;
- the diffuse path rendering process and the specular path rendering process are used for rendering respectively to obtain low sampling rate Monte Carlo rendering images P d and Monte Carlo rendering images P s , and generating Monte Carlo rendering images P d and Monte Carlo rendering images at the same time Auxiliary features corresponding to P s;
- the denoising rendering image P d 'and the denoising rendering image P s ' are merged to obtain the final denoising rendering image.
- the Monte Carlo rendering image denoising model has stronger denoising capabilities, and the denoising rendering image obtained after denoising can bring better denoising effects in terms of human visual perception.
- the denoising method and device for the Te Carlo rendering map use the Monte Carlo rendering map denoising model, which can achieve the rendering effect that can be achieved by using a lower sampling rate to achieve a high sampling rate, and the time for denoising is only limited to On the order of one second, it is far less than the rendering time required for multi-sampling (on the order of hundreds to thousands of seconds), which greatly saves rendering time and computing costs, thereby reducing the use of servers and reducing the industry cost of the entire rendering service. save resources.
- Figure 1 is a schematic diagram of the structure of a generative confrontation network
- Figure 2 is a schematic diagram of the training process of a generative confrontation network
- Fig. 3 is a schematic flow chart of a denoising method for a Monte Carlo rendering image.
- the Monte Carlo rendering image obtained often has a lot of noise.
- the following implementation provides a generative confrontation network based on The Monte Carlo rendering image denoising model and the method for establishing the same, a denoising method using the Monte Carlo rendering image denoising model, and a denoising device calling the Monte Carlo rendering image denoising model.
- An embodiment provides a method for establishing a Monte Carlo rendering graph denoising model based on a generative confrontation network, as shown in Figure 1 and Figure 2, which specifically includes the following processes:
- the goal that the Monte Carlo rendering image denoising model constructed in this embodiment can achieve is to perform a denoising operation on the input noise rendering image, and output the denoising rendering image whose image quality reaches the target rendering image.
- the present invention also considers adding other auxiliary features as the input of the Monte Carlo rendering image denoising model, so that the Monte Carlo rendering image denoising model can be used for denoising.
- auxiliary features include but are not limited to Normal Buffer, Depth Buffer, Material Texture Albedo Buffer.
- the noise rendering image and the corresponding auxiliary features, and the target rendering image corresponding to the noise rendering image are used as a training sample to construct a training sample set.
- the convolutional neural network is simply used to denoise the noise rendering image, and the obtained denoising rendering image lacks realism in details.
- this embodiment constructs Monte Carlo through adversarial learning Rendered image denoising model.
- the constructed generative confrontation network includes the denoising network Denoising Net and the discriminant network Critic Net.
- the Denoising Net denoising network is used to denoise the input noise rendering image and auxiliary features, and the output denoise Noise rendering map, the discrimination network Critic Net is used to classify the input denoising rendering map and the target rendering map corresponding to the noise rendering map, and output the classification results.
- the denoising network includes:
- the auxiliary graph feature extraction sub-network is a convolutional neural network including at least one convolutional layer, and is used to fuse input auxiliary features and output auxiliary feature maps;
- a rendering map feature extraction sub-network is a convolutional neural network including at least one convolutional layer, used for extracting features of the noise rendering map, and outputting the noise feature map;
- the feature fusion sub-network is a neural network that adopts the idea of residual error and uses convolutional layers to fuse and extract auxiliary feature maps and noise feature maps.
- the auxiliary graph feature extraction sub-network Encoder Net can be a convolutional neural network in which at least two convolutional layers Conv and an activation layer RelU are connected in sequence.
- the auxiliary feature fusion network Encoder Net can be as shown in Figure 1(c).
- the shown convolutional neural network specifically includes Conv k3n128s1, Leaky RelU, Conv k1n128s1, Leaky RelU, Conv k1n128s1, Leaky RelU, Conv k1n128s1, Leaky RelU and Conv k1n32k3 which are connected in sequence, where 128 is the convolution of 3Conv*RelU and Conv*K1n32s1. 3.
- the convolutional layer with 128 channels and 1 step size the explanation of other convolutional layers is similar, so I won't repeat them here.
- the feature fusion sub-network may include:
- the feature fusion unit is used to combine the auxiliary feature map and the noise feature map to output the modulation feature map, specifically including multiple auxiliary feature modulation modules CFM ResBlock, auxiliary feature modulation section CFM and convolutional layer connected in sequence, Among them, the input of the auxiliary feature modulation module CFM Block and the auxiliary feature modulation section CFM are the auxiliary feature map and the output of the previous layer, and the input of the first auxiliary feature modulation module CFM ResBlock is the noise feature map and the auxiliary feature map, and the convolutional layer The input of is the output of the auxiliary characteristic modulation section CFM, and the output is the modulation characteristic map;
- the output unit is used to perform feature fusion on the noise feature map output by the feature extraction unit and the modulation feature map output by the modulation unit, that is, the input is the feature map after the noise feature map and the modulation feature map are superimposed, and the output is the denoising rendering Figure.
- the auxiliary feature modulation module CFM ResBlock includes the auxiliary feature modulation section CFM, convolutional layer, activation layer, and superimposition operation.
- the auxiliary feature modulation section CFM is used to modulate the auxiliary feature and the last output feature, that is, the auxiliary feature modulation section CFM.
- the input of the feature modulation section CFM includes the auxiliary feature map and the output feature of the previous layer.
- the superimposition operation is used to superimpose the input of the auxiliary feature modulation module CFM ResBlock and the output of the final convolutional layer.
- the auxiliary characteristic modulation module CFM ResBlock includes the auxiliary characteristic modulation section CFM, Convk3n64s1, ReLU, the auxiliary characteristic modulation section CFM, Conv k3n64s1, and the superimposition operation ⁇ which are connected in sequence.
- the auxiliary characteristic modulation The input of section CFM includes the auxiliary feature map and the output feature of the previous layer.
- the superimposition operation is used to superimpose the input of the auxiliary feature modulation module CFM ResBlock and the output of Conv k3n64s1.
- the auxiliary feature modulation section CFM includes a convolution layer, a dot multiplication operation, and a superimposition operation.
- the input of the convolution layer is an auxiliary feature map
- the dot multiplication operation is used to point the output of the convolution layer and the output of the previous layer.
- the multiplication operation, the superposition operation is used to superimpose the output of the convolutional layer and the dot multiplication operation to output the feature map.
- the auxiliary feature modulation section CFM includes Conv k1n32s1, Leaky ReLU, Conv k1n64s1, dot multiplication ⁇ , and superposition operation ⁇ , where Conv k1n32s1, Leaky ReLU, and Conv k1n64s1 are connected in sequence ,
- the input of Conv k1n32s1 is the auxiliary feature map
- the dot multiplication operation ⁇ refers to the point multiplication of the output of the previous layer with the output ⁇ of Conv k1n64s1
- the superposition operation ⁇ refers to the result of the dot multiplication operation and the output ⁇ of Conv k1n64s1 Overlay.
- the fusion unit includes a convolutional layer and an activation layer, and is used to perform feature fusion on the noise feature map output by the feature extraction unit and the modulation feature map output by the modulation unit, and output a denoising feature map.
- the fusion unit includes Conv k3n64s1, ReLU, Conv k3n3s1, and ReLU connected in sequence.
- Critic Net is a network composed of convolutional layer, BN, activation layer and fully connected layer.
- the discrimination network Critic Net includes successively connected Conv, Leaky ReLU, multiple consecutive extraction units, fully connected layer Dense(100), Leaky ReLU, and fully connected layer Dense(1) , Where the extraction unit includes consecutive Conv, BN, and Leaky ReLU, and 100 in the fully connected layer Dense (100) indicates that the output dimension is 100.
- the training sample set is used to conduct confrontation training on the generative confrontation network, and the network parameters of the generation confrontation network are optimized.
- the role of the denoising network Denoising Net is to denoise the noise rendering image and generate the denoising rendering image.
- the purpose is to make the discrimination network Critic Net unable to distinguish the denoising rendering image and the target rendering image; and the role of the discrimination network CriticNet is to distinguish as much as possible
- the entire training is based on the confrontation process to make denoising
- the capabilities of the network DenoisingNet and the discrimination network CriticNet have been improved at the same time.
- the denoising network Denoising Net determined by the parameters is extracted as the Monte Carlo rendering image denoising model.
- the Monte Carlo rendering image denoising model can realize the denoising of noisy Monte Carlo rendering images. While achieving good denoising effects on low-frequency details, it can also significantly improve the retention of high-frequency details to gain Visually more realistic rendering.
- the generative confrontation network constructed above can also be trained by changing the training samples to obtain a Monte Carlo rendering image denoising model capable of processing other input images.
- Monte Carlo rendering is an improvement of traditional reverse ray tracing. It is mainly based on the principle of ray tracing. Therefore, when rendering, according to the material difference at the intersection point of the first ray and object of path tracing, the rendering engine can be changed.
- the rendering process is split into a diffuse path rendering process and a specular path rendering process.
- the diffuse path rendering process and the specular path rendering process are used to render separately, and you can get the Monte Carlo rendering P d and the Monte Carlo rendering P with noise. s .
- FIG P d Monte Carlo denoising denoising model renderings of Monte Carlo M d and P s rendering denoising rendering to Monte Carlo Noise model M s .
- the rendering process using rendering diffuse path obtained Monte Carlo rendering P d P d rendered as the noise (i.e. noisy Diffuse), P d renderings of noise, noise generated when the assist features renderings P d (Auxiliary feature), and the target rendering map corresponding to the noise rendering map P d as the training sample to conduct confrontation training on the above-mentioned generative confrontation network.
- confrontation training is completed, extract the denoising network Denoising Net and the auxiliary feature fusion network Encoder Net as Monte Carlo Rendered image denoising model M d .
- the Monte Carlo rendering image P s rendered by the specular path rendering process is used as the noise rendering image P s (that is, noisysy Specular), the noise rendering image P s , the auxiliary features when generating the noise rendering image P s , and the noise rendering image
- the target rendering corresponding to P s is used as the training sample to conduct confrontation training on the above-mentioned generative confrontation network.
- the denoising network Denoising Net and the auxiliary feature fusion network Encoder Net are extracted as the Monte Carlo rendering denoising model M s .
- Another embodiment provides a Monte Carlo rendering image denoising method, as shown in Figure 3, including the following steps:
- the rendering process of the rendering engine is split into a diffuse path rendering process and a specular path rendering process
- the auxiliary feature Auxiliary Feature corresponding to the Monte Carlo rendering image P d and the Monte Carlo rendering image P s includes, but is not limited to, the normal map Normal Buffer, the depth map Depth Buffer, and the material texture map Albedo Buffer.
- the Monte Carlo rendering image denoising model M d and the Monte Carlo rendering image denoising model M s are constructed and obtained according to the above-mentioned construction method, and will not be repeated here.
- This denoising method uses the Monte Carlo rendering image denoising models M d and M s to achieve rendering effects that can only be achieved by using a lower sampling rate to achieve a high sampling rate, and the denoising time is only on the order of one second. , Far less than the rendering time required for multi-sampling (on the order of hundreds to thousands of seconds), greatly saving rendering time and computing costs, thereby reducing the use of servers, reducing the industry cost of the entire rendering service, and saving resources.
- Another embodiment provides a denoising device for Monte Carlo rendering, including a computer memory, a computer processor, and a computer program stored in the computer memory and executable on the computer processor.
- the above-mentioned Monte Carlo rendering image denoising model M s and the Monte Carlo rendering image denoising model M d are stored in the computer memory;
- the rendering process of the rendering engine is split into the diffuse path rendering process and the specular path rendering process;
- the diffuse path rendering process and the specular path rendering process are used for rendering respectively to obtain low sampling rate Monte Carlo rendering images P d and Monte Carlo rendering images P s , and generating Monte Carlo rendering images P d and Monte Carlo rendering images at the same time Auxiliary features corresponding to P s;
- the denoising rendering image P d 'and the denoising rendering image P s ' are merged to obtain the final denoising rendering image.
- the denoising device uses the Monte Carlo rendering image denoising models M d and M s , which can achieve rendering effects that can only be achieved by using a lower sampling rate to achieve a high sampling rate, and the denoising time is only on the order of one second. , Far less than the rendering time required for multi-sampling (on the order of hundreds to thousands of seconds), greatly saving rendering time and computing costs, thereby reducing the use of servers, reducing the industry cost of the entire rendering service, and saving resources.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computing Systems (AREA)
- Artificial Intelligence (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Software Systems (AREA)
- Medical Informatics (AREA)
- Databases & Information Systems (AREA)
- Molecular Biology (AREA)
- Mathematical Physics (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Data Mining & Analysis (AREA)
- Image Processing (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (8)
- 一种基于生成式对抗网络的蒙特卡洛渲染图去噪模型的构建方法,包括以下步骤:获取含有噪声的蒙特卡洛渲染图作为噪声渲染图,获取生成噪声渲染图时的辅助特征,以噪声渲染图和对应的辅助特征,以及噪声渲染图对应的目标渲染图作为一个训练样本;构建生成式对抗网络,所述生成式对抗网络包括去噪网络和判别网络,其中,所述去噪网络用于输入的噪声渲染图和辅助特征进行去噪,输出去噪渲染图,所述判别网络用于对输入的去噪渲染图和噪声渲染图对应的目标渲染图进行分类,输出分类结果;利用训练样本对所述生成式对抗网络的网络参数进行调优,调优结束后,以网络参数确定的去噪网络作为蒙特卡洛渲染图去噪模型。
- 如权利要求1所述的基于生成式对抗网络的蒙特卡洛渲染图去噪模型的构建方法,其特征在于,所述去噪网络包括:辅助图特征提取子网络,该辅助图特征提取子网络为包括至少一个卷积层的卷积神经网络,用于对输入的辅助特征进行融合,输出辅助特征图;渲染图特征提取子网络,该渲染图特征提取子网络为包括至少一个卷积层的卷积神经网络,用于提取噪声渲染图的特征,输出噪声特征图;特征融合子网络,该特征融合子网络为采用残差思想,利用卷积层对辅助特征图和噪声特征图进行融合提取的神经网络。
- 如权利要求2所述的基于生成式对抗网络的蒙特卡洛渲染图去噪模型的构建方法,其特征在于,所述特征融合子网络包括:特征融合单元,该特征融合单元用于对辅助特征图和噪声特征图进行 结合,输出调制特征图,具体包括依次连接的多个辅助特征调制模块CFM ResBlock、辅助特征调制节CFM以及卷积层,其中,辅助特征调制模块CFM Block和辅助特征调制节CFM的输入为辅助特征图和上一层的输出,第一个辅助特征调制模块CFM ResBlock的输入为噪声特征图和辅助特征图,卷积层的输入为辅助特征调制节CFM的输出,输出为调制特征图;输出单元,该输出单元用于对特征提取单元输出的噪声特征图和调制单元输出的调制特征图进行特征融合,即输入为噪声特征图和调制特征图叠加后的特征图,输出为去噪渲染图。
- 如权利要求1所述的基于生成式对抗网络的蒙特卡洛渲染图去噪模型的构建方法,其特征在于,所述判别网络为卷积层、BN、激活层以及全连接层组成的网络。
- 一种基于生成式对抗网络的蒙特卡洛渲染图去噪模型,其特征在于,所述蒙特卡洛渲染图去噪模型通过权利要求1~4任一项所述的构建方法构建获得。
- 如权利要求5所示的基于生成式对抗网络的蒙特卡洛渲染图去噪模型,其特征在于,所述蒙特卡洛渲染图去噪模型为蒙特卡洛渲染图去噪模型M d,其为利用diffuse路径渲染流程渲染得到的蒙特卡洛渲染图P d、生成该蒙特卡洛渲染图P d时的辅助特征,以及蒙特卡洛渲染图P d对应的目标渲染图作为训练样本训练得到;所述蒙特卡洛渲染图去噪模型为蒙特卡洛渲染图去噪模型M s,其为利用specular路径渲染流程渲染得到的蒙特卡洛渲染图P s、生成该蒙特卡洛渲染图P s时的辅助特征,以及蒙特卡洛渲染图P s对应的目标渲染图作为训练样本训练得到。
- 一种蒙特卡洛渲染图的去噪方法,包括以下步骤:根据路径追踪第一次光线和物体相交交点处的材质区别,将渲染引擎的渲染流程拆分为diffuse路径渲染流程和specular路径渲染流程;分别利用diffuse路径渲染流程和specular路径渲染流程进行渲染,得到含有噪声的蒙特卡洛渲染图P d和蒙特卡洛渲染图P s,同时生成蒙特卡洛渲染图P d和蒙特卡洛渲染图P s对应的辅助特征;将蒙特卡洛渲染图P d以及对应的辅助特征输入至权利要求6所述的蒙特卡洛渲染图去噪模型M d中,获得去噪渲染图P d’;将蒙特卡洛渲染图P s以及对应的辅助特征输入至权利要求6所述的蒙特卡洛渲染图去噪模型M s中,获得去噪渲染图P s’;融合去噪渲染图P d’和去噪渲染图P s’,得到最终去噪渲染图。
- 一种对蒙特卡洛渲染图的去噪装置,包括计算机存储器、计算机处理器以及存储在所述计算机存储器中并可在所述计算机处理器上执行的计算机程序,其特征在于,所述计算机存储器中存有权利要求6所述的蒙特卡洛渲染图去噪模型M s和蒙特卡洛渲染图去噪模型M d;所述计算机处理器执行所述计算机程序时实现以下步骤:根据路径追踪第一次光线和物体相交交点处的材质区别,将渲染引擎的渲染流程拆分为diffuse路径渲染流程和specular路径渲染流程;分别利用diffuse路径渲染流程和specular路径渲染流程进行渲染,得到低采样率的蒙特卡洛渲染图P d和蒙特卡洛渲染图P s,同时生成蒙特卡洛渲染图P d和蒙特卡洛渲染图P s对应的辅助特征;调用蒙特卡洛渲染图去噪模型M d对将蒙特卡洛渲染图P d以及对应的辅助特征进行去噪,获得去噪渲染图P d’;调用蒙特卡洛渲染图去噪模型M s对蒙特卡洛渲染图P s以及对应的辅助特征进行去噪,获得去噪渲染图P s’;融合去噪渲染图P d’和去噪渲染图P s’,得到最终去噪渲染图。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/631,397 US20220335574A1 (en) | 2019-09-17 | 2020-06-05 | A monte carlo rendering image denoising model, method and device based on generative adversarial network |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910876687.8 | 2019-09-17 | ||
CN201910876687.8A CN110728636A (zh) | 2019-09-17 | 2019-09-17 | 一种基于生成式对抗网络的蒙特卡洛渲染图去噪模型、方法及装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021051893A1 true WO2021051893A1 (zh) | 2021-03-25 |
Family
ID=69219064
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/094759 WO2021051893A1 (zh) | 2019-09-17 | 2020-06-05 | 一种基于生成式对抗网络的蒙特卡洛渲染图去噪模型、方法及装置 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20220335574A1 (zh) |
CN (1) | CN110728636A (zh) |
WO (1) | WO2021051893A1 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113436111A (zh) * | 2021-07-21 | 2021-09-24 | 西北工业大学 | 一种基于网络结构搜索的高光谱遥感图像去噪方法 |
CN114742931A (zh) * | 2022-04-28 | 2022-07-12 | 北京字跳网络技术有限公司 | 渲染图像的方法、装置、电子设备及存储介质 |
CN118115634A (zh) * | 2024-03-18 | 2024-05-31 | 海南渔人映画文化传媒有限公司 | 一种数字动画渲染方法 |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110728636A (zh) * | 2019-09-17 | 2020-01-24 | 杭州群核信息技术有限公司 | 一种基于生成式对抗网络的蒙特卡洛渲染图去噪模型、方法及装置 |
US11887279B2 (en) * | 2020-08-25 | 2024-01-30 | Sharif University Of Technology | Machine learning-based denoising of an image |
CN113628126B (zh) * | 2021-06-29 | 2022-03-01 | 光线云(杭州)科技有限公司 | 基于重要度特征图共享的实时蒙特卡洛路径追踪降噪方法、装置和计算机设备 |
US20230035541A1 (en) * | 2021-07-28 | 2023-02-02 | Oracle International Corporation | Optimizing a prognostic-surveillance system to achieve a user-selectable functional objective |
US20230169176A1 (en) * | 2021-11-28 | 2023-06-01 | International Business Machines Corporation | Graph exploration framework for adversarial example generation |
CN114331895A (zh) * | 2021-12-30 | 2022-04-12 | 电子科技大学 | 一种基于生成对抗网络的蒙特卡罗渲染图去噪方法 |
CN115983352B (zh) * | 2023-02-14 | 2023-06-16 | 北京科技大学 | 一种基于辐射场和生成对抗网络的数据生成方法及装置 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109859147A (zh) * | 2019-03-01 | 2019-06-07 | 武汉大学 | 一种基于生成对抗网络噪声建模的真实图像去噪方法 |
CN110148088A (zh) * | 2018-03-14 | 2019-08-20 | 北京邮电大学 | 图像处理方法、图像去雨方法、装置、终端及介质 |
CN110223254A (zh) * | 2019-06-10 | 2019-09-10 | 大连民族大学 | 一种基于对抗生成网络的图像去噪方法 |
CN110728636A (zh) * | 2019-09-17 | 2020-01-24 | 杭州群核信息技术有限公司 | 一种基于生成式对抗网络的蒙特卡洛渲染图去噪模型、方法及装置 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101907081B1 (ko) * | 2011-08-22 | 2018-10-11 | 삼성전자주식회사 | 3차원 점군의 물체 분리 방법 |
US20160321523A1 (en) * | 2015-04-30 | 2016-11-03 | The Regents Of The University Of California | Using machine learning to filter monte carlo noise from images |
US10572979B2 (en) * | 2017-04-06 | 2020-02-25 | Pixar | Denoising Monte Carlo renderings using machine learning with importance sampling |
US10475165B2 (en) * | 2017-04-06 | 2019-11-12 | Disney Enterprises, Inc. | Kernel-predicting convolutional neural networks for denoising |
US11557022B2 (en) * | 2017-07-27 | 2023-01-17 | Nvidia Corporation | Neural network system with temporal feedback for denoising of rendered sequences |
CN108765319B (zh) * | 2018-05-09 | 2020-08-14 | 大连理工大学 | 一种基于生成对抗网络的图像去噪方法 |
CN109740283A (zh) * | 2019-01-17 | 2019-05-10 | 清华大学 | 自主多智能体对抗仿真方法及系统 |
CN109872288B (zh) * | 2019-01-31 | 2023-05-23 | 深圳大学 | 用于图像去噪的网络训练方法、装置、终端及存储介质 |
-
2019
- 2019-09-17 CN CN201910876687.8A patent/CN110728636A/zh active Pending
-
2020
- 2020-06-05 WO PCT/CN2020/094759 patent/WO2021051893A1/zh active Application Filing
- 2020-06-05 US US17/631,397 patent/US20220335574A1/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110148088A (zh) * | 2018-03-14 | 2019-08-20 | 北京邮电大学 | 图像处理方法、图像去雨方法、装置、终端及介质 |
CN109859147A (zh) * | 2019-03-01 | 2019-06-07 | 武汉大学 | 一种基于生成对抗网络噪声建模的真实图像去噪方法 |
CN110223254A (zh) * | 2019-06-10 | 2019-09-10 | 大连民族大学 | 一种基于对抗生成网络的图像去噪方法 |
CN110728636A (zh) * | 2019-09-17 | 2020-01-24 | 杭州群核信息技术有限公司 | 一种基于生成式对抗网络的蒙特卡洛渲染图去噪模型、方法及装置 |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113436111A (zh) * | 2021-07-21 | 2021-09-24 | 西北工业大学 | 一种基于网络结构搜索的高光谱遥感图像去噪方法 |
CN113436111B (zh) * | 2021-07-21 | 2024-01-09 | 西北工业大学 | 一种基于网络结构搜索的高光谱遥感图像去噪方法 |
CN114742931A (zh) * | 2022-04-28 | 2022-07-12 | 北京字跳网络技术有限公司 | 渲染图像的方法、装置、电子设备及存储介质 |
CN118115634A (zh) * | 2024-03-18 | 2024-05-31 | 海南渔人映画文化传媒有限公司 | 一种数字动画渲染方法 |
Also Published As
Publication number | Publication date |
---|---|
CN110728636A (zh) | 2020-01-24 |
US20220335574A1 (en) | 2022-10-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021051893A1 (zh) | 一种基于生成式对抗网络的蒙特卡洛渲染图去噪模型、方法及装置 | |
CN108198154B (zh) | 图像去噪方法、装置、设备及存储介质 | |
JP6961139B2 (ja) | 知覚的な縮小方法を用いて画像を縮小するための画像処理システム | |
CN109214990A (zh) | 一种基于Inception模型的深度卷积神经网络图像去噪方法 | |
US8285076B2 (en) | Methods and apparatus for visual sub-band decomposition of signals | |
KR20200132682A (ko) | 이미지 최적화 방법, 장치, 디바이스 및 저장 매체 | |
Ali et al. | Comparametric image compositing: Computationally efficient high dynamic range imaging | |
Liu et al. | Learning hadamard-product-propagation for image dehazing and beyond | |
CN111612891A (zh) | 模型生成方法、点云数据处理方法、装置、设备及介质 | |
Bachl et al. | City-GAN: Learning architectural styles using a custom Conditional GAN architecture | |
JP2009508234A (ja) | 2d/3d結合レンダリング | |
CN116645305A (zh) | 基于多注意力机制与Retinex的低光照图像增强方法 | |
Khan et al. | A deep hybrid few shot divide and glow method for ill-light image enhancement | |
CN115018968A (zh) | 图像渲染方法、装置、存储介质及电子设备 | |
DE102022100517A1 (de) | Verwenden von intrinsischen funktionen zum schattenentrauschen in raytracinganwendungen | |
CN106709888A (zh) | 一种基于人眼视觉模型的高动态范围图像产生方法 | |
Panetta et al. | Novel multi-color transfer algorithms and quality measure | |
Xu et al. | Artistic color virtual reality implementation based on similarity image restoration | |
CN112383366B (zh) | 一种数字荧光频谱的频谱监测方法、装置及存储介质 | |
Tang et al. | Feature comparison and analysis for new challenging research fields of image quality assessment | |
Titarenko et al. | Study of the ability of neural networks to extract and use semantic information when they are trained to reconstruct noisy images | |
CN113223128B (zh) | 用于生成图像的方法和装置 | |
Bae et al. | Non-iterative tone mapping with high efficiency and robustness | |
Zhang et al. | Fast Mesh Reconstruction from Single View Based on GCN and Topology Modification. | |
Fan et al. | Bidirectional image denoising with blurred image feature |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20864376 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20864376 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 28.09.2022) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20864376 Country of ref document: EP Kind code of ref document: A1 |