JP7171087B2

JP7171087B2 - A mesh denoising method based on graph convolutional networks

Info

Publication number: JP7171087B2
Application number: JP2021107136A
Authority: JP
Inventors: 越凡沈; 友怡 ▲鄭▼
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2020-09-09
Filing date: 2021-06-28
Publication date: 2022-11-15
Anticipated expiration: 2041-06-28
Also published as: CN112200733B; CN112200733A; JP2022045893A; LU500415B1

Description

本発明はコンピュータグラフィックスの分野に属し、特にグラフ畳み込みネットワークに基づくメッシュ雑音除去方法に関する。 The present invention belongs to the field of computer graphics, and more particularly to mesh denoising methods based on graph convolutional networks.

頂点位置又は面法線は本質的に３Ｄ信号であるため、メッシュ表面で雑音除去を行うタスクは２Ｄ画像に類似する。したがって、メッシュ雑音除去技術は画像における雑音除去技術に大きな示唆を受け、現在様々な低域通過及び特徴を保持するフィルタを導入してメッシュ雑音除去を行う。ここで、バイラテラルフィルタは最も広く適用されるフィルタの一つであるが、フィルタに基づく方法の一般的な欠点は、特徴が雑音により深刻に破壊されると、それら（特に弱い特徴）がこれらの方法で回復することが困難であることである。もう一つの方法は最適化されたメッシュ雑音除去方法に基づくことである。しかし仮定したメッシュのみを適用し、かつ異なる幾何学的特徴を有するメッシュに対して騒音モードをうまく要約することができない。 Since vertex positions or surface normals are inherently 3D signals, the task of denoising on a mesh surface is similar to 2D images. Therefore, mesh denoising techniques have been heavily inspired by denoising techniques in images, and currently various low-pass and feature-preserving filters are introduced to perform mesh denoising. Here, bilateral filters are one of the most widely applied filters, but a general drawback of filter-based methods is that if features (especially weak features) are severely corrupted by noise, they (especially weak features) will be is difficult to recover in any way. Another method is based on optimized mesh denoising method. However, it applies only hypothetical meshes and fails to summarize noise modes well for meshes with different geometric features.

これに対して、学習に基づく方法は基本的な特徴又は雑音モードに特定の仮説を与えず、かつ画像雑音除去に成功して適用される。しかし、画像とは異なり、３Ｄメッシュは一般的に不規則であるため、画像に基づく畳み込み演算は直接的に適用することができない。この問題を解決するために、本発明者らは新しい方法を提供し、不規則なメッシュブロックデータを直接グラフ畳み込みネットワークに入力する。パターン畳み込み演算により局部表面幾何学的図形に対して図形に基づく表示を行い、本発明者らのネットワークは他の従来の方法より雑音下源モデルの固有幾何学的特徴をよりよく捕捉することができる。 In contrast, learning-based methods do not give specific hypotheses to the underlying features or noise modes and are successfully applied to image denoising. However, unlike images, 3D meshes are generally irregular, so image-based convolution operations cannot be applied directly. To solve this problem, we present a new method to input irregular mesh block data directly into the graph convolutional network. The pattern convolution operation provides a graphical representation for the local surface geometry, and our network is better able to capture the intrinsic geometric features of the noisy source model than other conventional methods. can.

グラフ畳み込みネットワーク（ＧＣＮ）はユークリッド構造を処理するために導入される。ＧＣＮの早期動作は静的グラフ構造を必要とするため、変化トポロジを有するメッシュに拡張することができない。動的グラフ畳み込みの最新の研究によると、可変エッジはより良好に表現することができる。本発明者らの方法はブロックにおける静的グラフ構造及び畳み込み期間構造の動的グラフ構造を利用してブロックにおける幾何学的特徴を効果的に学習する。また他のいくつかはメッシュ開発の畳み込み操作であり、主に対象全体又は大型シーンを理解するために用いられ、かつ非常に深いネットワーク構造を必要とする。雑音除去作業はより局所ブロックに注目し、メッシュ面の双対空間に畳み込みを導入する。 A graph convolutional network (GCN) is introduced to process the Euclidean structure. Early operations of GCN require a static graph structure and thus cannot be extended to meshes with varying topologies. According to the latest research in dynamic graph convolution, variable edges can be represented better. Our method utilizes the static graph structure in the block and the dynamic graph structure of the convolutional term structure to effectively learn the geometric features in the block. Some others are mesh development convolution operations, which are mainly used for understanding whole objects or large scenes and require very deep network structures. The denoising operation focuses more on local blocks and introduces convolutions in the dual space of the mesh planes.

本発明はＧＣＮネットワークに基づくメッシュ雑音除去方法を提供し、メッシュ面の双対空間に回転不変のグラフ表示を使用することにより、グラフ畳み込みにより効果的な特徴学習を実現する。同時に本発明のアーキテクチャにおける静的及び動的なグラフ畳み込み演算が直列接続されることにより、隣接ノードの間の有効明示的な構造特徴及び潜在的な暗黙的特徴を学習する。 The present invention provides a mesh denoising method based on GCN networks, and uses rotation-invariant graph representations in the dual space of mesh faces to achieve efficient feature learning through graph convolution. At the same time, the static and dynamic graph convolution operations in our architecture are serially connected to learn effective explicit structural features and potential implicit features between adjacent nodes.

本発明は以下の技術的解決手段により実現される。 The present invention is realized by the following technical solutions.

グラフ畳み込みネットワークに基づくメッシュ雑音除去方法であって、以下のステップ１とステップ２を含み、
ステップ１では、雑音メッシュ内の各面に局所ブロックを生成しかつテンソル投票アルゴリズムを採用して局部ブロックを回転整列し、
ステップ２では、ステップ１で整列された局所ブロックをグラフに変換して示させ、トレーニングされたグラフ畳み込みニューラルネットワークに入力し、雑音のない法線方向を予測し、さらに予測された法線方向に基づいてメッシュモデルの頂点を更新し、雑音除去後のモデルを取得し、ここで、前記畳み込みニューラルネットワーク構造はＬ_ｅ層ＥｄｇｅＣｏｎｖ、Ｌ_ｄ層動的ＥｄｇｅＣｏｎｖ及びＬ_１層全結合（ＦＣ）層で構成される。 A mesh denoising method based on graph convolutional networks, comprising the following steps 1 and 2:
In step 1, generate a local block for each face in the noise mesh and employ a tensor voting algorithm to rotationally align the local block;
In step 2, the local blocks aligned in step 1 are transformed into a graph and shown, input to a trained graph convolutional neural network, noise-free normal directions are predicted, and the predicted normal directions are update the vertices of the mesh model based on to obtain the denoised model, wherein the convolutional neural network structure is L _e -layer EdgeConv, L _d -layer dynamic EdgeConv and L ₁ -layer fully connected (FC) layer Configured.

さらに、前記ステップ１が以下のサブステップ１．１～サブステップ１．３により実施され、
サブステップ１．１では、選択されたパッチ

に対して、その領域の面積の一定の割合に応じてバウンディングスフィアを画定し、バウンディングスフィア内の全てのパッチを該ブロック

中の面とし、
サブステップ１．２では、該ブロックにおける全ての面

に対する投票テンソル

を定義し、かつ特徴値及び単位特徴ベクトルを取得し、
サブステップ１．３では、サブステップ１．２で得られた特徴ベクトルに基づいて一つの回転行列Ｒ_ｉを構築し、かつ

中の各小面の重心及び法線をＲ_ｉ ^－１に乗算して新たなブロックデータ

を生成する。 Further, said step 1 is performed by the following substeps 1.1 to 1.3,
In substep 1.1 the selected patch

, define a bounding sphere according to a certain percentage of the area of that region, and all patches within the bounding sphere are added to the block

as the inside surface,
In sub-step 1.2 all faces in the block

vote tensor for

and obtain feature values and unit feature vectors,
in substep 1.3, constructing one rotation matrix R _i based on the feature vectors obtained in substep 1.2; and

Multiply R _i ⁻¹ by the centroid and normal of each facet in the new block data

to generate

さらに、前記ステップ２が以下のサブステップ２．１～サブステップ２．４によりされ、
サブステップ２．１では、入力された画像に対して静的ＥｄｇｅＣｏｎｖによる静的辺畳み込み処理を反復し、隣接面特徴を取得し、
サブステップ２．２では、サブステップ２．１で得られた結果に対して動的ＥｄｇｅＣｏｎｖによる動的辺畳み込み処理を反復し、特徴空間における最も隣接する特徴面を取得し、
サブステップ２．３では、グラフ畳み込みの後、学習された特徴を一緒に接続し、全結合層を介して特徴をまとめ、
サブステップ２．４では、対称性プール化により最も主要な特徴を選択して法線方向を最終的に予測する。 Further, step 2 is performed by the following substeps 2.1 to 2.4,
In sub-step 2.1, iterating the static edge convolution process with static EdgeConv on the input image to obtain adjacent surface features,
in substep 2.2, iterating the dynamic edge convolution process with dynamic EdgeConv on the result obtained in substep 2.1 to obtain the closest feature face in the feature space;
In sub-step 2.3, after graph convolution, connect the learned features together and aggregate the features via a fully connected layer,
In sub-step 2.4, symmetry pooling selects the most dominant features to finally predict the normal direction.

さらに、前記グラフ畳み込みニューラルネットワークのトレーニングデータセットは以下の方法で構築され、
テンソル投票アルゴリズムをデータセット中の無雑音モデルの各面に適用し、三つの特徴値λ_１、λ_２、λ_３を取得し、特徴値に基づいて全てのデータにおける全てのモデルの面を「面」及び「辺」の二組に分け、それぞれブロックデータを収集してトレーニングデータセットを構築する。 Further, a training dataset for said graph convolutional neural network is constructed in the following manner:
A tensor voting algorithm is applied to each face of the noiseless model in the dataset to obtain three feature values λ ₁ , λ ₂ , λ ₃ , and based on the feature values all model faces in all data are The block data is divided into two sets of "face" and "edge", and the block data is collected respectively to construct the training data set.

さらに、複数のカスケード接続されたグラフ畳み込みニューラルネットワークをトレーニングして雑音無しの法線方向を予測し、トレーニングするとき、カスケード接続されたネットワークの損失が低下しなくなるまで、前段のグラフ畳み込みニューラルネットワークに予測された法線方向を利用して雑音除去後のメッシュモデルを構築して新たなデータを生成することで次の段のグラフ畳み込みニューラルネットワークをトレーニングする。 Furthermore, when training multiple cascaded graph convolutional neural networks to predict noise-free normal directions, training the preceding graph convolutional neural networks until the loss of the cascaded networks no longer drops. The predicted normal direction is used to construct the denoised mesh model and generate new data to train the next stage graph convolutional neural network.

さらに、最終段のグラフ畳み込みニューラルネットワークにおいて、メッシュ上のバイラテラルフィルタ対の法線ベクトルにより反復最適化を行い、反復するたびに頂点を更新し、最終的に雑音除去後のメッシュモデルを取得する。 Furthermore, in the final-stage graph convolutional neural network, iterative optimization is performed using the normal vector of the bilateral filter pair on the mesh, the vertices are updated at each iteration, and finally the mesh model after noise removal is obtained. .

本発明の突出寄与は以下のとおりである。 The salient contributions of the present invention are as follows.

本発明はＧＣＮ－Ｄｅｎｏｉｓｅｒ、グラフ畳み込みネットワーク（ＧＣＮ）に基づく特徴保留のネットワーク雑音除去方法を提供する。従来の人為的な構造的特徴学習又はボクセルに基づく特徴学習に基づく学習に基づくメッシュの雑音低減方法とは異なり、本発明は三角メッシュ自体の構造を探索し、かつ図に示すように導入し、次に三角形の双対空間にグラフ畳み込み演算を導入する。本発明はこのように幾何学的特徴を自然にキャプチャすることができる図形表示形式を示し、同時にトレーニング段階及び推定段階に対していずれも軽量である。効果的な特徴学習を促進するために、このようなネットワークは静的及び動的エッジの畳み込みを同時に利用し、これにより明示的なメッシュ構造と未接続の隣接との間の潜在的な連絡から情報を学習することができる。未知雑音関数をよりよく推定するために、本発明は複数のＧＣＮのカスケード最適化例を導入し、面の雑音レス法線方向を段階的に推定する。本発明は複数の雑音データセットにおいて最良の結果を実現し、明確な特徴を含むＣＡＤモデルと異なる装置からキャプチャされた真の雑音を有する元の走査モデルを含む。本発明の方法は効果と効率との間の良好なバランスを達成すると同時に、現在最も好ましい結果を実現する。 The present invention provides a GCN-Denoiser, a feature-reserving network denoising method based on graph convolutional networks (GCN). Unlike conventional artificial structural feature learning or voxel-based feature learning-based learning-based mesh noise reduction methods, the present invention explores the structure of the triangular mesh itself and introduces, as shown, Next, we introduce a graph convolution operation in the dual space of triangles. The present invention thus presents a graphical representation format that can naturally capture geometric features, while at the same time being lightweight for both the training and estimation stages. In order to facilitate effective feature learning, such networks utilize static and dynamic edge convolutions simultaneously, thereby extracting from implicit connections between explicit mesh structures and unconnected neighbors. Information can be learned. To better estimate the unknown noise function, the present invention introduces a cascaded optimization example of multiple GCNs to estimate the noiseless normal direction of the surface step by step. The present invention achieves best results on multiple noise data sets, including CAD models with well-defined features and original scanned models with true noise captured from different devices. The method of the present invention achieves a good balance between effectiveness and efficiency while achieving the currently most favorable results.

本発明のネットワーク雑音除去のプロセス概略図である。1 is a process schematic diagram of network noise cancellation of the present invention; FIG. 本発明のＧＣＮネットワーク構造である。1 is the GCN network structure of the present invention; は本発明のネットワーク雑音除去効果図である。is the network noise elimination effect diagram of the present invention;

雑音はメッシュモデル表面の複雑な構成であるため、一般的に局所的な方法を用いてそれを推定する。本発明のターゲットはメッシュ三角形の双対領域において各パッチを、その一定の範囲内の雑音でのブロックを利用し、該パッチの元の雑音レス法線方向を予測し、かつ雑音なしのモデルを再構成し、具体的には以下のステップを含む。
ステップ１：雑音メッシュ内の各面に局所ブロックを生成しかつテンソル投票アルゴリズムを採用して局部ブロックを回転整列する。
ステップ２：ステップ１で整列された局所ブロックを図に示すように変換し、トレーニングされたグラフ畳み込みニューラルネットワークに入力し、雑音のない法線方向を予測し、さらに予測された法線方向に基づいてメッシュモデルの頂点を更新し、雑音除去後のモデルを取得する。ここで、前記畳み込みニューラルネットワーク構造はＬ_ｅ層ＥｄｇｅＣｏｎｖ、Ｌ_ｄ層動的ＥｄｇｅＣｏｎｖ及びＬ_１層全結合（ＦＣ）層で構成される。 Since noise is a complex composition of mesh model surfaces, we generally estimate it using local methods. The target of the present invention is to take each patch in the dual domain of mesh triangles, block it with noise within a certain range, predict the original noiseless normal direction of the patch, and reproduce the noiseless model. Specifically, it includes the following steps.
Step 1: Generate a local block for each face in the noise mesh and employ a tensor voting algorithm to rotationally align the local block.
Step 2: The local blocks aligned in step 1 are transformed as shown in the figure, input to a trained graph convolutional neural network to predict noise-free normal directions, and further based on the predicted normal directions update the vertices of the mesh model to get the denoised model. Here, the convolutional neural network structure is composed of an L _e layer EdgeConv, an L _d layer dynamic EdgeConv and an L ₁ fully connected (FC) layer.

図１は本発明における複数のグラフ畳み込みニューラルネットワークカスケードの雑音除去フローを説明する。以下、具体的な実施例を参照して本発明の方法をさらに説明する。 FIG. 1 illustrates the denoising flow of multiple graph convolutional neural network cascades in the present invention. The method of the present invention is further described below with reference to specific examples.

雑音メッシュに対して、まず入力三角形メッシュをＭ＝｛Ｖ、Ｆ｝と定義し、ここでＶ＝｛ｖ_ｋ｝_１Ｎ^ｖは全ての頂点を設定し、Ｆ＝｛ｆ_ｉ｝_１ ^Ｎｆは全ての面を設定する。Ｎ_ｖ及びＮ_ｆは、それぞれ頂点数及び面数である。Ｆにおける各面ｆ_ｉに対して、そのローカルブロックデータｐ_ｉを生成する。Ｍにおける全てのブロックの集合をＰ＝｛ｐ_ｉ｝_１ ^Ｎｆと定義する。同様に、面ｆ_ｉの法線方向をｎ_ｉと表し、その重心をｃ_ｉとして表し、かつその面積をａ_ｉとして表す。 For the noise mesh, first define the input triangular mesh as M={V, F}, where V={v _k } ₁ N ^v sets all vertices and F={f _i } ₁ ^Nf is Set all faces. _Nv and _Nf are the number of vertices and the number of faces, respectively. For each face f _i in F, generate its local block data p _i . Define the set of all blocks in M as P={p _i } ₁ ^Nf . Similarly, the normal direction of the face f _i is denoted by n _i , its centroid by c _i and its area by a _i .

ここで、ブロックｐ_ｉは小平面ｆ_ｉの重心ｃ_ｉ上に位置する半径ｒの球内の全ての小平面（ｆ_ｉを含む）を指し、即ちｐ_ｉは以下を満たすべきである。

Here, block p _i refers to all facets (including f _i ) within a sphere of radius r located on the centroid c _i of facet f _i , i.e. p _i should satisfy:

異なる位置にあるが同様の性質を有するブロックはニューラルネットワークに面倒をもたらし、深層学習方法に対して空間変換を学習しにくいためである。この問題を解決するために、本発明はテンソル投票理論を採用して面を一つの公共座標系に明確に整合し、すなわち当該面中の全ての面投票テンソルを定義し、かつ特徴値及び単位特徴ベクトルを取得し、具体的には以下のとおりである。 This is because blocks with similar properties but in different locations create trouble for neural networks and make it difficult to learn spatial transformations for deep learning methods. To solve this problem, the present invention adopts tensor voting theory to unambiguously align surfaces to one public coordinate system, i.e., define all surface voting tensors in the surface, and define feature values and units Get the feature vector, specifically:

まずｐ_ｉを原点［０、０、０］に変換し、次にそれを単位境界ボックスにスケーリングする。１つの投票テンソルＴ_ｉの面ｆ_ｉに対する定義は以下のとおりである。

First transform pi to the origin [ _0,0,0 ] and then scale it to the unit bounding box. The definition for face f _i of one voting tensor T _i is:

ここでμ_ｊ＝（ａ_ｊ／ａ_ｍ）ｅｘｐ（－｜｜ｃ_ｊ－ｃ_ｉ｜｜／σ）、σはパラメータであり、本実施例では１／３に設定され、ここでａ_ｍはｐ_ｉにおける最大三角形面積であり、かつｎ_ｊ′はｆ_ｊの投票法線ベクトルである。ｎ_ｊ′＝２（ｎ_ｊ・ｗ_ｊ）ｗ_ｊ－ｎ_ｊ、ここでｗ_ｊ＝ｎｏｒｍａｌｉｚｅ｛［（ｃ_ｊ－ｃ_ｉ）×ｎ_ｊ］×（ｃ_ｊ－ｃ_ｉ）｝である。Ｔ_ｉは半正則行列であるため、そのスペクトル分解により以下のように表すことができる。

where μ _j =(a _j /a _m )exp(−||c _j −c _i ||/σ), σ is a parameter, set to ⅓ in this embodiment, where a _m is is the maximum triangle area at p _i and n _j ' is the voting normal vector of f _j . n _j '=2(n _j ·w _j )w _j −n _j , where w _j =normalize{[(c _j −c _i )×n _j ]×(c _j −c _i )}. Since T _i is a semi-regular matrix, it can be expressed by its spectral decomposition as follows.

ここでλ_１≧λ_２≧λ_３はその特徴値であり、ｅ_１、ｅ_２及びｅ_３は対応する単位特徴ベクトルであり、それらは一組の直交基を構成する。 where λ ₁ ≧λ ₂ ≧λ ₃ are its feature values and e ₁ , e ₂ and e ₃ are the corresponding unit feature vectors, which constitute a set of orthogonal groups.

次に、１つの回転行列Ｒ_ｉ＝［ｅ_１、ｅ_２、ｅ_３］を構築し、かつｐ_ｉにおける各小面の重心及び法線方向をＲ_ｉ ^－１に乗算して新たなブロックデータ

を生成する。 Next, construct one rotation matrix R _i =[e ₁ ,e ₂ ,e ₃ ] and multiply R _i ⁻¹ by the centroid and normal direction of each facet at p _i to obtain the new block data

to generate

その後、各生成されたブロックに図構造を確立してそれを本発明者らの後のパターン畳み込みネットワークに適応させる。無向グラフＧ＝（Ｑ、Ｅ、Φ）を確立し、ここでブロック

内の各面ｆに図面上のノードｑ_ｉ∈Ｑ、及び一つの辺ｅ＝（ｑ_ｉ、ｑ_ｊ）∈Ｅを作成して対応する面ｆ_ｉ及びｆ_ｊが隣接する。Φはノード特徴を表し、一組のノード属性を含む。各のφ_ｉ∈Φ、対応面

．

及び

は整列後の平面ｆｉの重心及び法線方向を指す。ｄ_ｉはｆ_ｉの１環近傍における隣接する面の数であり、境界面を区別することに役立つ。 We then establish a diagram structure for each generated block to adapt it to our later pattern convolution network. Establish an undirected graph G = (Q, E, Φ), where block

For each face f in , create a node q _i εQ on the drawing and one edge e=(q _i , q _j )εE to adjoin the corresponding faces f _i and f _j . Φ represents a node feature and contains a set of node attributes. For each φ _i ∈ Φ, the corresponding surface

.

as well as

refers to the centroid and normal direction of the plane fi after alignment. d _i is the number of adjacent faces in one ring neighborhood of f _i and serves to distinguish the boundary faces.

図２に示すように、本発明のＧＣＮネットワークは複数の畳み込み層で構成されることを説明した。（ＭａｒｔｉｎＳｉｍｏｎｏｖｓｋｙａｎｄＮｉｋｏｓＫｏｍｏｄａｋｉｓ．２０１７．Ｄｙｎａｍｉｃｅｄｇｅ－ｃｏｎｄｉｔｉｏｎｅｄｆｉｌｔｅｒｓｉｎｃｏｎｖｏｌｕｔｉｏｎａｌｎｅｕｒａｌｎｅｔｗｏｒｋｓｏｎｇｒａｐｈｓ．Ｉｎ２０１７ＩＥＥＥＣｏｎｆｅｒｅｎｃｅｏｎＣｏｍｐｕｔｅｒＶｉｓｉｏｎａｎｄＰａｔｔｅｒｎＲｅｃｏｇｎｉｔｉｏｎ（ＣＶＰＲ）．２９－３８．）各層において、従来の畳み込みネットワークと類似し、本発明のＧＣＮは各ノードの隣接ノードの特徴をまとめて更新し、畳み込み操作とも呼ばれる。各図は面に一定の連通性を有するが、それらの構造はブロックに非常に異なる。したがって本発明者らはＥＣＣ（Ｅｄｇｅ－ＣｏｎｄｉｔｉｏｎｅｄＣｏｎｖｏｌｕｔｉｏｎ）ポリシーを用いて畳み込みプロセスにおける異なる構造を処理する。Ｇ_ｌ＝（Ｑ_ｌ、Ｅ_ｌ、Φ_ｌ）はグラフ畳み込みにおける第１層であり、φ^ｉはＧ_ｌにおける第_ｉ個のノードの特徴ベクトルである。以下の方式でノード特徴を更新する。

It has been described that the GCN network of the present invention is composed of multiple convolutional layers, as shown in FIG. （ＭａｒｔｉｎＳｉｍｏｎｏｖｓｋｙａｎｄＮｉｋｏｓＫｏｍｏｄａｋｉｓ．２０１７．Ｄｙｎａｍｉｃｅｄｇｅ－ｃｏｎｄｉｔｉｏｎｅｄｆｉｌｔｅｒｓｉｎｃｏｎｖｏｌｕｔｉｏｎａｌｎｅｕｒａｌｎｅｔｗｏｒｋｓｏｎｇｒａｐｈｓ．Ｉｎ２０１７ＩＥＥＥＣｏｎｆｅｒｅｎｃｅｏｎＣｏｍｐｕｔｅｒＶｉｓｉｏｎａｎｄＰａｔｔｅｒｎＲｅｃｏｇｎｉｔｉｏｎ（ＣＶＰＲ）．２９－３８．）各層において、従来の畳み込みネットワークと類似し, the GCN of the present invention collectively updates the features of each node's neighbors, also called a convolution operation. Each figure has a certain degree of connectivity in the plane, but their structure is very different in blocks. Therefore, we use an ECC (Edge-Conditioned Convolution) policy to handle different structures in the convolution process. G _l = (Q _l , E _l , Φ _l ) is the first layer in the graph convolution and φ ⁱ is the feature vector of the _{i th} node in G _l . Update the node features in the following manner.

ここでΨは特徴集合であり、ｈ_Θ ^ｌ＝Ｌｉｎｅａｒ_Θ ^ｌ（Ｆ_ｌ ^ｉ，Ｆ_ｌ ^ｊ－Ｆ_ｌ ^ｉ）である。ネットワーク内の各グラフリールのベース層に同様のＬｉｎｅａｒ_Θがある。 where Ψ is the feature set and h _Θ ^l = Linear _Θ ^l (F _l ⁱ , F _l ^j - F _l ⁱ ). There is a similar Linear _Θ at the base layer of each graph reel in the network.

幾何学的図形から連通性のマッピングは一対一の関数ではないため、元の図形構造のみを利用して畳み込みプロセスにおいていくつかの情報が失われる可能性がある。グラフノードの受信領域を豊富にするために、本発明はさらに畳み込みプロセスにおいて隣接しないグラフノードを接続することを可能にする。このようなグラフィックス変換は、動的ＥｄｇｅＣｏｎｖと呼ばれる。この技術案に対して、各ノードの近隣はいずれもノードのユークリッド距離に基づいてＫＮＮ（本実施例の実現においてＫ＝８である）により動的に計算される。
本発明のネットワークアーキテクチャはＬ_ｅ層の静的ＥｄｇｅＣｏｎｖ、Ｌ_ｄ層の動的ＥｄｇｅＣｏｎｖ及びＬ_１層の全結合（ＦＣ）層で構成される。グラフ畳み込みの各層の後に、学習された特徴を一緒に接続してプール化を行う。本実施例において、平均プール及び最大プールをいずれも対称関数として使用し、それらは最も重要な特徴を選択することができる。最後に、ＦＣ層は３Ｄベクトルを回帰し、すなわち本発明の予測する法線方向である。最後のＦＣ層以外、本発明のアーキテクチャにおける各層はバッチ処理正規化及び活性化関数ＬｅａｋｙＲｅＬＵを有する。 Since the geometry-to-connectivity mapping is not a one-to-one function, some information may be lost in the convolution process using only the original geometry structure. To enrich the receiving area of graph nodes, the present invention also allows connecting non-adjacent graph nodes in the convolution process. Such graphics transforms are called dynamic Edge Conv. For this scheme, each node's neighbors are dynamically computed by KNN (where K=8 in the implementation of this embodiment) based on the node's Euclidean distance.
The network architecture of the present invention consists of a static Edge Conv on the L _e layer, a dynamic Edge Conv on the L _d layer and a fully connected (FC) layer on the L ₁ layer. After each layer of graph convolution, the learned features are connected together for pooling. In this example, both the average pool and the maximum pool are used as symmetry functions, and they can select the most important features. Finally, the FC layer regresses the 3D vector, ie the normal direction we predict. Except for the final FC layer, each layer in our architecture has a batch processing normalization and activation function LeakyReLU.

好ましい解決手段として、カスケード接続されたＧＣＮ（ＧＣＮ_１、…、ＧＣＮ_Ｘ）で雑音無しの法線方向を段階的に回帰する。カスケード最適化における全てのＧＣＮは同じアーキテクチャを有するが、異なる静的ＥｄｇｅＣｏｎｖ、動的ＥｄｇｅＣｏｎｖ及びＦＣ層の数を有する。本実施例において、一つ目のＧＣＮに対して、Ｌｅ＝３、Ｌｄ＝３、かつＬｌ＝４を使用し、残りのＧＣＮに対して、Ｌｅ＝２、Ｌｄ＝２、Ｌｌ＝３を使用する。 A preferred solution is to stepwise regress the noise-free normal direction with cascaded GCNs (GCN ₁ , . . . , GCN _X ). All GCNs in the cascade optimization have the same architecture, but different numbers of static Edge Conv, dynamic Edge Conv and FC layers. In this example, we use Le=3, Ld=3, and Ll=4 for the first GCN, and Le=2, Ld=2, and Ll=3 for the remaining GCNs. do.

毎回の法線方向予測の更新後にいずれも頂点の位置を更新する必要があり、本発明の方法において頂点更新は以下の式に定義することができ、ここで、

には、該頂点の領域はその近傍頂点を含み、該頂点の領域はその隣接頂点の関連付けパッチを含む。

It is necessary to update the vertex position after each normal prediction update, and the vertex update in the method of the present invention can be defined as the following equation, where:

, the region of the vertex contains its neighboring vertices, and the region of the vertex contains the associated patches of its neighboring vertices.

ｋは頂点更新の反復回数を示し、上付きｇは目標法線方向を示し、前のｘ－１段のＧＣＮ更新において、ネットワーク出力の予測値であり、最終段の更新において、最適化後の法線方向を示す。ｅｉｊはパッチにおける二つの頂点の間の辺を示す。 k denotes the number of vertex update iterations, the superscript g denotes the target normal direction, in the previous x−1 stage GCN update, the predicted value of the network output, and in the final stage update, the post-optimization Indicates normal direction. eij indicates an edge between two vertices in the patch.

オフライントレーニングステップにおいて、ＧＣＮ_ｘの出力を用いてトレーニング集合の雑音メッシュに対して雑音除去を行い、次にこれらの更新されたメッシュから新しいデータを生成して次のＧＣＮ_ｘ＋１をトレーニングする。ネットワークの検証セットでの誤差が減少しない場合、カスケードＧＣＮを停止することができる。損失関数はネットワーク出力と基準値との間のＭＳＥであり、すなわちＲ^－１

である。ここで、

は面ｆ_ｉの真の雑音メッシュの法線方向であり、Ｒは上記対応する回転行列である。 In the offline training step, the output of GCN _x is used to denoise the noisy meshes of the training set and then new data is generated from these updated meshes to train the next GCN _x+1 . Cascaded GCN can be stopped if the error on the validation set of the network does not decrease. The loss function is the MSE between the network output and the reference value, i.e. R ⁻¹

is. here,

is the normal direction of the true noise mesh of face f _i and R is the corresponding rotation matrix above.

好ましい解決手段として、各３Ｄモデルに対して、異なるレベル及びタイプの雑音を生成してトレーニングする。テンソル投票アルゴリズムをデータセット中の無雑音モデルの各面に適用し、三つの特徴値λ１、λ２、λ３を取得する。各モデルに対して各面を四組に分け、｛ｆｉ｜λｉ２＜０．０１Λλｉ３＜０．００１｝は平面であり、｛ｆｉ｜λｉ２＞０．０１Λλｉ３＜０．１｝は辺表面であり、｛ｆｉ｜λｉ３＞０．１｝は角表面であり、残りは遷移面である。辺表面及び角表面は他の二種に比べて少なく、さらに二組に分けられ、平面及び遷移面で構成された特徴的な面は、辺表面及び角表面で構成された特徴面である。この二組の中にトレーニングデータとしてブロックを均一に収集する。このポリシーは全てのＧＣＮをトレーニングすることに適用される。 A preferred solution is to generate and train different levels and types of noise for each 3D model. A tensor voting algorithm is applied to each face of the noiseless model in the dataset to obtain three feature values λ1, λ2, λ3. For each model, each face is divided into four sets, {fi|λi 2<0.01Λλi 3<0.001} are planes and {fi|λi 2>0.01Λλi 3<0.1} are edges , {fi|λi 3>0.1} are corner surfaces and the rest are transition surfaces. Side surfaces and corner surfaces are fewer than the other two types, and are further divided into two sets. Characteristic surfaces composed of planes and transition surfaces are characteristic surfaces composed of edge surfaces and corner surfaces. Blocks are uniformly collected as training data in these two sets. This policy applies to training all GCNs.

好ましい解決手段として、一つの入力雑音メッシュの予測法線方向を与え、局所処理による隣接面の間の不連続性を回避するために、バイラテラルフィルタ（ＹｏｕｙｉＺｈｅｎｇ、ＨｏｎｇｂｏＦｕ、ＯｓｃａｒＫｉｎ－ＣｈｕｎｇＡｕ、ａｎｄＣｈｉｅｗ－ＬａｎＴａｉ．２０１１．Ｂｉｌａｌｎｏｒｍａｌｆｉｌｔｅｒｉｎｇｆｏｒｍｕｔｅｎｏｉｓｉｎｇ．ＩＥＥＥＴｒａｎｓｉｓｏｎｓｏｎＶｉｓｕａｌｉｚａｔｉｏｎａｎｄＣｏｍｐｕｔｅｒＧｒａｐｈｉｃｓ１７、１０（２０１１）、１５２１－１５３０．）を適用してＧＣＮ予測の法線方向を微調整することができる。

As a preferred solution, a bilateral filter (Youyi Zheng, Hongbo Fu, Oscar Kin-Chung Au , and Chiew-Lan Tai. 2011. Bilal normal filtering for mutenoising. IEEE Transions on Visualization and Computer Graphics 17, 10 (2011), 1521-1530.) can be applied to fine-tune the normal direction of the GCN prediction. can.

この操作反復はｍ回反復することができるが、最後のカスケードＧＣＮにおける法線出力のみに適用される。ここで

は反復最適化ｍ＋１回の法線方向であり、

はｍ回目の反復により得られた法線方向に基づいて頂点を更新した後のパッチに算出された法線方向である。Ω_ｉは隣接ｆ_ｉの集合であり、Ｗ_ｓとＷ_ｒはそれぞれσｓとσｒコアのガウス関数である。 This manipulation iteration can be repeated m times, but only applies to the normal output in the last cascaded GCN. here

is the normal direction of iterative optimization m+1 times, and

is the normal direction computed for the patch after updating the vertices based on the normal direction obtained by the m-th iteration. Ω _i is the set of neighbors f _i and W _s and W _r are Gaussian functions of σs and σr cores respectively.

図１及び３はそれぞれ本方法のスレーブ装置がキャプチャした実際の雑音の元の走査モデル及び明瞭な特徴を含むＣＡＤモデルの雑音除去結果を示し、最も左が入力された雑音モデルであり、中間が結果であり、最も右が元の雑音レス真値である。図から分かるように、本発明の方法は良好な雑音除去効果を有し、効果と効率との間の良好なバランスを達成することができる。 Figures 1 and 3 show the denoising results of the original scanned model and the CAD model containing the distinct features, respectively, of the actual noise captured by the slave device of the method, the leftmost being the input noise model and the middle Result, rightmost is the original noiseless true value. As can be seen, the method of the present invention has a good denoising effect and can achieve a good balance between effectiveness and efficiency.

Claims

A mesh denoising method based on graph convolutional networks, comprising the following steps 1 and 2:
In step 1, generate a local block for each face in the noise mesh and employ a tensor voting algorithm to rotationally align the local block;
In step 2, the local blocks aligned in step 1 are transformed into a graph and shown, input to a trained graph convolutional neural network, noise-free normal directions are predicted, and the predicted normal directions are update the vertices of the mesh model to obtain the denoised model, wherein the structure of the graph convolutional neural network is L _e -layer EdgeConv, L _d -layer dynamic EdgeConv and L ₁ -layer fully connected (FC) A method characterized by comprising layers.

said step 1 being performed by the following substeps 1.1 to 1.3;
In substep 1.1 the selected patch

, define a bounding sphere according to a constant fraction of the area of the region, and delineate all patches within the bounding sphere into the local block

as the inside surface,
In substep 1.2 all faces in the local block

vote tensor for

The graph convolutional network-based mesh denoising method of claim 1, wherein generating .

said step 2 is performed by the following substeps 2.1 to 2.4,
In sub-step 2.1, iterating the static edge convolution process with static EdgeConv on the input image to obtain adjacent surface features,
in substep 2.2, iterating the dynamic edge convolution process with dynamic EdgeConv on the result obtained in substep 2.1 to obtain the closest feature face in the feature space;
In sub-step 2.3, after graph convolution, connect the learned features together and aggregate the features via a fully connected layer,
The method of mesh denoising based on graph convolutional networks according to claim 1, characterized in that sub-step 2.4 selects the most dominant features by symmetry pooling to finally predict the normal direction.

A training data set for said graph convolutional neural network is constructed in the following manner,
A tensor voting algorithm is applied to each face of the noiseless model in the dataset to obtain three feature values λ ₁ , λ ₂ , λ ₃ , and based on the feature values all model faces in all data are The mesh denoising method based on graph convolutional network according to claim 1, characterized in that it divides into two sets of "faces" and "edges" and collects block data respectively to build a training data set.

When training multiple cascaded graph convolutional neural networks to predict noise-free normal directions and training, until the loss of the cascaded networks no longer drops, the previous graph convolutional neural network predicts The graph convolution according to claim 1, wherein the normal direction is used to build a mesh model after noise removal and generate new data to train a graph convolution neural network in the next stage. A network-based mesh denoising method.

In the final-stage graph convolutional neural network, iterative optimization is performed using the normal vector of the bilateral filter pair on the mesh, the vertices are updated at each iteration, and finally the mesh model after denoising is obtained. Graph convolutional network-based mesh denoising method according to claim 1.