JP2019220114A

JP2019220114A - Signal processor, convolutional neural network, signal processing method, and signal processing program

Info

Publication number: JP2019220114A
Application number: JP2018119299A
Authority: JP
Inventors: 崇之原; Takayuki Hara
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2018-06-22
Filing date: 2018-06-22
Publication date: 2019-12-26
Anticipated expiration: 2038-06-22
Also published as: JP7031511B2

Abstract

To provide filter processing on a spherical signal with small computation complexity and high approximation precision.SOLUTION: When performing filter processing on a spherical surface signal having a value allocated to a point on a spherical surface, an axial decomposition part expands or contracts the spherical surface signal in one axial direction of polar coordinates while reflecting a distance on the spherical surface to decompose the signal into a plurality of partial signals. A filter processing part performs filter processing on the partial signals. Then an axial integration part performs axial integration processing to expand or contract and integrate the partial signals having been subjected to the filter processing while reflecting the distance on the spherical surface. Consequently, the filter processing on the spherical surface signal etc., with small computational complexity and high approximation precision is provided.SELECTED DRAWING: Figure 5

Description

本発明は、信号処理装置、畳み込みニューラルネットワーク、信号処理方法及び信号処理プログラムに関する。 The present invention relates to a signal processing device, a convolutional neural network, a signal processing method, and a signal processing program.

今日において、上下左右斜め等の全方位の情報を取得して出力又は記録する装置が知られている。方向を単位ベクトルで表現すれば、このような全方位の情報（信号）は、単位球面上に値を割り当てた信号（単位球面から信号値の集合への写像：球面信号）と見なすことができる。球面信号に対するフィルタ処理は、ノイズ除去及び特徴抽出等の用途の他、畳み込みニューラルネットワークの構成要素等の幅広い用途がある。 At present, there is known an apparatus which acquires, outputs, or records information in all directions, such as up, down, left, right, and oblique directions. If the direction is expressed by a unit vector, such omnidirectional information (signal) can be regarded as a signal (a mapping from a unit sphere to a set of signal values: spherical signal) in which values are assigned on a unit sphere. . Filter processing on a spherical signal has a wide range of uses such as components of a convolutional neural network in addition to uses such as noise removal and feature extraction.

しかし、球面信号等に対してフィルタ処理を施す従来の信号処理装置は、近似精度が低いうえ、計算量も多くなる問題があった。 However, the conventional signal processing device that performs a filtering process on a spherical signal or the like has a problem that the approximation accuracy is low and the amount of calculation is large.

本発明は、上述の課題に鑑みてなされたものであり、少ない計算量及び高い近似精度で球面信号等のフィルタ処理を可能とする信号処理装置、畳み込みニューラルネットワーク、信号処理方法及び信号処理プログラムの提供を目的とする。 The present invention has been made in view of the above-described problems, and has been made in consideration of a signal processing apparatus, a convolutional neural network, a signal processing method, and a signal processing program that can perform a filtering process on a spherical signal or the like with a small amount of calculation and high approximation accuracy. For the purpose of providing.

上述した課題を解決し、目的を達成するために、本発明は、球面上の点に値が割り当てられた球面信号に対してフィルタ処理を行う信号処理装置であって、球面信号を極座標の１つの軸方向について、球面上の距離を反映して伸縮し、複数の部分信号に分解する軸分解処理を行う軸分解部と、前記部分信号に対してフィルタ処理を行うフィルタ処理部と、フィルタ処理された部分信号を、球面上の距離を反映して伸縮して統合する軸統合処理を行う軸統合部と、を有する。 In order to solve the above-described problems and achieve the object, the present invention is a signal processing device that performs a filtering process on a spherical signal in which a value is assigned to a point on a spherical surface. An axis disassembly unit that performs an axis disassembly process that expands and contracts by reflecting a distance on a spherical surface and decomposes the signal into a plurality of partial signals in one axial direction; a filter processing unit that performs a filter process on the partial signals; And an axis integration unit that performs an axis integration process of integrating the expanded and contracted partial signals by reflecting and expanding the distance on the spherical surface.

本発明によれば、少ない計算量で、高い近似精度のフィルタ処理を可能とすることができるという効果を奏する。 Advantageous Effects of Invention According to the present invention, there is an effect that it is possible to perform filter processing with high approximation accuracy with a small amount of calculation.

図１は、正距円筒形式を説明するための図である。FIG. 1 is a diagram for explaining the equirectangular cylinder type. 図２は、第１の実施の形態の信号処理装置の構成図である。FIG. 2 is a configuration diagram of the signal processing device according to the first embodiment. 図３は、第１の実施の形態の信号処理装置の信号処理の流れを示すフローチャートである。FIG. 3 is a flowchart illustrating a flow of signal processing of the signal processing device according to the first embodiment. 図４は、第１の実施の形態の信号処理装置における軸分解処理を説明するための図である。FIG. 4 is a diagram for explaining axis disassembly processing in the signal processing device according to the first embodiment. 図５は、軸分解処理、平面フィルタ処理及び軸統合処理の流れを模式的に示した図である。FIG. 5 is a diagram schematically showing the flow of the axis decomposition processing, the plane filter processing, and the axis integration processing. 図６は、第２の実施の形態の信号処理装置における、インテグラルイメージの計算の仕方を説明するための図である。FIG. 6 is a diagram for explaining how to calculate an integral image in the signal processing device according to the second embodiment. 図７は、第２の実施の形態の信号処理装置における軸分解処理を説明するための図である。FIG. 7 is a diagram for explaining axis disassembly processing in the signal processing device according to the second embodiment. 図８は、第２の実施の形態の変形例となる信号処理装置のマルチスケールのフィルタ処理を説明するための図である。FIG. 8 is a diagram illustrating a multi-scale filter process of a signal processing device according to a modification of the second embodiment. 図９は、第３の実施の形態の信号処理装置の構成図である。FIG. 9 is a configuration diagram of a signal processing device according to the third embodiment. 図１０は、第３の実施の形態の信号処理装置の信号処理の流れを示すフローチャートである。FIG. 10 is a flowchart illustrating a flow of signal processing of the signal processing device according to the third embodiment. 図１１は、第３の実施の形態の信号処理装置における、フィルタの垂直分解処理及び水平分解処理を説明するための図である。FIG. 11 is a diagram for describing vertical and horizontal decomposition processing of a filter in the signal processing device according to the third embodiment. 図１２は、第３の実施の形態の信号処理装置における、水平フィルタ及び垂直フィルタを用いた信号処理の流れを示す模式図である。FIG. 12 is a schematic diagram illustrating a flow of signal processing using a horizontal filter and a vertical filter in the signal processing device according to the third embodiment. 図１３は、フィルタの行列Ｇを、複数の垂直方向フィルタ及び水平方向フィルタに分解した例を示す図である。FIG. 13 is a diagram showing an example in which a matrix G of filters is decomposed into a plurality of vertical filters and horizontal filters. 図１４は、第４の実施の形態の信号処理装置における、３次の単位球面に対する一連の処理の流れを示す模式図である。FIG. 14 is a schematic diagram illustrating a flow of a series of processes performed on a third-order unit spherical surface in the signal processing device according to the fourth embodiment. 図１５は、第５の実施の形態の信号処理装置の構成図である。FIG. 15 is a configuration diagram of a signal processing device according to the fifth embodiment.

以下、一例ではあるが、全天球画像処理（全天球カメラ装置）、音場処理、気象解析又は天体物理学等の方向性を持った信号の処理及び解析等に用いることが可能な実施の形態の信号処理装置の説明をする。 Hereinafter, although only one example, implementation that can be used for processing and analysis of directional signals such as omnidirectional image processing (omnidirectional camera device), sound field processing, weather analysis, or astrophysics, etc. The signal processing device according to the embodiment will be described.

（第１の実施の形態）
（概要）
上下左右斜め等の全方向の情報（信号）は、各方向を単位ベクトルで表現すると、単位球面上に値を割り当てた信号（単位球面から信号値の集合への写像）と見なすことができる。以下、各方向の信号を「球面信号」という。離散的な球面信号のフィルタ処理は、信号とフィルタのカーネルの畳み込み（或いは相関）演算である。正方格子上の信号値のフィルタ処理であれば、フィルタを１グリッドずつずらして畳み込み演算を行えば良い。しかし、２次以上の球面にそのようなグリッドを構成することは大変困難である。 (First Embodiment)
(Overview)
Information (signals) in all directions, such as up, down, left, right, and diagonal, can be regarded as signals in which values are assigned to unit spheres (mapping from a unit sphere to a set of signal values) when each direction is represented by a unit vector. Hereinafter, the signal in each direction is referred to as a “spherical signal”. Filtering a discrete spherical signal is a convolution (or correlation) operation of the signal and the kernel of the filter. In the case of filter processing of signal values on a square lattice, convolution operation may be performed by shifting the filter by one grid. However, it is very difficult to construct such a grid on a secondary or higher spherical surface.

すなわち、外接する正多面体との接点に信号点を取ることで、球面上に回転不変なグリッドを構成する。しかし、例えば３次元空間の正多面体は２０面体までしか存在しないため、２次球面では３次元回転に不変な２１点以上のグリッドは構成不可能である。このため、球面上でフィルタ処理を行うには、信号値又はフィルタカーネルを補間するか、或いはこれらに代わる近似処理が必要となる。 That is, a signal point is set at a contact point with a circumscribed regular polyhedron, thereby forming a rotation-invariant grid on the spherical surface. However, for example, a regular polyhedron in a three-dimensional space exists only up to an icosahedron, so that a secondary sphere cannot form a grid of 21 points or more that is invariant to three-dimensional rotation. Therefore, in order to perform a filtering process on a spherical surface, it is necessary to interpolate a signal value or a filter kernel or to perform an approximation process instead of the interpolation.

また、信号値又はフィルタカーネルの補間処理は計算量が多くなる。例えば、信号点数が「Ｎ個」、フィルタサイズが「Ｍ」の場合、最高でＮＭ個の点を補間処理で形成することとなる。これは、信号点数が少ない場合、又は、フィルタサイズが小さい場合以外は、実用的ではないことを意味する。 Further, the interpolation amount of the signal value or the filter kernel requires a large amount of calculation. For example, when the number of signal points is “N” and the filter size is “M”, at most NM points are formed by interpolation processing. This means that it is not practical unless the number of signal points is small or the filter size is small.

ここで、特許文献１（特許第５７３４３２７号公報）、特許文献２（特許第６０６７９３４号公報）、非特許文献１（T.C.Cohen,et al.,「Spherical CNNs」.arXiv:1801.10130,2018.）又は非特許文献２（C.Esteves,et al.,「3D object classification and retrieval with Spherical CNNs.」arXiv:1711.06721,2017）に、実用的な計算量の近似演算手法が開示されている。特許文献１、特許文献２、非特許文献１及び非特許文献２には、一般化されたフーリエ変換処理である球面調和関数及びウィグナー関数を用いたスペクトル領域でのフィルタ処理が開示されている。 Here, Patent Literature 1 (Japanese Patent No. 5734327), Patent Literature 2 (Japanese Patent No. 6067934), Non-Patent Literature 1 (TCCohen, et al., “Spherical CNNs”. ArXiv: 1801.10130, 2018.) or Non-Patent Document 2 (C. Esteves, et al., “3D object classification and retrieval with Spherical CNNs.” ArXiv: 1711.06721, 2017) discloses a practical calculation amount approximation method. Patent Literature 1, Patent Literature 2, Non-Patent Literature 1, and Non-Patent Literature 2 disclose a filtering process in a spectral domain using a spherical harmonic function and a Wigner function, which are generalized Fourier transform processes.

このフィルタ処理では、信号点を球面上で極力等密度となるように配置したうえで、スペクトル領域で信号値及びフィルタカーネルを近似表現してフィルタ演算（乗算）を行う。しかし、スペクトル領域への信号値の変換は、一般化高速フーリエ変換を用いた場合でも、信号点数Ｎに対して計算量はＯ（ＮｌｏｇＮ）となる。なお、「Ｏ」は、ランダウの漸近記法のオー（又はオミクロン）を示す。空間領域でフィルタ処理を行う場合は、カーネルサイズをＭとしてＯ（ＮＭ）であるため、カーネルサイズが小さい時は計算効率が悪い。また、一度スペクトル領域に変換することから、処理対象の信号値と同程度の量のメモリが必要となる。このようなことから、球面調和関数及びウィグナー関数を用いたスペクトル領域でのフィルタ処理は、信号点数の大きい球面信号のフィルタ処理に適用することは適当ではない。 In this filtering process, signal points are arranged so as to have the same density as much as possible on the spherical surface, and then a filter operation (multiplication) is performed by approximating the signal value and the filter kernel in the spectral domain. However, even when the generalized fast Fourier transform is used to convert the signal value to the spectral domain, the calculation amount is O (N log N) for the number N of signal points. Note that “O” indicates Landau's asymptotic notation o (or omicron). When filtering is performed in the spatial domain, since the kernel size is M and O (NM), the calculation efficiency is poor when the kernel size is small. Further, since the data is once converted into the spectral domain, a memory of the same amount as the signal value to be processed is required. For this reason, it is not appropriate to apply the filtering in the spectral domain using the spherical harmonic function and the Wigner function to the filtering of a spherical signal having a large number of signal points.

一方、特許文献３（特開２０１７−２０７９６０号公報）、及び、非特許文献３（W.Boomsma,et al.,「Spherical convolutions and their application in molecular modelling」,Advances in Neural Information Processing Systems,2017.）では、球面信号を平面（正方格子）に射影して、平面上で代替して行うフィルタ処理が開示されている。 On the other hand, Patent Document 3 (Japanese Patent Application Laid-Open No. 2017-207960) and Non-Patent Document 3 (W. Boomsma, et al., "Spherical convolutions and their application in molecular modeling", Advances in Neural Information Processing Systems, 2017. ) Discloses a filtering process in which a spherical signal is projected onto a plane (square lattice) and is alternately performed on the plane.

この平面上で代替して行うフィルタ処理の場合、平面射影した後は信号点の補間処理は必要ないため、畳み込みニューラルネットワークのように多段的にフィルタ処理を行う場合には計算効率が良い。しかし、球面フィルタ処理の近似精度を高めるために射影平面の数を増やすと計算量が増加する。また、射影平面のオーバーラップ部分での処理結果にアーティファクトが生じる。さらに、射影平面以上に大きいフィルタの適用は困難である。 In the case of the filter processing performed in place of this plane, the interpolation processing of the signal points is not required after the plane projection, so that when the filter processing is performed in multiple stages as in a convolutional neural network, the calculation efficiency is high. However, if the number of projection planes is increased in order to increase the approximation accuracy of the spherical filtering, the amount of calculation increases. In addition, artifacts occur in the processing result at the overlapping portion of the projection plane. Further, it is difficult to apply a filter larger than the projection plane.

また、非特許文献４（Y.C.Su,et al.,「Learning spherical convolution for fast features from 360 imagery」,Advances in Neural Information Processing Systems,2017.）には、図１に示すように、球面信号を極座標で等間隔にサンプリングする「正距円筒形式（球の表面を円筒に広げた図法）」に変換し、その上でフィルタ処理を行う方法が開示されている。なお、図１（ａ）及び図１（ｂ）に示す例は、ｚ軸と球の中心Ｏを通る動径ｖがなす角度を第１の角度θ（０≦θ≦πの範囲）、ｚ軸に垂直な平面のｘ軸と、この平面への動径ｖの射影がなす角度を第２の角度−φ（０≦φ＜２π）とし、球の表面の信号点Ｐを、球の表面を円筒に広げた正距円筒に対して投影（信号点Ｐ´）した例である。 In addition, Non-Patent Document 4 (YCSu, et al., “Learning spherical convolution for fast features from 360 imagery”, Advances in Neural Information Processing Systems, 2017.) discloses that a spherical signal is represented by polar coordinates as shown in FIG. There is disclosed a method of converting into an equirectangular form (a projection in which the surface of a sphere is expanded into a cylinder) in which sampling is performed at equal intervals, and then performing a filtering process. In the example shown in FIGS. 1A and 1B, the angle formed by the z-axis and the moving radius v passing through the center O of the sphere is represented by a first angle θ (0 ≦ θ ≦ π), z The angle between the x-axis of the plane perpendicular to the axis and the projection of the radial radius v onto this plane is defined as a second angle -φ (0 ≦ φ <2π), and the signal point P on the surface of the sphere is defined as the surface of the sphere This is an example of projecting (signal point P ′) on an equidistant cylinder in which is expanded into a cylinder.

正距円筒形式の場合、高緯度となるほど水平方向に画像が拡大されるため、拡大率に合わせてフィルタのカーネルを変形する必要がある。このため、高緯度となるほど、大きいサイズのフィルタ処理を、多くの点の信号に対して行うこととなるため、計算効率が悪い。 In the case of the equirectangular format, the image is enlarged in the horizontal direction as the latitude increases, so that the filter kernel needs to be modified according to the enlargement ratio. For this reason, the higher the latitude, the larger the size of the filter processing to be performed on the signals of many points, and the lower the calculation efficiency.

このように、上述の球面信号に対するフィルタ処理は、いずれも計算量が大きく、大きなカーネルでのフィルタ処理が困難となり、また、アーティファクトが発生する問題もある。 As described above, any of the above-described filtering processes on the spherical signal requires a large amount of calculation, which makes it difficult to perform the filtering process with a large kernel, and also has a problem that an artifact occurs.

第１の実施の形態の信号処理装置は、球面フィルタ処理を特定軸方向の平面フィルタ処理に還元する。これにより、少ない計算量で、アーティファクトを発生させることなく、フィルタサイズの制限を受けずに、高い近似精度で球面フィルタ処理を実現している。 The signal processing device according to the first embodiment reduces spherical filter processing to planar filter processing in a specific axis direction. As a result, the spherical filter processing is realized with high approximation accuracy with a small amount of calculation and without generating artifacts and without being limited by the filter size.

（信号処理装置の構成）
図２に、第１の実施の形態の信号処理装置の構成図を示す。この図２に示すように、信号処理装置は、信号処理部１（信号処理装置の一例）及び記憶部２を有している。また、信号処理部１は、信号取得部１１、軸分解部１２、平面フィルタ処理部１３（フィルタ処理部の一例）、軸統合部１４及び出力部１５を有している。記憶部２としては、例えばＲＡＭ（Random Access Memory）又はＲＯＭ（Read Only Memory）等の半導体記憶装置、ＨＤＤ（Hard Disk Drive）、又は、光記憶媒体等を用いることができる。信号取得部１１〜出力部１５は、ハードウェア又はソフトウェアで実現できる。ソフトウェアで実現する場合、例えば記憶部２に記憶されている信号処理プログラムを、信号処理部が読み出し、ＲＡＭ等に信号取得部１１〜出力部１５を展開して実行する。 (Configuration of signal processing device)
FIG. 2 shows a configuration diagram of the signal processing device according to the first embodiment. As shown in FIG. 2, the signal processing device includes a signal processing unit 1 (an example of a signal processing device) and a storage unit 2. In addition, the signal processing unit 1 includes a signal acquisition unit 11, an axis decomposition unit 12, a plane filter processing unit 13 (an example of a filter processing unit), an axis integration unit 14, and an output unit 15. As the storage unit 2, for example, a semiconductor storage device such as a random access memory (RAM) or a read only memory (ROM), a hard disk drive (HDD), or an optical storage medium can be used. The signal acquisition unit 11 to the output unit 15 can be realized by hardware or software. In the case of realization by software, for example, the signal processing unit reads a signal processing program stored in the storage unit 2, and develops and executes the signal acquisition unit 11 to the output unit 15 in a RAM or the like.

なお、信号取得部１１〜出力部１５のうち、一部をソフトウェアで実現し、他の部分を、例えばＩＣ（Integrated Circuit）等のハードウェアで実現してもよい。また、信号取得部１１〜出力部１５の各処理は、信号処理プログラム単体で実行しても良いし、他のプログラムに一部の処理を実行させてもよい。或いは、信号取得部１１〜出力部１５の各処理は、他のプログラムを介して間接的に実行させても良い。 A part of the signal acquisition unit 11 to the output unit 15 may be realized by software, and the other part may be realized by hardware such as an IC (Integrated Circuit). Further, each processing of the signal acquisition unit 11 to the output unit 15 may be executed by a single signal processing program, or a part of processing may be executed by another program. Alternatively, each processing of the signal acquisition unit 11 to the output unit 15 may be executed indirectly through another program.

また、信号処理プログラムは、インストール可能な形式または実行可能な形式のファイルでＣＤ−ＲＯＭ、フレキシブルディスク（ＦＤ）などのコンピュータ装置で読み取り可能な記録媒体に記録して提供してもよい。また、ＣＤ−Ｒ、ＤＶＤ（Digital Versatile Disk）、ブルーレイディスク（登録商標）、半導体メモリなどのコンピュータ装置で読み取り可能な記録媒体に記録して提供してもよい。また、インターネット等のネットワーク経由でインストールするかたちで提供してもよいし、機器内のＲＯＭ等に予め組み込んで提供してもよい。 In addition, the signal processing program may be provided by being recorded in a recording medium readable by a computer device such as a CD-ROM or a flexible disk (FD) in a file in an installable format or an executable format. Alternatively, the program may be provided by being recorded on a computer-readable recording medium such as a CD-R, a DVD (Digital Versatile Disk), a Blu-ray Disc (registered trademark), and a semiconductor memory. Further, it may be provided by being installed via a network such as the Internet, or may be provided by being incorporated in a ROM or the like in the device in advance.

（信号処理動作）
図３は、第１の実施の形態の信号処理装置の信号処理の流れを示すフローチャートである。この図３のフローチャートを用いて、入力された正距円筒形式（図１）の球面信号に対してフィルタ処理を行う例を説明する。まず、ステップＳ１では、信号取得部１１が記憶部２から処理対象の球面信号（図５（ａ））を取得する。この時、球面信号は、図１（ｂ）に示すように、正距円筒形式での正方格子グリッド上の値で表現されているものとする。図１の例は、図１（ａ）に示す球面信号「Ｐ」が、図１（ｂ）に示す正距円筒形式での正方格子グリッド上の値「Ｐ´」で表現された例を示している。 (Signal processing operation)
FIG. 3 is a flowchart illustrating a flow of signal processing of the signal processing device according to the first embodiment. An example of performing a filtering process on the input spherical signal in the form of an equirectangular cylinder (FIG. 1) will be described with reference to the flowchart of FIG. First, in step S <b> 1, the signal acquisition unit 11 acquires a spherical signal to be processed (FIG. 5A) from the storage unit 2. At this time, it is assumed that the spherical signal is represented by values on a square grid grid in the form of an equirectangular cylinder as shown in FIG. The example of FIG. 1 shows an example in which the spherical signal “P” shown in FIG. 1A is represented by a value “P ′” on a square grid grid in the equirectangular cylinder format shown in FIG. 1B. ing.

ステップＳ２では、軸分解部１２が、ステップＳ１で取得された球面信号を垂直（θ）方向に伸縮処理及び分解処理（図５（ｂ））する。具体的には、軸分解部１２は、まず、図４（ａ）に示すように球面信号の水平方向の信号点数をＮ_ｗ、フィルタのカーネルの垂直方向のサイズをＭ_ｈとし、球面信号の各行を中心としてＭ_ｈ×Ｎ_ｗのサイズの部分信号を抽出する。そして、軸分解部１２は、図４（ｂ）に示すように、抽出した部分信号を、図１（ａ）に示す座標系のθ（ｚ軸と球の中心Ｏを通る動径ｖがなす第１の角度θ（０≦θ≦πの範囲）））に対して、ｓｉｎθの割合で水平方向を縮小処理する。この縮小処理には、例えば最近傍補間処理、線形補間処理、バイキュービック補間処理等を用いることができる。 In step S2, the axis disassembling unit 12 performs expansion / contraction processing and decomposition processing on the spherical signal acquired in step S1 in the vertical (θ) direction (FIG. 5B). Specifically, the shaft decomposition section 12, first, the horizontal direction of the signal points of the spherical signal as shown in FIG. 4 (a) and N _w, the size of the vertical direction of the kernel of the filter and M _h, the spherical signal Each row extracting the size partial signal of M _h × N _w around the. Then, as shown in FIG. 4B, the axis disassembling unit 12 converts the extracted partial signal into a radial radius v passing through the coordinate system θ (the z axis and the center O of the sphere shown in FIG. 1A). With respect to the first angle θ (range of 0 ≦ θ ≦ π))), the horizontal direction is reduced at the rate of sin θ. For this reduction processing, for example, nearest neighbor interpolation processing, linear interpolation processing, bicubic interpolation processing, or the like can be used.

これにより、幾何学的な歪みを補正することができる。なお、厳密に言えば、カーネル幅で拡張した部分には、僅かな歪みが存在する。しかし、カーネル幅が小さければ、高い近似精度を得ることができる。また、カーネルが信号からはみ出る場合は、ゼロパディング（ゼロフィル）で対応してもよい。 Thereby, geometric distortion can be corrected. Strictly speaking, there is a slight distortion in the portion expanded by the kernel width. However, if the kernel width is small, high approximation accuracy can be obtained. When the kernel is out of the signal, zero padding (zero fill) may be used.

ステップＳ３では、平面フィルタ処理部１３が、軸分解した部分信号に対して、平面フィルタ処理（図５（ｃ））を施す。これは球面フィルタ処理の近似となっている。なお、部分信号の左右端は、巡回接続していると見なしてフィルタ処理を行うことが望ましい。この処理によって、球面信号の各行に対して、１×Ｎ_ｗのサイズの部分信号が得られる。 In step S3, the plane filter processing unit 13 performs plane filter processing (FIG. 5C) on the axis-decomposed partial signal. This is an approximation of spherical filtering. Note that it is desirable that the left and right ends of the partial signals be subjected to the filter processing assuming that they are cyclically connected. This process, for each row of the spherical signal, the size of the partial signals of the 1 × N _w can be obtained.

ステップＳ４では、軸統合部１４が、平面フィルタ処理された部分信号を伸縮して統合し、元の正距離円筒形式の信号に戻し、縮小比率の逆の１／ｓｉｎθで拡大して統合処理（図５（ｄ））して出力する（図５（ｅ））。 In step S4, the axis integration unit 14 expands / contracts the partial signals subjected to the plane filter processing, integrates them, returns the signals to the original equidistant cylindrical form, expands them by the inverse of the reduction ratio of 1 / sin θ, and performs integration processing ( 5 (d)) and output (FIG. 5 (e)).

（第１の実施の形態の効果）
第１の実施の形態の信号処理装置は、正距円筒形式の球面信号に対して少ない計算量でフィルタ処理を行うことができる。入力信号形式としては、説明上の便宜上、一例として正距円筒形式を示したが、軸分解処理及び軸統合処理の際に、極座標を反映すれば良いので任意の球面信号の表現形式を用いてもよい。 (Effects of the First Embodiment)
The signal processing device according to the first embodiment can perform a filtering process on a spherical signal in the equirectangular form with a small amount of calculation. As the input signal format, for the sake of convenience in explanation, an equirectangular cylinder format was shown as an example, but in the axis resolution processing and axis integration processing, it is sufficient to reflect polar coordinates, so any arbitrary spherical signal expression format is used. Is also good.

また、垂直方向に入力信号を軸分解処理する際に、図１（ａ）に示した極座標θに応じてリサイズすることで、球面上の距離を反映させることができる。このため、カーネルを変形処理することなく、一様にフィルタ処理を行うことができる。カーネルの変形処理を不用とすることができるため、非特許文献４（Y.C.Su,et al.,「Learning spherical convolution for fast features from 360 imagery」,Advances in Neural Information Processing Systems,2017.）のように、実際よりも大きなサイズでのフィルタ処理を不要とすることができ、計算量を削減することができる。 Further, when the input signal is subjected to the axis decomposition process in the vertical direction, the distance on the spherical surface can be reflected by resizing according to the polar coordinate θ shown in FIG. For this reason, it is possible to uniformly perform the filter processing without performing the deformation processing on the kernel. Since the kernel deformation processing can be made unnecessary, as in Non-Patent Document 4 (YCSu, et al., “Learning spherical convolution for fast features from 360 imagery”, Advances in Neural Information Processing Systems, 2017.) In addition, it is possible to eliminate the need for filter processing with a size larger than the actual size, and to reduce the amount of calculation.

なお、カーネルの変形処理の代わりにリサイズ処理が必要となるが、線形補間処理で行えば、１行あたり高々Ｎ_ｗＭ_ｈｓｉｎθ回ずつの加算と除算で実現できるため、総合的に少ない計算量とすることができる。 Note that resizing processing is required instead of kernel deformation processing. However, if linear interpolation processing is performed, it can be realized by addition and division at most N _w M _h sin θ per row, so that the total amount of calculation is small. It can be.

また、特許文献１（特許第５７３４３２７号公報）、特許文献２（特許第６０６７９３４号公報）、非特許文献１（T.C.Cohen,et al.,「Spherical CNNs.」arXiv:1801.10130,2018.）及び非特許文献２（C.Esteves,et al.,「3D object classification and retrieval with Spherical CNNs.」arXiv:1711.06721,2017.）で開示されている一般化されたフーリエ変換を使う方式では、信号点数Ｎ、カーネルサイズＭに対して計算量はＯ（ＮｌｏｇＮ）となる。 Patent Document 1 (Japanese Patent No. 5734327), Patent Document 2 (Japanese Patent No. 6067934), Non-Patent Document 1 (TCCohen, et al., “Spherical CNNs.” ArXiv: 1801.10130, 2018.) In a method using a generalized Fourier transform disclosed in Patent Document 2 (C. Esteves, et al., “3D object classification and retrieval with Spherical CNNs.” ArXiv: 1711.06721, 2017.), the number of signal points N, The calculation amount is O (N log N) for the kernel size M.

これに対して、第１の実施の形態の信号処理装置の場合、信号点数Ｎ、カーネルサイズＭに対して計算量はＯ（ＮＭ）であり、カーネルが小さい時にフーリエ変換処理を行う方式よりも計算量を削減することができる。 On the other hand, in the case of the signal processing device according to the first embodiment, the calculation amount is O (NM) for the number of signal points N and the kernel size M, which is smaller than the method of performing the Fourier transform processing when the kernel is small. The amount of calculation can be reduced.

また、特許文献３（特開２０１７−２０７９６０号公報）又は非特許文献３（W.Boomsma,et al.,「Spherical convolutions and their application in molecular modelling」,Advances in Neural Information Processing Systems,2017.）のような平面射影を行う方式に対しては、計算量の優劣は平面の分割数及び分割した平面のオーバーラップ率に依存する。しかし、第１の実施の形態の信号処理装置は、平面射影のオーバーラップ部分でのアーティファクトが発生する不都合を防止でき、また、射影した平面以上のサイズのフィルタ処理が困難となる不都合を防止できる。 Patent Document 3 (JP-A-2017-207960) or Non-Patent Document 3 (W. Boomsma, et al., “Spherical convolutions and their application in molecular modeling”, Advances in Neural Information Processing Systems, 2017.) For such a method of performing plane projection, the degree of calculation depends on the number of plane divisions and the overlap ratio of the divided planes. However, the signal processing device according to the first embodiment can prevent the inconvenience that an artifact occurs at the overlapping portion of the planar projection, and can prevent the inconvenience that it is difficult to perform the filtering process with a size larger than the projected plane. .

このような効果をまとめると、第１の実施の形態の信号処理装置は、球面フィルタ処理を、少ない計算量で、アーティファクトの発生なく、フィルタサイズの制限なく、高い近似精度で実現することができる。 Summarizing such effects, the signal processing device according to the first embodiment can realize the spherical filter processing with a small amount of calculation, no generation of artifacts, no limitation on filter size, and high approximation accuracy. .

（第２の実施の形態）
次に、第２の実施の形態の信号処理装置の説明をする。この第２の実施の形態の信号処理装置は、球面信号に対してフィルタ処理を行う際に、インテグラルイメージを内包し、大きいサイズのフィルタに対しても計算量の削減を可能とした例である。なお、この第２の実施の形態と上述の第１の実施の形態とでは、図２に示した軸分解部１２における、図３のフローチャートのステップＳ２の軸分解処理が異なる。このため、以下、両者の差異の説明のみ行い、重複説明は省略する。 (Second embodiment)
Next, a signal processing device according to a second embodiment will be described. The signal processing device according to the second embodiment is an example in which an integral image is included when filtering processing is performed on a spherical signal, and the amount of calculation can be reduced even for a large-sized filter. is there. Note that the second embodiment differs from the above-described first embodiment in the axis disassembling process of step S2 in the flowchart of FIG. 3 in the axis disassembly unit 12 shown in FIG. Therefore, hereinafter, only the difference between the two will be described, and redundant description will be omitted.

この第２の実施の形態の信号処理装置の場合、図３のフローチャートのステップＳ２となると、軸分解部１２が、ステップＳ１で取得した球面信号を垂直（θ）方向に伸縮し、分解する。この際、軸分解部１２は、まず、処理対象の正距円筒形式の球面信号に対して、非特許文献５（P.Viola and M.Jones,「Rapid object detection using a boosted cascade of simple features」,in IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2001,vol.1.）にも開示されているインテグラルイメージを作成する。このインテグラルイメージを用いる手法は、図６に示す正方格子上の信号に対して、原点（０，０）から位置（ｉ，ｊ）までの累積和をメモリに格納し、その値を用いて任意のボックスフィルタを高速に実現する手法である。 In the case of the signal processing device according to the second embodiment, when step S2 in the flowchart of FIG. 3 is reached, the axis decomposing unit 12 expands and contracts the spherical signal acquired in step S1 in the vertical (θ) direction and decomposes the spherical signal. At this time, first, the axis disassembling unit 12 applies a non-patent document 5 (P. Viola and M. Jones, “Rapid object detection using a boosted cascade of simple features”) to the spherical signal in the form of an equirectangular cylinder to be processed. , in IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2001, vol.1.). In the method using the integral image, a cumulative sum from the origin (0, 0) to the position (i, j) is stored in a memory with respect to the signal on the square lattice shown in FIG. This is a method for realizing an arbitrary box filter at high speed.

具体的には、軸分解部１２は、非負整数座標（ｉ，ｊ）の信号値をｆ（ｉ，ｊ）として、以下の（１）式に示す漸化式を計算してインテグラルイメージＦを求める。 Specifically, the axis disassembling unit 12 calculates a recurrence formula represented by the following expression (1) with the signal value of the non-negative integer coordinates (i, j) as f (i, j), and calculates the integral image F Ask for.

なお、（１）式のインテグラルイメージＦのいずれかの引数が負になる場合、インテグラルイメージＦの値は「０」とする。このように計算されたインテグラルイメージＦ（ｉ，ｊ）には原点（０，０）から位置（ｉ，ｊ）までの信号値ｆ（ｉ，ｊ）の和が記録される。従って、図６に示す位置（ｉ１，ｊ１）及び位置（ｉ０，ｊ０）に囲まれた矩形領域の信号値ｆの和を求めるには、以下の（２）式の右辺を計算すれば良い。 Note that when any argument of the integral image F in Expression (1) is negative, the value of the integral image F is set to “0”. The sum of the signal values f (i, j) from the origin (0,0) to the position (i, j) is recorded in the integral image F (i, j) thus calculated. Therefore, to obtain the sum of the signal values f of the rectangular area surrounded by the position (i1, j1) and the position (i0, j0) shown in FIG. 6, the right side of the following equation (2) may be calculated.

インテグラルイメージＦの算出に必要となる計算回数は、（１）式に基づいてＮ個の信号点に対して行う加算の演算が３Ｎ回と、（２）式に基づいて行う、特定の矩形領域の信号値の和の加減算の演算が３回となる。この演算は信号値の和を求める矩形領域の面積に依存性が無いため、特に、大きい領域の信号値の和を求める際に計算効率が良い。 The number of calculations required to calculate the integral image F is 3N times for addition performed on N signal points based on the equation (1) and a specific rectangular shape performed based on the equation (2). The operation of addition and subtraction of the sum of the signal values of the area is three times. Since this calculation does not depend on the area of the rectangular region for which the sum of the signal values is obtained, the calculation efficiency is particularly high when the sum of the signal values of a large region is obtained.

軸分解部１２は、ステップＳ２において、このようなインテグラルイメージＦを、正距円筒形式の球面信号に対して作成する。そして、軸分解部１２は、インテグラルイメージＦを用いて、垂直（θ）方向への伸縮処理及び分解処理を行う。座標θの行では、その行を中心にフィルタのカーネルの垂直方向のサイズＭ_ｈ個の行を対象とする。対象行をそれぞれ１／ｓｉｎθに比例する水平幅（四捨五入等で整数値に丸める）で分解し、分解領域内において信号点の和を算出する。 In step S2, the axis disassembling unit 12 creates such an integral image F with respect to the spherical signal in the equirectangular form. Then, the axis disassembling unit 12 uses the integral image F to perform expansion / contraction processing in the vertical (θ) direction and decomposition processing. The row coordinate theta, directed to a vertical size M _h rows of kernel filter centered on the line. Each of the target rows is decomposed with a horizontal width (rounded to an integer value by rounding or the like) proportional to 1 / sin θ, and the sum of signal points is calculated in the decomposed area.

図７（ａ）及び図７（ｂ）に示す例は、座標θｉの行を中心としたフィルタのカーネルの垂直方向のサイズＭ_ｈ個の対象行を、それぞれＮ_ｗ／ｓｉｎθ_ｉに比例する水平幅（四捨五入等で整数値に丸める）で分解し、分解領域内において信号点の和を算出した例である。 Example shown in FIG. 7 (a) and 7 (b), the horizontal proportional vertical size M _h pieces of target row of the kernel of the filter around the row coordinate .theta.i, each N _{w /} sinθ _i This is an example in which a signal is decomposed by a width (rounded to an integer value by rounding or the like) and the sum of signal points is calculated in the decomposed region.

同様に、図７（ｃ）及び図７（ｄ）に示す例は、座標θ_ｉ＋１の行を中心としたフィルタのカーネルの垂直方向のサイズＭ_ｈ個の対象行を、それぞれＮ_ｗ／ｓｉｎθ_ｉ＋１に比例する水平幅（四捨五入等で整数値に丸める）で分解し、分解領域内において信号点の和を算出した例である。 Similarly, the example shown in FIG. 7 (c) and FIG. 7 (d), the vertical size _{M h} pieces of target row of the kernel filter centered on coordinates theta _{i + 1} row, each _N w / _sinθ _{i + 1} This is an example in which the signal is decomposed with a horizontal width (rounded to an integer value by rounding or the like) proportional to, and the sum of signal points is calculated in the decomposed region.

水平分解幅は、１／ｓｉｎθに依存するため、θの値が変わると、図７（ｂ）及び図７（ｄ）を見比べて分かるように、分解幅が変わる。インテグラルイメージを用いることで、分解幅に依存することなく、一定の計算量で和を計算できるため、計算量を削減することができる。このように球面信号を軸分解した後は、上述の第１の実施の形態と同様のフィルタ処理が行われる。 Since the horizontal decomposition width depends on 1 / sin θ, when the value of θ changes, the decomposition width changes as can be seen by comparing FIGS. 7B and 7D. By using the integral image, the sum can be calculated with a constant calculation amount without depending on the decomposition width, so that the calculation amount can be reduced. After the spherical signal is axially decomposed as described above, the same filter processing as that in the first embodiment is performed.

なお、上述の例では、信号値の和を取る矩形の高さは１であるから、実際には垂直方向にインテグラルイメージを作成する必要はない。行毎に水平方向の１次元のインテグラルイメージ（ライン）を作成すればよい。すなわち、位置（ｉ，ｊ）の信号値をｆ（ｉ，ｊ）として、以下の漸化式（（３）式）を計算してインテグラルイメージＦを求める。 In the above example, since the height of the rectangle that takes the sum of the signal values is 1, it is not actually necessary to create an integral image in the vertical direction. A horizontal one-dimensional integral image (line) may be created for each row. That is, assuming that the signal value at the position (i, j) is f (i, j), the following recurrence formula (formula (3)) is calculated to obtain the integral image F.

また、位置（ｉ，ｊ１）、位置（ｉ，ｊ０）に囲まれた領域の信号値ｆの和を求めるには、以下の（４）式の右辺を計算すれば良い。 Further, in order to obtain the sum of the signal values f in the region surrounded by the position (i, j1) and the position (i, j0), the right side of the following equation (4) may be calculated.

なお、信号の垂直解像度に対して低い解像度でフィルタ処理を行う場合は、上述の（１）式の形式で水平及び垂直の２次元のインテグラルイメージを用いて、フィルタの解像度に合わせて軸分解信号を生成してもよい。このような手法を用いることで、空間的に広いフィルタ処理も、少ない計算量で可能とすることができる。 When filtering is performed at a resolution lower than the vertical resolution of the signal, the horizontal and vertical two-dimensional integral images are used in the form of the above-described formula (1) to separate the axes according to the resolution of the filter. A signal may be generated. By using such a method, spatially wide filter processing can be performed with a small amount of calculation.

（第２の実施の形態の効果）
第２の実施の形態の信号処理装置は、正距円筒形式の球面信号に対して少ない計算量でフィルタ処理を行うことができる。特に、インテグラルイメージを用いて軸分解を行う際の計算量を削減することができ、また、大きいカーネルサイズのフィルタ処理も少ない計算量で可能とすることができる。 (Effect of Second Embodiment)
The signal processing device according to the second embodiment can perform the filtering process on the spherical signal of the equirectangular form with a small amount of calculation. In particular, it is possible to reduce the amount of calculation when performing axis decomposition using an integral image, and it is also possible to perform filter processing with a large kernel size with a small amount of calculation.

（第２の実施の形態の第１の変形例）
次に、第２の実施の形態の第１の変形例を説明する。インテグラルイメージとしては、以下の（５）式の漸化式で生成されるインテグラルイメージを用いることもできる。 (First Modification of Second Embodiment)
Next, a first modification of the second embodiment will be described. As the integral image, an integral image generated by the recurrence formula of the following expression (5) can be used.

このとき、αのべき乗列に対する畳み込みを、以下の（６）式の右辺の計算で実現できる（特許文献４（特開２０１６−１０３０８９号公報）参照）。 At this time, the convolution with respect to the power sequence of α can be realized by the calculation on the right side of the following equation (6) (see Patent Document 4 (Japanese Patent Application Laid-Open No. 2006-103089)).

ここで、α＝ｅｘｐ（√−１ω）とすると、角周波数ωの信号との畳み込みが可能となる。なお、位相は、更にαを乗算処理することで調整可能である。または、インテグラルイメージＦを二次元ベクトル、αを実コンパニオン行列として、以下の（７）式及び（８）式の演算を行っても、角周波数ωのカーネルとの畳み込みが可能である。 Here, if α = exp (√−1ω), convolution with a signal of angular frequency ω is possible. Note that the phase can be adjusted by further multiplying by α. Alternatively, even when the integral image F is a two-dimensional vector and α is a real companion matrix, the convolution with the kernel of the angular frequency ω can be performed by performing the operations of the following expressions (7) and (8).

この方式において、Ｆ（ｉ，ｊ）の第二要素は、Ｆ（ｉ，ｊ−１）の第一要素と等しくなることから、Ｆに複素数を用いる前者の方式よりもメモリ使用量を半減できる。このようなべき乗型のインテグラルイメージを使えば、単純に信号値の和を取るボックスフィルタではなく、三角関数型のフィルタ係数を実現できる。例えば、軸分解で和を取る矩形の中心に三角関数の極大値が来るようにフィルタ係数を設計すれば、ガウシアンフィルタのような重みつきの平滑化効果を生むことができる。 In this method, since the second element of F (i, j) is equal to the first element of F (i, j-1), the memory usage can be reduced by half compared to the former method using a complex number for F. . If such a power-type integral image is used, a triangular function-type filter coefficient can be realized instead of a box filter that simply takes the sum of signal values. For example, if the filter coefficient is designed so that the maximum value of the trigonometric function is located at the center of the rectangle whose sum is obtained by axial decomposition, a weighted smoothing effect such as a Gaussian filter can be produced.

なお、垂直方向座標θに応じて水平方向拡大率が変わることから、上述のωを１／ｓｉｎθに比例させることが望ましい。また、特許文献４（特開２０１６−１０３０８９号公報）のようにシンク関数（ｓｉｎｃ関数）を複数の三角関数の和で近似し、複数のべき乗型のインテグラルイメージを用いてｓｉｎｃ関数に基づくリサイジングを実現することができる。以上のようなべき乗型のインテグラルイメージに基づく軸分解を用いれば、精度の高いリサイジングを高速に実現でき、結果として高精度のフィルタ処理を少ない計算量で実現することができる。 Since the horizontal enlargement ratio changes in accordance with the vertical coordinate θ, it is desirable to make the above-mentioned ω proportional to 1 / sin θ. Also, as in Patent Document 4 (JP-A-2006-103089), a sink function (sinc function) is approximated by the sum of a plurality of trigonometric functions, and a resynchronization based on the sinc function is performed using a plurality of power-type integral images. Sizing can be realized. By using the above-described axis decomposition based on the power-type integral image, highly accurate resizing can be realized at high speed, and as a result, highly accurate filter processing can be realized with a small amount of calculation.

（第２の実施の形態の第２の変形例）
インテグラルイメージを用いることで、マルチスケールのフィルタ処理を高速化することができる。例えば、画像信号において、カメラの近くで撮影された物体と遠くで撮影された物体は、大きさが違うが相似形である等のように、信号の中にスケールの異なる相似形の信号が含まれていることがある。そのような信号に対して、マルチスケールのフィルタ処理を施す応用が存在する。 (Second Modification of Second Embodiment)
The use of the integral image can speed up multi-scale filtering. For example, in an image signal, an object photographed near a camera and an object photographed far away include signals of similar shapes with different scales, such as different sizes but similar shapes. May have been. There is an application that performs multi-scale filtering on such a signal.

例えば、図８（ａ）に示すように、縦×横が３×３のフィルタ係数のフィルタを、複数のスケールで適用することを考える。図８（ｂ）〜図８（ｄ）及び図８（ｅ）〜図８（ｇ）に示すように、正距円筒形式上で異なる解像度で軸分解により3行を取り出す際、ひとつのインテグラルイメージから上述の手法で計算を行えば、計算量を低減することができる。この例では、フィルタ係数はスケール間で共有したが、スケールに応じてフィルタ係数を変更してもよい。 For example, as shown in FIG. 8A, it is considered that a filter having a filter coefficient of 3 × 3 in height and width is applied on a plurality of scales. As shown in FIGS. 8 (b) to 8 (d) and FIGS. 8 (e) to 8 (g), when taking out three rows by axial decomposition at different resolutions on the equirectangular form, one integral If the calculation is performed from the image by the above-described method, the amount of calculation can be reduced. In this example, the filter coefficient is shared between the scales, but the filter coefficient may be changed according to the scale.

（第３の実施の形態）
次に、第３の実施の形態の信号処理装置の説明をする。この第３の実施の形態の信号処理装置は、入力された球面信号に対して、フィルタのカーネルを水平及び垂直方向に分解して、フィルタ処理を施す例である（フィルタのカーネル分解）。 (Third embodiment)
Next, a signal processing device according to a third embodiment will be described. The signal processing device according to the third embodiment is an example in which a kernel of a filter is decomposed in a horizontal and a vertical direction with respect to an input spherical signal to perform a filter process (kernel decomposition of a filter).

図９は、第３の実施の形態の信号処理装置の構成図である。第３の実施の形態の信号処理装置の場合、図２と見比べてわかるように、平面フィルタ処理部１３の代りに、軸統合部１４の前段に水平フィルタ処理部２１を有すると共に、軸統合部１４の後段に垂直フィルタ処理部２２を有する。 FIG. 9 is a configuration diagram of a signal processing device according to the third embodiment. In the case of the signal processing device according to the third embodiment, as can be seen from comparison with FIG. 2, a horizontal filter processing unit 21 is provided in front of the axis integration unit 14 instead of the plane filter processing unit 13, and the axis integration unit A vertical filter processing unit 22 is provided at a stage subsequent to 14.

また、図１０は、第３の実施の形態の信号処理装置の信号処理の流れを示すフローチャートである。第３の実施の形態の信号処理装置の場合、図３と見比べて分かるように、ステップＳ３の平面フィルタ処理の代りに、ステップＳ４の軸統合処理の前処理として、ステップＳ１１の水平フィルタ処理を有する。また、ステップＳ３の平面フィルタ処理の代りに、ステップＳ４の軸統合処理の後処理として、ステップＳ１２の垂直フィルタ処理を有する。 FIG. 10 is a flowchart illustrating a flow of signal processing of the signal processing device according to the third embodiment. In the case of the signal processing device according to the third embodiment, as can be seen from comparison with FIG. 3, instead of the plane filter processing in step S3, the horizontal filter processing in step S11 is performed as preprocessing of the axis integration processing in step S4. Have. Further, instead of the plane filter processing in step S3, the post-axis integration processing in step S4 includes a vertical filter processing in step S12.

なお、上述の第１の実施の形態と第３の実施の形態とでは、この点のみが異なる。このため、以下、両者の差異の説明のみ行い、重複説明は省略する。 The first embodiment and the third embodiment described above differ only in this point. Therefore, hereinafter, only the difference between the two will be described, and redundant description will be omitted.

図１０のフローチャートにおいて、ステップＳ２では、軸分解部１２が入力信号を水平方向に分解することとしたが、第３の実施の形態の軸分解部１２は、入力信号を、垂直方向のカーネルサイズＭ_ｈ＝１として分解する。 In the flowchart of FIG. 10, in step S2, the axis decomposing unit 12 decomposes the input signal in the horizontal direction. However, the axis decomposing unit 12 of the third embodiment converts the input signal into a kernel size in the vertical direction. Decompose as M _h = 1.

次に、ステップＳ１１において、水平フィルタ処理部２１が、上述のように分解された部分信号に、水平方向のフィルタ（カーネルサイズが１×Ｍ_ｗ）を適用する。フィルタのカーネルは、例えば二次元のＭ_ｈ×Ｍ_ｗ行列Ｇで与えられているとき、以下の（９）式に示すフロベニウスノルムの二乗を最小化するように求めた垂直方向フィルタｈ（Ｍ_ｈ次元ベクトル）及び水平方向フィルタｗ（Ｍ_ｗ次元ベクトル）に、事前に分解しておく。 Next, in step S11, the horizontal filter processing unit 21 applies a filter in the horizontal direction (kernel size is 1 × M _w ) to the partial signal decomposed as described above. When the filter kernel is given by, for example, a two-dimensional M _h × M _w matrix G, a vertical filter h (M _h) obtained to minimize the square of the Frobenius norm shown in the following equation (9) Dimensional vector) and a horizontal filter w (M _w- dimensional vector).

（９）式に示すフロベニウスノルムの二乗を最小化する垂直方向フィルタｈ及び水平方向フィルタｗは、それぞれＧＧ^ＴとＧ^ＴＧの最大固有値αの固有ベクトルｅ_ｈ，ｅ_ｗの定数倍として得られる。比例定数は、ＧＧ^ＴとＧ^ＴＧの最大固有λの平方根であり、垂直方向フィルタｈ及び水平方向フィルタｗをそれぞれ何倍にするかの不定性がある。一例ではあるが、「ｈ＝λ^１／４ｅ_ｈ，ｗ＝λ^１／４ｅ_ｗ」とすることができる。 (9) vertical filter h and the horizontal direction filter w minimizes the square of the Frobenius norm in the expression, the eigenvectors e _h largest eigenvalues α respectively GG ^T and G ^{T _G,} is obtained as a constant multiple of e _w. Proportionality constant is the maximum intrinsic λ of the square root of the GG ^T and G ^{T G,} there is one of uncertainty to the vertical filters h and the horizontal direction filter w many times each. There is an example, but can be a ^{_{"h = λ 1/4 e h, w}} = λ 1/4 e w ".

図１１（ａ）及び図１１（ｂ）に、３×３のフィルタの２つの分解例を示す。図１１（ａ）に示す例は、水平方向にそれぞれ３つずつ並べられたフィルタ係数「−１、−２、−１」、「０、０、０」及び「１、２、１」を垂直方向に積層するかたちで形成された３×３の行列Ｇのフィルタを、上述の（９）式に基づいて垂直方向フィルタｈ及び水平方向フィルタｗに分解した例である。この例の場合、垂直方向フィルタｈは、「−１．３１６１、０、１．３１６１」として算出され、水平方向フィルタｗ^Ｔは、「０．７５９８、１．５１９７、０．７５９８」として算出された例である。なお、この場合の分解誤差は、「０．０」である。 FIGS. 11A and 11B show two decomposition examples of the 3 × 3 filter. In the example shown in FIG. 11A, three filter coefficients “−1, −2, −1”, “0, 0, 0” and “1, 2, 1” arranged three by one in the horizontal direction are vertically set. This is an example in which a filter of a 3 × 3 matrix G formed by laminating in the direction is decomposed into a vertical filter h and a horizontal filter w based on the above equation (9). In the case of this example, the vertical direction filter h is calculated as “−1.3161, 0, 1.3161”, and the horizontal direction filter w ^T is calculated as “0.7598, 1.5197, 0.7598”. This is an example. Note that the resolution error in this case is “0.0”.

図１１（ｂ）に示す例は、水平方向にそれぞれ３つずつ並べられたフィルタ係数「１、２、０」、「−１、４、１」及び「１、３、１」を垂直方向に積層するかたちで形成された３×３の行列Ｇのフィルタを、上述の（９）式に基づいて垂直方向フィルタｈ及び水平方向フィルタｗに分解した例である。この例の場合、垂直方向フィルタｈは、「０．８３９６、１．７３５１、１．３５２４」として算出され、水平方向フィルタｗ^Ｔは、「０．０８２４、２．２８６４、０．５５６９」として算出された例である。なお、この場合の分解誤差は、「３．２５９１」である。 In the example shown in FIG. 11B, three filter coefficients “1, 2, 0”, “−1, 4, 1”, and “1, 3, 1” arranged three by one in the horizontal direction are arranged in the vertical direction. This is an example in which a filter of a 3 × 3 matrix G formed in the form of lamination is decomposed into a vertical filter h and a horizontal filter w based on the above equation (9). In this example, the vertical filter h is calculated as “0.8396, 1.7351, 1.3524”, and the horizontal filter w ^T is calculated as “0.0824, 2.2864, 0.5569”. This is an example. The decomposition error in this case is “3.2591”.

図９のフローチャートのステップＳ１１では、水平フィルタ処理部２１が、このようにして得られた水平方向フィルタｗを部分信号に適用する。 In step S11 in the flowchart of FIG. 9, the horizontal filter processing unit 21 applies the horizontal filter w obtained in this manner to the partial signal.

次に、図９のフローチャートのステップＳ１２では、垂直フィルタ処理部２２が、軸統合処理された信号に、垂直方向フィルタｈを適用する。なお、正距円筒形式の垂直方向には幾何学的な歪みはないため、部分信号の軸統合処理後に垂直方向フィルタｈをそのまま適用しても誤差は発生しない。 Next, in step S12 of the flowchart of FIG. 9, the vertical filter processing unit 22 applies a vertical filter h to the signal subjected to the axis integration processing. Since there is no geometric distortion in the vertical direction of the equirectangular cylinder format, no error occurs even if the vertical filter h is applied as it is after the axis integration processing of the partial signals.

図１２は、第３の実施の形態の信号処理装置の信号処理の流れを模式的に示す図である。この図１２に示すように、まず、軸分解部１２が、正距円筒形式の入力信号（球面信号：図１２（ａ））を、垂直（θ）方向に伸縮処理及び分解処理する（図１２（ｂ））。水平フィルタ処理部２１は、分解処理された部分信号に水平方向フィルタｗを適用して水平フィルタ処理を行う。 FIG. 12 is a diagram schematically illustrating a flow of signal processing of the signal processing device according to the third embodiment. As shown in FIG. 12, first, the axis disassembling unit 12 expands and contracts the input signal (spherical signal: FIG. 12A) in the equirectangular format in the vertical (θ) direction (FIG. 12). (B)). The horizontal filter processing unit 21 performs a horizontal filter process by applying a horizontal filter w to the partial signal subjected to the decomposition process.

軸統合部１４は、水平フィルタ処理された部分信号を伸縮して統合し、元の正距離円筒形式の信号に戻すことで、統合信号を形成する。垂直フィルタ処理部２２は、統合信号に垂直方向フィルタｈを適用して垂直フィルタ処理（図１２（ｈ））を施して出力する（図１２（ｇ））。 The axis integrating unit 14 expands and contracts the partial signals subjected to the horizontal filter processing, integrates the partial signals, and returns the signals to the original equidistant cylindrical signal, thereby forming an integrated signal. The vertical filter processing unit 22 applies a vertical filter h to the integrated signal, performs a vertical filter process (FIG. 12H), and outputs the resultant signal (FIG. 12G).

（第３の実施の形態の効果）
このような第３の実施の形態の信号処理装置は、フィルタのカーネルを水平方向及び垂直方向に１次元分解して用いる。これにより、図１０のフローチャートのステップＳ２における軸分解処理を、１次元の信号への分解処理とすることができ、計算量の低減を図ることができる。また、フィルタのカーネルを水平方向及び垂直方向に１次元分解して用いることで、部分信号に幾何学的な歪みが発生する不都合を防止できる。 (Effect of Third Embodiment)
The signal processing apparatus according to the third embodiment uses the filter kernel one-dimensionally decomposed in the horizontal and vertical directions. Thereby, the axis disassembly processing in step S2 of the flowchart of FIG. 10 can be made into one-dimensional signal disassembly processing, and the amount of calculation can be reduced. Further, by using the filter kernel one-dimensionally decomposed in the horizontal direction and the vertical direction, it is possible to prevent the inconvenience of generating geometric distortion in the partial signal.

また、（９）式を用いて説明したように、フロベニウスノルムの二乗を最小化するように、フィルタの行列Ｇを垂直方向フィルタｈ及び水平方向フィルタｗに分解することで、二次元フィルタの近似誤差を極力低減することができる。なお、カーネル分解の基準は、（９）式以外であっても、Ｌ１ノルム（各次元の値の絶対値の和）を使ってロバスト化するなど、一般的な誤差関数を用いてもよい。 Further, as described using the equation (9), the filter matrix G is decomposed into a vertical filter h and a horizontal filter w so as to minimize the square of the Frobenius norm, thereby approximating the two-dimensional filter. Errors can be reduced as much as possible. The criterion of the kernel decomposition may be a formula other than the formula (9), or a general error function such as robustness using the L1 norm (sum of the absolute values of the values of each dimension) may be used.

また、画像信号におけるＲＧＢ（赤緑青）のように、複数のチャンネルを持つ信号に対するフィルタ処理については、「水平方向とチャンネル方向を合わせて一軸」、「垂直方向を一軸」、「水平方向を一軸」、又は「チャンネル方向と垂直方向を合わせて一軸」として、フィルタカーネルの行列Ｇを構成し、上述と同様の方法で軸分解を行えば良い。 Also, as for the filtering process for a signal having a plurality of channels, such as RGB (red, green, blue) in an image signal, "one axis in the horizontal and channel directions", "one axis in the vertical direction", "one axis in the horizontal direction" Or "one axis in which the channel direction and the vertical direction are combined" to form a matrix G of filter kernels and perform axis decomposition in the same manner as described above.

また、図１３に示すようにフィルタの行列Ｇを、複数の垂直方向フィルタ及び水平方向フィルタに分解してもよい。この場合、ＧＧ^Ｔ及びＧ^ＴＧの固有値を大きい順に所定個数取得し、その固有ベクトルをフィルタ係数とすれば良い。この図１３の例は、フィルタの行列Ｇを、ＲＧＢ（赤緑青）の各画像信号に対応させて垂直方向フィルタｈ_１，ｈ_２，ｈ_３、及び、水平方向フィルタｗ_１ ^Ｔ，ｗ_２ ^Ｔ，ｗ_３ ^Ｔに分解した例である。この例の場合、ｈ_１ｗ_１ ^Ｔ＋ｈ_２ｗ_２ ^Ｔの分解誤差は０．２７２３であるが、ｈ_１ｗ_１ ^Ｔ＋ｈ_２ｗ_２ ^Ｔ＋ｈ_３ｗ_３ ^Ｔとすることで、分解誤差を０．０とすることができる。 As shown in FIG. 13, the filter matrix G may be decomposed into a plurality of vertical filters and horizontal filters. In this case, the predetermined number obtaining the eigenvalues of GG ^T and G ^{T G} in descending order, may be the eigenvectors with the filter coefficients. In the example of FIG. 13, the matrix G of the filters is set to correspond to each of the RGB (red, green, blue) image signals, and the vertical filters h ₁ , h ₂ , h ₃ and the horizontal filters w ₁ ^T , w ₂ ^{T are used.} , W ₃ ^T. In this _example, the decomposition error of _{^{_{_{h 1 w 1 T + h 2}}}} w 2 T is _0.2723, With _{^{_{_{^{h 1 w 1 T + h 2}}}}} w 2 T + h 3 w 3 T, the decomposition error 0 .0.

このようにフィルタの行列Ｇを、複数の垂直方向フィルタ及び水平方向フィルタに分解することで、複数のチャンネルの信号に対するフィルタ処理を可能とすることができ、また、フィルタの分解誤差も低減することができる。 By decomposing the filter matrix G into a plurality of vertical filters and a plurality of horizontal filters in this manner, it is possible to perform filter processing on signals of a plurality of channels, and to reduce filter decomposition errors. Can be.

（第３の実施の形態の効果）
このように第３の実施の形態の信号処理装置は、フィルタを１次元分解することで、部分信号を１次元信号とすることができる。また、幾何学的な歪みの発生を防止でき、また、計算回数を低減できるため、高い近似精度で高速な球面フィルタ処理を可能とすることができる。特に、フィルタサイズが大きいほど、このような効果が顕著となる。 (Effect of Third Embodiment)
As described above, the signal processing device according to the third embodiment can convert a partial signal into a one-dimensional signal by performing one-dimensional decomposition of the filter. In addition, since generation of geometric distortion can be prevented and the number of calculations can be reduced, high-speed spherical filter processing with high approximation accuracy can be performed. In particular, such an effect becomes more conspicuous as the filter size is larger.

（第４の実施の形態）
次に、第４の実施の形態の信号処理装置の説明をする。第１〜第３の実施の形態の信号処理装置は、２次球面上の球面信号の信号処理を行う例であった。これに対して、第４の実施の形態の信号処理装置は、入力されたp次の球面信号に対して、フィルタ処理を施す例である。 (Fourth embodiment)
Next, a signal processing device according to a fourth embodiment will be described. The signal processing devices of the first to third embodiments are examples in which signal processing of a spherical signal on a secondary spherical surface is performed. On the other hand, the signal processing device according to the fourth embodiment is an example in which the input p-order spherical signal is subjected to filter processing.

ｐ次単位球面Ｓ^ｐとは、以下の（１０）式で示されるｐ＋１次元のユークリッド空間における単位球面である。 The p-order unit sphere S ^p, is the unit sphere in the following (10) p + 1-dimensional Euclidean space of Formula.

ｐ次球面信号ｆは、ｐ次単位球面Ｓ^ｐから信号値の集合Ｙへの写像である。集合Ｙは、多くの場合、実数或いは複素数の集合である。第４の実施の形態の信号処理装置では、離散的な球面信号である、ｐ次単位球面Ｓ^ｐ上の有限個の点に対して割り当てられている信号値に基づく信号処理を行う。球面信号の表現形式としては、一例として、Ｓ^２の正距円筒形式をＳ^ｐに拡張した形式を用いる。そのために、まず、Ｓ^ｐ上の点ｘをｐ個の角度変数θｉ｛ｉ＝１，２，・・・，ｐ｝を用いて、Ｓ^ｐ上の点の極座標表示を以下の（１１）式のように表現する。 The p-th spherical signal f is a mapping from the ^p-th unit spherical surface ^Sp to a set Y of signal values. The set Y is often a set of real numbers or complex numbers. The signal processing device according to the fourth embodiment performs signal processing based on signal values assigned to a finite number of points on a ^p-th unit spherical surface Sp, which is a discrete spherical signal. The representation of the spherical signal, as an example, using a format that extends Equirectangular format S ² to S ^p. Therefore, first, ^S the point x on the ^p p pieces of angle variable θi {i = 1,2, ···, p} by using the following (11) the polar coordinates of points on ^{S p} type Express as:

ただし、θ_ｉ∈［０２π］（ｉ＜ｐ）、θ_ｐ∈［０ π］である。この角度変数θ_ｉを等間隔に分解し、p次元の正方格子上の信号とする。この球面信号の形式をｐ次単位球面Ｓ^ｐの正距円筒形式と呼ぶことにする。 Here, θ _i ∈ [0 2π] (i <p) and θ _p ∈ [0 π]. The angle variable θ _i is decomposed at equal intervals to obtain a signal on a p-dimensional square lattice. It will be referred to as form of the spherical signal Equirectangular form of order p the unit sphere S ^p.

ここで、（１１）式をθ_ｉで偏微分してＬ２ノルム（各次元の値を２乗した和）を取ると、以下の（１２）式のようになる。 Here, when the expression (11) is partially differentiated with respect to θ _i and the L2 norm (the sum of squares of the values of each dimension) is obtained, the following expression (12) is obtained.

これは、正距円筒形式において、θ_ｉ方向では、長さが上式の比率で変化することを示している。 This is because, in equirectangular format, theta _i direction, shows that the length change in the ratio of the above equation.

第３の実施の形態の信号処理装置における、p次球面信号に対する処理例を説明する。第３の実施の形態の信号処理装置の構成としては、図２に示した第１の実施の形態の信号処理装置と同様であるが、軸分解部１２、平面フィルタ処理部１３及び軸統合部１４が異なる動作を示す。 An example of processing on a p-order spherical signal in the signal processing device according to the third embodiment will be described. The configuration of the signal processing device according to the third embodiment is the same as that of the signal processing device according to the first embodiment shown in FIG. 2, except that the axis decomposing unit 12, the plane filter processing unit 13, and the axis integrating unit 14 indicates a different operation.

すなわち、軸分解部１２は、ｐ次単位球面Ｓ^ｐの正距円筒形式のｐ個の軸に沿ってp次球面信号を分解する。まず、軸分解部１２は、θ_ｐの方向でp次球面信号を分解する。そして分解したp次球面信号を、｜ｓｉｎθ_ｐ｜の比率で縮小する。次に、軸分解部１２は、θ_ｐ−１の方向でp次球面信号を分解し、分解した信号を、｜ｓｉｎθ_ｐ−１｜の比率で縮小する。軸分解部１２は、このように軸を順次選択して分解し、これをθ_２軸の分解まで繰り返す。この分解により、上述の（１２）式の球面上と正距円筒形式での長さの比率（＝幾何学的な歪み）を補正できる。 That is, the shaft decomposition unit 12 decomposes the p-th order spherical signal along the p number of axes of equirectangular format p-th order unit sphere S ^p. First, the shaft decomposition unit 12 decomposes the p-th order spherical signal in the direction of theta _p. Then, the decomposed p-order spherical signal is reduced at a ratio of | sin θ _p |. Next, the axis decomposing unit 12 decomposes the p-th spherical signal in the direction of θ _p −1, and reduces the decomposed signal at a ratio of | sin θ _p−1 |. Shaft decomposition section 12, thus to decompose sequentially selects axis, repeating this until decomposition of theta ₂ axes. By this decomposition, it is possible to correct the ratio of the length on the spherical surface and the equirectangular form (= geometric distortion) in the above equation (12).

なお、信号の分解は、θ_ｉ軸であれば、θ_ｉ軸方向のフィルタのカーネルサイズの幅を持たせて分解する。カーネルサイズが１でなければ重複領域が存在し、その領域で僅かな歪みが発生するが、カーネル幅が小さければ高い近似精度を得ることができる。 If the signal is the θ _i axis, the signal is decomposed by giving a filter kernel width in the θ _i axis direction. If the kernel size is not 1, an overlapping area exists and a slight distortion occurs in that area. However, if the kernel width is small, high approximation accuracy can be obtained.

次に、平面フィルタ処理部１３は、上述のように分解された部分信号に対してp次元のカーネルのフィルタを適用する。軸統合部１４は、軸分解部１２とは逆の処理で、θ_ｉ軸に対して信号を統合して｜１／ｓｉｎθ_ｉ｜倍に拡大する処理を、元のp次元まで繰り返す。 Next, the plane filter processing unit 13 applies a p-dimensional kernel filter to the partial signal decomposed as described above. The axis integrating unit 14 repeats the processing of integrating the signals with respect to the θ _i axis and enlarging the signal to | 1 / sin θ _i | times to the original p dimension in a process opposite to that of the axis disassembling unit 12.

（３次単位球面の処理例）
図１４は、３次（ｐ＝３）の単位球面に対する一連の処理の流れを示す模式図である。軸分解部１２は、図１４（ａ）に示す３次単位球面Ｓ^３の正距円筒形式の３個の軸θ_１〜θ_３に沿って３次球面信号を分解する。まず、軸分解部１２は、θ_３の方向で３次球面信号を分解する。そして、軸分解部１２は、分解した３次球面信号を、図１４（ｂ）に示すように｜ｓｉｎθ_３｜の比率で縮小する。 (Example of processing of cubic unit spherical surface)
FIG. 14 is a schematic diagram showing a flow of a series of processes for a third-order (p = 3) unit spherical surface. Shaft decomposition unit 12 decomposes the third-order spherical signal along the three axes theta ₁ through? ₃ of equirectangular format cubic unit sphere S ³ shown in FIG. 14 (a). First, the shaft decomposition unit 12 decomposes the third-order spherical signal in the direction of the theta _3. Then, the axis disassembling unit 12 reduces the decomposed tertiary sphere signal at a ratio of | sin θ ₃ | as shown in FIG.

次に、軸分解部１２は、図１４（ｃ）に示すようにθ_２方向（＝θ_３−１方向）で３次球面信号を分解し、分解した信号を、｜ｓｉｎθ_２｜の比率で縮小する。軸分解部１２は、このように軸を順次選択して分解し、これをθ_２軸の分解まで繰り返す。上述のように、この分解処理により、（１２）式の球面上と正距円筒形式での長さの比率（＝幾何学的な歪み）を補正できる。 Next, the shaft decomposition unit 12 decomposes the third order spherical signal theta ₂ direction (= theta ₃ -1 direction) as shown in FIG. 14 (c), the decomposed signals, | sin [theta ₂ | in a ratio to shrink. Shaft decomposition section 12, thus to decompose sequentially selects axis, repeating this until decomposition of theta ₂ axes. As described above, by this decomposition processing, the ratio of the length on the spherical surface and the equirectangular form (= geometric distortion) in equation (12) can be corrected.

次に、図１４（ｄ）に示すように平面フィルタ処理部１３は、上述のように分解された部分信号に対して３次元のカーネルのフィルタを適用してフィルタ処理を行う。軸統合部１４は、軸分解部１２とは逆の処理で、図１４（ｅ）に示すように、θ_２軸に対して信号を統合して｜１／ｓｉｎθ_２｜倍に拡大する処理を、元の３次元まで繰り返す（θ_３方向で｜１／ｓｉｎθ_３｜の比率で拡大及び統合）。 Next, as shown in FIG. 14D, the plane filter processing unit 13 performs a filter process by applying a three-dimensional kernel filter to the partial signal decomposed as described above. Axis Integrated unit 14 is a reverse process to the shaft decomposition section 12, as shown in FIG. 14 (e), integrates the signal to theta ₂ axes | a processing for enlarging doubled _| 1 / sin [theta ₂ To the original three dimensions (enlargement and integration at the ratio of | 1 / sin θ ₃ | in the θ ₃ direction).

（第４の実施の形態の効果）
このような第４の実施の形態の信号処理装置は、ｐ次の球面信号のフィルタ処理を少ない計算量、及び、高い近似精度で実現することができる。なお、上述の第１の実施の形態に基づいて、この第４の実施の形態を行ったが、第２の実施の形態で説明したインテグラルイメージ処理も単位球面Ｓ^ｐに適用可能である。第２の実施の形態で説明したインテグラルイメージ処理を単位球面Ｓ^ｐに適用する場合、ｐ次元のインテグラルイメージへの拡張が必要となる。これは（１）式の代わりに、以下の漸化式である（１３）式に基づいてインテグラルイメージを作成すれば良い。 (Effect of Fourth Embodiment)
The signal processing device according to the fourth embodiment can realize the filtering of the p-order spherical signal with a small amount of calculation and high approximation accuracy. Incidentally, based on the first embodiment described above, were subjected to this fourth embodiment is applicable integral image processing in the unit sphere S ^p described in the second embodiment. When applying the integral image processing described in the second embodiment in the unit sphere S ^p, extension to p-dimensional integral image is required. This can be achieved by creating an integral image based on the following recurrence formula (13) instead of the formula (1).

また、第３の実施の形態で説明したフィルタ分解処理も単位球面Ｓ^ｐに適用可能である。第３の実施の形態で説明したフィルタ分解処理も単位球面Ｓ^ｐに適用する場合、例えば、ｐ階テンソルのＣＰ分解アルゴリズム等を用いて、ｐ次元のフィルタをｐ個の１次元フィルタに分解する。そして、最終的に分解された信号に対して、θ_ｉ方向のフィルタ処理を行い、その後、θ_ｉ軸に統合する度にθ_ｉ方向の１次元フィルタを適用する。 It is also applicable to the unit sphere S ^p filtering decomposition treatment also described in the third embodiment. When applied to a filter decomposition treatment also the unit sphere S ^p described in the third embodiment decomposed, for example, by using the CP decomposition algorithm p floor tensor like, the p-dimensional filter p number of one-dimensional filter . Then, on the final degraded signal, performs theta _i direction filtering, then applying the one-dimensional filter in the theta _i direction every time the integrated theta _i axis.

（第５の実施の形態）
次に、第５の実施の形態の信号処理装置の説明をする。この第５の実施の形態の信号処理装置は、入力された正距円筒形式の球面信号に対する多段階の球面フィルタ処理を、畳み込みニューラルネットワークの畳み込み層として用いる例である。 (Fifth embodiment)
Next, a signal processing device according to a fifth embodiment will be described. The signal processing device according to the fifth embodiment is an example in which a multi-stage spherical filter process on an input equirectangular spherical signal is used as a convolutional layer of a convolutional neural network.

（第５の実施の形態の構成）
図１５に、第５の実施の形態の信号処理部１の構成図を示す。この図１５に示すように、第５の実施の形態の信号処理部１は、上述の信号取得部１１と共に、球面フィルタ部３１、活性化関数適用部３２、出力部３３及び学習部３４を有する。 (Configuration of Fifth Embodiment)
FIG. 15 shows a configuration diagram of a signal processing unit 1 according to the fifth embodiment. As shown in FIG. 15, the signal processing unit 1 according to the fifth embodiment includes a spherical filter unit 31, an activation function application unit 32, an output unit 33, and a learning unit 34, in addition to the signal acquisition unit 11 described above. .

信号取得部１１は、記憶部２等から処理対象となる球面信号を取得する。球面フィルタ部３１は、第１の実施の形態〜第４の実施の形態で説明した、いずれかの球面フィルタ処理を行う。活性化関数適用部３２は、球面フィルタ部３１の処理結果に活性化関数を適用する。活性化関数としては、例えばシグモイド関数、双曲線関数、ＲｅＬＵ（Rectified Linear Unit：ランプ関数）等、ニューラルネットワークで一般的に用いられる活性化関数を用いることができる。 The signal acquisition unit 11 acquires a spherical signal to be processed from the storage unit 2 or the like. The spherical filter unit 31 performs any one of the spherical filter processes described in the first to fourth embodiments. The activation function application unit 32 applies an activation function to the processing result of the spherical filter unit 31. As the activation function, for example, an activation function generally used in a neural network, such as a sigmoid function, a hyperbolic function, and a ReLU (Rectified Linear Unit), can be used.

球面フィルタ処理と活性化関数の適用は特定回数繰り返す。これにより、単体の球面フィルタ処理では実現困難な非線形の変換処理を行うことができる。なお、例えばプーリング層又は正規化層等のように、一般的にニューラルネットワークで用いられる層を追加してもよい。 The spherical filtering and the application of the activation function are repeated a specific number of times. This makes it possible to perform a non-linear conversion process that is difficult to achieve with a single spherical filter process. Note that a layer generally used in a neural network, such as a pooling layer or a normalization layer, may be added.

出力部３３は、ニューラルネットワークで処理された信号を出力する。学習部３４は、入力信号と出力信号の複数の組に対して特定の損失関数を最小化するように、ニューラルネットワークのパラメータを調整する。このパラメータには、球面フィルタのフィルタ係数が含まれており、例えば確率的勾配降下法等の、一般的なニューラルネットワークの学習法を用いて値を求めることができる。 The output unit 33 outputs a signal processed by the neural network. The learning unit 34 adjusts the parameters of the neural network so as to minimize a specific loss function for a plurality of pairs of the input signal and the output signal. These parameters include the filter coefficients of the spherical filter, and the values can be obtained by using a general neural network learning method such as a stochastic gradient descent method.

（第５の実施の形態の効果）
このように、第５の実施の形態の信号処理装置は、球面フィルタ処理を用いた球面信号用のニューラルネットワークを実現することができる。 (Effect of Fifth Embodiment)
As described above, the signal processing device according to the fifth embodiment can realize a neural network for spherical signals using spherical filter processing.

（実施の形態の効果のまとめ）
上述の各実施の形態の効果を、以下に示す。 (Summary of effects of the embodiment)
The effects of each of the above embodiments will be described below.

まず、極座標の１つの軸方向について、球面上の距離を反映して伸縮し、複数の部分信号に分解する軸分解し、フィルタ処理を行う。これにより、球面信号に対するフィルタ処理を、少ない計算量、かつ、高い近似精度で実現できる。また、フィルタサイズの制限を受けることなく演算を行うことがき、また、アーティファクトの発生も抑制することができる。 First, in one axial direction of the polar coordinates, expansion and contraction are performed by reflecting the distance on the spherical surface, and the axis is decomposed into a plurality of partial signals, and filtering is performed. As a result, the filtering process on the spherical signal can be realized with a small amount of calculation and high approximation accuracy. In addition, the calculation can be performed without being limited by the filter size, and the occurrence of artifacts can be suppressed.

また、分解する方向に対して対応するフィルタのカーネルの幅を持たせて分解することで、処理の簡略性を保ったまま、球面上のフィルタ処理を高い近似精度で実現することができる。 In addition, by performing the decomposition while giving the width of the kernel of the filter corresponding to the decomposition direction, it is possible to realize the filter processing on the spherical surface with high approximation accuracy while maintaining the simplicity of the processing.

また、インテグラルイメージを用いて球面上の距離を反映した伸縮を行うことで、伸縮処理の高速化を行うことができる。 In addition, by performing the expansion and contraction reflecting the distance on the spherical surface using the integral image, the speed of the expansion and contraction processing can be increased.

また、フィルタのカーネルを極座標の各軸方向に沿って１次元分解し、軸統合処理の後に、統合した軸方向の１次元フィルタを適用する。これにより、部分信号のメモリ使用量ｗを低減できる。また、局所的な信号の幾何変形の影響をフィルタ処理が受けなくなるため、少ない計算量で近似精度の高い球面フィルタ処理を行うことができる。 In addition, the filter kernel is one-dimensionally decomposed along each axis direction of the polar coordinates, and after the axis integration processing, the integrated one-dimensional filter in the axial direction is applied. Thereby, the memory usage w of the partial signal can be reduced. In addition, since the filtering process is not affected by the local geometric deformation of the signal, the spherical filtering process with high approximation accuracy can be performed with a small amount of calculation.

また、ｐ次元の球面信号に対して軸分解ステップと軸統合ステップをｐ−１回ずつ順次繰り返すことで、ｐ次元の球面信号に対するフィルタ処理を少ない計算量かつ高い近似精度で実現することができる。 Further, by sequentially repeating the axis resolution step and the axis integration step p-1 times for the p-dimensional spherical signal, it is possible to realize the filtering process for the p-dimensional spherical signal with a small amount of calculation and high approximation accuracy. .

また、球面信号フィルタ処理を、畳み込みニューラルネットワークの畳み込み層として用いることで、球面信号を入力とする畳み込みニューラルネットワークを少ない計算量かつ高い近似精度で実現することができる。 Further, by using the spherical signal filter processing as a convolution layer of a convolutional neural network, a convolutional neural network that receives a spherical signal as an input can be realized with a small amount of calculation and high approximation accuracy.

最後に、上述の各実施の形態は、一例として提示したものであり、本発明の範囲を限定することは意図していない。これら新規な実施の形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことも可能である。また、各実施の形態及び各実施の形態の変形は、発明の範囲や要旨に含まれると共に、特許請求の範囲に記載された発明とその均等の範囲に含まれるものである。 Finally, the above-described embodiments have been presented by way of example, and are not intended to limit the scope of the present invention. These new embodiments can be implemented in other various forms, and various omissions, replacements, and changes can be made without departing from the spirit of the invention. Further, each embodiment and modifications of each embodiment are included in the scope and spirit of the invention, and are also included in the invention described in the claims and equivalents thereof.

例えば、本発明は、情報処理技術分野における通常の知識を有した技術者であれば、ＡＳＩＣ（Application Specific Integrated Circuits）や、従来の回路モジュールを接続して構成した装置によって実施することが可能である。 For example, the present invention can be implemented by an ASIC (Application Specific Integrated Circuits) or an apparatus configured by connecting conventional circuit modules to a technician having ordinary knowledge in the information processing technology field. is there.

また、上述の各実施の形態に記載された各機能は、それぞれ一又は複数の処理回路（Circuit）によって実現することが可能である。なお、「処理回路」とは、ソフトウェアによって各機能を実行するようプログラムされたプロセッサ、各機能を実行するよう設計されたＡＳＩＣ、及び、回路モジュール等のハードウェアを含むものである。 Further, each function described in each of the above-described embodiments can be realized by one or a plurality of processing circuits (Circuit). The “processing circuit” includes a processor programmed to execute each function by software, an ASIC designed to execute each function, and hardware such as a circuit module.

１信号処理装置
２記憶部
１１信号取得部
１２軸分解部
１３平面フィルタ処理部
１４軸統合部
１５出力部
２１水平フィルタ処理部
２２垂直フィルタ処理部
３１球面フィルタ部
３２活性化関数適用部
３３出力部
３４学習部 DESCRIPTION OF SYMBOLS 1 Signal processing apparatus 2 Storage part 11 Signal acquisition part 12 Axis decomposition part 13 Planar filter processing part 14 Axis integration part 15 Output part 21 Horizontal filter processing part 22 Vertical filter processing part 31 Spherical filter part 32 Activation function application part 33 Output part 34 Learning Department

特許第５７３４３２７号公報Japanese Patent No. 5734327 特許第６０６７９３４号公報Japanese Patent No. 6067934 特開２０１７−２０７９６０号公報JP 2017-207960 A 特開２０１６−１０３０８９号公報JP-A-2006-103089

T.C.Cohen,et al.,「Spherical CNNs.」arXiv:1801.10130,2018.T.C.Cohen, et al., `` Spherical CNNs. '' ArXiv: 1801.10130, 2018. C.Esteves,et al.,「3D object classification and retrieval with Spherical CNNs.」arXiv:1711.06721,2017.C. Esteves, et al., `` 3D object classification and retrieval with Spherical CNNs. '' ArXiv: 1711.06721,2017. W.Boomsma,et al.,「Spherical convolutions and their application in molecular modelling」,Advances in Neural Information Processing Systems,2017.W. Boomsma, et al., `` Spherical convolutions and their application in molecular modeling '', Advances in Neural Information Processing Systems, 2017. Y.C.Su,et al.,「Learning spherical convolution for fast features from 360 imagery」,Advances in Neural Information Processing Systems,2017.Y.C.Su, et al., `` Learning spherical convolution for fast features from 360 imagery '', Advances in Neural Information Processing Systems, 2017. P.Viola and M.Jones,「Rapid object detection using a boosted cascade of simple features」,in IEEE Computer Society Conference on Computer Vision and Pattern Recognition,2001,vol.1.P. Viola and M. Jones, `` Rapid object detection using a boosted cascade of simple features '', in IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2001, vol.1.

Claims

A signal processing device that performs a filtering process on a spherical signal whose value is assigned to a point on a spherical surface,
An axis disassembly unit that performs an axis disassembly process that expands and contracts the spherical signal in one axial direction of polar coordinates by reflecting a distance on the spherical surface and decomposes the signal into a plurality of partial signals;
A filter processing unit that performs filter processing on the partial signal,
An axis integration unit that performs axis integration processing of integrating the filtered partial signals by expanding and contracting to reflect the distance on the spherical surface,
A signal processing device comprising:

The signal processing device according to claim 1, wherein the axis decomposing unit decomposes the image by giving a kernel width of a filter corresponding to a decomposing direction.

The signal processing device according to claim 1, wherein the axis disassembling unit performs expansion and contraction reflecting a distance on a spherical surface using an integral image.

The method according to claim 1, further comprising a one-dimensional filter processing unit that performs one-dimensional decomposition of the filter kernel along each axis direction of the polar coordinate, and applies a one-dimensional filter in the integrated axial direction after the axis integration unit. Item 2. The signal processing device according to item 1.

The axis decomposing unit, the filter processing unit, and the axis integrating unit perform the axis decomposing process and the filtering on a spherical signal in which a value is assigned to a point on a p-dimensional (p is an arbitrary natural number) hypersphere. The signal processing device according to any one of claims 1 to 4, wherein the processing and the axis integration processing are sequentially repeated p-1 times.

A convolutional neural network using the signal processing device according to any one of claims 1 to 5 as a convolutional layer.

A signal processing method of a signal processing device that performs a filtering process on a spherical signal whose value is assigned to a point on a spherical surface,
An axis disassembling step of performing an axis disassembly process of expanding and contracting the spherical signal in one axial direction of polar coordinates by reflecting a distance on the spherical surface and decomposing the spherical signal into a plurality of partial signals;
A filtering step of performing a filtering process on the partial signal,
An axis integration step of performing an axis integration process of integrating the filter-processed partial signal by expanding and contracting by reflecting a distance on a spherical surface,
A signal processing method comprising:

A signal processing program for causing a computer to perform a filtering process on a spherical signal whose value is assigned to a point on a spherical surface,
Said computer,
An axis disassembly unit that performs an axis disassembly process that expands and contracts the spherical signal in one axial direction of polar coordinates by reflecting a distance on the spherical surface and decomposes the signal into a plurality of partial signals;
A filter processing unit that performs filter processing on the partial signal,
The filtered partial signal, functioning as an axis integration unit that performs axis integration processing for integrating by expanding and contracting to reflect the distance on the spherical surface,
A signal processing program characterized by the following.