JP4281419B2

JP4281419B2 - Prefix nonlinear distortion compensator

Info

Publication number: JP4281419B2
Application number: JP2003157249A
Authority: JP
Inventors: 満平川
Original assignee: Sumitomo Electric Industries Ltd
Current assignee: Sumitomo Electric Industries Ltd
Priority date: 2003-06-02
Filing date: 2003-06-02
Publication date: 2009-06-17
Anticipated expiration: 2023-06-02
Also published as: JP2004363733A

Description

【０００１】
【発明の属する技術分野】
本発明は、前置型非線形歪補償器に関し、特に、サポートベクトルマシン（以下「ＳＶＭ」という）を回帰分析に用いた前置型非線形歪補償器に関する。
【０００２】
【従来の技術】
近年、通信における非線形デバイス（以下「ＮＬＤ」という）の非線形歪を補償する技術として、前置型非線形歪補償器が注目されている。代表的な前置型非線形歪補償器として、ルックアップテーブルを用いたものと、ＮＬＤの逆特性を推定するものがある。ここで、ＮＬＤの典型的なものとしては、たとえば、増幅器が挙げられる。
【０００３】
ルックアップテーブルを用いた前置型非線形歪補償器は、増幅器の特性を入力信号の振幅値に対して微小区間に分け、その区間の代表値をテーブルに格納しておき、送信信号とそれに対応するテーブルの補償値を乗算して出力信号とするものである。また、前置型非線形歪補償器入力とＮＬＤ出力を比較し、その結果から適応的にテーブル内の補償値を更新する前置型非線形歪補償器も既に開発されている。
【０００４】
一方、増幅器の逆関数を推定する前置型非線形歪補償器として、ｐｔｈ−ｏｒｄｅｒ−ｐｒｅｄｉｓｔｏｒｔｅｒと呼ばれるものがある。この前置型非線形歪補償器は、信号変換器の逆関数を複素多項式近似するものである。複素多項式は、非線形関数の表現形式として最も一般的なボルテラ級数より導出され、歪補償が行なわれる。また、前置型非線形歪補償器入力とＮＬＤ出力を比較し、その結果から適応的に複素多項式係数を更新する前置型非線形歪補償器も既に開発されている。
【０００５】
しかし、ルックアップテーブルを用いた前置型非線形歪補償器では、テーブル内の値は離散値をとるため、値の不連続性により、フロアノイズが上昇する。フロアノイズを解消するためには、データ数を増やす必要があるが、データ数を増やすとメモリ容量が多くなるという問題がある。
【０００６】
ｐｔｈ−ｏｒｄｅｒ−ｐｒｅｄｉｓｔｏｒｔｅｒでは、計算精度を上げるために複素多項式の次数を増やす必要がある。このため、計算時間がかかり、解が収束しにくいという問題がある。
【０００７】
さらに、広帯域信号に対しては、ルックアップテーブル、ｐｔｈ−ｏｒｄｅｒ−ｐｒｅｄｉｓｔｏｒｔｅｒともに、信号帯域内の各周波数ごとに振幅、位相の非線形性を補償する必要があるため、ハードウェア化が困難であるという問題がある。
【０００８】
ところで、近年、入出力観測データからプロセスのモデリングを行うという問題は、非線形回帰問題やシステム同定などの多くの分野において研究の目的となっている。上記前置型非線形歪補償器の最適な特性を得るという問題も、ひずみを補償する対象となる増幅器を未知の非線形システムとみなせば、この補償する対象となる増幅器の特性の逆関数のモデリングの問題に帰着される。
【０００９】
このようなプロセスモデリングの問題に取り組む方法は、大きく分けてグローバルモデリングとローカルモデリングの２つに分けることができる。
【００１０】
ローカルモデリングは、重み平均法、局所重み回帰法やデータベース学習であるJust-in-time法などがある。
【００１１】
これに対して、グローバルモデリングは、データベースの全てに適合する一つの関数を作る手法である。このような関数の作成は、ニューラルネットワークモデリングや非線型統計回帰の他の方法で使われている手法である。
【００１２】
このようなグローバルモデリングに対する統計的学習理論は、１９６０年代から存在していたが、実用的なツールとして用いられるようになったのは、１９９０年半ば以降にV.N.Vapnik等によって、サポートベクトルマシンが提案されたことを契機とする（たとえば、非特許文献１〜３を参照）。
【００１３】
このようなサポートベクトルマシンを用いた統計的学習理論によるプロセスモデリングには、一般的に、以下のような特徴がある。
【００１４】
（１）他の手法に比べると、収束速度に勝る。
（２）常に唯一の解を得ることができる。
【００１５】
（３）さまざまな学習問題に容易に適用できる。
（４）問題の複雑さを問題の次元とは独立であるサポートベクトルの個数によって特徴づけているために、いわゆる「次元の呪い（curse of dimensionality)」から逃れることができる。
【００１６】
さらに、上述したサポートベクトルマシンは、回帰問題の学習に用いることのできるサポートベクトル回帰（Support Vector Regression（以下、「SVR」と呼ぶ））に拡張されている。このようなＳＶＲについては、非特許文献４に開示がある。
【００１７】
さらに、このようなサポートベクトルマシンを上述した前置型非線形歪補償器に適用する試みが、非特許文献５に開示されている。
【００１８】
【非特許文献１】
V.N.Vapnik : The Nature of Statistical Learning Theory, Springer-Verlag, New York(1995)
【００１９】
【非特許文献２】
V.N.Vapnik : Statistical Learning Theory, J.Willey & Sons, New York(1998)
【００２０】
【非特許文献３】
B.Scholkopf et al.ed, "Advances in Kernal Methods", The MIT Press, 1999.
【００２１】
【非特許文献４】
N.Cristianini, J.Shave-Taylor "An Introduction to Support Vector Machines", Cambridge University Press, 2000.
【００２２】
【非特許文献５】
伊藤隆徳，大森浩充，佐野昭Ｂ−５−１２４「サポートベクトル法を用いた適応非線形歪補償」２００１年電子情報通信学会総合大会予稿集、ｐ．５２２，２００１年
【００２３】
【発明が解決しようとする課題】
しかしながら、サポートベクトルマシンをデジタル信号受信装置の前置型非線形歪補償器に用いる場合には、以下のような点を考慮する必要がある。
【００２４】
ここで、通信に利用されるＮＬＤのうち、最も伝送特性に影響を及ぼすのは増幅器である。
【００２５】
図２０は、一般な増幅器の入出力特性を示す図である。
図２０に示すように、入力信号が小さい領域では線形的に動作するが、入力信号が大きくなるにつれて出力信号は飽和する。
【００２６】
通常は、このような増幅器の非線形歪による影響を軽減するために、増幅器に対してマージンを設定し、その分だけ増幅器への入力レベルを下げて動作させる。このマージンを「バックオフ」と呼ぶ。
【００２７】
所要バックオフの大きさは変調方式に依存する。たとえば、変調方式がπ／4シフトＱＰＳＫ（Quadrature Phase Shift Keying）であれば、振幅変動が小さいのでバックオフは小さくてもよい。しかしながら、たとえば変調方式が直交周波数分割多重（以下「ＯＦＤＭ（Orthogonal Frequency Division Multiplexing）」という）であれば、振幅変動が大きいのでバックオフを大きく設定する必要がある。
【００２８】
ところが、バックオフを大きく設定すると、増幅器の電力効率が大幅に劣化してしまう。そこで、信号のような振幅変動が大きい信号に対しても、なるべく小さいバックオフで動作させるのが望ましい。
【００２９】
しかしながら、バックオフを小さく設定すると、ＳＶＲのようなグローバルモデリングを用いた前置型非線形歪補償器では、補償できないほどに過剰に大きな歪を有する信号が、前置型非線形歪補償器に入力されることになる。
【００３０】
ここで、増幅器の逆特性、すなわち理想的に動作する前置型非線形歪補償器の特性は、図２０に示すとおりである。ここで、前置型非線形歪補償器の出力は、入力が大きい場合に非常に大きくなってしまう。これは同時に、また増幅器への入力自体が非常に大きくなることを意味する。
【００３１】
ただし、このように大きな入力が仮に増幅器に入力されても、増幅器の特性自体が飽和しているので、もはや、前置型非線形歪補償器では、補償を行なうことができない。この場合、特に、ＳＶＲのように補償動作を学習するようなシステムでは、有効な学習ができなくなるという問題が生じる。
【００３２】
本発明は、上述の課題を解決するためになされたもので、その目的は、振幅変動が大きい信号に対してもなるべく小さいバックオフで動作可能であり、なおかつ高精度で歪補償を可能とする前置型非線形歪補償器を提供することである。
【００３３】
【課題を解決するための手段】
本発明のある局面に従う前置型非線形歪補償器は、信号変換器の逆特性を用いて、入力信号を変換し、変換後の信号を信号変換器に供給する変換用サポートベクトルマシンと、信号変換器で変換された変換出力信号を受け、変換出力信号列により構成される信号ベクトルと、過去に抽出された変換出力信号列により構成される信号ベクトルとに基づいて、変換用サポートベクトルマシンにおける変換後の入力信号を、第１の不感帯を有する損失関数に基づいて推定する推定用サポートベクトルマシンと、変換用サポートベクトルマシンおよび推定用サポートベクトルマシンに接続され、変換用サポートベクトルマシンの出力および推定用サポートベクトルマシンの出力に基づいて、変換用サポートベクトルマシンおよび推定用サポートベクトルマシンの係数を更新するための係数更新手段と、入力信号としきい値とを比較し、入力信号がしきい値よりも大きい場合、当該入力信号を含むデータセットをサポートベクトルとして採用しないようにするデータ選定手段とを備える。
【００３４】
好ましくは、係数更新手段は、変換出力信号を受け、新たに受けた変換出力信号の信号列により構成されるベクトルであって、かつ、第２の不感帯の外にあるベクトルをサポートベクトルの候補として抽出し、既にサポートベクトルとして抽出されているベクトルの集合に加え、前記ベクトルの集合を新たにサポートベクトルの集合とみなし、前記ベクトルの集合の中から、前記第１の不感帯の外にあるベクトルをサポートベクトルとして再抽出する抽出手段と、前記抽出されたサポートベクトルに対する係数を計算する計算手段とを有する。
【００３５】
好ましくは、前記しきい値は、所定の定数である。
好ましくは、前記データ選定手段は、過去の所定数の入力信号の絶対値に基づいて、しきい値を設定する。
【００３６】
好ましくは、データ選定手段は、所定の期間内の過去の所定数の入力信号の絶対値に基づいて、しきい値を更新する。
【００３７】
好ましくは、信号変換器は、増幅器である。
【００３８】
【発明の実施の形態】
以下、図面を参照しつつ本発明の実施の形態について詳細に説明する。以下の説明では、同一の構成部分には同一の符号を付してあり、それらの名称および機能も同じである。したがって、それらについての詳細な説明は繰り返さない。
【００３９】
［サポートベクトルマシンによる回帰問題の学習の一般論］
以下、本発明の説明を行うにあたり、その前提となるサポートベクトルマシンによる回帰問題の学習について、まとめておく。
【００４０】
１−１．線形回帰問題の学習
１−１−１線形回帰問題
まず、サポートベクトルマシン（ＳＶＭ）の考えに基づき、サポートベクトル回帰（Support Vector Regression : SVR）の立場から線形回帰問題を解く方法について述べる。この問題は、以下のように定式化できる。
【００４１】
［問題］
ｍ入力１出力の未知線形回帰ｆ：ｘ→y があり、トレーニングデータ
（x₁，y_l），…，（x_n，y_n） x_i∈ R^m, y_i∈R （1．1）
が与えられているとする。このとき、トレーニングデータを用いてこの線形回帰を近似する関数
f（x，w）＝w^Tx （1．2）
を求める。
【００４２】
１−１−２損失関数
ここで、線形ε不感損失関数（Linear ε-Insensitive Loss Function）Ｌε（x,y,f(x,w)）ならびに二次ε不感損失関数（Quadtatic ε-Insensitive Loss Function〉Ｌ₂ε（x,y,f(x,w)）を次のように定義する。
【００４３】
【数１】

図１は、このような線形ε不感損失関数および二次ε不感損失関数を示す図である。図１（ａ）は、線形ε不感損失関数であり、図１（ｂ）は、二次ε不感損失関数を示す。
【００４４】
１−１−３二次ε不感損失関数による回帰問題
以下では、二次ε不感損失を用いた場合のサポートベクトル回帰について考える。サポートベクトルマシンに基づいた主問題は、以下のように表現できる。
【００４５】
【数２】

この主題は、次のように書き換えることができる。
【００４６】
【数３】

ただし、Ｃはモデルの複雑さと制約条件の緩和（トレーニング誤差）との間のトレードオフを設定する正則化パラメータである。
【００４７】
【数４】

式（1.10）に対するKarush-Kuhn-Tucker(KKT)条件により、以下の関係が成り立つ。
【００４８】
【数５】

これらの関係式を式（1.10）に代入すると次の双村問題に書き換えられる。
【００４９】
【数６】

ここで、係数αiを以下のように置き換えると、式（1.14）は、最終的に以下の式（1.15）のように書き換えることができる。また、式（1.11）により、線形回帰を近似する関数ｆ（ｘ、ｗ）＝ｗ^Tｘは、以下の式（1.16）のようになる。
【００５０】
【数７】

図２は、線形近似関数とサポートベクトルとの関係を示す概念図である。
【００５１】
図２において、「●」がサポートベクトルであり、これらの近似関数による誤差の大きさはεよりも大きい。「○」は近似関数による誤差の大きさがεよりも小さいデータであり、これらは直接近似関数には影響しない。
【００５２】
１−２非線形回帰問題の学習
１−２−１非線形回帰問題
以上の説明では、サポートベクトル回帰（ＳＶＲ）を用いた線形回帰について説明した。以下では、それを応用したSVRを用いた非線形回帰について説明する。この非線形回帰問題は、以下のように定式化できる
［問題］
ｍ入力１出力の未知非線形回帰ｆ：ｘ→y があり、トレーニングデータ
（x₁，y_l），…，（x_n，y_n） x_i∈R^m, y_i∈R （1．17）
が与えられているとする。このとき、トレーニングデータを用いてこの非線形回帰を近似する関数ｆ（ｘ，ｗ）＝ｗ^Tｘを求める。ここでは、損失関数として、二次ε不感損失関数を用いることにする。
【００５３】
１−２−２特徴空間
一般的に複雑な問題に応用するには、入力空間内で考えるよりも表現能力が優れている空間で考える必要がある。このような手法は高次元化と呼ばれている。
【００５４】
ここで、非線形回帰問題に対応するために、入力空間から高次元特徴空間への非線形写像を導入する。入力ベクトルｘを入力空間Ｘから特徴空間Ｆヘ写像し、その特徴空間でのベクトルφ（ｘ）をこれまでの入力ベクトルｘとみなすことにより入力空間ではなく特徴空間で線形回帰問題を取り扱う。ただし、φ：Ｘ→Ｆは、入力空間からある特徴空間への非線形写像である。
【００５５】
この作業は、実質的には元の入力空間において非線形回帰問題を解いていることになる。
【００５６】
ヒルベルトシュミット理論によるとヒルベルト空間における内積は次のような等価表現ができる。
【００５７】
【数８】

以下に示すとおり、特徴空間で線形問題を考えると、式（1.21）となり、また、線形回帰を近似する関数ｆ（φ（ｘ），ｗ）＝ｗ^Tφ（ｘ）は、式（1.22）のようになる。この式（1.22）は、入力空間における線形回帰問題となっており、式（1.20）の関係により、最終的に式（1.23）となる。さらに、式（1.23）をベクトル表記すると、式（1.24）となり、この非線形回帰問題を解くと、式（1.25）の非線形近似関数を得ることができる。
【００５８】
【数９】

ここで、注目すべき点は、特徴空間内での内積計算を入力空間内での計算に置き換えることができるカーネル関数Ｋ（ｘ_i，ｘ_j）を用いることにより、双対問題の式（1．21）やそれを解くことによって得られる近似関数にはφ（ｘ）が含まれないので、実際には入力空間から特徴空間への非線形写像を計算する必要はない点である。これは、「カーネルトリック」と呼ばれている。
【００５９】
このような特徴があるため、高次元化を導入することにより、サポートベクトル回帰（SVR）では特徴空間の次元を非常に大きくとることもでき、さらにモデルの複雑さと特徴空間の次元とを独立に制御することが可能となる。このことより、SVRはいわゆる次元の呪いから逃れることができる。
【００６０】
図３は、非線形近似関数とサポートベクトルを示す概念図である。図３でも、「●」がサポートベクトルであり、これらの近似関数による誤差の大きさはεよりも大きい。「○」は近似関数による誤差の大きさがεよりも小さいデータであり、これらは直接近似関数には影響しない。
【００６１】
１−２−３サポートベクトルと係数の関係
二次ε不感損失関数を用いたＳＶＲ問題では、式（1.14）から式（1.15）に係数の置換えを行う前においては、式（1.11）〜（1.13）のＫＫＴ条件が成り立っていることから、係数の置換え後においては結局以下の関係式が成り立つ。
【００６２】
【数１０】

また、二次ε不感損失関数は、不感帯εを有するので、推定誤差の大きさがεよりも小さい、以下の式（1.27）を満たすデータセット（ｘ_i，ｙ_i）はサポートベクトルから外れる必要がある。このため、得られた解によるサポートベクトルの近似誤差の大きさは、式（1.28）のようになる。
【００６３】
【数１１】

さらに、ラグランジェ関数を式（1.10）のようにおいたので、以下の式（1.29）および式（1.30）の符号が等しくなる。このため、式（1.23）の解であるサポートベクトルの係数α_iは、式（1.31）を満たすことになる。
【００６４】
【数１２】

２−１サポートベクトル回帰の学習に対する本発明の方法
以上の説明の方法をそのまま実行しようとすると、ＳＶＲ問題においてトレーニングデータが多い場合、問題を解くために非常に多くの時間を費やしてしまう。そこで、本発明においては、ＳＶＲ問題を効率良く解くために、以下に説明するとおり、ニュートン・ラフソン（Newton−Raphson）法を変形させた基本的な更新式を求め、その更新式を用いた逐次形アルゴリズムを採用する。
【００６５】
２−１−１問題設定
ここでは、二次ε不感損失関数を使用する次の問題を解く方法について説明する。
【００６６】
［問題］
ｍ入力１出力の未知非線形回帰ｆ：ｘ→y があり、トレーニングデータ
（x₁，y_l），…，（x_n，y_n） x_i∈R^m, y_i∈R
が与えられているとする。このとき、トレーニングデータを用いてこの非線形回帰を近似する式（2.1）の関数を求めるために、式（2.2）で表される最大化問題を解く。
【００６７】
【数１３】

ＳＶＲでは、誤差を評価するときに不感帯を設けているので、この問題を解いて得られたｆ（ｘ，α）とトレーニングデータには、以下に述べるような関係が成り立っていなければならない。
【００６８】
すなわち、データセット（ｘ_i，ｙ_i）が不感帯の内側にあるならば、つまり、｜ｙ_i−ｆ（ｘ_i，α）｜≦εを満たすならば、係数はα_i＝０となる。
【００６９】
逆に、データセット（ｘ_i，ｙ_i）が不感帯の外側にあるならば、つまり、｜ｙ_i−ｆ（ｘ_i，α）｜＞εを満たすならば、係数α_iは、上述した式（1.31）を満たす。
【００７０】
以下に述べるサポートベクトル回帰の学習では、上記の関係を考慮して、不必要となったデータセットをサポートベクトルから除外していく方法をとっている。
【００７１】
２−１−２バッチ処理
以下では、はじめにｎ個のトレーニングデータからいくつかのデータセットを取り出し、その取り出したデータセットだけを用いてＳＶＲを求める方法について説明する。
【００７２】
（ニュートン・ラフソン法の修正）
以下、二次ε不感損失関数に対する双対問題をニュートン・ラフソン法を変形させて更新式を用いて解く方法について説明する。
【００７３】
式（2．2）の最大化問題をベクトル表記すると、式（2．3）のようになり、かつ、評価関数Ｗ（α）をテイラー展開すると、式（2．4）のようになる。
【００７４】
その上で、Ｗ（α）をαiの近傍で二次関数近似を行ったとして、上式（2．4）の第３項までで打ち切る。
【００７５】
【数１４】

この二次関数近似の最小値を与えるα＝αiは、以下の式（2．5）の条件により与えられ、そのような最小値を与えるα＝αiは、逐次的に以下の式（2．6）の関係を満たすことになる（ニュートン・ラフソン法）。
【００７６】
【数１５】

（αの更新方法（初期値０の場合））
ニュートン・ラフソン法に基づきαを求める際、初期値α⁰＝０の場合について考える。式（2．6）で、α⁰＝０とすると、以下の式（2．12）および式（2．13）を満たすことになるが、本発明においては、ε＝０とおいて、式（2．14）のように修正する。
【００７７】
【数１６】

その後、さらに式（2．6）を用いて更新すると、以下の式（2．15）および（2．１6）のようになる。さらに、αi¹≠０のとき、式（2．17）が成り立つので、式（2．16）は、式（2．18）および（2．19）で表されるようになる。
【００７８】
【数１７】

このように求めた係数ベクトルαで、以下の式（2．20）が成り立っている場合、式（2．21）のようになるので、上述した式（1．31）を満たしていることがわかる。
【００７９】
【数１８】

しかし、求めた解が式（2.20）を満たしていない場合、その解は式（1．31）を満たしてなく、解としてはふさわしくない。
【００８０】
（余分なデータセットの除去方法）
次に、式（2．20）を満たしていない場合について考えることにする。
【００８１】
式（2.20）を満たしていない場合は、求めた解は式（1.31）を満たしてなく、ＫＫＴ条件を満たさないので解ではないことになる。そこで、このような場合は、余分なデータセット（ｘ_i，ｙ_i）が現在のサポートベクトルの候補の中に存在していると考え、現在のサポートベクトルの侯補の中から余分なデータセットを取り除くことにする。
【００８２】
以下では、さらに、余分なデータセットの選出方法について以下に述べる。
まずはじめに、先ほどの更新式を用いて、以下の式（2.22）に基づいて、係数ベクトルαｎ_svを求める（初期値：０）。
【００８３】
次に、係数ベクトルαｎ_svの各要素はゼロではない（α_i≠０，ｉ＝1，…，ｎ_sv）と考え、さらに更新式（2.6）にステップサイズη（0＜η≦1）を導入して、以下の式（2.23）のような新たな更新式をつくる。
【００８４】
【数１９】

この更新式を用いて係数ベクトルαｎ_svを更新する際、更新前後で係数ベクトルαｎ_svの各要素の符号が変化しない範囲で更新を行うことにする。
【００８５】
次に、ステップサイズの決め方について説明する。
ステップサイズは、１に近ければ近いほど評価関数が大きな値となるので、できる限り１に近いことが望ましい。そこで、次のようにしてそのステップサイズを求める。
【００８６】
まず、以下の式（2.24）により、ステップサイズベクトルを計算する。
【００８７】
【数２０】

だたし、ステップサイズベクトルη＝［η₁，…，ηｎ_sv］^Tであり、η_iは、更新式（2.23）においてステップサイズη＝η_iで、係数ベクトルαｎ_svを更新するとα_i＝０となる値である。
【００８８】
つぎに、ステップサイズベクトルηの要素の中で、０以上で、かつ、最も小さい要素ηiをステップサイズとして更新を行う。このとき、α_i＝０であり、データセット（ｘ_i，ｙ_i）を余分なデータセットであるとし、以下の式（2.25）のようにサポートベクトルから外すことにする。
【００８９】
【数２１】

これに伴いサポートベクトルの個数が減少することになる（ｎ_sv←ｎ_sv−１）。
【００９０】
サポートベクトルの個数が減少した後、再び更新式（2.23）より係数ベクトルαｎ_svを更新する。このときも先ほどと同じように、更新前後で係数ベクトルαｎ_svの各要素の符号が変化しない範囲で更新を行うために式（2.24)からステップサイズηを求める。
【００９１】
ステップサイズベクトルηの要素の中で０以上で最も小さい要素η_iが１以上の場合、更新式（2.23）においてη＝１で更新した係数ベクトルαｎ_svは更新前後で各要素の符号が変化しないことがわかる。
【００９２】
このとき、係数ベクトルαｎ_svが決定する。逆にステップサイズベクトルηの要素の中で０以上で最も小さい要素η_iが１よりも小さい場合、先ほどと同じく更新式（2.23）においてステップサイズη＝η_iで係数ベクトルαｎ_svを更新し、データセット（ｘ_i，ｙ_i）をサポートベクトルから外すことにする。以降、係数ベクトルαｎ_svが決定するまでこの作業を繰り返し行う。
【００９３】
２−１−３逐次処理
以上の説明では、ｎ個のトレーニングデータからいくつかのデータセットを取り出し、その取り出したデータセットだけからＳＶＲを求める方法について説明した。しかし、バッチ処理では逆行列の計算を行っているため、扱うデータ数には限界がある。そこで、このような問題を回避するために、次に逐次処理について説明する。
【００９４】
ここでは、既に以下の式（2.26）のようなＳＶＲによる近似関数が存在しており、新たなトレーニングデータ（ｘ_s，ｙ_s）を用いて、このＳＶＲによる近似関数を更新する方法について説明する。
【００９５】
【数２２】

まず、はじめに、新たなトレーニングデータ（ｘ_s，ｙ_s）が不感帯の内側にあるかどうかを調べる。
【００９６】
もし、以下の式（2.27）が成り立つならば、不感帯の内側にあることになりＳＶＲの更新は必要でないが、新たなトレーニングデータ（ｘ_s，ｙ_s）が不感帯の外側にある場合、つまり以下の式（2.28）が成り立つ場合、SVRの更新が必要となる。
【００９７】
【数２３】

そこで、新たなトレーニングデータ（ｘ_s，ｙ_s）が不感帯の外側にある場合について考えることにする。
【００９８】
このとき、（ｘ₁，ｙ₁），…，（ｘｎ_sv，ｙｎ_sv），（ｘ_s，ｙ_s）をサポートベクトルであるとみなす。
【００９９】
先ほど示した式（2.23）を用いて、係数ベクトルαを更新しようとした場合、新たなトレーニングデータ（ｘ_s，ｙ_s）の係数α_sが決まっていないのでこの値の初期値を０とする。
【０１００】
しかし、このままでは、式（2.23）を計算する際、sign（α_s）＝０となり、このまま係数ベクトルαを更新して求めても、ＫＫＴ条件を満たさない。
【０１０１】
そこで、更新式（2.23）を次のように変更する。
【０１０２】
【数２４】

この後は、更新式（2.29）を用いてαを更新することを考える。ここでも、更新前後で、係数ベクトルαｎ_svの各要素の符号が変化しない範囲で更新を行うために式（2.24）を次のように変更し、ステップサイズηを求める。
【０１０３】
【数２５】

これ以降は、バッチ処理のところで行ったようにαの更新を行い、αを決定する。
【０１０４】
実際には、本発明においては、以下の前置型非線形歪補償システムの処理において説明するとおり、大量のトレーニングデータセットが与えられている場合についてのＳＶＲを求めるアルゴリズムを用いる。
【０１０５】
このアルゴリズムは、バッチ処理・逐次処理を一つにまとめた計算方法である。
【０１０６】
［非線型デバイス（ＮＬＤ）の飽和特性を特に考慮しない構成］
図４は、本発明の前置型非線形歪補償システムの構成を説明するにあたり、その理解の前提として、非線型デバイス（ＮＬＤ）の飽和特性を特に考慮しない構成を有する前置型非線形歪補償システム１０００を示す概略ブロック図である。
【０１０７】
図４に示した例では、増幅器１６の出力の歪を前置型非線形歪補償器により補償する構成となっている。
【０１０８】
図４を参照して、前置型非線形歪補償システム１０００は、無線周波数信号Ｓ_INを中間周波数信号に変換するダウンコンバータ２と、ダウンコンバータ２に接続され、ダウンコンバータ２の出力する中間周波数信号をサンプリングしてデジタル信号に変換するＡ／Ｄ変換器４と、Ａ／Ｄ変換器４に接続され、サンプリングされた中間周波数信号をベースバンド信号ｘ（ｔ）に変換する直交検波器６と、直交検波器６に接続され、信号ｘ（ｔ）を、後述する増幅器１６の逆特性を用いて、ベースバンド信号ｙ（ｔ）の実数部Ｒｅ｛ｙ（ｔ）｝に変換するＳＶＭ５４と、直交検波器６に接続され、信号ｘ（ｔ）を、増幅器１６の逆特性を用いて、ベースバンド信号ｙ（ｔ）の虚数部Ｉｍ｛ｙ（ｔ）｝に変換するＳＶＭ５８と、ＳＶＭ５４および５８に接続され、変換後のベースバンド信号ｙ（ｔ）を中間周波数信号に変換する直交変調器１０と、直交変調器１０に接続され、直交変調器１０の出力する中間周波数信号をアナログ信号に変換するＤ／Ａ変換器１２とを含む。
【０１０９】
なお、本願発明は、図４において、ダウンコンバータ２、Ａ／Ｄ変換器４および直交検波器６からなる受信部１の代わりに、無線送信機のベースバンド信号の出力部が接続されるような態様にも適用可能である。
【０１１０】
前置型非線形歪補償システム１０００は、さらに、Ｄ／Ａ変換器１２に接続され、アナログの中間周波数信号を無線周波数信号Ｓ₀に変換し、増幅器１６に供給するアップコンバータ１４と、増幅器１６に接続され、増幅器１６より出力される出力信号Ｓ_OUTを中間周波数信号に変換するダウンコンバータ１８と、ダウンコンバータ１８に接続され、中間周波数信号をデジタル信号に変換するＡ／Ｄ変換器２０と、Ａ／Ｄ変換器２０に接続され、デジタル信号をベースバンド信号ｚ（ｔ）に変換する直交検波器２２とを含む。
【０１１１】
前置型非線形歪補償システム１０００は、さらに、直交検波器２２の出力する信号ｚ（ｔ）と学習に用いたベースバンド信号ベクトルＶ＿ｚ（ｐ_i）（ｉ＝１，…，ｎ_R）とに基づいて、ＳＶＭ５４より出力されるベースバンド信号の実数部の推定値Ｒｅ｛ｙ′（ｔ）｝を出力するＳＶＭ２６と、直交検波器２２の出力する信号ｚ（ｔ）とベースバンド信号ベクトルＶ＿ｚ（ｑ_j）（ｊ＝１，…，ｎ_I）とに基づいて、ＳＶＭ５８より出力されるベースバンド信号の虚数部の推定値Ｉｍ｛ｙ′（ｔ）｝を出力するＳＶＭ６２と、ＳＶＭ５４およびＳＶＭ２６に接続され、ベースバンド信号の実数部の推定値Ｒｅ｛ｙ′（ｔ）｝とベースバンド信号ｙ（ｔ）の実数部Ｒｅ｛ｙ（ｔ）｝との差分を計算する減算器３０と、ＳＶＭ５８およびＳＶＭ６２に接続され、ベースバンド信号の虚数部の推定値Ｉｍ｛ｙ′（ｔ）｝とベースバンド信号ｙ（ｔ）の虚数部Ｉｍ｛ｙ（ｔ）｝との差分を計算する減算器６４と、直交検波部６および直交検波部２２に接続され、信号ベクトルおよびサポートベクトルのサイズｍと増幅器１６による遅延時間ｄとを求めるｍ・ｄ決定部４４と、減算器３０、ＳＶＭ５４、ＳＶＭ２６、直交検波部２２およびｍ・ｄ決定部４４に接続され、減算器３０の出力、直交検波部２２の出力およびｍ・ｄ決定部４４の出力に基づいて、ＳＶＭ５４およびＳＶＭ２６における、増幅器１６の逆特性を有する関数の係数α_R（i）を更新する係数更新部４２と、減算器６４、ＳＶＭ５８、ＳＶＭ６２、直交検波部２２の出力およびｍ・ｄ決定部４４に接続され、減算器６４の出力、直交検波部２２の出力およびｍ・ｄ決定部４４の出力に基づいて、ＳＶＭ５８およびＳＶＮ６２における、増幅器１６の逆特性を有する関数の係数α_I（ｊ）を更新する係数更新部６６とを含む。
【０１１２】
ＳＶＭ２６は、後に説明する方法に従い、ベースバンド信号の実数部の推定値Ｒｅ｛ｙ′（ｔ）｝を求める。ＳＶＭ６２は、ＳＶＭ２６と同様の方法に従い、ベースバンド信号の虚数部の推定値Ｉｍ｛ｙ′（ｔ）｝を求める。
【０１１３】
係数更新部４２は、Ｒｅ｛ｙ（ｔ）｝とＲｅ｛ｙ′（ｔ）｝との誤差ｅ_R（ｔ）が最小になるように係数α_R（i）を制御する。係数更新部６６は、係数更新部４２と同様に、Ｉｍ｛ｙ（ｔ）｝とＩｍ｛ｙ′（ｔ）｝との誤差ｅ_I（ｔ）が最小になるようにα_I（ｊ）を制御する。
【０１１４】
なお、直交検波器６をアナログで実現し、ダウンコンバータ２の出力を直交検波器６で受け、直交検波器６の出力をＡ／Ｄ変換器４でデジタルに変換し、その出力をＳＶＭ８に供給する構成であっても構わない。また、直交変調器１０をアナログで実現し、ＳＶＭ５４およびＳＶＭ５８の出力をＤ／Ａ変換器１２が受け、Ｄ／Ａ変換器１２の出力を直交変調器１０で受け、直交変調器１０の出力をアップコンバータ１２に供給する構成であっても構わない。さらに、直交検波器２２をアナログで実現し、ダウンコンバータ１８の出力を直交検波器２２で受け、直交検波器２２の出力をＡ／Ｄ変換器２０で受け、Ａ／Ｄ変換器２０の出力をＳＶＭ２６およびＳＶＭ６２に供給する構成であっても構わない。
【０１１５】
（前置型非線形歪補償システム１０００の動作）
以下、前置型非線形歪補償システム１０００の動作について説明する。
【０１１６】
まず、図４において、信号Ｓ_IN、Ｓ₀およびＳ_OUTは、式（１）〜（３）でそれぞれ表わされる。
【０１１７】
Ｓ_IN＝Ｒｅ｛ｘ（ｔ）ｅｘｐ（ｊ２π(ｆ_RFｔ）｝
：前置型非線形歪補償器への入力信号 …（１）
Ｓ₀＝Ｒｅ｛ｙ（ｔ）ｅｘｐ（ｊ２π(ｆ_RFｔ）｝
：歪補償された信号 …（２）
Ｓ_OUT＝Ｒｅ｛ｚ（ｔ）ｅｘｐ（ｊ２π(ｆ_RFｔ）｝
：増幅器１６からの出力信号 …（３）
ｘ（ｔ）、ｙ（ｔ）およびｚ（ｔ）は、それぞれ帯域信号Ｓ_IN、Ｓ₀およびＳ_OUTの複素包絡線 (ベースバンド信号)である。ｊは虚数を表わす。ｆ_RFは無線周波数（Radio Frequency）を表わす。ｔは時刻を表わす。
【０１１８】
［非線形歪補償器の動作原理］
Ｐ＝｜ｙ（ｔ）｜²／２として、増幅器１６のベースバンド領域における入出力特性をｆを複素関数としてｆ（Ｐ）と置くと、ベースバンド信号ｙ（ｔ）およびｚ（ｔ）の関係は、下記の式（４）のように表すことができる。
【０１１９】
さらに、一般に、非線形歪補償器では、ｆ（Ｐ）に対し、次式（５）の関係を満たす入出力特性ｇ（Ｐ）を求めることを目的とする。仮に、入出力特性ｇ（Ｐ）を得ることができれば、次式（６）および（７）が成立ち、Ｓ_INに対してＳ_OUTを線形化することができる。
【０１２０】
【数２６】

前置型非線形歪補償システム１０００では、ｇ（Ｐ）を、ＳＶＭ２６および係数更新部４２ならびにＳＶＭ６２および係数更新部６６にて推定する。具体的には、ｙ（ｔ）とｚ（ｔ）より推定したｙ′（ｔ）との誤差ｅ（ｔ）が、実部および虚部のそれぞれにおいて最小になるように、ＳＶＭ２６およびＳＶＭ６２の関数の係数を更新する。この計算が収束すれば、ＳＶＭ２６およびＳＶＭ６２により近似される特性は、増幅器１６の逆特性に相当することとなる。
【０１２１】
［逆特性関数の計算］
上述したような逆特性関数の計算のための条件を以下に整理しておく。
・非線形の歪補償のための計算は時間連続信号ではなく離散信号に対して行なわれる。ハードウェア化の際は、Ａ／Ｄ変換器を使用し、連続信号を離散信号に変換し、計算後、離散信号を連続信号にＤ／Ａ変換器により復元する。
・信号処理は帯域系ではなくベースバンド系または中間周波数信号にて行なわれる。ベースバンド系の場合には、増幅器１６は等価的に複素ベースバンド信号ｙ（ｔ）を受け、複素信号ｚ（ｔ）を出力する系と見なすことができる。
【０１２２】
［サポートベクトルマシンの動作説明］
時刻ｋＴ（Ｔ：Ａ／Ｄ変換およびＤ／Ａ変換のサンプリング間隔）のとき、サポートベクトルマシンＳＶＭ５４の出力Ｒｅ｛ｙ（ｋＴ）｝およびＳＶＭ５８の出力Ｉｍ｛ｙ（ｋＴ）｝、ならびにサポートベクトルマシンＳＶＭ２６の出力Ｒｅ｛ｙ′s（ｋＴ）｝およびＳＶＭ６２の出力Ｉｍ｛ｙ′s（ｋＴ）｝は、次式（８）〜（１１）により得られる。
【０１２３】
【数２７】

ここで、変数の接頭辞”Ｖ＿”は、その変数がベクトルであることを表わす。また、ｍは信号ベクトルおよび後述するサポートベクトルのサイズ、ｎ_R、ｎ_Iは、それぞれ実部および虚部のサポートベクトルマシンのサポートベクトルの数であり、ｐ_iはｉ番目のサポートベクトルを取得した時間であり、ｑ_jはｊ番目のサポートベクトルを取得した時間である。
【０１２４】
Ｖ＿ｚ（ｐ_iＴ）（ｉ＝１，…，ｎ_R），Ｖ＿ｚ（ｑ_jＴ）（ｊ＝１，…，ｎ_I）
は増幅器１６の入出力特性を学習した結果、時刻ｋＴまでに抽出されたサポートベクトルである。
【０１２５】
α_R（ｉ），α_I（ｊ）は、それぞれＶ＿ｚ（ｐ_iＴ），Ｖ＿ｚ（ｑ_jＴ）に対応した重み係数（実数）である。一般に、α_R（ｉ）（または、α_I（ｊ））が零の時、対応するＶ＿ｚ（ｐ_iＴ）（または、Ｖ＿ｚ（ｑ_jＴ））は計算に加える必要がなく廃却しても構わない。また、非零の係数α_R（ｉ）（または、α_I（ｊ））の個数をサポートベクトルマシンの次元と呼ぶ。例えば、非零の係数の個数が７３であればサポートベクトルマシンの次元数は７３である。さらに、この次元数は学習データを増やしても一定値以上に増えることがない。この上界値のことをＶＣ（Vapnik and Chervonenkis）次元と呼ぶ。
【０１２６】
Ｋ（Ｖ＿ｘ，Ｖ＿ｙ）は「内積カーネル」の関数であり、前置型非線形歪補償システム１０００では、内積カーネルの線形和である式（８）〜（１１）を、適切な係数α_R（ｉ），α_I（ｊ）を選択することにより、増幅器１６の入出力特性の逆特性ｇ（Ｐ）に適合させることを目標とする。このように未知の関数にシステムの入出力関数を適合させるような問題全般を総称して「回帰問題」という。次式（１５）〜（１７）は、一般的に用いられる内積カーネルの例である。
【０１２７】
【数２８】

どの内積カーネルが逆関数を表現するのに最も適しているかはシステムにより異なり、実際には動作前に何度か試験をした上で最も特性のよいものを選択する。もちろん、上記以外の関数でも逆関数ｇ（Ｐ）をよりよく推定でき、かつ期待する効果が得られるのであれば、その関数を使用してもよい。
【０１２８】
［係数更新部４２の動作説明］
以下では、サポートベクトルに対応する係数α_R（ｉ），α_I（ｊ）の更新計算を行う係数更新部４２および係数更新部６６の動作について説明する。
【０１２９】
上述したとおり、係数更新部４２は、Ｒｅ｛ｙ（ｔ）｝とＲｅ｛ｙ′（ｔ）｝との誤差ｅ_R（ｔ）が最小になるように係数α_R（ｉ）を制御する。一方、係数更新部６６は、Ｉｍ｛ｙ（ｔ）｝とＩｍ｛ｙ′（ｔ）｝との誤差ｅ_I（ｔ）が最小になるように係数α_I（ｊ）を制御する。
【０１３０】
【数２９】

なお、以下では、表記を簡単にするためＴ＝１としている。このようにしても一般性を失うことはない。
【０１３１】
係数α_R（ｉ）は、次式（２０）で表わされる評価関数Ｗを最大化するという最適化問題を解くことにより得られる。また、係数α_I（ｊ）についても同様である。
【０１３２】
【数３０】

ただし、定数Ｃと定数εは、通常、実験的に求められる。
【０１３３】
増幅器１６の入出力特性の学習が完了するまでの間はサポートベクトルの追加・削除が繰返されるが、これと対応して係数α_R（ｉ），α_I（ｊ）の値を更新する必要がある。
【０１３４】
以下では、係数更新部４２の動作を例にとって説明する。係数更新部６６の動作も、入力される変数が異なるのみで、基本的に係数更新部４２と同様である。
【０１３５】
なお、後に詳しく説明するように、係数更新部４２の更新の計算の特徴の１つは、以下のようにして逐次的に係数α_R（ｉ）の値を更新することである。
（１）更新の計算の途中において、係数更新部４２は、ｎ−１組のサポートベクトルＶ＿ｚ（ｐ_i）および係数α_R（ｉ）を取得している状態となる。
（２）係数更新部４２は、さらに、ｎ個目のサポートベクトルの候補としてデータＶ＿ｚ（ｐ_n）を入手する。
（３）Ｖ＿ｚ（ｐ_n）をサポートベクトルと見なし、その係数をα_R（ｎ）＝０とおく。
（４）Ｖ＿ｚ（ｐ_n）が加わったことによる最適解からのずれ（摂動）を補正するため、α_R（ｉ）（ｉ＝１，…，ｎ）に関して誤差分（摂動分）Δα_R（ｉ）を計算する。
【０１３６】
上述のように逐次サポートベクトルを抽出することにより、１回の係数更新において計算対象となる係数の数を削減できるので、計算量を削減することができる。
【０１３７】
［係数の更新の具体的な計算方法］
係数の更新方法の具体例を以下に示す。ただし、この具体例は実現の一例であって、必ずしもこの通りである必要はない。
【０１３８】
図５および図６は、このような係数の更新の具体的な計算方法を説明するためのフローチャートである。
【０１３９】
図５を参照して、係数更新部４２は、まず、取得済みのＮ個の学習データセットからｎ個のデータセット（Ｖ＿ｚ（Ｐ_i），Ｒｅ｛ｙｓ（Ｐ_i）｝）（ｉ＝１〜ｎ）を取り出し、これらをサポートベクトルに採用する（ステップ１００）。
【０１４０】
次に、係数更新部４２は、上記サポートベクトルに対応する係数の第１次近似と、その符号ベクトルＳ_Rを、以下の式（２１）および（２２）により計算する（ステップ１０２）。
【０１４１】
【数３１】

さらに、ステップサイズベクトル［η_R（１），…，η_R（ｎ）］^Tを以下の式（２４）および（２５）により計算する（ステップＳ１０４）。
【０１４２】
【数３２】

ステップサイズベクトルの要素の中で０以上でもっとも小さい要素をη_R（ν）とし（ステップＳ１０６）、要素η_R（ν）が１以上であるか否かを判定する（ステップＳ１０８）。
【０１４３】
まず、要素η_R（ν）が１未満である場合、続いて、要素η_R（ν）を用いて、以下の式（２６）により、係数ベクトル［α_R（１），…，α_R（ｎ）］^Tを更新する（ステップＳ１１０）。
【０１４４】
【数３３】

その上で、（Ｖ＿ｚ（Ｐν），Ｒｅ｛ｙｓ（Ｐν）｝）を余分なデータセットとして、サポートベクトルから削除し、係数α_R（Pν）も削除する（ステップＳ１１２）。このような削除に対応して、ｉ＞νについて、以下の表にしたがって、ｉ番目の学習データを取得した時間、および係数α_R（ｉ）の番号を読替える。
【０１４５】
【表１】

さらに、（ｎ−１）を新たにｎと読替える処理を行う（ステップＳ１１４）。以上の読替えの手続きの後、処理はステップＳ１０４に復帰する。
【０１４６】
一方、ステップＳ１０８において、要素η_R（ν）が１以上である場合、以下の式（２７）により、改めて係数ベクトルを計算する（ステップＳ１１６）。
【０１４７】
【数３４】

次に、図６を参照して、新しい学習データセット（Ｖ＿ｚ（ｋ），Ｒｅ｛ｙｓ（ｋ）｝）の取得が行われる（ステップＳ１１８）。
【０１４８】
さらに、以下の式（２８）により、誤差ｅ_Rが計算される（ステップ１２０）。続いて、誤差ｅ_Rとしきい値ε´との比較が行われて（ステップＳ１２２）、｜ｅ_R｜＜ε´であれば、処理は、ステップＳ１１８に復帰する。なお、ここで、ε≦ε´である。
【０１４９】
【数３５】

一方、｜ｅ_R｜≧ε´であれば、以下の式（２９）〜（３０）にしたがって、係数α_R（１），…，α_R（ｎ），および誤差ｅ_Rのそれぞれについて、符号を計算し、符号ベクトルを置き換え（ステップＳ１２４）、［α_R（１），…，α_R（ｎ），０］を新たに係数ベクトルに置き換えるとともに、（ｎ＋１）を新たにｎと置き換える（ステップＳ１２６）。
【０１５０】
【数３６】

（Ｖ＿ｚ（ｋ），Ｒｅ｛ｙｓ（ｋ）｝）をｎ番目のサポートベクトル（Ｖ＿ｚ（Ｐｎ），Ｒｅ｛ｙｓ（Ｐｎ）｝）として採用して、処理はステップＳ１０４に復帰する（ステップＳ１２８）。
【０１５１】
以上の処理により、係数更新部４２は、逐次適応的に係数ベクトルの更新を行う。
【０１５２】
［サポートベクトルマシンＳＶＭ５４，５８の動作説明］
ＳＶＭ５４は、係数更新部４２で更新された係数α_R（ｉ）を用いて、上述した式（８）に従い、ベースバンド信号ｘ（ｔ）をベースバンド信号Ｒｅ｛ｙ（ｔ）｝に変換する。サポートベクトルマシンＳＶＭ５８は、上述したのと同様の手続きで、係数更新部６６で更新された係数α_I（ｉ）を用いて、上述した式（９）に従い、ベースバンド信号ｘ（ｔ）をベースバンド信号Ｉｍ｛ｙ（ｔ）｝に変換する。
【０１５３】
［逐次計算における特徴的手続きの説明］
以上、図５および図６により、サポートベクトルマシンを用いて、非線形なプロセスであって、当該プロセスを表現する変数が複素数で表される場合に、回帰問題を解くことにより、前置型非線形歪補償システムを実現するための構成について説明した。特に、上記係数更新部４２の行う処理において、特徴的な手続きについて、以下に説明を付け加えておく。
【０１５４】
図７は、１入力１出力の非線形回帰ｆ：ｘ→ｙに関して、サポートベクトルマシンによる近似学習がある程度進んだ状態において、サポートベクトルによる学習データの近似特性を示す概念図である。
【０１５５】
図７においては、「損失」は、ε不感２次損失関数に基づく近似特性を表しているものとする。つまり、「損失」は、不感帯の片側幅を近似特性に加減して得られた２つの不感帯境界値のうち、学習データに近い方の不感帯境界値を減算することにより得られる値と定義する。
【０１５６】
図７で、２重丸はサポートベクトルであるデータを表しており、１重丸は、サポートベクトルによる近似特性に対する不感帯の内側に存在し、サポートベクトルではないデータを表す。なお、サポートベクトルの定義により、図７中で２重丸で表されるデータが真のサポートベクトルであるときは、この真のサポートベクトルの外側には、サポートベクトルは存在しない。
【０１５７】
ここで、たとえば、サポートベクトルによる近似特性の上側であって、かつ、不感帯の外側にあるサポートベクトルのデータＤ１＝（ｘ₁，ｙ₁）について見てみると、このデータＤ１についての損失Ｌ１は、以下の式で表される。
【０１５８】
Ｌ１＝α₁／Ｃ
なお、ここで、α₁はサポートベクトルｘ₁に対する係数である。このとき、データＤ１の損失は、このデータがサポートベクトルによる近似特性の上側にあることに対応して、「上向き、かつ、外向き」であり、α₁＞０である。
【０１５９】
一方、たとえば、サポートベクトルによる近似特性の下側であって、かつ、
不感帯の外側にあるサポートベクトルのデータＤ２＝（ｘ₂，ｙ₂）について見てみると、このデータＤ２についての損失Ｌ２も、Ｌ２＝α₂／Ｃで表される。
【０１６０】
このとき、データＤ２の損失は、このデータがサポートベクトルによる近似特性の下側にあることに対応して、「下向き、かつ、外向き」であり、α₂＜０である。
【０１６１】
（ステップＳ１２２におけるε´の値の設定）
ε´の値は、本来の不感帯を定義する値εと同じ値としても計算を実行することは可能である。しかし、本願発明では、ε´の値を本来の不感帯を定義する値εよりも大きくしている。
【０１６２】
そこで、以上の準備のもとに、まず、図６のステップＳ１２２において、ε´の値を本来の不感帯を定義する値εよりも大きくする理由について、まず、説明する。
【０１６３】
図８は、このようにε´の値を本来の不感帯を定義する値εよりも大きくすることの効果を説明するための概念図である。
【０１６４】
なお、εで決まる本来の不感帯に対し、εをステップＳ１２２のε´で置換えた場合に、ε´で決まる領域を本来の不感帯に対する「拡張不感帯」と定義する。
【０１６５】
拡張不感帯はステップＳ１２２のみにおいて用いられ、他のステップにおける不感帯は、εで決定される。
【０１６６】
図８においては、学習途中のサポートベクトルによる近似特性が直線ＬＳＶで表され、この近似特性よりもさらに近似が進んだ状態の近似特性が曲線ＣＳＶで表されているものとする。ただし、「学習途中のサポートベクトルによる近似特性」を直線としたのは、説明の便宜のためであって、より一般には、この特性も曲線で表される。
【０１６７】
まず、近似特性ＬＳＶについて、拡張不感帯をしきい値ε´（＝ε）で定義した場合は、データＤ３は、最終的にはサポートベクトルにはならないデータであるものの、近似特性ＬＳＶの段階では、サポートベクトルとして選択されることになる。これは、拡張不感帯を定義するしきい値を、しきい値εよりも大きなしきい値ε´（１）で定義した場合も同様である。
【０１６８】
これに対して、近似特性ＬＳＶについて、拡張不感帯をしきい値ε´（１）よりもさらに大きなしきい値ε´（２）で定義した場合は、データＤ３は、近似特性ＬＳＶの段階でも、サポートベクトルとして選択されない。
【０１６９】
上述したとおり、本発明においては、計算量を低減するために、逐次計算の途中において、サポートベクトルとして選択されなくなったデータを計算の対象から除外している。
【０１７０】
したがって、図６のステップＳ１２２において使用するしきい値ε´を、予め本来の不感帯に対応するしきい値εよりも大きなしきい値ε´としておけば、計算が進むにつれて、除外されていくはずのデータをそもそもサポートベクトルとして選択する確率が小さくなるので、一層の計算量の低減を期待できる。
【０１７１】
すなわち、１度、サポートベクトルとして抽出した古いデータセットが、学習（新しいサポートベクトルの追加）が進むにつれ、サポートベクトルとして不要となり除去されることが繰り返され、最終的には新しいサポートベクトルの追加が不要となって学習が終了することになる。しかしながら、「サポートベクトルの追加・除去の繰り返し」を低減することにより、計算量を削減するとともに学習の終了までの時間を短縮することが可能となる。
【０１７２】
なお、新しいサポートベクトルの追加の頻度が低下するにつれてしきい値ε´（≧ε）をεに近づけていくという処理を行うことも可能である。この場合、ある程度の近似精度までは早期に学習することになり、かつ、最終的な精度εも確実に得ることが可能となる。
【０１７３】
また、係数更新部４２や係数更新部６６において、直交検波器２２から受取る学習データの数をカウントするカウンタを備えて、一定数のデータを受取るごとにしきい値ε´を更新しつつ、εに近づけていくという処理を行うことも可能である。
【０１７４】
あるいは、係数更新部４２や係数更新部６６において、時間を計測するカウンタまたは抽出されたサポートベクトルの数をカウントするカウンタを有し、処理期間の長さに対応して、しきい値ε´を更新しつつ、εに近づけていくという処理を行うことも可能である。
【０１７５】
（サポートベクトルを削除する基準）
さらに、図５のステップＳ１１０からＳ１１４において、サポートベクトルを削除する基準について、さらに説明する。
【０１７６】
図９は、図８で示される状態から、近似特性の逐次計算を進めた場合に、余分となったサポートベクトルの除去を行う手続きを説明するための概念図である。
【０１７７】
以下の説明では、図８で示される状態で、しきい値εと近似特性ＬＳＶにより決まる不感帯の外にデータＤ３があるため、データＤ３がサポートベクトルとして選択されており、図９に示す状態に近似を進めると、このデータＤ３がサポートベクトルから除去され、一方、データＤ１が新たにサポートベクトルとして追加されるものとする。
【０１７８】
図１０は、図９において、データＤ３について、損失の変化を説明するための概念図である。
【０１７９】
なお、ここでは、直線ＬＳＶで表される近似特性で、このデータＤ３に対応する係数をα_i（１）とし、曲線ＣＳＶで表される近似特性でのデータＤ３に対応する係数をα_i（２）と表すものとする。
【０１８０】
直線ＬＳＶの不感帯の境界の外側にデータＤ３が存在することに対応して、直線ＬＳＶに対するデータＤ３の損失は、α_i（１）／Ｃ（α_i（１）＞０）となる。
【０１８１】
これに対して、曲線ＣＳＶの不感帯の境界の内側にデータＤ３が存在することに対応して、曲線ＣＳＶに対するデータＤ３の損失は、α_i（２）／Ｃ（α_i（２）＜０）となる。
【０１８２】
したがって、近似が進んで、サポートベクトルによる近似特性が、直線ＬＳＶ（より一般的には曲線）から、曲線ＣＳＶへと変化することによる損失の変化量を、−Δα_i／Ｃとすると、ステップＳ１０８で用いたステップサイズηは、η＝−α_i（１）／Δα_iと表される。
【０１８３】
このとき、０≦η＜１が成り立っていれば、図１０で示すように、直線ＬＳＶ（より一般的には曲線）の不感帯の境界と曲線ＣＳＶの不感帯の境界との間にデータＤ３が存在することになる。これは、言い換えると、「データＤ３は、サポートベクトルによる近似特性が、直線ＬＳＶ（より一般的には曲線）から、曲線ＣＳＶと変化することにより、サポートベクトルから除去されるデータとなった」ということができる。なお、以上の説明では、データが近似特性の上側にある場合について説明したが、データが近似特性の下側にある場合にも同様である。
【０１８４】
したがって、図５のステップＳ１０８におけるような基準で判断することにより、それまでサポートベクトルとして選択していたデータをサポートベクトルから除外するべきか否かを判断することが可能となる。
【０１８５】
上述の例では、入出力特性が不明な、非線形の入出力特性を有する増幅器１６の出力の補正に対して、サポートベクトルマシンを用いたが、本発明は増幅器１６に対してのみ適用可能なわけではない。一般的に、入力される信号が複素数で表現でき、かつ入出力特性が不明な信号の変換回路に対しても本発明を適用することが可能である。
【０１８６】
以上説明したように、前置型非線形歪補償システム１０００によると、ＳＶＭ２６およびＳＶＭ６２を用いて、増幅器１６の逆特性を有する信号を推定する。その信号とＳＶＭ５４およびＳＶＭ５８より出力される信号とに基づいて、係数更新部４２および係数更新部６６において、増幅器１６の逆特性を有する関数の係数が更新される。
【０１８７】
この構成では、ルックアップテーブルを必要としないため、ノイズフロアを低く押さえることができ、かつ、入力が複素信号である場合に適用可能であるので、簡単な構成で前置型非線形歪補償器を提供することができる。また、サポートベクトルマシンは、ｐｔｈ−ｏｒｄｅｒ−ｐｒｅｄｉｓｔｏｒｔｅｒに比べて、計算が簡単で収束時間が早い。さらに、学習する過程で、一度、サポートベクトルに加えられた信号ベクトルにおいて、不要となったサポートベクトルが削除されるため、高速処理が可能な前置型非線形歪補償器を提供することができる。
【０１８８】
また、広帯域信号が入力された場合、増幅器の非線形性が信号帯域に比較して狭帯域の場合にもデータベクトルのサイズｍを適切に設定することにより補償することが可能である。
【０１８９】
［本発明の実施の形態の前置型非線形歪補償システム２０００］
以上、図４に示した前置型非線形歪補償システム１０００の構成および動作について説明した。
【０１９０】
前置型非線形歪補償システム１０００は、上述のような特徴を有するものの、図２０に示したような入出力特性を有する増幅器において、バックオフを小さく取ると以下のような問題がある。
【０１９１】
すなわち、増幅器への入力が大きい場合、増幅器の飽和特性により、出力はある値に抑えられる。その飽和した出力が推定用ＳＶＭ２６および６２の入力となるので、推定用ＳＶＭ２６および６２の出力は変換用ＳＶＭ５４および５８の出力に比べて小さくなる。
【０１９２】
したがって、推定用ＳＶＭ２６および６２の出力と変換用ＳＶＭ５４および５８の出力との差分は、入力信号が小さいときには非常に小さな値となるが、入力が大きいときは非線形歪の影響を受け大きくなる。
【０１９３】
図４に示した前置型非線形歪補償システム１０００の方式では、各トレーニングデータについて、両ＳＶＭ出力の差分が予め設定した不感帯ε′よりも大きいとき、サポートベクトルとして採用する。過剰に大きな振幅のトレーニングデータにおける両ＳＶＭ出力の差分は、確実に不感帯ε′よりも大きくなり、サポートベクトルとして採用される。このため、このトレーニングデータにより全てのサポートベクトルは更新され、回帰関数の近似精度は悪くなる。このようなトレーニングデータが複数個採用されることにより、回帰関数の精度はますます悪くなり、場合によっては発散してしまう。
【０１９４】
図１１は、本発明の前置型非線形歪補償システム２０００の構成を説明するための概略ブロック図である。
【０１９５】
図４に示した前置型非線形歪補償システム１０００に比べて、前置型非線形歪補償システム２０００には、データ選定部７０がさらに設けられている。前置型非線形歪補償システム２０００のその他の構成は、前置型非線形歪補償システム１０００と同様であるので、同一部分には同一符号を付して、その説明は繰り返さない。
【０１９６】
なお、図１１において、非線形デバイス１６は、たとえば、図２０に示したような入出力特性を有する増幅器であるものとする。
【０１９７】
図１１を参照して、データ選定部７０は、入力されるベースバンド信号ｘ（ｔ）と、後に説明するしきい値ＴＨとを比較し、信号ｘ（ｔ）がしきい値よりも大きい場合、その信号ｘ（ｔ）を含むデータセットをサポートベクトルとして採用しないようにするものである。
【０１９８】
図１２は、データ選定部７０の構成を説明するための概略ブロック図である。
図１２を参照して、データ選定部７０は、ベースバンド信号ｘ（ｔ）を伝達する経路を選択するための選択部７０２と、少なくとも指定された個数のＮ_TH個のベースバンド信号ｘ（ｔ）を選択部７０２から受けて格納するためメモリ７０４と、メモリ７０４中に格納されたベースバンド信号ｘ（ｔ）からしきい値ＴＨを算出するためのしきい値決定部７０６と、選択部７０２からのベースバンド信号ｘ（ｔ）としきい値ＴＨとを比較して、しきい値ＴＨ以下のベースバンド信号ｘ（ｔ）を選択的に係数更新部４２および６６に与えるための比較部７０８とを備える。
【０１９９】
ここで、しきい値決定部７０６は、以下の式に基づいて、しきい値ＴＨを算出する。
【０２００】
【数３７】

ここで、上式において、Ｋ１およびＮ_THは、予め指定される定数である。
【０２０１】
なお、しきい値ＴＨを計算するためのベースバンド信号ｘ（ｔ）は、たとえば、一定の時間間隔ごとに少なくとも個数Ｎ_THがメモリ７０４に格納されて、しきい値決定部７０６が、この一定時間間隔ごとにしきい値ＴＨを更新することとしてもよい。
【０２０２】
また、しきい値ＴＨの決定の仕方は、上述した方法に限られず、たとえば、所定の定数としてもよいし、あるいは、以下の式のように、過去のベースバンド信号ｘ（ｔ）の絶対値の最大値に所定の定数Ｋ２を乗算することで、更新することとしてもよい。
【０２０３】
【数３８】

すなわち、データ選定部７０においてベースバンド信号ｘ（ｔ）を選択するためのしきい値ＴＨは、外部から任意に設定可能である。または、データ選定部７０におけるしきい値ＴＨは、過去の入力信号から、所定の計算により設定もしくは更新することが可能である。
【０２０４】
次に、図１１に示した前置型非線形歪補償システム２０００について説明する。ただし、その動作は、基本的に、図５および図６で説明した前置型非線形歪補償システム１０００の動作と同様であるので、以下では、主として、前置型非線形歪補償システム２０００の動作の前置型非線形歪補償システム１０００の動作との相違点について説明する。
【０２０５】
図１３および図１４は、前置型非線形歪補償システム２０００の係数更新の動作を説明するためのフローチャートである。
【０２０６】
まず、図１３を参照して、サポートベクトル係数の第１次近似を行うため、データセットをＮセット取得する。その際、対応するベースバンド信号ｘ（ｔ）も併せて取得しておく（ステップＳ２００）。
【０２０７】
次に、データ選定部７０において、しきい値ＴＨを決定する（ステップＳ２０１）。ここでは、たとえば、上述したうち、個数Ｎ_THのベースバンド信号ｘ（ｔ）の絶対値の平均値に係数Ｋ１を乗じて得られるものとする。
【０２０８】
次に、各データセットについてベースバンド信号ｘ（ｔ）としきい値ＴＨとを比較し、入力信号がしきい値ＴＨよりも大きい場合は、そのデータを含むデータセットは回帰関数の精度を悪くするデータセットであるとみなし、サポートベクトルに採用しない（ステップＳ２０２）。
【０２０９】
残りのデータセットの数をｎとすると、データセットは、（Ｖ＿ｚ（Ｐ_i），Ｒｅ｛ｙｓ（Ｐ_i）｝）（ｉ＝１〜ｎ）である。係数更新部４２は、上記サポートベクトルに対応する係数の第１次近似と、その符号ベクトルＳ_Rを、図５におけるステップＳ１０２と同様にして計算する（ステップ２０３）。
【０２１０】
以下、図５のステップＳ１０４〜Ｓ１１６と同様の処理を、図１３においても、ステップＳ２０４〜Ｓ２１６として実行する。
【０２１１】
次に、図１４を参照して、新しい学習データセット（Ｖ＿ｚ（ｋ），Ｒｅ｛ｙｓ（ｋ）｝）の取得が行われる（ステップＳ２１８）。併せて、対応する入力信号のベースバンド信号ｘ（ｔ）も取得する。
【０２１２】
そのときの入力信号のベースバンド信号ｘ（ｔ）がしきい値ＴＨ以上である場合（ステップＳ２１９）、そのデータセットをサポートベクトルに採用しないで、処理は、ステップＳ２１８に復帰する。
【０２１３】
一方、ステップＳ２１９において、入力信号のベースバンド信号ｘ（ｔ）がしきい値ＴＨよりも小さい場合は、さらに、上述した式（２８）により、誤差ｅ_Rが計算される（ステップ２２０）。
以下、図６におけるステップＳ１２２〜１２８と同様の処理が、図１４においても、ステップＳ２２２〜Ｓ２２８として実行される。
【０２１４】
以上の処理により、係数更新部４２は、逐次適応的に係数ベクトルの更新を行う。
【０２１５】
ＳＶＭ５４は、係数更新部４２で更新された係数α_R（ｉ）を用いて、上述した式（８）に従い、ベースバンド信号ｘ（ｔ）をベースバンド信号Ｒｅ｛ｙ（ｔ）｝に変換する。サポートベクトルマシンＳＶＭ５８は、上述したのと同様の手続きで、係数更新部６６で更新された係数α_I（ｉ）を用いて、上述した式（９）に従い、ベースバンド信号ｘ（ｔ）をベースバンド信号Ｉｍ｛ｙ（ｔ）｝に変換する。
【０２１６】
［シミュレーション結果］
以下、本発明の効果を示すために、シミュレーション結果を示す。
【０２１７】
以下のシミュレーションでも、ＮＬＤは増幅器とし、その入出力特性は次式で表されるものとする。
【０２１８】
【数３９】

その他の条件として、入力信号はＯＦＤＭ信号とし、そのキャリア数は１４０５であり、ＦＦＴサイズは２０４８であるものとする。
【０２１９】
信号電力対雑音電力比（以下、「ＳＮＲ」という）は、１００（ｄＢ）である。入力電力は１とし、しきい値ＴＨは固定で０．８としている。
【０２２０】
データセット数は２０００である。また、ｍ＝１とする。さらに、ｄは既知であるものとする。
【０２２１】
図１５は、入力信号をバックオフ６（ｄＢ）で、増幅器１６に通した後の信号スペクトラムである。すなわち、図１５においては、前置型非線形歪補償システムを設けることなく、増幅器１６で、入力信号を増幅した場合の特性のシミュレーション結果を示している。
【０２２２】
図１５において、横軸は周波数キャリア番号、縦軸は電力を表している。非線形歪の影響を受けて、ＳＮＲは、約２０（ｄＢ）にまで悪化している。
【０２２３】
さらに、図１６は、このような状況において、データ選定部７０が設けられていない前置型非線形歪補償システム１０００による信号スペクトラムを示す。
【０２２４】
増幅器１６のみの場合よりも信号帯域外のスペクトラムが上昇しており、逆にＳＮＲは小さくなってしまう。これは、過剰に大きな信号を要素にもつデータセットがサポートベクトルとして採用されてしまったからである。そのデータセットにより回帰関数は、もはや増幅器の逆特性を正確に近似できるものではなくなっている。そのデータセット以降のデータセットは、すべてサポートベクトルとして採用されることになり、サポートベクトル数が多くなる。併せて、サポートベクトル係数は極端に大きな数字となり、やがて発散する。
【０２２５】
図１７は、図１６の状態におけるサポートベクトル係数を表す図である。図１７において、横軸はサポートベクトル番号、縦軸はサポートベクトル係数値である。このとき、サポートベクトル数は約１４５０個と非常に多く、それに伴って計算量も莫大なものになっている。
【０２２６】
図１８は、データ選定部７０を搭載した本発明の前置型非線形歪補償システム２０００による信号スペクトラムを示す図である。増幅器１６のみの場合よりも信号帯域外のスペクトラムが下降しており、ＳＮＲは約３０（ｄＢ）に改善された。これは、発散を招く原因となる過剰に大きな信号を要素にもつデータセットをサポートベクトルに採用しなかったことが理由であると考えられる。
【０２２７】
したがって、バックオフが小さい場合でも、本発明による方式により非線形歪の影響を小さくすることができる。
【０２２８】
図１９は、図１８におけるサポートベクトル係数を表す図である。サポートベクトル数は７１個となっており、前置型非線形歪補償システム１０００と比較して大幅にサポートベクトル数が減少していることが確認できる。併せて、カーネル行列の大きさが小さくなったことにより、計算量も格段に減少している。
【０２２９】
今回開示された実施の形態はすべての点で例示であって制限的なものではないと考えられるべきである。本発明の範囲は上記した説明ではなくて特許請求の範囲によって示され、特許請求の範囲と均等の意味および範囲内でのすべての変更が含まれることが意図される。
【０２３０】
【発明の効果】
以上説明したとおり、本発明によれば、サポートベクトルマシンを用いて、増幅器等の信号の変換回路の逆特性を有する信号を推定するので、ルックアップテーブルを必要としないため、ノイズフロアを低く押さえることができる。
【０２３１】
しかも、入力が複素信号である場合に適用可能であるので、簡単な構成で前置型非線形歪補償器を提供することも可能となる。
【０２３２】
さらに、学習する過程で、一度、サポートベクトルに加えられた信号ベクトルにおいて、不要となったサポートベクトルが削除されるため、高速処理が可能である。
【０２３３】
また、本発明により、ＳＶＲのみを用いた前置型非線形歪補償器歪においては、補償できない過剰に大きな信号をトレーニング用のデータセットから削除するので、回帰関数の精度を維持することができる。
【０２３４】
さらに、本発明により、データセットを削除したことにより計算が安定し、同時にサポートベクトル数を少なくすることができるため、計算量を大幅に削減することが可能となる。
【図面の簡単な説明】
【図１】線形ε不感損失関数および二次ε不感損失関数を示す図である。
【図２】線形近似関数とサポートベクトルとの関係を示す概念図である。
【図３】非線形近似関数とサポートベクトルを示す概念図である。
【図４】本発明の前置型非線形歪補償システム１０００の構成を説明するための概略ブロック図である。
【図５】係数の更新の具体的な計算方法を説明するための第１のフローチャートである。
【図６】係数の更新の具体的な計算方法を説明するための第２のフローチャートである。
【図７】サポートベクトルマシンによる近似学習がある程度進んだ状態において、サポートベクトルによる学習データの近似特性を示す概念図である。
【図８】 ε´の値を本来の不感帯を定義する値εよりも大きくすることの効果を説明するための概念図である。
【図９】近似特性の逐次計算を進めた場合に、余分となったサポートベクトルの除去を行う手続きを説明するための概念図である。
【図１０】図９において、データＤ３について、損失の変化を説明するための概念図である。
【図１１】本発明の前置型非線形歪補償システム２０００の構成を説明するための概略ブロック図である。
【図１２】データ選定部７０の構成を説明するための概略ブロック図である。
【図１３】前置型非線形歪補償システム２０００の係数更新の動作を説明するための第１のフローチャートである。
【図１４】前置型非線形歪補償システム２０００の係数更新の動作を説明するための第２のフローチャートである。
【図１５】入力信号をバックオフ６（ｄＢ）で、増幅器１６に通した後の信号スペクトラムである。
【図１６】データ選定部７０が設けられていない前置型非線形歪補償システム１０００による信号スペクトラムを示す。
【図１７】図１６の状態におけるサポートベクトル係数を表す図である。
【図１８】データ選定部７０を搭載した本発明の前置型非線形歪補償システム２０００による信号スペクトラムを示す図である。
【図１９】図１８におけるサポートベクトル係数を表す図である。
【図２０】一般な増幅器の入出力特性を示す図である。
【符号の説明】
２，１８ダウンコンバータ、４，２０Ａ／Ｄ変換器、６，２２直交検波器、５４，５８，２６，６２ＳＶＭ、１０直交変調器、１２Ｄ／Ａ変換器、１４アップコンバータ、１６増幅器、３０，６４減算器、４２，６６係数更新部、７０データ選定部、１０００，２０００前置型非線形歪補償システム。[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a front-end nonlinear distortion compensator, and more particularly to a front-end nonlinear distortion compensator using a support vector machine (hereinafter referred to as “SVM”) for regression analysis.
[0002]
[Prior art]
2. Description of the Related Art In recent years, a front-end nonlinear distortion compensator has attracted attention as a technique for compensating for nonlinear distortion of a nonlinear device (hereinafter referred to as “NLD”) in communication. As a typical pre-linear nonlinear distortion compensator, there are one using a look-up table and one estimating an inverse characteristic of NLD. Here, a typical example of the NLD is an amplifier.
[0003]
A pre-distortion distortion compensator using a look-up table divides the amplifier characteristics into small intervals with respect to the amplitude value of the input signal, and stores the representative values in the interval in a table to correspond to the transmission signal The output value is multiplied by the compensation value of the table to be output. A pre-linear distortion compensator that compares the pre-linear distortion compensator input with the NLD output and adaptively updates the compensation value in the table based on the result has already been developed.
[0004]
On the other hand, a pre-linear distortion compensator that estimates the inverse function of an amplifier is called a pth-order-predistorter. This front-end nonlinear distortion compensator approximates the inverse function of a signal converter with a complex polynomial. The complex polynomial is derived from the most common Volterra series as a non-linear function expression format, and distortion compensation is performed. In addition, a pre-linear nonlinear distortion compensator that compares the pre-linear nonlinear distortion compensator input and the NLD output and adaptively updates the complex polynomial coefficient based on the result has already been developed.
[0005]
However, in the front-end nonlinear distortion compensator using a look-up table, the values in the table take discrete values, and the floor noise increases due to the discontinuity of the values. In order to eliminate the floor noise, it is necessary to increase the number of data. However, there is a problem that increasing the number of data increases the memory capacity.
[0006]
In the pth-order-predistorter, it is necessary to increase the order of the complex polynomial in order to increase the calculation accuracy. For this reason, there is a problem that it takes time to calculate and the solution is difficult to converge.
[0007]
Furthermore, for a wideband signal, both the look-up table and the pth-order-predistorter need to compensate for the nonlinearity of the amplitude and phase for each frequency in the signal band, so that hardware implementation is difficult. There's a problem.
[0008]
By the way, in recent years, the problem of modeling processes from input / output observation data has been the object of research in many fields such as nonlinear regression problems and system identification. The problem of obtaining the optimum characteristics of the above-mentioned non-linear distortion compensator is that modeling the inverse function of the characteristics of the amplifier to be compensated is considered if the amplifier to be compensated for distortion is regarded as an unknown nonlinear system. The problem is reduced.
[0009]
Methods for dealing with such process modeling problems can be broadly divided into two categories: global modeling and local modeling.
[0010]
Local modeling includes weighted average method, local weight regression method and Just-in-time method which is database learning.
[0011]
On the other hand, global modeling is a method for creating a single function that fits all databases. The creation of such a function is a technique used in other methods of neural network modeling and nonlinear statistical regression.
[0012]
Such statistical learning theory for global modeling has existed since the 1960s, but since mid-1990, support vector machines have been proposed by VNVapnik et al. (See, for example, Non-Patent Documents 1 to 3).
[0013]
In general, process modeling based on statistical learning theory using such a support vector machine has the following characteristics.
[0014]
(1) Compared with other methods, the convergence speed is better.
(2) A unique solution can always be obtained.
[0015]
(3) It can be easily applied to various learning problems.
(4) Since the complexity of the problem is characterized by the number of support vectors that are independent of the dimension of the problem, it is possible to escape from the so-called “curse of dimensionality”.
[0016]
Further, the above-described support vector machine is extended to support vector regression (hereinafter referred to as “SVR”) that can be used for learning regression problems. Such SVR is disclosed in Non-Patent Document 4.
[0017]
Furthermore, Non-Patent Document 5 discloses an attempt to apply such a support vector machine to the above-described pre-linear distortion compensator.
[0018]
[Non-Patent Document 1]
VNVapnik: The Nature of Statistical Learning Theory, Springer-Verlag, New York (1995)
[0019]
[Non-Patent Document 2]
VNVapnik: Statistical Learning Theory, J. Willey & Sons, New York (1998)
[0020]
[Non-Patent Document 3]
B. Scholkopf et al.ed, "Advances in Kernal Methods", The MIT Press, 1999.
[0021]
[Non-Patent Document 4]
N. Cristianini, J. Shave-Taylor "An Introduction to Support Vector Machines", Cambridge University Press, 2000.
[0022]
[Non-Patent Document 5]
Takanori Ito, Hiromitsu Omori, Akira Sano B-5-124 "Adaptive Nonlinear Distortion Compensation Using Support Vector Method" Proceedings of the 2001 IEICE General Conference, p. 522, 2001
[0023]
[Problems to be solved by the invention]
However, in the case where the support vector machine is used for a pre-set nonlinear distortion compensator of a digital signal receiving apparatus, the following points need to be considered.
[0024]
Here, among NLDs used for communication, an amplifier has the most influence on transmission characteristics.
[0025]
FIG. 20 is a diagram showing input / output characteristics of a general amplifier.
As shown in FIG. 20, the operation is linear in a region where the input signal is small, but the output signal saturates as the input signal increases.
[0026]
Usually, in order to reduce the influence of the nonlinear distortion of such an amplifier, a margin is set for the amplifier, and the input level to the amplifier is lowered accordingly. This margin is called “back-off”.
[0027]
The amount of required back-off depends on the modulation scheme. For example, if the modulation method is π / 4 shift QPSK (Quadrature Phase Shift Keying), the back-off may be small because the amplitude fluctuation is small. However, for example, if the modulation method is orthogonal frequency division multiplexing (hereinafter referred to as “OFDM (Orthogonal Frequency Division Multiplexing)”), it is necessary to set a large back-off because the amplitude fluctuation is large.
[0028]
However, if the back-off is set to be large, the power efficiency of the amplifier is greatly degraded. Therefore, it is desirable to operate with a back-off as small as possible even for a signal having a large amplitude variation such as a signal.
[0029]
However, if the back-off is set to be small, a signal having excessively large distortion that cannot be compensated for by the pre-linear distortion compensator using global modeling such as SVR is input to the pre-linear distortion compensator. Will be.
[0030]
Here, the inverse characteristic of the amplifier, that is, the characteristic of the pre-linear distortion compensator that operates ideally is as shown in FIG. Here, the output of the front-end nonlinear distortion compensator becomes very large when the input is large. This also means that the input to the amplifier itself is very large.
[0031]
However, even if such a large input is input to the amplifier, the characteristics of the amplifier itself are saturated, so that the pre-linear nonlinear distortion compensator can no longer perform the compensation. In this case, in particular, in a system that learns the compensation operation like SVR, there arises a problem that effective learning cannot be performed.
[0032]
The present invention has been made in order to solve the above-described problems, and an object of the present invention is to be able to operate with as little back-off as possible even for a signal with large amplitude fluctuations, and to enable distortion compensation with high accuracy. It is to provide a pre-set nonlinear distortion compensator.
[0033]
[Means for Solving the Problems]
A non-linear distortion compensator according to an aspect of the present invention includes a conversion support vector machine that converts an input signal using an inverse characteristic of a signal converter and supplies the converted signal to the signal converter; In the support vector machine for conversion based on the signal vector constituted by the conversion output signal sequence received from the conversion output signal converted by the converter and the signal vector constituted by the conversion output signal sequence extracted in the past. An estimation support vector machine that estimates the converted input signal based on a loss function having a first dead band, and is connected to the conversion support vector machine and the estimation support vector machine, and the output of the conversion support vector machine and Based on the output of the support vector machine for estimation, the support vector machine for conversion and the support vector for estimation The coefficient updating means for updating the coefficient of the machine is compared with the input signal and the threshold value. When the input signal is larger than the threshold value, the data set including the input signal is not adopted as the support vector. Data selection means.
[0034]
Preferably, the coefficient updating means receives a converted output signal, is a vector composed of a signal sequence of the newly received converted output signal, and is a vector outside the second dead zone as a support vector candidate. In addition to the set of vectors already extracted as support vectors, the set of vectors is newly regarded as a set of support vectors, and a vector outside the first dead zone is selected from the set of vectors. Extraction means for re-extracting as a support vector, and calculation means for calculating a coefficient for the extracted support vector.
[0035]
Preferably, the threshold value is a predetermined constant.
Preferably, the data selection means sets a threshold value based on absolute values of a predetermined number of input signals in the past.
[0036]
Preferably, the data selection unit updates the threshold based on the absolute values of a predetermined number of past input signals within a predetermined period.
[0037]
Preferably, the signal converter is an amplifier.
[0038]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. In the following description, the same components are denoted by the same reference numerals, and their names and functions are also the same. Therefore, detailed description thereof will not be repeated.
[0039]
[General theory of learning regression problems with support vector machines]
In the following, the learning of regression problems using a support vector machine, which is the premise for explaining the present invention, will be summarized.
[0040]
1-1. Learning linear regression problems
1-1-1 Linear regression problem
First, a method for solving a linear regression problem from the standpoint of Support Vector Regression (SVR) based on the idea of Support Vector Machine (SVM) will be described. This problem can be formulated as follows.
[0041]
[problem]
There is an unknown linear regression with m inputs and 1 output f: x → y, training data
(X ₁ , Y _l ), ..., (x _n , Y _n X _i ∈ R ^m , y _i ∈R (1.1)
Is given. At this time, a function that approximates this linear regression using training data
f (x, w) = w ^T x (1.2)
Ask for.
[0042]
1-1-2 Loss function
Here, a linear ε-insensitive loss function Lε (x, y, f (x, w)) and a quadratic ε-insensitive loss function L ₂ ε (x, y, f (x, w)) is defined as follows.
[0043]
[Expression 1]

FIG. 1 is a diagram showing such a linear ε insensitive loss function and a secondary ε insensitive loss function. FIG. 1A shows a linear ε dead loss function, and FIG. 1B shows a second order ε dead loss function.
[0044]
1-1-3 Regression problem with quadratic ε insensitive loss function
In the following, we consider support vector regression when using a second-order ε dead loss. The main problem based on the support vector machine can be expressed as follows.
[0045]
[Expression 2]

This subject can be rewritten as follows:
[0046]
[Equation 3]

Where C is a regularization parameter that sets a trade-off between model complexity and constraint relaxation (training error).
[0047]
[Expression 4]

The following relationship is established according to the Karush-Kuhn-Tucker (KKT) condition for equation (1.10).
[0048]
[Equation 5]

Substituting these relations into equation (1.10) can be rewritten as the following Somura problem.
[0049]
[Formula 6]

Here, when the coefficient αi is replaced as follows, the expression (1.14) can be finally rewritten as the following expression (1.15). Further, the function f (x, w) = w approximating linear regression is obtained by the equation (1.11). ^T x is represented by the following formula (1.16).
[0050]
[Expression 7]

FIG. 2 is a conceptual diagram showing the relationship between the linear approximation function and the support vector.
[0051]
In FIG. 2, “●” is a support vector, and the magnitude of error due to these approximate functions is larger than ε. “◯” is data in which the error due to the approximation function is smaller than ε, and these do not directly affect the approximation function.
[0052]
1-2 Learning nonlinear regression problems
1-2-1 Nonlinear regression problem
In the above description, linear regression using support vector regression (SVR) has been described. Below, the nonlinear regression using SVR which applied it is explained. This nonlinear regression problem can be formulated as
[problem]
There is an unknown nonlinear regression f: x → y with m inputs and 1 output, and training data
(X ₁ , Y _l ), ..., (x _n , Y _n X _i ∈R ^m , y _i ∈R (1.17)
Is given. At this time, the function f (x, w) = w that approximates this nonlinear regression using the training data ^T Find x. Here, a second-order ε insensitive loss function is used as the loss function.
[0053]
1-2-2 Feature space
In general, in order to apply to complex problems, it is necessary to think in a space where the expression ability is superior to that in the input space. Such a method is called higher dimension.
[0054]
Here, in order to cope with the nonlinear regression problem, a nonlinear mapping from the input space to the high-dimensional feature space is introduced. By mapping the input vector x from the input space X to the feature space F and regarding the vector φ (x) in the feature space as the input vector x thus far, the linear regression problem is handled not in the input space but in the feature space. However, φ: X → F is a nonlinear mapping from the input space to a certain feature space.
[0055]
This work essentially solves the nonlinear regression problem in the original input space.
[0056]
According to the Hilbert Schmidt theory, the inner product in the Hilbert space can be expressed as the following equivalent expression.
[0057]
[Equation 8]

As shown below, when a linear problem is considered in the feature space, the equation (1.21) is obtained, and a function f (φ (x), w) = w that approximates linear regression is obtained. ^T φ (x) is as shown in equation (1.22). This equation (1.22) is a linear regression problem in the input space, and finally becomes equation (1.23) due to the relationship of equation (1.20). Furthermore, when equation (1.23) is expressed in vector, equation (1.24) is obtained, and when this nonlinear regression problem is solved, a nonlinear approximation function of equation (1.25) can be obtained.
[0058]
[Equation 9]

Here, it should be noted that a kernel function K (x that can replace the inner product calculation in the feature space with the calculation in the input space. _i , X _j ) Does not include φ (x) in the equation (1.21) of the dual problem and the approximate function obtained by solving it, so in practice a nonlinear mapping from the input space to the feature space There is no need to calculate. This is called a “kernel trick”.
[0059]
Because of these features, support vector regression (SVR) can take very large feature space dimensions by introducing higher dimensions, and the model complexity and feature space dimensions are independent. It becomes possible to control. This allows SVR to escape the so-called dimensional curse.
[0060]
FIG. 3 is a conceptual diagram showing a nonlinear approximation function and a support vector. Also in FIG. 3, “●” is a support vector, and the magnitude of error due to these approximate functions is larger than ε. “◯” is data in which the error due to the approximation function is smaller than ε, and these do not directly affect the approximation function.
[0061]
1-2-3 Relationship between support vector and coefficient
In the SVR problem using the second-order ε insensitive loss function, the KKT conditions of the equations (1.11) to (1.13) are satisfied before the coefficient replacement from the equation (1.14) to the equation (1.15). After the replacement of the coefficients, the following relational expression is satisfied.
[0062]
[Expression 10]

Further, since the second-order ε dead loss function has a dead zone ε, a data set (x that satisfies the following equation (1.27) whose magnitude of the estimation error is smaller than ε: _i , Y _i ) Must deviate from the support vector. Therefore, the magnitude of the support vector approximation error due to the obtained solution is as shown in Equation (1.28).
[0063]
[Expression 11]

Furthermore, since the Lagrangian function is set as shown in Formula (1.10), the signs of Formulas (1.29) and (1.30) below are equal. Therefore, the coefficient α of the support vector, which is the solution of equation (1.23) _i Satisfies the equation (1.31).
[0064]
[Expression 12]

2-1 Method of the present invention for learning support vector regression
If the above-described method is to be executed as it is, if there is a lot of training data in the SVR problem, a great deal of time will be spent to solve the problem. Therefore, in the present invention, in order to efficiently solve the SVR problem, as described below, a basic update formula obtained by modifying the Newton-Raphson method is obtained, and a sequential update using the update formula is performed. Adopt a shape algorithm.
[0065]
2-1-1 Problem setting
Here, a method for solving the following problem using the second-order ε insensitive loss function will be described.
[0066]
[problem]
There is an unknown nonlinear regression f: x → y with m inputs and 1 output, and training data
(X ₁ , Y _l ), ..., (x _n , Y _n X _i ∈R ^m , y _i ∈R
Is given. At this time, in order to obtain the function of Equation (2.1) that approximates this nonlinear regression using training data, the maximization problem represented by Equation (2.2) is solved.
[0067]
[Formula 13]

In SVR, a dead zone is provided when evaluating an error. Therefore, f (x, α) obtained by solving this problem and the training data must have the following relationship.
[0068]
That is, the data set (x _i , Y _i ) Is inside the deadband, that is, | y _i -F (x _i , Α) | ≦ ε, the coefficient is α _i = 0.
[0069]
Conversely, the data set (x _i , Y _i ) Is outside the deadband, that is, | y _i -F (x _i , Α) |> ε, the coefficient α _i Satisfies the above-described equation (1.31).
[0070]
In the learning of support vector regression described below, a method of excluding unnecessary data sets from the support vector is taken into consideration in consideration of the above relationship.
[0071]
2-1-2 Batch processing
In the following, a method will be described in which several data sets are extracted from n pieces of training data, and SVR is obtained using only the extracted data sets.
[0072]
(Modification of Newton-Raphson method)
Hereinafter, a method for solving the dual problem for the second-order ε-insensitive loss function by using the update equation by modifying the Newton-Raphson method will be described.
[0073]
When the maximization problem of equation (2.2) is expressed in vector, it becomes as in equation (2.3), and when the evaluation function W (α) is Taylor-expanded, it becomes as in equation (2.4).
[0074]
Then, assuming that W (α) is approximated by a quadratic function in the vicinity of αi, the processing is terminated by the third term of the above equation (2.4).
[0075]
[Expression 14]

Α = αi giving the minimum value of the quadratic function approximation is given by the condition of the following equation (2.5), and α = αi giving such a minimum value is sequentially given by the following equation (2. The relationship of 6) is satisfied (Newton-Raphson method).
[0076]
[Expression 15]

(How to update α (when the initial value is 0))
When obtaining α based on the Newton-Raphson method, the initial value α ⁰ Consider the case of = 0. In equation (2.6), α ⁰ When = 0, the following expressions (2.12) and (2.13) are satisfied. However, in the present invention, ε = 0 is corrected as shown in expression (2.14).
[0077]
[Expression 16]

After that, further updating using equation (2.6) gives the following equations (2.15) and (2.16). In addition, αi ¹ When ≠ 0, the formula (2.17) is established, so the formula (2.16) is expressed by the formulas (2.18) and (2.19).
[0078]
[Expression 17]

When the following equation (2.20) is established with the coefficient vector α thus obtained, equation (2.21) is obtained, and therefore the above equation (1.31) must be satisfied. Recognize.
[0079]
[Formula 18]

However, when the obtained solution does not satisfy the equation (2.20), the solution does not satisfy the equation (1.31) and is not suitable as a solution.
[0080]
(Removal method of excess data set)
Next, let us consider the case where equation (2.20) is not satisfied.
[0081]
If the equation (2.20) is not satisfied, the obtained solution does not satisfy the equation (1.31) and does not satisfy the KKT condition, and thus is not a solution. Therefore, in such a case, an extra data set (x _i , Y _i ) Are present in the current support vector candidates, and the extra data set is removed from the current support vector interpolation.
[0082]
In the following, a method for selecting an extra data set will be described below.
First, the coefficient vector αn is calculated based on the following equation (2.22) using the above update equation. _sv (Initial value: 0).
[0083]
Next, the coefficient vector αn _sv Each element of is not zero (α _i ≠ 0, i = 1, ..., n _sv Then, the step size η (0 <η ≦ 1) is introduced into the update formula (2.6), and a new update formula like the following formula (2.23) is created.
[0084]
[Equation 19]

Coefficient vector αn using this update formula _sv The coefficient vector αn before and after the update _sv The update is performed within a range where the sign of each element does not change.
[0085]
Next, how to determine the step size will be described.
The closer the step size is to 1, the larger the evaluation function becomes. Therefore, it is desirable that the step size is as close to 1 as possible. Therefore, the step size is obtained as follows.
[0086]
First, the step size vector is calculated by the following equation (2.24).
[0087]
[Expression 20]

However, the step size vector η = [η ₁ , ..., ηn _sv ] ^T And η _i Is the step size η = η in the update formula (2.23) _i And the coefficient vector αn _sv Update α _i = 0.
[0088]
Next, among the elements of the step size vector η, updating is performed using the smallest element ηi that is 0 or more and the step size. At this time, α _i = 0 and the data set (x _i , Y _i ) Is an extra data set and is removed from the support vector as shown in the following equation (2.25).
[0089]
[Expression 21]

As a result, the number of support vectors decreases (n _sv ← n _sv -1).
[0090]
After the number of support vectors decreases, the coefficient vector αn again from the update formula (2.23) _sv Update. At this time, as before, the coefficient vector αn before and after the update. _sv In order to perform the update within a range where the sign of each element does not change, the step size η is obtained from the equation (2.24).
[0091]
The smallest element η greater than or equal to 0 among the elements of the step size vector η _i Is 1 or more, the coefficient vector αn updated with η = 1 in the update formula (2.23) _sv Shows that the sign of each element does not change before and after the update.
[0092]
At this time, the coefficient vector αn _sv Will be determined. Conversely, the smallest element η that is greater than or equal to 0 among the elements of the step size vector η _i Is smaller than 1, the step size η = η in the update formula (2.23) as before _i Coefficient vector αn _sv Update the dataset (x _i , Y _i ) Is removed from the support vector. Thereafter, coefficient vector αn _sv Repeat this process until it is determined.
[0093]
2-1-3 Sequential processing
In the above description, a method has been described in which several data sets are extracted from n pieces of training data, and the SVR is obtained only from the extracted data sets. However, since the inverse matrix is calculated in batch processing, the number of data to be handled is limited. In order to avoid such a problem, the sequential processing will be described next.
[0094]
Here, there is already an approximate function by SVR like the following formula (2.26), and new training data (x _s , Y _s The method of updating the approximate function by SVR will be described using FIG.
[0095]
[Expression 22]

First, new training data (x _s , Y _s ) Is inside the dead zone.
[0096]
If the following equation (2.27) holds, it means that it is inside the dead zone and SVR update is not necessary, but new training data (x _s , Y _s ) Is outside the dead zone, that is, if the following equation (2.28) is satisfied, the SVR needs to be updated.
[0097]
[Expression 23]

Therefore, new training data (x _s , Y _s ) Is outside the dead zone.
[0098]
At this time, (x ₁ , Y ₁ ), ..., (xn _sv , Yn _sv ), (X _s , Y _s ) Is considered a support vector.
[0099]
If the coefficient vector α is to be updated using the equation (2.23) shown earlier, new training data (x _s , Y _s ) Coefficient α _s Since this is not determined, the initial value of this value is set to 0.
[0100]
However, if this is the case, when calculating formula (2.23), sign (α _s ) = 0, and even if the coefficient vector α is updated as it is, the KKT condition is not satisfied.
[0101]
Therefore, the update formula (2.23) is changed as follows.
[0102]
[Expression 24]

After this, consider updating α using the update formula (2.29). Again, before and after the update, the coefficient vector αn _sv In order to perform updating within a range in which the sign of each element does not change, Equation (2.24) is changed as follows to obtain the step size η.
[0103]
[Expression 25]

Thereafter, α is updated as in the batch processing, and α is determined.
[0104]
In practice, the present invention uses an algorithm for obtaining SVR for a case where a large amount of training data sets are provided, as will be described in the following processing of the predistortion system.
[0105]
This algorithm is a calculation method that combines batch processing and sequential processing.
[0106]
[Configuration not taking into account saturation characteristics of non-linear devices (NLD)]
FIG. 4 is a precondition for understanding the configuration of the pre-linear nonlinear distortion compensation system of the present invention. 1 is a schematic block diagram showing 1000. FIG.
[0107]
In the example shown in FIG. 4, the distortion of the output of the amplifier 16 is compensated by a pre-set nonlinear distortion compensator.
[0108]
Referring to FIG. 4, the front-end nonlinear distortion compensation system 1000 includes a radio frequency signal S. _IN Is converted to an intermediate frequency signal, an A / D converter 4 connected to the down converter 2 for sampling the intermediate frequency signal output from the down converter 2 and converting it into a digital signal, and an A / D converter 4 is connected to the quadrature detector 6 for converting the sampled intermediate frequency signal to the baseband signal x (t), and the quadrature detector 6 is connected to the signal x (t) for the inverse characteristic of the amplifier 16 described later. Is connected to the SVM 54 for converting the real part Re {y (t)} of the baseband signal y (t) to the quadrature detector 6, and the signal x (t) is converted into the signal using the inverse characteristic of the amplifier 16. The SVM 58 that converts the baseband signal y (t) to the imaginary part Im {y (t)} and the

SVMs

54 and 58 are connected to convert the converted baseband signal y (t) into an intermediate frequency signal. And 交変 modulator 10, is connected to the quadrature modulator 10, and a D / A converter 12 for converting the intermediate frequency signal to an analog signal output from the quadrature modulator 10.
[0109]
In the present invention, the baseband signal output unit of the radio transmitter is connected in place of the receiving unit 1 including the down converter 2, the A / D converter 4 and the quadrature detector 6 in FIG. It is applicable also to an aspect.
[0110]
The front-end nonlinear distortion compensation system 1000 is further connected to the D / A converter 12 and converts the analog intermediate frequency signal to the radio frequency signal S. ₀ And an output signal S output from the amplifier 16 and connected to the amplifier 16. _OUT Is converted to an intermediate frequency signal, a down converter 18 connected to the down converter 18, an A / D converter 20 converting the intermediate frequency signal into a digital signal, and an A / D converter 20 connected to And a quadrature detector 22 for converting to a band signal z (t).
[0111]
The pre-linear distortion compensation system 1000 further includes a signal z (t) output from the quadrature detector 22 and a baseband signal vector V_z (p) used for learning. _i ) (I = 1,..., N _R ), The SVM 26 that outputs the estimated value Re {y ′ (t)} of the real part of the baseband signal output from the SVM 54, the signal z (t) output from the quadrature detector 22, and the baseband signal. Vector V_z (q _j ) (J = 1,..., N _I ) And the SVM 62 that outputs the estimated value Im {y ′ (t)} of the imaginary part of the baseband signal output from the SVM 58, and the estimated value of the real part of the baseband signal connected to the SVM 54 and SVM 26. A subtractor 30 for calculating a difference between Re {y ′ (t)} and a real part Re {y (t)} of the baseband signal y (t), and an imaginary part of the baseband signal connected to the SVM 58 and the SVM 62 The subtractor 64 that calculates the difference between the estimated value Im {y ′ (t)} of the baseband signal y (t) and the imaginary part Im {y (t)} of the baseband signal y, and the quadrature detection unit 6 and the quadrature detection unit 22 An m · d determining unit 44 for determining the size m of the signal vector and the support vector and the delay time d by the amplifier 16, the subtractor 30, SVM 54, SVM 26, the quadrature detection unit 22, and the m · d determination. Is connected to 44, the output of the subtractor 30 based on the output of the output and m · d determiner 44 of the quadrature detector 22, in SVM54 and SVM26, coefficients of the function having the inverse characteristic of the amplifier 16 alpha _R The coefficient updating unit 42 that updates (i), the subtractor 64, the SVM 58, the SVM 62, the output of the quadrature detection unit 22 and the m · d determination unit 44, and the output of the subtractor 64, the output of the quadrature detection unit 22, and Based on the output of the m · d determining unit 44, the coefficient α of the function having the inverse characteristic of the amplifier 16 in the SVM 58 and the SVN 62 _I A coefficient updating unit 66 for updating (j).
[0112]
The SVM 26 obtains an estimated value Re {y ′ (t)} of the real part of the baseband signal according to a method described later. The SVM 62 obtains an estimated value Im {y ′ (t)} of the imaginary part of the baseband signal according to the same method as the SVM 26.
[0113]
The coefficient updating unit 42 calculates an error e between Re {y (t)} and Re {y ′ (t)}. _R The coefficient α so that (t) is minimized _R Control (i). Similar to the coefficient updating unit 42, the coefficient updating unit 66 performs an error e between Im {y (t)} and Im {y ′ (t)}. _I Α so that (t) is minimized _I (J) is controlled.
[0114]
The quadrature detector 6 is realized in analog, the output of the down converter 2 is received by the quadrature detector 6, the output of the quadrature detector 6 is converted to digital by the A / D converter 4, and the output is supplied to the SVM 8. You may be the structure to do. Further, the quadrature modulator 10 is realized in analog, the outputs of the SVM 54 and SVM 58 are received by the D / A converter 12, the output of the D / A converter 12 is received by the quadrature modulator 10, and the output of the quadrature modulator 10 is received. The configuration may be such that the up-converter 12 is supplied. Further, the quadrature detector 22 is realized in analog, the output of the down converter 18 is received by the quadrature detector 22, the output of the quadrature detector 22 is received by the A / D converter 20, and the output of the A / D converter 20 is received. The configuration may be such that the SVM 26 and the SVM 62 are supplied.
[0115]
(Operation of Prefix Nonlinear Distortion Compensation System 1000)
Hereinafter, the operation of the front-end nonlinear distortion compensation system 1000 will be described.
[0116]
First, in FIG. _IN , S ₀ And S _OUT Are represented by formulas (1) to (3), respectively.
[0117]
S _IN = Re {x (t) exp (j2π (f _RF t)}
: Input signal to the front-end nonlinear distortion compensator (1)
S ₀ = Re {y (t) exp (j2π (f _RF t)}
: Distortion compensated signal (2)
S _OUT = Re {z (t) exp (j2π (f _RF t)}
: Output signal from amplifier 16 (3)
x (t), y (t) and z (t) are respectively the band signals S _IN , S ₀ And S _OUT The complex envelope (baseband signal). j represents an imaginary number. f _RF Represents a radio frequency. t represents time.
[0118]
[Operation principle of nonlinear distortion compensator]
P = | y (t) | ² Assuming that the input / output characteristics in the baseband region of the amplifier 16 is f (P), where f (P) is a complex function, the relationship between the baseband signals y (t) and z (t) is expressed by the following equation (4) It can be expressed as
[0119]
Furthermore, in general, in the nonlinear distortion compensator, an object is to obtain an input / output characteristic g (P) that satisfies the relationship of the following expression (5) with respect to f (P). If the input / output characteristic g (P) can be obtained, the following equations (6) and (7) hold, and S _IN Against S _OUT Can be linearized.
[0120]
[Equation 26]

In the front-end nonlinear distortion compensation system 1000, g (P) is estimated by the SVM 26 and the coefficient update unit 42, and the SVM 62 and the coefficient update unit 66. Specifically, the functions of SVM26 and SVM62 are such that the error e (t) between y (t) and y ′ (t) estimated from z (t) is minimized in each of the real part and the imaginary part. Update the coefficient. If this calculation converges, the characteristic approximated by SVM 26 and SVM 62 corresponds to the inverse characteristic of amplifier 16.
[0121]
[Inverse characteristic function calculation]
The conditions for calculating the inverse characteristic function as described above are summarized below.
Calculations for nonlinear distortion compensation are performed on discrete signals rather than time continuous signals. In the case of hardware, an A / D converter is used to convert a continuous signal into a discrete signal, and after the calculation, the discrete signal is restored to a continuous signal by the D / A converter.
・ Signal processing is performed not on the band system but on the baseband system or the intermediate frequency signal. In the case of a baseband system, the amplifier 16 can be regarded as a system that receives a complex baseband signal y (t) equivalently and outputs a complex signal z (t).
[0122]
[Description of operation of support vector machine]
At time kT (T: sampling interval of A / D conversion and D / A conversion), the output Re {y (kT)} of the support vector machine SVM 54 and the output Im {y (kT)} of the SVM 58, and the support vector machine The output Re {y ′s (kT)} of the SVM 26 and the output Im {y ′s (kT)} of the SVM 62 are obtained by the following equations (8) to (11).
[0123]
[Expression 27]

Here, the variable prefix “V_” indicates that the variable is a vector. M is the size of a signal vector and a support vector described later, n _R , N _I Are the number of support vectors for the real and imaginary support vector machines, respectively, and p _i Is the time when the i-th support vector was acquired, and q _j Is the time when the j-th support vector was acquired.
[0124]
V_z (p _i T) (i = 1,..., N _R ), V_z (q _j T) (j = 1,..., N _I )
Is a support vector extracted by time kT as a result of learning the input / output characteristics of the amplifier 16.
[0125]
α _R (I), α _I (J) is V_z (p _i T), V_z (q _j T) is a weighting factor (real number). In general, α _R (I) (or α _I When (j)) is zero, the corresponding V_z (p _i T) (or V_z (q _j T)) need not be added to the calculation and may be discarded. The non-zero coefficient α _R (I) (or α _I The number of (j)) is called the dimension of the support vector machine. For example, if the number of non-zero coefficients is 73, the support vector machine has 73 dimensions. Furthermore, the number of dimensions does not increase beyond a certain value even if the learning data is increased. This upper bound value is called a VC (Vapnik and Chervonenkis) dimension.
[0126]
K (V_x, V_y) is a function of “inner product kernel”, and in the pre-linear nonlinear distortion compensation system 1000, equations (8) to (11), which are linear sums of inner product kernels, are converted into appropriate coefficients α. _R (I), α _I By selecting (j), the goal is to match the inverse characteristic g (P) of the input / output characteristic of the amplifier 16. Such general problems that fit the input / output functions of the system to unknown functions are collectively called “regression problems”. The following equations (15) to (17) are examples of commonly used inner product kernels.
[0127]
[Expression 28]

Which inner product kernel is most suitable for expressing the inverse function varies depending on the system. In practice, the best characteristic is selected after performing several tests before operation. Of course, a function other than the above may be used as long as the inverse function g (P) can be estimated better and the expected effect can be obtained.
[0128]
[Description of operation of coefficient updating unit 42]
In the following, the coefficient α corresponding to the support vector _R (I), α _I The operations of the coefficient updating unit 42 and the coefficient updating unit 66 that perform the update calculation of (j) will be described.
[0129]
As described above, the coefficient updating unit 42 determines the error e between Re {y (t)} and Re {y ′ (t)}. _R The coefficient α so that (t) is minimized _R (I) is controlled. On the other hand, the coefficient updating unit 66 calculates the error e between Im {y (t)} and Im {y ′ (t)}. _I The coefficient α so that (t) is minimized _I (J) is controlled.
[0130]
[Expression 29]

In the following description, T = 1 is set for simplicity of description. Even in this way, generality is not lost.
[0131]
Coefficient α _R (I) is obtained by solving the optimization problem of maximizing the evaluation function W expressed by the following equation (20). The coefficient α _I The same applies to (j).
[0132]
[30]

However, the constant C and the constant ε are usually obtained experimentally.
[0133]
Until the learning of the input / output characteristics of the amplifier 16 is completed, the addition / deletion of support vectors is repeated. _R (I), α _I It is necessary to update the value of (j).
[0134]
Hereinafter, the operation of the coefficient updating unit 42 will be described as an example. The operation of the coefficient updating unit 66 is basically the same as that of the coefficient updating unit 42 except that the input variables are different.
[0135]
As will be described in detail later, one of the features of the update calculation of the coefficient update unit 42 is that the coefficient α is sequentially calculated as follows. _R Updating the value of (i).
(1) In the middle of the update calculation, the coefficient updating unit 42 sets n−1 sets of support vectors V_z (p _i ) And coefficient α _R (I) is acquired.
(2) The coefficient updating unit 42 further uses the data V_z (p) as the nth support vector candidate. _n ).
(3) V_z (p _n ) As a support vector and its coefficient is α _R (N) = 0.
(4) V_z (p _n ) To compensate for the deviation (perturbation) from the optimal solution due to the addition of _R (I) Error (perturbation) Δα with respect to (i = 1,..., N) _R (I) is calculated.
[0136]
By extracting the support vectors sequentially as described above, the number of coefficients to be calculated in one coefficient update can be reduced, so that the amount of calculation can be reduced.
[0137]
[Specific calculation method for coefficient update]
A specific example of the coefficient updating method is shown below. However, this specific example is an example of realization and does not necessarily have to be this way.
[0138]
5 and 6 are flowcharts for explaining a specific calculation method for updating the coefficients.
[0139]
Referring to FIG. 5, the coefficient updating unit 42 first sets n data sets (V_z (P _i ), Re {ys (P _i )}) (I = 1 to n) are taken out and adopted as support vectors (step 100).
[0140]
Next, the coefficient updating unit 42 performs first-order approximation of the coefficient corresponding to the support vector and the code vector S _R Is calculated by the following equations (21) and (22) (step 102).
[0141]
[31]

Furthermore, the step size vector [η _R (1), ..., η _R (N)] ^T Is calculated by the following equations (24) and (25) (step S104).
[0142]
[Expression 32]

The smallest element of step size vector greater than or equal to 0 is η _R (Ν) (step S106), element η _R It is determined whether (ν) is 1 or more (step S108).
[0143]
First, the element η _R If (ν) is less than 1, then the element η _R (Ν) and the coefficient vector [α _R (1), ..., α _R (N)] ^T Is updated (step S110).
[0144]
[Expression 33]

Then, (V_z (Pν), Re {ys (Pν)}) is removed from the support vector as an extra data set, and the coefficient α _R (Pν) is also deleted (step S112). Corresponding to such deletion, for i> ν, the time when the i-th learning data is acquired and the coefficient α according to the following table: _R Change the number in (i).
[0145]
[Table 1]

Further, a process of newly replacing (n-1) with n is performed (step S114). After the above replacement procedure, the process returns to step S104.
[0146]
On the other hand, in step S108, the element η _R When (ν) is 1 or more, a coefficient vector is calculated again by the following equation (27) (step S116).
[0147]
[Expression 34]

Next, referring to FIG. 6, a new learning data set (V_z (k), Re {ys (k)}) is acquired (step S118).
[0148]
Further, the error e is expressed by the following equation (28). _R Is calculated (step 120). Subsequently, the error e _R Is compared with the threshold value ε ′ (step S122), and | e _R If | <ε ′, the process returns to step S118. Here, ε ≦ ε ′.
[0149]
[Expression 35]

On the other hand, | e _R If | ≧ ε ′, the coefficient α according to the following equations (29) to (30) _R (1), ..., α _R (N) and error e _R For each of these, a code is calculated and the code vector is replaced (step S124), [α _R (1), ..., α _R (N), 0] are newly replaced with coefficient vectors, and (n + 1) is newly replaced with n (step S126).
[0150]
[Expression 36]

(V_z (k), Re {ys (k)}) is adopted as the nth support vector (V_z (Pn), Re {ys (Pn)}), and the process returns to step S104 (step S128). .
[0151]
Through the above processing, the coefficient updating unit 42 sequentially updates the coefficient vector adaptively.
[0152]
[Description of operation of support vector machines SVM 54 and 58]
The SVM 54 uses the coefficient α updated by the coefficient updating unit 42. _R Using (i), the baseband signal x (t) is converted into the baseband signal Re {y (t)} according to the above-described equation (8). The support vector machine SVM 58 uses the same procedure as described above to update the coefficient α updated by the coefficient update unit 66. _I Using (i), the baseband signal x (t) is converted into the baseband signal Im {y (t)} according to the above-described equation (9).
[0153]
[Description of characteristic procedures in sequential computation]
As described above, according to FIGS. 5 and 6, by using a support vector machine, when a variable representing the process is represented by a complex number, by solving the regression problem, The configuration for realizing the compensation system has been described. In particular, in the processing performed by the coefficient updating unit 42, a description will be added below regarding characteristic procedures.
[0154]
FIG. 7 is a conceptual diagram showing approximate characteristics of learning data based on a support vector in a state where approximate learning by a support vector machine has advanced to some extent with respect to a one-input one-output nonlinear regression f: x → y.
[0155]
In FIG. 7, “loss” represents an approximate characteristic based on an ε-insensitive secondary loss function. That is, “loss” is defined as a value obtained by subtracting the dead zone boundary value closer to the learning data from the two dead zone boundary values obtained by adjusting the one-sided width of the dead zone to the approximate characteristic.
[0156]
In FIG. 7, double circles represent data that are support vectors, and single circles represent data that is present inside the dead zone for the approximate characteristics of the support vectors and are not support vectors. If the data represented by double circles in FIG. 7 is a true support vector according to the definition of the support vector, no support vector exists outside the true support vector.
[0157]
Here, for example, support vector data D1 = (x above the approximate characteristic by the support vector and outside the dead zone. ₁ , Y ₁ ), The loss L1 for the data D1 is expressed by the following equation.
[0158]
L1 = α ₁ / C
Where α ₁ Is the support vector x ₁ Is a coefficient for. At this time, the loss of the data D1 is “upward and outward” corresponding to the fact that this data is above the approximate characteristic by the support vector, and α ₁ > 0.
[0159]
On the other hand, for example, below the approximate characteristic by the support vector, and
Support vector data D2 = (x outside the dead zone ₂ , Y ₂ ), The loss L2 for this data D2 is also L2 = α ₂ It is represented by / C.
[0160]
At this time, the loss of the data D2 is “downward and outward” corresponding to the fact that this data is below the approximate characteristic by the support vector, and α ₂ <0.
[0161]
(Setting of the value of ε ′ in step S122)
It is possible to perform the calculation even if the value of ε ′ is the same as the value ε that defines the original dead zone. However, in the present invention, the value of ε ′ is set larger than the value ε that defines the original dead zone.
[0162]
Therefore, based on the above preparation, first, the reason why the value of ε ′ is made larger than the value ε that defines the original dead zone in step S122 of FIG. 6 will be described.
[0163]
FIG. 8 is a conceptual diagram for explaining the effect of making the value of ε ′ larger than the value ε that defines the original dead zone.
[0164]
When ε is replaced with ε ′ in step S122 with respect to the original dead zone determined by ε, the region determined by ε ′ is defined as an “extended dead zone” with respect to the original dead zone.
[0165]
The extended dead zone is used only in step S122, and the dead zone in the other steps is determined by ε.
[0166]
In FIG. 8, it is assumed that the approximate characteristic based on the support vector in the middle of learning is represented by a straight line LSV, and the approximate characteristic in a state where the approximation is further advanced than this approximate characteristic is represented by a curve CSV. However, the reason that the “approximate characteristic by the support vector during learning” is a straight line is for convenience of explanation, and more generally, this characteristic is also expressed by a curve.
[0167]
First, for the approximate characteristic LSV, when the extended dead zone is defined by the threshold value ε ′ (= ε), the data D3 is data that does not eventually become a support vector, but at the stage of the approximate characteristic LSV, It will be selected as a support vector. This is the same when the threshold value defining the extended dead zone is defined by a threshold value ε ′ (1) larger than the threshold value ε.
[0168]
On the other hand, for the approximate characteristic LSV, when the extended dead zone is defined by a threshold value ε ′ (2) that is larger than the threshold value ε ′ (1), the data D3 is also in the stage of the approximate characteristic LSV. Not selected as support vector.
[0169]
As described above, in the present invention, in order to reduce the amount of calculation, data that is no longer selected as a support vector during the sequential calculation is excluded from the calculation target.
[0170]
Therefore, if the threshold value ε ′ used in step S122 of FIG. 6 is set in advance as a threshold value ε ′ larger than the threshold value ε corresponding to the original dead zone, it should be excluded as the calculation proceeds. In the first place, the probability of selecting the data as a support vector is reduced, so that a further reduction in the amount of calculation can be expected.
[0171]
That is, once an old data set extracted as a support vector is repeated as learning (addition of a new support vector) progresses, it is repeatedly made unnecessary and removed as a support vector. The learning is finished because it becomes unnecessary. However, by reducing “repetition of addition / removal of support vectors”, it is possible to reduce the amount of calculation and shorten the time until the end of learning.
[0172]
It is also possible to perform a process of bringing the threshold ε ′ (≧ ε) closer to ε as the frequency of adding a new support vector decreases. In this case, learning is performed at an early stage up to a certain degree of approximation accuracy, and the final accuracy ε can be reliably obtained.
[0173]
In addition, the coefficient updating unit 42 and the coefficient updating unit 66 include a counter that counts the number of learning data received from the quadrature detector 22, and updates the threshold ε ′ every time a certain number of data is received, to ε. It is also possible to perform a process of approaching.
[0174]
Alternatively, the coefficient updating unit 42 and the coefficient updating unit 66 have a counter that measures time or a counter that counts the number of extracted support vectors, and sets the threshold value ε ′ according to the length of the processing period. It is also possible to perform a process of approaching ε while updating.
[0175]
(Criteria for deleting support vectors)
Further, the criteria for deleting the support vector in steps S110 to S114 in FIG. 5 will be further described.
[0176]
FIG. 9 is a conceptual diagram for explaining a procedure for removing an extra support vector when the approximate characteristic is sequentially calculated from the state shown in FIG.
[0177]
In the following description, since the data D3 is outside the dead zone determined by the threshold value ε and the approximate characteristic LSV in the state shown in FIG. 8, the data D3 is selected as the support vector, and the state shown in FIG. As the approximation proceeds, this data D3 is removed from the support vector, while data D1 is newly added as a support vector.
[0178]
FIG. 10 is a conceptual diagram for explaining a change in loss for data D3 in FIG.
[0179]
In this example, the coefficient corresponding to the data D3 is expressed by the approximate characteristic represented by the straight line LSV. _i (1) and the coefficient corresponding to the data D3 in the approximate characteristic represented by the curve CSV is α _i It shall be expressed as (2).
[0180]
Corresponding to the existence of the data D3 outside the dead band boundary of the straight line LSV, the loss of the data D3 with respect to the straight line LSV is α _i (1) / C (α _i (1)> 0).
[0181]
On the other hand, corresponding to the presence of data D3 inside the dead zone boundary of curve CSV, the loss of data D3 for curve CSV is α _i (2) / C (α _i (2) <0).
[0182]
Therefore, the approximation progresses, and the amount of change in loss due to the change of the approximate characteristic by the support vector from the straight line LSV (more generally a curve) to the curve CSV is expressed as −Δα. _i / C, the step size η used in step S108 is η = −α. _i (1) / Δα _i It is expressed.
[0183]
At this time, if 0 ≦ η <1, the data D3 exists between the dead zone boundary of the straight line LSV (more generally a curve) and the dead zone boundary of the curve CSV as shown in FIG. Will do. In other words, “data D3 has been removed from the support vector by changing the approximate characteristic of the support vector from the straight line LSV (more generally, a curve) to the curved line CSV”. be able to. In the above description, the case where the data is on the upper side of the approximate characteristic has been described, but the same applies to the case where the data is on the lower side of the approximate characteristic.
[0184]
Therefore, by determining based on the criteria as in step S108 of FIG. 5, it is possible to determine whether or not the data that has been selected as the support vector so far should be excluded from the support vector.
[0185]
In the above example, the support vector machine is used for correcting the output of the amplifier 16 having a nonlinear input / output characteristic whose input / output characteristic is unknown. However, the present invention is applicable only to the amplifier 16. is not. In general, the present invention can be applied to a conversion circuit for a signal in which an input signal can be expressed by a complex number and the input / output characteristics are unknown.
[0186]
As described above, according to the front-end nonlinear distortion compensation system 1000, a signal having the inverse characteristic of the amplifier 16 is estimated using the SVM 26 and the SVM 62. Based on the signal and the signals output from the SVM 54 and SVM 58, the coefficient updating unit 42 and the coefficient updating unit 66 update the coefficient of the function having the inverse characteristic of the amplifier 16.
[0187]
Since this configuration does not require a look-up table, the noise floor can be kept low, and it can be applied when the input is a complex signal. Therefore, the front-end nonlinear distortion compensator can be configured with a simple configuration. Can be provided. Further, the support vector machine is simpler to calculate and has a faster convergence time than the pth-order-predistorter. Further, since the support vector that has become unnecessary is deleted from the signal vector once added to the support vector during the learning process, a pre-linear distortion compensator capable of high-speed processing can be provided.
[0188]
When a wideband signal is input, it is possible to compensate by appropriately setting the data vector size m even when the nonlinearity of the amplifier is narrower than the signal band.
[0189]
[Preliminary Nonlinear Distortion Compensation System 2000 of the Embodiment of the Present Invention]
The configuration and operation of the front-end nonlinear distortion compensation system 1000 shown in FIG. 4 have been described above.
[0190]
Although the front-end nonlinear distortion compensation system 1000 has the above-described characteristics, there are the following problems in the amplifier having the input / output characteristics as shown in FIG.
[0191]
That is, when the input to the amplifier is large, the output is suppressed to a certain value due to the saturation characteristic of the amplifier. Since the saturated output becomes the input of the

estimation SVMs

26 and 62, the output of the

estimation SVMs

26 and 62 is smaller than the output of the

conversion SVMs

54 and 58.
[0192]
Therefore, the difference between the outputs of the

estimation SVMs

26 and 62 and the outputs of the

conversion SVMs

54 and 58 becomes a very small value when the input signal is small, but becomes large due to the influence of nonlinear distortion when the input is large.
[0193]
In the method of the pre-linear distortion compensation system 1000 shown in FIG. 4, when the difference between both SVM outputs is larger than a preset dead band ε ′ for each training data, it is adopted as a support vector. The difference between the two SVM outputs in the training data having an excessively large amplitude is surely larger than the dead zone ε ′ and is adopted as a support vector. For this reason, all support vectors are updated by this training data, and the approximation accuracy of the regression function is deteriorated. By employing a plurality of such training data, the accuracy of the regression function becomes worse and divergent in some cases.
[0194]
FIG. 11 is a schematic block diagram for explaining the configuration of the pre-linear distortion compensation system 2000 of the present invention.
[0195]
Compared to the front-end nonlinear distortion compensation system 1000 shown in FIG. 4, the front-end nonlinear distortion compensation system 2000 further includes a data selection unit 70. Since the other configuration of the front-end nonlinear distortion compensation system 2000 is the same as that of the front-end nonlinear distortion compensation system 1000, the same portions are denoted by the same reference numerals, and description thereof will not be repeated.
[0196]
In FIG. 11, the nonlinear device 16 is assumed to be an amplifier having input / output characteristics as shown in FIG.
[0197]
Referring to FIG. 11, data selection unit 70 compares input baseband signal x (t) with a threshold value TH described later, and signal x (t) is larger than the threshold value. The data set including the signal x (t) is not adopted as a support vector.
[0198]
FIG. 12 is a schematic block diagram for explaining the configuration of the data selection unit 70.
Referring to FIG. 12, data selection unit 70 selects at least a designated number N of selection units 702 for selecting a path for transmitting baseband signal x (t). _TH A memory 704 for receiving and storing a plurality of baseband signals x (t) from the selection unit 702, and a threshold for calculating a threshold TH from the baseband signal x (t) stored in the memory 704 The baseband signal x (t) from the determination unit 706 and the selection unit 702 is compared with the threshold value TH, and the baseband signal x (t) less than or equal to the threshold value TH is selectively updated by the coefficient updating units 42 and 66. And a comparison unit 708 for providing the data.
[0199]
Here, the threshold value determination unit 706 calculates the threshold value TH based on the following equation.
[0200]
[Expression 37]

Here, in the above equation, K1 and N _TH Is a constant specified in advance.
[0201]
Note that the baseband signal x (t) for calculating the threshold value TH is, for example, at least the number N for every fixed time interval. _TH May be stored in the memory 704, and the threshold value determination unit 706 may update the threshold value TH at regular time intervals.
[0202]
Further, the method of determining the threshold value TH is not limited to the above-described method. For example, the threshold value TH may be a predetermined constant, or the absolute value of the past baseband signal x (t) as in the following equation: It is good also as updating by multiplying predetermined value K2 to the maximum value of.
[0203]
[Formula 38]

That is, the threshold TH for selecting the baseband signal x (t) in the data selection unit 70 can be arbitrarily set from the outside. Alternatively, the threshold value TH in the data selection unit 70 can be set or updated by a predetermined calculation from a past input signal.
[0204]
Next, the front-end nonlinear distortion compensation system 2000 shown in FIG. 11 will be described. However, since the operation is basically the same as the operation of the front-end nonlinear distortion compensation system 1000 described with reference to FIGS. 5 and 6, the operation of the front-end nonlinear distortion compensation system 2000 will be mainly described below. Differences from the operation of the front-end nonlinear distortion compensation system 1000 will be described.
[0205]
FIGS. 13 and 14 are flowcharts for explaining the coefficient updating operation of the front-end nonlinear distortion compensation system 2000.
[0206]
First, referring to FIG. 13, N sets of data sets are acquired in order to perform first-order approximation of support vector coefficients. At that time, the corresponding baseband signal x (t) is also acquired (step S200).
[0207]
Next, the data selection unit 70 determines a threshold value TH (step S201). Here, for example, among the above, the number N _TH It is assumed that the average value of the absolute values of the baseband signal x (t) is multiplied by the coefficient K1.
[0208]
Next, the baseband signal x (t) is compared with the threshold value TH for each data set, and if the input signal is larger than the threshold value TH, the data set including the data deteriorates the accuracy of the regression function. It is regarded as a data set and is not adopted as a support vector (step S202).
[0209]
If the number of remaining data sets is n, the data set is (V_z (P _i ), Re {ys (P _i )}) (I = 1 to n). The coefficient updating unit 42 performs the first approximation of the coefficient corresponding to the support vector and the code vector S _R Is calculated in the same manner as in step S102 in FIG. 5 (step 203).
[0210]
Hereinafter, processing similar to steps S104 to S116 in FIG. 5 is executed as steps S204 to S216 in FIG.
[0211]
Next, referring to FIG. 14, a new learning data set (V_z (k), Re {ys (k)}) is acquired (step S218). In addition, the baseband signal x (t) of the corresponding input signal is also acquired.
[0212]
If the baseband signal x (t) of the input signal at that time is equal to or greater than the threshold value TH (step S219), the process returns to step S218 without adopting the data set as a support vector.
[0213]
On the other hand, when the baseband signal x (t) of the input signal is smaller than the threshold value TH in step S219, the error e is further calculated by the above equation (28). _R Is calculated (step 220).
Hereinafter, the same processing as steps S122 to S128 in FIG. 6 is executed as steps S222 to S228 in FIG.
[0214]
Through the above processing, the coefficient updating unit 42 sequentially updates the coefficient vector adaptively.
[0215]
The SVM 54 uses the coefficient α updated by the coefficient updating unit 42. _R Using (i), the baseband signal x (t) is converted into the baseband signal Re {y (t)} according to the above-described equation (8). The support vector machine SVM 58 uses the same procedure as described above to update the coefficient α updated by the coefficient update unit 66. _I Using (i), the baseband signal x (t) is converted into the baseband signal Im {y (t)} according to the above-described equation (9).
[0216]
[simulation result]
Hereinafter, simulation results will be shown in order to show the effects of the present invention.
[0217]
In the following simulation, it is assumed that NLD is an amplifier and its input / output characteristics are expressed by the following equation.
[0218]
[39]

As other conditions, the input signal is an OFDM signal, the number of carriers is 1405, and the FFT size is 2048.
[0219]
The signal power to noise power ratio (hereinafter referred to as “SNR”) is 100 (dB). The input power is 1 and the threshold value TH is fixed and 0.8.
[0220]
The number of data sets is 2000. In addition, m = 1. Further, d is assumed to be known.
[0221]
FIG. 15 shows a signal spectrum after the input signal is passed through the amplifier 16 with a backoff 6 (dB). That is, FIG. 15 shows a simulation result of characteristics when the input signal is amplified by the amplifier 16 without providing a pre-set nonlinear distortion compensation system.
[0222]
In FIG. 15, the horizontal axis represents frequency carrier numbers, and the vertical axis represents power. Under the influence of nonlinear distortion, the SNR has deteriorated to about 20 (dB).
[0223]
Further, FIG. 16 shows a signal spectrum by the pre-linear nonlinear distortion compensation system 1000 in which the data selection unit 70 is not provided in such a situation.
[0224]
The spectrum outside the signal band is higher than that of the amplifier 16 alone, and the SNR is reduced. This is because a data set having an excessively large signal as an element has been adopted as a support vector. With that data set, the regression function can no longer accurately approximate the inverse characteristics of the amplifier. All data sets after that data set are adopted as support vectors, and the number of support vectors increases. At the same time, the support vector coefficient becomes an extremely large number and eventually diverges.
[0225]
FIG. 17 is a diagram illustrating support vector coefficients in the state of FIG. In FIG. 17, the horizontal axis represents a support vector number, and the vertical axis represents a support vector coefficient value. At this time, the number of support vectors is as very large as about 1450, and the calculation amount is enormous.
[0226]
FIG. 18 is a diagram showing a signal spectrum by the pre-linear distortion compensation system 2000 of the present invention on which the data selection unit 70 is mounted. The spectrum outside the signal band is lower than that of the amplifier 16 alone, and the SNR is improved to about 30 (dB). This is considered to be because a data set having an excessively large signal as an element that causes divergence was not adopted as a support vector.
[0227]
Therefore, even when the back-off is small, the influence of nonlinear distortion can be reduced by the method according to the present invention.
[0228]
FIG. 19 is a diagram illustrating the support vector coefficients in FIG. The number of support vectors is 71, and it can be confirmed that the number of support vectors is significantly reduced as compared with the pre-linear distortion compensation system 1000. At the same time, the amount of calculation has been remarkably reduced due to the smaller kernel matrix.
[0229]
The embodiment disclosed this time should be considered as illustrative in all points and not restrictive. The scope of the present invention is defined by the terms of the claims, rather than the description above, and is intended to include any modifications within the scope and meaning equivalent to the terms of the claims.
[0230]
【The invention's effect】
As described above, according to the present invention, a signal having an inverse characteristic of a signal conversion circuit such as an amplifier is estimated using a support vector machine, so that a look-up table is not required, so that the noise floor is kept low. be able to.
[0231]
In addition, since the present invention can be applied when the input is a complex signal, it is possible to provide a pre-linear distortion compensator with a simple configuration.
[0232]
Furthermore, in the learning process, unnecessary support vectors are deleted from the signal vectors once added to the support vectors, so that high-speed processing is possible.
[0233]
Further, according to the present invention, in the pre-linear distortion compensator distortion using only the SVR, an excessively large signal that cannot be compensated is deleted from the training data set, so that the accuracy of the regression function can be maintained.
[0234]
Furthermore, according to the present invention, the calculation is stabilized by deleting the data set, and the number of support vectors can be reduced at the same time, so that the calculation amount can be greatly reduced.
[Brief description of the drawings]
FIG. 1 is a diagram illustrating a linear ε insensitive loss function and a second order ε insensitive loss function.
FIG. 2 is a conceptual diagram showing a relationship between a linear approximation function and a support vector.
FIG. 3 is a conceptual diagram showing a nonlinear approximation function and a support vector.
FIG. 4 is a schematic block diagram for explaining a configuration of a pre-linear distortion compensation system 1000 according to the present invention.
FIG. 5 is a first flowchart for explaining a specific calculation method of coefficient update;
FIG. 6 is a second flowchart for explaining a specific calculation method of coefficient update;
FIG. 7 is a conceptual diagram showing approximate characteristics of learning data by a support vector in a state where approximate learning by a support vector machine has progressed to some extent.
FIG. 8 is a conceptual diagram for explaining the effect of making the value of ε ′ larger than the value ε that defines the original dead zone.
FIG. 9 is a conceptual diagram for explaining a procedure for removing an extra support vector when successive calculation of approximate characteristics is advanced.
FIG. 10 is a conceptual diagram for explaining a change in loss for data D3 in FIG. 9;
FIG. 11 is a schematic block diagram for explaining a configuration of a pre-linear distortion compensation system 2000 according to the present invention.
12 is a schematic block diagram for explaining a configuration of a data selection unit 70. FIG.
FIG. 13 is a first flowchart for explaining the coefficient update operation of the front-end nonlinear distortion compensation system 2000;
FIG. 14 is a second flowchart for explaining the coefficient updating operation of the front-end nonlinear distortion compensation system 2000;
FIG. 15 is a signal spectrum after an input signal is passed through an amplifier 16 with backoff 6 (dB).
FIG. 16 shows a signal spectrum by a pre-linear distortion compensation system 1000 in which a data selection unit 70 is not provided.
FIG. 17 is a diagram illustrating support vector coefficients in the state of FIG. 16;
FIG. 18 is a diagram showing a signal spectrum by a pre-set nonlinear distortion compensation system 2000 of the present invention equipped with a data selection unit 70.
FIG. 19 is a diagram illustrating support vector coefficients in FIG.
FIG. 20 is a diagram illustrating input / output characteristics of a general amplifier.
[Explanation of symbols]
2,18 down converter, 4,20 A / D converter, 6,22 quadrature detector, 54, 58, 26, 62 SVM, 10 quadrature modulator, 12 D / A converter, 14 up converter, 16 amplifier, 30, 64 subtractor, 42, 66 coefficient update unit, 70 data selection unit, 1000, 2000 pre-linear distortion compensation system.

Claims

A conversion support vector machine that converts the input signal using the inverse characteristics of the signal converter and supplies the converted signal to the signal converter;
Receiving the converted output signal converted by the signal converter, and based on a signal vector constituted by the converted output signal sequence and a signal vector constituted by the converted output signal sequence extracted in the past, for the conversion An estimation support vector machine for estimating an input signal after conversion in the support vector machine based on a loss function having a first dead band;
The support vector machine for conversion and the support vector for estimation are connected to the support vector machine for conversion and the support vector machine for estimation, and based on the output of the support vector machine for conversion and the output of the support vector machine for estimation Coefficient updating means for updating the machine coefficient;
A data selection means for comparing the input signal with a threshold value and, when the input signal is larger than the threshold value, not adopting a data set including the input signal as a support vector ;
The data selection means is a pre- set nonlinear distortion compensator that sets the threshold value based on an absolute value of a predetermined number of input signals in the past .

The coefficient updating means includes
A vector that is received from the converted output signal and is a vector composed of the newly received signal sequence of the converted output signal and that is outside the second dead zone is extracted as a support vector candidate and is already a support vector. In addition to the set of vectors extracted as, the set of vectors is newly regarded as a set of support vectors, and a vector outside the first dead zone is re-extracted as a support vector from the set of vectors. Extraction means;
2. The pre-linear distortion compensator according to claim 1, further comprising calculation means for calculating a coefficient for the extracted support vector.

The said non-linear distortion compensator of Claim 1 with which the said data selection means updates the said threshold value based on the absolute value of the past predetermined number of said input signals within a predetermined period.

The signal converter is an amplifier, before standing nonlinear distortion compensator according to any one of claims 1-3.