WO2020137641A1

WO2020137641A1 - Restoration device, restoration method, and program

Info

Publication number: WO2020137641A1
Application number: PCT/JP2019/049084
Authority: WO
Inventors: 江村　暁
Original assignee: 日本電信電話株式会社
Priority date: 2018-12-28
Filing date: 2019-12-16
Publication date: 2020-07-02
Also published as: JP2020106713A

Abstract

A waveform restoration process is reliably performed even in conditions in which the usable calculation amount is limited. This waveform restoration device 10 repeatedly estimates, from a post-clip signal, a pre-clip signal having a prescribed degree of complexity, while increasing the degree of complexity by a prescribed update amount, thereby restoring the pre-clip signal. A parameter determination unit 12 determines a maximum number of repetitions on the basis of the post-clip signal. A waveform restoration unit 13 repeatedly estimates the pre-clip signal using the maximum number of repetitions as the maximum number for the estimate.

Description

Restoration apparatus, restoration method, and program

The present invention relates to a technique for restoring a signal before clipping from a signal after clipping.

When inputting/outputting a signal between devices, the part where the signal amplitude is larger than the input/output range of the device is clipped to a constant value. Clipping can occur in a wide range of situations, for example, when obtaining a signal from a sensor, when outputting a signal to some device, when inputting an analog signal into an A/D converter for digitization, and the like. Therefore, studies have been made to restore the signal waveform before clipping from the clipped signal.

As such a method, a method called SPADE (SParse Audio DEclipper) has been proposed (Non-Patent Document 1). The SPADE will be described below.

The symbols " ^- " and "^" used in the sentence should be written right above the character just before, but due to the limitation of text notation, they are written immediately after the character. In the mathematical formula, these symbols are described at their original positions, that is, directly above the characters. For example, "z ^- " is represented by the following equation in the mathematical formula.

Further, for example, “z^” is represented by the following equation in the mathematical formula.

The original signal (the signal before clipping) is represented by a signal vector x=[x ₁ ,..., X _N ], and the clipped signal is represented by a signal vector y=[y ₁ ,..., Y _N ]. Each sample of the signal before and after clipping has the relationship of Formula (1).

Where θ is the clipping level. The signal sample after clipping belongs to one of the signal sample S ₊ that is clipped at the upper limit, the signal sample S _r that is not clipped, and the signal sample S ₋ that is clipped at the lower limit.

In SPADE, the dictionary matrix D is first defined. Then, paying attention to the signal expression vector z obtained by multiplying the inverse matrix D ⁻¹ of the dictionary matrix D by the signal vector x, the complexity of the signal is determined by the number of non-zero elements in z, that is, the L ₀ norm of z ||z || Measures with ₀ . As the dictionary matrix D, a DFT matrix (Discrete Fourier Transform Matrix), a DCT matrix (Discrete Cosine Transform Matrix), or the like is used.

SPADE assumes that the complexity of a signal before clipping (hereinafter, also referred to as “complexity”) is k, and a predetermined update amount s is an initial value of the complexity k. First, the input signal, that is, the signal y after clipping is converted into a signal expression vector z at D ⁻¹ . leaving k elements in z from the larger absolute value, the other value by zero, signal representation vector z complexity k ^- converted to. This operation is called hard thresholding, and is expressed as z ⁻ =H _k (z) in the mathematical formula (corresponding to step 2 in Table 1 below). Then, the signal representation vector z ^- over D, the estimated signal vector x ^- = Dz ^- converting into. The estimated signal vector x ^- is the estimation result of the signal vector x before clipping at this stage. Usually, there is a discrepancy between the estimated signal vector x ^- and the input signal vector y even in the non-clip part. Therefore, a signal expression vector z^ satisfying the following two conditions is obtained (corresponding to step 3 in Table 1 below).
Condition 1. The clipped Dz^ matches y.
Condition 2. z ^ and z ^- distance is the minimum.

z ^ and z ^- if the value is greater than the distance is determined in advance of, "can not express the target signal in order to complexity k of the expected signal is insufficient," the decision to the complexity k update amount s And the above process is repeated.

By implementing the above processing using the optimization method ADMM (Non-Patent Document 2), the algorithm shown in Table 1 is obtained.

In Table 1, to fix the complexity k z ^- is the process of estimating the z ^ after performing r times. Normally set to r=1. Further, max_iter the above z ^- the maximum number of repetitions of the process of estimating the z ^ from, given by equation (2).

Where ceil() is the round-up function and dim(z) is the dimension of vector z.

-SPADE is used in combination with normal frame signal processing. That is, as shown in FIG. 1, the input signal after clipping is divided into frames of a certain length with overlapping in the frame dividing unit 11, and after windowing processing is performed on each frame, the waveform restoring unit 12 SPADE processing is applied. Then, the frame synthesizing unit 13 applies the frame synthesizing process to the processing result, and the restored signal before the clip is obtained.

Incidentally, according to Non-Patent Document 3, the z Step 3 of Table 1 ^- process of estimating the z ^ from the projection

Using

Will be realized in.

However, for example, when it is necessary to restore the waveforms of multiple sensor signals in real time in IoT (Internet of Things), there is a problem that the calculation amount of SPADE is large. This is because SPADE needs to proceed with the waveform restoration process while sequentially increasing the assumed k of complexity, and the complexity of the input signal is unknown and constantly fluctuates.

The object of the present invention is to realize a technique capable of surely performing waveform restoration even in a situation where the usable calculation amount is limited, in view of the above technical problems.

In order to solve the above problems, the restoration device according to one aspect of the present invention repeatedly estimates a pre-clip signal of a predetermined complexity from a clipped signal while increasing the complexity by a predetermined update amount. Thus, a restoration device for restoring the pre-clip signal, a parameter determination unit that determines the maximum number of repetitions based on the post-clip signal, and an estimation that repeatedly estimates the pre-clip signal with the maximum number of repetitions as the maximum number of estimations And a part.

According to the waveform restoration technology of the present invention, it becomes possible to reliably perform the waveform restoration process even in a situation where the usable calculation amount is limited.

FIG. 1 is a diagram illustrating a functional configuration of a conventional waveform restoration device. FIG. 2 is a diagram illustrating a functional configuration of the waveform restoration device according to the embodiment. FIG. 3 is a diagram illustrating a processing procedure of the waveform restoration method of the embodiment.

Hereinafter, embodiments of the present invention will be described in detail. In the drawings, components having the same function are denoted by the same reference numerals, and duplicate description will be omitted.

[First embodiment]
The waveform restoration device of the first embodiment first estimates the complexity (sparseness) of a signal before clipping from a signal after clipping using a deep neural network (DNN: Deep Neural Network) (see Reference Document 1). To do. In addition, the maximum number of iterations of the estimation process is designated in advance as a limit index of the amount of calculation, and the update amount s of complexity k is controlled based on the estimated degree of sparsity. This makes it possible to reliably perform the waveform restoration process even in a situation where the amount of calculation that can be used is limited.
[Reference 1] I. Goodfellow, Y. Bensio, and A. Courville, "Deep learning", MIT Press, 2016.

Note that the maximum number of iterations of the estimation process that is allowed is specified by α times (0<α<1) of max_iter which is originally required.

The waveform restoration device 10 (hereinafter, also referred to as “restoration device”) of the first embodiment, as illustrated in FIG. 2, a frame division unit 11, a waveform restoration unit 12 (hereinafter, also referred to as “estimation unit”), and In addition to the frame synthesis section 13, a sparse degree estimation section 21 and a parameter determination section 22 are provided. The waveform restoration apparatus 10 implements the waveform restoration method of the first embodiment by performing the processing of each step illustrated in FIG.

The waveform restoration device 10 is, for example, a special program configured by reading a special program into a known or dedicated computer having a central processing unit (CPU: Central Processing Unit), a main storage device (RAM: Random Access Memory), and the like. It is a device. The waveform restoration device 10 executes each process under the control of the central processing unit, for example. The data input to the waveform restoration device 10 and the data obtained by each process are stored in, for example, the main storage device, and the data stored in the main storage device is read out to the central processing unit as necessary. It is used for other processing. At least a part of each processing unit of the waveform restoration device 10 may be configured by hardware such as an integrated circuit.

In step S11, the frame dividing unit 11 divides the input clipped signal into frames. The frame division unit 11 sends the clipped signal frame y to the waveform restoration unit 12 and the sparsity degree estimation unit 21.

In step S21, the sparseness estimation unit 21 obtains a signal expression vector of the clipped signal frame y by D ⁻¹ y, and uses a DNN learned in advance from a vector in which the absolute value of each component is taken to obtain x. Estimate the complexity of ^, that is, the degree of sparsity K^. The sparseness estimation unit 21 sends the estimated sparseness K^ to the parameter determination unit 22.

The DNN can be composed of two or three hidden layers. When the frame length is L and a DFT matrix is used as D, the first layer has L/2 dimensions. The training data used for this DNN pre-learning consists of input data given to the DNN input layer and teacher data given to the DNN output layer. Each data is generated as follows.
• Collect samples of the expected input signal x.
-The signal expression vector of the signal y after clipping is found by D ^-1 y, and the vector that takes the absolute value of each component is used as the input data.
-The signal expression vector of the signal x before clipping is calculated by D ^-1 x, the absolute value square of each element is calculated, and sorted in descending order. The sum is calculated from the larger elements, and the number of elements is calculated when the sum exceeds 95% of the total. This number of elements is defined as the degree of sparseness of the signal and used as teacher data.

In step S22, the parameter determination unit 22 corrects the parameters s and max_iter for SPADE from the estimated sparse degree K^ using the equations (3) and (4). The parameter determination unit 22 sends the modified parameters s_rev and max_iter_rev to the waveform restoration unit 12.

In step S12, the waveform restoration unit 12 estimates the signal frame x before clipping by executing the SPADE process using the corrected update amount s_rev and the corrected maximum number of repetitions max_iter_rev. The waveform restoration unit 12 sends the estimated signal frame x before clipping to the frame synthesis unit 13.

In step S13, the frame synthesis unit 13 applies frame synthesis processing to the estimated signal frame x before clipping to restore the signal before clipping.

[Second embodiment]
Within SPADE, control is performed to increase the update amount s of complexity k according to the number of iterations i. As an example of such control, it is conceivable to switch the update amount s to T stages at regular intervals. In this case, let C be the maximum natural number that satisfies equation (5),

The update amount s of the complexity k is determined by the function s(i) of the iteration number i shown in the equation (6).

However, floor() is a round-down function.

s(i) increases as 1, 2, …, 2 ^T each time the estimation of z ⁻ to z^ is repeated C times in SPADE. In this control, the repetitive initial SPADE, while increasing slowly complexity k z ^- performs estimation from z ^ of, then the complexity k of increase was significantly Nakara z ^- performing the z ^ estimation.

In SPADE, if the update amount s of complexity k is set large, the amplitude of the restored signal tends to become small. Therefore, the estimation accuracy of the restored signal can be improved by slowing the increase of the complexity k at the initial stage of iteration.

[Modification]
Let C _{2 be the} maximum natural number that satisfies equation (7),

The update amount s of complexity k is determined by the function s(i) shown in equation (8).

Although the embodiments of the present invention have been described above, the specific configuration is not limited to these embodiments, and even if the design is appropriately changed without departing from the gist of the present invention, Needless to say, it is included in the present invention. The various kinds of processing described in the embodiments may be executed not only in time series according to the order described, but also in parallel or individually according to the processing capability of the device that executes the processing or the need.

[Program, recording medium]
When various processing functions in each device described in the above embodiment are realized by a computer, processing contents of functions that each device should have are described by a program. By executing this program on a computer, various processing functions of the above-described devices are realized on the computer.

The program describing this processing content can be recorded in a computer-readable recording medium. The computer-readable recording medium may be, for example, a magnetic recording device, an optical disc, a magneto-optical recording medium, a semiconductor memory, or the like.

Also, distribution of this program is performed by selling, transferring, or lending a portable recording medium such as a DVD or a CD-ROM in which the program is recorded. Further, the program may be stored in a storage device of a server computer and transferred from the server computer to another computer via a network to distribute the program.

A computer that executes such a program first stores, for example, the program recorded in a portable recording medium or the program transferred from the server computer in its own storage device. Then, when executing the process, this computer reads the program stored in its own storage device and executes the process according to the read program. As another execution form of this program, a computer may directly read the program from a portable recording medium and execute processing according to the program, and the program is transferred from the server computer to this computer. Each time, the processing according to the received program may be sequentially executed. In addition, the above-mentioned processing is executed by the so-called ASP (Application Service Provider) type service that realizes the processing function only by executing the execution instruction and the result acquisition without transferring the program from the server computer to this computer. May be Note that the program in this embodiment includes information that is used for processing by an electronic computer and that conforms to the program (such as data that is not a direct command to a computer but has the property of defining computer processing).

In this embodiment, the device is configured by executing a predetermined program on a computer, but at least a part of the processing content may be implemented by hardware.

10, 90 waveform restoration device 11 frame division unit 12 waveform restoration unit 13 frame synthesis unit 21 sparseness estimation unit 22 parameter determination unit

Claims

A restoration device that restores a pre-clip signal by repeatedly estimating a pre-clip signal of a predetermined complexity from a post-clip signal while increasing the complexity by a predetermined update amount,
A parameter determination unit that determines the maximum number of repetitions based on the clipped signal,
An estimation unit that repeatedly estimates the pre-clip signal with the maximum number of repetitions as the maximum number of times of the estimation,
Restoration device including.
The restoration device according to claim 1, wherein
Further comprising a sparseness estimation unit that estimates the sparseness of the pre-clip signal from the clipped signal,
The parameter determination unit corrects the maximum number of repetitions and the update amount that are predetermined based on the sparsity of the pre-clip signal,
Restoration device.
The restoration device according to claim 1 or 2, wherein
The estimation unit increases the update amount according to the number of times the estimation is repeated,
Restoration device.
A restoration method for restoring a pre-clip signal by repeatedly estimating the pre-clip signal of a predetermined complexity from the post-clip signal while increasing the complexity by a predetermined update amount,
The parameter determination unit determines the maximum number of repetitions based on the clipped signal,
The estimation unit repeatedly estimates the pre-clip signal with the maximum number of repetitions as the maximum number of times of estimation,
How to restore.
A program for causing a computer to function as the restoration device according to any one of claims 1 to 3.