WO2023228371A1

WO2023228371A1 - Information processing device, information processing method, and program

Info

Publication number: WO2023228371A1
Application number: PCT/JP2022/021605
Authority: WO
Inventors: 秀明金; 哲也杵渕; 太一浅見
Original assignee: 日本電信電話株式会社
Priority date: 2022-05-26
Filing date: 2022-05-26
Publication date: 2023-11-30

Abstract

An information processing device according to one aspect of this invention comprises a processor and a storage unit. The storage unit comprises a first storage area and a first storage area. The first storage area stores event occurrence data about a location of occurrence of an event to be analyzed. A second storage area stores covariate data observed in an observation area of the event. The processor has a kernel function designation unit, a calculation method designation unit, and an intensity function estimation unit. The kernel function designation unit accepts designation of a kernel function in a Gaussian process. The calculation method designation unit accepts designation of a calculation method of an equivalent kernel function. The intensity function estimation unit calculates an equivalent kernel function on the basis of the specified kernel function and calculation method, and estimates an intensity function with respect to the covariate using the calculated equivalent kernel function.

Description

Information processing device, information processing method, and program

One aspect of the present invention relates to an information processing device, an information processing method, and a program that estimate the probability of occurrence of an event (intensity function) with respect to a covariate based on data regarding an event occurrence position and a covariate.

Consider a situation in which point events (hereinafter referred to as events) occur stochastically in a space where covariates are defined at arbitrary points. This situation can be expressed, for example, as (space, covariates, observed data) = (latitude/longitude, crowd density, location of accident event). As a technique for estimating the probability of occurrence of an event (also referred to as an intensity function) with respect to a covariate, a technique using a kernel density estimation method is known (see, for example, Non-Patent Document 1).

By the way, in recent years, a method that can obtain higher accuracy than the kernel density estimation method has been reported. For example, it is known that a Bayesian estimation method using a Gaussian process as a prior distribution achieves higher accuracy than a kernel density estimation method (see, for example, Non-Patent Documents 2 and 3).

Techniques using kernel density estimation to estimate intensity functions for covariates are known. However, there is still no known technology that can use a Bayesian estimation method using a Gaussian process as a prior distribution, which can be expected to have higher accuracy than the kernel density estimation method.

This invention was made in view of the above-mentioned circumstances, and its purpose is to provide a technology capable of estimating an intensity function for a covariate using a Bayesian estimation method using a Gaussian process as a prior distribution. be.

An information processing device according to one aspect of the present invention includes a processor and a storage unit. The storage unit includes a first storage area and a first storage area. The first storage area stores event occurrence data regarding the occurrence position of the event to be analyzed. The second storage area stores covariate data observed within the event observation area. The processor includes a kernel function specifying section, a calculation method specifying section, and an intensity function estimating section. The kernel function designation unit accepts designation of a kernel function in a Gaussian process. The calculation method designation unit accepts designation of a calculation method for the equivalent kernel function. The intensity function estimation unit calculates an equivalent kernel function based on a specified kernel function and calculation method, and estimates an intensity function for a covariate using the calculated equivalent kernel function.

According to one aspect of the present invention, it is possible to provide a technique capable of estimating an intensity function for a covariate based on a Bayesian estimation method using a Gaussian process as a prior distribution.

FIG. 1 is a functional block diagram showing an example of an information processing device according to an embodiment. FIG. 2 is a functional block diagram showing an example of the information processing device 1 shown in FIG. 1. As shown in FIG. FIG. 3 is a flowchart showing an example of a processing procedure of the information processing device 1 shown in FIG.

Embodiments of the present invention will be described below with reference to the drawings.
<Configuration>
FIG. 1 is a functional block diagram showing an example of an information processing device according to an embodiment.
The information processing device 1 is a computer including a processor and a memory. The information processing device 1 includes a processor 11, an input/output interface 12, and a storage unit 13. The processor 11, the input/output interface 12, and the storage unit 13 are communicably connected to each other via a bus.
Processor 11 controls information processing device 1 . The processor 11 is an arithmetic processing device such as a CPU (Central Processing Unit) or an MPU (Micro Processing Unit).

The input/output interface 12 is an interface that allows information to be sent and received between the input device 2 and the output device 3. The input/output interface 12 may include a wired or wireless communication interface. That is, the information processing device 1, the input device 2, and the output device 3 may transmit and receive information via a network such as a LAN or the Internet.

The storage unit 13 is a storage medium. The storage unit 13 includes, for example, a non-volatile memory that can be written to and read from at any time such as an HDD (Hard Disk Drive) or an SSD (Solid State Drive), a non-volatile memory such as a ROM (Read Only Memory), a RAM (Random Access Memory), etc. It is configured in combination with volatile memory. The storage unit 13 includes a program storage area and a data storage area. The program storage area stores application programs necessary for executing various processes in addition to the OS (Operating System) and middleware.

The input device 2 includes, for example, a keyboard, a pointing device, etc. for an owner of the information processing device 1 (for example, an assignee, a manager, a supervisor, etc.) to input instructions to the information processing device 1. Furthermore, the input device 2 may include a reader for reading data to be stored in the storage unit 13 from a memory medium such as a USB memory, and a disk device for reading such data from a disk medium. Furthermore, the input device 2 may include an image scanner.

The output device 3 includes a display that displays output data to be presented to the owner from the information processing device 1, a printer that prints the output data, and the like. The output device 3 also includes a writer for writing data to be input into another information processing device 1 such as a PC or a smartphone onto a memory medium such as a USB memory, or a disk for writing such data onto a disk medium. may include a device.

FIG. 2 is a functional block diagram showing an example of the information processing device 1 shown in FIG. 1. As shown in FIG. In FIG. 2, the storage unit 13 stores a program 10 that causes the processor 11 to function as the information processing device 1. Furthermore, the storage unit 13 includes a first storage area 131, a second storage area 132, and a third storage area 133.
The first storage area 131 stores event occurrence data 100. The event occurrence data 100 is data regarding the occurrence position of the event to be analyzed, and includes at least the number of observed events, a series of event positions, and an observation area.
The second storage area 132 stores covariate data 101 observed within the observation area of the event to be analyzed.
The third storage area 133 stores the intensity function distribution 105 calculated by the processor 11.

The processor 11 includes a kernel function designation unit 102, a calculation method designation unit 103, an intensity function estimation unit 112, and an output control unit 114 as processing functions according to the embodiment. The kernel function designation unit 102, the calculation method designation unit 103, the intensity function estimation unit 112, and the output control unit 114 are functional processes realized by the calculation processing of the processor 11 based on the program 10.

The kernel function designation unit 102 accepts designation of a kernel function in a Gaussian process. The kernel function is specified by the user by operating the input device 2, for example.
The calculation method designation unit 103 accepts designation of the calculation method of the equivalent kernel function. The calculation method may also be specified by the user operating the input device 2, for example.

The intensity function estimation unit 112 calculates an equivalent kernel function based on the specified kernel function and calculation method. Furthermore, the intensity function estimating unit 112 uses the calculated equivalent kernel function to estimate an intensity function for the covariate. The intensity function distribution 105 is stored in the third storage area 133.

The output control unit 114 outputs the intensity function distribution 105 to the output device 3 via the input/output interface 12. The output device 3 visualizes and displays the calculated intensity function distribution 105 on, for example, a display.

Next, the operation of the above configuration will be explained.
<Effect>
(overview)
First, an overview of the action will be explained. In the embodiment, the processor 11 mainly executes the processes (1) to (4) to realize the estimation of the intensity function for the covariate based on the Bayesian estimation method using a Gaussian process as the prior distribution.

(1) For a variable that follows a Gaussian process defined in the covariate space, the square of that variable is defined as the intensity function. In this way, the estimated value of the square root of the intensity function that maximizes the posterior probability (maximum posterior probability estimated value or MAP estimated value) is given as a solution to an N-element simultaneous equation, where N is the number of observed data. This can be expressed as the representer theorem holds. This fact makes it easy to numerically solve the estimate of the square root of the intensity function.

(2) Calculate the estimation error of the square root of the intensity function using Laplace approximation. That is, the Hessian matrix of the MAP estimate of the logarithmic posterior probability distribution followed by the square root of the intensity function is calculated. Then, the inverse matrix of the Hessian matrix multiplied by -1 is used as the covariance matrix of the estimated value of the square root of the intensity function.

(3) Under the Laplace approximation in (2), a gamma distribution to which the estimated value of the intensity function follows is obtained. Obtaining a probability distribution regarding this estimated value is the ultimate goal of intensity function estimation.
(4) Estimating hyperparameters necessary for estimating the intensity function from observed data based on the empirical Bayes method. Note that the empirical Bayes method is a method that uses hyperparameters that maximize the marginal likelihood as estimated values. A typical example of a hyperparameter is a parameter of a kernel function in a Gaussian process.

FIG. 3 is a flowchart showing an example of the processing procedure of the information processing device 1 shown in FIG. In FIG. 3, the processor 11 accepts a user's designation of a kernel function in a Gaussian process (step SST21). Next, the processor 11 accepts the user's designation of the method for calculating the equivalent kernel function (step SST22).

Next, the processor 11 calculates an equivalent kernel function based on the specified kernel function and calculation method (step ST23). Furthermore, the processor 11 estimates the intensity function for the covariate using the calculated equivalent kernel function (step S24).

(detail)
Next, details of the operation will be explained with reference to mathematical formulas.

[About event occurrence data]
Data regarding the occurrence position of the event to be analyzed is given as input. The event occurrence data includes the following (A), (B), and (C).

However, the number of dimensions of the space in which an event occurs is arbitrary; for example, one dimension may be time, two dimensions may be geographical space, and three dimensions may be space and time.

[About covariate data]
The covariate data observed within the event observation region T (formula (C)) is a function that outputs the covariate by inputting an arbitrary point (formula (D)) within the observation region (formula (C)). (Equation (E)). In many applications, information about the covariates is only available on a finite number of points within the observation region (Equation (C)). In that case, it is assumed that the function (formula (E)) is constructed using an interpolation technique such as a regression model or kriging.

[Specifying the kernel function in the Gaussian process] To use the Gaussian process, specify a function called the kernel function that determines the smoothness of the function to be modeled. Also specify the values of parameters (hyperparameters) included in the function.
Note that the function to be modeled in the embodiment is an intensity function for a covariate. The kernel function for arbitrary two points (formula (F)) in the covariate space is expressed as (formula (G)).

An example of a kernel function is the Gaussian kernel given by equation (1).

Alternatively, as an example of the kernel function, there may be mentioned the kernel of Equation (2), which is expressed by the inner product of a finite-dimensional feature mapping vector (Equation (H)).

Regarding [Specification of method for calculating equivalent kernel function] A method for calculating equivalent kernel function is given as input. The method for calculating the equivalent kernel function includes the type of calculation method and the Monte Carlo integration score (formula (I)).

Note that the options for the type of calculation method are type 1 and type 2, and type 2 can only be selected when the kernel function is given by the inner product of finite-dimensional feature mapping vectors.

Regarding [Estimation of Intensity Function] Based on the information given above, the processor 11 calculates an equivalent kernel function (formula (J)).

Then, the processor 11 uses the equivalent kernel function (J) to estimate the intensity function (formula (K)) for the covariate.

First, the equivalent kernel function (Equation (J)) is defined as a solution to the integral equation of Equation (3).

In preparation for numerically solving the above integral equation, equation (4) is obtained by approximating the integral part by Monte Carlo integration.

However, (formula (L)) is a covariate on the m-th sample point.

When type 1 is specified as the type of calculation method, by solving equation (4) as a matrix equation regarding the vertical vector function (formula (M)), the equivalent kernel function (formula (N) ).

When Type 2 is specified as the type of calculation method, on the premise that the kernel function is given by the inner product of finite-dimensional feature mapping vectors as in Equation (2), the equivalent kernel function (Equation ( O)) is obtained in the form of equation (6).

Next, the MAP estimated value of the square root of the intensity function is calculated using equation (7) using the equivalent kernel function (formula (J)).

However, (Formula (P)) can be obtained by solving the following simultaneous equations (8).

Next, based on the Laplace approximation, assuming that the square root of the intensity function follows a normal distribution with the MAP estimated value as the average, its covariance matrix (formula (Q)) is calculated using formula (9).

However, the number (10) holds true.

Finally, the probability distribution followed by the estimated value of the intensity function at each covariate value (formula (R)) is given by the scale parameter (formula (S)) and shape parameter (formula (T)) as shown in formula (11), respectively. It is calculated as the given gamma distribution.

Note that in the process of [estimation of intensity function], the validity of the hyperparameter specified in [specification of kernel function in Gaussian process] can also be evaluated based on the marginal likelihood function. Furthermore, when optimizing hyperparameters, a hyperparameter that maximizes the marginal likelihood function is searched for, and equation (11) is recalculated using that value.

About [Estimated intensity function distribution] The probability distribution of the intensity function calculated in [Intensity function estimation] is output. What is output is a function that outputs, for any covariate value (formula (R)), a value of a gamma distribution with the scale and shape parameters given by formula (11).

<Effect>
As described above, according to the embodiment, it is possible to estimate the intensity function for the covariate using the Bayesian estimation method using a Gaussian process as the prior distribution.
Note that the present invention is not limited to the above embodiments as they are. For example, the selection of the kernel function is not limited to equation (1) or equation (2).
Furthermore, the present invention can be embodied by modifying the constituent elements within the scope of the embodiments at the implementation stage. Furthermore, various inventions can be formed by appropriately combining the plurality of components disclosed in the above embodiments. For example, some components may be deleted from all the components shown in the embodiments. Furthermore, components from different embodiments may be combined as appropriate.

1... Information processing device 2... Input device 3... Output device 10... Program 11... Processor 12... Input/output interface 13... Storage unit 131... First storage area 132... Second storage area 133... Third storage area 100... Event occurrence Data 101...Covariate data 102...Kernel function specification section 103...Calculation method specification section 105...Intensity function distribution 112...Intensity function estimation section 114...Output control section.

Claims

In an information processing device including a processor and a storage unit,
The storage unit is
a first storage area that stores event occurrence data regarding the occurrence position of the event to be analyzed;
a second storage area for storing covariate data observed within the observation area of the event,
The processor includes:
a kernel function specification unit that accepts specification of a kernel function in a Gaussian process;
a calculation method specification section that accepts a specification of a calculation method of the equivalent kernel function;
and an intensity function estimator that calculates the equivalent kernel function based on the specified kernel function and calculation method, and estimates an intensity function for a covariate using the calculated equivalent kernel function. Device.
The information processing device according to claim 1, wherein the event occurrence data includes at least the number of observed events, a series of event positions, and an observation area.
The information processing device according to claim 1, wherein the kernel function designation unit further receives designation of a value of a hyperparameter of the kernel function.
The information processing device according to claim 1, wherein the calculation method designation unit receives at least a designation of a type of calculation method for the equivalent kernel function and a designation of a Monte Carlo integration score.
An information processing method for an information processing device, comprising: a storage unit that stores event occurrence data regarding the occurrence position of an event to be analyzed and covariate data observed within an observation area of the event; and a processor.
a step in which the processor receives a specification of a kernel function in a Gaussian process;
a step in which the processor receives a designation of a method for calculating an equivalent kernel function;
a step in which the processor calculates the equivalent kernel function based on the specified kernel function and calculation method;
An information processing method comprising: the processor estimating an intensity function for a covariate using the calculated equivalent kernel function.
A program that causes a computer to function as each section of the information processing apparatus according to claim 4.