JP2021056918A

JP2021056918A - Data analysis device and data analysis method

Info

Publication number: JP2021056918A
Application number: JP2019181159A
Authority: JP
Inventors: 晋太郎高田; Shintaro Takada; 岳士上原; Takeshi Uehara
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2019-10-01
Filing date: 2019-10-01
Publication date: 2021-04-08
Anticipated expiration: 2039-10-01
Also published as: JP7262359B2

Abstract

To achieve improvement in prediction accuracy of a work time.SOLUTION: A data analysis device is configured to implement: acquisition processing that acquires, for each work of a plurality of works, first learning data having an objective variable indicative of a work time, an explanation variable regarding a work environment, a worker variable indicative or presence or absence of a work implementation for each worker; prediction model generation processing that generates a first prediction model predicting the work time on the basis of the first learning data; calculation processing that inputs the first learning data to the first prediction model, and thereby calculates, for each work, a first prediction value of the work time and a first implication degree indicative of a degree in which the worker variable affects statistics of the first prediction value for each work; and learning data generation processing that generates an abstraction variable abstracting a work speed of a plurality of workers for each work on the basis of a distribution of respective first implication degrees of the plurality of works for each worker, and generates second learning data having the objective variable, explanation variable, and abstraction variable for each work.SELECTED DRAWING: Figure 5

Description

本発明は、データを分析するデータ分析装置およびデータ分析方法に関する。 The present invention relates to a data analyzer and a data analysis method for analyzing data.

近年、様々な分野においてビッグデータ解析の利用が普及している。特に、物流倉庫での商品ピッキング作業業務や、工場における組み立て作業業務など、人間が行う作業業務の効率向上のために、過去の大量の実績データをもとに、最適な商品配置や作業順序を業務改善施策として提案するサービスが注目されている。このようなサービスを実現する一つの手段として、過去の大量の実績データから、作業環境や作業内容における諸条件と作業時間の関係性を分析し、任意の作業条件での作業時間を予測可能な作業時間予測モデルを生成し、様々な作業条件の中から、最も作業時間が少なくなるような条件を、予測モデルを用いて探索する方式が挙げられる。 In recent years, the use of big data analysis has become widespread in various fields. In particular, in order to improve the efficiency of work performed by humans, such as product picking work in distribution warehouses and assembly work in factories, optimal product placement and work order are selected based on a large amount of past actual data. The service proposed as a business improvement measure is attracting attention. As one means of realizing such a service, it is possible to analyze the relationship between various conditions and working time in the working environment and work content from a large amount of past actual data, and predict the working time under arbitrary working conditions. A method of generating a work time prediction model and searching for the condition with the shortest work time from various work conditions using the prediction model can be mentioned.

たとえば、特許文献１は、指定された組立作業時間予測製品に対して、実績データ格納手段に格納されている実績データの中から類似の実績データを複数選択し、その得られた実績データのうち、類似製品の部品の種類を説明変数とすると共に組立作業時間を目的変数として重回帰分析を行うことにより、組立作業時間予測モデルを作製し、作製された組立作業時間予測モデルの有効性を統計的に判定し、有効である場合に該組立作業時間モデルより組立作業時間を予測し、また有効でない場合は従来手法により組立作業時間を予測する組立作業時間予測装置を開示する。 For example, in Patent Document 1, a plurality of similar actual data are selected from the actual data stored in the actual data storage means for the specified assembly work time prediction product, and among the obtained actual data. , By performing multiple regression analysis with the type of parts of similar products as the explanatory variable and the assembly work time as the objective variable, an assembly work time prediction model is created, and the effectiveness of the prepared assembly work time prediction model is statistically statistic. Disclosed is an assembly work time prediction device that predicts the assembly work time from the assembly work time model when it is valid, and predicts the assembly work time by a conventional method when it is not valid.

特開平０７−１６４２６７号公報Japanese Unexamined Patent Publication No. 07-164267

しかしながら、物流倉庫でのピッキング作業業務や、工場での組み立て作業業務などの作業時間は、作業員本人のスキルや、作業員個人にとって得意不得意な作業条件の存在など、個人性に大きく依存する。したがって、特許文献１で生成されたモデルは、作業員の個人性を反映しておらず、作業時間の予測精度に限界がある。本発明は、作業時間の予測精度の向上を図ることを目的とする。 However, the work time for picking work in a distribution warehouse and assembly work in a factory largely depends on individuality, such as the skills of the worker himself and the existence of work conditions that the worker is not good at. .. Therefore, the model generated in Patent Document 1 does not reflect the individuality of the worker, and the accuracy of predicting the working time is limited. An object of the present invention is to improve the accuracy of predicting working time.

本願において開示される発明の一側面となるデータ分析装置およびデータ分析方法は、プログラムを実行するプロセッサと、前記プログラムを記憶する記憶デバイスと、を有するデータ分析装置であって、前記プロセッサは、複数の作業の作業ごとに、作業時間を示す目的変数と、作業環境に関する説明変数と、作業員ごとの前記作業の実施の有無を示す作業員変数と、を有する第１学習データを取得する取得処理と、前記取得処理によって取得された第１学習データに基づいて、前記作業時間を予測する第１予測モデルを生成する予測モデル生成処理と、前記予測モデル生成処理によって生成された第１予測モデルに前記第１学習データを入力することにより、前記作業時間の第１予測値と、前記作業員変数が前記作業ごとの前記第１予測値の統計量に与える影響の度合いを示す第１影響度とを、前記作業ごとに算出する算出処理と、前記作業員ごとの前記複数の作業の各々の第１影響度の分布に基づいて、前記複数の作業員の作業速度を抽象化した抽象化変数を前記作業ごとに生成し、前記作業ごとに、前記目的変数と、前記説明変数と、前記抽象化変数と、を有する第２学習データを生成する学習データ生成処理と、を実行することを特徴とする。 The data analyzer and the data analysis method which are one aspect of the invention disclosed in the present application are data analyzers including a processor for executing a program and a storage device for storing the program, and the plurality of processors are included. Acquisition process for acquiring the first learning data having an objective variable indicating the work time, an explanatory variable for the work environment, and a worker variable indicating whether or not the work is performed for each worker. And the prediction model generation process that generates the first prediction model that predicts the working time based on the first training data acquired by the acquisition process, and the first prediction model generated by the prediction model generation process. By inputting the first training data, the first predicted value of the working time and the first influence degree indicating the degree of influence of the worker variable on the statistic of the first predicted value for each work. Based on the calculation process calculated for each work and the distribution of the first influence degree of each of the plurality of works for each worker, an abstract variable that abstracts the work speed of the plurality of workers is obtained. It is characterized in that a training data generation process for generating a second training data having the objective variable, the explanatory variable, and the abstraction variable is executed for each work. To do.

本発明の代表的な実施の形態によれば、作業時間の予測精度の向上を図ることができる。前述した以外の課題、構成及び効果は、以下の実施例の説明により明らかにされる。 According to a typical embodiment of the present invention, it is possible to improve the accuracy of predicting the working time. Issues, configurations and effects other than those described above will be clarified by the description of the following examples.

図１は、データ分析システムのシステム構成例を示すブロック図である。FIG. 1 is a block diagram showing a system configuration example of a data analysis system. 図２は、データ分析装置のハードウェア構成例を示すブロック図である。FIG. 2 is a block diagram showing a hardware configuration example of the data analyzer. 図３は、業務実績データの一例を示す説明図である。FIG. 3 is an explanatory diagram showing an example of business performance data. 図４は、商品情報の一例を示す説明図である。FIG. 4 is an explanatory diagram showing an example of product information. 図５は、図１に示した予測モデル生成処理の詳細な処理手順例を示すフローチャートである。FIG. 5 is a flowchart showing a detailed processing procedure example of the prediction model generation processing shown in FIG. 図６は、作業員ＩＤ使用学習データの一例を示す説明図である。FIG. 6 is an explanatory diagram showing an example of worker ID use learning data. 図７は、作業員属性使用学習データの一例を示す説明図である。FIG. 7 is an explanatory diagram showing an example of worker attribute use learning data. 図８は、中間学習データの一例を示す説明図である。FIG. 8 is an explanatory diagram showing an example of intermediate learning data. 図９は、第１影響度テーブルの一例を示す説明図である。FIG. 9 is an explanatory diagram showing an example of the first influence degree table. 図１０は、全作業における影響度分布情報を示す説明図である。FIG. 10 is an explanatory diagram showing influence degree distribution information in all operations. 図１１は、抽象化変数使用学習データの一例を示す説明図である。FIG. 11 is an explanatory diagram showing an example of learning data using abstract variables. 図１２は、第２影響度テーブルの一例を示す説明図である。FIG. 12 is an explanatory diagram showing an example of the second influence degree table. 図１３は、説明変数の影響度分布例を示すグラフである。FIG. 13 is a graph showing an example of the influence distribution of the explanatory variables. 図１４は、作業員特徴使用学習データの一例を示す説明図である。FIG. 14 is an explanatory diagram showing an example of worker feature use learning data. 図１５は、予測モデル生成処理によって生成された作業員特徴使用予測モデルを用いて業務改善施策を生成する業務改善施策生成処理手順例を示すフローチャートである。FIG. 15 is a flowchart showing an example of a business improvement measure generation processing procedure for generating a business improvement measure using a worker feature usage prediction model generated by the prediction model generation processing. 図１６は、ステップＳ１５０４における各種予測モデルの使い分け処理手順例を示すフローチャートである。FIG. 16 is a flowchart showing an example of a processing procedure for properly using various prediction models in step S1504. 図１７は、作業員属性・特徴データの一例を示す説明図である。FIG. 17 is an explanatory diagram showing an example of worker attribute / feature data.

以下、添付図面を用いて、データ分析システムについて説明する。以降の説明では、対象となる業務が物流倉庫における商品ピッキング作業であり、予測モデルの予測対象はピッキング作業に要する作業時間である場合を取り上げる。なお、本実施例は作業実績データから予測モデルを生成し、それを用いて業務改善施策を生成するサービス全般に適用可能なものであり、説明するユースケースに限定されるものではない。 Hereinafter, the data analysis system will be described with reference to the attached drawings. In the following description, the case where the target work is the product picking work in the distribution warehouse and the prediction target of the prediction model is the work time required for the picking work will be taken up. It should be noted that this embodiment can be applied to all services that generate a prediction model from work performance data and generate business improvement measures using the prediction model, and is not limited to the use cases to be described.

＜データ分析システムのシステム構成例＞
図１は、データ分析システムのシステム構成例を示すブロック図である。データ分析システム１００は、業務システム１０１と、データ分析装置１０２と、を有する。業務システム１０１およびデータ分析装置１０２は、ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ（；ＬＡＮ）、ＷｉｄｅＡｒｅａＮｅｔｗｏｒｋ（ＷＡＮ）、インターネットのようなネットワークを介して相互に通信可能に接続される。 <System configuration example of data analysis system>
FIG. 1 is a block diagram showing a system configuration example of a data analysis system. The data analysis system 100 includes a business system 101 and a data analysis device 102. The business system 101 and the data analysis device 102 are communicably connected to each other via a network such as Local Area Network (; LAN), Wide Area Network (WAN), and the Internet.

業務システム１０１は、一つ以上の計算機を有しており、外部から入力されるなどした業務計画の実施及び実施結果を業務実績データとして、業務実績ＤＢ１１０に蓄積する。データ分析装置１０２は、モデル生成処理１２１と、業務改善施策生成処理１２２と、を実行する。 The business system 101 has one or more calculators, and stores the execution and execution results of the business plan input from the outside as business performance data in the business performance DB 110. The data analysis device 102 executes the model generation process 121 and the business improvement measure generation process 122.

モデル生成処理１２１は、業務システム１０１から業務内容及び業務結果等を含む業務実績データを業務実績ＤＢ１１０から取得して、予測モデルを生成する処理である。業務改善施策生成処理１２２は、あらたな業務実績データを、モデル生成処理１２１によって生成された予測モデルに与えることにより、業務評価指標の改善を実現するための業務改善施策を生成する処理である。 The model generation process 121 is a process of acquiring business performance data including business contents and business results from the business system 101 from the business performance DB 110 and generating a prediction model. The business improvement measure generation process 122 is a process of generating a business improvement measure for realizing improvement of the business evaluation index by giving new business performance data to the prediction model generated by the model generation process 121.

業務評価指標とは、ＫｅｙＰｅｒｆｏｒｍａｎｃｅＩｎｄｉｃａｔｏｒ（ＫＰＩ）等の業務を評価するための指標である。業務実績データは、業務に関連するパラメータである属性を複数含む。属性は、たとえば、タイムスタンプ、性別、金額、および作業時間のような特徴を含む。 The business evaluation index is an index for evaluating business such as Key Performance Indicator (KPI). Business performance data includes a plurality of attributes that are parameters related to business. Attributes include characteristics such as timestamp, gender, amount, and working hours.

業務システム１０１は、データ分析装置１０２から受け取った業務改善施策をもとに、次に実施される業務計画の変更を行い、再び業務の実施及び、業務実績データの蓄積を行う。データ分析システム１００は、以上で述べた業務実施と業務改善のサイクルを継続的に実行する。 Based on the business improvement measures received from the data analysis device 102, the business system 101 changes the business plan to be implemented next, executes the business again, and accumulates the business performance data. The data analysis system 100 continuously executes the cycle of business execution and business improvement described above.

＜データ分析装置１０２のハードウェア構成例＞
図２は、データ分析装置１０２のハードウェア構成例を示すブロック図である。データ分析装置１０２は、プロセッサ２０１と、記憶デバイス２０２と、入力デバイス２０３と、出力デバイス２０４と、通信インターフェース（通信ＩＦ）２０５と、を有する。プロセッサ２０１、記憶デバイス２０２、入力デバイス２０３、出力デバイス２０４、および通信ＩＦ２０５は、バス２０６により接続される。プロセッサ２０１は、データ分析装置１０２を制御する。記憶デバイス２０２は、プロセッサ２０１の作業エリアとなる。また、記憶デバイス２０２は、モデル生成処理１２１および業務改善施策生成処理１２２を実行する各種プログラムやデータを記憶する非一時的なまたは一時的な記録媒体である。記憶デバイス２０２としては、たとえば、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）、フラッシュメモリがある。入力デバイス２０３は、データを入力する。入力デバイス２０３としては、たとえば、キーボード、マウス、タッチパネル、テンキー、スキャナがある。出力デバイス２０４は、データを出力する。出力デバイス２０４としては、たとえば、ディスプレイ、プリンタがある。通信ＩＦ２０５は、ネットワークと接続し、データを送受信する。 <Hardware configuration example of data analyzer 102>
FIG. 2 is a block diagram showing a hardware configuration example of the data analysis device 102. The data analyzer 102 includes a processor 201, a storage device 202, an input device 203, an output device 204, and a communication interface (communication IF) 205. The processor 201, the storage device 202, the input device 203, the output device 204, and the communication IF 205 are connected by the bus 206. The processor 201 controls the data analyzer 102. The storage device 202 serves as a work area for the processor 201. Further, the storage device 202 is a non-temporary or temporary recording medium that stores various programs and data that execute the model generation process 121 and the business improvement measure generation process 122. Examples of the storage device 202 include a ROM (Read Only Memory), a RAM (Random Access Memory), an HDD (Hard Disk Drive), and a flash memory. The input device 203 inputs data. The input device 203 includes, for example, a keyboard, a mouse, a touch panel, a numeric keypad, and a scanner. The output device 204 outputs data. The output device 204 includes, for example, a display and a printer. The communication IF205 connects to the network and transmits / receives data.

記憶デバイス２０２は、具体的には、たとえば、業務実績データ２２１、商品情報２２２、作業員ＩＤ使用学習データ２２３、作業員属性使用学習データ２２４、中間学習データ２２５、第１影響度テーブル２２６、抽象化変数使用学習データ２２７、第２影響度テーブル２２８、作業員特徴使用学習データ２２９、作業員属性・特徴データ２３０、作業員ＩＤ使用予測モデル２３１、作業員属性使用予測モデル２３２、中間予測モデル２３３、抽象化変数使用予測モデル２３４、作業員特徴使用予測モデル２３５を記憶する。 Specifically, the storage device 202 includes, for example, business performance data 221 and product information 222, worker ID usage learning data 223, worker attribute usage learning data 224, intermediate learning data 225, first influence degree table 226, and abstraction. Abstract variable usage learning data 227, second influence degree table 228, worker feature usage learning data 229, worker attribute / feature data 230, worker ID usage prediction model 231, worker attribute usage prediction model 232, intermediate prediction model 233 , Abstract variable usage prediction model 234, worker feature usage prediction model 235 are stored.

業務実績データ２２１は、業務実績ＤＢ１１０から抽出されデータ分析装置１０２が受け取ったデータである。業務実績データ２２１は、モデル生成処理１２１による予測モデルの生成に用いられたり、業務改善施策生成処理１２２による業務改善施策の生成に用いられたりする。業務実績データ２２１の詳細は、図３で後述する。 The business record data 221 is data extracted from the business record DB 110 and received by the data analyzer 102. The business performance data 221 is used to generate a prediction model by the model generation process 121, or is used to generate a business improvement measure by the business improvement measure generation process 122. Details of the business performance data 221 will be described later with reference to FIG.

商品情報２２２は、商品の属性を示す情報を記憶する。商品情報の詳細は、図４で後述する。 The product information 222 stores information indicating the attributes of the product. Details of the product information will be described later in FIG.

作業員ＩＤ使用学習データ２２３は、目的変数を作業時間、説明変数を作業員の移動距離やピッキングした物品の数量や重量、体積などとして規定される作業を示す学習データであって、説明変数としてさらに作業員ＩＤ変数が付加された学習データである。作業員ＩＤ変数とは、作業員ＩＤで特定される作業員が対象となる作業を行ったか否かを示す変数である。作業員ＩＤ使用学習データの詳細は、図６で後述する。 The worker ID use learning data 223 is learning data indicating a work in which the objective variable is defined as the working time, the explanatory variable is the moving distance of the worker, the quantity, weight, volume, etc. of the picked article, and is used as the explanatory variable. Further, it is learning data to which a worker ID variable is added. The worker ID variable is a variable indicating whether or not the worker specified by the worker ID has performed the target work. The details of the worker ID use learning data will be described later with reference to FIG.

作業員属性使用学習データ２２４は、目的変数を作業時間、説明変数を作業員の移動距離やピッキングした物品の数量や重量、体積などとして規定される作業を示す学習データであって、説明変数としてさらに作業員属性変数が付加された学習データである。作業員属性変数とは、当該作業員の経験日数などの作業員の属性を示す変数である。作業員属性使用学習データには、作業員ＩＤ変数は含まれない。作業員属性使用学習データの詳細は、図７で後述する。 The worker attribute use learning data 224 is learning data indicating a work in which the objective variable is defined as the working time, the explanatory variable is the moving distance of the worker, the quantity, weight, volume, etc. of the picked article, and is used as the explanatory variable. It is learning data to which worker attribute variables are added. The worker attribute variable is a variable indicating the attribute of the worker such as the number of days of experience of the worker. The worker attribute use learning data does not include the worker ID variable. The details of the worker attribute use learning data will be described later in FIG.

中間学習データ２２５は、抽象化変数使用学習データ２２７を生成するための中間的な学習データである。中間学習データの詳細は、図８で後述する。 The intermediate learning data 225 is intermediate learning data for generating the learning data 227 using abstract variables. Details of the intermediate learning data will be described later with reference to FIG.

第１影響度テーブル２２６は、説明変数、作業員属性変数、および作業員ＩＤ変数の各影響度を作業別に規定したテーブルである。第１影響度テーブル２２６の詳細は、図９および図１０で後述する。 The first influence degree table 226 is a table in which each influence degree of the explanatory variable, the worker attribute variable, and the worker ID variable is defined for each work. Details of the first impact table 226 will be described later in FIGS. 9 and 10.

抽象化変数使用学習データ２２７は、目的変数を作業時間、説明変数を作業員の移動距離やピッキングした物品の数量や重量、体積などとして規定される作業を示す学習データであって、説明変数としてさらに作業員属性変数および作業員ＩＤ抽象化変数が追加された学習データである。すなわち、作業員属性使用学習データに作業員ＩＤ抽象化変数が追加された学習データである。 The abstract variable use learning data 227 is learning data indicating a work in which the objective variable is defined as the working time, the explanatory variable is the moving distance of the worker, the quantity, weight, volume, etc. of the picked article, and is used as the explanatory variable. Furthermore, it is the training data to which the worker attribute variable and the worker ID abstraction variable are added. That is, it is the learning data in which the worker ID abstraction variable is added to the worker attribute use learning data.

作業員ＩＤ抽象化変数とは、作業員ＩＤで特定される作業員を作業員ＩＤで特定せずに「作業が遅い」や「作業が速い」のような表現で抽象化した変数である。抽象化変数使用学習データの詳細は、図１１で後述する。 The worker ID abstract variable is a variable that is abstracted by expressions such as "work is slow" or "work is fast" without specifying the worker specified by the worker ID by the worker ID. Details of the learning data using abstract variables will be described later in FIG.

第２影響度テーブル２２８は、モデル生成処理１２１によって抽象化変数使用学習データ２２７を用いて生成された予測モデルに、改善対象の業務実績データ２２１を入力した結果出力されるデータテーブルである。第２影響度テーブル２２８の詳細は、図１２および図１３で後述する。 The second influence degree table 228 is a data table output as a result of inputting the business performance data 221 to be improved into the prediction model generated by the model generation process 121 using the abstract variable use learning data 227. Details of the second impact table 228 will be described later in FIGS. 12 and 13.

作業員特徴使用学習データ２２９は、抽象化変数使用学習データ２２７に、作業員特徴変数が追加された学習データである。作業員特徴変数とは、たとえば、「棚上段が得意」のように作業員の特徴を示す変数である。作業員特徴使用学習データ２２９の詳細は、図１４で後述する。 The worker feature use learning data 229 is learning data in which a worker feature variable is added to the abstraction variable use learning data 227. The worker characteristic variable is a variable that indicates the characteristics of the worker, for example, "I am good at the upper shelf". Details of the worker feature use learning data 229 will be described later with reference to FIG.

作業員属性・特徴データ２３０は、各作業員の作業員ＩＤに対して、作業員属性や作業員特徴がリスト化されたデータである。作業員属性・特徴データ２３０の詳細は、図１７で後述する。 The worker attribute / feature data 230 is data in which the worker attributes and worker characteristics are listed for the worker ID of each worker. Details of the worker attribute / feature data 230 will be described later with reference to FIG.

作業員ＩＤ使用予測モデル２３１は、モデル生成処理１２１によって作業員ＩＤ使用学習データ２２３（図６）を用いて生成された、作業時間を予測する予測モデルである。 The worker ID use prediction model 231 is a prediction model for predicting the work time, which is generated by the model generation process 121 using the worker ID use learning data 223 (FIG. 6).

作業員属性使用予測モデル２３２は、モデル生成処理１２１によって作業員属性使用学習データ２２４（図７）を用いて生成された、作業時間を予測する予測モデルである。 The worker attribute use prediction model 232 is a prediction model for predicting the work time, which is generated by the model generation process 121 using the worker attribute use learning data 224 (FIG. 7).

中間予測モデル２３３は、モデル生成処理１２１によって中間学習データ２２５（図８）を用いて生成された、作業時間を予測する中間的な予測モデルである。 The intermediate prediction model 233 is an intermediate prediction model for predicting the working time, which is generated by the model generation process 121 using the intermediate learning data 225 (FIG. 8).

抽象化変数使用予測モデル２３４は、モデル生成処理１２１によって抽象化変数使用学習データ２２７（図１１）を用いて生成された、作業時間を予測する予測モデルである。 The abstract variable use prediction model 234 is a prediction model for predicting the working time, which is generated by the model generation process 121 using the abstract variable use learning data 227 (FIG. 11).

作業員特徴使用予測モデル２３５は、モデル生成処理１２１によって作業員特徴使用学習データ２２９を用いて生成された、作業時間を予測する予測モデルである。 The worker feature use prediction model 235 is a prediction model for predicting the work time, which is generated by the model generation process 121 using the worker feature use learning data 229.

＜業務実績データ２２１＞
図３は、業務実績データ２２１の一例を示す説明図である。業務実績データ２２１は、作業ＩＤ３０１と、枝番３０２と、作業員ＩＤ３０３と、商品コード３０４と、場所コード３０５と、個数３０６と、作業開始時刻３０７と、作業終了時刻３０８と、を有する。作業員ＩＤ３０３、商品コード３０４、場所コード３０５、および個数３０６は、商品の配置、取り扱う商品、作業員の構成といった作業環境を示す。 <Business performance data 221>
FIG. 3 is an explanatory diagram showing an example of business performance data 221. The work record data 221 has a work ID 301, a branch number 302, a worker ID 303, a product code 304, a place code 305, a number 306, a work start time 307, and a work end time 308. The worker ID 303, the product code 304, the location code 305, and the number 306 indicate a work environment such as the arrangement of products, the products to be handled, and the composition of workers.

作業ＩＤ３０１は、作業を一意に特定する識別情報であり、作業員がピッキングを行う作業業務の１単位を示す。たとえば、同一の発送先の商品が対象となる一連のピッキング作業が該当する。 The work ID 301 is identification information that uniquely identifies the work, and indicates one unit of the work work that the worker picks. For example, a series of picking operations for products of the same shipping destination are applicable.

枝番３０２は、作業ＩＤ３０１で特定される作業を作業員ＩＤ別または商品コード別に細分化した番号である。以下、作業ＩＤ３０１をｘ（ｘは数字）またはｘ−ｙ（ｙは枝番３０２の数字）で表記する。作業ＩＤ３０１をｘのみで表記した場合は、その作業ＩＤ３０１の全枝番３０２を含むエントリが対象となり、作業ＩＤ３０１をｘ−ｙで表記した場合は、その作業ＩＤ３０１：ｘの枝番３０２：ｙのエントリが対象となる。 The branch number 302 is a number obtained by subdividing the work specified by the work ID 301 by the worker ID or the product code. Hereinafter, the work ID 301 is represented by x (x is a number) or xy (y is a number of the branch number 302). When the work ID 301 is represented by x only, the entry including all the branch numbers 302 of the work ID 301 is targeted, and when the work ID 301 is represented by xy, the branch number 302: y of the work ID 301: x Entry is targeted.

作業員ＩＤ３０３は、作業員を一意に特定する識別情報である。商品コード３０４は、作業員ＩＤ３０３で特定される作業員がピッキングした商品を一意に特定する識別情報である。場所コード３０５は、商品コード３０４で特定される商品が収納されている場所を示す識別情報である。場所コード３０５は、たとえば、ａ列−ｂ連−ｃ段（ａ、ｂ、ｃは文字列）で表記され、ａ列の棚のｂ連（奥行き）のｃ段（上、中、下などの棚の高さ）を意味する。 The worker ID 303 is identification information that uniquely identifies the worker. The product code 304 is identification information that uniquely identifies the product picked by the worker specified by the worker ID 303. The place code 305 is identification information indicating a place where the product specified by the product code 304 is stored. The place code 305 is represented by, for example, a row-b series-c stage (a, b, c are character strings), and the c stage (top, middle, bottom, etc.) of the b series (depth) of the shelf in row a. It means the height of the shelf).

個数３０６は、商品コード３０４で特定される商品をピッキングした数である。作業開始時刻３０７は、作業ＩＤ３０１：ｘ−ｙで特定される作業を開始した日付時刻である。作業終了時刻３０８は、作業ＩＤ３０１：ｘ−ｙで特定される作業を終了した日付時刻である。 The number 306 is the number of picked products specified by the product code 304. The work start time 307 is a date and time when the work specified by the work ID 301: xy is started. The work end time 308 is a date and time when the work specified by the work ID 301: xy is completed.

＜商品情報＞
図４は、商品情報の一例を示す説明図である。商品情報２２２は、商品コード３０４と、１個当たりの重量４０１と、１個当たりの体積４０２と、を有する。１個当たりの重量４０１は、商品コード３０４で特定される商品１個当たりの重さである。１個当たりの体積４０２は、商品コード３０４で特定される商品１個当たりの体積である。 <Product information>
FIG. 4 is an explanatory diagram showing an example of product information. The product information 222 has a product code 304, a weight 401 per piece, and a volume 402 per piece. The weight 401 per product is the weight per product specified by the product code 304. The volume 402 per product is the volume per product specified by the product code 304.

＜モデル生成処理１２１＞
図５は、図１に示したモデル生成処理１２１の詳細な処理手順例を示すフローチャートである。モデル生成処理１２１とは、業務実績データ２２１から学習データを加工、生成し、作業時間を予測する予測モデルを生成する処理である。 <Model generation process 121>
FIG. 5 is a flowchart showing a detailed processing procedure example of the model generation processing 121 shown in FIG. The model generation process 121 is a process of processing and generating learning data from the business performance data 221 to generate a prediction model for predicting the work time.

データ分析装置１０２は、モデル生成処理１２１を起動し（ステップＳ５０１）、記憶デバイス２０２から予測モデルを生成する際の学習データの対象期間分の業務実績データ２２１を取得する（ステップＳ５０２）。つぎに、データ分析装置１０２は、業務実績データ２２１を加工して、作業員ＩＤ使用学習データ２２３を生成する（ステップＳ５０３）。これにより、業務実績データ２２１は、作業員ＩＤ使用学習データ２２３に変換される。作業員ＩＤ使用学習データ２２３の一例を図６に示す。 The data analysis device 102 activates the model generation process 121 (step S501), and acquires the business performance data 221 for the target period of the learning data when the prediction model is generated from the storage device 202 (step S502). Next, the data analysis device 102 processes the work performance data 221 to generate the worker ID use learning data 223 (step S503). As a result, the work performance data 221 is converted into the worker ID use learning data 223. An example of the worker ID use learning data 223 is shown in FIG.

［作業員ＩＤ使用学習データ２２３］
図６は、作業員ＩＤ使用学習データ２２３の一例を示す説明図である。作業員ＩＤ使用学習データ２２３は、作業ＩＤ３０１と、目的変数６０１と、説明変数６０２と、作業員ＩＤ変数６０３と、を有する。目的変数６０１は、予測対象の値であり、本例では作業時間に該当する。作業時間は、同一の作業ＩＤ３０１のエントリの作業終了時刻３０８から作業開始時刻３０７を引いた時間である。枝番３０２により作業ＩＤ３０１に複数のエントリが存在する場合、各エントリの作業時間の総和が作業時間６０１となる。 [Worker ID usage learning data 223]
FIG. 6 is an explanatory diagram showing an example of the worker ID use learning data 223. The worker ID use learning data 223 has a work ID 301, an objective variable 601, an explanatory variable 602, and a worker ID variable 603. The objective variable 601 is a value to be predicted, and corresponds to the working time in this example. The work time is the time obtained by subtracting the work start time 307 from the work end time 308 of the entry of the same work ID 301. When a plurality of entries exist in the work ID 301 according to the branch number 302, the total work time of each entry is the work time 601.

説明変数６０２は、業務実績データ２２１における各種の値を集計して生成された変数の集合であり、当該作業ＩＤ３０１のエントリの特徴量を示す。具体的には、たとえば、説明変数６０２は、移動距離６２１と、数量６２２と、重量６２３と、体積６２４と、棚上段６２５と、棚中段６２６と、棚下段６２７と、を有する。移動距離６２１は、作業ＩＤ３０１で特定される作業をした作業員が当該作業において移動した距離である。データ分析装置１０２は、場所コード３０５に示した位置情報を含む倉庫内の地図情報を有しており、ピッキングした場所間の道なり距離を算出可能である。 The explanatory variable 602 is a set of variables generated by aggregating various values in the business performance data 221 and indicates the feature amount of the entry of the work ID 301. Specifically, for example, the explanatory variable 602 has a movement distance 621, a quantity 622, a weight 623, a volume 624, a shelf upper tier 625, a shelf middle tier 626, and a shelf lower tier 627. The movement distance 621 is the distance traveled by the worker who performed the work specified by the work ID 301 in the work. The data analysis device 102 has map information in the warehouse including the location information indicated by the location code 305, and can calculate the road distance between the picked locations.

数量６２２は、作業ＩＤ３０１で特定される作業でピッキングされた商品の数または量である。重量６２３は、作業ＩＤ３０１で特定される作業でピッキングされた商品の数量６２２分の重さである。体積６２４は、作業ＩＤ３０１で特定される作業でピッキングされた商品の数量６２２分の大きさである。棚上段６２５は、棚の上段からピッキングした商品の数量である。棚中段６２６は、棚の中段からピッキングした商品の数量である。棚下段６２７は、棚の下段からピッキングした商品の数量である。作業ＩＤ３０１のエントリにおいて、棚上段６２５〜棚下段６２７の値の合計が、数量６２２となる。 The quantity 622 is the number or quantity of goods picked in the work specified by the work ID 301. The weight 623 is a weight of 622 minutes of the quantity of goods picked in the work specified by the work ID 301. The volume 624 is the size of the quantity of goods picked by the work specified by the work ID 301, which is 622 minutes. The upper shelf 625 is the quantity of products picked from the upper shelf. The middle shelf 626 is the quantity of products picked from the middle shelf. The lower shelf 627 is the quantity of products picked from the lower shelf. In the entry of the work ID 301, the total of the values of the upper shelf 625 to the lower shelf 627 is the quantity 622.

なお、データ分析装置１０２は、図示しない在庫管理システムと通信可能であり、在庫管理システムは、商品の１個当たりの重量や体積、収納されている場所コード３０５を管理している。したがって、データ分析装置１０２は、在庫管理システムにアクセスすることで、商品ごとに数量６２２〜棚下段６２７を取得可能である。 The data analysis device 102 can communicate with an inventory management system (not shown), and the inventory management system manages the weight and volume of each product and the storage location code 305. Therefore, the data analysis device 102 can acquire the quantity 622 to the lower shelf 627 for each product by accessing the inventory management system.

作業員ＩＤ変数６０３は、その作業ＩＤ３０１で特定される作業を、作業員ＩＤ３０３（図６のＩＤ１、ＩＤ２、…）で特定される作業員のうちどの作業員がおこなったかを示す変数である。「ＩＤ＃」（＃は数字）は、作業員ＩＤ３０３を示す。たとえば、作業員ＩＤ変数６０３が「１」であれば、その作業ＩＤ３０１で特定される作業を、その作業員ＩＤ３０３で特定される作業員が行ったことを示す。作業員ＩＤ変数６０３が「０」であれば、その作業ＩＤ３０１で特定される作業を、その作業員ＩＤ３０３で特定される作業員が行っていないことを示す。ここで、図６の説明を終了する。 The worker ID variable 603 is a variable indicating which of the workers specified by the worker ID 303 (ID1, ID2, ... In FIG. 6) has performed the work specified by the work ID 301. “ID #” (where # is a number) indicates worker ID 303. For example, if the worker ID variable 603 is "1", it indicates that the work specified by the work ID 301 has been performed by the worker specified by the worker ID 303. If the worker ID variable 603 is "0", it indicates that the worker specified by the worker ID 303 is not performing the work specified by the work ID 301. Here, the description of FIG. 6 ends.

図５に戻り、データ分析装置１０２は、作業員ＩＤ使用学習データ２２３を用いて、作業員ＩＤ使用予測モデル２３１を生成する（ステップＳ５０４）。この処理は、作業員ＩＤ使用予測モデル２３１を、目的変数６０１である作業時間を予測する回帰モデルとして機械学習によって生成する処理であり、機械学習アルゴリズムの手法自体は問わない。 Returning to FIG. 5, the data analysis device 102 generates the worker ID use prediction model 231 using the worker ID use learning data 223 (step S504). This process is a process of generating the worker ID usage prediction model 231 as a regression model for predicting the working time, which is the objective variable 601 by machine learning, and the method itself of the machine learning algorithm does not matter.

この機械学習アルゴリズムの手法は、たとえば、説明変数６０２と作業員ＩＤ変数６０３に適当な係数を掛け合わせそれらの総和を予測値とし、学習データの目的変数６０１と予測値との二乗誤差が最小になるような各係数を求める線形回帰式を生成する手法でもよい。また、機械学習アルゴリズムの手法は、ＤＮＮ（ＤｅｅｐＮｅｕｒａｌＮｅｔｗｏｒｋ）などによる非線形な表現によって予測モデルを生成するような手法でもよい。このように、作業員ＩＤ使用予測モデル２３１を生成することで、学習データに十分な作業実績のサンプルを持つ作業員の作業について、様々な説明変数６０２の値に対応した作業時間の予測値の高精度化を図ることができる。 In this machine learning algorithm method, for example, the explanatory variable 602 and the worker ID variable 603 are multiplied by an appropriate coefficient to obtain the sum of them as the predicted value, and the square error between the objective variable 601 of the training data and the predicted value is minimized. It may be a method of generating a linear regression equation for obtaining each coefficient such that. Further, the method of the machine learning algorithm may be a method of generating a prediction model by a non-linear expression such as DNN (Deep Neural Network). In this way, by generating the worker ID usage prediction model 231, for the work of the worker who has a sample of the work results sufficient for the learning data, the predicted value of the work time corresponding to the values of various explanatory variables 602. High accuracy can be achieved.

つぎに、データ分析装置１０２は、業務実績データ２２１を加工して、作業員属性使用学習データ２２４を生成する（ステップＳ５０５）。図７に作業員属性使用学習データ２２４の一例を示す。 Next, the data analysis device 102 processes the work performance data 221 to generate the worker attribute use learning data 224 (step S505). FIG. 7 shows an example of the worker attribute use learning data 224.

［作業員属性使用学習データ２２４］
図７は、作業員属性使用学習データ２２４の一例を示す説明図である。作業員属性使用学習データ２２４は、作業員属性変数７００を使用した学習データである。具体的には、たとえば、作業員属性使用学習データ２２４は、作業ＩＤ３０１と、目的変数６０１と、説明変数６０２と、作業員属性変数７００と、を有する。作業員属性変数７００とは、作業員の属性を示す変数である。 [Worker attribute use learning data 224]
FIG. 7 is an explanatory diagram showing an example of worker attribute use learning data 224. The worker attribute use learning data 224 is learning data using the worker attribute variable 700. Specifically, for example, the worker attribute use learning data 224 has a work ID 301, an objective variable 601, an explanatory variable 602, and a worker attribute variable 700. The worker attribute variable 700 is a variable indicating the attributes of the worker.

作業員属性変数７００は、業務実績データ２２１に格納されている情報を作業員属性変数７００として利用してもよく、新たに作成された変数でもよい。前者の場合、作業員の年齢や性別、身長、作業員のレベルが例として挙げられる。後者については、図７にあるような経験日数７０１や付近の作業員の経験日数７０２が例として挙げられる。 The worker attribute variable 700 may use the information stored in the work performance data 221 as the worker attribute variable 700, or may be a newly created variable. In the former case, the age, gender, height, and worker level of the worker are examples. As for the latter, examples include 701 days of experience as shown in FIG. 7 and 702 days of experience of nearby workers.

経験日数７０１は、作業員が分析対象の業務に携わってからどれくらいの日数が経過しているかを示す作業員属性変数７００であり、たとえば過去の全期間に渡る業務実績データ２２１から、対象のエントリの作業員ＩＤ３０３が存在する最も古い作業日時を基点日時とし、基点から該当するエントリの作業日時までの日数を計算することで得られる。 The number of days of experience 701 is a worker attribute variable 700 indicating how many days have passed since the worker engaged in the work to be analyzed. For example, the target entry is obtained from the work performance data 221 over the entire past period. It is obtained by calculating the number of days from the base point to the work date and time of the corresponding entry, using the oldest work date and time in which the worker ID 303 exists as the base point date and time.

また、作業員ＩＤ３０３それぞれに経験日数７０１の初期値を外部から与えられるようにし、その初期値を基点日時にして対象のエントリでの経験日数を計算するようにしてもよい。これによって、業務実績データ２２１を収集する以前から業務に携わっているような作業員の経験日数７０１を正確に反映することができる。 Further, the initial value of the number of experience days 701 may be given to each worker ID 303 from the outside, and the number of days of experience in the target entry may be calculated using the initial value as the base point date and time. As a result, it is possible to accurately reflect the number of days of experience 701 of a worker who has been engaged in the work before collecting the work performance data 221.

経験日数７０１は日数に限らず、分単位、時間単位、月単位で表現してもよい。さらに、変数の型は連続値に限らず、分類値（たとえば、経験１ヶ月未満、経験６ヶ月以上、など）を用いてもよい。これによって、より業務の特性に特化した経験値を反映した変数とすることができる。 The number of days of experience 701 is not limited to the number of days, and may be expressed in minutes, hours, or months. Further, the type of the variable is not limited to the continuous value, and a classification value (for example, less than 1 month of experience, 6 months or more of experience, etc.) may be used. As a result, it is possible to make a variable that reflects the experience value that is more specialized in the characteristics of the business.

付近の作業員の経験日数７０２は、該当するエントリの作業を実施する際に、近く（たとえば、半径１０メートル以内などの所定範囲）に存在する作業員の経験日数７０１を反映した作業員属性変数７００である。具体的には、たとえば、データ分析装置１０２は、業務実績データ２２１の場所コード３０５、作業開始時刻３０７および作業終了時刻３０８を参照し、ピッキング対象の商品が近い場所に存在し作業時間帯が重なっている別の作業を担当する作業員の経験日数７０１を算出する。この算出結果が、付近の作業員の経験日数７０２となる。 The number of experience days 702 of a nearby worker is a worker attribute variable that reflects the number of experience days 701 of a nearby worker (for example, within a predetermined range such as within a radius of 10 meters) when performing the work of the corresponding entry. It is 700. Specifically, for example, the data analyzer 102 refers to the location code 305, the work start time 307, and the work end time 308 of the business performance data 221, and the products to be picked exist in close locations and the work time zones overlap. Calculate the number of days of experience 701 of the worker in charge of another work. The result of this calculation is the number of days of experience of nearby workers 702.

当該別の作業員が複数存在する場合は、データ分析装置１０２は、各別の作業員の経験日数７０１の平均値、中央値、最大値、最小値といった統計値を算出することにより、付近の作業員の経験日数７０２に設定する。特に、最小値が選択された場合、付近に新人の作業員が存在し、業務の支援をするような状況を反映できると考えられる。ここで、図６の説明を終了する。 When there are a plurality of the other workers, the data analyzer 102 calculates statistical values such as the average value, the median value, the maximum value, and the minimum value of the number of days of experience 701 of each different worker in the vicinity. The number of days of experience of the worker is set to 702. In particular, when the minimum value is selected, it is considered that a situation in which a new worker exists in the vicinity and supports the work can be reflected. Here, the description of FIG. 6 ends.

図５に戻り、データ分析装置１０２は、作業員属性使用学習データ２２４を用いて、作業員属性使用予測モデル２３２を生成する（ステップＳ５０６）。この処理は、ステップＳ５０４の作業員ＩＤ使用予測モデル２３１と同様、作業員属性使用予測モデル２３２を、目的変数６０１である作業時間を予測する回帰モデルとして機械学習によって生成する処理であり、機械学習アルゴリズムの手法は、ステップＳ５０４と同様の手法となる。 Returning to FIG. 5, the data analysis device 102 generates the worker attribute use prediction model 232 using the worker attribute use learning data 224 (step S506). Similar to the worker ID usage prediction model 231 in step S504, this process is a process of generating the worker attribute usage prediction model 232 as a regression model for predicting the working time, which is the objective variable 601 by machine learning. The algorithm method is the same as in step S504.

このように、作業員属性使用予測モデル２３２を生成することで、作業員ＩＤ３０３ではなく、作業員が持つ作業員属性を説明変数とした普遍的な予測モデルを得ることができる。これによって、学習データ内に実績が存在しない作業員の作業を予測するような場合において、その作業員の属性情報を説明変数として与えることで、精度良く作業時間を予測することができる。なお、作業員属性使用予測モデル２３２は、後述する図１６で使用される。 By generating the worker attribute usage prediction model 232 in this way, it is possible to obtain a universal prediction model in which the worker attribute possessed by the worker is used as an explanatory variable instead of the worker ID 303. As a result, when the work of a worker whose actual result does not exist in the learning data is predicted, the work time can be predicted accurately by giving the attribute information of the worker as an explanatory variable. The worker attribute usage prediction model 232 is used in FIG. 16, which will be described later.

つぎに、データ分析装置１０２は、中間学習データ２２５を生成する（ステップＳ５０７）。図８に中間学習データ２２５の一例を示す。 Next, the data analyzer 102 generates the intermediate learning data 225 (step S507). FIG. 8 shows an example of the intermediate learning data 225.

［中間学習データ２２５］
図８は、中間学習データ２２５の一例を示す説明図である。中間学習データ２２５は、図７の作業員属性使用学習データ２２４に、図６の作業員ＩＤ変数６０３が追加された学習データである。まず、図８に示すように、データ分析装置１０２は、中間学習データ２２５を生成する。ここで、図８の説明を終了する。 [Intermediate learning data 225]
FIG. 8 is an explanatory diagram showing an example of the intermediate learning data 225. The intermediate learning data 225 is learning data in which the worker ID variable 603 of FIG. 6 is added to the worker attribute use learning data 224 of FIG. First, as shown in FIG. 8, the data analyzer 102 generates the intermediate learning data 225. Here, the description of FIG. 8 ends.

図５に戻り、データ分析装置１０２は、中間学習データ２２５を用いて、中間予測モデル２３３を生成する（ステップＳ５０８）。この処理は、ステップＳ５０４の作業員ＩＤ使用予測モデル２３１と同様、中間予測モデル２３３を、目的変数６０１である作業時間を予測する回帰モデルとして機械学習によって生成する処理であり、機械学習アルゴリズムの手法は、ステップＳ５０４と同様の手法となる。 Returning to FIG. 5, the data analyzer 102 uses the intermediate learning data 225 to generate the intermediate prediction model 233 (step S508). Similar to the worker ID usage prediction model 231 in step S504, this process is a process of generating the intermediate prediction model 233 as a regression model for predicting the working time, which is the objective variable 601. Is the same method as in step S504.

つぎに、データ分析装置１０２は、中間予測モデル２３３を用いて、中間学習データ２２５の全作業に対して、各々の作業が入力である場合の予測結果に対する各変数の影響度（第１影響度）を算出する（ステップＳ５０９）。具体的には、たとえば、データ分析装置１０２は、第１影響度テーブル２２６を作成する。図９に第１影響度テーブル２２６の一例を示す。 Next, the data analyzer 102 uses the intermediate prediction model 233 to influence the influence of each variable on the prediction result when each work is an input (first influence degree) for all the works of the intermediate learning data 225. ) Is calculated (step S509). Specifically, for example, the data analyzer 102 creates the first impact table 226. FIG. 9 shows an example of the first influence degree table 226.

［第１影響度テーブル２２６］
図９は、第１影響度テーブル２２６の一例を示す説明図である。第１影響度テーブル２２６は、全作業において、中間学習データ２２５で生成された予測モデルを用いて算出された予測値に対する各変数の影響度（第１影響度）をテーブルの形式で例示した情報である。具体的には、たとえば、第１影響度テーブル２２６は、作業ＩＤ３０１と、目的変数６０１と、説明変数６０２の影響度９０１と、作業員属性変数７００の影響度９０２と、作業員ＩＤ変数６０３の影響度９０３と、予測値９０４と、を有する。 [First influence table 226]
FIG. 9 is an explanatory diagram showing an example of the first influence degree table 226. The first influence degree table 226 is information exemplifying the influence degree (first influence degree) of each variable on the predicted value calculated by using the prediction model generated by the intermediate learning data 225 in the form of a table in all the operations. Is. Specifically, for example, the first influence degree table 226 shows the work ID 301, the objective variable 601, the influence degree 901 of the explanatory variable 602, the influence degree 902 of the worker attribute variable 700, and the worker ID variable 603. It has an influence degree of 903 and a predicted value of 904.

予測値９０４は、作業ＩＤ３０１ごとに、図８の説明変数６０２、作業員属性変数７００、および作業員ＩＤ変数６０３を、中間学習データ２２５で生成された予測モデルに入力した場合の出力結果（作業時間）である。 The predicted value 904 is an output result (work) when the explanatory variable 602, the worker attribute variable 700, and the worker ID variable 603 of FIG. 8 are input to the prediction model generated by the intermediate learning data 225 for each work ID 301. Time).

説明変数６０２の影響度９０１、作業員属性変数７００の影響度９０２、および作業員ＩＤ変数６０３の影響度９０３は、算出された予測値９０４が、全作業の予測値９０４の平均値に対してどのような影響を各変数から受けて決定されているかを示した情報である。具体的には、たとえば、説明変数６０２の影響度９０１は、説明変数６０２の項目（移動距離６２１、数量６２２、重量６２３、体積６２４、棚上段６２５、棚中段６２６、棚下段６２７、…）ごとに影響度を有する。 The influence degree 901 of the explanatory variable 602, the influence degree 902 of the worker attribute variable 700, and the influence degree 903 of the worker ID variable 603 are such that the calculated predicted value 904 is the average value of the predicted values 904 of all works. It is information showing what kind of influence is received from each variable and determined. Specifically, for example, the degree of influence 901 of the explanatory variable 602 is set for each item of the explanatory variable 602 (movement distance 621, quantity 622, weight 623, volume 624, shelf upper tier 625, shelf middle tier 626, shelf lower tier 627, ...). Has an influence on.

作業員属性変数７００の影響度９０２は、作業員属性変数７００の項目（経験日数７０１、付近の作業員の経験日数７０２、…）ごとに影響度を有する。作業員ＩＤ変数６０３の影響度９０３は、作業員ＩＤ変数６０３の項目（ＩＤ１、ＩＤ２、…）ごとに影響度を有する。影響度の値は、大きいほどその作業に影響を与えていることを示す。 The influence degree 902 of the worker attribute variable 700 has an influence degree for each item of the worker attribute variable 700 (experience days 701, experience days 702 of nearby workers, ...). The degree of influence 903 of the worker ID variable 603 has a degree of influence for each item (ID1, ID2, ...) Of the worker ID variable 603. The larger the impact value, the more influential the work is.

たとえば、作業ＩＤ３０１が「１」のエントリの場合、作業員ＩＤ変数６０３の「ＩＤ１」という変数の影響度は、「＋５５」である。これは、すべての予測値９０４の平均値である「３００」に対して「＋５５」の影響を与え、同様にその他の変数の影響度が加算されて最終的に予測値９０４である「４３２」が決定された、ということを意味する。これによって、学習データの各作業それぞれで、各種変数の値が予測値９０４にどのような影響を与えているかが定量的に分かるようになる。 For example, when the work ID 301 is an entry of "1", the influence degree of the variable "ID1" of the worker ID variable 603 is "+55". This has an effect of "+55" on "300" which is the average value of all the predicted values 904, and similarly, the influence degree of other variables is added to finally "432" which is the predicted value 904. Means that has been decided. As a result, it becomes possible to quantitatively understand how the values of various variables affect the predicted value 904 in each work of the learning data.

このような影響度を算出する方法としては、たとえば公知の技術として、ＳＨＡＰ（ＳＨａｐｌｅｙＡｄｄｉｔｉｖｅｅｘＰｌａｎａｔｉｏｎｓ）や、ＬＩＭＥ（ＬｏｃａｌＩｎｔｅｒｐｒｅｔａｂｌｅＭｏｄｅｌ−ａｇｎｏｓｔｉｃＥｘｐｌａｉｎａｔｉｏｎｓ）のような手法がある。 As a method for calculating such an influence degree, for example, as a known technique, there are a method such as SHAP (Shapley Adaptive exPlanations) and LIMITE (Local Interpretable Model-agnostic Expressions).

これにより、データ分析装置１０２は、任意の予測モデルが出力した予測値９０４について、該当する作業が持つ各変数が予測値９０４にどれくらい影響を与えたかを、定量的な値で得ることができる。ここで、図９の説明を終了する。 As a result, the data analyzer 102 can obtain a quantitative value of how much each variable of the corresponding work affects the predicted value 904 with respect to the predicted value 904 output by an arbitrary prediction model. Here, the description of FIG. 9 ends.

つぎに、データ分析装置１０２は、抽象化変数使用学習データ２２７を生成する（ステップＳ５１０）。具体的には、たとえば、データ分析装置１０２は、第１影響度テーブル２２６を用いて、作業員ＩＤ変数６０３を作業員属性変数７００に加工する。図１０を用いて具体的に説明する。 Next, the data analyzer 102 generates the abstract variable use learning data 227 (step S510). Specifically, for example, the data analyzer 102 processes the worker ID variable 603 into the worker attribute variable 700 by using the first influence degree table 226. This will be specifically described with reference to FIG.

［影響度分布情報］
図１０は、全作業における影響度分布情報を示す説明図である。影響度分布情報１０００は、作業員ＩＤ変数名１００１と影響度分布１００２とを対応付けた情報である。データ分析装置１０２は、作業員ＩＤ３０３ごとに、当該作業員の作業員ＩＤ変数６０３とその影響度を用いて、全作業（作業ＩＤ３０１：１，２，…）における影響度の分布を求める。 [Impact distribution information]
FIG. 10 is an explanatory diagram showing influence degree distribution information in all operations. The influence degree distribution information 1000 is information in which the worker ID variable name 1001 and the influence degree distribution 1002 are associated with each other. The data analysis device 102 obtains the distribution of the degree of influence in all the works (work IDs 301: 1, 2, ...) For each worker ID 303 by using the worker ID variable 603 of the worker and the degree of influence thereof.

すなわち、作業員ＩＤ変数名１００１のある値（作業員ＩＤ３０３）の影響度分布１００２は、図９に示した第１影響度テーブル２２６の作業員ＩＤ変数６０３の影響度９０３における当該作業員ＩＤ変数６０３の影響度９０３の列である。たとえば、当該作業員ＩＤ３０３が「ＩＤ１」であれば、影響度分布１００２は、作業ＩＤ３０１の昇順に「＋５５，０，０，…」の影響度分布１００２となり、当該作業員ＩＤ３０３が「ＩＤ２」であれば、影響度分布１００２は、作業ＩＤ３０１の昇順に「０，−６１，０，…」の影響度分布１００２となる。 That is, the influence degree distribution 1002 of a certain value (worker ID 303) of the worker ID variable name 1001 is the worker ID variable in the influence degree 903 of the worker ID variable 603 of the first influence degree table 226 shown in FIG. It is a column of influence degree 903 of 603. For example, if the worker ID 303 is "ID1", the influence distribution 1002 becomes the influence distribution 1002 of "+55, 0, 0, ..." In the ascending order of the work ID 301, and the worker ID 303 is "ID2". If there is, the influence degree distribution 1002 becomes the influence degree distribution 1002 of "0, -61, 0, ..." In the ascending order of the work ID 301.

図１０では、影響度分布１００２において図８の作業員ＩＤ変数６０３が「０」の作業（不実施作業）における当該作業員ＩＤ変数６０３の影響度９０３が黒丸●、作業員ＩＤ変数６０３が「１」の作業（実施作業）における当該作業員ＩＤ変数６０３の影響度９０３が白丸○で表記される。 In FIG. 10, in the influence degree distribution 1002, the influence degree 903 of the worker ID variable 603 in the work (non-execution work) in which the worker ID variable 603 of FIG. 8 is “0” is a black circle ●, and the worker ID variable 603 is “ The degree of influence 903 of the worker ID variable 603 in the work (implementation work) of 1 ”is indicated by a white circle.

たとえば、図８の中間学習データ２２５の作業員ＩＤ変数６０３において「ＩＤ１」の列は、作業ＩＤ３０１の昇順に「１，０，０，…」である。したがって、作業員ＩＤ３０３が「１」の影響度分布１００２において、作業ＩＤ３０１が「１」の影響度「＋５８」に対応する作業員ＩＤ変数６０３の値は「１」であるため、作業ＩＤ３０１が「１」の影響度「＋５８」は白丸○として表記される。 For example, in the worker ID variable 603 of the intermediate learning data 225 in FIG. 8, the column of "ID1" is "1,0,0, ..." In the ascending order of the work ID 301. Therefore, in the influence degree distribution 1002 in which the worker ID 303 is "1", the value of the worker ID variable 603 corresponding to the influence degree "+58" in which the work ID 301 is "1" is "1", so that the work ID 301 is "1". The degree of influence "+58" of "1" is indicated as a white circle ○.

また、作業ＩＤ３０１が「２」の影響度「０」に対応する作業員ＩＤ変数６０３の値は「０」であるため、作業ＩＤ３０１が「２」の影響度「０」は黒丸●として表記される。同様に、作業ＩＤ３０１が「３」の影響度「０」に対応する作業員ＩＤ変数６０３の値は「０」であるため、作業ＩＤ３０１が「３」の影響度「０」は黒丸●として表記される。 Further, since the value of the worker ID variable 603 corresponding to the influence degree “0” of the work ID 301 is “2” is “0”, the influence degree “0” of the work ID 301 “2” is indicated as a black circle ●. To. Similarly, since the value of the worker ID variable 603 corresponding to the influence degree “0” of the work ID 301 is “3” is “0”, the influence degree “0” of the work ID 301 “3” is indicated by a black circle ●. Will be done.

このように、作業員ＩＤ変数名１００１ごとに影響度分布１００２を求めることにより、作業員ＩＤ変数６０３の影響度９０３がプラスに大きい作業員ＩＤ３０３と、マイナスに大きい作業員ＩＤ３０３、値が「０」の場合とほとんど差異がない作業員ＩＤ３０３と、を区別することが可能となる。 In this way, by obtaining the influence degree distribution 1002 for each worker ID variable name 1001, the worker ID 303 having a positively large influence degree 903 of the worker ID variable 603, the worker ID 303 having a negatively large influence degree, and the value being "0". It is possible to distinguish between the worker ID 303, which has almost no difference from the case of "."

たとえば、作業員ＩＤ３０３が「１」の作業員は、作業員ＩＤ変数６０３が「１」であるエントリの作業（たとえば、作業ＩＤ３０１が「１」）に参加すると、プラスに大きい影響を与えている、すなわち、作業時間が増加することが分かる。つまり、この作業員ＩＤ変数６０３以外の変数を使用しないと予測精度の低下を招き、作業員特有の影響を考慮、すなわち、作業員ＩＤ変数６０３を使用すると、機械学習によってより予測精度が高い予測モデルが学習されたものとみなせる。したがって、この作業員ＩＤ３０３が「１」の作業員が作業することは作業時間を長くする影響が発生していると解釈できる。 For example, a worker whose worker ID 303 is "1" has a great positive effect when he / she participates in the work of an entry in which the worker ID variable 603 is "1" (for example, the work ID 301 is "1"). That is, it can be seen that the working time increases. That is, if variables other than the worker ID variable 603 are not used, the prediction accuracy will be lowered, and the influence peculiar to the worker will be taken into consideration. That is, if the worker ID variable 603 is used, the prediction accuracy will be higher by machine learning. It can be considered that the model has been trained. Therefore, it can be interpreted that the work of the worker whose worker ID 303 is "1" has an effect of prolonging the work time.

同様に作業員ＩＤ３０３が「２」の作業員は、マイナスに大きな影響を与える作業員で、作業員ＩＤ３０３が「４５」の作業員は、特にこの作業員が作業することで特別な影響を与えていない作業員である、ということが分かる。具体的な判別方法としては、データ分析装置１０２は、作業員ＩＤ変数６０３が「０」（黒丸●）の場合の作業群が持つ影響度の平均値（以下、第１平均値）および分散と、作業員ＩＤ変数６０３が「１」（白丸○）の作業群が持つ影響度の平均値（以下、第２平均値）を求める。 Similarly, a worker whose worker ID 303 is "2" is a worker who has a large negative influence, and a worker whose worker ID 303 is "45" has a special influence especially when this worker works. It turns out that it is a worker who has not. As a specific discrimination method, the data analyzer 102 has an average value (hereinafter, first average value) and variance of the degree of influence of the work group when the worker ID variable 603 is “0” (black circle ●). , The average value (hereinafter, the second average value) of the degree of influence of the work group in which the worker ID variable 603 is “1” (white circle ○) is obtained.

データ分析装置１０２は、第１平均値と第２平均値との差分が分散の定数倍以上であるような場合に、プラスまたはマイナスに大きな影響を与えていると判断を行う。以上のような処理を行うことで、目的変数６０１（作業時間）の予測値９０４にプラスに影響を与える作業員、マイナスに影響を与える作業員、影響を与えない作業員の３種類に分類することが可能となる。 When the difference between the first average value and the second average value is a constant multiple of the variance or more, the data analyzer 102 determines that it has a large positive or negative effect. By performing the above processing, the worker is classified into three types: a worker who has a positive influence on the predicted value 904 of the objective variable 601 (working time), a worker who has a negative influence, and a worker who has no influence. It becomes possible.

そして、データ分析装置１０２は、本処理結果に基づいて、抽象化変数使用学習データ２２７を生成する。図１１に抽象化変数使用学習データ２２７の一例を示す。 Then, the data analysis device 102 generates the abstract variable use learning data 227 based on the present processing result. FIG. 11 shows an example of learning data 227 using abstract variables.

［抽象化変数使用学習データ２２７］
図１１は、抽象化変数使用学習データ２２７の一例を示す説明図である。抽象化変数使用学習データ２２７は、作業ＩＤ３０１と、目的変数６０１と、説明変数６０２と、作業員属性変数７００と、作業員ＩＤ抽象化変数１１０１と、を有する。すなわち、抽象化変数使用学習データ２２７は、図７に示した作業員属性使用学習データ２２４に、作業員ＩＤ抽象化変数１１０１が追加された学習データである。 [Learning data using abstract variables 227]
FIG. 11 is an explanatory diagram showing an example of learning data 227 using abstract variables. The abstraction variable use learning data 227 has a work ID 301, an objective variable 601 and an explanatory variable 602, a worker attribute variable 700, and a worker ID abstraction variable 1101. That is, the abstraction variable use learning data 227 is the learning data in which the worker ID abstraction variable 1101 is added to the worker attribute use learning data 224 shown in FIG. 7.

作業員ＩＤ抽象化変数１１０１とは、作業員ＩＤ３０３を抽象化した変数である。具体的には、たとえば、作業員ＩＤ抽象化変数１１０１は、「作業が遅い」および「作業が速い」という変数を有する。データ分析装置１０２は、作業員ＩＤ抽象化変数１１０１の値を、上述した影響を与えているか否かの判断結果に基づいて決定する。 The worker ID abstraction variable 1101 is a variable that abstracts the worker ID 303. Specifically, for example, the worker ID abstraction variable 1101 has variables "work is slow" and "work is fast". The data analysis device 102 determines the value of the worker ID abstraction variable 1101 based on the determination result of whether or not the above-mentioned influence is exerted.

たとえば、「作業が遅い」の値が「１」であれば、その作業ＩＤ３０１の作業は、作業が遅い作業員によりおこなわれたことを意味し、「作業が速い」の値が「１」であれば、その作業ＩＤ３０１の作業は、作業が速い作業員によりおこなわれたことを意味する。「作業が遅い」および「作業が速い」の値がともに「０」であれば、その作業ＩＤ３０１の作業は、平均的な速さの作業員により行われたことを意味する。このように、「作業が遅い」および「作業が速い」の値の組み合わせにより、作業員をその作業能力で抽象化することができる。ここで、図１１の説明を終了する。 For example, if the value of "work is slow" is "1", it means that the work of the work ID 301 was performed by a worker whose work is slow, and the value of "work is fast" is "1". If there is, it means that the work of the work ID 301 was performed by a worker who has a fast work. If the values of "slow work" and "fast work" are both "0", it means that the work of the work ID 301 was performed by an average speed worker. In this way, the combination of the values "slow work" and "fast work" allows the worker to be abstracted by his or her work ability. Here, the description of FIG. 11 ends.

図５に戻り、データ分析装置１０２は、抽象化変数使用学習データ２２７を用いて抽象化変数使用予測モデル２３４を生成する（ステップＳ５１１）。この処理は、ステップＳ５０４の作業員ＩＤ使用予測モデル２３１と同様、抽象化変数使用予測モデル２３４を、目的変数６０１である作業時間を予測する回帰モデルとして機械学習によって生成する処理であり、機械学習アルゴリズムの手法は、ステップＳ５０４と同様の手法となる。 Returning to FIG. 5, the data analyzer 102 generates the abstract variable usage prediction model 234 using the abstract variable usage learning data 227 (step S511). Similar to the worker ID usage prediction model 231 in step S504, this process is a process of generating the abstract variable usage prediction model 234 as a regression model for predicting the working time which is the objective variable 601 by machine learning. The method of the algorithm is the same as that of step S504.

このように、作業員ＩＤ変数６０３の代わりに、作業員ＩＤ抽象化変数１１０１を新たな説明変数として抽象化変数使用予測モデル２３４に適用することにより、より高精度な予測を実現することができる。特に、予測対象の作業員の実績が業務実績データ２２１に存在しないような場合において、当作業員が新人として新たに加入した作業員である場合に、作業員ＩＤ抽象化変数１１０１である「作業が遅い」の値を「１」とすることで、不慣れな新人が作業を行うことによる影響を考慮した予測を行うことができる。 In this way, by applying the worker ID abstraction variable 1101 as a new explanatory variable to the abstraction variable usage prediction model 234 instead of the worker ID variable 603, more accurate prediction can be realized. .. In particular, when the actual result of the worker to be predicted does not exist in the work actual data 221 and the worker is a newly joined worker as a new employee, the worker ID abstraction variable 1101 "work". By setting the value of "is slow" to "1", it is possible to make a prediction considering the influence of the work performed by an unfamiliar newcomer.

つぎに、データ分析装置１０２は、抽象化変数使用予測モデル２３４に対する作業別変数の影響度（第２影響度）を算出する（ステップＳ５１２）。 Next, the data analysis device 102 calculates the degree of influence (second degree of influence) of the work-specific variables on the abstraction variable usage prediction model 234 (step S512).

［第２影響度テーブル２２８］
図１２は、第２影響度テーブル２２８の一例を示す説明図である。第２影響度テーブル２２８は、ステップＳ５１２で算出された第２影響度を記憶するテーブルである。第２影響度テーブル２２８は、具体的には、たとえば、第２影響度テーブル２２８は、第１影響度テーブル２２６と同様、作業ＩＤ３０１と、目的変数６０１と、説明変数６０２の影響度９０１と、作業員属性変数７００の影響度９０２と、予測値９０４と、を有し、また、あらたに、作業員ＩＤ抽象化変数１１０１の影響度１２０１を有する。各影響度の算出は、第１影響度テーブル２２６と同様であるため、説明を省略する。ここで、図１２の説明を終了する。 [Second influence table 228]
FIG. 12 is an explanatory diagram showing an example of the second influence degree table 228. The second influence degree table 228 is a table for storing the second influence degree calculated in step S512. Specifically, the second influence degree table 228, for example, the second influence degree table 228 has the work ID 301, the objective variable 601 and the influence degree 901 of the explanatory variable 602 as in the case of the first influence degree table 226. It has an influence degree 902 of the worker attribute variable 700 and a predicted value 904, and newly has an influence degree 1201 of the worker ID abstraction variable 1101. Since the calculation of each influence degree is the same as that of the first influence degree table 226, the description thereof will be omitted. This is the end of the description of FIG.

図５に戻り、データ分析装置１０２は、第２影響度テーブル２２８を用いて、作業員特徴変数を生成する（ステップＳ５１３）。作業員特徴変数とは、作業員の各々が持つ平均的ではない特異な特徴を表す変数である。同様の傾向を示す作業員の業務実績データ２２１が複数ある場合には、作業員特徴変数は特徴的だが共通的に存在する個人性を表す変数である、と言える。ここで、図１３を用いて具体的な作業員特徴を発見する方法について説明する。 Returning to FIG. 5, the data analyzer 102 generates the worker characteristic variable using the second influence degree table 228 (step S513). Worker characteristic variables are variables that represent non-average and peculiar characteristics of each worker. When there are a plurality of worker work performance data 221 showing the same tendency, it can be said that the worker characteristic variable is a variable representing individuality that is characteristic but commonly exists. Here, a method of discovering specific worker characteristics will be described with reference to FIG.

［説明変数６０２の影響度分布］
図１３は、説明変数６０２の影響度分布例を示すグラフである。図１３の影響度分布１３００では、説明変数６０２、作業員属性変数７００、および作業員ＩＤ抽象化変数１１０１のうち、説明変数６０２の棚上段６２５の値を例に挙げて説明するが、他の変数についても同様に実行される。 [Influence distribution of explanatory variable 602]
FIG. 13 is a graph showing an example of the influence distribution of the explanatory variable 602. In the influence distribution 1300 of FIG. 13, among the explanatory variable 602, the worker attribute variable 700, and the worker ID abstraction variable 1101, the value of the upper shelf 625 of the explanatory variable 602 will be described as an example, but other The same is true for variables.

図１３において、影響度分布１３００の横軸は、変数の値（本例では、棚上段６２５の値）であり、縦軸は、影響度（本例では、図１２の第２影響度テーブル２２８の説明変数６０２の影響度９０１の棚上段６２５の値）である。 In FIG. 13, the horizontal axis of the influence degree distribution 1300 is the value of the variable (in this example, the value of the upper shelf 625), and the vertical axis is the influence degree (in this example, the second influence degree table 228 of FIG. 12). The value of the upper shelf 625 of the influence degree 901 of the explanatory variable 602).

データ分析装置１０２は、「棚上段６２５」を選択する。「棚上段６２５」は、該当する作業ＩＤ３０１の作業において、棚の上段に何個の商品が配置されているかを示す変数である。業務に関する事前知識として、該当する変数の値が増加するまたは減少することが作業時間にどのように影響を与えるかは、あらかじめ定義される。たとえば、棚上段６２５の商品の個数３０６が多いほど、作業が遅くなる傾向があると定義される。このような傾向の指定方法は、たとえば、該当する変数は目的変数６０１にプラスの相関があるといった条件を指定することで設定可能である。 The data analyzer 102 selects "upper shelf 625". The “shelf upper tier 625” is a variable indicating how many products are arranged on the shelf upper tier in the work of the corresponding work ID 301. As prior knowledge about the business, how the increase or decrease of the value of the corresponding variable affects the work time is defined in advance. For example, it is defined that the larger the number of products 306 on the upper shelf 625, the slower the work tends to be. The method of specifying such a tendency can be set by, for example, specifying a condition that the corresponding variable has a positive correlation with the objective variable 601.

データ分析装置１０２は、このように指定された条件を前提とし、図１３の影響度分布１００２を分析する。たとえば、影響度分布１３００によれば、棚上段６２５の値が「５」の場合の影響度は、作業によってバラつきを持つことが分かる。このバラつきの中には、棚上段６２５の値が大きいほど影響度もプラスに大きくなるという条件に反して、あまり影響度が大きくない作業が存在していることが分かる（図１３中における点線の丸）。 The data analyzer 102 analyzes the influence distribution 1002 of FIG. 13 on the premise of the conditions specified in this way. For example, according to the influence degree distribution 1300, it can be seen that the influence degree when the value of the upper shelf 625 is “5” varies depending on the work. It can be seen that among these variations, there is work that does not have a large influence, contrary to the condition that the larger the value of the upper shelf 625, the larger the influence (the dotted line in FIG. 13). Circle).

このことは、つまり、作業対象の商品が棚上段６２５にあっても作業時間に影響がなかった業務実績の作業であると言え、たとえば、実施した作業員の背が高い場合などの推察が可能となる。このように、データ分析装置１０２は、与えられた事前知識に反した影響度の傾向を持つ作業群を、類似の特徴を持つ（この場合、棚上段６２５での作業が得意な）グループとみなし、作業員特徴変数を、作業員個人の特徴を表す新たな変数として生成する。ここで、図１３の説明を終了する。 This means that even if the product to be worked on is on the upper shelf 625, it can be said that the work has a work record that does not affect the work time. For example, it is possible to infer that the worker who carried out the work is tall. It becomes. In this way, the data analyzer 102 regards the work group having a tendency of influence contrary to the given prior knowledge as a group having similar characteristics (in this case, good at work on the upper shelf 625). , Worker feature variables are generated as new variables that represent the characteristics of individual workers. Here, the description of FIG. 13 ends.

図５に戻り、データ分析装置１０２は、作業員特徴変数を用いて、作業員特徴使用学習データ２２９を生成する（ステップＳ５１４）。作業員特徴使用学習データ２２９の一例を図１４に示す。 Returning to FIG. 5, the data analyzer 102 generates the worker feature use learning data 229 using the worker feature variable (step S514). An example of the worker feature use learning data 229 is shown in FIG.

［作業員特徴使用学習データ２２９］
図１４は、作業員特徴使用学習データ２２９の一例を示す説明図である。抽象化変数使用学習データ２２７に、作業員特徴変数１４０１が説明変数として追加された学習データである。作業員特徴変数１４０１の一例である「棚上段が得意」の値が「１」であれば、その作業は、「棚上段が得意」な作業員により実施されたことを示し、「０」であれば、「棚上段が得意」な作業員により実施されなかったことを示す。 [Worker feature use learning data 229]
FIG. 14 is an explanatory diagram showing an example of worker feature use learning data 229. This is learning data in which a worker characteristic variable 1401 is added as an explanatory variable to the learning data 227 using abstract variables. If the value of "good at the upper shelf", which is an example of the worker characteristic variable 1401, is "1", it means that the work was performed by a worker who is "good at the upper shelf", and is set to "0". If there is, it indicates that it was not carried out by a worker who is "good at the upper shelf".

また、作業員特徴変数１４０１の型は、連続値でもよい。連続値の場合、データ分析装置１０２は、一定または特定の幅毎に影響度の分布を算出し、同様に作業員特徴を発見する。たとえば、「年齢」という説明変数６０２が選択された場合、データ分析装置１０２は、０〜２０歳、２１歳〜４０歳、４１歳〜５０歳、５１歳〜６０歳、６１歳以上、などの幅で影響度の値をまとめ、影響度分布１００２を生成する。どの変数に対して、どのような分析幅で上記分析を行うかはあらかじめユーザなどによって定義可能とする。ここで、図１４の説明を終了する。 Further, the type of the worker characteristic variable 1401 may be a continuous value. In the case of continuous values, the data analyzer 102 calculates the distribution of the degree of influence for each constant or specific width, and similarly discovers the worker characteristics. For example, when the explanatory variable 602 "age" is selected, the data analyzer 102 may be 0-20 years old, 21-40 years old, 41-50 years old, 51-60 years old, 61 years old or older, and so on. The value of the degree of influence is summarized by the width, and the degree of influence distribution 1002 is generated. It is possible for the user or the like to define in advance which variable and what kind of analysis width is used for the above analysis. Here, the description of FIG. 14 ends.

図５に戻り、データ分析装置１０２は、作業員特徴使用学習データ２２９を用いて、作業員特徴使用予測モデル２３５を生成する（ステップＳ５１５）。このように、作業員特徴使用予測モデル２３５を生成することで、新たな説明変数として作業員特徴変数１４０１を追加することができ、より高精度な予測を行うことが可能となる。特に、業務実績データ２２１には実績が存在しない作業員ではあるが、あらかじめその個人的特徴（たとえば、背が高く高い棚での作業を苦にしない）を作業員特徴変数１４０１の値として設定することにより、より高精度な予測が可能となる。 Returning to FIG. 5, the data analysis device 102 generates the worker feature use prediction model 235 using the worker feature use learning data 229 (step S515). By generating the worker feature usage prediction model 235 in this way, the worker feature variable 1401 can be added as a new explanatory variable, and more accurate prediction can be performed. In particular, although the worker has no actual record in the work record data 221, his / her personal characteristics (for example, work on a tall and tall shelf is not bothered) are set as the value of the worker characteristic variable 1401 in advance. This enables more accurate prediction.

以上で、作業時間を予測する各種予測モデルを生成する処理を完了する（ステップＳ５１６）。なお、データ分析装置１０２は、作業員特徴を分析する際に、業務実績データ２２１における作業時間の実績値と予測値９０４との差異が大きい作業に関しては、分析対象の業務実績データ２２１から除外するようにしてもよい。 This completes the process of generating various prediction models for predicting the working time (step S516). When analyzing the worker characteristics, the data analysis device 102 excludes the work in which the difference between the actual value of the work time and the predicted value 904 in the work record data 221 is large from the work record data 221 to be analyzed. You may do so.

たとえば、全作業における差異の平均値を基準にして、分散値の定数倍以上の作業が除外対象となる。これにより、事前知識とは異なる傾向を示す作業の発生要因が、個人的特徴によるものではなく、業務実勢データ取得時の異常値であるような場合における分析への悪影響を排除することができる。 For example, work that is a constant multiple of the variance value or more is excluded based on the average value of the differences in all the work. As a result, it is possible to eliminate an adverse effect on the analysis when the cause of the work showing a tendency different from the prior knowledge is not due to personal characteristics but an abnormal value at the time of acquiring the actual business data.

また、一つの作業に対して、事前知識とは異なる傾向示す変数が２つ以上発見された場合は、それらを組み合わせた作業員特徴変数１４０１として生成するようにしてもよい。たとえば、棚上段６２５での作業が得意で、かつ、１個当たりの重量６２３がしきい値以上の商品のピッキングが速い（事前知識として重量６２３の変数が大きいと作業が遅くなる条件のもと、それに反するような影響度の値を示した作業）ような作業員の作業が所定数以上あった場合、データ分析装置１０２は、新たな作業員特徴変数１４０１として「棚上段が得意＆重い荷物が得意」を生成する。これにより、データ分析装置１０２は、複数の条件が組み合わさった作業員の個人的特徴をより詳細に表す作業員特徴変数１４０１を生成することができる。 Further, when two or more variables showing a tendency different from the prior knowledge are found for one work, they may be combined and generated as a worker characteristic variable 1401. For example, under the condition that the work on the upper shelf 625 is good and the picking of products whose weight 623 per piece is equal to or more than the threshold value is fast (as a prior knowledge, if the variable of weight 623 is large, the work becomes slow). When there are more than a predetermined number of worker's work (work showing the value of the degree of influence contrary to it), the data analyzer 102 sets the new worker characteristic variable 1401 as "good at the upper shelf & heavy luggage". Is good at. As a result, the data analyzer 102 can generate the worker characteristic variable 1401 that represents the personal characteristics of the worker in which a plurality of conditions are combined in more detail.

＜業務改善施策生成処理１２２＞
図１５は、モデル生成処理１２１によって生成された作業員特徴使用予測モデル２３５を用いて業務改善施策を生成する業務改善施策生成処理手順例を示すフローチャートである。データ分析装置１０２は、まず業務改善施策生成処理１２２を起動し（ステップＳ１５０１）、改善対象となる業務実績データ２２１を記憶デバイス２０２から取得する（ステップＳ１５０２）。改善対象となる業務実績データ２２１は、業務を改善したい対象となる期間における業務実績データ２２１であり、たとえば業務改善施策生成処理１２２を実行する次の日以降の業務を改善したい場合は、直近過去１週間の業務実績データ２２１などが対象となる。 <Business improvement measure generation process 122>
FIG. 15 is a flowchart showing an example of a business improvement measure generation processing procedure for generating a business improvement measure using the worker feature usage prediction model 235 generated by the model generation process 121. The data analysis device 102 first activates the business improvement measure generation process 122 (step S1501), and acquires the business performance data 221 to be improved from the storage device 202 (step S1502). The business performance data 221 to be improved is the business performance data 221 in the target period for which the business is to be improved. For example, if the business is to be improved on or after the next day when the business improvement measure generation process 122 is executed, the latest past One week's business performance data 221 and the like are targeted.

直近過去１週間の作業環境（たとえば、商品の配置、取り扱う商品、作業員の構成）は以後の作業環境と等しいと仮定する。直近過去１週間の期間の業務を改善する施策は、今後の業務も改善が見込める施策であるという考えに基づく。 It is assumed that the working environment for the last week (for example, the arrangement of products, the products handled, and the composition of workers) is equal to the working environment after that. Measures to improve operations during the most recent past week are based on the idea that future operations are also expected to improve.

たとえば、データ分析装置１０２は、業務改善施策生成処理１２２を１週間おきに繰り返すことで、継続的に業務を改善し続けることができる。改善対象となる作業環境は、たとえば、商品の配置であるが、商品の配置、取り扱う商品、作業員の構成のうち少なくとも１つを改善対象としてもよい。 For example, the data analysis device 102 can continuously improve the business by repeating the business improvement measure generation process 122 every other week. The work environment to be improved is, for example, the arrangement of products, but at least one of the arrangement of products, the products to be handled, and the composition of workers may be the object of improvement.

データ分析装置１０２は、改善対象となる業務実績データ２２１に対して、商品の配置場所の変更処理を実行する（ステップＳ１５０３）。具体的には、たとえば、データ分析装置１０２は、改善対象となる業務実績データ２２１の商品の場所コード３０５を変更する。場所コード３０５の入替え方法については、特に限定することはなく、データ分析装置１０２がランダムに実行してもよいし、遺伝的アルゴリズムなどの組合せ最適化問題に適した各種アルゴリズムで実行してもよい。 The data analysis device 102 executes a process of changing the product placement location for the business performance data 221 to be improved (step S1503). Specifically, for example, the data analysis device 102 changes the location code 305 of the product of the business performance data 221 to be improved. The method of replacing the location code 305 is not particularly limited, and the data analyzer 102 may execute it randomly, or it may be executed by various algorithms suitable for the combinatorial optimization problem such as a genetic algorithm. ..

データ分析装置１０２は、当該変更後の改善対象となる業務実績データ２２１に応じて、作業員ＩＤ使用学習データ２２３の説明変数６０２を再計算する。たとえば、場所コード３０５が変更されたため、データ分析装置１０２は、棚上段６２５、棚中段６２６、棚下段６２７の値も変更する。また、データ分析装置１０２は、移動距離６２１を変更後の場所コード３０５に基づいて再計算する。なお、再計算対象は、場所コード３０５の変更によって変更が必要な説明変数６０２であれば、棚上段６２５、棚中段６２６、棚下段６２７および移動距離６２１に限られない。 The data analysis device 102 recalculates the explanatory variable 602 of the worker ID use learning data 223 according to the business performance data 221 to be improved after the change. For example, since the location code 305 has been changed, the data analyzer 102 also changes the values of the upper shelf 625, the middle shelf 626, and the lower shelf 627. Further, the data analysis device 102 recalculates the travel distance 621 based on the changed location code 305. The recalculation target is not limited to the shelf upper tier 625, the shelf middle tier 626, the shelf lower tier 627, and the moving distance 621 as long as the explanatory variable 602 needs to be changed by changing the location code 305.

また、場所コード３０５が変更されたため、付近の作業員も異なってくる。したがって、データ分析装置１０２は、作業員属性使用学習データ２２４の作業員属性変数７００である付近の作業員の経験日数７０２を、変更度の場所コード３０５に応じて再計算する。なお、再計算対象は、場所コード３０５の変更によって変更が必要な作業員属性変数７００であれば、付近の作業員の経験日数７０２に限られない。 Also, because the location code 305 has been changed, the nearby workers will also be different. Therefore, the data analysis device 102 recalculates the number of days of experience 702 of the nearby worker, which is the worker attribute variable 700 of the worker attribute use learning data 224, according to the location code 305 of the degree of change. The recalculation target is not limited to the number of days of experience 702 of nearby workers as long as the worker attribute variable 700 needs to be changed by changing the location code 305.

データ分析装置１０２は、適用する予測モデルを３種類の予測モデルから選択し、選択した予測モデルに、ステップＳ１５０３で得られた再計算後の作業員ＩＤ使用学習データ２２３を入力することにより、作業時間の予測値９０４を得る（ステップＳ１５０４）。３種類の予測モデルの使い分けについては図１６で後述する。 The data analysis device 102 selects the prediction model to be applied from three types of prediction models, and inputs the recalculated worker ID usage learning data 223 obtained in step S1503 into the selected prediction model to perform the work. A time prediction value of 904 is obtained (step S1504). The proper use of the three types of prediction models will be described later in FIG.

データ分析装置１０２は、それぞれの作業で得られた作業時間の予測値９０４に対して、総和をとり、１週間での総合計作業時間を計算する（ステップＳ１５０５）。データ分析装置１０２は、得られた総合計作業時間が予め定められた終了条件に一致するか否か判定する（ステップＳ１５０６）。 The data analysis device 102 sums up the predicted work time 904 obtained in each work and calculates the total work time in one week (step S1505). The data analyzer 102 determines whether or not the obtained total working time matches a predetermined end condition (step S1506).

ここで、予め定められた終了条件とは、業務実績データ２２１における作業時間と比較して改善がなされたか否かや、ステップＳ１５０３の実行回数、商品変更配置の変更による改善度合い、などの条件が適用される。終了条件に満たない場合（ステップＳ１５０６：Ｎｏ）、再度ステップＳ１５０３の処理に戻る。 Here, the predetermined end conditions include conditions such as whether or not improvement has been made compared to the work time in the business performance data 221, the number of executions of step S1503, the degree of improvement due to the change in the product change arrangement, and the like. Applies. If the end condition is not satisfied (step S1506: No), the process returns to step S1503 again.

終了条件を満たした場合（ステップＳ１５０６：Ｙｅｓ）、データ分析装置１０２は、改善施策を決定する（ステップＳ１５０７）。改善施策とは、たとえば、改善対象となる業務実績データ２２１が得られる作業環境である。場所コードの変更により改善対象となる業務実績データ２２１が得られた場合には、改善施策となる作業環境は、場所コードの変更後の商品の配置状態である。 When the end condition is satisfied (step S1506: Yes), the data analyzer 102 determines the improvement measure (step S1507). The improvement measure is, for example, a work environment in which business performance data 221 to be improved can be obtained. When the business performance data 221 to be improved is obtained by changing the place code, the work environment to be the improvement measure is the arrangement state of the products after the change of the place code.

取り扱う商品の商品コードの変更により改善対象となる業務実績データ２２１が得られた場合には、改善施策となる作業環境は、商品コードの変更後の商品の配置状態である。作業員の構成変更により改善対象となる業務実績データ２２１が得られた場合には、改善施策となる作業環境は、作業員の構成変更後の作業員配置である。 When the business performance data 221 to be improved is obtained by changing the product code of the product to be handled, the work environment to be the improvement measure is the arrangement state of the product after the change of the product code. When the work performance data 221 to be improved is obtained by changing the composition of the workers, the work environment to be the improvement measure is the arrangement of the workers after the composition of the workers is changed.

そして、データ分析装置１０２は、決定した改善施策を出力する（ステップＳ１５０８）。具体的には、たとえば、データ分析装置１０２は、決定した改善施策をディスプレイに表示したり、業務システム１０１などの他のコンピュータに送信したりする。そして、本処理を終了する（ステップＳ１５０８）。 Then, the data analysis device 102 outputs the determined improvement measure (step S1508). Specifically, for example, the data analysis device 102 displays the determined improvement measures on a display or transmits the determined improvement measures to another computer such as the business system 101. Then, this process ends (step S1508).

なお、本説明では直近過去１週間の業務を改善対象としたが、この期間に限定するものではない。また、改善対象の業務をこれから行う予定の作業内容とし、データ分析時点での作業環境を改善するようなユースケースにも適用可能である。この場合、実施予定の作業内容を行うにあたって、最も作業時間が短くなるような作業環境の条件（ピッキング対象の品物の配置、など）を図１５で求めることで実現できる。これによって、作業実施予定の内容に即して、作業環境をその都度改善していくことが可能となり、より高い改善効果を期待できる。 In this explanation, the work of the last week is targeted for improvement, but it is not limited to this period. In addition, it can be applied to use cases where the work to be improved is the work content to be performed in the future and the work environment at the time of data analysis is improved. In this case, it can be realized by obtaining the conditions of the work environment (arrangement of items to be picked, etc.) in which the work time is the shortest when performing the work contents to be carried out in FIG. As a result, it becomes possible to improve the work environment each time according to the contents of the work implementation schedule, and a higher improvement effect can be expected.

＜予測モデルの使い分け処理＞
ここで、上述したステップＳ１５０４における予測モデルの使い分け処理について説明する。 <Processing for proper use of prediction model>
Here, the process of properly using the prediction model in step S1504 described above will be described.

［予測モデルの使い分け処理］
図１６は、ステップＳ１５０４における各種予測モデルの使い分け処理手順例を示すフローチャートである。データ分析装置１０２は、あらかじめ保有する過去の業務実績データ２２１をもとに、当該業務実績データ２２１に存在する全作業員の作業員ＩＤ３０３に対応した作業員属性および作業員特徴について管理する作業員属性・特徴データ２３０の作成または更新を実行する（ステップＳ１６０１）。ステップＳ１６０１は、たとえば、１週間毎などの決まったタイミングで定期的に実行される。ここで、作業員属性・特徴データ２３０について図１７を用いて具体的に説明する。 [Proper use of prediction model]
FIG. 16 is a flowchart showing an example of a processing procedure for properly using various prediction models in step S1504. The data analysis device 102 is a worker who manages the worker attributes and worker characteristics corresponding to the worker ID 303 of all the workers existing in the work record data 221 based on the past work record data 221 held in advance. The attribute / feature data 230 is created or updated (step S1601). Step S1601 is periodically executed at a fixed timing such as every week. Here, the worker attribute / feature data 230 will be specifically described with reference to FIG.

［作業員属性・特徴データ２３０］
図１７は、作業員属性・特徴データ２３０の一例を示す説明図である。作業員属性・特徴データ２３０は、作業員ＩＤ３０３と、実績作業数１７０１と、作業員属性変数１７０２と、作業員ＩＤ抽象化変数１７０３と、作業員特徴変数１７０４と、を有する。 [Worker attribute / feature data 230]
FIG. 17 is an explanatory diagram showing an example of worker attribute / feature data 230. The worker attribute / feature data 230 has a worker ID 303, an actual number of works 1701, a worker attribute variable 1702, a worker ID abstraction variable 1703, and a worker feature variable 1704.

実績作業数１７０１は、学習データ内１７１１と、最新１か月以内１７１２と、を有する。学習データ内１７１１の値は、その作業員ＩＤ３０３の作業員についての作業員ＩＤ使用予測モデル２３１を生成する際の作業員ＩＤ使用学習データ２２３内の実績作業数１７０１である。最新１か月以内１７１２の値は、その作業員ＩＤ３０３の作業員についての最新１か月以内の業務実績データ２２１に含まれる実績作業数１７０１である。なお、図１７では、例として「最新１か月」としたが、１か月には限定されない。 The actual number of works 1701 includes 1711 in the learning data and 1712 within the latest one month. The value of 1711 in the learning data is the actual number of operations 1701 in the worker ID use learning data 223 when generating the worker ID usage prediction model 231 for the worker of the worker ID 303. The value of 1712 within the latest one month is 1701 of the actual number of works included in the work actual data 221 within the latest one month for the worker with the worker ID 303. In FIG. 17, the "latest one month" is set as an example, but the term is not limited to one month.

学習データ内１７１１および最新１か月以内１７１２の値は、たとえば、作業員ＩＤ使用学習データ２２３内に多く含まれる作業員と、作業員ＩＤ使用学習データ２２３には含まれないが直近の作業の業務実績データ２２１が存在する作業員と、直近の作業の業務実績データ２２１に存在しない作業員と、を判別するために用いられる。 The values of 1711 in the training data and 1712 within the latest one month are, for example, the workers who are often included in the worker ID usage learning data 223 and the values of the latest work which are not included in the worker ID usage learning data 223. It is used to discriminate between a worker who has the work record data 221 and a worker who does not exist in the work record data 221 of the latest work.

作業員属性変数１７０２は、たとえば、その作業ＩＤ３０１の作業員についての最新経験日数１７２１を含む。最新経験日数１７２１は、その作業ＩＤ３０１の作業員についての最新の時点における経験日数７０１を示す。 The worker attribute variable 1702 includes, for example, the latest number of days of experience 1721 for the worker with work ID 301. The latest number of days of experience 1721 indicates the number of days of experience 701 at the latest time point for the worker of the work ID 301.

作業員ＩＤ使用学習データ２２３は、作業が遅い１７２２、および作業が速い１７２３を含む。作業が遅い１７２２は、その作業ＩＤ３０１の作業員の作業が遅いか否かを示す。値が「１」であれば作業が遅いことを示し、値が「０」であれば作業が速いことを示す。同様に、作業が速い１７２３は、その作業ＩＤ３０１の作業員の作業が速いか否かを示す。値が「１」であれば作業が速いことを示し、値が「０」であれば作業が遅いことを示す。 The worker ID use learning data 223 includes 1722, which is slow in work, and 1723, which is fast in work. The slow work 1722 indicates whether or not the work of the worker with the work ID 301 is slow. A value of "1" indicates that the work is slow, and a value of "0" indicates that the work is fast. Similarly, the fast work 1723 indicates whether or not the work of the worker with the work ID 301 is fast. A value of "1" indicates that the work is fast, and a value of "0" indicates that the work is slow.

作業が遅い１７２２および作業が速い１７２３の値は、作業員特徴使用学習データ２２９の生成時に算出された値が用いられる。たとえば、図１０の影響度分布情報１０００において、影響度の平均値ａｖが、ａ≦ａｖ≦ｂ（ａは０より小さい。ｂは０より大きい）であれば、その作業員についての作業が遅い１７２２および作業が速い１７２３の値はともに「０」に設定される。 As the values of the slow work 1722 and the fast work 1723, the values calculated at the time of generating the worker feature use learning data 229 are used. For example, in the influence degree distribution information 1000 of FIG. 10, if the average value av of the influence degree is a ≦ av ≦ b (a is less than 0, b is larger than 0), the work for the worker is slow. The values of 1722 and 1723, which works fast, are both set to "0".

また、影響度の平均値ａｖが、ａ＞ａｖであれば、その作業員についての作業が遅い１７２２の値は「０」、作業が速い１７２３の値は「１」に設定される。また、影響度の平均値ａｖが、ｂ＜ａｖであれば、その作業員についての作業が遅い１７２２の値は「１」、作業が速い１７２３の値は「０」に設定される。 If the average value av of the degree of influence is a> av, the value of 1722, which is slow for the worker, is set to "0", and the value of 1723, which is fast, is set to "1". If the average value av of the degree of influence is b <av, the value of 1722, which is slow for the worker, is set to "1", and the value of 1723, which is fast, is set to "0".

また、作業員ＩＤ３０３が「４０」の作業員のように、学習データ内１７１１に業務実績データ２２１が存在しない（学習データ内１７１１の値が「０」）作業員に関しては、上述のように学習データから作業が遅い１７２２および作業が速い１７２３の値を設定することができない。この場合、データ分析装置１０２は、最新１ヶ月の業務実績データ２２１に対して影響度分布情報１０００を生成し、作業が遅い１７２２および作業が速い１７２３の値を設定する。 Further, as for the worker whose worker ID 303 is "40", the work performance data 221 does not exist in the learning data 1711 (the value of 1711 in the learning data is "0"), the worker learns as described above. It is not possible to set the values of 1722 for slow work and 1723 for fast work from the data. In this case, the data analysis device 102 generates the influence degree distribution information 1000 for the latest one-month work record data 221 and sets the values of 1722 for slow work and 1723 for fast work.

一方、作業員ＩＤ３０３が「５５」の作業員のように、業務実績データ２２１がほぼ存在しない、すなわち、所定数以下の作業員については、データ分析装置１０２は、作業が遅い１７２２および作業が速い１７２３の値を「０」に設定する。但し、業務に関する事前知識に基づき、実績がない作業員について作業員属性変数１７０２の初期値が定義されている場合は、この限りではない。詳細は後述する。 On the other hand, like the worker whose worker ID 303 is "55", the work performance data 221 is almost nonexistent, that is, for the workers having a predetermined number or less, the data analysis device 102 has a slow work of 1722 and a fast work. Set the value of 1723 to "0". However, this does not apply when the initial value of the worker attribute variable 1702 is defined for the worker who has no track record based on the prior knowledge about the work. Details will be described later.

作業員特徴変数１７０４についても、たとえば、データ分析装置１０２は、作業員特徴変数１４０１の「棚上段が得意」の値が「１」の作業ＩＤ３０１を作業員特徴使用学習データ２２９から特定する。そして、データ分析装置１０２は、特定した作業ＩＤ３０１の作業を実施した作業員のエントリにおいて、棚上段が得意１７３１の値に「１」を設定する。同様に、データ分析装置１０２は、作業員特徴変数１４０１の「棚上段が得意」の値が「０」の作業ＩＤ３０１を作業員特徴使用学習データ２２９から特定する。そして、データ分析装置１０２は、特定した作業ＩＤ３０１の作業を実施した作業員のエントリにおいて、棚上段が得意１７３１の値に「０」を設定する。 Regarding the worker characteristic variable 1704, for example, the data analyzer 102 specifies the work ID 301 in which the value of the worker characteristic variable 1401 “good at the upper shelf” is “1” from the worker feature use learning data 229. Then, the data analysis device 102 sets "1" to the value of 1731, which the upper shelf is good at, in the entry of the worker who has performed the work of the specified work ID 301. Similarly, the data analysis device 102 identifies the work ID 301 in which the value of the worker feature variable 1401 “good at the upper shelf” is “0” from the worker feature use learning data 229. Then, the data analysis device 102 sets "0" to the value of 1731, which the upper shelf is good at, in the entry of the worker who performed the work of the specified work ID 301.

このように、業務改善施策生成処理１２２の実行時における各作業員の最新の作業員属性変数１７０２、作業員ＩＤ抽象化変数１７０３および作業員特徴変数１７０４の値を管理することが可能となる。ここで、図１７の説明を終了する。 In this way, it is possible to manage the values of the latest worker attribute variable 1702, the worker ID abstraction variable 1703, and the worker feature variable 1704 of each worker at the time of executing the business improvement measure generation process 122. Here, the description of FIG. 17 ends.

図１６に戻り、データ分析装置１０２は、作業員属性・特徴データ２３０と、作業員ＩＤ使用予測モデル２３１、作業員属性使用予測モデル２３２、および作業員特徴使用予測モデル２３５とを用いて、作業時間の予測の実行を、ステップＳ１５０４により開始する（ステップＳ１６０１）。 Returning to FIG. 16, the data analyzer 102 works by using the worker attribute / feature data 230, the worker ID usage prediction model 231 and the worker attribute usage prediction model 232, and the worker feature usage prediction model 235. Execution of the time prediction is started in step S1504 (step S1601).

まず、データ分析装置１０２は、予測対象作業の作業員の作業員ＩＤ３０３について、学習データ内１７１１における実績作業数１７０１が所定件数（たとえば、１００件）以上か否か判定する（ステップＳ１６０２）。所定件数以上存在する場合（ステップＳ１６０２：Ｙｅｓ）、データ分析装置１０２は、適用する予測モデルを作業員ＩＤ使用予測モデル２３１に決定する（ステップＳ１６０３）。 First, the data analysis device 102 determines whether or not the actual number of work 1701 in the learning data 1711 is equal to or more than a predetermined number (for example, 100) with respect to the worker ID 303 of the worker of the work to be predicted (step S1602). When there are a predetermined number or more (step S1602: Yes), the data analyzer 102 determines the prediction model to be applied to the worker ID usage prediction model 231 (step S1603).

この場合、ステップＳ１５０５では、データ分析装置１０２は、作業員ＩＤ使用予測モデル２３１に、再計算後の作業員ＩＤ使用学習データ２２３を入力することにより、作業時間９０４の予測を実行することになる。 In this case, in step S1505, the data analyzer 102 predicts the working time 904 by inputting the recalculated worker ID usage learning data 223 into the worker ID usage prediction model 231. ..

一方、所定件数以上存在しない場合（ステップＳ１６０２：Ｎｏ）、データ分析装置１０２は、予測対象の作業員が、最新１ヶ月以内１７１２の実績作業数１７０１が所定件数（たとえば、５０件）以上存在しているか否か判定する（ステップＳ１６０４）。所定件数以上存在する場合（ステップＳ１６０４：Ｙｅｓ）、データ分析装置１０２は、適用する予測モデルを作業員特徴使用予測モデル２３５に決定する（ステップＳ１６０５）。 On the other hand, when there are no more than a predetermined number (step S1602: No), in the data analyzer 102, the number of workers to be predicted is 1712 within the latest one month, 1701 is present in a predetermined number (for example, 50) or more. It is determined whether or not (step S1604). When there are a predetermined number or more (step S1604: Yes), the data analyzer 102 determines the prediction model to be applied to the worker feature use prediction model 235 (step S1605).

この場合、ステップＳ１５０５では、データ分析装置１０２は、作業員特徴使用予測モデル２３５に、再計算後の作業員ＩＤ使用学習データ２２３に作業員ＩＤ抽象化変数１７０３および作業員特徴変数１７０４が追加された学習データを入力することにより、作業時間９０４の予測を実行することになる。 In this case, in step S1505, the data analyzer 102 adds the worker ID abstraction variable 1703 and the worker feature variable 1704 to the worker ID usage learning data 223 after recalculation to the worker feature usage prediction model 235. By inputting the training data, the prediction of the working time 904 is executed.

一方、所定件数以上存在しない場合（ステップＳ１６０４：Ｎｏ）、データ分析装置１０２は、適用する予測モデルを作業員属性使用予測モデル２３２に決定する（ステップＳ１６０６）。 On the other hand, when the number of cases does not exist more than a predetermined number (step S1604: No), the data analyzer 102 determines the prediction model to be applied to the worker attribute use prediction model 232 (step S1606).

この場合、ステップＳ１５０５では、データ分析装置１０２は、作業員属性使用予測モデル２３２に、再計算後の作業員ＩＤ使用学習データ２２３に作業員属性変数１７０２が追加された学習データを入力することにより、作業時間９０４の予測を実行することになる。 In this case, in step S1505, the data analyzer 102 inputs the training data in which the worker attribute variable 1702 is added to the worker ID usage learning data 223 after recalculation into the worker attribute usage prediction model 232. , The prediction of working time 904 will be executed.

以上の処理によって、データ分析装置１０２は、予測対象の作業の予測作業時間を算出し、本処理を終了する（ステップＳ１６０７）。このように、予測対象の作業員に応じて、使用する予測モデルを適切に使い分けることによって、作業員の業務実績データ２２１の件数に応じた精度の高い作業時間を予測することが可能となる。 Through the above processing, the data analysis device 102 calculates the predicted work time of the work to be predicted, and ends this processing (step S1607). In this way, by appropriately using the prediction model to be used according to the worker to be predicted, it is possible to predict the work time with high accuracy according to the number of work performance data 221 of the worker.

すなわち、本例では、ステップＳ１６０２において、過去の業務実績データ２２１の件数が所定件数（たとえば、１００件）以上（ステップＳ１６０２：Ｙｅｓ）の作業員に関しては、データ分析装置１０２は、作業員個人に特化した予測モデルである作業員ＩＤ使用予測モデル２３１により作業時間を予測する（ステップＳ１６０３）。 That is, in this example, in step S1602, with respect to the worker whose number of past business record data 221 is equal to or more than a predetermined number (for example, 100) (step S1602: Yes), the data analysis device 102 is set to the individual worker. The working time is predicted by the worker ID usage prediction model 231 which is a specialized prediction model (step S1603).

また、ステップＳ１６０４において、最新１ヶ月の業務実績データ２２１の件数が所定件数（たとえば、５０件）未満である未知の作業員については、データ分析装置１０２は、作業員個人の特徴ではなく、作業員属性使用予測モデル２３２を用いて作業時間を予測する（ステップＳ１６０６）。 Further, in step S1604, for an unknown worker whose latest one-month work record data 221 is less than a predetermined number (for example, 50), the data analysis device 102 is not a feature of the individual worker but a work. The working time is predicted using the member attribute usage prediction model 232 (step S1606).

この場合、特定の属性に関して業務に関する事前知識によって初期値が定義されてもよい。たとえば、事前知識によって「経験日数が少ない作業員は新人であり作業が遅いと見なす」ことができる場合は、データ分析装置１０２は、事前に、作業員属性・特徴データ２３０において、業務実績データ２２１の件数が所定件数（たとえば、５０件）未満の作業員の「作業が遅い」の値を「１」に設定する。 In this case, the initial value may be defined by prior knowledge about the business regarding a specific attribute. For example, if it is possible to "consider that a worker with a small number of days of experience is a newcomer and work is slow" by prior knowledge, the data analysis device 102 previously performs work performance data 221 in the worker attribute / feature data 230. The value of "work is slow" for workers whose number of cases is less than a predetermined number (for example, 50 cases) is set to "1".

学習データ内１７１１には存在しないが（ステップＳ１６０２：Ｎｏ）、最新１ヶ月以内１７１２には所定件数（たとえば、１００件）以上存在するような作業員の場合（ステップＳ１６０４：Ｙｅｓ）、データ分析装置１０２は、作業員属性・特徴データ２３０によってあらかじめ付与された変数の値を用い、作業員特徴使用予測モデル２３５を用いて作業時間を予測する（ステップＳ１６０５）。 In the case of a worker who does not exist in 1711 in the training data (step S1602: No), but exists in 1712 within the latest one month in 1712 or more (for example, 100 cases) (step S1604: Yes), the data analyzer. In 102, the working time is predicted using the worker feature usage prediction model 235 using the values of the variables given in advance by the worker attribute / feature data 230 (step S1605).

このように、何れの作業員の場合においても、データ分析装置１０２は、十分な作業量を備えた学習データによって生成された予測モデルで予測を実行する。したがって、精度の高い作業時間の予測が可能となる。以上で説明したデータ分析装置１０２を用いることで、作業時間予測の対象となる作業員の個人性を考慮した作業時間の予測を高精度に行うことが可能となる。 As described above, in the case of any worker, the data analysis device 102 executes the prediction with the prediction model generated by the learning data having a sufficient amount of work. Therefore, it is possible to predict the working time with high accuracy. By using the data analysis device 102 described above, it is possible to predict the work time with high accuracy in consideration of the individuality of the worker who is the target of the work time prediction.

また、上述したデータ分析装置１０２は、下記（１）〜（１５）のように構成することもできる。 Further, the above-mentioned data analysis device 102 can also be configured as described in (1) to (15) below.

（１）データ分析装置１０２では、プロセッサ２０１は、複数の作業の作業ごとに、作業時間を示す目的変数６０１と、作業環境に関する説明変数６０２と、作業員ごとの作業の実施の有無を示す作業員ＩＤ変数６０３と、を有する作業員ＩＤ使用学習データ２２３を第１学習データとして取得する取得処理と、取得処理によって取得された作業員ＩＤ使用学習データ２２３に基づいて、作業時間を予測する作業員ＩＤ使用予測モデル２３１を第１予測モデルとして生成する予測モデル生成処理と、予測モデル生成処理によって生成された作業員ＩＤ使用予測モデル２３１に作業員ＩＤ使用学習データを入力することにより、作業時間の第１予測値９０４と、作業員ＩＤ変数６０３が作業ごとの第１予測値９０４の統計量（たとえば、平均値。中央値など他の統計量でもよい。）に与える影響の度合いを示す第１影響度９０３とを、作業ごとに算出する算出処理と、作業員ごとの複数の作業の各々の第１影響度９０３の分布（影響度分布情報１０００）に基づいて、複数の作業員の作業速度を抽象化した作業員ＩＤ抽象化変数１１０１を作業ごとに生成し、作業ごとに、目的変数６０１と、説明変数６０２と、作業員ＩＤ抽象化変数１１０１と、を有する抽象化変数使用学習データ２２７を第２学習データとして生成する学習データ生成処理と、を実行する。 (1) In the data analysis device 102, the processor 201 has an objective variable 601 indicating the work time, an explanatory variable 602 regarding the work environment, and an operation indicating whether or not the work is performed for each worker for each of a plurality of work operations. The work of predicting the work time based on the acquisition process of acquiring the worker ID use learning data 223 having the member ID variable 603 and the worker ID use learning data 223 as the first learning data and the worker ID use learning data 223 acquired by the acquisition process. Working time by inputting worker ID usage learning data into the prediction model generation process that generates the worker ID usage prediction model 231 as the first prediction model and the worker ID usage prediction model 231 generated by the prediction model generation process. 904, and the worker ID variable 603, which indicates the degree of influence on the statistic of the first predicted value 904 for each work (for example, the average value or other statistic such as the median value). Work of a plurality of workers based on a calculation process for calculating one influence degree 903 for each work and a distribution of a first influence degree 903 (impact degree distribution information 1000) for each of a plurality of works for each worker. Worker ID abstraction variable 1101 that abstracts the speed is generated for each work, and for each work, there is an abstract variable use learning data having an objective variable 601, an explanatory variable 602, and a worker ID abstract variable 1101. The training data generation process of generating 227 as the second training data is executed.

これにより、作業ごとに、作業を実施した複数の作業員を作業速度に抽象化した説明変数を得ることができ、抽象化変数使用学習データ２２７を用いて生成される抽象化変数使用予測モデル２３４（第３予測モデル）の高精度化を図ることができる。 As a result, it is possible to obtain an explanatory variable that abstracts a plurality of workers who have performed the work to the work speed for each work, and the abstract variable use prediction model 234 generated by using the abstract variable use learning data 227. It is possible to improve the accuracy of (third prediction model).

（２）上記（１）のデータ分析装置１０２において、第１学習データは、作業を実施した複数の作業員の各々の経験日数７０１を作業ごとに示す作業員属性変数７００を含む。これにより、第１学習データは、中間学習データ２２５となる。 (2) In the data analysis device 102 of the above (1), the first learning data includes a worker attribute variable 700 indicating the number of experience days 701 of each of the plurality of workers who performed the work for each work. As a result, the first learning data becomes the intermediate learning data 225.

また、予測モデル生成処理では、プロセッサ２０１は、作業員属性変数７００を含む前記第１学習データである中間学習データ２２５に基づいて、中間予測モデル２３３を第１予測モデルとして生成する。 Further, in the prediction model generation process, the processor 201 generates the intermediate prediction model 233 as the first prediction model based on the intermediate learning data 225 which is the first training data including the worker attribute variable 700.

そして、算出処理では、プロセッサ２０１は、中間予測モデル２３３に中間学習データ２２５を作業ごとに入力することにより、作業時間の第１予測値９０４と、説明変数６０２、作業員属性変数７００および作業員ＩＤ変数６０３の各々が作業ごとの第１予測値９０４の統計量に与える影響の度合いを示す第１影響度９０３とを、作業ごとに算出する。 Then, in the calculation process, the processor 201 inputs the intermediate learning data 225 into the intermediate prediction model 233 for each work, so that the first predicted value 904 of the working time, the explanatory variable 602, the worker attribute variable 700, and the worker The first influence degree 903, which indicates the degree of influence that each of the ID variables 603 has on the statistic of the first predicted value 904 for each work, is calculated for each work.

そして、学習データ生成処理では、プロセッサ２０１は、作業員ごとの第１影響度９０３の分布（影響度分布情報１０００）に基づいて、作業員ＩＤ抽象化変数１１０１を作業ごとに生成し、抽象化変数使用学習データ２２７に、作業ごとの作業員属性変数７００を追加する。これにより、作業員の経験日数７０１を考慮した抽象化変数使用予測モデル２３４を生成することができる。 Then, in the learning data generation process, the processor 201 generates the worker ID abstraction variable 1101 for each work based on the distribution of the first influence degree 903 for each worker (impact degree distribution information 1000), and abstracts it. A worker attribute variable 700 for each work is added to the variable use learning data 227. As a result, it is possible to generate an abstract variable usage prediction model 234 in consideration of the number of days of experience of the worker 701.

（３）上記（２）のデータ分析装置１０２において、作業員属性変数７００は、作業員から所定距離内でかつ作業時間帯が重複する付近の作業員の経験日数７０２を含む。これにより、付近の作業員の経験日数７０２が大きいほど、その作業の作業員は、付近の作業員からの業務支援を受けやすいという状況を予測モデルに反映することができ、付近の作業員の経験日数７０２が小さいほど、その作業の作業員は、付近の作業員に業務支援をするという状況を予測モデルに反映することができる。 (3) In the data analysis device 102 of the above (2), the worker attribute variable 700 includes the number of experience days 702 of a worker within a predetermined distance from the worker and in the vicinity where the work time zones overlap. As a result, it is possible to reflect in the prediction model that the larger the number of days of experience 702 of a nearby worker is, the more likely the worker of that work is to receive work support from a nearby worker. The smaller the number of days of experience 702, the more the worker of the work can reflect the situation of providing work support to the nearby worker in the prediction model.

（４）上記（２）のデータ分析装置１０２において、予測モデル生成処理では、プロセッサ２０１は、作業員属性変数７００を含む第１学習データである作業員属性使用学習データ２２４に基づいて、作業時間を予測する作業員属性使用予測モデル２３２を生成する。 (4) In the data analysis device 102 of the above (2), in the prediction model generation process, the processor 201 has a working time based on the worker attribute use learning data 224, which is the first learning data including the worker attribute variable 700. The worker attribute usage prediction model 232 for predicting is generated.

これにより、作業員の経験日数７０１のような作業員属性を考慮した作業員属性使用予測モデル２３２により、そのような作業員属性を有する作業員の作業時間を、作業員個人を特定することなく予測することができる。 As a result, the worker attribute usage prediction model 232 that considers the worker attribute such as the number of days of experience of the worker 701 allows the working time of the worker having such a worker attribute to be determined without specifying the individual worker. Can be predicted.

（５）上記（１）のデータ分析装置１０２において、予測モデル生成処理では、プロセッサ２０１は、抽象化変数使用学習データ２２７に基づいて、作業時間を予測する抽象化変数使用予測モデル２３４を第３予測モデルとして生成する。 (5) In the prediction model generation process in the data analyzer 102 of the above (1), the processor 201 uses the abstract variable usage prediction model 234 that predicts the working time based on the abstract variable usage learning data 227. Generate as a prediction model.

また、算出処理では、プロセッサ２０１は、予測モデル生成処理によって生成された抽象化変数使用予測モデル２３４に抽象化変数使用学習データ２２７を入力することにより、作業時間の第２予測値９０４と、説明変数６０２が作業ごとの第２予測値９０４の統計量に与える影響の度合いを示す第２影響度９０１とを、作業ごとに算出する。 Further, in the calculation process, the processor 201 inputs the abstract variable use learning data 227 into the abstract variable use prediction model 234 generated by the prediction model generation process, thereby explaining that the working time is the second predicted value 904. The second influence degree 901, which indicates the degree of influence that the variable 602 has on the statistic of the second predicted value 904 for each work, is calculated for each work.

そして、学習データ生成処理では、プロセッサ２０１は、第２影響度９０１の分布１３００に基づいて、説明変数６０２に対する作業員の個人的な特徴を示す作業員特徴変数１４０１を作業ごとに生成し、抽象化変数使用学習データ２２７に、作業ごとの作業員特徴変数１４０１を追加した作業員特徴使用学習データ２２９を第３学習データとして生成する。 Then, in the learning data generation process, the processor 201 generates a worker characteristic variable 1401 indicating the personal characteristics of the worker with respect to the explanatory variable 602 based on the distribution 1300 of the second influence degree 901 for each work, and abstracts it. The worker feature use learning data 229 in which the worker feature variable 1401 for each work is added to the abstract variable use learning data 227 is generated as the third learning data.

これにより、作業員特徴変数１４０１を考慮した作業員特徴使用予測モデル２３５（第４予測モデル）を生成することができる。 As a result, the worker feature use prediction model 235 (fourth prediction model) in consideration of the worker feature variable 1401 can be generated.

（６）上記（５）のデータ分析装置１０２において、予測モデル生成処理では、プロセッサ２０１は、作業員特徴使用学習データ２２９に基づいて、作業時間を予測する作業員特徴使用予測モデル２３５を第４予測モデルとして生成する。 (6) In the prediction model generation process in the data analysis device 102 of the above (5), the processor 201 uses the worker feature use prediction model 235 that predicts the work time based on the worker feature use learning data 229. Generate as a prediction model.

これにより、棚上段が得意といった作業員の個人的な特徴を考慮した作業員特徴使用予測モデル２３５により、そのような特徴を有する作業員の作業時間を、作業員個人を特定することなく予測することができる。 As a result, the work time of a worker having such a characteristic is predicted without specifying the individual worker by the worker characteristic use prediction model 235 considering the individual characteristics of the worker such as being good at the upper shelf. be able to.

（７）上記（１）のデータ分析装置１０２において、プロセッサ２０１は、作業を実施した作業員、作業時間、作業環境を作業ごとに有する第１実績データ（業務実績データ２２１）に基づいて、目的変数６０１および説明変数６０２を生成し、作業ごとの作業員から作業員ＩＤ変数６０３を生成することにより、第１実績データを作業員ＩＤ使用学習データ２２３に変換する変換処理を実行する。 (7) In the data analyzer 102 of the above (1), the processor 201 aims at the first actual data (business actual data 221) having the worker who performed the work, the working time, and the working environment for each work. By generating the variable 601 and the explanatory variable 602 and generating the worker ID variable 603 from the worker for each work, the conversion process of converting the first actual data into the worker ID use learning data 223 is executed.

また、取得処理では、プロセッサ２０１は、第１実績データの作業環境の変更に基づいて作業時間を再計算することにより、第１実績データを第２実績データ（改善対象の業務実績データ２２１）に変換し、第２実績データに基づいて、作業ごとに、再計算された作業時間を示す目的変数６０１と、変更された作業環境に関する説明変数６０２と、作業環境の変更に基づく作業員ごとの作業の実施の有無を示す作業員ＩＤ変数６０３と、を有する作業員ＩＤ使用学習データ２２３を第４学習データとして取得する。 Further, in the acquisition process, the processor 201 converts the first actual data into the second actual data (business actual data 221 to be improved) by recalculating the working time based on the change in the working environment of the first actual data. Objective variable 601 indicating the work time recalculated for each work after conversion based on the second actual data, explanatory variable 602 for the changed work environment, and work for each worker based on the change in the work environment. The worker ID use learning data 223 having the worker ID variable 603 indicating whether or not the above is implemented is acquired as the fourth learning data.

そして、算出処理では、プロセッサ２０１は、作業員ＩＤ使用予測モデル２３１に、予測対象の作業員に関する第４学習データを入力することにより、予測対象の作業員に関する作業時間の第３予測値９０４を作業ごとに算出する。 Then, in the calculation process, the processor 201 inputs the fourth learning data regarding the worker to be predicted into the worker ID usage prediction model 231 to obtain the third predicted value 904 of the working time for the worker to be predicted. Calculate for each work.

これにより、改善対象の業務実績データ２２１から特定される改善業務を予測対象の作業員が実施した場合の各作業の作業時間の予測値を得ることができる。したがって、データ分析装置１０２の利用者は、予測対象の作業員の作業時間が改善されたか否かを特定することができる。 As a result, it is possible to obtain a predicted value of the work time of each work when the worker to be predicted performs the improvement work specified from the work performance data 221 to be improved. Therefore, the user of the data analysis device 102 can specify whether or not the working time of the worker to be predicted has been improved.

（８）上記（７）のデータ分析装置１０２において、算出処理では、プロセッサ２０１は、第４学習データの作業員ＩＤ変数６０３において予測対象の作業員が実施したことを示す作業の件数が第１所定件数以上存在する場合（Ｓ１６０２：Ｙｅｓ）、作業員ＩＤ使用予測モデル２３１に第４学習データを入力することにより、予測対象の作業員に関する作業時間の第３予測値９０４を作業ごとに算出する。 (8) In the data analysis device 102 of the above (7), in the calculation process, the processor 201 has the first number of operations indicating that the worker to be predicted has performed in the worker ID variable 603 of the fourth training data. When there are more than a predetermined number (S1602: Yes), the third predicted value 904 of the working time for the worker to be predicted is calculated for each work by inputting the fourth learning data into the worker ID usage prediction model 231. ..

これにより、予測対象の作業員が実施した作業が第１所定件数以上の第４学習データについては、作業員ＩＤ使用予測モデル２３１を優先適用することにより、予測対象の作業員に関する作業時間の予測精度の向上を図ることができる。 As a result, for the fourth learning data in which the work performed by the worker to be predicted is equal to or greater than the first predetermined number, the worker ID usage prediction model 231 is preferentially applied to predict the work time for the worker to be predicted. The accuracy can be improved.

（９）上記（８）のデータ分析装置１０２において、プロセッサ２０１は、作業時間の第３予測値９０４が所定の改善条件を充足するか否かを判定する判定処理（ステップＳ１５０７）と、判定処理によって改善条件を充足する判定された場合、第２実績データに関する作業環境を改善施策に決定する決定処理（ステップＳ１５０８）と、決定処理による決定結果を出力する出力処理（ステップＳ１５０９）と、を実行する。 (9) In the data analysis device 102 of the above (8), the processor 201 has a determination process (step S1507) for determining whether or not the third predicted value 904 of the working time satisfies a predetermined improvement condition, and a determination process. When it is determined that the improvement conditions are satisfied, a decision process (step S1508) for determining the work environment related to the second actual data as an improvement measure and an output process (step S1509) for outputting the decision result by the decision process are executed. To do.

これにより、データ分析装置１０２は、予測対象の作業員が実施する作業について適切な改善施策を提示することができる。 As a result, the data analysis device 102 can present appropriate improvement measures for the work performed by the worker to be predicted.

（１０）上記（４）のデータ分析装置１０２において、プロセッサ２０１は、作業を実施した作業員、作業時間、作業環境を作業ごとに有する第１実績データ（業務実績データ２２１）に基づいて、目的変数６０１および説明変数６０２を生成し、作業ごとの作業員から作業員ＩＤ変数６０３を生成することにより、第１実績データを作業員ＩＤ使用学習データ２２３に変換する変換処理を実行する。 (10) In the data analyzer 102 of the above (4), the processor 201 aims at the first actual data (business actual data 221) having the worker who performed the work, the working time, and the working environment for each work. By generating the variable 601 and the explanatory variable 602 and generating the worker ID variable 603 from the worker for each work, the conversion process of converting the first actual data into the worker ID use learning data 223 is executed.

また、取得処理では、プロセッサ２０１は、第１実績データの作業環境の変更に基づいて作業時間を再計算することにより、第１実績データを第２実績データ（改善対象の業務実績データ２２１）に変換し、第２実績データに基づいて、作業ごとに、再計算された作業時間を示す目的変数６０１と、変更された作業環境に関する説明変数６０２と、作業環境の変更に基づく作業員ごとの作業の実施の有無を示す作業員ＩＤ変数６０３と、を有し、かつ、作業員属性変数１７０２を追加した第４学習データを取得する。 Further, in the acquisition process, the processor 201 converts the first actual data into the second actual data (business actual data 221 to be improved) by recalculating the working time based on the change in the working environment of the first actual data. The objective variable 601 indicating the work time recalculated for each work after conversion based on the second actual data, the explanatory variable 602 for the changed work environment, and the work for each worker based on the change in the work environment. Acquires the fourth learning data having the worker ID variable 603 indicating whether or not the above is performed, and adding the worker attribute variable 1702.

そして、算出処理では、プロセッサ２０１は、作業員属性使用予測モデル２３２に、予測対象の作業員に関する第４学習データを入力することにより、予測対象の作業員に関する作業時間の第３予測値９０４を作業ごとに算出する、 Then, in the calculation process, the processor 201 inputs the fourth learning data regarding the worker to be predicted into the worker attribute use prediction model 232 to obtain the third predicted value 904 of the work time related to the worker to be predicted. Calculate for each work,

これにより、改善対象の業務実績データ２２１から特定される改善業務を予測対象の作業員が実施した場合に、予測対象の作業員の作業員属性を考慮して各作業の作業時間の予測値を得ることができる。したがって、データ分析装置１０２の利用者は、予測対象の作業員の作業時間が改善されたか否かを高精度に特定することができる。 As a result, when the worker to be predicted performs the improvement work specified from the work performance data 221 to be improved, the predicted value of the work time of each work is calculated in consideration of the worker attributes of the worker to be predicted. Obtainable. Therefore, the user of the data analysis device 102 can accurately identify whether or not the working time of the worker to be predicted has been improved.

（１１）上記（１０）のデータ分析装置１０２において、算出処理では、プロセッサ２０１は、第４学習データの作業員ＩＤ変数６０３において予測対象の作業員が所定期間内（たとえば、直近過去１か月以内１７１２）で実施したことを示す作業の件数が第２所定件数以上存在しない場合（ステップＳ１６０４：Ｎｏ）、作業員属性使用予測モデル２３２に、予測対象の作業員に関する第４学習データを入力することにより、予測対象の作業員に関する作業時間の第３予測値９０４を作業ごとに算出する。 (11) In the data analysis device 102 of the above (10), in the calculation process, the processor 201 has the worker ID variable 603 of the fourth training data that the worker to be predicted is within a predetermined period (for example, the latest one month). When the number of works indicating that the work has been performed within 1712) does not exist in excess of the second predetermined number (step S1604: No), the fourth learning data regarding the worker to be predicted is input to the worker attribute usage prediction model 232. As a result, the third predicted value 904 of the working time for the worker to be predicted is calculated for each work.

これにより、予測対象の作業員が実施した作業が第２所定件数未満の第４学習データについては、作業員個人を特定するのではなく作業員属性を用いた作業員属性使用予測モデル２３２を優先適用することにより、予測対象の作業員に関する作業時間の予測精度の向上を図ることができる。 As a result, for the fourth learning data in which the work performed by the worker to be predicted is less than the second predetermined number, the worker attribute usage prediction model 232 using the worker attribute is prioritized instead of specifying the individual worker. By applying it, it is possible to improve the prediction accuracy of the work time for the worker to be predicted.

（１２）上記（５）のデータ分析装置１０２において、プロセッサ２０１は、作業を実施した作業員、作業時間、作業環境を作業ごとに有する第１実績データ（業務実績データ２２１）に基づいて、目的変数６０１および説明変数６０２を生成し、作業ごとの作業員から作業員ＩＤ変数６０３を生成することにより、第１実績データを作業員ＩＤ使用学習データ２２３に変換する変換処理を実行する。 (12) In the data analyzer 102 of the above (5), the processor 201 aims at the first actual data (business actual data 221) having the worker who performed the work, the working time, and the working environment for each work. By generating the variable 601 and the explanatory variable 602 and generating the worker ID variable 603 from the worker for each work, the conversion process of converting the first actual data into the worker ID use learning data 223 is executed.

また、取得処理では、プロセッサ２０１は、第１実績データの作業環境の変更に基づいて作業時間を再計算することにより、第１実績データを第２実績データ（改善対象の業務実績データ２２１）に変換し、第２実績データに基づいて、作業ごとに、再計算された作業時間を示す目的変数６０１と、変更された作業環境に関する説明変数６０２と、作業環境の変更に基づく作業員ごとの作業の実施の有無を示す作業員ＩＤ変数６０３と、を有し、かつ、作業員ＩＤ抽象化変数１７０３を追加した第４学習データを取得する。 Further, in the acquisition process, the processor 201 converts the first actual data into the second actual data (business actual data 221 to be improved) by recalculating the working time based on the change in the working environment of the first actual data. The objective variable 601 indicating the work time recalculated for each work after conversion based on the second actual data, the explanatory variable 602 for the changed work environment, and the work for each worker based on the change in the work environment. Acquires the fourth training data having the worker ID variable 603 indicating whether or not the above is performed, and adding the worker ID abstraction variable 1703.

そして、算出処理では、プロセッサ２０１は、抽象化変数使用予測モデル２３４に、予測対象の作業員に関する第４学習データを入力することにより、予測対象の作業員に関する作業時間の第３予測値９０４を作業ごとに算出する。 Then, in the calculation process, the processor 201 inputs the fourth learning data regarding the worker to be predicted into the abstraction variable usage prediction model 234 to obtain the third predicted value 904 of the working time for the worker to be predicted. Calculate for each work.

これにより、改善対象の業務実績データ２２１から特定される改善業務を予測対象の作業員が実施した場合に、予測対象の作業員を抽象化した作業速度を考慮して各作業の作業時間の予測値を得ることができる。したがって、データ分析装置１０２の利用者は、予測対象の作業員の作業時間が改善されたか否かを高精度に特定することができる。 As a result, when the worker to be predicted performs the improvement work specified from the work performance data 221 to be improved, the work time of each work is predicted in consideration of the work speed that abstracts the worker to be predicted. You can get the value. Therefore, the user of the data analysis device 102 can accurately identify whether or not the working time of the worker to be predicted has been improved.

（１３）上記（１２）のデータ分析装置１０２において、算出処理では、プロセッサ２０１は、第４学習データの作業員ＩＤ変数６０３において予測対象の作業員が所定期間内（たとえば、直近過去１か月以内１７１２）で実施したことを示す作業の件数が第２所定件数以上存在する場合（ステップＳ１６０４：Ｙｅｓ）、抽象化変数使用予測モデル２３４に、予測対象の作業員に関する第４学習データを入力することにより、予測対象の作業員に関する作業時間の第３予測値９０４を作業ごとに算出する。 (13) In the data analyzer 102 of the above (12), in the calculation process, in the calculation process, the processor 201 has the worker to be predicted in the worker ID variable 603 of the fourth training data within a predetermined period (for example, the latest one month). When the number of works indicating that the work has been performed within 1712) is equal to or greater than the second predetermined number (step S1604: Yes), the fourth learning data regarding the worker to be predicted is input to the abstract variable usage prediction model 234. As a result, the third predicted value 904 of the working time for the worker to be predicted is calculated for each work.

これにより、予測対象の作業員が実施した作業が第２所定件数以上の第４学習データについては、予測対象の作業員の作業速度を抽象化した抽象化変数使用予測モデル２３４を優先適用することにより、予測対象の作業員に関する作業時間の予測精度の向上を図ることができる。 As a result, for the fourth training data in which the work performed by the worker to be predicted is equal to or greater than the second predetermined number, the abstract variable usage prediction model 234 that abstracts the work speed of the worker to be predicted is preferentially applied. As a result, it is possible to improve the prediction accuracy of the work time of the worker to be predicted.

（１４）上記（６）のデータ分析装置１０２において、プロセッサ２０１は、作業を実施した作業員、作業時間、作業環境を作業ごとに有する第１実績データ（業務実績データ２２１）に基づいて、目的変数６０１および説明変数６０２を生成し、作業ごとの作業員から作業員ＩＤ変数６０３を生成することにより、第１実績データを作業員ＩＤ使用学習データ２２３に変換する変換処理を実行する。 (14) In the data analyzer 102 of the above (6), the processor 201 aims at the first actual data (business actual data 221) having the worker who performed the work, the working time, and the working environment for each work. By generating the variable 601 and the explanatory variable 602 and generating the worker ID variable 603 from the worker for each work, the conversion process of converting the first actual data into the worker ID use learning data 223 is executed.

また、取得処理では、プロセッサ２０１は、第１実績データの作業環境の変更に基づいて作業時間を再計算することにより、第１実績データを第２実績データ（改善対象の業務実績データ２２１）に変換し、第２実績データに基づいて、作業ごとに、再計算された作業時間を示す目的変数６０１と、変更された作業環境に関する説明変数６０２と、作業環境の変更に基づく作業員ごとの作業の実施の有無を示す作業員ＩＤ変数６０３と、を有し、かつ、作業員特徴変数１７０４を追加した第４学習データを取得する。 Further, in the acquisition process, the processor 201 converts the first actual data into the second actual data (business actual data 221 to be improved) by recalculating the working time based on the change in the working environment of the first actual data. The objective variable 601 indicating the work time recalculated for each work after conversion based on the second actual data, the explanatory variable 602 for the changed work environment, and the work for each worker based on the change in the work environment. Acquires the fourth learning data having the worker ID variable 603 indicating whether or not the above is performed, and adding the worker characteristic variable 1704.

そして、算出処理では、プロセッサ２０１は、作業員特徴使用予測モデル２３５に、予測対象の作業員に関する前記第４学習データを入力することにより、予測対象の作業員に関する作業時間の第３予測値を前記作業ごとに算出する。 Then, in the calculation process, the processor 201 inputs the fourth learning data regarding the worker to be predicted into the worker feature use prediction model 235 to obtain a third predicted value of the work time related to the worker to be predicted. Calculated for each of the above operations.

これにより、改善対象の業務実績データ２２１から特定される改善業務を予測対象の作業員が実施した場合に、予測対象の作業員の個人的な特徴を考慮して各作業の作業時間の予測値を得ることができる。したがって、データ分析装置１０２の利用者は、予測対象の作業員の作業時間が改善されたか否かを高精度に特定することができる。 As a result, when the worker to be predicted carries out the improvement work specified from the work performance data 221 to be improved, the predicted value of the work time of each work is taken into consideration in consideration of the personal characteristics of the worker to be predicted. Can be obtained. Therefore, the user of the data analysis device 102 can accurately identify whether or not the working time of the worker to be predicted has been improved.

（１５）上記（１４）のデータ分析装置１０２において、算出処理では、プロセッサ２０１は、第４学習データの作業員ＩＤ変数６０３において予測対象の作業員が所定期間内（たとえば、直近過去１か月以内１７１２）で実施したことを示す作業の件数が第２所定件数以上存在する場合（ステップＳ１６０４：Ｙｅｓ）、作業員特徴使用予測モデル２３５に、予測対象の作業員に関する第４学習データを入力することにより、予測対象の作業員に関する作業時間の第３予測値９０４を作業ごとに算出する。 (15) In the data analyzer 102 of the above (14), in the calculation process, in the calculation process, the processor 201 has the worker to be predicted in the worker ID variable 603 of the fourth training data within a predetermined period (for example, the latest past one month). When the number of works indicating that the work has been performed within 1712) is equal to or greater than the second predetermined number (step S1604: Yes), the fourth learning data regarding the worker to be predicted is input to the worker feature usage prediction model 235. As a result, the third predicted value 904 of the working time for the worker to be predicted is calculated for each work.

これにより、予測対象の作業員が実施した作業が第２所定件数以上の第４学習データについては、予測対象の作業員の個人的な特徴を用いた作業員特徴使用予測モデル２３５を優先適用することにより、予測対象の作業員に関する作業時間の予測精度の向上を図ることができる。 As a result, for the fourth learning data in which the work performed by the worker to be predicted is equal to or greater than the second predetermined number, the worker feature use prediction model 235 using the personal characteristics of the worker to be predicted is preferentially applied. As a result, it is possible to improve the prediction accuracy of the work time of the worker to be predicted.

なお、本発明は前述した実施例に限定されるものではなく、添付した特許請求の範囲の趣旨内における様々な変形例及び同等の構成が含まれる。たとえば、前述した実施例は本発明を分かりやすく説明するために詳細に説明したものであり、必ずしも説明した全ての構成を備えるものに本発明は限定されない。また、ある実施例の構成の一部を他の実施例の構成に置き換えてもよい。また、ある実施例の構成に他の実施例の構成を加えてもよい。また、各実施例の構成の一部について、他の構成の追加、削除、または置換をしてもよい。 The present invention is not limited to the above-described embodiment, and includes various modifications and equivalent configurations within the scope of the attached claims. For example, the above-described examples have been described in detail in order to explain the present invention in an easy-to-understand manner, and the present invention is not necessarily limited to those having all the described configurations. Further, a part of the configuration of one embodiment may be replaced with the configuration of another embodiment. Further, the configuration of another embodiment may be added to the configuration of one embodiment. In addition, other configurations may be added, deleted, or replaced with respect to a part of the configurations of each embodiment.

また、前述した各構成、機能、処理部、処理手段等は、それらの一部又は全部を、たとえば集積回路で設計する等により、ハードウェアで実現してもよく、プロセッサがそれぞれの機能を実現するプログラムを解釈し実行することにより、ソフトウェアで実現してもよい。 Further, each of the above-described configurations, functions, processing units, processing means, etc. may be realized by hardware by designing a part or all of them by, for example, an integrated circuit, and the processor realizes each function. It may be realized by software by interpreting and executing the program to be executed.

各機能を実現するプログラム、テーブル、ファイル等の情報は、メモリ、ハードディスク、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等の記憶装置、又は、ＩＣ（ＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）カード、ＳＤカード、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）の記録媒体に格納することができる。 Information such as programs, tables, and files that realize each function is recorded in a memory, hard disk, storage device such as SSD (Solid State Drive), or IC (Integrated Circuit) card, SD card, DVD (Digital Versaille Disc). It can be stored in a medium.

また、制御線や情報線は説明上必要と考えられるものを示しており、実装上必要な全ての制御線や情報線を示しているとは限らない。実際には、ほとんど全ての構成が相互に接続されていると考えてよい。 In addition, the control lines and information lines indicate those that are considered necessary for explanation, and do not necessarily indicate all the control lines and information lines necessary for implementation. In practice, it can be considered that almost all configurations are interconnected.

１００データ分析システム
１０１業務システム
１０２データ分析装置
１１０業務実績ＤＢ
１２１モデル生成処理
１２２業務改善施策生成処理
２０１プロセッサ
２０２記憶デバイス
２２１業務実績データ
２２２商品情報
２２３使用学習データ
２２４作業員属性使用学習データ
２２５中間学習データ
２２６第１影響度テーブル
２２７抽象化変数使用学習データ
２２８第２影響度テーブル
２２９作業員特徴使用学習データ
２３０作業員属性・特徴データ
２３１使用予測モデル
２３２作業員属性使用予測モデル
２３３中間予測モデル
２３４抽象化変数使用予測モデル
２３５作業員特徴使用予測モデル 100 Data analysis system 101 Business system 102 Data analysis device 110 Business performance DB
121 Model generation processing 122 Business improvement measure generation processing 201 Processor 202 Storage device 221 Business performance data 222 Product information 223 Usage learning data 224 Worker attribute usage learning data 225 Intermediate learning data 226 First impact table 227 Abstract variable usage learning data 228 Second influence table 229 Worker feature usage learning data 230 Worker attribute / feature data 231 Usage prediction model 232 Worker attribute usage prediction model 233 Intermediate prediction model 234 Abstract variable usage prediction model 235 Worker feature usage prediction model

Claims

A data analyzer having a processor that executes a program and a storage device that stores the program.
The processor
Acquisition of first learning data having an objective variable indicating the work time, an explanatory variable related to the work environment, and a worker variable indicating whether or not the work is performed for each worker for each work of a plurality of works. Processing and
A prediction model generation process that generates a first prediction model that predicts the working time based on the first learning data acquired by the acquisition process.
By inputting the first training data into the first prediction model generated by the prediction model generation process, the first prediction value of the work time and the worker variable are the first prediction value for each work. A calculation process for calculating the first degree of influence, which indicates the degree of influence on the statistic, for each work, and
Based on the distribution of the first influence degree of each of the plurality of works for each worker, an abstract variable that abstracts the work speeds of the plurality of workers is generated for each work, and for each work, A training data generation process for generating a second training data having the objective variable, the explanatory variable, and the abstraction variable.
A data analyzer characterized by performing.

The data analyzer according to claim 1.
The first learning data includes a worker attribute variable indicating the number of days of experience of each of the plurality of workers who performed the work for each work.
In the prediction model generation process, the processor generates the first prediction model based on the first learning data including the worker attribute variable.
In the calculation process, the processor inputs the first learning data including the worker attribute variable into the first prediction model for each work, so that the first predicted value of the working time and the explanatory variable are obtained. , The first degree of influence indicating the degree of influence that each of the worker attribute variable and the worker variable has on the statistic of the first predicted value for each work is calculated for each work.
In the learning data generation process, the processor generates the abstract variable for each work based on the distribution of the first influence degree for each worker, and the second training data is used for each work. Add the worker attribute variable,
A data analyzer characterized by this.

The data analyzer according to claim 2.
The worker attribute variable includes the number of days of experience of another worker who is within a predetermined distance from the worker and whose working hours overlap.
A data analyzer characterized by this.

The data analyzer according to claim 2.
In the prediction model generation process, the processor generates a second prediction model that predicts the work time based on the first learning data including the worker attribute variable.
A data analyzer characterized by this.

The data analyzer according to claim 1.
In the prediction model generation process, the processor generates a third prediction model that predicts the working time based on the second learning data.
In the calculation process, the processor inputs the second training data into the third prediction model generated by the prediction model generation process, so that the second prediction value of the work time and the explanatory variable are the work. The second degree of influence, which indicates the degree of influence on the statistic of the second predicted value for each, is calculated for each of the operations.
In the learning data generation process, the processor generates a worker characteristic variable indicating the personal characteristics of the worker with respect to the explanatory variable based on the distribution of the second influence degree for each work, and the first. 2 Generate third learning data by adding the worker characteristic variable for each work to the training data.
A data analyzer characterized by this.

The data analyzer according to claim 5.
In the prediction model generation process, the processor generates a fourth prediction model that predicts the work time based on the third learning data including the worker characteristic variable.
A data analyzer characterized by this.

The data analyzer according to claim 1.
The processor
The objective variable and the explanatory variable are generated based on the first actual data having the worker who performed the work, the work time, and the work environment for each work, and the work is performed from the worker for each work. By generating a member variable, a conversion process for converting the first actual data into the first learning data is executed.
In the acquisition process, the processor converts the first actual data into the second actual data by recalculating the working time based on the change in the working environment of the first actual data, and the second actual data. Based on the actual data, the objective variable indicating the recalculated work time for each work, the explanatory variable for the changed work environment, and the execution of the work for each worker based on the change in the work environment. Acquire the fourth training data having the worker variable indicating the presence or absence, and
In the calculation process, the processor inputs the fourth learning data regarding the worker to be predicted into the first prediction model, and thereby obtains the third predicted value of the work time for the worker to be predicted. Calculate for each,
A data analyzer characterized by this.

The data analyzer according to claim 7.
In the calculation process, when the processor has more than the first predetermined number of operations indicating that the worker to be predicted has performed in the worker variable of the fourth learning data, the first prediction model By inputting the fourth learning data into, the third predicted value of the working time for the worker to be predicted is calculated for each of the tasks.
A data analyzer characterized by this.

The data analyzer according to claim 8.
The processor
Judgment processing for determining whether or not the third predicted value of the working time satisfies a predetermined improvement condition, and
When it is determined by the determination process that the improvement condition is satisfied, the determination process for determining the work environment related to the second actual data as an improvement measure, and the determination process.
Output processing that outputs the determination result by the determination processing and
A data analyzer characterized by performing.

The data analyzer according to claim 4.
The processor
The objective variable and the explanatory variable are generated based on the first actual data having the worker who performed the work, the work time, and the work environment for each work, and the work is performed from the worker for each work. By generating a member variable, a conversion process for converting the first actual data into the first learning data is executed.
In the acquisition process, the processor converts the first actual data into the second actual data by recalculating the working time based on the change in the working environment of the first actual data, and the second actual data. Based on the actual data, the objective variable indicating the recalculated work time for each work, the explanatory variable for the changed work environment, and the execution of the work for each worker based on the change in the work environment. Acquire the fourth training data having the worker variable indicating the presence or absence and adding the worker attribute variable.
In the calculation process, the processor inputs the fourth learning data regarding the worker to be predicted into the second prediction model, and thereby obtains the third predicted value of the work time for the worker to be predicted. Calculate for each,
A data analyzer characterized by this.

The data analyzer according to claim 10.
In the calculation process, when the number of operations indicating that the worker to be predicted has performed within the predetermined period in the worker variable of the fourth training data does not exist in the second predetermined number or more, the processor said. By inputting the fourth learning data regarding the worker to be predicted into the second prediction model, the third predicted value of the working time for the worker to be predicted is calculated for each work.
A data analyzer characterized by this.

The data analyzer according to claim 5.
The processor
The objective variable and the explanatory variable are generated based on the first actual data having the worker who performed the work, the work time, and the work environment for each work, and the work is performed from the worker for each work. By generating a member variable, a conversion process for converting the first actual data into the first learning data is executed.
In the acquisition process, the processor converts the first actual data into the second actual data by recalculating the working time based on the change in the working environment of the first actual data, and the second actual data. Based on the actual data, the objective variable indicating the recalculated work time for each work, the explanatory variable for the changed work environment, and the execution of the work for each worker based on the change in the work environment. Acquire the fourth training data having the worker variable indicating the presence / absence and adding the abstraction variable.
In the calculation process, the processor inputs the fourth learning data regarding the worker to be predicted into the third prediction model, and thereby obtains the third predicted value of the work time for the worker to be predicted. Calculate for each,
A data analyzer characterized by this.

The data analyzer according to claim 12.
In the calculation process, when the processor has a second predetermined number or more of operations indicating that the worker to be predicted has performed within a predetermined period in the worker variable of the fourth training data, the said processor. By inputting the fourth learning data regarding the worker to be predicted into the third prediction model, the third predicted value of the working time for the worker to be predicted is calculated for each work.
A data analyzer characterized by this.

The data analyzer according to claim 6.
The processor
The objective variable and the explanatory variable are generated based on the first actual data having the worker who performed the work, the work time, and the work environment for each work, and the work is performed from the worker for each work. By generating a member variable, a conversion process for converting the first actual data into the first learning data is executed.
In the acquisition process, the processor converts the first actual data into the second actual data by recalculating the working time based on the change in the working environment of the first actual data, and the second actual data. Based on the actual data, the objective variable indicating the recalculated work time for each work, the explanatory variable for the changed work environment, and the execution of the work for each worker based on the change in the work environment. Acquire the fourth training data having the worker variable indicating the presence or absence and adding the worker characteristic variable.
In the calculation process, the processor inputs the fourth learning data regarding the worker to be predicted into the fourth prediction model, and thereby obtains the third predicted value of the work time for the worker to be predicted. Calculate for each,
A data analyzer characterized by this.

A data analysis method executed by a data analyzer having a processor that executes a program and a storage device that stores the program.
The data analysis method is
The processor
Acquisition of first learning data having an objective variable indicating the work time, an explanatory variable related to the work environment, and a worker variable indicating whether or not the work is performed for each worker for each work of a plurality of works. Processing and
A prediction model generation process that generates a first prediction model that predicts the working time based on the first learning data acquired by the acquisition process.
By inputting the first training data into the first prediction model generated by the prediction model generation process, the first prediction value of the work time and the worker variable are the first prediction value for each work. A calculation process for calculating the first degree of influence, which indicates the degree of influence on the statistic, for each work, and
Based on the distribution of the first influence degree of each of the plurality of works for each worker, an abstract variable that abstracts the work speeds of the plurality of workers is generated for each work, and for each work, A training data generation process for generating a second training data having the objective variable, the explanatory variable, and the abstraction variable.
A data analysis method characterized by performing.